Sequence of DPV Saccharomyces cerevisiae Ty1 virus
Saccharomyces cerevisiae (strain JB84A containing plasmid pNN162) Ty1-H3 gene, complete cds.
ACC No: M18706
Dated: 2000-03-04 | Length: 5918 | CRC: 393032074
!!NA_SEQUENCE 1.0 ID SCTY1H3A standard; genomic DNA; FUN; 5918 BP. XX AC M18706; XX SV M18706.1 XX DT 16-JUL-1988 (Rel. 16, Created) DT 04-MAR-2000 (Rel. 63, Last updated, Version 3) XX DE Saccharomyces cerevisiae (strain JB84A containing plasmid pNN162) Ty1-H3 DE gene, complete cds. XX KW long terminal repeat; Ty1 transposon B10. XX OS Saccharomyces cerevisiae (baker's yeast) OC Eukaryota; Fungi; Ascomycota; Saccharomycotina; Saccharomycetes; OC Saccharomycetales; Saccharomycetaceae; Saccharomyces. XX RN [1] RP 1-5918 RX MEDLINE; 88246410. RX PUBMED; 2837641. RA Boeke J.D., Eichinger D., Castrillon D., Fink G.R.; RT "The Saccharomyces cerevisiae genome contains functional and nonfunctional RT copies of transposon Ty1"; RL Mol. Cell. Biol. 8(4):1432-1442(1988). XX DR TRANSFAC; R01476; Y$TY1_01. DR TRANSFAC; R01477; Y$TY1_02. XX CC Submitted in computer readable form by J.D.Boeke 25-MAR-1988. XX FH Key Location/Qualifiers FH FT source 1. .5918 FT /db_xref="taxon:4932" FT /mol_type="genomic DNA" FT /organism="Saccharomyces cerevisiae" FT /strain="JB84A" FT LTR 1. .334 FT /note="5' long terminal repeat (delta domain)" FT mRNA 240. .5880 FT /note="Ty-H3 mRNA (3' end +/- 3 bp)" FT CDS 294. .1616 FT /codon_start=1 FT /db_xref="InterPro:IPR001042" FT /db_xref="UniProt/TrEMBL:Q07155" FT /note="putative" FT /gene="TyA" FT /protein_id="AAA66937.1" FT /translation="MESQQLSQHSPNSHGSACASVTSKEVHTNQDPLDVSASKTEECEK FT ASTKANSQQTTTPASSAVPENPHHASPQPASVPPPQNGPYPQQCMMTQNQANPSGWSFY FT GHPSMIPYTPYQMSPMYFPPGPQSQFPQYPSSVGTPLSTPSPESGNTFTDSSSADSDMT FT STKKYVRPPPMLTSPNDFPNWVKTYIKFLQNSNLGGIIPTVNGKPVRQITDDELTFLYN FT TFQIFAPSQFLPTWVKDILSVDYTDIMKILSKSIEKMQSDTQEANDIVTLANLQYNGST FT PADAFETKVTNIIDRLNNNGIHINNKVACQLIMRGLSGEYKFLRYTRHRHLNMTVAELF FT LDIHAIYEEQQGSRNSKPNYRRNPSDEKNDSRSYTNTTKPKVIARNPQKTNNSKSKTAR FT AHNVSTSNNSPSTDNDSISKSTTEPIQLNNKHDLHLRPETY" FT CDS 2095. .5562 FT /codon_start=1 FT /db_xref="GOA:Q07163" FT /db_xref="InterPro:IPR001584" FT /db_xref="UniProt/TrEMBL:Q07163" FT /note="Base 2095 is the position of the first start codon FT in the ORF; putative" FT /gene="TyB" FT /protein_id="AAA66938.1" FT /translation="MLAHANAQTIRYSLKNNTITYFNESDVDWSSAIDYQCPDCLIGKS FT TKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPNSAPSYFISFTDETTKFRWVYPLH FT DRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKNGITPCYTTTA FT DSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRNSLASPKSKKSARQH FT AGLAGLDISTLLPFGQPVIVNDHNPNSKIHPRGIPGYALHPSRNSYGYIIYLPSLKKTV FT DTTNYVILQGKESRLDQFNYDALTFDEDLNRLTASYHSFIASNEIQESNDLNIESDHDF FT QSDIELHPEQPRNVLSKAVSPTDSTPPSTHTEDSKRVSKTNIRAPREVDPNISESNILP FT SKKRSSTPQISNIESTGSGGMHKLNVPLLAPMSQSNTHESSHASKSKDFRHSDSYSENE FT TNHTNVPISSTGGTNNKTVPQISDQETEKRIIHRSPSIDASPPENNSSHNIVPIKTPTT FT VSEQNTEESIIADLPLPDLPPESPTEFPDPFKELPPINSRQTNSSLGGIGDSNAYTTIN FT SKKRSLEDNETEIKVSRDTWNTKNMRSLEPPRSKKRIHLIAAVKAVKSIKPIRTTLRYD FT EAITYNKDIKEKEKYIEAYHKEVNQLLKMKTWDTDEYYDRKEIDPKRVINSMFIFNKKR FT DGTHKARFVARGDIQHPDTYDSGMQSNTVHHYALMTSLSLALDNNYYITQLDISSAYLY FT ADIKEELYIRPPPHLGMNDKLIRLKKSLYGLKQSGANWYETIKSYLIQQCGMEEVRGWS FT CVFKNSQVTICLFVDDMVLFSKNLNSNKRIIEKLKMQYDTKIINLGESDEEIQYDILGL FT EIKYQRGKYMKLGMENSLTEKIPKLNVPLNPKGRKLSAPGQPGLYIDQDELEIDEDEYK FT EKVHEMQKLIGLASYVGYKFRFDLLYYINTLAQHILFPSRQVLDMTYELIQFMWDTRDK FT QLIWHKNKPTEPDNKLVAISDASYGNQPYYKSQIGNIYLLNGKVIGGKSTKASLTCTST FT TEAEIHAISESVPLLNNLSYLIQELNKKPIIKGLLTDSRSTISIIKSTNEEKFRNRFFG FT TKAMRLRDEVSGNNLYVYYIETKKNIADVMTKPLPIKTFKLLTNKWIH" FT LTR 5585. .5918 FT /note="3' long terminal repeat (delta domain)" XX SQ Sequence 5918 BP; 2111 A; 1270 C; 942 G; 1595 T; 0 other; M18706 Length: 5918 May 9, 2005 17:26 Type: N Check: 396 .. 1 tgttggaata gaaatcaact atcatctact aactagtatt tacattacta 51 gtatattatc atatacggtg ttagaagatg acgcaaatga tgagaaatag 101 tcatctaaat tagtggaagc tgaaacgcaa ggattgataa tgtaatagga 151 tcaatgaata taaacatata aaatgatgat aataatattt atagaattgt 201 gtagaattgc agattccatt ttgaggattc ctatatcctc gaggagaact 251 tctagtatat tctgtatacc taatattata gcctttatca acaatggaat 301 cccaacaatt atctcaacat tcacccaatt ctcatggtag cgcctgtgct 351 tcggttactt ctaaggaagt ccacacaaat caagatccgt tagacgtttc 401 agcttccaaa acagaagaat gtgagaaggc ttccactaag gctaactctc 451 aacagacaac aacacctgct tcatcagctg ttccagagaa cccccatcat 501 gcctctcctc aacctgcttc agtaccacct ccacagaatg ggccgtaccc 551 acagcagtgc atgatgaccc aaaaccaagc caatccatct ggttggtcat 601 tttacggaca cccatctatg attccgtata caccttatca aatgtcgcct 651 atgtactttc cacctgggcc acaatcacag tttccgcagt atccatcatc 701 agttggaacg cctctgagca ctccatcacc tgagtcaggt aatacattta 751 ctgattcatc ctcagcggac tctgatatga catccactaa aaaatatgtc 801 agaccaccac caatgttaac ctcacctaat gactttccaa attgggttaa 851 aacatacatc aaatttttac aaaactcgaa tctcggtggt attattccga 901 cagtaaacgg aaaacccgta cgtcagatca ctgatgatga actcaccttc 951 ttgtataaca cttttcaaat atttgctccc tctcaattcc tacctacctg 1001 ggtcaaagac atcctatccg ttgattatac ggatatcatg aaaattcttt 1051 ccaaaagtat tgaaaaaatg caatctgata cccaagaggc aaacgacatt 1101 gtgaccctgg caaatttgca atataatggc agtacacctg cagatgcatt 1151 tgaaacaaaa gtcacaaaca ttatcgacag actgaacaat aatggcattc 1201 atatcaataa caaggtcgca tgccaattaa ttatgagagg tctatctggc 1251 gaatataaat ttttacgcta cacacgtcat cgacatctaa atatgacagt 1301 cgctgaactg ttcttagata tccatgctat ttatgaagaa caacagggat 1351 cgagaaacag taaacctaat tacaggagaa atccgagtga tgagaagaat 1401 gattctcgca gctatacgaa tacaaccaaa cccaaagtta tagctcggaa 1451 tcctcaaaaa acaaataatt cgaaatcgaa aacagccagg gctcacaatg 1501 tatccacatc taataactct cccagcacgg acaacgattc catcagtaaa 1551 tcaactactg aaccgattca attgaacaat aagcacgacc ttcatcttag 1601 gccagaaact tactgaatct acagtaaatc atactaatca ttctgatgat 1651 gaactccctg gacacctcct tctcgattca ggagcatcac gaacccttat 1701 aagatctgct catcacatac actcagcatc atctaatcct gacataaacg 1751 tagttgatgc tcaaaaaaga aatataccaa ttaacgctat tggtgaccta 1801 caatttcact tccaggacaa caccaaaaca tcaataaagg tattgcacac 1851 tcctaacata gcctatgact tactcagttt gaatgaattg gctgcagtag 1901 atatcacagc atgctttacc aaaaacgtct tagaacggtc tgacggcact 1951 gtacttgcac ctatcgtaaa atatggagac ttttactggg tatctaaaaa 2001 gtacttgctt ccatcaaata tctccgtacc caccatcaat aatgtccata 2051 caagtgaaag tacacgcaaa tatccttatc ctttcattca tcgaatgctt 2101 gcgcatgcca atgcacagac aattcgatac tcacttaaaa ataacaccat 2151 cacgtatttt aacgaatcag atgtcgactg gtctagtgct attgactatc 2201 aatgtcctga ttgtttaatc ggcaaaagca ccaaacacag acatatcaaa 2251 ggttcacgac taaaatacca aaattcatac gaaccctttc aatacctaca 2301 tactgacata tttggtccag ttcacaacct accaaatagt gcaccatcct 2351 atttcatctc atttactgat gagacaacaa aattccgttg ggtttatcca 2401 ttacacgacc gtcgcgagga ctctatcctc gatgttttta ctacgatact 2451 agcttttatt aaaaaccagt ttcaggccag tgtcttggtt atacaaatgg 2501 accgtggttc tgagtatact aacagaactc tccataaatt ccttgaaaaa 2551 aatggtataa ctccatgcta tacaaccaca gcggattccc gagcacatgg 2601 agtcgctgaa cggctaaacc gtaccttatt agatgactgc cgtactcaac 2651 tgcaatgtag tggtttaccg aaccatttat ggttctctgc aatcgaattt 2701 tctactattg tgagaaattc actagcttca cctaaaagca aaaaatctgc 2751 aagacaacat gctggcttgg caggacttga tatcagtact ttgttacctt 2801 tcggtcaacc tgttatcgtc aatgatcaca accctaactc caaaatacat 2851 cctcgtggca tcccaggcta cgctctacat ccgtctcgaa actcttatgg 2901 atatatcatc tatcttccat ccttaaagaa gacagtagat acaactaact 2951 atgttattct tcagggcaag gaatccagat tagatcaatt caattacgac 3001 gcactcactt tcgatgaaga cttaaaccgt ttaactgctt catatcattc 3051 gttcattgcg tcaaatgaga tccaagaatc caatgatctt aacatagaat 3101 ctgaccatga cttccaatcc gacattgaac tacatcctga gcaaccgaga 3151 aatgtccttt caaaagctgt gagtccaacc gattccacac ctccgtcaac 3201 tcatactgaa gattcgaaac gtgtttctaa aaccaatatt cgcgcaccca 3251 gagaagttga ccccaacata tctgaatcta atattcttcc atcaaagaag 3301 agatctagca ccccccaaat ttccaatatc gagagtaccg gttcgggtgg 3351 tatgcataaa ttaaatgttc ctttacttgc tcccatgtcc caatctaaca 3401 cacatgagtc gtcgcacgcc agtaaatcta aagatttcag acactcagac 3451 tcgtacagtg aaaatgagac taatcataca aacgtaccaa tatccagtac 3501 gggtggtacc aacaacaaaa ctgttccgca gataagtgac caagagactg 3551 agaaaaggat tatacaccgt tcaccttcaa tcgatgcttc tccaccggaa 3601 aataattcat cgcacaatat tgttcctatc aaaacgccaa ctactgtttc 3651 tgaacagaat accgaggaat ctatcatcgc tgatctccca ctccctgatc 3701 tacctccaga atctcctacc gaattccctg acccatttaa agaactccca 3751 ccgataaatt ctcgtcaaac taattccagt ttgggtggta ttggtgactc 3801 taatgcctat actactatca acagtaagaa aagatcatta gaagataatg 3851 aaactgaaat taaggtatca cgagacacat ggaatactaa gaatatgcgt 3901 agtttagaac ctccgagatc gaagaaacga attcacctga ttgcagctgt 3951 aaaagcagta aaatcaatca aaccaatacg gacaacctta cgatacgatg 4001 aggcaatcac ctataataaa gatattaaag aaaaagaaaa atatatcgag 4051 gcataccaca aagaagtcaa tcaactgttg aagatgaaaa cttgggacac 4101 tgacgaatat tatgacagaa aagaaataga ccctaaaaga gtaataaact 4151 caatgtttat cttcaacaag aaacgtgacg gtactcataa agctagattt 4201 gttgcaagag gtgatattca gcatcctgac acttacgact caggcatgca 4251 atccaatacc gtacatcact atgcattaat gacatccctg tcacttgcat 4301 tagacaataa ctactatatt acacaattag acatatcttc ggcatatttg 4351 tatgcagaca tcaaagaaga attatacata agacctccac cacatttagg 4401 aatgaatgat aagttgatac gtttgaagaa atcactttat ggattgaaac 4451 aaagtggagc gaactggtac gaaactatca aatcatacct gatacaacaa 4501 tgtggtatgg aagaagttcg tggatggtca tgcgtattta aaaacagtca 4551 agtgacaatt tgtttattcg tagatgatat ggtattgttt agcaaaaatc 4601 taaattcaaa caaaagaatt atagagaagc ttaagatgca atacgacacc 4651 aagattataa atctaggcga aagtgatgag gaaattcaat atgacatact 4701 tggcttagaa atcaaatatc aaagaggtaa atacatgaaa ttaggtatgg 4751 aaaactcatt aactgagaaa atacccaaat taaacgtacc tttgaatcca 4801 aaaggaagaa aacttagcgc tccaggtcaa ccaggtcttt atatagacca 4851 ggatgaacta gaaatagatg aagatgaata caaagagaag gtacatgaaa 4901 tgcaaaagtt gattggtcta gcttcatatg ttggatataa atttagattt 4951 gacttactat actacatcaa cacacttgct caacatatac tattcccctc 5001 taggcaagtt ttagacatga catatgagtt gatacaattc atgtgggaca 5051 ctagagataa acaactgata tggcacaaaa acaaacctac cgagccagat 5101 aataaactag tcgcaataag tgatgcttcg tatggcaacc aaccgtatta 5151 taaatcacaa attggcaaca tatatttact taatggaaag gtaattggag 5201 gaaagtccac caaggcttca ttaacatgta cttcaactac ggaagcagaa 5251 atacacgcga taagtgaatc tgtcccatta ttaaataatc taagttacct 5301 gatacaagaa cttaacaaga aaccaattat taaaggctta cttactgata 5351 gtagatcaac gatcagtata attaagtcta caaatgaaga gaaatttaga 5401 aacagatttt ttggcacaaa ggcaatgaga cttagagatg aagtatcagg 5451 taataattta tacgtatact acatcgagac caagaagaac attgctgatg 5501 tgatgacaaa acctcttccg ataaaaacat ttaaactatt aactaacaaa 5551 tggattcatt agatctatta cattatgggt ggtatgttgg aatagaaatc 5601 aactatcatc tactaactag tatttacatt actagtatat tatcatatac 5651 ggtgttagaa gatgacgcaa atgatgagaa atagtcatct aaattagtgg 5701 aagctgaaac gcaaggattg ataatgtaat aggatcaatg aatataaaca 5751 tataaaatga tgataataat atttatagaa ttgtgtagaa ttgcagattc 5801 ccttttatgg attcctaaat ccttgaggag aacttctagt atattctgta 5851 tacctaatat tatagccttt atcaacaatg gaatcccaac aattatctca 5901 acattcaccc atttctca