Sequence of DPV Saccharomyces cerevisiae Ty4 virus

S.cerevisciae TY4 retrotransposon endogenous protease, integrase, reverse transcriptase protein (TY4B) gene, complete cds and gag protein (TY4A) pseudogene.

ACC No: M94164

Dated: 2004-12-02 | Length: 6727 | CRC: 541012383

                !!NA_SEQUENCE 1.0
ID   SCTY4A     standard; genomic DNA; FUN; 6727 BP.
XX
AC   M94164;
XX
SV   M94164.1
XX
DT   20-MAY-1992 (Rel. 31, Created)
DT   02-DEC-2004 (Rel. 82, Last updated, Version 6)
XX
DE   S.cerevisciae TY4 retrotransposon endogenous protease, integrase, reverse
DE   transcriptase protein (TY4B) gene, complete cds and gag protein (TY4A)
DE   pseudogene.
XX
KW   gag gene; integrase; protease; retrotransposon; reverse transcriptase.
XX
OS   Saccharomyces cerevisiae (baker's yeast)
OC   Eukaryota; Fungi; Ascomycota; Saccharomycotina; Saccharomycetes;
OC   Saccharomycetales; Saccharomycetaceae; Saccharomyces.
XX
RN   [1]
RP   1-6727
RX   DOI; 10.1016/0378-1119(92)90039-R.
RX   MEDLINE; 93083972.
RX   PUBMED; 1333437.
RA   Stucka R., Schwarzlose C., Lochmuller H., Hacker U., Feldmann H.;
RT   "Molecular analysis of the yeast Ty4 element: homology with Ty1, copia, and
RT   plant retrotransposons";
RL   Gene 122(1):119-128(1992).
XX
RN   [2]
RP   1-6727
RA   Feldmann H.;
RT   ;
RL   Submitted (18-MAY-1992) to the EMBL/GenBank/DDBJ databases.
RL   Horst Feldmann, Adolf-Butenandt-Institut fur Physiologische Chemie,
RL   Schillerstrasse 44, Munchen, D-80336, Germany
XX
RN   [3]
RP   1-6727
RA   Feldmann H.;
RT   ;
RL   Submitted (15-FEB-1996) to the EMBL/GenBank/DDBJ databases.
RL   Horst Feldmann, Adolf-Butenandt-Institut fur Physiologische Chemie,
RL   Schillerstrasse 44, Munchen, D-80336, Germany
XX
CC   On Mar 8, 1996 this sequence version replaced gi:173091.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .6727
FT                   /db_xref="taxon:4932"
FT                   /mol_type="genomic DNA"
FT                   /organism="Saccharomyces cerevisiae"
FT                   /transposon="retrotransposon TY4"
FT                   /strain="C836"
FT                   /tissue_lib="Ty4-90"
FT   LTR             153. .523
FT   repeat_region   153. .157
FT                   /rpt_type=INVERTED
FT   repeat_region   519. .523
FT                   /rpt_type=INVERTED
FT   CDS             523. .1758
FT                   /codon_start=1
FT                   /note="replace 'taa' at 1180-1182 with 'tta' in Ty4-476 and
FT                   Ty4-832"
FT                   /pseudo
FT                   /gene="TY4A"
FT                   /product="gag protein"
FT   CDS             <1529. .5872
FT                   /codon_start=1
FT                   /note="5' end of coding region undetermined"
FT                   /gene="TY4B"
FT                   /product="endogenous protease, integrase, reverse
FT                   transcriptase protein"
FT                   /protein_id="AAA91746.1"
FT                   /translation="KIAACIANLFSIAQLTAKRNQIGNLGLTRPISQKPIIYKVHRDNN
FT                   HLSPVQNEQKSWNKTQKRSNKVYNSKKLVIIDTGSGVNITNDKTLLHNYEDSNRSTRFF
FT                   GIGKNSSVSVKGYGYIKIKNGHNNTDNKCLLTYYVPEEESTIISCYDLAKTTKMVLSRK
FT                   YTRLGNKIIKIKTKIVNGVIHVKMNELIERFLSDDSKINAIKPTSSPGFKLNKRSITLE
FT                   DAHKRMGHTGIQQIENSIKHNHYEESLDLIKEPNEFWCQTCKISKATKRNHYTGSMNNH
FT                   STDHEPGSSWCMDIFGPVSSSNADTKRYMLIMVDNNTRYCMTSTHFNKNAETILAQVRK
FT                   NIQYVETQFDRKVREINSDRGTEFTNDQIEEYFISKGIHHILTSTQDHAANGRAERYIR
FT                   TIITDATTLLRQSNLRVKFWEYAVTSATNIRNCLEHKSTGKLPLKAISRQPVTVRLMSF
FT                   LPFGEKGIIWNHNHKKLKPSGLPSIILCKDPNSYGYKFFIPSKNKIVTSDNYTIPNYTM
FT                   DGRVRNTQNINKSHQFSSHNDDEEDQIETVTNLCEALENYEDDNKPITRLEDLFTEEEL
FT                   SQIDSNAKYPSPSNNLEGDLDYVFSDVEESGDYDVESELSTTNNSISTDKNKILSNKDF
FT                   NSELASTEISISGIDKKGLINTSHIDEDKYDEKVHRIPSIIQEKLVGSKNTIKINDENK
FT                   ISDRIRSKNIGSILNTGLSRCVDITDESITNKDESMHNAKPELIQEQLKKTNHETSFPK
FT                   EGSIGTNVKFRNTNNEISLKTGDTSLPIKTLESINNHHSNDYSTNKVEKFEKENHHPPP
FT                   IEDIVDMSDQTDMESNCQDGNNLKELKVTDKNVPTDNGTNVSPRLEQNIERSGSPVQTV
FT                   NKSAFLNKEFSSLNMKRKRKRHDKNNSLTSYELERDKKRSKKNRVKLIPDNMETVSAPK
FT                   IRAIYYNEAISKNPDLKEKHEYKQAYHKELQNLKDMKVFDVDVKYSRSEIPDNLIVPTN
FT                   TIFTKKRNGIYKARIVCRGDTQSPDTYSVITTESLNHNHIKIFLMMQTTEICLWTLDIN
FT                   HAFLYAKLEEEIYIPHPLIGDVYVKLNKALYGLKQSPKEWNDHLRQYLNGIGLKDNSYT
FT                   PGLYQTEDKNLMIAVYVDDCVIAASNEQRLDEFINKLKSNFELKITGTLIDDVLDTDIL
FT                   GMDLVYNKRLGTIDLTLKSFINRMDKKYNEELKKIRKSSIPHMSTYKIDPKKDVLQMSE
FT                   EEFRQGVLKLQQLLGELNYVRHKCRYDIEFAVKKVARLVNYPHERVFYMIYKIIQYLVR
FT                   YKDIGIHYDRDCNKDKKVIAITDASVGSEYDAQSRIGVILWYGMNIFNVYSNKSTNRCV
FT                   SSTEAELHAIYEGYRDSETLKVTLKELGEGDNNDIVMITDSKPAIQGLNRSYQQPKEKF
FT                   TWIKTEIIKEKLKRSITVKITGKGNIADLLTNQYQHLILKDLYKY"
FT   repeat_region   5992. .5996
FT                   /rpt_type=INVERTED
FT   LTR             5992. .6362
FT   repeat_region   6358. .6362
FT                   /rpt_type=INVERTED
XX
SQ   Sequence 6727 BP; 2661 A; 1063 C; 1178 G; 1825 T; 0 other;

   M94164  Length: 6727  May 9, 2005 17:26  Type: N  Check: 4792  ..

       1  gaggagatac ctggcaaaac atttcttgtg agcacaacct caattaaagt
      51  tagacaagta ggtgcactat tggttgcttg ttggctcatc tcgttgagga
     101  atgtaataag tacttgttat taacctgttt ttgtgccatc tatagtggag
     151  agtgttggaa cgagagtaat taatagtgac atgagttgct atggtaacaa
     201  tctaatgctt acatcgtata ttaatgtaca actcgtatac gtttaagtgt
     251  gattgcgcct attgcagaag gaatgttaaa cgagaagctc agacaatact
     301  gaagctgtgt taaagaccta ttagttgaac atgttatggt aggtacatat
     351  atgaggaata tgagtcgtca catcaatgta tagtaactac cggaatcact
     401  attatattgg tcataattaa tatgaccaat cggcgtgtgt tttatatacc
     451  tctcttattt agtataagaa gatcagtact cacttcttca ttaatactaa
     501  tttttaacct ctaattatca acatggcgac cccagtgagg ggtgaaacaa
     551  gaaatgttat tgacgacaac atttctgcgc ggattcaatc gaaagtcaaa
     601  acaaatgata ctgtcagaca gacgccatta agaaaagttt ctattaaaga
     651  tgaacaggtg agacaatatc aaagaaattt aaataggttt aaaaccatac
     701  taaatggttt aaaggcagaa gaggaaaaac tttctgaggc tgatgatatt
     751  cagatgctag ctgaaaaatt attaaaactc ggagaaacca ttgacaaggt
     801  tgagaatagg attgtggatc tagttgaaaa gatacaatta ttggaaacaa
     851  acgagaacaa taatatatta catgaacata tagatgctac agggacttac
     901  tatttattcg atacgttaac ttcaaccaac aaaagattct accctaagga
     951  ttgtgttttt gattatagga ctaataatgt cgagaacatt cctattctct
    1001  taaacaattt taaaaaattc atcaagaaat atcaatttga tgatgtcttt
    1051  gaaaatgata tcatagaaat cgatcctcgt gaaaatgaaa tcttgtgcaa
    1101  gataatcaaa gaaggactcg gtgaaagttt agatatcatg aacacaaata
    1151  caactgacat ttttaggata atcgatggtt aaaaaaacaa atatagaagt
    1201  ttgcatggta gagatgtcag aattagagcc tgggaaaagg ttttggttga
    1251  tacaacatgt agaaattccg cattgttaat gaataaactt caaaagttgg
    1301  tactaatgga aaaatggatt ttttctaaat gctgccaaga ttgtcctaat
    1351  ctaaaggatt acctacaaga agctatcatg ggaaccttac atgaatcctt
    1401  aagaaattct gtgaaacaac gtttgtacaa cattccacat gacgtaggaa
    1451  ttgatcacga agaatttcta atcaatactg ttattgaaac agtaattgat
    1501  ttgagcccaa ttgcagacga tcaaatagaa aatagctgca tgtattgcaa
    1551  atctgttttc cattgctcaa ttaactgcaa aaagaaacca aatagggaac
    1601  ttaggcctga ctcgaccaat ttctcaaaaa cctattatct acaaggtgca
    1651  cagagacaac aaccacttaa gtccagtgca aaacgaacaa aagtcttgga
    1701  acaagacaca aaaaaggtcg aacaaagtgt acaacagcaa aaaactggta
    1751  attattgata ccggttccgg cgtaaacatt accaatgaca aaaccttact
    1801  gcataattac gaagacagta atcgcagtac acgatttttt ggtattggga
    1851  aaaacagttc agtgtctgtt aaagggtatg gctatataaa aatcaagaat
    1901  ggtcacaaca atactgacaa taagtgtcta ttaacttact atgtaccgga
    1951  agaagaatcc actataatca gctgttatga cttagccaag acaaccaaaa
    2001  tggttttaag tcgaaaatat accagattgg gaaacaaaat cataaaaatt
    2051  aaaaccaaga tagttaatgg tgtcattcac gtaaaaatga acgagttaat
    2101  tgaacgcttc ctctccgatg attcaaaaat aaatgcaata aaacctactt
    2151  cttctcctgg atttaaacta aataaaaggt ctattacctt ggaagatgct
    2201  cataaaagaa tgggccatac aggaattcaa caaattgaaa attccataaa
    2251  acataatcat tatgaagaat cccttgactt aatcaaagaa ccaaatgaat
    2301  tttggtgtca aacctgtaaa atctctaaag ccacgaaacg aaatcattat
    2351  accgggtcta tgaataatca tagtactgat catgaaccag gctcatcatg
    2401  gtgcatggat atatttggcc ctgtatcaag ttcaaacgcg gacactaaaa
    2451  ggtacatgct tattatggtg gataacaaca cgagatattg catgacctcc
    2501  acacacttca ataagaatgc tgaaactatt ttagctcaag ttagaaagaa
    2551  tattcagtac gtggaaacac aatttgacag gaaagtcaga gaaattaatt
    2601  cagacagagg tactgaattc acaaatgatc agatagaaga atattttatt
    2651  tcaaaaggaa tacatcacat acttacttct acacaagatc atgctgctaa
    2701  cggaagagca gaaagataca taagaacaat aataactgat gcaacaacac
    2751  tcctaagaca aagtaactta agagtaaaat tttgggaata cgcagtaact
    2801  tctgctacca atataagaaa ttgcctggaa cacaaaagta caggtaaact
    2851  accattgaag gcaatctcac gtcaacctgt gacagtgaga ttaatgtcat
    2901  tcttaccatt tggcgaaaaa ggaataattt ggaatcataa tcacaaaaaa
    2951  ttgaaaccat ctggacttcc ttctataatt ctatgcaaag atccaaatag
    3001  ttatggatac aaattcttta taccatccaa aaataaaatt gtcacatctg
    3051  ataattatac aattcccaac tatacaatgg acggtagagt aagaaatact
    3101  cagaatatta acaagagtca tcaattcagt tcacataatg atgatgaaga
    3151  agatcaaatc gaaacggtca caaacttatg tgaagctttg gaaaactacg
    3201  aagatgataa taaaccaatt actcgcctgg aagatttgtt cacagaggaa
    3251  gagttatctc aaatagactc aaacgcaaaa tacccatctc ctagtaataa
    3301  cctagaaggg gacttggatt acgtattttc tgatgttgag gaatctggag
    3351  attatgacgt tgaatctgaa ctttcaacga caaataattc aatctcaact
    3401  gataaaaaca aaattttgtc aaacaaggat tttaattcag aacttgcatc
    3451  gactgaaata tccatcagtg gaatcgataa gaaaggatta ataaatacaa
    3501  gtcatattga tgaagataag tatgatgaaa aagtacacag aattccatcg
    3551  attatacaag agaaactggt aggaagtaaa aatactatta aaatcaatga
    3601  cgaaaacaaa atctccgaca gaattcgtag taaaaacatt gggagtattt
    3651  taaacactgg actcagtaga tgtgtagata tcaccgatga atctattact
    3701  aacaaagatg agtcaatgca caacgcaaaa cccgaactaa ttcaggagca
    3751  gttaaaaaaa acaaatcatg aaacttcgtt tcctaaagaa gggagcattg
    3801  gaacaaatgt aaaattccga aatacaaaca atgagatttc tttaaaaaca
    3851  ggcgatacga gtttaccaat aaaaacttta gaaagcatta acaatcacca
    3901  tagtaatgat tattccacaa acaaagttga aaagtttgag aaggaaaatc
    3951  atcatccgcc cccgattgag gacattgtgg atatgagtga tcaaactgat
    4001  atggaatcaa actgtcagga tggtaataac ttaaaagaat taaaagtcac
    4051  cgataaaaat gtaccaactg acaatggaac aaatgtgtca ccaaggttgg
    4101  aacaaaatat tgaacgatct ggatcaccag tacaaacagt taataaaagt
    4151  gccttcttaa acaaagaatt cagttctttg aacatgaaaa gaaaacggaa
    4201  aagacacgat aaaaacaata gtctaacaag ctatgaatta gaaagagata
    4251  agaagcgttc aaaaaagaat cgagtgaaat taattccaga taatatggaa
    4301  acagtttcag caccaaaaat tcgagccata tattataatg aagctatttc
    4351  aaaaaatcct gacctcaaag aaaaacatga atacaaacag gcatatcata
    4401  aagaattaca gaatttaaaa gatatgaagg tatttgatgt cgatgtgaag
    4451  tacagtagat cagaaattcc tgataattta atagtaccca ccaacacgat
    4501  attcacaaag aaaagaaatg ggatttataa ggctaggata gtctgcagag
    4551  gtgatactca gtcaccagac acttacagtg taataactac agaatcttta
    4601  aatcacaatc atattaagat attcttaatg atgcaaacaa cagaaatatg
    4651  tttatggacc ctggatatca atcatgcatt cctatatgct aaattggaag
    4701  aagaaatata catcccacat ccgctgatag gagatgtgta cgtcaagcta
    4751  aataaggcgt tatatggtct aaaacagagt cctaaagaat ggaatgatca
    4801  tctaagacaa tacttgaatg gaattggact gaaagataac tcttatactc
    4851  cgggattata ccaaaccgag gataaaaatc taatgattgc agtctatgtt
    4901  gatgactgcg taattgcggc aagcaatgaa cagagattgg atgaattcat
    4951  aaacaaattg aaaagtaatt ttgaactgaa aattacagga acattaatag
    5001  acgatgtact cgatacagat atattaggaa tggatctagt atacaacaaa
    5051  agacttggta ctatcgattt aacattaaaa tcattcataa atagaatgga
    5101  taaaaaatac aacgaggaat tgaaaaagat tagaaaaagt tcaattccgc
    5151  atatgtcaac ttataaaata gatcctaaga aagacgtact gcaaatgtca
    5201  gaagaagagt ttagacaagg tgttctaaag ctacaacaat tactaggtga
    5251  actaaactat gtcagacaca aatgcagata cgacattgaa tttgctgtta
    5301  agaaagtggc tagactagta aattacccac atgaaagagt cttttatatg
    5351  atttacaaaa taatccagta cttggttcgg tataaagata ttggaataca
    5401  ctatgaccga gactgtaata aagacaaaaa ggttattgct ataactgatg
    5451  catcagttgg atcagaatat gatgctcaat caaggattgg agttatatta
    5501  tggtacggta tgaatatttt taatgtttat tctaacaaga gcacaaacag
    5551  atgtgtatca tcaacagaag cagagcttca tgccatttat gaaggctatc
    5601  gagactcaga aacgttgaag gtaacattaa aggagctagg agaaggagac
    5651  aataatgaca ttgtcatgat cactgactca aagccagcca ttcaaggatt
    5701  aaatcgcagc tatcaacaac caaaagagaa attcacttgg ataaaaactg
    5751  aaataataaa agaaaaatta aagagaagta taactgttaa aattaccggc
    5801  aaaggtaata ttgctgattt actaacaaac cagtatcagc atctgatttt
    5851  aaaagattta tacaagtatt aaaaaataaa ataacatcac aggatatttt
    5901  ggcctcaaca gactattgat aattaattaa tgaagttcta aacacacaat
    5951  gaatatctgt tgaagtacaa taatatatct ttaagggagc atgttggaac
    6001  gagagtaatt aatagtgaca tgagttgcta tggtaacaat ctaatgctta
    6051  catcgtatat taatgtacaa ctcgtatacg tttaagtgtg attgcgccta
    6101  ttgcagaagg aatgttaaac gagaagctca gacaatactg aagctgtgtt
    6151  aaagacctat tagttgaaca tgttatggta ggtacatata tgaggaatat
    6201  gagtcgtcac atcaatgtat agtaactacc ggaatcacta ttatattggt
    6251  cataattaat atgaccaatc ggcgtgtgtt ttatatacct ctcttattta
    6301  gtataagaag atcagtactc acttcttcat taatactaat ttttaacctc
    6351  taattatcaa cagagagctt attgcaattg tttttatttc ttggctgcat
    6401  atatcagtct tgacaggctg catggggatg acagttagaa actagccaaa
    6451  ttacccctta tgtagataac aatcattgct tattcgctct tcccccattt
    6501  tttttcttgc tcttgctgtt ttttctttta gcgttcgttt caaggaacaa
    6551  gagaggaaaa aaaatcaaaa gtagaaaaga agaagaaaaa aacaacgtaa
    6601  cacaagttaa caccacaact gaaaaaaaaa ataagaggtg aacgaacgag
    6651  taactgggga gaggaaagca gattccacaa tatacattca aattaaagaa
    6701  atggactcac aaccagttga cgttgat