Sequence of DPV Arabidopsis thaliana Ta1 virus

Arabidopsis thaliana DNA for copia-like transposable element Ta1-3

ACC No: X13291

Dated: 2002-07-06 | Length: 5258 | CRC: -1722624655

                !!NA_SEQUENCE 1.0
ID   ATTA13     standard; genomic DNA; PLN; 5258 BP.
XX
AC   X13291;
XX
SV   X13291.1
XX
DT   23-NOV-1989 (Rel. 21, Created)
DT   06-JUL-2002 (Rel. 72, Last updated, Version 7)
XX
DE   Arabidopsis thaliana DNA for copia-like transposable element Ta1-3
XX
KW   copia-like element; integrase; polyprotein; protease; repetitive sequence;
KW   retrotransposon; reverse transcriptase; RNA binding protein.
XX
OS   Arabidopsis thaliana (thale cress)
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; rosids;
OC   eurosids II; Brassicales; Brassicaceae; Arabidopsis.
XX
RN   [1]
RP   1-5258
RA   Voytas D.F.;
RT   ;
RL   Submitted (17-OCT-1988) to the EMBL/GenBank/DDBJ databases.
RL   Voytas D.F., Harvard University, 10 Wellman, Massachusetts General
RL   Hospital, Boston, MA 02114.
XX
RN   [2]
RX   DOI; 10.1038/336242a0.
RX   MEDLINE; 89057095.
RX   PUBMED; 2904123.
RA   Voytas D.F., Ausubel F.M.;
RT   "A copia-like transposable element family in Arabidopsis thaliana";
RL   Nature 336(6196):242-244(1988).
XX
CC   The put. polyprotein contains domains for
CC   RNA-binding (RB)          : starting at AA 257,
CC   protease (P)              : starting at AA 318,
CC   integrase (INT)           : starting at AA 463 and
CC   reverse transcriptase (RT): starting at AA 856.
XX
CC   Data kindly reviewed (22-aug-1989) by Voytas D.F.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .5258
FT                   /db_xref="taxon:3702"
FT                   /mol_type="genomic DNA"
FT                   /organism="Arabidopsis thaliana"
FT                   /strain="La-O"
FT                   /clone_lib="Lambda FIX."
FT                   /clone="L2B"
FT   repeat_region   16. .20
FT                   /note="target site for Ta1-3 integration"
FT   repeat_region   21. .5238
FT                   /transposon="Ta1-3"
FT   misc_feature    21. .534
FT                   /note="5' LTR"
FT   CDS             567. .4442
FT                   /product="polyprotein"
FT                   /protein_id="CAA31653.1"
FT                   /translation="MANDPNQNTILKTSFQVFNENSDFSLWKTCMKAHLGLAGLKGIID
FT                   DFDLTMTVPIPKSEGKKIEDGDEQGDSSQTKIVPDLVKIEKSENAMNIIIAHVGDAVLR
FT                   KIDHCKSAAEMWETLNKQYMETSLPNRIYVQLKFYSFKMNDTKSINENVNEFLKIVAEL
FT                   SSLEINVVEEVRAILFLNRLSSRYSQLKHTLKYGNKALSLKDVISAARSLERELNEQKE
FT                   TDKNTSTVLYTNERSRPQTRNQNHNKGGQGRGRSKSNSNAKLTCWYCKKEGHVKKDYFA
FT                   RKRKLESENPGEAGVITEKLVFSEALSVNDLAVRDIWVLDSGCTSHMSARRDWFCSFRE
FT                   DGGPTILLGDDHSVKSQGQGSIKIETHGGTIIGLENVKYVPELRRNLISTGTLDKRGYK
FT                   HEGGDGKVRYFKNQKTALRGELVNGLYILDGNTVLSETCVAEGSKGKTELWHSRLGHIG
FT                   LNNMKVLAGKGLVSKEEIRVLDFCENCVMGKAKKVSFNVGKHNSEDVLRYVHADLWGST
FT                   NVTPSLSGNKYFLSIIDDKTRKVWLYFLRSKDETFDRFCEWKELVENQQNKKVKCLRTD
FT                   NGLEFCNLKFDAYCKEHGIERHKTCTYTPQQNGVAERMNRTIMEKVRCMLNESGLGEEF
FT                   WAEAAATAAYLINRSPASAIDHNVPEELWLNKKPGYKHLRRFGSIAYVHIDQGKLKPRA
FT                   LKGIFIGYPAGTKGYKIWLLEEHKCVISRNVLFHEESVYKDTMKKERVVESEAEPASHS
FT                   KSTLIKVKTPGNLNSGEVIQVSDEEESDESVEEEQEPETQVELPETQTTSSLANYQLAR
FT                   DRERRQIHPPARFTEESGVAFALVTVETLSMEEPQSYQEATSDKEWKKWKLATHEEMDS
FT                   LIKNGTWVLVDKPQNRKIIGCRWLFKLKSGSPGVEPVRYKAQLVAKGYTHREGVDYQEI
FT                   FALVVKHTSIRILMSVVVDQDLELEQMDVKTAFLHGELEEELYMEQPEGCISEDGENKV
FT                   CLLKKSLYGLKQSPRQWNKRFNRFMIDQNFIRSEHDACVYVKQVSEQEHLYLLLYVDDM
FT                   LIAGKSKSEINKVKEQLSMEFEMKDMGPASRILGIDIIRDMKNGVLRMSQASYIHNVVQ
FT                   RFNMAEAKVTRSPIGAHFKLAAVRDDDECIDNNAVPYASAVGSIMYAMIGIRPDLAYVI
FT                   CLVSRYMARPGSIHWEAVKWILRYMRGSQDLNLVFTKEKEFRVTGYCDSDYAADLDRRR
FT                   SVSGYVFTVGGNTVSWKANLQSVTALSTTEAEFMALTEAAKEALWIKGLMKDLGLEQDK
FT                   VTLWCDS"
FT   misc_feature    4725. .5238
FT                   /note="3'LTR"
FT   repeat_region   5239. .5243
FT                   /note="target site for Ta1-3 integration"
XX
SQ   Sequence 5258 BP; 1682 A; 846 C; 1301 G; 1429 T; 0 other;

   X13291  Length: 5258  May 9, 2005 17:26  Type: N  Check: 4686  ..

       1  tttcccaaaa gggaaatcaa tgttggagtt atgatccaat tcctaagttg
      51  ctaaagtcta atgtcgacta ttaacttaag ttgagtttga atttggattg
     101  gaggagctaa accggtctgg ttaagtttgg ttattgaaag aaggaaggat
     151  tgagttcggt ttaatctttg aagctgaaga ctcgacttgg tttagagggt
     201  ttgacgttga ctataaaaag gactcgtctt cttcttttct gtttcatcct
     251  ctgtaacaaa cattgtatct tcttcttctt cctctgatct tgagcttgta
     301  acggtgtgtg taaaagcttg agaaactcca ttgatatagt gaattgctgg
     351  tcagaatcca gccgagacgt aggcttactc attccgagta gctgaactcg
     401  taaatcctct gtgtcacttt attctttgaa tgtttcttgt tttgagagtg
     451  agagattaca aattgagaga cgagagagag gttcgtgtgc gtgagatcac
     501  aaatcgatca aggtttaagg ttcgtttggt aacaagtggt atcagagcca
     551  ttggttcttg cgagctatgg cgaacgatcc aaatcagaac acgatcctga
     601  agacctcgtt tcaagtcttt aacgagaatt cagatttttc gctatggaag
     651  acgtgtatga aggcacatct gggattggca ggacttaaag gcatcatcga
     701  tgattttgat cttacgatga cagtgccaat tccaaaatct gagggaaaga
     751  agattgaaga tggtgacgaa caaggagatt cgtctcaaac aaagattgtt
     801  cctgatctcg tgaagattga gaaatctgaa aacgcgatga acattatcat
     851  cgctcatgtt ggtgatgcag tattgagaaa gatcgatcac tgcaagagtg
     901  cagctgagat gtgggaaact ttgaacaagc aatacatgga aacctcattg
     951  cctaatcgga tctatgtaca gctcaagttc tattcattca agatgaatga
    1001  tactaagtcg atcaacgaaa acgtgaatga attcttaaag atcgtcgcag
    1051  aattgagtag cttggagatc aatgtggttg aagaagtaag agccatcttg
    1101  ttcttgaatc gtttgtcttc aagatattca caactcaaac atacactcaa
    1151  gtatgggaac aaggcattgt cactgaaaga tgtgatatca gctgcacgtt
    1201  ctcttgaaag agaacttaat gaacaaaagg aaactgataa gaacacctct
    1251  acagttttgt atactaatga gagaagcaga cctcagacta gaaatcaaaa
    1301  tcacaacaaa ggaggtcaag ggagaggcag aagcaaatcc aactctaatg
    1351  caaagcttac gtgctggtac tgcaagaaag agggacatgt caaaaaggac
    1401  tattttgcta ggaaaaggaa actagaaagt gaaaatccag gagaagctgg
    1451  agtcatcact gaaaagctgg tgttttctga agcactcagt gtcaatgatc
    1501  tagcagtaag agacatttgg gtacttgact caggttgcac gtctcacatg
    1551  tctgcaagaa gggattggtt ctgcagtttt agagaagatg gtggccctac
    1601  tattcttctg ggagatgacc actcggttaa atctcaagga caaggatcta
    1651  ttaagataga aactcatgga ggcactataa tagggcttga gaatgtgaag
    1701  tatgtacctg aacttagaag gaacctaatc tccacaggta ctcttgacaa
    1751  aaggggatac aaacatgaag gtggtgatgg taaagtgagg tatttcaaga
    1801  atcagaaaac agctttaaga ggagagcttg ttaacggact atacatactt
    1851  gatggaaaca cagtattatc tgaaacgtgt gttgctgaag gatctaaggg
    1901  aaaaacagaa ctctggcaca gtaggctcgg tcatatcggt ctaaacaata
    1951  tgaaggtgtt agcaggaaaa gggctagtga gcaaagaaga aataagggta
    2001  ctggacttct gtgaaaattg tgtcatggga aaggccaaga aagtgagctt
    2051  taatgtggga aagcacaact cagaagatgt tctccgctat gtccatgcag
    2101  atctgtgggg ttccacaaac gtcacacctt cattgtcagg taacaagtat
    2151  ttcttgtcaa taattgatga taaaacacgc aaagtttggt tgtattttct
    2201  caggtctaaa gacgaaacat ttgatcgctt ctgcgagtgg aaagagctcg
    2251  ttgagaatca acaaaacaag aaagtcaagt gtttgagaac tgacaacgga
    2301  ttggagtttt gcaacttgaa gtttgatgct tactgtaaag agcatggaat
    2351  agaaagacac aagacctgca cctatactcc tcagcagaat ggagtagcag
    2401  aacgcatgaa taggacaatc atggagaagg tgaggtgcat gttgaatgag
    2451  tcagggttgg gagaagagtt ttgggcagaa gctgctgcaa ctgcagccta
    2501  tttgataaac aggtccccag cgtctgcaat tgatcataat gtccctgagg
    2551  aattatggtt gaataagaaa cctggttaca aacatttgag gcggtttggt
    2601  tctattgcat atgtccacat agaccaaggg aagttgaagc ctagagcttt
    2651  aaagggaatc tttattggat acccagctgg aacaaaaggg tataagatct
    2701  ggcttctaga agaacataaa tgtgtgataa gccgaaatgt gttatttcat
    2751  gaggaatcag tgtataagga tactatgaaa aaagaaagag ttgtagaaag
    2801  tgaagcagaa cctgctagtc actcaaagag tacactgata aaagtaaaaa
    2851  ctccagggaa tctgaattca ggtgaagtaa ttcaagtatc agatgaagaa
    2901  gaatctgatg aaagtgttga agaagaacag gaacctgaaa ctcaggtgga
    2951  gttaccagaa actcaaacaa ctagttcttt agctaactat caactagcta
    3001  gagatcgaga aagaaggcag atccatcctc ctgctaggtt tacagaagaa
    3051  agtggtgtag catttgcact agtaactgtt gagactttga gtatggagga
    3101  gccgcagagt tatcaggaag caacttctga taaagaatgg aagaaatgga
    3151  aacttgctac tcatgaggag atggattctc tgattaagaa cggtacatgg
    3201  gtgttggttg ataaacccca gaaccgaaag atcattggtt gcagatggtt
    3251  gtttaaactg aagagtggca gtccaggagt tgagcctgtg agatacaagg
    3301  ctcagttagt ggcaaaaggg tacactcata gagagggtgt tgattaccaa
    3351  gagatctttg ctctagtggt taaacacaca tctatcagga tattgatgtc
    3401  tgttgttgtt gatcaagacc tagagttgga acagatggac gtaaagacag
    3451  ctttccttca tggagagtta gaagaagaac tttatatgga acaaccagag
    3501  ggttgcatat ctgaagatgg tgagaataag gtttgcttat tgaagaagtc
    3551  gttgtatggg ttaaaacaat ccccaagaca gtggaacaaa cgcttcaata
    3601  gattcatgat tgatcaaaac ttcattagaa gtgagcatga tgcttgtgta
    3651  tatgtgaagc aggtcagtga acaagaacac ctgtacctgt tgctatacgt
    3701  ggatgatatg ttgattgcag gaaagagcaa atcagaaatt aacaaggtta
    3751  aagagcagct gagcatggaa tttgaaatga aagatatggg accagcgagt
    3801  agaattctcg gcattgacat tataagagac atgaagaatg gagttctacg
    3851  catgtctcag gctagctaca ttcacaatgt ggtccagcgg ttcaacatgg
    3901  ctgaagccaa agtcacacgg tcaccaatag gagctcattt caagctagct
    3951  gcagtgaggg acgatgatga gtgcattgac aacaatgctg taccttatgc
    4001  cagtgcagtt ggcagtatca tgtacgccat gataggtata cgtcctgact
    4051  tagcttatgt tatatgtctg gtaagcaggt acatggcaag accaggcagt
    4101  attcactggg aagcagtcaa gtggattctc aggtacatgc gaggatctca
    4151  ggacttaaat cttgtgttta caaaagagaa agaattcaga gttacggggt
    4201  attgtgattc ggactatgct gctgatttgg atagaagaag atcagtaagt
    4251  ggatacgtgt ttacagtagg tggtaacaca gtaagttgga aggcaaattt
    4301  gcagtcagtg actgcattat caactacaga agccgagttc atggcactta
    4351  cagaagctgc caaagaagct ttatggatta aaggcttaat gaaggacttg
    4401  ggacttgagc aggataaggt aaccctttgg tgtgattcct agtcagctat
    4451  ttgcttgttt aaaaacagta ctcatcatga aaggactaag catatagatg
    4501  tcagatacaa cttcataaga gatgttgtgg aagcaggaga tgtggatgta
    4551  cttaagatac acacttcaag aaatcctgcg gatgctttaa ccaagagcat
    4601  tccggtaaac aagtttcagt cagctttaga gttgctgaag ctggttaagt
    4651  gggactgagg tgattcagcc actgctatga ccatggagag taattcacgg
    4701  ttggaatagg atcaaggtgg agattgttgg agttatgatc caattcctaa
    4751  gttgctaaag tctaatgtcg actattgact taagctgagt ttgaatttgg
    4801  attggaggag ctaaaccggt ctggttaagt ttggttattg aaagaaggaa
    4851  ggattgagtt cggtttaatc tttgaagctg aagactcgac ttggtttaga
    4901  gggtttgacg ttgactataa aaaggattcg tcttcttctt ttctgtttca
    4951  tcctctgtaa caaacattgt atcttcttct tcttcctctg atcttgagct
    5001  tgtaacggtg tgtgtaaaag cttgagaaac tccactgata tagtgaattg
    5051  ctggtcagaa tccagccgag acgtaggctt actcattccg agtagctgaa
    5101  cccgtaaatc ttctgtgtca ctttattctt tgagtgtttc ctgttttgag
    5151  agtgagagat tacaaattga gagacgagag agaggttcgt gtgcgtgaga
    5201  tcacaaatcg atcaaggttt aaggttcgtt tggtaacaat caatataact
    5251  tcaatgta