Sequence of DPV Arabidopsis thaliana AtRE1 virus

Arabidopsis thaliana copia-like retrotransposon AtRE1 gene for polyprotein, complete cds, strain Landsberg erecta.

ACC No: AB021265

Dated: 2004-02-17 | Length: 7405 | CRC: -319436183

                !!NA_SEQUENCE 1.0
ID   AB021265   standard; genomic DNA; PLN; 7405 BP.
XX
AC   AB021265;
XX
SV   AB021265.1
XX
DT   04-JUN-1999 (Rel. 59, Created)
DT   17-FEB-2004 (Rel. 78, Last updated, Version 4)
XX
DE   Arabidopsis thaliana copia-like retrotransposon AtRE1 gene for polyprotein,
DE   complete cds, strain Landsberg erecta.
XX
KW   copia-like retrotransposon AtRE1 polyprotein.
XX
OS   Arabidopsis thaliana (thale cress)
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; rosids;
OC   eurosids II; Brassicales; Brassicaceae; Arabidopsis.
XX
RN   [1]
RP   1-7405
RA   Kuwahara A., Kato A., Komeda Y.;
RT   ;
RL   Submitted (16-DEC-1998) to the EMBL/GenBank/DDBJ databases.
RL   Ayuko Kuwahara, Hokkaido University, Division of Biological Sciences,
RL   Graduate School of Science; Kita 10, Nishi 8, Kitaku, Sapporo, Hokkaido
RL   060-0810, Japan (E-mail:kuwahara@sci.hokudai.ac.jp, Tel:81-11-706-2740,
RL   Fax:81-11-746-1512)
XX
RN   [2]
RX   DOI; 10.1016/S0378-1119(99)00565-X.
RX   PUBMED; 10689195.
RA   Kuwahara A., Kato A., Komeda Y.;
RT   "Isolation and characterization of copia-type retrotransposons in
RT   Arabidopsis thaliana";
RL   Gene 244(1-2):127-136(2000).
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .7405
FT                   /db_xref="taxon:3702"
FT                   /mol_type="genomic DNA"
FT                   /organism="Arabidopsis thaliana"
FT                   /ecotype="Landsberg erecta"
FT   repeat_region   1. .7405
FT                   /transposon="copia-like retrotransposon AtRE1"
FT   repeat_unit     868. .872
FT                   /note="target site duplication"
FT   LTR             873. .1039
FT   primer_bind     1040. .1053
FT   CDS             1171. .5514
FT                   /codon_start=1
FT                   /db_xref="GOA:Q9SXQ3"
FT                   /db_xref="InterPro:IPR000276"
FT                   /db_xref="InterPro:IPR001584"
FT                   /db_xref="InterPro:IPR001878"
FT                   /db_xref="InterPro:IPR005162"
FT                   /db_xref="UniProt/TrEMBL:Q9SXQ3"
FT                   /transl_table=1
FT                   /gene="AtRE1"
FT                   /product="polyprotein"
FT                   /protein_id="BAA78425.1"
FT                   /translation="MSNVTKLTSTNYLMWSRQVHALFDGYELAGFLDGSTTMPPATIGT
FT                   DAAPRVNPDYTRWKRQDKLIYSAVLGAISMSVQPAVSRATTAAQIWETLRKIYANPSYG
FT                   HVTQFRTQLKQWTKGTKTIDDYMQGFVTHFDQLALLGKPMDHDEQVERVLENLPEEYKP
FT                   VIDQIAAKDTPPTLTEIHERLLNQESKILAVSSATVIPITANAVSHRNTTTTTNNNNGN
FT                   RTNRYDNRNNNNNSKPWQQSSSNFRPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFLS
FT                   SVNSQQPPSPFTLWQPRANLALGSPYSSNSWLLDSGATHHITSDFNNLSLHQPYTGGDD
FT                   VMVVDGSTIPISHTGSTSLSTKSRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFL
FT                   ASFQVKDLNTGVPLLQGKTKDELYEWPIASSQPVSLFASPSSKATHSSWHARLGHPAPS
FT                   ILNSVISNYSLSVLNPSHKFLSCLDSLINKSNKVPFSQSTINSTRPLEYIYSDVWSSPI
FT                   LSHDNYRYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLVENRFQTRIGTFYSDNGG
FT                   EFVALREYFSQHGISHLTSPPHTPEHNGLSERKHRHIVETGLTLLSHASIPKTYWPYAF
FT                   AVAVYLINRLPTPLLQLESPCQKLFGTSPNYDKLRVFGCACYPWLRPYNQHKLDDKSRQ
FT                   CVFLGYSLTQSAYLCLHLQTSRIYISRHVRFDENCFPFSNYLATLSPVQEQRRESSCVW
FT                   SPHTTLPTRTPVLPAPSCSDPHHAATPPSSPSAPSRNSQVSSSNLDSSFSSSFPSSPEP
FT                   TAPRQNGPQPTTQPTQTQTQTHSSQNTSQNNPTNESPSQLAQSLSTPAQSSSSSPSPTT
FT                   SASSSSTSPTPPSILIHPPPPLAQIVNNNNQAPINTHSMGTRAKAGIIKPNLKYSLAVS
FT                   LAAESEPRTAIQALKDERWRNAMGSEINAQIGNHTWDLVPPPPSHVTIVGCRWIFTKKY
FT                   NSDGSLNRYKARLVAKGYNQRPGLDYVETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVN
FT                   NAFLQGTLTDDVYMSQPPGFIDKDRPNYVCKLRKALYGLKQAPRAWYVELRNYLLTIGF
FT                   VNSVSDTSLFVLQRGKSIVYMLVYVDDILITGNDPTLLHNTLDNLSQRFSVKDHEELHY
FT                   FLGIEAKRVPTGLHLSQRRYILDLLARTNMITAKPVTTPMAPSPKLSLYSGTKLTDPTE
FT                   YRGIVGSLQYLAFTRPDISYAVNRLSQFMHMPTEEHLQALKRILRYLAGTPNHGIFLKK
FT                   GNTLSLHAYSDADWTGDKDDYVSTNGYIVYLGHHPISWSSKKQKGVVRSSTEAEYRSVA
FT                   NTSSEMQWICSLLTELGIRLTRPPVIYCDNVGATYLCANPVFHSRMKHIAIDYHFIRNQ
FT                   VQSGALRVVHVSTHDQLADTLTKPLSRTAFQNFASKIGVTRVPPS"
FT   LTR             5521. .5688
FT   repeat_unit     5689. .5693
FT                   /note="target site duplication"
XX
SQ   Sequence 7405 BP; 2119 A; 1867 C; 1253 G; 2166 T; 0 other;

  AB021265  Length: 7405  May 9, 2005 17:26  Type: N  Check: 9847  ..

       1  aaaaaaaaaa atcaaattct caaattcaat aagggaaact tactgatccc
      51  aggaacagaa gaactccctc cctcaatagc agtggcgccg ccgaagagaa
     101  taagacgagg accgtgagtc ttagtagctg caacagccgt aagtgtatga
     151  gcgcatctag gaccaggagc atcgtcttca tcatcccaaa aggtttctag
     201  ggttttatac tgaggagcta gatgtagcca aggcttcgag cccattggaa
     251  accactttac gggtataaat cgatcccgca acagccaaaa tactgaacca
     301  gaagcagaga aatgagaaga cgaaacctag aaagcgaatc aaagaaggaa
     351  agactacggc actggacgaa agaactaggt agaaagttca gatcggagga
     401  gtaagaagaa ttcacggcgg agaacggtgg cgagttacga cggacggact
     451  gtcttctcca caaagtaaaa accctttttt tttttcagag actatttttt
     501  gtttttgaat taaaagagaa gaaaaaataa aataaaaatc tttgcttcgc
     551  cgagttttgg aggacacagt tttcttcgga tagaaaaggc gttttcgctg
     601  agatggactt ttagtcttct tctttagtca acggctttat atgtatgtga
     651  attgtctaga ttaccctttg ttttctttta tttgtatatc tggtaaagtg
     701  gtagtaatat ttaactttaa tttacaaaaa acaaaagatt tgaatctatc
     751  caaatttaat tactgtgtgt agacaattct aagaaaaaca tttctatttc
     801  aacactttat tatcgaaact ttgaaatttg aattttcgtt tagaggaaca
     851  ttctaattct atacacattt tctattaagg atatcaatca tttccttgat
     901  tgattccaat catatcttcc atgtttgtaa tattgtatat actcacattg
     951  attataagtg tgctcagcaa gtctctcttg tgtatatata ttgtaaatcg
    1001  ttgaatggaa taaatcaatc acttctataa acctttatat ggtatcagac
    1051  gcctagatcc aaaataataa taaaaaaaat cgatcttctt atttcttctt
    1101  cgtctctcct tcattggctg cccacgctga agaactcgtt ctcaacaaca
    1151  caaatatcct taacgttaac atgagtaacg ttaccaaact caccagcact
    1201  aactacctca tgtggagccg ccaggttcat gcgctcttcg atgggtatga
    1251  actcgctggt tttctcgatg gctctacaac catgccgcct gcgaccattg
    1301  gcacagatgc ggctccccgt gtcaatcccg actacactcg ctggaaaaga
    1351  caagacaagc tcatctacag tgccgttctt ggagccatct ccatgtcggt
    1401  tcaaccggcg gtgtcaagag ccactaccgc tgctcagatc tgggagactc
    1451  tccgcaagat ctacgccaac ccaagctatg gccatgtcac tcaatttcgg
    1501  acgcaactaa aacaatggac gaaaggtacc aaaacaattg atgactatat
    1551  gcaaggcttt gtaactcatt ttgaccagct tgcactcctc gggaaaccaa
    1601  tggatcatga cgaacaagtc gagcgagtgc tggaaaactt accagaggaa
    1651  tacaaacccg tcatagacca gatcgctgcc aaagatactc ctcccactct
    1701  caccgagatc catgagcgcc tactgaatca agaaagcaaa atcctcgccg
    1751  tcagctcagc cactgtcatt ccaatcaccg ccaacgccgt ctctcatcgc
    1801  aacaccacca ccaccaccaa caacaacaat ggcaaccgca ccaaccgtta
    1851  tgataaccgt aacaataaca acaactcaaa gccgtggcaa cagtcctcct
    1901  cgaatttccg cccaaacaat aaccaatcca aaccatattt gggaaagtgt
    1951  caaatctgtg gcgttcaagg acatagcgcc aaacgatgtt ctcaacttca
    2001  acattttctg tccagtgtca attcacagca accaccatct ccgttcactc
    2051  tgtggcagcc tagagcaaat ctcgcacttg gctctcctta ctcctccaac
    2101  agttggcttc tcgatagtgg cgctacacat catatcacat ccgacttcaa
    2151  caatttgtct cttcatcagc cctatactgg tggtgacgat gtgatggttg
    2201  ttgatgggtc tactatcccc atatcacata ctggttctac ttctctatcc
    2251  acaaaatctc gtcccttaaa tctacacaac attttatatg ttcctaatat
    2301  acacaagaat ttaatttctg tgtatcgctt atgcaatgct aatggtgtct
    2351  ctgtcgaatt ctttctggca tcctttcagg tgaaggatct caacacgggg
    2401  gtcccattac tccaaggcaa aactaaggat gagttgtatg agtggcccat
    2451  agcatcgtct caaccggtct ctctgtttgc gtctccaagc tcgaaagcta
    2501  ctcattcatc ttggcatgct cgtttaggcc atccggctcc ttcaatttta
    2551  aattctgtca tttcaaatta ttctctctct gtgttaaacc catctcacaa
    2601  atttctctct tgcttagatt cccttatcaa taaaagtaat aaagtccctt
    2651  tctctcaatc aaccataaat tcgaccagac cccttgaata catttattct
    2701  gatgtttgga gctcacctat tctttcacat gataattatc gatactatgt
    2751  catctttgtt gatcacttta cccgatatac ttggctctat cctttaaaac
    2801  agaaatctca agttaaagag acgttcatta cattcaaaaa tctggtggaa
    2851  aatcgttttc agacacggat tggtactttt tattcagaca atggtggtga
    2901  gttcgtcgca ctacgggaat acttctcaca acatggtatc tcgcatctca
    2951  cttctccgcc acatacacca gaacacaatg ggctctccga acgtaaacat
    3001  cgtcacattg ttgaaaccgg tctcaccttg ctatctcacg catccattcc
    3051  gaagacgtat tggccttacg catttgctgt agctgtttac ttgatcaatc
    3101  gactaccaac acctctactt cagcttgagt ctccgtgtca gaaattattt
    3151  ggcacatcac caaattacga taagctcaga gtttttggat gcgcctgtta
    3201  tccgtggctg agaccttaca atcaacacaa actcgatgac aaatctagac
    3251  agtgtgtgtt tttgggttac tcacttaccc aaagtgctta tctctgtctt
    3301  catcttcaaa caagccgaat atacatctcc agacatgtcc gctttgatga
    3351  aaattgcttt cccttctcca attatctcgc gactctctct ccggtacaag
    3401  aacagcgtcg tgaatcatct tgtgtttggt cccctcatac aactcttcca
    3451  actcgcactc cagttttgcc ggctccctcg tgttcagatc ctcaccacgc
    3501  tgccacacca ccgtcatcgc cgtcagctcc atcccgcaac tctcaggtat
    3551  cgtcttctaa ccttgactcg tctttctcct cttcttttcc atcttcccct
    3601  gagcccactg ctccaagaca aaatgggccg caacccacga cccagccaac
    3651  ccaaacacaa acccaaacac attcctctca aaacacatca caaaataacc
    3701  caacaaatga aagcccatca caattagccc aatcattgtc cacaccagcc
    3751  caatcttctt cctcttcgcc aagcccgaca acctcagctt cgtcatcttc
    3801  aacatctccg actccaccat caatcctgat acatccacct cctccgcttg
    3851  ctcaaattgt taacaacaac aaccaagctc cgatcaatac tcattccatg
    3901  ggcaccaggg ccaaagctgg catcattaaa ccaaatctca agtattctct
    3951  agcagtttct ttggccgctg aatcagaacc acgaacagct attcaagctc
    4001  tcaaagatga acgatggaga aatgccatgg ggtccgaaat aaatgcacaa
    4051  attggtaacc atacatggga tcttgttccg cctccaccaa gccatgtaac
    4101  gattgttggc tgccgttgga tcttcaccaa gaaatacaat tccgatgggt
    4151  ctctaaatcg atacaaggcg cgtcttgtgg ctaaaggata caatcagcga
    4201  cctggacttg attatgtaga gacattcagc cctgtaatca agtcaacttc
    4251  aattcgaatt gttctgggtg ttgcggttga tcgctcatgg cccatacgac
    4301  agctagacgt caacaacgcc ttccttcaag ggacactcac agacgatgtc
    4351  tacatgtcac agcctccagg attcattgat aaggatcgtc ctaattatgt
    4401  ctgcaaatta aggaaagctc tctatggttt gaaacaggct ccacgtgcct
    4451  ggtatgttga gctacggaat tatcttctca ccattggttt tgttaattcg
    4501  gtttctgata cgtctctctt tgttcttcaa cggggaaaat caattgtcta
    4551  catgcttgtc tatgttgatg acattctgat caccggcaat gacccgacct
    4601  tacttcacaa cactcttgat aacctatctc aacgtttttc agtcaaggat
    4651  catgaggaac ttcactactt tctgggaatc gaagcaaaac gggtgccaac
    4701  tggtttacat ctaagtcaac gtcgctacat tcttgatctg ctggctcgta
    4751  cgaatatgat cacggcaaaa ccggtgacaa ccccaatggc tccctcacca
    4801  aaattgtcgc tctattcggg taccaaacta acagatccta cggagtacag
    4851  aggcattgtt ggcagcctcc aatatcttgc atttacccgt cctgacatat
    4901  cttatgcggt caaccgattg tcacagttta tgcatatgcc tacagaagaa
    4951  catttgcaag ccttgaagcg tattcttcga tatttggctg gtactccaaa
    5001  tcatggtatc tttctcaaga aaggtaatac tctatccttg cacgcatatt
    5051  ctgatgctga ctggacaggg gacaaagatg attatgtttc aacaaatggc
    5101  tacattgttt atcttggtca ccatccaatc tcttggtcat ctaaaaagca
    5151  aaaaggggtt gtacgctcat caaccgaagc tgagtacagg tcggtagcta
    5201  atacttcttc agaaatgcaa tggatatgtt ccttactcac agagcttggt
    5251  attcgactca cacgtccacc agttatctac tgcgataatg ttggcgccac
    5301  gtatctctgt gcaaatccgg tgttccactc tagaatgaag cacatcgcga
    5351  tagattacca cttcatcaga aatcaggtac aatccggagc tcttcgtgtg
    5401  gttcatgtct ctacacacga ccaattagcg gacactctaa cgaagccgtt
    5451  gtcaagaaca gcttttcaaa atttcgcatc caagattgga gttacaagag
    5501  tccctccatc ttgagggggc gtattaagga tatcaatcat ctccttgatt
    5551  gattccaatc atatcttcca tgtttgtaat attgtatata ctcacattga
    5601  ttataagtgt gctcagcaag tctctcttgt gtatatatat tgtaaatcgt
    5651  tgaatggaat aaatcaatca cttctataaa cctttatatt ttcaataagc
    5701  tggagcctta ttggtaattt atctgaatat tcaaaatata atcattagct
    5751  actttatgga attgatttct tcttaaattt actataatta atctccttaa
    5801  atcacgaaaa aaataaaatt tgtttcttta aatagctgta aagtcaacta
    5851  actgagattt ataaagtcaa actattcctt ctgttattac cttcctctca
    5901  aagtctcaac ctttgcctgt aaataaccag agagagacaa agctcacaaa
    5951  acagaggaga atctctaaac agttaaaaat aaacaaaagt caaaaaccta
    6001  ctggtcaaat tcaatgcaac tgagtctaat actgaagctt attcacatcc
    6051  caagccactc tcataactcc aaaccttgaa ttcgcctctg tgaaagtcct
    6101  cccaacatca tctgatccat aagggatcat ctttttgttc ccatcatatg
    6151  actcactaag ctccattaca tcaactacac atatctgtct tgtctccagg
    6201  cttaaaccgt accattcacc gtctgttctc aacaccattc ctttcccgat
    6251  ctcgtctcct ctctcgtcca taagcctcac ttgttgccct tgttttatcc
    6301  cttcttctga tgatgacgat gatctcatca gactctctcc tgttttcggg
    6351  gtttctgttg cagtttgatc tttaattggt gttgatggtt tctgctgcca
    6401  agtgttttca tctccaggac tctccggtag atcccctgag ctgttattat
    6451  catgagctgg ccctgtttgc ttgtttgctc gagctagttt cgcttttcgg
    6501  ttattcagcc tgataataaa ctttcaatgt cagataatga atagtgatta
    6551  cgagattctc tttgcagctg acgaagagga acaaataaaa gcaagtacca
    6601  gtttttcagc tgcgaagatg taataacctc cgaaccctgc aaaatcatgt
    6651  aatcaacgag ggcatcgttc gtaatacgtg gttggaaatt aaaaagagta
    6701  gaattcttga actcaccttt tgacttattt tatcagccca taactgtctc
    6751  gaagctgaat tccgctgcaa atcaggttct tcagcaagcg ccttctcaat
    6801  catccccatt tgatcagcat tcataatact acgcttccgt ttcttcttct
    6851  gcttctcaaa gacaaggaag gtttcagatt tctcatcctc cttcacctct
    6901  cctgatgcac tgcctttaaa tcgcttgctc atattctgaa ccaactctcc
    6951  ctcttcaacc agacccttcc ctctgttaga gcttgtatct gaaccactgg
    7001  tttcaagatt gctagcatct gcatcgctct ctttcaaccg ttcaactgtg
    7051  tctatctcct cgttcacgcc ttgctttgtc atcacacctt caactcggac
    7101  atcacaatcc tctgaagctt cctcattgtt aaggttcaga agctctttta
    7151  gcttaccaga taaattcccg cccctacctt caatatccta gtacaacata
    7201  acatttgaag cttagcaaaa gatctgtctt ttgtagaaaa tcaacatacg
    7251  atggacatga gtaaaacctt gttacaccta cttgtatcag tgtaactaaa
    7301  cagataagcc agagaattgt aaaaccaatg tataagagag caaaaagctt
    7351  tttcaccttc atctgtactt gactttcctc aaactcggaa tggattaacg
    7401  gctgt