Sequence of DPV Oryza australiensis RIRE1 virus

Oryza australiensis retrotransposon RIRE1, complete sequence.

ACC No: D85597

Dated: 2002-03-01 | Length: 8322 | CRC: -541569150

                !!NA_SEQUENCE 1.0
ID   D85597     standard; genomic DNA; PLN; 8322 BP.
XX
AC   D85597;
XX
SV   D85597.1
XX
DT   21-SEP-1997 (Rel. 52, Created)
DT   01-MAR-2002 (Rel. 70, Last updated, Version 3)
XX
DE   Oryza australiensis retrotransposon RIRE1, complete sequence.
XX
KW   aspartic proteinase; gag protein; integrase; polyprotein;
KW   reverse transcriptase; RNase H.
XX
OS   Oryza australiensis
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; Ehrhartoideae;
OC   Oryzeae; Oryza.
XX
RN   [1]
RP   1-8322
RA   Ohtsubo E.;
RT   ;
RL   Submitted (27-MAY-1996) to the EMBL/GenBank/DDBJ databases.
RL   Eiichi Ohtsubo, Institute of Molecular and Cellular Biosciences,
RL   Univ.Tokyo; Yayoi 1-1-1, Bunkyo-ku, Tokyo 113, Japan
RL   (E-mail:eohtsubo@ims.u-tokyo.ac.jp, Tel:03-5684-3269, Fax:03-5684-3269)
XX
RN   [2]
RX   DOI; 10.1266/ggs.72.131.
RX   MEDLINE; 97480925.
RX   PUBMED; 9339541.
RA   Noma K., Nakajima R., Ohtsubo H., Ohtsubo E.;
RT   "RIRE1, a retrotransposon from wild rice Oryza australiensis";
RL   Genes Genet. Syst. 72(3):131-140(1997).
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .8322
FT                   /db_xref="taxon:4532"
FT                   /mol_type="genomic DNA"
FT                   /organism="Oryza australiensis"
FT                   /clone_lib="The pCRII vector was used for the genomic
FT                   library."
FT                   /strain="W1538"
FT   repeat_region   1. .8322
FT                   /transposon="retrotransposon RIRE1"
FT   LTR             1. .1523
FT                   /note="5' LTR"
FT   CAAT_signal     1053. .1057
FT   TATA_signal     1079. .1083
FT   misc_signal     1334. .1337
FT                   /note="termination signal"
FT   primer_bind     1525. .1542
FT                   /note="complementary to 3' end of initiator methionyl tRNA"
FT                   /note="putative (-)-strand primer binding site"
FT   CDS             2826. .6779
FT                   /codon_start=1
FT                   /db_xref="GOA:O23864"
FT                   /db_xref="InterPro:IPR001584"
FT                   /db_xref="InterPro:IPR001878"
FT                   /db_xref="UniProt/TrEMBL:O23864"
FT                   /product="polyprotein"
FT                   /protein_id="BAA22288.1"
FT                   /translation="MAANTTPSTFNLRSILEKEKLNGTNFMDWYRNLRIVLKQERKEYV
FT                   LEVPYPEELPNNATATARRGFEKHTNDALDISCLMLATMSPELQKQYESSDAHTTIQGL
FT                   RGMFENQARDERFNTSKSLFACRLVEGNPVSPHVIKMIGYIESLEKLGFPLSQELATDV
FT                   ILQSLPPSFEPFILNYHMNNMDRTLAELHGMLKTVEESIQKNGHHVMMMQNAKRKPPVK
FT                   KLCTKRKLTPDEIASASNAKKGKKGSAASDAVCFYCKETGHWKRNCKKYMEDLKKKQST
FT                   TSASGINVIDINLATSPTDSWVFDTGSVAHSCKSLQGMRRSRGLRRGEVNLRVGNGASV
FT                   ATVAVGTVPLHLPSGLVLELNNCYCVPTLCQNVISASCLQAEGYDFRSMNNGCSIYLRD
FT                   MFYFHAPLVNGLYVLNLEASPIYNINTERQLSNDINPTFIWHCRLGHINKKRMEKLHKD
FT                   GLLHSFDFESFETCESCLLGKMTKAPFTGHSERASDLLALVHTDVCGPMSSTARGGYQY
FT                   FITFTDDFSRYGYIYLMRHKSESFEKFKEFQNEVQNHLGKTIKFLRSDRGGEYVSQEFG
FT                   NHLKDCGIVPQLTPPGTPQWNGVSERRNRTLLDMVRSMMSQSDLPLSFWGYALETAALT
FT                   LNRVPSKSVEKTPYEIWTGQPPSLSFLKIWGCEAYVKRLQSDKLTPKSDKCFVVGYPKE
FT                   TKGYYFYNREQAKVFVARHGVFLEKEFLSRRVSGIRVHLEEVQETPETVSATTEPQQED
FT                   QSVAPPVVDTPAPRRSERSRRAPDRYTGAEQRDILLLDNDEPKTYEEAMVGHDSNKWLG
FT                   AMKSEIESMYDNQVWNLVDPPDGVKTIECKWLFKKKADMDGNVHIYKARLVAKGFKQIQ
FT                   GVDYDETFSPVAMLKSIRIILAIAAYFDYEIWQMDVKTAFLNGNLSEDVYMIQPQGFVD
FT                   PESPGKICKLQKSIYGLKQASRSWNIRFDEVIKGFGFIKNEEEACVYKKVSGSAIVFLI
FT                   LYVDDILLIGNDIPMLESVKSSLKNSFSMKDLGEAAYILGIRIYRDRSKRLIGLSQSTY
FT                   IDKVLKRFNMHDSKKGFLPMSHGINLSKNQCPQTHDERNKMGMVPYASAIGSIMYAMLC
FT                   TRPDVSYALSATSRYQSDPGEGHWTAVKNILKYLRRTKDMFLVYGGEEDLVVSGYTDAS
FT                   FQTDKDDYRSQSGFVFCLNGGAVSWKSSKQDTVADSTTEAEYIAASEAAKEAVWIKKFV
FT                   SELGVMTSTTGPMSLYCDNSGAIAQAKEPRSHQKSKHILRRYHLIREIVDRGDVKICKV
FT                   HTDLNIADPLTKPLPQPKHEAHTRAMGIRYLHD"
FT   mat_peptide     2826. .3674
FT                   /note="putative"
FT                   /gene="gag"
FT                   /product="gag protein"
FT   misc_feature    3588. .3629
FT                   /note="RNA binding motif"
FT   mat_peptide     3675. .4121
FT                   /note="putative"
FT                   /gene="pro"
FT                   /product="aspartic proteinase"
FT   misc_feature    3738. .3746
FT                   /note="protease motif"
FT   mat_peptide     4122. .5021
FT                   /note="putative"
FT                   /gene="int"
FT                   /product="integrase"
FT   misc_feature    4146. .4251
FT                   /note="zinc finger motif"
FT   misc_feature    4329. .4628
FT                   /note="integrase motif"
FT   mat_peptide     5022. .6287
FT                   /note="putative"
FT                   /gene="rt"
FT                   /product="reverse transcriptase"
FT   misc_feature    5796. .5807
FT                   /note="reverse transcriptase motif"
FT   mat_peptide     6288. .6776
FT                   /note="putative"
FT                   /gene="rh"
FT                   /product="RNase H"
FT   misc_feature    6315. .6701
FT                   /note="RNase H motif"
FT   primer_bind     6789. .6798
FT                   /note="polypurine tract as putative (+)-strand priming
FT                   site"
FT                   /note="putatie (+)-strand primer binding site"
FT   LTR             6800. .8322
FT                   /note="3' LTR"
XX
SQ   Sequence 8322 BP; 2323 A; 1638 C; 2072 G; 2289 T; 0 other;

   D85597  Length: 8322  May 9, 2005 17:26  Type: N  Check: 5781  ..

       1  tgttagtgat atgccctaga ggcaacatgg agatgttgga tatcacgtca
      51  atgtttatgt ataaatgaat aaagtgttat tcagttccat gaaatagaca
     101  tcttgtattg atcaataagt acgtgactta tttatgagac tctatatgta
     151  tgaatctatt tctaaacgat ccctgatcat atgtcatatt gttggaacaa
     201  atatgattct aagatcggca tgtttattga ttgatgatca tgtctcatag
     251  atcatgggta tggagatacc aaatcaataa acatggacat atgtgttaga
     301  gaacacattg ttggatagac ccaccatgag acactacagg aattaatgtg
     351  ccattagttg gtctcaggta gtgttggtac aaagtcctta gacctgagat
     401  caccatggat tccaacatgt gtagtagcct actttgggac taccaaacgc
     451  tattccgtaa ctgggtagtt ataaaggtag ttttcgggtt tgctatgaaa
     501  catggagtgg gatgtgagtg atcaagatgg aatttgcccc tcctttggag
     551  agatatctct gggcccctcg aggtaatgga ttatggaaat gcatggccat
     601  gctaagttga ttgaggagtc aatcaacaga cagataatcc aatacggatc
     651  gagtgaatgg ttaagctatt gaagggatgg cacacatctt gcctatagct
     701  taactggtat cgtgaggcaa agggatttgt gtacacatta caggttcaga
     751  gccgatatta tattcttgta tactatggtg tcaatatgtg ctgctaggcg
     801  ccgctgttga caggtcggct gagttggact cgagccgacg atcggctaag
     851  ttagactata gccgaaaccg agtatacctg aacctacagg gtcgcacgct
     901  taaggggata agaacaggca ttcgagttgg actcggatac ggctcgatcg
     951  gatagggatc cgatcggata gagtcctatg ggcttacgac gtatgggcgg
    1001  ccaactctat gagatacgga tgagatccga ttcaaaatag agtcctgatc
    1051  ggccactagt cccagtcggc caactctata taaaggaggg aggggtggag
    1101  agctgcggta cgtcaattgc atcgcgtgca acagaagtcg ccgagatcaa
    1151  tctccacgga aaccctcccc actttcgagc caaaccctag ccttgcatcg
    1201  cgtgctagca cacgggtgtt cggcgttctg tccccggacg tgtggatacc
    1251  ggtagaggcg ctgctacgtt gcacgctgtt gatcggctgc ggatcggcta
    1301  cgacatccgc gaatcggctg attggtgaat cggttgtccc cgggtatcgg
    1351  ctgtcggctt gcgggaatcg gctacactgt tccggatcgg atagccctga
    1401  ttggctattt ccgctgcata ctccaatcgg taacgatcta tcggccctta
    1451  cttgcaagtg ttcctggttt gcgcggtaaa aagtttttgt tttcggctag
    1501  cgtagcttcc gcgtaaccct tcagtggtat cagagctaat cttgcttagt
    1551  tcgggtttta gattggatct ggaacaagtt ttgcgcacgg gttgattaga
    1601  tcaatcggtt ttacgggatt tgttgaggag agttgatggt aattgcgttc
    1651  cttgatagtg tatcggaacg acgtaatgtg attacgtcga agcaatatcc
    1701  atataactcg gtttgagttg ggttcgagat agcgttcctt gatagtgtat
    1751  cggaacggta ttatgtgatt ataccgaagc tatctcgtag tccgactttg
    1801  tttgctgcct cgcgcctacg caaaccatct cccttgatag tgtatcggga
    1851  cggcataata tgattatgcc gatatggttt ccgtaagcga tttcatagat
    1901  tggatctatg aattcgtgtg gtgttacggg cgccatgtgc gtaccccctt
    1951  ataggtcgtc tcggtgcccg atctctgcga taggtagaaa acgtttctac
    2001  gcctaacttg tttttgcaga caatgcctgc agatggtatg gccctattgt
    2051  atgtatgaga tgcatgtgat gcatgtaata ttctttcata tgtgcaatcc
    2101  ctgtaatatt ctttcatacg tgcaatgctt gtaatattcg ttcacgacct
    2151  gcgagtcgat gtaatcttca attagagctt ctatagtagt agcttagttt
    2201  gaagaaatca agcattcggc actcggaaga acggtgatga agatggagat
    2251  cctctacacc ggcatggaga tggagatcac catgtgaagg gggccatact
    2301  atttcactat atgctattct acttgctttc atatatgtga tatgtttgtt
    2351  tgataggata gcatccctcg caaaattaag tagtaatgat gcccttccaa
    2401  atgttgcacc cgtcaccgtt atgctcgtca atggtgggtc tgccaaagca
    2451  gagtgccatt ctctatcata acacgagggg gtatatgtca gacatataca
    2501  tgcaggagca tttggtttac ttaacaaggc tatcaaaaca gttttgggcc
    2551  ttggggcata ggttggggcc ggggcatgga gatgccacac caacaacaag
    2601  agtcacatag agatgtgatt agcaaggtgt tgcttaccga tgttattaat
    2651  cttctagcag tgatgccaaa gctcactaga aaattagtta acatgtggat
    2701  cttgacccag tacgtaacct agagggaagg tgcaaaatac gtaatgggag
    2751  tcttatgtta aattgtttaa gaatagacat agcatgaacg ttgtgctaaa
    2801  ctttgctttt tctgttgtag aataaatggc ggctaatact acacccagta
    2851  cctttaattt gcgatctatt ctcgaaaagg aaaaattgaa tggaacaaac
    2901  tttatggatt ggtatcgcaa cttgagaatt gttctcaagc aagagcgtaa
    2951  ggagtacgtt cttgaagtac cctatcctga ggagttgcct aataatgcca
    3001  ctgctactgc acggaggggt ttcgagaagc acactaatga tgccctagat
    3051  ataagctgtc tcatgctagc tacaatgtcc cctgagcttc agaagcaata
    3101  tgagagcagc gatgctcaca ctactattca gggactgcgt ggtatgtttg
    3151  agaaccaagc tcgggacgag aggttcaaca cctcaaagtc cttgtttgcg
    3201  tgcaggcttg ttgaggggaa tcccgtcagt ccgcatgtga taaagatgat
    3251  tggctatatt gagagtctgg aaaaactagg ttttcccctt agccaagagt
    3301  tggctactga tgtgattctc cagtcgctcc ctccgagctt cgagccattc
    3351  atattaaact atcacatgaa caatatggat agaaccttgg ctgaattaca
    3401  tgggatgcta aagacagttg aggagagtat ccagaaaaat ggtcatcatg
    3451  tgatgatgat gcagaatgct aagcgcaaac cacctgtcaa gaaactttgc
    3501  accaagagga agttaactcc cgatgagatc gcgagtgcct ctaatgcaaa
    3551  gaagggcaag aaggggtcgg cggcatcaga tgccgtttgc ttctattgca
    3601  aggagacagg ccactggaag aggaactgca agaagtacat ggaggatctc
    3651  aaaaagaagc aaagtacgac ttctgcttca ggtattaatg ttatagacat
    3701  taatcttgct acttcaccta ctgactcttg ggtatttgat accggatcag
    3751  tagctcatag ttgcaaatcg ttgcagggaa tgagaagaag tagaggattg
    3801  agaaggggcg aggtgaacct gcgcgtcggc aatggagcaa gcgttgctac
    3851  agttgctgtc ggcacagtac cacttcatct accttcagga ttagttttgg
    3901  aattgaataa ttgttattgt gttccaacac tatgtcaaaa cgttatttcc
    3951  gcttcatgtt tgcaagcgga aggatatgat tttagatcaa tgaacaatgg
    4001  ttgttcaata tatctcagag atatgttcta ttttcatgct ccattggtga
    4051  atggattata cgttttgaat cttgaagcgt ctcccatcta taacattaat
    4101  acagaaaggc aactctctaa tgatataaac cccacattta tctggcattg
    4151  tcgcttaggc catataaata agaaacgcat ggagaagctc cataaggatg
    4201  gattgcttca ctcttttgat tttgaatcat ttgagacatg tgagtcttgt
    4251  ttacttggta agatgacaaa ggcacctttc acgggacata gtgagagagc
    4301  aagtgactta ttggcactcg tacatactga tgtatgtgga ccaatgagct
    4351  caacggccag aggtggttat caatacttca ttacctttac cgatgacttt
    4401  agtagatatg gctatatcta cttaatgagg cataagtccg aatcctttga
    4451  aaagttcaaa gaattccaga atgaagtaca gaatcattta gggaaaacaa
    4501  tcaagtttct acgatcagat cgtggagggg aatacgtgag ccaagagttt
    4551  ggtaatcatc tgaaagattg tggaattgtt ccacagttga ctccgccagg
    4601  aactccacaa tggaacggag tgtccgaacg gagaaatcgc accttgttgg
    4651  acatggtgcg gtcgatgatg agccaaagtg atcttccgtt atccttctgg
    4701  ggatatgctc ttgaaacagc tgcgctcaca ctaaatagag ttccatctaa
    4751  gtcagttgaa aagacaccat atgagatatg gacagggcaa ccccctagtt
    4801  tgtcttttct caaaatttgg ggatgtgagg cttatgtaaa acgtttacaa
    4851  tctgacaagc tcacacccaa atctgacaag tgcttcgttg tgggatatcc
    4901  taaggaaact aagggatatt acttttataa tcgggaacaa gccaaagtgt
    4951  ttgtcgcccg acatggtgtc ttcttggaga aagagtttct ttcaagaagg
    5001  gttagtggga tcagggtgca tcttgaagaa gttcaagaaa caccagaaac
    5051  cgtttcagcg accacagaac cacaacagga ggaccaaagt gttgcgccac
    5101  cagttgtaga tacaccagcc ccacgaaggt ctgaaagatc acgtcgtgcg
    5151  cctgacaggt acacaggtgc ggaacaacgt gatatattgt tgttggacaa
    5201  cgatgaacct aagacctatg aggaagcgat ggtgggacac gattccaaca
    5251  agtggcttgg agccatgaaa tccgaaatag aatccatgta cgacaatcaa
    5301  gtttggaact tggttgatcc acctgatggt gtcaaaacca tcgagtgtaa
    5351  atggcttttt aagaaaaagg ccgatatgga tggaaatgtt cacatctata
    5401  aggcgcgatt ggtggcgaaa ggttttaaac agattcaagg agttgattat
    5451  gatgaaacct tctcgcccgt cgcaatgctt aaatctattc ggattatcct
    5501  agcgattgct gcatatttcg attatgagat atggcagatg gatgtcaaaa
    5551  cggctttcct aaatggaaac ctaagcgagg atgtatacat gatacaacct
    5601  cagggttttg tcgatccaga atcgcctgga aagatatgca agctacagaa
    5651  atccatttat ggattgaagc aagcatctcg gagttggaat attcgttttg
    5701  atgaagtaat caaagggttt ggtttcatca aaaacgaaga agaggcctgt
    5751  gtttacaaaa aggtcagtgg gagcgcaatt gtatttctaa tcttatatgt
    5801  ggatgacata ttactgattg gaaatgatat ccctatgcta gaatccgtca
    5851  agtcttcatt gaaaaatagt ttttccatga aagacttagg ggaggcagca
    5901  tacatattgg gcattcggat ctatagagat agatccaaga ggctaattgg
    5951  attaagccaa agtacataca ttgacaaggt gttgaaaagg ttcaacatgc
    6001  atgattccaa gaaaggtttc ttgcccatgt cacatggcat taatcttagc
    6051  aagaatcagt gccctcagac acatgatgag cggaataaga tgggtatggt
    6101  tccatatgct tcggcaattg gatccatcat gtatgccatg ctttgtacac
    6151  gcccagatgt ctcgtacgct ttgagtgcta cgagcagata ccagtcagat
    6201  ccaggtgaag gtcactggac tgccgtaaag aatatcctta agtacttgag
    6251  aagaactaag gatatgttcc tagtctatgg aggtgaagaa gatctcgttg
    6301  taagtggtta caccgatgct agcttccaaa cagacaagga tgattataga
    6351  tcgcaatctg ggttcgtgtt ctgcctgaac ggaggcgcag tcagctggaa
    6401  gagttccaag caggatactg ttgctgattc tacaacggag gccgagtaca
    6451  ttgctgcttc ggaagctgca aaggaggctg tttggatcaa gaaattcgtt
    6501  tctgagcttg gtgtgatgac tagtacgact ggtccaatgt ctctctattg
    6551  tgataatagt ggagccattg cgcaagccaa ggagccgagg tcacatcaga
    6601  agtccaaaca catacttcgg cgatatcatc tcatccgcga gatagtggac
    6651  agaggtgatg tcaagatatg caaagtgcac acggatctca acatagccga
    6701  tccgctgaca aaacctctcc ctcagccgaa gcatgaggcg cacacaagag
    6751  caatgggtat tagatactta catgattgac tctagtgcaa gtgggagatt
    6801  gttagtgata tgccctagag gcaacatgga gatgttggat atcacgtcaa
    6851  tgtttatgta taaatgaata aagtgttatt cagttccatg aaatagacat
    6901  cttgtattga tcaataagta cgtgacttat ttatgagact ctatatgtat
    6951  gaatctattt ctaaacgatc cctgatcata tgtcatattg ttggaacaaa
    7001  tatgattcta agatcggcat gtttattgat tgatgatcat gtctcataga
    7051  tcatgggtat ggagatacca aatcaataaa catggacata tgtgttagag
    7101  aacacattgt tggatagacc caccatgaga cactacagga attaatgtgc
    7151  cattagttgg tctcaggtag tgttggtaca aagtccttag acctgagatc
    7201  accatggatt ccaacatgtg tagtagccta ctttgggact accaaacgct
    7251  attccgtaac tgggtagtta taaaggtagt tttcgggttt gctatgaaac
    7301  atggagtggg atgtgagtga tcaagatgga atttgcccct cctttggaga
    7351  gatatctctg ggcccctcga ggtaatggat tatggaaatg catggccatg
    7401  ctaagttgat tgaggagtca atcaacagac agataatcca atacggatcg
    7451  agtgaatggt taagctattg aagggatggc acacatcttg cctatagctt
    7501  aactggtatc gtgaggcaaa gggatttgtg tacacattac aggttcagag
    7551  ccgatattat attcttgtat actatggtgt caatatgtgc tgctaggcgc
    7601  cgctgttgac aggtcggctg agttggactc gagccgacga tcggctaagt
    7651  tagactatag ccgaaaccga gtatacctga acctacaggg tcgcacgctt
    7701  aaggggataa gaacaggcat tcgagttgga ctcggatacg gctcgatcgg
    7751  atagggatcc gatcggatag agtcctatgg gcttacgacg tatgggcggc
    7801  caactctatg agatacggat gagatccgat tcaaaataga gtcctgatcg
    7851  gccactagtc ccagtcggcc aactctatat aaaggaggga ggggtggaga
    7901  gctgcggtac gtcaattgca tcgcgtgcaa cagaagtcgc cgagatcaat
    7951  ctccacggaa accctcccca ctttcgagcc aaaccctagc cttgcatcgc
    8001  gtgctagcac acgggtgttc ggcgttctgt ccccggacgt gtggataccg
    8051  gtagaggcgc tgctacgttg cacgctgttg atcggctgcg gatcggctac
    8101  gacatccgcg aatcggctga ttggtgaatc ggttgtcccc gggtatcggc
    8151  tgtcggcttg cgggaatcgg ctacactgtt ccggatcgga tagccctgat
    8201  tggctatttc cgctgcatac tccaatcggt aacgatctat cggcccttac
    8251  ttgcaagtgt tcctggtttg cgcggtaaaa agtttttgtt ttcggctagc
    8301  gtagcttccg cgtaaccctt ca