Sequence of DPV Oryza australiensis RIRE1 virus
Oryza australiensis retrotransposon RIRE1, complete sequence.
ACC No: D85597
Dated: 2002-03-01 | Length: 8322 | CRC: -541569150
!!NA_SEQUENCE 1.0 ID D85597 standard; genomic DNA; PLN; 8322 BP. XX AC D85597; XX SV D85597.1 XX DT 21-SEP-1997 (Rel. 52, Created) DT 01-MAR-2002 (Rel. 70, Last updated, Version 3) XX DE Oryza australiensis retrotransposon RIRE1, complete sequence. XX KW aspartic proteinase; gag protein; integrase; polyprotein; KW reverse transcriptase; RNase H. XX OS Oryza australiensis OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; Ehrhartoideae; OC Oryzeae; Oryza. XX RN [1] RP 1-8322 RA Ohtsubo E.; RT ; RL Submitted (27-MAY-1996) to the EMBL/GenBank/DDBJ databases. RL Eiichi Ohtsubo, Institute of Molecular and Cellular Biosciences, RL Univ.Tokyo; Yayoi 1-1-1, Bunkyo-ku, Tokyo 113, Japan RL (E-mail:eohtsubo@ims.u-tokyo.ac.jp, Tel:03-5684-3269, Fax:03-5684-3269) XX RN [2] RX DOI; 10.1266/ggs.72.131. RX MEDLINE; 97480925. RX PUBMED; 9339541. RA Noma K., Nakajima R., Ohtsubo H., Ohtsubo E.; RT "RIRE1, a retrotransposon from wild rice Oryza australiensis"; RL Genes Genet. Syst. 72(3):131-140(1997). XX FH Key Location/Qualifiers FH FT source 1. .8322 FT /db_xref="taxon:4532" FT /mol_type="genomic DNA" FT /organism="Oryza australiensis" FT /clone_lib="The pCRII vector was used for the genomic FT library." FT /strain="W1538" FT repeat_region 1. .8322 FT /transposon="retrotransposon RIRE1" FT LTR 1. .1523 FT /note="5' LTR" FT CAAT_signal 1053. .1057 FT TATA_signal 1079. .1083 FT misc_signal 1334. .1337 FT /note="termination signal" FT primer_bind 1525. .1542 FT /note="complementary to 3' end of initiator methionyl tRNA" FT /note="putative (-)-strand primer binding site" FT CDS 2826. .6779 FT /codon_start=1 FT /db_xref="GOA:O23864" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001878" FT /db_xref="UniProt/TrEMBL:O23864" FT /product="polyprotein" FT /protein_id="BAA22288.1" FT /translation="MAANTTPSTFNLRSILEKEKLNGTNFMDWYRNLRIVLKQERKEYV FT LEVPYPEELPNNATATARRGFEKHTNDALDISCLMLATMSPELQKQYESSDAHTTIQGL FT RGMFENQARDERFNTSKSLFACRLVEGNPVSPHVIKMIGYIESLEKLGFPLSQELATDV FT ILQSLPPSFEPFILNYHMNNMDRTLAELHGMLKTVEESIQKNGHHVMMMQNAKRKPPVK FT KLCTKRKLTPDEIASASNAKKGKKGSAASDAVCFYCKETGHWKRNCKKYMEDLKKKQST FT TSASGINVIDINLATSPTDSWVFDTGSVAHSCKSLQGMRRSRGLRRGEVNLRVGNGASV FT ATVAVGTVPLHLPSGLVLELNNCYCVPTLCQNVISASCLQAEGYDFRSMNNGCSIYLRD FT MFYFHAPLVNGLYVLNLEASPIYNINTERQLSNDINPTFIWHCRLGHINKKRMEKLHKD FT GLLHSFDFESFETCESCLLGKMTKAPFTGHSERASDLLALVHTDVCGPMSSTARGGYQY FT FITFTDDFSRYGYIYLMRHKSESFEKFKEFQNEVQNHLGKTIKFLRSDRGGEYVSQEFG FT NHLKDCGIVPQLTPPGTPQWNGVSERRNRTLLDMVRSMMSQSDLPLSFWGYALETAALT FT LNRVPSKSVEKTPYEIWTGQPPSLSFLKIWGCEAYVKRLQSDKLTPKSDKCFVVGYPKE FT TKGYYFYNREQAKVFVARHGVFLEKEFLSRRVSGIRVHLEEVQETPETVSATTEPQQED FT QSVAPPVVDTPAPRRSERSRRAPDRYTGAEQRDILLLDNDEPKTYEEAMVGHDSNKWLG FT AMKSEIESMYDNQVWNLVDPPDGVKTIECKWLFKKKADMDGNVHIYKARLVAKGFKQIQ FT GVDYDETFSPVAMLKSIRIILAIAAYFDYEIWQMDVKTAFLNGNLSEDVYMIQPQGFVD FT PESPGKICKLQKSIYGLKQASRSWNIRFDEVIKGFGFIKNEEEACVYKKVSGSAIVFLI FT LYVDDILLIGNDIPMLESVKSSLKNSFSMKDLGEAAYILGIRIYRDRSKRLIGLSQSTY FT IDKVLKRFNMHDSKKGFLPMSHGINLSKNQCPQTHDERNKMGMVPYASAIGSIMYAMLC FT TRPDVSYALSATSRYQSDPGEGHWTAVKNILKYLRRTKDMFLVYGGEEDLVVSGYTDAS FT FQTDKDDYRSQSGFVFCLNGGAVSWKSSKQDTVADSTTEAEYIAASEAAKEAVWIKKFV FT SELGVMTSTTGPMSLYCDNSGAIAQAKEPRSHQKSKHILRRYHLIREIVDRGDVKICKV FT HTDLNIADPLTKPLPQPKHEAHTRAMGIRYLHD" FT mat_peptide 2826. .3674 FT /note="putative" FT /gene="gag" FT /product="gag protein" FT misc_feature 3588. .3629 FT /note="RNA binding motif" FT mat_peptide 3675. .4121 FT /note="putative" FT /gene="pro" FT /product="aspartic proteinase" FT misc_feature 3738. .3746 FT /note="protease motif" FT mat_peptide 4122. .5021 FT /note="putative" FT /gene="int" FT /product="integrase" FT misc_feature 4146. .4251 FT /note="zinc finger motif" FT misc_feature 4329. .4628 FT /note="integrase motif" FT mat_peptide 5022. .6287 FT /note="putative" FT /gene="rt" FT /product="reverse transcriptase" FT misc_feature 5796. .5807 FT /note="reverse transcriptase motif" FT mat_peptide 6288. .6776 FT /note="putative" FT /gene="rh" FT /product="RNase H" FT misc_feature 6315. .6701 FT /note="RNase H motif" FT primer_bind 6789. .6798 FT /note="polypurine tract as putative (+)-strand priming FT site" FT /note="putatie (+)-strand primer binding site" FT LTR 6800. .8322 FT /note="3' LTR" XX SQ Sequence 8322 BP; 2323 A; 1638 C; 2072 G; 2289 T; 0 other; D85597 Length: 8322 May 9, 2005 17:26 Type: N Check: 5781 .. 1 tgttagtgat atgccctaga ggcaacatgg agatgttgga tatcacgtca 51 atgtttatgt ataaatgaat aaagtgttat tcagttccat gaaatagaca 101 tcttgtattg atcaataagt acgtgactta tttatgagac tctatatgta 151 tgaatctatt tctaaacgat ccctgatcat atgtcatatt gttggaacaa 201 atatgattct aagatcggca tgtttattga ttgatgatca tgtctcatag 251 atcatgggta tggagatacc aaatcaataa acatggacat atgtgttaga 301 gaacacattg ttggatagac ccaccatgag acactacagg aattaatgtg 351 ccattagttg gtctcaggta gtgttggtac aaagtcctta gacctgagat 401 caccatggat tccaacatgt gtagtagcct actttgggac taccaaacgc 451 tattccgtaa ctgggtagtt ataaaggtag ttttcgggtt tgctatgaaa 501 catggagtgg gatgtgagtg atcaagatgg aatttgcccc tcctttggag 551 agatatctct gggcccctcg aggtaatgga ttatggaaat gcatggccat 601 gctaagttga ttgaggagtc aatcaacaga cagataatcc aatacggatc 651 gagtgaatgg ttaagctatt gaagggatgg cacacatctt gcctatagct 701 taactggtat cgtgaggcaa agggatttgt gtacacatta caggttcaga 751 gccgatatta tattcttgta tactatggtg tcaatatgtg ctgctaggcg 801 ccgctgttga caggtcggct gagttggact cgagccgacg atcggctaag 851 ttagactata gccgaaaccg agtatacctg aacctacagg gtcgcacgct 901 taaggggata agaacaggca ttcgagttgg actcggatac ggctcgatcg 951 gatagggatc cgatcggata gagtcctatg ggcttacgac gtatgggcgg 1001 ccaactctat gagatacgga tgagatccga ttcaaaatag agtcctgatc 1051 ggccactagt cccagtcggc caactctata taaaggaggg aggggtggag 1101 agctgcggta cgtcaattgc atcgcgtgca acagaagtcg ccgagatcaa 1151 tctccacgga aaccctcccc actttcgagc caaaccctag ccttgcatcg 1201 cgtgctagca cacgggtgtt cggcgttctg tccccggacg tgtggatacc 1251 ggtagaggcg ctgctacgtt gcacgctgtt gatcggctgc ggatcggcta 1301 cgacatccgc gaatcggctg attggtgaat cggttgtccc cgggtatcgg 1351 ctgtcggctt gcgggaatcg gctacactgt tccggatcgg atagccctga 1401 ttggctattt ccgctgcata ctccaatcgg taacgatcta tcggccctta 1451 cttgcaagtg ttcctggttt gcgcggtaaa aagtttttgt tttcggctag 1501 cgtagcttcc gcgtaaccct tcagtggtat cagagctaat cttgcttagt 1551 tcgggtttta gattggatct ggaacaagtt ttgcgcacgg gttgattaga 1601 tcaatcggtt ttacgggatt tgttgaggag agttgatggt aattgcgttc 1651 cttgatagtg tatcggaacg acgtaatgtg attacgtcga agcaatatcc 1701 atataactcg gtttgagttg ggttcgagat agcgttcctt gatagtgtat 1751 cggaacggta ttatgtgatt ataccgaagc tatctcgtag tccgactttg 1801 tttgctgcct cgcgcctacg caaaccatct cccttgatag tgtatcggga 1851 cggcataata tgattatgcc gatatggttt ccgtaagcga tttcatagat 1901 tggatctatg aattcgtgtg gtgttacggg cgccatgtgc gtaccccctt 1951 ataggtcgtc tcggtgcccg atctctgcga taggtagaaa acgtttctac 2001 gcctaacttg tttttgcaga caatgcctgc agatggtatg gccctattgt 2051 atgtatgaga tgcatgtgat gcatgtaata ttctttcata tgtgcaatcc 2101 ctgtaatatt ctttcatacg tgcaatgctt gtaatattcg ttcacgacct 2151 gcgagtcgat gtaatcttca attagagctt ctatagtagt agcttagttt 2201 gaagaaatca agcattcggc actcggaaga acggtgatga agatggagat 2251 cctctacacc ggcatggaga tggagatcac catgtgaagg gggccatact 2301 atttcactat atgctattct acttgctttc atatatgtga tatgtttgtt 2351 tgataggata gcatccctcg caaaattaag tagtaatgat gcccttccaa 2401 atgttgcacc cgtcaccgtt atgctcgtca atggtgggtc tgccaaagca 2451 gagtgccatt ctctatcata acacgagggg gtatatgtca gacatataca 2501 tgcaggagca tttggtttac ttaacaaggc tatcaaaaca gttttgggcc 2551 ttggggcata ggttggggcc ggggcatgga gatgccacac caacaacaag 2601 agtcacatag agatgtgatt agcaaggtgt tgcttaccga tgttattaat 2651 cttctagcag tgatgccaaa gctcactaga aaattagtta acatgtggat 2701 cttgacccag tacgtaacct agagggaagg tgcaaaatac gtaatgggag 2751 tcttatgtta aattgtttaa gaatagacat agcatgaacg ttgtgctaaa 2801 ctttgctttt tctgttgtag aataaatggc ggctaatact acacccagta 2851 cctttaattt gcgatctatt ctcgaaaagg aaaaattgaa tggaacaaac 2901 tttatggatt ggtatcgcaa cttgagaatt gttctcaagc aagagcgtaa 2951 ggagtacgtt cttgaagtac cctatcctga ggagttgcct aataatgcca 3001 ctgctactgc acggaggggt ttcgagaagc acactaatga tgccctagat 3051 ataagctgtc tcatgctagc tacaatgtcc cctgagcttc agaagcaata 3101 tgagagcagc gatgctcaca ctactattca gggactgcgt ggtatgtttg 3151 agaaccaagc tcgggacgag aggttcaaca cctcaaagtc cttgtttgcg 3201 tgcaggcttg ttgaggggaa tcccgtcagt ccgcatgtga taaagatgat 3251 tggctatatt gagagtctgg aaaaactagg ttttcccctt agccaagagt 3301 tggctactga tgtgattctc cagtcgctcc ctccgagctt cgagccattc 3351 atattaaact atcacatgaa caatatggat agaaccttgg ctgaattaca 3401 tgggatgcta aagacagttg aggagagtat ccagaaaaat ggtcatcatg 3451 tgatgatgat gcagaatgct aagcgcaaac cacctgtcaa gaaactttgc 3501 accaagagga agttaactcc cgatgagatc gcgagtgcct ctaatgcaaa 3551 gaagggcaag aaggggtcgg cggcatcaga tgccgtttgc ttctattgca 3601 aggagacagg ccactggaag aggaactgca agaagtacat ggaggatctc 3651 aaaaagaagc aaagtacgac ttctgcttca ggtattaatg ttatagacat 3701 taatcttgct acttcaccta ctgactcttg ggtatttgat accggatcag 3751 tagctcatag ttgcaaatcg ttgcagggaa tgagaagaag tagaggattg 3801 agaaggggcg aggtgaacct gcgcgtcggc aatggagcaa gcgttgctac 3851 agttgctgtc ggcacagtac cacttcatct accttcagga ttagttttgg 3901 aattgaataa ttgttattgt gttccaacac tatgtcaaaa cgttatttcc 3951 gcttcatgtt tgcaagcgga aggatatgat tttagatcaa tgaacaatgg 4001 ttgttcaata tatctcagag atatgttcta ttttcatgct ccattggtga 4051 atggattata cgttttgaat cttgaagcgt ctcccatcta taacattaat 4101 acagaaaggc aactctctaa tgatataaac cccacattta tctggcattg 4151 tcgcttaggc catataaata agaaacgcat ggagaagctc cataaggatg 4201 gattgcttca ctcttttgat tttgaatcat ttgagacatg tgagtcttgt 4251 ttacttggta agatgacaaa ggcacctttc acgggacata gtgagagagc 4301 aagtgactta ttggcactcg tacatactga tgtatgtgga ccaatgagct 4351 caacggccag aggtggttat caatacttca ttacctttac cgatgacttt 4401 agtagatatg gctatatcta cttaatgagg cataagtccg aatcctttga 4451 aaagttcaaa gaattccaga atgaagtaca gaatcattta gggaaaacaa 4501 tcaagtttct acgatcagat cgtggagggg aatacgtgag ccaagagttt 4551 ggtaatcatc tgaaagattg tggaattgtt ccacagttga ctccgccagg 4601 aactccacaa tggaacggag tgtccgaacg gagaaatcgc accttgttgg 4651 acatggtgcg gtcgatgatg agccaaagtg atcttccgtt atccttctgg 4701 ggatatgctc ttgaaacagc tgcgctcaca ctaaatagag ttccatctaa 4751 gtcagttgaa aagacaccat atgagatatg gacagggcaa ccccctagtt 4801 tgtcttttct caaaatttgg ggatgtgagg cttatgtaaa acgtttacaa 4851 tctgacaagc tcacacccaa atctgacaag tgcttcgttg tgggatatcc 4901 taaggaaact aagggatatt acttttataa tcgggaacaa gccaaagtgt 4951 ttgtcgcccg acatggtgtc ttcttggaga aagagtttct ttcaagaagg 5001 gttagtggga tcagggtgca tcttgaagaa gttcaagaaa caccagaaac 5051 cgtttcagcg accacagaac cacaacagga ggaccaaagt gttgcgccac 5101 cagttgtaga tacaccagcc ccacgaaggt ctgaaagatc acgtcgtgcg 5151 cctgacaggt acacaggtgc ggaacaacgt gatatattgt tgttggacaa 5201 cgatgaacct aagacctatg aggaagcgat ggtgggacac gattccaaca 5251 agtggcttgg agccatgaaa tccgaaatag aatccatgta cgacaatcaa 5301 gtttggaact tggttgatcc acctgatggt gtcaaaacca tcgagtgtaa 5351 atggcttttt aagaaaaagg ccgatatgga tggaaatgtt cacatctata 5401 aggcgcgatt ggtggcgaaa ggttttaaac agattcaagg agttgattat 5451 gatgaaacct tctcgcccgt cgcaatgctt aaatctattc ggattatcct 5501 agcgattgct gcatatttcg attatgagat atggcagatg gatgtcaaaa 5551 cggctttcct aaatggaaac ctaagcgagg atgtatacat gatacaacct 5601 cagggttttg tcgatccaga atcgcctgga aagatatgca agctacagaa 5651 atccatttat ggattgaagc aagcatctcg gagttggaat attcgttttg 5701 atgaagtaat caaagggttt ggtttcatca aaaacgaaga agaggcctgt 5751 gtttacaaaa aggtcagtgg gagcgcaatt gtatttctaa tcttatatgt 5801 ggatgacata ttactgattg gaaatgatat ccctatgcta gaatccgtca 5851 agtcttcatt gaaaaatagt ttttccatga aagacttagg ggaggcagca 5901 tacatattgg gcattcggat ctatagagat agatccaaga ggctaattgg 5951 attaagccaa agtacataca ttgacaaggt gttgaaaagg ttcaacatgc 6001 atgattccaa gaaaggtttc ttgcccatgt cacatggcat taatcttagc 6051 aagaatcagt gccctcagac acatgatgag cggaataaga tgggtatggt 6101 tccatatgct tcggcaattg gatccatcat gtatgccatg ctttgtacac 6151 gcccagatgt ctcgtacgct ttgagtgcta cgagcagata ccagtcagat 6201 ccaggtgaag gtcactggac tgccgtaaag aatatcctta agtacttgag 6251 aagaactaag gatatgttcc tagtctatgg aggtgaagaa gatctcgttg 6301 taagtggtta caccgatgct agcttccaaa cagacaagga tgattataga 6351 tcgcaatctg ggttcgtgtt ctgcctgaac ggaggcgcag tcagctggaa 6401 gagttccaag caggatactg ttgctgattc tacaacggag gccgagtaca 6451 ttgctgcttc ggaagctgca aaggaggctg tttggatcaa gaaattcgtt 6501 tctgagcttg gtgtgatgac tagtacgact ggtccaatgt ctctctattg 6551 tgataatagt ggagccattg cgcaagccaa ggagccgagg tcacatcaga 6601 agtccaaaca catacttcgg cgatatcatc tcatccgcga gatagtggac 6651 agaggtgatg tcaagatatg caaagtgcac acggatctca acatagccga 6701 tccgctgaca aaacctctcc ctcagccgaa gcatgaggcg cacacaagag 6751 caatgggtat tagatactta catgattgac tctagtgcaa gtgggagatt 6801 gttagtgata tgccctagag gcaacatgga gatgttggat atcacgtcaa 6851 tgtttatgta taaatgaata aagtgttatt cagttccatg aaatagacat 6901 cttgtattga tcaataagta cgtgacttat ttatgagact ctatatgtat 6951 gaatctattt ctaaacgatc cctgatcata tgtcatattg ttggaacaaa 7001 tatgattcta agatcggcat gtttattgat tgatgatcat gtctcataga 7051 tcatgggtat ggagatacca aatcaataaa catggacata tgtgttagag 7101 aacacattgt tggatagacc caccatgaga cactacagga attaatgtgc 7151 cattagttgg tctcaggtag tgttggtaca aagtccttag acctgagatc 7201 accatggatt ccaacatgtg tagtagccta ctttgggact accaaacgct 7251 attccgtaac tgggtagtta taaaggtagt tttcgggttt gctatgaaac 7301 atggagtggg atgtgagtga tcaagatgga atttgcccct cctttggaga 7351 gatatctctg ggcccctcga ggtaatggat tatggaaatg catggccatg 7401 ctaagttgat tgaggagtca atcaacagac agataatcca atacggatcg 7451 agtgaatggt taagctattg aagggatggc acacatcttg cctatagctt 7501 aactggtatc gtgaggcaaa gggatttgtg tacacattac aggttcagag 7551 ccgatattat attcttgtat actatggtgt caatatgtgc tgctaggcgc 7601 cgctgttgac aggtcggctg agttggactc gagccgacga tcggctaagt 7651 tagactatag ccgaaaccga gtatacctga acctacaggg tcgcacgctt 7701 aaggggataa gaacaggcat tcgagttgga ctcggatacg gctcgatcgg 7751 atagggatcc gatcggatag agtcctatgg gcttacgacg tatgggcggc 7801 caactctatg agatacggat gagatccgat tcaaaataga gtcctgatcg 7851 gccactagtc ccagtcggcc aactctatat aaaggaggga ggggtggaga 7901 gctgcggtac gtcaattgca tcgcgtgcaa cagaagtcgc cgagatcaat 7951 ctccacggaa accctcccca ctttcgagcc aaaccctagc cttgcatcgc 8001 gtgctagcac acgggtgttc ggcgttctgt ccccggacgt gtggataccg 8051 gtagaggcgc tgctacgttg cacgctgttg atcggctgcg gatcggctac 8101 gacatccgcg aatcggctga ttggtgaatc ggttgtcccc gggtatcggc 8151 tgtcggcttg cgggaatcgg ctacactgtt ccggatcgga tagccctgat 8201 tggctatttc cgctgcatac tccaatcggt aacgatctat cggcccttac 8251 ttgcaagtgt tcctggtttg cgcggtaaaa agtttttgtt ttcggctagc 8301 gtagcttccg cgtaaccctt ca