Sequence of DPV Leishmania RNA virus 1-4
Leishmania RNA virus 1-4 major capsid protein gene, complete cds, and putative RNA-dependent RNA polymerase gene, partial cds.
ACC No: U01899
Dated: 2006-11-14 | Length: 5283 | CRC: -1777506768
ID U01899; SV 1; linear; genomic RNA; STD; VRL; 5283 BP. XX AC U01899; XX DT 25-JAN-1994 (Rel. 38, Created) DT 14-NOV-2006 (Rel. 89, Last updated, Version 11) XX DE Leishmania RNA virus 1-4 major capsid protein gene, complete cds, and DE putative RNA-dependent RNA polymerase gene, partial cds. XX KW . XX OS Leishmania RNA virus 1 - 4 OC Viruses; dsRNA viruses; Totiviridae; Leishmaniavirus. XX RN [1] RP 1-5283 RX PUBMED; 8122377. RA Scheffter S., Widmer G., Patterson J.L.; RT "Complete sequence of Leishmania RNA virus 1-4 and identification of RT conserved sequences"; RL Virology 199(2):479-483(1994). XX RN [2] RP 1-5283 RA Scheffter S.M.; RT ; RL Submitted (17-SEP-1993) to the EMBL/GenBank/DDBJ databases. RL Scott M. Scheffter, Children's Hospital of Boston, Division of Infectious RL Diseases, 300 Longwood Avenue, Boston, MA 02115, USA XX FH Key Location/Qualifiers FH FT source 1. .5283 FT /organism="Leishmania RNA virus 1 - 4" FT /lab_host="Leishmania braziliensis guyanensis" FT /mol_type="genomic RNA" FT /clone="pBSLRV-7, pBSGAP, pBS190, pCR-D5" FT /db_xref="taxon:12530" FT stem_loop 1. .37 FT /note="conserved in virus isolate LRV1-1" FT stem_loop 163. .189 FT /note="conserved in virus isolate LRV1-1" FT stem_loop 214. .231 FT /note="conserved in virus isolate LRV1-1" FT stem_loop 269. .284 FT /note="conserved in virus isolate LRV1-1" FT stem_loop 403. .443 FT /note="conserved in virus isolate LRV1-1" FT misc_signal 447. .453 FT /note="Kozak concensus site" FT CDS 450. .2678 FT /codon_start=1 FT /product="major capsid protein" FT /note="ORF2" FT /db_xref="InterPro:IPR008871" FT /db_xref="UniProtKB/TrEMBL:Q83100" FT /experiment="experimental evidence, no additional details FT recorded" FT /protein_id="AAB50027.1" FT /translation="MADIPNSDKIACGRKPMFCEIIKLANRKRLIFNTTDERVYDARLN FT YCSTADSTVQADCHIYWRLKLRRTDAVFEEYTGQGYSLDTAAYPQQYTDIIRGYYSKHV FT SSSLAANTQHCVNVVAMLRHAGACIAHYCMTGKIDCDIVSKKKHKNKEVVTLSNADSVS FT FVAHSALYLPSPLRASDPEIFNMLYLLGCACDASIAMDNISNTSGAAKYSMPHYNPLQL FT SHALHVTIFYMLSLMDSCGYGDDAVLALTSGLHSVTTVIAHSDEGGITRDALRELSYTQ FT PYGTMPVPIAGYFQHINVLFTTQPAWDQFAGIWDYVILATAALVHLSDPGMTVNDVTYP FT TTLTTKVATVDGRNSDLAAQMMHSATRFCDIFVENLSTFWGVVANPDGNASQALLHAFN FT IVACAVEPNRHLEMNVMAPWYWVESSALFCDYAPFRSPISSAGYGPQCVYGARLVLAAT FT NSLEFTGEPGDYSAYRFEWTTMRHNPLFNILNKRVGDGLANVDFRLRPFNEWLLEGQPS FT RRSCNSAGHGTPTATCSHKTPNHDTLDEYIWGSTSCDLFHPAELTSYTTVCVRFRNYLS FT GADGDVRILNTPTREVIEGNVVTRCDGIRCLDSNKRIQHVPEVARRYCMMARYLAQART FT FGALTIGDDIIRGFDKVEKIVKMHKSNNRLDQMPLIDVTGLCQPMIETSTVRASTPTRI FT DPNKLAAATARVELPLAPRCTSSLIPSSDTVPEPEPQVGETGDNGGCA" FT variation 727 FT /replace="t" FT variation 837 FT /replace="t" FT variation 1512 FT /replace="c" FT variation 1849 FT /replace="g" FT variation 2146 FT /replace="g" FT variation 2210 FT /replace="t" FT variation 2520 FT /replace="a" FT /note="clone pCRII-5'.F2" FT variation 2582 FT /replace="g" FT /note="clone pCRII-5'.F2" FT CDS <2605. .5241 FT /codon_start=1 FT /standard_name="RDRP" FT /product="RNA-dependent RNA polymerase" FT /note="translation probably initiates at the 5' end of FT major capsid protein coding sequence and results in the FT expression of a fusion protein via a translational FT frameshift" FT /db_xref="GOA:Q83101" FT /db_xref="InterPro:IPR001795" FT /db_xref="UniProtKB/TrEMBL:Q83101" FT /inference="non-experimental evidence, no additional FT details recorded" FT /protein_id="AAB50028.1" FT /translation="SHHLIQYPNQSLKLAKPVTTEAALDCFENVRRMISGFLNLSWLTV FT TSKERQHKSDSNYFYDYNLFFSQMPAIVFNQLKCCVKTQVDDAIKTILQKTKKVEPSKQ FT VERTLTLSVLNTFLGYLDLGRYVTQYTEQQSGPVAAKLLLTLLSSALIALVPAKSDPNL FT CQNKIPRHYYQLKAHVGAQKKVNLTAIEVIRGCQHECSVVYCEYMRYSAYFSGLYDDQV FT AAILLYATAAHGVQGFGARFSILWALTCVKAPDFADEINIYIKHRGMSGLLPQLVEMKC FT LLGRGVSEIDVELEARNRLNVKNLNMQKFNEDELRAAVRQVYSEEIRRSVSYPPICDFW FT SSRWLWAANGSHSRALEHAHPELATRKEGQAYRKAVMEQWQHNPMDRWDGTVYVTPSAK FT LEHGKTRLLLACDTLSYMWFEYALRPVERIWENSNVILDPGSIGNCGIAARVNKWRNSV FT RGQSFFAVDYDDFNSQHTLLSQKIVFEELFQHINCNMSWTRTLIDSFDSMELWVKGKRA FT GVVAGTLMSGHRATSFINSVLDRAYIICAGGHVPTSMHVGDDILMSCTFDHADNLIANL FT TENGIRLNASKQVFSKTSGEFLRVAHREHTSHGYLARVVSSAVSGNWVSDHTLNQQEAL FT MNATVCCRGILNRSLPGDKNPVVRVISRSVSKRTKVDERIIRLLLSGKACLRGGVVYGE FT QTNCIQVYKINCRVERLEEKLPPYKHATEDYLNNHLTGIEVMAVRQYGSDIADIMAQAS FT WKKSMSNESAEEISRLSLSRDKNLPCLYCITEDEVATLPVRYGLFTSYPILMMLKDRMP FT IKEALKLAITVGYRPQPNSDIELDLWGESKNSCAIEGILPYNEATSMAQKLPCGGVVIQ FT VIHNVYV" FT stem_loop 2623. .2659 FT /function="possible role in translational frameshifting" FT /note="possible pseudoknot formed with adjacent FT nucleotides" FT variation 2652 FT /replace="c" FT /note="clone pCR-G2.R3" FT variation 2654 FT /replace="g" FT /note="clone pCRII-G2.R3" FT variation 2966 FT /replace="c" FT /note="clone pCRII-E1.T12" FT variation 2967 FT /replace="t" FT /note="clone pCRII-E1.T12" FT variation 3054 FT /replace="a" FT /note="clone pCRII-E1.T12" FT misc_binding 3796. .3819 FT /bound_moiety="purines" FT /note="conserved in virus isolate LRV1-1" FT variation 3819 FT /replace="c" FT /note="clone pCRII-E1.T12" FT variation 4219 FT /replace="a" FT /note="clone pCRII-E3.3'" FT variation 4520 FT /replace="t" FT /note="clone pCRII-E3.3'" FT variation 5037 FT /replace="a" FT /note="clone pCRII-E3.3'" FT stem_loop 5206. .5245 FT /note="conserved in virus isolate LRV1-1" FT stem_loop 5255. .5280 FT /note="conserved in virus isolate LRV1-1" XX SQ Sequence 5283 BP; 1466 A; 1135 C; 1304 G; 1378 T; 0 other; u01899 Length: 5283 14-NOV-2006 Type: N Check: 8734 .. 1 gcgaattcaa acgagatgcc taagagtttg gattcgctag ctgtccggat 51 ggtagtgtta cctgtggtcc accacggtaa agcattaagg gctagcctta 101 acctcactgg aattgaatga aagtgggaga tcagtggcct ccaacggttg 151 gactgactgg acggggggta atcgagtggg agtcccccac atcctacatt 201 tatgtagttc ctcacgatcc acagcaatcg tctggttgta tccaggttac 251 tgccgcgagc gtaagggagt gttttggcag acacaatcca atatgctgac 301 tacggtcggt gtggaggatc cgaaacgtaa gcaagtttct tgttactatt 351 gatcaacagc tacatcctac aacgacagct atcataccag ccatctttaa 401 cccaagaatt ccacgtgaac ataccttgag tatacaattc ttgagcaaaa 451 tggctgatat accaaactct gataagattg cttgcggtcg caagcctatg 501 ttttgcgaga ttataaaact tgcaaacagg aagaggttga tctttaatac 551 aaccgacgag cgtgtgtatg atgcgcgatt gaactattgt tcaaccgcag 601 attctacagt gcaagcagat tgccacatct actggcgact caaactgcga 651 cgcactgatg cagtttttga ggaatacacc ggtcaagggt attcacttga 701 cactgctgct tacccacaac agtacactga tattatcaga gggtattaca 751 gtaagcacgt atcaagctct ctcgctgcaa acactcaaca ttgtgtgaac 801 gttgtggcta tgctacgcca tgctggtgca tgtatcgcac attattgcat 851 gactgggaaa attgattgtg acattgtgag taagaaaaaa cataaaaata 901 aagaggtggt gaccttgagc aatgctgaca gtgtgagttt tgtggctcac 951 tcggccttat atctgccttc tcctttacgt gccagcgatc ctgaaatctt 1001 taatatgtta tatttgcttg ggtgcgcttg tgacgctagt attgctatgg 1051 ataacatctc aaatacaagc ggggccgcta agtactcaat gccccattat 1101 aacccgctgc agctgtctca tgctctgcat gtgacgatat tttacatgtt 1151 aagcttgatg gacagctgcg ggtatggtga tgacgccgtc ctggcactaa 1201 cttctggctt acattccgtg accacagtta ttgcacacag tgatgaaggt 1251 ggtattacac gtgatgccct acgggaactg tcatacacgc aaccttatgg 1301 tactatgcct gtcccgattg ccggttactt ccagcacatt aacgtgctgt 1351 ttaccacaca gcccgcttgg gatcaatttg ctgggatttg ggattatgtt 1401 atcttggcga ctgctgctct ggtacatcta tctgatcctg gaatgactgt 1451 caacgatgtt acttacccta cgactctgac aactaaagtg gcaacggttg 1501 atgggcgtaa tagtgacttg gctgctcaga tgatgcacag cgcaacccga 1551 ttctgtgaca tattcgtgga gaatttaagc acattctggg gcgttgttgc 1601 taaccctgac gggaacgcaa gccaggctct actccacgct ttcaacatcg 1651 tggcttgtgc tgtagagccg aacagacatc tggagatgaa tgtgatggcc 1701 ccgtggtatt gggtagaaag ttctgctttg ttctgtgatt acgcgccatt 1751 tcgatcaccg atatcatccg caggctatgg ccctcaatgt gtctatggtg 1801 caaggctggt acttgctgct acgaactcgc tagagtttac aggtgaacca 1851 ggtgattatt ctgcataccg ttttgagtgg acaacaatgc gccacaaccc 1901 attatttaac atccttaaca aacgtgttgg tgatggtctt gccaatgttg 1951 atttcaggtt acgccccttc aatgaatggt tgttggaggg tcaaccaagt 2001 cgtcggagtt gtaactcagc tggccacggc acgcccacgg ccacatgttc 2051 tcacaaaacc ccaaaccacg acacgctaga tgagtacatc tggggcagta 2101 cgtcatgtga cctgtttcac ccggcggagt tgacttctta cacaaccgtg 2151 tgcgtcagat tccgcaacta cctatcgggt gctgacggtg atgttagaat 2201 actgaatacg ccgactaggg aagtaatcga gggaaacgtg gtcacaagat 2251 gtgacgggat aagatgtctg gacagtaata aacgaataca gcatgtccct 2301 gaagtggcaa ggcgctactg catgatggcg cggtacctcg cacaagcccg 2351 tacttttgga gcactaacta ttggggatga tattatacgt ggttttgaca 2401 aggttgaaaa aatcgtcaaa atgcataaga gtaataatag attggatcaa 2451 atgcctttaa ttgatgttac tggattatgt cagccaatga ttgaaacaag 2501 tacagtgcgc gcctcaacac cgacgcgtat tgatcctaac aaacttgcgg 2551 ctgctacggc ccgggttgag ttaccactag ccccacgttg tacatcttct 2601 ttaatcccat catctgatac agtacccgaa ccagagcctc aagttggcga 2651 aaccggtgac aacggaggct gcgcttgatt gttttgaaaa tgtgcgacgt 2701 atgattagtg gtttcttgaa tctgtcgtgg ctgacagtaa catctaaaga 2751 aaggcaacat aaaagtgaca gtaattattt ctatgattat aatctcttct 2801 tttcccagat gccagcgata gttttcaacc aactaaaatg ctgtgtaaag 2851 acacaagttg acgatgccat aaaaactatt ttgcaaaaaa cgaagaaagt 2901 tgaacctagt aaacaggttg aaagaacgct cactctctcc gtgctaaaca 2951 cgttcctggg gtacctggac ttgggtaggt atgtaacaca atacacagag 3001 cagcagtcag ggccagttgc ggcaaaactg ctcttgacac tcctctcgtc 3051 agcgctaatt gcattggtcc ctgcgaaatc agaccccaat ttatgtcaga 3101 ataagatacc acgccattat tatcagctta aggcgcatgt tggggctcag 3151 aaaaaagtaa acttaacggc aatcgaggtg atcaggggtt gccaacacga 3201 gtgcagtgtc gtttattgtg agtatatgcg ttacagtgct tacttttcgg 3251 gcctgtatga tgatcaggtc gctgccatcc tgttatatgc tacggcagca 3301 cacggtgtcc aaggatttgg ggctcgattt agtattctgt gggcgcttac 3351 gtgcgttaag gcacctgatt ttgcggacga aattaatatt tatatcaagc 3401 acagaggtat gtcaggacta ctacctcagc tagtagagat gaaatgcttg 3451 ctaggtcgtg gggtgagtga gattgatgtc gaacttgagg ctagaaacag 3501 actaaacgta aagaacctga atatgcagaa gttcaacgaa gacgaactac 3551 gagcagccgt tcgtcaagtt tactctgaag aaatcaggag gtctgtttcg 3601 tacccgccga tttgtgattt ctggtcgtca cgttggctgt gggctgcaaa 3651 tggctcgcac tcccgcgctc tggaacatgc acatcctgag cttgcaacta 3701 ggaaagaagg gcaggcctac agaaaagcgg taatggaaca gtggcagcat 3751 aatcccatgg accgctggga tggcacagtg tatgtcaccc ctagtgctaa 3801 gcttgagcat ggaaagacaa gactactgtt ggcatgtgac acactgtcgt 3851 atatgtggtt tgaatatgcc ttacgacccg ttgaaaggat ctgggagaac 3901 agtaatgtca tccttgaccc tgggagtatc ggcaactgcg gtattgcagc 3951 cagagttaat aagtggcgaa acagtgttcg tggacagagc ttctttgcgg 4001 tagattatga tgacttcaat tctcaacata cgttgctgtc acagaagatt 4051 gtgtttgagg aattgtttca gcatatcaac tgcaatatgt cctggacccg 4101 aacattaata gattcttttg acagcatgga gttatgggtt aaagggaaaa 4151 gagctggtgt ggtggcaggg acgctgatga gcggtcatag agctaccagc 4201 ttcataaact ctgttctcga cagggcatac attatatgtg caggcggcca 4251 cgtacccacg tctatgcatg ttggtgatga catcttgatg tcatgtacat 4301 ttgaccatgc cgataatcta atagctaatc ttacggagaa tgggattagg 4351 ttaaatgcta gtaaacaagt ctttagtaaa actagtgggg agttcctacg 4401 agtggcacac cgtgaacaca caagccacgg gtatcttgcg agagtagtga 4451 gcagtgcggt atcggggaat tgggtgagtg atcacaccct caatcagcaa 4501 gaggccctca tgaatgctac agtatgttgc agaggaatcc tcaataggag 4551 cttaccaggt gacaagaatc ctgtggtcag ggtaatatca cgcagtgtaa 4601 gcaagagaac taaagtagat gaacgaataa ttcggctgtt actgagtggg 4651 aaagcgtgtc ttagaggtgg tgtcgtgtac ggtgaacaaa ctaactgcat 4701 ccaagtatat aagataaatt gcagggttga gcgcctagag gagaagctac 4751 ccccatacaa acatgcaact gaagattact tgaacaatca tctgacaggt 4801 attgaagtaa tggcagtcag acaatatgga tcggacattg ctgatatcat 4851 ggcccaggct agctggaaga aatccatgag taatgagagt gccgaagaaa 4901 tatctcgtct gtcactgtca cgcgacaaaa acctgccatg cctgtattgt 4951 atcacagagg atgaggttgc aacactgccg gttcgctacg ggttgttcac 5001 gagctatcca atcctaatga tgttgaagga taggatgccc atcaaagaag 5051 ccctcaaact agccataact gtaggttaca gaccccaacc taacagcgac 5101 atagagctag atttgtgggg tgagagcaaa aattcatgtg caattgaagg 5151 aatacttcct tataatgaag cgacttcaat ggcccaaaaa cttccttgtg 5201 gtggagttgt tatacaagta atacataatg tatatgtata aggacgcacc 5251 attcggaata tggcaagagt gccataaact atc