Sequence of DPV Trichomonas vaginalis virus 1
Trichomonas vaginalis virus 1 strain TVV1-OC5, complete genome.
ACC No: HQ607523
Dated: 2011-05-08 | Length: 4680 | CRC: 1437201674
ID HQ607523; SV 1; linear; genomic RNA; STD; VRL; 4680 BP. XX AC HQ607523; XX DT 08-MAY-2011 (Rel. 108, Created) DT 08-MAY-2011 (Rel. 108, Last updated, Version 1) XX DE Trichomonas vaginalis virus 1 strain TVV1-OC5, complete genome. XX KW . XX OS Trichomonas vaginalis virus 1 OC Viruses; dsRNA viruses; Totiviridae; Trichomonasvirus. XX RN [1] RP 1-4680 RX DOI; 10.1128/JVI.00220-11. RX PUBMED; 21345965. RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W., RA Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H., RA Singh B.N., Fichorova R.N., Nibert M.L.; RT "Clinical Isolates of Trichomonas vaginalis Concurrently Infected by RT Strains of Up to Four Trichomonasvirus Species (Family Totiviridae)"; RL J. Virol. 85(9):4258-4270(2011). XX RN [2] RP 1-4680 RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A., RA Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.; RT ; RL Submitted (12-NOV-2010) to the EMBL/GenBank/DDBJ databases. RL Department of Microbiology and Molecular Genetics, Harvard Medical School, RL Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA XX FH Key Location/Qualifiers FH FT source 1. .4680 FT /organism="Trichomonas vaginalis virus 1" FT /host="Trichomonas vaginalis" FT /strain="TVV1-OC5" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="Jan-2010" FT /db_xref="taxon:674953" FT gene 323. .4613 FT /gene="pol" FT CDS join(323. .2349,2351. .4613) FT /codon_start=1 FT /gene="pol" FT /product="RNA-dependent RNA polymerase" FT /note="translated via ribosomal frameshift" FT /protein_id="AED99820.1" FT /translation="MEASANGLSHDDNANKSQNVGPSTLPGSDKQGGENHENSFNSFSN FT DFFFNFLRTSTSTHISDSPGVSFVLKDGTPYTSATIQSAVGRLTHNVVASAVQLNITAN FT NTLEVDYGFGQDVSRATGTITIPVFDGEKYKEVARALSLIFSKKGTAIDTTSQTIQDTL FT KNSDLTIATVAAGYYTALAARHELTKAESTAAHRIPFATALSDTFSAAGDALRSSHVIS FT SCLRCPASNNAQRQVTVGTNMWTNVSVENIAVQGLIIPNPNDVSFFIPNKSLPPSWWCA FT IWLLNAFLHSFIAQTRIHIFITPGETYNLAPFTDADIYEAIPILLGMSKTSRPVPESVE FT SMLYAYGAQMVIQPHSLYTEGGIVRRMIFTVPHLPAHGYFIANTEYSRYMNIAVPNDPR FT TAKDYIIGVGTGLLQIILAYQAAFSCAGPIALHWHDNDAISNGMDTVAAAYLEGRYFTI FT PMAVNVATNIAQYTTRVRADPQYKHTLDRILPRIFGPSTDTVFNFIESAITSSWVSINA FT TKRNGRARKFRTAFINRFHDPEFAYMFGITGNGIERMEGKVTSNIAQEVEYLTNGGDLR FT NCPILRTLKAAEAEETVTFMCTGKIGSIFAVDGTMRTLKRYQTVDLADLGWTSHGKVMK FT PYAFRAPVIQGITVCKTAYTSTAIDIVTTVFGPLRLRVGTLLSKAVRCGPIIPSVRHHF FT NIRHIITVKRNGNEYVFIPGYGWVLQDDYLVNSVKMTGVDQLPPNQLPYGDDLLFIYAE FT ILLYNYISLFPKFRYKNPDLLNQETELQLFPLKTDSAARNKANFYARSLWNEAKTDKTA FT FRPGTYNDTVAGLLMWQQCALMWSLPRSVINRTISGVCDALTERTSLALLKRISDWLKQ FT LGLACSPIHRLFIELPTLLGRGAIPGDNVKDMKHRLKFDPSITVDVPKDELHALIYRLL FT SRNLNITKVDSFEHHLEERLLWSKSGSHYYPDDRINQLLPKQPTRKEFLDVVTVDYIKE FT CKPHVFIRQSRKLEHGKERFIYNCDTISYVYFDFILKLFEAGWQDSEAILSPGDYTSER FT LHTRISNYKYKAMLDYTDFNSQHTIQSMRLIFETMKELLPPETTFALDWCIASFDNMYT FT SDGHKWVSTLPSGHRATTFINTVLNWCYTQMVGLKFNSFMCAGDDVILLSQEPISLAPI FT LTSHFKFNPSKQSTGTRGEFLRKHYTTEGVFAYPTRAIASLVSGNWLSQSLRENTPILV FT PIQNGVDRLRSRAGLLGVPWILGLSELTEREAIPRDVSMALLNSHAAGPGLITRNYSSF FT TVTPKPPKLTSTLEYTATRFGVQDLSKHVPWEQLTLEERNKLGKQIKKMSHRHCSQAKI FT TYTCVHDFYKPSGLPTVLSGASQPSLSMAWWQAMLKEAMQDNFTKKLDAQMFASNACTD FT CVSGDAFLQASAKTAGVLFTSSILSSS" FT CDS 323. .2359 FT /codon_start=1 FT /gene="pol" FT /product="capsid protein" FT /protein_id="AED99819.1" FT /translation="MEASANGLSHDDNANKSQNVGPSTLPGSDKQGGENHENSFNSFSN FT DFFFNFLRTSTSTHISDSPGVSFVLKDGTPYTSATIQSAVGRLTHNVVASAVQLNITAN FT NTLEVDYGFGQDVSRATGTITIPVFDGEKYKEVARALSLIFSKKGTAIDTTSQTIQDTL FT KNSDLTIATVAAGYYTALAARHELTKAESTAAHRIPFATALSDTFSAAGDALRSSHVIS FT SCLRCPASNNAQRQVTVGTNMWTNVSVENIAVQGLIIPNPNDVSFFIPNKSLPPSWWCA FT IWLLNAFLHSFIAQTRIHIFITPGETYNLAPFTDADIYEAIPILLGMSKTSRPVPESVE FT SMLYAYGAQMVIQPHSLYTEGGIVRRMIFTVPHLPAHGYFIANTEYSRYMNIAVPNDPR FT TAKDYIIGVGTGLLQIILAYQAAFSCAGPIALHWHDNDAISNGMDTVAAAYLEGRYFTI FT PMAVNVATNIAQYTTRVRADPQYKHTLDRILPRIFGPSTDTVFNFIESAITSSWVSINA FT TKRNGRARKFRTAFINRFHDPEFAYMFGITGNGIERMEGKVTSNIAQEVEYLTNGGDLR FT NCPILRTLKAAEAEETVTFMCTGKIGSIFAVDGTMRTLKRYQTVDLADLGWTSHGKVMK FT PYAFRAPVIQGITVCKTAYTSTAIDIVTTVFGPLRLRVGTLFE" XX SQ Sequence 4680 BP; 1377 A; 1198 C; 928 G; 1177 T; 0 other; hq607523 Length: 4680 08-MAY-2011 Type: N Check: 6813 .. 1 gcaaaaagga gggagtagtc cgctcttctc ctttttgcac tcaacatttt 51 tactccatca tgacgaatcc atgacatgga catgtaacaa gcgttttgtc 101 ctcgatgatt gccatcctcg tgtgaactcc gggcaccgct tgcactgatg 151 atacctctta caaagctgga gagacacccg tcttgaagag ccgtaatgta 201 tcctctgcgc ctgggaccta atggtgattt tgctgtaggt actttttaag 251 ggaggaatta gggttgaaca tactagttcg ctagtatgcc ttttttctac 301 tttattaaga aattgaatac ccatggaggc ttctgctaat gggttatcac 351 atgatgataa tgcgaataaa tcgcaaaatg ttggaccttc tactcttccg 401 gggtcagata aacaaggagg agaaaaccac gaaaattctt ttaattcttt 451 ttcaaatgat ttctttttta attttttacg cacatctacg agtactcaca 501 tttcagacag tccaggagtt tcttttgttt tgaaggatgg aacaccatat 551 acatccgcta ccatccaatc cgctgtcggt cgtcttacac acaatgtcgt 601 cgcatcagca gtccaactca atattacagc aaacaatacg ttagaggtgg 651 actacggttt cggtcaggat gtttcaagag ctacaggaac catcacaatc 701 ccagtcttcg atggcgaaaa gtacaaagag gtagctcgcg ctttatcatt 751 aattttcagt aagaaaggta cggcgattga cactacgtct caaactattc 801 aagacaccct caaaaactcc gatctcacta ttgctaccgt cgctgctgga 851 tactacacag ccttagctgc tcgccatgaa cttaccaaag cagaaagcac 901 tgcagctcat cgcattccat tcgctacagc tttatcagat acattctcag 951 cagccggcga cgcgctgcgt tcaagccacg tcatctcttc ttgcttacgc 1001 tgccctgcct caaacaacgc acaacgacag gttacagtcg gaaccaacat 1051 gtggacgaac gtctccgtcg aaaacatcgc agtacaaggc ttgataattc 1101 caaatccaaa cgacgtatcg ttcttcattc cgaacaaatc tcttccacct 1151 tcctggtggt gcgcaatctg gcttctcaac gctttcctcc acagcttcat 1201 cgcacaaacc cgcatccaca tcttcatcac gccaggtgaa acttacaatc 1251 ttgcgccatt cacagatgcc gatatctacg aggctattcc tatcttactt 1301 ggaatgtcga aaacatcacg cccagttcca gaaagcgtcg aaagcatgct 1351 ctacgcatac ggcgcgcaga tggttatcca gccacactcg ctttacacag 1401 aaggcggtat agtcagaaga atgatcttta ccgtcccaca tcttccagca 1451 catggctact tcatagcaaa cacagaatac tcgagataca tgaacatcgc 1501 tgttccaaac gacccacgta cagctaagga ttacataatt ggtgtcggaa 1551 ctggcctctt acagatcata ctcgcctacc aagctgcttt tagctgcgct 1601 ggtccaatcg ctctccattg gcacgacaat gacgctatct caaatggcat 1651 ggatacggtt gcagctgctt acctcgaagg acggtacttc actatcccaa 1701 tggccgtcaa tgttgccaca aacatcgccc aatacactac aagagtcagg 1751 gctgacccac agtacaaaca cacactcgat cgaatcttac cacgcatctt 1801 cggtccatcg actgacacag tcttcaactt catcgaatcc gcaatcacat 1851 catcttgggt ctcgatcaac gcgacgaagc gtaacggtcg ggccagaaag 1901 ttcaggaccg ccttcatcaa tcgttttcat gatccagaat tcgcctacat 1951 gtttggcatc accggcaacg gtatcgagcg gatggaaggc aaggttacat 2001 ccaacatcgc acaagaagtc gaatacctca ccaacggcgg tgaccttcgc 2051 aactgcccta tccttcgcac cttaaaagct gcggaggcag aagagaccgt 2101 cacttttatg tgtacgggaa agatcggttc catcttcgcc gtcgatggta 2151 caatgcgcac gctcaagcgg taccaaacag tcgacctcgc cgacctcgga 2201 tggacatcgc atggcaaggt catgaaaccg tacgccttca gggccccagt 2251 catccaagga atcaccgtct gcaagacagc ttacacatcc acagctatcg 2301 acatcgttac aacagtcttc ggccccttac gcctccgcgt aggcaccctt 2351 tttgagtaag gctgtacgtt gtggccctat aataccatcc gtcaggcatc 2401 actttaacat aagacacatc ataacagtta aacgcaatgg taacgaatac 2451 gtatttatcc caggttacgg atgggtatta caggatgatt atttggtgaa 2501 ttccgtcaag atgactggtg tagatcaact acctcccaac cagttaccct 2551 atggcgatga tcttttattt atatatgcag aaattttact ttataactac 2601 atatctcttt ttcctaaatt cagatacaaa aatccagact tattaaatca 2651 agaaacagaa ttacagctct tcccacttaa aaccgactca gctgccagaa 2701 ataaagccaa cttttatgct agatcactat ggaacgaagc aaaaacagac 2751 aaaacagctt ttagaccggg aacatacaat gacacagtag ctggtctatt 2801 gatgtggcaa caatgtgctc tcatgtggtc actgcctcgc tcagttatca 2851 acagaacaat tagcggcgtt tgtgatgcat taaccgaaag gacttcactc 2901 gcgctattaa aacgtatctc cgattggttg aaacaactcg ggctggcttg 2951 ctcaccaatc catcgcttgt tcatagagct cccaacatta ctaggacgcg 3001 gagcaatccc aggcgataac gtgaaagaca tgaagcacag actcaaattc 3051 gatccatcta tcacagtaga cgttccaaaa gacgagttac atgccctaat 3101 ctacagacta ttatcaagaa atctcaacat aactaaagtt gacagcttcg 3151 aacaccacct agaagagcgt ttgctttggt ccaaatccgg cagtcattat 3201 tatccggacg acagaatcaa tcagctactt ccaaaacaac ccacaaggaa 3251 agaattctta gatgttgtaa ccgtagatta catcaaggaa tgcaagcctc 3301 acgtcttcat aaggcaatca cgtaagctag aacacggtaa ggaacgtttc 3351 atttataatt gcgatacgat ctcctatgtc tattttgatt ttatcctgaa 3401 gctcttcgaa gcgggatggc aagatagcga agcaatacta tcgccaggcg 3451 actatactag tgaacgctta catacaagaa tttcaaacta caaatataag 3501 gctatgctag attatacgga tttcaattca cagcatacaa tccaaagcat 3551 gagactgata ttcgaaacga tgaaggagtt actaccaccc gaaactactt 3601 tcgcactcga ttggtgtatt gcctcattcg ataacatgta cacatccgat 3651 ggtcataaat gggtctcgac tctcccaagc ggacatcgag ccacaacctt 3701 catcaacaca gtcctaaatt ggtgctacac acagatggta ggtcttaagt 3751 ttaacagttt tatgtgcgct ggtgatgatg tcattttatt gtctcaagag 3801 ccaatatcac tagccccaat tcttacatca cacttcaagt tcaatcccag 3851 caaacaaagt acaggtacaa gaggtgaatt cttacgcaag cattacacta 3901 cagaaggcgt gtttgcatac ccaacacgag caattgcaag cttagtaagc 3951 ggaaattggt taagtcaatc tttaagagag aacaccccaa tcttggtccc 4001 aatacaaaac ggagtcgaca gacttcgaag cagagcaggt ctactcggag 4051 tcccatggat tttgggcctc tcggagctca cggagcgaga ggccattcct 4101 agggatgtca gcatggctct gttgaattca cacgcagcag gacccggttt 4151 gatcactcgg aactacagtt ctttcacagt taccccgaaa ccacctaagc 4201 taactagcac actagagtac acagcaaccc gttttggtgt ccaggacctg 4251 tccaaacacg taccatggga acaacttaca ctggaagaac gtaataagtt 4301 agggaaacaa attaagaaaa tgagtcacag gcattgtagc caggcaaaga 4351 taacatacac ttgtgttcac gatttttaca aaccaagtgg cctccccacg 4401 gtgttatctg gtgccagcca gccatcgttg tcgatggcgt ggtggcaggc 4451 gatgcttaaa gaagcaatgc aggacaattt tactaagaag ttagatgcac 4501 aaatgttcgc ttcgaatgca tgtacagact gcgtcagcgg tgatgcattc 4551 ttgcaagcga gcgccaaaac tgctggtgta ttattcacta gctcgattct 4601 atcttcttca taacgtacag caaaaaagtc tctatagttg ctcaggacat 4651 atatgagcca gatggccccg ctataccttc