Sequence of DPV Trichomonas vaginalis virus 3
Trichomonas vaginalis virus 3 strain TVV3-UR1, complete genome.
ACC No: HQ607515
Dated: 2011-05-08 | Length: 4845 | CRC: 333657510
ID HQ607515; SV 1; linear; genomic RNA; STD; VRL; 4845 BP. XX AC HQ607515; XX DT 08-MAY-2011 (Rel. 108, Created) DT 08-MAY-2011 (Rel. 108, Last updated, Version 1) XX DE Trichomonas vaginalis virus 3 strain TVV3-UR1, complete genome. XX KW . XX OS Trichomonas vaginalis virus 3 OC Viruses; dsRNA viruses; Totiviridae; Trichomonasvirus. XX RN [1] RP 1-4845 RX DOI; 10.1128/JVI.00220-11. RX PUBMED; 21345965. RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W., RA Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H., RA Singh B.N., Fichorova R.N., Nibert M.L.; RT "Clinical Isolates of Trichomonas vaginalis Concurrently Infected by RT Strains of Up to Four Trichomonasvirus Species (Family Totiviridae)"; RL J. Virol. 85(9):4258-4270(2011). XX RN [2] RP 1-4845 RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A., RA Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.; RT ; RL Submitted (12-NOV-2010) to the EMBL/GenBank/DDBJ databases. RL Department of Microbiology and Molecular Genetics, Harvard Medical School, RL Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA XX FH Key Location/Qualifiers FH FT source 1. .4845 FT /organism="Trichomonas vaginalis virus 3" FT /host="Trichomonas vaginalis" FT /strain="TVV3-UR1" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="Jun-1999" FT /db_xref="taxon:170965" FT gene 363. .4693 FT /gene="pol" FT CDS join(363. .2448,2448. .4693) FT /codon_start=1 FT /gene="pol" FT /product="RNA-dependent RNA polymerase" FT /note="translated via ribosomal frameshift" FT /protein_id="AED99800.1" FT /translation="MSAPEPLNTEVRSPNGVSEAKETQNLAITQSGVSNEKITDTQSDL FT QTLKKQLQPVSRSTDFETLYNYFYGLQVPASTDRIGNAIQRNIPVNDTNEVVSFPLTAS FT VSHTFSNTPVPAHIQPLQISVADDCXNYELDESGTLCPALDSSVHVQRATSLASALKVK FT LTGEVMHSASVRPIQTPQLIAYLYGVLLAVQDRINIHRNQPTNLWRSLCAPGRAAQAKP FT FFDEFANNKFRAGPLLAPPLPDAGFGPFPAEGLNQNSKLDFKSKGYIFYKQRTYNPDDM FT NRAFWFLWAIYNRMPGEFQQYYQLNITFCTSELPVQNPIPNADGISNEQCEKALLLLEK FT IILELFNNDRKLAYYYIFKGSQFVMRPCSCYQEGGLIRKASRNVALRAFTGIYYLAGFA FT DQYANMISCAAHPGVIGALFQYVDTMVLQAVFSLSGPKLVRFAAPPEYQGRHACPFSFV FT ADENYWGIAPGSNAEPVGMYYMDIIQRKSEHNLFVDTFMDIYGSTASIICANIETSLFT FT SGTNVLNERMQKDFARDTPKPGTLRHQHAIINRFHEPEYAYRLGILADGIIPLSGSFEV FT DILKEAERLITGEDIRNLPGLRCLCSRGLDAILGLRPIQQKRKKMCYFRTLDGNFHEVT FT IRSETRDLQVWRDHGYLARPYACHIVDSDGIEFYDKSNGLYKGRVNVLVSGFAIPGRAY FT QGPSLAGSNRGRPDLSDVPATGSLSNLIYLSKASRLPYRKLKEGVRAADYTVARELACA FT FRSSRLTRQMDHVTDIAYLNFLRWVLLPYNGQTLRPHPTMWRQTPYPEHVNLKFLSKET FT ELELFPLKKAPQADLKVNCYARNILASTELTDDILKQSLPIGLNNDSVCGIVIVLELLL FT IAGVPSKLLPVIGQAIANKDPFIKELSDFNKMIGATTSRIANILTECNTLIGRGVKSSD FT PSADLYHRVAPEGNRHEAKISRHILIEAINKIYKNEMTDMPPPGDFMLHLITSPLWCKA FT GSHHHPHFAKYDSRLEFVMDVPADKIAVEAPSVYITQAEKLEHGKTRYIYNCDTVSYLF FT FDYILHYVECVWSNESVLLNPAAMSVERFSILDYPQYCMIDYTDFNSQHSLESQKLVFE FT CLRPYLPSEMHPVLDWCIASFEHMEINGQHWLSTLPSGHRATTFINSVLNKAYLIPYIG FT DAVSFHCGDDVLLCGEYDYQTLIDTLPYELNKSKQSFGPNAEFLRLHRCGGDVIGYPSR FT AVSSLVSGNWLSKTSWEWQPSLISVTNQCNVIISRSQLNIRFIPAMQQELRNRYADKMS FT EPFDVSSNYYVMPGCPCYSDAATTIVPNVPQLERSDAPFSQAQKVFDAMRDFCPEFTTV FT GDVIDKVRARRSSSAVKNIMYDVCAPVAPRISIVVNPAHYQFLLRKKYYPREHIAPTGS FT DNTDRTKLVFATYDLAPSIAMKSCAVLTPAKIISGHGLRSG" FT gene 363. .2489 FT /gene="cap" FT CDS 363. .2489 FT /codon_start=1 FT /gene="cap" FT /product="capsid protein" FT /protein_id="AED99799.1" FT /translation="MSAPEPLNTEVRSPNGVSEAKETQNLAITQSGVSNEKITDTQSDL FT QTLKKQLQPVSRSTDFETLYNYFYGLQVPASTDRIGNAIQRNIPVNDTNEVVSFPLTAS FT VSHTFSNTPVPAHIQPLQISVADDCXNYELDESGTLCPALDSSVHVQRATSLASALKVK FT LTGEVMHSASVRPIQTPQLIAYLYGVLLAVQDRINIHRNQPTNLWRSLCAPGRAAQAKP FT FFDEFANNKFRAGPLLAPPLPDAGFGPFPAEGLNQNSKLDFKSKGYIFYKQRTYNPDDM FT NRAFWFLWAIYNRMPGEFQQYYQLNITFCTSELPVQNPIPNADGISNEQCEKALLLLEK FT IILELFNNDRKLAYYYIFKGSQFVMRPCSCYQEGGLIRKASRNVALRAFTGIYYLAGFA FT DQYANMISCAAHPGVIGALFQYVDTMVLQAVFSLSGPKLVRFAAPPEYQGRHACPFSFV FT ADENYWGIAPGSNAEPVGMYYMDIIQRKSEHNLFVDTFMDIYGSTASIICANIETSLFT FT SGTNVLNERMQKDFARDTPKPGTLRHQHAIINRFHEPEYAYRLGILADGIIPLSGSFEV FT DILKEAERLITGEDIRNLPGLRCLCSRGLDAILGLRPIQQKRKKMCYFRTLDGNFHEVT FT IRSETRDLQVWRDHGYLARPYACHIVDSDGIEFYDKSNGLYKGRVNVLVSGFAIPGRAY FT QGPRSQVATEAAQI" XX SQ Sequence 4845 BP; 1296 A; 1281 C; 1016 G; 1251 T; 1 other; hq607515 Length: 4845 08-MAY-2011 Type: N Check: 3649 .. 1 gcttaaaaag cgaagtccac tttttaagcc ggtttaactt caaccgtgaa 51 taccagggca aaattaatca acaccctcct ggaatcgccg gggtgttgcg 101 agccataaga gactggttct aaaggactga catagcgccg cgagggtagg 151 cggtcgatag cccgtttgag ggggtagtaa tactcctgat tctggtgaag 201 catcgactgg ggccccctag cgtgagctca gcacgttgga aaaacgaaaa 251 actgcatgtg cacagctttg cagtagcgtg agctcagggc accctaaaaa 301 gtgctccgtt tcacaacaac ctatgcgttg ttgtgagact ctagtgtatt 351 gcgtgcaacg gtatgtcagc tcccgagccc ttaaatactg aagtacgctc 401 acctaatggt gttagtgaag ccaaagaaac tcaaaacttg gctatcactc 451 aaagcggtgt gtcgaacgaa aaaataaccg acacacaaag tgatctgcaa 501 acactcaaaa aacagttaca accggtcagc agatccacag atttcgaaac 551 tctttataat tatttttatg gtttacaagt tcctgcttca acagatcgta 601 ttggcaatgc tattcagcgt aacatcccag tcaatgatac gaacgaagtc 651 gttagctttc cgcttacagc atcggtttca cacacatttt ccaatacgcc 701 ggtacctgcc catatacagc ctctccaaat ctcagttgcc gacgactgcr 751 tcaactacga gctagacgag agcggaacat tatgcccagc gttagatagc 801 tctgttcacg tccaaagagc tacctccctt gctagcgctc tcaaagtcaa 851 attaacaggc gaagttatgc attctgcctc agtcaggcca atccagacac 901 cacagttgat cgcttatttg tatggcgtcc tccttgccgt ccaagatcgt 951 atcaacatcc atcgcaacca acctactaac ttatggcgca gcttatgtgc 1001 acctggtcgc gctgctcaag caaagccttt cttcgatgaa ttcgcaaaca 1051 acaaattcag ggcaggtccc ctcttggcac ctcccctccc tgatgctggt 1101 ttcggtccat tcccggcaga aggcctcaat cagaattcca agctcgactt 1151 caaatccaaa ggatacatct tctacaaaca gcgcacttac aacccagatg 1201 atatgaatcg tgctttctgg ttcctttggg cgatctacaa ccgcatgcct 1251 ggagaattcc aacaatacta tcagttgaac atcactttct gcacttccga 1301 gttaccagta caaaatccga taccaaatgc cgatggcatc tcaaatgaac 1351 aatgtgaaaa agcacttctc ctcctcgaaa aaattatcct cgaacttttc 1401 aataacgatc gcaaacttgc ttactactac atcttcaagg gaagccaatt 1451 cgttatgcgt ccttgttcct gctaccaaga aggaggctta attcgcaagg 1501 cttcacgtaa cgttgctctc cgtgctttta ctggcatcta ctacttagcc 1551 ggattcgcag atcaatacgc taacatgatt tcatgtgctg cccatccagg 1601 tgtcattggt gctcttttcc aatatgttga cacaatggtc ttacaagccg 1651 tcttctcgct ttctggtcct aagcttgttc gtttcgctgc cccacccgaa 1701 tatcaaggtc gccacgcttg cccgttctca tttgtggctg acgaaaacta 1751 ctggggtatt gctccaggct caaacgccga gccagttggc atgtactaca 1801 tggatatcat tcaacgcaaa tccgaacata atttgtttgt cgacacattc 1851 atggacatct atggttctac agcttcaatc atttgcgcta atatcgaaac 1901 tagcttattt acttctggca ctaatgtttt gaacgaacgc atgcagaagg 1951 atttcgctcg tgatacaccc aaacctggaa ctcttcgtca ccaacatgct 2001 atcatcaacc gcttccacga accagaatac gcttaccgcc tcggcatcct 2051 tgcagatggc atcattccgc tcagtggctc attcgaagtc gatatcctca 2101 aagaagctga gcgcctcatt actggtgagg atatccgcaa cctcccaggt 2151 ttacgttgct tatgctctcg tggcctcgac gctatcctcg gtctccgtcc 2201 aattcaacag aagcgcaaga agatgtgtta cttccgcacc ctcgacggca 2251 atttccacga agtaacaatc agatcggaga ctcgcgatct acaggtctgg 2301 cgtgaccacg gctacctcgc ccgcccatac gcgtgccaca ttgtcgactc 2351 agatggcatc gaattctacg acaagtccaa tggtctctac aagggacgcg 2401 tcaacgtcct cgtttccgga tttgccattc caggacgcgc atatcagggc 2451 cctcgctcgc aggtagcaac agaggccgcc cagatctaag cgacgtcccg 2501 gcgacaggaa gtctgtccaa cctcatctac ctttctaagg caagtcggct 2551 accataccgt aagctgaagg aaggcgtgag agcggcagac tacaccgtcg 2601 cccgcgagtt agcttgcgct tttcgcagtt ctcgcctaac tcgccaaatg 2651 gatcatgtca cagatatagc ttaccttaat ttcttgagat gggtgttgtt 2701 accttacaac ggtcaaactt tacgaccaca ccccaccatg tggcgtcaaa 2751 caccctaccc cgaacatgtc aatttgaagt tcctaagtaa ggagacggag 2801 ctcgaacttt tcccactgaa gaaggcccca caagccgatc ttaaagtgaa 2851 ttgttacgcg cgaaatatcc ttgcttctac agagcttact gacgatatac 2901 tcaaacagag tttgcccatt ggtctcaata atgactcggt ttgcggaatc 2951 gttattgttt tagagctact tctaattgca ggtgttccga gtaagttact 3001 accagttatt ggtcaagcaa tcgccaataa agatccattt attaaggaac 3051 tgtccgactt caacaagatg ataggagcga ccacttcccg tatcgctaac 3101 attcttacag agtgtaatac attaataggt cgtggtgtta agtcatctga 3151 cccaagtgct gatttgtatc accgggtagc gcccgagggc aataggcacg 3201 aggcgaagat ttctcgacac atcctcatcg aagccatcaa caaaatttac 3251 aaaaacgaaa tgacagacat gcctccaccg ggtgacttca tgctccactt 3301 gataacgagc cctctatggt gtaaggctgg ctctcaccat catccacact 3351 tcgccaagta tgattcgcgc ttggaattcg ttatggatgt tccagcagac 3401 aaaatcgctg ttgaagcacc ctctgtatac attactcaag ccgagaaatt 3451 agaacatggt aaaactagat acatttataa ctgtgataca gtttcatact 3501 tgttctttga ttacatctta cactatgtcg aatgtgtgtg gtcaaatgag 3551 tcagttctac tcaacccagc tgctatgagt gtcgagcgct ttagtatctt 3601 ggattacccg caatattgca tgatcgatta cacagatttc aactctcaac 3651 acagtctcga atcacagaag ctagtgttcg agtgtttgag accatactta 3701 ccaagcgaaa tgcatccagt cttggattgg tgtattgcca gctttgagca 3751 catggaaatc aacggacaac attggttaag cacgttgcct tcaggacata 3801 gggccacaac attcatcaac tcggtcctca ataaagcata cctgatccca 3851 tacataggcg acgcggtttc cttccattgt ggtgacgacg tgttactatg 3901 tggtgagtat gattaccaaa cactcattga taccctaccc tatgaattaa 3951 acaagagcaa acagagcttc ggacctaatg ccgagttctt gcgcttgcat 4001 aggtgtggtg gtgacgttat aggctatcca tccagagctg tttcgagtct 4051 tgtatctgga aattggttaa gcaagacatc atgggagtgg cagccaagtc 4101 tcatttcggt tacaaatcaa tgcaatgtga tcatctcgcg ttcacaattg 4151 aacatcaggt tcatccccgc aatgcaacaa gaactacgca atcgttacgc 4201 agacaagatg agtgaacctt tcgatgtcag ctccaactac tacgtcatgc 4251 caggatgtcc atgctatagc gacgccgcga cgacaatagt accgaatgtt 4301 ccccaactgg aacgttcgga cgcaccgttt tcgcaggcac aaaaagtttt 4351 tgatgctatg cgcgacttct gtcctgagtt cactactgtt ggcgatgtca 4401 tcgataaggt tagagctcgc cgatcttcaa gtgcagtcaa gaacatcatg 4451 tacgacgtat gcgcgcctgt tgcaccacgt atcagtatcg tagtgaaccc 4501 ggcacactat cagttcctct tacgcaagaa gtactaccca cgtgaacaca 4551 ttgcgcctac tggctccgat aatacagatc gaaccaaact cgttttcgca 4601 acatacgatc tcgctccttc aatcgccatg aagtcgtgcg ctgttttgac 4651 cccggctaag ataataagtg gtcacggact acgcagtggt tgaataatct 4701 gccagtacca ggcaacgatt ggtaccggct tggccacgca cggtctgctg 4751 tcttcggacc ctccgcctat aggttaatag gaacacagtg ttactgttgt 4801 gtgtatcgct ctaggcacac gaacgtacta ccccacgttt agttc