Sequence of DPV Trichomonas vaginalis virus 3
Trichomonas vaginalis virus 3 strain TVV3-OC5, complete genome.
ACC No: HQ607525
Dated: 2011-05-08 | Length: 4842 | CRC: 1948420080
ID HQ607525; SV 1; linear; genomic RNA; STD; VRL; 4842 BP. XX AC HQ607525; XX DT 08-MAY-2011 (Rel. 108, Created) DT 08-MAY-2011 (Rel. 108, Last updated, Version 1) XX DE Trichomonas vaginalis virus 3 strain TVV3-OC5, complete genome. XX KW . XX OS Trichomonas vaginalis virus 3 OC Viruses; dsRNA viruses; Totiviridae; Trichomonasvirus. XX RN [1] RP 1-4842 RX DOI; 10.1128/JVI.00220-11. RX PUBMED; 21345965. RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W., RA Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H., RA Singh B.N., Fichorova R.N., Nibert M.L.; RT "Clinical Isolates of Trichomonas vaginalis Concurrently Infected by RT Strains of Up to Four Trichomonasvirus Species (Family Totiviridae)"; RL J. Virol. 85(9):4258-4270(2011). XX RN [2] RP 1-4842 RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A., RA Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.; RT ; RL Submitted (12-NOV-2010) to the EMBL/GenBank/DDBJ databases. RL Department of Microbiology and Molecular Genetics, Harvard Medical School, RL Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA XX FH Key Location/Qualifiers FH FT source 1. .4842 FT /organism="Trichomonas vaginalis virus 3" FT /host="Trichomonas vaginalis" FT /strain="TVV3-OC5" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="Jan-2010" FT /db_xref="taxon:170965" FT gene 360. .4690 FT /gene="pol" FT CDS join(360. .2445,2445. .4690) FT /codon_start=1 FT /gene="pol" FT /product="RNA-dependent RNA polymerase" FT /note="translated via ribosomal frameshift" FT /protein_id="AED99804.1" FT /translation="MSAPEPLNTEVRSPDGVSEATETQNLAITQSRVSNEKITDTQSDL FT QTLKKQSQPVSRSTDFETLYNYFYGLDVSPSTDRIGNAITRNTPVTDTNEVVSFPLTAS FT VSHTFSNSPVPAHIQPLQISIADDCINYELDESGTLCPALDSSVHVQRATSLASALKVK FT LTGEIMHSSSVRPIQTPQLIAYLYGVLLAVKDRINIHRNQPTNLWRSLCAAGRAAQAKP FT FFDEIPNNKFRPGALVAPPLPEAGFGPFPAEGLNQNSKLDFKAKAYVFYKQRTYNPDDM FT NRAFWFIWAIYNRMPNDFQNSYPLNITFCTSELPVQSPMPTADGISAEQCDKALLLLDK FT IVLEFFNNDRKLAYYYVFKGSQFVMRPCSCYQEGGLIRKASRNVALRAFTGIYYLAGFA FT EQYANMISCASHPGIIGALFQYVDTMVLQAVFSLSGPKLVRFAAPPEYQGRHACPFSFV FT ADENYWGIAPGSNAEPVGMYYTEIIQRKTEHNLFTETFMDIYGSTASVICANIETSLFT FT SGTEVINQRMQNDFARDTPKPGTLRHQHAIINRFHEPEYAYRLGILADGIIPLSGSFEV FT DILKEAERLITGEDIRNLPGLRCLCSRGLDAILGLRPIQQKRKKMCYFRTLDGNFHEVT FT IRSETRDLQVWRDHGYLARPYACHIVDSDGIEFYDKSNGLYKGRVNVLISGFAIPGRAY FT QGPSLAGSNRGRPDLSDVPATGSLSNLISLSKASRLPYRKRQNGVRVSDYTVARELACA FT FRNSRLTRQMDHVTDIAYLNFLRWVLLPYNGQTVRPHPTEWGQTPYPEHVNLKFLSKEM FT ELELFPLKKAPQADLKVNCYARNILASTELTDDLLKRCLPVGLNNDSVCGIVIVLELLL FT IAGVPSKLLPVIGQAIANKDPFIKELSDFNKMIGATSSRIANILTECNTLIGRGVKSSD FT PSADLYHRVAPEGNRHEAKISRHILIEAINKIYKNEMTDMPPPGDFMLHLITSPLWCKA FT GSHHHPHFAKYGSRLEFVMDVPADKIAAEPPAVYITQAEKLEHGKTRYIYNCDTIAYLF FT FDYILHYVECVWSNESVLLNPAAMSVERFSVLDYPEYCMIDYTDFNSQHSLESQKLVFE FT CLRPYLPSEMHPILDWCITSMDHMEINGQHWLSTLPSGHRATTFINSVLNKAYLIPYIG FT DTVSFHCGDDVLLCGEYDYQTLIDTLPYELNKSKQSFGPNAEFLRLHRRGGDVIGYPSR FT AVSSLVSGNWLSKTSWEWQPSLISVTNQCNVIISRSQLNIRFIPAMQQELRNRYADKMS FT EPFDVSSDYYVMPGCPCYSDAATTIVPNVPKLEHSDVPFSQAQKLFDTMRDSCPEFTTV FT NDIIDKVRARRSSSAASNITYNVGSPVAPQVCVVVNPNHYQFLLRKRYYPREHIAPPGF FT DASNDSKLVFTTYDLAPSIAMKSCAVLAPAKIICGHGLRSG" FT CDS 360. .2486 FT /codon_start=1 FT /gene="pol" FT /product="capsid protein" FT /protein_id="AED99803.1" FT /translation="MSAPEPLNTEVRSPDGVSEATETQNLAITQSRVSNEKITDTQSDL FT QTLKKQSQPVSRSTDFETLYNYFYGLDVSPSTDRIGNAITRNTPVTDTNEVVSFPLTAS FT VSHTFSNSPVPAHIQPLQISIADDCINYELDESGTLCPALDSSVHVQRATSLASALKVK FT LTGEIMHSSSVRPIQTPQLIAYLYGVLLAVKDRINIHRNQPTNLWRSLCAAGRAAQAKP FT FFDEIPNNKFRPGALVAPPLPEAGFGPFPAEGLNQNSKLDFKAKAYVFYKQRTYNPDDM FT NRAFWFIWAIYNRMPNDFQNSYPLNITFCTSELPVQSPMPTADGISAEQCDKALLLLDK FT IVLEFFNNDRKLAYYYVFKGSQFVMRPCSCYQEGGLIRKASRNVALRAFTGIYYLAGFA FT EQYANMISCASHPGIIGALFQYVDTMVLQAVFSLSGPKLVRFAAPPEYQGRHACPFSFV FT ADENYWGIAPGSNAEPVGMYYTEIIQRKTEHNLFTETFMDIYGSTASVICANIETSLFT FT SGTEVINQRMQNDFARDTPKPGTLRHQHAIINRFHEPEYAYRLGILADGIIPLSGSFEV FT DILKEAERLITGEDIRNLPGLRCLCSRGLDAILGLRPIQQKRKKMCYFRTLDGNFHEVT FT IRSETRDLQVWRDHGYLARPYACHIVDSDGIEFYDKSNGLYKGRVNVLISGFAIPGRAY FT QGPRLQVATEAAQI" XX SQ Sequence 4842 BP; 1296 A; 1312 C; 997 G; 1237 T; 0 other; hq607525 Length: 4842 08-MAY-2011 Type: N Check: 2840 .. 1 gcttaaaagc ttagtccact tttaagccgg tcatacttca accgtgatac 51 cggggcaaaa ttaatcaaca ccctcctgga atcgccgggg tgttgcgagc 101 cataagagac tggttctaaa ggactgacat agcgccgcga gggtaggcgg 151 tcgatagccc gtttgaggga atagcaatat tcctgattct ggtgtagcat 201 cgactggggc cccctagcgt gagctcagca cgttgggaaa acgaaaaact 251 gcatgcgcac agccttgcag tagcgtgagc tcaaggcacc ctaaaaagtg 301 cctcgtttca tgacgaactt tatgtcgtta tgaaatacta gtgtattgcg 351 tgcaacggta tgtcagctcc cgagccctta aatactgaag tacgctcacc 401 tgatggtgtt agtgaagcca cagaaactca aaacttggct atcactcaaa 451 gccgtgtgtc aaacgaaaaa ataactgaca cacaaagtga tctgcaaaca 501 cttaaaaaac agtcacaacc ggtcagcaga tccacagatt ttgaaactct 551 ttataattat ttttatggtt tagatgtctc tccttcaaca gatcgcattg 601 gtaatgcaat tacccgcaat accccagtca ctgatacgaa tgaggttgtt 651 agttttccac tcactgcatc tgtttcacac acattttcga attcgccagt 701 tccagctcac atacagcctc tccaaatttc tattgctgat gactgcatca 751 actacgagtt agatgagagc ggaacgttat gcccagcgct tgatagttct 801 gttcacgtcc agagagccac ttctcttgct agcgctctca aggtcaagtt 851 aacaggcgaa attatgcatt cttcatcagt tagaccaatt caaactcctc 901 aattaattgc ttatttatac ggtgttctcc ttgctgtcaa agaccgcatt 951 aacattcatc gtaatcagcc tacgaattta tggcgtagct tatgtgcagc 1001 aggtcgcgca gcccaagcaa agccgttctt cgatgaaatt cccaacaaca 1051 agttcaggcc cggtgccctc gtcgcacccc ctcttcctga agcaggattc 1101 ggtcctttcc cagctgaggg ccttaaccaa aattctaagc tcgatttcaa 1151 agcaaaagca tacgtcttct acaagcaacg cacctacaat ccagatgaca 1201 tgaatcgcgc attctggttt atctgggcaa tttacaaccg tatgcccaat 1251 gacttccaaa attcgtaccc actcaacatc actttctgca cttccgagct 1301 accagtccaa agcccgatgc caacagctga tggaatttcc gccgaacaat 1351 gcgataaagc gctccttcta cttgacaaaa tcgttctcga attcttcaat 1401 aacgaccgca aactcgctta ctactatgtg ttcaaaggaa gccagttcgt 1451 tatgcgtcct tgttcatgtt atcaagaagg aggcttgatc cgcaaggcct 1501 cacgcaatgt cgctcttcgc gcttttactg gcatctacta tctcgccgga 1551 ttcgccgaac aatacgctaa catgatttca tgcgcctccc atccaggaat 1601 catcggcgcc cttttccaat acgtcgacac tatggtctta caggccgttt 1651 tctctctttc cggccccaag cttgttcgct tcgcggctcc acctgaatat 1701 cagggtcgtc acgcttgtcc attttccttc gtcgccgatg aaaactattg 1751 gggcattgct cccggctcaa atgccgaacc agtcggtatg tattacacgg 1801 aaattatcca acgcaaaacc gagcacaatc tgttcaccga aacattcatg 1851 gatatctacg gttcgactgc ctccgtcata tgcgcaaata tcgaaacaag 1901 cttgttcaca tccggcactg aagttataaa ccagcgcatg caaaacgatt 1951 tcgctcgcga cacgccaaag cctggaaccc ttcgccacca gcatgccatc 2001 atcaatcgct tccacgaacc cgaatatgct taccgccttg gcatcctcgc 2051 tgatggcatt attccgctta gcggctcttt cgaagtcgac atcctcaaag 2101 aagctgaacg cctcatcaca ggcgaagaca tccgcaatct cccaggttta 2151 cgttgcttat gctctcgcgg tctcgacgcc atcctcggtc tccgtccaat 2201 ccaacagaaa cgcaagaaga tgtgttactt ccgcacactc gatggcaact 2251 tccatgaagt aacaatcaga tcggagactc gcgatctaca ggtctggcgt 2301 gatcatggct acctcgctcg cccatacgcg tgccacatcg ttgattcaga 2351 tggcatcgaa ttctacgaca aatccaacgg tctctataag ggacgcgtca 2401 acgttctcat ttccggattt gccattccag gtcgcgcata tcagggccct 2451 cgcttgcagg tagcaacaga ggccgcccag atctaagcga cgtcccggcg 2501 acaggaagtt tgtccaacct catcagcctt tctaaagcaa gtcggctacc 2551 ataccgtaag cggcagaatg gcgtgagagt gtcagactac accgtcgccc 2601 gcgagttagc ttgcgctttt cgcaattctc gcctaactcg ccaaatggat 2651 cacgtcacag atatagctta tctcaatttc cttagatggg tgttgttacc 2701 ttacaacggt caaaccgtac gaccacaccc caccgagtgg ggtcaaacac 2751 cctaccccga acacgtcaat ttgaagttct taagcaagga aatggagctc 2801 gaacttttcc cactgaagaa ggccccacaa gccgatctta aagtgaattg 2851 ttacgcgcga aacatccttg cttccacaga gctaacagat gatctcctca 2901 aacggtgtct gccagtcgga ctcaacaatg attcagtttg cggaattgtc 2951 atcgttttag agctgcttct cattgcaggt gtcccaagta agttattacc 3001 agtcattggc caagccatcg ccaacaaaga tccatttatt aaagaattgt 3051 ccgatttcaa taagatgata ggagcgacct cctcacgtat cgccaatatt 3101 ctcacagaat gtaacacatt gataggtcgc ggagtcaagt catctgaccc 3151 aagtgctgat ttgtatcacc gggtagcgcc tgagggcaat aggcacgaag 3201 cgaagatttc tcgacacatc ctcatcgaag ccatcaacaa aatctacaaa 3251 aacgaaatga cagacatgcc tccaccaggt gatttcatgc tccacttaat 3301 aacgagccct ctatggtgta aggctggctc tcaccaccat ccacactttg 3351 caaagtacgg ttcacgctta gaattcgtca tggacgttcc agcagacaaa 3401 atcgctgctg agccgcccgc tgtttacatt actcaagcgg agaaactaga 3451 acacggtaag actaggtaca tttacaactg cgatacaatt gcatacctat 3501 tcttcgatta catcttgcac tatgtcgagt gtgtatggtc aaacgagtca 3551 gttttactca acccagctgc tatgagtgtt gagcgattca gtgtcttaga 3601 ttacccggag tactgcatga tcgattacac agacttcaac tctcaacaca 3651 gcttagaatc acagaagcta gtttttgagt gtttgagacc atacttacca 3701 agtgaaatgc acccaatcct cgattggtgt atcaccagca tggaccatat 3751 ggaaattaac ggccagcatt ggttaagtac gctaccctca ggacatagag 3801 ctacgacatt tatcaactcg gtcctgaata aagcttactt aatcccttac 3851 ataggtgaca ccgtttcctt ccattgtggt gacgacgtgt tactatgtgg 3901 tgagtacgat taccaaaccc tcattgatac cctgccctat gaattaaaca 3951 agagcaaaca gagctttgga cctaatgccg agttcttgcg cttgcatagg 4001 cgcggtggtg acgttatagg ttatccatca agagctgttt cgagtcttgt 4051 atctggaaat tggttaagca aaacgtcatg ggagtggcaa ccaagcctca 4101 tttcggtcac taatcaatgc aatgttatta tctcgcgttc acaattgaat 4151 atcagattta tccccgccat gcaacaagaa ctacgtaacc gctacgcgga 4201 caagatgagc gaaccattcg atgttagttc ggattactac gtcatgccag 4251 gttgtccctg ctatagtgac gccgcgacga caatcgtgcc gaatgtcccc 4301 aaattggaac attcagacgt accgttttcg caggcacaaa aactttttga 4351 tactatgcgc gactcctgtc ctgagttcac aactgttaac gacatcatcg 4401 acaaagttag agctcgccgg tcttccagtg ctgccagtaa catcacgtac 4451 aacgtcggct cacctgtcgc acctcaagtt tgcgtagtcg taaatccaaa 4501 tcattaccag ttccttttgc gcaagagata ctacccacga gagcatattg 4551 ctccaccagg cttcgacgca tccaacgact caaaactcgt tttcacgact 4601 tacgatctcg ctccttcaat cgctatgaaa tcgtgcgctg ttttggcccc 4651 ggcaaagata atatgcggcc acggactacg cagtggttga gtagttctgt 4701 cgtaccaagc cacacttggt accggatagg ccacgaacgg tctcctgtct 4751 tcggaccctt cgcctatagg ttaataggaa tacagtgtta ctgttgtgtg 4801 tatcgcttta ggcacacgaa cgtactaccc cacgtttagt tc