Sequence of DPV Trichomonas vaginalis virus 3
Trichomonas vaginalis virus 3 strain TVV3-OC3, complete genome.
ACC No: HQ607519
Dated: 2011-05-08 | Length: 4846 | CRC: -822401152
ID HQ607519; SV 1; linear; genomic RNA; STD; VRL; 4846 BP. XX AC HQ607519; XX DT 08-MAY-2011 (Rel. 108, Created) DT 08-MAY-2011 (Rel. 108, Last updated, Version 1) XX DE Trichomonas vaginalis virus 3 strain TVV3-OC3, complete genome. XX KW . XX OS Trichomonas vaginalis virus 3 OC Viruses; dsRNA viruses; Totiviridae; Trichomonasvirus. XX RN [1] RP 1-4846 RX DOI; 10.1128/JVI.00220-11. RX PUBMED; 21345965. RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W., RA Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H., RA Singh B.N., Fichorova R.N., Nibert M.L.; RT "Clinical Isolates of Trichomonas vaginalis Concurrently Infected by RT Strains of Up to Four Trichomonasvirus Species (Family Totiviridae)"; RL J. Virol. 85(9):4258-4270(2011). XX RN [2] RP 1-4846 RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A., RA Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.; RT ; RL Submitted (12-NOV-2010) to the EMBL/GenBank/DDBJ databases. RL Department of Microbiology and Molecular Genetics, Harvard Medical School, RL Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA XX FH Key Location/Qualifiers FH FT source 1. .4846 FT /organism="Trichomonas vaginalis virus 3" FT /host="Trichomonas vaginalis" FT /strain="TVV3-OC3" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="Nov-2009" FT /db_xref="taxon:170965" FT gene 364. .4694 FT /gene="pol" FT CDS join(364. .2449,2449. .4694) FT /codon_start=1 FT /gene="pol" FT /product="RNA-dependent RNA polymerase" FT /note="translated via ribosomal frameshift" FT /protein_id="AED99802.1" FT /translation="MSAPEPLNTEVRSPNGVSEATETQNLAVTQSSVSNEKTIDTQSDL FT QTLKKQLQPVTRSTDFETLYNYFYALNVSPSTDRIGNAITRNTPVNDTNEVVSFPLTAS FT VSHTFSNTPVPAHIQPLQISIADDCVNYELDESGTLCPALDSSVHVQRATSLASALKVK FT LTGEVMHSASVRPIQTPQLIAYLYGVLLAVQDRLNIHRNQHTNLWRSLCAAGRAAQAKP FT FFDEIPNNKFRTGALLAPPLPDAGFGPFPAEGLNQNSKLDFKSKGYIFYKQRTYNPDDM FT NRAFWFIWAIYNRMPEDFQNSYPLNITFCTSELPVQSPMPAADGISAEQCDRALVLLDK FT VILEFFNNDRKLAYYYVFKGCQFVMRPCSCYQEGGLIRKASRNVALRAFTGIYYLAGFA FT EQYANMISCATHPGIIGALFQYVDTMVLQAVFSLSGPKLVRFAAPPEYQGRHACPFSFV FT ADENYWGIAPGSDAEPVGMYYMDIIQRKAEHDLFTETFMDIYGSTASIICANIETSLFT FT SGTNVINKRMQNDFARDTPKPGTLRHQHAIINRFHEPEYAYRLGILADGIIPLGGSFEV FT DILKEAERLITGEDIRNLPGLRCLCSRGLDAILGIRPVQQKRKKMSYFRTLDGTFHEVT FT IRSDTHDLQVWNDHGYLARPYACHIVDSDGIEFYDKSNGLYKGRVNVLISGFAIPGRAY FT RGPSLAGSNRGRPSLSDIPATGSLSNLIDLSKASRLPYRKLKEGLRASDYTVARELASA FT FRNSRLTRQMDHVTDIAYLNFLRWVLLPYNGQTLRPHPSKWSPTPYPEHVNLKFLTKEI FT ELELFPLKKAPQADLKVNCYARNILASTELTDDLLKQCLPVGLNNDSVCGIVIVLELLL FT IAGVPSKLLPIIGQAIANKDPFIKELSDFNKMIGATTSRIANILTECNTLIGRGVKSSD FT PSADLYHRVAPEGNRHEAKISRHILIEAIDKIYKNEMTSMPPPGDFMLHLITSPLWCKA FT GSHHHPHFAKYDSRLEFVMDVPADKIAAEPPSAYITQAEKLEHGKTRYIYNCDTVSYLF FT FDYILHYVECVWSNESVLLNPAAMSVERFSVLDYPEYCMIDYTDFNSQHSLESQKLVFE FT CLRPYLPREMHSVLDWCIASMDHMEINGQHWLSTLPSGHRATTFINSVLNKAYLIPYIG FT DTVSFHCGDDVLLCGEYDYQTLIDTLPYELNKSKQSFGPNAEFLRLHRRGGDVIGYPSR FT AVSSLVSGNWLSKTSWEWQPSLISVTNQCNVIISRSQLNIRFIPAMQQELRNRYADKMS FT EPFDVSSNYYVMPGCPCYSDAATTIVPNVPQLEHSDVPFSQAQKLFDTMRDYCPEFTTV FT NDVIDKVKARRSSSAVSNIMYNVCSPVAPQVCVVVNPNNYQFLLRKRYYPREHIAPSGF FT DESSDSKLVFTTYDLAPSIAMKSCAVLTPAKIICGHGLRSG" FT gene 364. .2490 FT /gene="cap" FT CDS 364. .2490 FT /codon_start=1 FT /gene="cap" FT /product="capsid protein" FT /protein_id="AED99801.1" FT /translation="MSAPEPLNTEVRSPNGVSEATETQNLAVTQSSVSNEKTIDTQSDL FT QTLKKQLQPVTRSTDFETLYNYFYALNVSPSTDRIGNAITRNTPVNDTNEVVSFPLTAS FT VSHTFSNTPVPAHIQPLQISIADDCVNYELDESGTLCPALDSSVHVQRATSLASALKVK FT LTGEVMHSASVRPIQTPQLIAYLYGVLLAVQDRLNIHRNQHTNLWRSLCAAGRAAQAKP FT FFDEIPNNKFRTGALLAPPLPDAGFGPFPAEGLNQNSKLDFKSKGYIFYKQRTYNPDDM FT NRAFWFIWAIYNRMPEDFQNSYPLNITFCTSELPVQSPMPAADGISAEQCDRALVLLDK FT VILEFFNNDRKLAYYYVFKGCQFVMRPCSCYQEGGLIRKASRNVALRAFTGIYYLAGFA FT EQYANMISCATHPGIIGALFQYVDTMVLQAVFSLSGPKLVRFAAPPEYQGRHACPFSFV FT ADENYWGIAPGSDAEPVGMYYMDIIQRKAEHDLFTETFMDIYGSTASIICANIETSLFT FT SGTNVINKRMQNDFARDTPKPGTLRHQHAIINRFHEPEYAYRLGILADGIIPLGGSFEV FT DILKEAERLITGEDIRNLPGLRCLCSRGLDAILGIRPVQQKRKKMSYFRTLDGTFHEVT FT IRSDTHDLQVWNDHGYLARPYACHIVDSDGIEFYDKSNGLYKGRVNVLISGFAIPGRAY FT RGPRLPVATEAAQV" XX SQ Sequence 4846 BP; 1274 A; 1334 C; 1008 G; 1230 T; 0 other; hq607519 Length: 4846 08-MAY-2011 Type: N Check: 4612 .. 1 gcttaaaagg tctagtccac tttttaagcc ggctatactt cgccgtagac 51 acttgggcaa aattaatcaa caccctcctg caatcaccgg ggtgttgcga 101 gccataagag actggttcta aaggactgat ataccgccgt gcgggtaggt 151 ggtcgatagc ccgtttgaag gagtagtaat acttccgatt ctggtgtagc 201 atcgactggg gccccctagc gtgagctcag cacgttgaga aaacgaaaaa 251 ctgcatgtgc acagccttcg cagtagcgtg agctcgaggc accctaaaaa 301 gtgccttttc cttgtgacaa cctacgtgtt atcacaaggt tctagagtat 351 tacgtgcaac ggtatgtcag ctcccgagcc cttaaatact gaagtacgtt 401 cacctaatgg tgttagtgaa gccactgaaa ctcaaaactt ggctgtcact 451 caaagcagtg tgtcaaacga aaaaacaatc gacacacaaa gtgatctgca 501 aacactcaaa aaacagttac aaccggtcac cagatccaca gattttgaaa 551 ctctttataa ttatttttat gctttaaatg tctctccgtc aacagatcgt 601 attggtaatg ctatcacacg caatactcca gttaatgata caaacgaagt 651 ggttagtttt ccgcttactg cttctgtctc gcacacattc tccaatacac 701 cagtacctgc ccacattcag cctctccaaa tttccattgc cgacgattgc 751 gttaactatg aactggatga gagcggaaca ttatgccctg ccctagatag 801 ctctgtccac gttcaaaggg ccacttctct cgctagcgct ctcaaggtca 851 aattaacagg cgaggtcatg cactcagctt cagtcagacc aatccaaact 901 ccacagttaa ttgcttatct atatggtgtt cttctcgccg tccaagatcg 951 tcttaatatt catcgtaatc aacacacaaa cttatggcgt agcttatgtg 1001 ccgcaggtcg tgcagctcaa gcgaaaccat tcttcgatga aatcccaaat 1051 aacaagttca gaaccggcgc gctcttggca ccccctctcc cagacgccgg 1101 ctttggtccc ttcccagctg agggcctcaa ccagaattcc aagctcgatt 1151 tcaaatcgaa gggatacatt ttctacaagc agcgcactta caatcccgat 1201 gatatgaatc gcgctttttg gttcatttgg gcgatctaca atcgtatgcc 1251 cgaagacttc cagaattcat atcctctgaa cattactttc tgcacttccg 1301 aattaccagt ccaaagcccg atgccagcgg ctgacggaat ttccgcagag 1351 cagtgtgata gggcactcgt tcttctcgac aaggtcattc tcgaattctt 1401 caacaacgat cgcaagcttg cttattacta cgtattcaag ggatgccagt 1451 tcgtcatgcg tccttgttcc tgttatcaag aaggtggctt aatccgcaag 1501 gcctcacgca acgttgctct tcgcgccttt actggcattt actacctcgc 1551 cggcttcgct gagcaatacg ctaacatgat ttcatgtgcc acacacccag 1601 gaattatcgg tgctctcttc caatacgtcg acactatggt cttacaggct 1651 gttttctctc tctctggccc taagctggtc cgatttgccg ccccacccga 1701 atatcaaggt cgtcacgctt gtccgttctc cttcgtagca gacgaaaatt 1751 attggggtat cgctccaggt tcagacgctg agcctgttgg catgtactat 1801 atggatatca tccaacgcaa agccgaacac gacttattca ccgaaacatt 1851 catggatatc tacggttcaa cagcttccat catttgcgca aacatcgaaa 1901 caagtttgtt cacttctggc acaaacgtca tcaacaaacg catgcagaat 1951 gatttcgcac gcgacactcc aaagcctgga actcttcgcc accaacatgc 2001 catcatcaac cgcttccacg aacccgaata tgcttaccgt ctcggtatcc 2051 tcgctgatgg catcattccg ctcggcggct ccttcgaagt cgacatcctc 2101 aaagaagctg agcgcctcat cacaggtgaa gacatccgca acctcccagg 2151 actacgttgc ctgtgctctc gcggtctcga cgcgattctc ggcatacgcc 2201 cagtccaaca aaaacgcaag aagatgagtt acttccgcac tctcgatggc 2251 acattccacg aagtaacgat caggtcagat actcacgatt tacaggtctg 2301 gaatgaccac ggctaccttg cccgcccata cgcatgtcac atcgtcgact 2351 cagacggcat cgagttctac gacaaatcca acggtctcta caagggacgc 2401 gtcaatgtcc tcatctctgg attcgccatc ccaggtcgcg cataccgggg 2451 ccctcgcttg ccggtagcaa cagaggccgc ccaagtctga gcgacattcc 2501 ggcgacagga agtctgtcca acctcatcga cctttcgaaa gcaagtcggc 2551 taccataccg taagcttaaa gaaggcttga gagcgtcaga ctacaccgtc 2601 gcccgcgagt tagctagcgc ttttcgcaat tctcgcctaa ctcgccaaat 2651 ggatcatgtt acagatatag cttaccttaa tttcctcaga tgggtgttgc 2701 taccttacaa cggtcaaaca ctacgaccac acccctccaa gtggagtcca 2751 acaccctacc ccgaacacgt caacctaaag ttcctaacca aggaaatcga 2801 gctcgaactt ttcccactga agaaggcccc acaagccgat cttaaagtga 2851 attgttacgc gcgaaatatc cttgcttcga cagagctaac tgacgatctc 2901 ctcaaacagt gtctgccagt cggactcaac aacgattcag tttgcggaat 2951 tgttatcgtc ttagagctgc ttttgattgc aggtgttcca agtaagctgc 3001 taccaattat tggccaagcc atcgcgaaca aagatccatt catcaaagaa 3051 ttgtccgatt tcaataagat gataggagcg accacttcac gcattgctaa 3101 cattctcaca gagtgcaaca cattaattgg tcgcggtgtt aagtcatctg 3151 acccaagtgc tgatttgtat caccgggtag cgcctgaggg caataggcac 3201 gaagcgaaga tttcccgaca catcctcatc gaagccatcg acaaaattta 3251 caaaaacgaa atgacaagca tgcctccacc gggcgacttc atgctccact 3301 taataacaag tcctctatgg tgtaaggctg gctctcatca ccatcctcac 3351 ttcgccaaat acgattcccg cttggaattc gtcatggatg ttccagcaga 3401 caaaatcgct gctgaaccac cctctgcata cattactcaa gcggagaaat 3451 tggaacacgg taagactagg tacatctata actgcgatac agtatcatac 3501 ctattcttcg attacatctt acattacgtc gagtgtgtgt ggtcaaatga 3551 gtcagtctta ctcaacccag ctgctatgag tgtcgagcgt tttagtgtct 3601 tggactaccc ggagtactgc atgatcgatt atacagattt caactctcaa 3651 cacagtctag aatcccagaa gctagtcttc gagtgtttga gaccgtactt 3701 gccacgcgaa atgcattcag tcttggattg gtgtatcgcg agcatggacc 3751 atatggaaat taacggccaa cattggttaa gcacgttgcc gtcaggacac 3801 agagcgacaa ctttcataaa ctcggttctg aacaaagcct acttgatccc 3851 ctacataggc gacaccgttt ctttccattg tggcgacgac gtgttattat 3901 gtggcgagta cgactatcaa acactcatcg ataccctgcc ctatgagcta 3951 aacaagagca aacagagctt tggacctaat gccgagttct tgcgcttgca 4001 taggcgtggt ggtgacgtca taggataccc atccagagct gtgtcgagtt 4051 tggtatctgg caattggtta agcaaaacat cttgggagtg gcaaccaagt 4101 ctcatttcgg tcacaaatca atgcaacgta atcatctcgc gttcacaatt 4151 gaatatcagg tttattcccg ctatgcaaca ggaactgcgc aaccgctacg 4201 cggacaagat gagtgaacct ttcgatgtca gctcgaatta ctacgtcatg 4251 ccaggttgtc cgtgctatag tgacgccgcg actacgatcg taccgaacgt 4301 tccccaactg gaacattcgg atgtaccgtt ctcgcaagca caaaaacttt 4351 ttgatactat gcgcgactac tgtcctgagt tcactaccgt caacgacgtc 4401 atcgacaagg ttaaagcccg tcgttcctcg agtgctgtca gcaatatcat 4451 gtacaatgta tgctcacctg tcgcacctca agtttgcgta gtcgtaaatc 4501 ccaacaacta ccagttcctt ttgcgcaagc ggtactaccc acgcgaacac 4551 attgccccat ccggctttga tgaatctagc gactccaagc tcgtttttac 4601 tacttacgat ctcgctcctt caatcgctat gaaatcgtgc gctgttttga 4651 ccccggcaaa gataatatgt ggtcacgggc tacgcagtgg ttgaataatc 4701 tgcctgtacc aggctatgat tggtaccgat tcagccacga acggcctctt 4751 gtcttcggac cctccgccta taggttaata ggagtacagt gttactgttg 4801 tgtgtatcgc tctaggcaca cgaacgtact accccacgtt tagttc