Sequence of DPV Trichomonas vaginalis virus 1
Trichomonas vaginalis virus 1 strain TVV1-UR1, complete genome.
ACC No: HQ607513
Dated: 2011-05-08 | Length: 4684 | CRC: -358378484
ID HQ607513; SV 1; linear; genomic RNA; STD; VRL; 4684 BP.
XX
AC HQ607513;
XX
DT 08-MAY-2011 (Rel. 108, Created)
DT 08-MAY-2011 (Rel. 108, Last updated, Version 1)
XX
DE Trichomonas vaginalis virus 1 strain TVV1-UR1, complete genome.
XX
KW .
XX
OS Trichomonas vaginalis virus 1
OC Viruses; dsRNA viruses; Totiviridae; Trichomonasvirus.
XX
RN [1]
RP 1-4684
RX DOI; 10.1128/JVI.00220-11.
RX PUBMED; 21345965.
RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W.,
RA Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H.,
RA Singh B.N., Fichorova R.N., Nibert M.L.;
RT "Clinical Isolates of Trichomonas vaginalis Concurrently Infected by
RT Strains of Up to Four Trichomonasvirus Species (Family Totiviridae)";
RL J. Virol. 85(9):4258-4270(2011).
XX
RN [2]
RP 1-4684
RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A.,
RA Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.;
RT ;
RL Submitted (12-NOV-2010) to the EMBL/GenBank/DDBJ databases.
RL Department of Microbiology and Molecular Genetics, Harvard Medical School,
RL Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA
XX
FH Key Location/Qualifiers
FH
FT source 1. .4684
FT /organism="Trichomonas vaginalis virus 1"
FT /host="Trichomonas vaginalis"
FT /strain="TVV1-UR1"
FT /mol_type="genomic RNA"
FT /country="USA"
FT /collection_date="Jun-1999"
FT /db_xref="taxon:674953"
FT gene 326. .4616
FT /gene="pol"
FT CDS join(326. .2352,2354. .4616)
FT /codon_start=1
FT /gene="pol"
FT /product="RNA-dependent RNA polymerase"
FT /note="translated via ribosomal frameshift"
FT /protein_id="AED99812.1"
FT /translation="MEASANGLSHDDNANKSQNVGPSTLPGSDKQGGENHENSFNSFSN
FT DFFFNFLRTSTSTHISDSPGVSFVSKDGTPYTSATIQSAVGRLTHNVVASAVQLNITAN
FT NTLEVDYGFGQDVSRATGTITIPIFDGEKYKEVARALSLVFSKKGMALDVTSQTVQDTL
FT MNSDLTIATVAAGYYTALAARHELTKEASVAAHRIPFVTALSDTFTAADNAQRSSHVIS
FT SCLRCPASNNAQRQVTVGTNMWTNVSVENLAVQGAAIPNPNDVSFFIPNKALPSSWWCA
FT IWLLNAFLHSFVAQTRFHIFITPGETYNLAPFTDADIYEAIPVLLAMSKSSRPVPESVE
FT SMLYAYGTQMVIQPHSLYTEGGIIRKMIFTVPHLPAHGYFVTNAEYSRYMNIAVPNDPR
FT TAKDYIIGVGTGLLQVILAYQAAFSCGGPIALHWHANDAISHGMDTVAAAYLEGRYFTI
FT PMAINVATNIAQYTTGVRADPQYKHSLDRILPRIFGPSTDTVFNFIESAITSSWVSINA
FT TKRNGRARKFRTAFINRFHDPEFAYMFGITGNGIERMEGKVTSNIAQEVEYLTNGGDLR
FT NCPILRTLKAAEAEETVTFMCTGKIGSIFAIDGTMRTFKRYQTIDLAELGWTSHGKVMK
FT PYAFRAPVIQGITVCKTAYTSTAIDIVTTVFGPLRLRVGTLLSKAVRCGPIIPSVKHHF
FT NIRRIITVKRNGNEYVFIPGYGWVLQDDYLVNSVKMTGEDQLPPNQLPYGDDLLLIYSE
FT ILLYNYISLFPKFRYKNPDLLNQETELQLFPLKTDSAARNKANFYARSLWNEAKTDKTA
FT FKPGTYNDTVAGLLMWQQCALMWSLPRSVINRTISGVCDALTERTSLALLKRISDWLKQ
FT LGLACSPIHRLFIELPTLLGRGAIPGDSVKDMKHRLKFDPSITVDVPRDQLHDLIYRLL
FT SRNLHITNVESFDHHLEERLLWSKSGSHYYPDEEVNRLLPNQPTRKEFLDVVTVDYIKE
FT CKPQVFIRQSRKLEHGKERFIYNCDTVSYVYFDFILKLFEAGWQDSEAILSPGDYTGER
FT LHARISSYKYKAMLDYTDFNSQHTIRSMRLIFETMKELLPPETTFALDWCIASFDNMYT
FT SDGHKWVSTLPSGHRATTFINTVLNWCYTQMVGLKFNSFMCAGDDVILLSQEPISLVPI
FT LTSHFKFNPSKQSTGTRGEFLRKHYTSEGVFAYPARAIASLVSGNWLSQSLRENTPILV
FT PIQNGIDRLRSRAGLLGVPWILGLSELTEREAVPRDVSMALLNSHAAGPGLITRNYSSF
FT TVTPKPPTLTSTLEYTATRYGVQDLSKHVPWEQLTLEERNKLGKQIKKMSHRHCSQAKI
FT TYTCVHEVYKPSGLPKVLSGASQPSLSMVWWQAMLKEAMQDNSTKKIDAQMFASSACTD
FT RVSGDAFLQASAKAAGVLITSLIQSSS"
FT gene 326. .2362
FT /gene="cap"
FT CDS 326. .2362
FT /codon_start=1
FT /gene="cap"
FT /product="capsid protein"
FT /protein_id="AED99811.1"
FT /translation="MEASANGLSHDDNANKSQNVGPSTLPGSDKQGGENHENSFNSFSN
FT DFFFNFLRTSTSTHISDSPGVSFVSKDGTPYTSATIQSAVGRLTHNVVASAVQLNITAN
FT NTLEVDYGFGQDVSRATGTITIPIFDGEKYKEVARALSLVFSKKGMALDVTSQTVQDTL
FT MNSDLTIATVAAGYYTALAARHELTKEASVAAHRIPFVTALSDTFTAADNAQRSSHVIS
FT SCLRCPASNNAQRQVTVGTNMWTNVSVENLAVQGAAIPNPNDVSFFIPNKALPSSWWCA
FT IWLLNAFLHSFVAQTRFHIFITPGETYNLAPFTDADIYEAIPVLLAMSKSSRPVPESVE
FT SMLYAYGTQMVIQPHSLYTEGGIIRKMIFTVPHLPAHGYFVTNAEYSRYMNIAVPNDPR
FT TAKDYIIGVGTGLLQVILAYQAAFSCGGPIALHWHANDAISHGMDTVAAAYLEGRYFTI
FT PMAINVATNIAQYTTGVRADPQYKHSLDRILPRIFGPSTDTVFNFIESAITSSWVSINA
FT TKRNGRARKFRTAFINRFHDPEFAYMFGITGNGIERMEGKVTSNIAQEVEYLTNGGDLR
FT NCPILRTLKAAEAEETVTFMCTGKIGSIFAIDGTMRTFKRYQTIDLAELGWTSHGKVMK
FT PYAFRAPVIQGITVCKTAYTSTAIDIVTTVFGPLRLRVGTLFE"
XX
SQ Sequence 4684 BP; 1356 A; 1204 C; 943 G; 1181 T; 0 other;
hq607513 Length: 4684 08-MAY-2011 Type: N Check: 1601 ..
1 gcaaaaagag ggagtcaccc actttcctct ttttgcaccc caacattgtc
51 acctcatcat gacgaactca taacgcggac acataacaag cgtagtgtcc
101 tcgacgattg ccatcctcgt gtgaattccg ggctccgctt gcactgatgg
151 tacctcttac gaaacttgga gagacttcgg cctcgaagag cggtaatgta
201 ccctctgcgc ctgggaccta atggcgtttt tgctgtaggg atttttcagt
251 ggtagaaaga ttaggggtta aacatcctgg ttcgctaggt ttgtccttaa
301 ctttactcag ctgatgggaa tacccatgga ggcttctgct aatgggttat
351 cacatgatga taatgcgaat aaatcgcaaa atgttggacc ttctactctt
401 ccggggtcag ataaacaagg aggagaaaac cacgaaaatt cttttaattc
451 tttttcaaat gatttctttt ttaatttttt acgcacatct acgagtactc
501 atatttcaga cagtccagga gtttctttcg tctcaaagga tggaacacct
551 tacacatcag ccaccatcca gtccgcagtc ggtcgtctta cacataatgt
601 cgtcgcatca gcagtccagc tcaacattac agccaacaat acgttagagg
651 tggactacgg ttttggccaa gacgtctcaa gagctacagg aaccatcaca
701 atcccgatct ttgatggtga aaagtacaaa gaggtagctc gcgccttatc
751 attagttttc agcaagaagg gtatggcgct cgacgttaca tctcaaactg
801 ttcaagacac cctcatgaac tccgatctca caattgctac cgtcgctgct
851 ggatattaca cagctttagc cgctcgccat gaactcacaa aagaagcaag
901 cgtcgcagcc catcgtattc ctttcgttac agccttatca gacacgttca
951 cagctgcaga caacgcgcaa cgctcaagcc acgtcatctc ttcttgcttg
1001 cgctgccctg cctcgaataa cgctcaacgc caagtcacag tcggtacgaa
1051 tatgtggacg aatgtttccg tcgaaaatct tgcagtacaa ggcgctgcaa
1101 tcccaaatcc aaacgatgta tcattcttca ttccgaacaa agccctccca
1151 tcttcttggt ggtgcgcaat ctggcttctc aacgcttttc ttcacagctt
1201 cgtcgcgcag actcgtttcc acatcttcat tacaccaggt gaaacctaca
1251 atcttgcgcc gttcacagat gccgatatct acgaagcgat ccctgttttg
1301 ctcgcgatgt caaaatcatc gcgtcctgtt ccagagagcg tcgaaagtat
1351 gctctacgcc tacggcaccc agatggttat ccaaccacac tcgctctata
1401 cggagggcgg cataattagg aaaatgatct ttaccgtccc acaccttcca
1451 gcacacggct atttcgtaac aaacgctgaa tactcgagat acatgaacat
1501 cgctgttcca aacgacccac gtacagccaa agattacata attggagtcg
1551 gcaccggtct cttacaggtc atacttgctt accaagccgc cttcagctgt
1601 ggtggtccaa tcgctctcca ctggcacgcc aacgacgcta tctcacatgg
1651 tatggatact gttgcagctg cttacctcga aggaagatac ttcaccatcc
1701 caatggctat caacgttgcg acgaatattg cccaatatac tacaggagtt
1751 agggcagatc cacagtacaa acattcactc gaccggatct taccacgcat
1801 tttcggtccg tcgacagaca cagttttcaa cttcatcgaa tccgcgatca
1851 catcttcttg ggtctcaatc aatgctacga aacgcaacgg ccgtgccaga
1901 aagttcagga cggctttcat caaccgcttt catgatccag aattcgctta
1951 catgttcggc attactggca atggcatcga gcggatggaa ggtaaagtca
2001 cgtcgaacat cgcacaggaa gttgaatacc tcaccaacgg tggtgacctt
2051 cgcaactgcc caattcttcg caccttaaag gctgctgaag cagaagagac
2101 cgtcactttc atgtgtacgg gaaagatcgg ttccatcttc gcgatcgatg
2151 gtacaatgcg cacattcaaa cggtaccaaa cgatcgacct cgctgaactc
2201 ggatggacgt cacacggcaa ggtcatgaaa ccatacgctt tcagagcccc
2251 agtcatccaa ggaatcaccg tctgcaagac agcttacaca tccacagcta
2301 tcgacatcgt cacaacagtc ttcggcccct tacgccttcg cgtaggcacc
2351 ctttttgagt aaggctgtac gttgtggccc tataatacca tccgtcaagc
2401 atcacttcaa cataagacgc atcataacag ttaaacgtaa tggtaatgaa
2451 tacgtattta tcccaggtta cggatgggta ttacaggatg attatttggt
2501 gaattccgtc aagatgactg gtgaagatca actacctcct aaccagttac
2551 cttatggcga tgatctttta cttatatatt cagaaatttt actttataat
2601 tacatatctc tttttcccaa gttcagatac aagaatccag acttattaaa
2651 tcaagaaaca gaattacaac ttttcccact taaaaccgac tcagctgcca
2701 gaaataaagc caatttttat gctagatcac tatggaatga agcaaaaaca
2751 gacaaaacag ctttcaaacc aggaacctac aatgacacag tagcaggtct
2801 attgatgtgg caacaatgtg ctctcatgtg gtcactgcct cgctcagtta
2851 tcaacagaac aattagcggc gtttgtgatg cgttaaccga aaggacttca
2901 ctcgcgctat taaaacgtat ctcagattgg ttgaaacaac tcgggctggc
2951 ttgctcacca atccatcgct tattcataga actcccaaca ctattaggac
3001 gcggagcaat tccaggcgat agtgtaaagg atatgaagca cagactcaag
3051 ttcgacccat ctataacagt agatgtccca agagaccagt tacacgatct
3101 aatctacaga ctcttatcaa gaaatcttca cataaccaac gttgagagct
3151 tcgatcacca tctagaagag cgtctgcttt ggtctaaatc cggaagtcat
3201 tactatcccg acgaggaagt caatagatta cttcccaatc aacccacaag
3251 gaaagaattc ttagacgtcg taaccgtaga ctacatcaag gaatgcaagc
3301 ctcaggtttt cataagacaa tcacgtaagc tagaacacgg caaagaacgt
3351 ttcatctaca actgcgacac agtctcatac gtctattttg attttatcct
3401 gaaactcttt gaggcaggat ggcaagatag cgaagcaata ctgtcgccag
3451 gtgactacac tggtgaacgc ttacacgcaa gaatttctag ctacaaatac
3501 aaggctatgc tcgattacac ggatttcaat tctcagcata caatccgaag
3551 catgcgactg atattcgaaa ctatgaagga gttactacca cctgaaacca
3601 cctttgctct cgactggtgt atcgcctcat tcgataacat gtacacatcc
3651 gatggccaca aatgggtctc gactctccca agcggacatc gagctactac
3701 cttcatcaac acagtattaa attggtgcta cacacaaatg gtcggtctca
3751 agttcaacag ttttatgtgc gccggtgatg atgtcatttt attgtctcaa
3801 gagccaatat cactggtccc cattcttaca tcacatttca agttcaatcc
3851 cagtaaacaa agtacaggta ctagaggtga attcttacgc aagcattaca
3901 cctcagaagg cgtgtttgca tacccggcac gagcaattgc aagcttagta
3951 agcggaaatt ggttaagcca atctttaaga gagaacactc caattttggt
4001 cccaatacaa aacggaatcg acagacttcg aagcagagct ggtttactcg
4051 gagtcccttg gatcttaggc ctctcggagc tcacagagcg agaagccgtt
4101 cctagggatg tcagcatggc tctgttaaat tcacacgctg caggaccagg
4151 tttgatcaca cggaattaca gttctttcac cgttaccccg aaaccaccta
4201 cgctaactag tacactcgag tacacagcaa cccgttacgg tgtccaagac
4251 ctgtccaaac acgtaccatg ggaacaactt acattggaag aacgtaataa
4301 gttaggaaaa caaattaaga aaatgagtca caggcattgt agccaggcaa
4351 agataacata cacttgtgtt cacgaagttt acaaaccaag tggcctcccc
4401 aaggtgttat ctggtgccag ccaaccatcg ttgtcgatgg tgtggtggca
4451 ggcaatgctt aaggaagcaa tgcaagacaa ctctactaag aagatagatg
4501 cacaaatgtt cgcttcgagt gcatgtacag accgcgtcag cggtgatgca
4551 ttcttgcaag cgagcgcaaa agctgctggt gtactaatca ctagcttgat
4601 tcaatcttct tcataacgta cagcaaaaaa agtctctata gttgctcaag
4651 acatatatga gccagatggc cctgctatac cttc