Sequence of DPV Trichomonas vaginalis virus 1

Trichomonas vaginalis virus 1 strain TVV1-UH9, complete genome.

ACC No: HQ607516

Dated: 2011-05-08 | Length: 4678 | CRC: 2045305673

                
ID   HQ607516; SV 1; linear; genomic RNA; STD; VRL; 4678 BP.
XX
AC   HQ607516;
XX
DT   08-MAY-2011 (Rel. 108, Created)
DT   08-MAY-2011 (Rel. 108, Last updated, Version 1)
XX
DE   Trichomonas vaginalis virus 1 strain TVV1-UH9, complete genome.
XX
KW   .
XX
OS   Trichomonas vaginalis virus 1
OC   Viruses; dsRNA viruses; Totiviridae; Trichomonasvirus.
XX
RN   [1]
RP   1-4678
RX   DOI; 10.1128/JVI.00220-11.
RX   PUBMED; 21345965.
RA   Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W.,
RA   Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H.,
RA   Singh B.N., Fichorova R.N., Nibert M.L.;
RT   "Clinical Isolates of Trichomonas vaginalis Concurrently Infected by
RT   Strains of Up to Four Trichomonasvirus Species (Family Totiviridae)";
RL   J. Virol. 85(9):4258-4270(2011).
XX
RN   [2]
RP   1-4678
RA   Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A.,
RA   Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.;
RT   ;
RL   Submitted (12-NOV-2010) to the EMBL/GenBank/DDBJ databases.
RL   Department of Microbiology and Molecular Genetics, Harvard Medical School,
RL   Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .4678
FT                   /organism="Trichomonas vaginalis virus 1"
FT                   /host="Trichomonas vaginalis"
FT                   /strain="TVV1-UH9"
FT                   /mol_type="genomic RNA"
FT                   /country="USA"
FT                   /collection_date="Sep-2002"
FT                   /db_xref="taxon:674953"
FT   gene            325. .4615
FT                   /gene="pol"
FT   CDS             join(325. .2351,2353. .4615)
FT                   /codon_start=1
FT                   /gene="pol"
FT                   /product="RNA-dependent RNA polymerase"
FT                   /note="translated via ribosomal frameshift"
FT                   /protein_id="AED99814.1"
FT                   /translation="MEASANGLSHDENATRSQNVGPSTLPGSDKQGGEKHENSFNSFSN
FT                   DFFFNFLRISAQTHISDSPGVSFIGKDGTPYSSTTIPSAVGRLTHNVVASAVQLNVTAD
FT                   NVLEVDYGFGQDVSRSTGTITIPIFDGEKYKETARALATIFSKKGMAVDVTSQTVQETL
FT                   KNSDLTIATVAAGYYTALAARHELTKQVSVASHTIPFVTALSDTLSAAQGAQRSSHVIS
FT                   SCLRCPHSNNVQHDVGIGTDMWNNVSVESLSPQNMAVPNPNDVSFFIPNKALPPPWWCA
FT                   IWLLNAFIHSFVAPTRFHIFIAPGETYHLAPFTDADVYEAIPIMLAMSKAARPVPESVE
FT                   SMLYAYGTQMIIQPHSLYTEGGLIRRMIFTVPHLPAHGYFVTNSEYSRYMNIAVPNDPR
FT                   SAKDFIIGAGTGLLQITLAYQAAFSCAGPIALHWHDNDAISQGMDTIASTYLEGRYFTI
FT                   PIAVAVATNVAQYTTMVRADPQYRHTLDRILPRIFGPSTDTVFNFIESAISSSWVSIDA
FT                   RRRNGRARKFRTAFINRFHDPEFAYMFGITGNGIERMEGKVTSTISQEVDYLLHGGDLR
FT                   NCPVLRTLKAAERDETITFMCKEKAGTLFAMDGTMRWFKRFETIDLTQLGWTSHGKVMK
FT                   PYAFRAPIIQGITICNTAYTTTAIDIVTTVFGPLRQRVGSLLSKAVRCGPIIPTIKHHF
FT                   NFKHVITAKRNDNEYVFIPGYGWVLQDDYLLNAVKMTGEGDLPPDQLPYDDDLLLTYAK
FT                   ILLYDYITHFPKYRYNNPKILTQETEIQLFPLKEDSAARNKVNFYTRLLWNEATSDKKA
FT                   FKPGTYNDTVAGLLMWQQCALMWSIPQSIINRTISGVCDALTDRTSLTLLKRISDWLKQ
FT                   LGLAYSPIHRLFIELPTLLGRGAIPGDAAHDIKHRLTFDPSITVDVPTDQLHRLIYRLL
FT                   SRNLNITTVDSFEDHLEERLLWSKSGSHYYPDDEVNKLLPHRPTRKEFLDIVTVDYIKR
FT                   CKPQVFIRQSRKLEHGKERFIYNCDTISYVYFDYILKLFETGWQDSEAILSPGDYTNDR
FT                   LHTKISSYKYKAMLDYTDFNSQHTIESMRLIFETMKELLPSEATFALDWCIASFDNMKT
FT                   SKGHKWVATLPSGHRATTFINTVLNWCYTQMVGLKFDSFMCAGDDVILMSQEPISLAPI
FT                   LTSHFKFNPSKQSTGTRGEFLRKHYTAEGVFAYPCRAIASLVSGNWLSETLRDNTPMVV
FT                   PIQNGIDRLRSRAGLLGVPWRLGLSELIEREDIPKEVGMALLNSHAAGPGLITRDYSSF
FT                   TVTPKPPTITSTLEYTATRYGVQDLSKHVPWKQLTLQECNKLGQQIKKMSHRHCSQAKI
FT                   TYKCVYEVFKPNGLPTVLSEVSQPSLSMVWWQAMLKEAMQDYSVKKIDAQMFASNACTS
FT                   SVSGDAFLQATPKMAGVLMTSLIYSSS"
FT   gene            325. .2361
FT                   /gene="cap"
FT   CDS             325. .2361
FT                   /codon_start=1
FT                   /gene="cap"
FT                   /product="capsid protein"
FT                   /protein_id="AED99813.1"
FT                   /translation="MEASANGLSHDENATRSQNVGPSTLPGSDKQGGEKHENSFNSFSN
FT                   DFFFNFLRISAQTHISDSPGVSFIGKDGTPYSSTTIPSAVGRLTHNVVASAVQLNVTAD
FT                   NVLEVDYGFGQDVSRSTGTITIPIFDGEKYKETARALATIFSKKGMAVDVTSQTVQETL
FT                   KNSDLTIATVAAGYYTALAARHELTKQVSVASHTIPFVTALSDTLSAAQGAQRSSHVIS
FT                   SCLRCPHSNNVQHDVGIGTDMWNNVSVESLSPQNMAVPNPNDVSFFIPNKALPPPWWCA
FT                   IWLLNAFIHSFVAPTRFHIFIAPGETYHLAPFTDADVYEAIPIMLAMSKAARPVPESVE
FT                   SMLYAYGTQMIIQPHSLYTEGGLIRRMIFTVPHLPAHGYFVTNSEYSRYMNIAVPNDPR
FT                   SAKDFIIGAGTGLLQITLAYQAAFSCAGPIALHWHDNDAISQGMDTIASTYLEGRYFTI
FT                   PIAVAVATNVAQYTTMVRADPQYRHTLDRILPRIFGPSTDTVFNFIESAISSSWVSIDA
FT                   RRRNGRARKFRTAFINRFHDPEFAYMFGITGNGIERMEGKVTSTISQEVDYLLHGGDLR
FT                   NCPVLRTLKAAERDETITFMCKEKAGTLFAMDGTMRWFKRFETIDLTQLGWTSHGKVMK
FT                   PYAFRAPIIQGITICNTAYTTTAIDIVTTVFGPLRQRVGSLFE"
XX
SQ   Sequence 4678 BP; 1387 A; 1159 C; 936 G; 1195 T; 1 other;

hq607516 Length: 4678  08-MAY-2011  Type: N  Check: 2132  ..

       1  gcaaaaagag ggagtgatcc actttcctct ttttgcaccc aacattgtta
      51  catcatcatg acgaacccat aacgcggaca cataacaagc gtagtgtcct
     101  cgacgattgc catcctcgtg tgaattccgg gctccgcttg cactgatggt
     151  acctcttacg aaacttggag agacttcggc cttgaagagc ggtaatgtgc
     201  cctctgcgcc tgggacctaa tggtgttttt gctgtaggta cttcaatagt
     251  aggaaggagt agggttaaac atcctggttc gctaggtttg tcctcgcctt
     301  aatctctatt gataatgaat acccatggag gcttctgcta atgggttatc
     351  acatgatgaa aatgcgacaa gatcgcaaaa tgttggacct tctactcttc
     401  cggggtcaga taaacaagga ggagaaaaac acgaaaattc ttttaattct
     451  ttttcaaatg atttcttttt taatttttta cgtatatctg cacagacaca
     501  catttcagac agtccaggcg tctctttcat aggtaaagac ggcacgccct
     551  actcatcaac tacaattcca tcagctgtag gccgtcttac acacaatgta
     601  gttgcatcag ccgtccagct caacgtcaca gccgataatg tcttagaagt
     651  cgactacggt ttcggtcagg acgtctcaag atctacagga acgatcacaa
     701  tcccaatctt cgacggtgaa aagtacaaag aaacagctcg cgccttagcc
     751  acaatcttca gcaagaaagg tatggcagtc gacgtcacat cacagacggt
     801  ccaggaaact ctcaagaatt cagaccttac aattgccacg gtagcagctg
     851  gatattacac agccctagca gctcgtcacg aactcaccaa acaagtaagc
     901  gtcgcatccc ataccatacc gttcgtcacc gcgttatcag atacactctc
     951  ggccgcacaa ggtgcgcaac gttcaagcca tgttatctct tcttgcttac
    1001  gttgccctca ttccaacaat gttcaacatg acgtcggaat cggtacagac
    1051  atgtggaata acgtctccgt cgaaagtctt tcaccacaga acatggcagt
    1101  tcctaatcct aacgatgtat ccttcttcat tccgaacaag gctctcccac
    1151  ctccttggtg gtgcgctatt tggcttctca atgccttcat tcacagcttt
    1201  gtcgcgccga cacgtttcca tatcttcatt gctccaggcg aaacatacca
    1251  tcttgcacca tttacagatg ccgatgttta cgaagcgatc cctatcatgc
    1301  ttgcgatgtc aaaggcggca cgcccagttc ctgagagcgt tgagagtatg
    1351  ctttatgcat acggtacaca gatgattatc cagccacact cgctctacac
    1401  agaaggtgga ttaatcagaa ggatgatatt tacggttcca catctcccag
    1451  cacacggcta cttcgtcacg aattccgaat actcaagata catgaacatc
    1501  gctgttccaa acgaccctcg ttctgcaaaa gactttatta tcggtgctgg
    1551  aacgggtctc ttacaaatca cactcgccta ccaagctgct ttcagttgcg
    1601  caggccccat cgcacttcat tggcacgaca acgatgccat ctcccaaggt
    1651  atggatacaa ttgctagcac ctaccttgaa ggaagatact tcactattcc
    1701  tatagcggtt gctgtcgcta caaatgtagc tcaatataca acgatggtca
    1751  gagctgatcc ccaatacagg cacacactcg atcggatctt accacgcata
    1801  ttcggtccat caacagatac agtgttcaat ttcatcgaat ccgctatctc
    1851  atcatcatgg gtatcaatcg atgcccgcag gcgaaacggt cgcgcaagaa
    1901  aattcaggac ngccttcatc aatcgttttc atgatccaga attcgcatac
    1951  atgttcggta tcactggtaa cggtatcgag agaatggaag gcaaagtcac
    2001  ttccaccatc agccaagagg tcgactacct cttacacggc ggtgaccttc
    2051  gcaactgccc agtccttcgc accctcaaag cagcagagag agatgaaaca
    2101  atcacgttca tgtgcaagga gaaagctggc acgctcttcg ccatggacgg
    2151  aacaatgcgc tggtttaagc ggttcgagac aatcgatctc acccagctcg
    2201  gatggacatc acacggaaag gtcatgaagc cgtacgcatt cagagctcca
    2251  atcatccaag gaataacaat ctgcaacaca gcttacacaa caacagccat
    2301  cgacatcgtt acaactgtct ttggtccttt acgtcagagg gtaggttccc
    2351  tttttgagta aagctgtacg ttgtggccct ataataccaa ccatcaagca
    2401  tcatttcaat ttcaaacatg ttataacagc taagcgaaat gataacgaat
    2451  atgttttcat tcccggatat ggttgggtat tacaggatga ttatttgttg
    2501  aatgccgtaa agatgactgg tgaaggtgat ttaccccctg atcagttacc
    2551  ttacgatgat gatcttttac ttacatacgc aaaaatttta ctttatgatt
    2601  acataactca ttttcctaaa tacagatata acaatcctaa gatattaaca
    2651  caagaaacag aaatacaact tttcccactc aaagaagact cagctgctag
    2701  aaacaaagtc aacttctaca ctagactact atggaacgaa gcaacctcag
    2751  ataaaaaggc tttcaaacca ggaacttaca atgacacagt agcaggctta
    2801  ttgatgtggc aacaatgtgc tcttatgtgg tccatacctc agtcaattat
    2851  caacagaaca attagcggtg tttgtgatgc attaaccgat aggacttcac
    2901  tcacgctatt gaaacgtatc tcagactggc tgaaacagct tggactagct
    2951  tactcaccta tacatcgcct tttcatagag ctccccacat tattaggacg
    3001  tggagccata ccaggcgacg cagctcacga tattaagcac agactcacgt
    3051  tcgacccatc aattacagta gacgtgccaa cagaccaatt acataggtta
    3101  atttacagac tcttatctcg gaacctcaat ataactacgg tagatagttt
    3151  cgaagatcac ttagaggaac ggctactttg gtccaaatca ggaagtcact
    3201  attatcccga cgacgaagtg aataagttac ttcctcaccg ccctacaaga
    3251  aaggaattct tagacatagt aacagtagat tacattaaac gatgcaagcc
    3301  ccaagttttc atcagacaat cacgcaagct agaacacggc aaggagcgat
    3351  tcatctacaa ttgtgacacg atctcgtatg tctattttga ttacatcctg
    3401  aagctcttcg agacaggatg gcaagatagt gaagcaatat tgtcaccagg
    3451  tgattatact aatgatcgtc tccacaccaa gatctccagc tataagtaca
    3501  aagcaatgtt agattataca gattttaatt ctcaacacac gatcgaaagc
    3551  atgcgtctga ttttcgaaac catgaaagaa ctactcccgt cagaagcgac
    3601  cttcgcactt gactggtgta tagcatcctt cgacaacatg aaaacatcca
    3651  aaggtcacaa atgggttgcg acccttccta gcggacatcg tgctacaacg
    3701  ttcatcaata cagtattaaa ttggtgttac acgcagatgg tcggtctcaa
    3751  gtttgacagt tttatgtgcg ctggtgatga tgtcatctta atgtcccaag
    3801  aaccaatatc attagctcca attctcacat cgcacttcaa attcaatcct
    3851  agcaaacaga gtacaggcac tagaggtgag ttcttacgca agcattacac
    3901  tgcagaaggc gtgtttgcat acccatgtag agcgatcgca agtttagtaa
    3951  gcggaaattg gttgagcgaa acactaagag ataacacccc aatggtggtc
    4001  ccaatacaga atggaatcga tagattacgc agtagagcgg gtttactcgg
    4051  agtcccttgg cgtttaggcc tctcagagct cattgagaga gaggacatac
    4101  ccaaggaagt cggcatggct ttactcaatt cgcacgcagc gggaccgggt
    4151  ctcatcacac gtgattacag ttcatttaca gttacaccca aaccgcctac
    4201  aataactagt acacttgaat acaccgcgac tcgttatggt gtccaagacc
    4251  tgtccaaaca cgtaccttgg aaacaactca cattacaaga atgcaataaa
    4301  ttaggtcaac agattaagaa aatgagccac aggcattgta gccaggccaa
    4351  gataacttac aaatgcgtct acgaggtttt taaacctaat ggactcccta
    4401  cggtgttatc tgaggtcagc caaccatcgt tgtcgatggt gtggtggcag
    4451  gcaatgctta aggaagcaat gcaggactat tctgttaaga agatagatgc
    4501  tcaaatgttc gcgagtaacg catgtacaag ctccgttagc ggagatgcgt
    4551  ttttacaagc gacacctaaa atggctggcg ttctaatgac tagcctcatc
    4601  tattcttctt cataacgtac agcaaagtct ctatagttgc tcaagactta
    4651  atgagccaga tggtctcact ataccttc