Sequence of DPV Trichomonas vaginalis virus 1

Trichomonas vaginalis virus 1 strain TVV1-OC4, complete genome.

ACC No: HQ607521

Dated: 2011-05-08 | Length: 4680 | CRC: 248096124

                
ID   HQ607521; SV 1; linear; genomic RNA; STD; VRL; 4680 BP.
XX
AC   HQ607521;
XX
DT   08-MAY-2011 (Rel. 108, Created)
DT   08-MAY-2011 (Rel. 108, Last updated, Version 1)
XX
DE   Trichomonas vaginalis virus 1 strain TVV1-OC4, complete genome.
XX
KW   .
XX
OS   Trichomonas vaginalis virus 1
OC   Viruses; dsRNA viruses; Totiviridae; Trichomonasvirus.
XX
RN   [1]
RP   1-4680
RX   DOI; 10.1128/JVI.00220-11.
RX   PUBMED; 21345965.
RA   Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W.,
RA   Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H.,
RA   Singh B.N., Fichorova R.N., Nibert M.L.;
RT   "Clinical Isolates of Trichomonas vaginalis Concurrently Infected by
RT   Strains of Up to Four Trichomonasvirus Species (Family Totiviridae)";
RL   J. Virol. 85(9):4258-4270(2011).
XX
RN   [2]
RP   1-4680
RA   Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A.,
RA   Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.;
RT   ;
RL   Submitted (12-NOV-2010) to the EMBL/GenBank/DDBJ databases.
RL   Department of Microbiology and Molecular Genetics, Harvard Medical School,
RL   Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .4680
FT                   /organism="Trichomonas vaginalis virus 1"
FT                   /host="Trichomonas vaginalis"
FT                   /strain="TVV1-OC4"
FT                   /mol_type="genomic RNA"
FT                   /country="USA"
FT                   /collection_date="Dec-2009"
FT                   /db_xref="taxon:674953"
FT   gene            325. .4615
FT                   /gene="pol"
FT   CDS             join(325. .2351,2353. .4615)
FT                   /codon_start=1
FT                   /gene="pol"
FT                   /product="RNA-dependent RNA polymerase"
FT                   /note="translated via ribosomal frameshift"
FT                   /protein_id="AED99818.1"
FT                   /translation="MEASANGLSHDENATRSQNVGPSTLPGSDKQGGEKHENSFNSFSN
FT                   DFFFNFLRMSAQTHISDNPGVSFIGKDGTPYSSATIPSAVGRLTHNVVASAVQLNVTAD
FT                   NVLEVDYGFGQDVSRSTGTITIPIFDGEKYKETARALAAIFNKKGMAVDVTSQTVQETL
FT                   KNSDLTIATVAAGYYTALAARHELTKQVSVAAHSIPFVTAISDTLAAAQDAQRSSHVIS
FT                   SCLRCPHCNNAQHDIGIGTNMWNNVSVESLSPQNMAVPNPNDISFFIPNKALPPPWWCA
FT                   IWLLNAFIHSFVAPTHIHIFITPGETYHLAPFTDADIYEAIPIMLAMSKAARPVPESVE
FT                   SMLYAYGTQMIIQPHSLYTEGGLIRKMIFTVPHLPAHGYFVTNSEYSRYMNIAVPNDPR
FT                   SAKDFIIGAGTGLLQIILAYQAAFSCAGPIALHWHANDAISQGMDTIAGTYLEGRYFTI
FT                   PMAVAVATNVAQYTTLIRTDPQYRHTLERILPRIFGPSTDTVYNFIESAISSSWVSIDA
FT                   RRRNGRTRKFRTAFINRFHDPEFAYMFGITGNGIERMEGKVTSTISQEVDYLLNGGDLR
FT                   NCPVLRTLKAAERDETITFMCKEKAGTLMAMDGTIRWFKRFETIDLTHLGWTSRGKVMK
FT                   PYAFRAPIIQGITICNTAYTTTAIDIVTTVFGPLRQRVGSLLSKAVRCGPIIPTVKHHF
FT                   NFKHVITTKRNDNEYIFIPGYGWVLQDDYLLNAVKMTGEGDLPPDQLPYDDDLLLSYAK
FT                   ILLYDYITHFPKHRYNNPKILTQETELQLFPLKDDSAARTKVNFYARLLWNEATSDKTA
FT                   FKPGTYNDTVAGLLMWQQCALMWSVPQSIINRVISGVCDALTDRTSLALLKRISDWLKQ
FT                   LGLAYSPIHRLFIELPTLLGRGAIPGDAIHDIKHRLKFDPSITVDVPTDQLHRLIYRLL
FT                   SRNLKVTTLDSFEDHLEERLLWSKSGSHYYPDDEVNKLLPHRPTRKEFLDIVTVDYIKR
FT                   CKPQVFIRQSRKLEHGKERFIYNCDTISYVYFDYILKLFETGWQDSEAILSPGDYTNER
FT                   LHAKISSYKYKAMLDYTDFNSQHTIESMRLIFETMKELLPSETAFALDWCIASFDNMRT
FT                   SSGHKWVATLPSGHRATTFINTVLNWCYTQMVGLKFDSFMCAGDDVILMSQEPISLAPI
FT                   LTSHFKFNPSKQSTGTRGEFLRKHYTAEGVFAYPCRAIASLVSGNWLSETLRDNTPMVV
FT                   PIQNGIDRLRSRAGLLGVPWSLGLSELIEREGIPKEVGMALLNSHAAGPGLITRDYSSF
FT                   TVTPKPPTITSTLEYTATRYGVQDLSKHVPWKQLTTEESRKLGQQIKKMSHRHCSQAKI
FT                   TYKCIYEVFKPSGLPTVLSGVSQPSLSMVWWQAMLKEAMQNYSVKKIDAQMFASNACTS
FT                   SVSGDAFLQATPKMAGVLMTSLIYSSS"
FT   gene            325. .2361
FT                   /gene="cap"
FT   CDS             325. .2361
FT                   /codon_start=1
FT                   /gene="cap"
FT                   /product="capsid protein"
FT                   /protein_id="AED99817.1"
FT                   /translation="MEASANGLSHDENATRSQNVGPSTLPGSDKQGGEKHENSFNSFSN
FT                   DFFFNFLRMSAQTHISDNPGVSFIGKDGTPYSSATIPSAVGRLTHNVVASAVQLNVTAD
FT                   NVLEVDYGFGQDVSRSTGTITIPIFDGEKYKETARALAAIFNKKGMAVDVTSQTVQETL
FT                   KNSDLTIATVAAGYYTALAARHELTKQVSVAAHSIPFVTAISDTLAAAQDAQRSSHVIS
FT                   SCLRCPHCNNAQHDIGIGTNMWNNVSVESLSPQNMAVPNPNDISFFIPNKALPPPWWCA
FT                   IWLLNAFIHSFVAPTHIHIFITPGETYHLAPFTDADIYEAIPIMLAMSKAARPVPESVE
FT                   SMLYAYGTQMIIQPHSLYTEGGLIRKMIFTVPHLPAHGYFVTNSEYSRYMNIAVPNDPR
FT                   SAKDFIIGAGTGLLQIILAYQAAFSCAGPIALHWHANDAISQGMDTIAGTYLEGRYFTI
FT                   PMAVAVATNVAQYTTLIRTDPQYRHTLERILPRIFGPSTDTVYNFIESAISSSWVSIDA
FT                   RRRNGRTRKFRTAFINRFHDPEFAYMFGITGNGIERMEGKVTSTISQEVDYLLNGGDLR
FT                   NCPVLRTLKAAERDETITFMCKEKAGTLMAMDGTIRWFKRFETIDLTHLGWTSRGKVMK
FT                   PYAFRAPIIQGITICNTAYTTTAIDIVTTVFGPLRQRVGSLFE"
XX
SQ   Sequence 4680 BP; 1385 A; 1175 C; 938 G; 1182 T; 0 other;

hq607521 Length: 4680  08-MAY-2011  Type: N  Check: 2344  ..

       1  gcaaaaagag ggagtgatcc actttcctct ttttgcaccc aacattgtta
      51  catcatcatg acgaatccat aacgcggaca cataacaagc gtagtgtcct
     101  cgacgattgc catcctcgtg tgaattccgg gctccgcttg cactgatggt
     151  acctcttacg aaacttggat agacttcggc cttgaagagc ggtaatgtgc
     201  cctctgcgcc tgggacctaa tggcgtttat gctgtaggta atttcagtag
     251  taggaaggag aggggtaaac atcctggttc gctaggtttg tcctcgcctt
     301  aatctctact gatagcgaat acccatggag gcttctgcta atgggttatc
     351  acatgatgaa aatgcgacaa gatcgcaaaa tgttggacct tctactcttc
     401  cggggtcaga taaacaagga ggagaaaaac atgaaaattc ttttaattct
     451  ttttcaaatg atttcttttt taatttttta cgtatgtctg cacaaacaca
     501  catttccgac aatccaggtg tttctttcat aggtaaagac ggcacacctt
     551  attcatcagc cacaattcct tcagctgtag gccgtcttac acacaacgta
     601  gttgcatcag ccgtccagct caacgtcaca gccgataatg ttttagaagt
     651  cgattacggc ttcggtcaag acgtttcaag gtctacagga acaattacaa
     701  tcccaatttt cgacggcgag aaatacaaag aaactgctcg tgctttagcc
     751  gcaattttca acaagaaagg catggcagtt gacgtcacat cacagacagt
     801  ccaagaaact ctcaagaatt cggatcttac aattgctaca gttgcagccg
     851  gatattacac agccttagct gctcgccacg aacttaccaa acaagtaagc
     901  gttgcagctc attccatacc attcgttaca gcgatatcag atacactcgc
     951  agccgcacaa gatgcgcaac gctcaagcca tgttatctct tcttgcttgc
    1001  gttgccctca ttgcaataac gcacagcacg acatcggaat tggtacaaac
    1051  atgtggaata acgtttccgt cgaaagtctc tcaccacaga atatggcagt
    1101  tccaaatccc aacgacatat ccttcttcat tccgaataag gctctcccac
    1151  ccccttggtg gtgcgctatt tggcttctca acgcgttcat ccatagcttc
    1201  gtcgcgccga cacatatcca catcttcatc actccaggtg agacatatca
    1251  tcttgcaccg ttcacggatg ctgatatcta cgaagccatc cctatcatgc
    1301  tcgcaatgtc aaaggccgca cgtccagttc cagaaagcgt cgaaagtatg
    1351  ctttatgcat acggcacaca gatgattatc cagccacact cgctctatac
    1401  agaaggtgga ttaatcagaa agatgatatt cacagttcca catctcccag
    1451  cacacggcta tttcgtcacg aactccgaat actcaaggta catgaacatc
    1501  gcggttccaa acgaccctcg ttccgcaaaa gacttcatta tcggtgcagg
    1551  aacgggtctc ttacagatca tactcgctta ccaagctgct ttcagttgcg
    1601  ctggccctat cgcgcttcat tggcacgcaa acgacgctat ctcccaaggt
    1651  atggatacaa tcgcaggcac ttaccttgaa ggaaggtact tcacaatccc
    1701  tatggcagtc gccgtcgcta cgaatgtagc tcaatacaca acactgatca
    1751  gaacagatcc tcaatacagg cacacactcg aacggatctt accacgcata
    1801  ttcggtccat cgacagatac ggtctacaac ttcatcgaat ccgctatctc
    1851  gtcatcctgg gtatcaatcg atgctcgcag acgcaacggt cgcacaagaa
    1901  agttcagaac agccttcatc aatcgtttcc atgatccaga attcgcatac
    1951  atgttcggca tcaccggcaa cggtatcgag cgaatggaag gtaaagtcac
    2001  ttccaccatc agccaagagg tcgattacct cttaaacggc ggcgaccttc
    2051  gcaattgccc agtcctccgt actctcaaag cggcagaaag agacgaaaca
    2101  atcacgttca tgtgcaagga gaaagctggt acacttatgg ccatggacgg
    2151  aacgattcgc tggttcaagc ggttcgagac aattgatctc acccatctcg
    2201  gatggacatc acgtggtaag gtcatgaaac catacgcatt cagagctcca
    2251  atcatccaag gaatcacaat ctgcaacaca gcatacacaa caacagccat
    2301  cgacatcgtt actacagtct ttggcccatt acgtcagagg gtaggttccc
    2351  tttttgagta aggctgtacg ttgtggccct ataataccaa ccgtcaagca
    2401  tcatttcaat ttcaaacatg ttataacaac taaacgaaat gataacgaat
    2451  atattttcat tcccggttac ggttgggtat tacaggatga ttatttgctg
    2501  aatgccgtaa agatgactgg tgaaggcgat ttaccccctg accagttacc
    2551  ttacgatgat gatcttttac tttcatacgc aaaaatttta ctttatgatt
    2601  acataactca ttttcctaaa cacagataca ataatccaaa aatattgaca
    2651  caagaaacag aactacaact tttcccactc aaagacgact cagctgctag
    2701  aacaaaagtc aacttctacg ctaggttact atggaacgaa gcaacctcag
    2751  acaaaacagc tttcaaacca ggaacttaca acgatacagt agcaggctta
    2801  ttgatgtggc aacaatgtgc tctcatgtgg tccgtacccc agtctattat
    2851  caacagagta attagcggtg tttgtgatgc attaaccgat aggacttcac
    2901  tcgcgctatt gaaacgtatc tcagactggt tgaaacaact cggactagct
    2951  tactcaccga tacatcgcct tttcatagag ctccccacat tattaggacg
    3001  tggagccatc ccaggcgacg caattcacga tatcaagcac agactcaagt
    3051  ttgacccatc aattacagtc gacgtaccaa cagaccagtt acacaggcta
    3101  atctacagac tcttgtctcg aaacctcaag gtcactacgc tagacagttt
    3151  tgaagatcac ttagaggaac gtctactttg gtccaaatca ggaagtcact
    3201  attatcctga cgacgaagtc aataagttac ttcctcaccg cccaacaaga
    3251  aaagaattcc tagatatagt aacagtggac tatatcaaac gatgcaagcc
    3301  tcaagttttt atccgacaat cacgcaagct ggagcatggc aaggaacgct
    3351  tcatctacaa ttgtgacacg atctcttatg tctattttga ttacatcctg
    3401  aagctcttcg agacaggatg gcaagatagt gaagcaatat tgtcaccagg
    3451  tgattacact aacgaacgcc tccacgccaa gatctctagc tacaagtata
    3501  aagcaatgtt agattataca gatttcaatt cccaacatac gatcgaaagc
    3551  atgcgtttga ttttcgaaac catgaaggaa ctactcccgt cagagacagc
    3601  ttttgcactc gactggtgta tagcatcttt cgataacatg agaacatcca
    3651  gtggtcacaa atgggttgca acccttccta gcggacatcg tgctaccacc
    3701  ttcatcaata cagtattgaa ttggtgttac acgcagatgg tcggtctcaa
    3751  gtttgacagt tttatgtgcg ctggtgatga cgtcatctta atgtctcagg
    3801  aaccaatatc actagcccca attcttacat cacattttaa attcaatcct
    3851  agcaaacaaa gtacaggcac tagaggtgag ttcttacgta agcattacac
    3901  tgcagaaggc gtgtttgcat acccatgtag agcgatcgcg agtttagtaa
    3951  gtggaaattg gttgagcgaa acactaagag ataacacccc aatggtggtc
    4001  ccaatacaga acggaatcga taggctacgc agtagagcgg gtttactcgg
    4051  agttccttgg agtttaggcc tctcagagct cattgagaga gagggcatac
    4101  ccaaggaagt cggcatggct ttactcaatt cacacgcagc gggaccaggt
    4151  ctcatcacac gtgattacag ttcattcaca gttacaccca aaccacctac
    4201  gataactagt acacttgaat acacagcgac tcgttatggc gtccaagacc
    4251  tgtccaaaca cgtaccttgg aaacaactca caacagagga aagtcgcaaa
    4301  ttaggtcaac agattaagaa aatgagtcac aggcattgta gccaggctaa
    4351  gataacttac aaatgcatct acgaggtttt taaacctagt ggactcccta
    4401  cggtgttatc tggagtcagc caaccatcgt tgtcgatggt gtggtggcag
    4451  gcaatgctta aggaagcaat gcagaactat tctgtcaaga agatagatgc
    4501  gcaaatgttc gcgagtaacg catgtacaag ctccgttagc ggggatgcgt
    4551  ttttacaagc gacacccaag atggctggcg tcctaatgac tagcctcatc
    4601  tattcttctt cataacgtac agcaaagtct ctatagttgc tcaagactta
    4651  taatgagcca gttggtctca ctataccttc