Sequence of DPV Trichomonas vaginalis virus 3

Trichomonas vaginalis virus 3 strain TVV3-OC5, complete genome.

ACC No: HQ607525

Dated: 2011-05-08 | Length: 4842 | CRC: 1948420080

                
ID   HQ607525; SV 1; linear; genomic RNA; STD; VRL; 4842 BP.
XX
AC   HQ607525;
XX
DT   08-MAY-2011 (Rel. 108, Created)
DT   08-MAY-2011 (Rel. 108, Last updated, Version 1)
XX
DE   Trichomonas vaginalis virus 3 strain TVV3-OC5, complete genome.
XX
KW   .
XX
OS   Trichomonas vaginalis virus 3
OC   Viruses; dsRNA viruses; Totiviridae; Trichomonasvirus.
XX
RN   [1]
RP   1-4842
RX   DOI; 10.1128/JVI.00220-11.
RX   PUBMED; 21345965.
RA   Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W.,
RA   Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H.,
RA   Singh B.N., Fichorova R.N., Nibert M.L.;
RT   "Clinical Isolates of Trichomonas vaginalis Concurrently Infected by
RT   Strains of Up to Four Trichomonasvirus Species (Family Totiviridae)";
RL   J. Virol. 85(9):4258-4270(2011).
XX
RN   [2]
RP   1-4842
RA   Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A.,
RA   Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.;
RT   ;
RL   Submitted (12-NOV-2010) to the EMBL/GenBank/DDBJ databases.
RL   Department of Microbiology and Molecular Genetics, Harvard Medical School,
RL   Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .4842
FT                   /organism="Trichomonas vaginalis virus 3"
FT                   /host="Trichomonas vaginalis"
FT                   /strain="TVV3-OC5"
FT                   /mol_type="genomic RNA"
FT                   /country="USA"
FT                   /collection_date="Jan-2010"
FT                   /db_xref="taxon:170965"
FT   gene            360. .4690
FT                   /gene="pol"
FT   CDS             join(360. .2445,2445. .4690)
FT                   /codon_start=1
FT                   /gene="pol"
FT                   /product="RNA-dependent RNA polymerase"
FT                   /note="translated via ribosomal frameshift"
FT                   /protein_id="AED99804.1"
FT                   /translation="MSAPEPLNTEVRSPDGVSEATETQNLAITQSRVSNEKITDTQSDL
FT                   QTLKKQSQPVSRSTDFETLYNYFYGLDVSPSTDRIGNAITRNTPVTDTNEVVSFPLTAS
FT                   VSHTFSNSPVPAHIQPLQISIADDCINYELDESGTLCPALDSSVHVQRATSLASALKVK
FT                   LTGEIMHSSSVRPIQTPQLIAYLYGVLLAVKDRINIHRNQPTNLWRSLCAAGRAAQAKP
FT                   FFDEIPNNKFRPGALVAPPLPEAGFGPFPAEGLNQNSKLDFKAKAYVFYKQRTYNPDDM
FT                   NRAFWFIWAIYNRMPNDFQNSYPLNITFCTSELPVQSPMPTADGISAEQCDKALLLLDK
FT                   IVLEFFNNDRKLAYYYVFKGSQFVMRPCSCYQEGGLIRKASRNVALRAFTGIYYLAGFA
FT                   EQYANMISCASHPGIIGALFQYVDTMVLQAVFSLSGPKLVRFAAPPEYQGRHACPFSFV
FT                   ADENYWGIAPGSNAEPVGMYYTEIIQRKTEHNLFTETFMDIYGSTASVICANIETSLFT
FT                   SGTEVINQRMQNDFARDTPKPGTLRHQHAIINRFHEPEYAYRLGILADGIIPLSGSFEV
FT                   DILKEAERLITGEDIRNLPGLRCLCSRGLDAILGLRPIQQKRKKMCYFRTLDGNFHEVT
FT                   IRSETRDLQVWRDHGYLARPYACHIVDSDGIEFYDKSNGLYKGRVNVLISGFAIPGRAY
FT                   QGPSLAGSNRGRPDLSDVPATGSLSNLISLSKASRLPYRKRQNGVRVSDYTVARELACA
FT                   FRNSRLTRQMDHVTDIAYLNFLRWVLLPYNGQTVRPHPTEWGQTPYPEHVNLKFLSKEM
FT                   ELELFPLKKAPQADLKVNCYARNILASTELTDDLLKRCLPVGLNNDSVCGIVIVLELLL
FT                   IAGVPSKLLPVIGQAIANKDPFIKELSDFNKMIGATSSRIANILTECNTLIGRGVKSSD
FT                   PSADLYHRVAPEGNRHEAKISRHILIEAINKIYKNEMTDMPPPGDFMLHLITSPLWCKA
FT                   GSHHHPHFAKYGSRLEFVMDVPADKIAAEPPAVYITQAEKLEHGKTRYIYNCDTIAYLF
FT                   FDYILHYVECVWSNESVLLNPAAMSVERFSVLDYPEYCMIDYTDFNSQHSLESQKLVFE
FT                   CLRPYLPSEMHPILDWCITSMDHMEINGQHWLSTLPSGHRATTFINSVLNKAYLIPYIG
FT                   DTVSFHCGDDVLLCGEYDYQTLIDTLPYELNKSKQSFGPNAEFLRLHRRGGDVIGYPSR
FT                   AVSSLVSGNWLSKTSWEWQPSLISVTNQCNVIISRSQLNIRFIPAMQQELRNRYADKMS
FT                   EPFDVSSDYYVMPGCPCYSDAATTIVPNVPKLEHSDVPFSQAQKLFDTMRDSCPEFTTV
FT                   NDIIDKVRARRSSSAASNITYNVGSPVAPQVCVVVNPNHYQFLLRKRYYPREHIAPPGF
FT                   DASNDSKLVFTTYDLAPSIAMKSCAVLAPAKIICGHGLRSG"
FT   CDS             360. .2486
FT                   /codon_start=1
FT                   /gene="pol"
FT                   /product="capsid protein"
FT                   /protein_id="AED99803.1"
FT                   /translation="MSAPEPLNTEVRSPDGVSEATETQNLAITQSRVSNEKITDTQSDL
FT                   QTLKKQSQPVSRSTDFETLYNYFYGLDVSPSTDRIGNAITRNTPVTDTNEVVSFPLTAS
FT                   VSHTFSNSPVPAHIQPLQISIADDCINYELDESGTLCPALDSSVHVQRATSLASALKVK
FT                   LTGEIMHSSSVRPIQTPQLIAYLYGVLLAVKDRINIHRNQPTNLWRSLCAAGRAAQAKP
FT                   FFDEIPNNKFRPGALVAPPLPEAGFGPFPAEGLNQNSKLDFKAKAYVFYKQRTYNPDDM
FT                   NRAFWFIWAIYNRMPNDFQNSYPLNITFCTSELPVQSPMPTADGISAEQCDKALLLLDK
FT                   IVLEFFNNDRKLAYYYVFKGSQFVMRPCSCYQEGGLIRKASRNVALRAFTGIYYLAGFA
FT                   EQYANMISCASHPGIIGALFQYVDTMVLQAVFSLSGPKLVRFAAPPEYQGRHACPFSFV
FT                   ADENYWGIAPGSNAEPVGMYYTEIIQRKTEHNLFTETFMDIYGSTASVICANIETSLFT
FT                   SGTEVINQRMQNDFARDTPKPGTLRHQHAIINRFHEPEYAYRLGILADGIIPLSGSFEV
FT                   DILKEAERLITGEDIRNLPGLRCLCSRGLDAILGLRPIQQKRKKMCYFRTLDGNFHEVT
FT                   IRSETRDLQVWRDHGYLARPYACHIVDSDGIEFYDKSNGLYKGRVNVLISGFAIPGRAY
FT                   QGPRLQVATEAAQI"
XX
SQ   Sequence 4842 BP; 1296 A; 1312 C; 997 G; 1237 T; 0 other;

hq607525 Length: 4842  08-MAY-2011  Type: N  Check: 2840  ..

       1  gcttaaaagc ttagtccact tttaagccgg tcatacttca accgtgatac
      51  cggggcaaaa ttaatcaaca ccctcctgga atcgccgggg tgttgcgagc
     101  cataagagac tggttctaaa ggactgacat agcgccgcga gggtaggcgg
     151  tcgatagccc gtttgaggga atagcaatat tcctgattct ggtgtagcat
     201  cgactggggc cccctagcgt gagctcagca cgttgggaaa acgaaaaact
     251  gcatgcgcac agccttgcag tagcgtgagc tcaaggcacc ctaaaaagtg
     301  cctcgtttca tgacgaactt tatgtcgtta tgaaatacta gtgtattgcg
     351  tgcaacggta tgtcagctcc cgagccctta aatactgaag tacgctcacc
     401  tgatggtgtt agtgaagcca cagaaactca aaacttggct atcactcaaa
     451  gccgtgtgtc aaacgaaaaa ataactgaca cacaaagtga tctgcaaaca
     501  cttaaaaaac agtcacaacc ggtcagcaga tccacagatt ttgaaactct
     551  ttataattat ttttatggtt tagatgtctc tccttcaaca gatcgcattg
     601  gtaatgcaat tacccgcaat accccagtca ctgatacgaa tgaggttgtt
     651  agttttccac tcactgcatc tgtttcacac acattttcga attcgccagt
     701  tccagctcac atacagcctc tccaaatttc tattgctgat gactgcatca
     751  actacgagtt agatgagagc ggaacgttat gcccagcgct tgatagttct
     801  gttcacgtcc agagagccac ttctcttgct agcgctctca aggtcaagtt
     851  aacaggcgaa attatgcatt cttcatcagt tagaccaatt caaactcctc
     901  aattaattgc ttatttatac ggtgttctcc ttgctgtcaa agaccgcatt
     951  aacattcatc gtaatcagcc tacgaattta tggcgtagct tatgtgcagc
    1001  aggtcgcgca gcccaagcaa agccgttctt cgatgaaatt cccaacaaca
    1051  agttcaggcc cggtgccctc gtcgcacccc ctcttcctga agcaggattc
    1101  ggtcctttcc cagctgaggg ccttaaccaa aattctaagc tcgatttcaa
    1151  agcaaaagca tacgtcttct acaagcaacg cacctacaat ccagatgaca
    1201  tgaatcgcgc attctggttt atctgggcaa tttacaaccg tatgcccaat
    1251  gacttccaaa attcgtaccc actcaacatc actttctgca cttccgagct
    1301  accagtccaa agcccgatgc caacagctga tggaatttcc gccgaacaat
    1351  gcgataaagc gctccttcta cttgacaaaa tcgttctcga attcttcaat
    1401  aacgaccgca aactcgctta ctactatgtg ttcaaaggaa gccagttcgt
    1451  tatgcgtcct tgttcatgtt atcaagaagg aggcttgatc cgcaaggcct
    1501  cacgcaatgt cgctcttcgc gcttttactg gcatctacta tctcgccgga
    1551  ttcgccgaac aatacgctaa catgatttca tgcgcctccc atccaggaat
    1601  catcggcgcc cttttccaat acgtcgacac tatggtctta caggccgttt
    1651  tctctctttc cggccccaag cttgttcgct tcgcggctcc acctgaatat
    1701  cagggtcgtc acgcttgtcc attttccttc gtcgccgatg aaaactattg
    1751  gggcattgct cccggctcaa atgccgaacc agtcggtatg tattacacgg
    1801  aaattatcca acgcaaaacc gagcacaatc tgttcaccga aacattcatg
    1851  gatatctacg gttcgactgc ctccgtcata tgcgcaaata tcgaaacaag
    1901  cttgttcaca tccggcactg aagttataaa ccagcgcatg caaaacgatt
    1951  tcgctcgcga cacgccaaag cctggaaccc ttcgccacca gcatgccatc
    2001  atcaatcgct tccacgaacc cgaatatgct taccgccttg gcatcctcgc
    2051  tgatggcatt attccgctta gcggctcttt cgaagtcgac atcctcaaag
    2101  aagctgaacg cctcatcaca ggcgaagaca tccgcaatct cccaggttta
    2151  cgttgcttat gctctcgcgg tctcgacgcc atcctcggtc tccgtccaat
    2201  ccaacagaaa cgcaagaaga tgtgttactt ccgcacactc gatggcaact
    2251  tccatgaagt aacaatcaga tcggagactc gcgatctaca ggtctggcgt
    2301  gatcatggct acctcgctcg cccatacgcg tgccacatcg ttgattcaga
    2351  tggcatcgaa ttctacgaca aatccaacgg tctctataag ggacgcgtca
    2401  acgttctcat ttccggattt gccattccag gtcgcgcata tcagggccct
    2451  cgcttgcagg tagcaacaga ggccgcccag atctaagcga cgtcccggcg
    2501  acaggaagtt tgtccaacct catcagcctt tctaaagcaa gtcggctacc
    2551  ataccgtaag cggcagaatg gcgtgagagt gtcagactac accgtcgccc
    2601  gcgagttagc ttgcgctttt cgcaattctc gcctaactcg ccaaatggat
    2651  cacgtcacag atatagctta tctcaatttc cttagatggg tgttgttacc
    2701  ttacaacggt caaaccgtac gaccacaccc caccgagtgg ggtcaaacac
    2751  cctaccccga acacgtcaat ttgaagttct taagcaagga aatggagctc
    2801  gaacttttcc cactgaagaa ggccccacaa gccgatctta aagtgaattg
    2851  ttacgcgcga aacatccttg cttccacaga gctaacagat gatctcctca
    2901  aacggtgtct gccagtcgga ctcaacaatg attcagtttg cggaattgtc
    2951  atcgttttag agctgcttct cattgcaggt gtcccaagta agttattacc
    3001  agtcattggc caagccatcg ccaacaaaga tccatttatt aaagaattgt
    3051  ccgatttcaa taagatgata ggagcgacct cctcacgtat cgccaatatt
    3101  ctcacagaat gtaacacatt gataggtcgc ggagtcaagt catctgaccc
    3151  aagtgctgat ttgtatcacc gggtagcgcc tgagggcaat aggcacgaag
    3201  cgaagatttc tcgacacatc ctcatcgaag ccatcaacaa aatctacaaa
    3251  aacgaaatga cagacatgcc tccaccaggt gatttcatgc tccacttaat
    3301  aacgagccct ctatggtgta aggctggctc tcaccaccat ccacactttg
    3351  caaagtacgg ttcacgctta gaattcgtca tggacgttcc agcagacaaa
    3401  atcgctgctg agccgcccgc tgtttacatt actcaagcgg agaaactaga
    3451  acacggtaag actaggtaca tttacaactg cgatacaatt gcatacctat
    3501  tcttcgatta catcttgcac tatgtcgagt gtgtatggtc aaacgagtca
    3551  gttttactca acccagctgc tatgagtgtt gagcgattca gtgtcttaga
    3601  ttacccggag tactgcatga tcgattacac agacttcaac tctcaacaca
    3651  gcttagaatc acagaagcta gtttttgagt gtttgagacc atacttacca
    3701  agtgaaatgc acccaatcct cgattggtgt atcaccagca tggaccatat
    3751  ggaaattaac ggccagcatt ggttaagtac gctaccctca ggacatagag
    3801  ctacgacatt tatcaactcg gtcctgaata aagcttactt aatcccttac
    3851  ataggtgaca ccgtttcctt ccattgtggt gacgacgtgt tactatgtgg
    3901  tgagtacgat taccaaaccc tcattgatac cctgccctat gaattaaaca
    3951  agagcaaaca gagctttgga cctaatgccg agttcttgcg cttgcatagg
    4001  cgcggtggtg acgttatagg ttatccatca agagctgttt cgagtcttgt
    4051  atctggaaat tggttaagca aaacgtcatg ggagtggcaa ccaagcctca
    4101  tttcggtcac taatcaatgc aatgttatta tctcgcgttc acaattgaat
    4151  atcagattta tccccgccat gcaacaagaa ctacgtaacc gctacgcgga
    4201  caagatgagc gaaccattcg atgttagttc ggattactac gtcatgccag
    4251  gttgtccctg ctatagtgac gccgcgacga caatcgtgcc gaatgtcccc
    4301  aaattggaac attcagacgt accgttttcg caggcacaaa aactttttga
    4351  tactatgcgc gactcctgtc ctgagttcac aactgttaac gacatcatcg
    4401  acaaagttag agctcgccgg tcttccagtg ctgccagtaa catcacgtac
    4451  aacgtcggct cacctgtcgc acctcaagtt tgcgtagtcg taaatccaaa
    4501  tcattaccag ttccttttgc gcaagagata ctacccacga gagcatattg
    4551  ctccaccagg cttcgacgca tccaacgact caaaactcgt tttcacgact
    4601  tacgatctcg ctccttcaat cgctatgaaa tcgtgcgctg ttttggcccc
    4651  ggcaaagata atatgcggcc acggactacg cagtggttga gtagttctgt
    4701  cgtaccaagc cacacttggt accggatagg ccacgaacgg tctcctgtct
    4751  tcggaccctt cgcctatagg ttaataggaa tacagtgtta ctgttgtgtg
    4801  tatcgcttta ggcacacgaa cgtactaccc cacgtttagt tc