Sequence of DPV Trichomonas vaginalis virus 3

Trichomonas vaginalis virus 3 strain TVV3-UR1, complete genome.

ACC No: HQ607515

Dated: 2011-05-08 | Length: 4845 | CRC: 333657510

                
ID   HQ607515; SV 1; linear; genomic RNA; STD; VRL; 4845 BP.
XX
AC   HQ607515;
XX
DT   08-MAY-2011 (Rel. 108, Created)
DT   08-MAY-2011 (Rel. 108, Last updated, Version 1)
XX
DE   Trichomonas vaginalis virus 3 strain TVV3-UR1, complete genome.
XX
KW   .
XX
OS   Trichomonas vaginalis virus 3
OC   Viruses; dsRNA viruses; Totiviridae; Trichomonasvirus.
XX
RN   [1]
RP   1-4845
RX   DOI; 10.1128/JVI.00220-11.
RX   PUBMED; 21345965.
RA   Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W.,
RA   Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H.,
RA   Singh B.N., Fichorova R.N., Nibert M.L.;
RT   "Clinical Isolates of Trichomonas vaginalis Concurrently Infected by
RT   Strains of Up to Four Trichomonasvirus Species (Family Totiviridae)";
RL   J. Virol. 85(9):4258-4270(2011).
XX
RN   [2]
RP   1-4845
RA   Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A.,
RA   Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.;
RT   ;
RL   Submitted (12-NOV-2010) to the EMBL/GenBank/DDBJ databases.
RL   Department of Microbiology and Molecular Genetics, Harvard Medical School,
RL   Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .4845
FT                   /organism="Trichomonas vaginalis virus 3"
FT                   /host="Trichomonas vaginalis"
FT                   /strain="TVV3-UR1"
FT                   /mol_type="genomic RNA"
FT                   /country="USA"
FT                   /collection_date="Jun-1999"
FT                   /db_xref="taxon:170965"
FT   gene            363. .4693
FT                   /gene="pol"
FT   CDS             join(363. .2448,2448. .4693)
FT                   /codon_start=1
FT                   /gene="pol"
FT                   /product="RNA-dependent RNA polymerase"
FT                   /note="translated via ribosomal frameshift"
FT                   /protein_id="AED99800.1"
FT                   /translation="MSAPEPLNTEVRSPNGVSEAKETQNLAITQSGVSNEKITDTQSDL
FT                   QTLKKQLQPVSRSTDFETLYNYFYGLQVPASTDRIGNAIQRNIPVNDTNEVVSFPLTAS
FT                   VSHTFSNTPVPAHIQPLQISVADDCXNYELDESGTLCPALDSSVHVQRATSLASALKVK
FT                   LTGEVMHSASVRPIQTPQLIAYLYGVLLAVQDRINIHRNQPTNLWRSLCAPGRAAQAKP
FT                   FFDEFANNKFRAGPLLAPPLPDAGFGPFPAEGLNQNSKLDFKSKGYIFYKQRTYNPDDM
FT                   NRAFWFLWAIYNRMPGEFQQYYQLNITFCTSELPVQNPIPNADGISNEQCEKALLLLEK
FT                   IILELFNNDRKLAYYYIFKGSQFVMRPCSCYQEGGLIRKASRNVALRAFTGIYYLAGFA
FT                   DQYANMISCAAHPGVIGALFQYVDTMVLQAVFSLSGPKLVRFAAPPEYQGRHACPFSFV
FT                   ADENYWGIAPGSNAEPVGMYYMDIIQRKSEHNLFVDTFMDIYGSTASIICANIETSLFT
FT                   SGTNVLNERMQKDFARDTPKPGTLRHQHAIINRFHEPEYAYRLGILADGIIPLSGSFEV
FT                   DILKEAERLITGEDIRNLPGLRCLCSRGLDAILGLRPIQQKRKKMCYFRTLDGNFHEVT
FT                   IRSETRDLQVWRDHGYLARPYACHIVDSDGIEFYDKSNGLYKGRVNVLVSGFAIPGRAY
FT                   QGPSLAGSNRGRPDLSDVPATGSLSNLIYLSKASRLPYRKLKEGVRAADYTVARELACA
FT                   FRSSRLTRQMDHVTDIAYLNFLRWVLLPYNGQTLRPHPTMWRQTPYPEHVNLKFLSKET
FT                   ELELFPLKKAPQADLKVNCYARNILASTELTDDILKQSLPIGLNNDSVCGIVIVLELLL
FT                   IAGVPSKLLPVIGQAIANKDPFIKELSDFNKMIGATTSRIANILTECNTLIGRGVKSSD
FT                   PSADLYHRVAPEGNRHEAKISRHILIEAINKIYKNEMTDMPPPGDFMLHLITSPLWCKA
FT                   GSHHHPHFAKYDSRLEFVMDVPADKIAVEAPSVYITQAEKLEHGKTRYIYNCDTVSYLF
FT                   FDYILHYVECVWSNESVLLNPAAMSVERFSILDYPQYCMIDYTDFNSQHSLESQKLVFE
FT                   CLRPYLPSEMHPVLDWCIASFEHMEINGQHWLSTLPSGHRATTFINSVLNKAYLIPYIG
FT                   DAVSFHCGDDVLLCGEYDYQTLIDTLPYELNKSKQSFGPNAEFLRLHRCGGDVIGYPSR
FT                   AVSSLVSGNWLSKTSWEWQPSLISVTNQCNVIISRSQLNIRFIPAMQQELRNRYADKMS
FT                   EPFDVSSNYYVMPGCPCYSDAATTIVPNVPQLERSDAPFSQAQKVFDAMRDFCPEFTTV
FT                   GDVIDKVRARRSSSAVKNIMYDVCAPVAPRISIVVNPAHYQFLLRKKYYPREHIAPTGS
FT                   DNTDRTKLVFATYDLAPSIAMKSCAVLTPAKIISGHGLRSG"
FT   gene            363. .2489
FT                   /gene="cap"
FT   CDS             363. .2489
FT                   /codon_start=1
FT                   /gene="cap"
FT                   /product="capsid protein"
FT                   /protein_id="AED99799.1"
FT                   /translation="MSAPEPLNTEVRSPNGVSEAKETQNLAITQSGVSNEKITDTQSDL
FT                   QTLKKQLQPVSRSTDFETLYNYFYGLQVPASTDRIGNAIQRNIPVNDTNEVVSFPLTAS
FT                   VSHTFSNTPVPAHIQPLQISVADDCXNYELDESGTLCPALDSSVHVQRATSLASALKVK
FT                   LTGEVMHSASVRPIQTPQLIAYLYGVLLAVQDRINIHRNQPTNLWRSLCAPGRAAQAKP
FT                   FFDEFANNKFRAGPLLAPPLPDAGFGPFPAEGLNQNSKLDFKSKGYIFYKQRTYNPDDM
FT                   NRAFWFLWAIYNRMPGEFQQYYQLNITFCTSELPVQNPIPNADGISNEQCEKALLLLEK
FT                   IILELFNNDRKLAYYYIFKGSQFVMRPCSCYQEGGLIRKASRNVALRAFTGIYYLAGFA
FT                   DQYANMISCAAHPGVIGALFQYVDTMVLQAVFSLSGPKLVRFAAPPEYQGRHACPFSFV
FT                   ADENYWGIAPGSNAEPVGMYYMDIIQRKSEHNLFVDTFMDIYGSTASIICANIETSLFT
FT                   SGTNVLNERMQKDFARDTPKPGTLRHQHAIINRFHEPEYAYRLGILADGIIPLSGSFEV
FT                   DILKEAERLITGEDIRNLPGLRCLCSRGLDAILGLRPIQQKRKKMCYFRTLDGNFHEVT
FT                   IRSETRDLQVWRDHGYLARPYACHIVDSDGIEFYDKSNGLYKGRVNVLVSGFAIPGRAY
FT                   QGPRSQVATEAAQI"
XX
SQ   Sequence 4845 BP; 1296 A; 1281 C; 1016 G; 1251 T; 1 other;

hq607515 Length: 4845  08-MAY-2011  Type: N  Check: 3649  ..

       1  gcttaaaaag cgaagtccac tttttaagcc ggtttaactt caaccgtgaa
      51  taccagggca aaattaatca acaccctcct ggaatcgccg gggtgttgcg
     101  agccataaga gactggttct aaaggactga catagcgccg cgagggtagg
     151  cggtcgatag cccgtttgag ggggtagtaa tactcctgat tctggtgaag
     201  catcgactgg ggccccctag cgtgagctca gcacgttgga aaaacgaaaa
     251  actgcatgtg cacagctttg cagtagcgtg agctcagggc accctaaaaa
     301  gtgctccgtt tcacaacaac ctatgcgttg ttgtgagact ctagtgtatt
     351  gcgtgcaacg gtatgtcagc tcccgagccc ttaaatactg aagtacgctc
     401  acctaatggt gttagtgaag ccaaagaaac tcaaaacttg gctatcactc
     451  aaagcggtgt gtcgaacgaa aaaataaccg acacacaaag tgatctgcaa
     501  acactcaaaa aacagttaca accggtcagc agatccacag atttcgaaac
     551  tctttataat tatttttatg gtttacaagt tcctgcttca acagatcgta
     601  ttggcaatgc tattcagcgt aacatcccag tcaatgatac gaacgaagtc
     651  gttagctttc cgcttacagc atcggtttca cacacatttt ccaatacgcc
     701  ggtacctgcc catatacagc ctctccaaat ctcagttgcc gacgactgcr
     751  tcaactacga gctagacgag agcggaacat tatgcccagc gttagatagc
     801  tctgttcacg tccaaagagc tacctccctt gctagcgctc tcaaagtcaa
     851  attaacaggc gaagttatgc attctgcctc agtcaggcca atccagacac
     901  cacagttgat cgcttatttg tatggcgtcc tccttgccgt ccaagatcgt
     951  atcaacatcc atcgcaacca acctactaac ttatggcgca gcttatgtgc
    1001  acctggtcgc gctgctcaag caaagccttt cttcgatgaa ttcgcaaaca
    1051  acaaattcag ggcaggtccc ctcttggcac ctcccctccc tgatgctggt
    1101  ttcggtccat tcccggcaga aggcctcaat cagaattcca agctcgactt
    1151  caaatccaaa ggatacatct tctacaaaca gcgcacttac aacccagatg
    1201  atatgaatcg tgctttctgg ttcctttggg cgatctacaa ccgcatgcct
    1251  ggagaattcc aacaatacta tcagttgaac atcactttct gcacttccga
    1301  gttaccagta caaaatccga taccaaatgc cgatggcatc tcaaatgaac
    1351  aatgtgaaaa agcacttctc ctcctcgaaa aaattatcct cgaacttttc
    1401  aataacgatc gcaaacttgc ttactactac atcttcaagg gaagccaatt
    1451  cgttatgcgt ccttgttcct gctaccaaga aggaggctta attcgcaagg
    1501  cttcacgtaa cgttgctctc cgtgctttta ctggcatcta ctacttagcc
    1551  ggattcgcag atcaatacgc taacatgatt tcatgtgctg cccatccagg
    1601  tgtcattggt gctcttttcc aatatgttga cacaatggtc ttacaagccg
    1651  tcttctcgct ttctggtcct aagcttgttc gtttcgctgc cccacccgaa
    1701  tatcaaggtc gccacgcttg cccgttctca tttgtggctg acgaaaacta
    1751  ctggggtatt gctccaggct caaacgccga gccagttggc atgtactaca
    1801  tggatatcat tcaacgcaaa tccgaacata atttgtttgt cgacacattc
    1851  atggacatct atggttctac agcttcaatc atttgcgcta atatcgaaac
    1901  tagcttattt acttctggca ctaatgtttt gaacgaacgc atgcagaagg
    1951  atttcgctcg tgatacaccc aaacctggaa ctcttcgtca ccaacatgct
    2001  atcatcaacc gcttccacga accagaatac gcttaccgcc tcggcatcct
    2051  tgcagatggc atcattccgc tcagtggctc attcgaagtc gatatcctca
    2101  aagaagctga gcgcctcatt actggtgagg atatccgcaa cctcccaggt
    2151  ttacgttgct tatgctctcg tggcctcgac gctatcctcg gtctccgtcc
    2201  aattcaacag aagcgcaaga agatgtgtta cttccgcacc ctcgacggca
    2251  atttccacga agtaacaatc agatcggaga ctcgcgatct acaggtctgg
    2301  cgtgaccacg gctacctcgc ccgcccatac gcgtgccaca ttgtcgactc
    2351  agatggcatc gaattctacg acaagtccaa tggtctctac aagggacgcg
    2401  tcaacgtcct cgtttccgga tttgccattc caggacgcgc atatcagggc
    2451  cctcgctcgc aggtagcaac agaggccgcc cagatctaag cgacgtcccg
    2501  gcgacaggaa gtctgtccaa cctcatctac ctttctaagg caagtcggct
    2551  accataccgt aagctgaagg aaggcgtgag agcggcagac tacaccgtcg
    2601  cccgcgagtt agcttgcgct tttcgcagtt ctcgcctaac tcgccaaatg
    2651  gatcatgtca cagatatagc ttaccttaat ttcttgagat gggtgttgtt
    2701  accttacaac ggtcaaactt tacgaccaca ccccaccatg tggcgtcaaa
    2751  caccctaccc cgaacatgtc aatttgaagt tcctaagtaa ggagacggag
    2801  ctcgaacttt tcccactgaa gaaggcccca caagccgatc ttaaagtgaa
    2851  ttgttacgcg cgaaatatcc ttgcttctac agagcttact gacgatatac
    2901  tcaaacagag tttgcccatt ggtctcaata atgactcggt ttgcggaatc
    2951  gttattgttt tagagctact tctaattgca ggtgttccga gtaagttact
    3001  accagttatt ggtcaagcaa tcgccaataa agatccattt attaaggaac
    3051  tgtccgactt caacaagatg ataggagcga ccacttcccg tatcgctaac
    3101  attcttacag agtgtaatac attaataggt cgtggtgtta agtcatctga
    3151  cccaagtgct gatttgtatc accgggtagc gcccgagggc aataggcacg
    3201  aggcgaagat ttctcgacac atcctcatcg aagccatcaa caaaatttac
    3251  aaaaacgaaa tgacagacat gcctccaccg ggtgacttca tgctccactt
    3301  gataacgagc cctctatggt gtaaggctgg ctctcaccat catccacact
    3351  tcgccaagta tgattcgcgc ttggaattcg ttatggatgt tccagcagac
    3401  aaaatcgctg ttgaagcacc ctctgtatac attactcaag ccgagaaatt
    3451  agaacatggt aaaactagat acatttataa ctgtgataca gtttcatact
    3501  tgttctttga ttacatctta cactatgtcg aatgtgtgtg gtcaaatgag
    3551  tcagttctac tcaacccagc tgctatgagt gtcgagcgct ttagtatctt
    3601  ggattacccg caatattgca tgatcgatta cacagatttc aactctcaac
    3651  acagtctcga atcacagaag ctagtgttcg agtgtttgag accatactta
    3701  ccaagcgaaa tgcatccagt cttggattgg tgtattgcca gctttgagca
    3751  catggaaatc aacggacaac attggttaag cacgttgcct tcaggacata
    3801  gggccacaac attcatcaac tcggtcctca ataaagcata cctgatccca
    3851  tacataggcg acgcggtttc cttccattgt ggtgacgacg tgttactatg
    3901  tggtgagtat gattaccaaa cactcattga taccctaccc tatgaattaa
    3951  acaagagcaa acagagcttc ggacctaatg ccgagttctt gcgcttgcat
    4001  aggtgtggtg gtgacgttat aggctatcca tccagagctg tttcgagtct
    4051  tgtatctgga aattggttaa gcaagacatc atgggagtgg cagccaagtc
    4101  tcatttcggt tacaaatcaa tgcaatgtga tcatctcgcg ttcacaattg
    4151  aacatcaggt tcatccccgc aatgcaacaa gaactacgca atcgttacgc
    4201  agacaagatg agtgaacctt tcgatgtcag ctccaactac tacgtcatgc
    4251  caggatgtcc atgctatagc gacgccgcga cgacaatagt accgaatgtt
    4301  ccccaactgg aacgttcgga cgcaccgttt tcgcaggcac aaaaagtttt
    4351  tgatgctatg cgcgacttct gtcctgagtt cactactgtt ggcgatgtca
    4401  tcgataaggt tagagctcgc cgatcttcaa gtgcagtcaa gaacatcatg
    4451  tacgacgtat gcgcgcctgt tgcaccacgt atcagtatcg tagtgaaccc
    4501  ggcacactat cagttcctct tacgcaagaa gtactaccca cgtgaacaca
    4551  ttgcgcctac tggctccgat aatacagatc gaaccaaact cgttttcgca
    4601  acatacgatc tcgctccttc aatcgccatg aagtcgtgcg ctgttttgac
    4651  cccggctaag ataataagtg gtcacggact acgcagtggt tgaataatct
    4701  gccagtacca ggcaacgatt ggtaccggct tggccacgca cggtctgctg
    4751  tcttcggacc ctccgcctat aggttaatag gaacacagtg ttactgttgt
    4801  gtgtatcgct ctaggcacac gaacgtacta ccccacgttt agttc