Sequence of DPV Trichomonas vaginalis virus 3

Trichomonas vaginalis virus 3 strain TVV3-OC3, complete genome.

ACC No: HQ607519

Dated: 2011-05-08 | Length: 4846 | CRC: -822401152

                
ID   HQ607519; SV 1; linear; genomic RNA; STD; VRL; 4846 BP.
XX
AC   HQ607519;
XX
DT   08-MAY-2011 (Rel. 108, Created)
DT   08-MAY-2011 (Rel. 108, Last updated, Version 1)
XX
DE   Trichomonas vaginalis virus 3 strain TVV3-OC3, complete genome.
XX
KW   .
XX
OS   Trichomonas vaginalis virus 3
OC   Viruses; dsRNA viruses; Totiviridae; Trichomonasvirus.
XX
RN   [1]
RP   1-4846
RX   DOI; 10.1128/JVI.00220-11.
RX   PUBMED; 21345965.
RA   Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W.,
RA   Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H.,
RA   Singh B.N., Fichorova R.N., Nibert M.L.;
RT   "Clinical Isolates of Trichomonas vaginalis Concurrently Infected by
RT   Strains of Up to Four Trichomonasvirus Species (Family Totiviridae)";
RL   J. Virol. 85(9):4258-4270(2011).
XX
RN   [2]
RP   1-4846
RA   Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A.,
RA   Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.;
RT   ;
RL   Submitted (12-NOV-2010) to the EMBL/GenBank/DDBJ databases.
RL   Department of Microbiology and Molecular Genetics, Harvard Medical School,
RL   Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .4846
FT                   /organism="Trichomonas vaginalis virus 3"
FT                   /host="Trichomonas vaginalis"
FT                   /strain="TVV3-OC3"
FT                   /mol_type="genomic RNA"
FT                   /country="USA"
FT                   /collection_date="Nov-2009"
FT                   /db_xref="taxon:170965"
FT   gene            364. .4694
FT                   /gene="pol"
FT   CDS             join(364. .2449,2449. .4694)
FT                   /codon_start=1
FT                   /gene="pol"
FT                   /product="RNA-dependent RNA polymerase"
FT                   /note="translated via ribosomal frameshift"
FT                   /protein_id="AED99802.1"
FT                   /translation="MSAPEPLNTEVRSPNGVSEATETQNLAVTQSSVSNEKTIDTQSDL
FT                   QTLKKQLQPVTRSTDFETLYNYFYALNVSPSTDRIGNAITRNTPVNDTNEVVSFPLTAS
FT                   VSHTFSNTPVPAHIQPLQISIADDCVNYELDESGTLCPALDSSVHVQRATSLASALKVK
FT                   LTGEVMHSASVRPIQTPQLIAYLYGVLLAVQDRLNIHRNQHTNLWRSLCAAGRAAQAKP
FT                   FFDEIPNNKFRTGALLAPPLPDAGFGPFPAEGLNQNSKLDFKSKGYIFYKQRTYNPDDM
FT                   NRAFWFIWAIYNRMPEDFQNSYPLNITFCTSELPVQSPMPAADGISAEQCDRALVLLDK
FT                   VILEFFNNDRKLAYYYVFKGCQFVMRPCSCYQEGGLIRKASRNVALRAFTGIYYLAGFA
FT                   EQYANMISCATHPGIIGALFQYVDTMVLQAVFSLSGPKLVRFAAPPEYQGRHACPFSFV
FT                   ADENYWGIAPGSDAEPVGMYYMDIIQRKAEHDLFTETFMDIYGSTASIICANIETSLFT
FT                   SGTNVINKRMQNDFARDTPKPGTLRHQHAIINRFHEPEYAYRLGILADGIIPLGGSFEV
FT                   DILKEAERLITGEDIRNLPGLRCLCSRGLDAILGIRPVQQKRKKMSYFRTLDGTFHEVT
FT                   IRSDTHDLQVWNDHGYLARPYACHIVDSDGIEFYDKSNGLYKGRVNVLISGFAIPGRAY
FT                   RGPSLAGSNRGRPSLSDIPATGSLSNLIDLSKASRLPYRKLKEGLRASDYTVARELASA
FT                   FRNSRLTRQMDHVTDIAYLNFLRWVLLPYNGQTLRPHPSKWSPTPYPEHVNLKFLTKEI
FT                   ELELFPLKKAPQADLKVNCYARNILASTELTDDLLKQCLPVGLNNDSVCGIVIVLELLL
FT                   IAGVPSKLLPIIGQAIANKDPFIKELSDFNKMIGATTSRIANILTECNTLIGRGVKSSD
FT                   PSADLYHRVAPEGNRHEAKISRHILIEAIDKIYKNEMTSMPPPGDFMLHLITSPLWCKA
FT                   GSHHHPHFAKYDSRLEFVMDVPADKIAAEPPSAYITQAEKLEHGKTRYIYNCDTVSYLF
FT                   FDYILHYVECVWSNESVLLNPAAMSVERFSVLDYPEYCMIDYTDFNSQHSLESQKLVFE
FT                   CLRPYLPREMHSVLDWCIASMDHMEINGQHWLSTLPSGHRATTFINSVLNKAYLIPYIG
FT                   DTVSFHCGDDVLLCGEYDYQTLIDTLPYELNKSKQSFGPNAEFLRLHRRGGDVIGYPSR
FT                   AVSSLVSGNWLSKTSWEWQPSLISVTNQCNVIISRSQLNIRFIPAMQQELRNRYADKMS
FT                   EPFDVSSNYYVMPGCPCYSDAATTIVPNVPQLEHSDVPFSQAQKLFDTMRDYCPEFTTV
FT                   NDVIDKVKARRSSSAVSNIMYNVCSPVAPQVCVVVNPNNYQFLLRKRYYPREHIAPSGF
FT                   DESSDSKLVFTTYDLAPSIAMKSCAVLTPAKIICGHGLRSG"
FT   gene            364. .2490
FT                   /gene="cap"
FT   CDS             364. .2490
FT                   /codon_start=1
FT                   /gene="cap"
FT                   /product="capsid protein"
FT                   /protein_id="AED99801.1"
FT                   /translation="MSAPEPLNTEVRSPNGVSEATETQNLAVTQSSVSNEKTIDTQSDL
FT                   QTLKKQLQPVTRSTDFETLYNYFYALNVSPSTDRIGNAITRNTPVNDTNEVVSFPLTAS
FT                   VSHTFSNTPVPAHIQPLQISIADDCVNYELDESGTLCPALDSSVHVQRATSLASALKVK
FT                   LTGEVMHSASVRPIQTPQLIAYLYGVLLAVQDRLNIHRNQHTNLWRSLCAAGRAAQAKP
FT                   FFDEIPNNKFRTGALLAPPLPDAGFGPFPAEGLNQNSKLDFKSKGYIFYKQRTYNPDDM
FT                   NRAFWFIWAIYNRMPEDFQNSYPLNITFCTSELPVQSPMPAADGISAEQCDRALVLLDK
FT                   VILEFFNNDRKLAYYYVFKGCQFVMRPCSCYQEGGLIRKASRNVALRAFTGIYYLAGFA
FT                   EQYANMISCATHPGIIGALFQYVDTMVLQAVFSLSGPKLVRFAAPPEYQGRHACPFSFV
FT                   ADENYWGIAPGSDAEPVGMYYMDIIQRKAEHDLFTETFMDIYGSTASIICANIETSLFT
FT                   SGTNVINKRMQNDFARDTPKPGTLRHQHAIINRFHEPEYAYRLGILADGIIPLGGSFEV
FT                   DILKEAERLITGEDIRNLPGLRCLCSRGLDAILGIRPVQQKRKKMSYFRTLDGTFHEVT
FT                   IRSDTHDLQVWNDHGYLARPYACHIVDSDGIEFYDKSNGLYKGRVNVLISGFAIPGRAY
FT                   RGPRLPVATEAAQV"
XX
SQ   Sequence 4846 BP; 1274 A; 1334 C; 1008 G; 1230 T; 0 other;

hq607519 Length: 4846  08-MAY-2011  Type: N  Check: 4612  ..

       1  gcttaaaagg tctagtccac tttttaagcc ggctatactt cgccgtagac
      51  acttgggcaa aattaatcaa caccctcctg caatcaccgg ggtgttgcga
     101  gccataagag actggttcta aaggactgat ataccgccgt gcgggtaggt
     151  ggtcgatagc ccgtttgaag gagtagtaat acttccgatt ctggtgtagc
     201  atcgactggg gccccctagc gtgagctcag cacgttgaga aaacgaaaaa
     251  ctgcatgtgc acagccttcg cagtagcgtg agctcgaggc accctaaaaa
     301  gtgccttttc cttgtgacaa cctacgtgtt atcacaaggt tctagagtat
     351  tacgtgcaac ggtatgtcag ctcccgagcc cttaaatact gaagtacgtt
     401  cacctaatgg tgttagtgaa gccactgaaa ctcaaaactt ggctgtcact
     451  caaagcagtg tgtcaaacga aaaaacaatc gacacacaaa gtgatctgca
     501  aacactcaaa aaacagttac aaccggtcac cagatccaca gattttgaaa
     551  ctctttataa ttatttttat gctttaaatg tctctccgtc aacagatcgt
     601  attggtaatg ctatcacacg caatactcca gttaatgata caaacgaagt
     651  ggttagtttt ccgcttactg cttctgtctc gcacacattc tccaatacac
     701  cagtacctgc ccacattcag cctctccaaa tttccattgc cgacgattgc
     751  gttaactatg aactggatga gagcggaaca ttatgccctg ccctagatag
     801  ctctgtccac gttcaaaggg ccacttctct cgctagcgct ctcaaggtca
     851  aattaacagg cgaggtcatg cactcagctt cagtcagacc aatccaaact
     901  ccacagttaa ttgcttatct atatggtgtt cttctcgccg tccaagatcg
     951  tcttaatatt catcgtaatc aacacacaaa cttatggcgt agcttatgtg
    1001  ccgcaggtcg tgcagctcaa gcgaaaccat tcttcgatga aatcccaaat
    1051  aacaagttca gaaccggcgc gctcttggca ccccctctcc cagacgccgg
    1101  ctttggtccc ttcccagctg agggcctcaa ccagaattcc aagctcgatt
    1151  tcaaatcgaa gggatacatt ttctacaagc agcgcactta caatcccgat
    1201  gatatgaatc gcgctttttg gttcatttgg gcgatctaca atcgtatgcc
    1251  cgaagacttc cagaattcat atcctctgaa cattactttc tgcacttccg
    1301  aattaccagt ccaaagcccg atgccagcgg ctgacggaat ttccgcagag
    1351  cagtgtgata gggcactcgt tcttctcgac aaggtcattc tcgaattctt
    1401  caacaacgat cgcaagcttg cttattacta cgtattcaag ggatgccagt
    1451  tcgtcatgcg tccttgttcc tgttatcaag aaggtggctt aatccgcaag
    1501  gcctcacgca acgttgctct tcgcgccttt actggcattt actacctcgc
    1551  cggcttcgct gagcaatacg ctaacatgat ttcatgtgcc acacacccag
    1601  gaattatcgg tgctctcttc caatacgtcg acactatggt cttacaggct
    1651  gttttctctc tctctggccc taagctggtc cgatttgccg ccccacccga
    1701  atatcaaggt cgtcacgctt gtccgttctc cttcgtagca gacgaaaatt
    1751  attggggtat cgctccaggt tcagacgctg agcctgttgg catgtactat
    1801  atggatatca tccaacgcaa agccgaacac gacttattca ccgaaacatt
    1851  catggatatc tacggttcaa cagcttccat catttgcgca aacatcgaaa
    1901  caagtttgtt cacttctggc acaaacgtca tcaacaaacg catgcagaat
    1951  gatttcgcac gcgacactcc aaagcctgga actcttcgcc accaacatgc
    2001  catcatcaac cgcttccacg aacccgaata tgcttaccgt ctcggtatcc
    2051  tcgctgatgg catcattccg ctcggcggct ccttcgaagt cgacatcctc
    2101  aaagaagctg agcgcctcat cacaggtgaa gacatccgca acctcccagg
    2151  actacgttgc ctgtgctctc gcggtctcga cgcgattctc ggcatacgcc
    2201  cagtccaaca aaaacgcaag aagatgagtt acttccgcac tctcgatggc
    2251  acattccacg aagtaacgat caggtcagat actcacgatt tacaggtctg
    2301  gaatgaccac ggctaccttg cccgcccata cgcatgtcac atcgtcgact
    2351  cagacggcat cgagttctac gacaaatcca acggtctcta caagggacgc
    2401  gtcaatgtcc tcatctctgg attcgccatc ccaggtcgcg cataccgggg
    2451  ccctcgcttg ccggtagcaa cagaggccgc ccaagtctga gcgacattcc
    2501  ggcgacagga agtctgtcca acctcatcga cctttcgaaa gcaagtcggc
    2551  taccataccg taagcttaaa gaaggcttga gagcgtcaga ctacaccgtc
    2601  gcccgcgagt tagctagcgc ttttcgcaat tctcgcctaa ctcgccaaat
    2651  ggatcatgtt acagatatag cttaccttaa tttcctcaga tgggtgttgc
    2701  taccttacaa cggtcaaaca ctacgaccac acccctccaa gtggagtcca
    2751  acaccctacc ccgaacacgt caacctaaag ttcctaacca aggaaatcga
    2801  gctcgaactt ttcccactga agaaggcccc acaagccgat cttaaagtga
    2851  attgttacgc gcgaaatatc cttgcttcga cagagctaac tgacgatctc
    2901  ctcaaacagt gtctgccagt cggactcaac aacgattcag tttgcggaat
    2951  tgttatcgtc ttagagctgc ttttgattgc aggtgttcca agtaagctgc
    3001  taccaattat tggccaagcc atcgcgaaca aagatccatt catcaaagaa
    3051  ttgtccgatt tcaataagat gataggagcg accacttcac gcattgctaa
    3101  cattctcaca gagtgcaaca cattaattgg tcgcggtgtt aagtcatctg
    3151  acccaagtgc tgatttgtat caccgggtag cgcctgaggg caataggcac
    3201  gaagcgaaga tttcccgaca catcctcatc gaagccatcg acaaaattta
    3251  caaaaacgaa atgacaagca tgcctccacc gggcgacttc atgctccact
    3301  taataacaag tcctctatgg tgtaaggctg gctctcatca ccatcctcac
    3351  ttcgccaaat acgattcccg cttggaattc gtcatggatg ttccagcaga
    3401  caaaatcgct gctgaaccac cctctgcata cattactcaa gcggagaaat
    3451  tggaacacgg taagactagg tacatctata actgcgatac agtatcatac
    3501  ctattcttcg attacatctt acattacgtc gagtgtgtgt ggtcaaatga
    3551  gtcagtctta ctcaacccag ctgctatgag tgtcgagcgt tttagtgtct
    3601  tggactaccc ggagtactgc atgatcgatt atacagattt caactctcaa
    3651  cacagtctag aatcccagaa gctagtcttc gagtgtttga gaccgtactt
    3701  gccacgcgaa atgcattcag tcttggattg gtgtatcgcg agcatggacc
    3751  atatggaaat taacggccaa cattggttaa gcacgttgcc gtcaggacac
    3801  agagcgacaa ctttcataaa ctcggttctg aacaaagcct acttgatccc
    3851  ctacataggc gacaccgttt ctttccattg tggcgacgac gtgttattat
    3901  gtggcgagta cgactatcaa acactcatcg ataccctgcc ctatgagcta
    3951  aacaagagca aacagagctt tggacctaat gccgagttct tgcgcttgca
    4001  taggcgtggt ggtgacgtca taggataccc atccagagct gtgtcgagtt
    4051  tggtatctgg caattggtta agcaaaacat cttgggagtg gcaaccaagt
    4101  ctcatttcgg tcacaaatca atgcaacgta atcatctcgc gttcacaatt
    4151  gaatatcagg tttattcccg ctatgcaaca ggaactgcgc aaccgctacg
    4201  cggacaagat gagtgaacct ttcgatgtca gctcgaatta ctacgtcatg
    4251  ccaggttgtc cgtgctatag tgacgccgcg actacgatcg taccgaacgt
    4301  tccccaactg gaacattcgg atgtaccgtt ctcgcaagca caaaaacttt
    4351  ttgatactat gcgcgactac tgtcctgagt tcactaccgt caacgacgtc
    4401  atcgacaagg ttaaagcccg tcgttcctcg agtgctgtca gcaatatcat
    4451  gtacaatgta tgctcacctg tcgcacctca agtttgcgta gtcgtaaatc
    4501  ccaacaacta ccagttcctt ttgcgcaagc ggtactaccc acgcgaacac
    4551  attgccccat ccggctttga tgaatctagc gactccaagc tcgtttttac
    4601  tacttacgat ctcgctcctt caatcgctat gaaatcgtgc gctgttttga
    4651  ccccggcaaa gataatatgt ggtcacgggc tacgcagtgg ttgaataatc
    4701  tgcctgtacc aggctatgat tggtaccgat tcagccacga acggcctctt
    4751  gtcttcggac cctccgccta taggttaata ggagtacagt gttactgttg
    4801  tgtgtatcgc tctaggcaca cgaacgtact accccacgtt tagttc