Sequence of DPV Trichomonas vaginalis virus 2

Trichomonas vaginalis virus 2 strain TVV2-OC3, complete genome.

ACC No: HQ607518

Dated: 2011-05-08 | Length: 4674 | CRC: -311108341

                
ID   HQ607518; SV 1; linear; genomic RNA; STD; VRL; 4674 BP.
XX
AC   HQ607518;
XX
DT   08-MAY-2011 (Rel. 108, Created)
DT   08-MAY-2011 (Rel. 108, Last updated, Version 1)
XX
DE   Trichomonas vaginalis virus 2 strain TVV2-OC3, complete genome.
XX
KW   .
XX
OS   Trichomonas vaginalis virus 2
OC   Viruses; dsRNA viruses; Totiviridae; Trichomonasvirus.
XX
RN   [1]
RP   1-4674
RX   DOI; 10.1128/JVI.00220-11.
RX   PUBMED; 21345965.
RA   Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W.,
RA   Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H.,
RA   Singh B.N., Fichorova R.N., Nibert M.L.;
RT   "Clinical Isolates of Trichomonas vaginalis Concurrently Infected by
RT   Strains of Up to Four Trichomonasvirus Species (Family Totiviridae)";
RL   J. Virol. 85(9):4258-4270(2011).
XX
RN   [2]
RP   1-4674
RA   Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A.,
RA   Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.;
RT   ;
RL   Submitted (12-NOV-2010) to the EMBL/GenBank/DDBJ databases.
RL   Department of Microbiology and Molecular Genetics, Harvard Medical School,
RL   Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .4674
FT                   /organism="Trichomonas vaginalis virus 2"
FT                   /host="Trichomonas vaginalis"
FT                   /strain="TVV2-OC3"
FT                   /mol_type="genomic RNA"
FT                   /country="USA"
FT                   /collection_date="Nov-2009"
FT                   /db_xref="taxon:674954"
FT   gene            298. .4607
FT                   /gene="pol"
FT   CDS             join(298. .2380,2380. .4607)
FT                   /codon_start=1
FT                   /gene="pol"
FT                   /product="RNA-dependent RNA polymerase"
FT                   /note="translated via ribosomal frameshift"
FT                   /protein_id="AED99808.1"
FT                   /translation="MASTLISSDNSATSGTVGEVINNTDTSPPDTPPSDHSNPRLTKIL
FT                   DEMSKKPCVNINEIRKVIRNFQPQIIQPRNGNRPGAQPRTVNSFEWVVRIQSTVNTQLL
FT                   GATNTIPEQTLNLDISFTDDSTTITPASIPGSISMLDNSRHIPAIQSMIQNFKARYLGN
FT                   LQDTAQLNSPQYPQLLAYLFGQLIAIKDRLDLFRPSNPLSFADALFGFTLAQNARPRYD
FT                   DHIHAKACTGPIVIPAATNADCGPCGFVQINANQGLTLPLGACLFVNPDTVNDQSFQDF
FT                   LWLIFATHHRMPNQMQNNWPFALNIVSTCAAPGRQVPQAGQLTDARFHAALDTGHRILL
FT                   SMFNDDEETLRYYQRKGIETMFRPCCFYTEGGLLRKATRYVSMVPLSGLYYYNGASSYI
FT                   VTPIHTDAHPGITAAIESFVDIMVLQAVFSFTGPKVVAARADANQVDASSVFGPAVAEG
FT                   DGFVYDPRRPAPPLSAFYSEFIHRPAEQRIFQMAMSQIYGSHAPLIIANVINSIHNCKT
FT                   KIVNNKLRATFVRRPPGAPHLKADTAIINRFHDPELAYALGILADGIAPLDGTHEYNIL
FT                   DELDYLFNGGDIRNCFGLNALNTRGLGQIVHVRPKRDPGKKPRRGFYTTLDGQVHPITH
FT                   DAPLDEIYQWRDHGNLTRPYSCHILDSEGLEFADVSNGRSRGKLLVVVTTPLKTSAAYQ
FT                   GPQLRAKAGQRYVERINQCGGNVIYPRLSTARACSSSAIDSKTLTLANQLCYLYKSSDL
FT                   HQQLDFTIPQSYLTFLEWLLRMNPDNQKSSIRHFPNYDNQEVITCNLRNLTKEQEIELF
FT                   PIKDIIQANRRVNAYARNLLDASPLPDFALQQMLLPKTANDVVCGILLLGEVLWLLRCP
FT                   ISIIVGISRAICRNDSFIKDLSDFNKMLGLTKIPIANCLTELNSLQGRGVTSSDAKRDL
FT                   THRIDDVNPHEAKISREILKEAINHIYKEEINKTEVPDTFKQHVFSSPLWVKKGAHHHP
FT                   HFKSYDNRLEFVENVDLDRVLQSHPAVYITQASKLEHGKTRYIYNCDTVSYIYFDYILN
FT                   YVESVWSNKHVLLNPDYMNPVIFSSLNYDEYCMLDYTDFNSQHSIASMKLVFSCLMPFL
FT                   PYSMHSVLQWCLTSFDNMYINNVHWKSTLPSGHRATTFINSILNRAYLLPFLQVSNAFH
FT                   TGDDVLLCGKADYATLINTVPYELNKTKQSFGSSAEFLRLHKHNNQVSGYPARAISSLV
FT                   SGNWLSYDNPLWQPSLLSIMQQLYTISARSGLLPTLPVTMKLEVRRRYDLPTRLTNGLF
FT                   SGDIVPSGCPCYKSNAALLSAVIPDTVLKAQPKHYDLRTLDILKHTSPWINSESKYLDL
FT                   LDRRHMESNKKNVLYNIQYLPSKMLPMIDVDPSEALPPQKRYHPRSHIAHPLPRDAHLK
FT                   ELRFATCRVGPATAIRLGSLWPANRINLIRPVYV"
FT   CDS             298. .2427
FT                   /codon_start=1
FT                   /gene="pol"
FT                   /product="capsid protein"
FT                   /protein_id="AED99807.1"
FT                   /translation="MASTLISSDNSATSGTVGEVINNTDTSPPDTPPSDHSNPRLTKIL
FT                   DEMSKKPCVNINEIRKVIRNFQPQIIQPRNGNRPGAQPRTVNSFEWVVRIQSTVNTQLL
FT                   GATNTIPEQTLNLDISFTDDSTTITPASIPGSISMLDNSRHIPAIQSMIQNFKARYLGN
FT                   LQDTAQLNSPQYPQLLAYLFGQLIAIKDRLDLFRPSNPLSFADALFGFTLAQNARPRYD
FT                   DHIHAKACTGPIVIPAATNADCGPCGFVQINANQGLTLPLGACLFVNPDTVNDQSFQDF
FT                   LWLIFATHHRMPNQMQNNWPFALNIVSTCAAPGRQVPQAGQLTDARFHAALDTGHRILL
FT                   SMFNDDEETLRYYQRKGIETMFRPCCFYTEGGLLRKATRYVSMVPLSGLYYYNGASSYI
FT                   VTPIHTDAHPGITAAIESFVDIMVLQAVFSFTGPKVVAARADANQVDASSVFGPAVAEG
FT                   DGFVYDPRRPAPPLSAFYSEFIHRPAEQRIFQMAMSQIYGSHAPLIIANVINSIHNCKT
FT                   KIVNNKLRATFVRRPPGAPHLKADTAIINRFHDPELAYALGILADGIAPLDGTHEYNIL
FT                   DELDYLFNGGDIRNCFGLNALNTRGLGQIVHVRPKRDPGKKPRRGFYTTLDGQVHPITH
FT                   DAPLDEIYQWRDHGNLTRPYSCHILDSEGLEFADVSNGRSRGKLLVVVTTPLKTSAAYQ
FT                   GPSFAPKPGSAMWNE"
XX
SQ   Sequence 4674 BP; 1341 A; 1237 C; 860 G; 1236 T; 0 other;

hq607518 Length: 4674  08-MAY-2011  Type: N  Check: 4126  ..

       1  gctttaaaag gagtgacgac ctttaaagcc caggcctaac cagcctggtc
      51  agaaactcct ggggtccatc aggagacgga tcgctaacgc gaactggata
     101  ttagagtcaa aagccttgtg cgccatggat acttggtaca cttcgcggga
     151  gtaggtgacc cgcaagccgt acggttttta gtggattgaa aattgtactg
     201  ttaactgcta gccttatgcg gttgctgtgt attgagaggg cttatatgta
     251  gcttttatgc ttccttattg tataattgtg tacatctaat tacgataatg
     301  gcttcgacgc taatatcgtc tgataattct gccacgtcag gcactgttgg
     351  tgaagttata aacaacacag atacttcacc tcccgacact ccacctagtg
     401  atcactcaaa tccacgttta acaaagattc tagatgaaat gtccaaaaaa
     451  ccttgtgtaa atattaatga aataagaaaa gttattagaa atttccaacc
     501  tcaaattatt caacctcgta atggtaatcg cccaggcgct caaccacgca
     551  cagtaaattc tttcgaatgg gttgtccgta ttcaaagcac tgtcaatact
     601  caactacttg gtgcaactaa tacgattcct gaacagactc tcaacctcga
     651  tatctcgttt acagatgatt ctactacaat tactccagct tccattccag
     701  gctctatttc gatgctcgac aactcacgcc atatccctgc gatccagagc
     751  atgatccaga atttcaaagc ccgttactta ggtaatctcc aagatacagc
     801  ccaactcaat tctccgcagt atcctcaact tcttgcctat ctattcggac
     851  aattaatcgc catcaaggac cgcctcgatc ttttccgacc atcgaaccca
     901  ctttcatttg ctgatgcttt atttggcttt actttagccc agaacgcacg
     951  ccctcgctac gatgaccaca tacatgctaa ggcatgtaca ggacctatcg
    1001  tcatcccagc agctacaaat gcagattgcg gtccttgcgg cttcgtccaa
    1051  atcaatgcta atcagggcct cactttaccc ctcggcgctt gcctttttgt
    1101  taatccagat acagttaacg atcaatcctt ccaagatttc ctttggctca
    1151  ttttcgcaac acaccatcgc atgccaaacc agatgcaaaa taactggcca
    1201  tttgctctca atatcgtctc gacatgtgcc gctccaggtc gtcaggttcc
    1251  tcaagccggc caactcaccg atgccaggtt tcacgctgcc ctcgatacag
    1301  gtcatcgcat cttactctca atgttcaacg atgatgaaga aacactccgc
    1351  tactatcaac gcaaaggaat agaaacgatg ttcagaccat gttgcttcta
    1401  cactgaaggc ggtttgctca gaaaagctac aagatatgtt tcaatggttc
    1451  cactcagcgg cttatattac tacaatggcg cctcttcata cattgttact
    1501  ccaatccaca ctgatgcaca cccaggaatc acagctgcaa tcgaatcctt
    1551  cgtcgatatc atggtcttac aagcggtctt ttctttcacg ggccctaaag
    1601  ttgtcgctgc tagagccgac gcaaaccaag ttgatgcttc ctcagtcttc
    1651  ggccctgctg tcgctgaagg agatggtttt gtctacgatc ctcgccgtcc
    1701  agcccctccg ctctccgcat tctacagtga attcatccac agaccagccg
    1751  aacaacgcat cttccagatg gcgatgagcc agatttacgg atcccatgcc
    1801  cccctcatca tcgctaacgt catcaattcc atccacaatt gcaagacaaa
    1851  gatcgtcaac aacaaattac gcgccacttt cgttcgtcgt ccacccggtg
    1901  ctccccatct caaggcggac accgcaatca tcaaccgctt ccatgatcca
    1951  gaactcgcct atgccctcgg cattctcgct gatggtatag ctccactcga
    2001  tggaacacac gaatacaaca tcctcgacga actcgattac ttgttcaacg
    2051  gtggtgacat ccgcaattgc ttcggcctca acgcgctcaa cactcgcgga
    2101  ttaggccaaa tcgtccacgt tcgtccgaag cgcgatccag gaaagaaacc
    2151  tcgccgcggt ttctacacca ccctcgatgg acaagtccac cccatcacac
    2201  acgatgctcc actcgatgag atttaccagt ggcgtgacca cggaaatctt
    2251  acacgtccat attcgtgcca catcctcgac agcgaaggat tggaattcgc
    2301  tgacgtctcc aacggacggt cacggggcaa gctcctcgtt gtcgtcacca
    2351  caccgctcaa gacaagcgct gcctaccagg gccccagctt cgcgccaaag
    2401  ccgggcagcg ctatgtggaa cgaataaacc aatgcggggg caacgtcatc
    2451  tatccgcgct tgagcactgc tcgcgcctgc agctcgtctg ccattgatag
    2501  taagacgttg accctcgcaa accaactgtg ctacctatac aagtcttctg
    2551  acttgcacca acagttggac ttcacgatac cacaatccta cttaacattt
    2601  cttgaatggc tactcagaat gaatcccgat aatcaaaaaa gttcaattcg
    2651  ccacttccca aattacgata atcaagaagt tattacatgc aatttacgaa
    2701  acctcacaaa agaacaagaa attgaactat ttccaataaa agacataatc
    2751  caagctaacc gccgcgtaaa tgcctacgcg cgaaatctcc ttgacgcctc
    2801  accacttcct gattttgctc tacagcagat gctcttacca aaaacagcta
    2851  atgatgtagt ttgcggaatc ctgttacttg gtgaagtatt atggttactt
    2901  cgctgtccca tctcgatcat cgtaggtatt tcccgtgcaa tatgtcgcaa
    2951  tgatagtttc ataaaagatt tatcggattt caacaaaatg ttaggcttaa
    3001  ctaagatccc aatagctaac tgtttgactg aattaaacag tctacagggt
    3051  agaggagtca cttcaagcga tgccaaaaga gatttgaccc accgaataga
    3101  cgatgtcaat cctcatgagg cgaaaataag tagagaaatc ctcaaagaag
    3151  caatcaatca tatatacaaa gaagaaatca acaaaacaga agtacccgat
    3201  acattcaaac aacatgtatt ctcatctcca ttatgggtta agaaaggcgc
    3251  acaccatcac cctcacttca aatcttacga taaccgacta gagtttgtcg
    3301  agaatgttga cttagacagg gttttacaat cacaccctgc tgtctacatt
    3351  acgcaagcct caaaacttga acacggtaag acacgataca tttacaattg
    3401  tgatacagtc agttacatat attttgatta tatcttgaac tacgtcgaga
    3451  gtgtttggtc caataaacat gtgcttctaa accctgatta catgaaccca
    3501  gtcattttta gtagtcttaa ttatgatgag tactgtatgt tggattacac
    3551  tgacttcaat tcccaacatt ctattgcaag tatgaagcta gtgttttcct
    3601  gtctcatgcc atttttacct tactcaatgc attcagtatt acaatggtgc
    3651  cttacatcat tcgataacat gtacatcaat aatgtccact ggaaatcgac
    3701  attaccatca ggacacagag caacaacatt tatcaattct atactaaaca
    3751  gggcatactt gttgccgttt ctgcaagttt caaatgcatt ccacacaggt
    3801  gatgacgttc tgctatgcgg aaaggctgac tatgcaacct taattaacac
    3851  ggtaccttat gaactcaata aaactaagca atcgttcggg tcttcagctg
    3901  aatttctacg tcttcataaa cataacaacc aggtctcggg ctacccagct
    3951  cgtgcaatta gtagtctcgt aagcggaaac tggttgtcat acgataatcc
    4001  actatggcaa ccttcactac tgtcaattat gcaacaattg tacaccatat
    4051  cagcgagatc aggcttgtta ccaactcttc ctgtcacgat gaagttagag
    4101  gttcggcgac ggtacgactt acctacacgg ctcacaaatg gattgttctc
    4151  tggtgatatt gtcccaagtg gttgcccatg ttataagtca aatgctgcct
    4201  tactaagcgc ggtcatccct gatacagtac ttaaggcaca accaaagcat
    4251  tacgacttac gtactttgga catcctaaaa catacttcac cttggatcaa
    4301  ttctgaatcc aaatatttgg atctcctgga tcgccgtcac atggaatcga
    4351  ataagaaaaa tgtattatat aatattcagt atttaccttc caagatgtta
    4401  ccaatgattg atgttgaccc atctgaggct cttcctccac aaaaacggta
    4451  tcatccacgt tcccacatcg cacacccact cccacgtgat gctcatctca
    4501  aggaattgag attcgccacg tgtagagtgg gcccggctac agcgataaga
    4551  ttaggatcgc tttggcctgc gaacagaata aacctaatca ggccagtgta
    4601  cgtctaagta caactgccaa aaatcttaca aaactactcg gctagagtag
    4651  gagagtaata tcaactctta tgtc