Sequence of DPV Trichomonas vaginalis virus 2

Trichomonas vaginalis virus 2 strain TVV2-UR1, complete genome.

ACC No: HQ607514

Dated: 2011-05-08 | Length: 4674 | CRC: 760150049

                
ID   HQ607514; SV 1; linear; genomic RNA; STD; VRL; 4674 BP.
XX
AC   HQ607514;
XX
DT   08-MAY-2011 (Rel. 108, Created)
DT   08-MAY-2011 (Rel. 108, Last updated, Version 1)
XX
DE   Trichomonas vaginalis virus 2 strain TVV2-UR1, complete genome.
XX
KW   .
XX
OS   Trichomonas vaginalis virus 2
OC   Viruses; dsRNA viruses; Totiviridae; Trichomonasvirus.
XX
RN   [1]
RP   1-4674
RX   DOI; 10.1128/JVI.00220-11.
RX   PUBMED; 21345965.
RA   Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W.,
RA   Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H.,
RA   Singh B.N., Fichorova R.N., Nibert M.L.;
RT   "Clinical Isolates of Trichomonas vaginalis Concurrently Infected by
RT   Strains of Up to Four Trichomonasvirus Species (Family Totiviridae)";
RL   J. Virol. 85(9):4258-4270(2011).
XX
RN   [2]
RP   1-4674
RA   Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A.,
RA   Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.;
RT   ;
RL   Submitted (12-NOV-2010) to the EMBL/GenBank/DDBJ databases.
RL   Department of Microbiology and Molecular Genetics, Harvard Medical School,
RL   Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .4674
FT                   /organism="Trichomonas vaginalis virus 2"
FT                   /host="Trichomonas vaginalis"
FT                   /strain="TVV2-UR1"
FT                   /mol_type="genomic RNA"
FT                   /country="USA"
FT                   /collection_date="Jun-1999"
FT                   /db_xref="taxon:674954"
FT   gene            297. .4606
FT                   /gene="pol"
FT   CDS             join(297. .2379,2379. .4606)
FT                   /codon_start=1
FT                   /gene="pol"
FT                   /product="RNA-dependent RNA polymerase"
FT                   /note="translated via ribosomal frameshift"
FT                   /protein_id="AED99806.1"
FT                   /translation="MASTLISSDNSATLGKVNEVINNTDTSPPDTSPGDHSNPRLTKIL
FT                   DEMSKKPCVNINEIRKMIRNFQPQIIQPRNGNRPANQPRTVDSFEWVVRIQSTVNTQLL
FT                   GATNTIPEQTLNLDISFTDDSTTITPASIPGSISMLDNSRHIPAIQSMIQNFKARYLGN
FT                   LQDTAQLNSPQYPQLLAYLFGQLIAIKDRLDLFRPSNPLSFADALFGFTLAQHAHPRYD
FT                   DHRHAKACTGPIVIPAATNADCGPCGFVQINANQGLTLPLGACLFVNPETVNDQSFQDF
FT                   LWLIFATHHRMPNQMQNDWPFALNIVSTCAAPGRQAPQAGQFTEARVKLALDTGHRILL
FT                   SMFNDDEEALRYYQRKGIETMFRPCCFYTEGGLLRKATRYVSMVPLSGLYYYNGASSYV
FT                   VSPIHTDAHPGITAAVESFVDIMVLQAVFSFSGPKVVAAKVDANQIDASSVFGPAVAEG
FT                   DGFVYDPRRPAPPLSAFYTEFIHRPAEQRIFQMAMSQIYGSHAPLIIANVINSIHNCKT
FT                   KIVNSKLRTAFVRRPPGAPPLKADTAIINRFHDPELAYALGILADGIAPLDGTHEYNIL
FT                   DELDYLFNGGDIRNCFGLNALNTRGLGQIVHVRPKREPGKRPRRGYYTTLDGQVHSVTQ
FT                   DAPLDEIYHWRDHGNLTRPYSCHILDSEGLQFADVSNGRTRGKILVVVNTPLKTSAAYQ
FT                   GPQLRAKAGQRYVERINQCGGDVIYPRLSTARACSSSATDSTTSTLATQLCYLYKSSDL
FT                   HRQLDITIPQSYLTFIEWLLRMNPDNEKNSIRHFPNHDDHEVITCSLRNLTKEQEIELF
FT                   PIKDIIQANRRVNAYARNLLDASPLPDFALQQMLLPKTANDVVCGILLLGEVLWMLRCP
FT                   ISIIVGISRAICRNDSFLKDLSDFNKMLGLTKIPIANCLTELNSLQGRGVTSSDAKRDL
FT                   THRIDDVNPHEAKISRENLKQAINNIYREEISKKEVPDTFKQHVFSSPLWVKKGAHHHP
FT                   HFESYDNRLEFVENVNLDRVLQSHPAVYITQASKLEHGKTRFIYNCDTISYIYFDYILN
FT                   YIEGVWSNKHVLLNPDYMNPIIFSSLDYSEYCMLDFTDFNSQHSIENMKLVFSCLMPFL
FT                   PYSMHSVLQWCVTSFDNMYINNIHWKSTLPSGHRATTFINSVLNRAYLLPFLQVSNAFH
FT                   TGDDVLLCGKADYATLINTAPYELNKTKQSFGPSAEFLRLHKHNDQVSGYPARAISSLV
FT                   SGNWLSYDNPLWQPSLLSIMQQLYTISARSGLLPFVPVTMKAEVRRRYDLPTRLTNGLF
FT                   SGEIVPSGCPCYKSNAALLSAVIPDTVLKAKPKHYDLRTLDILKHTSPWINSESKYFDL
FT                   LDRRHMESSKKNVLYSIQYLPSKMLPIIDVDPTEAFPPQKRYHPRSHIAHPLPREAHLK
FT                   ELRFATCRVGPATAIRLGSLWPANRINLIRPVYV"
FT   gene            297. .2426
FT                   /gene="cap"
FT   CDS             297. .2426
FT                   /codon_start=1
FT                   /gene="cap"
FT                   /product="capsid protein"
FT                   /protein_id="AED99805.1"
FT                   /translation="MASTLISSDNSATLGKVNEVINNTDTSPPDTSPGDHSNPRLTKIL
FT                   DEMSKKPCVNINEIRKMIRNFQPQIIQPRNGNRPANQPRTVDSFEWVVRIQSTVNTQLL
FT                   GATNTIPEQTLNLDISFTDDSTTITPASIPGSISMLDNSRHIPAIQSMIQNFKARYLGN
FT                   LQDTAQLNSPQYPQLLAYLFGQLIAIKDRLDLFRPSNPLSFADALFGFTLAQHAHPRYD
FT                   DHRHAKACTGPIVIPAATNADCGPCGFVQINANQGLTLPLGACLFVNPETVNDQSFQDF
FT                   LWLIFATHHRMPNQMQNDWPFALNIVSTCAAPGRQAPQAGQFTEARVKLALDTGHRILL
FT                   SMFNDDEEALRYYQRKGIETMFRPCCFYTEGGLLRKATRYVSMVPLSGLYYYNGASSYV
FT                   VSPIHTDAHPGITAAVESFVDIMVLQAVFSFSGPKVVAAKVDANQIDASSVFGPAVAEG
FT                   DGFVYDPRRPAPPLSAFYTEFIHRPAEQRIFQMAMSQIYGSHAPLIIANVINSIHNCKT
FT                   KIVNSKLRTAFVRRPPGAPPLKADTAIINRFHDPELAYALGILADGIAPLDGTHEYNIL
FT                   DELDYLFNGGDIRNCFGLNALNTRGLGQIVHVRPKREPGKRPRRGYYTTLDGQVHSVTQ
FT                   DAPLDEIYHWRDHGNLTRPYSCHILDSEGLQFADVSNGRTRGKILVVVNTPLKTSAAYQ
FT                   GPSFAPKPGSAMWNE"
XX
SQ   Sequence 4674 BP; 1358 A; 1244 C; 861 G; 1211 T; 0 other;

hq607514 Length: 4674  08-MAY-2011  Type: N  Check: 1199  ..

       1  gctttgaagg agtgacgacc ttcagagtcc aggcttaatt agcttggtca
      51  gatactcctg gggtccatca ggaaacggat cgctaacgcg aactgggtat
     101  tagagtcaaa agccttgtgc gccatggata cttggtacac ttcgcgggtg
     151  taggtgaccc gcaaggtgga cagttttgtg tggattaaaa actgtactcc
     201  taactgctag ccttatgcgg ttgctgtgta ttgagagggc ttatatgtag
     251  ctttcatgct tcacaacagc atgattgtcc gcatctaatt acgataatgg
     301  cttcgacgct aatatcgtct gataattctg ccacgttagg caaagttaat
     351  gaagtaataa ataacacaga tacttcacca cccgacactt cacctggtga
     401  tcattcaaat ccacgattaa caaagatact agatgaaatg tccaaaaaac
     451  catgtgtaaa tattaatgaa ataagaaaaa tgattagaaa tttccagcct
     501  caaattattc aacctcgtaa cggcaaccgc ccagccaatc aaccacgtac
     551  agtagactct ttcgaatggg ttgttcgtat ccaaagtaca gttaatactc
     601  aattacttgg cgcaacaaac acgattcctg aacaaaccct taatcttgat
     651  atctcattta cagatgattc tactacaatt actccagcat ccattccagg
     701  ctccatttcc atgctcgaca attcacgtca tatccctgcg attcagagca
     751  tgatccagaa tttcaaagcc cgttatttag gcaacctcca ggatacagcc
     801  caactcaatt ctccgcagta tcctcagctt ctcgcctatt tattcggaca
     851  attgatcgca atcaaagatc gcctcgatct cttccgcccg tcaaatccac
     901  tctcatttgc tgatgctcta tttggcttta cattagctca acacgctcac
     951  cctcgctacg atgaccacag acacgctaaa gcttgcacag gacctattgt
    1001  cataccagct gccaccaatg ccgattgtgg tccttgcggt ttcgtccaga
    1051  tcaatgctaa ccaaggtctc accttacctc tcggcgcttg ccttttcgtc
    1101  aatcctgaaa cagttaacga ccagtccttc caagatttcc tttggcttat
    1151  tttcgcgaca caccaccgca tgccaaacca aatgcaaaac gattggccat
    1201  tcgctctcaa tatcgtttca acatgtgctg ctccaggtcg tcaggctcct
    1251  caagctggtc agttcactga ggccagagtc aagcttgccc tcgatacagg
    1301  tcatcgcatc ctactctcaa tgttcaacga tgatgaagaa gcactccgct
    1351  attatcaacg caaaggaatc gaaacaatgt tcagaccatg ttgcttctac
    1401  actgaaggcg gtttactcag aaaggccaca agatacgttt caatggttcc
    1451  actcagcggc ttatactact acaatggcgc ctcttcatac gtcgtctctc
    1501  caatccacac tgatgcacac ccaggaatca ctgcagcagt cgaatctttc
    1551  gtcgatatca tggtcctaca ggcggtcttc tctttctcag gccctaaagt
    1601  tgtcgccgct aaagttgacg caaatcaaat cgacgcctcc tcagtcttcg
    1651  gccctgctgt tgcagaggga gatggttttg tctacgatcc ccgccgtcca
    1701  gctcctccgc tttctgcgtt ctatactgaa ttcattcaca gaccagccga
    1751  acaacgcata ttccagatgg cgatgagcca gatttacgga tcccacgctc
    1801  ccctcatcat cgccaacgtc atcaattcca tccacaattg caagacaaaa
    1851  atcgttaaca gcaaattacg caccgccttt gttcgtcgcc cacccggagc
    1901  tcctcctctc aaagcagaca cagcaatcat caaccgcttc catgatccag
    1951  aactcgcata tgctctcggc attctcgcag atggcattgc ccctctcgat
    2001  ggaacacacg aatacaatat cctcgacgaa ctcgattact tgttcaacgg
    2051  tggcgacatt cgcaattgtt tcggcctcaa cgcactcaac actcgcggat
    2101  taggccagat cgtccacgtt cgtccaaaac gcgagccggg aaagagacct
    2151  cgccgcggtt actacaccac tctcgacgga caggtccact ccgtcacaca
    2201  agatgctcca ctcgatgaga tttaccactg gcgtgaccat ggaaatctca
    2251  cacgtccata ttcgtgccac atcctcgaca gcgaaggatt acaattcgct
    2301  gacgtctcca acggacggac acgaggtaaa atcctcgttg tcgtcaacac
    2351  accgctcaag acaagcgctg cctatcaggg ccccagcttc gcgccaaagc
    2401  cgggcagcgc tatgtggaac gaataaacca atgcgggggc gacgtcatct
    2451  atccgcgctt gagcactgct cgcgcctgca gctcgtctgc cactgatagt
    2501  acgacgtcga ccctcgcaac ccaactgtgc tacctataca agtcttctga
    2551  cttacaccga cagttggaca tcacgatacc acaatcctac ttaacattta
    2601  ttgaatggct actcagaatg aatcccgaca atgagaaaaa ttcaattcgt
    2651  catttcccaa atcacgatga ccacgaagtt attacatgca gtttacgaaa
    2701  tctcacaaaa gaacaagaaa ttgaactatt tccaataaag gacataatcc
    2751  aagctaatcg tcgtgtaaat gcttacgcgc gaaatcttct tgatgcctca
    2801  ccactccctg attttgctct acaacagatg cttttaccaa aaacagctaa
    2851  tgatgtagtt tgcggaatcc tgttacttgg tgaagtatta tggatgcttc
    2901  gctgcccaat ttctattatt gtaggcattt ctcgtgcaat atgtcgcaac
    2951  gacagtttct taaaagattt atccgatttt aacaagatgt taggcttaac
    3001  taagatccca atagctaact gcttgactga attgaacagt ttacagggtc
    3051  gaggagtcac ttcaagcgat gccaaaagag atttgaccca ccgaatagac
    3101  gatgtcaatc ctcatgaggc gaaaataagt agagaaaacc tcaaacaagc
    3151  aatcaataac atatacagag aagaaatcag caaaaaagaa gttcccgata
    3201  cattcaagca acacgtattc tcatctccat tatgggttaa gaaaggcgca
    3251  caccaccatc ctcacttcga atcttacgac aatagattgg agtttgtaga
    3301  aaatgtcaac ttagacaggg ttttacaatc acaccctgct gtttacatta
    3351  cacaggcttc aaagctcgaa cacggtaaga cacgattcat ctacaattgt
    3401  gatacaatta gttatatata tttcgactat atcttgaact acatcgaggg
    3451  tgtctggtcc aataaacatg tgcttctaaa ccccgattac atgaacccga
    3501  tcatcttcag cagtctcgat tatagtgagt actgcatgtt agatttcact
    3551  gatttcaatt ctcagcattc catcgaaaac atgaagctag tcttttcctg
    3601  tcttatgcct ttcttgcctt actcaatgca ttccgtgtta cagtggtgcg
    3651  ttacatcatt tgacaatatg tacattaaca atatccactg gaaatcaacg
    3701  ctaccatcag gccacagagc aacaacattc atcaactcag tactcaatag
    3751  agcgtatcta ctaccattcc tacaagtatc aaacgcattt cataccggtg
    3801  atgatgtact gctatgcggg aaggcagact atgccacttt aattaataca
    3851  gcaccctatg aacttaataa gactaagcaa tcattcggac cctcagccga
    3901  atttttacgc cttcacaaac acaacgacca ggtctcaggc tatcctgcgc
    3951  gtgctattag cagtctcgta agtggaaact ggttgtcata cgacaatcca
    4001  ttatggcaac cttcactact atcaatcatg caacaattgt acaccatatc
    4051  agcgagatca ggtctattac cgtttgttcc cgttacaatg aaggcagaag
    4101  ttcggcggcg atacgaccta cctacaagac tcacaaatgg attgttctct
    4151  ggtgaaattg tcccaagcgg ttgcccatgt tataagtcaa atgctgcctt
    4201  gctaagcgcg gttatcccag atacagtgct caaggcaaaa ccaaagcatt
    4251  atgacttacg tactttggac atcctaaaac acacttcacc ttggatcaat
    4301  tctgaatcca aatattttga tctcctagat cgccgacata tggaatcgag
    4351  taagaaaaac gttttataca gtatccaata tttaccatcc aaaatgttac
    4401  caattattga tgtcgacccc actgaggctt ttcctccaca aaaacggtat
    4451  catccacgtt cccacatcgc acacccactc ccacgtgaag cacatcttaa
    4501  ggaattaaga tttgcaacgt gcagagtggg cccggctaca gcgataagat
    4551  taggatcgct ttggcctgcg aatagaatca acctaatcag gccagtgtac
    4601  gtataagtac aactgccaaa aaaccttaca aaactactcg gctagagtag
    4651  gagagtaata tcaactctta tgtc