Sequence of DPV Trichomonas vaginalis virus 1

Trichomonas vaginalis virus 1 strain TVV1-OC5, complete genome.

ACC No: HQ607523

Dated: 2011-05-08 | Length: 4680 | CRC: 1437201674

                
ID   HQ607523; SV 1; linear; genomic RNA; STD; VRL; 4680 BP.
XX
AC   HQ607523;
XX
DT   08-MAY-2011 (Rel. 108, Created)
DT   08-MAY-2011 (Rel. 108, Last updated, Version 1)
XX
DE   Trichomonas vaginalis virus 1 strain TVV1-OC5, complete genome.
XX
KW   .
XX
OS   Trichomonas vaginalis virus 1
OC   Viruses; dsRNA viruses; Totiviridae; Trichomonasvirus.
XX
RN   [1]
RP   1-4680
RX   DOI; 10.1128/JVI.00220-11.
RX   PUBMED; 21345965.
RA   Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W.,
RA   Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H.,
RA   Singh B.N., Fichorova R.N., Nibert M.L.;
RT   "Clinical Isolates of Trichomonas vaginalis Concurrently Infected by
RT   Strains of Up to Four Trichomonasvirus Species (Family Totiviridae)";
RL   J. Virol. 85(9):4258-4270(2011).
XX
RN   [2]
RP   1-4680
RA   Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A.,
RA   Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.;
RT   ;
RL   Submitted (12-NOV-2010) to the EMBL/GenBank/DDBJ databases.
RL   Department of Microbiology and Molecular Genetics, Harvard Medical School,
RL   Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .4680
FT                   /organism="Trichomonas vaginalis virus 1"
FT                   /host="Trichomonas vaginalis"
FT                   /strain="TVV1-OC5"
FT                   /mol_type="genomic RNA"
FT                   /country="USA"
FT                   /collection_date="Jan-2010"
FT                   /db_xref="taxon:674953"
FT   gene            323. .4613
FT                   /gene="pol"
FT   CDS             join(323. .2349,2351. .4613)
FT                   /codon_start=1
FT                   /gene="pol"
FT                   /product="RNA-dependent RNA polymerase"
FT                   /note="translated via ribosomal frameshift"
FT                   /protein_id="AED99820.1"
FT                   /translation="MEASANGLSHDDNANKSQNVGPSTLPGSDKQGGENHENSFNSFSN
FT                   DFFFNFLRTSTSTHISDSPGVSFVLKDGTPYTSATIQSAVGRLTHNVVASAVQLNITAN
FT                   NTLEVDYGFGQDVSRATGTITIPVFDGEKYKEVARALSLIFSKKGTAIDTTSQTIQDTL
FT                   KNSDLTIATVAAGYYTALAARHELTKAESTAAHRIPFATALSDTFSAAGDALRSSHVIS
FT                   SCLRCPASNNAQRQVTVGTNMWTNVSVENIAVQGLIIPNPNDVSFFIPNKSLPPSWWCA
FT                   IWLLNAFLHSFIAQTRIHIFITPGETYNLAPFTDADIYEAIPILLGMSKTSRPVPESVE
FT                   SMLYAYGAQMVIQPHSLYTEGGIVRRMIFTVPHLPAHGYFIANTEYSRYMNIAVPNDPR
FT                   TAKDYIIGVGTGLLQIILAYQAAFSCAGPIALHWHDNDAISNGMDTVAAAYLEGRYFTI
FT                   PMAVNVATNIAQYTTRVRADPQYKHTLDRILPRIFGPSTDTVFNFIESAITSSWVSINA
FT                   TKRNGRARKFRTAFINRFHDPEFAYMFGITGNGIERMEGKVTSNIAQEVEYLTNGGDLR
FT                   NCPILRTLKAAEAEETVTFMCTGKIGSIFAVDGTMRTLKRYQTVDLADLGWTSHGKVMK
FT                   PYAFRAPVIQGITVCKTAYTSTAIDIVTTVFGPLRLRVGTLLSKAVRCGPIIPSVRHHF
FT                   NIRHIITVKRNGNEYVFIPGYGWVLQDDYLVNSVKMTGVDQLPPNQLPYGDDLLFIYAE
FT                   ILLYNYISLFPKFRYKNPDLLNQETELQLFPLKTDSAARNKANFYARSLWNEAKTDKTA
FT                   FRPGTYNDTVAGLLMWQQCALMWSLPRSVINRTISGVCDALTERTSLALLKRISDWLKQ
FT                   LGLACSPIHRLFIELPTLLGRGAIPGDNVKDMKHRLKFDPSITVDVPKDELHALIYRLL
FT                   SRNLNITKVDSFEHHLEERLLWSKSGSHYYPDDRINQLLPKQPTRKEFLDVVTVDYIKE
FT                   CKPHVFIRQSRKLEHGKERFIYNCDTISYVYFDFILKLFEAGWQDSEAILSPGDYTSER
FT                   LHTRISNYKYKAMLDYTDFNSQHTIQSMRLIFETMKELLPPETTFALDWCIASFDNMYT
FT                   SDGHKWVSTLPSGHRATTFINTVLNWCYTQMVGLKFNSFMCAGDDVILLSQEPISLAPI
FT                   LTSHFKFNPSKQSTGTRGEFLRKHYTTEGVFAYPTRAIASLVSGNWLSQSLRENTPILV
FT                   PIQNGVDRLRSRAGLLGVPWILGLSELTEREAIPRDVSMALLNSHAAGPGLITRNYSSF
FT                   TVTPKPPKLTSTLEYTATRFGVQDLSKHVPWEQLTLEERNKLGKQIKKMSHRHCSQAKI
FT                   TYTCVHDFYKPSGLPTVLSGASQPSLSMAWWQAMLKEAMQDNFTKKLDAQMFASNACTD
FT                   CVSGDAFLQASAKTAGVLFTSSILSSS"
FT   CDS             323. .2359
FT                   /codon_start=1
FT                   /gene="pol"
FT                   /product="capsid protein"
FT                   /protein_id="AED99819.1"
FT                   /translation="MEASANGLSHDDNANKSQNVGPSTLPGSDKQGGENHENSFNSFSN
FT                   DFFFNFLRTSTSTHISDSPGVSFVLKDGTPYTSATIQSAVGRLTHNVVASAVQLNITAN
FT                   NTLEVDYGFGQDVSRATGTITIPVFDGEKYKEVARALSLIFSKKGTAIDTTSQTIQDTL
FT                   KNSDLTIATVAAGYYTALAARHELTKAESTAAHRIPFATALSDTFSAAGDALRSSHVIS
FT                   SCLRCPASNNAQRQVTVGTNMWTNVSVENIAVQGLIIPNPNDVSFFIPNKSLPPSWWCA
FT                   IWLLNAFLHSFIAQTRIHIFITPGETYNLAPFTDADIYEAIPILLGMSKTSRPVPESVE
FT                   SMLYAYGAQMVIQPHSLYTEGGIVRRMIFTVPHLPAHGYFIANTEYSRYMNIAVPNDPR
FT                   TAKDYIIGVGTGLLQIILAYQAAFSCAGPIALHWHDNDAISNGMDTVAAAYLEGRYFTI
FT                   PMAVNVATNIAQYTTRVRADPQYKHTLDRILPRIFGPSTDTVFNFIESAITSSWVSINA
FT                   TKRNGRARKFRTAFINRFHDPEFAYMFGITGNGIERMEGKVTSNIAQEVEYLTNGGDLR
FT                   NCPILRTLKAAEAEETVTFMCTGKIGSIFAVDGTMRTLKRYQTVDLADLGWTSHGKVMK
FT                   PYAFRAPVIQGITVCKTAYTSTAIDIVTTVFGPLRLRVGTLFE"
XX
SQ   Sequence 4680 BP; 1377 A; 1198 C; 928 G; 1177 T; 0 other;

hq607523 Length: 4680  08-MAY-2011  Type: N  Check: 6813  ..

       1  gcaaaaagga gggagtagtc cgctcttctc ctttttgcac tcaacatttt
      51  tactccatca tgacgaatcc atgacatgga catgtaacaa gcgttttgtc
     101  ctcgatgatt gccatcctcg tgtgaactcc gggcaccgct tgcactgatg
     151  atacctctta caaagctgga gagacacccg tcttgaagag ccgtaatgta
     201  tcctctgcgc ctgggaccta atggtgattt tgctgtaggt actttttaag
     251  ggaggaatta gggttgaaca tactagttcg ctagtatgcc ttttttctac
     301  tttattaaga aattgaatac ccatggaggc ttctgctaat gggttatcac
     351  atgatgataa tgcgaataaa tcgcaaaatg ttggaccttc tactcttccg
     401  gggtcagata aacaaggagg agaaaaccac gaaaattctt ttaattcttt
     451  ttcaaatgat ttctttttta attttttacg cacatctacg agtactcaca
     501  tttcagacag tccaggagtt tcttttgttt tgaaggatgg aacaccatat
     551  acatccgcta ccatccaatc cgctgtcggt cgtcttacac acaatgtcgt
     601  cgcatcagca gtccaactca atattacagc aaacaatacg ttagaggtgg
     651  actacggttt cggtcaggat gtttcaagag ctacaggaac catcacaatc
     701  ccagtcttcg atggcgaaaa gtacaaagag gtagctcgcg ctttatcatt
     751  aattttcagt aagaaaggta cggcgattga cactacgtct caaactattc
     801  aagacaccct caaaaactcc gatctcacta ttgctaccgt cgctgctgga
     851  tactacacag ccttagctgc tcgccatgaa cttaccaaag cagaaagcac
     901  tgcagctcat cgcattccat tcgctacagc tttatcagat acattctcag
     951  cagccggcga cgcgctgcgt tcaagccacg tcatctcttc ttgcttacgc
    1001  tgccctgcct caaacaacgc acaacgacag gttacagtcg gaaccaacat
    1051  gtggacgaac gtctccgtcg aaaacatcgc agtacaaggc ttgataattc
    1101  caaatccaaa cgacgtatcg ttcttcattc cgaacaaatc tcttccacct
    1151  tcctggtggt gcgcaatctg gcttctcaac gctttcctcc acagcttcat
    1201  cgcacaaacc cgcatccaca tcttcatcac gccaggtgaa acttacaatc
    1251  ttgcgccatt cacagatgcc gatatctacg aggctattcc tatcttactt
    1301  ggaatgtcga aaacatcacg cccagttcca gaaagcgtcg aaagcatgct
    1351  ctacgcatac ggcgcgcaga tggttatcca gccacactcg ctttacacag
    1401  aaggcggtat agtcagaaga atgatcttta ccgtcccaca tcttccagca
    1451  catggctact tcatagcaaa cacagaatac tcgagataca tgaacatcgc
    1501  tgttccaaac gacccacgta cagctaagga ttacataatt ggtgtcggaa
    1551  ctggcctctt acagatcata ctcgcctacc aagctgcttt tagctgcgct
    1601  ggtccaatcg ctctccattg gcacgacaat gacgctatct caaatggcat
    1651  ggatacggtt gcagctgctt acctcgaagg acggtacttc actatcccaa
    1701  tggccgtcaa tgttgccaca aacatcgccc aatacactac aagagtcagg
    1751  gctgacccac agtacaaaca cacactcgat cgaatcttac cacgcatctt
    1801  cggtccatcg actgacacag tcttcaactt catcgaatcc gcaatcacat
    1851  catcttgggt ctcgatcaac gcgacgaagc gtaacggtcg ggccagaaag
    1901  ttcaggaccg ccttcatcaa tcgttttcat gatccagaat tcgcctacat
    1951  gtttggcatc accggcaacg gtatcgagcg gatggaaggc aaggttacat
    2001  ccaacatcgc acaagaagtc gaatacctca ccaacggcgg tgaccttcgc
    2051  aactgcccta tccttcgcac cttaaaagct gcggaggcag aagagaccgt
    2101  cacttttatg tgtacgggaa agatcggttc catcttcgcc gtcgatggta
    2151  caatgcgcac gctcaagcgg taccaaacag tcgacctcgc cgacctcgga
    2201  tggacatcgc atggcaaggt catgaaaccg tacgccttca gggccccagt
    2251  catccaagga atcaccgtct gcaagacagc ttacacatcc acagctatcg
    2301  acatcgttac aacagtcttc ggccccttac gcctccgcgt aggcaccctt
    2351  tttgagtaag gctgtacgtt gtggccctat aataccatcc gtcaggcatc
    2401  actttaacat aagacacatc ataacagtta aacgcaatgg taacgaatac
    2451  gtatttatcc caggttacgg atgggtatta caggatgatt atttggtgaa
    2501  ttccgtcaag atgactggtg tagatcaact acctcccaac cagttaccct
    2551  atggcgatga tcttttattt atatatgcag aaattttact ttataactac
    2601  atatctcttt ttcctaaatt cagatacaaa aatccagact tattaaatca
    2651  agaaacagaa ttacagctct tcccacttaa aaccgactca gctgccagaa
    2701  ataaagccaa cttttatgct agatcactat ggaacgaagc aaaaacagac
    2751  aaaacagctt ttagaccggg aacatacaat gacacagtag ctggtctatt
    2801  gatgtggcaa caatgtgctc tcatgtggtc actgcctcgc tcagttatca
    2851  acagaacaat tagcggcgtt tgtgatgcat taaccgaaag gacttcactc
    2901  gcgctattaa aacgtatctc cgattggttg aaacaactcg ggctggcttg
    2951  ctcaccaatc catcgcttgt tcatagagct cccaacatta ctaggacgcg
    3001  gagcaatccc aggcgataac gtgaaagaca tgaagcacag actcaaattc
    3051  gatccatcta tcacagtaga cgttccaaaa gacgagttac atgccctaat
    3101  ctacagacta ttatcaagaa atctcaacat aactaaagtt gacagcttcg
    3151  aacaccacct agaagagcgt ttgctttggt ccaaatccgg cagtcattat
    3201  tatccggacg acagaatcaa tcagctactt ccaaaacaac ccacaaggaa
    3251  agaattctta gatgttgtaa ccgtagatta catcaaggaa tgcaagcctc
    3301  acgtcttcat aaggcaatca cgtaagctag aacacggtaa ggaacgtttc
    3351  atttataatt gcgatacgat ctcctatgtc tattttgatt ttatcctgaa
    3401  gctcttcgaa gcgggatggc aagatagcga agcaatacta tcgccaggcg
    3451  actatactag tgaacgctta catacaagaa tttcaaacta caaatataag
    3501  gctatgctag attatacgga tttcaattca cagcatacaa tccaaagcat
    3551  gagactgata ttcgaaacga tgaaggagtt actaccaccc gaaactactt
    3601  tcgcactcga ttggtgtatt gcctcattcg ataacatgta cacatccgat
    3651  ggtcataaat gggtctcgac tctcccaagc ggacatcgag ccacaacctt
    3701  catcaacaca gtcctaaatt ggtgctacac acagatggta ggtcttaagt
    3751  ttaacagttt tatgtgcgct ggtgatgatg tcattttatt gtctcaagag
    3801  ccaatatcac tagccccaat tcttacatca cacttcaagt tcaatcccag
    3851  caaacaaagt acaggtacaa gaggtgaatt cttacgcaag cattacacta
    3901  cagaaggcgt gtttgcatac ccaacacgag caattgcaag cttagtaagc
    3951  ggaaattggt taagtcaatc tttaagagag aacaccccaa tcttggtccc
    4001  aatacaaaac ggagtcgaca gacttcgaag cagagcaggt ctactcggag
    4051  tcccatggat tttgggcctc tcggagctca cggagcgaga ggccattcct
    4101  agggatgtca gcatggctct gttgaattca cacgcagcag gacccggttt
    4151  gatcactcgg aactacagtt ctttcacagt taccccgaaa ccacctaagc
    4201  taactagcac actagagtac acagcaaccc gttttggtgt ccaggacctg
    4251  tccaaacacg taccatggga acaacttaca ctggaagaac gtaataagtt
    4301  agggaaacaa attaagaaaa tgagtcacag gcattgtagc caggcaaaga
    4351  taacatacac ttgtgttcac gatttttaca aaccaagtgg cctccccacg
    4401  gtgttatctg gtgccagcca gccatcgttg tcgatggcgt ggtggcaggc
    4451  gatgcttaaa gaagcaatgc aggacaattt tactaagaag ttagatgcac
    4501  aaatgttcgc ttcgaatgca tgtacagact gcgtcagcgg tgatgcattc
    4551  ttgcaagcga gcgccaaaac tgctggtgta ttattcacta gctcgattct
    4601  atcttcttca taacgtacag caaaaaagtc tctatagttg ctcaggacat
    4651  atatgagcca gatggccccg ctataccttc