Sequence of DPV Trichomonas vaginalis virus 4

Trichomonas vaginalis virus 4 strain TVV4-OC5, complete genome.

ACC No: HQ607526

Dated: 2011-05-08 | Length: 4942 | CRC: 687494882

                
ID   HQ607526; SV 1; linear; genomic RNA; STD; VRL; 4942 BP.
XX
AC   HQ607526;
XX
DT   08-MAY-2011 (Rel. 108, Created)
DT   08-MAY-2011 (Rel. 108, Last updated, Version 1)
XX
DE   Trichomonas vaginalis virus 4 strain TVV4-OC5, complete genome.
XX
KW   .
XX
OS   Trichomonas vaginalis virus 4
OC   Viruses; dsRNA viruses; Totiviridae; Trichomonasvirus.
XX
RN   [1]
RP   1-4942
RX   DOI; 10.1128/JVI.00220-11.
RX   PUBMED; 21345965.
RA   Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W.,
RA   Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H.,
RA   Singh B.N., Fichorova R.N., Nibert M.L.;
RT   "Clinical Isolates of Trichomonas vaginalis Concurrently Infected by
RT   Strains of Up to Four Trichomonasvirus Species (Family Totiviridae)";
RL   J. Virol. 85(9):4258-4270(2011).
XX
RN   [2]
RP   1-4942
RA   Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A.,
RA   Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.;
RT   ;
RL   Submitted (12-NOV-2010) to the EMBL/GenBank/DDBJ databases.
RL   Department of Microbiology and Molecular Genetics, Harvard Medical School,
RL   Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .4942
FT                   /organism="Trichomonas vaginalis virus 4"
FT                   /host="Trichomonas vaginalis"
FT                   /strain="TVV4-OC5"
FT                   /mol_type="genomic RNA"
FT                   /country="USA"
FT                   /collection_date="Jan-2010"
FT                   /db_xref="taxon:1008292"
FT   gene            338. .4782
FT                   /gene="pol"
FT   CDS             join(338. .2534,2534. .4782)
FT                   /codon_start=1
FT                   /gene="pol"
FT                   /product="RNA-dependent RNA polymerase"
FT                   /note="translated via ribosomal frameshift"
FT                   /protein_id="AED99798.1"
FT                   /translation="MSAIATTIASANLNDLSRSANAPQNEGVPALAPQQNNAKPNTGPP
FT                   DPGEGTKQQTTNSSLSANDFTKEHTPATTIQKDTTIPNTDLPSSEEGPRTHSTDFQTLY
FT                   EYFYSYPVPASQTRTGGAITRNGPVNDNNEVVSFTTETALVTSLTPRHIDTNIQPLQIS
FT                   IADDCVNYSCQYSGQTCPIFDGSQHVQSATALASSMKARLMCEVTQSLSARPVQQPQII
FT                   AYLYGALLALGDRLNIHYGNKVNLWNALLGHNLQRGAPINGENFNHHLLIDGPLAPPIL
FT                   PAAGLGPFPSTTLGPNTTVTFKARASIFVRPQTYDYALVDAAFWLIYAMYSRMPVAFRQ
FT                   SYSLNIDFFTVQPMAACVFPGHDGFTTPVIDQALGVLESMLVEMFNGDREIMYYYAFKG
FT                   GQVFMRPCSCYQEGGLIRKASRNVSLASFTGIYSLIGYCAPEARPLHAANHPGIIAALF
FT                   QYVDTMVLQAVLSYSGPKLIHFGAAPEFATKGSTPYNFIDPDNYWGIRAGVNAHPVGYY
FT                   YLDILMRQKEHQLLDETLSDIYGHVGSLAMSNIMASIASSGTEVLNQKMQKSFVRRGNQ
FT                   VRALRHSHAIINRFHEPEYAYRLGILADGIMPLSGTHKCNIIDEATRLLEGEDIRNLPG
FT                   LRCLRGRGLDAIVGIRPINKKRRAGFYTLDGNFHVVTNQSTSDILQVWNDHGYIARPYA
FT                   CHIVESINVEIYDKSNGAYEGWIQALVGGFGVPERCYMGPSSAGSRRRPLCPLKGSNCA
FT                   AALHVDGQLYRASRLPYRKLTTSHLNCSKHCARQLAVIYRYQTLSPQLTEVSDADYLAF
FT                   LRWVLLPYTGATNRPHPKRWPKPFYPREVNLKFLDKETELQLFPLKKVPQADLKVNCFA
FT                   RNLLYSSPLSDRILKQCIPVGTNNDTVCGLIILLELLFEAGVPLDLLPIISVAIAKNDP
FT                   FVKALSDFNKMTGATTSHIANLLTECATLLGRGVAGSEPNVDLYHRVAPEGNPHEAKIS
FT                   DDVLRSAIRTIYKQEIKDCPKPGDFRLHLLTSPFWCKSGSHHHPEFPSYRNRLEFVMNT
FT                   DPDNIIAVKPSVYITQAQKLEHGKTRYIYNCDTVSYLYFDYILNYIESIWANSHVLLNP
FT                   DALNAEKFATLEYPKYCMIDYTDFNSQHTLTSMKVVFEVLKEFLPSEMFPVLDWCITSF
FT                   DNMTIRDKKWRSTLPSGHRATTFINSVLNRAYLLPYIGTIVSYHCGDDVLLCGDYDYQN
FT                   LITRLPYELNPSKQSFGPHAEFLRLHRHGEKVIGYPTRAISSLVSGNWLSTTSWNWQPS
FT                   LLSVTNQINAIICRSQLSINRIRSLAQELRFRYCPLFDGYIDPATTSFVAAGCPSYQPT
FT                   ATMIIPDVPHLDAGEVEFAQLHQLAKYAINTYPWLNSVESVDQLVRNRMRRPAAQDIRY
FT                   SILGPAIPLVSYHHHCDPMVVPPARRYYPRDHLAPPITPQVLPPQPVFCDRDLSPIIAL
FT                   KIAPAGVAVKVTADRPIASA"
FT   CDS             338. .2578
FT                   /codon_start=1
FT                   /gene="pol"
FT                   /product="capsid protein"
FT                   /protein_id="AED99797.1"
FT                   /translation="MSAIATTIASANLNDLSRSANAPQNEGVPALAPQQNNAKPNTGPP
FT                   DPGEGTKQQTTNSSLSANDFTKEHTPATTIQKDTTIPNTDLPSSEEGPRTHSTDFQTLY
FT                   EYFYSYPVPASQTRTGGAITRNGPVNDNNEVVSFTTETALVTSLTPRHIDTNIQPLQIS
FT                   IADDCVNYSCQYSGQTCPIFDGSQHVQSATALASSMKARLMCEVTQSLSARPVQQPQII
FT                   AYLYGALLALGDRLNIHYGNKVNLWNALLGHNLQRGAPINGENFNHHLLIDGPLAPPIL
FT                   PAAGLGPFPSTTLGPNTTVTFKARASIFVRPQTYDYALVDAAFWLIYAMYSRMPVAFRQ
FT                   SYSLNIDFFTVQPMAACVFPGHDGFTTPVIDQALGVLESMLVEMFNGDREIMYYYAFKG
FT                   GQVFMRPCSCYQEGGLIRKASRNVSLASFTGIYSLIGYCAPEARPLHAANHPGIIAALF
FT                   QYVDTMVLQAVLSYSGPKLIHFGAAPEFATKGSTPYNFIDPDNYWGIRAGVNAHPVGYY
FT                   YLDILMRQKEHQLLDETLSDIYGHVGSLAMSNIMASIASSGTEVLNQKMQKSFVRRGNQ
FT                   VRALRHSHAIINRFHEPEYAYRLGILADGIMPLSGTHKCNIIDEATRLLEGEDIRNLPG
FT                   LRCLRGRGLDAIVGIRPINKKRRAGFYTLDGNFHVVTNQSTSDILQVWNDHGYIARPYA
FT                   CHIVESINVEIYDKSNGAYEGWIQALVGGFGVPERCYMGPRLQVAGGAPSAL"
XX
SQ   Sequence 4942 BP; 1306 A; 1418 C; 991 G; 1227 T; 0 other;

hq607526 Length: 4942  08-MAY-2011  Type: N  Check: 9720  ..

       1  gcttaaagcc ccagtgagct ttaagcacca gaagtcgcag cattacacta
      51  tcatgcctta catgagctgc gaagtggtta gaattaaaca cgcgttttgg
     101  aagtcgtcag acgcgtggtg gtcacttcgg tgatcaagtt atactgacat
     151  ggcgccgggt ggttgacgac cctggccgcc tagttgtagc ctttggctct
     201  ttgccccaag cgtggtttga acctgccttc agtggattct attgagtgta
     251  atacctcaat agaggtgtca aaccggccta gatgtaatag tctaggttta
     301  tggacagtga tgtccacttg tgcttccgtg ctacgtcatg tcagctatag
     351  caaccactat tgcttctgct aatcttaatg atttatcacg ttctgcaaac
     401  gccccgcaaa atgaaggcgt ccctgctctt gcgcctcagc aaaataatgc
     451  taagccaaac actggtcctc ctgaccccgg tgagggaacg aaacaacaaa
     501  ctacaaattc ttcactctca gcaaacgatt tcacaaaaga acatactcct
     551  gctaccacga ttcagaaaga taccactatt ccgaacacag acctccccag
     601  ttccgaagaa ggaccaagaa cacattctac agattttcaa actctttatg
     651  aatattttta cagttatcca gttccagctt cacagaccag gaccggcggt
     701  gctatcacac gtaacggccc agttaatgac aacaacgaag ttgtctcttt
     751  cacaaccgaa acagcattag ttacttcact cacaccaaga catatcgaca
     801  caaacatcca gcctctccaa atctcaatcg ctgatgattg cgtcaactat
     851  tcgtgccaat acagcggcca aacctgcccg atattcgatg gttcacaaca
     901  cgtccagagc gccacagctc tcgccagttc catgaaagct cgcctcatgt
     951  gtgaagtcac acaatctttg tccgcacgtc ctgtccaaca accccaaatc
    1001  atcgcttacc tctatggtgc tttacttgca cttggagatc gcctcaacat
    1051  ccactacggc aataaagtca acctctggaa cgctttactc ggtcacaatt
    1101  tacaaagagg agccccgatc aacggtgaaa atttcaacca tcacttgctc
    1151  atcgatggtc ctctcgctcc tccaatactc ccagccgctg gattaggtcc
    1201  attcccatcg acgacattgg gaccaaacac cacggttaca ttcaaggctc
    1251  gcgcttccat cttcgttcgt ccacaaacct acgattacgc tcttgtcgat
    1301  gccgctttct ggcttattta cgctatgtat tcccgtatgc cagttgcttt
    1351  ccgtcaatcg tactccctca atattgattt cttcaccgtc caaccaatgg
    1401  ccgcttgcgt attcccagga cacgatggtt tcaccacacc agttatcgac
    1451  caagctctcg gcgttctcga atcaatgtta gtcgagatgt tcaatggtga
    1501  tcgagaaatc atgtactact acgcgttcaa aggtggtcag gttttcatgc
    1551  gcccttgctc ctgctaccaa gaaggaggcc tcatccgtaa agcctcacgc
    1601  aatgtctcac tcgcatcatt cacaggtatc tactcactca tcggttattg
    1651  cgcaccagaa gccagaccac tccatgcagc caatcatcca ggcatcatcg
    1701  ctgccctctt ccaatacgtc gatacaatgg tcttacaagc cgtcctttcg
    1751  tactccggcc ccaagctcat ccacttcgga gccgcaccag aattcgccac
    1801  gaaaggctct acaccataca atttcatcga ccctgataac tattggggaa
    1851  tcagggccgg tgtcaatgca cacccagtcg gttattacta cctcgacatc
    1901  ctcatgcgcc aaaaagaaca ccaactcctc gatgagacac tctccgatat
    1951  ctacggacac gtcggctcgc ttgccatgtc aaacataatg gcaagcattg
    2001  catcttccgg cactgaagtt ctcaatcaga agatgcagaa atccttcgtc
    2051  agacgtggca accaagtacg tgcattacgc cactcccacg ccattatcaa
    2101  tcgcttccac gaaccagaat acgcctaccg cctcggaatt ctcgcagatg
    2151  gtatcatgcc tttatcaggt acccacaaat gcaacatcat cgatgaggct
    2201  actcgcttac tcgagggaga agatatccgc aatctcccag gtcttcgttg
    2251  tttacgcggt cgcggcctcg acgccatcgt tggaatccgt ccaatcaaca
    2301  agaagcggcg cgcgggcttc tacaccctcg atggcaattt ccacgtcgtc
    2351  acaaaccaga gcacaagcga catcctccag gtttggaacg atcacggcta
    2401  catcgcacgc ccttacgctt gccacatcgt cgaatccatc aatgtcgaga
    2451  tctacgataa atcaaatggc gcctacgaag gatggatcca agcgctcgtc
    2501  ggcggcttcg gtgtccccga acgctgctac atgggccctc gtctgcaggt
    2551  agcaggaggc gccccctctg ccctttaaag ggcagcaatt gcgcagcagc
    2601  actgcatgta gatggacaac tatatagagc aagtcggcta ccataccgta
    2651  agctcacaac aagtcatctt aattgctcga agcactgcgc acgtcaactg
    2701  gcggttatat atagatacca gacactgagc ccccagttga ccgaggttag
    2751  tgatgcagat tacctcgctt tcctccgttg ggtcctgtta ccttacacag
    2801  gtgctacaaa tagaccacac cccaagcggt ggcctaaacc attttaccca
    2851  agggaggtga atctcaagtt tttagataaa gagaccgagt tacagttgtt
    2901  ccctctcaag aaggtcccac aagccgactt gaaagtcaac tgctttgcaa
    2951  gaaacctact ttattcgtcg cctctttccg accgaattct caagcaatgt
    3001  atcccagttg gaacgaataa cgatacggtt tgtggtctca ttatcttact
    3051  tgagctcctc ttcgaagcgg gggtcccact agacctctta cctattatca
    3101  gtgtcgctat cgcaaagaat gatccattcg ttaaagccct ttcggacttt
    3151  aacaagatga cgggcgcaac cacctcacac attgcaaacc tcttgactga
    3201  gtgcgcgacc ttgcttggcc gcggcgtcgc aggatctgag ccgaatgttg
    3251  atttgtatca ccgggtagct cctgagggca atccacatga agcgaagatc
    3301  agtgacgatg tgttgcgttc cgccattcga accatttaca aacaagaaat
    3351  caaagattgt ccaaagccgg gtgacttccg cttacacctc ctcacaagtc
    3401  cattttggtg caagtctggg tcacaccatc atccagaatt cccttcatat
    3451  cgtaaccgac tcgaattcgt aatgaacacc gatcctgata acattattgc
    3501  tgttaagcca tccgtatata ttacccaagc acaaaaactc gaacatggaa
    3551  agactcgata catatacaat tgcgacacag tttcttactt atacttcgac
    3601  tacatactga actacataga gagcatatgg gccaattctc atgtgctgct
    3651  caatcccgac gctcttaatg cggagaagtt cgccacactt gagtatccga
    3701  agtattgtat gatcgattac actgacttca attcacagca tactctcaca
    3751  tcaatgaaag tagtcttcga agtgctcaaa gagttcttac cctcagagat
    3801  gttcccagtc ctcgattggt gtataaccag cttcgacaac atgacaatca
    3851  gagataagaa atggagaagt acattgcctt ctggccacag agcgaccacc
    3901  tttatcaatt cggtactaaa tagagcctac ttgttacctt acatcggtac
    3951  tatcgtcagc tatcactgtg gtgatgacgt actcttatgt ggggactatg
    4001  attaccagaa cctaataacg cgcctgcctt atgagttgaa cccaagcaag
    4051  caaagtttcg gaccgcatgc cgaattctta cgcttgcaca ggcatggtga
    4101  aaaagttata ggatatccca ctcgcgcaat ttcatcacta gtttccggca
    4151  actggcttag tacgacaagt tggaattggc aaccatcctt gctatctgta
    4201  acgaatcaga taaacgccat catctgccgc tcgcaactct ccataaatag
    4251  aatacgttcg cttgcccaag aactacgctt ccgctactgt ccactgtttg
    4301  acggctacat tgacccggcc acaaccagct ttgtagctgc gggctgtccc
    4351  tcatatcaac caactgcgac aatgatcatc cctgacgtgc cccacctgga
    4401  cgccggagag gtcgaattcg ctcaactcca tcagttggct aaatatgcta
    4451  tcaatactta cccgtggctt aactccgtcg agtctgttga ccagctcgtt
    4501  agaaacagaa tgcgtagacc tgctgcacaa gacatccgtt atagcatact
    4551  tggcccggct atcccacttg tttcttacca ccatcactgt gatccaatgg
    4601  tcgtcccacc tgcgaggagg tattatccac gagaccactt agcgcctccg
    4651  attacaccac aagttttacc tcctcaacca gttttttgcg atagagattt
    4701  gtctccaata atagcactca aaatagcacc cgccggtgtg gcagtaaaag
    4751  ttactgcaga ccggccgata gccagtgctt aaataggtgg cccactgcag
    4801  tgggctaatg actgcagact tagttggcat taaacaccac cttttgccaa
    4851  ctcgcctacg taagtaggaa cagccgtatg gctaacaacc catacgtctc
    4901  agcacgcgct ttgtgctaga cctatgactc ccggtcttcc tc