Sequence of DPV Trichomonas vaginalis virus 4

Trichomonas vaginalis virus 4 strain TVV4-1, complete genome.

ACC No: HQ607522

Dated: 2011-05-08 | Length: 4943 | CRC: 487838294

                
ID   HQ607522; SV 1; linear; genomic RNA; STD; VRL; 4943 BP.
XX
AC   HQ607522;
XX
DT   08-MAY-2011 (Rel. 108, Created)
DT   08-MAY-2011 (Rel. 108, Last updated, Version 1)
XX
DE   Trichomonas vaginalis virus 4 strain TVV4-1, complete genome.
XX
KW   .
XX
OS   Trichomonas vaginalis virus 4
OC   Viruses; dsRNA viruses; Totiviridae; Trichomonasvirus.
XX
RN   [1]
RP   1-4943
RX   DOI; 10.1128/JVI.00220-11.
RX   PUBMED; 21345965.
RA   Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W.,
RA   Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H.,
RA   Singh B.N., Fichorova R.N., Nibert M.L.;
RT   "Clinical Isolates of Trichomonas vaginalis Concurrently Infected by
RT   Strains of Up to Four Trichomonasvirus Species (Family Totiviridae)";
RL   J. Virol. 85(9):4258-4270(2011).
XX
RN   [2]
RP   1-4943
RA   Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A.,
RA   Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.;
RT   ;
RL   Submitted (12-NOV-2010) to the EMBL/GenBank/DDBJ databases.
RL   Department of Microbiology and Molecular Genetics, Harvard Medical School,
RL   Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .4943
FT                   /organism="Trichomonas vaginalis virus 4"
FT                   /host="Trichomonas vaginalis"
FT                   /strain="TVV4-1"
FT                   /mol_type="genomic RNA"
FT                   /country="USA"
FT                   /collection_date="Dec-2009"
FT                   /db_xref="taxon:1008292"
FT   gene            338. .4782
FT                   /gene="pol"
FT   CDS             join(338. .2534,2534. .4782)
FT                   /codon_start=1
FT                   /gene="pol"
FT                   /product="RNA-dependent RNA polymerase"
FT                   /note="translated via ribosomal frameshift"
FT                   /protein_id="AED99796.1"
FT                   /translation="MSAITATISSANLNDLSRSANAQQNNGVPALAPQQNIAKPNTGPP
FT                   DPGDGTRQQTIPSSPKTDDHTKEPVSAPTTQQNVTTSDKELPDSAEGPRTHSTDFQTLY
FT                   EYFYSYPVPASQTRTGGAIARNGPVNDNNEVVSFTTETQLVTSLTPRHIDANIQPLQIS
FT                   IADDCVNYSCEYSGQTCPVFDGSQHVQSATALASSMKARLMCEVTQSLSARPVQQPQVI
FT                   AYLYGALLAFGDRLNLHYGNKVNLWNALLGHNLQRATPVNGDAFNHHLLIDGALAPPIL
FT                   PAAGLGPFPSTTLGPNTTVTFKARASIFVRPQTYDYALVDAAFWLIYAMYSRMPVAFRQ
FT                   AYSLNVDFFTVQPMAACVFPGHDGFTTPVIDQALGVLESMLVEMFNGDRQIMYYYAFKG
FT                   GQLFMRPCSCYQEGGLIRKASRNVSLASFTGIYSLIGYCAPDARPLHAASHPGIIAALF
FT                   QYVDTMVLQAVLSYSGPKLVHFGAAPEFATKGSTPYDFIDPDNYWGIRAGVNAHPVGYY
FT                   YLDILMRQKEHQLLDETLSDIYGHVGSLAMSNIMASVASSGTEVLNQKMQKSFVRRGNQ
FT                   VRALRHSHAIINRFHEPEYAYRLGILADGIMPLAGTHKCDIIDEATRLLQGEDIRNLPG
FT                   LRCLRGRGLDAIVGIRPINKKRRAGFYTLDGNFHVVTNQCTSDVLQVWNDHGYIARPYA
FT                   CHIVESINVEIYDKSNGAYNGWIQALVGGFGVPERCYMGPSSAGSRRRPLCPLKGSNRA
FT                   VALHVDGQLDRASRVPYRKLAPCHLNCSKRCARQLAVIYRYQTLSRQLPEVSDEDYLAF
FT                   LRWVLLPYTGATNRPHPKRWPKPFYPREVNLKFLDKETELQLFPLKKVPQADLKVNCFA
FT                   RNLLYSSPLSDRVLKACIPVGTNNDTVCGLLVLLELLFEAGVPLDLLPTISVAIAKNDP
FT                   FVKALSDFNKMTGATTSNIANLLTECTTLLGRGVTASAPSADLYHRVAPEGNRHEAKVS
FT                   DDVLYSAIRTIYKQEIKDCPKPGDFGLHLLTSPFWCKSGSHHHPEFPSYRNRLEFVMNT
FT                   DPDSIAAVKPSVYITQAQKLEHGKTRYIYNCDTVSYLYFDYVLNYIESIWANSHVLLNP
FT                   DALNAEKFATLEYPEYCMIDYTDFNSQHALNSMKAVFKVLKEFLPSEMFPVLDWCISSF
FT                   DNMTIKNTKWRSTLPSGHRATTFINSVLNRAYLLPYIGTIVSYHCGDDVLLCGDYDYQN
FT                   LITRLPFELNPSKQSFGPHAEFLRLHRHGEKVVGYPTRAISSLVSGNWLSTTSWNWQPS
FT                   LLSITNQINAIICRSQLSIHTIRSLAQELRFRYCPLLDDYIDPATTSFVAAGCPSYRPT
FT                   ATMIVPDVPHLDAEEVEFTQLRRLAEYAINTYPWLNSTESVNQLVRNRTRKPAAKAIRY
FT                   NILGSAVPLVCYHHHCDSMIVPLARRYYPRDHLAPPVTPQVLPPQPVFCDRNLSPIIAL
FT                   KLAPAGVAVKVTADRPIASA"
FT   CDS             338. .2578
FT                   /codon_start=1
FT                   /gene="pol"
FT                   /product="capsid protein"
FT                   /protein_id="AED99795.1"
FT                   /translation="MSAITATISSANLNDLSRSANAQQNNGVPALAPQQNIAKPNTGPP
FT                   DPGDGTRQQTIPSSPKTDDHTKEPVSAPTTQQNVTTSDKELPDSAEGPRTHSTDFQTLY
FT                   EYFYSYPVPASQTRTGGAIARNGPVNDNNEVVSFTTETQLVTSLTPRHIDANIQPLQIS
FT                   IADDCVNYSCEYSGQTCPVFDGSQHVQSATALASSMKARLMCEVTQSLSARPVQQPQVI
FT                   AYLYGALLAFGDRLNLHYGNKVNLWNALLGHNLQRATPVNGDAFNHHLLIDGALAPPIL
FT                   PAAGLGPFPSTTLGPNTTVTFKARASIFVRPQTYDYALVDAAFWLIYAMYSRMPVAFRQ
FT                   AYSLNVDFFTVQPMAACVFPGHDGFTTPVIDQALGVLESMLVEMFNGDRQIMYYYAFKG
FT                   GQLFMRPCSCYQEGGLIRKASRNVSLASFTGIYSLIGYCAPDARPLHAASHPGIIAALF
FT                   QYVDTMVLQAVLSYSGPKLVHFGAAPEFATKGSTPYDFIDPDNYWGIRAGVNAHPVGYY
FT                   YLDILMRQKEHQLLDETLSDIYGHVGSLAMSNIMASVASSGTEVLNQKMQKSFVRRGNQ
FT                   VRALRHSHAIINRFHEPEYAYRLGILADGIMPLAGTHKCDIIDEATRLLQGEDIRNLPG
FT                   LRCLRGRGLDAIVGIRPINKKRRAGFYTLDGNFHVVTNQCTSDVLQVWNDHGYIARPYA
FT                   CHIVESINVEIYDKSNGAYNGWIQALVGGFGVPERCYMGPRLQVAGGAPSAL"
XX
SQ   Sequence 4943 BP; 1286 A; 1428 C; 1008 G; 1221 T; 0 other;

hq607522 Length: 4943  08-MAY-2011  Type: N  Check: 7603  ..

       1  gcttaaagtc ccagtgagct ttaagcacca gaagtcgcag catagcacca
      51  caatatttca cattagctgc gaggtggtta gaattaaaca cgcgttttgg
     101  aagtcgtcag acgcgtggtg gtcactttgg tgatcaagtt atactgacat
     151  ggcgccgggt ggttgacgac cctggccgcc tagttgtagc ctttggctct
     201  ttgccccaag cgtggtttga acctgccttt agtggaatct gtcgagttag
     251  atctcccgat agaggtgtca aaccgactta gatgtaatag tctaagttta
     301  tggacagtga tgtccacttg tgcttccgtg ctacgtcatg tcagctataa
     351  cagccactat ttcttctgct aatcttaatg atttatcacg ttctgcaaac
     401  gcccagcaaa ataacggcgt ccctgctctt gcgcctcagc aaaatattgc
     451  taagccaaac actggtcctc ctgacccggg agatggaaca agacaacaaa
     501  ctataccctc ttcacctaaa acagacgatc atacaaaaga acctgtttcc
     551  gctcctacaa ctcaacaaaa cgttactacc tcagataaag aactccccga
     601  ttccgccgaa ggaccaagaa cacactctac agattttcaa actctttacg
     651  aatattttta cagttatcca gttccagcat cacagaccag gaccggcggt
     701  gctattgcac gcaatggtcc agttaacgac aacaatgaag ttgtctcatt
     751  cactactgaa acacaattag ttacatcact tacaccaaga catatcgatg
     801  caaatatcca gcctcttcag atctcgatcg ccgatgattg cgtcaactat
     851  tcatgcgaat acagcggtca aacctgccca gttttcgatg gttcacagca
     901  cgtccaaagc gccacagccc ttgctagttc catgaaggcg cgcctcatgt
     951  gcgaagttac acaatcatta tctgcacgcc ctgtccagca accacaagtc
    1001  attgcttatc tctatggtgc cttacttgca ttcggagacc gccttaacct
    1051  tcattatggt aacaaagtca atctctggaa cgctttactc ggccacaatt
    1101  tacaaagagc aacaccagtc aacggtgatg ccttcaatca ccacttgctc
    1151  atcgatggcg ctctcgctcc tccaatactt ccggcagctg gattaggtcc
    1201  attcccatca acaacattag gacccaatac caccgttaca ttcaaagctc
    1251  gcgcatctat tttcgtccgt ccacaaactt acgactacgc gctcgtcgat
    1301  gccgctttct ggcttatata cgccatgtac tctcgcatgc cagttgcttt
    1351  ccgccaagcg tactcactca atgttgattt ctttactgtc cagccaatgg
    1401  ccgcttgcgt atttccagga cacgatggtt tcacaacacc agttatcgac
    1451  caagcgctcg gtgtcctcga atcaatgttg gtcgaaatgt tcaacggaga
    1501  ccgccaaatc atgtactact acgctttcaa gggcggtcaa ctcttcatgc
    1551  gcccttgttc ttgctaccaa gaaggaggcc tcatccgcaa agcctcacgc
    1601  aatgtctcgc tcgcttcatt tacaggcatt tactcgctca tcggctactg
    1651  cgcgccagac gccagaccac tccatgcagc cagtcaccca ggtataatcg
    1701  ctgccctctt ccagtacgtc gatacaatgg tcttacaagc cgtcctctcg
    1751  tattccggcc ccaagctcgt ccacttcggc gccgcacccg aattcgctac
    1801  aaaaggctcc acaccatacg attttatcga ccctgataac tattggggaa
    1851  tcagggccgg cgtcaacgca catccagtcg gttactacta cctcgacatc
    1901  cttatgcggc aaaaagaaca ccaactcctc gacgaaacgc tttccgatat
    1951  ttacggccac gtaggttcgc tcgcaatgtc aaacataatg gcgagcgtcg
    2001  catcttctgg caccgaagtt ctcaatcaga agatgcagaa atccttcgtc
    2051  cgacgcggca accaagtacg cgcattacgc cactcccacg ccatcatcaa
    2101  ccgcttccac gaacccgaat acgcttaccg cctcggaatc ctcgcagatg
    2151  gcatcatgcc cttagcgggt acgcacaagt gcgatatcat cgacgaggcc
    2201  acacgcttac tccagggaga agacatccgc aatctcccag gcctccgttg
    2251  cttacgcggt cgcggactcg atgccatcgt cggcatccgc cctatcaata
    2301  agaagcggcg cgcaggcttc tacactctcg acggcaattt ccacgttgtt
    2351  acaaaccagt gcacaagcga cgtccttcag gtttggaacg atcacggcta
    2401  catcgcgcgc ccttacgctt gccacatcgt cgaatccatc aacgtcgaaa
    2451  tctacgataa gtcaaatggt gcttacaacg gatggattca ggcactcgtc
    2501  ggcggcttcg gtgttccgga gcgctgctac atgggccctc gtctgcaggt
    2551  agcaggcggc gccccctctg ccctttaaag ggcagtaatc gcgcagtagc
    2601  actgcatgta gatggacaac tagatagagc aagtcgggta ccataccgta
    2651  agctcgcccc atgtcatctt aattgctcga agcgctgcgc acgacagttg
    2701  gcggtaatat acagatacca gacactgagc cgccaattgc ccgaggtcag
    2751  tgatgaagat tatctcgctt tcctccgttg ggtcctgtta ccttacacag
    2801  gtgctacaaa tagaccacac cccaagcggt ggcctaaacc attttaccca
    2851  agggaggtga atctcaagtt tttggataag gagacagaat tgcagttgtt
    2901  ccctctcaag aaggtcccac aagccgactt gaaagtcaat tgcttcgcaa
    2951  gaaacctcct ttattcgtcg cctctttccg atcgtgttct caaggcttgc
    3001  attccagtag ggacgaataa tgatacggtt tgtggtctcc ttgtcttact
    3051  tgagctcctc ttcgaagcgg gagtcccact agacctcctt cctactatca
    3101  gtgtcgctat cgcaaagaat gatccattcg ttaaagccct ttccgacttt
    3151  aacaagatga cgggtgcaac cacctcaaac atcgcaaacc tcttgactga
    3201  gtgcacgacc ttacttggtc gtggcgttac tgcatctgcg ccaagtgccg
    3251  atttgtatca ccgggtagct cctgagggca atcgacacga agctaaggtt
    3301  agtgacgatg tgttgtattc cgccataaga accatttaca aacaagaaat
    3351  caaagattgt ccaaaaccag gtgacttcgg cttacacctc cttacaagtc
    3401  cattttggtg caagtctggg tcacaccacc atccagaatt cccttcatac
    3451  cgaaaccgac tcgaattcgt aatgaacacc gatcccgact ctattgccgc
    3501  cgttaaaccg tccgtgtaca ttactcaagc acagaaactc gaacatggta
    3551  aaactcgata catatataac tgcgatacgg tctcgtacct atacttcgac
    3601  tacgtactga actacataga gagcatatgg gccaactctc atgtgctact
    3651  caatccagac gctctcaatg cggaaaagtt cgccacactc gagtatccgg
    3701  agtattgtat gattgactac accgacttca attcgcaaca cgctctcaat
    3751  tctatgaaag cagttttcaa agtactgaaa gagttcctgc cttcagaaat
    3801  gttcccggtt ctagattggt gtatcagtag tttcgataac atgacaatca
    3851  aaaacacaaa gtggagaagc accttaccct caggccacag agcgaccaca
    3901  tttattaatt ctgtgttaaa cagggcctat ttgttgcctt acatcggtac
    3951  tatcgtcagc tatcattgcg gtgatgacgt actcttgtgt ggtgactacg
    4001  actaccaaaa cctaataaca cgcctgcctt ttgagttgaa tccaagcaag
    4051  cagagtttcg gaccacatgc cgaattctta cgtttgcaca ggcatggtga
    4101  gaaagtcgtt ggttacccta cgcgcgcaat ttcatcactg gtctccggca
    4151  actggcttag tacgacgagt tggaattggc aaccatcctt actatctata
    4201  accaatcaaa taaacgctat catctgccgc tcacagctct ccatacatac
    4251  gatacgttca ctcgcccaag aattacgctt ccgttactgt ccactactcg
    4301  acgactacat tgacccagct acaaccagct ttgtagccgc gggatgcccc
    4351  tcatatcgac caaccgcaac aatgatcgtc cccgacgtac cccacctgga
    4401  cgccgaagag gtcgaattta cccaactccg taggttggcc gaatatgcta
    4451  tcaatactta cccgtggctt aactcaactg agtccgttaa ccagctcgtc
    4501  agaaatagaa cgcgtaaacc tgcagctaaa gccatccgtt acaacatact
    4551  cggttcagct gtcccacttg tttgttacca tcatcactgc gactcaatga
    4601  ttgttccatt agcgaggagg tattatccac gagaccactt agcgcctccg
    4651  gttacacctc aagttttacc tcctcaacca gttttttgcg atagaaactt
    4701  gtccccaata atagcactca agctagcacc cgccggtgtg gcagtaaagg
    4751  ttactgcaga ccggccgata gccagtgctt aaataaatgg ccctctgcag
    4801  tgggcctatg actgcagatt cagttggcac ttaacaccag ttctttgcca
    4851  actcgcctac gtaagtagga atagccgtat ggcaaacaaa ccatacgtct
    4901  cagcaccagc ttagtgctag acctatgact cccggtcttc ctc