Sequence of DPV Trichomonas vaginalis virus 4

Trichomonas vaginalis virus 4 strain TVV4-OC3, complete genome.

ACC No: HQ607520

Dated: 2011-05-08 | Length: 4944 | CRC: -66071358

                
ID   HQ607520; SV 1; linear; genomic RNA; STD; VRL; 4944 BP.
XX
AC   HQ607520;
XX
DT   08-MAY-2011 (Rel. 108, Created)
DT   08-MAY-2011 (Rel. 108, Last updated, Version 1)
XX
DE   Trichomonas vaginalis virus 4 strain TVV4-OC3, complete genome.
XX
KW   .
XX
OS   Trichomonas vaginalis virus 4
OC   Viruses; dsRNA viruses; Totiviridae; Trichomonasvirus.
XX
RN   [1]
RP   1-4944
RX   DOI; 10.1128/JVI.00220-11.
RX   PUBMED; 21345965.
RA   Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W.,
RA   Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H.,
RA   Singh B.N., Fichorova R.N., Nibert M.L.;
RT   "Clinical Isolates of Trichomonas vaginalis Concurrently Infected by
RT   Strains of Up to Four Trichomonasvirus Species (Family Totiviridae)";
RL   J. Virol. 85(9):4258-4270(2011).
XX
RN   [2]
RP   1-4944
RA   Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A.,
RA   Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.;
RT   ;
RL   Submitted (12-NOV-2010) to the EMBL/GenBank/DDBJ databases.
RL   Department of Microbiology and Molecular Genetics, Harvard Medical School,
RL   Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .4944
FT                   /organism="Trichomonas vaginalis virus 4"
FT                   /host="Trichomonas vaginalis"
FT                   /strain="TVV4-OC3"
FT                   /mol_type="genomic RNA"
FT                   /country="USA"
FT                   /collection_date="Nov-2009"
FT                   /db_xref="taxon:1008292"
FT   gene            339. .4783
FT                   /gene="pol"
FT   CDS             join(339. .2535,2535. .4783)
FT                   /codon_start=1
FT                   /gene="pol"
FT                   /product="RNA-dependent RNA polymerase"
FT                   /note="translated via ribosomal frameshift"
FT                   /protein_id="AED99794.1"
FT                   /translation="MSAIAATISSANLNDLSRLAGAQPKEGVPAPVLQQNIATPKTGPP
FT                   DPGEGTRKQTTDSPHSANPSTKEHTPAPTIQPDTPTPITDHSDSEEGPRTHSTDFQTLY
FT                   EYFYSYPVPASQTRTGGAIARAGPVNDNNEVVSFTTETALVTSLTPKHIDANIQPLQIS
FT                   IADDCVNYSCQYSGQTCPIFDGSQHVQSATALASSMKARLMCEVTQSLSARPVQQPQLI
FT                   AYLYGALLAFGDRLNIHYGNKVNLWNALLGHNLQRGTPINGDNFNHHLLIDGPLAPPIL
FT                   PAAGLGPFPSTTLGPNTTVTFKARASIFVRPQTYDYALVDAAFWLIYAMYSRMPVAFRQ
FT                   AHSLNIDFFTVQPMAACVFPGHDGFTTPVIDQALGVLESMLVEMFNGDREIMYYYAFKG
FT                   GQIFMRPCSCYQEGGLIRKASRNVSLASFTGIYSLIGYCAPEARPLHAANHPGIIAALF
FT                   QYVDTMVLQAVLSYSGPKLVHFGAAPEFCSKGSTPYDFIDPDNYWGIRAGVNAHPIGYY
FT                   YLDILMRPKEHQLLDETLSDIYGHVGSLAMANIMASIASSGTEVLNQKMQKSFVRRGNQ
FT                   VRALRHSHAIINRFHEPEYAYRLGILADGIIPLAGTHKCDIIDEATRLLQGEDIRNLPG
FT                   LRCLRGRGLDAIIGIRPINKKRRAGFYTLDGNFHVVTNQCTSDVLQVWNDHGYIARPYA
FT                   CHIVESINVEIYDRSNGAYNGWIQALVSGFGVPERCYMGPSSAGSRRRPLCPLKGSNCA
FT                   VALHVDGQLTRASRVPYRKLTPSHLNCSKRCARQLAVIYRYQTLSPQLTEVSDSDYLAF
FT                   LRWVLLPYTGATNRPHPKRWPKPFYPAEVSLKFLDKKTELQLFPLKKAPQADLKVNCFA
FT                   RNLLYSSPLSDRILKQCIPVGTNNDTVCGLVILLELLFEAGVPLDLLPTISVAIAKNDP
FT                   FVKALSDFNKMTGATTSHIANLLTECTTLLGRGVTASAPNADLYHRVAPEGNRHEAKIS
FT                   DDVLRSAIRTIYKQEIKDCPKPGDFGLHLLTSPFWCKSGSHHHPQFPRYRNRLEFVMNT
FT                   DPSAIMAVKPSVYITQAQKLEHGKTRYIYNCDTVSYLYFDYILNYVESIWANSHVLLNP
FT                   DALNAEKFATLEYSEYCMIDYTDFNSQHTLTSMKAVFEVLKEFLPSEMFPVLDWCISSF
FT                   DNMTIKDMKWRSTLPSGHRATTFINSVLNRAYLLPYIGTIVSYHCGDDVLLCGEHDYQH
FT                   LITRLPYELNPSKQSFGPHAEFLRLHRHGEKVIGYPTRAVSSLVSGNWLSTTSWNWQPS
FT                   LLSITNQINAIICRSQLSISRIRSLAQELRFRYCPLLDNYIDPATTSFVAAGCPSYQPT
FT                   ATMITPDVPHLDAEEVEFTQLHQLAEYAINTYPWLNSVESVNQLVRSRMRKPAARDIHY
FT                   SVLGPAIPLVSYHHHCDPMVVPLTRRYYPRDHLAPPITPQVLPPQPVFCDRDLSPIMAL
FT                   KIAPAGVAVKVTADRPIASA"
FT   gene            339. .2579
FT                   /gene="cap"
FT   CDS             339. .2579
FT                   /codon_start=1
FT                   /gene="cap"
FT                   /product="capsid protein"
FT                   /protein_id="AED99793.1"
FT                   /translation="MSAIAATISSANLNDLSRLAGAQPKEGVPAPVLQQNIATPKTGPP
FT                   DPGEGTRKQTTDSPHSANPSTKEHTPAPTIQPDTPTPITDHSDSEEGPRTHSTDFQTLY
FT                   EYFYSYPVPASQTRTGGAIARAGPVNDNNEVVSFTTETALVTSLTPKHIDANIQPLQIS
FT                   IADDCVNYSCQYSGQTCPIFDGSQHVQSATALASSMKARLMCEVTQSLSARPVQQPQLI
FT                   AYLYGALLAFGDRLNIHYGNKVNLWNALLGHNLQRGTPINGDNFNHHLLIDGPLAPPIL
FT                   PAAGLGPFPSTTLGPNTTVTFKARASIFVRPQTYDYALVDAAFWLIYAMYSRMPVAFRQ
FT                   AHSLNIDFFTVQPMAACVFPGHDGFTTPVIDQALGVLESMLVEMFNGDREIMYYYAFKG
FT                   GQIFMRPCSCYQEGGLIRKASRNVSLASFTGIYSLIGYCAPEARPLHAANHPGIIAALF
FT                   QYVDTMVLQAVLSYSGPKLVHFGAAPEFCSKGSTPYDFIDPDNYWGIRAGVNAHPIGYY
FT                   YLDILMRPKEHQLLDETLSDIYGHVGSLAMANIMASIASSGTEVLNQKMQKSFVRRGNQ
FT                   VRALRHSHAIINRFHEPEYAYRLGILADGIIPLAGTHKCDIIDEATRLLQGEDIRNLPG
FT                   LRCLRGRGLDAIIGIRPINKKRRAGFYTLDGNFHVVTNQCTSDVLQVWNDHGYIARPYA
FT                   CHIVESINVEIYDRSNGAYNGWIQALVSGFGVPERCYMGPRLQVAGGAPSAL"
XX
SQ   Sequence 4944 BP; 1278 A; 1431 C; 1008 G; 1227 T; 0 other;

hq607520 Length: 4944  08-MAY-2011  Type: N  Check: 6314  ..

       1  gcttaaagtc ccagtgagct ttaagcacca gaagtcgcag caacaaacag
      51  ttgtacttta catgagctgc gaggtggtta gaattaaaca cgcgttttgg
     101  aagtcgtcag acgcgtggtg gtcacttcgg tgatcaagtt atactgacat
     151  ggcgccgggt ggttgacgac ccgggccgcc tagttgtagc cctttggctc
     201  tttgccccaa gcgtggtttg aacctgcctt tagtggattc tgttgagtgt
     251  aatatctcaa tggaggtgtc agaccggcct agatgtaata gtctaggttt
     301  atggacagtg atgtctactt gtgcttccgt gctacgtcat gtcagctata
     351  gcagccacta tttcttctgc taatcttaat gatttatcac gtttagcagg
     401  cgcccagcca aaagaaggcg tccctgctcc tgtgcttcag caaaatattg
     451  ctacgccaaa aacaggtcct cctgaccctg gagaaggaac aagaaaacaa
     501  acaacagatt caccacattc agcaaacccc tcgacaaagg aacatactcc
     551  tgctcccaca atccagccag atactccgac tccaattaca gaccactccg
     601  attccgaaga aggaccaaga acacattcta ctgattttca aactctttat
     651  gaatattttt acagttaccc agttccagcc tcacagacca ggaccggcgg
     701  tgctatcgca cgcgctggcc cagtcaacga caacaatgaa gtcgtctcct
     751  tcacaacaga aacagcattg gttacatcac ttacaccaaa acatattgat
     801  gcaaatatcc aacctctcca gatctcaatc gcggatgact gcgtcaatta
     851  ttcgtgccaa tacagcggtc aaacctgccc gatattcgat ggttcacagc
     901  acgtccagag tgccacggct ctcgccagct ccatgaaggc gcgcctcatg
     951  tgtgaagtca cacaatcttt atccgcacgc cctgttcaac aacctcaact
    1001  cattgcttac ctttacggcg cgttactcgc attcggcgat cgcctcaaca
    1051  ttcattacgg taacaaagtc aacctctgga acgccttact tggccacaat
    1101  ttgcaaagag gtacgccgat caacggcgac aacttcaacc atcacttact
    1151  catcgatggt cctctcgctc ctccaatact cccagctgct ggattaggtc
    1201  cattcccatc gacgacattg ggacctaaca ccaccgtcac ctttaaggct
    1251  cgcgcatcca ttttcgtccg tccacagact tacgattacg ctcttgtcga
    1301  tgccgccttc tggcttatct acgccatgta ctctcgcatg ccagttgctt
    1351  tccgccaagc acattctctc aatatcgact tcttcaccgt ccagccaatg
    1401  gccgcctgtg tatttccagg acacgatggc ttcaccaccc cagttatcga
    1451  tcaagctctc ggtgttcttg aatcaatgtt ggttgagatg ttcaacggtg
    1501  atcgcgaaat catgtactac tacgctttca agggtggcca gatcttcatg
    1551  cgtccttgct cctgctacca ggaaggaggc ctcatccgca aagcctcacg
    1601  caatgtctca ctcgcttcgt ttacaggcat ctactcgctc attggttact
    1651  gtgcaccaga agccagacca ctccatgcag ccaatcaccc aggcattatt
    1701  gcagctcttt tccagtacgt cgacacaatg gtcttacagg ctgttctttc
    1751  ctactctggt cccaaactcg tccacttcgg agccgcacca gagttctgct
    1801  ccaaaggctc cacaccatac gatttcatcg accctgataa ctattgggga
    1851  atcagggctg gtgtcaatgc acatccaatc ggttattact atctcgacat
    1901  cctcatgcga ccaaaagaac accaactcct cgacgaaaca ctctccgata
    1951  tctacggaca cgtcggttca ctcgccatgg caaacataat ggcaagcatc
    2001  gcttcttctg gcaccgaagt tctcaatcag aagatgcaga aatccttcgt
    2051  cagacgcggc aaccaagtcc gcgcgttacg tcactcccac gccattatca
    2101  atcgcttcca cgaaccagaa tacgcctacc gcctcggaat ccttgcagat
    2151  ggtatcattc ctttagccgg tacgcacaag tgcgatatta tcgacgaagc
    2201  cactcgctta cttcagggag aagatatccg caatctccct ggtcttcgct
    2251  gtttacgcgg ccgcggcctc gacgccatca tcggtattcg tccgatcaac
    2301  aagaagcggc gtgcaggctt ctacacactc gacggcaatt tccacgtcgt
    2351  cacgaaccag tgcacaagtg acgttctaca ggtctggaac gatcacggct
    2401  acatcgcgcg tccttacgct tgccacatcg tcgaatccat caacgtcgaa
    2451  atctacgaca gatccaatgg cgcttacaac ggatggattc aggcgctcgt
    2501  cagcggcttc ggtgttccgg agcgctgcta catgggccct cgtctgcagg
    2551  tagcaggagg cgccccctct gccctttaaa gggcagcaat tgcgcagtag
    2601  cactgcatgt agatggacaa ctaactagag caagtcgggt accataccgt
    2651  aagctcacac caagtcatct taattgctcg aagcgctgcg cacgtcaact
    2701  ggcggttata tatagatacc agacactgag cccccagttg accgaggtta
    2751  gtgatagtga ttatctcgct ttcctccgtt gggtcctgtt accttataca
    2801  ggtgctacaa atagaccaca ccccaagcgg tggcctaaac cattttaccc
    2851  agcggaggtg agtctcaagt ttttagataa aaagaccgaa ttacagttgt
    2901  ttcctctcaa gaaggcccca caagccgacc tgaaagtcaa ctgcttcgca
    2951  agaaacctac tttattcgtc acctctttcc gaccgtattc tcaaacaatg
    3001  catcccagtt ggaacgaata acgatacggt ttgtggtctc gttatcttac
    3051  ttgagctcct cttcgaagcg ggagtcccac tagacctcct acctactatc
    3101  agtgtcgcta tcgcaaagaa cgatccattc gttaaagccc tttcggactt
    3151  taacaagatg acgggcgcaa ccacctcaca cattgcaaac ctcttgacag
    3201  agtgcacgac cttacttggc cgtggcgtca ccgcatccgc accgaatgcc
    3251  gatttgtatc accgggtagc tcctgagggc aatcgacatg aggccaagat
    3301  cagtgacgat gtgttgcgtt ccgccataag aaccatttac aaacaagaaa
    3351  tcaaagattg tccaaagccg ggtgactttg gcttacacct cctcacaagt
    3401  cccttttggt gcaagtccgg gtcacaccac catccacaat tccctcgata
    3451  ccgcaaccga cttgaattcg taatgaacac tgatcccagc gctattatgg
    3501  ctgttaagcc atccgtgtac attacccagg cacagaaact cgaacatggg
    3551  aaaactcgat acatatataa ttgtgacaca gtctcgtatt tgtatttcga
    3601  ctatatactg aactacgtag agagcatatg ggccaattct catgtgctac
    3651  tcaatccaga cgctcttaac gcggaaaagt tcgccacact tgaatattcg
    3701  gaatattgta tgattgacta cactgacttc aactcacaac acacccttac
    3751  atctatgaaa gcagtcttcg aagtacttaa agagtttcta ccttcagaaa
    3801  tgtttccagt tctcgattgg tgtatcagca gtttcgacaa catgacaatc
    3851  aaagatatga aatggagaag cacattaccc tcaggccaca gagcgacaac
    3901  attcatcaat tctgtactca atagagccta tttgttacct tacatcggta
    3951  ctatcgtcag ttaccactgc ggtgatgatg tactcttatg cggtgagcac
    4001  gattaccagc acttaataac gcgcctgcct tatgagttga acccgagcaa
    4051  acaaagtttc ggaccacatg ccgaattcct acgtttgcat aggcatggtg
    4101  aaaaagttat tggctaccct acacgtgcag tttcatcact agtctccggc
    4151  aattggctta gtacgacaag ttggaactgg caaccatccc tgctatctat
    4201  aacgaatcaa ataaacgcta tcatctgtcg ctcacagctc tccataagta
    4251  gaatacgttc acttgcccaa gaattacgct tccgttactg tccattgctt
    4301  gacaactaca tcgacccggc tacaaccagc tttgtagctg cgggctgccc
    4351  ctcatatcag ccaactgcaa caatgatcac tcccgacgta ccccacctgg
    4401  acgccgaaga agtcgaattc actcagctcc accagttggc cgaatatgct
    4451  atcaatactt acccgtggct taactctgtc gagtccgtca accagcttgt
    4501  cagaagcaga atgcgtaagc ctgcagcacg agacatccat tacagcgtac
    4551  ttggtccggc tatcccactt gtttcttacc accaccactg tgatccaatg
    4601  gtcgttcccc ttacgaggag gtattatcct cgagaccact tagcgcctcc
    4651  gattacacct caagttttac ctcctcaacc agttttttgc gatagagatt
    4701  tgtccccaat aatggcactc aaaatagctc ccgccggtgt ggcagtaaag
    4751  gttactgcag accggccgat agccagtgct taaataggtg gccctctgca
    4801  gtgggcctat aactgcagac tcagttggca ttaaacacca gttttttgcc
    4851  agcacgccta cgtaagtagg aacagccgta tggctaacaa cccatacgtc
    4901  tcagcaccag ctttgtgcta gacctatgac tcccggtctt cctc