Sequence of DPV Trichomonas vaginalis virus 2

Trichomonas vaginalis virus 2 strain TVV2-OC5, complete genome.

ACC No: HQ607524

Dated: 2011-05-08 | Length: 4671 | CRC: 784695631

                
ID   HQ607524; SV 1; linear; genomic RNA; STD; VRL; 4671 BP.
XX
AC   HQ607524;
XX
DT   08-MAY-2011 (Rel. 108, Created)
DT   08-MAY-2011 (Rel. 108, Last updated, Version 1)
XX
DE   Trichomonas vaginalis virus 2 strain TVV2-OC5, complete genome.
XX
KW   .
XX
OS   Trichomonas vaginalis virus 2
OC   Viruses; dsRNA viruses; Totiviridae; Trichomonasvirus.
XX
RN   [1]
RP   1-4671
RX   DOI; 10.1128/JVI.00220-11.
RX   PUBMED; 21345965.
RA   Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W.,
RA   Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H.,
RA   Singh B.N., Fichorova R.N., Nibert M.L.;
RT   "Clinical Isolates of Trichomonas vaginalis Concurrently Infected by
RT   Strains of Up to Four Trichomonasvirus Species (Family Totiviridae)";
RL   J. Virol. 85(9):4258-4270(2011).
XX
RN   [2]
RP   1-4671
RA   Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A.,
RA   Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.;
RT   ;
RL   Submitted (12-NOV-2010) to the EMBL/GenBank/DDBJ databases.
RL   Department of Microbiology and Molecular Genetics, Harvard Medical School,
RL   Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .4671
FT                   /organism="Trichomonas vaginalis virus 2"
FT                   /host="Trichomonas vaginalis"
FT                   /strain="TVV2-OC5"
FT                   /mol_type="genomic RNA"
FT                   /country="USA"
FT                   /collection_date="Jan-2010"
FT                   /db_xref="taxon:674954"
FT   gene            296. .4605
FT                   /gene="pol"
FT   CDS             join(296. .2378,2378. .4605)
FT                   /codon_start=1
FT                   /gene="pol"
FT                   /product="RNA-dependent RNA polymerase"
FT                   /note="translated via ribosomal frameshift"
FT                   /protein_id="AED99810.1"
FT                   /translation="MASTLISSDNSATLGKDSEVINDTDTSPPDNPPSDHSNPQLTKIL
FT                   DEMSKKPCANINEIRKMIRNFQPQFIQLRNGNRPNAQPRTVDSFEWVVRIQSTVETQLL
FT                   GATNTVPQQTLNLDISFTDDSTTITPASIPGSISMLDNSRHIPAIQSMIQNFKARYLGS
FT                   LQDAAQLNSPQYPQLLAYLFGQLIAIKDRLDLFKPSNPLSLADAIFGFTLAQHAHPRYD
FT                   DHRHAKACTGPLVIPAATNSDCGPCGFVQINANQALTLPLGACLFVNPETVNDQSFQDF
FT                   LWLVFATHHRMPNQMQNNWPFALNIVSTCAAPGRQAPHAGELTDERVRLALDTGHRILL
FT                   AMFNDDEETLRYYQRKGIETMFRPCCFYTEGGLLRKATRYVSMVPLNGLYYYNGATSYI
FT                   VSPIHTDAHPGITAAIESFVDIMVLQAVFSFTGPKVVAAKVNANQLDPAMIFGPAIADG
FT                   DGFVYDPLRPAPPLSAFYSEFIHRPAEQRIFQMAMSQIYGSHAPLVIANVINSIHNCKT
FT                   KIVNSKLRATFVRRPAGAPHLKADTAIINRFHDPELAYALGILADGIAPLDGSHEYNIL
FT                   DELDYLFNGGDIRNCFGLNALNTRGLGQIVHIRPKREPGKRPRRGFYTTLDGQVHPVTQ
FT                   DAPLDEIYQWRDHGNLTRPYSCHILDSQGLEFADVSNGRSRGKILVVVNSPLKTCAAYQ
FT                   GPQLRANAGQRHVERINQCGGSVIYPRLSTARARSSSAIDTMTLTLANQLCYLYKSSDL
FT                   HRQLDIMIPQSYLTFLEWLLRISPNREINSIRHFPSQDNHEIITCTLRNLSKEQEIMLF
FT                   PIKDIIQANRRVNAYARNLLDASPLPDFALQQMLLPNTANDVVCAILLLGEVLWMLRCP
FT                   ISIIVNISRAICRNDSFLKDLSDFNKMLGLTKIPIANCLTELNTLQGRGVTSSDAKRDL
FT                   THRIADVNPHEAKISRENLKEAINQIYREEITKKEIPDTFRQHVFTSPLWVKKGAHHHP
FT                   HFKSYDNRLEFVENVDLDEVLQSRPAVYITQAPKLEHGKTRFIYNCDTVSYIYFDYILN
FT                   YVEGVWSNKHVLLNPDYMNPVIFSTLNYDEYCMLDYTDFNSQHSIESMKQVFSSLLPFL
FT                   PTSMHRILQWCVTSFDNMYINNTHWNSTLPSGHRATTFINSVLNRAYLLPFLQVSNAFH
FT                   TGDDVLLCGKADYGTLINTVPYELNKTKQSFGPSAEFLRLHKHNDQVSGYPARAISSLV
FT                   SGNWLSFANPLWQPSLLSIMQQLYTISARSGLLPYIPVTMKLEVQRRYDLRSRITNGLF
FT                   SGDIVPSGCPCYKSNAALLSAVVPDTVVKGSPNFYDLRTLDTLKQTSPWINSASKYMNL
FT                   LZRRHMESDNKNVLYSIQYLPSKMLPIIDVDPADALPLQKRYHPRSHIAHPLPRDAHLK
FT                   ELRFATCRVGPATAIRLGSLWPANRINLIKPVYV"
FT   CDS             296. .2425
FT                   /codon_start=1
FT                   /gene="pol"
FT                   /product="capsid protein"
FT                   /protein_id="AED99809.1"
FT                   /translation="MASTLISSDNSATLGKDSEVINDTDTSPPDNPPSDHSNPQLTKIL
FT                   DEMSKKPCANINEIRKMIRNFQPQFIQLRNGNRPNAQPRTVDSFEWVVRIQSTVETQLL
FT                   GATNTVPQQTLNLDISFTDDSTTITPASIPGSISMLDNSRHIPAIQSMIQNFKARYLGS
FT                   LQDAAQLNSPQYPQLLAYLFGQLIAIKDRLDLFKPSNPLSLADAIFGFTLAQHAHPRYD
FT                   DHRHAKACTGPLVIPAATNSDCGPCGFVQINANQALTLPLGACLFVNPETVNDQSFQDF
FT                   LWLVFATHHRMPNQMQNNWPFALNIVSTCAAPGRQAPHAGELTDERVRLALDTGHRILL
FT                   AMFNDDEETLRYYQRKGIETMFRPCCFYTEGGLLRKATRYVSMVPLNGLYYYNGATSYI
FT                   VSPIHTDAHPGITAAIESFVDIMVLQAVFSFTGPKVVAAKVNANQLDPAMIFGPAIADG
FT                   DGFVYDPLRPAPPLSAFYSEFIHRPAEQRIFQMAMSQIYGSHAPLVIANVINSIHNCKT
FT                   KIVNSKLRATFVRRPAGAPHLKADTAIINRFHDPELAYALGILADGIAPLDGSHEYNIL
FT                   DELDYLFNGGDIRNCFGLNALNTRGLGQIVHIRPKREPGKRPRRGFYTTLDGQVHPVTQ
FT                   DAPLDEIYQWRDHGNLTRPYSCHILDSQGLEFADVSNGRSRGKILVVVNSPLKTCAAYQ
FT                   GPSFAPMPGSAMWNE"
XX
SQ   Sequence 4671 BP; 1338 A; 1249 C; 858 G; 1225 T; 1 other;

hq607524 Length: 4671  08-MAY-2011  Type: N  Check: 8018  ..

       1  gctttgaagg agtgacgacc ttctaagccc aggcctcgac agcctggtca
      51  gacactcctg gggtccatca ggagacgggt cgctaacgcg aactggatgt
     101  cagtgtcaaa agccttgtgc gccatggata cttggtacac ttctacggga
     151  gtaggtaacc cgcgaacctt gaacagtcat gaggatttaa aactgttctc
     201  cgaactgcta gccttatgca gttgacgtgt attgagaggg cttaattgtt
     251  gctttcatac taatattgta tgatcgtcag cttttaatta cgataatggc
     301  ttcgacgcta atatcgtctg ataattctgc cacgttaggc aaagatagtg
     351  aagttattaa cgacacagat acttcaccac ccgacaatcc acctagtgat
     401  cattcaaatc ctcagttaac aaagattcta gatgagatgt ccaaaaaacc
     451  atgtgcaaat attaatgaaa taagaaaaat gattagaaat ttccaacctc
     501  agtttattca attacgtaac ggcaaccgtc caaatgctca gccccggaca
     551  gtagattctt tcgaatgggt agttcgtatt caaagcactg tcgagaccca
     601  attacttggt gccacgaata ctgtccccca acaaactctt aacctcgata
     651  tctcctttac tgatgattct accactatta ccccagcttc cattccgggc
     701  tcaatttcaa tgcttgataa ttctcgtcac atcccagcga tccagagtat
     751  gatccaaaac ttcaaggctc gttatttagg ttcattacaa gatgccgccc
     801  agctcaattc cccacagtat ccacaactcc tcgcttactt attcggccag
     851  ctaatcgcta tcaaggaccg cctcgatctc ttcaagccat caaacccact
     901  ttctctcgct gatgccatat tcggtttcac gttagctcaa cacgctcacc
     951  cacgttacga tgaccacaga cacgccaaag cttgtacagg accactcgtt
    1001  attccagcag cgaccaacag cgactgtggc ccttgcggtt tcgtacagat
    1051  taatgcaaat caagccctca ctcttcctct tggtgcttgt cttttcgtca
    1101  acccagagac ggttaatgat caatctttcc aagattttct ctggctcgtc
    1151  ttcgcaacac accaccgcat gccaaatcaa atgcaaaaca attggccatt
    1201  tgctctcaac atcgtctcaa catgcgcggc cccaggtcgt caagctcctc
    1251  acgcaggcga actcactgat gagagggtcc ggctcgccct cgatacaggc
    1301  catcgcattc ttctcgcaat gttcaacgac gatgaagaaa ctctccgcta
    1351  ctaccagcgc aaaggaatcg aaacaatgtt cagaccatgc tgtttctaca
    1401  ctgaaggcgg tttactcaga aaagctacca gatacgtttc tatggtccca
    1451  ctcaacggct tatattacta caacggtgca acctcatata tcgtctcccc
    1501  gatccacact gatgctcatc ctggcattac tgcagcaatc gaatcattcg
    1551  ttgacattat ggtcttacaa gcagtattct ctttcacagg tcctaaagta
    1601  gttgctgcta aagttaatgc caaccaactc gatccagcca tgatcttcgg
    1651  ccctgcaatc gccgacggag atggtttcgt ttacgaccct ctccgcccag
    1701  cgcctccact ttccgcgttc tactccgaat tcatccacag accagccgaa
    1751  caacgcatct tccagatggc gatgagccaa atctacggtt cacatgctcc
    1801  tcttgtcatc gccaacgtca tcaactccat ccacaattgc aagacaaaga
    1851  ttgtcaacag caaattacgc gctaccttcg ttcgtcgtcc agccggcgct
    1901  cctcatctca aggccgacac agctatcatc aaccgcttcc atgatccaga
    1951  actcgcttat gctctcggaa ttctcgccga cggcatcgcc cctcttgatg
    2001  gctcacatga atacaacatt ctcgatgaac tcgattactt gttcaacggt
    2051  ggcgacatcc gcaattgctt cggcctcaac gccctcaaca ctcgtggttt
    2101  gggccaaatc gtccacatcc ggccaaaacg cgaaccagga aagagacccc
    2151  gccgcggttt ctacaccaca ctcgatggac aggttcaccc tgtcacacaa
    2201  gatgctccac tcgatgagat ttaccaatgg cgcgaccatg gaaatctcac
    2251  acgcccatat tcgtgccaca tcctcgacag tcaaggactc gaattcgccg
    2301  atgtttccaa cggacggtca cgtggaaaga tcctcgtggt cgtcaactca
    2351  ccactcaaaa catgcgctgc ctaccagggc cccagcttcg cgccaatgcc
    2401  gggcagcgcc atgtggaacg aataaaccaa tgcgggggca gcgtcatcta
    2451  tccgcgcttg agcactgctc gcgcccgcag ctcgtctgcc attgatacga
    2501  tgacgttgac cctcgcaaac caactgtgct acctatataa gtcctcggac
    2551  ttacaccgac agttggatat catgatacca caatcctact taacatttct
    2601  tgaatggcta ctcagaatca gtcccaacag agaaataaat tcaatccgtc
    2651  atttcccaag tcaagacaat cacgaaatta taacatgcac tctaagaaat
    2701  ctttccaagg aacaagagat catgctattt ccgatcaaag acataattca
    2751  agccaaccgt cgtgtaaatg cttacgcacg aaatctcctt gatgcctcac
    2801  cgcttcccga ttttgcccta caacagatgc ttctaccaaa tacagcaaat
    2851  gatgtagtgt gtgcaatctt actactcggt gaagtactat ggatgcttcg
    2901  gtgcccaatc tcaatcatcg taaacatttc acgcgcaatt tgtcgtaacg
    2951  atagcttttt gaaagattta tccgatttta ataagatgtt aggcttaact
    3001  aagattccta tagctaattg tttgacggaa ctgaacactt tacaaggtcg
    3051  aggagtaact tcaagcgatg ccaaaagaga tttgacccac cgaattgccg
    3101  atgtcaatcc tcatgaggcg aaaataagta gagagaatct caaagaagca
    3151  atcaatcaaa tatacagaga agaaatcact aagaaagaaa tccctgatac
    3201  atttagacaa catgtattta catctccatt atgggttaag aaaggagcac
    3251  accaccaccc acacttcaag tcttacgaca atcgattgga gtttgttgag
    3301  aatgtagatt tggacgaggt cttacaatca cgccccgctg tctacatcac
    3351  acaagccccg aaacttgaac atggtaagac tagattcatt tacaattgtg
    3401  acacagtcag ctacatttac tttgattaca ttttgaatta cgttgagggt
    3451  gtttggtcca ataaacatgt actcttaaac cctgattaca tgaatccagt
    3501  tatctttagt actcttaatt acgacgaata ctgcatgtta gattacactg
    3551  atttcaactc acagcattcc atcgaaagta tgaagcaagt cttttcaagt
    3601  cttttacctt ttctgccaac gtccatgcac cgtatattac agtggtgcgt
    3651  cacatcgttt gacaacatgt atatcaacaa tactcactgg aattccacac
    3701  tgccatcagg acatagagca actacattta taaactctgt cttgaacagg
    3751  gcatacttgt taccattcct acaagtctcc aatgcgtttc acacaggcga
    3801  tgatgtactt ttatgcggaa aagcagacta tggcacgctt atcaataccg
    3851  taccttatga actcaacaaa actaagcaat cattcggacc ttcagctgaa
    3901  ttcctgcgtc ttcataaaca caatgatcaa gtttccggtt atccagcacg
    3951  tgcaattagc agtctcgtaa gtggcaattg gttgtcattc gcaaatccac
    4001  tatggcaacc ttcgctactg tcaattatgc aacaattgta caccatatca
    4051  gcaagatcag gtctattacc atatattcct gtaacaatga agttagaagt
    4101  gcagcggcgt tatgatttgc gatcacgaat taccaatgga ttgttctctg
    4151  gtgacatagt cccaagcggt tgtccttgtt ataagtcaaa tgctgcttta
    4201  ctaagtgcgg tagttcccga cactgtagta aagggttccc ctaactttta
    4251  tgacttacgt acattggaca cgttaaaaca aacatcacct tggatcaatt
    4301  ctgcatccaa atacatgaac ctcttasaac ggcgccatat ggaatctgat
    4351  aacaaaaatg ttttatatag tatccaatat ttaccatcta aaatgttacc
    4401  aataattgat gttgaccctg cagatgctct tccattacaa aaacggtatc
    4451  acccacgttc tcacatcgca cacccactcc cacgagatgc tcatcttaag
    4501  gaattaagat ttgcaacgtg tcgagtgggc ccggctactg cgataagatt
    4551  aggatcgctt tggcctgcga acagaatcaa cctaatcaag ccagtctacg
    4601  tctaagtacg actgacacaa tcttacataa ctactcggct agagtaggag
    4651  agtaatatca actcttacgt c