Sequence of DPV Trichomonas vaginalis virus 1

Trichomonas vaginalis virus 1 strain TVV1-OC3, complete genome.

ACC No: HQ607517

Dated: 2011-05-08 | Length: 4684 | CRC: 745789571

                
ID   HQ607517; SV 1; linear; genomic RNA; STD; VRL; 4684 BP.
XX
AC   HQ607517;
XX
DT   08-MAY-2011 (Rel. 108, Created)
DT   08-MAY-2011 (Rel. 108, Last updated, Version 1)
XX
DE   Trichomonas vaginalis virus 1 strain TVV1-OC3, complete genome.
XX
KW   .
XX
OS   Trichomonas vaginalis virus 1
OC   Viruses; dsRNA viruses; Totiviridae; Trichomonasvirus.
XX
RN   [1]
RP   1-4684
RX   DOI; 10.1128/JVI.00220-11.
RX   PUBMED; 21345965.
RA   Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W.,
RA   Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H.,
RA   Singh B.N., Fichorova R.N., Nibert M.L.;
RT   "Clinical Isolates of Trichomonas vaginalis Concurrently Infected by
RT   Strains of Up to Four Trichomonasvirus Species (Family Totiviridae)";
RL   J. Virol. 85(9):4258-4270(2011).
XX
RN   [2]
RP   1-4684
RA   Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A.,
RA   Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.;
RT   ;
RL   Submitted (12-NOV-2010) to the EMBL/GenBank/DDBJ databases.
RL   Department of Microbiology and Molecular Genetics, Harvard Medical School,
RL   Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .4684
FT                   /organism="Trichomonas vaginalis virus 1"
FT                   /host="Trichomonas vaginalis"
FT                   /strain="TVV1-OC3"
FT                   /mol_type="genomic RNA"
FT                   /country="USA"
FT                   /collection_date="Nov-2009"
FT                   /db_xref="taxon:674953"
FT   gene            327. .4617
FT                   /gene="pol"
FT   CDS             join(327. .2353,2355. .4617)
FT                   /codon_start=1
FT                   /gene="pol"
FT                   /product="RNA-dependent RNA polymerase"
FT                   /note="translated via ribosomal frameshift"
FT                   /protein_id="AED99816.1"
FT                   /translation="MEASANGLSHDDNATRSQNVGPSTLPGSDKQGGEKHENSFNSFSY
FT                   DFFFNFLRMSAHTHISDSPGVSFVGKDGTPYSTATIQSAVGRLTHNVIASAVQLNVTAD
FT                   NVLEVDYGFGQDVSRSAGTITIPIFDGEKYKETARALAAIFSKKGMAVDVTSQTVQETL
FT                   KNSDLTIATVAAGYYTALAARHELTKEVSMAAHTIPFVTALSDTFSAAPDAQRSSHVIS
FT                   SCLRCPHSNNIQHDIGIGTDIWNNVSVESLSPQNMAVPNPNDVSFFIPNKALPSSWWCA
FT                   IWLLNAFIHSFVAPTRFHIFIAPGETYHLAPFTDADIYEAIPIMLAMSKAARPVPESVE
FT                   SMLYAYGTQMIIQPHSLYTEGGLIRKMIFTVPHLPAHGYFVTNSEYSRYMNIAVPNDPR
FT                   SAKDFIIGAGTGLLQITLAYQAAFSCAGPIALHWHANDAISQGMDTIAETYLQGRYFTV
FT                   PMAVAVATNVAQYTTLVRADPQYRHTLDRILPRIFGPSTDTVFNFIESAISSSWVSIDA
FT                   RRRNGRTRKFRTAFINRFHDPEFAYMFGITGNGIERMEGKVTSTISQEVDYLLNGGDLR
FT                   NCPVLRTLKAAERDETITFMCKEKAGTLIAMDGTVRFFKRFETIDLTQLGWTSHGKVMK
FT                   PYAFRAPLINGITICNTAYTTTAIDIVTTVFGPLRQRVGSLLSKAVRCGPVIPAVKHHF
FT                   NFKNVIVATRNNSEYTFIPGYGWVLQDDYLLNAVKMTGEGDIPPDQLPYDDDLLLSYAK
FT                   ILLYDYITHFPKYRYNNPKILTQTTELQLFPLKDDSAARNKVNFYARLLWNEATSDKKA
FT                   FKPGTYNDTVAGLLMWQQCALMWSVPQSIINRVISGVCDALTDRTSLALLKRISDWLKQ
FT                   LGLAYSPIHRLFIELPTLLGRGAIPGNAILDIKHRLTFDPSITVDVPTDRLHRLIYRLL
FT                   SRNLHITTASSFEDHLEERLLWSKTGSHYYPDDEVNKLLPHRPTRKEFLDIVTVDYIKR
FT                   CKPQVFIRQSRKLEHGKERFIYNCDTISYVYFDYILKLFETGWQDGEAILSPGDYTNDR
FT                   LHAKISSYKYKAMLDYTDFNSQHTIQSMRLIFETMKELLPPEMSFALDWCIASFDNMKT
FT                   SDGHKWVATLPSGHRATTFINTVLNWCYTQMVGLKFDSFMCAGDDVILMSQEPISLAPI
FT                   LTSNFKFNPSKQSTGTRGEFLRKHYTEEGVFAYPCRAIASLVSGNWLSDTLRDNTPMVV
FT                   PIQNGVDRLRSRAGLLGVPWILGLSELIEREDLPKEVGMALLNSHAAGPGLITRDYSSF
FT                   TVTPKPPTITSSLEYTATRHGVQDLSKHVPWKQLTTQECNRLGQQIKKMSHRHCSQAKI
FT                   TYKCVYEVFKPNRLPTVLSDVSQPSLSMAWWQAMLKEAMQDYTVKKIDAQMFASNACTN
FT                   SVSGDAFLRATPKMAGVLITSLISSSS"
FT   gene            327. .2363
FT                   /gene="cap"
FT   CDS             327. .2363
FT                   /codon_start=1
FT                   /gene="cap"
FT                   /product="capsid protein"
FT                   /protein_id="AED99815.1"
FT                   /translation="MEASANGLSHDDNATRSQNVGPSTLPGSDKQGGEKHENSFNSFSY
FT                   DFFFNFLRMSAHTHISDSPGVSFVGKDGTPYSTATIQSAVGRLTHNVIASAVQLNVTAD
FT                   NVLEVDYGFGQDVSRSAGTITIPIFDGEKYKETARALAAIFSKKGMAVDVTSQTVQETL
FT                   KNSDLTIATVAAGYYTALAARHELTKEVSMAAHTIPFVTALSDTFSAAPDAQRSSHVIS
FT                   SCLRCPHSNNIQHDIGIGTDIWNNVSVESLSPQNMAVPNPNDVSFFIPNKALPSSWWCA
FT                   IWLLNAFIHSFVAPTRFHIFIAPGETYHLAPFTDADIYEAIPIMLAMSKAARPVPESVE
FT                   SMLYAYGTQMIIQPHSLYTEGGLIRKMIFTVPHLPAHGYFVTNSEYSRYMNIAVPNDPR
FT                   SAKDFIIGAGTGLLQITLAYQAAFSCAGPIALHWHANDAISQGMDTIAETYLQGRYFTV
FT                   PMAVAVATNVAQYTTLVRADPQYRHTLDRILPRIFGPSTDTVFNFIESAISSSWVSIDA
FT                   RRRNGRTRKFRTAFINRFHDPEFAYMFGITGNGIERMEGKVTSTISQEVDYLLNGGDLR
FT                   NCPVLRTLKAAERDETITFMCKEKAGTLIAMDGTVRFFKRFETIDLTQLGWTSHGKVMK
FT                   PYAFRAPLINGITICNTAYTTTAIDIVTTVFGPLRQRVGSLFE"
XX
SQ   Sequence 4684 BP; 1365 A; 1190 C; 939 G; 1190 T; 0 other;

hq607517 Length: 4684  08-MAY-2011  Type: N  Check: 9072  ..

       1  gcaaaaagag ggggtcatcc acttccctct ttttgcactc aacattttca
      51  cctcatcatg acgaatccgt gacgcggaca tgataacaag cgtactgtcc
     101  tcgacgattg ccatcctcgt gtgaattccg ggctccgctt gcactgatgg
     151  tacctcttac gaaacttgga gagacttcgg cctcaaagag cggtaatgtg
     201  ccctctgcgc ctgggaccta atggtgtttt ctgctgtagg tacttcagta
     251  gtaggaaggt gaagggttaa acatcctggt tcgctaggtt tgtccttgcc
     301  ttatctctgc tgataattga atacccatgg aggcttctgc taatgggtta
     351  tcacatgatg ataatgcgac aagatcgcaa aatgttggac cttctactct
     401  tccggggtca gataaacaag gaggagaaaa acacgaaaat tcttttaatt
     451  ctttttctta tgatttcttt tttaactttt tacgtatgtc agcacacact
     501  cacatttcag acagtccagg tgtttctttc gtaggtaaag acggcacacc
     551  ttattcgaca gctacaatcc aatctgctgt aggccgtctc acacataacg
     601  taatcgcatc agccgtccag ctcaacgtta cagccgacaa tgtcttggaa
     651  gtagattatg gttttggcca agacgtctca agatctgcag gaaccatcac
     701  catcccaatt tttgatggtg agaagtacaa agaaacagcg cgcgctttag
     751  ctgcgatctt cagcaagaaa ggtatggcag tcgatgtcac gtcacagaca
     801  gtccaagaaa ccctcaagaa ttccgatctc acaattgcta cggtagccgc
     851  aggatattac actgctttag ctgcccgtca tgaactcacg aaggaagtaa
     901  gcatggcggc tcacactatc ccatttgtta ccgcattgtc cgacacgttc
     951  tcagctgcac cagatgccca acgttcaagc catgttattt cctcttgctt
    1001  gcgttgccct cattcgaaca atatccaaca cgacatcgga atcggtacag
    1051  acatctggaa caacgtctct gtcgaaagtc tctcaccgca aaatatggca
    1101  gttccgaatc ccaacgacgt atcattcttc attccgaaca aagctctccc
    1151  atcctcttgg tggtgtgcca tctggcttct taatgccttc atccacagct
    1201  tcgtcgcgcc gacacgcttc catatcttca tcgcaccagg cgaaacatac
    1251  catcttgcac cattcacaga tgccgatatt tacgaggcta tcccaattat
    1301  gctcgcaatg tcgaaggcag ctcgcccagt tccagaaagt gtcgaaagca
    1351  tgctttacgc atatggcact cagatgatta tccagccaca ctcgctctac
    1401  acagaaggtg gactcatcag aaaaatgata ttcacagttc cacaccttcc
    1451  agcccacggc tattttgtta caaattccga atactcgaga tacatgaaca
    1501  tcgcagttcc taacgatcct cgctctgcaa aggacttcat catcggtgca
    1551  ggaacaggtc tcttacagat cacactcgct taccaggctg ctttcagctg
    1601  cgctggccct attgcacttc attggcacgc aaatgacgcc atctcccaag
    1651  gcatggatac gatcgcggaa acatacctcc aaggaaggta tttcacagtt
    1701  cctatggcag tcgcagttgc tacaaacgtt gctcaataca cgacgctggt
    1751  cagagccgat ccccaataca gacacacact cgaccggatc ttaccacgca
    1801  tattcggacc gtcaacagat acagtcttca atttcatcga gtccgcaatc
    1851  tcatcatctt gggtatcaat agacgcccgc cgacgcaacg gccgcacaag
    1901  aaagttcaga acagctttca tcaaccgctt ccacgatcca gaattcgctt
    1951  acatgttcgg tatcacaggc aacggtatcg aaagaatgga aggcaaagtc
    2001  acctccacga tcagccaaga ggtcgattac ctcttaaacg gcggtgacct
    2051  ccgcaattgc ccagtcctcc gcacactcaa ggcagcagaa agagacgaaa
    2101  caatcacgtt catgtgcaaa gaaaaagccg gtacactcat cgccatggac
    2151  ggaacagtcc gctttttcaa gcggttcgag acgatcgatc tcactcagct
    2201  cggatggaca tcccacggta aggtcatgaa accatacgca ttcagagctc
    2251  cacttatcaa cggaatcacg atctgcaaca cagcctacac aacgacagcc
    2301  atcgacatcg ttactacagt ctttggtcct ttacgtcaga gggtaggttc
    2351  cctttttgag taaggctgta cgttgtggcc ctgtaatacc agccgtcaag
    2401  catcatttca acttcaagaa cgttatagta gcaacacgaa ataattccga
    2451  atacacgttc attcccggtt acggttgggt attacaggat gactatttat
    2501  tgaatgccgt aaagatgact ggcgaaggtg atatacctcc tgatcagtta
    2551  ccttacgatg atgatctttt actttcatac gcaaaaattt tactttacga
    2601  ttacataact cattttccta aatacagata caacaatcca aaaatattga
    2651  cacagacaac agaactacaa ctttttccac tcaaagacga ctcagctgct
    2701  agaaataaag tcaacttcta cgctagatta ttatggaacg aagcaacctc
    2751  agacaagaaa gctttcaaac caggaactta caatgatact gtagcaggtt
    2801  tactgatgtg gcaacaatgt gctctcatgt ggtccgtacc tcagtccatt
    2851  atcaacagag taattagcgg tgtttgtgat gcattaaccg acaggacgtc
    2901  actcgcgcta ttgaaacgta tctcagattg gctgaagcaa cttggactag
    2951  cctactcacc gatacatcgc cttttcatag agctcccaac actactagga
    3001  cgtggagcta tcccaggcaa tgcaattctg gatattaagc acagactcac
    3051  attcgaccca tcaattacag tagacgtccc gacggaccgg ctacatagat
    3101  tgatttacag acttctatct cgcaatctcc atatcacgac ggccagtagt
    3151  ttcgaggatc acttagaaga aagactactc tggtctaaaa caggaagcca
    3201  ctattatccc gacgacgaag tcaataagtt actccctcat cgtcctacaa
    3251  gaaaagagtt cctagatata gttacagtag actacattaa gcgatgcaaa
    3301  ccccaagttt ttatcagaca atcacgcaag ttggaacacg gcaaggaacg
    3351  attcatctac aattgcgaca cgatttcata cgtctatttt gattacatcc
    3401  taaagctctt cgagacagga tggcaagatg gcgaggcaat actatctcca
    3451  ggtgattata ccaatgatcg tctccacgcc aaaatatcta gttacaaata
    3501  caaagcaatg ttagattaca cagatttcaa ttcacaacac acgatccaaa
    3551  gcatgcgctt aatttttgaa acaatgaaag agctactccc accagaaatg
    3601  tcctttgcac tagactggtg tatagcatcc tttgataaca tgaaaacgtc
    3651  cgacggtcac aaatgggttg caacccttcc tagcggacat cgtgctacaa
    3701  cattcattaa cacagtatta aattggtgtt acacacagat ggtcggtctt
    3751  aagttcgata gttttatgtg cgctggtgat gacgtcatct tgatgtccca
    3801  agaaccaatt tcactagccc caatacttac ctctaatttc aaatttaatc
    3851  ccagcaaaca aagcacaggt actagaggtg agtttctacg taaacattat
    3901  acggaagaag gtgtttttgc atatccatgt cgagcaattg ccagtttagt
    3951  aagtggaaat tggttgagcg atacactaag agataacacc ccaatggtgg
    4001  tccctataca gaatggagtc gatagattac gcagtagagc aggtttactc
    4051  ggagtccctt ggattttagg cctctcagag ctcattgaga gagaggactt
    4101  acccaaggag gtcggcatgg ctttactaaa ttcacacgca gcgggaccag
    4151  gtctcatcac acgcgattac agttccttca cagttacgcc gaaaccacct
    4201  acgataacta gttcacttga atacactgca actcgtcatg gtgtccagga
    4251  cttgtcaaaa cacgtaccat ggaaacagct tacaacacaa gaatgcaata
    4301  ggttaggtca acaaattaag aaaatgagtc acaggcattg tagccaggct
    4351  aagataactt acaaatgtgt ctatgaggtt ttcaaaccca ataggctccc
    4401  cacggtgtta tctgacgtca gccagccatc gttgtcgatg gcgtggtggc
    4451  aggcaatgct taaggaagca atgcaagatt acactgtcaa gaagatagat
    4501  gctcaaatgt tcgcgagtaa cgcatgtaca aactccgtta gcggggatgc
    4551  gtttttacga gcgacaccca agatggctgg cgtcttaatc actagcctca
    4601  tctcttcttc ttcataacgt acagcaaaaa gtctctgtag ttgctcaaga
    4651  cttataatga gccagttggt ctcagtatac cttc