Sequence of DPV Trichomonas vaginalis virus 1

Trichomonas vaginalis virus 1 strain TVV-T5 capsid protein (cap) gene, complete cds; and RNA-dependent RNA polymerase (pol) gene, partial cds.

ACC No: U57898

Dated: 2009-09-22 | Length: 4648 | CRC: 1143248652

                
ID   U57898; SV 1; linear; genomic RNA; STD; VRL; 4648 BP.
XX
AC   U57898;
XX
DT   14-SEP-1996 (Rel. 49, Created)
DT   22-SEP-2009 (Rel. 102, Last updated, Version 3)
XX
DE   Trichomonas vaginalis virus 1 strain TVV-T5 capsid protein (cap) gene,
DE   complete cds; and RNA-dependent RNA polymerase (pol) gene, partial cds.
XX
KW   .
XX
OS   Trichomonas vaginalis virus 1
OC   Viruses; dsRNA viruses; Totiviridae; Trichomonasvirus.
XX
RN   [1]
RP   1-4648
RX   DOI; 10.1006/viro.1996.0446.
RX   PUBMED; 8806533.
RA   Su H.M., Tai J.H.;
RT   "Genomic organization and sequence conservation in type I Trichomonas
RT   vaginalis viruses";
RL   Virology 222(2):470-473(1996).
XX
RN   [2]
RP   1-4648
RA   Tai J.-H.;
RT   ;
RL   Submitted (10-MAY-1996) to the EMBL/GenBank/DDBJ databases.
RL   Infectious Diseases, Institute of Biomedical Sciences, Academia Sinica,
RL   IBMS, Academia Sinica, Taipei, Taiwan, 11529, ROC
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .4648
FT                   /organism="Trichomonas vaginalis virus 1"
FT                   /host="Trichomonas vaginalis"
FT                   /lab_host="Trichomoans vaginalis"
FT                   /strain="TVV-T5"
FT                   /mol_type="genomic RNA"
FT                   /proviral
FT                   /db_xref="taxon:674953"
FT   5'UTR           1. .285
FT   gene            286. .2322
FT                   /gene="cap"
FT   CDS             286. .2322
FT                   /codon_start=1
FT                   /gene="cap"
FT                   /product="capsid protein"
FT                   /db_xref="UniProtKB/TrEMBL:Q98752"
FT                   /protein_id="AAC55468.1"
FT                   /translation="MEASANGLSHDDKANNSQNVGPSTLPGSDKQGGEKHENSFNSFSN
FT                   DFFFNFLRMSMNTHISDSPGVSFIGKDGAPYSSVTIQSAVGRLTHNVVASAVQLNVTAD
FT                   NVLEVDYGFGQDVSRATGTIPIPIFDGEKYKETARALAMIFSKKGMSVDVTSQTVQETL
FT                   KNSDLTIATVAAGYYTALAARHELTKAVSVAAHTIPFATALSDTFTAAPNAQRSSHVIS
FT                   SCLRCPASGNIQHDIGIGSTIWTNVIVESLSPQNMAVPNPDDISFFIPNKALPSSWWCA
FT                   IWLLNAFLHSFVAPTRIHIFITQGETYHLAPFTDSDVYEAVRFLLAMSKSSRPMPESVE
FT                   SMLYAYSTQMIIQPHSLYTEGGLIRRMIFTVPHLPAHGYFVTNSEFSRYMNIAVPDDPR
FT                   SAKVFVIGAGTGLLQIVLAYQAAFSCAGPIALHWHANDAISQGMDTVASTYLQGRYFTI
FT                   PMAVNVATNVARYTTTVRADPQYKRTLDRILPRIFGPSTDTIFEFIESAISSSWVSIDA
FT                   VRRSGRARKFRTAFINRFHDPEFAYMFGITGNGIERMEGKVTSNIAQEVDYLMNGGDLR
FT                   NCPVLRTLKAAERDETITFMCKEKVGSIFAIDGTVRVLKQYQTIDLSQLGWTSHGKVMK
FT                   PYAFRAPVIQGITICNTAYTTTAIDIVTTVFGPLRQRVGTLFE"
FT   gene            <2306. .4576
FT                   /gene="pol"
FT   CDS             <2306. .4576
FT                   /codon_start=1
FT                   /gene="pol"
FT                   /product="RNA-dependent RNA polymerase"
FT                   /note="translated via ribosomal frameshift"
FT                   /db_xref="GOA:Q98753"
FT                   /db_xref="InterPro:IPR001795"
FT                   /db_xref="UniProtKB/TrEMBL:Q98753"
FT                   /protein_id="AAC55469.2"
FT                   /translation="VPFLSKAVRCGPVIPFVIHHFNFRRVTTTKRRRNKYVLVPGYGWV
FT                   LQDDYLVNSVKMTGENDLPPNQLPHDDDLLFTYAKILLYDYISYFPKFRHNNPDLLDHK
FT                   TELELFPLKADSAARNKANFYARTLWNDTITDKSAFKPGTYNDTVAGLLLWQQCALMWS
FT                   LPKSVINRTISGVCDALTNRTSLTLLKRISDWLKQLGLACSPIHRLFIELPTLLGRGAI
FT                   PGDADKDIKHRLAFDPSITVDVPKEQLHLLIYRLLSRNLNITKVNSFEHHLEERLLWSK
FT                   SGSHYYPDDKINELLPPQPTRKEFLDVVTTEYIKECKPQVFIRQSRKLEHGKERFIYNC
FT                   DTVSYVYFDFILKLFETGWQDSEAILSPGDYTSERLHAKISSYKYKAMLDYTDFNSQHT
FT                   IQSMRLIFETMKELLPPEATFALDWCIASFDNMQTSDGLKWMATLPSGHRATTFINTVL
FT                   NWCYTQMVGLKFDSFMCAGDDVILMSQQPISLAPILTSHFKFNPSKQSTGTRGEFLRKH
FT                   YSEAGVFAYPCRAIASLVSGNWLSQSLRENTPILVPIQNGIDRLRSRAGLLGVPWKLGL
FT                   SELIEREAIPKEVGMALLNSHAAGPGLITRDYSSFTVTPKPPKLSSTLEYTATRYGLQD
FT                   LSKHVPWKQLTTVESDKLSRQIKKISYRHCSQAKITYNCTYEVFKPRGLPTVLSGSSQP
FT                   SLSMLWWQAMLKQAIQDDSTKKIDARMFAANACTSSVSGDAFLRANASMAGVLITSLIT
FT                   SSS"
FT   3'UTR           4577. .4648
XX
SQ   Sequence 4648 BP; 1351 A; 1194 C; 898 G; 1205 T; 0 other;

u57898 Length: 4648  22-SEP-2009  Type: N  Check: 1231  ..

       1  actcaacatt ttcactccgt catgacgaac tcataacgcg gacatataac
      51  aagcgtaatg tcctcgacga ttgccatcct cgtgtgaact ccgggctccg
     101  cttcactgat gtacctctta ctaagctgga gagacttttt agtcttgaag
     151  agccgtaatg tgccctctgc gcctgggacc taatggcgct tttgctgtag
     201  gtactttata gaagaagaat gagggttctc aacatactag ttcgctggta
     251  tgtcctattc ctacgctata aagaaataaa tacccatgga ggcttctgct
     301  aatgggttat cacatgacga taaagcgaat aattcgcaaa atgttggacc
     351  ttctactctt ccggggtcag ataaacaagg aggagaaaaa catgaaaatt
     401  cttttaattc tttttctaat gatttctttt ttaacttttt acgtatgtcc
     451  atgaacactc acatttcaga cagtccaggc gtttctttca tcggaaaaga
     501  cggtgcacct tactcatcag taacaattca atcagccgta ggccgtctta
     551  cacataacgt agttgcatca gccgttcaac tcaatgtaac agcagacaac
     601  gttttagaag tcgattacgg tttcggtcag gatgtttcaa gagctaccgg
     651  aacaatccca attccaattt tcgacggcga gaaatacaag gaaactgctc
     701  gtgccttagc tatgatcttc agtaagaaag gcatgtcagt tgatgttaca
     751  tcccaaacag tacaagaaac acttaagaac tccgatctca ctatcgcgac
     801  agttgcagcc ggatattaca cagctttagc tgcacgccac gaactcacga
     851  aagctgttag tgttgcagcc cacacaattc ctttcgccac cgccttgtcc
     901  gacacattca cggcagctcc aaatgcacag cgttcaagcc acgttatttc
     951  ttcttgctta cgctgtccag cttcgggcaa tatccaacac gacatcggaa
    1001  tcggttctac catctggact aatgtcatcg tcgaaagtct ttcaccacag
    1051  aatatggcag ttccaaatcc agacgacata tcattcttca ttccgaacaa
    1101  agccctccca tcttcttggt ggtgtgcgat ttggctcctc aacgcatttc
    1151  ttcactcctt tgttgcgcca actcgtatcc acatcttcat tacacaagga
    1201  gaaacatacc acctcgctcc tttcaccgat tcggatgtct acgaggccgt
    1251  tcgtttcttg ctcgcaatgt caaagtcatc acgcccaatg ccagagagcg
    1301  tcgagagtat gttatatgca tacagcacac agatgatcat ccaaccacat
    1351  tcgctctaca cagagggagg cttgatcaga agaatgatct ttacagttcc
    1401  acaccttcca gctcatggtt acttcgtcac gaattccgaa ttctcgagat
    1451  acatgaatat cgctgttcca gacgacccgc gttctgcaaa agtcttcgtt
    1501  atcggtgcag gaacaggtct cttacaaatc gtactggctt accaagctgc
    1551  tttcagctgt gctggcccta ttgcacttca ctggcacgca aacgatgcca
    1601  tctcacaagg catggataca gttgcgagta cataccttca gggaagatac
    1651  ttcaccattc ctatggctgt caacgtcgcc acaaacgtcg ctcgatacac
    1701  tacgacagtt agagcagacc ctcaatacaa gcgtacactc gatcggatct
    1751  taccacgcat cttcggccca tcaactgaca caatattcga gttcatcgaa
    1801  tcggctatct cgtcatcttg ggtctccatc gacgctgtca gacgcagcgg
    1851  tcgcgctcga aagttcagaa cagctttcat caatcgcttt catgatccag
    1901  aattcgctta catgttcggt atcactggca acggcatcga gagaatggaa
    1951  ggtaaggtta cttcaaacat cgcccaagaa gtcgattacc tcatgaatgg
    2001  tggcgacctc cgcaactgcc ctgttctccg cacacttaag gcagcagaga
    2051  gagatgaaac tatcaccttc atgtgcaagg agaaagtcgg ttccattttc
    2101  gcgatcgatg gtactgtccg cgtactcaaa cagtatcaga ctatcgatct
    2151  ctcccaactc ggttggactt cccacggcaa ggtgatgaaa ccttacgctt
    2201  tcagagctcc agtcatccaa ggaattacca tctgcaacac agcttacaca
    2251  accacggcca tcgacattgt cacaacagtc tttggtccct tacgccaacg
    2301  tgtaggtacc ctttttgagt aaagctgtac gttgtggccc tgtaatacca
    2351  ttcgtcatac accatttcaa cttcagacgt gttacaacta ctaaacgacg
    2401  acgcaataaa tacgtacttg tccccggata tggatgggta ttacaggatg
    2451  actatttggt taattccgtc aaaatgactg gtgaaaacga tttaccccca
    2501  aaccagttac ctcatgacga tgatctttta tttacatacg caaaaatttt
    2551  actttacgac tacatatctt attttcctaa attcagacac aataatccag
    2601  acttactaga tcacaaaaca gaactagaac ttttcccact caaagctgat
    2651  tcagctgcta gaaataaagc aaacttctac gcaagaactt tatggaatga
    2701  tactatcaca gataaaagcg ctttcaaacc aggaacttat aatgatacag
    2751  ttgcaggtct gttattatgg caacagtgtg ctctcatgtg gtcattaccc
    2801  aagtcagtga tcaacagaac aattagcggt gtttgtgatg cactaaccaa
    2851  caggacttca ctcacgctat taaaacgtat ctcagattgg ctaaaacaac
    2901  ttggactggc ctgctcaccg atacatcgcc tattcatcga actccctaca
    2951  cttctaggac gcggtgcgat cccaggcgat gctgacaaag atataaagca
    3001  cagactcgct ttcgacccat caataacagt cgatgtccca aaagaacagt
    3051  tacatctact gatctacaga ctcttatcca gaaatctcaa tatcactaaa
    3101  gtcaatagtt ttgaacacca cctggaagag cgcttacttt ggtccaaatc
    3151  aggaagtcac tactaccccg acgacaagat caacgagtta cttcctccgc
    3201  aacctactag aaaggaattc ttggatgttg tcacgacaga atacattaag
    3251  gagtgcaagc ctcaagtctt catcagacag tctcgtaaac tcgaacacgg
    3301  taaggaacga ttcatctaca attgcgacac agtctcatac gtctattttg
    3351  attttatctt gaagctcttt gagacaggat ggcaagatag cgaagcaata
    3401  ttgtcgccag gcgactacac tagtgaacgt ctccacgcta agatttccag
    3451  ttataagtat aaagccatgt tagactacac agacttcaac tcacaacata
    3501  caatccaaag catgcggttg atcttcgaaa ccatgaaaga gttactccct
    3551  ccagaagcga cttttgctct cgattggtgt atcgcctcat ttgacaacat
    3601  gcaaacatca gacggtctca aatggatggc tactctccct agtggacacc
    3651  gtgccactac attcattaat actgtcctaa attggtgtta cactcagatg
    3701  gtcggtctca aattcgatag tttcatgtgc gctggtgatg atgttatcct
    3751  aatgtcccaa caacccatat cactagcacc aattcttaca tcacatttta
    3801  agttcaatcc aagcaagcaa agcacgggta ctagaggtga attcttacgc
    3851  aagcactata gcgaagcagg tgtcttcgca tacccatgtc gagcgatcgc
    3901  tagcttagtg agcggaaatt ggctaagtca atcactaaga gagaacaccc
    3951  caatcctggt ccctatacaa aacggaatcg atagattacg cagtagagca
    4001  ggtctactcg gagttccttg gaaactaggt ctctcagagc tcattgagag
    4051  agaggccatt cctaaggaag tcggcatggc tctattgaat tcacacgcag
    4101  cagggcccgg tctgattact cgagactaca gttctttcac agttacgccc
    4151  aaacccccca agttaagcag cacactcgaa tacaccgcaa cccgttacgg
    4201  tcttcaagat ttatccaaac acgtcccatg gaaacaactc acaacagttg
    4251  aatctgataa gttaagtcga caaattaaga aaataagtta caggcattgc
    4301  agccaggcga agataactta caattgtacc tacgaagttt ttaaaccacg
    4351  tgggctccct acagtgttat ccggttccag ccaaccatcg ttgtcgatgc
    4401  tatggtggca agcaatgctc aagcaagcaa tacaagatga ctctacgaag
    4451  aagatagatg cacgaatgtt tgctgcgaac gcatgtacta gctccgttag
    4501  cggagatgcg ttcttgcgag caaacgccag tatggctggt gtcctaatca
    4551  ctagcctaat cacttcttca tcataacgta cagctacgaa aaaagtctct
    4601  atagttgctc aagactacaa tgagccagat ggccccgcta taccttcg