Sequence of DPV Trichomonas vaginalis virus 1

Trichomonas vaginalis virus 1 strain T1 capsid protein gene, complete cds; and RNA-dependent RNA polymerase (pol) gene, partial cds.

ACC No: U08999

Dated: 2009-09-23 | Length: 4647 | CRC: 71970221

                
ID   U08999; SV 1; linear; genomic RNA; STD; VRL; 4647 BP.
XX
AC   U08999;
XX
DT   09-APR-1995 (Rel. 43, Created)
DT   23-SEP-2009 (Rel. 102, Last updated, Version 4)
XX
DE   Trichomonas vaginalis virus 1 strain T1 capsid protein gene, complete cds;
DE   and RNA-dependent RNA polymerase (pol) gene, partial cds.
XX
KW   .
XX
OS   Trichomonas vaginalis virus 1
OC   Viruses; dsRNA viruses; Totiviridae; Trichomonasvirus.
XX
RN   [1]
RP   1-4647
RX   DOI; 10.1016/S0042-6822(95)80008-5.
RX   PUBMED; 7831841.
RA   Tai J.H., Ip C.F.;
RT   "The cDNA sequence of Trichomonas vaginalis virus-T1 double-stranded RNA";
RL   Virology 206(1):773-776(1995).
XX
RN   [2]
RP   1-4647
RA   Tai J.-H.;
RT   ;
RL   Submitted (19-APR-1994) to the EMBL/GenBank/DDBJ databases.
RL   Jung-Hsiang Tai, Division of Infectious Diseases, Institute of Biomedical
RL   Sciences, Academia Sinica, Rm. 414, Taipei, Taiwan 115, ROC
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .4647
FT                   /organism="Trichomonas vaginalis virus 1"
FT                   /strain="T1"
FT                   /mol_type="genomic RNA"
FT                   /db_xref="taxon:674953"
FT   5'UTR           1. .287
FT   CDS             288. .2324
FT                   /codon_start=1
FT                   /product="capsid protein"
FT                   /db_xref="UniProtKB/TrEMBL:Q90154"
FT                   /protein_id="AAA62867.1"
FT                   /translation="MEASANGLSHDDNANKSQNVGPSTLPRSDKQGGEKHENSFNSFSN
FT                   DFFFNFLRMSTNTHISDSPGVSFVAKDGTPYSSLTIPSGVGRLTHNVVASAVQLNITAS
FT                   NTLEVDYGFGQDVSRTTGTIPIPIFDGEKYKETARALAAIFSKKGMSVDVTSQTVQETL
FT                   KNSDLTIATVAAGYYTALAARHELTKDVSEAAHTIPFVTALSDTFSAALNAQRTSHVIS
FT                   SCLRCPNSRNAQRDIVIGTVLWNNVFVESLSEHNMAVPNPNDISFFIPNKALSSSWWCA
FT                   IWLLNAFLHSFIAPTRIHIFITQGETYHLAPFTDSDVYEAVRFLLAMSKSSRPMPESVE
FT                   SMLYAYGTQMIIQPHSLYTEGGLIRRMIFTVPHLPAHGYFVTNSEFSRYMNIAVPDDPR
FT                   SAKDFVIGAGTGLLQIVLAYQAAFSCAGPIALHWHANDAISQGMDRVASIYLEGRYFTI
FT                   PMAVNVATNVAQYTTMVRADPEYRHTLDRILPRIFGPSTDTVFDFIESAITSSWVSIDA
FT                   RKRNGRARKFRTAFINRFHDPEFAYMFGITGNGIERMEGKVTSNIAQEVDYLMNGGDLR
FT                   NCPILRTLKAAEREETVTFMCKEKVGSLYAIDGTVRVFKRFETIDLAQLGWTSHGKVMK
FT                   PYAFRAPVIQGMTICSTAYTSTAIDIITTVFGPLRLRVGSLFE"
FT   misc_signal     2301. .2350
FT                   /function="ribosomal frameshifting"
FT   gene            <2308. .4578
FT                   /gene="pol"
FT   CDS             <2308. .4578
FT                   /codon_start=1
FT                   /gene="pol"
FT                   /product="RNA-dependent RNA polymerase"
FT                   /db_xref="GOA:Q90155"
FT                   /db_xref="InterPro:IPR001795"
FT                   /db_xref="UniProtKB/TrEMBL:Q90155"
FT                   /protein_id="AAA62868.1"
FT                   /translation="DPFLSKAVRCGPVIPSVKHHFNINYLSILKHNGNEYTFVPGYGWV
FT                   LQDDYLLNAVKMVGEGDLPPNQLPYDDDLLFTYAKILLYDYISHFPEFRHKNPRLLTSE
FT                   TELQLFPLKENSAARTKANFYARTLWNETTSDKSAFKPGTYNDTVAGLLMWQQCALMWS
FT                   LPKLIINKIISGVCDALTEKVSLTLLKRISDWLKQLGLAYSPIFRLFIELPTLLGRGAI
FT                   PGDAALDMKHRLTYNPLMTVDVPKTQLHDLIYRLLSRNYNNTKISSFEHHLEERLLWSR
FT                   SGSHYYPDEQIDQLLPPQPTRKEFLDIVTIDYIKQCKPQVFIRQSRKLEHGKERFIYNC
FT                   DTISYVYFDFVLKLFESGWQDSEAILSPGDYSSECLHAKISGYKYKAMLDYTDFNSQHT
FT                   IQSMRLIFETMKELLPPEASFALDWCIASFDNMQTSDGRKWTATLPSGHRATTFINTVL
FT                   NWCYTQMVGLKFDSFMCAGDDVILMSQEPISLAPILKSQFKFNPSKQSTGTRGEFLRKH
FT                   YTEAGVFAYPCRAIASLVSGNWLSESLRDNTPILVPIQNGIDRLRSRAGLLGVPWKLGL
FT                   SELIEREAIPKDVSMALLNSHAAGPGLITRDYSSFTVTPTPPKLHSSLEYTATRHGLQD
FT                   LCKHVPWKQLTANECNKLGQQIKKMSHRHCSQTKITYKCVYEVFKPSGLPTVLSEVSQS
FT                   ALSLVWWQAMLKEAMQDYSTKKKDAHMYACNACTSSVSGDAFLRATSKMAGVLITSLIS
FT                   SSS"
FT   3'UTR           4576. .4647
XX
SQ   Sequence 4647 BP; 1348 A; 1156 C; 926 G; 1217 T; 0 other;

u08999 Length: 4647  23-SEP-2009  Type: N  Check: 9662  ..

       1  actcaacatt gctacttcat catgacgaac ccgtgacgcg gacacgtaac
      51  aagcgtagtg tcctcgacga ttgccatcct cgtgtgaatt ccgggctccg
     101  cttgcactga tggtacctcc tacgaaactt ggagaggctt cggtctcgaa
     151  gagcggtaat gtaccctctg cgcctgggac ctaatcgcgt tttgctgtag
     201  gtaattcagt agatggaagg attaagggtc aaacatcttg gttcgccaag
     251  tatgtcctta ccagtactct actgagcttg aatacccatg gaggcttctg
     301  ctaatgggtt atcacatgat gataatgcga ataaatcgca aaatgttgga
     351  ccttctactc ttccgaggtc agataaacaa ggaggagaaa aacacgaaaa
     401  ttcttttaat tctttttcta atgatttctt ttttaacttc ttacgtatgt
     451  caaccaacac gcacatctca gacagtccag gcgtttcttt cgttgctaaa
     501  gatggtacac catacagttc actcacaatt ccttcaggtg tcggtcgtct
     551  tactcacaat gtagttgcat ctgccgtcca gctcaatatt acggccagta
     601  acacattgga agtagactac ggcttcggcc aagatgtttc aagaaccaca
     651  ggcactatcc caatccctat ctttgatggt gaaaaataca aagaaacggc
     701  ccgcgcatta gccgctatct tcagcaagaa aggcatgtca gtcgacgtca
     751  cctcacagac tgtccaagaa actctcaaga actcagacct caccatcgcc
     801  actgtcgcag ctggatatta cacagcctta gctgcgcgtc acgaacttac
     851  gaaagacgta agcgaggctg cccacaccat tccattcgtt accgctttat
     901  cggatacatt ctcagcagca ctaaatgcac aacgtacaag ccatgtcata
     951  tcttcttgct tacgttgtcc aaattccagg aatgctcaac gtgacatcgt
    1001  aatcggtacg gttttatgga ataacgtttt tgttgagagc ctctccgaac
    1051  acaacatggc ggttcccaat ccaaacgaca tatcattttt cattccgaac
    1101  aaagctctct catcttcttg gtggtgcgct atttggctcc tcaatgcatt
    1151  tcttcacagc tttatcgcac caactcgtat ccacatcttc attacacaag
    1201  gagaaacata ccacctcgct cctttcaccg attcggatgt ctacgaggcc
    1251  gttcgtttct tgctcgcaat gtcaaagtca tcacgcccaa tgccagagag
    1301  cgtcgagagt atgttatatg catacggcac acagatgatc atccaaccac
    1351  attcgctcta cacagaggga ggcttgatca gaagaatgat ctttacagtt
    1401  ccacaccttc cagctcatgg ttacttcgtc acgaattccg aattctcgag
    1451  atacatgaat atcgctgttc cagacgaccc gcgttctgca aaagacttcg
    1501  ttatcggtgc aggaacaggt ctcttacaaa tcgtactggc ttaccaagct
    1551  gctttcagct gtgctggacc tattgctctt cactggcatg caaacgatgc
    1601  tatctcccaa ggtatggaca gagtcgcaag catctacctt gaaggaagat
    1651  acttcaccat cccaatggca gttaacgttg ctactaatgt cgctcaatac
    1701  actacaatgg tcagagccga tcctgaatac cgtcacacac ttgaccggat
    1751  cttgcctcgc atattcggac catcgactga cacagtcttc gatttcatcg
    1801  aatcagcaat tacgtcatct tgggtatcca ttgatgcccg caaacgcaac
    1851  ggtcgtgcca gaaagttcag aacagctttc atcaaccgtt tccacgaccc
    1901  agaattcgca tacatgttcg gtatcaccgg caacggtatc gagagaatgg
    1951  aaggaaaagt tacttccaac atcgcccaag aagtcgatta tctcatgaac
    2001  ggcggcgacc ttcgcaattg cccaattctc cgcacactta aggcagcaga
    2051  gagagaagaa acagtcactt ttatgtgcaa ggaaaaggtc ggctcactct
    2101  acgccatcga cggaacagtt cgcgtattca aacggttcga aacaatcgat
    2151  cttgcccagc ttggctggac ttcacatggt aaggtgatga aaccttatgc
    2201  atttcgcgct ccagtcattc aaggaatgac catctgcagt acagcgtaca
    2251  catcaacggc catcgacatc atcacaacag tctttggtcc cttacgcctc
    2301  cgcgtaggat ccctttttga gtaaagctgt acgttgtggc cccgtaatac
    2351  catccgtcaa gcatcacttc aatatcaatt accttagtat tctcaaacac
    2401  aatggtaacg aatacacttt tgtcccagga tacggatggg tattacagga
    2451  tgattatttg ttgaatgccg tcaaaatggt tggagaaggt gatctacccc
    2501  ccaatcaatt accttatgac gatgatcttt tatttacata cgcaaaaatt
    2551  ttactttacg attacatatc tcattttcca gaattcagac acaagaatcc
    2601  acgcttatta acaagtgaaa cagaactaca actcttcccg ctcaaggaaa
    2651  actcagctgc caggactaaa gcaaatttct acgctaggac actatggaac
    2701  gaaacaactt cggacaagtc agctttcaaa ccaggaactt acaatgacac
    2751  agtcgcaggt ctattaatgt ggcaacaatg tgctttgatg tggtctctac
    2801  ccaagttaat tatcaacaag attattagcg gtgtttgtga tgcattaacc
    2851  gaaaaggtct cactcacgct attaaaacgg atttctgatt ggttaaaaca
    2901  actcgggtta gcctactcac ctatattccg tcttttcatt gaactcccta
    2951  ctctattagg acgcggagca atcccaggcg atgctgcact agacatgaag
    3001  cacagattaa cttacaaccc attgatgaca gtcgatgttc caaagacaca
    3051  actacacgac ttaatttaca gacttctatc acgtaactac aacaatacaa
    3101  aaattagcag tttcgagcac cacctagaag aacgtttact ttggtcaagg
    3151  tctggaagtc actactaccc tgatgaacaa atcgatcagt tacttccccc
    3201  acaacctacc agaaaagagt tcttagatat agtaacaata gactacatca
    3251  aacaatgcaa acctcaagtt ttcatcagac agtcacgcaa actagagcac
    3301  ggcaaggagc gattcattta caattgtgac acgatctcat atgtctattt
    3351  tgattttgtc ctgaagctct tcgagtcagg atggcaagat agtgaagcaa
    3401  tactgtcacc aggcgattac tcaagtgaat gtctccatgc taaaatctct
    3451  ggttataagt acaaagctat gttggactac acagatttca attcgcaaca
    3501  cacaatacaa agcatgcgtt tgatcttcga aaccatgaaa gagctactcc
    3551  cacctgaagc atcttttgct cttgactggt gtatcgcctc gtttgacaac
    3601  atgcaaacgt ctgatggtcg caagtggacc gccactctcc caagtggaca
    3651  ccgcgccacg acattcatta acaccgttct aaattggtgt tacacacaga
    3701  tggttggttt aaagtttgat agtttcatgt gcgctggtga tgacgtcatc
    3751  ctgatgtcgc aagagcctat atcactagcc ccaatcctaa aatcacagtt
    3801  caagttcaat cctagcaagc agagtactgg tacaagaggt gaattcttac
    3851  gtaaacacta taccgaagca ggtgtcttcg cgtatccatg tcgagcaatc
    3901  gcaagcttgg tgagtggaaa ttggttaagc gagtcattga gagacaacac
    3951  cccaatttta gtcccgatac agaacggaat cgatagatta cgtagtagag
    4001  caggtttact cggagttcct tggaagttag gcctctctga gctcattgag
    4051  agagaggcta ttcctaagga cgttagcatg gccctactaa attctcacgc
    4101  agcaggaccg ggactaatca ctcgtgacta cagttctttc acagttacac
    4151  cgactccgcc caagctacat agttcgttag aatacactgc gacccgacac
    4201  ggtctccaag atttatgtaa gcacgtgcca tggaaacaac tcacagcaaa
    4251  tgagtgcaat aagttaggac agcaaattaa gaaaatgagc cacaggcatt
    4301  gtagccagac aaagataacc tacaaatgtg tctacgaagt ttttaaacct
    4351  agtggacttc ctacggtgtt atccgaggtc agccagtcag cgttgtcgct
    4401  ggtgtggtgg caagcaatgc ttaaggaagc aatgcaggac tactctacaa
    4451  agaagaagga tgcacatatg tacgcttgta acgcatgtac aagctccgtt
    4501  agcggagatg cgtttttacg agcgacatca aaaatggctg gtgttttaat
    4551  cactagcttg atttcttctt cttcataacg tacagtagaa aagtctctag
    4601  agttgctcaa gacttataat gagccagttt ggtctcacta taccttc