Sequence of DPV Trichomonas vaginalis virus 1
Trichomonas vaginalis virus 1 strain T1 capsid protein gene, complete cds; and RNA-dependent RNA polymerase (pol) gene, partial cds.
ACC No: U08999
Dated: 2009-09-23 | Length: 4647 | CRC: 71970221
ID U08999; SV 1; linear; genomic RNA; STD; VRL; 4647 BP. XX AC U08999; XX DT 09-APR-1995 (Rel. 43, Created) DT 23-SEP-2009 (Rel. 102, Last updated, Version 4) XX DE Trichomonas vaginalis virus 1 strain T1 capsid protein gene, complete cds; DE and RNA-dependent RNA polymerase (pol) gene, partial cds. XX KW . XX OS Trichomonas vaginalis virus 1 OC Viruses; dsRNA viruses; Totiviridae; Trichomonasvirus. XX RN [1] RP 1-4647 RX DOI; 10.1016/S0042-6822(95)80008-5. RX PUBMED; 7831841. RA Tai J.H., Ip C.F.; RT "The cDNA sequence of Trichomonas vaginalis virus-T1 double-stranded RNA"; RL Virology 206(1):773-776(1995). XX RN [2] RP 1-4647 RA Tai J.-H.; RT ; RL Submitted (19-APR-1994) to the EMBL/GenBank/DDBJ databases. RL Jung-Hsiang Tai, Division of Infectious Diseases, Institute of Biomedical RL Sciences, Academia Sinica, Rm. 414, Taipei, Taiwan 115, ROC XX FH Key Location/Qualifiers FH FT source 1. .4647 FT /organism="Trichomonas vaginalis virus 1" FT /strain="T1" FT /mol_type="genomic RNA" FT /db_xref="taxon:674953" FT 5'UTR 1. .287 FT CDS 288. .2324 FT /codon_start=1 FT /product="capsid protein" FT /db_xref="UniProtKB/TrEMBL:Q90154" FT /protein_id="AAA62867.1" FT /translation="MEASANGLSHDDNANKSQNVGPSTLPRSDKQGGEKHENSFNSFSN FT DFFFNFLRMSTNTHISDSPGVSFVAKDGTPYSSLTIPSGVGRLTHNVVASAVQLNITAS FT NTLEVDYGFGQDVSRTTGTIPIPIFDGEKYKETARALAAIFSKKGMSVDVTSQTVQETL FT KNSDLTIATVAAGYYTALAARHELTKDVSEAAHTIPFVTALSDTFSAALNAQRTSHVIS FT SCLRCPNSRNAQRDIVIGTVLWNNVFVESLSEHNMAVPNPNDISFFIPNKALSSSWWCA FT IWLLNAFLHSFIAPTRIHIFITQGETYHLAPFTDSDVYEAVRFLLAMSKSSRPMPESVE FT SMLYAYGTQMIIQPHSLYTEGGLIRRMIFTVPHLPAHGYFVTNSEFSRYMNIAVPDDPR FT SAKDFVIGAGTGLLQIVLAYQAAFSCAGPIALHWHANDAISQGMDRVASIYLEGRYFTI FT PMAVNVATNVAQYTTMVRADPEYRHTLDRILPRIFGPSTDTVFDFIESAITSSWVSIDA FT RKRNGRARKFRTAFINRFHDPEFAYMFGITGNGIERMEGKVTSNIAQEVDYLMNGGDLR FT NCPILRTLKAAEREETVTFMCKEKVGSLYAIDGTVRVFKRFETIDLAQLGWTSHGKVMK FT PYAFRAPVIQGMTICSTAYTSTAIDIITTVFGPLRLRVGSLFE" FT misc_signal 2301. .2350 FT /function="ribosomal frameshifting" FT gene <2308. .4578 FT /gene="pol" FT CDS <2308. .4578 FT /codon_start=1 FT /gene="pol" FT /product="RNA-dependent RNA polymerase" FT /db_xref="GOA:Q90155" FT /db_xref="InterPro:IPR001795" FT /db_xref="UniProtKB/TrEMBL:Q90155" FT /protein_id="AAA62868.1" FT /translation="DPFLSKAVRCGPVIPSVKHHFNINYLSILKHNGNEYTFVPGYGWV FT LQDDYLLNAVKMVGEGDLPPNQLPYDDDLLFTYAKILLYDYISHFPEFRHKNPRLLTSE FT TELQLFPLKENSAARTKANFYARTLWNETTSDKSAFKPGTYNDTVAGLLMWQQCALMWS FT LPKLIINKIISGVCDALTEKVSLTLLKRISDWLKQLGLAYSPIFRLFIELPTLLGRGAI FT PGDAALDMKHRLTYNPLMTVDVPKTQLHDLIYRLLSRNYNNTKISSFEHHLEERLLWSR FT SGSHYYPDEQIDQLLPPQPTRKEFLDIVTIDYIKQCKPQVFIRQSRKLEHGKERFIYNC FT DTISYVYFDFVLKLFESGWQDSEAILSPGDYSSECLHAKISGYKYKAMLDYTDFNSQHT FT IQSMRLIFETMKELLPPEASFALDWCIASFDNMQTSDGRKWTATLPSGHRATTFINTVL FT NWCYTQMVGLKFDSFMCAGDDVILMSQEPISLAPILKSQFKFNPSKQSTGTRGEFLRKH FT YTEAGVFAYPCRAIASLVSGNWLSESLRDNTPILVPIQNGIDRLRSRAGLLGVPWKLGL FT SELIEREAIPKDVSMALLNSHAAGPGLITRDYSSFTVTPTPPKLHSSLEYTATRHGLQD FT LCKHVPWKQLTANECNKLGQQIKKMSHRHCSQTKITYKCVYEVFKPSGLPTVLSEVSQS FT ALSLVWWQAMLKEAMQDYSTKKKDAHMYACNACTSSVSGDAFLRATSKMAGVLITSLIS FT SSS" FT 3'UTR 4576. .4647 XX SQ Sequence 4647 BP; 1348 A; 1156 C; 926 G; 1217 T; 0 other; u08999 Length: 4647 23-SEP-2009 Type: N Check: 9662 .. 1 actcaacatt gctacttcat catgacgaac ccgtgacgcg gacacgtaac 51 aagcgtagtg tcctcgacga ttgccatcct cgtgtgaatt ccgggctccg 101 cttgcactga tggtacctcc tacgaaactt ggagaggctt cggtctcgaa 151 gagcggtaat gtaccctctg cgcctgggac ctaatcgcgt tttgctgtag 201 gtaattcagt agatggaagg attaagggtc aaacatcttg gttcgccaag 251 tatgtcctta ccagtactct actgagcttg aatacccatg gaggcttctg 301 ctaatgggtt atcacatgat gataatgcga ataaatcgca aaatgttgga 351 ccttctactc ttccgaggtc agataaacaa ggaggagaaa aacacgaaaa 401 ttcttttaat tctttttcta atgatttctt ttttaacttc ttacgtatgt 451 caaccaacac gcacatctca gacagtccag gcgtttcttt cgttgctaaa 501 gatggtacac catacagttc actcacaatt ccttcaggtg tcggtcgtct 551 tactcacaat gtagttgcat ctgccgtcca gctcaatatt acggccagta 601 acacattgga agtagactac ggcttcggcc aagatgtttc aagaaccaca 651 ggcactatcc caatccctat ctttgatggt gaaaaataca aagaaacggc 701 ccgcgcatta gccgctatct tcagcaagaa aggcatgtca gtcgacgtca 751 cctcacagac tgtccaagaa actctcaaga actcagacct caccatcgcc 801 actgtcgcag ctggatatta cacagcctta gctgcgcgtc acgaacttac 851 gaaagacgta agcgaggctg cccacaccat tccattcgtt accgctttat 901 cggatacatt ctcagcagca ctaaatgcac aacgtacaag ccatgtcata 951 tcttcttgct tacgttgtcc aaattccagg aatgctcaac gtgacatcgt 1001 aatcggtacg gttttatgga ataacgtttt tgttgagagc ctctccgaac 1051 acaacatggc ggttcccaat ccaaacgaca tatcattttt cattccgaac 1101 aaagctctct catcttcttg gtggtgcgct atttggctcc tcaatgcatt 1151 tcttcacagc tttatcgcac caactcgtat ccacatcttc attacacaag 1201 gagaaacata ccacctcgct cctttcaccg attcggatgt ctacgaggcc 1251 gttcgtttct tgctcgcaat gtcaaagtca tcacgcccaa tgccagagag 1301 cgtcgagagt atgttatatg catacggcac acagatgatc atccaaccac 1351 attcgctcta cacagaggga ggcttgatca gaagaatgat ctttacagtt 1401 ccacaccttc cagctcatgg ttacttcgtc acgaattccg aattctcgag 1451 atacatgaat atcgctgttc cagacgaccc gcgttctgca aaagacttcg 1501 ttatcggtgc aggaacaggt ctcttacaaa tcgtactggc ttaccaagct 1551 gctttcagct gtgctggacc tattgctctt cactggcatg caaacgatgc 1601 tatctcccaa ggtatggaca gagtcgcaag catctacctt gaaggaagat 1651 acttcaccat cccaatggca gttaacgttg ctactaatgt cgctcaatac 1701 actacaatgg tcagagccga tcctgaatac cgtcacacac ttgaccggat 1751 cttgcctcgc atattcggac catcgactga cacagtcttc gatttcatcg 1801 aatcagcaat tacgtcatct tgggtatcca ttgatgcccg caaacgcaac 1851 ggtcgtgcca gaaagttcag aacagctttc atcaaccgtt tccacgaccc 1901 agaattcgca tacatgttcg gtatcaccgg caacggtatc gagagaatgg 1951 aaggaaaagt tacttccaac atcgcccaag aagtcgatta tctcatgaac 2001 ggcggcgacc ttcgcaattg cccaattctc cgcacactta aggcagcaga 2051 gagagaagaa acagtcactt ttatgtgcaa ggaaaaggtc ggctcactct 2101 acgccatcga cggaacagtt cgcgtattca aacggttcga aacaatcgat 2151 cttgcccagc ttggctggac ttcacatggt aaggtgatga aaccttatgc 2201 atttcgcgct ccagtcattc aaggaatgac catctgcagt acagcgtaca 2251 catcaacggc catcgacatc atcacaacag tctttggtcc cttacgcctc 2301 cgcgtaggat ccctttttga gtaaagctgt acgttgtggc cccgtaatac 2351 catccgtcaa gcatcacttc aatatcaatt accttagtat tctcaaacac 2401 aatggtaacg aatacacttt tgtcccagga tacggatggg tattacagga 2451 tgattatttg ttgaatgccg tcaaaatggt tggagaaggt gatctacccc 2501 ccaatcaatt accttatgac gatgatcttt tatttacata cgcaaaaatt 2551 ttactttacg attacatatc tcattttcca gaattcagac acaagaatcc 2601 acgcttatta acaagtgaaa cagaactaca actcttcccg ctcaaggaaa 2651 actcagctgc caggactaaa gcaaatttct acgctaggac actatggaac 2701 gaaacaactt cggacaagtc agctttcaaa ccaggaactt acaatgacac 2751 agtcgcaggt ctattaatgt ggcaacaatg tgctttgatg tggtctctac 2801 ccaagttaat tatcaacaag attattagcg gtgtttgtga tgcattaacc 2851 gaaaaggtct cactcacgct attaaaacgg atttctgatt ggttaaaaca 2901 actcgggtta gcctactcac ctatattccg tcttttcatt gaactcccta 2951 ctctattagg acgcggagca atcccaggcg atgctgcact agacatgaag 3001 cacagattaa cttacaaccc attgatgaca gtcgatgttc caaagacaca 3051 actacacgac ttaatttaca gacttctatc acgtaactac aacaatacaa 3101 aaattagcag tttcgagcac cacctagaag aacgtttact ttggtcaagg 3151 tctggaagtc actactaccc tgatgaacaa atcgatcagt tacttccccc 3201 acaacctacc agaaaagagt tcttagatat agtaacaata gactacatca 3251 aacaatgcaa acctcaagtt ttcatcagac agtcacgcaa actagagcac 3301 ggcaaggagc gattcattta caattgtgac acgatctcat atgtctattt 3351 tgattttgtc ctgaagctct tcgagtcagg atggcaagat agtgaagcaa 3401 tactgtcacc aggcgattac tcaagtgaat gtctccatgc taaaatctct 3451 ggttataagt acaaagctat gttggactac acagatttca attcgcaaca 3501 cacaatacaa agcatgcgtt tgatcttcga aaccatgaaa gagctactcc 3551 cacctgaagc atcttttgct cttgactggt gtatcgcctc gtttgacaac 3601 atgcaaacgt ctgatggtcg caagtggacc gccactctcc caagtggaca 3651 ccgcgccacg acattcatta acaccgttct aaattggtgt tacacacaga 3701 tggttggttt aaagtttgat agtttcatgt gcgctggtga tgacgtcatc 3751 ctgatgtcgc aagagcctat atcactagcc ccaatcctaa aatcacagtt 3801 caagttcaat cctagcaagc agagtactgg tacaagaggt gaattcttac 3851 gtaaacacta taccgaagca ggtgtcttcg cgtatccatg tcgagcaatc 3901 gcaagcttgg tgagtggaaa ttggttaagc gagtcattga gagacaacac 3951 cccaatttta gtcccgatac agaacggaat cgatagatta cgtagtagag 4001 caggtttact cggagttcct tggaagttag gcctctctga gctcattgag 4051 agagaggcta ttcctaagga cgttagcatg gccctactaa attctcacgc 4101 agcaggaccg ggactaatca ctcgtgacta cagttctttc acagttacac 4151 cgactccgcc caagctacat agttcgttag aatacactgc gacccgacac 4201 ggtctccaag atttatgtaa gcacgtgcca tggaaacaac tcacagcaaa 4251 tgagtgcaat aagttaggac agcaaattaa gaaaatgagc cacaggcatt 4301 gtagccagac aaagataacc tacaaatgtg tctacgaagt ttttaaacct 4351 agtggacttc ctacggtgtt atccgaggtc agccagtcag cgttgtcgct 4401 ggtgtggtgg caagcaatgc ttaaggaagc aatgcaggac tactctacaa 4451 agaagaagga tgcacatatg tacgcttgta acgcatgtac aagctccgtt 4501 agcggagatg cgtttttacg agcgacatca aaaatggctg gtgttttaat 4551 cactagcttg atttcttctt cttcataacg tacagtagaa aagtctctag 4601 agttgctcaa gacttataat gagccagttt ggtctcacta taccttc