Sequence of DPV Trichomonas vaginalis virus 1

Trichomonas vaginalis virus 1 isolate IH-2 capsid protein gene, complete cds; and dsRNA-dependent RNA polymerase gene, partial cds.

ACC No: DQ270032

Dated: 2009-09-22 | Length: 4647 | CRC: -1416186258

                
ID   DQ270032; SV 1; linear; genomic RNA; STD; VRL; 4647 BP.
XX
AC   DQ270032;
XX
DT   01-NOV-2006 (Rel. 89, Created)
DT   22-SEP-2009 (Rel. 102, Last updated, Version 2)
XX
DE   Trichomonas vaginalis virus 1 isolate IH-2 capsid protein gene, complete
DE   cds; and dsRNA-dependent RNA polymerase gene, partial cds.
XX
KW   .
XX
OS   Trichomonas vaginalis virus 1
OC   Viruses; dsRNA viruses; Totiviridae; Trichomonasvirus.
XX
RN   [1]
RP   1-4647
RA   Kim J.W., Chung P.R., Hwang M.K., Choi E.Y.;
RT   "Trichomonas vaginalis virus IH-2 isolated from Korea";
RL   Unpublished.
XX
RN   [2]
RP   1-4647
RA   Kim J.W., Chung P.R., Hwang M.K., Choi E.Y.;
RT   ;
RL   Submitted (28-OCT-2005) to the EMBL/GenBank/DDBJ databases.
RL   Biochemistry, Inha University, 253 Younghyun-Dong, Nam-Ku, Incheon 402-751,
RL   Korea
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .4647
FT                   /organism="Trichomonas vaginalis virus 1"
FT                   /isolate="IH-2"
FT                   /mol_type="genomic RNA"
FT                   /country="South Korea"
FT                   /db_xref="taxon:674953"
FT   CDS             288. .2324
FT                   /codon_start=1
FT                   /product="capsid protein"
FT                   /note="P75"
FT                   /db_xref="UniProtKB/TrEMBL:A0EQ15"
FT                   /protein_id="ABC86750.1"
FT                   /translation="MEASARGLSHDDNANKSQNVGPSTLPRSDKQGGEKHEISFNSFSN
FT                   DFFFNFLRMSTNTHISDSPGVSFVGKDGTPYSSATIPSAVSRLTHNVVAAAAQLNITSD
FT                   NVLEVDYGFGQDVSRSTGTITIPIFDGEKYKETARALAAIFSKKGSAVDVTSQTVQETL
FT                   KNSDLTIATVAAGYYTALAARHELTKEVSLAQHTIPFVTALSDTFTAARGAQRSSHVIS
FT                   SCLRCPASNNAQRDVAIGTNMWTNVFIESLSAQNMVVPNANDVSFFIPNKSLPPSWWCA
FT                   IWLLNAFLHSFVAPTRFHIFITPGETYHLAPFTDADVYEAIPILLAMSKSSRPVPESVE
FT                   SMLYAYGTQMIIQPHSLYTEGGLIRKMIFTVPHLPAHGYFVTNSEFSRYMNIAVPNDPR
FT                   SAKDYIIGAGTGLLQIVLAYQAALSCAGPIALHWHGNDAISQGMDTIATTYLQGRYFTV
FT                   PIAANVVNNVAQYTTLVRADPEYRHTLERILPRIFGPSIDTIYNFIESAISSSWTSIDA
FT                   RKRNGRARKFRTAFINRFHDPEFAYMFGITGNGIERMEGKVTSNIAQEVDYLLNGGDLR
FT                   NCPVLRTLKAAEREGTITFMCKEKVGSLFAIDGTVRVFKRYETIDLAQLGWTSHGKVMK
FT                   PYAFRAPIMQGMTICNTAYTSTDIDVVTTVFGPLRYHVGALFE"
FT   CDS             <2308. .4578
FT                   /codon_start=1
FT                   /product="dsRNA-dependent RNA polymerase"
FT                   /note="P86"
FT                   /db_xref="GOA:A0EQ16"
FT                   /db_xref="InterPro:IPR001795"
FT                   /db_xref="UniProtKB/TrEMBL:A0EQ16"
FT                   /protein_id="ABC86751.1"
FT                   /translation="VPFLSKAVRCGPVVPSIKHHFNFKHVIDIKHGGNNYTFIPGYGWV
FT                   LQDDYLLNAVKMTGEGDLPPDQLPYDDDLLLTYAKILLYDYITHFPKHRYKNPKILTPE
FT                   TELQLFPLKTDSAARNKVNFYARSLWNETTTDKSAFKPGTYNDTVAGLLMWQQCALMWS
FT                   LPHSVINKTISGVCDALTDRTSLALLKRISDWLKQLGLAFSPIHRLFIELPTLLGRGAI
FT                   PGNAIKDIKHRFKFDPSITVDVPKDLLHSLIYRLLSRNLDISKKNSFEHHLEERLLWSK
FT                   SGSHFYPDEMIDQLLPKQPTRKEFLDVVTADYIKQCAPRTYIRQSRKLEHGKERFIYNC
FT                   DTISYVYFDFILKLFETGWQDREAILSPGDYTSERLHTRISSYKYKAMLDYTDFNSQHT
FT                   IESMRLVFETMKELLPSETAFALDWCIASFDNMQTTDGKKWVATLPSGHRATTFINTVL
FT                   NWCYTQMVGLEFDSFMCAGDDVILMSHAPISLAPILTSPFKFNPSKQSTGTRGEFLRKH
FT                   YTNEGVFVYPTRAIASLVSGSWLSDSLRENTPILVPIQNGIDRLRSRAGLLGVPWISGL
FT                   SELIEREAIPKDVGMALLNSHAAGPGLITRDYSSFTVTPNPPQVSSTLEYTATRYGLQD
FT                   LCKHVPWNQLSTTECNRLGQQIKKMSHRHCSQAKITYHYTHEVFKPSGLPTVLSDASQP
FT                   SLSMVWWQAMLKEAMQDNSTKKIDAHMFACNACTGCVSGDAFLRANPKLAGVLITSLIT
FT                   SSS"
XX
SQ   Sequence 4647 BP; 1337 A; 1190 C; 924 G; 1196 T; 0 other;

dq270032 Length: 4647  22-SEP-2009  Type: N  Check: 3887  ..

       1  actcaacatt gctactccgt atgacgaatc cacaacgcgg acacgtaaca
      51  agcgtagtgt cctcgacgat tgccatcctc gtgtgaattc cgggctccgc
     101  ttgcactgat ggtacctctt acgaaacttg gagagacttc ggcctcgaag
     151  agcggtaatg tatcctctgc gcctgggacc taatggtgtt tttgctgtag
     201  gtaatttcag tggtaggaag attaggggtt aaacatcctg gttcgctagg
     251  tttgtcctta actttactca gctgatgatg aatacccatg gaggcttctg
     301  ctagggggtt atcacatgat gataatgcga ataaatcgca aaatgttgga
     351  ccttctactc ttccgaggtc agataaacaa ggaggagaaa aacacgaaat
     401  ttcttttaat tctttttcta atgatttctt ttttaatttt ttacgtatgt
     451  ctacgaacac tcacatctcc gacagtccag gcgtttcatt tgttggaaaa
     501  gacggcaccc cctattcatc tgctacaatc ccttccgcag tcagtcgtct
     551  cacacacaac gtagtagccg cggcagctca actgaacatc acatccgata
     601  atgttttaga agtagattac ggctttggcc aagacgtttc aagatctaca
     651  ggaaccatca caatccctat cttcgatggc gaaaagtaca aggaaacagc
     701  tcgcgcttta gcagcgatct tcagcaagaa aggctcagca gttgatgtca
     751  cttcccagac agtccaagag acactcaaga actcggacct cacaattgct
     801  actgtcgcag ccggatacta tacagcctta gccgctcgcc acgaactcac
     851  aaaagaagta agccttgccc agcacacgat cccattcgta acagctttat
     901  ccgacacctt taccgctgca cgaggtgcac aacgttcaag ccatgttatc
     951  tcttcctgct tacgttgccc agcttccaat aacgcgcagc gcgatgtcgc
    1001  aatcggcact aacatgtgga ccaacgtctt catcgaaagt ctctcagcac
    1051  agaatatggt agttccaaat gccaatgatg tatcattctt catcccaaac
    1101  aagagtcttc caccctcttg gtggtgcgct atttggctcc tcaatgcttt
    1151  ccttcacagc ttcgtcgctc caactcgttt ccacatcttc atcacacctg
    1201  gagaaacata tcaccttgct ccattcacgg atgccgacgt ttacgaagct
    1251  atccctatct tgctcgcaat gtcaaagtcg tctcgccctg ttccagaaag
    1301  tgttgaaagt atgctttacg cttacggcac ccagatgatc atccagccac
    1351  attcactcta caccgaaggt ggattgatca gaaagatgat cttcaccgtc
    1401  ccacatcttc cagcacacgg ttacttcgtt acgaattcag aattctcgag
    1451  atacatgaac atcgctgttc caaacgatcc tcgttccgcc aaagattata
    1501  ttataggtgc tggaacgggc ctcttacaga tcgtacttgc ctaccaggct
    1551  gccctcagct gtgctggccc catcgctctt cattggcacg gaaatgatgc
    1601  tatctcccaa ggcatggaca caattgcaac cacctatctt caaggaagat
    1651  acttcacagt tccaatcgca gccaatgtag tcaacaacgt cgcccagtac
    1701  actacactcg ttcgcgccga tcctgagtac agacacacac tcgaacgaat
    1751  cttgccacgc atattcggcc catcgatcga cacaatctac aacttcatcg
    1801  aatccgccat ttcgtcatcc tggacatcga tcgacgctcg caagcgcaat
    1851  ggccgcgcca gaaagttcag aacagctttc atcaatcgct tccatgatcc
    1901  agaattcgcc tacatgttcg gtatcactgg taatggtatc gagagaatgg
    1951  aaggaaaagt tacttccaac atcgcgcaag aagtcgacta cctcttaaac
    2001  ggaggtgacc ttcgcaactg cccagttctt cgtacgttaa aggcggcaga
    2051  aagggaaggc acgattactt tcatgtgcaa ggagaaagtc ggctcactct
    2101  tcgccatcga cggcacagtt cgcgtattca agagatacga aacgatcgac
    2151  ctcgcccagc tcggttggac ctcacacggc aaagtcatga aaccatacgc
    2201  cttccgggct ccgatcatgc aaggaatgac catctgcaac acggcttaca
    2251  catccacaga catcgacgtc gtcacaacag tcttcggtcc cttacgttac
    2301  cacgtaggtg ccctttttga gtaaggctgt acgttgtggc cctgtagtac
    2351  catccatcaa acatcacttc aacttcaaac atgttataga tatcaagcac
    2401  ggtggtaaca attacacatt catccccggc tacggatggg tattacagga
    2451  tgattactta ttgaatgccg taaagatgac tggcgaaggc gatctacctc
    2501  ccgaccagtt accttacgat gatgatcttt tacttacata cgcaaaaatt
    2551  ttactttacg attacataac tcattttcct aaacacagat acaagaaccc
    2601  aaagatacta actccagaaa cagaactaca gctcttccca cttaagacag
    2651  actcagcagc tagaaataaa gtaaacttct acgctagatc actatggaac
    2701  gaaacaacta cagataaaag tgctttcaaa ccaggaacat ataacgatac
    2751  agttgcaggc ttattgatgt ggcaacaatg tgcactcatg tggtcactgc
    2801  cccattcagt tatcaacaag acaattagcg gtgtttgtga tgcattaacc
    2851  gataggactt cactcgcgct attgaaacga atttcagatt ggttgaaaca
    2901  acttgggctg gccttttcac ctatccaccg cttattcatt gagcttccta
    2951  cattgttagg acgcggtgcg attccaggca atgccataaa ggatataaaa
    3001  catagattca aatttgatcc atcgataaca gttgatgttc caaaagattt
    3051  attacatagt ttgatctaca gactcttgtc tcgaaatctt gatatttcta
    3101  aaaagaacag cttcgagcac cacctagaag aaagattact ttggtcgaaa
    3151  tcaggaagcc atttttatcc agatgaaatg atagatcaat tgcttcctaa
    3201  gcagcctaca agaaaagaat tcctagatgt tgtaacagca gattacatta
    3251  aacagtgtgc accccgaacc tatatcagac agtcacgcaa gctggagcac
    3301  ggtaaggaac gtttcatcta caattgtgac acgatctcat atgtctattt
    3351  tgattttatc ctgaagctct tcgagacagg atggcaagat agagaagcaa
    3401  tattgtcacc aggcgactac accagtgaac gtctccatac cagaatctcc
    3451  agttacaaat acaaagccat gttagattac accgatttca actctcaaca
    3501  cacaatcgaa agcatgcgac tagtttttga gactatgaag gaactactcc
    3551  cttctgaaac agctttcgcc ctcgactggt gtattgcctc attcgataat
    3601  atgcagacaa cggatggcaa gaaatgggtg gccacccttc ccagcggaca
    3651  tcgtgctaca acttttatca atacagtctt aaattggtgc tatacacaaa
    3701  tggtcggtct tgaattcgac agtttcatgt gtgctggcga tgacgtcatc
    3751  ttgatgtccc acgccccaat ttcgctagct ccaattctca catcaccttt
    3801  caagtttaat cccagtaaac agagcacagg tacaagaggc gagttcttac
    3851  gcaagcatta cactaacgaa ggtgtttttg tttatcctac acgagctatt
    3901  gcaagtttgg tgagtggaag ctggttaagc gattcattaa gagagaacac
    3951  tccaatattg gtcccaatac agaatggaat cgacagatta cgtagtagag
    4001  caggtttact cggagtccct tggatttctg gcctctcaga actcattgag
    4051  agagaggcta ttcctaagga cgtcggcatg gctttactaa attcacacgc
    4101  agcaggacca ggtttaatca ctcgcgacta cagttctttc acagtcacac
    4151  caaatccacc ccaagttagc agtacactcg aatacacagc aacccgctat
    4201  ggtctccaag atttatgcaa acacgtacca tggaatcaac tctcaacgac
    4251  agaatgtaac agattaggac aacaaattaa gaaaatgagt cacaggcatt
    4301  gtagccaggc taagataact tatcactata ctcacgaggt cttcaaacct
    4351  agtgggctcc ccacggtgtt atccgacgcc agccaaccat cgttgtcgat
    4401  ggtgtggtgg caggcaatgc ttaaagaagc aatgcaggac aattctacga
    4451  agaagataga tgctcatatg tttgcttgta acgcatgtac aggctgcgtc
    4501  agcggcgatg cgtttttacg agcgaatcca aaactggctg gtgtcttgat
    4551  cactagtctg atcacttctt catcataacg tacagctaga aagtctctat
    4601  ggttgctcaa gacttataat gagccagatc ggcctcacta taccttc