Sequence of DPV Trichomonas vaginalis virus 1

Trichomonas vaginalis virus 1 isolate Changchun capsid protein gene, complete cds; and RNA-dependent RNA polymerase gene, partial cds.

ACC No: DQ528812

Dated: 2011-10-24 | Length: 4291 | CRC: 1416171226

                ID   DQ528812; SV 1; linear; genomic RNA; STD; VRL; 4291 BP.
XX
AC   DQ528812;
XX
DT   20-MAY-2006 (Rel. 87, Created)
DT   24-OCT-2011 (Rel. 110, Last updated, Version 3)
XX
DE   Trichomonas vaginalis virus 1 isolate Changchun capsid protein gene,
DE   complete cds; and RNA-dependent RNA polymerase gene, partial cds.
XX
KW   .
XX
OS   Trichomonas vaginalis virus 1
OC   Viruses; dsRNA viruses; Totiviridae; Trichomonasvirus.
XX
RN   [1]
RC   Publication Status: Available-Online prior to print
RP   1-4291
RX   PUBMED; 21861063.
RA   Li W., Ding H., Zhang X., Cao L., Li J., Gong P., Li H., Zhang G., Li S.,
RA   Zhang X.;
RT   "The viral RNA-based transfection of enhanced green fluorescent protein
RT   (EGFP) in the parasitic protozoan Trichomonas vaginalis";
RL   Parasitol. Res. 0:0-0(2011).
XX
RN   [2]
RP   1-4291
RA   Zhao Y., Zhang X., Li J., Liu Q., Yin J., Gong P.;
RT   "Identification and Characterization of Trichomonas vaginalis dsRNA Virus
RT   in China";
RL   Unpublished.
XX
RN   [3]
RP   1-4291
RA   Zhao Y., Zhang X., Li J., Liu Q., Yin J., Gong P.;
RT   ;
RL   Submitted (03-MAY-2006) to the INSDC.
RL   College of Animal Science and Veterinary Medicine, Jilin University, 5333#
RL   Xian Road, Changchun, Jilin 130062, China
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .4291
FT                   /organism="Trichomonas vaginalis virus 1"
FT                   /host="Homo sapiens"
FT                   /isolate="Changchun"
FT                   /mol_type="genomic RNA"
FT                   /country="China"
FT                   /db_xref="taxon:674953"
FT   CDS             1. .2037
FT                   /codon_start=1
FT                   /product="capsid protein"
FT                   /db_xref="UniProtKB/TrEMBL:Q1G0T7"
FT                   /protein_id="ABF57712.1"
FT                   /translation="MEASANGLSHDDNAIKSHNVGPSTLPGSDKQGREKHINSFNSFSN
FT                   DFFFNFLRMSTNTHISDSPGVSFIGKDGTPYSSATIPSAVGRLTHEVIASAVQLNISTS
FT                   NTLEVDYGFGQDVSRTTGTIPIPIFDGEKYKETARALAAIFNKKGMSVDVTSQTVQETL
FT                   KESDLTIATVAAGYYTALAARHELTKQVSVATHTIPYVTALSDTFTAAQNAQRSSHIIS
FT                   SCLRCPNAHNVQRDIGIGTVMWTNVSVESLAAPNMVVPNPNDISFFIPNKSLPSSWWCA
FT                   IWLLNAFLQSFIAPTHIHIFITPGETYHLAPFTDTDVYEAVRLLLAMSKTSRPIPESVE
FT                   SMLYAYGTQMIIQPHSLYTEGGLIRKMIFTVPHLPAHGYFVTNSEFSRYMNIAVPDNPR
FT                   SAKDFVIGVGTGLLQIVLAYQAALSCAGPIALHWHGNDITSEGMDRIADTYLEGRYFTI
FT                   PMAANVATNVAQYTIRVRADPEYRHTLDRILPRIFGPSTDTVFDFIESAITSSWVSIDA
FT                   RKRNGRARKFRTAFINRFHDPEFAYMFGITGNGIERMEGKVTSNIAQEVDYLMNGGDLR
FT                   NCPVLRTLKAAEREETVTFMCTQKVGSLYAIDGTVRVFKRYQTIDLAQLGWTSHGKVMK
FT                   PYAFRAPLMQGMTICNTAYTSTDIDIVTTVFGPLRNRVGSLFE"
FT   CDS             <2021. .4291
FT                   /codon_start=1
FT                   /product="RNA-dependent RNA polymerase"
FT                   /note="RDRP"
FT                   /db_xref="GOA:Q1G0T6"
FT                   /db_xref="InterPro:IPR001795"
FT                   /db_xref="UniProtKB/TrEMBL:Q1G0T6"
FT                   /protein_id="ABF57713.1"
FT                   /translation="DPFLSKAVRCGPIVPSVKHHFNIVWYSILKRKGKEYTFIPGYGWV
FT                   LQDDYLLSAVKMAGEGVLPPNQLPNDDDISFTLTKILLYDYISHFPEFRYTNPKILTQE
FT                   TELQLFPLKIDSAARTKANFYARTLWNDLATDKSAFKPGTPNDTVAGLLMWQQCALMWS
FT                   LPKLIINKTISGVCDALTERTSLTLLKRISDWLKQLGLAYSPIFRLFIELPTLLGRGAI
FT                   PGDSTLDIKHRLRYNPSITVDVPKDQLHDLIYRLLSRNLNGMKADSFEHHLEERLLWSK
FT                   SGSHFYPDDNIDQLLPPRPTRKEFLDIVTTDYIKECKPQVFIRQSRKLEHGKERFIYNC
FT                   DTISYVYFDFVLKLFESGWQDSEAILSPGDYTSEQLHAKISRYKYKAMLDYTDFNSQHT
FT                   IQSMRLIFETMKELLPPEATFALDWCIASFDNMQTSDGHKWVATLPSGHRATTFINTVL
FT                   NWCYTQMVGLKFDTFMCAGDDVILLSQEPISLAPILTSQFKFNPSKQSTGTRGEFLRKH
FT                   YTEEGVFAYPCRAIVSLVSGNWLSDSLRENTPILVPIQNGIDRLRSRAGLLGVPWSLGL
FT                   SELIEREAIPKEVGMALLNSHAAGPGLITRDYSSFTVTPTPPKISSTLEYTATRHGLQD
FT                   LSKHVPWKQLTVEECNKLGQQIKKMSHRHCSQAKITYKCVYEAFKPRKLPTVLSEASQS
FT                   ALSLVWWQAMLKEAMQDYSTKKIDAHMFVCNACTSSVSGDAFLRATSKMAGVLITSLIS
FT                   SSS"
XX
SQ   Sequence 4291 BP; 1287 A; 1092 C; 825 G; 1087 T; 0 other;

dq528812 Length: 4291  24-OCT-2011  Type: N  Check: 6387  ..

       1  atggaggctt ctgctaatgg gttatcacat gatgataatg cgattaaatc
      51  gcacaatgtt ggaccttcta ctcttccggg gtcagataag caaggaagag
     101  aaaaacatat aaattctttt aattcttttt ctaatgattt cttttttaat
     151  tttttacgta tgtctacaaa cacgcacatc tcagacagtc caggcgtctc
     201  tttcatcgga aaagacggca caccttacag ctcagctacg atcccatcag
     251  ccgtaggccg tcttactcat gaagttatag catcagccgt ccagctcaac
     301  atctcaacaa gcaatacttt agaagtagat tacggttttg gtcaagatgt
     351  atcaagaact acaggaacca tcccaatccc tatctttgat ggtgaaaaat
     401  acaaagaaac agctcgcgcc ttagccgcaa tcttcaacaa gaaaggtatg
     451  tcagttgatg ttacttcaca gacagtccaa gaaaccttga aagaatcaga
     501  tcttactatt gctacagtcg ctgccggata ctacacagca ctggctgcac
     551  gccacgagct cacgaaacaa gtaagcgtcg ccacccacac aatcccatac
     601  gttactgcct tatccgatac attcacagca gcacaaaacg cacaacgttc
     651  aagccatatt atatcttcct gcttacgttg tcccaacgct cacaatgtcc
     701  agcgcgatat cggaattggt acagttatgt ggaccaatgt ttctgtcgag
     751  agccttgctg ctccaaacat ggtagttcca aatccaaatg acatatcatt
     801  cttcattcca aacaagagtc tcccttcttc ttggtggtgc gcgatctggc
     851  ttctcaatgc tttccttcaa agcttcatcg ctcccacaca catccacatc
     901  ttcatcacac caggagaaac ataccacctc gctccattca cggacactga
     951  tgtttacgaa gctgtccgcc tcttgctcgc tatgtcgaaa acgtcgcgcc
    1001  caataccaga gagcgtcgaa agcatgctct atgcttacgg tacacaaatg
    1051  attatccagc cacattcact ttacacagaa ggcggcttaa tcagaaagat
    1101  gatcttcaca gtcccacacc tcccagcaca tggttatttc gtcaccaatt
    1151  ccgaattctc gagatacatg aacatcgctg tcccagataa cccccgttcg
    1201  gcaaaagatt tcgtcatcgg tgttggcacc ggtcttttgc aaatcgtact
    1251  cgcctaccag gccgcactca gctgcgccgg acctattgca cttcactggc
    1301  atggtaacga catcacctct gaaggcatgg acagaattgc agatacttat
    1351  ctcgaaggaa gatacttcac cattccaatg gcagccaacg tcgccaccaa
    1401  cgtcgcccaa tacacaataa gggttcgcgc cgatcccgaa tatcggcata
    1451  ccctcgaccg aatcttacca cgcatattcg gtccctcaac ggacacagtc
    1501  ttcgatttca tcgaatcagc gatcacatca tcctgggtat caatcgatgc
    1551  ccgcaagcgc aatggccgtg caaggaagtt cagaacggca ttcatcaacc
    1601  gtttccacga tccagaattc gcatacatgt tcggtatcac aggtaacggt
    1651  atcgaaagaa tggaaggcaa agttacttcg aatatcgccc aagaagttga
    1701  ttacctcatg aacggtggcg atcttcgcaa ctgtccagtc cttcgtacgc
    1751  tcaaggcggc agaaagagaa gaaacagtca cattcatgtg cacacagaaa
    1801  gtcggctcgc tctacgccat cgatggcaca gtccgcgtat tcaagcgata
    1851  ccaaacaatc gacctcgccc aacttggttg gacatcacac ggcaaggtga
    1901  tgaaacctta cgctttccgc gcgccactca tgcaaggaat gaccatctgc
    1951  aacacagcct acacatccac agacatcgac attgtcacaa cagtttttgg
    2001  cccattacgc aatcgcgttg gatccctttt tgagtaaagc tgtacgttgt
    2051  ggccctatag taccatccgt caaacatcat ttcaatattg tttggtattc
    2101  tattctcaaa cgcaaaggta aagaatacac gtttatccca ggatacggat
    2151  gggtactaca ggatgattat ttgttgagtg ccgtcaaaat ggctggtgaa
    2201  ggtgttttac caccaaacca gttacctaat gacgatgata tttcatttac
    2251  attaacaaaa attttacttt acgattacat atctcatttt ccagaattca
    2301  gatacacaaa tccaaaaata ctaacacaag aaacagaatt acaacttttt
    2351  ccattaaaaa tagattcagc tgctagaaca aaagcaaact tttacgctag
    2401  aacactatgg aatgacttag ccaccgacaa atctgctttc aaaccaggaa
    2451  ctcctaatga tacagtcgcc ggtttactga tgtggcaaca atgcgcttta
    2501  atgtggtctc tacccaagtt aattatcaac aaaacaatta gcggtgtttg
    2551  tgatgcatta accgaaagga cttcactcac gctactcaag cgtatctcag
    2601  attggttgaa acaactcggg ttagcctact cacctatatt tcgcctcttc
    2651  atcgagctcc ctacattatt aggacgagga gcaatcccag gcgatagcac
    2701  actagatata aagcacagac tcagatataa tccatcaata accgtcgatg
    2751  tcccaaagga tcaattacat gatttgatct acagactctt atcacgcaat
    2801  ctcaatggta tgaaagctga cagtttcgaa caccacctag aagaacgctt
    2851  gctctggtcg aagtcaggaa gccactttta tcccgatgac aatatcgatc
    2901  aattacttcc cccacggccc accagaaagg agttcttaga cattgtaact
    2951  acagactata tcaaagaatg caaaccccaa gtttttatca gacaatcacg
    3001  taaactggag cacggcaagg aacggttcat ctacaattgc gatacgatct
    3051  catatgtcta ttttgatttt gtcctgaagc tcttcgagtc aggatggcaa
    3101  gatagcgaag caatactgtc gccaggagat tacacaagtg aacaactcca
    3151  cgccaagatc tcccgttata agtacaaagc tatgttagac tacacagatt
    3201  tcaactcgca acatacaatc cagagtatgc gattgatttt tgaaaccatg
    3251  aaagagctac tcccacctga agcaaccttc gctcttgatt ggtgcatcgc
    3301  ctcatttgat aacatgcaaa cgtctgatgg ccacaaatgg gttgcaaccc
    3351  tcccaagcgg acaccgtgct acaacattta tcaataccgt cttaaattgg
    3401  tgttacaccc aaatggttgg tctcaagttt gacactttca tgtgcgctgg
    3451  tgacgacgtc atccttttat ctcaggaacc aatatcacta gcccctatcc
    3501  ttacatcaca gttcaagttc aatcctagca aacaaagtac aggtacaaga
    3551  ggtgaattct tacgcaagca ctacactgaa gaaggtgtct tcgcgtatcc
    3601  gtgccgagca atagtaagct tagtaagtgg aaattggtta agcgattcat
    3651  tacgagagaa caccccaatt ttagtcccaa tacagaatgg aattgacaga
    3701  ctacgcagca gagcgggttt actcggagtt ccttggagtt taggtctctc
    3751  agagctcatt gagagagagg ctatcccaaa ggaagtcggc atggcactgc
    3801  taaattcaca cgcggcagga ccgggactaa tcactcgtga ctacagttct
    3851  tttacagtta caccgactcc acctaaaata agcagtacat tggaatacac
    3901  tgcgacacgt catggtcttc aagacttatc caaacatgta ccgtggaagc
    3951  agcttacagt cgaagaatgc aataaattag gacagcaaat taagaaaatg
    4001  agccacaggc attgtagcca ggcgaagata acctacaaat gtgtctacga
    4051  agcttttaag cctcgtaaac ttcctacggt gttatccgaa gccagccagt
    4101  cagcgttgtc gctggtgtgg tggcaagcaa tgcttaagga agcaatgcag
    4151  gactattcta caaagaagat agatgctcac atgtttgttt gtaacgcatg
    4201  tacaagctcc gttagcgggg atgcgttttt acgagcaaca tccaagatgg
    4251  ccggtgtctt aatcactagc ttgatttctt cttcttcata a