Sequence of DPV Trichomonas vaginalis virus 3

Trichomonas vaginalis virus 3 capsid protein (cap) and RNA-dependent RNA polymerase (pol) genes, complete cds.

ACC No: AF325840

Dated: 2011-03-04 | Length: 4844 | CRC: -1198835809

                ID   AF325840; SV 1; linear; genomic RNA; STD; VRL; 4844 BP.
XX
AC   AF325840;
XX
DT   05-DEC-2001 (Rel. 70, Created)
DT   04-MAR-2011 (Rel. 108, Last updated, Version 4)
XX
DE   Trichomonas vaginalis virus 3 capsid protein (cap) and RNA-dependent RNA
DE   polymerase (pol) genes, complete cds.
XX
KW   .
XX
OS   Trichomonas vaginalis virus 3
OC   Viruses; dsRNA viruses; Totiviridae; Trichomonasvirus.
XX
RN   [1]
RP   1-4844
RX   PUBMED; 21110050.
RA   Bessarab I.N., Nakajima R., Liu H.W., Tai J.H.;
RT   "Identification and characterization of a type III Trichomonas vaginalis
RT   virus in the protozoan pathogen Trichomonas vaginalis";
RL   Arch. Virol. 156(2):285-294(2011).
XX
RN   [2]
RP   1-4844
RA   Bessarab I.N., Tai J.H.;
RT   "The complete DNA sequence of type III Trichomonas vaginalis virus";
RL   Unpublished.
XX
RN   [3]
RP   1-4844
RA   Bessarab I.N., Tai J.H.;
RT   ;
RL   Submitted (04-DEC-2000) to the EMBL/GenBank/DDBJ databases.
RL   Division of Infectious Diseases, Institute of Biomedical Sciences, Academia
RL   Sinica, Taipei, Taiwan 11529, Republic of China
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .4844
FT                   /organism="Trichomonas vaginalis virus 3"
FT                   /mol_type="genomic RNA"
FT                   /db_xref="taxon:170965"
FT   gene            360. .2486
FT                   /gene="cap"
FT   CDS             360. .2486
FT                   /codon_start=1
FT                   /gene="cap"
FT                   /product="capsid protein"
FT                   /db_xref="UniProtKB/TrEMBL:Q8V616"
FT                   /protein_id="AAL37369.1"
FT                   /translation="MSAPEPLNTEVRSPNGVSEAIETQNMAVTQSSVSNEIKNDTQSDL
FT                   QTLKKQLQPLYRSTDFNTLYNFFYGLDVPASTDRVGHAIQRNTSVNDTNEVVSFPLTAT
FT                   VSHTFSNTPVPANIQPLQISVADDSVNYELDESGTLCPTLDSSVHVQRATSLASALKVK
FT                   LTGEIMHSDSVRPVQTPQLIAYLFGVLLGVKDRVNIHRNQPTNLWRSLCSPGRAAQAKP
FT                   FFDEFPNNKFRTGALLAPPLPDAGFGPFPAEGLNQNSKLDFRSKGYVFYKQRTYNPDEM
FT                   NRAFWFLWAIYNRMPEDFQLSYPLNITFCTSELPVQNPMPGADAISNEQCEKALLLLEK
FT                   VVLEFFNNDRKLAYYYVFKGCQFVMRPCSCYQEGGLIRKASRNVALRGFTGIYYLAGFT
FT                   DQYANMISCAVHPGVVGALFQYVDTMVLQAVFSLSGPKLVRFAAPPEYQGRHACPFSFV
FT                   ADENYWGIAPGIEAEPVGMYYMDIIQRKAEHDLFIETFMDIYGSTASIICANIETSLFT
FT                   SGTNVLNHRMQKDFARDTPKPGTLRHQHAIINRFHEPEYAYRLGILANGVIPLSGSFEV
FT                   DILKEAELLVTGEDIRNLPGLRCLCTRGLDAILGLRPVQQKRKKMCYFRTLDGNFHEVT
FT                   IRSETRDLQVWRDHGYLARPYACHIIDSEGIEFYDKSNGLYKGRVNVLVSGFAIPGRAY
FT                   QGPRLQVATMAAQI"
FT   gene            2645. .4690
FT                   /gene="pol"
FT   CDS             2645. .4690
FT                   /codon_start=1
FT                   /gene="pol"
FT                   /product="RNA-dependent RNA polymerase"
FT                   /note="translated via ribosomal frameshift"
FT                   /db_xref="GOA:Q8V615"
FT                   /db_xref="InterPro:IPR001795"
FT                   /db_xref="UniProtKB/TrEMBL:Q8V615"
FT                   /protein_id="AAL37370.1"
FT                   /translation="MDHVSDIAYLNFLRWVLLPYNGQTLRPHPSVWRQTPYPEHVNLKF
FT                   LNKEMELELFPLKKAPQADLKVNCYARNVLASTELTDDLLKQSLPIGLNNDSVCGIVIV
FT                   LELLRIAGVPSKLLPIIGQAIANKDPFIKELSDFNKMIGATTSRIANILTECNTLIGRG
FT                   VKSSDPSADLYHRVAPEGNRHEAKIPRHILIEAINKIYKNEMTDMPPPGDFKLHLITSP
FT                   LWCKAGSHHHPHFAKYSSRLDFVMDVPADKIAAVPPSVFITQAEKLEHGKTRYIYNCDT
FT                   XSYLFFDYILHYVECVWSNESVLLNPAAMSVERFSVLDYPEYCMIDYTDFNSQHSLESQ
FT                   KLVFECLRPYLPSEMHPILDWCIASMDHMEIGGQHWLSTLPSGHRATTFINSVLNKAYL
FT                   IPYIGDTTSFHCGDDVLLCGKYDYQTLIDTLPYELNKSKQSFGPNAEFLRLHRRAGDVI
FT                   GYPSRAVSSLVSGNWLSKTSWEWQPSLISVTNQCNVIISRSQLNIRFIPAMQQELRNRY
FT                   IDKMSEPFDVGSDYYVMPGCPCYSDAATTIVPNVPQLECSDVPFSQAQKVFDTMRDICP
FT                   EFTTVNDVIDRVLARRTSNAVKNITYNVCAPVAPQVCIAVNPAHYQFLLRKKYYPREHI
FT                   APPGFDDSTNSKLVFSTYDLAPSIAMKSCAVLTPAKIICGHGLRSG"
XX
SQ   Sequence 4844 BP; 1270 A; 1287 C; 1025 G; 1258 T; 4 other;

af325840 Length: 4844  04-MAR-2011  Type: N  Check: 1994  ..

       1  cttaaaaagc ctagtccact ttttaagccg gttagacttc accgtagata
      51  cttgggcaaa attaatcaac accctcctgg aatcgccggg gtgttgcgag
     101  ccgtaagaga ccggttctaa aggactgaca tagcgccgcg agggtaggtg
     151  gtcgatagcc cgtttgacgg agtagcgata ttcctgattc tggtgtagca
     201  tcgacggggg ccccctagcg tgagctcagc acgttgtgaa aacgaaaaac
     251  tgcatgtctg aggcttcgca gtagcgtgag ctcggagcac cctaaaaagt
     301  gctctgtttt gtaacagcca acgattgtta cgaaactcta gtgtattgcg
     351  tgcaacggta tgtcagctcc cgagccctta aatactgaag tacgctcacc
     401  taatggtgtt agtgaagcca tagaaactca aaacatggct gtcactcaaa
     451  gcagtgtgtc aaacgaaatt aaaaatgaca cacaaagtga tctgcaaaca
     501  cttaaaaaac agttacaacc gctctacaga tccacagatt ttaatactct
     551  ttataatttc ttttatggtt tagatgtccc agcttcaaca gatcgcgtcg
     601  gtcacgctat ccagcgtaac acctcagtca atgatacaaa tgaagttgtt
     651  agtttccctc ttacggcgac ggtatcacac acgttttcaa acacgccagt
     701  tccagccaac atacagcctc ttcaaatctc agtagccgac gattcagtca
     751  actatgagct cgatgaaagt ggaacattat gtccaacgtt agatagctcc
     801  gtccatgtcc aaagagccac ttccttggcc agcgctctca aggtcaaatt
     851  aacgggcgaa atcatgcact ctgactcagt cagaccagtc caaactccac
     901  aattgattgc ttatttattc ggcgtcctcc tcggtgtcaa agatcgcgtc
     951  aacattcacc gcaatcaacc cactaactta tggagaagtt tatgttcgcc
    1001  tggtcgtgct gctcaggcaa agcctttctt cgatgaattt ccaaacaaca
    1051  agttcaggac cggtgctctt ttggcacctc ctctccccga tgccggcttc
    1101  ggtcctttcc cagctgaagg ccttaaccaa aattccaagc ttgattttag
    1151  atcaaaggga tacgtcttct acaaacaacg cacttacaat ccagacgaaa
    1201  tgaaccgtgc gttctggttc ctttgggcga tctacaatcg tatgcctgaa
    1251  gatttccaac tatcataccc attaaacatt accttctgca cttccgaatt
    1301  gccagtccaa aacccgatgc caggggccga cgcaatctca aatgagcaat
    1351  gtgaaaaagc gctccttctc ctcgaaaaag tcgtcctcga attcttcaac
    1401  aatgatcgca aactcgctta ttattacgtt ttcaaaggat gccagttcgt
    1451  tatgcgccct tgctcctgct accaagaagg aggcttgatc cgcaaagcct
    1501  cacgcaacgt tgccctccgt ggcttcaccg gtatctatta tttggccggc
    1551  ttcacagatc aatacgccaa catgatttca tgtgctgttc acccaggtgt
    1601  tgttggcgct ctcttccaat acgtcgatac gatggtttta caagcggttt
    1651  tctccctctc tggcccaaag ttggtccgct tcgccgctcc gcctgaatat
    1701  caaggtcgcc acgcttgtcc tttctcattt gtcgccgacg aaaattactg
    1751  gggcattgct ccgggtattg aagccgaacc cgtcggcatg tattacatgg
    1801  acatcatcca acgcaaggcc gagcatgatt tgttcatcga aaccttcatg
    1851  gatatctacg gttcaacagc ttccatcatt tgcgccaata tcgaaacgag
    1901  cttattcacc tctggcacta acgtcttgaa ccatcgcatg caaaaagact
    1951  tcgctcgcga cactcctaag ccaggaaccc tccgtcatca acacgccatc
    2001  atcaaccgct tccacgaacc tgaatacgcc taccgccttg gcatccttgc
    2051  taatggtgtt atccctctca gcggctcatt cgaggtcgat atcctcaaag
    2101  aagctgagct tctcgtcaca ggtgaagata tccgcaatct ccctggttta
    2151  cgttgcttgt gcacacgcgg ccttgatgca atcctcggtc tccgcccagt
    2201  ccagcagaag cgcaagaaga tgtgttactt ccgcactctc gatggcaact
    2251  tccatgaagt aacaatcaga tcagagacgc gcgatttaca ggtctggcgt
    2301  gatcacggct accttgctcg tccatacgcg tgccatatta tcgattcaga
    2351  aggcatagaa ttctacgaca aatccaacgg tctctataag ggccgtgtca
    2401  atgtccttgt ttctggattt gccatcccag gtcgcgcgta tcagggccct
    2451  cgcttgcagg tagcaacaat ggccgcccag atctaagcga cgtcccggcg
    2501  acaggaagtc tgtccaacct cattaccctt tccaaagcaa gtcggctacc
    2551  ataccgtaag ctgagggaag gcgtgagagc gtcagactac accgtcgccc
    2601  gcgagttagc tagcgccttt cgcaattctc gcctaactcg ccaaatggat
    2651  catgtctcag atatagctta tcttaatttc ctgagatggg tgttgttacc
    2701  ttacaacggt caaacattac gaccacaccc cagcgtgtgg cgtcagacac
    2751  cctaccccga acatgtcaat ttgaagttcc tgaacaagga aatggagctc
    2801  gaactcttcc cacttaagaa ggccccacaa gccgatctta aagtgaactg
    2851  ttacgcgcga aacgtccttg cttccacaga gctaaccgac gacttactca
    2901  agcagagttt acccataggt ctcaacaatg actcagtttg tggaatcgtc
    2951  atcgttttag agctgcttcg gattgcaggt gttccaagta agttactacc
    3001  aattattggc caagctattg ccaataaaga tccattcatt aaggaattgt
    3051  ccgatttcaa caagatgata ggagcgacga cttcccgtat tgctaacatt
    3101  ctcacagagt gcaatacatt gataggtcgc ggtgttaagt catctgatcc
    3151  aagtgccgat ttgtatcacc gggtagcgcc tgagggcaat aggcacgagg
    3201  cgaagattcc tcgacacatc cttatcgaag ccataaacaa aatctacaaa
    3251  aacgaaatga cagacatgcc tccaccagga gatttcaagc tccacttaat
    3301  aacgagcccc ctatggtgta aggctggttc tcatcatcat ccacacttcg
    3351  ccaagtatag ttcacgcttg gatttcgtta tggatgttcc agcagacaaa
    3401  attgctgctg taccaccctc ggttttcatc actcaagcgg agaaattgga
    3451  acacggtaag actaggtaca tttataactg tgacacantt tcttacctgt
    3501  tcttcgatta catcctacat tatgtcgaat gtgtgtggtc aaacgagtca
    3551  gtcttgctca acccagctgc tatgagtgtt gagcggttta gcgtcctaga
    3601  ctatccggag tattgcatga tcgattacac agatttcaat tctcaacaca
    3651  gccttgaatc ccagaagctt gtattcgagt gtttgagacc gtacttgcca
    3701  agcgaaatgc atccaatctt agattggtgt atcgctagca tggaccacat
    3751  ggaaattggc gggcaacatt ggttaagcac gttgccttcg ggacatagag
    3801  ctactacgtt tatcaactca gtcctgaata aagcatacct gattccttac
    3851  ataggcgaca ccacctcttt ccattgcggt gacgacgtat tactatgtgg
    3901  yaaatacgac tatcagacac tcattgatac cctaccctat gaattaaaca
    3951  agagcaaaca gagctttgga cctaatgccg agttcttgcg cttgcatagg
    4001  cgtgcgggtg acgtgatagg ctacccatca agggctgttt cgagtctcgt
    4051  atctggaaat tggttaagta agacttcatg ggaatggcag ccaagtctca
    4101  tttcggtcac gaatcaatgc aatgtgatca tctcacgctc acagttaaac
    4151  atcaggttca tccccgctat gcagcaggaa ctacgcaatc gttacataga
    4201  taaaatgagc gagcccttcg atgtcggctc ggattactac gtcatgccag
    4251  gttgtccatg ttatagtgat gccgcgacaa caatcgttcc gaatgttccc
    4301  caactggaat gttccgacgt accgttttcg caggcacaaa aggtttttga
    4351  tactatgcgc gacatctgtc ctgagttcac cacagtcaac gacgtcatcg
    4401  acagagttct agctcgtcgg acttccaatg ctgtcaaaaa catcacgtac
    4451  aatgtttgtg ctcctgtagc acctcaagtt tgcatagccg taaacccagc
    4501  acattaccaa tttctgttac gtaagaagta ctacccacgt gaacacattg
    4551  cgccccccgg ctttgatgat tcaacgaact ctaagcttgt tttytcgact
    4601  tacgatctag ctccttcaat tgctatgaaa tcgtgcgctg ttttgacccc
    4651  ggcaaagata atatgcggtc acggactacg cagtggttga ttyagcacgc
    4701  tagtaccgcg cgacagtcgg taccgtctag gccacgtaca gtctattgtc
    4751  ctcggaccct ctgcctatag gttaatagga atacagtgtt actgttgtgt
    4801  gtatcgctct aggcacacga acgtgctacc ccacgtttag ttca