Sequence of DPV Soybean mosaic virus

Soybean mosaic virus partial gene for polyprotein, genomic RNA, strain P, isolate 2

ACC No: AJ511347

Dated: 2004-01-29 | Length: 1733 | CRC: -1522093201

                !!NA_SEQUENCE 1.0
ID   SMO511347  standard; mRNA; VRL; 1733 BP.
XX
AC   AJ511347;
XX
SV   AJ511347.1
XX
DT   29-JAN-2004 (Rel. 78, Created)
DT   29-JAN-2004 (Rel. 78, Last updated, Version 1)
XX
DE   Soybean mosaic virus partial gene for polyprotein, genomic RNA, strain P,
DE   isolate 2
XX
KW   "coat protein; NIb protein; polyprotein.
XX
OS   Soybean mosaic virus
OC   Viruses; ssRNA positive-strand viruses, no DNA stage; Potyviridae;
OC   Potyvirus.
XX
RN   [1]
RP   1-1733
RA   Adams M.J.;
RT   ;
RL   Submitted (09-OCT-2002) to the EMBL/GenBank/DDBJ databases.
RL   Adams M.J., Plant Pathogen Interactions Division, Rothamsted Research,
RL   Harpenden, Herts AL5 2JQ, UNITED KINGDOM.
XX
RN   [2]
RA   Chen J., Zheng H.Y., Lin L., Adams M.J., Antoniw J.F., Zhao M.F.,
RA   Shang Y.F., Chen J.P.;
RT   "A virus related to Soybean mosaic virus from Pinellia ternata in China and
RT   its comparison with local soybean SMV isolates";
RL   Arch. Virol. 149(2):349-363(2004).
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .1733
FT                   /country="China:Zhejiang"
FT                   /db_xref="taxon:12222"
FT                   /mol_type="mRNA"
FT                   /virion
FT                   /organism="Soybean mosaic virus"
FT                   /isolate="2"
FT                   /isolation_source="Hangzhou"
FT                   /strain="P"
FT                   /specific_host="Pinellia ternata"
FT   CDS             <1. .1482
FT                   /codon_start=1
FT                   /product="polyprotein"
FT                   /product="hypothetical protein"
FT                   /protein_id="CAD54081.1"
FT                   /translation="GNNSGQPSTVVDNTLMVVIAMYYSCCKQGWSEKEIQGRLVFFANG
FT                   DDIILAVNEEDVWLYDTLSTSFAELGLNYNSGERTRKREELWFMSHKAVLIDGVYIPKL
FT                   EPERIVSILEWDRSKELMHRTEAICAAMIEAWGYTELLQEIRKFYLWLLGKDEFKELAA
FT                   SGKAPYIAETALRKLYTDVNTQESELQRYLEVLDFNHADNCCESVSLQSGKETGEDLDA
FT                   GKETKKNTNNEKGDKSQNTQSTQNGKGTTNSGNKDKDINVGSKGKVVPRLQKITRKMNL
FT                   PMVGGKIILNLDHLLEYKPNQVDLFNTRATKTQFTAWYNAVKAEYGLEDEQMGVVMNGF
FT                   VVWCIDNGTSPDVNGMWVMMDGEEQIEYPLKPIVENAKPTLRQIMHHFSDAAEAYIEMR
FT                   NSEGPYMPRYGLLRNLRDRDLARYAFDFYEVTSKTPNRAREALAQMKAAALTGVNNKLF
FT                   GLDGNISTNAENTERHTARDVNQNMHTLLGMGPPQ"
FT   mat_peptide     <1. .633
FT                   /product="NIb protein"
FT   mat_peptide     634. .1479
FT                   /product="coat protein"
FT   3'UTR           1483. .1733
XX
SQ   Sequence 1733 BP; 553 A; 304 C; 436 G; 440 T; 0 other;

AJ511347  Length: 1733  February 3, 2004 09:02  Type: N  Check: 6830  ..

       1  ggaaacaata gcgggcagcc atctacagtt gtagacaaca ccttgatggt
      51  ggtcattgct atgtattact catgctgcaa acaaggatgg tcagagaaag
     101  aaatccaggg gagattggtg ttctttgcca atggggatga cataattcta
     151  gcagttaatg aggaggatgt atggctatat gacacgctta gcacctcgtt
     201  tgctgaactt ggtcttaatt ataattccgg tgagcgaacg aggaagagag
     251  aggaactatg gttcatgtca cataaagccg tactaattga tggagtttat
     301  atcccaaagc ttgagccgga acggatagtc tccatcttgg aatgggatag
     351  aagcaaggag cttatgcacc gaactgaagc aatttgtgct gcaatgattg
     401  aagcatgggg atacacagaa ctgctacagg aaattcgcaa gttctacttg
     451  tggctcctgg gcaaagatga atttaaagaa ctcgctgcat ctggaaaagc
     501  accgtatatt gcggaaacag ctttgagaaa gctttacaca gatgttaaca
     551  cacaggaaag cgaactgcaa agatacctcg aagttctgga tttcaaccat
     601  gcggataatt gttgtgaatc agtatccttg caatcaggaa aggagacagg
     651  cgaagatttg gatgcaggta aagaaacaaa gaagaatacc aacaatgaaa
     701  aaggagataa gtctcagaat acacaaagca ctcagaatgg caaaggaaca
     751  acaaattctg gaaataagga taaggatata aatgttggat caaaaggaaa
     801  ggttgttcca cgcctgcaaa agatcacaag gaaaatgaat ctcccgatgg
     851  ttggtgggaa aatcattctt aacctggatc atttgctcga gtacaaacct
     901  aatcaggttg atttattcaa tactcgggca acaaagacac agtttacagc
     951  atggtacaat gcagtcaagg ccgagtatgg actggaggat gagcagatgg
    1001  gtgtggttat gaatggcttc gtggtttggt gcatagataa tggcacatcc
    1051  ccagatgtta atgggatgtg ggtaatgatg gatggagaag aacaaattga
    1101  gtatccgctg aagcccattg ttgaaaatgc aaaaccaact ttaaggcaga
    1151  tcatgcatca tttttcagat gcagcggagg cttatattga gatgagaaac
    1201  tctgaaggtc cgtatatgcc tagatacgga ctgctgagaa atttgaggga
    1251  cagagatctg gcgcgatatg cttttgattt ctatgaggtg acttccaaga
    1301  caccaaatag agcaagagag gcactagcgc aaatgaaggc tgcagctctc
    1351  acgggagtta acaacaagtt gtttggactt gatggtaata tctcgaccaa
    1401  tgccgaaaat actgagaggc acactgcaag ggacgtgaac caaaatatgc
    1451  acactctttt gggaatgggt ccaccacagt aaaggctagg taaactggcc
    1501  acagttagca tttcgggtcg ctttatagtt ttctataata tcgtattgca
    1551  cttattttaa gtacagcgtg attgtatcac ctttattgta cttatgctta
    1601  gcgtggctta gccatcctag tgtgctttat attatagttt atgaatggca
    1651  gggagaacta ttgcaatgct ggagctattc gcagagtgat tccatcacga
    1701  gagtggccga agtacggcaa tatttgttgt cct