Sequence of DPV Sorghum mosaic virus

Sorghum mosaic virus partial polyprotein gene, genomic RNA, isolate YN13

ACC No: FM997900

Dated: 2010-05-19 | Length: 1867 | CRC: -1175789093

                
ID   FM997900; SV 1; linear; genomic RNA; STD; VRL; 1867 BP.
XX
AC   FM997900;
XX
DT   19-MAY-2010 (Rel. 104, Created)
DT   19-MAY-2010 (Rel. 104, Last updated, Version 1)
XX
DE   Sorghum mosaic virus partial polyprotein gene, genomic RNA, isolate YN13
XX
KW   .
XX
OS   Sorghum mosaic virus
OC   Viruses; ssRNA positive-strand viruses, no DNA stage; Potyviridae;
OC   Potyvirus.
XX
RN   [1]
RP   1-1867
RA   Adams M.J.;
RT   ;
RL   Submitted (10-FEB-2009) to the EMBL/GenBank/DDBJ databases.
RL   Adams M.J., Plant Pathology and Microbiology, Rothamsted Research, West
RL   Common, Herts., AL5 2JQ, UNITED KINGDOM.
XX
RN   [2]
RA   Wang J.G., Zheng H.Y., Chen H.R., Adams M.J., Chen J.P.;
RT   "Molecular Diversities of Sugarcane mosaic virus and Sorghum mosaic virus
RT   Isolates from Yunnan Province, China";
RL   J. Phytopathol. 158(6):427-432(2010).
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .1867
FT                   /organism="Sorghum mosaic virus"
FT                   /host="Saccharum officinarum cv. Badila"
FT                   /isolate="YN13"
FT                   /mol_type="genomic RNA"
FT                   /country="China:Yunnan,Lancang"
FT                   /db_xref="taxon:32619"
FT   CDS             <1. .1632
FT                   /codon_start=1
FT                   /gene="polyprotein"
FT                   /product="polyprotein"
FT                   /protein_id="CAX36849.1"
FT                   /translation="GNNSGQPSTVVDNTLMVIIAFNYTMLSCGIEEDMIDEICKMYANG
FT                   DDLLLAVRPDYEFLLDDFSKHFSNLGLNFDFTSRTRDKTELWFMSTRGIKVDGMYIPKL
FT                   EQERIVAILEWDRPLLPQYRLEAICAAMVEAWGYPQLLHEIRKFYAWILEMQPFATLAK
FT                   EGLAPYIAETALRNLYTGAGIKEEEINVYYTQFLKDLPEYVEDELIDVRHQAGGATVDA
FT                   GAAAAEATAQAQRDAAAKAQRDADAKKKADDEAAEKQRQDATAKKKADDDAKAKADADA
FT                   KAKADAEAKKKADDEAAKKTQNQKDRDVDAGTSGTVAVPKLKAMSKKMRLPQAKGKNIL
FT                   HLDFLLGYKPQQQDISNTRSTRDEFDRWYDALQKKYELDDTQMTVVASGLMVWAIENGC
FT                   SPNINGVWTMMDGEEQRKFPLKPVIEYASPTFRQIMHHFSDAAEAYIEYRNSTERYMPR
FT                   YGLQRNLTDYNLARYAFDFYEITSRTPARAREAHMQMKAAAVRGSNTRMFGLDGNVGES
FT                   QENTERHTAGDVSRNMHSLLGVQQHH"
FT   mat_peptide     <1. .642
FT                   /product="NIb polymerase"
FT   mat_peptide     643. .1629
FT                   /product="Coat protein"
FT   3'UTR           1633. .1867
XX
SQ   Sequence 1867 BP; 610 A; 351 C; 457 G; 449 T; 0 other;

fm997900 Length: 1867  19-MAY-2010  Type: N  Check: 1610  ..

       1  gggaacaaca gcgggcaacc atcaaccgtt gttgacaaca cactaatggt
      51  gataatagca ttcaattaca caatgttgtc gtgtggaata gaagaggaca
     101  tgatcgatga aatatgcaaa atgtatgcga atggagacga tttattgtta
     151  gcggttcggc cagattatga gtttctgcta gatgactttt caaaacactt
     201  ttcaaatctt ggactaaatt ttgattttac atcacgaacc agagacaaga
     251  cagaattgtg gttcatgtca acaagaggta tcaaagtgga tggaatgtat
     301  attccaaagt tggagcaaga gaggatagtc gcaatacttg aatgggatag
     351  accgctgtta ccacagtaca ggttagaagc tatttgtgca gccatggtag
     401  aggcatgggg ttatccacaa ctcttacatg aaattaggaa gttttatgct
     451  tggattctcg aaatgcaacc ctttgccact ttagccaaag aaggccttgc
     501  cccatacata gcagaaacag ctctacgtaa tctttatact ggagcgggca
     551  ttaaagaaga ggaaataaat gtatattaca cgcaatttct taaggactta
     601  cctgaatatg ttgaagatga attaatcgat gtgcgtcacc aagcaggagg
     651  tgctacagtg gatgcaggag cagctgcagc tgaggcaact gcacaagcac
     701  agcgggatgc agcagcaaaa gctcagcgag atgctgatgc aaagaagaag
     751  gctgacgatg aagcagcaga aaagcaaaga caggatgcta ctgcgaagaa
     801  gaaagcggat gatgacgcca aggccaaagc tgatgccgat gccaaggcta
     851  aagctgatgc tgaagctaag aagaaagcag atgatgaggc agcaaagaaa
     901  acacaaaacc agaaggatag ggatgtcgat gctggaacat caggcacagt
     951  ggcagtgcca aagctcaaag ctatgtctaa gaaaatgaga ttaccacaag
    1001  ctaaaggaaa gaacattttg cacttagact tccttttagg ttacaagcca
    1051  cagcaacagg atatttcgaa cacaagatca actagggatg aattcgacag
    1101  atggtacgat gcattgcaga aaaaatatga gctggatgat acacagatga
    1151  cagtcgttgc aagcggactc atggtctggg ctatcgagaa tgggtgttcg
    1201  cctaatatta atggtgtttg gacaatgatg gatggagaag agcaaaggaa
    1251  atttcctttg aaacctgtca tagaatatgc ttctccaacg tttagacaga
    1301  taatgcacca ctttagtgat gcagctgaag cgtatattga gtatagaaac
    1351  tcaacggaac gttatatgcc aagatatgga cttcagcgaa acttaaccga
    1401  ctataaccta gcacgatacg catttgattt ttatgaaata acatcgcgta
    1451  caccagcaag agctagagag gcccacatgc agatgaaagc agcagcagtg
    1501  cgtggttcaa acacgcgcat gtttggcttg gatgggaatg tcggggaaag
    1551  tcaggagaat acagaacgtc acacagctgg cgacgtgagt cgcaacatgc
    1601  actctctcct tggggtgcag cagcatcact gatgtgctga aatcttcact
    1651  gcagtatttt aagtatttta tattttacta tttcagtgag ggtttccctc
    1701  cttagtatta tatatgtact ttagaaatgg tagtcaatct gcaggggaat
    1751  gaggtatcac ctctaaccct ttgattacta tttcctacta gcgtcgaact
    1801  acattacgga caccctgttg tgtggttcca ccacgagtca ggagctgcga
    1851  gtattgtagc aagagac