Sequence of DPV Sorghum mosaic virus

Sorghum mosaic virus partial polyprotein gene, genomic RNA, isolate YN19

ACC No: FM997906

Dated: 2010-05-19 | Length: 1840 | CRC: -823137493

                
ID   FM997906; SV 1; linear; genomic RNA; STD; VRL; 1840 BP.
XX
AC   FM997906;
XX
DT   19-MAY-2010 (Rel. 104, Created)
DT   19-MAY-2010 (Rel. 104, Last updated, Version 1)
XX
DE   Sorghum mosaic virus partial polyprotein gene, genomic RNA, isolate YN19
XX
KW   .
XX
OS   Sorghum mosaic virus
OC   Viruses; ssRNA positive-strand viruses, no DNA stage; Potyviridae;
OC   Potyvirus.
XX
RN   [1]
RP   1-1840
RA   Adams M.J.;
RT   ;
RL   Submitted (10-FEB-2009) to the EMBL/GenBank/DDBJ databases.
RL   Adams M.J., Plant Pathology and Microbiology, Rothamsted Research, West
RL   Common, Herts., AL5 2JQ, UNITED KINGDOM.
XX
RN   [2]
RA   Wang J.G., Zheng H.Y., Chen H.R., Adams M.J., Chen J.P.;
RT   "Molecular Diversities of Sugarcane mosaic virus and Sorghum mosaic virus
RT   Isolates from Yunnan Province, China";
RL   J. Phytopathol. 158(6):427-432(2010).
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .1840
FT                   /organism="Sorghum mosaic virus"
FT                   /host="Saccharum hybrid cv. Xintaitang 26"
FT                   /isolate="YN19"
FT                   /mol_type="genomic RNA"
FT                   /country="China:Yunnan,Shuangjiang"
FT                   /db_xref="taxon:32619"
FT   CDS             <1. .1608
FT                   /codon_start=1
FT                   /gene="polyprotein"
FT                   /product="polyprotein"
FT                   /protein_id="CAX36855.1"
FT                   /translation="GNNSGQPSTVVDNTLMVIIAFNYTMLSCGIEADMIDEICKMYANG
FT                   DDLLLAIRPDYEHLLDNFSKHFSDLGLNFDFTSRTRDRTELWFMSTRGIKIDNMYIPKL
FT                   EQERIVAILEWDRSLLPQYRLEAICAAMVEAWGYPQLLHEIRKFYAWILEMQPFATLAK
FT                   EGLAPYIAETALRNLYTGEGIKEGELDVYYTQFLKDLPEYIEDELIDVRHQAGGGTVDA
FT                   GAAATDATAQAQRDAAAKAQRDAEAAEKQRQDAAAKKKADDDAKAKADADAKAKSDADA
FT                   KKKADDEAASKAQNQKDKDVDAGTSGTVAVPKLKAMSKKMKLPQAKGKNILHLDFLLGY
FT                   KPQQQDISNTRATRDEFDRWYDALQKEYELDDTQMTVVASGLMVWVIENGCSPNINGVW
FT                   TMMDGDEQRKFPLKPVIEYASPTFRQIMHHFSDAAEAYIEYRNSTERYMPRYGLQRNLT
FT                   DYNLARYAFDFYEITSRTPARAREAHMQMKAAAVRGSNTRMFGLDGNVGESQENTERHT
FT                   AGDVSRNMHSLLGVQQHH"
FT   mat_peptide     <1. .642
FT                   /product="NIb polymerase"
FT   mat_peptide     643. .1605
FT                   /product="Coat protein"
FT   3'UTR           1609. .1840
XX
SQ   Sequence 1840 BP; 605 A; 345 C; 449 G; 441 T; 0 other;

fm997906 Length: 1840  19-MAY-2010  Type: N  Check: 8942  ..

       1  gggaataata gtgggcagcc gtcaactgtt gttgataaca cattaatggt
      51  gattatagcg tttaactata cgatgttgtc atgtggaatt gaagcagata
     101  tgatagatga aatatgcaaa atgtatgcaa acggggatga tcttttgttg
     151  gcaatacgac ctgattatga acatttattg gataattttt caaaacactt
     201  ttctgatcta ggtcttaact tcgattttac atcacgcaca agagatagaa
     251  cggagttgtg gtttatgtca acacgaggca ttaaaattga caatatgtac
     301  atcccaaaat tggaacagga aagaattgtc gccattttag aatgggatag
     351  atcattatta ccacaatata gactagaggc gatatgtgct gcaatggtgg
     401  aagcatgggg atatccacag ttattacatg agattaggaa attctacgct
     451  tggattcttg aaatgcagcc attcgctact ctagcgaaag aaggacttgc
     501  cccgtacata gcagaaacgg ctttgcgcaa tctttataca ggggaaggaa
     551  taaaagaagg ggaattagat gtttactaca cacaatttct caaagacttg
     601  ccggaataca tagaggatga attaattgac gtgcggcatc aggcaggagg
     651  cggtacagta gatgcaggag cagccgcaac agatgcaaca gcacaagcac
     701  agcgtgatgc agcagcgaaa gcccaacgag acgcagaagc ggcagagaag
     751  cagagacaag atgctgcagc taagaagaaa gctgatgatg atgcgaaagc
     801  taaagctgac gcggatgcca aagcaaaatc agatgctgac gcgaagaaga
     851  aagcagacga tgaagcagca agtaaagcac aaaatcaaaa agataaggat
     901  gtggatgccg gcacatccgg cacagtggca gtgcctaaac tcaaagcaat
     951  gtccaagaaa atgaagctac cacaagcaaa agggaaaaac attttacact
    1001  tggattttct tttgggatat aagccacaac aacaagacat ttcaaacacc
    1051  agagctacac gggatgagtt cgataggtgg tacgatgcat tgcagaaaga
    1101  atatgaacta gatgacacgc agatgacagt ggttgcaagc ggactcatgg
    1151  tttgggtcat agagaacgga tgctcaccta atattaatgg tgtttggaca
    1201  atgatggatg gagatgagca aaggaaattt ccactcaagc ccgttattga
    1251  gtatgcatct ccaacattta gacagataat gcaccacttt agtgatgcag
    1301  ctgaagcgta tatagagtat agaaactcga cagagcgtta catgccaaga
    1351  tacggacttc agcgaaactt aaccgactat aacctagccc ggtatgcatt
    1401  tgatttctat gaaataactt cgcgtacacc ggcgagagct agagaggccc
    1451  acatgcagat gaaagcagca gcagtgcgtg gttcaaacac gcgcatgttt
    1501  ggcttggatg ggaatgtcgg cgagagtcag gagaatacag aacgtcacac
    1551  agctggcgat gtgagtcgca atatgcactc ccttcttgga gtgcagcagc
    1601  accattgatg tactgagatc ttcattgcag tatcaagtat ttatatattt
    1651  actatttcag tgagggtctc cctccttagt attatatacg taccttagaa
    1701  atagtagtca ttctgcagag gagtgaggtt tacctccaac tctatggtta
    1751  ctatttccta ctagcgtcga actacattac ggacaccctg ttgtgtggtt
    1801  ctaccatgag tcaggagctg cgagtattgt agcaagagac