Sequence of DPV Sorghum mosaic virus

Sorghum mosaic virus partial polyprotein gene, genomic RNA, isolate YN16

ACC No: FM997903

Dated: 2010-05-19 | Length: 1845 | CRC: 530081192

                
ID   FM997903; SV 1; linear; genomic RNA; STD; VRL; 1845 BP.
XX
AC   FM997903;
XX
DT   19-MAY-2010 (Rel. 104, Created)
DT   19-MAY-2010 (Rel. 104, Last updated, Version 1)
XX
DE   Sorghum mosaic virus partial polyprotein gene, genomic RNA, isolate YN16
XX
KW   .
XX
OS   Sorghum mosaic virus
OC   Viruses; ssRNA positive-strand viruses, no DNA stage; Potyviridae;
OC   Potyvirus.
XX
RN   [1]
RP   1-1845
RA   Adams M.J.;
RT   ;
RL   Submitted (10-FEB-2009) to the EMBL/GenBank/DDBJ databases.
RL   Adams M.J., Plant Pathology and Microbiology, Rothamsted Research, West
RL   Common, Herts., AL5 2JQ, UNITED KINGDOM.
XX
RN   [2]
RA   Wang J.G., Zheng H.Y., Chen H.R., Adams M.J., Chen J.P.;
RT   "Molecular Diversities of Sugarcane mosaic virus and Sorghum mosaic virus
RT   Isolates from Yunnan Province, China";
RL   J. Phytopathol. 158(6):427-432(2010).
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .1845
FT                   /organism="Sorghum mosaic virus"
FT                   /host="Saccharum hybrid cv. Xuan 3"
FT                   /isolate="YN16"
FT                   /mol_type="genomic RNA"
FT                   /country="China:Yunnan,Menglian"
FT                   /db_xref="taxon:32619"
FT   CDS             <1. .1608
FT                   /codon_start=1
FT                   /gene="polyprotein"
FT                   /product="polyprotein"
FT                   /protein_id="CAX36852.1"
FT                   /translation="GNNSGQPSTVVDNTLMVIIAFNYTMLSCGIEADMIDEICKMYANG
FT                   DDLLLAIRPDYECLLDNFSRHFSDLGLNFDFTSRTRDREELWFMSTRGIKIDGMYIPKL
FT                   EQERIVAILEWDRSLLPQYRLEAICAAMVEAWGYPQLLYEIRKFYAWILEMQPFATLAK
FT                   EGLAPYIAETALRNLYTGAGIKEGELDIYYTQFIKDLPEYVEDELIDVFHQAGGGTVDA
FT                   GANTADATARAQQEAAAKAQQDADAKKRADDEAAEKQRQDAAAKKKADDDAKAKADADA
FT                   KKKAGDEAAQRAQNQKDKDVDAGTSGTVTVPKLKAMSKKMRLPQAKGKNILHLDFLLGY
FT                   KPQQQDISNTRATRDEFDRWYAAVQKEYEVDDTQMTVIMSGLMVWCIENGCSPNINGVW
FT                   TMMDGEEQRKFPLKPIIENASPTFRQIMHHFSDAAEAYIEYRNSTERYMPRYGLQRNLT
FT                   DYNLARYAFDFYEITSRTPARAKEAHMQMKAAAVRGSNTRMFGLDGNVGESRENTERHT
FT                   AGDVSRNMHTLLGVQQHH"
FT   mat_peptide     <1. .642
FT                   /product="NIb polymerase"
FT   mat_peptide     643. .1605
FT                   /product="Coat protein"
FT   3'UTR           1609. .1845
XX
SQ   Sequence 1845 BP; 612 A; 353 C; 435 G; 445 T; 0 other;

fm997903 Length: 1845  19-MAY-2010  Type: N  Check: 9589  ..

       1  gggaataata gtgggcagcc atctaccgtt gttgacaaca cattaatggt
      51  gatcatagct ttcaattaca cgatgctatc ttgcggaatt gaagcagata
     101  tgatagatga gatatgcaaa atgtatgcga atggagacga tctcttatta
     151  gccatccgac cagattacga atgcctgctg gataacttct ctagacactt
     201  ctctgatcta ggattaaact ttgatttcac atcacgaact agagatagag
     251  aagaattatg gtttatgtca acacgaggca tcaaaatcga cggaatgtac
     301  atcccaaaat tggagcagga aagaatagtt gctatattag agtgggacag
     351  atcattgttg cctcaatata gattagaagc aatatgtgct gctatggtgg
     401  aagcttgggg atatccacaa ttattatatg agattagaaa attttacgcc
     451  tggattctcg aaatgcaacc atttgcaaca ctggcaaagg aaggactcgc
     501  accatacata gcagaaacag ctttacgcaa tctttacaca ggagcaggaa
     551  tcaaagaagg ggagttggat atttattata cacaatttat caaggattta
     601  cctgagtatg tggaagatga gttgattgat gtgtttcatc aagcaggagg
     651  cggcacagtg gatgcgggag ctaacacagc agatgcaaca gcacgagcgc
     701  aacaagaagc tgcagcaaaa gctcagcagg atgctgatgc gaaaaagagg
     751  gcagacgatg aagcagcaga gaaacagaga caagatgctg ctgcaaagaa
     801  gaaagctgac gacgatgcca aagcaaaagc agatgctgat gcgaaaaaga
     851  aagcaggtga tgaagcagcg caaagagcac aaaatcagaa agacaaagac
     901  gttgatgctg gaacgtcagg aacagtcaca gtaccaaagc ttaaagctat
     951  gtccaagaag atgcgcttgc cacaagcgaa aggaaagaac atcttacatc
    1001  ttgatttctt acttggatat aaaccacagc aacaagacat ttcaaacaca
    1051  cgagcaacac gagatgagtt tgatagatgg tatgcagccg tacaaaagga
    1101  atacgaagtc gatgacacac aaatgacagt tatcatgagt ggacttatgg
    1151  tatggtgcat agagaatggt tgctcaccga acatcaatgg tgtttggacc
    1201  atgatggatg gagaagaaca aaggaaattt cctttaaagc caataattga
    1251  aaatgcttct ccaactttta gacaaataat gcaccacttt agtgatgcag
    1301  ctgaagcgta tatagaatac cgtaattcga cggaacgata catgccaaga
    1351  tacggacttc agcgaaactt gaccgactac aatttagcac gatatgcttt
    1401  tgatttctat gaaattacat cacgcacacc tgctcgtgct aaggaggccc
    1451  acatgcagat gaaagccgca gcagttcgtg gttcaaacac ccgaatgttc
    1501  ggcttggacg gtaatgtcgg cgagtctcgg gagaatactg aacgtcatac
    1551  tgctggcgac gtgagtcgta acatgcacac ccttcttggg gtgcaacagc
    1601  accattgacg tgacggaaac cctgtttgca gtatctaagt attttatata
    1651  ttttctatga aagtgaggct atgcctcgtt agtataatat atatttactt
    1701  ttcgagtatt tactattctg caagggagtg aggactcatc ctccaacctt
    1751  ttagtattta ctttactagc ttcgaaccac tagacggacg atctgttgtg
    1801  tggccgtgct acgatgcaga tgccgcgagt cttgtggcaa gagac