Sequence of DPV Sorghum mosaic virus

Sorghum mosaic virus partial polyprotein gene, genomic RNA, isolate YN14

ACC No: FM997901

Dated: 2010-05-19 | Length: 1845 | CRC: 58631907

                
ID   FM997901; SV 1; linear; genomic RNA; STD; VRL; 1845 BP.
XX
AC   FM997901;
XX
DT   19-MAY-2010 (Rel. 104, Created)
DT   19-MAY-2010 (Rel. 104, Last updated, Version 1)
XX
DE   Sorghum mosaic virus partial polyprotein gene, genomic RNA, isolate YN14
XX
KW   .
XX
OS   Sorghum mosaic virus
OC   Viruses; ssRNA positive-strand viruses, no DNA stage; Potyviridae;
OC   Potyvirus.
XX
RN   [1]
RP   1-1845
RA   Adams M.J.;
RT   ;
RL   Submitted (10-FEB-2009) to the EMBL/GenBank/DDBJ databases.
RL   Adams M.J., Plant Pathology and Microbiology, Rothamsted Research, West
RL   Common, Herts., AL5 2JQ, UNITED KINGDOM.
XX
RN   [2]
RA   Wang J.G., Zheng H.Y., Chen H.R., Adams M.J., Chen J.P.;
RT   "Molecular Diversities of Sugarcane mosaic virus and Sorghum mosaic virus
RT   Isolates from Yunnan Province, China";
RL   J. Phytopathol. 158(6):427-432(2010).
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .1845
FT                   /organism="Sorghum mosaic virus"
FT                   /host="Saccharum hybrid cv. Yuetang 79/177"
FT                   /isolate="YN14"
FT                   /mol_type="genomic RNA"
FT                   /country="China:Yunnan,Menghai"
FT                   /db_xref="taxon:32619"
FT   CDS             <1. .1608
FT                   /codon_start=1
FT                   /gene="polyprotein"
FT                   /product="polyprotein"
FT                   /protein_id="CAX36850.1"
FT                   /translation="GNNSGQPPTVVDNTLMVIIAFNYTMLSCGIEADMIDEICKMYANG
FT                   DDLLLAIRPDYECLLDNFSRHFSDLGLNLDFTSRTRDREELWFMSTRGIKIDGMYIPKL
FT                   EQERIVAILEWDRSLLPQYRLEAICAAMVEAWGYPQLLHEIRKFYAWILEMQPFATLAK
FT                   EGLAPYIAETALRNLYTGAGVKEGELDIYYTQFIKDLPEYVEDELIDVFHQAGGDTVDA
FT                   GANTADATARAQQEAAAKAQQDADAKKRADDEAAEKQRQDAAAKKKADDDAKAKADADA
FT                   KKKADDEAAQRAQNQKDKDVDAGTSGTVTVPKLKAMSKKMRLPQAKGKNILHLDFLLGY
FT                   KPQQQDISNTRATRDEFDRWYAAVQKEYEVDDTLMTVVMSGLMVWCIENGCSPNINGVW
FT                   TMMDGEEQRKFPLKPIIENASPTFRQIMHHFSDAAEAYIEYRNSTERYMPRYGLQRNLT
FT                   DYNLARYAFDFYEITSRTPARAKEAHMQMKAAAVRGSNTRMFGLDGNVGESQENTERHT
FT                   AGDVSRSMHTLLGVQQHH"
FT   mat_peptide     <1. .642
FT                   /product="NIb polymerase"
FT   mat_peptide     643. .1605
FT                   /product="Coat protein"
FT   3'UTR           1609. .1845
XX
SQ   Sequence 1845 BP; 607 A; 353 C; 438 G; 447 T; 0 other;

fm997901 Length: 1845  19-MAY-2010  Type: N  Check: 1245  ..

       1  gggaataata gtgggcagcc acctaccgtt gttgacaaca cattaatggt
      51  gatcatagct ttcaattaca cgatgctatc ttgcggaatt gaagcagata
     101  tgatagatga gatatgcaaa atgtatgcaa atggagacga tctcttatta
     151  gccattcgac cagattacga atgcctgctg gataactttt ctagacactt
     201  ctctgatcta ggattaaacc ttgatttcac atcacgaact agagatagag
     251  aagaattatg gtttatgtca acacgaggca tcaaaatcga cggaatgtac
     301  atcccaaaat tggagcagga aagaatagtt gctatattag agtgggacag
     351  atcattgttg cctcaatata gattagaagc aatatgtgct gctatggtgg
     401  aagcttgggg atatccacaa ttattacatg agattagaaa attttacgcc
     451  tggattctcg aaatgcaacc atttgccaca ctggcaaagg aaggactcgc
     501  accatacata gcagaaacag ctttacgcaa tctttacaca ggagcaggag
     551  tcaaagaagg ggagttggat atttattata cacaattcat caaggattta
     601  cctgagtatg tggaagatga gttgattgat gtgttccatc aagcaggggg
     651  cgacacagtg gatgcgggag ctaacacagc agatgcaaca gcacgagcgc
     701  aacaagaagc tgcagcaaaa gctcagcagg atgctgatgc gaaaaagagg
     751  gcagatgatg aagcagcaga gaaacagaga caagatgctg ctgcaaagaa
     801  gaaagctgat gacgatgcca aagcaaaagc agatgctgat gcgaaaaaga
     851  aagcagatga tgaagcagcg caaagagcac aaaatcagaa agataaagac
     901  gttgatgctg gaacgtcagg gacagtcaca gtaccaaagc ttaaagctat
     951  gtccaagaag atgcgcttgc cacaagcgaa aggaaagaac atcttacatc
    1001  ttgatttctt acttggatat aaaccacagc aacaagacat ttcaaacaca
    1051  cgagcaacac gagatgagtt tgatagatgg tatgcagccg tacaaaagga
    1101  atacgaagtc gatgacacac taatgacagt tgtcatgagt ggacttatgg
    1151  tatggtgcat agagaatggt tgctcaccga acatcaatgg tgtttggacc
    1201  atgatggatg gagaagaaca aaggaaattt cctttaaagc caataattga
    1251  aaatgcttct ccaactttta gacaaataat gcaccacttt agtgatgcag
    1301  ctgaagcgta tatagaatac cgtaattcga cggaacgata catgccaaga
    1351  tacggacttc agcgaaactt gaccgactac aatttagcac ggtatgcttt
    1401  tgatttctat gaaattacat cacgcacacc tgctcgtgct aaggaggccc
    1451  acatgcagat gaaagccgca gcagttcggg gttcaaacac ccgaatgttc
    1501  ggcttggacg gtaatgtcgg cgagtctcag gagaatactg aacgtcatac
    1551  tgctggcgac gtgagtcgta gcatgcacac ccttcttggg gtgcaacagc
    1601  accattgacg tgacggaaac cctgtttgca gtatctaagt attttatata
    1651  ttttctatgt aagtgaggct atgcctcgtt agtataatat atatttactt
    1701  ttcgagtatt tactattctg caagggagtg aggactcatc ctccaacctt
    1751  ttagtattta ctttactagc ttcgaaccac tagacggacg atctgttgtg
    1801  tggccgtgct acgatgcaga tgctgcgagt cttgtggcaa gagac