Sequence of DPV Tobacco mosaic virus

Tobacco mosaic virus gene for 180K protein, complete cds.

ACC No: D78608

Dated: 2010-06-15 | Length: 4851 | CRC: 809568550

                ID   D78608; SV 1; linear; genomic RNA; STD; VRL; 4851 BP.
XX
AC   D78608;
XX
DT   16-DEC-1995 (Rel. 46, Created)
DT   15-JUN-2010 (Rel. 105, Last updated, Version 8)
XX
DE   Tobacco mosaic virus gene for 180K protein, complete cds.
XX
KW   .
XX
OS   Tobacco mosaic virus
OC   Viruses; ssRNA positive-strand viruses, no DNA stage; Virgaviridae;
OC   Tobamovirus.
XX
RN   [1]
RP   1-4851
RA   Watanabe T.;
RT   ;
RL   Submitted (04-DEC-1995) to the EMBL/GenBank/DDBJ databases.
RL   Contact:Takato Watanabe National Institute of Genetics, Department of
RL   Molecular Genetics; Yata 1111, Mishima, Shizuoka 411, Japan
XX
RN   [2]
RA   Meshi T., Ohno T., Okada Y.;
RT   "Nucleotide sequence and its character of cistron coding for the 30K
RT   protein of tobacco mosaic virus(om strain)";
RL   J. Biochem. 91:1441-1444(1982).
XX
RN   [3]
RA   Meshi T., Ishikawa M., Takamatsu N., Ohno T., Okada Y.;
RT   "The 5'-terminal sequence of TMV RNA. Question on the polymorphism found in
RT   vulgare strain";
RL   FEBS Lett. 162:282-285(1983).
XX
RN   [4]
RA   Watanabe T., Hibi T., Ishihama A.;
RT   "Nucleotide sequence of the coding region for 180K protein of tobacco
RT   mosaic virus common strain OM";
RL   Unpublished.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .4851
FT                   /organism="Tobacco mosaic virus"
FT                   /strain="OM"
FT                   /mol_type="genomic RNA"
FT                   /db_xref="taxon:12242"
FT   CDS             1. .4851
FT                   /codon_start=1
FT                   /transl_except=(pos:3349. .3351,aa:OTHER)
FT                   /product="180K protein"
FT                   /note="readthrough"
FT                   /db_xref="GOA:O93058"
FT                   /db_xref="InterPro:IPR000606"
FT                   /db_xref="InterPro:IPR001788"
FT                   /db_xref="InterPro:IPR002588"
FT                   /db_xref="InterPro:IPR007094"
FT                   /db_xref="UniProtKB/Swiss-Prot:O93058"
FT                   /protein_id="BAA11429.1"
FT                   /translation="MAYTQTATTSALLDTVRGNNSLVNDLAKRRLYDTAVEEFNARDRR
FT                   PKVNFSKVISEEQTLIATRAYPEFQITFYNTQNAVHSLAGGLRSLELEYLMMQIPYGSL
FT                   TYDIGGNFASHLFKGRAYVHCCMPNLDVRDIMRHEGQKDSIELYLSRLERGGKTVPNFQ
FT                   KEAFDRYAEIPEDAVCHNTFQTCEHQPMQQSGRVYAIALHSIYDIPADEFGAALLRKNV
FT                   HTCYAAFHFSENLLLEDSYVNLDEINACFSRDGDKLTFSFASESTLNYCHSYSNILKYV
FT                   CKTYFPASNREVYMKEFLVTRVNTWFCKFSRIDTFLLYKGVAHKSVDSEQFYTAMEDAW
FT                   HYKKTLAMCNSERILLEDSSSVNYWFPKMRDMVIVPLFDISLETSKRTRKEVLVSKDFV
FT                   FTVLNHIRTYQAKALTYANVLSFVESIRSRVIINGVTARSEWDVDKSLLQSLSMTFYLH
FT                   TKLAVLKDDLLISKFSLGSKTVCQHVWDEISLAFGNAFPSVKERLLNRKLIRVAGDALE
FT                   IRVPDLYVTFHDRLVTEYKASVDMPALDIRKKMEETEVMYNALSELSVLRESDKFDVDV
FT                   FSQMCQSLEVDPMTAAKVIVAVMSNESGLTLTFERPTEANVALALQDQEKASEGALVVT
FT                   SREVEEPSMKGSMARGELQLAGLAGDHPESSYSRNEEIESLEQFHMATADSLIRKQMSS
FT                   IVYTGPIKVQQMKNFIDSLVASLSAAVSNLVKILKDTAAIDLETRQKFGVLDVASRKWL
FT                   IKPTAKSHAWGVVETHARKYHVALLEYDEQGVVTCDDWRRVAVSSESVVYSDMAKLRTL
FT                   RRLLRNGEPHVSSAKVVLVDGVPGCGKTKEILSRVNFDEDLILVPGKQAAEMIRRRANS
FT                   SGIIVATKDNVKTVDSFMMNFGKSTRCQFKRLFIDEGLMLHTGCVNFLVAMSLCEIAYV
FT                   YGDTQQIPYINRVSGFPYPAHFAKLEVDEVETRRTTLRCPADVTHYLNRRYEGFVMSTS
FT                   SVKKSVSQEMVGGAAVINPISKPLHGKILTFTQSDKEALLSRGYSDVHTVHEVQGETYS
FT                   DVSLVRLTPTPVSIIAGDSPHVLVALSRHTCSLKYYTVVMDPLVSIIRDLEKLSSYLLD
FT                   MYKVDAGTQXQLQIDSVFKGSNLFVAAPKTGDISDMQFYYDKCLPGNSTMMNNFDAVTM
FT                   RLTDISLNVKDCILDMSKSVAAPKDQIKPLTPMVRTAAEMPRQTGLLENLVAMIKRNFN
FT                   APELSGIIDIENTASLVVDKFFDSYLLKEKRKPNKNVSLFSRESLNRWLEKQEQVTIGQ
FT                   LADFDFVDLPAVDQYRHMIKAQPKQKLDTSIQTEYPALQTIVYHSKKINAIFGPLFSEL
FT                   TRQLLDSVDSSRFLFFTRKTPAQIEDFFGDLDSHVPMDVLELDISKYDKSQNEFHCAVE
FT                   YEIWRRLGFEDFLGEVWKQGHRKTTLKDYTAGIKTCIWYQRKSGDVTTFIGNTVIIAAC
FT                   LASMLPMEKIIKGAFCGDDSLLYFPKGCEFPDVQHSANLMWNFEAKLFKKQYGYFCGRY
FT                   VIHHDRGCIVYYDPLKLISKLGAKHIKDWEHLEEFRRSLCDVAVSLNNCAYYTQLDDAV
FT                   REVHKTAPPGSFVYKSLVKFLSDKVLFRSLFIDGSSC"
FT   CDS             1. .3351
FT                   /codon_start=1
FT                   /product="130K protein"
FT                   /db_xref="GOA:O93058"
FT                   /db_xref="InterPro:IPR000606"
FT                   /db_xref="InterPro:IPR001788"
FT                   /db_xref="InterPro:IPR002588"
FT                   /db_xref="InterPro:IPR007094"
FT                   /db_xref="UniProtKB/Swiss-Prot:O93058"
FT                   /protein_id="BAA11430.1"
FT                   /translation="MAYTQTATTSALLDTVRGNNSLVNDLAKRRLYDTAVEEFNARDRR
FT                   PKVNFSKVISEEQTLIATRAYPEFQITFYNTQNAVHSLAGGLRSLELEYLMMQIPYGSL
FT                   TYDIGGNFASHLFKGRAYVHCCMPNLDVRDIMRHEGQKDSIELYLSRLERGGKTVPNFQ
FT                   KEAFDRYAEIPEDAVCHNTFQTCEHQPMQQSGRVYAIALHSIYDIPADEFGAALLRKNV
FT                   HTCYAAFHFSENLLLEDSYVNLDEINACFSRDGDKLTFSFASESTLNYCHSYSNILKYV
FT                   CKTYFPASNREVYMKEFLVTRVNTWFCKFSRIDTFLLYKGVAHKSVDSEQFYTAMEDAW
FT                   HYKKTLAMCNSERILLEDSSSVNYWFPKMRDMVIVPLFDISLETSKRTRKEVLVSKDFV
FT                   FTVLNHIRTYQAKALTYANVLSFVESIRSRVIINGVTARSEWDVDKSLLQSLSMTFYLH
FT                   TKLAVLKDDLLISKFSLGSKTVCQHVWDEISLAFGNAFPSVKERLLNRKLIRVAGDALE
FT                   IRVPDLYVTFHDRLVTEYKASVDMPALDIRKKMEETEVMYNALSELSVLRESDKFDVDV
FT                   FSQMCQSLEVDPMTAAKVIVAVMSNESGLTLTFERPTEANVALALQDQEKASEGALVVT
FT                   SREVEEPSMKGSMARGELQLAGLAGDHPESSYSRNEEIESLEQFHMATADSLIRKQMSS
FT                   IVYTGPIKVQQMKNFIDSLVASLSAAVSNLVKILKDTAAIDLETRQKFGVLDVASRKWL
FT                   IKPTAKSHAWGVVETHARKYHVALLEYDEQGVVTCDDWRRVAVSSESVVYSDMAKLRTL
FT                   RRLLRNGEPHVSSAKVVLVDGVPGCGKTKEILSRVNFDEDLILVPGKQAAEMIRRRANS
FT                   SGIIVATKDNVKTVDSFMMNFGKSTRCQFKRLFIDEGLMLHTGCVNFLVAMSLCEIAYV
FT                   YGDTQQIPYINRVSGFPYPAHFAKLEVDEVETRRTTLRCPADVTHYLNRRYEGFVMSTS
FT                   SVKKSVSQEMVGGAAVINPISKPLHGKILTFTQSDKEALLSRGYSDVHTVHEVQGETYS
FT                   DVSLVRLTPTPVSIIAGDSPHVLVALSRHTCSLKYYTVVMDPLVSIIRDLEKLSSYLLD
FT                   MYKVDAGTQ"
FT   misc_difference 57
FT                   /replace="c"
FT                   /note="conflict"
FT                   /citation=[3]
FT   misc_difference 195
FT                   /replace="g"
FT                   /note="conflict"
FT                   /citation=[3]
FT   misc_difference 4738
FT                   /replace="a"
FT                   /note="conflict"
FT                   /citation=[2]
FT   misc_difference 4796
FT                   /replace="t"
FT                   /note="conflict"
FT                   /citation=[2]
FT   misc_difference 4830
FT                   /replace="c"
FT                   /note="conflict"
FT                   /citation=[2]
XX
SQ   Sequence 4851 BP; 1385 A; 939 C; 1171 G; 1356 T; 0 other;

d78608 Length: 4851  15-JUN-2010  Type: N  Check: 4853  ..

       1  atggcataca cacagacagc taccacatca gctttgctgg acactgtccg
      51  aggaaacaac tccttggtca atgatctagc aaagcgtcgt ctttacgaca
     101  cagcggttga agagtttaac gctcgtgacc gcaggcccaa agtgaacttt
     151  tcaaaagtaa taagcgagga gcagacgctt attgctaccc gggcgtatcc
     201  agaattccaa attacatttt ataacacgca aaatgccgtg cattcgcttg
     251  caggtggatt gcgatcttta gaactggaat atctgatgat gcaaattccc
     301  tacggatcat tgacttatga cataggcggg aattttgcat cgcatctgtt
     351  caagggacga gcatatgtac actgctgcat gcccaacctg gacgttcgag
     401  acatcatgcg gcatgaaggc cagaaagaca gtattgaact atacctttct
     451  aggctagaga gagggggaaa aacagtcccc aacttccaaa aggaagcatt
     501  tgacagatac gcagaaattc ctgaagacgc tgtctgtcac aatactttcc
     551  agacatgcga acatcagccg atgcagcaat caggcagagt gtacgccatt
     601  gcgctacaca gcatatatga catacccgct gatgagttcg gggcagcact
     651  tttgaggaaa aatgtccata cgtgctatgc cgctttccac ttctctgaga
     701  acctgcttct tgaagattca tatgtcaatt tggacgaaat caacgcgtgt
     751  ttttcgcgcg atggagacaa gttgaccttt tcttttgcat cagagagtac
     801  tcttaattac tgtcatagtt attctaatat tcttaagtat gtgtgcaaaa
     851  cttacttccc ggcctctaat agagaggttt acatgaagga gtttttagtc
     901  accagggtta atacctggtt ttgtaagttt tctagaatag atacttttct
     951  tttgtacaaa ggtgtggccc ataaaagtgt agatagtgag cagttttata
    1001  ctgcaatgga agacgcatgg cattacaaaa agactcttgc aatgtgcaac
    1051  agcgagagaa tcctccttga ggattcatca tcagtcaatt actggtttcc
    1101  caaaatgagg gatatggtca tcgtaccatt attcgatatt tctttggaga
    1151  ctagtaagag gacgcgtaag gaagtcttag tgtccaagga tttcgtgttt
    1201  acagtgctta accacattcg aacataccag gcgaaagctc ttacatacgc
    1251  aaatgttttg tccttcgtcg aatcgattcg atcgagggta atcattaacg
    1301  gtgtgacagc gaggtctgaa tgggatgtgg acaaatcttt gttacaatcc
    1351  ttgtccatga cgttttacct gcatactaag cttgccgttc taaaggatga
    1401  cttactgatt agcaagttta gtctcggttc gaaaacggtg tgccagcatg
    1451  tgtgggatga gatttcgctg gcgtttggga acgcatttcc ctccgtgaaa
    1501  gagaggctct tgaacaggaa acttatcaga gtggcaggcg acgcactaga
    1551  gatcagggtg cctgatctat atgtgacctt ccacgaccga ttagtgactg
    1601  agtacaaggc ctctgtggac atgcctgcgc ttgacattag gaagaagatg
    1651  gaagaaacgg aagtgatgta caatgcactt tcagaattat cggtgttaag
    1701  ggagtctgac aaattcgatg ttgatgtttt ttcccagatg tgccaatctt
    1751  tggaagttga cccaatgacg gcagcgaagg ttatagtcgc ggtcatgagc
    1801  aatgagagcg gtctgactct cacatttgaa cgacctactg aggcgaatgt
    1851  tgcgctagct ttacaggatc aagagaaggc ttcagaaggt gctttggtag
    1901  ttacctcaag agaagttgaa gaaccgtcca tgaagggttc gatggccaga
    1951  ggagagttac aattagctgg tcttgctgga gatcatccgg agtcgtccta
    2001  ttctaggaac gaggagatag agtctttaga gcagtttcat atggcaacgg
    2051  cagattcgtt aattcgtaag cagatgagct cgattgtgta cacgggtccg
    2101  attaaagttc agcaaatgaa aaactttatc gatagcctgg tagcatcact
    2151  atctgctgcg gtgtcgaatc tcgtcaagat cctcaaagat acagctgcta
    2201  ttgaccttga aacccgtcaa aagtttggag tcttggatgt tgcatctagg
    2251  aagtggttaa tcaaaccaac ggccaagagt catgcatggg gtgttgttga
    2301  aacccacgcg aggaagtatc atgtggcgct tctggaatat gatgagcagg
    2351  gtgtggtgac atgcgatgat tggagaagag tagctgtcag ctctgagtct
    2401  gttgtttatt ccgacatggc gaaactcaga actctgcgca gactgcttcg
    2451  aaacggagaa ccgcatgtca gtagcgcaaa ggttgttctt gtggacggag
    2501  ttccgggctg tggaaaaacc aaagaaattc tttccagagt taattttgat
    2551  gaagatttaa ttttagtacc tgggaagcaa gccgctgaaa tgatcagaag
    2601  acgtgcgaat tcctcaggga ttattgtggc cacgaaggac aacgttaaaa
    2651  ccgttgattc tttcatgatg aattttggga aaagcacacg ctgtcagttc
    2701  aagaggttat tcattgatga agggttgatg ttgcatactg gttgtgttaa
    2751  ttttcttgtg gcgatgtcat tgtgcgaaat tgcatatgtt tacggagaca
    2801  cacagcaaat tccatacatc aatagagttt caggattccc gtaccccgcc
    2851  cattttgcca aattggaagt tgacgaggtg gagacacgca gaactactct
    2901  ccgttgtcca gccgatgtca cacattatct gaacaggaga tatgagggct
    2951  ttgtcatgag cacttcttcg gttaaaaagt ctgtttcgca ggagatggtc
    3001  ggcggagccg ccgtgatcaa tccgatctca aaacccttgc atggcaagat
    3051  cctgactttt acccaatcgg ataaagaagc tctgctttca agagggtatt
    3101  cagatgttca cactgtgcat gaagtgcaag gcgagacata ctctgatgtt
    3151  tcactagtta ggttaacccc tacaccggtc tccatcattg caggagacag
    3201  cccacatgtt ttggtcgcat tgtcaaggca cacctgttcg ctcaagtact
    3251  atactgttgt tatggatcct ttagttagta tcattagaga tctagagaaa
    3301  cttagctcgt acttgttaga tatgtataag gtcgatgcag gaacacaata
    3351  gcaattacag attgactcgg tgttcaaagg ttccaatctt tttgtggcag
    3401  cgccaaagac tggtgatatt tctgatatgc agttttacta tgataagtgt
    3451  ctcccaggca acagcaccat gatgaataat tttgatgctg ttaccatgag
    3501  gttgactgac atttcattga atgtcaaaga ttgcatattg gatatgtcta
    3551  agtctgttgc tgcgcctaag gatcaaatca aaccactaac acctatggta
    3601  cgaacggcgg cagaaatgcc acgccagact ggactattgg aaaatttagt
    3651  ggcgatgatt aaaaggaact ttaacgcacc cgagttgtct ggcatcattg
    3701  atattgaaaa tactgcatct ttagttgtag ataagttttt cgatagttat
    3751  ttgcttaaag aaaaaagaaa accaaataaa aatgtttctt tgttcagtag
    3801  agagtctctc aatagatggt tagaaaagca ggaacaggta acaataggtc
    3851  agcttgcaga ttttgatttt gtggatttgc cagcagttga tcagtacaga
    3901  cacatgatca aagcacaacc caagcaaaaa ttggacactt caatccaaac
    3951  ggagtacccg gctttgcaga cgattgtgta ccattcaaaa aagatcaatg
    4001  caatattcgg cccgttgttt agtgagctta ctaggcaatt actggacagt
    4051  gttgattcga gcagattttt gttttttaca agaaagacac cagcgcagat
    4101  tgaggatttc ttcggagatc ttgacagtca tgtgccgatg gatgtcttgg
    4151  agctggatat atcaaaatac gacaaatctc agaatgaatt ccactgtgca
    4201  gtagaatacg agatctggcg aagattgggt tttgaagact tcttgggaga
    4251  agtttggaaa caagggcata gaaagaccac cctcaaggat tataccgcag
    4301  gtatcaaaac ttgcatctgg tatcaaagaa agagtgggga cgtcacgacg
    4351  ttcattggaa acactgtgat cattgctgca tgtttggcct cgatgcttcc
    4401  gatggagaaa ataatcaaag gagccttttg tggtgacgat agtctgctgt
    4451  acttcccaaa gggttgtgag tttccggatg tgcaacactc cgcgaatctt
    4501  atgtggaatt ttgaagcaaa actgtttaaa aaacagtatg gatatttttg
    4551  cggaagatat gtaatacatc acgacagagg atgcattgtg tattacgatc
    4601  ccctaaagtt gatctcgaaa cttggtgcta aacacatcaa ggattgggaa
    4651  cacttggagg agttcagaag gtctctttgt gatgttgctg tttcgttgaa
    4701  caattgtgcg tattacacac agttggacga cgctgtaagg gaggttcata
    4751  agaccgcccc tccaggttcg tttgtttata aaagtctggt gaagtttttg
    4801  tctgataaag ttctttttag aagtttgttc atagatggct ctagttgtta
    4851  a