Sequence of DPV Tobacco mosaic virus
Tobacco mosaic virus gene for 180K protein, complete cds.
ACC No: D78608
Dated: 2010-06-15 | Length: 4851 | CRC: 809568550
ID D78608; SV 1; linear; genomic RNA; STD; VRL; 4851 BP.
XX
AC D78608;
XX
DT 16-DEC-1995 (Rel. 46, Created)
DT 15-JUN-2010 (Rel. 105, Last updated, Version 8)
XX
DE Tobacco mosaic virus gene for 180K protein, complete cds.
XX
KW .
XX
OS Tobacco mosaic virus
OC Viruses; ssRNA positive-strand viruses, no DNA stage; Virgaviridae;
OC Tobamovirus.
XX
RN [1]
RP 1-4851
RA Watanabe T.;
RT ;
RL Submitted (04-DEC-1995) to the EMBL/GenBank/DDBJ databases.
RL Contact:Takato Watanabe National Institute of Genetics, Department of
RL Molecular Genetics; Yata 1111, Mishima, Shizuoka 411, Japan
XX
RN [2]
RA Meshi T., Ohno T., Okada Y.;
RT "Nucleotide sequence and its character of cistron coding for the 30K
RT protein of tobacco mosaic virus(om strain)";
RL J. Biochem. 91:1441-1444(1982).
XX
RN [3]
RA Meshi T., Ishikawa M., Takamatsu N., Ohno T., Okada Y.;
RT "The 5'-terminal sequence of TMV RNA. Question on the polymorphism found in
RT vulgare strain";
RL FEBS Lett. 162:282-285(1983).
XX
RN [4]
RA Watanabe T., Hibi T., Ishihama A.;
RT "Nucleotide sequence of the coding region for 180K protein of tobacco
RT mosaic virus common strain OM";
RL Unpublished.
XX
FH Key Location/Qualifiers
FH
FT source 1. .4851
FT /organism="Tobacco mosaic virus"
FT /strain="OM"
FT /mol_type="genomic RNA"
FT /db_xref="taxon:12242"
FT CDS 1. .4851
FT /codon_start=1
FT /transl_except=(pos:3349. .3351,aa:OTHER)
FT /product="180K protein"
FT /note="readthrough"
FT /db_xref="GOA:O93058"
FT /db_xref="InterPro:IPR000606"
FT /db_xref="InterPro:IPR001788"
FT /db_xref="InterPro:IPR002588"
FT /db_xref="InterPro:IPR007094"
FT /db_xref="UniProtKB/Swiss-Prot:O93058"
FT /protein_id="BAA11429.1"
FT /translation="MAYTQTATTSALLDTVRGNNSLVNDLAKRRLYDTAVEEFNARDRR
FT PKVNFSKVISEEQTLIATRAYPEFQITFYNTQNAVHSLAGGLRSLELEYLMMQIPYGSL
FT TYDIGGNFASHLFKGRAYVHCCMPNLDVRDIMRHEGQKDSIELYLSRLERGGKTVPNFQ
FT KEAFDRYAEIPEDAVCHNTFQTCEHQPMQQSGRVYAIALHSIYDIPADEFGAALLRKNV
FT HTCYAAFHFSENLLLEDSYVNLDEINACFSRDGDKLTFSFASESTLNYCHSYSNILKYV
FT CKTYFPASNREVYMKEFLVTRVNTWFCKFSRIDTFLLYKGVAHKSVDSEQFYTAMEDAW
FT HYKKTLAMCNSERILLEDSSSVNYWFPKMRDMVIVPLFDISLETSKRTRKEVLVSKDFV
FT FTVLNHIRTYQAKALTYANVLSFVESIRSRVIINGVTARSEWDVDKSLLQSLSMTFYLH
FT TKLAVLKDDLLISKFSLGSKTVCQHVWDEISLAFGNAFPSVKERLLNRKLIRVAGDALE
FT IRVPDLYVTFHDRLVTEYKASVDMPALDIRKKMEETEVMYNALSELSVLRESDKFDVDV
FT FSQMCQSLEVDPMTAAKVIVAVMSNESGLTLTFERPTEANVALALQDQEKASEGALVVT
FT SREVEEPSMKGSMARGELQLAGLAGDHPESSYSRNEEIESLEQFHMATADSLIRKQMSS
FT IVYTGPIKVQQMKNFIDSLVASLSAAVSNLVKILKDTAAIDLETRQKFGVLDVASRKWL
FT IKPTAKSHAWGVVETHARKYHVALLEYDEQGVVTCDDWRRVAVSSESVVYSDMAKLRTL
FT RRLLRNGEPHVSSAKVVLVDGVPGCGKTKEILSRVNFDEDLILVPGKQAAEMIRRRANS
FT SGIIVATKDNVKTVDSFMMNFGKSTRCQFKRLFIDEGLMLHTGCVNFLVAMSLCEIAYV
FT YGDTQQIPYINRVSGFPYPAHFAKLEVDEVETRRTTLRCPADVTHYLNRRYEGFVMSTS
FT SVKKSVSQEMVGGAAVINPISKPLHGKILTFTQSDKEALLSRGYSDVHTVHEVQGETYS
FT DVSLVRLTPTPVSIIAGDSPHVLVALSRHTCSLKYYTVVMDPLVSIIRDLEKLSSYLLD
FT MYKVDAGTQXQLQIDSVFKGSNLFVAAPKTGDISDMQFYYDKCLPGNSTMMNNFDAVTM
FT RLTDISLNVKDCILDMSKSVAAPKDQIKPLTPMVRTAAEMPRQTGLLENLVAMIKRNFN
FT APELSGIIDIENTASLVVDKFFDSYLLKEKRKPNKNVSLFSRESLNRWLEKQEQVTIGQ
FT LADFDFVDLPAVDQYRHMIKAQPKQKLDTSIQTEYPALQTIVYHSKKINAIFGPLFSEL
FT TRQLLDSVDSSRFLFFTRKTPAQIEDFFGDLDSHVPMDVLELDISKYDKSQNEFHCAVE
FT YEIWRRLGFEDFLGEVWKQGHRKTTLKDYTAGIKTCIWYQRKSGDVTTFIGNTVIIAAC
FT LASMLPMEKIIKGAFCGDDSLLYFPKGCEFPDVQHSANLMWNFEAKLFKKQYGYFCGRY
FT VIHHDRGCIVYYDPLKLISKLGAKHIKDWEHLEEFRRSLCDVAVSLNNCAYYTQLDDAV
FT REVHKTAPPGSFVYKSLVKFLSDKVLFRSLFIDGSSC"
FT CDS 1. .3351
FT /codon_start=1
FT /product="130K protein"
FT /db_xref="GOA:O93058"
FT /db_xref="InterPro:IPR000606"
FT /db_xref="InterPro:IPR001788"
FT /db_xref="InterPro:IPR002588"
FT /db_xref="InterPro:IPR007094"
FT /db_xref="UniProtKB/Swiss-Prot:O93058"
FT /protein_id="BAA11430.1"
FT /translation="MAYTQTATTSALLDTVRGNNSLVNDLAKRRLYDTAVEEFNARDRR
FT PKVNFSKVISEEQTLIATRAYPEFQITFYNTQNAVHSLAGGLRSLELEYLMMQIPYGSL
FT TYDIGGNFASHLFKGRAYVHCCMPNLDVRDIMRHEGQKDSIELYLSRLERGGKTVPNFQ
FT KEAFDRYAEIPEDAVCHNTFQTCEHQPMQQSGRVYAIALHSIYDIPADEFGAALLRKNV
FT HTCYAAFHFSENLLLEDSYVNLDEINACFSRDGDKLTFSFASESTLNYCHSYSNILKYV
FT CKTYFPASNREVYMKEFLVTRVNTWFCKFSRIDTFLLYKGVAHKSVDSEQFYTAMEDAW
FT HYKKTLAMCNSERILLEDSSSVNYWFPKMRDMVIVPLFDISLETSKRTRKEVLVSKDFV
FT FTVLNHIRTYQAKALTYANVLSFVESIRSRVIINGVTARSEWDVDKSLLQSLSMTFYLH
FT TKLAVLKDDLLISKFSLGSKTVCQHVWDEISLAFGNAFPSVKERLLNRKLIRVAGDALE
FT IRVPDLYVTFHDRLVTEYKASVDMPALDIRKKMEETEVMYNALSELSVLRESDKFDVDV
FT FSQMCQSLEVDPMTAAKVIVAVMSNESGLTLTFERPTEANVALALQDQEKASEGALVVT
FT SREVEEPSMKGSMARGELQLAGLAGDHPESSYSRNEEIESLEQFHMATADSLIRKQMSS
FT IVYTGPIKVQQMKNFIDSLVASLSAAVSNLVKILKDTAAIDLETRQKFGVLDVASRKWL
FT IKPTAKSHAWGVVETHARKYHVALLEYDEQGVVTCDDWRRVAVSSESVVYSDMAKLRTL
FT RRLLRNGEPHVSSAKVVLVDGVPGCGKTKEILSRVNFDEDLILVPGKQAAEMIRRRANS
FT SGIIVATKDNVKTVDSFMMNFGKSTRCQFKRLFIDEGLMLHTGCVNFLVAMSLCEIAYV
FT YGDTQQIPYINRVSGFPYPAHFAKLEVDEVETRRTTLRCPADVTHYLNRRYEGFVMSTS
FT SVKKSVSQEMVGGAAVINPISKPLHGKILTFTQSDKEALLSRGYSDVHTVHEVQGETYS
FT DVSLVRLTPTPVSIIAGDSPHVLVALSRHTCSLKYYTVVMDPLVSIIRDLEKLSSYLLD
FT MYKVDAGTQ"
FT misc_difference 57
FT /replace="c"
FT /note="conflict"
FT /citation=[3]
FT misc_difference 195
FT /replace="g"
FT /note="conflict"
FT /citation=[3]
FT misc_difference 4738
FT /replace="a"
FT /note="conflict"
FT /citation=[2]
FT misc_difference 4796
FT /replace="t"
FT /note="conflict"
FT /citation=[2]
FT misc_difference 4830
FT /replace="c"
FT /note="conflict"
FT /citation=[2]
XX
SQ Sequence 4851 BP; 1385 A; 939 C; 1171 G; 1356 T; 0 other;
d78608 Length: 4851 15-JUN-2010 Type: N Check: 4853 ..
1 atggcataca cacagacagc taccacatca gctttgctgg acactgtccg
51 aggaaacaac tccttggtca atgatctagc aaagcgtcgt ctttacgaca
101 cagcggttga agagtttaac gctcgtgacc gcaggcccaa agtgaacttt
151 tcaaaagtaa taagcgagga gcagacgctt attgctaccc gggcgtatcc
201 agaattccaa attacatttt ataacacgca aaatgccgtg cattcgcttg
251 caggtggatt gcgatcttta gaactggaat atctgatgat gcaaattccc
301 tacggatcat tgacttatga cataggcggg aattttgcat cgcatctgtt
351 caagggacga gcatatgtac actgctgcat gcccaacctg gacgttcgag
401 acatcatgcg gcatgaaggc cagaaagaca gtattgaact atacctttct
451 aggctagaga gagggggaaa aacagtcccc aacttccaaa aggaagcatt
501 tgacagatac gcagaaattc ctgaagacgc tgtctgtcac aatactttcc
551 agacatgcga acatcagccg atgcagcaat caggcagagt gtacgccatt
601 gcgctacaca gcatatatga catacccgct gatgagttcg gggcagcact
651 tttgaggaaa aatgtccata cgtgctatgc cgctttccac ttctctgaga
701 acctgcttct tgaagattca tatgtcaatt tggacgaaat caacgcgtgt
751 ttttcgcgcg atggagacaa gttgaccttt tcttttgcat cagagagtac
801 tcttaattac tgtcatagtt attctaatat tcttaagtat gtgtgcaaaa
851 cttacttccc ggcctctaat agagaggttt acatgaagga gtttttagtc
901 accagggtta atacctggtt ttgtaagttt tctagaatag atacttttct
951 tttgtacaaa ggtgtggccc ataaaagtgt agatagtgag cagttttata
1001 ctgcaatgga agacgcatgg cattacaaaa agactcttgc aatgtgcaac
1051 agcgagagaa tcctccttga ggattcatca tcagtcaatt actggtttcc
1101 caaaatgagg gatatggtca tcgtaccatt attcgatatt tctttggaga
1151 ctagtaagag gacgcgtaag gaagtcttag tgtccaagga tttcgtgttt
1201 acagtgctta accacattcg aacataccag gcgaaagctc ttacatacgc
1251 aaatgttttg tccttcgtcg aatcgattcg atcgagggta atcattaacg
1301 gtgtgacagc gaggtctgaa tgggatgtgg acaaatcttt gttacaatcc
1351 ttgtccatga cgttttacct gcatactaag cttgccgttc taaaggatga
1401 cttactgatt agcaagttta gtctcggttc gaaaacggtg tgccagcatg
1451 tgtgggatga gatttcgctg gcgtttggga acgcatttcc ctccgtgaaa
1501 gagaggctct tgaacaggaa acttatcaga gtggcaggcg acgcactaga
1551 gatcagggtg cctgatctat atgtgacctt ccacgaccga ttagtgactg
1601 agtacaaggc ctctgtggac atgcctgcgc ttgacattag gaagaagatg
1651 gaagaaacgg aagtgatgta caatgcactt tcagaattat cggtgttaag
1701 ggagtctgac aaattcgatg ttgatgtttt ttcccagatg tgccaatctt
1751 tggaagttga cccaatgacg gcagcgaagg ttatagtcgc ggtcatgagc
1801 aatgagagcg gtctgactct cacatttgaa cgacctactg aggcgaatgt
1851 tgcgctagct ttacaggatc aagagaaggc ttcagaaggt gctttggtag
1901 ttacctcaag agaagttgaa gaaccgtcca tgaagggttc gatggccaga
1951 ggagagttac aattagctgg tcttgctgga gatcatccgg agtcgtccta
2001 ttctaggaac gaggagatag agtctttaga gcagtttcat atggcaacgg
2051 cagattcgtt aattcgtaag cagatgagct cgattgtgta cacgggtccg
2101 attaaagttc agcaaatgaa aaactttatc gatagcctgg tagcatcact
2151 atctgctgcg gtgtcgaatc tcgtcaagat cctcaaagat acagctgcta
2201 ttgaccttga aacccgtcaa aagtttggag tcttggatgt tgcatctagg
2251 aagtggttaa tcaaaccaac ggccaagagt catgcatggg gtgttgttga
2301 aacccacgcg aggaagtatc atgtggcgct tctggaatat gatgagcagg
2351 gtgtggtgac atgcgatgat tggagaagag tagctgtcag ctctgagtct
2401 gttgtttatt ccgacatggc gaaactcaga actctgcgca gactgcttcg
2451 aaacggagaa ccgcatgtca gtagcgcaaa ggttgttctt gtggacggag
2501 ttccgggctg tggaaaaacc aaagaaattc tttccagagt taattttgat
2551 gaagatttaa ttttagtacc tgggaagcaa gccgctgaaa tgatcagaag
2601 acgtgcgaat tcctcaggga ttattgtggc cacgaaggac aacgttaaaa
2651 ccgttgattc tttcatgatg aattttggga aaagcacacg ctgtcagttc
2701 aagaggttat tcattgatga agggttgatg ttgcatactg gttgtgttaa
2751 ttttcttgtg gcgatgtcat tgtgcgaaat tgcatatgtt tacggagaca
2801 cacagcaaat tccatacatc aatagagttt caggattccc gtaccccgcc
2851 cattttgcca aattggaagt tgacgaggtg gagacacgca gaactactct
2901 ccgttgtcca gccgatgtca cacattatct gaacaggaga tatgagggct
2951 ttgtcatgag cacttcttcg gttaaaaagt ctgtttcgca ggagatggtc
3001 ggcggagccg ccgtgatcaa tccgatctca aaacccttgc atggcaagat
3051 cctgactttt acccaatcgg ataaagaagc tctgctttca agagggtatt
3101 cagatgttca cactgtgcat gaagtgcaag gcgagacata ctctgatgtt
3151 tcactagtta ggttaacccc tacaccggtc tccatcattg caggagacag
3201 cccacatgtt ttggtcgcat tgtcaaggca cacctgttcg ctcaagtact
3251 atactgttgt tatggatcct ttagttagta tcattagaga tctagagaaa
3301 cttagctcgt acttgttaga tatgtataag gtcgatgcag gaacacaata
3351 gcaattacag attgactcgg tgttcaaagg ttccaatctt tttgtggcag
3401 cgccaaagac tggtgatatt tctgatatgc agttttacta tgataagtgt
3451 ctcccaggca acagcaccat gatgaataat tttgatgctg ttaccatgag
3501 gttgactgac atttcattga atgtcaaaga ttgcatattg gatatgtcta
3551 agtctgttgc tgcgcctaag gatcaaatca aaccactaac acctatggta
3601 cgaacggcgg cagaaatgcc acgccagact ggactattgg aaaatttagt
3651 ggcgatgatt aaaaggaact ttaacgcacc cgagttgtct ggcatcattg
3701 atattgaaaa tactgcatct ttagttgtag ataagttttt cgatagttat
3751 ttgcttaaag aaaaaagaaa accaaataaa aatgtttctt tgttcagtag
3801 agagtctctc aatagatggt tagaaaagca ggaacaggta acaataggtc
3851 agcttgcaga ttttgatttt gtggatttgc cagcagttga tcagtacaga
3901 cacatgatca aagcacaacc caagcaaaaa ttggacactt caatccaaac
3951 ggagtacccg gctttgcaga cgattgtgta ccattcaaaa aagatcaatg
4001 caatattcgg cccgttgttt agtgagctta ctaggcaatt actggacagt
4051 gttgattcga gcagattttt gttttttaca agaaagacac cagcgcagat
4101 tgaggatttc ttcggagatc ttgacagtca tgtgccgatg gatgtcttgg
4151 agctggatat atcaaaatac gacaaatctc agaatgaatt ccactgtgca
4201 gtagaatacg agatctggcg aagattgggt tttgaagact tcttgggaga
4251 agtttggaaa caagggcata gaaagaccac cctcaaggat tataccgcag
4301 gtatcaaaac ttgcatctgg tatcaaagaa agagtgggga cgtcacgacg
4351 ttcattggaa acactgtgat cattgctgca tgtttggcct cgatgcttcc
4401 gatggagaaa ataatcaaag gagccttttg tggtgacgat agtctgctgt
4451 acttcccaaa gggttgtgag tttccggatg tgcaacactc cgcgaatctt
4501 atgtggaatt ttgaagcaaa actgtttaaa aaacagtatg gatatttttg
4551 cggaagatat gtaatacatc acgacagagg atgcattgtg tattacgatc
4601 ccctaaagtt gatctcgaaa cttggtgcta aacacatcaa ggattgggaa
4651 cacttggagg agttcagaag gtctctttgt gatgttgctg tttcgttgaa
4701 caattgtgcg tattacacac agttggacga cgctgtaagg gaggttcata
4751 agaccgcccc tccaggttcg tttgtttata aaagtctggt gaagtttttg
4801 tctgataaag ttctttttag aagtttgttc atagatggct ctagttgtta
4851 a