Sequence of DPV Potato virus S
Potato virus S gene for Replicase, 25K protein, 12K protein, 42K protein, 7K protein, coat protein, 11K protein, complete and partial cds, 3'-terminal region.
ACC No: D00461
Dated: 2003-07-07 | Length: 3552 | CRC: 1116769277
!!NA_SEQUENCE 1.0
ID PVS standard; RNA; VRL; 3552 BP.
XX
AC D00461;
XX
SV D00461.1
XX
DT 11-APR-1990 (Rel. 23, Created)
DT 07-JUL-2003 (Rel. 76, Last updated, Version 4)
XX
DE Potato virus S gene for Replicase, 25K protein, 12K protein, 42K protein,
DE 7K protein, coat protein, 11K protein, complete and partial cds,
DE 3'-terminal region.
XX
KW coat protein; replicase.
XX
OS Potato virus S
OC Viruses; ssRNA positive-strand viruses, no DNA stage; Carlavirus.
XX
RN [1]
RP 1-3552
RX MEDLINE; 89279283.
RX PUBMED; 2732711.
RA Mackenzie D.J., Tremaine J.H., Stace-Smith R.;
RT "Organization and interviral homologies of the 3'-terminal portion of
RT potato virus S RNA";
RL J. Gen. Virol. 70:1053-1063(1989).
XX
DR GOA; P16650.
DR GOA; P16651.
DR GOA; P16652.
DR GOA; P16653.
DR GOA; P16654.
DR GOA; P22657.
DR GOA; Q9WK32.
DR SPTREMBL; Q9WK32; Q9WK32.
DR SWISS-PROT; P16650; VHEL_PVSP.
DR SWISS-PROT; P16651; VMEM_PVSP.
DR SWISS-PROT; P16652; V07K_PVSP.
DR SWISS-PROT; P16653; COAT_PVSP.
DR SWISS-PROT; P16654; VNBP_PVSP.
DR SWISS-PROT; P22657; RRPO_PVSP.
XX
CC Translation of the largest (42K) potential ORF (position 2045 to
CC 3172) from an internal ATG codon (position 2291) would yield a 33K
CC polypeptide, the sequence of which has been confirmed to be that of
CC the coat protein.
XX
FH Key Location/Qualifiers
FH
FT source 1. .3552
FT /db_xref="taxon:12169"
FT /mol_type="genomic RNA"
FT /note="27 bp upstream of EcoRI site."
FT /note="clones:pVS[49,30,20,92,52,59,91,44,58,94,45,64,57,
FT 41,66,65 and 61]"
FT /organism="Potato virus S"
FT /strain="Andean(Peruvian)"
FT CDS <1. .1064
FT /codon_start=3
FT /db_xref="GOA:P22657"
FT /db_xref="SWISS-PROT:P22657"
FT /product="Replicase"
FT /protein_id="BAA00350.1"
FT /translation="MYGPFLLKEFLNDVPLKPMHNTRMMAEAKFDFEEKKTQKSAATIE
FT NHSNRSCRDWLADMGMVFSKSQLCTKFDNRFRDAKAAQTIVCFQHSVLCRFAPYMRYIE
FT KKLNEVLPATFYIHSGKGLEELNKWVIESKFEGVCTESDYEAFDASQDQYIVAFELALM
FT RYLGLPNDLIEDYKYIKTHLGSKLGNFAIMRFSGEASTFLFNTMANMLFTFLRYKLKGD
FT ERICFAGDDMCANRALFIKDTHEGFLKKLKLKAKVDRTNRPSFCGWSLSSDGIYKKPQL
FT VFERLCIAKETANLANCIDNYAIEVSYAYKLGERIKERMSEEELEAFYNCVRVIIKHKH
FT LLKSEIRSVYEEV"
FT CDS 1101. .1784
FT /codon_start=1
FT /db_xref="GOA:P16650"
FT /db_xref="SWISS-PROT:P16650"
FT /product="25K protein"
FT /protein_id="BAA00351.1"
FT /translation="MDVFLQVLNKYKFERVSSTLNKPIVVHSVPGAGKSSAIRELLKLD
FT SRFECITRGRPDIPNLEGAFIKAERSGESKLLLVDEYIEGPIPEDAFAIFADPLQSTAV
FT SPHRAHFIKTLSHRFGKCTDSLLRDLGWDVQAEGQDSVQIADIFTVDPRETIVYFEPEV
FT GELLRSHGVEASCIGEVRGATFEHVTFVTSENSPLIDKASAFQCLTRHTKSLLILCPDA
FT TYTAA"
FT CDS 1762. .2088
FT /codon_start=1
FT /db_xref="GOA:P16651"
FT /db_xref="SWISS-PROT:P16651"
FT /product="12K protein"
FT /protein_id="BAA00352.1"
FT /translation="MPLTPPPNYTGLYIAAALGVSLAAVVALFTRSTLPIVGDSQHNLP
FT HGGRYRDGTKAIDYFKPTKLNSVEPGNYWYTQPWLLVILLVALICLSGRHAQCCPRCNR
FT VHSA"
FT CDS 2045. .3172
FT /codon_start=1
FT /db_xref="GOA:Q9WK32"
FT /db_xref="SPTREMBL:Q9WK32"
FT /note="containing coat protein ORF; see comment"
FT /product="42K protein"
FT /protein_id="BAA00353.1"
FT /translation="MLNAAQDATECTVLNSVLLGLRSRLVRSETRKYKLRFANHWGISQ
FT AGQLRAHKRSSGGRATKAVETPLGSQVKARIYSLTARMPPKPDPSSSGEAPQAMQPAPP
FT PRAEGHMYAQPEGPGQNEEAMLEQRLIRLIELMATKRHNSTLSNISFEIGRPSLEPTPE
FT MRRNPENPYSRFSIDELFKMEIRSVSNNMANTEQMAQITADIAGLGVPTEHVAGVILKV
FT VIMCASVSSSVYLDPAGTVEFPTGAVPLDSIIAIMKNRAGLRKVCRLYAPVVWNYMLVQ
FT NRPPSDWQAMGFQWNARFAAFDTFDYVTNGAAIQPVEGLIRRPTPEETIAHNAHKSMAI
FT DKSNRNERLANTNVEYTGGMLGAEIVRNHRNAINQ"
FT CDS 2052. .2249
FT /codon_start=1
FT /db_xref="GOA:P16652"
FT /db_xref="SWISS-PROT:P16652"
FT /product="7K protein"
FT /protein_id="BAA00354.1"
FT /translation="MLPKMQPSAQCLIVFSLAFVLGWYVLRPGNTSCVLLITGESVRLV
FT NCELTKDLVEAVLLRPLKHL"
FT CDS <2291. .3172
FT /codon_start=1
FT /db_xref="GOA:P16653"
FT /db_xref="SWISS-PROT:P16653"
FT /product="coat protein"
FT /protein_id="BAA00355.1"
FT /translation="MPPKPDPSSSGEAPQAMQPAPPPRAEGHMYAQPEGPGQNEEAMLE
FT QRLIRLIELMATKRHNSTLSNISFEIGRPSLEPTPEMRRNPENPYSRFSIDELFKMEIR
FT SVSNNMANTEQMAQITADIAGLGVPTEHVAGVILKVVIMCASVSSSVYLDPAGTVEFPT
FT GAVPLDSIIAIMKNRAGLRKVCRLYAPVVWNYMLVQNRPPSDWQAMGFQWNARFAAFDT
FT FDYVTNGAAIQPVEGLIRRPTPEETIAHNAHKSMAIDKSNRNERLANTNVEYTGGMLGA
FT EIVRNHRNAINQ"
FT CDS 3169. .3450
FT /codon_start=1
FT /db_xref="GOA:P16654"
FT /db_xref="SWISS-PROT:P16654"
FT /product="11K protein"
FT /protein_id="BAA00356.1"
FT /translation="MKAERLEMLLLCVYRLGYILPVDVCIKIISVAQVSVQGRSTYSCK
FT RRARSIGRCWRCYRVYPPVCNSKCDNRTCRPGISPNFKVVTFIRGWSN"
FT polyA_site 3552
FT /note="poly(A) site"
XX
SQ Sequence 3552 BP; 976 A; 747 C; 920 G; 909 T; 0 other;
D00461 Length: 3552 July 15, 2003 09:04 Type: N Check: 6345 ..
1 agatgtatgg gccatttcta cttaaagaat tcctcaacga tgtgccactt
51 aaacctatgc acaacacgcg catgatggct gaagcgaagt ttgacttcga
101 ggagaagaaa acccagaaaa gtgcagcaac cattgagaat catagtaata
151 ggtcttgtag ggattggctg gccgacatgg gcatggtgtt ttcaaagtct
201 caactctgca ccaagtttga caacaggttc agggatgcaa aggctgcgca
251 gaccatcgtt tgctttcagc acagcgttct gtgccgcttc gccccataca
301 tgaggtacat agagaagaag cttaatgagg tgttgcctgc cacattttac
351 atccactcag gcaagggctt ggaagagttg aacaaatggg tgatagaatc
401 caaatttgag ggagtgtgta cagagtctga ttatgaagct tttgatgcta
451 gccaagatca gtacattgtg gcgtttgaat tagcgctaat gaggtacttg
501 ggcctgccca atgatctcat agaggactac aagtacatca agacacatct
551 tggctctaaa ttgggaaatt ttgcaataat gcgcttctct ggtgaggcaa
601 gcacattctt attcaatacc atggccaaca tgctgttcac cttcttgagg
651 tacaagttga agggagatga aaggatttgc tttgctgggg atgatatgtg
701 tgcaaataga gctctgttca tcaaggatac gcatgagggc ttcctcaaga
751 agctcaagct gaaggccaag gtggatagaa caaacagacc gagcttctgc
801 gggtggagtt tgagctctga tgggatctac aaaaagccgc aacttgtctt
851 tgagaggctc tgtatagcaa aagagaccgc taatttagcc aattgcatag
901 ataattatgc gatcgaggtg tcctatgcct acaagcttgg ggagaggatc
951 aaagagcgta tgtcagagga ggaactagag gctttctaca actgcgtgag
1001 ggttatcatc aaacacaagc atctgcttaa gtctgaaatt cgcagtgtgt
1051 atgaggaggt ttgatagctt aggtaatcag cttagtagta ttgaatatat
1101 atggatgtgt ttttgcaagt tttgaataaa tataagtttg agcgtgttag
1151 tagtactcta aataaaccaa tagttgttca tagtgttcca ggtgctggta
1201 agagttccgc gatcagggaa ttactgaagt tagatagtag gtttgagtgc
1251 attacccgtg gccggccaga cattcccaat ctagagggag ctttcatcaa
1301 ggccgaacgt agtggtgaga gtaagctgtt actggtagat gagtacatag
1351 aagggcccat tccagaggac gcctttgcaa tcttcgcaga tccgcttcag
1401 agcacagccg tcagtccaca cagagcgcat ttcatcaaaa cactaagcca
1451 tcgctttggc aagtgtactg attcactctt gagagatttg ggttgggacg
1501 tgcaagctga aggtcaggat tcagttcaaa tcgctgatat cttcacggtc
1551 gaccccagag aaacaattgt ttactttgag ccggaagttg gtgagttgct
1601 gaggagtcac ggagtcgagg caagctgcat cggtgaggtg cgtggggcca
1651 cttttgagca cgtaacgttt gtcacatctg aaaatagccc attgattgat
1701 aaggcctctg catttcagtg cttaacgagg cacaccaaga gcttactcat
1751 attgtgccct gatgccactt acaccgccgc ctaattacac agggttatac
1801 attgcggcag cgcttggtgt atctcttgct gctgtagttg ccttattcac
1851 aagaagtact ttgccgattg taggggactc acagcacaac ctcccacacg
1901 gggggcggta ccgtgacggc acaaaggcca tagactactt caagcccaca
1951 aaattgaatt ctgtggagcc gggcaattac tggtacactc aaccttggtt
2001 gttggttata cttttggtag cgctcatctg tctatccggg cgtcatgctc
2051 aatgctgccc aagatgcaac cgagtgcaca gtgcttaata gtgttctcct
2101 tggccttcgt tctaggttgg tacgttctga gaccaggaaa tacaagttgc
2151 gttttgctaa tcactgggga atcagtcagg ctggtcaatt gcgagctcac
2201 aaaagatcta gtggaggccg tgctactaag gccgttgaaa cacctttagg
2251 ttcacaggta aaagctcgaa tatacagtct cacagcaaga atgccgccta
2301 aaccagatcc atctagctca ggggaagcac cacaagcgat gcaacctgca
2351 ccaccaccgc gcgcagaagg gcacatgtat gcgcaaccag aagggccagg
2401 gcaaaacgag gaagccatgc tggagcaaag actcatcagg ttgattgagc
2451 tcatggctac gaagaggcac aactcgacat tgagcaacat ctcctttgaa
2501 ataggtaggc ccagtctaga accgacccct gagatgagga ggaatccgga
2551 aaatccgtac tctcggttct caatcgacga gctgttcaag atggaaatcc
2601 gatcggtctc taacaatatg gccaacactg agcagatggc acagatcacc
2651 gcggatattg cagggctcgg cgtccccaca gagcatgtgg cgggggttat
2701 actgaaggtc gtaattatgt gcgcaagcgt gagcagttct gtctacttag
2751 accctgcagg gactgttgag ttccctactg gcgcagtgcc actggattcc
2801 ataatcgcaa tcatgaaaaa ccgtgctggg ttgaggaagg tgtgtaggtt
2851 gtatgctccg gtcgtttgga attatatgct tgttcagaat aggccacctt
2901 cagattggca ggccatgggg tttcaatgga atgcacgttt cgccgctttt
2951 gacacatttg attatgtgac taacggcgct gcaatccagc ctgttgaggg
3001 gctcatccgt aggccgacgc ctgaagagac aatagctcat aacgctcaca
3051 agagcatggc tattgataag tcgaacagaa atgaaaggtt ggctaacacc
3101 aacgttgagt atactggggg catgctcggt gctgagattg tgcgtaatca
3151 tcggaatgca ataaaccaat gaaagcggaa cgtttagaaa tgttactgtt
3201 gtgtgtttac aggctgggtt atattttacc agtcgatgtg tgtattaaaa
3251 taataagcgt agcgcaggtc agtgtccaag gtcgttcaac ctactcatgt
3301 aagcgaaggg cccgcagcat tggacgatgc tggcgttgct accgtgtcta
3351 tccaccagtt tgtaattcta agtgtgataa taggacatgc cgtccaggca
3401 ttagtcccaa ctttaaagta gtgactttta ttcggggttg gagtaactga
3451 ggtgatacca cccgggatga aaagtctgag tttcgcataa agcttaaata
3501 atatataagt gtgcaactat aaagaaaata tgtttttaaa atattttagc
3551 at