Sequence of DPV Norwalk virus

Norwalk calicivirus nonstructural polyprotein, 58 kd capsid protein and orf3 genes, complete cds.

ACC No: M87661

Dated: 2000-03-04 | Length: 7654 | CRC: -1707662437

                !!NA_SEQUENCE 1.0
ID   NWCRNA     standard; RNA; VRL; 7654 BP.
XX
AC   M87661;
XX
SV   M87661.1
XX
DT   29-AUG-1992 (Rel. 33, Created)
DT   04-MAR-2000 (Rel. 63, Last updated, Version 5)
XX
DE   Norwalk calicivirus nonstructural polyprotein, 58 kd capsid protein and
DE   orf3 genes, complete cds.
XX
KW   .
XX
OS   Norwalk virus
OC   Viruses; ssRNA positive-strand viruses, no DNA stage; Caliciviridae;
OC   Norwalk-like viruses.
XX
RN   [1]
RP   13-7654
RX   MEDLINE; 93303939.
RA   Jiang X., Wang M., Wang K., Estes M.K.;
RT   "Sequence and genomic organization of Norwalk virus";
RL   Virology 195(1):51-61(1993).
XX
RN   [2]
RP   1-12
RX   MEDLINE; 97037719.
RA   Hardy M.E., Estes M.K.;
RT   "Completion of the Norwalk virus genome sequence";
RL   Virus Genes 12(3):287-290(1996).
XX
RN   [3]
RP   13-7654
RA   Estes M.K.;
RT   ;
RL   Submitted (09-AUG-1993) to the EMBL/GenBank/DDBJ databases.
RL   M.K. Estes, Molecular Virology, Baylor College of Medicine, Houston, TX
RL   77006, USA
XX
RN   [4]
RP   1-12
RA   Hardy M.E.;
RT   ;
RL   Submitted (07-NOV-1995) to the EMBL/GenBank/DDBJ databases.
RL   Michele E. Hardy, Molecular Virology, Baylor College of Medicine, Houston,
RL   TX 77006, USA
XX
DR   SPTREMBL; Q83883; Q83883.
DR   SPTREMBL; Q83884; Q83884.
DR   SPTREMBL; Q83885; Q83885.
XX
CC   On Nov 13, 1995 this sequence version replaced gi:332545.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .7654
FT                   /db_xref="taxon:11983"
FT                   /organism="Norwalk virus"
FT   CDS             5. .5374
FT                   /codon_start=1
FT                   /db_xref="SPTREMBL:Q83883"
FT                   /note="orf1; sequence homologies to 2C helicase, 3C
FT                   protease, and 3D RNA-dependent RNA polymerase of
FT                   picornavirus"
FT                   /product="nonstructural polyprotein"
FT                   /protein_id="AAB50465.1"
FT                   /translation="MMMASKDVVPTAASSENANNNSSIKSRLLARLKGSGGATSPPNSI
FT                   KITNQDMALGLIGQVPAPKATSVDVPKQQRDRPPRTVAEVQQNLRWTERPQDQNVKTWD
FT                   ELDHTTKQQILDEHAEWFDAGGLGPSTLPTSHERYTHENDEGHQVKWSAREGVDLGISG
FT                   LTTVSGPEWNMCPLPPVDQRSTTPATEPTIGDMIEFYEGHIYHYAIYIGQGKTVGVHSP
FT                   QAAFSITRITIQPISAWWRVCYVPQPKQRLTYDQLKELENEPWPYAAVTNNCFEFCCQV
FT                   MCLEDTWLQRKLISSGRFYHPTQDWSRDTPEFQQDSKLEMVRDAVLAAINGLVSRPFKD
FT                   LLGKLKPLNVLNLLSNCDWTFMGVVEMVVLLLELFGIFWNPPDVSNFIASLLPDFHLQG
FT                   PEDLARDLVPIVLGGIGLAIGFTRDKVSKMMKNAVDGLRAATQLGQYGLEIFSLLKKYF
FT                   FGGDQTEKTLKDIESAVIDMEVLSSTSVTQLVRDKQSARAYMAILDNEEEKARKLSVRN
FT                   ADPHVVSSTNALISRISMARAALAKAQAEMTSRMRPVVIMMCGPPGIGKTKAAEHLAKR
FT                   LANEIRPGGKVGLVPREAVDHWDGYHGEEVMLWDDYGMTKIQEDCNKLQAIADSAPLTL
FT                   NCDRIENKGMQFVSDAIVITTNAPGPAPVDFVNLGPVCRRVDFLVYCTAPEVEHTRKVS
FT                   PGDTTALKDCFKPDFSHLKMELAPQGGFDNQGNTPFGKGVMKPTTINRLLIQAVALTME
FT                   RQDEFQLQGPTYDFDTDRVAAFTRMARANGLGLISMASLGKKLRSVTTIEGLKNALSGY
FT                   KISKCSIQWQSRVYIIESDGASVQIKEDKQALTPLQQTINTASLAITRLKAARAVAYAS
FT                   CFQSAITTILQMAGSALVINRAVKRMFGTRTAAMALEGPGKEHNCRVHKAKEAGKGPIG
FT                   HDDMVERFGLCETEEEESEDQIQMVPSDAVPEGKNKGKTKKGRGRKNNYNAFSRRGLSD
FT                   EEYEEYKKIREEKNGNYSIQEYLEDRQRYEEELAEVQAGGDGGIGETEMEIRHRVFYKS
FT                   KSKKHQQEQRRQLGLVTGSDIRKRKPIDWTPPKNEWADDDREVDYNEKINFEAPPTLWS
FT                   RVTKFGSGWGFWVSPTVFITTTHVVPTGVKEFFGEPLSSIAIHQAGEFTQFRFSKKMRP
FT                   DLTGMVLEEGCPEGTVCSVLIKRDSGELLPLAVRMGAIASMRIQGRLVHGQSGMLLTGA
FT                   NAKGMDLGTIPGDCGAPYVHKRGNDWVVCGVHAAATKSGNTVVCAVQAGEGETALEGGD
FT                   KGHYAGHEIVRYGSGPALSTKTKFWRSSPEPLPPGVYEPAYLGGKDPRVQNGPSLQQVL
FT                   RDQLKPFADPRGRMPEPGLLEAAVETVTSMLEQTMDTPSPWSYADACQSLDKTTSSGYP
FT                   HHKRKNDDWNGTTFVGELGEQAAHANNMYENAKHMKPIYTAALKDELVKPEKIYQKVKK
FT                   RLLWGADLGTVVRAARAFGPFCDAIKSHVIKLPIKVGMNTIEDGPLIYAEHAKYKNHFD
FT                   ADYTAWDSTQNRQIMTESFSIMSRLTASPELAEVVAQDLLAPSEMDVGDYVIRVKEGLP
FT                   SGFPCTSQVNSINHWIITLCALSEATGLSPDVVQSMSYFSFYGDDEIVSTDIDFDPARL
FT                   TQILKEYGLKPTRPDKTEGPIQVRKNVDGLVFLRRTISRDAAGFQGRLDRASIERQIFW
FT                   TRGPNHSDPSETLVPHTQRKIQLISLLGEASLHGEKFYRKISSKVIHEIKTGGLEMYVP
FT                   GWQAMFRWMRFHDLGLWTGDRDLLPEFVNDDGV"
FT   CDS             5358. .6950
FT                   /codon_start=1
FT                   /db_xref="SPTREMBL:Q83884"
FT                   /note="orf2"
FT                   /product="58 kd capsid protein"
FT                   /protein_id="AAB50466.1"
FT                   /translation="MMMASKDATSSVDGASGAGQLVPEVNASDPLAMDPVAGSSTAVAT
FT                   AGQVNPIDPWIINNFVQAPQGEFTISPNNTPGDVLFDLSLGPHLNPFLLHLSQMYNGWV
FT                   GNMRVRIMLAGNAFTAGKIIVSCIPPGFGSHNLTIAQATLFPHVIADVRTLDPIEVPLE
FT                   DVRNVLFHNNDRNQQTMRLVCMLYTPLRTGGGTGDSFVVAGRVMTCPSPDFNFLFLVPP
FT                   TVEQKTRPFTLPNLPLSSLSNSRAPLPISSIGISPDNVQSVQFQNGRCTLDGRLVGTTP
FT                   VSLSHVAKIRGTSNGTVINLTELDGTPFHPFEGPAPIGFPDLGGCDWHINMTQFGHSSQ
FT                   TQYDVDTTPDTFVPHLGSIQANGIGSGNYVGVLSWISPPSHPSGSQVDLWKIPNYGSSI
FT                   TEATHLAPSVYPPGFGEVLVFFMSKMPGPGAYNLPCLLPQEYISHLASEQAPTVGEAAL
FT                   LHYVDPDTGRNLGEFKAYPDGFLTCVPNGASSGPQQLPINGVFVFVSWVSRFYQLKPVG
FT                   TASSARGRLGLRR"
FT   CDS             6950. .7588
FT                   /codon_start=1
FT                   /db_xref="SPTREMBL:Q83885"
FT                   /note="orf3; encodes small basic protein of unknown
FT                   function"
FT                   /protein_id="AAB50467.1"
FT                   /translation="MAQAIIGAIAASTAGSALGAGIQVGGEAALQSQRYQQNLQLQENS
FT                   FKHDREMIGYQVEASNQLLAKNLATRYSLLRAGGLTSADAARSVAGAPVTRIVDWNGVR
FT                   VSAPESSATTLRSGGFMSVPIPFASKQKQVQSSGISNPNYSPSSISRTTSWVESQNSSR
FT                   FGNLSPYHAEALNTVWLTPPGSTASSTLSSVPRGYFNTDRLPLFANNRR"
XX
SQ   Sequence 7654 BP; 2108 A; 1792 C; 1883 G; 1871 T; 0 other;

   M87661  Length: 7654  May 20, 2002 10:15  Type: N  Check: 7252  ..

       1  gtgaatgatg atggcgtcaa aagacgtcgt tcctactgct gctagcagtg
      51  aaaatgctaa caacaatagt agtattaagt ctcgtctatt ggcgagactc
     101  aagggttcag gtggggctac gtccccaccc aactcgataa agataaccaa
     151  ccaagatatg gctctggggc tgattggaca ggtcccagcg ccaaaggcca
     201  catccgtcga tgtccctaaa caacagaggg atagaccacc acggactgtt
     251  gccgaagttc aacaaaattt gcgttggact gagagaccac aagaccagaa
     301  tgttaagacg tgggatgagc ttgaccacac aacaaaacaa cagatacttg
     351  atgaacacgc tgagtggttt gatgccggtg gcttaggtcc aagtacacta
     401  cccactagtc atgaacggta cacacatgag aatgatgaag gccaccaggt
     451  aaagtggtcg gctagggaag gtgtagacct tggcatatcc gggctcacga
     501  cggtgtctgg gcctgagtgg aatatgtgcc cgctaccacc agttgaccaa
     551  aggagcacga cacctgcaac tgagcccaca attggtgaca tgatcgaatt
     601  ctatgaaggg cacatctatc attatgctat atacataggt caaggcaaga
     651  cggtgggtgt acactcccct caagcagcct tctcaataac gaggatcacc
     701  atacagccca tatcagcttg gtggcgagtc tgttatgtcc cacaaccaaa
     751  acagaggctc acatacgacc aactcaaaga attagaaaat gaaccatggc
     801  cgtatgccgc agtcacgaac aactgcttcg aattttgttg ccaggtcatg
     851  tgcttggaag atacttggtt gcaaaggaag ctcatctcct ctggccggtt
     901  ttaccacccg acccaagatt ggtcccgaga cactccagaa ttccaacaag
     951  acagcaagtt agagatggtt agggatgcag tgctagccgc tataaatggg
    1001  ttggtgtcgc ggccatttaa agatcttctg ggtaagctca aacccttgaa
    1051  cgtgcttaac ttactttcaa actgtgattg gacgttcatg ggggtcgtgg
    1101  agatggtggt cctcctttta gaactctttg gaatcttttg gaacccacct
    1151  gatgtttcca actttatagc ttcactcctg ccagatttcc atctacaggg
    1201  ccccgaggac cttgccaggg atctcgtgcc aatagtattg ggggggatcg
    1251  gcttagccat aggattcacc agagacaagg taagtaagat gatgaagaat
    1301  gctgttgatg gacttcgtgc ggcaacccag ctcggtcaat atggcctaga
    1351  aatattctca ttactaaaga agtacttctt cggtggtgat caaacagaga
    1401  aaaccctaaa agatattgag tcagcagtta tagatatgga agtactatca
    1451  tctacatcag tgactcagct cgtgagggac aaacagtctg cacgggctta
    1501  tatggccatc ttagataatg aagaagaaaa ggcaaggaaa ttatctgtca
    1551  ggaatgccga cccacacgta gtatcctcta ccaatgctct catatcccgg
    1601  atctcaatgg ctagggctgc attggccaag gctcaagctg aaatgaccag
    1651  caggatgcgt cctgtggtca ttatgatgtg tgggccccct ggtataggta
    1701  aaaccaaggc agcagaacat ctggctaaac gcctagccaa tgagatacgg
    1751  cctggtggta aggttgggct ggtcccacgg gaggcagtgg atcattggga
    1801  tggatatcac ggagaggaag tgatgctgtg ggacgactat ggaatgacaa
    1851  agatacagga agactgtaat aaactgcaag ccatagccga ctcagccccc
    1901  ctaacactca attgtgaccg aatagaaaac aagggaatgc aatttgtgtc
    1951  tgatgctata gtcatcacca ccaatgctcc tggcccagcc ccagtggact
    2001  ttgtcaacct cgggcctgtt tgccgaaggg tggacttcct tgtgtattgc
    2051  acggcacctg aagttgaaca cacgaggaaa gtcagtcctg gggacacaac
    2101  tgcactgaaa gactgcttca agcccgattt ctcacatcta aaaatggagt
    2151  tggctcccca agggggcttt gataaccaag ggaatacccc gtttggtaag
    2201  ggtgtgatga agcccaccac cataaacagg ctgttaatcc aggctgtagc
    2251  cttgacgatg gagagacagg atgagttcca actccagggg cctacgtatg
    2301  actttgatac tgacagagta gctgcgttca cgaggatggc ccgagccaac
    2351  gggttgggtc tcatatccat ggcctcccta ggcaaaaagc tacgcagtgt
    2401  caccactatt gaaggattaa agaatgctct atcaggctat aaaatatcaa
    2451  aatgcagtat acaatggcag tcaagggtgt acattataga atcagatggt
    2501  gccagtgtac aaatcaaaga agacaagcaa gctttgaccc ctctgcagca
    2551  gacaattaac acggcctcac ttgccatcac tcgactcaaa gcagctaggg
    2601  ctgtggcata cgcttcatgt ttccagtccg ccataactac catactacaa
    2651  atggcgggat ctgcgctcgt tattaatcga gcggtcaagc gtatgtttgg
    2701  tacccgtaca gcagccatgg cattagaagg acctgggaaa gaacataatt
    2751  gcagggtcca taaggctaag gaagctggaa aggggcccat aggtcatgat
    2801  gacatggtag aaaggtttgg cctatgtgaa actgaagagg aggagagtga
    2851  ggaccaaatt caaatggtac caagtgatgc cgtcccagaa ggaaagaaca
    2901  aaggcaagac caaaaaggga cgtggtcgca aaaataacta taatgcattc
    2951  tctcgccgtg gtctgagtga tgaagaatat gaagagtaca aaaagatcag
    3001  agaagaaaag aatggcaatt atagtataca agaatacttg gaggaccgcc
    3051  aacgatatga ggaagaatta gcagaggtac aggcaggtgg tgatggtggc
    3101  ataggagaaa ctgaaatgga aatccgtcac agggtcttct ataaatccaa
    3151  gagtaagaaa caccaacaag agcaacggcg acaacttggt ctagtgactg
    3201  gatcagacat cagaaaacgt aagcccattg actggacccc gccaaagaat
    3251  gaatgggcag atgatgacag agaggtggat tataatgaaa agatcaattt
    3301  tgaagctccc ccgacactat ggagccgagt cacaaagttt ggatcaggat
    3351  ggggcttttg ggtcagcccg acagtgttca tcacaaccac acatgtagtg
    3401  ccaactggtg tgaaagaatt ctttggtgag cccctatcta gtatagcaat
    3451  ccaccaagca ggtgagttca cacaattcag gttctcaaag aaaatgcgcc
    3501  ctgacttgac aggtatggtc cttgaagaag gttgccctga agggacagtc
    3551  tgctcagtcc taattaaacg ggattcgggt gaactacttc cgctagccgt
    3601  ccgtatgggg gctattgcct ccatgaggat acagggtcgg cttgtccatg
    3651  gccaatcagg gatgttactg acaggggcca atgcaaaggg gatggatctt
    3701  ggcactatac caggagactg cggggcacca tacgtccaca agcgcgggaa
    3751  tgactgggtt gtgtgtggag tccacgctgc agccacaaag tcaggcaaca
    3801  ccgtggtctg cgctgtacag gctggagagg gcgaaaccgc actagaaggt
    3851  ggagacaagg ggcattatgc cggccacgag attgtgaggt atggaagtgg
    3901  cccagcactg tcaactaaaa caaaattctg gaggtcctcc ccagaaccac
    3951  tgccccccgg agtatatgag ccagcatacc tggggggcaa ggacccccgt
    4001  gtacagaatg gcccatccct acaacaggta ctacgtgacc aactgaaacc
    4051  ctttgcggac ccccgcggcc gcatgcctga gcctggccta ctggaggctg
    4101  cggttgagac tgtaacatcc atgttagaac agacaatgga taccccaagc
    4151  ccgtggtctt acgctgatgc ctgccaatct cttgacaaaa ctactagttc
    4201  ggggtaccct caccataaaa ggaagaatga tgattggaat ggcaccacct
    4251  tcgttggaga gctcggtgag caagctgcac acgccaacaa tatgtatgag
    4301  aatgctaaac atatgaaacc catttacact gcagccttaa aagatgaact
    4351  agtcaagcca gaaaagattt atcaaaaagt caagaagcgt ctactatggg
    4401  gcgccgatct cggaacagtg gtcagggccg cccgggcttt tggcccattt
    4451  tgtgacgcta taaaatcaca tgtcatcaaa ttgccaataa aagttggcat
    4501  gaacacaata gaagatggcc ccctcatcta tgctgagcat gctaaatata
    4551  agaatcattt tgatgcagat tatacagcat gggactcaac acaaaataga
    4601  caaattatga cagaatcctt ctccattatg tcgcgcctta cggcctcacc
    4651  agaattggcc gaggttgtgg cccaagattt gctagcacca tctgagatgg
    4701  atgtaggtga ttatgtcatc agggtcaaag aggggctgcc atctggattc
    4751  ccatgtactt cccaggtgaa cagcataaat cactggataa ttactctctg
    4801  tgcactgtct gaggccactg gtttatcacc tgatgtggtg caatccatgt
    4851  catatttctc attttatggt gatgatgaga ttgtgtcaac tgacatagat
    4901  tttgacccag cccgcctcac tcaaattctc aaggaatatg gcctcaaacc
    4951  aacaaggcct gacaaaacag aaggaccaat acaagtgagg aaaaatgtgg
    5001  atggactggt cttcttgcgg cgcaccattt cccgtgatgc ggcagggttc
    5051  caaggcaggt tagatagggc ttcgattgaa cgccaaatct tctggacccg
    5101  cgggcccaat cattcagatc catcagagac tctagtgcca cacactcaaa
    5151  gaaaaataca gttgatttca cttctagggg aagcttcact ccatggtgag
    5201  aaattttaca gaaagatttc cagcaaggtc atacatgaaa tcaagactgg
    5251  tggattggaa atgtatgtcc caggatggca ggccatgttc cgctggatgc
    5301  gcttccatga cctcggattg tggacaggag atcgcgatct tctgcccgaa
    5351  ttcgtaaatg atgatggcgt ctaaggacgc tacatcaagc gtggatggcg
    5401  ctagtggcgc tggtcagttg gtaccggagg ttaatgcttc tgaccctctt
    5451  gcaatggatc ctgtagcagg ttcttcgaca gcagtcgcga ctgctggaca
    5501  agttaatcct attgatccct ggataattaa taattttgtg caagcccccc
    5551  aaggtgaatt tactatttcc ccaaataata cccccggtga tgttttgttt
    5601  gatttgagtt tgggtcccca tcttaatcct ttcttgctcc atctatcaca
    5651  aatgtataat ggttgggttg gtaacatgag agtcaggatt atgctagctg
    5701  gtaatgcctt tactgcgggg aagataatag tttcctgcat accccctggt
    5751  tttggttcac ataatcttac tatagcacaa gcaactctct ttccacatgt
    5801  gattgctgat gttaggactc tagaccccat tgaggtgcct ttggaagatg
    5851  ttaggaatgt tctctttcat aataatgata gaaatcaaca aaccatgcgc
    5901  cttgtgtgca tgctgtacac ccccctccgc actggtggtg gtactggtga
    5951  ttcttttgta gttgcagggc gagttatgac ttgccccagt cctgatttta
    6001  atttcttgtt tttagtccct cctacggtgg agcagaaaac caggcccttc
    6051  acactcccaa atctgccatt gagttctctg tctaactcac gtgcccctct
    6101  cccaatcagt agtatcggca tttccccaga caatgtccag agtgtgcagt
    6151  tccaaaatgg tcggtgtact ctggatggcc gcctggttgg caccacccca
    6201  gtttcattgt cacatgttgc caagataaga gggacctcca atggcactgt
    6251  aatcaacctt actgaattgg atggcacacc ctttcaccct tttgagggcc
    6301  ctgcccccat tgggtttcca gacctcggtg gttgtgattg gcatatcaat
    6351  atgacacagt ttggccattc tagccagacc cagtatgatg tagacaccac
    6401  ccctgacact tttgtccccc atcttggttc aattcaggca aatggcattg
    6451  gcagtggtaa ttatgttggt gttcttagct ggatttcccc cccatcacac
    6501  ccgtctggct cccaagttga cctttggaag atccccaatt atgggtcaag
    6551  tattacggag gcaacacatc tagccccttc tgtatacccc cctggtttcg
    6601  gagaggtatt ggtctttttc atgtcaaaaa tgccaggtcc tggtgcttat
    6651  aatttgccct gtctattacc acaagagtac atttcacatc ttgctagtga
    6701  acaagcccct actgtaggtg aggctgccct gctccactat gttgaccctg
    6751  ataccggtcg gaatcttggg gaattcaaag cataccctga tggtttcctc
    6801  acttgtgtcc ccaatggggc tagctcgggt ccacaacagc tgccgatcaa
    6851  tggggtcttt gtctttgttt catgggtgtc cagattttat caattaaagc
    6901  ctgtgggaac tgccagctcg gcaagaggta ggcttggtct gcgccgataa
    6951  tggcccaagc cataattggt gcaattgctg cttccacagc aggtagtgct
    7001  ctgggagcgg gcatacaggt tggtggcgaa gcggccctcc aaagccaaag
    7051  gtatcaacaa aatttgcaac tgcaagaaaa ttcttttaaa catgacaggg
    7101  aaatgattgg gtatcaggtt gaagcttcaa atcaattatt ggctaaaaat
    7151  ttggcaacta gatattcact cctccgtgct gggggtttga ccagtgctga
    7201  tgcagcaaga tctgtggcag gagctccagt cacccgcatt gtagattgga
    7251  atggcgtgag agtgtctgct cccgagtcct ctgctaccac attgagatcc
    7301  ggtggcttca tgtcagttcc cataccattt gcctctaagc aaaaacaggt
    7351  tcaatcatct ggtattagta atccaaatta ttccccttca tccatttctc
    7401  gaaccactag ttgggtcgag tcacaaaact catcgagatt tggaaatctt
    7451  tctccatacc acgcggaggc tctcaataca gtgtggttga ctccacccgg
    7501  ttcaacagcc tcttctacac tgtcttctgt gccacgtggt tatttcaata
    7551  cagacaggtt gccattattc gcaaataata ggcgatgatg ttgtaatatg
    7601  aaatgtgggc atcatattca tttaattagg tttaattagg tttaatttga
    7651  tgtt