Sequence of DPV Hantaan virus

Hantaan virus, complete M RNA segment coding for G1 and G2 proteins, complete cds.

ACC No: M14627

Dated: 2000-03-04 | Length: 3616 | CRC: -999070269

                !!NA_SEQUENCE 1.0
ID   HANG1G2    standard; RNA; VRL; 3616 BP.
XX
AC   M14627;
XX
SV   M14627.1
XX
DT   19-SEP-1987 (Rel. 13, Created)
DT   04-MAR-2000 (Rel. 63, Last updated, Version 3)
XX
DE   Hantaan virus, complete M RNA segment coding for G1 and G2 proteins,
DE   complete cds.
XX
KW   envelope protein; glycoprotein; polyprotein.
XX
OS   Hantaan virus
OC   Viruses; ssRNA negative-strand viruses; Bunyaviridae; Hantavirus.
XX
RN   [1]
RP   1-3616
RX   MEDLINE; 87151118.
RA   Schmaljohn C.S., Schmaljohn A.L., Dalrymple J.M.;
RT   "Hantaan virus M RNA: Coding strategy, nucleotide sequence, and gene
RT   order";
RL   Virology 157:31-39(1987).
XX
DR   SWISS-PROT; P08668; VGLM_HANTV.
XX
CC   Draft entry and computer-readable sequence of [1] kindly provided
CC   by C.Schmaljohn, 02-MAR-1987.
CC   There are two possible initiation codons at nucleotide positions
CC   41-43 or 65-67, but 'the first codon has more favorable flanking
CC   sequences for initiation of protein synthesis'.
CC   It was not possible to precisely define the carboxy terminus of G1
CC   and G2 mature glycoproteins, but it was established that the
CC   carboxy terminus of G1 and G2 were between positions 1802-1882 and
CC   3419-3445 respectively.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .3616
FT                   /db_xref="taxon:11599"
FT                   /organism="Hantaan virus"
FT   CDS             41. .3448
FT                   /codon_start=1
FT                   /db_xref="SWISS-PROT:P08668"
FT                   /note="precursor structural polyprotein"
FT                   /protein_id="AAA43836.1"
FT                   /translation="MGIWKWLVMASLVWPVLTLRNVYDMKIECPHTVSFGENSVIGYVE
FT                   LPPVPLADTAQMVPESSCNMDNHQSLNTITKYTQVSWRGKADQSQSSQNSFETVSTEVD
FT                   LKGTCVLKHKMVEESYRSRKSVTCYDLSCNSTYCKPTLYMIVPIHACNMMKSCLIALGP
FT                   YRVQVVYERSYCMTGVLIEGKCFVPDQSVVSIIKHGIFDIASVHIVCFFVAVKGNTYKI
FT                   FEQVKKSFESTCNDTENKVQGYYICIVGGNSAPIYVPTLDDFRSMEAFTGIFRSPHGED
FT                   HDLAGEEIASYSIVGPANAKVPHSASSDTLSLIAYSGIPSYSSLSILTSSTEAKHVFSP
FT                   GLFPKLNHTNCDKSAIPLIWTGMIDLPGYYEAVHPCTVFCVLSGPGASCEAFSEGGIFN
FT                   ITSPMCLVSKQNRFRLTEQQVNFVCQRVDMDIVVYCNGQRKVILTKTLVIGQCIYTITS
FT                   LFSLLPGVAHSIAVELCVPGFHGWATAALLVTFCFGWVLIPAITFIILTVLKFIANIFH
FT                   TSNQENRLKSVLRKIKEEFEKTKGSMVCDVCKYECETYKELKAHGVSCPQSQCPYCFTH
FT                   CEPTEAAFQAHYKVCQVTHRFRDDLKKTVTPQNFTPGCYRTLNLFRYKSRCYIFTMWIF
FT                   LLVLESILWAASASETPLTPVWNDNAHGVGSVPMHTDLELDFSLTSSSKYTYRRKLTNP
FT                   LEEAQSIDLHIEIEEQTIGVDVHALGHWFDGRLNLKTSFHCYGACTKYEYPWHTAKCHY
FT                   ERDYQYETSWGCNPSDCPGVGTGCTACGLYLDQLKPVGSAYKIITIRYSRRVCVQFGEE
FT                   NLCKIIDMNDCFVSRHVKVCIIGTVSKFSQGDTLLFFGPLEGGGLIFKHWCTSTCQFGD
FT                   PGDIMSPRDKGFLCPEFPGSFRKKCNFATTPICEYDGNMVSGYKKVMATIDSFQSFNTS
FT                   TMHFTDERIEWKDPDGMLRDHINILVTKDIDFDNLGENPCKIGLQTSSIEGAWGSGVGF
FT                   TLTCLVSLTECPTFLTSIKACDKAICYGAESVTLTRGQNTVKVSGKGGHSGSTFRCCHG
FT                   EDCSQIGLHAAAPHLDKVNGISEIENSKVYDDGAPQCGIKCWFVKSGEWISGIFSGNWI
FT                   VLIVLCVFLLFSLVLLSILCPVRKHKKS"
FT   sig_peptide     41. .94
FT                   /note="precursor structural polyprotein, signal peptide"
FT   mat_peptide     95. .1801
FT                   /note="envelope glycoprotein G1 (see comment)"
FT                   /partial
FT   mat_peptide     1985. .3418
FT                   /note="envelope glycoprotein G2 (see comment)"
FT                   /partial
XX
SQ   Sequence 3616 BP; 1113 A; 648 C; 773 G; 1082 T; 0 other;

M14627  Length: 3616  December 21, 2001 15:54  Type: N  Check: 6321  ..

       1  tagtagtaga caccgcaaaa gaaagcagtc aatcagcaac atggggatat
      51  ggaagtggct agtgatggcc agtttagtat ggcctgtttt gacactgaga
     101  aatgtctatg acatgaaaat tgagtgcccc catacagtaa gttttgggga
     151  aaacagtgtg ataggttatg tagaattacc ccccgtgcca ttggccgaca
     201  cagcacagat ggtgcctgag agttcttgta acatggataa tcaccaatcg
     251  ttgaatacaa taacaaaata tacccaagta agttggagag gaaaggctga
     301  tcagtcacag tctagtcaaa attcatttga gacagtgtcc actgaagttg
     351  acttgaaagg aacatgtgtt ctaaaacaca aaatggtgga agaatcatac
     401  cgtagtagga aatcagtaac ctgttacgac ctgtcttgca atagcactta
     451  ctgcaagcca acactataca tgattgtacc aattcatgca tgcaatatga
     501  tgaaaagctg tttgattgca ttgggaccat acagagtaca ggtggtttat
     551  gagagaagtt actgtatgac aggagtcctg attgaaggga aatgctttgt
     601  cccagatcaa agtgtggtca gtattatcaa gcatgggatc tttgatattg
     651  caagtgttca tattgtatgt ttctttgttg cagttaaagg gaatacttat
     701  aaaatttttg aacaggttaa gaaatccttt gaatcaacat gcaatgatac
     751  agagaataaa gtgcaaggat attatatttg tattgtaggg ggaaactctg
     801  caccaatata tgttccaaca cttgatgatt tcagatccat ggaagcattt
     851  acaggaatct tcagatcacc acatggggaa gatcatgatc tggctggaga
     901  agaaattgca tcttattcta tagtcggacc tgccaatgca aaagttcctc
     951  atagtgctag ctcagataca ttgagcttga ttgcctattc aggtatacca
    1001  tcttattctt cccttagcat cctaacaagt tcaacagaag ctaagcatgt
    1051  attcagccct gggttgttcc caaaacttaa tcacacaaat tgtgataaaa
    1101  gtgccatacc actcatatgg actgggatga ttgatttacc tggatactac
    1151  gaagctgtcc acccttgtac agttttttgc gtattatcag gtcctggggc
    1201  atcatgtgaa gccttttctg aaggcgggat tttcaacata acctctccca
    1251  tgtgcttagt gtcaaaacaa aatcgattcc ggttaacaga acagcaagtg
    1301  aattttgtgt gtcagcgagt ggacatggac attgttgtgt actgcaacgg
    1351  gcagaggaaa gtaatattaa caaaaactct agttattgga cagtgtatat
    1401  atactataac aagcttattc tcattactac ctggagtagc acattctatt
    1451  gctgttgaat tgtgtgtacc tgggttccat ggttgggcca cagctgctct
    1501  gcttgttaca ttctgtttcg gatgggttct tataccagca attacattta
    1551  tcatactaac agtcctaaag ttcattgcta atatttttca cacaagtaat
    1601  caagagaata ggctaaaatc agtacttaga aagataaagg aagagtttga
    1651  aaaaacaaaa ggctcaatgg tatgtgatgt ctgcaagtat gagtgtgaaa
    1701  cctataaaga attaaaggca cacggggtat catgccccca atctcaatgt
    1751  ccttactgtt ttactcattg tgaacccaca gaagcagcat tccaagctca
    1801  ttacaaggta tgccaagtta ctcacagatt cagggatgat ctaaagaaaa
    1851  ctgttactcc tcaaaatttt acaccaggat gttaccggac actaaattta
    1901  tttagataca aaagcaggtg ctacatcttt acaatgtgga tatttcttct
    1951  tgtcttagaa tccatactgt gggctgcaag tgcatcagag acaccattaa
    2001  ctcctgtctg gaatgacaat gcccatgggg taggttctgt tcctatgcat
    2051  acagatttag agcttgattt ctctttaaca tccagttcca agtatacata
    2101  ccgtaggaag ttaacaaacc cacttgagga agcacaatcc attgacctac
    2151  atattgaaat agaagaacag acaattggtg ttgatgtgca tgctctagga
    2201  cactggtttg atggtcgtct taaccttaaa acatcctttc actgttatgg
    2251  tgcttgtaca aagtatgaat acccttggca tactgcaaag tgccattatg
    2301  aaagagatta ccaatatgag acgagctggg gttgtaatcc atcagattgt
    2351  cctggggtgg gcacaggctg tacagcatgt ggtttatacc tagatcaact
    2401  gaaaccagtt ggtagtgctt ataaaattat cacaataagg tacagcagga
    2451  gagtctgtgt tcagtttggg gaggaaaacc tttgtaagat aatagacatg
    2501  aatgattgtt ttgtatctag gcatgttaag gtctgcataa ttggtacagt
    2551  atctaaattc tctcagggtg ataccttatt gttttttgga ccgcttgaag
    2601  gtggtggtct aatatttaaa cactggtgta catccacatg tcaatttggt
    2651  gacccaggag atatcatgag tccaagagac aaaggttttt tatgccctga
    2701  gtttccaggt agtttcagga agaaatgcaa ctttgctact acccctattt
    2751  gtgagtatga tggaaatatg gtctcaggtt acaagaaagt gatggcgaca
    2801  attgattcct tccaatcttt taatacaagc actatgcact tcactgatga
    2851  aaggatagag tggaaagacc ctgatggaat gctaagggac catataaaca
    2901  ttttagtaac gaaggacatt gactttgata accttggtga aaatccttgc
    2951  aaaattggcc tacaaacatc ttctattgag ggggcctggg gttctggtgt
    3001  ggggttcaca ttaacatgtc tggtatcact aacagaatgt cctacctttt
    3051  tgacctcaat aaaggcttgt gataaggcta tctgttatgg tgcagagagt
    3101  gtaacattga caagaggaca aaatacagtc aaggtatcag ggaaaggtgg
    3151  ccatagtggt tcaacattta ggtgttgcca tggggaggac tgttcacaaa
    3201  ttggactcca tgctgctgca cctcaccttg acaaggtaaa tgggatttct
    3251  gagatagaaa atagtaaagt atatgatgat ggggcaccgc aatgtgggat
    3301  aaaatgttgg tttgttaaat caggggaatg gatttcaggg atattcagtg
    3351  gtaattggat tgtactcatt gtcctctgtg tatttctatt gttctccttg
    3401  gttttactaa gcattctctg tcccgtaagg aagcataaaa aatcatagct
    3451  aaattctgtg actatcctgt tcttatgtat agctttaaca tatatactaa
    3501  tttttatatt ccagtatact ctatctaaca cactaaaaaa aatagtagct
    3551  ttctaaccac aaaacttaga ttcttcttct gtatgatgtc ttaacatctt
    3601  gcggtgtcta ctacta