Sequence of DPV Hantaan virus
Hantaan virus, complete M RNA segment coding for G1 and G2 proteins, complete cds.
ACC No: M14627
Dated: 2000-03-04 | Length: 3616 | CRC: -999070269
!!NA_SEQUENCE 1.0
ID HANG1G2 standard; RNA; VRL; 3616 BP.
XX
AC M14627;
XX
SV M14627.1
XX
DT 19-SEP-1987 (Rel. 13, Created)
DT 04-MAR-2000 (Rel. 63, Last updated, Version 3)
XX
DE Hantaan virus, complete M RNA segment coding for G1 and G2 proteins,
DE complete cds.
XX
KW envelope protein; glycoprotein; polyprotein.
XX
OS Hantaan virus
OC Viruses; ssRNA negative-strand viruses; Bunyaviridae; Hantavirus.
XX
RN [1]
RP 1-3616
RX MEDLINE; 87151118.
RA Schmaljohn C.S., Schmaljohn A.L., Dalrymple J.M.;
RT "Hantaan virus M RNA: Coding strategy, nucleotide sequence, and gene
RT order";
RL Virology 157:31-39(1987).
XX
DR SWISS-PROT; P08668; VGLM_HANTV.
XX
CC Draft entry and computer-readable sequence of [1] kindly provided
CC by C.Schmaljohn, 02-MAR-1987.
CC There are two possible initiation codons at nucleotide positions
CC 41-43 or 65-67, but 'the first codon has more favorable flanking
CC sequences for initiation of protein synthesis'.
CC It was not possible to precisely define the carboxy terminus of G1
CC and G2 mature glycoproteins, but it was established that the
CC carboxy terminus of G1 and G2 were between positions 1802-1882 and
CC 3419-3445 respectively.
XX
FH Key Location/Qualifiers
FH
FT source 1. .3616
FT /db_xref="taxon:11599"
FT /organism="Hantaan virus"
FT CDS 41. .3448
FT /codon_start=1
FT /db_xref="SWISS-PROT:P08668"
FT /note="precursor structural polyprotein"
FT /protein_id="AAA43836.1"
FT /translation="MGIWKWLVMASLVWPVLTLRNVYDMKIECPHTVSFGENSVIGYVE
FT LPPVPLADTAQMVPESSCNMDNHQSLNTITKYTQVSWRGKADQSQSSQNSFETVSTEVD
FT LKGTCVLKHKMVEESYRSRKSVTCYDLSCNSTYCKPTLYMIVPIHACNMMKSCLIALGP
FT YRVQVVYERSYCMTGVLIEGKCFVPDQSVVSIIKHGIFDIASVHIVCFFVAVKGNTYKI
FT FEQVKKSFESTCNDTENKVQGYYICIVGGNSAPIYVPTLDDFRSMEAFTGIFRSPHGED
FT HDLAGEEIASYSIVGPANAKVPHSASSDTLSLIAYSGIPSYSSLSILTSSTEAKHVFSP
FT GLFPKLNHTNCDKSAIPLIWTGMIDLPGYYEAVHPCTVFCVLSGPGASCEAFSEGGIFN
FT ITSPMCLVSKQNRFRLTEQQVNFVCQRVDMDIVVYCNGQRKVILTKTLVIGQCIYTITS
FT LFSLLPGVAHSIAVELCVPGFHGWATAALLVTFCFGWVLIPAITFIILTVLKFIANIFH
FT TSNQENRLKSVLRKIKEEFEKTKGSMVCDVCKYECETYKELKAHGVSCPQSQCPYCFTH
FT CEPTEAAFQAHYKVCQVTHRFRDDLKKTVTPQNFTPGCYRTLNLFRYKSRCYIFTMWIF
FT LLVLESILWAASASETPLTPVWNDNAHGVGSVPMHTDLELDFSLTSSSKYTYRRKLTNP
FT LEEAQSIDLHIEIEEQTIGVDVHALGHWFDGRLNLKTSFHCYGACTKYEYPWHTAKCHY
FT ERDYQYETSWGCNPSDCPGVGTGCTACGLYLDQLKPVGSAYKIITIRYSRRVCVQFGEE
FT NLCKIIDMNDCFVSRHVKVCIIGTVSKFSQGDTLLFFGPLEGGGLIFKHWCTSTCQFGD
FT PGDIMSPRDKGFLCPEFPGSFRKKCNFATTPICEYDGNMVSGYKKVMATIDSFQSFNTS
FT TMHFTDERIEWKDPDGMLRDHINILVTKDIDFDNLGENPCKIGLQTSSIEGAWGSGVGF
FT TLTCLVSLTECPTFLTSIKACDKAICYGAESVTLTRGQNTVKVSGKGGHSGSTFRCCHG
FT EDCSQIGLHAAAPHLDKVNGISEIENSKVYDDGAPQCGIKCWFVKSGEWISGIFSGNWI
FT VLIVLCVFLLFSLVLLSILCPVRKHKKS"
FT sig_peptide 41. .94
FT /note="precursor structural polyprotein, signal peptide"
FT mat_peptide 95. .1801
FT /note="envelope glycoprotein G1 (see comment)"
FT /partial
FT mat_peptide 1985. .3418
FT /note="envelope glycoprotein G2 (see comment)"
FT /partial
XX
SQ Sequence 3616 BP; 1113 A; 648 C; 773 G; 1082 T; 0 other;
M14627 Length: 3616 December 21, 2001 15:54 Type: N Check: 6321 ..
1 tagtagtaga caccgcaaaa gaaagcagtc aatcagcaac atggggatat
51 ggaagtggct agtgatggcc agtttagtat ggcctgtttt gacactgaga
101 aatgtctatg acatgaaaat tgagtgcccc catacagtaa gttttgggga
151 aaacagtgtg ataggttatg tagaattacc ccccgtgcca ttggccgaca
201 cagcacagat ggtgcctgag agttcttgta acatggataa tcaccaatcg
251 ttgaatacaa taacaaaata tacccaagta agttggagag gaaaggctga
301 tcagtcacag tctagtcaaa attcatttga gacagtgtcc actgaagttg
351 acttgaaagg aacatgtgtt ctaaaacaca aaatggtgga agaatcatac
401 cgtagtagga aatcagtaac ctgttacgac ctgtcttgca atagcactta
451 ctgcaagcca acactataca tgattgtacc aattcatgca tgcaatatga
501 tgaaaagctg tttgattgca ttgggaccat acagagtaca ggtggtttat
551 gagagaagtt actgtatgac aggagtcctg attgaaggga aatgctttgt
601 cccagatcaa agtgtggtca gtattatcaa gcatgggatc tttgatattg
651 caagtgttca tattgtatgt ttctttgttg cagttaaagg gaatacttat
701 aaaatttttg aacaggttaa gaaatccttt gaatcaacat gcaatgatac
751 agagaataaa gtgcaaggat attatatttg tattgtaggg ggaaactctg
801 caccaatata tgttccaaca cttgatgatt tcagatccat ggaagcattt
851 acaggaatct tcagatcacc acatggggaa gatcatgatc tggctggaga
901 agaaattgca tcttattcta tagtcggacc tgccaatgca aaagttcctc
951 atagtgctag ctcagataca ttgagcttga ttgcctattc aggtatacca
1001 tcttattctt cccttagcat cctaacaagt tcaacagaag ctaagcatgt
1051 attcagccct gggttgttcc caaaacttaa tcacacaaat tgtgataaaa
1101 gtgccatacc actcatatgg actgggatga ttgatttacc tggatactac
1151 gaagctgtcc acccttgtac agttttttgc gtattatcag gtcctggggc
1201 atcatgtgaa gccttttctg aaggcgggat tttcaacata acctctccca
1251 tgtgcttagt gtcaaaacaa aatcgattcc ggttaacaga acagcaagtg
1301 aattttgtgt gtcagcgagt ggacatggac attgttgtgt actgcaacgg
1351 gcagaggaaa gtaatattaa caaaaactct agttattgga cagtgtatat
1401 atactataac aagcttattc tcattactac ctggagtagc acattctatt
1451 gctgttgaat tgtgtgtacc tgggttccat ggttgggcca cagctgctct
1501 gcttgttaca ttctgtttcg gatgggttct tataccagca attacattta
1551 tcatactaac agtcctaaag ttcattgcta atatttttca cacaagtaat
1601 caagagaata ggctaaaatc agtacttaga aagataaagg aagagtttga
1651 aaaaacaaaa ggctcaatgg tatgtgatgt ctgcaagtat gagtgtgaaa
1701 cctataaaga attaaaggca cacggggtat catgccccca atctcaatgt
1751 ccttactgtt ttactcattg tgaacccaca gaagcagcat tccaagctca
1801 ttacaaggta tgccaagtta ctcacagatt cagggatgat ctaaagaaaa
1851 ctgttactcc tcaaaatttt acaccaggat gttaccggac actaaattta
1901 tttagataca aaagcaggtg ctacatcttt acaatgtgga tatttcttct
1951 tgtcttagaa tccatactgt gggctgcaag tgcatcagag acaccattaa
2001 ctcctgtctg gaatgacaat gcccatgggg taggttctgt tcctatgcat
2051 acagatttag agcttgattt ctctttaaca tccagttcca agtatacata
2101 ccgtaggaag ttaacaaacc cacttgagga agcacaatcc attgacctac
2151 atattgaaat agaagaacag acaattggtg ttgatgtgca tgctctagga
2201 cactggtttg atggtcgtct taaccttaaa acatcctttc actgttatgg
2251 tgcttgtaca aagtatgaat acccttggca tactgcaaag tgccattatg
2301 aaagagatta ccaatatgag acgagctggg gttgtaatcc atcagattgt
2351 cctggggtgg gcacaggctg tacagcatgt ggtttatacc tagatcaact
2401 gaaaccagtt ggtagtgctt ataaaattat cacaataagg tacagcagga
2451 gagtctgtgt tcagtttggg gaggaaaacc tttgtaagat aatagacatg
2501 aatgattgtt ttgtatctag gcatgttaag gtctgcataa ttggtacagt
2551 atctaaattc tctcagggtg ataccttatt gttttttgga ccgcttgaag
2601 gtggtggtct aatatttaaa cactggtgta catccacatg tcaatttggt
2651 gacccaggag atatcatgag tccaagagac aaaggttttt tatgccctga
2701 gtttccaggt agtttcagga agaaatgcaa ctttgctact acccctattt
2751 gtgagtatga tggaaatatg gtctcaggtt acaagaaagt gatggcgaca
2801 attgattcct tccaatcttt taatacaagc actatgcact tcactgatga
2851 aaggatagag tggaaagacc ctgatggaat gctaagggac catataaaca
2901 ttttagtaac gaaggacatt gactttgata accttggtga aaatccttgc
2951 aaaattggcc tacaaacatc ttctattgag ggggcctggg gttctggtgt
3001 ggggttcaca ttaacatgtc tggtatcact aacagaatgt cctacctttt
3051 tgacctcaat aaaggcttgt gataaggcta tctgttatgg tgcagagagt
3101 gtaacattga caagaggaca aaatacagtc aaggtatcag ggaaaggtgg
3151 ccatagtggt tcaacattta ggtgttgcca tggggaggac tgttcacaaa
3201 ttggactcca tgctgctgca cctcaccttg acaaggtaaa tgggatttct
3251 gagatagaaa atagtaaagt atatgatgat ggggcaccgc aatgtgggat
3301 aaaatgttgg tttgttaaat caggggaatg gatttcaggg atattcagtg
3351 gtaattggat tgtactcatt gtcctctgtg tatttctatt gttctccttg
3401 gttttactaa gcattctctg tcccgtaagg aagcataaaa aatcatagct
3451 aaattctgtg actatcctgt tcttatgtat agctttaaca tatatactaa
3501 tttttatatt ccagtatact ctatctaaca cactaaaaaa aatagtagct
3551 ttctaaccac aaaacttaga ttcttcttct gtatgatgtc ttaacatctt
3601 gcggtgtcta ctacta