Sequence of DPV Parainfluenza virus 5

Simian parainfluenza virus 5 nucleocapsid protein (NP), V protein and phosphoprotein (V/P), membrane protein (M), fusion protein (F), small hydrophobic protein (SH), hemagglutinin-neuraminidase protein (HN), and L protein (L) genes, complete cds.

ACC No: AF052755

Dated: 2000-03-03 | Length: 15246 | CRC: 1356918567

                !!NA_SEQUENCE 1.0
ID   AF052755   standard; RNA; VRL; 15246 BP.
XX
AC   AF052755;
XX
SV   AF052755.1
XX
DT   23-MAR-1998 (Rel. 55, Created)
DT   03-MAR-2000 (Rel. 62, Last updated, Version 3)
XX
DE   Simian parainfluenza virus 5 nucleocapsid protein (NP), V protein and
DE   phosphoprotein (V/P), membrane protein (M), fusion protein (F), small
DE   hydrophobic protein (SH), hemagglutinin-neuraminidase protein (HN), and L
DE   protein (L) genes, complete cds.
XX
KW   .
XX
OS   Simian parainfluenza virus 5
OC   Viruses; ssRNA negative-strand viruses; Mononegavirales; Paramyxoviridae;
OC   Paramyxovirinae; Rubulavirus.
XX
RN   [1]
RP   4475-6209
RX   MEDLINE; 85038582.
RA   Paterson R.G., Harris T.J., Lamb R.A.;
RT   "Fusion protein of the paramyxovirus simian virus 5: nucleotide sequence of
RT   mRNA predicts a highly hydrophobic glycoprotein";
RL   Proc. Natl. Acad. Sci. U.S.A. 81(21):6706-6710(1984).
XX
RN   [2]
RP   6224-8387
RX   MEDLINE; 85135055.
RA   Hiebert S.W., Paterson R.G., Lamb R.A.;
RT   "Hemagglutinin-neuraminidase protein of the paramyxovirus simian virus 5:
RT   nucleotide sequence of the mRNA predicts an N-terminal membrane anchor";
RL   J. Virol. 54(1):1-6(1985).
XX
RN   [3]
RP   1793-3087
RX   MEDLINE; 88311091.
RA   Thomas S.M., Lamb R.A., Paterson R.G.;
RT   "Two mRNAs that differ by two nontemplated nucleotides encode the amino
RT   coterminal proteins P and V of the paramyxovirus SV5";
RL   Cell 54(6):891-902(1988).
XX
RN   [4]
RP   3093-4474
RX   MEDLINE; 90232733.
RA   Sheshberadaran H., Lamb R.A.;
RT   "Sequence characterization of the membrane protein gene of paramyxovirus
RT   simian virus 5";
RL   Virology 176(1):234-243(1990).
XX
RN   [5]
RP   1-1787, 8388-15246
RX   MEDLINE; 92327825.
RA   Parks G.D., Ward C.D., Lamb R.A.;
RT   "Molecular cloning of the NP and L genes of simian virus 5: identification
RT   of highly conserved domains in paramyxovirus NP and L proteins";
RL   Virus Res. 22(3):259-279(1992).
XX
RN   [6]
RP   1-15246
RX   MEDLINE; 98022952.
RA   He B., Paterson R.G., Ward C.D., Lamb R.A.;
RT   "Recovery of infectious SV5 from cloned DNA and expression of a foreign
RT   gene";
RL   Virology 237(2):249-260(1997).
XX
RN   [7]
RP   1-15246
RA   Lamb R.A.;
RT   ;
RL   Submitted (05-MAR-1998) to the EMBL/GenBank/DDBJ databases.
RL   Biochemistry, Molecular Biology and Cell Biology, Northwestern University
RL   and Howard Hughes Medical Institute, 2153 North Campus Drive, Evanston, IL
RL   60208-3500, USA
XX
DR   SWISS-PROT; P04849; VGLF_SV5.
DR   SWISS-PROT; P04850; HEMA_SV5.
DR   SWISS-PROT; P07577; VSH_SV5.
DR   SWISS-PROT; P11207; VV_SV5.
DR   SWISS-PROT; P11208; RRPP_SV5.
DR   SWISS-PROT; P16629; VMAT_SV5.
DR   SWISS-PROT; Q88434; RRPL_SV5.
DR   SWISS-PROT; Q88435; NCAP_SV5.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .15246
FT                   /db_xref="taxon:11207"
FT                   /note="infectious clone constructed from cDNAs in
FT                   references 1-5; infectious clone yields infectious virus in
FT                   rescue assays (see reference 6); the genome length is
FT                   divisible by six and fulfills 'rule of six' criteria as
FT                   described by Kolakofsky, et. al. (J. Virol. 72, 891-899,
FT                   1998)"
FT                   /organism="Simian parainfluenza virus 5"
FT                   /strain="W3A"
FT   misc_feature    1. .55
FT                   /note="SV5 leader sequence"
FT   misc_feature    56. .68
FT                   /note="NP transcription initiation site"
FT                   /gene="NP"
FT   mRNA            56. .1787
FT                   /gene="NP"
FT                   /product="nucleocapsid protein"
FT   CDS             152. .1681
FT                   /codon_start=1
FT                   /db_xref="SWISS-PROT:Q88435"
FT                   /gene="NP"
FT                   /product="nucleocapsid protein"
FT                   /protein_id="AAC95511.1"
FT                   /translation="MSSVLKAYERFTLTQELQDQSEEGTIPPTTLKPVIRVFILTSNNP
FT                   ELRSRLLLFCLRIVLSNGARDSHRFGALLTMFSLPSATMLNHVKLADQSPEADIERVEI
FT                   DGFEEGSFRLIPNARSGMSRGEINAYAALAEDLPDTLNHATPFVDSEVEGTAWDEIETF
FT                   LDMCYSVLMQAWIVTCKCMTAPDQPAASIEKRLQKYRQQGRINPRYLLQPEARRIIQNV
FT                   IRKGMVVRHFLTFELQLARAQSLVSNRYYAMVGDVGKYIENCGMGGFFLTLKYALGTRW
FT                   PTLALAAFSGELTKLKSLMALYQTLGEQARYLALLESPHLMDFAAANYPLLYSYAMGIG
FT                   YVLDVNMRNYAFSRSYMNKTYFQLGMETARKQQGAVDMRMAEDLGLTQAERTEMANTLA
FT                   KLTTANRGADTRGGVNPFSSVTGTTQVPAAATGDTLESYMAADRLRQRYADAGTHDDEM
FT                   PPLEEEEEDDTSAGPRTGPTLEQVALDIQNAAVGAPIHTDDLNAALGDLDI"
FT   misc_feature    1774. .1787
FT                   /note="NP transcription termination site"
FT                   /gene="NP"
FT   mRNA            1789. .3092
FT                   /gene="V/P"
FT                   /product="V protein and phosphoprotein transcript"
FT   misc_feature    1789. .1802
FT                   /note="V/P transcription initiation site"
FT                   /gene="V/P"
FT   CDS             join(1850. .2340,2339. .3026)
FT                   /citation=[3]
FT                   /codon_start=1
FT                   /db_xref="SWISS-PROT:P11208"
FT                   /note="editing process yields phosphoprotein via a
FT                   stuttering mechanism, resulting in frameshift"
FT                   /gene="V/P"
FT                   /product="phosphoprotein"
FT                   /protein_id="AAC95513.1"
FT                   /translation="MDPTDLSFSPDEINKLIETGLNTVEYFTSQQVTGTSSLGKNTIPP
FT                   GVTGLLTNAAEAKIQESTNHQKGSVGGGAKPKKPRPKIAIVPADDKTVPGKPIPNPLLG
FT                   LDSTPSTQTVLDLSGKTLPSGSYKGVKLAKFGKENLMTRFIEEPRENPIATSSPIDFKR
FT                   GAGIPAGSIEGSTQSDGWEMKSRSLSGAIHPVLQSPLQQGDLNALVTSVQSLALNVNEI
FT                   LNTVRNLDSRMNQLETKVDRILSSQSLIQTIKNDIVGLKAGMATLEGMITTVKIMDPGV
FT                   PSNVTVEDVRKTLSNHAVVVPESFNDSFLTQSEDVISLDELARPTATSVKKIVRKVPPQ
FT                   KDLTGLKITLEQLAKDCISKPKMREEYLLKINQASSEAQLIDLKKAIIRSAI"
FT   CDS             1850. .2518
FT                   /codon_start=1
FT                   /db_xref="SWISS-PROT:P11207"
FT                   /gene="V/P"
FT                   /product="V protein"
FT                   /protein_id="AAC95512.1"
FT                   /translation="MDPTDLSFSPDEINKLIETGLNTVEYFTSQQVTGTSSLGKNTIPP
FT                   GVTGLLTNAAEAKIQESTNHQKGSVGGGAKPKKPRPKIAIVPADDKTVPGKPIPNPLLG
FT                   LDSTPSTQTVLDLSGKTLPSGSYKGVKLAKFGKENLMTRFIEEPRENPIATSSPIDFKR
FT                   GRDTGGFHRREYSIGWVGDEVKVTEWCNPSCSPITAAARRFECTCHQCPVTCSECERDT
FT                   "
FT   misc_feature    3081. .3092
FT                   /note="V/P transcription termination site"
FT                   /gene="V/P"
FT   mRNA            3109. .4478
FT                   /gene="M"
FT                   /product="membrane protein"
FT   misc_feature    3109. .3120
FT                   /note="M transcription initiation site"
FT                   /gene="M"
FT   CDS             3141. .4274
FT                   /codon_start=1
FT                   /db_xref="SWISS-PROT:P16629"
FT                   /gene="M"
FT                   /product="membrane protein"
FT                   /protein_id="AAC95514.1"
FT                   /translation="MPSISIPADPTNPRQSIKAFPIVINSDGGEKGRLVKQLRTTYLND
FT                   LDTHEPLVTFINTYGFIYEQDRGNTIVGEDQLGKKREAVTAAMVTLGCGPNLPSLGNVL
FT                   GQLREFQVTVRKTSSKAEEMVFEIVKYPRIFRGHTLIQKGLVCVSAEKFVKSPGKIQSG
FT                   MDYLFIPTFLSVTYCPAAIKFQVPGPMLKMRSRYTQSLQLELMIRILCKPDSPLMKVHT
FT                   PDKEGRGCLVSVWLHVCNIFKSGNKNGSEWQEYWMRKCANMQLEVSIADMWGPTIIIHA
FT                   RGHIPKSAKLFFGKGGWSCHPLHEVVPSVTKTLWSVGCEITKAKAIIQESSISLLVETT
FT                   DIISPKVKISSKHRRFVKSNWGLFKKTKSLPNLTELE"
FT   misc_feature    4468. .4478
FT                   /note="M transcription termination site"
FT                   /gene="M"
FT   misc_feature    4502. .4514
FT                   /note="F transcription initiation site"
FT                   /gene="F"
FT   mRNA            4502. .6219
FT                   /gene="F"
FT                   /product="fusion protein"
FT   CDS             4530. .6119
FT                   /codon_start=1
FT                   /db_xref="SWISS-PROT:P04849"
FT                   /gene="F"
FT                   /product="fusion protein"
FT                   /protein_id="AAC95515.1"
FT                   /translation="MGTIIQFLVVSCLLAGAGSLDPAALMQIGVIPTNVRQLMYYTEAS
FT                   SAFIVVKLMPTIDSPISGCNITSISSYNATVTKLLQPIGENLETIRNQLIPTRRRRRFA
FT                   GVVIGLAALGVATAAQVTAAVALVKANENAAAILNLKNAIQKTNAAVADVVQATQSLGT
FT                   AVQAVQDHINSVVSPAITAANCKAQDAIIGSILNLYLTELTTIFHNQITNPALSPITIQ
FT                   ALRILLGSTLPTVVEKSFNTQISAAELLSSGLLTGQIVGLDLTYMQMVIKIELPTLTVQ
FT                   PATQIIDLATISAFINNQEVMAQLPTRVMVTGSLIQAYPASQCTITPNTVYCRYNDAQV
FT                   LSDDTMACLQGNLTRCTFSPVVGSFLTRFVLFDGIVYANCRSMLCKCMQPAAVILQPSS
FT                   SPVTVIDMYKCVSLQLDNLRFTITQLANVTYNSTIKLESSQILSIDPLDISQNLAAVNK
FT                   SLSDALQHLAQSDTYLSAITSATTTSVLSIIAICLGSLGLILIILLSVVVWKLLTIVVA
FT                   NRNRMENFVYHK"
FT   misc_feature    6205. .6219
FT                   /note="F transcription termination site"
FT                   /gene="F"
FT   mRNA            6224. .6515
FT                   /gene="SH"
FT                   /product="small hydrophobic protein"
FT   misc_feature    6224. .6236
FT                   /note="SH transcription initiation site"
FT                   /gene="SH"
FT   CDS             6303. .6437
FT                   /codon_start=1
FT                   /db_xref="SWISS-PROT:P07577"
FT                   /gene="SH"
FT                   /product="small hydrophobic protein"
FT                   /protein_id="AAC95516.1"
FT                   /translation="MLPDPEDPESKKATRRAGNLIICFLFIFFLFVTFIVPTLRHLLS"
FT   misc_feature    6502. .6515
FT                   /note="SH transcription termination site"
FT                   /gene="SH"
FT   mRNA            6517. .8392
FT                   /gene="HN"
FT                   /product="hemagglutinin-neuraminidase protein"
FT   misc_feature    6517. .6529
FT                   /note="HN transcription initiation site"
FT                   /gene="HN"
FT   CDS             6584. .8281
FT                   /codon_start=1
FT                   /db_xref="SWISS-PROT:P04850"
FT                   /gene="HN"
FT                   /product="hemagglutinin-neuraminidase protein"
FT                   /protein_id="AAC95517.1"
FT                   /translation="MVAEDAPVRATCRVLFRTTTLIFLCTLLALSISILYESLITQKQI
FT                   MSQAGSTGSNSGLGSITDLLNNILSVANQIIYNSAVALPLQLDTLESTLLTAIKSLQTS
FT                   DKLEQNCSWSAALINDNRYINGINQFYFSIAEGRNLTLGPLLNMPSFIPTATTPEGCTR
FT                   IPSFSLTKTHWCYTHNVILNGCQDHVSSNQFVSMGIIEPTSAGFPFFRTLKTLYLSDGV
FT                   NRKSCSISTVPGGCMMYCFVSTQPERDDYFSAAPPEQRIIIMYYNDTIVERIINPPGVL
FT                   DVWATLNPGTGSGVYYLGWVLFPIYGGVIKGTSLWNNQANKYFIPQMVAALCSQNQATQ
FT                   VQNAKSSYYSSWFGNRMIQSGILACPLRQDLTNECLVLPFSNDQVLMGAEGRLYMYGDS
FT                   VYYYQRSNSWWPMTMLYKVTITFTNGQPSAISAQNVPTQQVPRPGTGDCSATNRCPGFC
FT                   LTGVYADAWLLTNPSSTSTFGSEATFTGSYLNTATQRINPTMYIANNTQIISSQQFGSS
FT                   GQEAAYGHTTCFRDTGSVMVYCIYIIELSSSLLGQFQIVPFIRQVTLS"
FT   misc_feature    8381. .8392
FT                   /note="HN transcription termination site"
FT                   /gene="HN"
FT   mRNA            8406. .15215
FT                   /gene="L"
FT                   /product="L protein"
FT   misc_feature    8406. .8416
FT                   /note="L transcription initiation site"
FT                   /gene="L"
FT   CDS             8414. .15181
FT                   /codon_start=1
FT                   /db_xref="SWISS-PROT:Q88434"
FT                   /gene="L"
FT                   /product="L protein"
FT                   /protein_id="AAC95518.1"
FT                   /translation="MAGSREILLPEVHLNSPIVKHKLYYYILLGNLPNEIDLDDLGPLH
FT                   NQNWNQIAHEESNLAQRLVNVRNFLITHIPDLRKGHWQEYVNVILWPRILPLIPDFKIN
FT                   DQLPLLKNWDKLVKESCSVINAGTSQCIQNLSYGLTGRGNLFTRSRELSGDRRDIDLKT
FT                   VVAAWHDSDWKRISDFWIMIKFQMRQLIVRQTDHNDSDLITYIENREGIIIITPELVAL
FT                   FNTENHTLTYMTFEIVLMVSDMYEGRHNILSLCTVSTYLNPLKKRITYLLSLVDNLAFQ
FT                   IGDAVYNIIALLESFVYAQLQMSDPIPELRGQFHAFVCSEILDALRGTNSFTQDELRTV
FT                   TTNLISPFQDLTPDLTAELLCIMRLWGHPMLTASQAAGKVRESMCAGKVLDFPTIMKTL
FT                   AFFHTILINGYRRKHHGVWPPLNLPGNASKGLTELMNDNTEISYEFTLKHWKEVSLIKF
FT                   KKCFDADAGEELSIFMKDKAISAPKQDWMSVFRRSLIKQRHQHHQVPLPNPFNRRLLLN
FT                   FLGDDKFDPNVELQYVTSGEYLHDDTFCASYSLKEKEIKPDGRIFAKLTKRMRSCQVIA
FT                   ESLLANHAGKLMKENGVVMNQLSLTKSLLTMSQIGIISEKARKSTRDNINQPGFQNIQR
FT                   NKSHHSKQVNQRDPSDDFELAASFLTTDLKKYCLQWRYQTIIPFAQSLNRMYGYPHLFE
FT                   WIHLRLMRSTLYVGDPFNPPADTSQFDLDKVINGDIFIVSPRGGIEGLCQKAWTMISIA
FT                   VIILSATESGTRVMSMVQGDNQAIAVTTRVPRSLPTLEKKTIAFRSCNLFFERLKCNNF
FT                   GLGHHLKEQETIISSHFFVYSKRIFYQGRILTQALKNASKLCLTADVLGECTQSSCSNL
FT                   ATTVMRLTENGVEKDICFYLNIYMTIKQLSYDIIFPQVSIPGDQITLEYINNPHLVSRL
FT                   ALLPSQLGGLNYLSCSRLFNRNIGDPVVSAVADLKRLIKSGCMDYWILYNLLGRKPGNG
FT                   SWATLAADPYSINIEYQYPPTTALKRHTQQALMELSTNPMLRGIFSDNAQAEENNLARF
FT                   LLDREVIFPRVAHIIIEQTSVGRRKQIQGYLDSTRSIMRKSLEIKPLSNRKLNEILDYN
FT                   INYLAYNLALLKNAIEPPTYLKAMTLETCSIDIARNLRKLSWAPLLGGRNLEGLETPDP
FT                   IEITAGALIVGSGYCEQCAAGDNRFTWFFLPSGIEIGGDPRDNPPIRVPYIGSRTDERR
FT                   VASMAYIRGASSSLKAVLRLAGVYIWAFGDTLENWIDALDLSHTRVNITLEQLQSLTPL
FT                   PTSANLTHRLDDGTTTLKFTPASSYTFSSFTHISNDEQYLTINDKTADSNIIYQQLMIT
FT                   GLGILETWNNPPINRTFEESTLHLHTGASCCVRPVDSCILSEALTVKPHITVPYSNKFV
FT                   FDEDPLSEYETAKLESLSFQAQLGNIDAVDMTGKLTLLSQFTARQIINAITGLDESVSL
FT                   TNDAIVASDYVSNWISECMYTKLDELFMYCGWELLLELSYQMYYLRVVGWSNIVDYSYM
FT                   ILRRIPGAALNNLASTLSHPKLFRRAINLDIVAPLNAPHFASLDYIKMSVDAILWGCKR
FT                   VINVLSNGGDLELVVTSEDSLILSDRSMNLIARKLTLLSLIHHNGLELPKIKGFSPDEK
FT                   CFALTEFLRKVVNSGLSSIENLSNFMYNVENPRLAAFASNNYYLTRKLLNSIRDTESGQ
FT                   VAVTSYYESLEYIDSLKLTPHVPGTSCIEDDSLCTNDYIIWIIESNANLEKYPIPNSPE
FT                   DDSNFHNFKLNAPSHHTLRPLGLSSTAWYKGISCCRYLERLKLPQGDHLYIAEGSGASM
FT                   TIIEYLFPGRKIYYNSLFSSGDNPPQRNYAPMPTQFIESVPYKLWQAHTDQYPEIFEDF
FT                   IPLWNGNAAMTDIGMTACVEFIINRVGPRTCSLVHVDLESSASLNQQCLSKPIINAIIT
FT                   ATTVLCPHGVLILKYSWLPFTRFSTLITFLWCYFERITVLRSTYSDPANHEVYLICILA
FT                   NNFAFQTVSQATGMAMTLTDQGFTLISPERINQYWDGHLKQERIVAEAIDKVVLGENAL
FT                   FNSSDNELILKCGGTPNARNLIDIEPVATFIEFEQLICTMLTTHLKEIIDITRSGTQDY
FT                   ESLLLTPYNLGLLGKISTIVRLLTERILNHTIRNWLILPPSLRMIVKQDLEFGIFRITS
FT                   ILNSDRFLKLSPNRKYLIAQLTAGYIRKLIEGDCNIDLTRPIQKQIWKALGCVVYCHDP
FT                   MDQRESTEFIDININEEIDRGIDGEEI"
FT   misc_feature    15204. .15215
FT                   /note="L transcription termination site"
FT                   /gene="L"
FT   misc_feature    15216. .15246
FT                   /note="SV5 trailer sequence"
XX
SQ   Sequence 15246 BP; 4732 A; 3289 C; 3155 G; 4070 T; 0 other;

AF052755  Length: 15246  September 27, 2002 09:40  Type: N  Check: 8273  ..

       1  accaagggga aaatgaagtg gtgactcaaa tcatcgaaga ccctcgagat
      51  tacataggtc cggaacctat ggccttcgtg accgacctcg agtcagagta
     101  gttcaataag gacctatcaa gtttgggcaa tttttcgtcc ccgacacaaa
     151  aatgtcatcc gtgcttaaag catatgagcg attcacgctc actcaagaac
     201  tgcaagatca gagtgaggaa ggtacaatcc cacctacaac actaaaaccg
     251  gtaatcaggg tatttatact aacctctaat aacccagagc taagatcccg
     301  gcttcttcta ttctgcctac ggattgttct cagtaatggt gcaagggatt
     351  cccatcgctt tggagcatta ctcacaatgt tttcgctacc atcagccaca
     401  atgctcaatc atgtcaaatt agctgaccag tcaccagaag ctgatatcga
     451  aagggtagag atcgatggct ttgaggaggg atcattccgc ttaatcccca
     501  atgcacgttc aggtatgagc cgtggagaga tcaatgccta tgctgcactt
     551  gcagaagatc tacctgacac actaaaccat gcaacacctt tcgttgattc
     601  cgaagtcgag ggaactgcat gggatgagat tgagactttc ttagatatgt
     651  gttacagtgt cctaatgcag gcatggatag tgacttgcaa gtgcatgact
     701  gcgccagacc aacctgctgc ttctattgag aaacgcctgc aaaaatatcg
     751  tcagcaaggc aggatcaacc cgagatatct cctgcaaccg gaggctcgac
     801  gaataatcca gaatgtaatc cggaagggaa tggtggtcag acatttcctc
     851  acctttgaac tgcagcttgc ccgagcacaa agccttgtat caaataggta
     901  ttatgctatg gtaggggatg ttggaaagta tatagagaat tgtggaatgg
     951  gaggcttctt tttgacacta aaatatgcat taggaactag atggcccaca
    1001  cttgctttag ctgcattttc aggagagcta acaaagctaa agtccctcat
    1051  ggcattatac cagacccttg gtgagcaggc ccgatatttg gccctattgg
    1101  agtcaccaca tttgatggat tttgctgcag caaactaccc actgctatat
    1151  agctatgcta tgggaatagg ctatgtgtta gatgtcaaca tgaggaacta
    1201  cgctttctcc agatcataca tgaacaagac atatttccaa ttgggaatgg
    1251  aaactgcaag aaaacaacag ggtgcagttg acatgaggat ggcagaagat
    1301  ctcggtctaa ctcaagccga acgcaccgag atggcaaata cacttgccaa
    1351  attgaccaca gcaaatcgag gggcagacac caggggagga gtcaacccgt
    1401  tctcatctgt cactgggaca actcaggtgc ccgctgcagc aacaggtgac
    1451  acactcgaga gttacatggc agcggatcga ctgaggcaga gatatgctga
    1501  tgcaggcacc catgatgatg agatgccacc attggaagag gaggaagagg
    1551  acgacacatc tgcaggtcca cgcactggac caactcttga acaagtggcc
    1601  ttggacatcc agaacgcagc agttggagct cccatccata cagatgacct
    1651  gaatgccgca ctgggtgatc ttgacatcta gacaattcag atcccaatct
    1701  aaaattgaca tacctaattg attagttaga tggaactaca gtggattcca
    1751  taaggttcct gcctaccatc ggctttaaag aaaaaaatag gcccggacgg
    1801  gttagcaaca agcgactgcc ggtgccaaca gcgcaatcca caatctacaa
    1851  tggatcccac tgatctgagc ttctccccag atgagatcaa taagctcata
    1901  gagacaggcc tgaatactgt agagtatttt acttcccaac aagtcacagg
    1951  aacatcctct cttggaaaga atacaatacc accaggggtc acaggactac
    2001  taaccaatgc tgcagaggca aagatccaag agtcaactaa ccatcagaag
    2051  ggctcagttg gtgggggtgc aaaaccaaag aaaccgcgac caaaaattgc
    2101  cattgtgcca gcagatgaca aaacagtgcc cggaaagccg atcccaaacc
    2151  ctctattagg tctggactcc accccgagca cccaaactgt gcttgatcta
    2201  agtgggaaaa cattaccatc aggatcctat aagggggtta agcttgcgaa
    2251  atttggaaaa gaaaatctga tgacacggtt catcgaggaa cccagagaga
    2301  atcctatcgc aaccagttcc cccatcgatt ttaagagggg cagggatacc
    2351  ggcgggttcc atagaaggga gtactcaatc ggatgggtgg gagatgaagt
    2401  caaggtcact gagtggtgca atccatcctg ttctccaatc accgctgcag
    2451  caaggcgatt tgaatgcact tgtcaccagt gtccagtcac ttgctctgaa
    2501  tgtgaacgag atacttaata cagtgagaaa tttggactct cggatgaatc
    2551  aactggagac aaaagtagat cgcattctct catctcagtc tctaatccag
    2601  accatcaaga atgacatagt tggacttaaa gcagggatgg ctactttaga
    2651  aggaatgatt acaactgtga aaatcatgga cccgggagtt cccagtaatg
    2701  ttactgtgga agatgtacgc aagacactaa gtaaccatgc tgttgttgtg
    2751  ccagaatcat tcaatgatag tttcttgact caatctgaag atgtaatttc
    2801  acttgatgag ttggctcgac caactgcaac aagtgttaag aagattgtca
    2851  ggaaggttcc tcctcagaag gatctgactg gattgaagat tacactagag
    2901  caattggcaa aggattgcat cagcaaaccg aagatgaggg aagagtatct
    2951  cctcaaaatc aaccaggctt ccagtgaggc tcagctaatt gacctcaaga
    3001  aagcaatcat ccgcagtgca atttgatcaa gaaacaccca attacactac
    3051  actggtatga cactgtacta accctgaggg ttttagaaaa aacgattaac
    3101  gataaataag cccgaacact acacactacc tgaggcagcc atgccatcca
    3151  tcagcattcc cgcagacccc accaatccac gtcaatcaat aaaagcgttc
    3201  ccaattgtga tcaacagtga tgggggtgag aaaggccgct tggttaaaca
    3251  actacgcaca acctacttga atgacctaga tactcatgag ccactggtga
    3301  cattcataaa tacctatgga ttcatctacg aacaggatcg ggggaatacc
    3351  attgtcggag aggatcaact tgggaagaaa agagaggctg tgaccgctgc
    3401  aatggttacc cttggatgtg ggcctaatct accatcatta gggaatgtcc
    3451  tgggacaact gagggaattc caggtcactg ttaggaagac atccagcaaa
    3501  gcggaagaga tggtctttga aattgttaag tatccgagaa tatttcgggg
    3551  tcatacatta atccagaaag gactagtctg tgtctccgca gaaaaatttg
    3601  ttaagtcacc agggaaaata caatctggaa tggactatct cttcattccg
    3651  acatttctgt cagtgactta ctgtccagct gcaatcaaat ttcaggtacc
    3701  tggccccatg ttgaaaatga gatcaagata cactcagagc ttacaacttg
    3751  aactaatgat aagaatcctg tgtaagcccg attcgccact tatgaaggtc
    3801  catacccctg acaaggaggg aagaggatgt cttgtatcag tatggctgca
    3851  tgtatgcaac atcttcaaat caggaaacaa gaatggcagt gagtggcagg
    3901  aatactggat gagaaagtgt gctaacatgc aacttgaagt gtcgattgca
    3951  gatatgtggg gaccaactat cataattcat gccagaggtc acattcccaa
    4001  aagtgctaag ttgttttttg gaaagggtgg atggagctgc catccacttc
    4051  acgaagttgt tccaagtgtc actaaaacac tatggtccgt gggctgtgag
    4101  attacaaagg cgaaggcaat aatacaagag agtagcatct ctcttctcgt
    4151  ggagactact gacatcataa gtccaaaagt caaaatttca tctaagcatc
    4201  gccgctttgt gaaatcaaat tggggtctgt tcaagaaaac taaatcactg
    4251  cctaacctga cggagctgga atgactgacc tctaatcgag actacaccgc
    4301  cgcaaactat aggtgggtgg tacctcagtg attaatcttg taagcactga
    4351  tcgtaggcta caacacacta atattatcca gattagagag cttaattagc
    4401  tctgtattaa taataacact actattccaa taactggaat caccagcttg
    4451  atttatctcc aaaatgattc aaagaaaaca aatcatatta agactatcct
    4501  aagcacgaac ccatatcgtc cttcaaatca tgggtactat aattcaattt
    4551  ctggtggtct cctgtctatt ggcaggagca ggcagccttg atccagcagc
    4601  cctcatgcaa atcggtgtca ttccaacaaa tgtccggcaa cttatgtatt
    4651  atactgaggc ctcatcagca ttcattgttg tgaagttaat gcctacaatt
    4701  gactcgccga ttagtggatg taatataaca tcaatttcaa gctataatgc
    4751  aacagtgaca aaactcctac agccgatcgg tgagaatttg gagacgatta
    4801  ggaaccagtt gattccaact cggaggagac gccggtttgc aggggtggtg
    4851  attggattag ctgcattagg agtagctact gccgcacagg tcactgccgc
    4901  agtggcacta gtaaaggcaa atgaaaatgc tgcggctata ctcaatctca
    4951  aaaatgcaat ccaaaaaaca aatgcagcag ttgcagatgt ggtccaggcc
    5001  acacaatcac taggaacggc agttcaagca gttcaagatc acataaacag
    5051  tgtggtaagt ccagcaatta cagcagccaa ttgtaaggcc caagatgcta
    5101  tcattggctc aatcctcaat ctctatttga ccgagttgac aaccatcttc
    5151  cacaatcaaa ttacaaaccc tgcattgagt cccattacaa ttcaagcttt
    5201  aaggatccta ctggggagta ccttgccgac tgtggtcgaa aaatctttca
    5251  atacccagat aagtgcagct gagcttctct catcagggtt attgacaggc
    5301  cagattgtgg gattagattt gacctatatg cagatggtca taaaaattga
    5351  gctgccaact ttaactgtac aacctgcaac ccagatcata gatctggcca
    5401  ccatttctgc attcattaac aatcaagaag tcatggccca attaccaaca
    5451  cgtgttatgg tgactggcag cttgatccaa gcctatcccg catcgcaatg
    5501  caccattaca cccaacactg tgtactgtag gtataatgat gcccaagtac
    5551  tctcagatga tactatggct tgcctccaag gtaacttgac aagatgcacc
    5601  ttctctccag tggttgggag ctttctcact cgattcgtgc tgttcgatgg
    5651  aatagtttat gcaaattgca ggtcgatgtt gtgcaagtgc atgcaacctg
    5701  ctgctgtgat cctacagccg agttcatccc ctgtaactgt cattgacatg
    5751  tacaaatgtg tgagtctgca gcttgacaat ctcagattca ccatcactca
    5801  attggccaat gtaacctaca atagcaccat caagcttgaa tcatcccaga
    5851  tcttgtctat tgatccgttg gatatatccc aaaatctagc tgcggtgaat
    5901  aagagtctaa gtgatgcact acaacactta gcacaaagtg acacatatct
    5951  ttctgcaatc acatcagcta cgactacaag tgtattatcc ataatagcaa
    6001  tctgtcttgg atcgttaggt ttaatattaa taatcttgct cagtgtagtt
    6051  gtgtggaagt tattgaccat tgtcgttgct aatcgaaata gaatggagaa
    6101  ttttgtttat cataaataag cattccacca ctcacgatct gatctcagtg
    6151  agaaaaatca acctgcaact cttggaacaa gataagacag tcatccatta
    6201  gtaattttta agaaaaaaac gataggaccg aacctagtat tgaaagaacc
    6251  gtctcggtca atctaggtaa tcgagctgat accgtctcgg aaagctcaaa
    6301  tcatgctgcc tgatccggaa gatccggaaa gcaagaaagc tacaaggaga
    6351  gcaggaaacc taattatctg cttcctattc atcttctttc tgtttgtaac
    6401  cttcattgtt ccaactctaa gacacttgct gtcctaacac ctgctatagg
    6451  ctatccactg catcatctct cctgccatac ttcctactca catcatatct
    6501  attttaaaga aaaaataggc ccgaacacta atcgtgccgg cagtgccact
    6551  gcacacacaa cactacacat acaatacact acaatggttg cagaagatgc
    6601  ccctgttagg gccacttgcc gagtattatt tcgaacaaca actttaatct
    6651  ttctatgcac actactagca ttaagcatct ctatccttta tgagagttta
    6701  ataacccaaa agcaaatcat gagccaagca ggctcaactg gatctaattc
    6751  tggattagga agtatcactg atcttcttaa taatattctc tctgtcgcaa
    6801  atcagattat atataactct gcagtcgctc tacctctaca attggacact
    6851  cttgaatcaa cactccttac agccattaag tctcttcaaa ccagtgacaa
    6901  gctagaacag aactgctcgt ggagtgctgc actgattaat gataatagat
    6951  acattaatgg catcaatcag ttctattttt caattgctga gggtcgcaat
    7001  ctgacacttg gcccacttct taatatgcct agtttcattc caactgccac
    7051  gacaccagag ggctgcacca ggatcccatc attctcgctc actaagacac
    7101  actggtgtta tacacacaat gttatcctga atggatgcca ggatcatgta
    7151  tcctcaaatc aatttgtttc catgggaatc attgaaccca cttctgccgg
    7201  gtttccattc tttcgaaccc taaagactct atatctcagc gatggggtca
    7251  atcgtaagag ctgctctatc agtacagttc cggggggttg tatgatgtac
    7301  tgttttgttt ctactcaacc agagagggat gactactttt ctgccgctcc
    7351  tccagaacaa cgaattatta taatgtacta taatgataca atcgtggagc
    7401  gcataattaa tccacccggg gtactagatg tatgggcaac attgaaccca
    7451  ggaacaggaa gcggggtata ttatttaggt tgggtgctct ttccaatata
    7501  tggcggcgtg attaaaggta cgagtttatg gaataatcaa gcaaataaat
    7551  actttatccc ccagatggtt gctgctctct gctcacaaaa ccaggcaact
    7601  caagtccaaa atgctaagtc atcatactat agcagctggt ttggcaatcg
    7651  aatgattcag tctgggatcc tggcatgtcc tcttcgacag gatctaacca
    7701  atgagtgttt agttctgccc ttttctaatg atcaggtgct tatgggtgct
    7751  gaagggagat tatacatgta tggtgactcg gtgtattact atcaaagaag
    7801  caatagttgg tggcctatga ccatgctgta taaggtaacc ataacattca
    7851  ctaatggtca gccatctgct atatcagctc agaatgtgcc cacacagcag
    7901  gtccctagac ctgggacagg agactgctct gcaaccaata gatgtcccgg
    7951  tttttgcttg acaggagtgt atgccgatgc ctggttactg accaaccctt
    8001  cgtctaccag tacatttgga tcagaagcaa ccttcactgg ttcttatctc
    8051  aacacagcaa ctcagcgtat caatccgacg atgtatatcg cgaacaacac
    8101  acagatcata agctcacagc aatttggatc aagcggtcaa gaagcagcat
    8151  atggccacac aacctgtttt agggacacag gctctgttat ggtatactgt
    8201  atctatatta ttgaattgtc ctcatctctc ttaggacaat ttcagattgt
    8251  cccatttatc cgtcaggtga cactatccta aaggcagaag ccttcaggtc
    8301  tgacccagcc aatcaaagca ttataccaga ccatggaatg cataccaaac
    8351  attattgaca ctaatgacac acaaaattgg ttttaagaaa aaccaagaga
    8401  acaataggcc agaatggctg ggtctcggga gatattactc cctgaagtcc
    8451  atctcaattc accaattgta aagcataagc tatactatta cattctactt
    8501  ggaaacctcc caaatgagat cgaccttgac gatttaggtc cattacataa
    8551  tcaaaattgg aatcagatag cacatgaaga gtctaactta gctcaacgct
    8601  tggtaaatgt aagaaatttt ctaattaccc acatccctga tcttagaaag
    8651  ggccattggc aagagtatgt caatgtaata ctgtggccgc gaattcttcc
    8701  cttgatcccg gattttaaaa tcaatgacca attgcctctg ctcaaaaatt
    8751  gggacaagtt agttaaagaa tcatgttcag taatcaatgc aggtacttcc
    8801  cagtgcattc agaatctcag ctatggactg acaggtcgtg ggaacctctt
    8851  tacacgatca cgtgaactct ctggtgaccg cagggatatt gatcttaaga
    8901  cagttgtggc agcatggcat gactcagact ggaaaagaat aagtgatttt
    8951  tggattatga tcaaattcca gatgagacaa ttaattgtta ggcaaacaga
    9001  tcataatgat tctgatttaa tcacgtatat cgaaaataga gaaggcataa
    9051  tcatcataac ccctgaactg gtagcattat ttaacactga gaatcataca
    9101  ctaacataca tgacctttga aattgtactg atggtttcag atatgtacga
    9151  aggtcgtcac aacattttat cactatgcac agttagcact tacctgaatc
    9201  ctctgaagaa aagaataaca tatttattga gccttgtaga taacttagct
    9251  tttcagatag gtgatgctgt atataacata attgctttgc tagaatcctt
    9301  tgtatatgca cagttgcaaa tgtcagatcc catcccagaa ctcagaggac
    9351  aattccatgc attcgtatgt tctgagattc ttgatgcact aagaggaact
    9401  aatagtttca cccaggatga attaagaact gtgacaacta atttgatatc
    9451  cccattccaa gatctgaccc cagatcttac ggctgaattg ctctgtataa
    9501  tgaggctttg gggacacccc atgctcactg ccagtcaagc tgcaggaaag
    9551  gtacgcgagt ctatgtgtgc tggaaaagta ttagactttc ccaccattat
    9601  gaaaacacta gcctttttcc atactattct gatcaatgga tacaggagga
    9651  agcatcatgg agtatggcca cccttaaact taccgggtaa tgcttcaaag
    9701  ggtctcacgg aacttatgaa tgacaatact gaaataagct atgaattcac
    9751  acttaagcat tggaaggaag tctctcttat aaaattcaag aaatgttttg
    9801  atgcagacgc aggtgaggaa ctcagtatat ttatgaaaga taaggcaatt
    9851  agtgccccaa aacaagactg gatgagtgtg tttagaagaa gcctaatcaa
    9901  acagcgccat cagcatcatc aggtccccct accaaatcca ttcaatcgac
    9951  ggctgttgct aaactttctc ggagatgaca aattcgaccc gaatgtggag
   10001  ctacagtatg taacatcagg tgagtatcta catgatgaca cgttttgtgc
   10051  atcatattca ctaaaagaga aggaaattaa acctgatggt cgaatttttg
   10101  caaagttgac taagagaatg agatcatgtc aagttatagc agaatctctt
   10151  ttagcgaacc atgctgggaa gttaatgaaa gagaatggtg ttgtgatgaa
   10201  tcagctatca ttaacaaaat cactattaac aatgagtcag attggaataa
   10251  tatccgagaa agctagaaag tcaactcgag ataacataaa tcaacctggt
   10301  ttccagaata tccagagaaa taaatcacat cactccaagc aagtcaatca
   10351  gcgagatcca agtgatgact ttgaattggc agcatctttt ttaactactg
   10401  atctcaaaaa atattgttta caatggaggt accagacaat tatcccattt
   10451  gctcaatcat taaacagaat gtatggttat cctcatctct ttgagtggat
   10501  tcacttacgg ctaatgcgta gtacacttta cgtgggggat cccttcaacc
   10551  caccagcaga taccagtcaa tttgatctag ataaagtaat taatggagat
   10601  atcttcattg tatcacccag aggtggaatt gaagggctgt gtcaaaaggc
   10651  ttggacaatg atatctatcg ctgtgataat tctatctgcc acagagtctg
   10701  gcacacgagt aatgagtatg gtgcagggag ataatcaagc aattgctgtc
   10751  accacacgag taccaaggag cctgccgact cttgagaaaa agactattgc
   10801  ttttagatct tgtaatctat tctttgagag gttaaaatgt aataattttg
   10851  gattaggtca ccatttgaaa gaacaagaga ctatcattag ttctcacttc
   10901  tttgtttata gcaagagaat attctatcag gggaggattc taacgcaagc
   10951  cttaaaaaat gctagtaagc tctgcttgac agctgatgtc ctaggagaat
   11001  gcacccaatc atcatgttct aatcttgcaa ctactgtcat gaggttaact
   11051  gagaatggtg ttgaaaaaga tatctgtttc tacttgaata tctatatgac
   11101  catcaaacag ctctcctatg atatcatctt ccctcaagtg tcaattcctg
   11151  gagatcagat cacattagaa tacataaata atccacacct ggtatcacga
   11201  ttggctcttt tgccatccca gttaggaggt ctaaactacc tgtcatgcag
   11251  taggctgttc aatcgaaaca taggcgaccc ggtggtttcc gcagttgcag
   11301  atcttaagag attaattaaa tcaggatgta tggattactg gatcctttat
   11351  aacttattag ggagaaaacc gggaaacggc tcatgggcta ctttagcagc
   11401  tgacccgtac tcaatcaata tagagtatca ataccctcca actacagctc
   11451  ttaagaggca cacccaacaa gctctgatgg aactcagtac gaatccaatg
   11501  ttacgtggca tattctctga caatgcacag gcagaagaaa ataaccttgc
   11551  taggtttctc ctggataggg aggtgatctt tccgcgtgta gctcacatca
   11601  tcattgagca aaccagtgtc gggaggagaa aacagattca aggatatttg
   11651  gattcaacta gatcgataat gaggaaatca ctagaaatta agcccttatc
   11701  caataggaag cttaatgaaa tactggatta caacatcaat tacctagctt
   11751  acaatttggc attactcaag aatgctattg aacctccgac ttatttgaag
   11801  gcaatgacac ttgaaacatg tagcatcgac attgcaagga acctccggaa
   11851  gctctcctgg gccccactct tgggtgggag aaatcttgaa ggattagaga
   11901  cgccagatcc cattgaaatt actgcaggag cattaattgt tggatcgggc
   11951  tactgtgaac agtgtgctgc aggagacaat cgattcacat ggtttttctt
   12001  gccatctggt atcgagatag gaggggatcc ccgtgataat cctcctatcc
   12051  gtgtaccgta cattggctcc aggactgatg agaggagggt agcctcaatg
   12101  gcatacatca ggggtgcctc gagtagctta aaagcagttc ttagactggc
   12151  gggagtgtac atctgggcat tcggagatac tctggagaat tggatagatg
   12201  cactggattt gtctcacact agagttaaca tcacacttga acagctgcaa
   12251  tccctcaccc cacttccaac ctctgccaat ctaacccatc ggttggatga
   12301  tggcacaact accctaaagt ttactcctgc gagctcttac accttttcaa
   12351  gtttcactca tatatcaaat gatgagcaat acctgacaat taatgacaaa
   12401  actgcagatt caaatataat ctaccaacag ttaatgatca ctggactcgg
   12451  aatcttagaa acatggaata atcccccaat caatagaaca ttcgaagaat
   12501  ctaccctaca tttgcacact ggtgcatcat gttgtgtccg acctgtggac
   12551  tcctgcattc tctcagaagc attaacagtc aagccacata ttacagtacc
   12601  gtacagcaat aaatttgtat ttgatgagga cccgctatct gaatatgaaa
   12651  ctgcaaaact ggaatcgtta tcattccaag cccaattagg caacattgat
   12701  gctgtagata tgacaggtaa attaacatta ttgtcccaat tcactgcaag
   12751  gcagattatc aatgcaatca ctggactcga tgagtctgtc tctcttacta
   12801  atgatgccat tgttgcatca gactatgtct ccaattggat tagtgaatgc
   12851  atgtatacca aattagatga attatttatg tattgtgggt gggaactact
   12901  attggaacta tcctatcaaa tgtattatct gagggtagtt gggtggagta
   12951  atatagtgga ttattcttac atgatcttga gaagaatccc gggtgcagca
   13001  ttaaacaatc tggcatctac attaagtcat ccaaaacttt tccgacgagc
   13051  tatcaaccta gatatagttg cccccttaaa tgctcctcat tttgcatctc
   13101  tggactacat caagatgagt gtggatgcaa tactctgggg ctgtaaaaga
   13151  gtcatcaatg tgctctccaa tggaggggac ttagaattag ttgtgacatc
   13201  tgaagatagc cttattctca gtgaccgatc catgaatctc attgcaagga
   13251  aattaacttt attatcactg attcaccata atggtttgga actaccaaag
   13301  attaaggggt tctctcctga tgagaagtgt ttcgctttga cagaattttt
   13351  gaggaaagtg gtgaactcag ggttgagttc aatagagaac ctatcaaatt
   13401  ttatgtacaa tgtggagaac ccacggcttg cagcattcgc cagcaacaat
   13451  tactacctga ccagaaaatt attgaattca atacgagata ctgagtcggg
   13501  tcaagtagca gtcacctcat attatgaatc attagaatat attgatagtc
   13551  ttaagctaac cccacatgtg cctggcacct catgcattga ggatgatagt
   13601  ctatgtacaa atgattacat aatctggatc atagagtcta atgcaaactt
   13651  ggagaagtat ccaattccaa atagccctga ggatgattcc aatttccata
   13701  actttaagtt gaatgctcca tcgcaccata ccttacgccc attagggttg
   13751  tcatcaactg cttggtataa gggtataagc tgctgcaggt accttgagcg
   13801  attaaagcta ccacaaggtg atcatttata tattgcagaa ggtagtggtg
   13851  ccagtatgac aatcatagaa tacctattcc caggaagaaa gatatattac
   13901  aattctttat ttagtagtgg tgacaatccc ccacaaagaa attatgcacc
   13951  aatgcctact cagttcattg agagtgtccc atacaagctc tggcaagcac
   14001  acacagatca atatcccgag atttttgagg acttcatccc tctatggaac
   14051  ggaaacgccg ccatgactga cataggaatg acagcttgtg tagaattcat
   14101  catcaatcga gtcggcccaa ggacttgcag tttagtacat gtagatttgg
   14151  aatcaagtgc aagcttaaat caacaatgcc tgtcaaagcc gataattaat
   14201  gctatcatca ctgctacaac tgttttgtgc cctcatgggg tgcttattct
   14251  gaaatatagt tggttgccat ttactagatt tagtactttg atcactttct
   14301  tatggtgcta ctttgagaga atcactgttc ttaggagcac atattctgat
   14351  ccagctaatc atgaggttta tttaatttgt atccttgcca acaactttgc
   14401  attccagact gtctcgcagg caacaggaat ggcgatgact ttaactgatc
   14451  aagggtttac tttgatatca cctgaaagaa taaatcagta ttgggatggt
   14501  cacttgaagc aagaacgtat cgtagcagaa gcaattgata aggtggttct
   14551  aggagaaaat gctctattta attcgagtga taatgaatta attctcaaat
   14601  gtggagggac accaaatgca cggaatctca tcgatatcga gccagtcgca
   14651  actttcatag aatttgaaca attgatctgc acaatgttga caacccactt
   14701  gaaggaaata attgatataa caaggtctgg aacccaggat tatgaaagtt
   14751  tattactcac tccttacaat ttaggtcttc ttggtaaaat cagtacgata
   14801  gtgagattat taacagaaag gattctaaat catactatca ggaattggtt
   14851  gatcctccca ccttcgctcc ggatgatcgt gaagcaggac ttggaattcg
   14901  gcatattcag gattacttcc atcctcaatt ctgatcggtt cctgaagctt
   14951  tctccaaata ggaaatactt gattgcacaa ttaactgcag gctacattag
   15001  gaaattgatt gagggggatt gcaatatcga tctaaccaga cctatccaaa
   15051  agcaaatctg gaaagcatta ggttgtgtag tctattgtca cgatccaatg
   15101  gatcaaaggg agtcaacaga gtttattgat ataaatatta atgaagaaat
   15151  agaccgcggg atcgatggcg aggaaatcta aacatatcaa gaatcagaat
   15201  tagtttaaga aaaaagaaga ggattaatct tggttttccc cttggt