Sequence of DPV Woolly monkey sarcoma virus
Squirrel monkey retrovirus-H, complete genome.
ACC No: M23385
Dated: 2007-05-03 | Length: 8785 | CRC: -474381652
ID M23385; SV 1; linear; genomic RNA; STD; VRL; 8785 BP. XX AC M23385; XX DT 06-JUL-1989 (Rel. 20, Created) DT 03-MAY-2007 (Rel. 91, Last updated, Version 8) XX DE Squirrel monkey retrovirus-H, complete genome. XX KW . XX OS Squirrel monkey retrovirus-H OC Viruses; Retro-transcribing viruses; Retroviridae; Orthoretrovirinae; OC Betaretrovirus. XX RN [1] RP 1-8785 RX PUBMED; 3201749. RA Oda T., Ikeda S., Watanabe S., Hatsushika M., Akiyama K., Mitsunobu F.; RT "Molecular cloning, complete nucleotide sequence, and gene structure of the RT provirus genome of a retrovirus produced in a human lymphoblastoid cell RT line"; RL Virology 167(2):468-476(1988). XX RN [2] RP 1-8785 RA Oda T., Ikeda S., Watanabe S., Hatsushika M., Akiyama K., Mitsunobu F.; RT ; RL Submitted (24-MAR-1989) to the EMBL/GenBank/DDBJ databases. RL Department of Biochemistry, Okayama University Medical School, Japan XX FH Key Location/Qualifiers FH FT source 1. .8785 FT /organism="Squirrel monkey retrovirus-H" FT /mol_type="genomic RNA" FT /isolation_source="passed in human LMB cell line" FT /note="SMRV-HLB; SMRV-H" FT /db_xref="taxon:11857" FT polyA_signal 332. .337 FT CDS 645. .2867 FT /codon_start=1 FT /product="gag protein" FT /note="Base 645 is the position of the first start codon in FT the ORF; putative" FT /db_xref="GOA:P21411" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR003322" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="UniProtKB/Swiss-Prot:P21411" FT /protein_id="AAA66451.1" FT /translation="MGQASSHSENDLFISHLKESLKVRRIRVRKKDLVSFFSFIFKTCP FT WFPQEGSIDSRVWGRVGDCLNDYYRVFGPETIPITTFNYYNLIRDVLTNQSDSPDIQRL FT CKEGHKILISHSRPPSRQAPVTITTSEKASSRPPSRAPSTCPSVAIDIGSHDTGQSSLY FT PNLATLTDPPIQSPHSRAHTPPQHLPLLANSKTLHNSGSQDDQLNPADQADLEEAAAQY FT NNPDWPQLTNTPALPPFRPPSYVSTAVPPVAVAAPVLHAPTSGVPGSPTAPNLPGVALA FT KPSGPIDETVSLLDGVKTLVTKLSDLALLPPAGVMAFPVTRSQGQVSSNTTGRASPHPD FT THTIPEEEEADSGESDSEDDEEESSEPTEPTYTHSYKRLNLKTIEKIKTAVANYGPTAP FT FTVALVESLSERWLTPSDWFFLSRAALSGGDNILWKSEYEDISKQFAERTRVRPPPKDG FT PLKIPGASPYQNNDKQAQFPPGLLTQIQSAGLKAWKRLPQKGAATTSLAKIRQGPDESY FT SDFVSRLQETADRLFGSGESESSFVKHLAYENANPACQSAIRPFRQKELSTMSPLLWYC FT SAHAVGLAIGAALQNLAPAQLLEPRPAFAIIVTNPAIFQETAPKKIQPPTQLPTQPNAP FT QASLIKNLGPTTKCPRCKKGFHWASECRSRLDINGQPIIKQGNLNRGQPQGPTTGMNSG FT ASQFTPQYRQPTPALPVINHAATSQTSGEQQRAVQDWTSVPPPTQY" FT CDS 3368. .3640 FT /codon_start=1 FT /product="aspartyl protease" FT /note="Base 3368 is the position of the first start codon FT in the ORF; putative" FT /db_xref="GOA:P21407" FT /db_xref="UniProtKB/Swiss-Prot:P21407" FT /protein_id="AAA66452.1" FT /translation="MLKWEDSEGNNGHITPYVLPNLPVNLWGRDILSQMKLVMCSPNDT FT VMTQMLSQGYLPGQGLGKNNQGITQPITITPKKDKTGLGFHQNLP" FT CDS 3888. .6284 FT /codon_start=1 FT /product="pol protein" FT /note="Base 3888 is the position of the first start codon FT in the ORF; putative" FT /db_xref="GOA:P03364" FT /db_xref="UniProtKB/Swiss-Prot:P03364" FT /protein_id="AAA66453.1" FT /translation="MVPMGALQPGLPSPVAIPLNYHKIVIDLKDCFFTIPLHPEDRPYF FT AFSVPQINFQSPMPRYQWKVLPQGMANSPTLCQKFVAAAIAPVRSQWPEAYILHYMDDI FT LLACDSAEAAKACYAHIISCLTSYGLKIAPDKVQVSEPFSYLGFELHHQQVFTPRVCLK FT TDHLKTLNDFQKLLGDIQWLRPYLKLPTSALVPLNNILKGDPNPLSVRALTPEAKQSLA FT LINKAIQNQSVQQISYNLPLVLLLLPTPHTPTAVFWQPNGTDPTKNGSPLLWLHLPASP FT SKVLLTYPSLLAMLIIKGRYTGRQLFGRDPHSIIIPYTQDQLTWLLQTSDEWAIALSSF FT TGDIDNHYPSDPVIQFAKLHQFIFPKITKCAPIPQATLVFTDGSSNGIAAYVIDNQPIS FT IKSPYLSAQLVELYAILQVFTVLAHQPFNLYTDSAYIAQSVPLLETVPFIKSSTNATPL FT FSKLQQLILNRQHPFFIGHLRAHLNLPGPLAEGNALADAATQIFPIISDPIHEATQAHT FT LHHLNAHTLRLLYKITREQARDIVKACKQCVVATPVPHLGVNPRGLVPNAIWQMDVTHF FT TPFGKQRFVHVTVDTFSGFILATPQTGEASKNVISHVIHCLATIGKPHTIKTDNGPGYT FT GKNFQDFCQKLQIKHVTGIPYNPQGQGVVERAHQTLKNALNRLARSPLGFSMQQPRNLL FT SHALFQLNFLQLDSQGRSAADRLWHPQTSQQHATVMWRDPLTSVWKGPDPVLIWGRGSA FT CIYDQKEDGPRWLPERLIRHINNQTAPLCDRPSNPNTAPGPKGSP" FT CDS 6314. .8041 FT /codon_start=1 FT /product="envelope glycoprotein" FT /note="Base 6314 is the position of the first start codon FT in the ORF; putative" FT /db_xref="GOA:P21412" FT /db_xref="InterPro:IPR002050" FT /db_xref="UniProtKB/Swiss-Prot:P21412" FT /protein_id="AAA66454.1" FT /translation="MLCILILLLHPRLCPVTKGGLGKPSGDIYTALFGAPCDCKGGTQT FT NNYATPTYTQVTDCGDKNAYLTYDTNWNGVSSPKWLCVRKPPSIPVINGRPGPCPSECT FT NNIKSQMHSSCYSSFSQCTQGNNTYFTAILQRTKSTSETNPVTSGLQPHGVLQAGCDGT FT VGKSVCWNQQAPIHVSDGGGPQDAVRELYVQKQIELVIQSQFPKLSYHPLARSKPRGPD FT IDAQMLDILSATHQALNISNPSLAQNCWLCLNQGTSMPLAFPVNISSFNASQNNCTPSL FT PFRVQPMPSQVYPCFFKGAQNNSFDIPVGVANFVNCSSSSNHSEALCPGPGQAFVCGNN FT LAFTALPANWTGSCVLAALLPDIDIISGDDPVPIPTFDYIAGRQKRAVTLIPLLVGLGV FT STAVATGTAGLGVAVQSYTKLSHQLINDVQALSSTINDLQDQLDSLAEVVLQNRRGLDL FT LTAEQGGICLALQERCCFYANKSGIVRDKIKNLQEDLEKRRKALADNLFLTGLNGLLPY FT LLPFLGPLFAIILFFSFAPWILRRVTALIRDQLNSLLGKPIQIHYHQLATRDLEYGRL" FT mat_peptide 6314. .7471 FT /product="outer membrane protein" FT mat_peptide 7472. .8038 FT /product="transmembrane protein" FT LTR 8661. .8666 FT /note="3' long terminal repeat" FT polyA_signal 8661. .8666 XX SQ Sequence 8785 BP; 2298 A; 2620 C; 1734 G; 2133 T; 0 other; m23385 Length: 8785 03-MAY-2007 Type: N Check: 4659 .. 1 tgttgggaac ccaggctaag ctgatgctat tggaacaaaa ggttgcatca 51 gccctccccg ccaaaagcat gccccgggcg tggtggcggg ccaccaatgg 101 aggacctgat cacgggcaag acatgcctca gggccaccaa tggaggacct 151 gatcacgggc aagacatgcc tcagggccac caataaagga cctgatcacg 201 cacagaacat gcatggctgc accaatgggg tagctgatca tgagctaaac 251 actcgtctcc cagcctatca gaactacttc ccctttcccc tctctacccg 301 ctcccctccc tatataagga acccattttg aaataaactt tgcagcttga 351 tcagaacttt tgtcttgctg ccattcttcg cgcctcttgt cccatccctt 401 tctcatccac gacaggcttg ctcgggttcc tgtttgttgc tctgcgggac 451 agagcaagtg gcgcccagga cgtggggctc gatgccggcc tccgtggacc 501 gccgtcccct gtaaccggtt ccctgcacgg ccgtcccgat taaccgattc 551 cccgcacgga gcaccgcgga ccacccgacc gcgagccgac tcctggagtt 601 cgttcctcat ttcgacggcg gcattactca agtaagaccc aatcatggga 651 caagcatctt cacacagtga aaatgatctc tttataagtc acttaaagga 701 atctctcaag gtgcgtagaa ttcgggttcg caagaaagac cttgtctcct 751 tttttagttt catttttaaa acatgtccat ggttccccca ggaagggtct 801 attgactccc gtgtttgggg acgtgttggt gattgtctga acgactatta 851 ccgtgttttt ggtcctgaga ctattccgat caccactttt aattattata 901 atttaataag ggacgtcctt actaatcaga gcgactcccc tgacattcaa 951 cgcctctgca aggagggtca caaaattctt attagccact cccgacctcc 1001 atctagacaa gcccctgtaa caattaccac ctctgaaaag gcctcctctc 1051 gccctccctc tagggcccct tctacctgcc cctcggttgc aattgacatt 1101 ggttcacatg atacagggca gtcttcactg taccccaacc ttgcaaccct 1151 tacggacccc cccattcaaa gtcctcattc tcgggcgcat actccgcccc 1201 aacatttgcc cttgcttgct aattctaaaa ccttgcataa ctcgggtagt 1251 caggatgatc aactaaatcc cgccgatcaa gcagatctgg aagaggcagc 1301 tgcccaatat aacaaccctg attggcccca attaactaac accccggcat 1351 tgccaccttt ccggccaccg tcctatgttt ctacagcagt gcccccagtg 1401 gcagttgcgg ctcctgtttt gcatgcccct acttcaggcg ttcctggttc 1451 ccccacggcc ccaaacttgc ccggtgtagc cctagccaaa ccctccggtc 1501 ccattgatga gactgtttct cttcttgatg gggttaaaac cttagtcaca 1551 aaactgtccg atttggccct tctacctccc gcgggagtta tggcttttcc 1601 cgttaccaga agtcagggac aggttagctc caataccacg ggccgagcgt 1651 ctcctcaccc tgacacacac accatccctg aggaggagga agcagactcc 1701 ggagaatctg actcagagga tgacgaggag gaaagctcag agcccaccga 1751 gcctacctac acccattcct ataagcggct aaatctaaag accatagaaa 1801 aaattaaaac tgctgttgct aactatggtc ctactgcccc ctttaccgtg 1851 gcccttgtag agagtcttag tgaaagatgg cttaccccta gtgattggtt 1901 tttcttgtct cgtgctgcgc tgagcggagg ggacaatatc ctttggaagt 1951 ctgagtatga ggatatttcc aaacagtttg cagagcgaac gcgcgtaagg 2001 cctcctccaa aggatggacc cttaaaaatt cctggcgcca gcccttatca 2051 gaacaatgac aaacaggccc aattcccccc agggctttta acccagattc 2101 agtccgcagg cctaaaagcc tggaagcgac tccctcaaaa gggagcggct 2151 actacttccc ttgcaaagat tagacaaggc cccgatgagt catacagtga 2201 ttttgtaagc cgcctccagg agacggcaga tcgccttttt ggctccgggg 2251 aaagtgagag ctcctttgta aaacacctag cctatgaaaa cgctaacccc 2301 gcttgccaaa gtgcaattcg gccttttagg cagaaggagc tttcgactat 2351 gtcgcctctg ctctggtatt gctctgccca tgctgttggc ctagccatag 2401 gagctgccct ccaaaatctt gcccccgcgc aactcctgga gcccaggccc 2451 gcctttgcta taattgtcac caacccggcc atctttcaag aaactgcccc 2501 caaaaaaata caaccaccta ctcaactccc aactcaacct aatgccccac 2551 aggctagcct tataaaaaat ttaggtccca caacaaaatg tcctcgctgc 2601 aaaaaaggat ttcactgggc ttcagaatgc cgttctcgat tagacattaa 2651 tggacaaccc attattaagc agggaaactt gaacaggggc cagccccagg 2701 gccccactac cgggatgaac tccggggctt cacagttcac cccccaatac 2751 cgccagccaa cccctgccct cccagtaatc aaccacgccg ctacgtcaca 2801 gacctctggc gagcaacagc gggcagtgca ggactggacc tctgtaccac 2851 caccgacaca atactaacca cccaaaatag ccctctgaca cttccagttg 2901 gaatatatgg acccttacca ccccagacat tcggcctcat attagcagag 2951 ccagctctac cctccaaggg gatccaagtt ctgcccggca tattagacaa 3001 tgattttgag ggagaaatcc atatcattct ctctacaact aaagatttag 3051 tcaccatccc aaagggcacc agactagctc aaatagtcat tctccccctc 3101 caacaaatta actccaattt ccataagccc taccgcgggg ctagtgcccc 3151 tgggtcttct gatgtctact gggttcaaca aatttctcaa cagcggccta 3201 ccctgaaact taaattaaat ggtaagctct tttctggcat tcttgataca 3251 ggggccgatg ccaccgttat atcttacact cactggccga ggaactggcc 3301 gttaacaacc gttgctactc acctgcgcgg tattggccag gccaccaacc 3351 cccaacaaag tgctcaaatg cttaagtggg aggactctga aggcaataat 3401 ggtcacatta ccccttatgt cctccccaat ctgccagtca atctctgggg 3451 aagggacatc ctctctcaaa tgaaacttgt catgtgcagt cccaacgata 3501 ctgtcatgac ccaaatgcta agccaggggt atctccccgg ccaagggttg 3551 ggaaaaaata atcaaggaat cacccagccc attactatta cccccaaaaa 3601 agacaaaaca ggcctaggat tccaccaaaa tttaccgtag tcgtgccatt 3651 gacattcctg taccccacgc tgacaaaatt tcctggaaaa ttacagaccc 3701 tgtgtgggtt gatcagtggc cacttacata tgagaaaacc ctcgctgcca 3751 ttgcgttagt acaggaacag ctcgcagcag gacatattga gcccacaaat 3801 tctccatgga atactcctat attcatcatt aagaaaaaat caggtagctg 3851 gcgtctttta caggatctaa gagccgttaa taaggtaatg gtccccatgg 3901 gagcccttca gcctggtctt ccctctcctg tagccatccc cctaaactat 3951 cacaaaattg ttattgacct taaggattgt ttctttacca tccccttaca 4001 ccctgaagac agaccttact ttgcctttag cgtccctcaa atcaacttcc 4051 aaagtcctat gcctcgttat cagtggaagg ttctgccaca gggcatggcc 4101 aacagtccca cactgtgcca aaaatttgtt gctgccgcca ttgccccagt 4151 aagatcccag tggccagagg cctatatcct ccattatatg gatgacatcc 4201 ttcttgcttg tgacagcgcc gaggcagcca aggcctgcta tgctcacatt 4251 atatcctgtc ttacctcata tggactaaaa attgctccag acaaggtaca 4301 agtgtctgag ccattttctt atttaggatt tgagttacac catcagcaag 4351 tatttactcc ccgagtctgc ttaaaaactg atcacttaaa aacccttaac 4401 gatttccaaa aattactcgg ggacattcag tggcttcgac cctatttaaa 4451 attgcccacc agtgcccttg ttccccttaa caatattcta aaaggcgatc 4501 caaatccttt atcggttcga gcactgaccc cagaggcaaa gcaatctcta 4551 gccctcatca acaaggctat ccaaaatcaa agtgttcaac aaatttcgta 4601 taaccttccc ctagtactcc tcttgctccc aactccccat acacccaccg 4651 cggtgttttg gcaaccaaac ggtacagacc ctacaaaaaa cggaagcccc 4701 ctcctttggc tccatctacc tgcctcccca tcaaaagtct tactcaccta 4751 cccctcgctc ctcgccatgt taattattaa gggtcggtac actggccgcc 4801 aactgtttgg cagggacccc cactctataa tcattccata cacccaggac 4851 caattaacct ggctcctgca aacctctgac gaatgggcca ttgcattatc 4901 ctccttcaca ggagacatag acaatcatta ccccagtgac cctgttatcc 4951 aatttgccaa gcttcaccag ttcatattcc ccaagatcac aaaatgtgcc 5001 ccaattcctc aagccacgct agttttcact gatggatcct caaacggaat 5051 tgctgcatat gttattgata atcaacccat ctcaataaaa tccccctacc 5101 tgtcagctca acttgttgag ctctatgcta ttctccaggt gttcacagtt 5151 ctagctcacc aaccgtttaa cttgtacact gacagtgcgt atattgctca 5201 atcagtccct cttttggaga cagtcccctt tatcaaatcc tcaaccaatg 5251 ctaccccctt attttctaaa ctgcaacagc taattttaaa cagacaacac 5301 cctttcttta tcggacatct tcgggcccac ctaaatcttc caggacccct 5351 ggctgaaggc aatgccttag ctgatgctgc cacacagatt ttccccatta 5401 taagtgaccc aatacatgag gctactcaag ctcacaccct acatcacctc 5451 aatgcacaca ccctacgatt actctataaa attactagag aacaagccag 5501 agatattgta aaagcttgca aacagtgtgt cgtagccacc cctgtacccc 5551 atcttggcgt gaacccccgt ggtttagtcc ccaatgccat ttggcaaatg 5601 gatgtcactc attttactcc ttttggaaaa cagaggtttg ttcatgttac 5651 tgttgacaca tttagtggtt ttatcttagc cactccccaa acaggtgaag 5701 catcaaaaaa tgttatatct catgttatcc actgtcttgc taccatagga 5751 aaaccacaca ccattaaaac agacaatggc ccgggatata ctggaaaaaa 5801 cttccaagac ttttgccaaa aactccaaat caaacatgtt actggtatac 5851 cgtacaaccc ccagggtcaa ggagtagttg aacgagctca tcaaacatta 5901 aaaaatgccc taaatcgctt agcccgctcc ccccttgggt tttctatgca 5951 acaacccaga aaccttctta gtcatgccct atttcaacta aattttctac 6001 agcttgacag tcaagggcgc tcggcagctg accgtctatg gcatccccaa 6051 acttctcagc agcatgctac ggttatgtgg cgtgaccctc tcaccagtgt 6101 ttggaagggc cctgaccctg tcctcatatg ggggcgaggc tcagcctgca 6151 tatacgatca aaaggaggat ggcccccgct ggctccctga gcgactaatt 6201 agacacatca ataatcagac agcccccttg tgtgacaggc caagtaaccc 6251 aaatacagcc ccagggccaa aaggctcgcc ctgaggagct ccttttctct 6301 tcttccagga agaatgctct gcatcctcat cctcctactg cacccacgcc 6351 tctgcccagt cacaaaggga ggacttggaa agccatccgg agacatttac 6401 actgccctct ttggagcgcc atgtgactgt aaagggggga ctcagaccaa 6451 taattacgcc accccaactt acactcaggt aacagattgt ggggacaaaa 6501 atgcctatct tacctatgac accaattgga atggagtatc ttcacctaag 6551 tggctttgtg tgcgcaagcc tcctagtata ccggtcatta atggccgccc 6601 aggcccgtgc ccaagcgagt gcacaaacaa cattaaatcc cagatgcact 6651 cctcctgcta ttctagtttc tcacagtgta ctcaaggcaa taatacttat 6701 tttactgcca ttctacaaag aacaaagagc acctcagaaa ccaatcctgt 6751 caccagcggc ctacaacctc atggggtcct ccaggccgga tgcgatggca 6801 cggttggaaa atcggtttgt tggaatcagc aagcccctat tcacgtctcc 6851 gacggtggcg gaccccaaga tgctgtgaga gagctttatg tacaaaaaca 6901 aatagagctt gttattcaaa gccaattccc taagttatcc taccaccccc 6951 tagctcgctc aaaaccaaga ggacctgaca ttgatgcaca aatgcttgat 7001 attctgtcag ccacccacca ggccctcaat atctccaacc ccagcctagc 7051 ccaaaattgc tggttatgct taaatcaagg tacctccatg cccctagcct 7101 tccctgtcaa tatatctagt tttaatgcct cacaaaataa ttgcaccccc 7151 agcttaccct ttagagtcca gcccatgcct tcccaagtat acccttgctt 7201 ctttaaaggt gcacaaaaca acagctttga tattccggtt ggcgttgcca 7251 actttgtaaa ctgctccagt agttccaacc acagtgaggc cctttgccct 7301 ggcccaggcc aagcttttgt ttgcggcaac aacctcgcct ttactgctct 7351 gcctgcaaac tggacagggt catgtgtgtt agccgccctc ctgccagata 7401 tagacattat ttctggtgat gaccctgtcc ctatccctac ctttgactat 7451 attgcagggc ggcagaaacg agccgttaca ctgattcccc tgctagtagg 7501 attgggtgtc tctacagcag tcgctaccgg tacagcagga ctcggggtgg 7551 ctgttcaatc ttacacaaaa ctttcccatc aacttattaa cgacgtccaa 7601 gccttgtcta gcaccattaa tgacttacag gaccaactag attccctagc 7651 cgaagtagtc ctccaaaaca gaagaggctt agacctactc actgcagaac 7701 agggaggtat ctgtttggct ctacaggaac gttgctgctt ttatgccaac 7751 aagtcaggaa ttgtccgaga taaaataaaa aatctacaag aagacctcga 7801 aaaaagacgc aaggcacttg cagacaatct cttcctcacc ggcctcaatg 7851 gacttctccc ttacctcctc cccttccttg gacccttatt cgctatcatc 7901 ctgttcttct cttttgcccc ttggatccta agacgagtaa cagcgttaat 7951 cagggatcag ctcaattccc tactgggaaa gcccatacaa atccactatc 8001 accaactagc aacgcgtgat ctagaatatg gcagactgta gccggttccc 8051 ctcctacggg agcagcatac cgctcgacac tatgctttac gaaggtaatg 8101 gacaccgcta ggtgcaaggc aaggcactgc aaggagaggc cttactaagg 8151 ctactgtcga gtctcctgag aggtaagctg gcttgcatag aggttggtac 8201 tcgaaaaatc ctctcctccc aaaaaggtac ctgtaagcct gaaaattaag 8251 gctcaggagg agcacagcct ctacctcccc tagctggtta aggtccgcct 8301 cctctttttt taaagaaaaa gggaggagat gttgggaacc caggctaagc 8351 tgatgctatt ggaacaaaag gttgcatcag ccctccccgc caaaagcatg 8401 ccccgggcgt ggtggcgggc caccaatgga ggacctgatc acgggcaaga 8451 catgcctcag ggccaccaat ggaggacctg atcacgggca agacatgcct 8501 cagggccacc aataaaggac ctgatcacgc acagaacatg catggctgca 8551 ccaatggggt agctgatcat gagctaaaca ctcgtctccc agcctatcag 8601 aactacttcc cctttcccct ctctacccgc tcccctccct atataaggaa 8651 cccattttga aataaacttt gcagcttgat cagaactttt gtcttgctgc 8701 cattcttcgc gcctcttgtc ccatcccttt ctcatccacg acaggcttgc 8751 tcgggttcct gtttgttgct ctgcgggaca gagca