Sequence of DPV Gibbon ape leukemia virus
Gibbon ape leukemia virus gag, pol, and env genes, complete cds.
ACC No: M26927
Dated: 2000-03-04 | Length: 8088 | CRC: -1404907696
!!NA_SEQUENCE 1.0 ID PCGGPE standard; genomic RNA; VRL; 8088 BP. XX AC M26927; XX SV M26927.1 XX DT 25-APR-1990 (Rel. 23, Created) DT 04-MAR-2000 (Rel. 63, Last updated, Version 3) XX DE Gibbon ape leukemia virus gag, pol, and env genes, complete cds. XX KW env protein; envelope-associated protein; gag protein; integrase; KW nucleocapsid protein; pol protein; protease; reverse transcriptase. XX OS Gibbon ape leukemia virus (GALV) OC Viruses; Retroid viruses; Retroviridae; Gammaretrovirus. XX RN [1] RP 1-8088 RX MEDLINE; 90051069. RX PUBMED; 2683360. RA Delassus S., Sonigo P., Wain-Hobson S.; RT "Genetic organization of gibbon ape leukemia virus"; RL Virology 173(1):205-213(1989). XX CC Draft entry and computer-readable copy of sequence [1] kindly CC submitted by S.Wain-Hobson, 08-AUG-1989. CC Bases 4518 to 4926 in [1] are from the SF strain of GaLV. XX FH Key Location/Qualifiers FH FT source 1. .8088 FT /db_xref="taxon:11840" FT /mol_type="genomic RNA" FT /organism="Gibbon ape leukemia virus" FT LTR <1. .125 FT /note="5' long terminal repeat" FT repeat_region <1. .69 FT /note="LTR R region" FT primer_bind 126. .142 FT /note="primer (Pro-tRNA) binding site" FT CDS 631. .2193 FT /codon_start=1 FT /db_xref="GOA:P21416" FT /db_xref="HSSP:Q9WJP4" FT /db_xref="UniProt/Swiss-Prot:P21416" FT /note="gag polyprotein" FT /protein_id="AAA46809.1" FT /translation="MGQDNSTPISLTLNHWRDVRTRAHNLSVEIKKGKWQTFCSSEWPT FT FGVGWPPEGTFNLSVIFAVKKIVFQENGGHPDQVPYIVVWQDLAQNPPPWVPASAKVAV FT VSDTRRPVAGRPSAPPRPPIYPATDDLLLLSEPTPPPYPAALPPPLAPQAIGPPSGQMP FT DSSDPEGPAAGTRSRRARSPADNSGPDSTVILPLRAIGPPAEPNGLVPLQYWPFSSADL FT YNWKSNHPSFSENPAGLTGLLESLMFSHQPTWDDCQQLLQILFTTEERERILLEARKNV FT LGDNGAPTQLENLINEAFPLNRPHWDYNTAAGRERLLVYRRTLVAGLKGAARRPTNLAK FT VREVLQGPAEPPSVFLERLMEAYRRYTPFDPSSEGQQAAVAMAFIGQSAPDIKKKLQRL FT EGLQDYSLQDLVKEAEKVYHKRETEEERQEREKKEAEEKERRRDRPKKKNLTKILAAVV FT SREGSTGRQTGNLSNQAKKTPRDGRPPLDKDQCAYCKEKGHWARECPRKKHVREAKVLA FT LDN" FT CDS 2194. .5691 FT /codon_start=1 FT /db_xref="GOA:P21414" FT /db_xref="HSSP:P03355" FT /db_xref="UniProt/Swiss-Prot:P21414" FT /note="NH2 terminal uncertain" FT /partial FT /product="pol polyprotein" FT /protein_id="AAA46810.1" FT /translation="GSQGSDPLPEPRVTLTVEGTPIEFLVDTGAEHSVLTQPMGKVGSR FT RTVVEGATGSKVYPWTTKRLLKIGHKQVTHSFLVIPECPAPLLGRDLLTKLKAQIQFSA FT EGPQVTWGERPTMCLVLNLEEEYRLHEKPVPSSIDPSWLQLFPTVWAERAGMGLANQVP FT PVVVELRSGASPVAVRQYPMSKEAREGIRPHIQKFLDLGVLVPCRSPWNTPLLPVKKPG FT TNDYRPVQDLREINKRVQDIHPTVPNPYNLLSSLPPSYTWYSVLDLKDAFFCLRLHPNS FT QPLFAFEWKDPEKGNTGQLTWTRLPQGFKNSPTLFDEALHRDLAPFRALNPQVVLLQYV FT DDLLVAAPTYEDCKKGTQKLLQELSKLGYRVSAKKAQLCQREVTYLGYLLKEGKRWLTP FT ARKATVMKIPVPTTPRQVREFLGTAGFCRLWIPGFASLAAPLYPLTKESIPFIWTEEHQ FT QAFDHIKKALLSAPALALPDLTKPFTLYIDERAGVARGVLTQTLGPWRRPVAYLSKKLD FT PVASGWPTCLKAVAAVALLLKDADKLTLGQNVTVIASHSLESIVRQPPDRWMTNARMTH FT YQSLLLNERVSFAPPAVLNPATLLPVESEATPVHRCSEILAEETGTRRDLEDQPLPGVP FT TWYTDGSSFITEGKRRAGAPIVDGKRTVWASSLPEGTSAQKAELVALTQALRLAEGKNI FT NIYTDSRYAFATAHIHGAIYKQRGLLTSAGKDIKNKEEILALLEAIHLPRRVAIIHCPG FT HQRGSNPVATGNRRADEAAKQAALSTRVLAGTTKPQEPIEPAQEKTRPRELTPDRGKEF FT IKRLHQLTHLGPEKLLQLVNRTSLLIPNLQSAVREVTSQCQACAMTNAVTTYRETGKRQ FT RGDRPGVYWEVDFTEIKPGRYGNKYLLVFIDTFSGWVEAFPTKTETALIVCKKILEEIL FT PRFGIPKVLGSDNGPAFVAQVSQGLATQLGINWKLHCAYRPQSSGQVERMNRTIKETLT FT KLALETGGKDWVTLLPLALLRARNTPGRFGLTPYEILYGGPPPILESGETLGPDDRFLP FT VLFTHLKALEIVRTQIWDQIKEVYKPGTVTIPHPFQVGDQVLVRRHRPSSLEPRWKGPY FT LVLLTTPTAVKVDGIAAWVHASHLKPAPPSAPDESWELEKTDHPLKLRIRRRRDESAK" FT CDS 5552. .7555 FT /codon_start=1 FT /db_xref="GOA:P21415" FT /db_xref="HSSP:P03385" FT /db_xref="UniProt/Swiss-Prot:P21415" FT /note="env protein" FT /protein_id="AAA46811.1" FT /translation="MVLLPGSMLLTSNLHHLRHQMSPGSWKRLIILLSCVFGGGGTSLQ FT NKNPHQPMTLTWQVLSQTGDVVWDTKAVQPPWTWWPTLKPDVCALAASLESWDIPGTDV FT SSSKRVRPPDSDYTAAYKQITWGAIGCSYPRARTRMASSTFYVCPRDGRTLSEARRCGG FT LESLYCKEWDCETTGTGYWLSKSSKDLITVKWDQNSEWTQKFQQCHQTGWCNPLKIDFT FT DKGKLSKDWITGKTWGLRFYVSGHPGVQFTIRLKITNMPAVAVGPDLVLVEQGPPRTSL FT ALPPPLPPREAPPPSLPDSNSTALATSAQTPTVRKTIVTLNTPPPTTGDRLFDLVQGAF FT LTLNATNPGATESCWLCLAMGPPYYEAIASSGEVAYSTDLDRCRWGTQGKLTLTEVSGH FT GLCIGKVPFTHQHLCNQTLSINSSGDHQYLLPSNHSWWACSTGLTPCLSTSVFNQTRDF FT CIQVQLIPRIYYYPEEVLLQAYDNSHPRTKREAVSLTLAVLLGLGITAGIGTGSTALIK FT GPIDLQQGLTSLQIAIDADLRALQDSVSKLEDSLTSLSEVVLQNRRGLDLLFLKEGGLC FT AALKEECCFYIDHSGAVRDSMKKLKEKLDKRQLERQKSQNWYEGWFNNSPWFTTLLSTI FT AGPLLLLLLLLILGPCIINKLVQFINDRISAC" FT LTR 7651. .>8088 FT /note="3' long terminal repeat" FT repeat_region 8020. .>8088 FT /note="LTR R region" XX SQ Sequence 8088 BP; 2103 A; 2251 C; 1982 G; 1752 T; 0 other; M26927 Length: 8088 January 7, 2005 14:55 Type: N Check: 5631 .. 1 gcgccagtcc ttagagagac tgagccgccc gggtacccgt gtgtccaata 51 aaacctcttg ctgattgcat ccggagccgt ggtctcgttg ttccttggga 101 gggtttctcc taactattga ccgcccactt cgggggtctc acatttgggg 151 gctcgtccgg gatcggaaac cccacccagg gaccaccgac ccaccaacgg 201 gaggtaagct ggccagcgac cgttgtgtgt ctcgcttctg tgtctaagtc 251 cgtaattctg actgtccttg tgtgtctcgc ttctgtgtct gagaccgtaa 301 ctctgactgc ccttgtaagt gcgcgcattt ttttggtttc agtctgttcc 351 gggtgaatca ctctgcgagt gacgtgtgag tagcgaacag acgtgttcgg 401 ggctcaccgc ctggtaatcc agggagacgt cccaggatca ggggaggacc 451 agggacgcct ggtggacccc tcggtaacgg gtcgttgtga cccgatttca 501 tcgcccgtct ggtaagacgc gctctgaatc tgattctctc tctcggtcgc 551 ctcgccgccg tctctggttt ctttttgttt cgtttctgga aagcctctgt 601 gtcacagtct ttctctccca aatcatcaat atgggacaag ataattctac 651 ccctatctcc ctcactctaa atcactggag agatgtgaga acaagggctc 701 acaatctatc cgtggaaatc aaaaagggaa aatggcagac tttctgttcc 751 tccgagtggc ccacattcgg cgtggggtgg ccaccggagg gaacttttaa 801 tctctctgtc atttttgcag ttaaaaagat tgtctttcag gagaacgggg 851 gacatccgga ccaagttcca tatatcgtgg tatggcagga cctcgcccag 901 aatcccccac catgggtgcc agcctccgcc aaggtcgctg ttgtctctga 951 tacccgaaga ccagttgcgg ggaggccatc agctcctccc cgacccccca 1001 tctacccggc aacagacgac ttactcctcc tctctgaacc cacgcccccg 1051 ccctatccgg cggcactgcc accccctctg gcccctcagg cgatcggacc 1101 gccgtcaggc cagatgcccg atagtagcga tcctgagggg ccagccgctg 1151 ggaccaggag tcgccgtgcc cgcagtccag cagacaactc gggtcctgac 1201 tccactgtga ttttgcccct ccgagccata ggacccccgg ccgagcccaa 1251 tggcctggtc cctctacaat attggccttt ttcctcagca gatctttata 1301 attggaaatc taatcatccc tctttttctg aaaacccagc aggtctcacg 1351 gggctccttg agtctcttat gttctcccat cagcccactt gggacgattg 1401 ccaacagctc ctacagattc ttttcaccac tgaggaacgg gaaagaattc 1451 tcctggaggc ccgcaaaaat gtccttgggg acaatggggc ccctacacag 1501 ctcgagaacc tcattaatga ggccttcccc ctcaatcgac ctcactggga 1551 ttacaacaca gccgcaggta gggagcgtct tctggtctac cgccggactc 1601 tagtggcagg tctcaaaggg gcagctcggc gtcctaccaa tttggctaag 1651 gtaagagagg tcttgcaggg accggcagaa cccccttcgg ttttcttaga 1701 acgcctgatg gaggcctata ggagatacac tccgtttgat ccctcttctg 1751 agggacaaca ggctgcggtc gccatggcct ttatcggaca gtcagcccca 1801 gatatcaaga aaaagttaca gaggctagag gggctccagg actattcctt 1851 acaagattta gtaaaagagg cagaaaaggt gtaccataag agagagacag 1901 aagaagaaag acaagaaaga gaaaaaaagg aggcagaaga aaaggagagg 1951 cggcgcgata ggccgaagaa aaaaaacttg actaaaattc tggccgcagt 2001 agtaagtaga gaagggtcca caggtaggca gacagggaac ctgagcaacc 2051 aggcaaagaa gacacctagg gatggaagac ctccactaga caaagaccag 2101 tgcgcatact gtaaagagaa gggccattgg gcaagagaat gtccccgaaa 2151 aaaacacgtc agagaagcca aggttctagc cctagataac taggggagtc 2201 agggttcgga ccccctcccc gaacctaggg taacactgac tgtggagggg 2251 acccccattg agttcctggt cgacaccgga gctgaacatt cagtattgac 2301 ccaacccatg ggaaaagtag ggtccagacg gacggtcgtg gaaggagcga 2351 caggcagcaa ggtctacccc tggaccacaa aaagactttt aaaaattgga 2401 cataaacaag tgacccactc cttcctggtc atacccgagt gccctgctcc 2451 tctgttgggc agggacctcc taaccaaact aaaggcccag atccagtttt 2501 ccgctgaggg cccacaggta acatggggag aacgccctac tatgtgcctg 2551 gtcctaaacc tggaagaaga ataccgacta catgaaaagc cagtaccctc 2601 ctctatcgac ccatcctggc tccagctttt ccccactgta tgggcagaaa 2651 gagccggcat gggactagcc aatcaagtcc caccagtggt agtagagcta 2701 agatcaggtg cctcaccagt ggctgttcga caatatccaa tgagcaaaga 2751 agctcgggaa ggtatcagac cccacatcca gaagttccta gacctagggg 2801 tcttggtgcc ctgtcggtcg ccctggaata cccctctgct acctgtaaaa 2851 aagccaggga ccaatgacta tcggccagtt caagacctga gagaaattaa 2901 taaaagggta caggatattc atcccacagt cccaaaccct tacaatcttc 2951 tgagttccct tccgcctagc tatacttggt actcagtctt agatctcaag 3001 gatgcctttt tctgcctcag gctacatccc aacagccagc cgctgttcgc 3051 gttcgagtgg aaagacccag aaaaaggtaa cacaggtcag ctgacctgga 3101 cgcggctacc acaagggttc aagaactctc ccactctctt cgacgaggcc 3151 ctccaccgag atttggctcc ctttagggcc ctcaaccccc aggtggtgtt 3201 actccaatat gtggacgacc tcttggtggc cgcccccaca tatgaagact 3251 gcaaaaaagg aacacagaag ctcttacagg agttaagtaa gttggggtac 3301 cgggtatcgg ctaagaaggc ccagctctgc cagagagaag tcacctatct 3351 ggggtaccta ctcaaggaag gaaaaagatg gctaacccca gcccgaaagg 3401 ctactgttat gaaaatccct gttcctacga cccccagaca ggtccgtgaa 3451 tttctaggca ctgccggatt ctgcaggctc tggatccctg ggtttgcttc 3501 cctggctgca cccttgtacc ccctaacaaa agagagcatc ccttttattt 3551 ggactgagga acatcagcag gcttttgacc acataaaaaa agccttgctg 3601 tcagcccctg cattggccct cccagacctc accaagccat tcactctata 3651 tatagatgag agagccggcg tggcccgggg agtgctcact cagactttag 3701 gaccctggcg gcggccagta gcatatctat caaaaaaact ggatccggtg 3751 gccagcgggt ggccaacctg cctgaaagcg gttgcagcag tagcactcct 3801 tctcaaagac gctgataagt taaccttggg acaaaatgtg actgtgattg 3851 cttcccatag cctcgaaagc atcgtgcggc aaccccccga ccggtggatg 3901 accaatgcca gaatgactca ttaccagagc ctgctgttaa atgaaagggt 3951 atcgtttgcg ccccctgctg tcctaaaccc agctacccta cttccagtcg 4001 agtcggaagc caccccagtg cacaggtgct cagaaatcct cgccgaagaa 4051 actggaactc gacgagacct agaagaccaa ccattgcccg gggtgccaac 4101 ctggtataca gacggtagca gtttcatcac ggaaggtaaa cggagagcag 4151 gggccccgat cgtagatggc aagcggacgg tatgggctag cagcctgcca 4201 gaaggtacgt cagcccagaa ggctgaacta gtagccttga cgcaggcatt 4251 acgcctggcc gaaggaaaaa acatcaacat ctacacggac agcaggtatg 4301 cttttgccac tgctcatatt catggggcaa tatataagca gagggggctg 4351 ctcacttctg ctggaaaaga tatcaaaaac aaagaggaaa ttttggccct 4401 gctagaggcc atccatctcc ctaggcgggt cgccattatc cactgtcctg 4451 gccaccagag gggaagtaac cctgtggcca ctgggaaccg gagggccgac 4501 gaggctgcaa agcaagccgc cctgtcgacc agagtgctgg caggaactac 4551 aaaacctcaa gagccaatcg agcccgctca agaaaagacc aggccgaggg 4601 agctcacccc tgaccgggga aaagaattca ttaagcggtt acatcagtta 4651 actcacttag gaccagaaaa gcttctccaa ctagtgaacc gtaccagcct 4701 cctcatcccg aacctccaat ctgcagttcg cgaagtcacc agtcagtgtc 4751 aggcttgtgc catgactaat gcggtcacca cctacagaga gaccggaaaa 4801 aggcaacgag gagatcgacc cggcgtgtac tgggaggtag acttcacaga 4851 aataaagcct ggtcggtatg gaaacaagta tctgttagta ttcatagata 4901 ctttctccgg atgggtagaa gcttttccta ccaaaactga aacggcccta 4951 atcgtctgta aaaaaatatt agaagaaatt ctaccccgct tcgggatccc 5001 taaggtactc gggtcagaca atggcccggc ctttgttgct caggtaagtc 5051 agggactggc cactcaactg gggataaatt ggaagttaca ttgtgcgtat 5101 agaccccaga gctcaggtca ggtagaaaga atgaacagaa caattaaaga 5151 gaccttgacc aaattagcct tagagaccgg tggaaaagac tgggtgaccc 5201 tccttccctt agcgctgctt agggccagga atacccctgg ccggtttggt 5251 ttaactcctt atgaaattct ctatggagga ccacccccca tacttgagtc 5301 tggagaaact ttgggtcccg atgatagatt tctccctgtc ttatttactc 5351 acttaaaggc tttagaaatt gtaaggaccc aaatctggga ccagatcaaa 5401 gaggtgtata agcctggtac cgtaacaatc cctcacccgt tccaggtcgg 5451 ggatcaagtg cttgtcagac gccatcgacc cagcagcctt gagcctcggt 5501 ggaaaggccc atacctggtg ttgctgacta ccccgaccgc ggtaaaagtc 5551 gatggtattg ctgcctgggt ccatgcttct cacctcaaac ctgcaccacc 5601 ttcggcacca gatgagtcct gggagctgga aaagactgat catcctctta 5651 agctgcgtat tcggcggcgg cgggacgagt ctgcaaaata agaaccccca 5701 ccagcccatg accctcactt ggcaggtact gtcccaaact ggagacgttg 5751 tctgggatac aaaggcagtc cagccccctt ggacttggtg gcccacactt 5801 aaacctgatg tatgtgcctt ggcggctagt cttgagtcct gggatatccc 5851 gggaaccgat gtctcgtcct ctaaacgagt cagacctccg gactcagact 5901 atactgccgc ttataagcaa atcacctggg gagccatagg gtgcagctac 5951 cctcgggcta ggactagaat ggcaagctct accttctacg tatgtccccg 6001 ggatggccgg accctttcag aagctagaag gtgcgggggg ctagaatccc 6051 tatactgtaa agaatgggat tgtgagacca cggggaccgg ttattggcta 6101 tctaaatcct caaaagacct cataactgta aaatgggacc aaaatagcga 6151 atggactcaa aaatttcaac agtgtcacca gaccggctgg tgtaaccccc 6201 ttaaaataga tttcacagac aaaggaaaat tatccaagga ctggataacg 6251 ggaaaaacct ggggattaag attctatgtg tctggacatc caggcgtaca 6301 gttcaccatt cgcttaaaaa tcaccaacat gccagctgtg gcagtaggtc 6351 ctgacctcgt ccttgtggaa caaggacctc ctagaacgtc cctcgctctc 6401 ccacctcctc ttcccccaag ggaagcgcca ccgccatctc tccccgactc 6451 taactccaca gccctggcga ctagtgcaca aactcccacg gtgagaaaaa 6501 caattgttac cctaaacact ccgcctccca ccacaggcga cagacttttt 6551 gatcttgtgc agggggcctt cctaacctta aatgctacca acccaggggc 6601 cactgagtct tgctggcttt gtttggccat gggcccccct tattatgaag 6651 caatagcctc atcaggagag gtcgcctact ccaccgacct tgaccggtgc 6701 cgctggggga cccaaggaaa gctcaccctc actgaggtct caggacacgg 6751 gttgtgcata ggaaaggtgc cctttaccca tcagcatctc tgcaatcaga 6801 ccctatccat caattcctcc ggagaccatc agtatctgct cccctccaac 6851 catagctggt gggcttgcag cactggcctc accccttgcc tctccacctc 6901 agtttttaat cagactagag atttctgtat ccaggtccag ctgattcctc 6951 gcatctatta ctatcctgaa gaagttttgt tacaggccta tgacaattct 7001 caccccagga ctaaaagaga ggctgtctca cttaccctag ctgttttact 7051 ggggttggga atcacggcgg gaataggtac tggttcaact gccttaatta 7101 aaggacctat agacctccag caaggcctga caagcctcca gatcgccata 7151 gatgctgacc tccgggccct ccaagactca gtcagcaagt tagaggactc 7201 actgacttcc ctgtccgagg tagtgctcca aaataggaga ggccttgact 7251 tgctgtttct aaaagaaggt ggcctctgtg cggccctaaa ggaagagtgc 7301 tgtttttaca tagaccactc aggtgcagta cgggactcca tgaaaaaact 7351 caaagaaaaa ctggataaaa gacagttaga gcgccagaaa agccaaaact 7401 ggtatgaagg atggttcaat aactcccctt ggttcactac cctgctatca 7451 accatcgctg ggcccctatt actcctcctt ctgttgctca tcctcgggcc 7501 atgcatcatc aataagttag ttcaattcat caatgatagg ataagtgcat 7551 gttaaaattc tggtccttag acaaaatatc aggccctaga gaacgaaggt 7601 aacctttaat tttgctctaa gattagagct attcacaaga gaaatggggg 7651 aatgaaagaa gtgttttttt ttagccaact gcagtaacgc cattttgcta 7701 ggcacaccta aaggatagga aaaatacagc taagaacagg gccaaacagg 7751 atatctgtgg tcatgcacct gggccccggc ccaggccaag gacagagggt 7801 tcccagaaat agatgagtca acagcagttt ccagcaagga cagagggttc 7851 ccagaaatag atgagtcaac agcagtttcc agggtgcccc tcaaccgttt 7901 caaggactcc catgaccggg aattcacccc tggccttatt tgaactaacc 7951 aattaccttg cctctcgctt ctgtacccgc gctttttgct ataaaataag 8001 ctcagaaact ccacccggag cgccagtcct tagagagact gagccgcccg 8051 ggtacccgtg tgtccaataa aacctcttgc tgattgca