Sequence of DPV Gibbon ape leukemia virus

Gibbon ape leukemia virus gag, pol, and env genes, complete cds.

ACC No: M26927

Dated: 2000-03-04 | Length: 8088 | CRC: -1404907696

                !!NA_SEQUENCE 1.0
ID   PCGGPE     standard; genomic RNA; VRL; 8088 BP.
XX
AC   M26927;
XX
SV   M26927.1
XX
DT   25-APR-1990 (Rel. 23, Created)
DT   04-MAR-2000 (Rel. 63, Last updated, Version 3)
XX
DE   Gibbon ape leukemia virus gag, pol, and env genes, complete cds.
XX
KW   env protein; envelope-associated protein; gag protein; integrase;
KW   nucleocapsid protein; pol protein; protease; reverse transcriptase.
XX
OS   Gibbon ape leukemia virus (GALV)
OC   Viruses; Retroid viruses; Retroviridae; Gammaretrovirus.
XX
RN   [1]
RP   1-8088
RX   MEDLINE; 90051069.
RX   PUBMED; 2683360.
RA   Delassus S., Sonigo P., Wain-Hobson S.;
RT   "Genetic organization of gibbon ape leukemia virus";
RL   Virology 173(1):205-213(1989).
XX
CC   Draft entry and computer-readable copy of sequence [1] kindly
CC   submitted by S.Wain-Hobson, 08-AUG-1989.
CC   Bases 4518 to 4926 in [1] are from the SF strain of GaLV.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .8088
FT                   /db_xref="taxon:11840"
FT                   /mol_type="genomic RNA"
FT                   /organism="Gibbon ape leukemia virus"
FT   LTR             <1. .125
FT                   /note="5' long terminal repeat"
FT   repeat_region   <1. .69
FT                   /note="LTR R region"
FT   primer_bind     126. .142
FT                   /note="primer (Pro-tRNA) binding site"
FT   CDS             631. .2193
FT                   /codon_start=1
FT                   /db_xref="GOA:P21416"
FT                   /db_xref="HSSP:Q9WJP4"
FT                   /db_xref="UniProt/Swiss-Prot:P21416"
FT                   /note="gag polyprotein"
FT                   /protein_id="AAA46809.1"
FT                   /translation="MGQDNSTPISLTLNHWRDVRTRAHNLSVEIKKGKWQTFCSSEWPT
FT                   FGVGWPPEGTFNLSVIFAVKKIVFQENGGHPDQVPYIVVWQDLAQNPPPWVPASAKVAV
FT                   VSDTRRPVAGRPSAPPRPPIYPATDDLLLLSEPTPPPYPAALPPPLAPQAIGPPSGQMP
FT                   DSSDPEGPAAGTRSRRARSPADNSGPDSTVILPLRAIGPPAEPNGLVPLQYWPFSSADL
FT                   YNWKSNHPSFSENPAGLTGLLESLMFSHQPTWDDCQQLLQILFTTEERERILLEARKNV
FT                   LGDNGAPTQLENLINEAFPLNRPHWDYNTAAGRERLLVYRRTLVAGLKGAARRPTNLAK
FT                   VREVLQGPAEPPSVFLERLMEAYRRYTPFDPSSEGQQAAVAMAFIGQSAPDIKKKLQRL
FT                   EGLQDYSLQDLVKEAEKVYHKRETEEERQEREKKEAEEKERRRDRPKKKNLTKILAAVV
FT                   SREGSTGRQTGNLSNQAKKTPRDGRPPLDKDQCAYCKEKGHWARECPRKKHVREAKVLA
FT                   LDN"
FT   CDS             2194. .5691
FT                   /codon_start=1
FT                   /db_xref="GOA:P21414"
FT                   /db_xref="HSSP:P03355"
FT                   /db_xref="UniProt/Swiss-Prot:P21414"
FT                   /note="NH2 terminal uncertain"
FT                   /partial
FT                   /product="pol polyprotein"
FT                   /protein_id="AAA46810.1"
FT                   /translation="GSQGSDPLPEPRVTLTVEGTPIEFLVDTGAEHSVLTQPMGKVGSR
FT                   RTVVEGATGSKVYPWTTKRLLKIGHKQVTHSFLVIPECPAPLLGRDLLTKLKAQIQFSA
FT                   EGPQVTWGERPTMCLVLNLEEEYRLHEKPVPSSIDPSWLQLFPTVWAERAGMGLANQVP
FT                   PVVVELRSGASPVAVRQYPMSKEAREGIRPHIQKFLDLGVLVPCRSPWNTPLLPVKKPG
FT                   TNDYRPVQDLREINKRVQDIHPTVPNPYNLLSSLPPSYTWYSVLDLKDAFFCLRLHPNS
FT                   QPLFAFEWKDPEKGNTGQLTWTRLPQGFKNSPTLFDEALHRDLAPFRALNPQVVLLQYV
FT                   DDLLVAAPTYEDCKKGTQKLLQELSKLGYRVSAKKAQLCQREVTYLGYLLKEGKRWLTP
FT                   ARKATVMKIPVPTTPRQVREFLGTAGFCRLWIPGFASLAAPLYPLTKESIPFIWTEEHQ
FT                   QAFDHIKKALLSAPALALPDLTKPFTLYIDERAGVARGVLTQTLGPWRRPVAYLSKKLD
FT                   PVASGWPTCLKAVAAVALLLKDADKLTLGQNVTVIASHSLESIVRQPPDRWMTNARMTH
FT                   YQSLLLNERVSFAPPAVLNPATLLPVESEATPVHRCSEILAEETGTRRDLEDQPLPGVP
FT                   TWYTDGSSFITEGKRRAGAPIVDGKRTVWASSLPEGTSAQKAELVALTQALRLAEGKNI
FT                   NIYTDSRYAFATAHIHGAIYKQRGLLTSAGKDIKNKEEILALLEAIHLPRRVAIIHCPG
FT                   HQRGSNPVATGNRRADEAAKQAALSTRVLAGTTKPQEPIEPAQEKTRPRELTPDRGKEF
FT                   IKRLHQLTHLGPEKLLQLVNRTSLLIPNLQSAVREVTSQCQACAMTNAVTTYRETGKRQ
FT                   RGDRPGVYWEVDFTEIKPGRYGNKYLLVFIDTFSGWVEAFPTKTETALIVCKKILEEIL
FT                   PRFGIPKVLGSDNGPAFVAQVSQGLATQLGINWKLHCAYRPQSSGQVERMNRTIKETLT
FT                   KLALETGGKDWVTLLPLALLRARNTPGRFGLTPYEILYGGPPPILESGETLGPDDRFLP
FT                   VLFTHLKALEIVRTQIWDQIKEVYKPGTVTIPHPFQVGDQVLVRRHRPSSLEPRWKGPY
FT                   LVLLTTPTAVKVDGIAAWVHASHLKPAPPSAPDESWELEKTDHPLKLRIRRRRDESAK"
FT   CDS             5552. .7555
FT                   /codon_start=1
FT                   /db_xref="GOA:P21415"
FT                   /db_xref="HSSP:P03385"
FT                   /db_xref="UniProt/Swiss-Prot:P21415"
FT                   /note="env protein"
FT                   /protein_id="AAA46811.1"
FT                   /translation="MVLLPGSMLLTSNLHHLRHQMSPGSWKRLIILLSCVFGGGGTSLQ
FT                   NKNPHQPMTLTWQVLSQTGDVVWDTKAVQPPWTWWPTLKPDVCALAASLESWDIPGTDV
FT                   SSSKRVRPPDSDYTAAYKQITWGAIGCSYPRARTRMASSTFYVCPRDGRTLSEARRCGG
FT                   LESLYCKEWDCETTGTGYWLSKSSKDLITVKWDQNSEWTQKFQQCHQTGWCNPLKIDFT
FT                   DKGKLSKDWITGKTWGLRFYVSGHPGVQFTIRLKITNMPAVAVGPDLVLVEQGPPRTSL
FT                   ALPPPLPPREAPPPSLPDSNSTALATSAQTPTVRKTIVTLNTPPPTTGDRLFDLVQGAF
FT                   LTLNATNPGATESCWLCLAMGPPYYEAIASSGEVAYSTDLDRCRWGTQGKLTLTEVSGH
FT                   GLCIGKVPFTHQHLCNQTLSINSSGDHQYLLPSNHSWWACSTGLTPCLSTSVFNQTRDF
FT                   CIQVQLIPRIYYYPEEVLLQAYDNSHPRTKREAVSLTLAVLLGLGITAGIGTGSTALIK
FT                   GPIDLQQGLTSLQIAIDADLRALQDSVSKLEDSLTSLSEVVLQNRRGLDLLFLKEGGLC
FT                   AALKEECCFYIDHSGAVRDSMKKLKEKLDKRQLERQKSQNWYEGWFNNSPWFTTLLSTI
FT                   AGPLLLLLLLLILGPCIINKLVQFINDRISAC"
FT   LTR             7651. .>8088
FT                   /note="3' long terminal repeat"
FT   repeat_region   8020. .>8088
FT                   /note="LTR R region"
XX
SQ   Sequence 8088 BP; 2103 A; 2251 C; 1982 G; 1752 T; 0 other;

 M26927  Length: 8088  January 7, 2005 14:55  Type: N  Check: 5631  ..

       1  gcgccagtcc ttagagagac tgagccgccc gggtacccgt gtgtccaata
      51  aaacctcttg ctgattgcat ccggagccgt ggtctcgttg ttccttggga
     101  gggtttctcc taactattga ccgcccactt cgggggtctc acatttgggg
     151  gctcgtccgg gatcggaaac cccacccagg gaccaccgac ccaccaacgg
     201  gaggtaagct ggccagcgac cgttgtgtgt ctcgcttctg tgtctaagtc
     251  cgtaattctg actgtccttg tgtgtctcgc ttctgtgtct gagaccgtaa
     301  ctctgactgc ccttgtaagt gcgcgcattt ttttggtttc agtctgttcc
     351  gggtgaatca ctctgcgagt gacgtgtgag tagcgaacag acgtgttcgg
     401  ggctcaccgc ctggtaatcc agggagacgt cccaggatca ggggaggacc
     451  agggacgcct ggtggacccc tcggtaacgg gtcgttgtga cccgatttca
     501  tcgcccgtct ggtaagacgc gctctgaatc tgattctctc tctcggtcgc
     551  ctcgccgccg tctctggttt ctttttgttt cgtttctgga aagcctctgt
     601  gtcacagtct ttctctccca aatcatcaat atgggacaag ataattctac
     651  ccctatctcc ctcactctaa atcactggag agatgtgaga acaagggctc
     701  acaatctatc cgtggaaatc aaaaagggaa aatggcagac tttctgttcc
     751  tccgagtggc ccacattcgg cgtggggtgg ccaccggagg gaacttttaa
     801  tctctctgtc atttttgcag ttaaaaagat tgtctttcag gagaacgggg
     851  gacatccgga ccaagttcca tatatcgtgg tatggcagga cctcgcccag
     901  aatcccccac catgggtgcc agcctccgcc aaggtcgctg ttgtctctga
     951  tacccgaaga ccagttgcgg ggaggccatc agctcctccc cgacccccca
    1001  tctacccggc aacagacgac ttactcctcc tctctgaacc cacgcccccg
    1051  ccctatccgg cggcactgcc accccctctg gcccctcagg cgatcggacc
    1101  gccgtcaggc cagatgcccg atagtagcga tcctgagggg ccagccgctg
    1151  ggaccaggag tcgccgtgcc cgcagtccag cagacaactc gggtcctgac
    1201  tccactgtga ttttgcccct ccgagccata ggacccccgg ccgagcccaa
    1251  tggcctggtc cctctacaat attggccttt ttcctcagca gatctttata
    1301  attggaaatc taatcatccc tctttttctg aaaacccagc aggtctcacg
    1351  gggctccttg agtctcttat gttctcccat cagcccactt gggacgattg
    1401  ccaacagctc ctacagattc ttttcaccac tgaggaacgg gaaagaattc
    1451  tcctggaggc ccgcaaaaat gtccttgggg acaatggggc ccctacacag
    1501  ctcgagaacc tcattaatga ggccttcccc ctcaatcgac ctcactggga
    1551  ttacaacaca gccgcaggta gggagcgtct tctggtctac cgccggactc
    1601  tagtggcagg tctcaaaggg gcagctcggc gtcctaccaa tttggctaag
    1651  gtaagagagg tcttgcaggg accggcagaa cccccttcgg ttttcttaga
    1701  acgcctgatg gaggcctata ggagatacac tccgtttgat ccctcttctg
    1751  agggacaaca ggctgcggtc gccatggcct ttatcggaca gtcagcccca
    1801  gatatcaaga aaaagttaca gaggctagag gggctccagg actattcctt
    1851  acaagattta gtaaaagagg cagaaaaggt gtaccataag agagagacag
    1901  aagaagaaag acaagaaaga gaaaaaaagg aggcagaaga aaaggagagg
    1951  cggcgcgata ggccgaagaa aaaaaacttg actaaaattc tggccgcagt
    2001  agtaagtaga gaagggtcca caggtaggca gacagggaac ctgagcaacc
    2051  aggcaaagaa gacacctagg gatggaagac ctccactaga caaagaccag
    2101  tgcgcatact gtaaagagaa gggccattgg gcaagagaat gtccccgaaa
    2151  aaaacacgtc agagaagcca aggttctagc cctagataac taggggagtc
    2201  agggttcgga ccccctcccc gaacctaggg taacactgac tgtggagggg
    2251  acccccattg agttcctggt cgacaccgga gctgaacatt cagtattgac
    2301  ccaacccatg ggaaaagtag ggtccagacg gacggtcgtg gaaggagcga
    2351  caggcagcaa ggtctacccc tggaccacaa aaagactttt aaaaattgga
    2401  cataaacaag tgacccactc cttcctggtc atacccgagt gccctgctcc
    2451  tctgttgggc agggacctcc taaccaaact aaaggcccag atccagtttt
    2501  ccgctgaggg cccacaggta acatggggag aacgccctac tatgtgcctg
    2551  gtcctaaacc tggaagaaga ataccgacta catgaaaagc cagtaccctc
    2601  ctctatcgac ccatcctggc tccagctttt ccccactgta tgggcagaaa
    2651  gagccggcat gggactagcc aatcaagtcc caccagtggt agtagagcta
    2701  agatcaggtg cctcaccagt ggctgttcga caatatccaa tgagcaaaga
    2751  agctcgggaa ggtatcagac cccacatcca gaagttccta gacctagggg
    2801  tcttggtgcc ctgtcggtcg ccctggaata cccctctgct acctgtaaaa
    2851  aagccaggga ccaatgacta tcggccagtt caagacctga gagaaattaa
    2901  taaaagggta caggatattc atcccacagt cccaaaccct tacaatcttc
    2951  tgagttccct tccgcctagc tatacttggt actcagtctt agatctcaag
    3001  gatgcctttt tctgcctcag gctacatccc aacagccagc cgctgttcgc
    3051  gttcgagtgg aaagacccag aaaaaggtaa cacaggtcag ctgacctgga
    3101  cgcggctacc acaagggttc aagaactctc ccactctctt cgacgaggcc
    3151  ctccaccgag atttggctcc ctttagggcc ctcaaccccc aggtggtgtt
    3201  actccaatat gtggacgacc tcttggtggc cgcccccaca tatgaagact
    3251  gcaaaaaagg aacacagaag ctcttacagg agttaagtaa gttggggtac
    3301  cgggtatcgg ctaagaaggc ccagctctgc cagagagaag tcacctatct
    3351  ggggtaccta ctcaaggaag gaaaaagatg gctaacccca gcccgaaagg
    3401  ctactgttat gaaaatccct gttcctacga cccccagaca ggtccgtgaa
    3451  tttctaggca ctgccggatt ctgcaggctc tggatccctg ggtttgcttc
    3501  cctggctgca cccttgtacc ccctaacaaa agagagcatc ccttttattt
    3551  ggactgagga acatcagcag gcttttgacc acataaaaaa agccttgctg
    3601  tcagcccctg cattggccct cccagacctc accaagccat tcactctata
    3651  tatagatgag agagccggcg tggcccgggg agtgctcact cagactttag
    3701  gaccctggcg gcggccagta gcatatctat caaaaaaact ggatccggtg
    3751  gccagcgggt ggccaacctg cctgaaagcg gttgcagcag tagcactcct
    3801  tctcaaagac gctgataagt taaccttggg acaaaatgtg actgtgattg
    3851  cttcccatag cctcgaaagc atcgtgcggc aaccccccga ccggtggatg
    3901  accaatgcca gaatgactca ttaccagagc ctgctgttaa atgaaagggt
    3951  atcgtttgcg ccccctgctg tcctaaaccc agctacccta cttccagtcg
    4001  agtcggaagc caccccagtg cacaggtgct cagaaatcct cgccgaagaa
    4051  actggaactc gacgagacct agaagaccaa ccattgcccg gggtgccaac
    4101  ctggtataca gacggtagca gtttcatcac ggaaggtaaa cggagagcag
    4151  gggccccgat cgtagatggc aagcggacgg tatgggctag cagcctgcca
    4201  gaaggtacgt cagcccagaa ggctgaacta gtagccttga cgcaggcatt
    4251  acgcctggcc gaaggaaaaa acatcaacat ctacacggac agcaggtatg
    4301  cttttgccac tgctcatatt catggggcaa tatataagca gagggggctg
    4351  ctcacttctg ctggaaaaga tatcaaaaac aaagaggaaa ttttggccct
    4401  gctagaggcc atccatctcc ctaggcgggt cgccattatc cactgtcctg
    4451  gccaccagag gggaagtaac cctgtggcca ctgggaaccg gagggccgac
    4501  gaggctgcaa agcaagccgc cctgtcgacc agagtgctgg caggaactac
    4551  aaaacctcaa gagccaatcg agcccgctca agaaaagacc aggccgaggg
    4601  agctcacccc tgaccgggga aaagaattca ttaagcggtt acatcagtta
    4651  actcacttag gaccagaaaa gcttctccaa ctagtgaacc gtaccagcct
    4701  cctcatcccg aacctccaat ctgcagttcg cgaagtcacc agtcagtgtc
    4751  aggcttgtgc catgactaat gcggtcacca cctacagaga gaccggaaaa
    4801  aggcaacgag gagatcgacc cggcgtgtac tgggaggtag acttcacaga
    4851  aataaagcct ggtcggtatg gaaacaagta tctgttagta ttcatagata
    4901  ctttctccgg atgggtagaa gcttttccta ccaaaactga aacggcccta
    4951  atcgtctgta aaaaaatatt agaagaaatt ctaccccgct tcgggatccc
    5001  taaggtactc gggtcagaca atggcccggc ctttgttgct caggtaagtc
    5051  agggactggc cactcaactg gggataaatt ggaagttaca ttgtgcgtat
    5101  agaccccaga gctcaggtca ggtagaaaga atgaacagaa caattaaaga
    5151  gaccttgacc aaattagcct tagagaccgg tggaaaagac tgggtgaccc
    5201  tccttccctt agcgctgctt agggccagga atacccctgg ccggtttggt
    5251  ttaactcctt atgaaattct ctatggagga ccacccccca tacttgagtc
    5301  tggagaaact ttgggtcccg atgatagatt tctccctgtc ttatttactc
    5351  acttaaaggc tttagaaatt gtaaggaccc aaatctggga ccagatcaaa
    5401  gaggtgtata agcctggtac cgtaacaatc cctcacccgt tccaggtcgg
    5451  ggatcaagtg cttgtcagac gccatcgacc cagcagcctt gagcctcggt
    5501  ggaaaggccc atacctggtg ttgctgacta ccccgaccgc ggtaaaagtc
    5551  gatggtattg ctgcctgggt ccatgcttct cacctcaaac ctgcaccacc
    5601  ttcggcacca gatgagtcct gggagctgga aaagactgat catcctctta
    5651  agctgcgtat tcggcggcgg cgggacgagt ctgcaaaata agaaccccca
    5701  ccagcccatg accctcactt ggcaggtact gtcccaaact ggagacgttg
    5751  tctgggatac aaaggcagtc cagccccctt ggacttggtg gcccacactt
    5801  aaacctgatg tatgtgcctt ggcggctagt cttgagtcct gggatatccc
    5851  gggaaccgat gtctcgtcct ctaaacgagt cagacctccg gactcagact
    5901  atactgccgc ttataagcaa atcacctggg gagccatagg gtgcagctac
    5951  cctcgggcta ggactagaat ggcaagctct accttctacg tatgtccccg
    6001  ggatggccgg accctttcag aagctagaag gtgcgggggg ctagaatccc
    6051  tatactgtaa agaatgggat tgtgagacca cggggaccgg ttattggcta
    6101  tctaaatcct caaaagacct cataactgta aaatgggacc aaaatagcga
    6151  atggactcaa aaatttcaac agtgtcacca gaccggctgg tgtaaccccc
    6201  ttaaaataga tttcacagac aaaggaaaat tatccaagga ctggataacg
    6251  ggaaaaacct ggggattaag attctatgtg tctggacatc caggcgtaca
    6301  gttcaccatt cgcttaaaaa tcaccaacat gccagctgtg gcagtaggtc
    6351  ctgacctcgt ccttgtggaa caaggacctc ctagaacgtc cctcgctctc
    6401  ccacctcctc ttcccccaag ggaagcgcca ccgccatctc tccccgactc
    6451  taactccaca gccctggcga ctagtgcaca aactcccacg gtgagaaaaa
    6501  caattgttac cctaaacact ccgcctccca ccacaggcga cagacttttt
    6551  gatcttgtgc agggggcctt cctaacctta aatgctacca acccaggggc
    6601  cactgagtct tgctggcttt gtttggccat gggcccccct tattatgaag
    6651  caatagcctc atcaggagag gtcgcctact ccaccgacct tgaccggtgc
    6701  cgctggggga cccaaggaaa gctcaccctc actgaggtct caggacacgg
    6751  gttgtgcata ggaaaggtgc cctttaccca tcagcatctc tgcaatcaga
    6801  ccctatccat caattcctcc ggagaccatc agtatctgct cccctccaac
    6851  catagctggt gggcttgcag cactggcctc accccttgcc tctccacctc
    6901  agtttttaat cagactagag atttctgtat ccaggtccag ctgattcctc
    6951  gcatctatta ctatcctgaa gaagttttgt tacaggccta tgacaattct
    7001  caccccagga ctaaaagaga ggctgtctca cttaccctag ctgttttact
    7051  ggggttggga atcacggcgg gaataggtac tggttcaact gccttaatta
    7101  aaggacctat agacctccag caaggcctga caagcctcca gatcgccata
    7151  gatgctgacc tccgggccct ccaagactca gtcagcaagt tagaggactc
    7201  actgacttcc ctgtccgagg tagtgctcca aaataggaga ggccttgact
    7251  tgctgtttct aaaagaaggt ggcctctgtg cggccctaaa ggaagagtgc
    7301  tgtttttaca tagaccactc aggtgcagta cgggactcca tgaaaaaact
    7351  caaagaaaaa ctggataaaa gacagttaga gcgccagaaa agccaaaact
    7401  ggtatgaagg atggttcaat aactcccctt ggttcactac cctgctatca
    7451  accatcgctg ggcccctatt actcctcctt ctgttgctca tcctcgggcc
    7501  atgcatcatc aataagttag ttcaattcat caatgatagg ataagtgcat
    7551  gttaaaattc tggtccttag acaaaatatc aggccctaga gaacgaaggt
    7601  aacctttaat tttgctctaa gattagagct attcacaaga gaaatggggg
    7651  aatgaaagaa gtgttttttt ttagccaact gcagtaacgc cattttgcta
    7701  ggcacaccta aaggatagga aaaatacagc taagaacagg gccaaacagg
    7751  atatctgtgg tcatgcacct gggccccggc ccaggccaag gacagagggt
    7801  tcccagaaat agatgagtca acagcagttt ccagcaagga cagagggttc
    7851  ccagaaatag atgagtcaac agcagtttcc agggtgcccc tcaaccgttt
    7901  caaggactcc catgaccggg aattcacccc tggccttatt tgaactaacc
    7951  aattaccttg cctctcgctt ctgtacccgc gctttttgct ataaaataag
    8001  ctcagaaact ccacccggag cgccagtcct tagagagact gagccgcccg
    8051  ggtacccgtg tgtccaataa aacctcttgc tgattgca