WO1990006369A1 - Use of hiv protease cleavage site to express mature proteins - Google Patents
Use of hiv protease cleavage site to express mature proteins Download PDFInfo
- Publication number
- WO1990006369A1 WO1990006369A1 PCT/US1989/005009 US8905009W WO9006369A1 WO 1990006369 A1 WO1990006369 A1 WO 1990006369A1 US 8905009 W US8905009 W US 8905009W WO 9006369 A1 WO9006369 A1 WO 9006369A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- protease
- cleavage site
- nucleotide sequence
- fusion protein
- sequence
- Prior art date
Links
- 108091005804 Peptidases Proteins 0.000 title claims abstract description 170
- 239000004365 Protease Substances 0.000 title claims abstract description 168
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 title claims abstract description 161
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 134
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 106
- 238000003776 cleavage reaction Methods 0.000 title claims abstract description 102
- 230000007017 scission Effects 0.000 title claims abstract description 100
- 108020001507 fusion proteins Proteins 0.000 claims abstract description 86
- 102000037865 fusion proteins Human genes 0.000 claims abstract description 85
- 239000002773 nucleotide Substances 0.000 claims abstract description 70
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 70
- 238000000034 method Methods 0.000 claims abstract description 65
- 230000014509 gene expression Effects 0.000 claims abstract description 48
- 239000013604 expression vector Substances 0.000 claims abstract description 45
- 108020004511 Recombinant DNA Proteins 0.000 claims abstract description 43
- 239000012634 fragment Substances 0.000 claims abstract description 43
- 150000001413 amino acids Chemical class 0.000 claims abstract description 29
- 108091026890 Coding region Proteins 0.000 claims abstract description 23
- 238000004519 manufacturing process Methods 0.000 claims abstract description 19
- 238000012258 culturing Methods 0.000 claims abstract 4
- 235000018102 proteins Nutrition 0.000 claims description 97
- 241000588724 Escherichia coli Species 0.000 claims description 50
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 32
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 claims description 28
- 229940024606 amino acid Drugs 0.000 claims description 25
- 235000001014 amino acid Nutrition 0.000 claims description 25
- 241000700605 Viruses Species 0.000 claims description 24
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 claims description 22
- 239000013612 plasmid Substances 0.000 claims description 22
- 102000053602 DNA Human genes 0.000 claims description 21
- 108010016183 Human immunodeficiency virus 1 p16 protease Proteins 0.000 claims description 21
- 150000007523 nucleic acids Chemical group 0.000 claims description 21
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 claims description 20
- 241000713772 Human immunodeficiency virus 1 Species 0.000 claims description 17
- 206010061598 Immunodeficiency Diseases 0.000 claims description 17
- 208000029462 Immunodeficiency disease Diseases 0.000 claims description 17
- 230000007813 immunodeficiency Effects 0.000 claims description 17
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 claims description 16
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 16
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 16
- 230000001177 retroviral effect Effects 0.000 claims description 16
- 108020004705 Codon Proteins 0.000 claims description 14
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 claims description 13
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 claims description 12
- 108700023157 Galactokinases Proteins 0.000 claims description 11
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 claims description 11
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 claims description 11
- 229960000310 isoleucine Drugs 0.000 claims description 11
- 230000002797 proteolythic effect Effects 0.000 claims description 11
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 claims description 10
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 claims description 10
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 claims description 10
- 239000004471 Glycine Substances 0.000 claims description 9
- 229960001230 asparagine Drugs 0.000 claims description 9
- 230000000295 complement effect Effects 0.000 claims description 9
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 claims description 8
- 239000004473 Threonine Substances 0.000 claims description 8
- 229930182817 methionine Natural products 0.000 claims description 8
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 claims description 8
- 241001430294 unidentified retrovirus Species 0.000 claims description 8
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 claims description 7
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 claims description 7
- 238000007792 addition Methods 0.000 claims description 7
- 230000035772 mutation Effects 0.000 claims description 7
- 238000012217 deletion Methods 0.000 claims description 6
- 230000037430 deletion Effects 0.000 claims description 6
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 claims description 5
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 claims description 3
- 235000004279 alanine Nutrition 0.000 claims description 3
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 claims 8
- 235000009582 asparagine Nutrition 0.000 claims 8
- 101150105088 Dele1 gene Proteins 0.000 claims 1
- 201000010099 disease Diseases 0.000 claims 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims 1
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 claims 1
- 210000004027 cell Anatomy 0.000 description 55
- 238000012545 processing Methods 0.000 description 19
- 239000013598 vector Substances 0.000 description 17
- 101100137549 Arabidopsis thaliana PRF5 gene Proteins 0.000 description 15
- 101150054426 PRO4 gene Proteins 0.000 description 15
- 239000002243 precursor Substances 0.000 description 15
- 108010010369 HIV Protease Proteins 0.000 description 13
- 241000725303 Human immunodeficiency virus Species 0.000 description 13
- 101150045500 galK gene Proteins 0.000 description 12
- 238000013519 translation Methods 0.000 description 12
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 11
- 108020004414 DNA Proteins 0.000 description 10
- 230000006698 induction Effects 0.000 description 10
- 102000035195 Peptidases Human genes 0.000 description 9
- 102100036352 Protein disulfide-isomerase Human genes 0.000 description 8
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 8
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 8
- 230000001580 bacterial effect Effects 0.000 description 8
- 238000010276 construction Methods 0.000 description 8
- 230000004927 fusion Effects 0.000 description 8
- 238000003780 insertion Methods 0.000 description 8
- 230000037431 insertion Effects 0.000 description 8
- 108091008146 restriction endonucleases Proteins 0.000 description 8
- 101100137548 Arabidopsis thaliana PRF4 gene Proteins 0.000 description 7
- 101710147327 Calcineurin B homologous protein 1 Proteins 0.000 description 7
- 101710205625 Capsid protein p24 Proteins 0.000 description 7
- 101710177291 Gag polyprotein Proteins 0.000 description 7
- 101150066369 PRO3 gene Proteins 0.000 description 7
- 101710177166 Phosphoprotein Proteins 0.000 description 7
- 101710149279 Small delta antigen Proteins 0.000 description 7
- 102100022563 Tubulin polymerization-promoting protein Human genes 0.000 description 7
- 229920001184 polypeptide Polymers 0.000 description 7
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 6
- 102100021519 Hemoglobin subunit beta Human genes 0.000 description 6
- 108091005904 Hemoglobin subunit beta Proteins 0.000 description 6
- 102100034347 Integrase Human genes 0.000 description 6
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 6
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 6
- 238000007796 conventional method Methods 0.000 description 6
- 230000029087 digestion Effects 0.000 description 6
- 238000011144 upstream manufacturing Methods 0.000 description 6
- 101710125418 Major capsid protein Proteins 0.000 description 5
- 108091081024 Start codon Proteins 0.000 description 5
- 230000015572 biosynthetic process Effects 0.000 description 5
- -1 e.g. Proteins 0.000 description 5
- 230000000694 effects Effects 0.000 description 5
- 239000000284 extract Substances 0.000 description 5
- 230000001939 inductive effect Effects 0.000 description 5
- 102000039446 nucleic acids Human genes 0.000 description 5
- 108020004707 nucleic acids Proteins 0.000 description 5
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 5
- 230000017854 proteolysis Effects 0.000 description 5
- 241000894006 Bacteria Species 0.000 description 4
- 101710198774 Envelope protein US9 Proteins 0.000 description 4
- 102000004190 Enzymes Human genes 0.000 description 4
- 108090000790 Enzymes Proteins 0.000 description 4
- 101100518977 Mus musculus Pask gene Proteins 0.000 description 4
- 108091034117 Oligonucleotide Proteins 0.000 description 4
- 101100258052 Rattus norvegicus Stk39 gene Proteins 0.000 description 4
- 241000187747 Streptomyces Species 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 229960005091 chloramphenicol Drugs 0.000 description 4
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 4
- 229940088598 enzyme Drugs 0.000 description 4
- KWGKDLIKAYFUFQ-UHFFFAOYSA-M lithium chloride Chemical compound [Li+].[Cl-] KWGKDLIKAYFUFQ-UHFFFAOYSA-M 0.000 description 4
- 239000003550 marker Substances 0.000 description 4
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 4
- 230000010076 replication Effects 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 238000013518 transcription Methods 0.000 description 4
- 230000035897 transcription Effects 0.000 description 4
- 101100137547 Arabidopsis thaliana PRF3 gene Proteins 0.000 description 3
- 102100026189 Beta-galactosidase Human genes 0.000 description 3
- 102000029816 Collagenase Human genes 0.000 description 3
- 108060005980 Collagenase Proteins 0.000 description 3
- 102100031780 Endonuclease Human genes 0.000 description 3
- 108010042407 Endonucleases Proteins 0.000 description 3
- 241000701959 Escherichia virus Lambda Species 0.000 description 3
- 108090000631 Trypsin Proteins 0.000 description 3
- 102000004142 Trypsin Human genes 0.000 description 3
- 101100137778 Zea mays PRO5 gene Proteins 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 108010005774 beta-Galactosidase Proteins 0.000 description 3
- 229960002424 collagenase Drugs 0.000 description 3
- 238000003119 immunoblot Methods 0.000 description 3
- 210000004962 mammalian cell Anatomy 0.000 description 3
- SLCVBVWXLSEKPL-UHFFFAOYSA-N neopentyl glycol Chemical compound OCC(C)(C)CO SLCVBVWXLSEKPL-UHFFFAOYSA-N 0.000 description 3
- 230000006337 proteolytic cleavage Effects 0.000 description 3
- 239000000523 sample Substances 0.000 description 3
- 239000000758 substrate Substances 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 230000014621 translational initiation Effects 0.000 description 3
- 239000012588 trypsin Substances 0.000 description 3
- 230000003612 virological effect Effects 0.000 description 3
- PGOHTUIFYSHAQG-LJSDBVFPSA-N (2S)-6-amino-2-[[(2S)-5-amino-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-4-amino-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-5-amino-2-[[(2S)-5-amino-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S,3R)-2-[[(2S)-5-amino-2-[[(2S)-2-[[(2S)-2-[[(2S,3R)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-5-amino-2-[[(2S)-1-[(2S,3R)-2-[[(2S)-2-[[(2S)-2-[[(2R)-2-[[(2S)-2-[[(2S)-2-[[2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-1-[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-amino-4-methylsulfanylbutanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-5-carbamimidamidopentanoyl]amino]propanoyl]pyrrolidine-2-carbonyl]amino]-3-methylbutanoyl]amino]-4-methylpentanoyl]amino]-4-methylpentanoyl]amino]acetyl]amino]-3-hydroxypropanoyl]amino]-4-methylpentanoyl]amino]-3-sulfanylpropanoyl]amino]-4-methylsulfanylbutanoyl]amino]-5-carbamimidamidopentanoyl]amino]-3-hydroxybutanoyl]pyrrolidine-2-carbonyl]amino]-5-oxopentanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-(1H-imidazol-5-yl)propanoyl]amino]-4-methylpentanoyl]amino]-3-hydroxybutanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-5-carbamimidamidopentanoyl]amino]-5-oxopentanoyl]amino]-3-hydroxybutanoyl]amino]-3-hydroxypropanoyl]amino]-3-carboxypropanoyl]amino]-3-hydroxypropanoyl]amino]-5-oxopentanoyl]amino]-5-oxopentanoyl]amino]-3-phenylpropanoyl]amino]-5-carbamimidamidopentanoyl]amino]-3-methylbutanoyl]amino]-4-methylpentanoyl]amino]-4-oxobutanoyl]amino]-5-carbamimidamidopentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-4-carboxybutanoyl]amino]-5-oxopentanoyl]amino]hexanoic acid Chemical compound CSCC[C@H](N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O PGOHTUIFYSHAQG-LJSDBVFPSA-N 0.000 description 2
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 2
- 208000030507 AIDS Diseases 0.000 description 2
- 101000609447 Beet necrotic yellow vein virus (isolate Japan/S) Protein P25 Proteins 0.000 description 2
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 2
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 2
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 2
- 102000048120 Galactokinases Human genes 0.000 description 2
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 2
- 101710159910 Movement protein Proteins 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- 239000002202 Polyethylene glycol Substances 0.000 description 2
- 241000712907 Retroviridae Species 0.000 description 2
- 108020004682 Single-Stranded DNA Proteins 0.000 description 2
- 102000019197 Superoxide Dismutase Human genes 0.000 description 2
- 108010012715 Superoxide dismutase Proteins 0.000 description 2
- 108020005038 Terminator Codon Proteins 0.000 description 2
- DTQVDTLACAAQTR-UHFFFAOYSA-N Trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 description 2
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 2
- 208000036142 Viral infection Diseases 0.000 description 2
- 229910001628 calcium chloride Inorganic materials 0.000 description 2
- 239000001110 calcium chloride Substances 0.000 description 2
- 238000012512 characterization method Methods 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 239000013611 chromosomal DNA Substances 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 238000012411 cloning technique Methods 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- ATDGTVJJHBUTRL-UHFFFAOYSA-N cyanogen bromide Chemical compound BrC#N ATDGTVJJHBUTRL-UHFFFAOYSA-N 0.000 description 2
- 230000002950 deficient Effects 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 239000013613 expression plasmid Substances 0.000 description 2
- 108010027225 gag-pol Fusion Proteins Proteins 0.000 description 2
- 206010022000 influenza Diseases 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 230000035800 maturation Effects 0.000 description 2
- 108020004999 messenger RNA Proteins 0.000 description 2
- 239000002207 metabolite Substances 0.000 description 2
- 235000015097 nutrients Nutrition 0.000 description 2
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 230000001323 posttranslational effect Effects 0.000 description 2
- 238000004007 reversed phase HPLC Methods 0.000 description 2
- 231100000331 toxic Toxicity 0.000 description 2
- 230000002588 toxic effect Effects 0.000 description 2
- 230000005026 transcription initiation Effects 0.000 description 2
- 230000005030 transcription termination Effects 0.000 description 2
- 239000004474 valine Substances 0.000 description 2
- 230000009385 viral infection Effects 0.000 description 2
- DIGQNXIGRZPYDK-WKSCXVIASA-N (2R)-6-amino-2-[[2-[[(2S)-2-[[2-[[(2R)-2-[[(2S)-2-[[(2R,3S)-2-[[2-[[(2S)-2-[[2-[[(2S)-2-[[(2S)-2-[[(2R)-2-[[(2S,3S)-2-[[(2R)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[2-[[(2S)-2-[[(2R)-2-[[2-[[2-[[2-[(2-amino-1-hydroxyethylidene)amino]-3-carboxy-1-hydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1,5-dihydroxy-5-iminopentylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]hexanoic acid Chemical compound C[C@@H]([C@@H](C(=N[C@@H](CS)C(=N[C@@H](C)C(=N[C@@H](CO)C(=NCC(=N[C@@H](CCC(=N)O)C(=NC(CS)C(=N[C@H]([C@H](C)O)C(=N[C@H](CS)C(=N[C@H](CO)C(=NCC(=N[C@H](CS)C(=NCC(=N[C@H](CCCCN)C(=O)O)O)O)O)O)O)O)O)O)O)O)O)O)O)N=C([C@H](CS)N=C([C@H](CO)N=C([C@H](CO)N=C([C@H](C)N=C(CN=C([C@H](CO)N=C([C@H](CS)N=C(CN=C(C(CS)N=C(C(CC(=O)O)N=C(CN)O)O)O)O)O)O)O)O)O)O)O)O DIGQNXIGRZPYDK-WKSCXVIASA-N 0.000 description 1
- OIXLLKLZKCBCPS-RZVRUWJTSA-N (2s)-2-azanyl-5-[bis(azanyl)methylideneamino]pentanoic acid Chemical compound OC(=O)[C@@H](N)CCCNC(N)=N.OC(=O)[C@@H](N)CCCNC(N)=N OIXLLKLZKCBCPS-RZVRUWJTSA-N 0.000 description 1
- YVOOPGWEIRIUOX-UHFFFAOYSA-N 2-azanyl-3-sulfanyl-propanoic acid Chemical compound SCC(N)C(O)=O.SCC(N)C(O)=O YVOOPGWEIRIUOX-UHFFFAOYSA-N 0.000 description 1
- 101710112984 20 kDa protein Proteins 0.000 description 1
- ZZGXRPGQPAPARK-UWVGGRQHSA-N 3-[(5r,6r)-1-azabicyclo[3.2.1]octan-6-yl]-4-propylsulfanyl-1,2,5-thiadiazole Chemical compound C1([C@H]2CN3C[C@@]2(CCC3)[H])=NSN=C1SCCC ZZGXRPGQPAPARK-UWVGGRQHSA-N 0.000 description 1
- OSJPPGNTCRNQQC-UWTATZPHSA-N 3-phospho-D-glyceric acid Chemical compound OC(=O)[C@H](O)COP(O)(O)=O OSJPPGNTCRNQQC-UWTATZPHSA-N 0.000 description 1
- QFVHZQCOUORWEI-UHFFFAOYSA-N 4-[(4-anilino-5-sulfonaphthalen-1-yl)diazenyl]-5-hydroxynaphthalene-2,7-disulfonic acid Chemical compound C=12C(O)=CC(S(O)(=O)=O)=CC2=CC(S(O)(=O)=O)=CC=1N=NC(C1=CC=CC(=C11)S(O)(=O)=O)=CC=C1NC1=CC=CC=C1 QFVHZQCOUORWEI-UHFFFAOYSA-N 0.000 description 1
- 241000714195 Aids-associated retrovirus Species 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- 101710140962 Capsid scaffolding protein Proteins 0.000 description 1
- 108090000317 Chymotrypsin Proteins 0.000 description 1
- 102000008186 Collagen Human genes 0.000 description 1
- 108010035532 Collagen Proteins 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 102000004594 DNA Polymerase I Human genes 0.000 description 1
- 108010017826 DNA Polymerase I Proteins 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 101710170658 Endogenous retrovirus group K member 10 Gag polyprotein Proteins 0.000 description 1
- 101710186314 Endogenous retrovirus group K member 21 Gag polyprotein Proteins 0.000 description 1
- 101710162093 Endogenous retrovirus group K member 24 Gag polyprotein Proteins 0.000 description 1
- 101710094596 Endogenous retrovirus group K member 8 Gag polyprotein Proteins 0.000 description 1
- 101710177443 Endogenous retrovirus group K member 9 Gag polyprotein Proteins 0.000 description 1
- 101710168592 Gag-Pol polyprotein Proteins 0.000 description 1
- 241000287828 Gallus gallus Species 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- 108010054147 Hemoglobins Proteins 0.000 description 1
- 102000001554 Hemoglobins Human genes 0.000 description 1
- 101000899111 Homo sapiens Hemoglobin subunit beta Proteins 0.000 description 1
- 101000633984 Homo sapiens Influenza virus NS1A-binding protein Proteins 0.000 description 1
- 241000598436 Human T-cell lymphotropic virus Species 0.000 description 1
- 241000713340 Human immunodeficiency virus 2 Species 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 102100029241 Influenza virus NS1A-binding protein Human genes 0.000 description 1
- 101710203526 Integrase Proteins 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- 241000713666 Lentivirus Species 0.000 description 1
- 208000008771 Lymphadenopathy Diseases 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 102000003792 Metallothionein Human genes 0.000 description 1
- 108090000157 Metallothionein Proteins 0.000 description 1
- 241000713869 Moloney murine leukemia virus Species 0.000 description 1
- 125000000729 N-terminal amino-acid group Chemical group 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 108090001074 Nucleocapsid Proteins Proteins 0.000 description 1
- 102000015636 Oligopeptides Human genes 0.000 description 1
- 108010038807 Oligopeptides Proteins 0.000 description 1
- 108090000284 Pepsin A Proteins 0.000 description 1
- 102000057297 Pepsin A Human genes 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 108010076039 Polyproteins Proteins 0.000 description 1
- OCYROESYHWUPBP-CIUDSAMLSA-N Pro-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 OCYROESYHWUPBP-CIUDSAMLSA-N 0.000 description 1
- AWJGUZSYVIVZGP-YUMQZZPRSA-N Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 AWJGUZSYVIVZGP-YUMQZZPRSA-N 0.000 description 1
- 102100024147 Protein phosphatase 1 regulatory subunit 14A Human genes 0.000 description 1
- 108010094028 Prothrombin Proteins 0.000 description 1
- 102100027378 Prothrombin Human genes 0.000 description 1
- 206010039491 Sarcoma Diseases 0.000 description 1
- 101710172711 Structural protein Proteins 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 108091023045 Untranslated Region Proteins 0.000 description 1
- 101710090398 Viral interleukin-10 homolog Proteins 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 238000003277 amino acid sequence analysis Methods 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 238000005349 anion exchange Methods 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 230000000721 bacterilogical effect Effects 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- 239000003114 blood coagulation factor Substances 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 229960002376 chymotrypsin Drugs 0.000 description 1
- 238000000975 co-precipitation Methods 0.000 description 1
- 229920001436 collagen Polymers 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000000432 density-gradient centrifugation Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000008034 disappearance Effects 0.000 description 1
- 231100000676 disease causative agent Toxicity 0.000 description 1
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 235000020774 essential nutrients Nutrition 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000000855 fermentation Methods 0.000 description 1
- 230000004151 fermentation Effects 0.000 description 1
- 230000037433 frameshift Effects 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 208000012116 immunodeficiency-related disease Diseases 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 229940072221 immunoglobulins Drugs 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 239000003999 initiator Substances 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 208000032839 leukemia Diseases 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 208000018555 lymphatic system disease Diseases 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 230000013011 mating Effects 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- MHWLWQUZZRMNGJ-UHFFFAOYSA-N nalidixic acid Chemical compound C1=C(C)N=C2N(CC)C=C(C(O)=O)C(=O)C2=C1 MHWLWQUZZRMNGJ-UHFFFAOYSA-N 0.000 description 1
- 125000002347 octyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])[H] 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 238000012261 overproduction Methods 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 229940111202 pepsin Drugs 0.000 description 1
- 239000002831 pharmacologic agent Substances 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 229940039716 prothrombin Drugs 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- FLNVBBPBGKOJHN-KKAOYSRWSA-N sivmac Chemical compound O=C([C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O FLNVBBPBGKOJHN-KKAOYSRWSA-N 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 230000010473 stable expression Effects 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 239000002821 viper venom Substances 0.000 description 1
- 210000002845 virion Anatomy 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/795—Porphyrin- or corrin-ring-containing peptides
- C07K14/805—Haemoglobins; Myoglobins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/48—Hydrolases (3) acting on peptide bonds (3.4)
- C12N9/50—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25)
- C12N9/503—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25) derived from viruses
- C12N9/506—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25) derived from viruses derived from RNA viruses
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/40—Fusion polypeptide containing a tag for immunodetection, or an epitope for immunisation
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/50—Fusion polypeptide containing protease site
Definitions
- This invention relates to the field of recombinant DNA techniques for production of proteins and polypeptides.
- Fusion proteins can be cleaved by chemical means, such as cyanogen bromide, or more specific cleavage of hybrid proteins can be achieved by including a nucleotide sequence encoding a protease cleavage site or other selectively cleavable site into the nucleotide sequence encoding the hybrid protein between the sequences encoding the respective proteins.
- chemical means such as cyanogen bromide
- cleavage of hybrid proteins can be achieved by including a nucleotide sequence encoding a protease cleavage site or other selectively cleavable site into the nucleotide sequence encoding the hybrid protein between the sequences encoding the respective proteins.
- Ile-Glu-Gly-Arg is the sequence that immediately precedes the two factor X a cleavage sites in prothrombin.
- the purified hybrid protein was harvested and subsequently digested with blood coagulation factor X a , which had been activated with
- Russel's viper venom Russel's viper venom.
- Germino and Bastia Proc. Nat'l Acad. Sci., USA, 81: 4692-4696 (1984) described a similar system using an initiator sequence, a sequence encoding 60 amino acids of chicken pro-alpha-2 collagen and a marker gene.
- the resulting hybrid protein was purified and subsequently cleaved by collagenase.
- This system had the drawback of apparently non-specific proteolysis when high concentration of collagenase were used to cleave the hybrid protein, and it was necessary to carefully titrate each batch of collagenase to ensure specific cleavage and to guard against this apparently non-specific proteolysis.
- Patent 4,751,180 issued June 14, 1988 to Cousens et al discloses methods and compositions for producing heterologous proteins in a host organism such as yeast.
- a DNA sequence coding for superoxide dismutase or other highly expressed protein is joined to a DNA sequen coding for a polypeptide of interest by a selectively cleavable linkage.
- the fusion protein thus produced can then be purified and subsequently cleaved by chemical or enzymatic means to produce the polypeptide of interest.
- proteases e.g., trypsin or cyanogen bromide is relatively non-specific. As a result, functional proteins are difficult to produce with these methods. In the case of site-specific proteases, cleavage requires an extra processing step following production of a fusion protein. Poor solubility of the fusion proteins in the cells that produce them is also often a drawback of these methods.
- the invention provides methods of preparing a protein product by expressing, in the same cell, a fusion protein and a protease capable of cleaving the protease cleavage site, contained in the fusion protein, to
- HIV-1 protease cleavage of a fusion protein having and HIV-1 protease cleavage site to produce the selected protein product is more selective than when cleavage is done with chemical reagents or enzymes such as trypsin.
- the amino acid sequence recognized by these latter methods may appear several times in the fusion protein, and the use of these methods will result in several, usually non-fictional, protein fragments.
- the retroviral cleavage site is not commonly found in proteins, thus, the likelihood of inadvertently cleaving the protein product while cleaving the deliberate retroviral protease cleavage site is correspondingly significantly reduced.
- the invention provides the efficient production of protein products in quantities sufficient for
- the invention eliminates the need to first purify the fusion protein, release the protein product and then isolate the protein product in a second, separate purification process.
- the protein product is present in cells in a mature form. There should be little or no cost for large quantities of purified protease to cleave the fusion protein, since the organism is producing the appropriate enzyme at the same time it is producing the fusion protein.
- the methods of the invention will result in more efficient, less
- a first recombinant DNA molecule comprising a fusion protein nucleotide sequence coding for a fusion protein is provided.
- the fusion protean nucleotide sequence is provided.
- the first portion preferably codes for a protein that is
- the protease cleavage site portion codes for amino acids recognized by the protease selected for use in the invention.
- the protein product portion codes for a selected protein product.
- the amino acids of the cleavage site act as substrate for the proteolytic enzyme. Cleavage of the protease recognition site releases the protein product from its association with the first portion of the fusion protein.
- a second recombinant DNA molecule comprising a coding sequence for a protease or an active fragment or derivative thereof, capable of cleaving amino acids encoded by the protease cleavage site portion is also provided.
- This nucleotide sequence codes for a functional protease that enzymatically releases the protein product from the fusion protein.
- the first and second recombinant DNA molecules are then introduced into a cell for expression of the nucleotide sequences and production of the protein
- the cells are cultured under conditions selected to express the fusion protein and the protease, whereby the protease cleaves the fusion protein at the protease cleavage site to thereby produce the mature protein product.
- the mature protein product may then be isolated by conventional methods.
- the fusion protein nucleotide sequence and the protease nucleotide sequence may be incorporated into at least one expression vector and then introduced into the cell, or the two sequences may be incorporated into the genome of the cell for expression of the fusion protein and protease.
- the first and second recombinant DNA molecules are
- the protease and cleavage site are preferably derived from HIV-1.
- the first and second recombinant DNA molecules may further comprise at least one expression control sequence operatively linked with the fusion protein nucleotide sequence and the protease nucleotide sequence, respectively.
- the invention also provides fusion protein nucleotide sequences as described above, nucleotide sequences coding for HIV-1 protease or proteolytically active fragments or derivatives thereof, recombinant
- the invention further provides cells having incorporated therein two recombinant DNA molecules, a first recombinant DNA molecule comprising a nucleotide sequence coding for a fusion protein nucleotide sequence and a second recombinant DNA molecule comprising a nucleotide sequence coding for a protease capable of cleaving the fusion protein at the protease cleavage site.
- the fusion protein nucleotide sequences are those described herein for the production of fusion proteins.
- the cells of the invention preferably have two expression vectors, one of which comprises the first recombinant DNA molecule, and the other of which comprises the second recombinant DNA molecule.
- the invention provides nucleotide sequences, peptides, cells and methods for producing protein
- a retroviral protease is coupled with a retroviral protease cleavage site for production of mature protein products.
- Preferred retroviruses are the immunodeficiency viruses; more preferably, Human Immunodeficiency Virus-1 (HIV-1).
- Retroviruses that is, viruses within the family Retroviridae, are a large family of enveloped, icosohedral viruses of about 150 nm having a coiled nucleocapsid within the core structure and having RNA as the genetic material.
- the family comprises the oncoviruses, such as the sarcoma and leukemia viruses, the immunodeficiency viruses and the lentiviruses.
- HIV causative agent of acquired immunodeficiency syndrome and related disorders
- Retroviridae family There exist several isolates of HIV, including human T-lymphotropic virus type-Ill (HTLV-III). the lymphadenopathy virus (LAV) and the AIDS-associated retrovirus (ARV) which have been grouped in type 1.
- HTLV-III human T-lymphotropic virus type-Ill
- LAV lymphadenopathy virus
- ARV AIDS-associated retrovirus
- HIV type 2 which has been shown to be associated with AIDS in West Africa.
- Immunodeficiency Virus such as SIV mac -BK28 [see
- HIV genome Molecular characterization of the HIV genome has demonstrated that the virus exhibits the same overall gag-pol-env genetic organization as other retroviruses.
- HIV also contains a highly specific protease responsible for post-translational processing of HIV precursor
- gag-pol region is initially translated int a polyprotein precursor of about 55 kDa that is then processed into the mature p17, p24, and p15 gag structural proteins. Similarly, gag-pol region is believed to be translated into a larger precursor through a translational frameshift between the overlapping gag and pol reading frames. This gag-pol precursor is post-translationally processed as well, to yield the mature gag proteins and the products of the pol region , including protease ,
- gag and gag-pol polyproteins maturation of the gag and gag-pol polyproteins has been shown to be effected at least, in part, by a
- the retrovirus protease is essential to the retroviral life cycle as indicated by the roduction of noninfectious, replication-deficient virions by Moloney murine leukemia virus variants mutated in the protease coding region.
- HIV protease codin ion is believed to be positioned between the p15 gag and reverse transcriptase genes and to reside entirely with the pol reading frame. Determination of the amino-terminal sequences of HIV gag p24 and reverse transcriptase proteins and knowledge of the surrounding sequences have revealed a common sequence [Asn-(Phe or Tyr)*Pro-Ile] at the cleavage site, with cleavage occurring at the asterisk between (phenylalanine or tyrosine) and proline. A similar sequence,
- the first and second recombinant DNA sequences are generally introduced into a cell for production of the protein product either by incorporation into at least one expression vector that is then introduced into the cell, or. by integration into the genome of the host cell.
- nucleic acid sequences of the invention will be prepared as a single entity, it should be appreciated that they may be prepared as various fragments, these fragments joined in expression cassettes and expression vectors to various untranslated regions providing for particular functions , and ultimately the coding sequences brought together at a subsequent stage.
- the discussion will be directed primarily to the situation where the coding sequence is prepared as a single entity and then transferred to an expression vector.
- the fusion protein nucleic acid sequences of the invention preferably comprise a first portion, a protease cleavage site portion and a protein product portion.
- the parts of the fusion protein nucleic acid sequence are preferably arranged in an order wherein the first portion is upstream of the protease cleavage site portion and the protein product portion, respectively.
- the protein product portion may be upstream of the protease cleavage site portion and the first portion.
- the first portion of the nucleotide sequences of the invention are preferably chosen from genes for
- nucleic acid sequences encoding proteins produced in amounts of about 5% or greater of the total protein produced by the cell are considered to be
- nucleic acid sequence for use in the invention may be identical to the gene encoding the well-expressed protein portion, or it may be a mutant of the gene or have one or more codons substituted.
- the entire gene or any portion of the gene which provides for the desired high yield of protein in the host may be employed.
- nucleotide sequence when the nucleotide sequence is to be expressed in E. coli, several genes, such as the galactokinase gene, Rosenberg et al, Meth. Enzymol.,
- a preferred first portion is an E. coli galactokinase gene, more preferably the first 56 codons of the E. coli galactokinase gene. This gene and gene fragment result in high levels of expression in E. coli.
- the choice of protein product portion will depend upon the identity of the selected protein product.
- the protein products of the invention are mature proteins or polypeptides and do not comprise generally amino acids derived from the first portion.
- the protein products may however, comprise one or more amino acids derived from the protease cleavage site portion, depending on the length of the protease cleavage site portion and the exact position of cleavage within this portion.
- the protein product portion can code for an entire protein, part of a protein polypeptide, or oligopeptide. The size and choice of protein product will depend on such factors as its use or desired characteristics. Protein products of interest include enzymes, hormones, blood clotting factors,
- immunoglobulins etc., as well as fragments or parts of such molecules.
- the protease cleavage site portion comprises nucleotides coding for the amino acids forming the
- protease cleavage site portion should be chosen to provide an enzyme-substrate pair with the protease nucleotide sequence. For example, if the protease cleaves an Ala-Leu bond, the protease cleavage site portion should include at least nucleotides coding these amino acids. Suitable protease cleavage sites for use in the invention include those which serve as substrates for proteases from
- protease cleavage site preferably
- Suitable proteases for use in the invention include those that are capable of exhibiting proteolytic activity within the environment of the cell in which the expression system is introduced and that are not
- proteases substantially toxic for the host cell upon induction of the expression system.
- Proteolytically active fragments and derivatives of such proteases are also suitable for use in the invention.
- Proteases, such as trypsin, having cleavage site which commonly, and frequently, appear in protein products are not suitable for use in the invention.
- these proteases may, nevertheless, be suitable for use in the invention in situation where the protein product is functional after cleavage of the fusion protein with the protease.
- Preferred protease cleavage sites include those of retroviruses, preferably immunodeficiency virus
- cleavage sites such as those of HIV and Simian
- Preferred cleavage sites are selected from the group consisting of R 1 .tyrosine.proline.R 2 ,
- R 2 and R 1 . leucine. phenylalanine. R 2 wherein R 1 is
- X 1 , X 2 , X 3 , X 4 and X 5 are independently an
- n,o,p,r and s are independently 0 or 1.
- X 1 is preferably selected from the group consisting of serine and threonine.
- X 1 , X 2 , X 3 , X 4 and X 5 may
- X 4 and X 5 are preferably hydrophobic amino
- acids such as isoleucine, valine, glycine or serine.
- cleavage sites are:
- amino acids are those conventionally given to amino acids found in
- proteins i.e., ala-alanine, arg-arginine, asp-aspartic acid, asn-asparagine, cys-cysteine, gly-glycine,
- glu-glutamic acid gln-glycine, his-histadine,
- the protease cleavage site portion may also include additional nucleotides flanking the nucleotides coding for the cleavage site, providing these additional nucleotides do not encode amino acids that substantially interfere with the activity of the protease or the
- nucleotide sequences of the invention are those coding for further amino acids near the protease cleavage site, those derived from the genetic material of the organism from which the protease cleavage site was isolated, linker sequences, or sequences tending to promote proper folding of the fusion protein produced from the nucleotide sequences of the invention.
- a nucleotide sequence coding for HIV-1 protease may be obtained by restriction endonuclease digestion of clone of HIV-1, such as the BH10 clone of the HTLVIIIB isolate of HIV, by isolating HIV-1 from persons infected with the virus, and synthesis of single-stranded DNA from the HIV RNA with reverse transcriptase using conventiona techniques. The single-stranded DNA complementary may then be used as the template for preparing a second strand to provide double-stranded cDNA containing the coding region for the HIV-1 protease. Once DNA encoding of the protease region is obtained, fragments containing the nucleotide sequence coding for protease are obtained by digestion of the DNA with appropriate restriction
- fragment containing the nucleotide sequence coding for protease is isolated in accordance with conventional techniques. Fragments, mutations, additions and deletions from the nucleotide sequence coding for HIV-1 protease that have proteolytic activity are also within the scope of the invention and are suitable for use in the methods of the invention.
- the DNA sequences coding for the first portion, the protease cleavage site portion, the protein product portion, and the DNA sequence coding for the protease may be obtained in a variety of ways.
- the sequences encoding the portions may be derived from natural sources, where the messenger RNA or chromosomal DNA may be identified with appropriate probes, which are complementary to a portion of the coding or non-coding sequence. From
- messenger RNA, single-stranded (ss) DNA may be prepared employing reverse transcriptase in accordance with
- the ss DNA complementary strand may then be used as the template for preparing a second strand to provide double-stranded (ds) cDNA containing the coding region for the portion.
- ds double-stranded
- the region containing the coding region may be detected employing probes, restriction mapped, and by appropriate techniques isolated, substantially free of untranslated 5' and 3' regions.
- the remaining portions may be provided by synthesis of adapters which can be ligated to the coding portions and provide for convenient termini for ligation to other sequences providing
- the first portion, the protein product portion, the protease cleavage site portion, and the DNA sequence coding for the protease may also be prepared synthetically using conventional methods and reagents for synthesizing nucleic acids.
- the first and second recombinant DNA molecules may further comprise expression control elements
- the fusion protein nucleotide sequence is preferably flanked by expression control elements that provide control of transcription and translation. These may be included with the fusion protein nucleotide
- the fusion protein nucleic acid sequence in the first recombinant DNA molecule to form an expression cassette can be moved intact into a selected expression vector.
- the fusion protein nucleic acid sequence, without transcription control elements can be moved to selected expression vectors already having transcription control elements incorporated therein.
- An expression cassette containing the nucleic acid sequences of the invention preferably comprises, in an upstream position, a promoter, followed by a
- translation initiation signal comprising a ribosome binding site, and an initiation codon, and, in a
- transcription termination signal downstream position, a transcription termination signal.
- additional transcription control elements such as capping sequences and enhancers may also be used when expression is done in eukaryotic hosts.
- the transcription and translation control elements may be ligated in any functional combination or order.
- the transcription and translation control elements used in any particular embodiment of the invention will be chosen with reference to the type of cell into which the expression vector will be introduced, so that an expression system will be created.
- Nucleotide sequences coding for the protease also preferably form parts of an expression cassette containing the elements described above for the nucleotide sequences coding for the fusion protein, or they may be inserted into expression vectors already containing transcription control elements. If the nucleotide sequences coding fo r the fusion protein and the protease are directed by different sets of transcription control elements, it is preferable that both sequences are expressed in comparable amounts to avoid a relative overabundance of the fusion protein or protease.
- inducible promoters are preferred for use in the invention.
- suitable inducible promoters are those where induction may be as a result of a physical change, e.g., temperature; or chemical change, e.g., change in nutrient or metabolite concentration, such as glucose or tryptophan; or change in pH or ionic strength.
- Suitable promoters in E. coli include promoters, such as the Lambda P L and P R promoters, and the E. coli
- MMTV mouse mammalian tumor virus
- the transcription initiation elements can be separated from the transcription terminator sequence by a polylinker, which has a plurality of unique restriction sites for insertion of the fusion protein nucleic acid sequence.
- the polylinker will be followed by the
- the polylinker will be cleaved and the open reading frame nucleotide sequence coding for the fusion protein may be inserted.
- the expression vector will be selected so as to have an appropriate copy number, as well as providing for stable extrachromosomal maintenance.
- the expression vector will usually have a marker that allows for
- Suitable markers include those which provide the host with an enhanced ability to utilize an essential nutrient or metabolite in short supply, or provide resistance to drugs or
- antibiotics such as ampicillin or chloramphenicol.
- Cells suitable for use in the invention include prokaryotic and eukaryotic cells that can be transformed to stably contain and express both the fusion protein and the protease. Suitable types of cells include bacterial, yeast and mammalian cells. E. coli cells are preferred. Introduction of at least one expression vector
- incorporating the fusion protein nucleotide sequence and the protease nucleotide sequence can be performed in a variety of ways, such as calcium chloride or lithium chloride treatment, calcium-polyethylene glycol-treated DNA with spheroplasts, use of liposomes, mating,
- the plasmid When one plasmid is employed for expression of both the fusion protein and the protease, the plasmid may be constructed to have two separate sets of transcription control elements, one for the fusion protein nucleic acid sequence and one for the protease nucleotide sequence; or both the fusion protein and the protease may be under the control of the same transcription control elements.
- the transcription control elements When two plasmids are used, the transcription control elements may be selected to be identical in both plasmids, or the may be selected to be different in one or more
- both are inserted into the same cell. Care should be taken that the expression vectors are chosen to be compatible and stable in the same cell.
- E. coli cell are chosen for production of the protein product, and two plasmids are used (one for the fusion protein, one for the protease), a suitable compatible pair of plasmids is one plasmid carrying the colEl origin of replication and the ampicillin-resistance marker and the other plasmid
- chloramphenicol-resistance marker Other suitable compounds
- Saccharomyces cerevisiae (micron-based and CEN-based expression vectors), Gorman et al, Gene, 48:13-22 (1986). Stable expression of both the fusion protein and the protease using two expression vectors should also be possible with some mammalian cells.
- Cells, expression vectors and transcription control elements are selected to be complementary to one another to form an expression system.
- Suitable expressio systems in E. coli utilize, for example, influenza NSl, Young et al, Proc. Nat'l Acad. Sci., USA, 80:6105 (1983), phage Lambda O protein, Rosenberg et al, Meth. Enzymol., 101:123 (1983), the E. coli galactokinase protein,
- yeast expression can be achieved with E. coli galK, Rosenberg et al, in
- At least one expression cassette comprising the first and second recombinant DNA molecules can be
- the expression cassette can be incorporated into the genome of the host cell by site-directed, e.g., by homologous recombination, or random integration. Introduction of the expression cassette into the host cell can be accomplished in. a variety of ways, such as co-precipitation, calcium chloride or lithium chloride treatment, calcium
- nucleotide sequence coding for the fusion protein and the nucleotide sequence coding for the protease may be under the control of the same or different transcription control elements. Additionally, the nucleotide sequence coding for the fusion protein and the nucleotide sequence coding for the protease may be incorporated into the same cell in different manners, i.e., the first recombinant DNA molecule coding for the fusion protein may be incorporated into the genome of the host cell, and the second
- sequence coding for the protease may be incorporated into an expression vector, or vice versa.
- Host cells containing the first and second recombinant DNA molecules are then grown in an appropriate medium for the host.
- an inducible promoter employed, the host cell may be grown to high density and the promoter turned on for expression of the fusion protein and protease. Where the promoter is not
- the cells may be grown until there is no further increase in product formation or the ratio of nutrients consumed to product formation falls below a predetermine level, at which time, the cells may be harvested, lysed and the protein product obtained and purified in
- E. coli The bacterium E. coli is the organism of choice for the production of the protein products. Cloning and expression can be obtained rapidly in E. coli; and to date, in some cases, much higher levels of gene expression have been achieved in E. coli as compared to other
- E. coli organisms, e.g., mammalian systems (Chinese hamster ovary cells) or yeast. Production in E. coli is readily
- E. coli strains are defective lysogen strains. Some such strains can be used in combination with the inducible Lambda phage P L promoter for heat induction of the promoter, Devare et al,
- Nucleotide sequences coding for HIV-1 protease or a proteolytically active fragment or derivative thereof may be obtained by restriction endonuclease digestion of a clone of HIV-1, using the appropriate restriction
- Proteolytically active fragments, derivatives, additions and deletions of nucleotide sequences coding for HIV-1 protease are also within the scope of the invention.
- Suitable nucleotide sequences coding for peptides having proteolytic activity towards HIV-1 cleavage sites include, or the complementary sequences thereof:
- the recombinant E. coli protease product is produced by recombinant DNA techniques from nucleotide sequences coding for nucleotide sequences coding for peptides having proteolytic activity toward HIV-1 cleava sites.
- the recombinant E. coli protease has proteolytic activity towards HIV-1 cleavage sites and is useful for cleaving fusion proteins as disclosed herein. Suitable methods and nucleotide sequences for preparing the
- E. coli protease product include those
- the protease product may be produced in cells with or without production of a fusion protein.
- Suitable peptides include those selected from the group of peptides consisting of-
- Proteolytically active fragments, mutations, additions or deletions of the recombinant E. coli protease product are also within the scope of the invention.
- the E. coli strain used for expression was AR120, as described in Mott et al, Proc. Nat'l Acad. Sci., USA, 82: 88-92 (1985). All DNA manipulations were carried out as described in Maniatis et al. Molecular Cloning: A
- E. coli strain AR120 carrying the expression plasmids were induced, and total cell extracts were prepared and analyze by NaDodSO 4 /PAGE as described in Aldovini, Debouck et al, Proc. Nat'l Acad. Sci., USA, 83:6672-6676 (1886). HIV Protease and Activity
- HIV-1 protease Vectors for expression of HIV-1 protease were constructe by inserting the appropriate restriction fragment into the expression vector pOTSKF33 All HIV-1 restriction fragments were isolated from the BH10 clone of the HTLVIIIB isolate of HIV (Shaw et al, Science, 226:1165-1171 (1984).
- pOTSKF33 was constructed in several steps:
- Step 1 Insertion of the polylinker region of pUC19 into the pOTS34 vector: pOTS34, a derivative of pASl (Devare et al, Cell, 36:43 (1984), was digested with the Bglll restriction endonuclease, treated with DNA polymerase I (Klenow) to create blunt ends and redigested with the Sphl restriction endonuclease.
- the large 5251 base pair (bp) Bglll-Sphl fragment was purified and ligated to the 135 bp Bglll-Sp restriction fragment isolated and purified from pUC19 (Yanisch-Perron et al, Gene, 33:103 (1985).
- Step 2 Insertion of the partial galK expression unit from pASK into pOTS-UC: pOTS-UC was digested with Xmal, treated with Klenow to create blunt ends and redigested with Hindlll. The large fragment was purified and ligated to a 2007 bp Bell filled-Hindlll fragment isolated and purified from pASK. The resulting plasmid was called pOTSKF45.
- the pASK plasmid vector is a pASl derivative in which the entire galK coding sequence is inserted after the ATG initiation codon. This galK sequence differs from the authentic sequence (Debouck et al, Nucleic Acids Res., 13:1841 (1985)) at the 5' end:
- the pASK sequence contains a BamHI site (GGATCC) at the ATG whereas the authentic sequence does not.
- pOTSKF45 was digested with Xmal and Sail and then treated with Klenow to create blunt ends.
- 5'-GTA.GGC.CTA.GTT.AAC.TAG-3' was then introduced between the Xmal filled and Sail filled sites by ligation with T4 DNA ligase to yield the vector: pOTSKF33.
- NlalV-Hae III fragment was isolated from BH10 using standard cloning techniques and inserted at the Stul restriction site of pOTSKF33 in the correct orientation.
- the NlalV-Haelll restriction fragment has the following nucleotide sequence and corresponding translation for the reading frame of interest:
- a 516-bp Hae III fragment was isolated from BH 10, using standard cloning techniques and inserted at the XmaI filled-in restriction site of pOTSKF33 in the correct orientation.
- the Hae III restriction fragment has the following nucleotide sequence and corresponding
- E. coli overlapping fragments, encompassing the protease coding region, were inserted into the pOTSKF33 bacterial expression vector, as described above in I.
- the fragments contained the two domains (I and II) that are highly conserved among all known retroviral protease, as well as the region between these domains.
- the fragments differed only in the amount of viral sequence information they carried upstream and downstream from the conserved
- the PRO3 and PRO4 fragments contained the proteolytic cleavage site, (indicated by the asterisk) Thr-Leu-Asn-Phe*Pro, positioned downstream of domain II, from which the amino terminus of reverse transcriptase derives ( Figure 1B) and another protease cleavage site, Ser-Phe-Asn-Phe*Pro, positioned approximately 20 codons upstream of conserved domain I, from which the amino terminus of HIV-1 protease derives. Proteolytic cleavage at these two sites yields a 10-kDa polypeptide which correspond to the mature protease.
- the protein products expressed by the PRO4 constructs were examined initially by NaDodSO./PAGE by the method described in Aldovini, Debouck et al, supra.
- the PRO3 and PRO44 constructs did not produce proteins with the size expected for their encoded products (20 kDa and 25 kDa, respectively). Instead, both
- constructs gave rise to a protein of approximately 10 kDa that was visualized by immunoblot analysis using a
- Plasmid PRO4-BX was derived from the PRO4 expression vector by digestion with Bel I, treatment with DNA polymerase (Klenow) and ligation with the 8-mer oligonucleotide 5' CCTCGAGG 3'. This treatment resulted in the insertion of four codons (encoding Pro-Ser-Arg-Asp) in the protease region between the conserved domains.
- the pOTSKF33 bacterial expression vector As described above in I, was used. A translational fusion was created between the HIV-1 gag reading frame and the first 56 amino acids of galactokinase sequence on the expression vector. A 1286-base pair (bp) Cla I-Bg II filled-in restriction endonuclease fragment, including most of the pl7, all of the p24, and half of the p15 gag coding sequence, was inserted at the Stul site of the pOTSKF33 fusion vector for expression of p55 gag. This construct is known as pOTSKF33-p55. The Clal-Bglll restriction fragment was isolated from the BH10 clone of the HTLVIIIB isolate of HIV [Shaw et al, Science,
- Plasmid pDPT287 is a derivative of the chloramphenicol-resistant pDPT270 plasmid of the incFII incompatibility group, which can co-exist with the
- pDPT287 differs from pDPT270 by two deletions, one removing a small HindiII fragment between the streptomycin-resistance gene and the incFII origin of replication and the other removing a Hindi fragment between the chloramphenicol and
- Plasmid pDPT287-p55 was constructed by excising the entire P L -p55 transcriptional cassette from
- pOTSKF33-p55 described in IV above, on a Bgllll-Sall filled-in restriction endonuclease fragment and inserting it between the unique BamHI and HindiII filled-in sites of pDPT287. This deletes the streptomycin-resistance gene of the vector. The orientation of the fragment with respect to the plasmid markers did not influence the expression of the galK-p55 product.
- bacteria were co-transformed with the p55-expressing vector and each of the PRO constructs separately. Both expression plasmids were induced simultaneously, and the extracts were analyzed by
- PRO4-BX contained the four-codon insertion that eliminated protease
- Plasmid PRO4-BS was derived from the PRO4 expression vector by digestion with Bel I, treatment with DNA polymerase (Klenow), and ligation with the 12-mer oligonucleotide 5" CATGTTAACTAG 3", which introduces stop codons in all three reading frames. This treatment resulted in the interruption of the protease coding sequence between the conserved domains.
- constructs are proteolytically active in both their precursor and mature forms and can carry out the
- the processed 24-kDa gag protein was purified from induced cells, and its amino-terminal sequence was determined.
- E. coli cells co-expressing the PRO4 and pDPT287-p55 constructs were disrupted by sonication in
- the p24 protein eluted in the range of 48-54% (vol/vol) acetonitrile and was approximately 90% pure as judged by Coomassie blue-stained NaDodSO 4 /polyacrylamide gel.
- a plasmid expressing HIV-1 protease not requiring auto-processing for proteolytic activity was constructed as follows:
- the PRO5 construct was derived from the PRO4 construct, described in Example 1, by introduction of a translation initiation codon immediately before the first codon of the mature protease (Proline 1) and a translation termination codon immediately after the last codon of the mature protease (Phenylalanine 99). This was done by site-specific mutagenesis using oligonucleotides with the following sequences:
- the PRO5 insert a 375 bp Ndel-Haelll fragment, has the following nucleotide sequence and corresponding translation for the reading frame of interest:
- the bacterial expression vector, pSKF-Pr-bglo, which produces a fusion protein consisting of the first 56 amino acids of galK, followed by a cleavage sequence for the HIV protease and ending with the full-length coding sequence for the beta-chain of human hemoglobin was constructed as follows:
- pSKF-bglo was linearized with Neol and an oligonucleotide with Neol overends
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Biophysics (AREA)
- Virology (AREA)
- Medicinal Chemistry (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Gastroenterology & Hepatology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
The invention discloses methods for producing mature protein products. A host cell which has been co-transformed with two recombinant DNA molecules, the first of said recombinant DNA molecules comprises a fusion protein nucleotide sequence, wherein the fusion protein encoded by said coding sequence comprises a first portion, a protein product portion and a protease cleavage site portion therebetween, and the second of said recombinant DNA molecules comprising a coding sequence for a protease or an active fragment or derivative thereof, capable of cleaving amino acid encoded by the protease cleavage site portion is cultured, whereby upon culturing the cell, both the fusion protein and the protease are expressed. The protease cleaves the fusion protein at the protease cleavage site to thereby produce the protein product. In preferred embodiments of the invention, the fusion protein nucleotide sequence and the protease nucleotide sequence are incorporated into two expression vectors that are then inserted into the same cell for expression and production of the mature protein product.
Description
USE OF HIV PROTEASE CLEAVAGE SITE TO EXPRESS MATURE PROTEINS
Field of Invention
This invention relates to the field of recombinant DNA techniques for production of proteins and polypeptides.
Background of Invention
There are numerous gene products of biological interest that cannot be obtained from natural sources in quantities sufficient for detailed biochemical and
physical analysis. Moreover, the limited bioavailability of these molecules has made it impossible to consider their potential utilization as either pharmacological agents or targets. One solution to this problem has been the development of recombinant vector systems that are designed to achieve efficient expression of cloned genes in bacteria or other recombinant hosts. The rationale used in the design of these systems involves insertion of the gene of interest into a multicopy vector system
(usually a plasmid), such that the gene is efficiently transcribed and translated in a host cell and the product of. the gene of interest is produced in quantity.
Nevertheless, high-level expression of many genes in host cells has proved difficult to achieve even when placed under the control of a strong promoter. To
overcome this problem, many proteins have been efficiently produced as hybrids after fusion of the gene coding for the protein with a sequence coding for a whole or part of a highly expressible protein. However, such hybrid proteins are not usually suitable for functional studies
or pharmaceutical or other commercial use, unless the authentic protein sequence can be released by cleavage of the fusion protein portion from the protein portion of interest.
Fusion proteins can be cleaved by chemical means, such as cyanogen bromide, or more specific cleavage of hybrid proteins can be achieved by including a nucleotide sequence encoding a protease cleavage site or other selectively cleavable site into the nucleotide sequence encoding the hybrid protein between the sequences encoding the respective proteins. For example, Nagai and
Thogersen, Nature, 309:810-812 (1984) reported an
expression/cleavage system employing the recognition site for blood coagulation factor Xa. Plasmids were
constructed comprising nucleic acid sequences encoding the 31 amino terminal residues of Lambda ell protein, a sequence encoding the tetrapeptide: Ile-Glu-Gly-Arg, and the sequence encoding beta-globin. Ile-Glu-Gly-Arg is the sequence that immediately precedes the two factor Xa cleavage sites in prothrombin. The purified hybrid protein was harvested and subsequently digested with blood coagulation factor Xa, which had been activated with
Russel's viper venom.
Germino and Bastia, Proc. Nat'l Acad. Sci., USA, 81: 4692-4696 (1984) described a similar system using an initiator sequence, a sequence encoding 60 amino acids of chicken pro-alpha-2 collagen and a marker gene. The resulting hybrid protein was purified and subsequently cleaved by collagenase. This system, however, had the drawback of apparently non-specific proteolysis when high concentration of collagenase were used to cleave the hybrid protein, and it was necessary to carefully titrate each batch of collagenase to ensure specific cleavage and to guard against this apparently non-specific proteolysis.
U.S. Patent 4,751,180, issued June 14, 1988 to Cousens et al discloses methods and compositions for producing heterologous proteins in a host organism such as yeast. A DNA sequence coding for superoxide dismutase or other highly expressed protein is joined to a DNA sequen coding for a polypeptide of interest by a selectively cleavable linkage. The fusion protein thus produced can then be purified and subsequently cleaved by chemical or enzymatic means to produce the polypeptide of interest.
These methods of producing proteins can be time-consuming and costly, especially when the production of large quantities for commercial use is considered.
Cleavage of fusion proteins with certain types of
proteases, e.g., trypsin or cyanogen bromide is relatively non-specific. As a result, functional proteins are difficult to produce with these methods. In the case of site-specific proteases, cleavage requires an extra processing step following production of a fusion protein. Poor solubility of the fusion proteins in the cells that produce them is also often a drawback of these methods.
It is an object of this invention to provide methods for producing mature protein products which are less time-consuming, but which can also provide protein products in mature form in large quantities.
Summary of Invention
The invention provides methods of preparing a protein product by expressing, in the same cell, a fusion protein and a protease capable of cleaving the protease cleavage site, contained in the fusion protein, to
generate the mature protein product.
Complete processing of the fusion protein within the same cell by methods of the invention has several advantages over the prior methods. HIV-1 protease
cleavage of a fusion protein having and HIV-1 protease cleavage site to produce the selected protein product is more selective than when cleavage is done with chemical reagents or enzymes such as trypsin. The amino acid sequence recognized by these latter methods may appear several times in the fusion protein, and the use of these methods will result in several, usually non-fictional, protein fragments. The retroviral cleavage site is not commonly found in proteins, thus, the likelihood of inadvertently cleaving the protein product while cleaving the deliberate retroviral protease cleavage site is correspondingly significantly reduced.
The invention provides the efficient production of protein products in quantities sufficient for
commercial or research purposes. Because fusion proteins are synthesized and processed in the same cell, the invention eliminates the need to first purify the fusion protein, release the protein product and then isolate the protein product in a second, separate purification process. With the methods of the invention, the protein product is present in cells in a mature form. There should be little or no cost for large quantities of purified protease to cleave the fusion protein, since the organism is producing the appropriate enzyme at the same time it is producing the fusion protein. The methods of the invention will result in more efficient, less
time-consuming and more economical production of protein products.
In the methods of the invention, a first recombinant DNA molecule comprising a fusion protein nucleotide sequence coding for a fusion protein is provided. The fusion protean nucleotide sequence
comprises a first portion, a protein product portion and protease cleavage site portion therebetween. The first portion preferably codes for a protein that is
well-expressed in the host cell, to ensure high levels of
production of the fusion protein. The protease cleavage site portion codes for amino acids recognized by the protease selected for use in the invention. The protein product portion codes for a selected protein product. The amino acids of the cleavage site act as substrate for the proteolytic enzyme. Cleavage of the protease recognition site releases the protein product from its association with the first portion of the fusion protein.
A second recombinant DNA molecule comprising a coding sequence for a protease or an active fragment or derivative thereof, capable of cleaving amino acids encoded by the protease cleavage site portion is also provided. This nucleotide sequence codes for a functional protease that enzymatically releases the protein product from the fusion protein.
The first and second recombinant DNA molecules are then introduced into a cell for expression of the nucleotide sequences and production of the protein
product. Next, the cells are cultured under conditions selected to express the fusion protein and the protease, whereby the protease cleaves the fusion protein at the protease cleavage site to thereby produce the mature protein product. The mature protein product may then be isolated by conventional methods.
The fusion protein nucleotide sequence and the protease nucleotide sequence may be incorporated into at least one expression vector and then introduced into the cell, or the two sequences may be incorporated into the genome of the cell for expression of the fusion protein and protease. In preferred embodiments of the invention, the first and second recombinant DNA molecules are
incorporated into two expression vectors, the first of which comprises the first recombinant DNA molecule and the second of which comprises the second recombinant DNA
molecule. If two expression vectors are employed, it is desirable that the two expression vectors are compatible in the same cell. The protease and cleavage site are preferably derived from HIV-1. The first and second recombinant DNA molecules may further comprise at least one expression control sequence operatively linked with the fusion protein nucleotide sequence and the protease nucleotide sequence, respectively.
The invention also provides fusion protein nucleotide sequences as described above, nucleotide sequences coding for HIV-1 protease or proteolytically active fragments or derivatives thereof, recombinant
E. coli protease products and proteases specific for a cleavage site.
The invention further provides cells having incorporated therein two recombinant DNA molecules, a first recombinant DNA molecule comprising a nucleotide sequence coding for a fusion protein nucleotide sequence and a second recombinant DNA molecule comprising a nucleotide sequence coding for a protease capable of cleaving the fusion protein at the protease cleavage site. The fusion protein nucleotide sequences are those described herein for the production of fusion proteins. The cells of the invention preferably have two expression vectors, one of which comprises the first recombinant DNA molecule, and the other of which comprises the second recombinant DNA molecule.
Detailed Description of the Invention
The invention provides nucleotide sequences, peptides, cells and methods for producing protein
products. In preferred embodiments of the invention, a retroviral protease is coupled with a retroviral protease cleavage site for production of mature protein products.
Preferred retroviruses are the immunodeficiency viruses; more preferably, Human Immunodeficiency Virus-1 (HIV-1).
Retroviruses, that is, viruses within the family Retroviridae, are a large family of enveloped, icosohedral viruses of about 150 nm having a coiled nucleocapsid within the core structure and having RNA as the genetic material. The family comprises the oncoviruses, such as the sarcoma and leukemia viruses, the immunodeficiency viruses and the lentiviruses.
HIV, causative agent of acquired immunodeficiency syndrome and related disorders, is a member of the
Retroviridae family. There exist several isolates of HIV, including human T-lymphotropic virus type-Ill (HTLV-III). the lymphadenopathy virus (LAV) and the AIDS-associated retrovirus (ARV) which have been grouped in type 1.
Related Immunodeficiency Viruses include HIV type 2, which has been shown to be associated with AIDS in West Africa.
Other Immunodeficiency Viruses include the Simian
Immunodeficiency Virus (SIV), such as SIVmac -BK28 [see
Hirsch et al, Cell, 49:307 (1987)].
Molecular characterization of the HIV genome has demonstrated that the virus exhibits the same overall gag-pol-env genetic organization as other retroviruses.
HIV also contains a highly specific protease responsible for post-translational processing of HIV precursor
proteins. The HIV gag region is initially translated int a polyprotein precursor of about 55 kDa that is then processed into the mature p17, p24, and p15 gag structural proteins. Similarly, gag-pol region is believed to be translated into a larger precursor through a translational frameshift between the overlapping gag and pol reading frames. This gag-pol precursor is post-translationally processed as well, to yield the mature gag proteins and the products of the pol region , including protease ,
reverse transcriptase and endonuclease.
In several retroviruses, the proteolytic
maturation of the gag and gag-pol polyproteins has been shown to be effected at least, in part, by a
highly-specific protease that is virally encoded between the gag and pol regions. The retrovirus protease is essential to the retroviral life cycle as indicated by the roduction of noninfectious, replication-deficient virions by Moloney murine leukemia virus variants mutated in the protease coding region.
The HIV protease codin ion is believed to be positioned between the p15 gag and reverse transcriptase genes and to reside entirely with the pol reading frame. Determination of the amino-terminal sequences of HIV gag p24 and reverse transcriptase proteins and knowledge of the surrounding sequences have revealed a common sequence [Asn-(Phe or Tyr)*Pro-Ile] at the cleavage site, with cleavage occurring at the asterisk between (phenylalanine or tyrosine) and proline. A similar sequence,
(Asn-Phe-Pro-Gln-Ile) occurs at the amino terminus of the protease, Debouck et al, Proc. Nat'l Acad. Sci.,
84:8903-8906, December, 1987 which is specifically
incorporated as if fully set forth herein.
The first and second recombinant DNA sequences are generally introduced into a cell for production of the protein product either by incorporation into at least one expression vector that is then introduced into the cell, or. by integration into the genome of the host cell.
While, for the most part, the nucleic acid sequences of the invention will be prepared as a single entity, it should be appreciated that they may be prepared as various fragments, these fragments joined in expression cassettes and expression vectors to various untranslated regions providing for particular functions , and ultimately the coding sequences brought together at a subsequent stage. For clarity, however, the discussion will be directed
primarily to the situation where the coding sequence is prepared as a single entity and then transferred to an expression vector.
The fusion protein nucleic acid sequences of the invention preferably comprise a first portion, a protease cleavage site portion and a protein product portion. The parts of the fusion protein nucleic acid sequence are preferably arranged in an order wherein the first portion is upstream of the protease cleavage site portion and the protein product portion, respectively. However, in some cases, the protein product portion may be upstream of the protease cleavage site portion and the first portion. The various sequences comprising the parts of the fusion protein nucleic acid sequence, as well as any other nucleic acid manipulations, are performed using
conventional techniques for manipulating nucleic acids.
The first portion of the nucleotide sequences of the invention are preferably chosen from genes for
portions of genes that are well-expressed in the
particular cell into which the nucleotide sequence will be introduced. Nucleic acid sequences encoding proteins produced in amounts of about 5% or greater of the total protein produced by the cell are considered to be
well-expressed and suitable for use in the invention. The nucleic acid sequence for use in the invention may be identical to the gene encoding the well-expressed protein portion, or it may be a mutant of the gene or have one or more codons substituted. The entire gene or any portion of the gene which provides for the desired high yield of protein in the host may be employed.
For example, when the nucleotide sequence is to be expressed in E. coli, several genes, such as the galactokinase gene, Rosenberg et al, Meth. Enzymol.,
101:123 (1983); and the first 56 codons of this gene; the phage Lambda O protein, Rosenberg et al, Meth. Enzymol.,
101:123 (1983); E. coli beta-galactosidase protein, Bartus et al, Antim. Ag. Ch., 25:622 (1984); and the influenza NS-1 protein, Young et al, Proc. Nat'l Acad. Sci, USA,
80:6105 (1983) may be used.
A preferred first portion is an E. coli galactokinase gene, more preferably the first 56 codons of the E. coli galactokinase gene. This gene and gene fragment result in high levels of expression in E. coli.
The choice of protein product portion will depend upon the identity of the selected protein product. The protein products of the invention are mature proteins or polypeptides and do not comprise generally amino acids derived from the first portion. The protein products may however, comprise one or more amino acids derived from the protease cleavage site portion, depending on the length of the protease cleavage site portion and the exact position of cleavage within this portion. The protein product portion can code for an entire protein, part of a protein polypeptide, or oligopeptide. The size and choice of protein product will depend on such factors as its use or desired characteristics. Protein products of interest include enzymes, hormones, blood clotting factors,
immunoglobulins, etc., as well as fragments or parts of such molecules.
The protease cleavage site portion comprises nucleotides coding for the amino acids forming the
cleavage site of the protease used in the invention. The protease cleavage site portion should be chosen to provide an enzyme-substrate pair with the protease nucleotide sequence. For example, if the protease cleaves an Ala-Leu bond, the protease cleavage site portion should include at least nucleotides coding these amino acids. Suitable protease cleavage sites for use in the invention include those which serve as substrates for proteases from
eukaryotic and prokaryotic organisms, such as chymotrypsin
and pepsin. The protease cleavage site preferably
comprises nucleotides coding for amino acid combinations absent from the protein product or occurring in parts of the protein product where function of the product would not be adversely affected by cleavage of the product.
Suitable proteases for use in the invention include those that are capable of exhibiting proteolytic activity within the environment of the cell in which the expression system is introduced and that are not
substantially toxic for the host cell upon induction of the expression system. Proteolytically active fragments and derivatives of such proteases are also suitable for use in the invention. Proteases, such as trypsin, having cleavage site which commonly, and frequently, appear in protein products are not suitable for use in the invention. However, these proteases may, nevertheless, be suitable for use in the invention in situation where the protein product is functional after cleavage of the fusion protein with the protease.
Preferred protease cleavage sites include those of retroviruses, preferably immunodeficiency virus
cleavage sites, such as those of HIV and Simian
Immunodeficiency Virus (SIV).
Preferred cleavage sites are selected from the group consisting of R1.tyrosine.proline.R2,
R1 leucine.alanine.R2, R1.methionine.methionine.R2,
R1.phenylalanine.leucine.R2,R1.phenylalanine.proline.
R2, and R1. leucine. phenylalanine. R2 wherein R1 is
(X1)n(X2)O(X3)p and R2 is (X4)r(X5)s
and X1, X2, X3, X4 and X5 are independently an
amino acid and n,o,p,r and s are independently 0 or 1.
X1 is preferably selected from the group consisting of serine and threonine. X1, X2, X3, X4 and X5 may
be any amino acid generally found in proteins or any
non-protein amino acid, providing that X1, X2, X3,
X4 or X5 do not substantially interfere with the
ability of the protease to cleave the protease cleavage site. X4 and X5 are preferably hydrophobic amino
acids, such as isoleucine, valine, glycine or serine.
Examples of suitable cleavage sites are:
Ser. Gly.Asn. Tyr.Pro. lie.Val;
Ser .Phe.Asn.Phe.Pro.Gly. lie; and
Thr .Leu.Asn. Phe. Pro. lie. Ser.
As used herein, abbreviations of amino acids are those conventionally given to amino acids found in
proteins, i.e., ala-alanine, arg-arginine, asp-aspartic acid, asn-asparagine, cys-cysteine, gly-glycine,
glu-glutamic acid, gln-glycine, his-histadine,
ile-isoleucine, lys-lysine, met-methionine,
phe-phenylalanine, pro-proline, ser-serine, thr-threonine, trp-tryptophan, tyr-tyrosine, val-valine.
The protease cleavage site portion may also include additional nucleotides flanking the nucleotides coding for the cleavage site, providing these additional nucleotides do not encode amino acids that substantially interfere with the activity of the protease or the
functioning of the well-expressed gene portion or protein product portion. Examples of additional nucleotides are those coding for further amino acids near the protease cleavage site, those derived from the genetic material of the organism from which the protease cleavage site was isolated, linker sequences, or sequences tending to promote proper folding of the fusion protein produced from the nucleotide sequences of the invention.
A nucleotide sequence coding for HIV-1 protease may be obtained by restriction endonuclease digestion of clone of HIV-1, such as the BH10 clone of the HTLVIIIB isolate of HIV, by isolating HIV-1 from persons infected with the virus, and synthesis of single-stranded DNA from
the HIV RNA with reverse transcriptase using conventiona techniques. The single-stranded DNA complementary may then be used as the template for preparing a second strand to provide double-stranded cDNA containing the coding region for the HIV-1 protease. Once DNA encoding of the protease region is obtained, fragments containing the nucleotide sequence coding for protease are obtained by digestion of the DNA with appropriate restriction
endonuclease, such as Haelll, and NlalV and Haelll, and the fragment containing the nucleotide sequence coding for protease is isolated in accordance with conventional techniques. Fragments, mutations, additions and deletions from the nucleotide sequence coding for HIV-1 protease that have proteolytic activity are also within the scope of the invention and are suitable for use in the methods of the invention.
The DNA sequences coding for the first portion, the protease cleavage site portion, the protein product portion, and the DNA sequence coding for the protease may be obtained in a variety of ways. The sequences encoding the portions may be derived from natural sources, where the messenger RNA or chromosomal DNA may be identified with appropriate probes, which are complementary to a portion of the coding or non-coding sequence. From
messenger RNA, single-stranded (ss) DNA may be prepared employing reverse transcriptase in accordance with
conventional techniques. The ss DNA complementary strand may then be used as the template for preparing a second strand to provide double-stranded (ds) cDNA containing the coding region for the portion. Where chromosomal DNA is employed, the region containing the coding region may be detected employing probes, restriction mapped, and by appropriate techniques isolated, substantially free of untranslated 5' and 3' regions. Where only portions of the coding sequence are obtained, the remaining portions
may be provided by synthesis of adapters which can be ligated to the coding portions and provide for convenient termini for ligation to other sequences providing
particular functions or properties.
The first portion, the protein product portion, the protease cleavage site portion, and the DNA sequence coding for the protease may also be prepared synthetically using conventional methods and reagents for synthesizing nucleic acids.
The first and second recombinant DNA molecules may further comprise expression control elements
operatively linked with the fusion protein nucleotide sequence and the protease nucleotide sequence,
respectively. The fusion protein nucleotide sequence is preferably flanked by expression control elements that provide control of transcription and translation. These may be included with the fusion protein nucleotide
sequence in the first recombinant DNA molecule to form an expression cassette, which can be moved intact into a selected expression vector. Alternatively, the fusion protein nucleic acid sequence, without transcription control elements, can be moved to selected expression vectors already having transcription control elements incorporated therein.
An expression cassette containing the nucleic acid sequences of the invention preferably comprises, in an upstream position, a promoter, followed by a
translation initiation signal comprising a ribosome binding site, and an initiation codon, and, in a
downstream position, a transcription termination signal. However, additional transcription control elements, such as capping sequences and enhancers may also be used when expression is done in eukaryotic hosts. The transcription and translation control elements may be ligated in any functional combination or order. The transcription and
translation control elements used in any particular embodiment of the invention will be chosen with reference to the type of cell into which the expression vector will be introduced, so that an expression system will be created.
Nucleotide sequences coding for the protease also preferably form parts of an expression cassette containing the elements described above for the nucleotide sequences coding for the fusion protein, or they may be inserted into expression vectors already containing transcription control elements. If the nucleotide sequences coding fo r the fusion protein and the protease are directed by different sets of transcription control elements, it is preferable that both sequences are expressed in comparable amounts to avoid a relative overabundance of the fusion protein or protease.
Inducible promoters are preferred for use in the invention. Examples of suitable inducible promoters are those where induction may be as a result of a physical change, e.g., temperature; or chemical change, e.g., change in nutrient or metabolite concentration, such as glucose or tryptophan; or change in pH or ionic strength. Suitable promoters in E. coli include promoters, such as the Lambda PL and PR promoters, and the E. coli
PLAC'PTRP and PTAC Promoters; in yeast, the CUP 1
metallothionein promoter; and in mammalian systems, the mouse mammalian tumor virus (MMTV).
The transcription initiation elements can be separated from the transcription terminator sequence by a polylinker, which has a plurality of unique restriction sites for insertion of the fusion protein nucleic acid sequence. The polylinker will be followed by the
terminator region, which may be obtained from the same wild-type gene from which the promoter region was obtained or a different wild-type gene, so long as efficient
transcription initiation and termination is achieved when the two regions are used. By digestion of the expression vector with the appropriate restriction enzymes, the polylinker will be cleaved and the open reading frame nucleotide sequence coding for the fusion protein may be inserted.
The expression vector will be selected so as to have an appropriate copy number, as well as providing for stable extrachromosomal maintenance. The expression vector will usually have a marker that allows for
selection in the expression host. Suitable markers include those which provide the host with an enhanced ability to utilize an essential nutrient or metabolite in short supply, or provide resistance to drugs or
antibiotics, such as ampicillin or chloramphenicol.
Cells suitable for use in the invention include prokaryotic and eukaryotic cells that can be transformed to stably contain and express both the fusion protein and the protease. Suitable types of cells include bacterial, yeast and mammalian cells. E. coli cells are preferred. Introduction of at least one expression vector
incorporating the fusion protein nucleotide sequence and the protease nucleotide sequence can be performed in a variety of ways, such as calcium chloride or lithium chloride treatment, calcium-polyethylene glycol-treated DNA with spheroplasts, use of liposomes, mating,
electroporation.
When one plasmid is employed for expression of both the fusion protein and the protease, the plasmid may be constructed to have two separate sets of transcription control elements, one for the fusion protein nucleic acid sequence and one for the protease nucleotide sequence; or both the fusion protein and the protease may be under the control of the same transcription control elements. When two plasmids are used, the transcription control elements
may be selected to be identical in both plasmids, or the may be selected to be different in one or more
transcription control elements.
When two expression vectors are used, both are inserted into the same cell. Care should be taken that the expression vectors are chosen to be compatible and stable in the same cell. For example, when E. coli cell are chosen for production of the protein product, and two plasmids are used (one for the fusion protein, one for the protease), a suitable compatible pair of plasmids is one plasmid carrying the colEl origin of replication and the ampicillin-resistance marker and the other plasmid
carrying the incFII origin of replication and the
chloramphenicol-resistance marker. Other suitable
pairings of plasmids for use in E. coli can be determined by reference to Novick et al, Bacteriological Reviews, 40:168-189 (1976). Examples of other types of host cells for which two expression vectors can be used are
Streptomyces and Bacillus species, and the yeast
Saccharomyces cerevisiae (micron-based and CEN-based expression vectors), Gorman et al, Gene, 48:13-22 (1986). Stable expression of both the fusion protein and the protease using two expression vectors should also be possible with some mammalian cells.
Cells, expression vectors and transcription control elements are selected to be complementary to one another to form an expression system. Suitable expressio systems in E. coli utilize, for example, influenza NSl, Young et al, Proc. Nat'l Acad. Sci., USA, 80:6105 (1983), phage Lambda O protein, Rosenberg et al, Meth. Enzymol., 101:123 (1983), the E. coli galactokinase protein,
Rosenberg et al, Meth. Enzymol., 101: 123 (1983) and the E. coli beta-galactosidase protein, Bartus et al, Antim. Ag. Ch., 25:622 (1984). Expression in Streptomyces
includes use of E. coli galK, Brawner et al, Gene, 40:191
(1985), the genes from the Streptomyces gal operon,
Fornwald et al, Proc. Nat'l Acad. Sci., USA, 84: 2130 (1987), and the Streptomyces beta-galactosidase, Burnett et al, in Microbiology 1985, ed. Levin, Amer. Soc.
Microbiol., Washington, D.C. (1985). In yeast expression, can be achieved with E. coli galK, Rosenberg et al, in
Genetic Engineering, vol. 8, ed. Setlow, Plenum Press (1986), 3-phosphoglycerate kinase (PKG), Chen et al,
Nucleic Acid Res., 12:8951 (1984) and human superoxide dismutase, Cousens et al, U.S. Patent 4,751,180, issued June 14, 1988. In mammalian cells, Schumperli et al,
Proc. Nat'l Acad. Sci., USA, 79:257 (1982) has expressed E. coli galK; such system may also be useful as an
expression system for the invention.
As an alternative to the use of at least one expression vector inserted into the host cell for
expression of both the fusion protein and the protease in one cell, at least one expression cassette comprising the first and second recombinant DNA molecules can be
integrated into the genome of the host cell. This may be preferable if a eukaryotic expression system, such as a yeast or mammalian expression system, is employed. The expression cassette can be incorporated into the genome of the host cell by site-directed, e.g., by homologous recombination, or random integration. Introduction of the expression cassette into the host cell can be accomplished in. a variety of ways, such as co-precipitation, calcium chloride or lithium chloride treatment, calcium
polyethylene glycol treated with DNA or electroporation. The nucleotide sequence coding for the fusion protein and the nucleotide sequence coding for the protease may be under the control of the same or different transcription control elements. Additionally, the nucleotide sequence coding for the fusion protein and the nucleotide sequence coding for the protease may be incorporated into the same
cell in different manners, i.e., the first recombinant DNA molecule coding for the fusion protein may be incorporated into the genome of the host cell, and the second
recombinant DNA molecule comprising the nucleotide
sequence coding for the protease may be incorporated into an expression vector, or vice versa.
Host cells containing the first and second recombinant DNA molecules are then grown in an appropriate medium for the host. Where an inducible promoter is employed, the host cell may be grown to high density and the promoter turned on for expression of the fusion protein and protease. Where the promoter is not
inducible, then constitutive production of the fusion protein and protease will occur. Constitutive production of the protease is preferable only in expression systems where the protease is not substantially toxic to the host cell. The cells may be grown until there is no further increase in product formation or the ratio of nutrients consumed to product formation falls below a predetermine level, at which time, the cells may be harvested, lysed and the protein product obtained and purified in
accordance with conventional techniques. These techniques include chromatography, electrophoresis, extraction, density gradient centrifugation, or the like.
The bacterium E. coli is the organism of choice for the production of the protein products. Cloning and expression can be obtained rapidly in E. coli; and to date, in some cases, much higher levels of gene expression have been achieved in E. coli as compared to other
organisms, e.g., mammalian systems (Chinese hamster ovary cells) or yeast. Production in E. coli is readily
amenable to cost-effective, large-scale fermentation and protein purification. Preferred E. coli strains are defective lysogen strains. Some such strains can be used in combination with the inducible Lambda phage PL
promoter for heat induction of the promoter, Devare et al,
Cell, 36:43-49 (1984), or chemical induction using
nalidixic, Mott et al, Proc. Nat'l Acad. Sci., 82:88-92
(1985). A more detailed description of an E. coli system for expression of the fusion protein and the protease is found in the Examples herein.
Nucleotide sequences coding for HIV-1 protease or a proteolytically active fragment or derivative thereof may be obtained by restriction endonuclease digestion of a clone of HIV-1, using the appropriate restriction
endonuclease, or from persons infected with HIV-1, as described herein supra. The examples herein disclose the methods of obtaining nucleotide sequences having
proteolytic activity towards HIV-1 cleavage sites.
Proteolytically active fragments, derivatives, additions and deletions of nucleotide sequences coding for HIV-1 protease are also within the scope of the invention.
Suitable nucleotide sequences coding for peptides having proteolytic activity towards HIV-1 cleavage sites include, or the complementary sequences thereof:
CCTCAGATCACTCTTTGGCAACGACCCCTCGTCACAATAAAGATAGGGGGGCAA
----+----------------------------------+----------------------+--------------------+-------------------+
GGAGTCTAGTGAGAAACCGTTGCTGGGGAGCAGTGTTATTTCTATCCCCCCGTT
CTAAAGGAAGCTCTATTAGATACAGGAGCAGATGATACAGTATTAGAAGAAATGAGTTTG
------------------+------------------+----------------------+-----------------+----------------------------------+
GATTTCCTTCGAGATAATCTATGTCCTCGTCTACTATGTCATAATCTTCTTTACTCAAAC CCAGGAAGATGAAAACCAAAAATGATAGGGGGAATTGGAGGTTTTATCAAAGTAAGACAG
------------------+------------------+----------------------+-----------------+----------------------------------+
GGTCCTTCTACCTTTGGTTTTTACTATCCCCCTTAACCTCCAAA ATAGTTTCATTCTGTC
TATGATCAGATACTCATAGAAATCTGTGGACATAAAGCTATAGGTACAGTATTAGTAGGA ----------------------------+------------------+----------------------+-----------------+--------------------------+ ATACTAGTCTATGAGTATCTTTAGACACCT GTATTTCGATATCCA GTCATAATCATCCT
CCTACACCTGTCMCATAATTGGAAGAAATCTGTTGACTCAGATTGGTTGCACTTTAAAT
----------------------------+------------------+----------------------+-----------------+--------------------------+
GGATGTGGACAGTTGTATTAACCTTCTTTAGACAACTGAGTCTAACCAACGTGAAATTTA TTT AAA;
NlalV
GCCGATAGACAAGGAACTGTATCCTTTAACTTCCCTCAGATCACTCTTTGGCAACGACCC
---------------------------+------------------+----------------------+-----------------+-------------------------+
CGGCTATCTGTTCCTTGACATAGGAAATTGAAGGGAGTCTAGTGAGAAACCGTTGCTGGG CTCGTCACAATAAAGATAGGGGGGCAACTAAAGGAAGCTCTATTAGATACAGGAGCAGAT
------------------------------+------------------+----------------------+-----------------+----------------------+
GAGCAGTGTTATTTCTATCCCCCCGTTGATTTCCTTCGAGATAATCTATGTCCTCGTCTA
GATACAGTATTAGAAGAAATGAGTTTGCCAGGAAGATGGAAACCAAAAATGATAGGGGGA
-----------------------------+------------------+----------------------+-----------------+------------------------+
CTATGTCATAATCTTCTTTACTCAAACGGTCCTTCTACCTTTGGTTTTTACTATCCCCCT ATTGGAGGTTTTATCAAAGTAAGACAGTATGATCAGATACTCATAGAAATCTGTGGACAT
---------------------------+------------------+----------------------+-----------------+-----------------------+
TAACCTCCAAAATAGTTTCATTCTGTCATACTAGTCTATGAGTATCTTTAGACACCTGTA
AAAGCTATAGGTACAGTATTAGTAGGACCTACACCTGTCAACATAATTGGAAGAAATCTG
----------------------+ -------------+------------------+----------------------+-----------------+------------------+
TTTCGATATCCATGTCATAATCATCCTGGATGTGGACAGTTGTATTAACCTTCTTTAAAC
TTGACTCAGATTGGTTGCACTTTAAATTTTCCCATTAGCCCTATTGAGACTGTACCAGTA ------------------+----------------------------------+------------------+----------------------+-----------------+
AACTGAGTCTAACCAACGTGAAATTTAAAAGGGTAATCGGGATAACTCTGACATGGTCAT
Hae III
AAATTAAAGCCAGGAATGGATGGCCCAAAAGTTAAACAATGG
------------------+----------------------------------+------------------+----
TTTAATTTCGGTCCTTACCTACCGGGTTTTCAATTTGTTACC;
Hae III
GCCGATAGACAAGGAACTGTATCCTTTAACTTCCCTCAGATCACTCTTTGGCAACGACCC
------------------+----------------------------------+------------------+----------------------+-----------------+
CGGCTATCTGTTCCTTGACATAGGAAATTGAAGGGAGTCTAGTGAGAAACCGTTGCTGGG
CTCGTCACAATAAAGATAGGGGGGCAACTAAAGGAAGCTCTATTAGATACAGGAGCAGAT
------------------+----------------------------------+------------------+----------------------+-----------------+
GAGCAGTGTTATTTCTATCCCCCCGTTGATTTCCTTCGAGATAATCTATGTCCTCGTCTA
GATACAGTATTAGAAGAAATGAGTTTGCCAGGAAGATGGAAACCAAAAATGATAGGGGGA
------------------+----------------------------------+------------------+----------------------+-----------------+
CTATGTCATAATCTTCTTTACTCAAACGGTCCTTCTACCTTTGGTTTTTACTATCCCCCT
ATTGGAGGTTTTATCAAAGTAAGACAGTATGATCAGATACTCATAGAAATCTGTGGACAT
------------------------+----------------------------------+------------------+----------------------+---------+
TAACCTCCAAAATAGTTTCATTCTGTCATACTAGTCTATGAGTATCTTTAGACACCTGTA
AAAGCTATAGGTACAGTATTAGTAGGACCTACACCTGTCAACATAATTGGAAGAAATCTG
------------------+----------------------------------+------------------+----------------------+-----------------+
TTTCGATATCCATGTCATAATCATCCTGGATGTGGACAGTTGTATTAACCTTCTTTAGAC
TTGACTCAGATTGGTTGCACTTTAAATTTTCCCATTAGCCCTATTGAGACTGTACCAGTA ------------------+----------------------------------+------------------+----------------------+-----------------+
AACTGAGTCTAACCAACGTGAAATTTAAAAGGGTAATCGGGATAACTCTGACATGGTCAT
Hae III
AAATTAAAGCCAGGAATGGATGGCCCAAAAGTTAAACAATGG
------------------+----------------------------------+-------------------------+
TTTAATTTCGGTCCTTACCTACCGGGTTTTCAATTTGTTACC;
Nde I
CATATGCCTCAGATCACTCTTTGGCAACGACCCCTCGTCACAATAAAGATAGGGGGGCAA
------------------+----------------------------------+------------------+----------------------+-----------------+
GTATACGGAGTCTAGTGAGAAACCGTTGCTGGGGAGCAGTGTTATTTCTATCCCCCCGTT
CTAAAGGAAGCTCTATTAGATACAGGAGCAGATGATACAGTATTAGAAGAAATGAGTTTG
------------------+----------------------------------+------------------+----------------------+-----------------+
GATTTCCTTCGAGATAATCTATGTCCTCGTCTACTATGTCATAATCTTCTTTACTCAAAC
CCAGGAAGATGGAAACCAAAAATGATAGGGGGAATTGGAGGTTTTATCAAAGTAAGACAG
------------------+----------------------------------+------------------+----------------------+-----------------+
GGTCCTTCTACCTTTGGTTTTTACTATCCCCCTTAACCTCCAAAATAGTTTCATTCTGTC
TATGATCAGATACTCATAGAAATCTGTGGACATAAAGCTATAGGTACAGTATTAGTAGGA
------------------+----------------------------------+------------------+----------------------+-----------------+
ATACTAGTCTATGAGTATCTTTAGACACCTGTATTTCGATATCCATGTCATAATCATCCT
CCTACACCTGTCAACATAATTGGAAGAAATCTGTTGACTCAGATTGGTTGCACTTTAAAT
------------------+----------------------------------+------------------+----------------------+-----------------+
GGATGTGGACAGTTGTATTAACCTTCTTTAGACAACTGAGTCTAACCAACGTGAAATTTA
TTTTAGATTAGCCCTATTGAGACTGTACCAGTAAAATTAAAGCCAGGAATGGATGGCCCA
------------------+----------------------------------+------------------+----------------------+-----------------+
AAAATCTAATCGGGATAACTCTGACATGGTCATTTTAATTTCGGTCCTTACCTACCGGGT
Hae III
AAAGTTAAACAATGG
------------------+-------
TTTCAATTTGTTACC.
The recombinant E. coli protease product is produced by recombinant DNA techniques from nucleotide sequences coding for nucleotide sequences coding for peptides having proteolytic activity toward HIV-1 cleava sites. The recombinant E. coli protease has proteolytic activity towards HIV-1 cleavage sites and is useful for cleaving fusion proteins as disclosed herein. Suitable methods and nucleotide sequences for preparing the
recombinant E. coli protease product include those
disclosed herein for preparing the protease in E. coli. The protease product may be produced in cells with or without production of a fusion protein.
Suitable peptides include those selected from the group of peptides consisting of-
ProGlnlleThrLeuTrpGlnArgProLeuValThrlleLysIleGlyGlyGln
LeuLysGluAlaLeuLeuAspThrGlyAlaAspAspThrValLeuGluGluMetSerLeu
ProGlyArgTrpLysProLysMetlleGlyGlylleGlyGlyPhelleLysValArgGln
TyrAspGlnlleLeuIleGluIleCysGlyHisLysAlaIleGlyThrValLeuValGly
ProThrProValAsnllelleGlyArgAsnLeuLeuThrGlnlleGlyCysThrLeuAsn
Phe,
AlaAspArgGlnGlyThrValSerPheAsnPheProGlnlleThrLeuTrpGlnArgPro
LeuValThrlleLysIleGlyGlyGlnLeuLysGluAlaLeuLeuAspThrGlyAlaAsp
AsPThrValLeuGluGluMetSerLeuProGlyArgTrpLysProLysMetIleGlyGly
IleGlyGlyPheUeLysValArgGlnTyrAspGlnlleLeuIleGlylleCysGlyHis
LysAlalleGlyThrValLeuValGlyProThrProValAsnllelleGlyArgAsnLeu
LeuThrGlnlleGlyCysThrLeuAsniPheProIleSerProIleGluTiirValProVal
LysLeuLysProGlyMetAspGlyProLysValLysGlnTrp,
AlaAspArgGlnGlyThrValSerPheAsnPheProGlnlleThrLeuTrpGlnArgPro
LeuValThrlleLysIleGlyGlyGlnLeuLysGluAlaLeuLeuAspThrGlyAlaAsp
AspThrValLeuGluGluMetSerLeuProGlyArgTrpLysProLysMetlleGlyGly
ileGlyGlyPhelleLysValArgGlnTyrAspGlnlleLeuIleGlylleCysGlyHis
LysAlalleGlyThrValLeuValGlyProThrProValAsnllelleGlyArgAsnLeu
LeuThrGlnlleGlyCysThrLeuAsnPhePcoIleSerProIleGluThrValProVal
LysLeuLysProGlyMetAspGlyProLysValLysGlnTrp, and MetProGlnlleThrLeuTrpGlnArgProLeuValThrlleLyslleGlyGlyGln
LeuLysGluAlaLeuLeuAspThrGlyAlaAspAspThrValLeuGluGluMetSerLeu
ProGlyArgTrpLysProLysMetlleGlyGlylleGlyGlyPhelleLysValArgGln
TyrAspGlnlleLeuIleGluIleCysGlyHisLysAlalleGlyThrValLeuValGly
ProThrProValAsnllelleGlyArgAsnLeuLeuThrGlnlleGlyCysThrLeuAsn
PheEnd.
Proteolytically active fragments, mutations, additions or deletions of the recombinant E. coli protease product are also within the scope of the invention.
Example 1.
The E. coli strain used for expression was AR120, as described in Mott et al, Proc. Nat'l Acad. Sci., USA, 82: 88-92 (1985). All DNA manipulations were carried out as described in Maniatis et al. Molecular Cloning: A
Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 1982. E. coli strain AR120 carrying the expression plasmids were induced, and total cell extracts were prepared and analyze by NaDodSO4/PAGE as described in Aldovini, Debouck et al, Proc. Nat'l Acad. Sci., USA, 83:6672-6676 (1886).
HIV Protease and Activity
I. Construction of vectors for expression of HIV-1
protease in E. coli
Vectors for expression of HIV-1 protease were constructe by inserting the appropriate restriction fragment into the expression vector pOTSKF33 All HIV-1 restriction fragments were isolated from the BH10 clone of the HTLVIIIB isolate of HIV (Shaw et al, Science, 226:1165-1171 (1984).
A. Construction of the galK translation fusion vector pOTSKF33 : pOTSKF33 derives from the pASl
expression vector of Rosenberg as disclosed in U.S. Patent 4,578,355, issued March 25, 1986, and comprises a portion of the E. coli galK gene (56 codons) flanked by a
polylinker with restriction sites for fusion in any of the three translation frames (phase 1 - Xmal filled-in;
phase 2 - Smal; phase 3 - Stul), stop codons for each phase and some additional cloning sites. pOTSKF33 was constructed in several steps:
Step 1 : Insertion of the polylinker region of pUC19 into the pOTS34 vector: pOTS34, a derivative of pASl (Devare et al, Cell, 36:43 (1984), was digested with the Bglll restriction endonuclease, treated with DNA polymerase I (Klenow) to create blunt ends and redigested with the Sphl restriction endonuclease. The large 5251 base pair (bp) Bglll-Sphl fragment was purified and ligated to the 135 bp Bglll-Sp restriction fragment isolated and purified from pUC19 (Yanisch-Perron et al, Gene, 33:103 (1985). The resulting plasmid was called pOTS-UC.
Step 2: Insertion of the partial galK expression unit from pASK into pOTS-UC: pOTS-UC was digested with Xmal, treated with Klenow to create blunt ends and redigested with Hindlll. The large fragment was purified and ligated to a 2007 bp Bell filled-Hindlll fragment isolated and purified from pASK. The resulting plasmid was called pOTSKF45.
The pASK plasmid vector is a pASl derivative in which the entire galK coding sequence is inserted after the ATG initiation codon. This galK sequence differs from the authentic sequence (Debouck et al, Nucleic Acids Res., 13:1841 (1985)) at the 5' end:
-pASK sequence: ATG.GAT.CCG.GAA.TTC.CAA.GAA.AAA
-authentic seq: ATG.AGT.CTG.AAA.GAA.AAA...
The pASK sequence contains a BamHI site (GGATCC) at the ATG whereas the authentic sequence does not.
Step 3: Introduction of a linker with
translation stops into pOTSKF45: pOTSKF45 was digested with Xmal and Sail and then treated with Klenow to create blunt ends. The linker
5'-GTA.GGC.CTA.GTT.AAC.TAG-3' was then introduced between the Xmal filled and Sail filled sites by ligation with T4 DNA ligase to yield the vector: pOTSKF33.
B. Insertion of Sequence Coding for HIV-1 Protease into pOTSKF33
1. Construction of PRO3
A 382-bp NlalV-Hae III fragment was isolated from BH10 using standard cloning techniques and
inserted at the Stul restriction site of pOTSKF33 in the correct orientation. The NlalV-Haelll restriction fragment has the following nucleotide sequence and corresponding translation for the reading frame of interest:
NlaIv
GCCGATAGACAA∞AACTGTATCCTTTAACTTCCCTCAGATCACTCTTTGGCAACGACCC
2253 ------------------+----------------------------------+------------------+-------------------------------+ 2312
CGGCTATCTGTTCCTTGACATAGGAAATTGAAGGGAGTCTAGTGAGAAACCGTTGCTGGG
AlaAspArgGInGIyThrValSerPheAsπPheProGInileThrLeuTrpGlnArgPro -
CTCGTCACAATAAAGATAGGGGGGCAACTAAAGGAAGCTCTATTAGATACAGGAGCAGAT
2313 ---------------- -----------------+----------------------------------+------------------+----------------------+ 2432
GAGCAGTGTTATTTCTATCCCCCCGTTGATTTCCTTCGAGATAATCTATGTCCTCGTCTA
LeuValThrIleLysIleGlyGlyGlnLeuLysGluAlaLeuLeuAspThrGlyAlaAsp -
GATACAGTATTAGAAGAAATGAGTTTGCCAGGAAGATGGAAACCAAAAATGATAGGGGGA
2373 -------------- --------------+----------------------------------+------------------+--------------------------------+2432
CTATGTCATAATCTTCTTTACTCAAACGGTCCTTCTACCTTTGGTTTTTACTATCCCCCT
AspThrValLeuGluGluMetSerLeuProGlyArgTrpLysProLysMetlileGIyGly -
ATTGGAGGTTTTATCAAAGTAAGACAGTATGATCAGATACTCATAGAAATCTGTGGACAT
2433 ---------------- -----------------+----------------------------------+------------------+------------------------+ 2492
TAACCTCCAAAATAGTTTCATTCTGTCATACTAGTCTATGAGTATCTTTAGACACCTGTA
IleGlyGlyPhelleLysValArgGlnTyrAspGlnlleLeuIleGlylleCysGlyHis -
AAAGCTATAGGTACAGTATTAGTAGGACCTACACCTGTCAACATAATTGGAAGAAATCTG
2493 ---------------- -----------------+----------------------------------+------------------+------------------------+ 2552
TTTCGATATCCATGTCATAATCATCCTGGATGTGGACAGTTGTATTAACCπCTTTAGAC
LysAIalleGIyThrVal LeuValG1yProThrProValAsnllell eGlyArgAsnLeu -
TTGACTCAGATTGGTTGCACTTTAAATTTTCCCATTAGCCCTATTGAGACTGTACCAGTA
2553 ---------------- -----------------+----------------------------------+------------------+------------------------+ 2612
AACTGAGTCTAACCAACGTGAAATTTAAAAGGGTAATCGGGATAACTCTGACATGGTCAT
LeuThrGInil eGI yCysThrLeuAsnPheProII eSerProII eGI uThrVal ProVal -
Hae III
AAATTAAAGCCAGGAATGGATGGCCCAAAAGTTAAACAATGG
2613 --------------------------------- -----------------+---------------------------+ 2654
TTTAATTTCGGTCCTTACCTACCGGGTTTTCAATTTGTTACC
LysLeuLysProGlyMetAspGlyProLysValLysGlnTrp -
(Note: The numbering corresponds to the numbering of Ratner et al, Nature, 313:277 (1985) plus 453 base pairs.)
2. Construction of PRO4
A 516-bp Hae III fragment was isolated from BH 10, using standard cloning techniques and inserted at the XmaI filled-in restriction site of pOTSKF33 in the correct orientation. The Hae III restriction fragment has the following nucleotide sequence and corresponding
translation for the reading frame of interest:
CCAGGGGAATTTTCTTCAGAGCAGACCAGAGCCAACAGCCCCACCATTTCTTCAGAGCAGA
21 13 ------------------+----------------------------------+------------------+-------------------------------+
GGTCCCTTAAAAGAAGTCTCGTCTGGTCTCGGTTGTCGGGGTGGTAAAGAAGTCTCGTCT
ArgGIuPheSerSerGIuGlnThrArgAlaAsnSerProThrlleSerSerGluGInThr -
CCAGAGCCAACAGCCCCACCAGAAGAGAGCTTCAGGTCTGGGGTAGAGACAACAACTCCC
2179 ------------------+----------------------------------+------------------+-------------------------------+ 2238
GGTCTCGGTTGTCGGGGTGGTCTTCTCTCGAAGTCCAGACCCCATCTCTGTTGTTGAGGG
ArgAlaAsnSerProThrArgArgGluLeuGlnValTrpGlyArgAspAsnAsnSerPro -
CCTCAGAAGCAGGAGCCGATAGACAAGGAACTGTATCCTTTAACTTCCCTCAGATCACTC
2239 ------------------+----------------------------------+------------------+-------------------------------+ 2298
GGAGTCTTCGTCCTCGGCTATCTGTTCCTTGACATAGGAAATTGAAGGGAGTCTAGTGAG
SerGluAlaGlyAlaAspArgGlnGlyThrValSerPheAsnPheProGlnileThrLeu -
TTTGGCAACGACCCCTCGTCACAATAAAGATAGGGGGGCAACTAAAGGAAGCTCTATTAG
2299 ------------------+----------------------------------+------------------+-------------------------------+ 2358
AAACCGTTGCTGGGGAGCAGTGTTATTTCTATCCCCCCGTTGATTTCCTTCGAGATAATC
TrpGlnArgProLeuValThrlleLysIIeGIyGlyGlnLeuLysGluAl aLeuLeuAsp -
ATACAGGAGCAGATGATACAGTATTAGAAGAAATGAGTTTGCCAGGAAGATGGAAACCAA
2359 ------------------+----------------------------------+------------------+-------------------------------+ 2418
TATGTCCTCGTCTACTATGTCATAATCTTCTTTACTCAAACGGTCCTTCTACCTTTGGTT
ThrGl yAl aAspAspThrVal LeuGl uGl uMetSerLeuProGl yArgTrpLysProLys -
AAATGATAGGGGGAATTGGAGGTTTTATCAAAGTAAGACAGTATGATCAGATACTCATAG
2419 ------------------+----------------------------------+------------------+-------------------------------+ 2478
TTTACTATCCCCCTTAACCTCCAAAATAGTTTCATTCTGTCATACTAGTCTATGAGTATC
MetlleGlyGlylleGIyGIyPhelleLysValArgGInTyrAspGlnileLeuIIeGIu -
AAATCTGTGGACATAAAGCTATAGGTACAGTATTAGTAGGACCTACACCTGTCAACATAA
2479 ----------- ------------+----------------------------------+------------------+-------------------------------+ 2538
TTTAGACACCTGTATTTCGATATCCATGTCATAATCATCCTGGATGTGGACAGTTGTATT
IleCysGlyHisLysAlalleGlyThrValLeuValGlyProThrProValAsnllelle -
TTGGAAGAAATCTGTTGACTCAGATTGGTTGCACTTTAAATTTTCCCATTAGCCCTATTG
2539 --------------------------+----------------------------------+------------------+-----------------------------+ 2598
AACCTTCTTTAGACAACTGAGTCTAACCAACGTGAAATTTAAAAGGGTAATCGGGATAAC
GlyArgAsnLeuLeuThrGl nIl eGlyCysThrLeuAsnPheProII eSerProII eGI u
Hae III AGACTGTACCAGTAAAATTAAAGCCAGGAATGGATGGCCCAAAAGTTAAACAATGG
2599 --------------------------+----------------------------------+------------------+-------------+---------- 2654
TCTGACATGGTCATTTTAATTTCGGTCCTTACCTACCGGGTTTTCAATTTGTTACC
ThrValProValLysLeuLysProGlyMetAspGlyProLysValLysGInTrp -
(Note: The numbering corresponds to the numbering of Ratner et al,
Nature. 313:277 (1985) plus 453 base pairs.)
II. Expression of the HIV Protease in E. coli
To express the HIV protease gene product in
E. coli, overlapping fragments, encompassing the protease coding region, were inserted into the pOTSKF33 bacterial expression vector, as described above in I. The fragments contained the two domains (I and II) that are highly conserved among all known retroviral protease, as well as the region between these domains. The fragments differed only in the amount of viral sequence information they carried upstream and downstream from the conserved
domains. The PRO3 and PRO4 fragments contained the proteolytic cleavage site, (indicated by the asterisk) Thr-Leu-Asn-Phe*Pro, positioned downstream of domain II, from which the amino terminus of reverse transcriptase derives (Figure 1B) and another protease cleavage site, Ser-Phe-Asn-Phe*Pro, positioned approximately 20 codons upstream of conserved domain I, from which the amino terminus of HIV-1 protease derives. Proteolytic cleavage at these two sites yields a 10-kDa polypeptide which correspond to the mature protease.
The protein products expressed by the PRO4 constructs were examined initially by NaDodSO./PAGE by the method described in Aldovini, Debouck et al, supra.
The PRO3 and PRO44 constructs did not produce proteins with the size expected for their encoded products (20 kDa and 25 kDa, respectively). Instead, both
constructs gave rise to a protein of approximately 10 kDa that was visualized by immunoblot analysis using a
polyclonal antibody specific for the protease region, produced according to the method of Aldovini, Debouck et al, supra. The 20-kDa and 25-kDa proteins expected from the PR03 and PR04 constructs could be observed for a brief period immediately after induction, but rapidly
disappeared to give rise to the 10-kDa protein. This
apparent conversion from precursor to mature 10-kDa form occurred in the presence of chloramphenicol, a strong translation inhibitor, indicating that the 10-kDa protein results from the post-translational processing of the larger precursors.
III. Characterization of HIV-1 Protease Expressed in
E. coli
A. Auto-processing of the HIV Protease
The generation of a discrete 10-kDa protein by both PRO3 and PRO4 constructs is consistent with specific processing occurring at the two protease cleavage sites positioned on either side of the conserved domains. If the HIV protease product itself is responsible for this processing, then it suggests that this protease has an autocatalytic capability. To discriminate between
bacterial proteolysis and HIV protease auto-processing, smaal alteration was positioned.within the protease coding region far from the cleavage sites to inactivate the protease without affecting the susceptibility of the precursor protein to bacterial proteolysis.
Plasmid PRO4-BX was derived from the PRO4 expression vector by digestion with Bel I, treatment with DNA polymerase (Klenow) and ligation with the 8-mer oligonucleotide 5' CCTCGAGG 3'. This treatment resulted in the insertion of four codons (encoding Pro-Ser-Arg-Asp) in the protease region between the conserved domains.
Induction of this mutant derivative gave rise to a 25-kDa protein that is the size expected from the entire PR04 coding region. No 10-kDa protein was ooserved in the extract. This result suggested that the small insertion destroyed the protease activity and that the processing observed truly, results from the expression and
autocatalytic processing of the HIV protease itself.
B. Amino Acid Sequencing of HIV-1 Protease
To demonstrate directly that the 10-kDa product from induced PRO3 and PRO44 extracts indeed corresponds to the protease protein, this product was isolated and subjected to standard amino acid sequence analysis. The results indicated that the amino terminal sequence
precisely matched the sequence predicted from the proposed cleavage site located upstream of conserved domain I, Pro-Gln-Ile-Thr-Leu.
IV. Catalytic Activity of HIV-1 Protease Expressed in
E. coli
A. Construction of Vectors Expressing Fusion
Protein Having HIV Protease Cleavage Site
For the expression of HIV-1 precursor protein gag in E. coli, the pOTSKF33 bacterial expression vector, as described above in I, was used. A translational fusion was created between the HIV-1 gag reading frame and the first 56 amino acids of galactokinase sequence on the expression vector. A 1286-base pair (bp) Cla I-Bg II filled-in restriction endonuclease fragment, including most of the pl7, all of the p24, and half of the p15 gag coding sequence, was inserted at the Stul site of the pOTSKF33 fusion vector for expression of p55 gag. This construct is known as pOTSKF33-p55. The Clal-Bglll restriction fragment was isolated from the BH10 clone of the HTLVIIIB isolate of HIV [Shaw et al, Science,
226:1165-1171 (1984)].
B. Construction of pDPT287
Plasmid pDPT287 is a derivative of the chloramphenicol-resistant pDPT270 plasmid of the incFII incompatibility group, which can co-exist with the
ColEI-like pOTSKF33 derivatives in bacteria [Taylor et al, J. Bacteriol., 137:92-104 (1979)]. pDPT287 differs from pDPT270 by two deletions, one removing a small HindiII fragment between the streptomycin-resistance gene and the incFII origin of replication and the other removing a Hindi fragment between the chloramphenicol and
streptomycin-resistance gene.
Plasmid pDPT287-p55 was constructed by excising the entire PL-p55 transcriptional cassette from
pOTSKF33-p55, described in IV above, on a Bgllll-Sall filled-in restriction endonuclease fragment and inserting it between the unique BamHI and HindiII filled-in sites of pDPT287. This deletes the streptomycin-resistance gene of the vector. The orientation of the fragment with respect to the plasmid markers did not influence the expression of the galK-p55 product.
Induction of E. coli cells carrying the gag-expressing vector pDPT287-p55 alone gave rise to a 50-kDa protein, which is the size expected for the p55 gag protein encoded by this construct. Using the more
sensitive immunoblot analysis with an antiserum specific for the gag region, the same 50-kDa major protein was observed, as well as some smaller, minor polypeptides that result presumably from bacterial proteolysis or internal translation initiation.
C. Activity of HIV-1 Protease
To examine the proteolytic effect of the protein products from each of the PR03 and PRO4 constructs, on the
50-kDa gag precursor, bacteria were co-transformed with the p55-expressing vector and each of the PRO constructs separately. Both expression plasmids were induced simultaneously, and the extracts were analyzed by
immunoblot with the gag-specific antiserum. Polyclonal antibodies were raised against the p55 gag and PROl products according to the method described in Aldovini, Debouck et al, supra. Co-induction of the pDPT287-p55 and either the PRO3 or PRO4 construct resulted in the
disappearance of the large 50-kDa gag precursor and the concomitant appearance of two new proteins, one of about 24 kDa and the other of about 17 kDa. The 24-kDa product corresponds to the mature p24 gag protein, whereas the 17-kDa protein contains most of the pl7 gag region fused to the first 56 amino acids of galK. Clearly, induction of both vectors within the same cell resulted in the processing of the p55 gag precursor into products with sizes consistent with cleavage at the known viral
processing sites. Proper processing of purified 50-kDa gag precursor protein was also observed in vitro upon incubation with an induced PRO4 bacterial extract that only contained the mature 10-kDa protease product.
To confirm that this effect was specific for the products expressed by the PR03 and PRO4 constructs, the co-induction experiment was repeated with two inactive PR04 derivatives. PRO4-BX (described in IIIA) contained the four-codon insertion that eliminated protease
auto-processing, and the other, PRO4-BS (described
immediately below) contained a translation stop codon in the middle of the protease coding region.
Plasmid PRO4-BS was derived from the PRO4 expression vector by digestion with Bel I, treatment with DNA polymerase (Klenow), and ligation with the 12-mer oligonucleotide 5" CATGTTAACTAG 3", which introduces stop codons in all three reading frames. This treatment
resulted in the interruption of the protease coding sequence between the conserved domains.
No p55 gag processing was observed with either of these vector constructs. These results clearly
demonstrate that the products of the PRO3 and PRO4
constructs are proteolytically active in both their precursor and mature forms and can carry out the
maturation of the gag polyprotein precursor in addition to auto-processing.
D. Protease from PRO4 Cleaves Protein at Sites Used in Viral Infection
To examine whether the PRO4-mediated p55 gag cleavage reaction observed in E. coli occurs precisely at the sites used in viral infection, the processed 24-kDa gag protein was purified from induced cells, and its amino-terminal sequence was determined.
E. coli cells co-expressing the PRO4 and pDPT287-p55 constructs were disrupted by sonication in
50 mM Tris-HCl, pH 7.5/1 mM EDTA/1 mM dithiothreitol/1 mM phenylmethylsulfonyl fluoride. The soluble fraction was applied to a Mono Q high-performance anion-exchange column (Pharmacia) equilibrated in 10 mM Tris-HCl, pH 7.5. The putative p24 protein was found in the nonadsorbed fraction and subjected to reverse-phase HPLC [Brownlee RP-300 octyl column, 4.6 x 250 mm, equilibrated with 30% (vol/vol) acetonitrile in 0.05% trifluoroacetic acid over 45
minutes. The p24 protein eluted in the range of 48-54% (vol/vol) acetonitrile and was approximately 90% pure as judged by Coomassie blue-stained NaDodSO4/polyacrylamide gel.
A sample of purified protein was subjected to 10 cycles of automated Edman degradation in a Beckman 890M protein sequencer. Released phenylthiohydantoin-amino-
acid derivatives were analyzed by reverse-phase HPLC on a Beckman Ultrasphere ODS column (2 x 250 mm). The first 10 amino acid residues precisely matched the amino-terminal sequence of the mature p24 gag protein isolated from viral particles, as reported in Casey et al, J. Virol.,
55:417-423 (1985). This demonstrates the authenticity of the p55 gag proteolytic cleavage carried out in bacteria by the 10-kDa recombinant HIV protease. The
bacterially-produced protease seems to be extremely active as relatively small amounts of the enzyme efficiently cleave high levels of the gag precursor.
Example 2.
Expression of the HIV-1 Protease Requiring No Auto- processing
A plasmid expressing HIV-1 protease not requiring auto-processing for proteolytic activity was constructed as follows:
The PRO5 construct was derived from the PRO4 construct, described in Example 1, by introduction of a translation initiation codon immediately before the first codon of the mature protease (Proline 1) and a translation termination codon immediately after the last codon of the mature protease (Phenylalanine 99). This was done by site-specific mutagenesis using oligonucleotides with the following sequences:
* ** *
5' CCTTTCCATATGCCTCAGATC 3' for the initiation codon
***
5' GCACTTTAAATTTTTAGATTAGCCC 3' for the termination codon
* indicates the nucleotides that differ from the
wild-type sequence
The mutation that created the initiation ATG also introduced a convenient Ndel restriction endonuclease site that was used to connect the ATG initiation codon directly to the ribosome binding site of pMG, a pAS derivative with a unique Ndel site at the ATG [M. Gross et al, Mol. Cell Biol., 5:1015-1024 (1985)].
Induction of PRO5, using the method described in Example 1, results in the direct synthesis of the
full-length mature protease without a protease precursor intermediate.
The PRO5 insert, a 375 bp Ndel-Haelll fragment, has the following nucleotide sequence and corresponding translation for the reading frame of interest:
CATATGCCTCAGATCACTCTTTGGCAACGACCCCTCGRCACAATAAAGATAGGGGGGCAA 1 ----------- ------------+----------------------------------+------------------+-------------------------------+60
GTATACGGAGTCTAGTGAGAAACCGTTGCTGGGGAGCAGTGTTATTTCTATCCCCCCGTT
MetProGlnIleThrLeuTrpGlnArgProLeuValThrlleLysIIeGIyGIyGIn -
CTAAAGGAAGCTCTATTAGATACAGGAGCAGATGATACAGTATTAGAAGAAATGAGTTTG
61 ----------- ------------+----------------------------------+------------------+-------------------------------+120
GATTTCCTTCGAGATAATCTATGTCCTCGTCTACTATGTCATAATCTTCTTTACTCAAAC
LeuLysGluAlaLeuLeuAspThrGlyAlaAspAspThrValLeuGluGluMetSerLeu -
CCAGGAAGATGGAAACCAAAAATGATAGGGGGAATTGGAGGTTTTATCAAAGTAAGACAG
121 ----------- ------------+----------------------------------+------------------+------------------------------+180
GGTCCTTCTACCTTTGGTTTITACTATCCCCCTTAACCTCCAAAATAGTTTCATTCTGTC
ProGlyArgTrpLysProLysMetlleGlyGlylleGlyGlyPhelleLysValArgGIn -
TATGATCAGATACTCATAGAAATCTGTGGACATAAAGCTATAGGTACAGTATTAGTAGGA
181 ----------- ------------+----------------------------------+------------------+----------------------------+240
ATACTAGTCTATGAGTATCTTTAGACACCTGTATTTCGATATCCATGRCATAATCATCCT
TyrAspGlnIleLeuIIeGIulleCysGlyHisLysAlalleGIyThrValLeuValGly -
CCTACACCTGTCAACATAATTGGAAGAAATCTGTTGACTCAGATTGGTTGCACTTTAAAT
241 ----------- ------------+----------------------------------+------------------+-------------------------------+ 300
GGATGRGGACAGTTGTATTAACCTTCTTTAGACAACTGAGTCTAACCAACGTGAAATTTA
ProThrProValAsnllelleGIyArgAsnLeuLeuThrGlnileGIyCysThrLeuAsn -
TTTTAGATTAGCCCTATTGAGACTGTACCAGTAAAATTAAAGCCAGGAATGGATGGCCCA
301 ----------- ------------+----------------------------------+------------------+-------------------------------+ 360
AAAATCTAATCGGGATAACTCTGACATGGTCATTTTAATTTCGGTCCTTACCTACCGGGT
PheEnd
Hae III
AAAGTTAAACAATGG
361 ----------- + 375
TTTCAATTTGTTACC
Example 3 .
Production of Mature Beta-Globin
I. Construction of Vector-expressing Beta-Globin Fusion Protein
The bacterial expression vector, pSKF-Pr-bglo, which produces a fusion protein consisting of the first 56 amino acids of galK, followed by a cleavage sequence for the HIV protease and ending with the full-length coding sequence for the beta-chain of human hemoglobin was constructed as follows:
A. The full-length human beta-globin coding sequence was inserted between the Smal and Xbal
restriction sites of the p0TSKF33 expression vector, described in Example 1, to create a translational fusion between the first 56 codons of galK from the vector and the beta-globin coding sequence. This construction, pSKF-bglo, contained a unique Neol restriction site at the junction between galK and beta-globin.
B. pSKF-bglo was linearized with Neol and an oligonucleotide with Neol overends,
5' CATGCAGGTCAGCCAAAATTACCCGATCGTGGC 3' encoding an HIV protease cleavage site, was inserted in the vector to create pSKF-Pr-bglo.
When pSKF-Pr-bglo was introduced in E. coli cell together with a plasmid encoding the HIV protease, as described in Example 1, complete processing of the fusion protein at the galK-cleavage site junction was observed upon induction of the promoters, resulting in the
over-production of mature beta-globin.
The above description and examples fully disclose the invention, including preferred embodiments thereof. Modifications of the methods described that are obvious to those skilled in the art to which the invention pertains are intended to be within the scope of the following claims:
Claims
WHAT IS CLAIMED IS:
1. A method for producing a protein product in a
recombinant host cell comprising:
• culturing a host cell which has been
co-transformed with two recombinant DNA molecules, • the first of said recombinant DNA molecules
comprises a fusion protein nucleotide sequence, wherein the fusion protein encoded by said coding sequence comprises a first portion, a protein product portion and a protease cleavage site portion therebetween, and • the second of said recombinant DNA molecules
comprising a coding sequence for a protease or an active fragment or derivative thereof, capable of cleaving amino acids encoded by the protease cleavage site portion,
• whereby upon culturing said cell, both said fusion protein and said protease are expressed, and said protease cleaves said fusion protein at said protease cleavage site to thereby produce said protein product.
2. The method of claim 1 further comprising the step of isolating the protein product from the culture.
3. The method of claim 1 wherein said first recombinant DNA molecule further comprises at least one
expression control sequence operatively linked with said fusion protein nucleotide sequence.
4. The method of claim l wherein second recombinant DNA molecule further comprises at least one expression control sequence operatively linked with said protease nucleotide sequence.
5. The method of claim 1 where in said two recombinant DNA molecules are incorporated into at least one expression vector.
6. The method of claim 5 wherein said at least one
expression vector is two expression vectors, the first of which comprises said first recombinant DNA molecule and the second of which comprises said second recombinant DNA molecule.
7. The method of claim 6 wherein said two expression vectors are compatible in the same cell.
8. The method of claim 1 wherein said two recombinant DNA molecules are incorporated into the genome of said cell.
9. The method of claim 1 wherein said protease cleavage site is a retroviral protease cleavage site.
10. The method of claim 9 wherein said retroviral
protease cleavage site is an immunodeficiency virus protease cleavage site.
11. The method of claim 10 wherein said immunodeficiency virus protease cleavage site is an HIV-1 protease cleavage site.
12. The method of claim 11 wherein said cleavage site is selected from the group consisting of
R1.tyrosine.proline.R2,
R1leucine.alanine.R2,
R1.methionine.methionine.R2,
R1. phenylalanine.leucine.R2,
R1.phenylalanine.proline.R2, and
R1.leucine.phenylalanine.R2
wherein R1 is (X1)n(X2)o(X3)p and R2
is (X4)r(X5)s and X1 , X2, X3, X4 and
X5 are independently an amino acid and n,o,p,r and s are independently 0 or 1.
13. The method of claim 12 wherein X1 is selected from the group consisting of serine and threonine.
14. The method of claim 9 wherein said cleavage site
comprises a nucleic acid sequence coding for the peptides selected from the group consisting of
Serine.Glycine.Asparagine.Tyrosine.Proline.Isoleucine.
Valine,
Serine.Phenylalanine.Asparagine.Phenylalanine.Proline.
Glycine.Isoleucine; and
Threonine.Leucine.Asparagine.Phenylalanine.Proline.
Isoleucine.Serine.
16. The method of claim 1 wherein said protease is a
retroviral disease.
17. The method of claim 16 wherein said retrovirus is an immunodeficiency virus.
18. The method of claim 17 wherein said immunodeficiency virus is HIV-1.
19. The method of claim 1 wherein said first portion is an E. coli galactokinase gene segment.
20. The method of claim 19 wherein said E. coli
galactokinase gene segment is the first 56 codons of the E. coli galactokinase gene.
21. The method of claim 1 wherein said cell is an E. coli cell.
22. A method of preparing a protein product, comprising the steps of : a. providing a first recombinant DNA molecule
comprising a fusion protein nucleotide sequence, wherein the fusion protein encoded by said coding sequence comprises a first portion, a protein product portion and a protease cleavage site portion therebetween; b. providing a second recombinant DNA molecule
comprising a coding sequence for a protease or an active fragment or derivative thereof, capable of cleaving amino acids encoded by the protease cleavage site portion; c. incorporating said first and second recombinant DNA molecules into the same cell, said cell being capable of expressing said fusion protein and said protease; d. culturing said cell under conditions selected to express said fusion protein and said protease, whereby said protease cleaves said fusion protein at said protease cleavage site to thereby produce said protein product.
23. The method of claim 20 wherein said incorporating step comprises incorporating said first and second recombinant DNA molecules into at least one
expression vector and incorporating said at least one expression vector into a cell capable of expressing said fusion protein and said protease.
24. The method of claim 23 wherein said at least one
expression vector is two expression vectors, the first of which comprises said first recombinant DNA molecule and the second of which comprises said second recombinant DNA molecule.
25. The method of claim 24 wherein said two expression vectors are compatible in the same cell.
26. The method of claim 22 wherein said incorporating
comprises incorporating said first and second
recombinant DNA molecules into the genome of the same cell.
27. The method of claim 22 wherein said protease cleavage site is a retroviral protease cleavage site.
28. The method of claim 27 wherein said retroviral
protease cleavage site is an immunodeficiency virus protease cleavage site.
29. The method of claim 28 wherein said immunodeficiency virus cleavage site is an HIV-1 protease cleavage site.
30. The method of claim 27 wherein said retroviral cleavage site is selected from the group consisting of
R1.tyrosine.proline.R2,
R1leucine.alanme.R2,
R1.methionine.methionine.R2,
R1. phenylalanine.leucine.R2,
R1.phenylalanine.proline.R2, and
R1.leucine.phenylalanine.R2
wherein R1 is (X1)n(X2)o(X3)pand R2
is (X4)r(X5)s and X1, X2, X3, X4 and
X5 are independently an amino acid and n,o,p,r and s are independently 0 or 1
31. The method of claim 30 wherein X is selected from the group consisting of serine and threonine.
32. The method of claim 27 wherein said cleavage site
comprises a nucleic acid sequence coding for the peptides selected from the group consisting of
Serine.Glycine.Asparagine.Tyrosine.Proline.Isoleucine.
Valine,
Serine.Phenylalanine.Asparagine.Phenylalanine.Proline.
Glycine. Isoleucine; and
Threonine.Leucine.Asparagine.Phenylalanine.Proline.
Isoleucine.Serine.
33. The method of claim 22 wherein said protease is a
retroviral protease.
34. The method of claim 33 wherein said retrovirus is an immunodeficiency virus.
35. The method of claim 34 wherein said immunodeficiency virus is HIV-1.
36. A fusion protein nucleotide sequence comprising a first portion, a protein product portion and a retroviral protease cleavage site portion
therebetween.
37. The nucleotide sequence of claim 36 wherein said
first portion is an E. coli galactokinase gene segment.
38. The method of claim 37 wherein said E. coli
galactokinase gene is the first 56 codons the E. coli galactokinase gene.
39. The nucleotide sequence of claim 36 wherein said
retroviral protease cleavage site is an
immunodeficiency virus protease cleavage site.
40. The method of claim 39 wherein said immunodeficiency virus protease cleavage site is an HIV-1 protease cleavage site.
41. The nucleotide sequence of claim 36 wherein said
retroviral protease cleavage site comprises
nucleotides coding for the amino acid sequence selected from the group consisting of
R1.tyrosine.proline.R2,
R1leucine.alanine.R2,
R1.methionine.methionine.R2,
R1.phenylalanine.leucine.R2,
R1.phenylalanine.proline.R2, and
R1.leucine.phenylalanine.R2
wherein R1 is (x1 ) n (x2 ) o (X3 ) p and R2
is (X4)r(X5)s and X1, X2, X3, X4 and
X5 are independently an amino acid and n,o,p,r and s are independently 0 or 1.
42. The method of claim 41 wherein X1 is selected from thr group consisting of serine and threonine.
43. The method of claim 36 wherein said cleavage site
comprises a nucleic acid sequence coding for the peptides selected from the group consisting of
Serine. Glycine.Asparagine.Tyrosine.Proline.Isoleucine.
Valine,
Serine.Phenylalanine .Asparagine. Phenylalanine .Proline.
Glycine. Isoleucine; and
Threonine. Leucine .Asparagine.Phenylalanine.Proline.
Isoleucine. Serine.
44. The nucleotide sequence of claim 36 incorporated into a plasmid.
45. The nucleotide sequence of claim 36 incorporated into a cell.
46. A nucleotide sequence coding for HIV-1 protease or a proteolytically active fragment or derivative thereof.
47. The nucleotide sequence of claim 46 wherein said
nucleotide sequence comprises the sequence or the complement of said sequence
CCTCAGATCACTCTTTGGCAACGACCCCTCGTCACAATAAAGATAGGGGGGCAA
--------+---- -------------+----------------------------------+------------------+--------------------------+
GGAGTCTAGTGAGAAACCGTTGCTGGGGAGCAGTGTTATTTCTATCCCCCCGTT
CTAAAGGAAGCTCTATTAGATACAGGAGCAGATGATACAGTATTAGAAGAAATGAGTTTG
---------+-- ------------+----------------------------------+------------------+-------------------------------+
GATTTCCTTCGAGATAATCTATGTCCTCGTCTACTATGTCATAATCTTCTTTACTCAAAC
CCAGGAAGATGGAAACCAAAAATGATAGGGGGAATTGGAGGTTTTATCAAAGTAAGACAG ----------- -------+-----+----------------------------------+------------------+-------------------------------+
GGTCCTTCTACCTTTGGTTTTTACTATCCCCCTTAACCTCCAAAATAGTTTCATTCTGTC TATGATCAGATACTCATAGAAATCTGTGGACATAAAGCTATAGGTACAGTATTAGTAGGA
----------- -----+-------+----------------------------------+------------------+-------------------------------+
ATACTAGTCTATGAGTATCTTTAGACACCTGTATTTCGATATCCATGTCATAATCATCCT
CCTACACCTGTCAACATAATTGGAAGAAATCTGTTGACTCAGATTGGTTGCACTTTAAAT
----------- ----+--------+----------------------------------+------------------+-------------------------------+
GGATGTGGACAGTTGTATTAACCTTCTTTAGACAACTGAGTCTAACCAACGTGAAATTTA
TTT
---
AAA or proteolytic fragments, mutations, additions or dele thereof.
48. The nucleotide sequence of claim 46 wherein said
sequence or the complement of said sequence is
GCCGATAGACAAGGAACTGTATCCTTTAACTTCCCTCAGATCACTCTTTGGCAACGACCC
----------- --+----------+----------------------------------+------------------+-------------------------------+
CGGCTATCTGTTCCTTGACATAGGAAATTGAAGGGAGTCTAGTGAGAAACCGTTGCTGGG
CTCGTCACAATAAAGATAGGGGGGCAACTAAAGGAAGCTCTATTAGATACAGGAGCAGAT
----------- -+-----------+----------------------------------+------------------+-------------------------------+
GAGCAGTGTTATTTCTATCCCCCCGTTGATTTCCTTCGAGATAATCTATGTCCTCGTCTA
GATACAGTATTAGAAGAAATGAGTTTGCCAGGAAGATGGAAACCAAAAATGATAGGGGGA
-------- ---------+---+----------------------------------+------------------+-------------------------------+
CTATGTCATAATCTTCTTTACTCAAACGGTCCTTCTACCTTTGGTTTTTACTATCCCCCT
ATTGGAGGTTTTATCAAAGTAAGACAGTATGATCAGATACTCATAGAAATCTGTGGACAT ----------- ------------+----------------------------------+------------------+-------------------------------+
TAACCTCCAAAATAGTTTCATTCTGTCATACTAGTCTATGAGTATCTTTAGACACCTGTA
AAAGCTATAGGTACAGTATTAGTAGGACCTACACCTGTCAACATAATTGGAAGAAATCTG
----------- ------------+----------------------------------+------------------+-------------------------------+
TTTCGATATCCATGTCATAATCATCCTGGATGTGGACAGTTGTATTAACCTTCTTTAGAC
TTGACTCAGATTGGTTGCACTTTAAATTTTCCCATTAGCCCTATTGAGACTGTACCAGTA
----------- ------------+----------------------------------+------------------+-------------------------------+
AACTGAGTCTAACCAACGTGAAATTTAAAAGGGTAATCGGGATAACTCTGACATGGTCAT
AAATTAAAGCCAGGAATGGATGGCCCAAAAGTTAAACAATGG
----------- ------------+----------------------------------+------------------+
TTTAATTTCGGTCCTTACCTACCGGGTTTTCAATTTGTTACC
49. The nucleic acid sequence of claim 46 wherein said sequence or the complement of said sequence is
GCCGATAGACAAGGAACTGTATCCTTTAACTTCCCTCAGATCACTCTTTGGCAACGACCC
----------- ------------+----------------------------------+------------------+--------------------------------------+
CGGCTATCTGTTCCTTGACATAGGAAATTGAAGGGAGTCTAGTGAGAAACCGTTGCTGGG
CTCGTCACAATAAAGATAGGGGGGCAACTAAAGGAAGCTCTATTAGATACAGGAGCAGAT
----------- ------------+----------------------------------+------------------+------------------------------------+
GAGCAGTGTTATTTCTATCCCCCCGTTGATTTCCTTCGAGATAATCTATGTCCTCGTCTA
GATACAGTATTAGAAGAAATGAGTTTGCCAGGAAGATGGAAACCAAAAATGATAGGGGGA
----------- ------------+----------------------------------+------------------+-------------------------------------+
CTATGTCATAATCTTCTTTACTCAAACGGTCCTTCTACCTTTGGTTTTTACTATCCCCCT ATTGGAGGTTTTATCAAAGTAAGACAGTATGATCAGATACTCATAGAAATCTGTGGACAT
----------- ------------+----------------------------------+------------------+---------------------------------+
TAACCTCCAAAATAGTTTCATTCTGTCATACTAGTCTATGAGTATCTTTAGACACCTGTA
AAAGCTATAGGTACAGTATTAGTAGGACCTACACCTGTCAACATAATTGGAAGAAATCTG
----------- ------------+----------------------------------+------------------+-------------------------------+
TTTCGATATCCATGTCATAATCATCCTGGATGTGGACAGTTGTATTAACCTTCTTTAGAC TTGACTCAGATTGGTTGCACTTTAAATTTTCCCATTAGCCCTATTGAGACTGTACCAGTA
----------- ------------+----------------------------------+------------------+-------------------------------+
AACTGAGTCTAACCAACGTGAAATTTAAAAGGGTAATCGGGATAACTCTGACATGGTCAT
AAATTAAAGCCAGGAATGGATGGCCCAAAAGTTAAACAATGG
----------- ------------+-----------------------------+-----+------------------+----
TTTAATTTCGGTCCTTACCTACCGGGTTTTCAATTTGTTACC or proteolytic fragments, mutations, additions or deletiol thereof.
50. The nucleic acid sequence of claim 46 wherein said
sequence or the complement of said sequence is
CATATGCCT-CAGATCACTCTTTGGCAACGACCCCTCGTCACAATAAAGATAGGGGGGCAA
------ ------------+----------------------------------+------------------+------------------------------------+-------+
GTATACGGAGTCTAGTGAGAAACCGTTGCTGGGGAGCAGTGTTATTTCTATCCCCCCGTT
CTAAAGGAAGCTCTATTAGATACAGGAGCAGATGATACAGTATTAGAAGAAATGAGTTTG
----------- ------------+--------------------------+--------+------------------+-------------------------------+------+
GATTTCCTTCGAGATAATCTATGTCCTCGTCTACTATGTCATAATCTTCTTTACTCAAAC
CCAGGAAGATGGAAACCAAAAATGATAGGGGGAATTGGAGGTTTTATCAAAGTAAGACAG
------ ------------+------------------------------+----+------------------+-----------------------------------+----+
GGTCCTTCTACCTTTGGTTTTTACTATCCCCCTTAACCTCCAAAATAGTTTCATTCTGTC
TATGATCAGATACTCATAGAAATCTGTGGACATAAAGCTATAGGTACAGTATTAGTAGGA
----------- ------------+----------------------------------+------------------+----------------------------------+
ATACTAGTCTATGAGTATCTTTAGACACCTGTATTTCGATATCCATGTCATAATCATCCT
CCTACACCTGTCAACATAATTGGAAGAAATCTGTTGACTCAGATTGGTTGCACTTTAAAT ----------- -------+-----+----------------------------------+------------------+--------------+------------------------+
GGATGTGGACAGTTGTATTAACCTTCTTTAGACAACTGAGTCTAACCAACGTGAAATTTA
TTTTAGATTAGCCCTATTGAGACTGTACCAGTAAAATTAAAGCCAGGAATGGATGGCCCA
----------- ---------+---+----------------------------+------+------------------+-------------------------------------+
AAAATCTAATCGGGATAACTCTGACATGGTCATTTTAATTTCGGTCCTTACCTACCGGGT
AAAGTTAAACAATGG
--------------------+----------
TTTCAATTTGTTACC
51. A recombinant E. coli protease product selected from the group of peptides consisting of
ProGlnlleThrLeuTrpGlnArgProLeuValThrIleLysIleGlyGlyGln
LeuLysGluAlaLeuLeuAspThrGlyAlaAspAspThrValLeuGluGluMetSerLeu
ProGlyArgTrpLysProLysMetlleGlyGlylleGlyGlyPhelleLysValArgGln
TyrAspGlnlleLeuIleGluIleCysGlyHisLysAlalleGlyThrValLeuValGly
ProThrProValAsnllelleGlyArgAsnLeuLeuThrGlnlleGlyCysThrLeuAsn
Phe,
AlaAspArgGlnGlyThrValSerPheAsnPheProGlnlleThrLeuTrpGlnArgPro
LeuValThrlleLysIleGlyGlyGlnLeuLysGluAlaLeuLeuAspThrGlyAlaAsp
AspThrValLeuGluGluJϊetSerLexiProGlyArgTrpLysProLysMetlleGlyGly
IleGlyGlyPhelleLysValArgGlnTyrAspGlnlleLeuIleGlylleCysGlyHis
LysAlalleGlyThrValLeuValGlyProThrProValAsnllelleGlyArgAsnLeu
LeuTlirGlnlleGlyCysThrLeiiAsnPheProIleSerProIleGluThrValProVal
LysLeuLysProGlyMetAspGlyProLysValLysGlnTrp,
AlaAspArgGlnGlyThrValSerPheAsnPheProGlnlleThrLeuTrpGlnArgPro
LeuValThrlleLysIleGlyGlyGlnLeuLysGluAlaLeuLeuAspThrGlyAlaAsp
AspThrValLeuGluGluMetSerLeuProGlyArgTrpLysProLysMetlleGlyGly
IleGlyGlyPhelleLysValArgGlnTyrAspGlnlleLeuIleGlylleCysGlyHis
LysAlalleGlyThrValLeuValGlyProThrProValAsnllelleGlyArgAsnLeu
LeuThrGlnlleGlyCysThrLeuAsnPheProIleSerProIleGluThrValProVal
LysLeuLysProGlyMetAspGlyProLysValLysGlnTrp, and
MetProGlnlleThrLeuTrpGlnArgProLeuValThrlleLysIleGlyGlyGln
LeuLysGluAlaLeuLeuAspThrGlyAlaAspAspThrValLeuGluGluMetSerLeu
ProGlyArgTrpLysProLysMetlleGlyGlylleGlyGlyPhelleLysValArgGln
TyrAspGlnlleLeuIleGluIleCysGlyHisLysAlalleGlyThrValLeuValGly
ProThrProValAsnllelleGlyArgAsnLeuLeuThrGlnlleGlyCysThrLeuAsn
PheEnd
52. The protease product of claim 51 wherein said
protease product is a proteolytically active fragment, mutation, addition or deletion of said protease product.
53. A protease specific for a cleavage site comprising the amino acid sequence selected from the group comprising
ProGlnIleThrLeuTrpGlArgProLeuValThrIleLysIleGlyGlyGln
LeuLysGluAlaLeuLeuAspϊhrGlyAlaAspAspThrValLeuGluGluMetSerLeu
ProGlyArgTrpLysProLysMetlleGlyGlylleGlyGlyPhelleLysValArgGln
TyrAspGlnlleLeuIleGluIleCysGlyHisLysAlalleGlyThrValLeuValGly
ProThrProValAsnlleileGlyArgArnLeuLeuThrGlnlleGlyCysThrLeuAsn
Phe,
AlaAspArgGlnGlyThrValSerPheAsnPheProGlnIleThrLeuTrpGlnArgPro
LeuValThrlleLysIleGlyGlyGlnLeuLysGluAlaLeuLeuAspThrGlyAlaAsp
AspThrValLeuGluGluMetSerLeuProGlyArgTrpLysProLysMetlleGlyGly
IleGlyGlyPhelleLysValArgGΪnTyrAspGlnlleLeuIleGlylleCysGlyHis
LysAlalleGlyThrValLeuValGlyProThrProValAsnlleileGlyArgAsnLeu
LeuThrGlnlleGlyCysThrLeuAsnPheProIleSerProIleGluThrValProVal
LysLeuLysProGlyMetAcpGlyProLysValLysGlnTrp,
AlaAspArgGlnGlyThrValSerPheAsnPheProGlnlleThrLeuTrpGlnArgPro
LeuValThrIleLysIleGlyGlyGlnLeuLysGluAlaLeuLeuAspThrGlyAlaAsp
AspThrValLeuGluGluMetSerLeuProGlyArgTrpLysProLysMetlleGlyGly
IleGlyGlyPhelleLysValArgGlnTyrAspGlnlleLeuIleGlylleCysGlyHis
LysAlalleGlyThrValLeuValGlyProThrProValAsnllelleGlyArgAsnLeu
LeuThrGlnlleGlyCysThrLeuAsnPheProIleSerProIleGluThrValProVal
LysLeuLysProGlyMetAspGlyProLysValLysGlnTrp, and
MetProGlnlleThrLeuTrpGlnArgProLeuValThrlleLysIleGlyGlyGln
LeuLysGluAlaLeuLeuAspThrGlyAlaAspAspThrValLeuGluGluMetSerLeu
ProGlyArgTrpLysProLysMetlleGlyGlylleGlyGlyPhelleLysValArgGln
TyrAspGlnlleLeuIleGluIleCysGlyHisLysAlalleGlyThrValLeuValGly
ProThrProValAsnllelleGlyArgAsnLeuLeuThrGlnlleGlyCysThrLeuAsn.
PheEnd
54. The protease of claim 53 wherein said protease is a proteolytically active fragment, mutation, addition or deletion of said protease.
55. A cell having incorporated therein two recombinant DNA molecules, one of said recombinant molecules comprising a fusion protein nucleotide sequence and the second of said recombinant DNA molecules
comprising a nucleotide sequence coding for a protease capable of cleaving the fusion protein at said protease cleavage site; said fusion protein nucleotide sequence encoding a first portion, a protein product portion and a protease cleavage site portion therebetween.
56. The cell of claim 55 wherein said fusion protein
nucleotide sequence and said protease nucleotide sequence are incorporated into at least one
expression vector within said cell.
57. The cells of claim 55 wherein said cells have two expression vectors, one of which comprises said fusion protein nucleotide sequence, and the other of which comprises said protease nucleotide sequence.
58. The cells of claim 55 wherein said cells are E. coli cells.
59. The cells of claim 55 wherein said two recombinant DNA molecules further comprise at least one
expression control sequence operatively linked with said fusion protein nucleotide sequence and said protease nucleotide sequence, respectively.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US28242188A | 1988-12-09 | 1988-12-09 | |
US282,421 | 1988-12-09 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO1990006369A1 true WO1990006369A1 (en) | 1990-06-14 |
Family
ID=23081449
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US1989/005009 WO1990006369A1 (en) | 1988-12-09 | 1989-11-07 | Use of hiv protease cleavage site to express mature proteins |
Country Status (2)
Country | Link |
---|---|
AU (1) | AU4838690A (en) |
WO (1) | WO1990006369A1 (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1991016436A1 (en) * | 1990-04-13 | 1991-10-31 | Schering Corporation | Novel protease assays |
US5599906A (en) * | 1990-04-13 | 1997-02-04 | Schering Corporation | Protease assays |
US6077694A (en) * | 1990-09-21 | 2000-06-20 | The United States Of America As Represented By The Department Of Health And Human Services | Method for over-expression and rapid purification of biosynthetic proteins |
WO2004074488A1 (en) * | 2003-02-21 | 2004-09-02 | Biotecnol S.A. | Use of caspase enzymes for maturation of engineered recombinant polypeptide fusions |
US7169603B2 (en) * | 2000-07-14 | 2007-01-30 | Mgi Pharma Biologics, Inc. | α-MSH related compounds and methods of use |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4758355A (en) * | 1987-01-02 | 1988-07-19 | Atlantic Richfield Company | Oil spill recovery method |
-
1989
- 1989-11-07 AU AU48386/90A patent/AU4838690A/en not_active Abandoned
- 1989-11-07 WO PCT/US1989/005009 patent/WO1990006369A1/en unknown
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4758355A (en) * | 1987-01-02 | 1988-07-19 | Atlantic Richfield Company | Oil spill recovery method |
Non-Patent Citations (6)
Title |
---|
CELL, Volume 40, published January 1985, S. WAIN HOBSON et al., "Nucleotide sequence of the AIDS virus, LAV", pp. 9-17, see Figure 1 and Results section entitled "pol". * |
DNA, Volume 3, Number 5, published 1984, M. HEUSTERSPREUTE et al., "Expression of galactokinase as a fusion protein in Escherichia coli and Saccharomyces cerevisae", pp. 377-386, see entire article. * |
J. BIOL. CHEM., Volume 253, Number 10, published 25 May 1978, T. YOSHIMOTO et al., "Post-proline cleaving enzyme and post-proline dipeptidyl aminopeptidase", pp. 3708-3716, see abstract, Tables I, II, III. * |
PROC. NATL. ACAD. SCI. USA, Volume 84, published December 1987, C. DEBOUCK et al., "Human immunodeficiency virus protease expressed in Escherichia coli exhibits autoprocessing and specific maturation of the gag precursor", pp. 8903-8906, see entire article. * |
PROC. NATL. ACAD. SCI. USA, Volume 84, published July 1987, V.K. CHAUDHARY et al., "Activity of a recombinant fusion protein between transforming growth factor type alpha and Pseudomonas toxin", pp. 4538-4542. See Results section. * |
VIROLOGY, Volume 161, published September 1987, O.C. RICHARDS et al., "Formation of poliovirus RNA polymerase 3D in Escherichia coli by cleavage of fusion proteins expressed from cloned viral cDNA", pp. 348-356, see entire article. * |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1991016436A1 (en) * | 1990-04-13 | 1991-10-31 | Schering Corporation | Novel protease assays |
US5599906A (en) * | 1990-04-13 | 1997-02-04 | Schering Corporation | Protease assays |
US5721133A (en) * | 1990-04-13 | 1998-02-24 | Schering Corporation | Protease assays |
US5891635A (en) * | 1990-04-13 | 1999-04-06 | Schering Corporation | Protease assays |
US6077694A (en) * | 1990-09-21 | 2000-06-20 | The United States Of America As Represented By The Department Of Health And Human Services | Method for over-expression and rapid purification of biosynthetic proteins |
US7169603B2 (en) * | 2000-07-14 | 2007-01-30 | Mgi Pharma Biologics, Inc. | α-MSH related compounds and methods of use |
WO2004074488A1 (en) * | 2003-02-21 | 2004-09-02 | Biotecnol S.A. | Use of caspase enzymes for maturation of engineered recombinant polypeptide fusions |
Also Published As
Publication number | Publication date |
---|---|
AU4838690A (en) | 1990-06-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Larder et al. | AIDS virus reverse transcriptase defined by high level expression in Escherichia coli. | |
Zybarth et al. | Domains upstream of the protease (PR) in human immunodeficiency virus type 1 Gag-Pol influence PR autoprocessing | |
Falgout et al. | Deletion analysis of dengue virus type 4 nonstructural protein NS2B: identification of a domain required for NS2B-NS3 protease activity | |
Wu et al. | Localization of the Vpx packaging signal within the C terminus of the human immunodeficiency virus type 2 Gag precursor protein | |
US6140059A (en) | Methods for the obtention of human immunodeficiency virsus Type 1 envelope glycoproteins in native and oligomeric form employing recombinant chimeric antigens containing collagenase recognition sites. | |
Nam et al. | Characterization of ribosomal frameshifting for expression of pol gene products of human T-cell leukemia virus type I | |
JP2002503477A5 (en) | ||
WO1991015575A1 (en) | Hepatitis c virus protease | |
JPH04507346A (en) | Alkaline proteolytic enzyme and its production method | |
EP0331961A2 (en) | Method for protein synthesis using CKS fusion proteins | |
CA2059541C (en) | Method of preparing a plasmid having both the ability to express a retroviral gene and the ability to process expression products after translation, the resultant plasmid and expression products thereof | |
Cheng et al. | High-level synthesis of recombinant HIV-1 protease and the recovery of active enzyme from inclusion bodies | |
Baum et al. | Isolation of mutants of human immunodeficiency virus protease based on the toxicity of the enzyme in Escherichia coli. | |
Liu et al. | Proteolytic processing of the coronavirus infectious bronchitis virus 1a polyprotein: identification of a 10-kilodalton polypeptide and determination of its cleavage sites | |
AU609725B2 (en) | Recombinant HTLV-III proteins and uses thereof | |
WO1990006369A1 (en) | Use of hiv protease cleavage site to express mature proteins | |
NO175153B (en) | polypeptides | |
US5312737A (en) | CKS method of HCV protein synthesis | |
Yasumasa et al. | Processing of hepatitis C viral polyprotein in Escherichia coli | |
US5310876A (en) | Expression of HIV1 and HIV2 polypeptides and their use | |
Hughes et al. | Morphogenic capabilities of human immunodeficiency virus type 1 gag and gag-pol proteins in insect cells | |
Stebbins et al. | [1] Expression systems for retroviral proteases | |
Rumlová et al. | Comparison of classical and affinity purification techniques of Mason-Pfizer monkey virus capsid protein: the alteration of the product by an affinity tag | |
JPH05508324A (en) | Novel soluble, non-cleavable gp160 mutants in hybrid form | |
CA2004646C (en) | Protease, reverse transcriptase and endonuclease of retrovirus, and method for producing these enzymes |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AU DK FI JP NO |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): AT BE CH DE FR GB IT LU NL SE |