WO2001066693A1 - Compositions and methods for producing high yields of heterologous polypeptides in a pichia cell - Google Patents
Compositions and methods for producing high yields of heterologous polypeptides in a pichia cell Download PDFInfo
- Publication number
- WO2001066693A1 WO2001066693A1 PCT/DK2001/000154 DK0100154W WO0166693A1 WO 2001066693 A1 WO2001066693 A1 WO 2001066693A1 DK 0100154 W DK0100154 W DK 0100154W WO 0166693 A1 WO0166693 A1 WO 0166693A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- cell
- sequence
- nucleotide sequence
- methanolica
- polypeptide
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 54
- 241000235648 Pichia Species 0.000 title claims abstract description 29
- 229920001184 polypeptide Polymers 0.000 title claims description 87
- 108090000765 processed proteins & peptides Proteins 0.000 title claims description 87
- 102000004196 processed proteins & peptides Human genes 0.000 title claims description 87
- 239000000203 mixture Substances 0.000 title description 7
- 239000002773 nucleotide Substances 0.000 claims abstract description 42
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 42
- 108700010070 Codon Usage Proteins 0.000 claims abstract description 38
- 241001452677 Ogataea methanolica Species 0.000 claims abstract description 29
- 238000004519 manufacturing process Methods 0.000 claims abstract description 17
- 210000004027 cell Anatomy 0.000 claims description 120
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 50
- 108090000623 proteins and genes Proteins 0.000 claims description 45
- 108010011619 6-Phytase Proteins 0.000 claims description 40
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 claims description 33
- 102000004190 Enzymes Human genes 0.000 claims description 28
- 108090000790 Enzymes Proteins 0.000 claims description 28
- 229940085127 phytase Drugs 0.000 claims description 28
- 239000013612 plasmid Substances 0.000 claims description 27
- 229940088598 enzyme Drugs 0.000 claims description 25
- 108020004414 DNA Proteins 0.000 claims description 23
- 101710121765 Endo-1,4-beta-xylanase Proteins 0.000 claims description 23
- 108010076504 Protein Sorting Signals Proteins 0.000 claims description 15
- 230000028327 secretion Effects 0.000 claims description 15
- 230000000694 effects Effects 0.000 claims description 14
- 108010025188 Alcohol oxidase Proteins 0.000 claims description 8
- 108090000698 Formate Dehydrogenases Proteins 0.000 claims description 7
- 102000016938 Catalase Human genes 0.000 claims description 6
- 108010053835 Catalase Proteins 0.000 claims description 6
- 108010067193 Formaldehyde transketolase Proteins 0.000 claims description 6
- 210000000349 chromosome Anatomy 0.000 claims description 4
- 230000012010 growth Effects 0.000 claims description 4
- 108091092566 Extrachromosomal DNA Proteins 0.000 claims description 2
- 150000007523 nucleic acids Chemical group 0.000 description 60
- 239000013598 vector Substances 0.000 description 36
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 24
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 24
- 238000003752 polymerase chain reaction Methods 0.000 description 24
- 239000012634 fragment Substances 0.000 description 22
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 19
- 239000008103 glucose Substances 0.000 description 19
- 239000002299 complementary DNA Substances 0.000 description 17
- 108020004705 Codon Proteins 0.000 description 16
- 238000006243 chemical reaction Methods 0.000 description 11
- 238000000855 fermentation Methods 0.000 description 11
- 230000004151 fermentation Effects 0.000 description 11
- 230000009466 transformation Effects 0.000 description 11
- 108091026890 Coding region Proteins 0.000 description 10
- 239000002253 acid Substances 0.000 description 10
- 239000002609 medium Substances 0.000 description 10
- 108020004707 nucleic acids Proteins 0.000 description 10
- 102000039446 nucleic acids Human genes 0.000 description 10
- 239000000872 buffer Substances 0.000 description 9
- 230000010076 replication Effects 0.000 description 9
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 8
- 238000005273 aeration Methods 0.000 description 7
- 230000015572 biosynthetic process Effects 0.000 description 7
- 238000010276 construction Methods 0.000 description 7
- 239000013613 expression plasmid Substances 0.000 description 7
- 239000013604 expression vector Substances 0.000 description 7
- 239000011541 reaction mixture Substances 0.000 description 7
- 239000000243 solution Substances 0.000 description 7
- 241001225321 Aspergillus fumigatus Species 0.000 description 6
- 230000010354 integration Effects 0.000 description 6
- 241000223218 Fusarium Species 0.000 description 5
- 241000223257 Thermomyces Species 0.000 description 5
- 241000223258 Thermomyces lanuginosus Species 0.000 description 5
- 230000001965 increasing effect Effects 0.000 description 5
- 238000011081 inoculation Methods 0.000 description 5
- 244000005700 microbiome Species 0.000 description 5
- 230000035772 mutation Effects 0.000 description 5
- 235000018102 proteins Nutrition 0.000 description 5
- 102000004169 proteins and genes Human genes 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- 239000007222 ypd medium Substances 0.000 description 5
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 4
- UHPMCKVQTMMPCG-UHFFFAOYSA-N 5,8-dihydroxy-2-methoxy-6-methyl-7-(2-oxopropyl)naphthalene-1,4-dione Chemical compound CC1=C(CC(C)=O)C(O)=C2C(=O)C(OC)=CC(=O)C2=C1O UHPMCKVQTMMPCG-UHFFFAOYSA-N 0.000 description 4
- 101100437118 Arabidopsis thaliana AUG1 gene Proteins 0.000 description 4
- 241000193830 Bacillus <bacterium> Species 0.000 description 4
- 108091034117 Oligonucleotide Proteins 0.000 description 4
- 150000007513 acids Chemical class 0.000 description 4
- 239000011543 agarose gel Substances 0.000 description 4
- 238000013019 agitation Methods 0.000 description 4
- 229940041514 candida albicans extract Drugs 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- 238000000605 extraction Methods 0.000 description 4
- 229910052757 nitrogen Inorganic materials 0.000 description 4
- 235000015097 nutrients Nutrition 0.000 description 4
- 239000000047 product Substances 0.000 description 4
- 238000003259 recombinant expression Methods 0.000 description 4
- 108091008146 restriction endonucleases Proteins 0.000 description 4
- 238000013518 transcription Methods 0.000 description 4
- 230000035897 transcription Effects 0.000 description 4
- 239000012138 yeast extract Substances 0.000 description 4
- 229920001817 Agar Polymers 0.000 description 3
- 241000228212 Aspergillus Species 0.000 description 3
- 241000894006 Bacteria Species 0.000 description 3
- 241000235058 Komagataella pastoris Species 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 239000008272 agar Substances 0.000 description 3
- 230000003321 amplification Effects 0.000 description 3
- 229940091771 aspergillus fumigatus Drugs 0.000 description 3
- 230000002538 fungal effect Effects 0.000 description 3
- 230000004927 fusion Effects 0.000 description 3
- 230000006801 homologous recombination Effects 0.000 description 3
- 238000002744 homologous recombination Methods 0.000 description 3
- 238000009776 industrial production Methods 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 238000003199 nucleic acid amplification method Methods 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- 241000894007 species Species 0.000 description 3
- 239000000758 substrate Substances 0.000 description 3
- 230000014616 translation Effects 0.000 description 3
- VHUUQVKOLVNVRT-UHFFFAOYSA-N Ammonium hydroxide Chemical compound [NH4+].[OH-] VHUUQVKOLVNVRT-UHFFFAOYSA-N 0.000 description 2
- 101100163849 Arabidopsis thaliana ARS1 gene Proteins 0.000 description 2
- 241000228245 Aspergillus niger Species 0.000 description 2
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 2
- 241000567163 Fusarium cerealis Species 0.000 description 2
- 241000146406 Fusarium heterosporum Species 0.000 description 2
- 241000221779 Fusarium sambucinum Species 0.000 description 2
- 108010073178 Glucan 1,4-alpha-Glucosidase Proteins 0.000 description 2
- 102100022624 Glucoamylase Human genes 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- -1 LEU2 Proteins 0.000 description 2
- 241000320412 Ogataea angusta Species 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- 241000235070 Saccharomyces Species 0.000 description 2
- 101100097319 Schizosaccharomyces pombe (strain 972 / ATCC 24843) ala1 gene Proteins 0.000 description 2
- 108091081024 Start codon Proteins 0.000 description 2
- 241000187747 Streptomyces Species 0.000 description 2
- 108010006785 Taq Polymerase Proteins 0.000 description 2
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 2
- 239000004473 Threonine Substances 0.000 description 2
- IXKSXJFAGXLQOQ-XISFHERQSA-N WHWLQLKPGQPMY Chemical compound C([C@@H](C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CNC=N1 IXKSXJFAGXLQOQ-XISFHERQSA-N 0.000 description 2
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 2
- 238000002835 absorbance Methods 0.000 description 2
- 235000011114 ammonium hydroxide Nutrition 0.000 description 2
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 2
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 2
- 229960002685 biotin Drugs 0.000 description 2
- 235000020958 biotin Nutrition 0.000 description 2
- 239000011616 biotin Substances 0.000 description 2
- 229910052799 carbon Inorganic materials 0.000 description 2
- 230000002759 chromosomal effect Effects 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 238000001816 cooling Methods 0.000 description 2
- 229910000396 dipotassium phosphate Inorganic materials 0.000 description 2
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 125000001475 halogen functional group Chemical group 0.000 description 2
- 229910000359 iron(II) sulfate Inorganic materials 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 238000007834 ligase chain reaction Methods 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 108020004999 messenger RNA Proteins 0.000 description 2
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 229910052760 oxygen Inorganic materials 0.000 description 2
- 239000001301 oxygen Substances 0.000 description 2
- 210000001938 protoplast Anatomy 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 239000000523 sample Substances 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 239000001509 sodium citrate Substances 0.000 description 2
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 229910021654 trace metal Inorganic materials 0.000 description 2
- 239000011782 vitamin Substances 0.000 description 2
- 229940088594 vitamin Drugs 0.000 description 2
- 235000013343 vitamin Nutrition 0.000 description 2
- 229930003231 vitamin Natural products 0.000 description 2
- 150000003722 vitamin derivatives Chemical class 0.000 description 2
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- 101710163881 5,6-dihydroxyindole-2-carboxylic acid oxidase Proteins 0.000 description 1
- 101150096273 ADE2 gene Proteins 0.000 description 1
- 241001019659 Acremonium <Plectosphaerellaceae> Species 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 241000228215 Aspergillus aculeatus Species 0.000 description 1
- 241001513093 Aspergillus awamori Species 0.000 description 1
- 241000892910 Aspergillus foetidus Species 0.000 description 1
- 241001480052 Aspergillus japonicus Species 0.000 description 1
- 241000351920 Aspergillus nidulans Species 0.000 description 1
- 240000006439 Aspergillus oryzae Species 0.000 description 1
- 235000002247 Aspergillus oryzae Nutrition 0.000 description 1
- 241000223651 Aureobasidium Species 0.000 description 1
- 241000193744 Bacillus amyloliquefaciens Species 0.000 description 1
- 241000193752 Bacillus circulans Species 0.000 description 1
- 241000193749 Bacillus coagulans Species 0.000 description 1
- 241000193422 Bacillus lentus Species 0.000 description 1
- 241000194108 Bacillus licheniformis Species 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 108010023063 Bacto-peptone Proteins 0.000 description 1
- 102100030981 Beta-alanine-activating enzyme Human genes 0.000 description 1
- 241000193764 Brevibacillus brevis Species 0.000 description 1
- 210000003771 C cell Anatomy 0.000 description 1
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 1
- 102100037633 Centrin-3 Human genes 0.000 description 1
- 241001337994 Cryptococcus <scale insect> Species 0.000 description 1
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 101710088194 Dehydrogenase Proteins 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 241000223194 Fusarium culmorum Species 0.000 description 1
- 241000223195 Fusarium graminearum Species 0.000 description 1
- 241000223221 Fusarium oxysporum Species 0.000 description 1
- 241001112697 Fusarium reticulatum Species 0.000 description 1
- 241001014439 Fusarium sarcochroum Species 0.000 description 1
- 241001465753 Fusarium torulosum Species 0.000 description 1
- 241000567178 Fusarium venenatum Species 0.000 description 1
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 1
- 241000193385 Geobacillus stearothermophilus Species 0.000 description 1
- 102000008214 Glutamate decarboxylase Human genes 0.000 description 1
- 108091022930 Glutamate decarboxylase Proteins 0.000 description 1
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 101150009006 HIS3 gene Proteins 0.000 description 1
- 101100246753 Halobacterium salinarum (strain ATCC 700922 / JCM 11081 / NRC-1) pyrF gene Proteins 0.000 description 1
- 244000286779 Hansenula anomala Species 0.000 description 1
- 235000014683 Hansenula anomala Nutrition 0.000 description 1
- 101000773364 Homo sapiens Beta-alanine-activating enzyme Proteins 0.000 description 1
- 101000880522 Homo sapiens Centrin-3 Proteins 0.000 description 1
- 102000008100 Human Serum Albumin Human genes 0.000 description 1
- 108091006905 Human Serum Albumin Proteins 0.000 description 1
- 241000223198 Humicola Species 0.000 description 1
- 241001480714 Humicola insolens Species 0.000 description 1
- IMQLKJBTEOYOSI-GPIVLXJGSA-N Inositol-hexakisphosphate Chemical compound OP(O)(=O)O[C@H]1[C@H](OP(O)(O)=O)[C@@H](OP(O)(O)=O)[C@H](OP(O)(O)=O)[C@H](OP(O)(O)=O)[C@@H]1OP(O)(O)=O IMQLKJBTEOYOSI-GPIVLXJGSA-N 0.000 description 1
- 241000235649 Kluyveromyces Species 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- 101500023488 Lithobates catesbeianus GnRH-associated peptide 1 Proteins 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 1
- 101150068888 MET3 gene Proteins 0.000 description 1
- 241001344133 Magnaporthe Species 0.000 description 1
- 241000235395 Mucor Species 0.000 description 1
- 241000226677 Myceliophthora Species 0.000 description 1
- 241000233892 Neocallimastix Species 0.000 description 1
- 241000221960 Neurospora Species 0.000 description 1
- 241000221961 Neurospora crassa Species 0.000 description 1
- 101100022915 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) cys-11 gene Proteins 0.000 description 1
- 101100494726 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) pep-4 gene Proteins 0.000 description 1
- 241001236817 Paecilomyces <Clavicipitaceae> Species 0.000 description 1
- 241000194109 Paenibacillus lautus Species 0.000 description 1
- 206010034133 Pathogen resistance Diseases 0.000 description 1
- 241000228143 Penicillium Species 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 102000012288 Phosphopyruvate Hydratase Human genes 0.000 description 1
- 108010022181 Phosphopyruvate Hydratase Proteins 0.000 description 1
- IMQLKJBTEOYOSI-UHFFFAOYSA-N Phytic acid Natural products OP(O)(=O)OC1C(OP(O)(O)=O)C(OP(O)(O)=O)C(OP(O)(O)=O)C(OP(O)(O)=O)C1OP(O)(O)=O IMQLKJBTEOYOSI-UHFFFAOYSA-N 0.000 description 1
- 241000235379 Piromyces Species 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 101710130181 Protochlorophyllide reductase A, chloroplastic Proteins 0.000 description 1
- 241000589774 Pseudomonas sp. Species 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- 241000235403 Rhizomucor miehei Species 0.000 description 1
- 240000005384 Rhizopus oryzae Species 0.000 description 1
- 235000013752 Rhizopus oryzae Nutrition 0.000 description 1
- 101100394989 Rhodopseudomonas palustris (strain ATCC BAA-98 / CGA009) hisI gene Proteins 0.000 description 1
- 235000003534 Saccharomyces carlsbergensis Nutrition 0.000 description 1
- 235000001006 Saccharomyces cerevisiae var diastaticus Nutrition 0.000 description 1
- 244000206963 Saccharomyces cerevisiae var. diastaticus Species 0.000 description 1
- 241000204893 Saccharomyces douglasii Species 0.000 description 1
- 241001407717 Saccharomyces norbensis Species 0.000 description 1
- 241001123227 Saccharomyces pastorianus Species 0.000 description 1
- 241000235060 Scheffersomyces stipitis Species 0.000 description 1
- 241000222480 Schizophyllum Species 0.000 description 1
- 241000235346 Schizosaccharomyces Species 0.000 description 1
- 101100022918 Schizosaccharomyces pombe (strain 972 / ATCC 24843) sua1 gene Proteins 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 241000187398 Streptomyces lividans Species 0.000 description 1
- KDYFGRWQOYBRFD-UHFFFAOYSA-N Succinic acid Natural products OC(=O)CCC(O)=O KDYFGRWQOYBRFD-UHFFFAOYSA-N 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 241000228341 Talaromyces Species 0.000 description 1
- 241001540751 Talaromyces ruber Species 0.000 description 1
- 241000228178 Thermoascus Species 0.000 description 1
- 241001313536 Thermothelomyces thermophila Species 0.000 description 1
- 241001494489 Thielavia Species 0.000 description 1
- 241001149964 Tolypocladium Species 0.000 description 1
- 102000005924 Triose-Phosphate Isomerase Human genes 0.000 description 1
- 108700015934 Triose-phosphate isomerases Proteins 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 101150050575 URA3 gene Proteins 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 241000235013 Yarrowia Species 0.000 description 1
- 241000222124 [Candida] boidinii Species 0.000 description 1
- FENRSEGZMITUEF-ATTCVCFYSA-E [Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].OP(=O)([O-])O[C@@H]1[C@@H](OP(=O)([O-])[O-])[C@H](OP(=O)(O)[O-])[C@H](OP(=O)([O-])[O-])[C@H](OP(=O)(O)[O-])[C@H]1OP(=O)([O-])[O-] Chemical compound [Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].OP(=O)([O-])O[C@@H]1[C@@H](OP(=O)([O-])[O-])[C@H](OP(=O)(O)[O-])[C@H](OP(=O)([O-])[O-])[C@H](OP(=O)(O)[O-])[C@H]1OP(=O)([O-])[O-] FENRSEGZMITUEF-ATTCVCFYSA-E 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 238000012870 ammonium sulfate precipitation Methods 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 210000004507 artificial chromosome Anatomy 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 229940054340 bacillus coagulans Drugs 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 108010051210 beta-Fructofuranosidase Proteins 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 239000003139 biocide Substances 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- KDYFGRWQOYBRFD-NUQCWPJISA-N butanedioic acid Chemical compound O[14C](=O)CC[14C](O)=O KDYFGRWQOYBRFD-NUQCWPJISA-N 0.000 description 1
- FAPWYRCQGJNNSJ-UBKPKTQASA-L calcium D-pantothenic acid Chemical compound [Ca+2].OCC(C)(C)[C@@H](O)C(=O)NCCC([O-])=O.OCC(C)(C)[C@@H](O)C(=O)NCCC([O-])=O FAPWYRCQGJNNSJ-UBKPKTQASA-L 0.000 description 1
- 229960002079 calcium pantothenate Drugs 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 210000002421 cell wall Anatomy 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 238000011098 chromatofocusing Methods 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 239000002361 compost Substances 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 229910000366 copper(II) sulfate Inorganic materials 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 230000008034 disappearance Effects 0.000 description 1
- 230000005684 electric field Effects 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000001952 enzyme assay Methods 0.000 description 1
- 238000001704 evaporation Methods 0.000 description 1
- 230000008020 evaporation Effects 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 102000006602 glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 1
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 1
- 230000002414 glycolytic effect Effects 0.000 description 1
- 229910001385 heavy metal Inorganic materials 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 230000001976 improved effect Effects 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 239000003262 industrial enzyme Substances 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 239000001573 invertase Substances 0.000 description 1
- 235000011073 invertase Nutrition 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 229910000357 manganese(II) sulfate Inorganic materials 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- MEFBJEMVZONFCJ-UHFFFAOYSA-N molybdate Chemical compound [O-][Mo]([O-])(=O)=O MEFBJEMVZONFCJ-UHFFFAOYSA-N 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 239000000825 pharmaceutical preparation Substances 0.000 description 1
- 229940127557 pharmaceutical product Drugs 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 235000002949 phytic acid Nutrition 0.000 description 1
- 239000000467 phytic acid Substances 0.000 description 1
- 229940068041 phytic acid Drugs 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 230000008092 positive effect Effects 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 239000008057 potassium phosphate buffer Substances 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 235000019833 protease Nutrition 0.000 description 1
- 235000019419 proteases Nutrition 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 102220082204 rs753215899 Human genes 0.000 description 1
- 239000006152 selective media Substances 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 239000007974 sodium acetate buffer Substances 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 229940083982 sodium phytate Drugs 0.000 description 1
- 239000002689 soil Substances 0.000 description 1
- 238000010563 solid-state fermentation Methods 0.000 description 1
- 239000000600 sorbitol Substances 0.000 description 1
- 238000001694 spray drying Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 229920001221 xylan Polymers 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
- C12N15/81—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
- C12N15/815—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts for yeasts other than Saccharomyces
Definitions
- compositions and methods for producing high yields of heterologous polypeptides in a Pichia cell Compositions and methods for producing high yields of heterologous polypeptides in a Pichia cell
- Yeast cells are potential candidates for host organisms for heterologous protein production.
- Saccharomyces cerevisiae for the production of e.g. pharmaceutical products or industrial enzymes is well known. It has been suggested that methylotrophic yeasts that can utilize methanol as a sole carbon and energy source are more attractive host strains for heterologous protein production.
- High protein productivity has been achieved with strongly inducible promoters, such as the one from alcohol oxidase or formate dehydrogenase, by performing high cell density fermentations in which the dried cell mass exceeds 100 g/kg. Examples of high expression of heterologous protein are: 2.5 g/l of S. cere visiae invertase (Tschopp et. al.
- Pichia methanolica is another methylotrophic yeast that has been shown recently to produce human glutamate decarboxylase at the level of 0.5 g/l intracellulariy (Yeast, 14, 11-23, 1998). It has been stated that P. methanolica could be used for production of any heterologous protein, but no example of productivity or yield over 1 gram per litre has been shown yet (US 5,716,808; WO 97/17450).
- the problem to be solved by this invention is to provide expression systems and methods for producing a polypeptide of interest in high yields.
- the solution is based on that the present inventors have identified compositions and methods for high yield production of a heterologous polypeptide of interest in a Pichia cell, specifically Pichia methanolica.
- a heterologous polypeptide of interest By adjusting the codon usage of a gene encoding a polypeptide of interest to the codon usage of one or more gene(s) from the wild type P. methanolica, the inventors have shown that protein productivity surprisingly can be increased with as much as a factor of 10 or more.
- heterologous polypeptides were expressed from codon-optimized genes with original secretion signal sequences and secreted from P. methanolica.
- a gene encoding a polypeptide of interest is most often present within the production host cell in multiple copies which are all expressed in order to achieve an optimally high yield of the polypeptide.
- Gene analysis by Southern blotting of a production cell of the invention revealed that the high polypeptide yield was achieved with a very low gene dosage of only approx. 2 copies pr. cell, which suggests a potential for increasing the yield of the polypeptide of interest even further by increasing the copy number.
- the present invention relates to a Pichia cell comprising at least one copy of a heterologous nucleotide sequence encoding a polypeptide of interest, wherein the codon usage of said sequence has been adjusted to match the preferred codon usage of P methanolica, as defined herein
- the preferred codon usage of P methanolica as relates to this invention is presented in Example 2, Table 1 , e g as the codon-ratios used in the optimized phytase gene sequence for each ammo acid More specifically the preferred codons for Lysine are TTA or TTG, for Isoleucme they are ATT or ATC, for Valine they are GTT or less preferred GTC, for Se ⁇ ne they are TCT, TCC or TCA, for Proline they are CCA or less preferred CCT, for Threonine they are ACT or less preferred ACC, for Alanine they are GCT or GCC, for Tyrosme the more preferred is TAC; for Glutamine it is CAA, for Asparagine it is AAC, for Glutamic acid it is GAA, for Cysteine it is TGT, for Arginine it is AGA, and finally for Glycine it is GGT
- the codon usage of a DNA sequence encoding a polypeptide is adjusted to match the preferred codon usage of P methanolica when at least 50% of the codons for specific ammo acids in the polypeptide sequence have been adjusted to a preferred codon as defined above, or preferably at least 60%, more preferably at least 70%, even more preferably 80%, yet more preferably 90%, and most preferably 95% of the codons for specific ammo acids in the polypeptide sequence have been adjusted to a preferred codon as defined above
- the present invention relates to an isolated DNA construct comprising at least one copy of a nucleotide sequence encoding a polypeptide heterologous to Pichia methanolica, wherein the codon usage of said sequence has been adjusted to match the preferred codon usage of P methanolica, as defined herein
- the present invention relates to a method of producing a polypeptide of interest in a Pichia cell, where the polypeptide is encoded by a nucleotide sequence heterologous to P methanolica, said method comprising the steps of a) adjusting the codon usage of the sequence to match the preferred codon usage of P methanolica, as defined herein, and b) cultivating a Pichia cell comprising at least one copy of the codon usage adjusted sequence of step a) under appropriate growth conditions to express the sequence and achieve production of the polypeptide of interest
- the entire sequence of the synthetic codon optimized A fumigatus phytase gene (SEQ ID No 19) is comprised in the strain E coli NN049526, which was deposited with "Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH” (DSMZ) 01-March-2000 with the accession number DSM 13352
- isolated nucleic acid sequence refers to a nucleic acid sequence which is essentially free of other nucleic acid sequences, e g , at least about 20% pure, preferably at least about 40% pure, more preferably at least about 60% pure, even more preferably at least about 80% pure, and most preferably at least about 90% pure as determined by agarose electrophoresis
- an isolated nucleic acid sequence can be obtained by standard cloning procedures used in genetic engineering to relocate the nucleic acid sequence from its natural location to a different site where it will be reproduced The cloning procedures may involve excision and isolation of a desired nucleic acid fragment comprising the nucleic acid sequence encoding the polypeptide, insertion of the fragment into a vector molecule, and incorporation of the recombinant vector into a host cell where multiple copies or clones of the nucleic acid sequence will be replicated
- nucleic acid sequences of the present invention may be of genomic, cDNA, RNA, semisynthetic, synthetic origin, or any combinations thereof and may be obtained from microorganisms of any genus
- the term "obtained from as used herein in connection with a given source shall mean that the polypeptide encoded by the nucleic acid sequence is produced by the source or by a cell in which the nucleic acid sequence from the source has been inserted
- the nucleic acid sequences may be obtained from a bacterial source
- these polypeptides may be obtained from a gram positive bacterium such as a Bacillus strain, e g , Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus coagulans, Bacillus lautus, Bacillus lentus, Bacillus licheniformis, Bacillus megate ⁇ um, Bacillus stearothermophilus, Bacillus subtilis, or Bacillus thunngiensis, or a Streptomyces strain, e g , Streptomyces lividans or Streptomyces munnus, or from a gram negative bacterium, e g , E coli or Pseudomonas sp
- the nucleic acid sequences may be obtained from a fungal source, and more preferably from a yeast strain such as a Candida, Hansenula, Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia strain, or more preferably from a filamentous fungal strain such as an Acremonium, Aspergillus Aureobasidium, Cryptococcus, Filibasidium, Fusarium, Humicola, Magnaporthe, Mucor, Myceliophthora, Neocallimastix, Neurospora, Paecilomyces, Penicillium, Piromyces, Schizophyllum Talaromyces, Thermoascus, Thielavia Tolypocladium, or Tnchoderma strain
- a yeast strain such as a Candida, Hansenula, Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia strain
- the nucleic acid sequences may thus be obtained from Saccharomyces carlsbergensis, Saccharomyces cerevisiae, Saccharomyces diastaticus, Saccharomyces douglasii, Saccharomyces kluyven, Saccharomyces norbensis or Saccharomyces oviformis, or indeed from from an Aspergillus aculeatus, Aspergillus awamori, Aspergillus foetidus, Aspergillus japonicus Aspergillus nidulans, Aspergillus niger, Aspergillus oryzae, Fusarium bactndioides Fusarium cerealis, Fusarium crookwellense, Fusarium culmorum, Fusarium graminearum, Fusarium graminum, Fusarium heterosporum Fusarium negundi, Fusarium oxysporum, Fusarium reticulatum Fus
- the nucleic acid sequence is the sequence encoding a phytase comprised in Esche ⁇ chia coll DSM13352
- the nucleic acid sequence is SEQ ID NO 3 or SEQ ID NO 19
- the invention encompasses both the perfect and imperfect states, and other taxonomic equivalents, e g , anamorphs, regardless of the species name by which they are known Those skilled in the art will readily recognize the identity of appropriate equivalents Strains of these species are readily accessible to the public in a number of culture collections such as the American Type Culture Collection (ATCC), Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH (DSMZ), Centraalbureau Voor Schimmelcultures (CBS), and Agricultural Research Service Patent Culture Collection, Northern Regional Research Center (NRRL)
- ATCC American Type Culture Collection
- DSMZ Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH
- CBS Centraalbureau Voor Schimmelcultures
- NRRL Northern Regional Research Center
- nucleic acid sequences may be identified and obtained from other sources including microorganisms isolated from nature (e g , soil, composts water, etc ) using the above-mentioned probes Techniques for isolating microorganisms from natural habitats are well known in the art
- the nucleic acid sequence may then be derived by similarly screening a genomic or cDNA library of another microorganism Once a nucleic acid sequence encoding a polypeptide has been detected with the probe(s), the sequence may be isolated or cloned by utilizing techniques which are known to those of ordinary skill in the art (see, e g , Sambrook et al , 1989, supra)
- the techniques used to isolate or clone a nucleic acid sequence encoding a polypeptide include isolation from genomic DNA, preparation from cDNA, or a combination thereof
- the cloning of the nucleic acid sequences of the present invention from such genomic DNA can be effected, e g , by using the well-known polymerase chain reaction (PCR) or antibody screening of expression libraries to detect cloned DNA fragments with shared structural features
- PCR polymerase chain reaction
- LCR ligase chain reaction
- LAT ligated activated transcription
- NASBA nucleic acid sequence-based amplification
- a nucleic acid sequence of the present invention may also encode fused polypeptides or cleavable fusion polypeptides in which another polypeptide is fused at the N-terminus or the C-terminus of the polypeptide or fragment thereof
- a fused polypeptide is produced by fusing a nucleic acid sequence (or a portion thereof) encoding another polypeptide to a nucleic acid sequence (or a portion thereof) of the present invention
- Techniques for producing fusion polypeptides are known in the art, and include hgating the coding sequences encoding the polypeptides so that they are in frame and that expression of the fused polypeptide is under control of the same promoter(s) and terminator
- the present invention also relates to nucleic acid constructs comprising a nucleic acid sequence of the present invention operably linked to one or more control sequences which direct the expression of the coding sequence in a suitable host cell under conditions compatible with the control sequences Expression will be understood to include any step involved in the production
- nucleic acid construct or “DNA construct” is defined herein as a nucleic acid molecule, either single- or double-stranded, which is isolated from a naturally occurring gene or which has been modified to contain segments of nucleic acid which are combined and juxtaposed in a manner which would not otherwise exist in nature
- nucleic acid construct is synonymous with the term expression cassette when the nucleic acid construct contains all the control sequences required for expression of a coding sequence of the present invention
- coding sequence is defined herein as a portion of a nucleic acid sequence which directly specifies the ammo acid sequence of its protein product The boundaries of the coding sequence are generally determined by a ribosome binding site (prokaryotes) or by the ATG start codon (eukaryotes) located just upstream of the open reading frame at the 5' end of the mRNA and a transcription terminator sequence located just downstream of the open reading frame at the 3' end of the mRNA
- a coding sequence can include, but is not limited to, DNA, cDNA
- control sequences is defined herein to include all components which are necessary or advantageous for the expression of a polypeptide of the present invention
- Each control sequence may be native or foreign to the nucleic acid sequence encoding the polypeptide
- Such control sequences include, but are not limited to, a leader, polyadenylation sequence, propeptide sequence, promoter, signal peptide sequence, and transcription terminator
- the control sequences include a promoter, and transcnptional and translational stop signals
- the control sequences may be provided with linkers for the purpose of introducing specific restriction sites facilitating ligation of the control sequences with the coding region of the nucleic acid sequence encoding a polypeptide
- Operably linked is defined herein as a configuration in which a control sequence is appropriately placed at a position relative to the coding sequence of the DNA sequence such that the control sequence directs the expression of a polypeptide
- the control sequence may be an appropriate promoter sequence, a nucleic acid sequence which is recognized by a host cell for expression of
- Useful signal peptides for yeast host cells are obtained from the genes for Saccharomyces cerevisiae alpha-factor and Saccharomyces cerevisiae mvertase Other useful signal peptide coding regions are described by Romanos et al , 1992, Yeast 8 423-488
- regulatory sequences which allow the regulation of the expression of the polypeptide relative to the growth of the host cell
- regulatory systems are those which cause the expression of the gene to be turned on or off in response to a chemical or physical stimulus, including the presence of a regulatory compound
- the present invention also relates to recombinant expression vectors comprising a nucleic acid sequence of the present invention, a promoter, and transcnptional and translational stop signals
- a recombinant expression vector which may include one or more convenient restriction sites to allow for insertion or substitution of the nucleic acid sequence encoding the polypeptide at such sites
- the nucleic acid sequence of the present invention may be expressed by inserting the nucleic acid sequence or a nucleic acid construct comprising the sequence into an appropriate vector for expression
- the coding sequence is located in the vector so that the coding sequence is operably linked with the appropriate control sequences for expression
- the recombinant expression vector may be any vector (e g , a plasmid or virus) which can be conveniently subjected to recombinant DNA procedures and can bring about the expression of the nucleic acid sequence
- the choice of the vector will typically depend on the compatibility of the vector with the host cell into which the vector is to be introduced
- the vectors may be linear or closed circular plasmids
- the vector may be an autonomously replicating vector, / e , a vector which exists as an extrachromosomal entity, the replication of which is independent of chromosomal replication, e g , a plasmid, an extrachromosomal element, a mini-chromosome, or an artificial chromosome
- the vector may contain any means for assuring self-replication
- the vector may be one which, when introduced into the host cell, is integrated into the genome and replicated together with the chromosome(s) into which it has been integrated
- the vectors of the present invention preferably contain an element(s) that permits stable integration of the vector into the host cell genome or autonomous replication of the vector in the cell independent of the genome of the cell
- the vector may rely on the nucleic acid sequence encoding the polypeptide or any other element of the vector for stable integration of the vector into the genome by homologous or nonhomologous recombination
- the vector may contain additional nucleic acid sequences for directing integration by homologous recombination into the genome of the host cell
- the additional nucleic acid sequences enable the vector to be integrated into the host cell genome at a precise locat ⁇ on(s) in the chromosome(s)
- the mtegrational elements should preferably contain a sufficient number of nucleic acids, such as 100 to 1 ,500 base pairs, preferably 400 to 1 ,500 base pairs, and most preferably 800 to 1 ,500 base pairs, which are highly homologous with the corresponding target sequence
- the integrational elements may be any sequence that is homologous with the target sequence in the genome of the host cell. Furthermore, the integrational elements may be non-encoding or encoding nucleic acid sequences. On the other hand, the vector may be integrated into the genome of the host cell by non-homologous recombination.
- the vector may further comprise an origin of replication enabling the vector to replicate autonomously in the host cell in question.
- origins of replication for use in a yeast host cell are the two micron origins of replication, ARS1 , ARS4, the combination of ARS1 and CEN3, and the combination of ARS4 and CEN6.
- the origin of replication may be one having a mutation which makes its functioning temperature-sensitive in the host cell (see, e.g., Ehrlich, 1978, Proceedings of the National Academy of Sciences USA 75: 1433).
- More than one copy of a nucleic acid sequence of the present invention may be inserted into the host cell to increase production of the gene product.
- An increase in the copy number of the nucleic acid sequence can be obtained by integrating at least one additional copy of the sequence into the host cell genome or by including an amplifiable selectable marker gene with the nucleic acid sequence where cells containing amplified copies of the selectable marker gene, and thereby additional copies of the nucleic acid sequence, can be selected for by cultivating the cells in the presence of the appropriate selectable agent.
- the procedures used to ligate the elements described above to construct the recombinant expression vectors of the present invention are well known to one skilled in the art (see, e.g., Sambrook et al., 1989, supra).
- the present invention also relates to recombinant host cells, comprising a nucleic acid sequence of the invention, which are advantageously used in the recombinant production of the polypeptides.
- a vector comprising a nucleic acid sequence of the present invention is introduced into a host cell so that the vector is maintained as a chromosomal integrant or as a self-replicating extra-chromosomal vector as described earlier.
- host cell encompasses any progeny of a parent cell that is not identical to the parent cell due to mutations that occur during replication.
- the choice of a host cell will to a large extent depend upon the gene encoding the polypeptide and its source.
- Fungal cells may be transformed by a process involving protoplast formation, transformation of the protoplasts, and regeneration of the cell wall in a manner known per se. Suitable procedures for transformation of Aspergillus host cells are described in EP 238 023 and Yelton et al., 1984, Proceedings of the National Academy of Sciences USA 81 : 1470-1474. Suitable methods for transforming Fusarium species are described by Malardier et al., 1989, Gene 78: 147-156 and WO 96/00787.
- Yeast may be transformed using the procedures described by Becker and Guarente, In Abelson, J.N. and Simon, M.I., editors, Guide to Yeast Genetics and Molecular Biology, Methods in Enzymology, Volume 194, pp 182-187, Academic Press, Inc., New York; Ito et al., 1983, Journal of Bacteriology 53: 163; and Hinnen et ai, 1978, Proceedings of the National Academy of Sciences USA 75: 1920.
- the present invention also relates to methods for producing a polypeptide comprising cultivating a host cell under conditions suitable for production of the polypeptide.
- the cells are cultivated in a nutrient medium suitable for production of the polypeptide using methods known in the art.
- the cell may be cultivated by shake flask cultivation, small-scale or large-scale fermentation (including continuous, batch, fed-batch, or solid state fermentations) in laboratory or industrial fermentors performed in a suitable medium and under conditions allowing the polypeptide to be expressed and/or isolated
- the cultivation takes place in a suitable nutrient medium comprising carbon and nitrogen sources and inorganic salts, using procedures known in the art Suitable media are available from commercial suppliers or may be prepared according to published compositions (e g , in catalogues of the American Type Culture Collection)
- the polypeptide is secreted into the nutrient medium, the polypeptide can be recovered directly from the medium If the polypeptide is not secreted, it can be recovered from cell lysates
- the polypeptides may be detected using methods known in the art that are specific for the polypeptides These detection methods may include use of specific antibodies, formation of an enzyme product, or disappearance of an enzyme substrate
- an enzyme assay may be used to determine the activity of the polypeptide as described herein
- the resulting polypeptide may be recovered by methods known in the art
- the polypeptide may be recovered from the nutrient medium by conventional procedures including, but not limited to, cent fugation, filtration, extraction, spray-drying, evaporation, or precipitation
- the polypeptides may be purified by a variety of procedures known in the art including, but not limited to, chromatography (e g , ion exchange, affinity, hydrophobic, chromatofocusing, and size exclusion), electrophoretic procedures (e g , preparative isoelectnc focusing), differential solubility (e , ammonium sulfate precipitation), SDS-PAGE, or extraction (see, e g , Protein Purification, J -C Janson and Lars Ryden, editors, VCH Publishers, New York, 1989) Detailed description of the invention
- a Pichia cell comprising at least one copy of a heterologous nucleotide sequence encoding a polypeptide of interest, wherein the codon usage of said sequence has been adjusted to match the preferred codon usage of P methanolica, as defined herein
- the specific Pichia cell may be any Pichia cell, such as a Pichia methanolica, Pichia pastoris, Pichia anomala, Pichia stipitis, Pichia angusta or Pichia guilliermondn cell
- the specific Pichia cell is a Pichia methanolica cell
- a preferred embodiment of the invention relates to a Pichia cell of the first aspect of the invention, wherein the nucleotide sequence encodes an enzyme, preferably the nucleotide sequence encodes an enzyme having phytase or xylanase activity, or even more preferably the nucleotide sequence comprises a DNA sequence at least 90% identical, or preferably at least 93% identical, or more preferably at least 95% identical, or most preferably at least 97% identical to the sequences shown in SEQ ID No 3, SEQ ID No 19, or to the phytase encoding sequence comprised in a cell of DSM 13352
- a further embodiment relates to a Pichia cell of the first aspect, wherein the nucleotide sequence is present on an extrachromosomal DNA construct, preferably a plasmid, or more preferably the nucleotide sequence is integrated in the cell's chromosome, preferably in multiple copies
- a preferred embodiment of the invention relates to a Pichia cell of the first aspect, wherein the nucleotide sequence is transcribed from a promoter of a methanol-inducible P methanolica gene, preferably the promoter is a native promoter of a P methanolica gene encoding an enzyme selected from the group consisting of alcohol oxidase, dihydroxyacetone synthase, formate dehydrogenase, and catalase
- polypeptides For industrial production of polypeptides it is of considerable interest to have the polypeptides secreted from the producing host cells in order to minimize costs of isolating the polypeptide, ways of promoting efficient secretion are well known in the art
- a polypeptide may be efficiently secreted if it carries an effective secretion signal peptide recognized by the host cell's secretion apparatus
- a number of secretion signal peptides have been described to work well in yeast host cells as described elsewhere herein The present inventors found that even without the addition of a well-characterized yeast secretion signal peptide, efficient secretion could be achieved from a host cell of a polypeptide carrying its native original secretion signal sequence
- a preferred embodiment of the invention relates to a Pichia cell of the first aspect, wherein the polypeptide comprises its native original secretion signal sequence
- the present invention relies on adjusting the codon usage of an isolated DNA sequence to match the preferred codon usage of Pichia methanolica
- An isolated DNA construct comprising at least one copy of a nucleotide sequence encoding a polypeptide heterologous to Pichia methanolica, wherein the codon usage of said sequence has been adjusted to match the preferred codon usage of P methanolica, as defined herein
- a preferred embodiment of the invention relates to an isolated DNA construct of the second aspect of the invention, wherein the nucleotide sequence encodes an enzyme, preferably the nucleotide sequence encodes an enzyme having phytase or xylanase activity, and more preferably the nucleotide sequence comprises a DNA sequence at least 90% identical, or preferably at least 93% identical, or more preferably at least 95% identical, or most preferably at least 97% identical to the sequences shown in SEQ ID No 3, SEQ ID No 19, or to the phytase encoding sequence comprised in a cell of DSM 13352
- Another preferred embodiment of the invention relates to an isolated DNA construct of the second aspect of the invention, wherein the nucleotide sequence is transcribed from a promoter of a methanol-inducible P methanolica gene, preferably the promoter is a native promoter of a P methanolica gene encoding an enzyme selected from the group consisting of alcohol oxidase, dihydroxyacetone synthase, formate dehydrogenase, and catalase
- Other promoters generally recognized as working well in Pichia cells are those of the glycolytic enzymes, such as glyceraldehyde-3-phosphate dehydrogenase, triose phosphate isomerase, enolase etc
- a preferred embodiment of the invention relates to an isolated DNA construct of the second aspect of the invention, wherein the nucleotide sequence encodes a polypeptide comprising its native original secretion signal sequence
- the invention relates to industrial production, l e a method of producing a polypeptide of interest in a Pichia cell, where the polypeptide is encoded by a nucleotide sequence heterologous to P methanolica, said method comprising the steps of a) adjusting the codon usage of the sequence to match the preferred codon usage of P methanolica, as defined herein, and b) cultivating a Pichia cell comprising at least one copy of the codon usage adjusted sequence of step a) under appropriate growth conditions to express the sequence and achieve production of the polypeptide of interest
- a preferred embodiment of the invention relates to a method of the third aspect, wherein the Pichia cell is a Pichia methanolica cell
- the nucleotide sequence encodes an enzyme, preferably the nucleotide sequence encodes an enzyme having phytase or xylanase activity, or more preferably the nucleotide sequence comprises a DNA sequence at least 90% identical, or preferably at least 93% identical, or more preferably at least 95% identical, or most preferably at least 97% identical to the sequences shown in SEQ ID No 3, SEQ ID No 19, or to the phytase encoding sequence comprised in a cell of DSM 13352
- a preferred embodiment relates to a method of the third aspect, wherein the polypeptide is produced in a yield at least three times higher than what is obtained from an otherwise identical cell comprising a sequence not adjusted to the preferred codon usage of P methanolica, as defined herein, preferably at least five times higher, more preferably seven, even more preferably eight, yet more preferably nine, and most preferably ten times higher than what is obtained from an otherwise identical cell comprising a sequence not adjusted to the preferred codon usage of P methanolica, as defined herein
- preferred embodiments relate to methods of the third aspect, wherein the nucleotide sequence is transcribed from a promoter of a methanol-inducible P methanolica gene, preferably the promoter is a native promoter of a P methanolica gene encoding an enzyme selected from the group consisting of alcohol oxidase, dihydroxyacetone synthase, formate dehydrogenase, and catalase, and preferably methanol is added to the cell during the cultivating step, whereby the methanol-inducible promoter is induced, it is also preferred that the nucleotide sequence encodes a polypeptide comprising its native original secretion signal sequence Material and methods
- E coli DH12S (Gibco BRL) and E coii JM109 (Toyobo) were used for plasmid construction and amplification
- Amplified plasmid was recovered with Qiagen ® Plasmid Kit (Qiagen) Ligation was done with DNA ligation kit (Takara) or T4 DNA ligase (Boehnnger Mannheim) Polymerase Chain Reaction (PCR) was carried out either with Taq DNA polymerase (Boehnnger Mannheim) or with Expand TM PCR system especially ExpandTM Long Template (Boehnnger Mannheim) QIAquickTM Gel Extraction Kit (Qiagen) was used for the purification of PCR fragments and extraction of DNA fragment from agarose gel A plasmid pT7blue T-vector (Novagen) was used for the cloning of PCR fragments
- the expression host strain is a Pichia methanolica PMAD16 (ade2- pep4 ⁇ prbl ⁇ ) described in WO 99/14347 It is a vacuolar protease deficient mutant originally generated from P methanolica CBS 6515 Two major vacuolar proteinases, l e Protemase A encoded by the pep4 gene and Protemase B encoded by the prbl gene, are inactivated by disrupting these genes (WO 99/14347)
- the expression cassette plasmid pCZR134 is described in WO 99/14347 It comprises the promoter and terminator of the alcohol oxidase gene AUG1 from P methanolica (WO 99/14347) and the ADE2 gene encoding phospho ⁇ bosyl-5-am ⁇ no ⁇ m ⁇ dazole carboxylase (WO 99/14347) from P methanolica
- Another expression cassette plasmid is pGAP, which is identical to pCZR134 but contains the promoter of the gene encoding the glyceraldehyde-3-phophate dehydrogenase GAP1 from P methanolica (described in WO00/56903, SEQ ID No therein) instead of the AUG1 promoter as described above
- Transformation of P methanolica Transformation of Pichia methanolica can be done using the general methods for yeast transformation As the most preferred method, the procedure of electro transformation is described below
- the P methanolica host strain is grown in non-selective YPD medium (20g/l of Pepton, Difco, 10g/I of yeast extract, Difco and 20g/l of glucose) at 37°C for overnight with vigorous aeration, and the culture is diluted 100-fold in 200ml fresh, pre-warmed YPD This is further cultivated at 37°C until the optical density at 663n ⁇ m becomes 1 2-1 5, which corresponds to 9x10 7 cells/ml Cells are collected by centnfugation at 3000 g for 10 minutes, and re-suspended in 40ml of 50mM potassium phosphate buffer pH 7.5 containing 25mM of dithiothreitol (DTT), then incubated at 37°C for 15 minutes.
- DTT dithiothreitol
- the cells are washed twice with ice-cold electroporation buffer STM (270mM sucrose, 10mM Tris pH7 5 and 1 mM CaCI2), at first with 200 ml (the original culture volume), then with 100 ml (50% original volume) Finally the cells are re-suspended in 1 ml (0 5% of original volume) ice-cold STM buffer to a concentration of approximately 2x10 10 electrocompetent cells/ml
- STM 270mM sucrose, 10mM Tris pH7 5 and 1 mM CaCI2
- 0 1-10 ⁇ g of DNA (volume ⁇ 10 ⁇ l) is added to 100 ⁇ l of electrocompetent cells and placed into an ice-cold electroporation cuvette with an electrode gap of 2mm
- a pulsed electric field is provided at 3 75 kV/cm for 20 millisecond, using a BioRad Gene PulserTM electroporator on which resistance is set to >600 ⁇ or "mfitine' and capacitance is set to 25 ⁇ F
- 1 ml of YPD is added to the cell/DNA mixture and incubated for 1 hour at 30°C Cells are harvested by centnfugation at 3000 g for 5 minutes, then washed with
- yeast nitrogen base 6 7 g/l yeast nitrogen base without ammo acids, Difco
- minimal selective media e g SC glucose Host cells having an ade2 mutation appear as pink colonies on minimal medium, whereas transformants that acquired a stable integration (Ade+) can grow rapidly as white colonies on minimal medium
- the cDNA sequence of the Thermomyces lanuginosus DSM4109 xylanase gene was published (WO 96/32472)
- DSM 10133 A Saccharomyces cerevisiae strain, DSM 10133, containing a plasmid comprising the full-length xylanase cDNA of the Thermomyces lanuginosus DSM4109
- Thermomyces xylanase cDNA was amplified by PCR using primers xyN and xyC to introduce desirable restriction enzyme cleavage sites at both ends of the PCR fragment xyN (SEQ ID 1 ) CCGGAATTCATGGTCGGCTTTACCCCCGTT XyC (SEQ ID 2) AAGGAAAAAAGCGGCCGCACTAGTTTAGCCCA CGTCAGCAAC
- the PCR reaction mixture comprised 50 ng of template cDNA, 0 05mM of dNTPs, 100 pmol of each primer and 1 un ⁇ t of Taq polymerase in 50 ⁇ l of Taq buffer with MgCI2 (Boehnnger Mannheim)
- the reaction was initiated at 94°C for 3 5 minutes followed by 30 cycles of 94°C for 1 5 minutes, 50°C for 1 5 minutes and 72°C for 3 minutes, with the final extension at 72°C for 10 minutes
- the amplified 0 7kb fragment was digested with EcoRI and Spel, and purified on 1 5% agarose gel Obtained fragment was subjected to the ligation with pCZR134 linearized by EcoRI and Spel
- the resulting plasmid was termed pCaXM .
- Themomyces xylanase gene was synthesized to change the codon usage to have A or T as the preferred third letter of each codon. This is not a full optimization to the preferred codon usage of Pichia methanolica, as defined herein, but rather a first attempt to see if there is any advantage in optimizing the codons further.
- Two fragments XyC12 and XyC37 were synthesized by two-step PCR. The resulting designed xylanase encoding sequence is shown as SEQ ID 3.
- Oliqonucleotides used for the first PCR reaction were as follows:
- xyC1 (SEQ ID 4): atggttggttttactccagttgctttagctgctttagctgctactggtgctttagctttc ccagcaggtaatgctactgaattagaaaaagacaaacaactccaaattctgaaggttgg.
- xyC2r (SEQ ID 5): cataggtaccaccttctaaattagtataagtagcttgagcaccaccatca gaccaccaccaagaataataataaccatcatgccaaccttcagaattggggaga.
- XyC37 xyC3 (SEQ ID 6): ggtacctatgaaatttcttggggtgatggtggtaatttagttggtggtaaaggttggaat ccaggtttaaatgcaagagctattcattttgaaggtgtttatcaaccaaatggtaattct.
- xyC4r (SEQ ID 7): ctagtagttttacctaatctataaatagaaccatcacattcaacagtacctaaatcag tagcaccagaagatggatcataagtaccaaaatttttcaacaatataatattcaactaatgga.
- xyC5 (SEQ ID 8): ttctggtgctactgatttaggtactgtttgaatgtgatggttctatttatagattaggtaa aactactagagttaatgcaccatctattgatggtactcaaactttcgatcaatattggtc.
- xyC6r (SEQ ID 9): taataatgatcaccattaacattcaaaccagctctagcccaagcatcgaaatgac aaccagtttgaacagtaccagaagttcttttatcttgtctaacagaccaatattgatcgaaagtt.
- xyC7 (SEQ ID 10) ttaatggtgatcattattatcaaattgttgcaactgaaggttatttctcttctggttatg ctagaattactgttgctgatgttggttaa
- Oliqonucleotides used for the second PCR reaction were as follows For the synthesis of XyC 12 xyc12N (SEQ ID 11 ) ataagaatgcggccgcatggttggtttactccagt xyc12C (SEQ ID 12) tcataggtaccaccttctaaattagtataa
- the first PCR reaction mixture contained 1 pmol of each oligonucleotide, 0 35mM dNTP, 2 6 units of ExpandTM polymerase in 50 ⁇ l of provided buffer Reaction started at 94°C for 1 minute followed by 6 cycles of 94°C for 0 5 minutes, 50°C for 5 minutes and 72°C for 1 minute with the final extension at 72°C for 10 minutes
- Five ⁇ l of resulting first PCR reaction solution was used for the second PCR with 0 35mM dNTP, 100 pmol of each oligonucleotide and 2 6 units of ExpandTM polymerase in 50 ⁇ l of provided buffer
- the reaction was carried out at 94°C for 3 5 minutes first followed by 30 cycles of 94°C for 1 5 minutes, 50°C for 1 5 minutes and 72°C for 3 minutes with the final extension at 72°C for 10 minutes
- the amplified fragments were purified on 4% agarose gel then ligated into pT7blue T-vector resulting p
- the synthesized Thermomyces xylanase gene was amplified by PCR using the following primers;
- the reaction mixture comprised 10 ng of synthesized xylanase gene, 0.35 mM of dNTP, 100 pmol of each primer and 2.6 units of ExpandTM polymerase in 50 ⁇ l of provided buffer with MgCI2.
- the PCR reaction was initiated at 94°C for 3.5 minutes followed by 30 cycles of 94°C for 1.5 minutes, 55°C for 1.5 minutes and 68°C for 3 minutes with the final extension at 72°C for 10 minutes.
- the amplified gene was digested with EcoRI and Spel, and ligated into pCZR134 cut by EcoRI and Spel beforehand and the resulting plasmid was termed pCaXS3
- the expression plasmids pCaXM and pCaXS3 were digested with Notl and
- 6.2kb fragments were used for the transformation of P. methanolica PMAD16 to achieve a Ade+ phenotype.
- Transformed cells were grown on SC glucose (2% glucose, 0.01 % tryptophan, 0.02% threonine, 0.5% casamino acid, in basal salt without ammo acids (per litre, 6.68g yeast nitrogen base w/o ammo acid (Difco), 10g succinic acid, 6g NaOH, and
- Xylanase producing cells were inoculated in YPD medium (20g/l of Pepton, Difco, 10g/I of yeast extract, Difco and 20g/l of glucose) in a 500 ml shaking flask, and cultivated under vigorous aeration overnight at 30°C
- YPD medium 20g/l of Pepton, Difco, 10g/I of yeast extract, Difco and 20g/l of glucose
- a transformant carrying the synthesized Thermomyces xylanase gene was inoculated in two flasks with 110ml of YEPD ( 10g/I Bacto yeast extract, 20g/l Bacto peptone and 20g/l glucose) in 500ml shaking flasks then cultivated under vigorous aeration condition at 30°C for 16-18 hours Most of the cells were used for the subsequent inoculation of the fermentor
- the fermentation vessel was filled with 2 liters of media containing 43g (NH4)2S04, 5 2g K2HP04, 19g KH2P04, 0 8g FeS04 7H20, 2 3g sodium citrate with 2aq After autoclavmg and cooling down to 30°C, 94ml of 60% glucose, 40ml of 1 M MgS04 7H20, 4ml of 1 M CaCI2, 40ml of trace metal solution (containing, per litter, 27 8g Fe S04 7H20, 0 5g CuS04 5H20, 1 09
- the reaction mixture comprising 10 ng of cDNA clone of A fumigatus phytase, 0 35 mM dNTPs, 100 pmol of each primers and 2 6 units of ExpandTM polymerase in 50 ⁇ l of provided buffer with MgCI2, was incubated at 94°C for 2 minutes followed by 10 cycles of 94°C for 10 seconds, 55°C for 30 seconds, 68°C for 45 seconds After that another 20 cycles were performed as 94°C for 10 seconds, 55°C for 30 seconds, 68°C for 45 plus n x 20 seconds (n cycle number)
- Seventy oligonucleotides (SEQ ID's 20-89) were prepared for the construction of the codon optimized phytase gene, and subjected to PCR using Expand TM Long Template (Boehnnger Mannheim) for assembly.
- the reaction mixture comprised 125 pmol of each of the seventy oligonucleotides shown in SEQ ID's 20-89, 0.2 mM dNTPs and 2.6 units of Expand TM polymerase in 50 ⁇ l of provided buffer.
- the reaction was employed as 55 cycles of 94°C for 30 seconds, 52°C for 30 seconds and 68°C for 30 seconds.
- the second PCR was carried out using the resulting first reaction mixture as template and the following set of primers:
- reaction mixtures comprised 0.5 ⁇ l, 1 ⁇ l or 2 ⁇ l of the first PCR mixture, 30 pmol of each primer, 0 2 mM dNTPs and 2.6 units of Expand TM polymerase in 50 ⁇ l of provided buffer
- the reaction was employed as 25 cycles of 94°C for 30 seconds, 50°C for 30 seconds and 68°C for 60 + 10 x n seconds (n cycle number)
- An amplified 1 3kb fragment was purified with QIAquick and cloned into pT7blue T-vector Two independent piasmids pTPhy ⁇ and pTPhy ⁇ were selected for further construction, they had a couple of PCR generated sequence errors within the Hindlll— Spel region of pTPhy ⁇ and the EcoRI-Hi
- plasmid pTPhy58 which comprises the entire sequence of the synthetic A fumigatus phytase gene
- the plasmid pTPhy ⁇ was digested with EcoRI and Spel, and a 1 3kb fragment was isolated and ligated into the expression vector pCZR134 The resulting phytase expression plasmid was designated pCPS40
- the entire sequence of the synthetic codon optimized A fumigatus phytase gene (SEQ ID No 19) is comprised in the strain E coll NN049526, which was deposited with DSMZ 01-March-2000 with the accession number DSM 13352 The deposit was made by Novo Nordisk A/S and was later assigned to Novozymes A/S
- piasmids pCaPhy3 and pCPS40 were digested with Notl and
- Transformed cells were grown on minimal medium SC glucose containing 1 2 M sorbitol White colonies growing rapidly were isolated as stable transformants The obtained transformants were subsequently grown on YPM agar containing 0 1 % phytic acid (Sigma P3810) at 30°C overnight, then 0.1 M CaCI2 was poured onto the grown agar plate to detect phytase production. Positive transformants were detected as colonies surrounded by a clear halo.
- Phytase producing strains were inoculated in 50ml of YPD medium in 500ml of shaking flasks and cultivated at 30°C with vigorous aeration. After overnight cultivation, 1 ml of methanol was added to each shaking flask and the cultivation was continued one more day. Cells were removed by centnfugation and phytase activity in the supernatant was measured using 0.15% sodium phytate in 100mM sodium acetate buffer pH 5.5 as the substrate. The reaction was carried out at room temperature for 15 minutes, and liberated phosphate ions were quantified by colorization with 1 % ammonum molybdate, 0.26M FeS04 in 3.2% H2S04 and absorbance was measured at 650nm. The amount of enzyme protein was calculated from the activity based on the known specific activity per mg enzyme protein. The results are shown in Table 2.
- the yield of A. fumigatus phytase from a transformant generated with pCaPhy3, carrying non codon optimized cDNA of the phytase gene was calculated to be 310 mg/l with 120 g/kg of dried cells at 170 hours of fermentation, whereas the yield from the transformant with a codon-optimised gene was estimated to be 3.5 g/l with 115 g/kg of dried cells at 190 hours of fermentation.
- a synthesized A. fumigatus phytase gene was obtained from pCPS40 as described in Example 2 by digesting with the restriction enzymes EcoRI and Spel. The isolated 1.3kb fragment was ligated into the pGAP vector, which was initially pre-digested with the same enzymes, the resulting plasmid was termed pGAPphy. The plasmid pGAPphy was then digested with Notl and the 6.9kb fragment was used for the transformation of P.methanolica PMAD16. Transformants were isolated as described in Example 2.
- Obtained transformants were cultivated in 5ml of YPD medium at 30°C overnight with vigorous aeration, then 1 ml of grown culture was transferred to fresh 100ml of YPD medium in a shaking flask. After one-day cultivation at 30°C, 2ml of 20% glucose solution was added to each flask and cultivation was continued for one more day. Enzyme activity of the culture broth, from which the grown cells were removed, was measured to estimate the enzyme productivity. The estimated phytase productivity was 47mg/l.
- a transformant carrying a synthesized Aspergillus fumigatus phytase gene was fermented in 5L jar-fermenter.
- the fermentation vessel was filled with 2 liters of medium containing 43g (NH4)2S04, 5.2g K2HP04, 19g KH2P04, 0.8g Fe S04.7H20, 2.3g sodium citrate with 2aq.
- the pH of fermentation media was adjusted to 5.0 and during the fermentation it was controlled automatically above 5.0 with 25% ammonia water.
- Aeration was provided as compressed air at a flow rate of 3 litres per minute initially and agitation at the rate of 100 rpm. After inoculation the agitation was controlled automatically from 100 rpm to 1000 rpm to keep the dissolved oxygen level at minimum 30% of saturation. Around 11 hours after inoculation, 1100ml of 60% glucose was supplemented. The yield of Aspergillus fumigatus phytase at 50 hours was calculated to 1000 mg/l with the 65 g/kg of dried cells.
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Biomedical Technology (AREA)
- Organic Chemistry (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Mycology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Microbiology (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
The present invention relates to a method of heterologous production in Pichia, where the codon usage of a heterologous nucleotide sequence is adjusted to the preferred codon usage of Pichia methanolica.
Description
Compositions and methods for producing high yields of heterologous polypeptides in a Pichia cell
Field of invention The high yield production of certain polypeptides is of industrial importance, thus compositions and methods for producing high yields of heterologous polypeptides in a Pichia cell such as Pichia methanolica are of interest.
Background of the invention Yeast cells are potential candidates for host organisms for heterologous protein production. To use Saccharomyces cerevisiae for the production of e.g. pharmaceutical products or industrial enzymes is well known. It has been suggested that methylotrophic yeasts that can utilize methanol as a sole carbon and energy source are more attractive host strains for heterologous protein production. High protein productivity has been achieved with strongly inducible promoters, such as the one from alcohol oxidase or formate dehydrogenase, by performing high cell density fermentations in which the dried cell mass exceeds 100 g/kg. Examples of high expression of heterologous protein are: 2.5 g/l of S. cere visiae invertase (Tschopp et. al. Bio/Technology, vol 5, 1305-1308, 1987); 3.4 g/l of human serum albumin in Pichia pastoris (EP 510693); 3.4g/l of glucoamylase from Rhizopus oryzae in Candida boidinii (Sakai et.al, Biochem. Biophys. Acta, 1308, 81-87,1996); 1.4g/l of Schawanniomyces occidentalis glucoamylase (Gelissen et.al, Bio/Technology, vol 9, 291-295, 1991 ); and 7.2g/l of Aspergillus phytase in Hansenula polymorpha (EP911416). In WO 99/67398 an Aspergillus niger phytase is expressed in and secreted from Pichia pastoris by using a fusion with the signal peptide of yeast α-factor.
Pichia methanolica is another methylotrophic yeast that has been shown
recently to produce human glutamate decarboxylase at the level of 0.5 g/l intracellulariy (Yeast, 14, 11-23, 1998). It has been stated that P. methanolica could be used for production of any heterologous protein, but no example of productivity or yield over 1 gram per litre has been shown yet (US 5,716,808; WO 97/17450).
Summary of the invention
The problem to be solved by this invention is to provide expression systems and methods for producing a polypeptide of interest in high yields.
The solution is based on that the present inventors have identified compositions and methods for high yield production of a heterologous polypeptide of interest in a Pichia cell, specifically Pichia methanolica. By adjusting the codon usage of a gene encoding a polypeptide of interest to the codon usage of one or more gene(s) from the wild type P. methanolica, the inventors have shown that protein productivity surprisingly can be increased with as much as a factor of 10 or more. In non-limiting example herein heterologous polypeptides were expressed from codon-optimized genes with original secretion signal sequences and secreted from P. methanolica.
In the industrial production of polypeptides, a gene encoding a polypeptide of interest is most often present within the production host cell in multiple copies which are all expressed in order to achieve an optimally high yield of the polypeptide. Gene analysis by Southern blotting of a production cell of the invention revealed that the high polypeptide yield was achieved with a very low gene dosage of only approx. 2 copies pr. cell, which suggests a potential for increasing the yield of the polypeptide of interest even further by increasing the copy number. Accordingly in a first aspect, the present invention relates to a Pichia cell comprising at least one copy of a heterologous nucleotide sequence encoding a
polypeptide of interest, wherein the codon usage of said sequence has been adjusted to match the preferred codon usage of P methanolica, as defined herein
The preferred codon usage of P methanolica as relates to this invention is presented in Example 2, Table 1 , e g as the codon-ratios used in the optimized phytase gene sequence for each ammo acid More specifically the preferred codons for Lysine are TTA or TTG, for Isoleucme they are ATT or ATC, for Valine they are GTT or less preferred GTC, for Seπne they are TCT, TCC or TCA, for Proline they are CCA or less preferred CCT, for Threonine they are ACT or less preferred ACC, for Alanine they are GCT or GCC, for Tyrosme the more preferred is TAC; for Glutamine it is CAA, for Asparagine it is AAC, for Glutamic acid it is GAA, for Cysteine it is TGT, for Arginine it is AGA, and finally for Glycine it is GGT
The codon usage of a DNA sequence encoding a polypeptide is adjusted to match the preferred codon usage of P methanolica when at least 50% of the codons for specific ammo acids in the polypeptide sequence have been adjusted to a preferred codon as defined above, or preferably at least 60%, more preferably at least 70%, even more preferably 80%, yet more preferably 90%, and most preferably 95% of the codons for specific ammo acids in the polypeptide sequence have been adjusted to a preferred codon as defined above
Further in a second aspect, the present invention relates to an isolated DNA construct comprising at least one copy of a nucleotide sequence encoding a polypeptide heterologous to Pichia methanolica, wherein the codon usage of said sequence has been adjusted to match the preferred codon usage of P methanolica, as defined herein
Finally in a third aspect the present invention relates to a method of producing a polypeptide of interest in a Pichia cell, where the polypeptide is encoded by a nucleotide sequence heterologous to P methanolica, said method comprising the steps of a) adjusting the codon usage of the sequence to match the preferred codon usage of P
methanolica, as defined herein, and b) cultivating a Pichia cell comprising at least one copy of the codon usage adjusted sequence of step a) under appropriate growth conditions to express the sequence and achieve production of the polypeptide of interest
Deposited microorganisms
The entire sequence of the synthetic codon optimized A fumigatus phytase gene (SEQ ID No 19) is comprised in the strain E coli NN049526, which was deposited with "Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH" (DSMZ) 01-March-2000 with the accession number DSM 13352
The cDNA sequence of the Thermomyces lanuginosus DSM4109 xylanase gene was published in WO 96/32472 For this study we obtained a Saccharomyces cerevisiae strain from DSMZ, accession number DSM 10133, deposited on
19-July-1995, which contains a plasmid comprising the full-length xylanase cDNA of the Thermomyces lanuginosus DSM4109
The deposits were made by Novo Nordisk A/S and were later assigned to Novozymes A S
Definitions The term "isolated nucleic acid sequence" as used herein refers to a nucleic acid sequence which is essentially free of other nucleic acid sequences, e g , at least about 20% pure, preferably at least about 40% pure, more preferably at least about 60% pure, even more preferably at least about 80% pure, and most preferably at least about 90% pure as determined by agarose electrophoresis For example, an isolated nucleic acid sequence can be obtained by standard cloning procedures used in genetic engineering to relocate the nucleic acid sequence
from its natural location to a different site where it will be reproduced The cloning procedures may involve excision and isolation of a desired nucleic acid fragment comprising the nucleic acid sequence encoding the polypeptide, insertion of the fragment into a vector molecule, and incorporation of the recombinant vector into a host cell where multiple copies or clones of the nucleic acid sequence will be replicated
The nucleic acid sequences of the present invention may be of genomic, cDNA, RNA, semisynthetic, synthetic origin, or any combinations thereof and may be obtained from microorganisms of any genus For purposes of the present invention, the term "obtained from as used herein in connection with a given source shall mean that the polypeptide encoded by the nucleic acid sequence is produced by the source or by a cell in which the nucleic acid sequence from the source has been inserted
The nucleic acid sequences may be obtained from a bacterial source For example, these polypeptides may be obtained from a gram positive bacterium such as a Bacillus strain, e g , Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus coagulans, Bacillus lautus, Bacillus lentus, Bacillus licheniformis, Bacillus megateπum, Bacillus stearothermophilus, Bacillus subtilis, or Bacillus thunngiensis, or a Streptomyces strain, e g , Streptomyces lividans or Streptomyces munnus, or from a gram negative bacterium, e g , E coli or Pseudomonas sp
The nucleic acid sequences may be obtained from a fungal source, and more preferably from a yeast strain such as a Candida, Hansenula, Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia strain, or more preferably from a filamentous fungal strain such as an Acremonium, Aspergillus Aureobasidium, Cryptococcus, Filibasidium, Fusarium, Humicola, Magnaporthe, Mucor, Myceliophthora, Neocallimastix, Neurospora, Paecilomyces, Penicillium, Piromyces,
Schizophyllum Talaromyces, Thermoascus, Thielavia Tolypocladium, or Tnchoderma strain
The nucleic acid sequences may thus be obtained from Saccharomyces carlsbergensis, Saccharomyces cerevisiae, Saccharomyces diastaticus, Saccharomyces douglasii, Saccharomyces kluyven, Saccharomyces norbensis or Saccharomyces oviformis, or indeed from from an Aspergillus aculeatus, Aspergillus awamori, Aspergillus foetidus, Aspergillus japonicus Aspergillus nidulans, Aspergillus niger, Aspergillus oryzae, Fusarium bactndioides Fusarium cerealis, Fusarium crookwellense, Fusarium culmorum, Fusarium graminearum, Fusarium graminum, Fusarium heterosporum Fusarium negundi, Fusarium oxysporum, Fusarium reticulatum Fusarium roseum Fusarium sambucinum, Fusarium sarcochroum, Fusarium sporotnchioides, Fusarium sulphureum, Fusarium torulosum, Fusarium tnchothecioides, Fusarium venenatum, Humicola insolens, Humicola lanuginosa, Mucor miehei, Myceliophthora thermophila, Neurospora crassa, Penicillium purpurogenum, Tnchoderma harzianum, Tnchoderma koningii, Tnchoderma longibrachiatum, Tnchoderma reesei, or Tnchoderma vinde strain
In a preferred embodiment, the nucleic acid sequence is the sequence encoding a phytase comprised in Escheπchia coll DSM13352 In another preferred embodiment, the nucleic acid sequence is SEQ ID NO 3 or SEQ ID NO 19 It will be understood that for the aforementioned species, the invention encompasses both the perfect and imperfect states, and other taxonomic equivalents, e g , anamorphs, regardless of the species name by which they are known Those skilled in the art will readily recognize the identity of appropriate equivalents Strains of these species are readily accessible to the public in a number of culture collections such as the American Type Culture Collection (ATCC), Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH (DSMZ), Centraalbureau Voor
Schimmelcultures (CBS), and Agricultural Research Service Patent Culture Collection, Northern Regional Research Center (NRRL)
Furthermore, such nucleic acid sequences may be identified and obtained from other sources including microorganisms isolated from nature (e g , soil, composts water, etc ) using the above-mentioned probes Techniques for isolating microorganisms from natural habitats are well known in the art The nucleic acid sequence may then be derived by similarly screening a genomic or cDNA library of another microorganism Once a nucleic acid sequence encoding a polypeptide has been detected with the probe(s), the sequence may be isolated or cloned by utilizing techniques which are known to those of ordinary skill in the art (see, e g , Sambrook et al , 1989, supra)
The techniques used to isolate or clone a nucleic acid sequence encoding a polypeptide are known in the art and include isolation from genomic DNA, preparation from cDNA, or a combination thereof The cloning of the nucleic acid sequences of the present invention from such genomic DNA can be effected, e g , by using the well-known polymerase chain reaction (PCR) or antibody screening of expression libraries to detect cloned DNA fragments with shared structural features See, e g , Innis et al , 1990, PCR A Guide to Methods and Application, Academic Press, New York Other nucleic acid amplification procedures such as ligase chain reaction (LCR), ligated activated transcription (LAT) and nucleic acid sequence-based amplification (NASBA) may be used
A nucleic acid sequence of the present invention may also encode fused polypeptides or cleavable fusion polypeptides in which another polypeptide is fused at the N-terminus or the C-terminus of the polypeptide or fragment thereof A fused polypeptide is produced by fusing a nucleic acid sequence (or a portion thereof) encoding another polypeptide to a nucleic acid sequence (or a portion thereof) of the
present invention Techniques for producing fusion polypeptides are known in the art, and include hgating the coding sequences encoding the polypeptides so that they are in frame and that expression of the fused polypeptide is under control of the same promoter(s) and terminator The present invention also relates to nucleic acid constructs comprising a nucleic acid sequence of the present invention operably linked to one or more control sequences which direct the expression of the coding sequence in a suitable host cell under conditions compatible with the control sequences Expression will be understood to include any step involved in the production of the polypeptide including, but not limited to, transcription, post-transcπptional modification, translation, post-translational modification, and secretion
"Nucleic acid construct" or "DNA construct" is defined herein as a nucleic acid molecule, either single- or double-stranded, which is isolated from a naturally occurring gene or which has been modified to contain segments of nucleic acid which are combined and juxtaposed in a manner which would not otherwise exist in nature The term nucleic acid construct is synonymous with the term expression cassette when the nucleic acid construct contains all the control sequences required for expression of a coding sequence of the present invention The term "coding sequence" is defined herein as a portion of a nucleic acid sequence which directly specifies the ammo acid sequence of its protein product The boundaries of the coding sequence are generally determined by a ribosome binding site (prokaryotes) or by the ATG start codon (eukaryotes) located just upstream of the open reading frame at the 5' end of the mRNA and a transcription terminator sequence located just downstream of the open reading frame at the 3' end of the mRNA A coding sequence can include, but is not limited to, DNA, cDNA, and recombinant nucleic acid sequences
An isolated nucleic acid sequence encoding a polypeptide of the present invention may be manipulated in a variety of ways to provide for expression of the polypeptide Manipulation of the nucleic acid sequence prior to its insertion into a vector may be desirable or necessary depending on the expression vector The techniques for modifying nucleic acid sequences utilizing recombinant DNA methods are well known in the art
The term "control sequences' is defined herein to include all components which are necessary or advantageous for the expression of a polypeptide of the present invention Each control sequence may be native or foreign to the nucleic acid sequence encoding the polypeptide Such control sequences include, but are not limited to, a leader, polyadenylation sequence, propeptide sequence, promoter, signal peptide sequence, and transcription terminator At a minimum, the control sequences include a promoter, and transcnptional and translational stop signals The control sequences may be provided with linkers for the purpose of introducing specific restriction sites facilitating ligation of the control sequences with the coding region of the nucleic acid sequence encoding a polypeptide The term Operably linked" is defined herein as a configuration in which a control sequence is appropriately placed at a position relative to the coding sequence of the DNA sequence such that the control sequence directs the expression of a polypeptide The control sequence may be an appropriate promoter sequence, a nucleic acid sequence which is recognized by a host cell for expression of the nucleic acid sequence The promoter sequence contains transcnptional control sequences which mediate the expression of the polypeptide The promoter may be any nucleic acid sequence which shows transcnptional activity in the host cell of choice including mutant, truncated, and hybrid promoters, and may be obtained from genes encoding extracellular or intracellular polypeptides either homologous or heterologous to the
host cell
Useful signal peptides for yeast host cells are obtained from the genes for Saccharomyces cerevisiae alpha-factor and Saccharomyces cerevisiae mvertase Other useful signal peptide coding regions are described by Romanos et al , 1992, Yeast 8 423-488
It may also be desirable to add regulatory sequences which allow the regulation of the expression of the polypeptide relative to the growth of the host cell Examples of regulatory systems are those which cause the expression of the gene to be turned on or off in response to a chemical or physical stimulus, including the presence of a regulatory compound
The present invention also relates to recombinant expression vectors comprising a nucleic acid sequence of the present invention, a promoter, and transcnptional and translational stop signals The various nucleic acid and control sequences described above may be joined together to produce a recombinant expression vector which may include one or more convenient restriction sites to allow for insertion or substitution of the nucleic acid sequence encoding the polypeptide at such sites Alternatively, the nucleic acid sequence of the present invention may be expressed by inserting the nucleic acid sequence or a nucleic acid construct comprising the sequence into an appropriate vector for expression In creating the expression vector, the coding sequence is located in the vector so that the coding sequence is operably linked with the appropriate control sequences for expression
The recombinant expression vector may be any vector (e g , a plasmid or virus) which can be conveniently subjected to recombinant DNA procedures and can bring about the expression of the nucleic acid sequence The choice of the vector will typically depend on the compatibility of the vector with the host cell into which the vector is to be introduced The vectors may be linear or closed circular plasmids
The vector may be an autonomously replicating vector, / e , a vector which exists as an extrachromosomal entity, the replication of which is independent of chromosomal replication, e g , a plasmid, an extrachromosomal element, a mini-chromosome, or an artificial chromosome The vector may contain any means for assuring self-replication Alternatively, the vector may be one which, when introduced into the host cell, is integrated into the genome and replicated together with the chromosome(s) into which it has been integrated Furthermore, a single vector or plasmid or two or more vectors or plasmids which together contain the total DNA to be introduced into the genome of the host cell, or a transposon may be used The vectors of the present invention preferably contain one or more selectable markers which permit easy selection of transformed cells A selectable marker is a gene the product of which provides for biocide or viral resistance, resistance to heavy metals, prototrophy to auxotrophs, and the like Suitable markers for yeast host cells are ADE1 , ADE2, HIS3, LEU2, LYS2, MET3, TRP1 , and URA3
The vectors of the present invention preferably contain an element(s) that permits stable integration of the vector into the host cell genome or autonomous replication of the vector in the cell independent of the genome of the cell For integration into the host cell genome, the vector may rely on the nucleic acid sequence encoding the polypeptide or any other element of the vector for stable integration of the vector into the genome by homologous or nonhomologous recombination Alternatively, the vector may contain additional nucleic acid sequences for directing integration by homologous recombination into the genome of the host cell The additional nucleic acid sequences enable the vector to be integrated into the host cell genome at a precise locatιon(s) in the chromosome(s) To increase the likelihood of integration at a precise location, the mtegrational elements should preferably contain a
sufficient number of nucleic acids, such as 100 to 1 ,500 base pairs, preferably 400 to 1 ,500 base pairs, and most preferably 800 to 1 ,500 base pairs, which are highly homologous with the corresponding target sequence to enhance the probability of homologous recombination. The integrational elements may be any sequence that is homologous with the target sequence in the genome of the host cell. Furthermore, the integrational elements may be non-encoding or encoding nucleic acid sequences. On the other hand, the vector may be integrated into the genome of the host cell by non-homologous recombination.
For autonomous replication, the vector may further comprise an origin of replication enabling the vector to replicate autonomously in the host cell in question. Examples of origins of replication for use in a yeast host cell are the two micron origins of replication, ARS1 , ARS4, the combination of ARS1 and CEN3, and the combination of ARS4 and CEN6. The origin of replication may be one having a mutation which makes its functioning temperature-sensitive in the host cell (see, e.g., Ehrlich, 1978, Proceedings of the National Academy of Sciences USA 75: 1433).
More than one copy of a nucleic acid sequence of the present invention may be inserted into the host cell to increase production of the gene product. An increase in the copy number of the nucleic acid sequence can be obtained by integrating at least one additional copy of the sequence into the host cell genome or by including an amplifiable selectable marker gene with the nucleic acid sequence where cells containing amplified copies of the selectable marker gene, and thereby additional copies of the nucleic acid sequence, can be selected for by cultivating the cells in the presence of the appropriate selectable agent.
The procedures used to ligate the elements described above to construct the recombinant expression vectors of the present invention are well known to one skilled in the art (see, e.g., Sambrook et al., 1989, supra).
The present invention also relates to recombinant host cells, comprising a nucleic acid sequence of the invention, which are advantageously used in the recombinant production of the polypeptides. A vector comprising a nucleic acid sequence of the present invention is introduced into a host cell so that the vector is maintained as a chromosomal integrant or as a self-replicating extra-chromosomal vector as described earlier. The term "host cell" encompasses any progeny of a parent cell that is not identical to the parent cell due to mutations that occur during replication. The choice of a host cell will to a large extent depend upon the gene encoding the polypeptide and its source. Fungal cells may be transformed by a process involving protoplast formation, transformation of the protoplasts, and regeneration of the cell wall in a manner known per se. Suitable procedures for transformation of Aspergillus host cells are described in EP 238 023 and Yelton et al., 1984, Proceedings of the National Academy of Sciences USA 81 : 1470-1474. Suitable methods for transforming Fusarium species are described by Malardier et al., 1989, Gene 78: 147-156 and WO 96/00787. Yeast may be transformed using the procedures described by Becker and Guarente, In Abelson, J.N. and Simon, M.I., editors, Guide to Yeast Genetics and Molecular Biology, Methods in Enzymology, Volume 194, pp 182-187, Academic Press, Inc., New York; Ito et al., 1983, Journal of Bacteriology 53: 163; and Hinnen et ai, 1978, Proceedings of the National Academy of Sciences USA 75: 1920.
The present invention also relates to methods for producing a polypeptide comprising cultivating a host cell under conditions suitable for production of the polypeptide. In the production methods of the present invention, the cells are cultivated in a nutrient medium suitable for production of the polypeptide using methods known in the art. For example, the cell may be cultivated by shake flask cultivation, small-scale or large-scale fermentation (including continuous, batch,
fed-batch, or solid state fermentations) in laboratory or industrial fermentors performed in a suitable medium and under conditions allowing the polypeptide to be expressed and/or isolated The cultivation takes place in a suitable nutrient medium comprising carbon and nitrogen sources and inorganic salts, using procedures known in the art Suitable media are available from commercial suppliers or may be prepared according to published compositions (e g , in catalogues of the American Type Culture Collection) If the polypeptide is secreted into the nutrient medium, the polypeptide can be recovered directly from the medium If the polypeptide is not secreted, it can be recovered from cell lysates The polypeptides may be detected using methods known in the art that are specific for the polypeptides These detection methods may include use of specific antibodies, formation of an enzyme product, or disappearance of an enzyme substrate For example, an enzyme assay may be used to determine the activity of the polypeptide as described herein
The resulting polypeptide may be recovered by methods known in the art For example, the polypeptide may be recovered from the nutrient medium by conventional procedures including, but not limited to, cent fugation, filtration, extraction, spray-drying, evaporation, or precipitation The polypeptides may be purified by a variety of procedures known in the art including, but not limited to, chromatography (e g , ion exchange, affinity, hydrophobic, chromatofocusing, and size exclusion), electrophoretic procedures (e g , preparative isoelectnc focusing), differential solubility (e , ammonium sulfate precipitation), SDS-PAGE, or extraction (see, e g , Protein Purification, J -C Janson and Lars Ryden, editors, VCH Publishers, New York, 1989)
Detailed description of the invention
A Pichia cell comprising at least one copy of a heterologous nucleotide sequence encoding a polypeptide of interest, wherein the codon usage of said sequence has been adjusted to match the preferred codon usage of P methanolica, as defined herein
The specific Pichia cell may be any Pichia cell, such as a Pichia methanolica, Pichia pastoris, Pichia anomala, Pichia stipitis, Pichia angusta or Pichia guilliermondn cell Preferably the specific Pichia cell is a Pichia methanolica cell
A preferred embodiment of the invention relates to a Pichia cell of the first aspect of the invention, wherein the nucleotide sequence encodes an enzyme, preferably the nucleotide sequence encodes an enzyme having phytase or xylanase activity, or even more preferably the nucleotide sequence comprises a DNA sequence at least 90% identical, or preferably at least 93% identical, or more preferably at least 95% identical, or most preferably at least 97% identical to the sequences shown in SEQ ID No 3, SEQ ID No 19, or to the phytase encoding sequence comprised in a cell of DSM 13352
As is discussed elsewhere herein, there are a number of ways to introduce a heterologous DNA sequence into a host cell known in the art
A further embodiment relates to a Pichia cell of the first aspect, wherein the nucleotide sequence is present on an extrachromosomal DNA construct, preferably a plasmid, or more preferably the nucleotide sequence is integrated in the cell's chromosome, preferably in multiple copies
Also well known in the art are ways of expressing heterologous DNA sequences encoding polypeptides of interest in host cells Several promoters have been described to work well in Pichia cells as described above Accordingly a preferred embodiment of the invention relates to a Pichia cell of the first aspect, wherein the nucleotide sequence is transcribed from a promoter of a
methanol-inducible P methanolica gene, preferably the promoter is a native promoter of a P methanolica gene encoding an enzyme selected from the group consisting of alcohol oxidase, dihydroxyacetone synthase, formate dehydrogenase, and catalase
For industrial production of polypeptides it is of considerable interest to have the polypeptides secreted from the producing host cells in order to minimize costs of isolating the polypeptide, ways of promoting efficient secretion are well known in the art A polypeptide may be efficiently secreted if it carries an effective secretion signal peptide recognized by the host cell's secretion apparatus A number of secretion signal peptides have been described to work well in yeast host cells as described elsewhere herein The present inventors found that even without the addition of a well-characterized yeast secretion signal peptide, efficient secretion could be achieved from a host cell of a polypeptide carrying its native original secretion signal sequence
A preferred embodiment of the invention relates to a Pichia cell of the first aspect, wherein the polypeptide comprises its native original secretion signal sequence The present invention relies on adjusting the codon usage of an isolated DNA sequence to match the preferred codon usage of Pichia methanolica An isolated DNA construct comprising at least one copy of a nucleotide sequence encoding a polypeptide heterologous to Pichia methanolica, wherein the codon usage of said sequence has been adjusted to match the preferred codon usage of P methanolica, as defined herein
A preferred embodiment of the invention relates to an isolated DNA construct of the second aspect of the invention, wherein the nucleotide sequence encodes an enzyme, preferably the nucleotide sequence encodes an enzyme having phytase or xylanase activity, and more preferably the nucleotide sequence comprises a DNA sequence at least 90% identical, or preferably at least 93% identical, or more preferably at least 95% identical, or most preferably at least 97% identical to the sequences shown
in SEQ ID No 3, SEQ ID No 19, or to the phytase encoding sequence comprised in a cell of DSM 13352
Another preferred embodiment of the invention relates to an isolated DNA construct of the second aspect of the invention, wherein the nucleotide sequence is transcribed from a promoter of a methanol-inducible P methanolica gene, preferably the promoter is a native promoter of a P methanolica gene encoding an enzyme selected from the group consisting of alcohol oxidase, dihydroxyacetone synthase, formate dehydrogenase, and catalase Other promoters generally recognized as working well in Pichia cells are those of the glycolytic enzymes, such as glyceraldehyde-3-phosphate dehydrogenase, triose phosphate isomerase, enolase etc
Further a preferred embodiment of the invention relates to an isolated DNA construct of the second aspect of the invention, wherein the nucleotide sequence encodes a polypeptide comprising its native original secretion signal sequence
As mentioned previously the invention relates to industrial production, l e a method of producing a polypeptide of interest in a Pichia cell, where the polypeptide is encoded by a nucleotide sequence heterologous to P methanolica, said method comprising the steps of a) adjusting the codon usage of the sequence to match the preferred codon usage of P methanolica, as defined herein, and b) cultivating a Pichia cell comprising at least one copy of the codon usage adjusted sequence of step a) under appropriate growth conditions to express the sequence and achieve production of the polypeptide of interest
A preferred embodiment of the invention relates to a method of the third aspect, wherein the Pichia cell is a Pichia methanolica cell Yet another preferred embodiment of the invention relates to a method of the third aspect, wherein the nucleotide sequence encodes an enzyme, preferably the
nucleotide sequence encodes an enzyme having phytase or xylanase activity, or more preferably the nucleotide sequence comprises a DNA sequence at least 90% identical, or preferably at least 93% identical, or more preferably at least 95% identical, or most preferably at least 97% identical to the sequences shown in SEQ ID No 3, SEQ ID No 19, or to the phytase encoding sequence comprised in a cell of DSM 13352
The present inventors succeeded in obtaining much improved yields of polypeptides of interest by using a method of the third aspect of the invention, in one non-limiting example they even obtained a yield that exceeded 10 times the yield obtained with the conventional method Consequently a preferred embodiment relates to a method of the third aspect, wherein the polypeptide is produced in a yield at least three times higher than what is obtained from an otherwise identical cell comprising a sequence not adjusted to the preferred codon usage of P methanolica, as defined herein, preferably at least five times higher, more preferably seven, even more preferably eight, yet more preferably nine, and most preferably ten times higher than what is obtained from an otherwise identical cell comprising a sequence not adjusted to the preferred codon usage of P methanolica, as defined herein
In parallel with the two first aspects, preferred embodiments relate to methods of the third aspect, wherein the nucleotide sequence is transcribed from a promoter of a methanol-inducible P methanolica gene, preferably the promoter is a native promoter of a P methanolica gene encoding an enzyme selected from the group consisting of alcohol oxidase, dihydroxyacetone synthase, formate dehydrogenase, and catalase, and preferably methanol is added to the cell during the cultivating step, whereby the methanol-inducible promoter is induced, it is also preferred that the nucleotide sequence encodes a polypeptide comprising its native original secretion signal sequence
Material and methods
Purchased material (E coli, plasmid and kits)
E coli DH12S (Gibco BRL) and E coii JM109 (Toyobo) were used for plasmid construction and amplification Amplified plasmid was recovered with Qiagen® Plasmid Kit (Qiagen) Ligation was done with DNA ligation kit (Takara) or T4 DNA ligase (Boehnnger Mannheim) Polymerase Chain Reaction (PCR) was carried out either with Taq DNA polymerase (Boehnnger Mannheim) or with Expand ™ PCR system especially Expand™ Long Template (Boehnnger Mannheim) QIAquick™ Gel Extraction Kit (Qiagen) was used for the purification of PCR fragments and extraction of DNA fragment from agarose gel A plasmid pT7blue T-vector (Novagen) was used for the cloning of PCR fragments
Expression host strain
The expression host strain is a Pichia methanolica PMAD16 (ade2- pep4ΔprblΔ) described in WO 99/14347 It is a vacuolar protease deficient mutant originally generated from P methanolica CBS 6515 Two major vacuolar proteinases, l e Protemase A encoded by the pep4 gene and Protemase B encoded by the prbl gene, are inactivated by disrupting these genes (WO 99/14347)
Expression plasmid
The expression cassette plasmid pCZR134 is described in WO 99/14347 It comprises the promoter and terminator of the alcohol oxidase gene AUG1 from P methanolica (WO 99/14347) and the ADE2 gene encoding phosphoπbosyl-5-amιnoιmιdazole carboxylase (WO 99/14347) from P methanolica Another expression cassette plasmid is pGAP, which is identical to pCZR134 but contains the promoter of the gene encoding the glyceraldehyde-3-phophate
dehydrogenase GAP1 from P methanolica (described in WO00/56903, SEQ ID No therein) instead of the AUG1 promoter as described above
Transformation of P methanolica Transformation of Pichia methanolica can be done using the general methods for yeast transformation As the most preferred method, the procedure of electro transformation is described below The P methanolica host strain is grown in non-selective YPD medium (20g/l of Pepton, Difco, 10g/I of yeast extract, Difco and 20g/l of glucose) at 37°C for overnight with vigorous aeration, and the culture is diluted 100-fold in 200ml fresh, pre-warmed YPD This is further cultivated at 37°C until the optical density at 663nιm becomes 1 2-1 5, which corresponds to 9x107 cells/ml Cells are collected by centnfugation at 3000 g for 10 minutes, and re-suspended in 40ml of 50mM potassium phosphate buffer pH 7.5 containing 25mM of dithiothreitol (DTT), then incubated at 37°C for 15 minutes. The cells are washed twice with ice-cold electroporation buffer STM (270mM sucrose, 10mM Tris pH7 5 and 1 mM CaCI2), at first with 200 ml (the original culture volume), then with 100 ml (50% original volume) Finally the cells are re-suspended in 1 ml (0 5% of original volume) ice-cold STM buffer to a concentration of approximately 2x1010 electrocompetent cells/ml
0 1-10 μg of DNA (volume < 10μl) is added to 100μl of electrocompetent cells and placed into an ice-cold electroporation cuvette with an electrode gap of 2mm A pulsed electric field is provided at 3 75 kV/cm for 20 millisecond, using a BioRad Gene Pulser™ electroporator on which resistance is set to >600 Ω or "mfitine' and capacitance is set to 25 μF After delivering the pulse, 1 ml of YPD is added to the cell/DNA mixture and incubated for 1 hour at 30°C Cells are harvested by centnfugation at 3000 g for 5 minutes, then washed with
1 ml of yeast nitrogen base (6 7 g/l yeast nitrogen base without ammo acids, Difco), and
plated on minimal selective media, e g SC glucose Host cells having an ade2 mutation appear as pink colonies on minimal medium, whereas transformants that acquired a stable integration (Ade+) can grow rapidly as white colonies on minimal medium
Example 1
A cDNA clone of the Thermomyces xylanase gene
The cDNA sequence of the Thermomyces lanuginosus DSM4109 xylanase gene was published (WO 96/32472) For this study we obtained A Saccharomyces cerevisiae strain, DSM 10133, containing a plasmid comprising the full-length xylanase cDNA of the Thermomyces lanuginosus DSM4109
Construction of expression plasmid pCaXH
The Thermomyces xylanase cDNA was amplified by PCR using primers xyN and xyC to introduce desirable restriction enzyme cleavage sites at both ends of the PCR fragment xyN (SEQ ID 1 ) CCGGAATTCATGGTCGGCTTTACCCCCGTT XyC (SEQ ID 2) AAGGAAAAAAGCGGCCGCACTAGTTTAGCCCA CGTCAGCAAC
The PCR reaction mixture comprised 50 ng of template cDNA, 0 05mM of dNTPs, 100 pmol of each primer and 1 unιt of Taq polymerase in 50μl of Taq buffer with MgCI2 (Boehnnger Mannheim) The reaction was initiated at 94°C for 3 5 minutes followed by 30 cycles of 94°C for 1 5 minutes, 50°C for 1 5 minutes and 72°C for 3 minutes, with the final extension at 72°C for 10 minutes The amplified 0 7kb fragment was digested with EcoRI and Spel, and purified on 1 5% agarose gel Obtained fragment was subjected to the ligation with pCZR134 linearized by EcoRI and Spel The resulting
plasmid was termed pCaXM .
Gene synthesis of Themomyces xylanase gene
The Themomyces xylanase gene was synthesized to change the codon usage to have A or T as the preferred third letter of each codon. This is not a full optimization to the preferred codon usage of Pichia methanolica, as defined herein, but rather a first attempt to see if there is any advantage in optimizing the codons further. Two fragments XyC12 and XyC37 were synthesized by two-step PCR. The resulting designed xylanase encoding sequence is shown as SEQ ID 3.
Oliqonucleotides used for the first PCR reaction were as follows:
For the synthesis of XyC12: xyC1 (SEQ ID 4): atggttggttttactccagttgctttagctgctttagctgctactggtgctttagctttc ccagcaggtaatgctactgaattagaaaaaagacaaacaactccaaattctgaaggttgg. xyC2r (SEQ ID 5): cataggtaccaccttctaaattagtataagtagcttgagcaccaccatca gaccaccaagaataataataaccatcatgccaaccttcagaatttggaga.
For the synthesis of XyC37: xyC3 (SEQ ID 6): ggtacctatgaaatttcttggggtgatggtggtaatttagttggtggtaaaggttggaat ccaggtttaaatgcaagagctattcattttgaaggtgtttatcaaccaaatggtaattct. xyC4r (SEQ ID 7): ctagtagttttacctaatctataaatagaaccatcacattcaacagtacctaaatcag tagcaccagaagatggatcataagtaccaaaattttcaacaatataatattcaactaatgga. xyC5 (SEQ ID 8): ttctggtgctactgatttaggtactgttgaatgtgatggttctatttatagattaggtaa aactactagagttaatgcaccatctattgatggtactcaaactttcgatcaatattggtc. xyC6r (SEQ ID 9): taataatgatcaccattaacattcaaaccagctctagcccaagcatcgaaatgac aaccagtttgaacagtaccagaagttcttttatcttgtctaacagaccaatattgatcgaaagtt.
xyC7 (SEQ ID 10) ttaatggtgatcattattatcaaattgttgcaactgaaggttatttctcttctggttatg ctagaattactgttgctgatgttggttaa
Oliqonucleotides used for the second PCR reaction were as follows For the synthesis of XyC 12 xyc12N (SEQ ID 11 ) ataagaatgcggccgcatggttggttttactccagt xyc12C (SEQ ID 12) tcataggtaccaccttctaaattagtataa
For the synthesis of XyC37 xyc37N (SEQ ID 13) ttagaaggtggtacctatgaaatttcttggggtgat xyc37C (SEQ ID 14) ttttccttttgcggccgcttaaccaacatcagcaac
The first PCR reaction mixture contained 1 pmol of each oligonucleotide, 0 35mM dNTP, 2 6 units of Expand™ polymerase in 50μl of provided buffer Reaction started at 94°C for 1 minute followed by 6 cycles of 94°C for 0 5 minutes, 50°C for 5 minutes and 72°C for 1 minute with the final extension at 72°C for 10 minutes Five μl of resulting first PCR reaction solution was used for the second PCR with 0 35mM dNTP, 100 pmol of each oligonucleotide and 2 6 units of Expand™ polymerase in 50μl of provided buffer The reaction was carried out at 94°C for 3 5 minutes first followed by 30 cycles of 94°C for 1 5 minutes, 50°C for 1 5 minutes and 72°C for 3 minutes with the final extension at 72°C for 10 minutes The amplified fragments were purified on 4% agarose gel then ligated into pT7blue T-vector resulting pTXyC12-7 and pTXyC37-715 respectively They were confirmed to have no PCR error by sequencing A Kpnl fragment of around 200bp was isolated from pTXyC12-7 then ligated with a 3 7kb of Kpnl fragment from pTXyC37-715 The resulting plasmid pTXyC17-1 comprises the sequence of the codon-optimized synthesized Themomyces xylanase gene (SEQ ID 3)
Construction of expression plasmid pCaXS3
The synthesized Thermomyces xylanase gene was amplified by PCR using the following primers;
a) (SEQ ID 15): CCGGAATTCATGGTTGGTTTTACTCCAGTTGCT b) (SEQ ID 16): GGACTAGTTTAACCAACATCAGCAACAGTAAT
The reaction mixture comprised 10 ng of synthesized xylanase gene, 0.35 mM of dNTP, 100 pmol of each primer and 2.6 units of Expand™ polymerase in 50μl of provided buffer with MgCI2. The PCR reaction was initiated at 94°C for 3.5 minutes followed by 30 cycles of 94°C for 1.5 minutes, 55°C for 1.5 minutes and 68°C for 3 minutes with the final extension at 72°C for 10 minutes. The amplified gene was digested with EcoRI and Spel, and ligated into pCZR134 cut by EcoRI and Spel beforehand and the resulting plasmid was termed pCaXS3
Transformation of P. methanolica with pCaXH and pCaXS3
The expression plasmids pCaXM and pCaXS3 were digested with Notl and
6.2kb fragments were used for the transformation of P. methanolica PMAD16 to achieve a Ade+ phenotype. Transformed cells were grown on SC glucose (2% glucose, 0.01 % tryptophan, 0.02% threonine, 0.5% casamino acid, in basal salt without ammo acids (per litre, 6.68g yeast nitrogen base w/o ammo acid (Difco), 10g succinic acid, 6g NaOH, and
2% Agar Noble (Difco) containing 1 2Msorbιtol) Rapidly growing white colonies were isolated as stable transformants. Obtained transformants were grown on YPM agar (20g/l of Pepton, Difco, 10g/I of yeast extract, Difco, 20g/l of Agar Noble, Difco and 20ml/l of methanol) containing
0 1 % AZCL™-xylan (Megazyme™) for 3-5 days at 30°C Positive transformants were identified as the colonies surrounded by a blue halo
Xylanase producing cells were inoculated in YPD medium (20g/l of Pepton, Difco, 10g/I of yeast extract, Difco and 20g/l of glucose) in a 500 ml shaking flask, and cultivated under vigorous aeration overnight at 30°C One ml of methanol was added to each shaking flask and it was cultivated one more day Cells were removed by centnfugation and xylanase activity in the supernatant was measured using 0 5% Azo-Wheat Arabmoxylan (Megazyme™) as substrate The measurement reaction was employed at 30°C, pH 6 for 30 minutes and released colour was measured as the absorbance at 585nm The amount of enzyme protein was estimated from the activity The results are shown below, our conclusion to this first attempt is that the codon optimization did indeed have a significant positive effect on the obtainable yield from a Pichia cell However, since we did not fully optimize the codons of the xylanase encoding sequence to match the preferred codon usage of P methanolica, it may be possible to obtain even greater yields by adjusting the sequence thoroughly
Strain Plasmid Gene Yield in shaking flask
PXI242 pCaXM intact gene 12 mg/l
PXS5 pCaXS3 synthesized gene 20 mg/l
A transformant carrying the synthesized Thermomyces xylanase gene was inoculated in two flasks with 110ml of YEPD ( 10g/I Bacto yeast extract, 20g/l Bacto peptone and 20g/l glucose) in 500ml shaking flasks then cultivated under vigorous aeration condition at 30°C for 16-18 hours Most of the cells were used for the subsequent inoculation of the fermentor The fermentation vessel was filled with 2 liters of media containing 43g (NH4)2S04, 5 2g K2HP04, 19g KH2P04, 0 8g FeS04 7H20, 2 3g sodium citrate with 2aq After autoclavmg and cooling down to 30°C, 94ml of 60%
glucose, 40ml of 1 M MgS04 7H20, 4ml of 1 M CaCI2, 40ml of trace metal solution (containing, per litter, 27 8g Fe S04 7H20, 0 5g CuS04 5H20, 1 09g ZnCI2, 1 35g MnS04 H20 0 48g CoCI2 6H20, 0 24g Na2Mo04 2 H20, 0 5g H3B03, 0 08g Kl, 0 26g NιS04 6 H20, 0 56g thiamme hydrochlonde, 5mg biotin, 1 ml H2S04) and 4ml of vitamin solution (containing, per litter, 47g mositol, 25g calcium pantothenate, 1 3g pyridoxme hydrochlonde, 5 6g thiamme hydrochlonde, 0 1g biotin) were added The pH of the fermentation media was adjusted to 5 0 and during the fermentation pH was controlled automatically with 25% ammonia water and 10% H3P0 Aeration was provided as compressed air at a flow rate of 3 litters per minute initially and agitation at the rate of 100 rpm After inoculation the agitation was controlled automatically from 100 rpm to 1000 rpm to keep the dissolved oxygen level at minimum 30% of saturation Following exhaustion of glucose after inoculation, a glucose feed was initiated at 12 hours at the rate of 0 27% glucose/hour The feed was increased stepwise to 0 38% glucose/hour at 18 hours and to 0 5% glucose/hour at 30 hours After 40 hours, methanol induction of the AUG1 promoter was started by methanol feed at the rate of 0 3% methanol/hour, in parallel the glucose feed rate was reduced to 0 38% glucose/hour Finally mixed feeding (0 6% methanol/hour, 0 27% glucose/hour) was run for the rest of fermentation Increasing methanol feeding elevated the enzyme productivity The yield of Thermomyces xylanase was calculated to be 750 mg/l with the 110g/kg of dried cells at 170 hours
Example 2 cDNA clone of A fumiqatus phytase gene
The sequence of the phytase gene from Aspergillus fumigatus ATCC 13073 was published previously (EP 0897010) This phytase contains an ammo acid mutation Q27L, and the phytase of the invention comprises a further mutation at the second ammo acid,
V2I, due to the introduction of a restriction enzyme cleavage site around the translation initiation codon
Construction of expression plasmid pCaPhy3 A cDNA of A fumigatus phytase gene was amplified by PCR using the primers shown below,
A1 ) (SEQ ID 17) CCGGAATTCATGATTACTCTGACTTTCCTGCTTTCGGCGG CGTATCTGCTTTCTGGTAGAGTGTCTGCGGCACCTAGTTCTGCTGGCTC B1 ) (SEQ ID 8) CTAGACTAGTCTAGTCAACTAAAGCACTCTCCCCA
The reaction mixture comprising 10 ng of cDNA clone of A fumigatus phytase, 0 35 mM dNTPs, 100 pmol of each primers and 2 6 units of Expand™ polymerase in 50μl of provided buffer with MgCI2, was incubated at 94°C for 2 minutes followed by 10 cycles of 94°C for 10 seconds, 55°C for 30 seconds, 68°C for 45 seconds After that another 20 cycles were performed as 94°C for 10 seconds, 55°C for 30 seconds, 68°C for 45 plus n x 20 seconds (n cycle number)
The final extension was at 68°C for 7 minutes Amplified 1 3kb fragment was purified on 1 2% agarose gel and digestion with EcoRI and Spel It was used for the ligation with pCZR134 digested with the same restriction enzymes and the obtained plasmid was termed pCaPhy3
Construction of the expression plasmid pCPS40
DNA encoding the A fumigatus phytase was synthesized, the sequence was designed based on the codon usage of two genes from Pichia methanolica, i e ADE1 and AUG1 Codon usage of designed sequence is shown in table 1
This designed codon optimized phytase DNA sequence is shown in (SEQ ID 19).
Table 1.
Seventy oligonucleotides (SEQ ID's 20-89) were prepared for the construction of the codon optimized phytase gene, and subjected to PCR using Expand ™ Long Template (Boehnnger Mannheim) for assembly. The reaction mixture comprised 125 pmol of each of the seventy oligonucleotides shown in SEQ ID's 20-89, 0.2 mM dNTPs and 2.6 units of Expand ™ polymerase in 50μl of provided buffer. The reaction was employed as 55 cycles of 94°C for 30 seconds, 52°C for 30 seconds and 68°C for 30 seconds. The second PCR was carried out using the resulting first reaction mixture as template and the following set of primers:
c) (SEQ ID 90): CGGAATTCATGATTACTTTAACTTTCTTATTATCAGCTTATTTA. d) (SEQ ID 91 ): GGACTAGTTCAAGAAAAACATTCACCCC.
The reaction mixtures comprised 0.5μl, 1 μl or 2μl of the first PCR mixture, 30 pmol of each primer, 0 2 mM dNTPs and 2.6 units of Expand ™ polymerase in 50μl of provided buffer The reaction was employed as 25 cycles of 94°C for 30 seconds, 50°C for 30 seconds and 68°C for 60 + 10 x n seconds (n cycle number) An amplified 1 3kb fragment was purified with QIAquick and cloned into pT7blue T-vector Two independent piasmids pTPhyδ and pTPhyδ were selected for further construction, they had a couple of PCR generated sequence errors within the Hindlll— Spel region of pTPhyδ and the EcoRI-Hindlll region of pTPhyδ
Each plasmid was digested with Hindlll, and a 3 4kg fragment from pTPhyδ and a 0 7kb fragment from pTPhyδ were isolated and ligated together, resulting in the plasmid pTPhy58, which comprises the entire sequence of the synthetic A fumigatus phytase gene
The plasmid pTPhyδδ was digested with EcoRI and Spel, and a 1 3kb fragment was isolated and ligated into the expression vector pCZR134 The resulting phytase expression plasmid was designated pCPS40
The entire sequence of the synthetic codon optimized A fumigatus phytase gene (SEQ ID No 19) is comprised in the strain E coll NN049526, which was deposited with DSMZ 01-March-2000 with the accession number DSM 13352 The deposit was made by Novo Nordisk A/S and was later assigned to Novozymes A/S
Transformation of P methanolica with pCaPhy3 and pCPS40
The expression piasmids pCaPhy3 and pCPS40 were digested with Notl and
6 7kb fragments were used for the transformation of P methanolica PMAD16
Transformed cells were grown on minimal medium SC glucose containing 1 2 M sorbitol White colonies growing rapidly were isolated as stable transformants The obtained transformants were subsequently grown on YPM agar containing 0 1 % phytic acid
(Sigma P3810) at 30°C overnight, then 0.1 M CaCI2 was poured onto the grown agar plate to detect phytase production. Positive transformants were detected as colonies surrounded by a clear halo.
Phytase producing strains were inoculated in 50ml of YPD medium in 500ml of shaking flasks and cultivated at 30°C with vigorous aeration. After overnight cultivation, 1 ml of methanol was added to each shaking flask and the cultivation was continued one more day. Cells were removed by centnfugation and phytase activity in the supernatant was measured using 0.15% sodium phytate in 100mM sodium acetate buffer pH 5.5 as the substrate. The reaction was carried out at room temperature for 15 minutes, and liberated phosphate ions were quantified by colorization with 1 % ammonum molybdate, 0.26M FeS04 in 3.2% H2S04 and absorbance was measured at 650nm. The amount of enzyme protein was calculated from the activity based on the known specific activity per mg enzyme protein. The results are shown in Table 2.
Table 2.
Strain Plasmid Used gene Yield in shaking flask
PPi 51 pCaPhy3 cDNA 1 mg/l
PPs237 PCPS40 codon optimized 20 mg/l
The transformants mentioned above were cultivated as described in example 1.
The yield of A. fumigatus phytase from a transformant generated with pCaPhy3, carrying non codon optimized cDNA of the phytase gene, was calculated to be 310 mg/l with 120 g/kg of dried cells at 170 hours of fermentation, whereas the yield from the transformant with a codon-optimised gene was estimated to be 3.5 g/l with 115 g/kg of dried cells at 190 hours of fermentation.
Thus the yield from the codon optimized transformant was approximately 10 times higher than what could be obtained from the transformant carrying the original
cDNA phytase gene.
Example 3
Expression of the A. fumigatus phytase from the GAP-promoter A synthesized A. fumigatus phytase gene was obtained from pCPS40 as described in Example 2 by digesting with the restriction enzymes EcoRI and Spel. The isolated 1.3kb fragment was ligated into the pGAP vector, which was initially pre-digested with the same enzymes, the resulting plasmid was termed pGAPphy. The plasmid pGAPphy was then digested with Notl and the 6.9kb fragment was used for the transformation of P.methanolica PMAD16. Transformants were isolated as described in Example 2.
Obtained transformants were cultivated in 5ml of YPD medium at 30°C overnight with vigorous aeration, then 1 ml of grown culture was transferred to fresh 100ml of YPD medium in a shaking flask. After one-day cultivation at 30°C, 2ml of 20% glucose solution was added to each flask and cultivation was continued for one more day. Enzyme activity of the culture broth, from which the grown cells were removed, was measured to estimate the enzyme productivity. The estimated phytase productivity was 47mg/l.
A transformant carrying a synthesized Aspergillus fumigatus phytase gene was fermented in 5L jar-fermenter. The fermentation vessel was filled with 2 liters of medium containing 43g (NH4)2S04, 5.2g K2HP04, 19g KH2P04, 0.8g Fe S04.7H20, 2.3g sodium citrate with 2aq. After autoclaving and cooling down to 30°C, 94ml of 60% glucose, 40ml of 1 M MgS04.7H20, 4ml of 1 M CaCI2, 40ml of trace metal solution and 4ml of vitamin solution were added. The pH of fermentation media was adjusted to 5.0 and during the fermentation it was controlled automatically above 5.0 with 25% ammonia water. Aeration was provided as compressed air at a flow rate of 3 litres per minute initially and agitation at the rate of 100 rpm. After inoculation the agitation was controlled automatically from 100 rpm to 1000 rpm to keep the dissolved oxygen level at minimum 30% of saturation. Around 11 hours after inoculation, 1100ml of 60% glucose was supplemented. The yield of Aspergillus fumigatus phytase at 50 hours was calculated to 1000 mg/l with the 65 g/kg of dried cells.
Claims
1 A Pichia cell comprising at least one copy of a heterologous nucleotide sequence encoding a polypeptide of interest, wherein the codon usage of said sequence has been adjusted to match the preferred codon usage of P methanolica, as defined herein
2 The cell of claim 1 , which is a Pichia methanolica cell
3 The cell of claim 1 or 2, wherein the nucleotide sequence encodes an enzyme
4 The cell of claim 1 - 3, wherein the nucleotide sequence encodes an enzyme having phytase or xylanase activity
5 The cell of claim 1 , wherein the nucleotide sequence comprises a DNA sequence at least 90% identical to the sequences shown in SEQ ID No. 3, SEQ ID No 19, or to the phytase encoding sequence comprised in a cell of DSM 13352
6 The cell of claims 1 - 5, wherein the nucleotide sequence is present on an extrachromosomal DNA construct, preferably a plasmid
7 The cell of claims 1 - 5, wherein the nucleotide sequence is integrated in the cell's chromosome
δ The cell of claims 1 - 7, wherein the nucleotide sequence is transcribed from a promoter of a methanol-inducible P methanolica gene
9 The cell of claim δ, wherein the promoter is a native promoter of a P methanolica gene encoding an enzyme selected from the group consisting of alcohol oxidase, dihydroxyacetone synthase, formate dehydrogenase, and catalase.
10. The cell of claims 1 - 9, wherein the polypeptide comprises its native original secretion signal sequence.
11. An isolated DNA construct comprising at least one copy of a nucleotide sequence encoding a polypeptide heterologous to Pichia methanolica, wherein the codon usage of said sequence has been adjusted to match the preferred codon usage of P. methanolica, as defined herein.
12. The DNA construct of claim 11 , wherein the nucleotide sequence encodes an enzyme.
13. The DNA construct of claim 11 , wherein the nucleotide sequence encodes an enzyme having phytase or xylanase activity.
14. The DNA construct of claim 11 , wherein the nucleotide sequence comprises a DNA sequence at least 90% identical to the sequences shown in SEQ ID No. 3, SEQ ID No. 19, or to the phytase encoding sequence comprised in a cell of DSM 13352.
15. The DNA construct of claims 11 - 14, wherein the nucleotide sequence is transcribed from a promoter of a methanol-inducible P. methanolica gene.
16. The DNA construct of claim 15, wherein the promoter is a native promoter of a P methanolica gene encoding an enzyme selected from the group consisting of alcohol oxidase, dihydroxyacetone synthase, formate dehydrogenase, and catalase.
17. The DNA construct of claims 11 - 16, wherein the nucleotide sequence encodes a polypeptide comprising its native original secretion signal sequence.
18. A method of producing a polypeptide of interest in a Pichia cell, where the polypeptide is encoded by a nucleotide sequence heterologous to P. methanolica, said method comprising the steps of: a) adjusting the codon usage of the sequence to match the preferred codon usage of P. methanolica, as defined herein; and b) cultivating a Pichia cell comprising at least one copy of the codon usage adjusted sequence of step a) under appropriate growth conditions to express the sequence and achieve production of the polypeptide of interest.
19. The method of claim 18, wherein the Pichia cell is a Pichia methanolica cell.
20. The method of claim 18 or 19, wherein the nucleotide sequence encodes an enzyme.
21. The method of claim 18 or 19, wherein the nucleotide sequence encodes an enzyme having phytase or xylanase activity.
22. The method of claim 18 or 19, wherein the nucleotide sequence comprises a DNA sequence at least 90% identical to the sequences shown in SEQ ID No. 3, SEQ ID No. 19, or to the phytase encoding sequence comprised in a cell of DSM 13352.
23. The method of claims 18 - 22, wherein the polypeptide is produced in a yield at least three times higher than what is obtained from an otherwise identical cell comprising a sequence not adjusted to the preferred codon usage of P. methanolica, as defined herein. (Preferably at least five times higher, preferably seven, more preferably eight, even more preferably nine, and most preferably ten times higher.)
24. The method of claims 18 — 23, wherein the nucleotide sequence is transcribed from a promoter of a methanol-inducible P. methanolica gene.
25. The method of claim 24, wherein the promoter is a native promoter of a P methanolica gene encoding an enzyme selected from the group consisting of alcohol oxidase, dihydroxyacetone synthase, formate dehydrogenase, and catalase.
26. The method of claim 24 or 25, wherein methanol is added to the cell during the cultivating step, whereby the methanol-inducible promoter is induced.
27. The method of claims 18 - 26, wherein the nucleotide sequence encodes a polypeptide comprising its native original secretion signal sequence.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU2001239205A AU2001239205A1 (en) | 2000-03-10 | 2001-03-09 | Compositions and methods for producing high yields of heterologous polypeptides in a pichia cell |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DKPA200000392 | 2000-03-10 | ||
DKPA200000392 | 2000-03-10 | ||
DKPA200000419 | 2000-03-15 | ||
DKPA200000419 | 2000-03-15 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2001066693A1 true WO2001066693A1 (en) | 2001-09-13 |
Family
ID=26068787
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/DK2001/000154 WO2001066693A1 (en) | 2000-03-10 | 2001-03-09 | Compositions and methods for producing high yields of heterologous polypeptides in a pichia cell |
Country Status (2)
Country | Link |
---|---|
AU (1) | AU2001239205A1 (en) |
WO (1) | WO2001066693A1 (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1176205A2 (en) * | 2000-07-25 | 2002-01-30 | Roche Diagnostics GmbH | Expression of alkaline phosphatase in yeast |
WO2004035771A1 (en) * | 2002-10-16 | 2004-04-29 | Biopsytec Analytik Gmbh | Mutated nucleic acid of a cel i-endonuclease and method for producing the recombinant, full-length cel i-protein |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1996032472A1 (en) * | 1995-04-11 | 1996-10-17 | Novo Nordisk A/S | Bread-improving additive comprising a xylanolytic enzyme |
WO1997017450A2 (en) * | 1995-11-09 | 1997-05-15 | Zymogenetics, Inc. | Compositions and methods for producing heterologous polypeptides in pichia methanolica |
EP0911404A2 (en) * | 1997-09-24 | 1999-04-28 | Rhone Poulenc Agro | Recoding DNA sequences for expression in yeast and transformed yeasts obtained |
WO1999049022A1 (en) * | 1998-03-23 | 1999-09-30 | Novo Nordisk A/S | Phytase variants |
WO1999066040A1 (en) * | 1998-06-18 | 1999-12-23 | Zymogenetics, Inc. | Immunomodulator polypeptide, zsig57 |
-
2001
- 2001-03-09 WO PCT/DK2001/000154 patent/WO2001066693A1/en active Application Filing
- 2001-03-09 AU AU2001239205A patent/AU2001239205A1/en not_active Abandoned
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1996032472A1 (en) * | 1995-04-11 | 1996-10-17 | Novo Nordisk A/S | Bread-improving additive comprising a xylanolytic enzyme |
WO1997017450A2 (en) * | 1995-11-09 | 1997-05-15 | Zymogenetics, Inc. | Compositions and methods for producing heterologous polypeptides in pichia methanolica |
EP0911404A2 (en) * | 1997-09-24 | 1999-04-28 | Rhone Poulenc Agro | Recoding DNA sequences for expression in yeast and transformed yeasts obtained |
WO1999049022A1 (en) * | 1998-03-23 | 1999-09-30 | Novo Nordisk A/S | Phytase variants |
WO1999066040A1 (en) * | 1998-06-18 | 1999-12-23 | Zymogenetics, Inc. | Immunomodulator polypeptide, zsig57 |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1176205A2 (en) * | 2000-07-25 | 2002-01-30 | Roche Diagnostics GmbH | Expression of alkaline phosphatase in yeast |
EP1176205A3 (en) * | 2000-07-25 | 2002-05-22 | Roche Diagnostics GmbH | Expression of alkaline phosphatase in yeast |
US6884602B2 (en) | 2000-07-25 | 2005-04-26 | Roche Diagnostics Operations, Inc. | Expression of alkaline phosphatase in yeast |
EP1792982A3 (en) * | 2000-07-25 | 2007-06-13 | Roche Diagnostics GmbH | Codon-optimised alkaline phosphatase and its expression in yeast |
WO2004035771A1 (en) * | 2002-10-16 | 2004-04-29 | Biopsytec Analytik Gmbh | Mutated nucleic acid of a cel i-endonuclease and method for producing the recombinant, full-length cel i-protein |
Also Published As
Publication number | Publication date |
---|---|
AU2001239205A1 (en) | 2001-09-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP4620253B2 (en) | Promoter for gene expression in fungal cells | |
EP1266011B1 (en) | Fungal transcriptional activator useful in methods for producing polypeptides | |
EP2681308B1 (en) | Protein expression | |
EP2044203B1 (en) | Methods for increasing expression of genes in a fungal cell | |
US7527947B2 (en) | Signal peptide for producing a polypeptide | |
EP1625219B1 (en) | Methods for producing secreted polypeptides | |
JP2011101651A (en) | Oxaloacetate hydrolase deficient fungal host cell | |
NO179078B (en) | Replicable expression vector, its use and transformed host | |
EP1250444A1 (en) | Methods for producing a polypeptide using a consensus translational initiator sequence | |
EP1078080A1 (en) | Methods for producing polypeptides in filamentous fungal mutant cells | |
US9701970B2 (en) | Promoters for expressing genes in a fungal cell | |
WO2003070957A2 (en) | Plant polypeptide production | |
US5770371A (en) | Modification of cryptic splice sites in heterologous genes expressed in fungi | |
US20030022280A1 (en) | Compositions and methods for producing high yields of heterologous polypeptides in a pichia cell | |
WO2001066693A1 (en) | Compositions and methods for producing high yields of heterologous polypeptides in a pichia cell | |
EP1756273A1 (en) | Signal peptide for producing a polypeptide | |
JP2004538004A (en) | Vectors for site-specific integration of heterologous DNA sequences into rDNA sites of methylotrophic yeast | |
CN101067127B (en) | Methods for producing polypeptides in cyclohexadepsipeptide-deficient cells | |
DK2646558T3 (en) | Promoters for expression of genes in a fungal cell | |
KR20220098155A (en) | Nonviral transcriptional activation domains and related methods and uses |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
122 | Ep: pct application non-entry in european phase | ||
NENP | Non-entry into the national phase |
Ref country code: JP |