WO1992011378A1 - A method of constructing synthetic leader sequences - Google Patents
A method of constructing synthetic leader sequences Download PDFInfo
- Publication number
- WO1992011378A1 WO1992011378A1 PCT/DK1991/000396 DK9100396W WO9211378A1 WO 1992011378 A1 WO1992011378 A1 WO 1992011378A1 DK 9100396 W DK9100396 W DK 9100396W WO 9211378 A1 WO9211378 A1 WO 9211378A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- sequence encoding
- dna sequence
- yeast
- signal peptide
- arg
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims description 64
- 108010076504 Protein Sorting Signals Proteins 0.000 claims abstract description 73
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims abstract description 73
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 73
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 62
- 108091028043 Nucleic acid sequence Proteins 0.000 claims abstract description 57
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 54
- 108020004414 DNA Proteins 0.000 claims abstract description 52
- 150000001413 amino acids Chemical class 0.000 claims abstract description 52
- 229920001184 polypeptide Polymers 0.000 claims abstract description 51
- 239000012634 fragment Substances 0.000 claims abstract description 45
- 230000028327 secretion Effects 0.000 claims abstract description 28
- 238000012545 processing Methods 0.000 claims abstract description 18
- 239000013598 vector Substances 0.000 claims abstract description 18
- 210000005253 yeast cell Anatomy 0.000 claims abstract description 9
- 238000012216 screening Methods 0.000 claims abstract description 8
- 108091008146 restriction endonucleases Proteins 0.000 claims abstract description 7
- 238000012258 culturing Methods 0.000 claims abstract description 6
- 239000013599 cloning vector Substances 0.000 claims abstract description 4
- 238000001400 expression cloning Methods 0.000 claims abstract description 3
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 claims description 26
- 239000013604 expression vector Substances 0.000 claims description 16
- 210000004027 cell Anatomy 0.000 claims description 14
- 102000004877 Insulin Human genes 0.000 claims description 13
- 108090001061 Insulin Proteins 0.000 claims description 13
- IXKSXJFAGXLQOQ-XISFHERQSA-N WHWLQLKPGQPMY Chemical group C([C@@H](C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CNC=N1 IXKSXJFAGXLQOQ-XISFHERQSA-N 0.000 claims description 13
- 229940125396 insulin Drugs 0.000 claims description 13
- 239000002243 precursor Substances 0.000 claims description 12
- 230000008569 process Effects 0.000 claims description 9
- 230000003248 secreting effect Effects 0.000 claims description 7
- 238000003780 insertion Methods 0.000 claims description 6
- 230000037431 insertion Effects 0.000 claims description 6
- 239000004382 Amylase Substances 0.000 claims description 5
- 108010065511 Amylases Proteins 0.000 claims description 5
- 102000013142 Amylases Human genes 0.000 claims description 5
- 101150071434 BAR1 gene Proteins 0.000 claims description 5
- 102000005367 Carboxypeptidases Human genes 0.000 claims description 5
- 108010006303 Carboxypeptidases Proteins 0.000 claims description 5
- NPBGTPKLVJEOBE-IUCAKERBSA-N Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N NPBGTPKLVJEOBE-IUCAKERBSA-N 0.000 claims description 5
- 101100378536 Ovis aries ADRB1 gene Proteins 0.000 claims description 5
- 235000019418 amylase Nutrition 0.000 claims description 5
- NMWKYTGJWUAZPZ-WWHBDHEGSA-N (4S)-4-[[(4R,7S,10S,16S,19S,25S,28S,31R)-31-[[(2S)-2-[[(1R,6R,9S,12S,18S,21S,24S,27S,30S,33S,36S,39S,42R,47R,53S,56S,59S,62S,65S,68S,71S,76S,79S,85S)-47-[[(2S)-2-[[(2S)-4-amino-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-amino-3-methylbutanoyl]amino]-3-methylbutanoyl]amino]-3-hydroxypropanoyl]amino]-3-(1H-imidazol-4-yl)propanoyl]amino]-3-phenylpropanoyl]amino]-4-oxobutanoyl]amino]-3-carboxypropanoyl]amino]-18-(4-aminobutyl)-27,68-bis(3-amino-3-oxopropyl)-36,71,76-tribenzyl-39-(3-carbamimidamidopropyl)-24-(2-carboxyethyl)-21,56-bis(carboxymethyl)-65,85-bis[(1R)-1-hydroxyethyl]-59-(hydroxymethyl)-62,79-bis(1H-imidazol-4-ylmethyl)-9-methyl-33-(2-methylpropyl)-8,11,17,20,23,26,29,32,35,38,41,48,54,57,60,63,66,69,72,74,77,80,83,86-tetracosaoxo-30-propan-2-yl-3,4,44,45-tetrathia-7,10,16,19,22,25,28,31,34,37,40,49,55,58,61,64,67,70,73,75,78,81,84,87-tetracosazatetracyclo[40.31.14.012,16.049,53]heptaoctacontane-6-carbonyl]amino]-3-methylbutanoyl]amino]-7-(3-carbamimidamidopropyl)-25-(hydroxymethyl)-19-[(4-hydroxyphenyl)methyl]-28-(1H-imidazol-4-ylmethyl)-10-methyl-6,9,12,15,18,21,24,27,30-nonaoxo-16-propan-2-yl-1,2-dithia-5,8,11,14,17,20,23,26,29-nonazacyclodotriacontane-4-carbonyl]amino]-5-[[(2S)-1-[[(2S)-1-[[(2S)-3-carboxy-1-[[(2S)-1-[[(2S)-1-[[(1S)-1-carboxyethyl]amino]-4-methyl-1-oxopentan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]amino]-1-oxopropan-2-yl]amino]-1-oxopropan-2-yl]amino]-3-(1H-imidazol-4-yl)-1-oxopropan-2-yl]amino]-5-oxopentanoic acid Chemical compound CC(C)C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CSSC[C@H](NC(=O)[C@@H](NC(=O)[C@@H]2CSSC[C@@H]3NC(=O)[C@H](Cc4ccccc4)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](Cc4c[nH]cn4)NC(=O)[C@H](CO)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]4CCCN4C(=O)[C@H](CSSC[C@H](NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](Cc4c[nH]cn4)NC(=O)[C@H](Cc4ccccc4)NC3=O)[C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](Cc3ccccc3)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N3CCC[C@H]3C(=O)N[C@@H](C)C(=O)N2)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](Cc2ccccc2)NC(=O)[C@H](Cc2c[nH]cn2)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)C(C)C)[C@@H](C)O)C(C)C)C(=O)N[C@@H](Cc2c[nH]cn2)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](Cc2ccc(O)cc2)C(=O)N[C@@H](C(C)C)C(=O)NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N1)C(=O)N[C@@H](C)C(O)=O NMWKYTGJWUAZPZ-WWHBDHEGSA-N 0.000 claims description 4
- 108010039627 Aprotinin Proteins 0.000 claims description 4
- OMLWNBVRVJYMBQ-YUMQZZPRSA-N Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OMLWNBVRVJYMBQ-YUMQZZPRSA-N 0.000 claims description 4
- JQFZHHSQMKZLRU-IUCAKERBSA-N Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N JQFZHHSQMKZLRU-IUCAKERBSA-N 0.000 claims description 4
- 102000004190 Enzymes Human genes 0.000 claims description 4
- 108090000790 Enzymes Proteins 0.000 claims description 4
- 108060003199 Glucagon Proteins 0.000 claims description 4
- 108010000521 Human Growth Hormone Proteins 0.000 claims description 4
- 102000015696 Interleukins Human genes 0.000 claims description 4
- 108010063738 Interleukins Proteins 0.000 claims description 4
- 102000004882 Lipase Human genes 0.000 claims description 4
- 108090001060 Lipase Proteins 0.000 claims description 4
- 239000004367 Lipase Substances 0.000 claims description 4
- NVGBPTNZLWRQSY-UWVGGRQHSA-N Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN NVGBPTNZLWRQSY-UWVGGRQHSA-N 0.000 claims description 4
- 102000010780 Platelet-Derived Growth Factor Human genes 0.000 claims description 4
- 108010038512 Platelet-Derived Growth Factor Proteins 0.000 claims description 4
- 241000223258 Thermomyces lanuginosus Species 0.000 claims description 4
- 102000003978 Tissue Plasminogen Activator Human genes 0.000 claims description 4
- 108090000373 Tissue Plasminogen Activator Proteins 0.000 claims description 4
- 102100030951 Tissue factor pathway inhibitor Human genes 0.000 claims description 4
- 102000004887 Transforming Growth Factor beta Human genes 0.000 claims description 4
- 108090001012 Transforming Growth Factor beta Proteins 0.000 claims description 4
- 101800004564 Transforming growth factor alpha Proteins 0.000 claims description 4
- 102400001320 Transforming growth factor alpha Human genes 0.000 claims description 4
- 229960004405 aprotinin Drugs 0.000 claims description 4
- 108010068380 arginylarginine Proteins 0.000 claims description 4
- 108010062796 arginyllysine Proteins 0.000 claims description 4
- 108010006025 bovine growth hormone Proteins 0.000 claims description 4
- 229940042399 direct acting antivirals protease inhibitors Drugs 0.000 claims description 4
- MASNOZXLGMXCHN-ZLPAWPGGSA-N glucagon Chemical compound C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)C(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CO)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)[C@@H](C)O)C1=CC=CC=C1 MASNOZXLGMXCHN-ZLPAWPGGSA-N 0.000 claims description 4
- 229960004666 glucagon Drugs 0.000 claims description 4
- ZPNFWUPYTFPOJU-LPYSRVMUSA-N iniprol Chemical compound C([C@H]1C(=O)NCC(=O)NCC(=O)N[C@H]2CSSC[C@H]3C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=4C=CC(O)=CC=4)C(=O)N[C@@H](CC=4C=CC=CC=4)C(=O)N[C@@H](CC=4C=CC(O)=CC=4)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC=4C=CC=CC=4)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(N)=N)NC2=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@H](CC=2C=CC=CC=2)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H]2N(CCC2)C(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N2[C@@H](CCC2)C(=O)N2[C@@H](CCC2)C(=O)N[C@@H](CC=2C=CC(O)=CC=2)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N2[C@@H](CCC2)C(=O)N3)C(=O)NCC(=O)NCC(=O)N[C@@H](C)C(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@H](C(=O)N1)C(C)C)[C@@H](C)O)[C@@H](C)CC)=O)[C@@H](C)CC)C1=CC=C(O)C=C1 ZPNFWUPYTFPOJU-LPYSRVMUSA-N 0.000 claims description 4
- 235000019421 lipase Nutrition 0.000 claims description 4
- 108010054155 lysyllysine Proteins 0.000 claims description 4
- 239000000137 peptide hydrolase inhibitor Substances 0.000 claims description 4
- ZRKFYGHZFMAOKI-QMGMOQQFSA-N tgfbeta Chemical compound C([C@H](NC(=O)[C@H](C(C)C)NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC)C(C)C)[C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O)C1=CC=C(O)C=C1 ZRKFYGHZFMAOKI-QMGMOQQFSA-N 0.000 claims description 4
- 229960000187 tissue plasminogen activator Drugs 0.000 claims description 4
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 claims description 3
- 108010013555 lipoprotein-associated coagulation inhibitor Proteins 0.000 claims description 3
- 230000001131 transforming effect Effects 0.000 claims description 3
- 102000051325 Glucagon Human genes 0.000 claims 3
- 101710139626 Tissue factor pathway inhibitor Proteins 0.000 claims 1
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 54
- 102000004169 proteins and genes Human genes 0.000 description 41
- 235000018102 proteins Nutrition 0.000 description 40
- 235000001014 amino acid Nutrition 0.000 description 28
- 229940024606 amino acid Drugs 0.000 description 28
- 239000013612 plasmid Substances 0.000 description 18
- 239000002299 complementary DNA Substances 0.000 description 13
- YFKWIIRWHGKSQQ-WFBYXXMGSA-N Cys-Trp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CS)N YFKWIIRWHGKSQQ-WFBYXXMGSA-N 0.000 description 12
- 108010044940 alanylglutamine Proteins 0.000 description 12
- ODPKZZLRDNXTJZ-WHOFXGATSA-N Ile-Gly-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ODPKZZLRDNXTJZ-WHOFXGATSA-N 0.000 description 11
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 11
- WTHGNAAQXISJHP-AVGNSLFASA-N Met-Lys-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WTHGNAAQXISJHP-AVGNSLFASA-N 0.000 description 11
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 11
- 238000010276 construction Methods 0.000 description 10
- 239000002609 medium Substances 0.000 description 10
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 9
- 239000000047 product Substances 0.000 description 8
- 108010016616 cysteinylglycine Proteins 0.000 description 7
- 230000004048 modification Effects 0.000 description 7
- 238000012986 modification Methods 0.000 description 7
- 108020004707 nucleic acids Proteins 0.000 description 7
- 102000039446 nucleic acids Human genes 0.000 description 7
- 150000007523 nucleic acids Chemical class 0.000 description 7
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 6
- XTHUKRLJRUVVBF-WHFBIAKZSA-N Cys-Gly-Ser Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O XTHUKRLJRUVVBF-WHFBIAKZSA-N 0.000 description 6
- WTXCNOPZMQRTNN-BWBBJGPYSA-N Cys-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)O WTXCNOPZMQRTNN-BWBBJGPYSA-N 0.000 description 6
- 241000588724 Escherichia coli Species 0.000 description 6
- XMVLTPMCUJTJQP-FXQIFTODSA-N Glu-Gln-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N XMVLTPMCUJTJQP-FXQIFTODSA-N 0.000 description 6
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 6
- 108010064235 lysylglycine Proteins 0.000 description 6
- 239000002773 nucleotide Substances 0.000 description 6
- 125000003729 nucleotide group Chemical group 0.000 description 6
- 108010048818 seryl-histidine Proteins 0.000 description 6
- 108010005652 splenotritin Proteins 0.000 description 6
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 5
- GCDLPNRHPWBKJJ-WDSKDSINSA-N Cys-Gly-Glu Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GCDLPNRHPWBKJJ-WDSKDSINSA-N 0.000 description 5
- GGRDJANMZPGMNS-CIUDSAMLSA-N Cys-Ser-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O GGRDJANMZPGMNS-CIUDSAMLSA-N 0.000 description 5
- IWUFOVSLWADEJC-AVGNSLFASA-N Gln-His-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IWUFOVSLWADEJC-AVGNSLFASA-N 0.000 description 5
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 5
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 5
- 101150033985 TPI gene Proteins 0.000 description 5
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 5
- 108700015934 Triose-phosphate isomerases Proteins 0.000 description 5
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 5
- 108010049041 glutamylalanine Proteins 0.000 description 5
- RGQCNKIDEQJEBT-CQDKDKBSSA-N Ala-Leu-Tyr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RGQCNKIDEQJEBT-CQDKDKBSSA-N 0.000 description 4
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 4
- SJPZTWAYTJPPBI-GUBZILKMSA-N Asn-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SJPZTWAYTJPPBI-GUBZILKMSA-N 0.000 description 4
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 4
- VNTGPISAOMAXRK-CIUDSAMLSA-N Gln-Pro-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O VNTGPISAOMAXRK-CIUDSAMLSA-N 0.000 description 4
- ZJICFHQSPWFBKP-AVGNSLFASA-N Glu-Asn-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZJICFHQSPWFBKP-AVGNSLFASA-N 0.000 description 4
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 4
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 4
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 4
- DURWCDDDAWVPOP-JBDRJPRFSA-N Ile-Cys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N DURWCDDDAWVPOP-JBDRJPRFSA-N 0.000 description 4
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 4
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 4
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 4
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 4
- WALVCOOOKULCQM-ULQDDVLXSA-N Lys-Arg-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WALVCOOOKULCQM-ULQDDVLXSA-N 0.000 description 4
- 108091034117 Oligonucleotide Proteins 0.000 description 4
- 102100033598 Triosephosphate isomerase Human genes 0.000 description 4
- XBWKCYFGRXKWGO-SRVKXCTJSA-N Tyr-Cys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O XBWKCYFGRXKWGO-SRVKXCTJSA-N 0.000 description 4
- TWAVEIJGFCBWCG-JYJNAYRXSA-N Tyr-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N TWAVEIJGFCBWCG-JYJNAYRXSA-N 0.000 description 4
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 4
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 4
- 238000003776 cleavage reaction Methods 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- 108010004073 cysteinylcysteine Proteins 0.000 description 4
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 4
- 108010081551 glycylphenylalanine Proteins 0.000 description 4
- 108010025306 histidylleucine Proteins 0.000 description 4
- 108010012058 leucyltyrosine Proteins 0.000 description 4
- 238000010369 molecular cloning Methods 0.000 description 4
- 230000007017 scission Effects 0.000 description 4
- 238000010561 standard procedure Methods 0.000 description 4
- 230000009466 transformation Effects 0.000 description 4
- FAAHJOLJYDXKKU-ZHDGNLTBSA-N (2s)-6-amino-2-[[(2s)-1-[(2s,3r)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[(2-aminoacetyl)amino]-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-hydroxybutanoyl]pyrrolidine-2-carbonyl]amino]hexanoic acid Chemical compound C([C@@H](C(=O)N[C@@H]([C@H](O)C)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)CN)C1=CC=C(O)C=C1 FAAHJOLJYDXKKU-ZHDGNLTBSA-N 0.000 description 3
- SLQQPJBDBVPVQV-JYJNAYRXSA-N Arg-Phe-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O SLQQPJBDBVPVQV-JYJNAYRXSA-N 0.000 description 3
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 3
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 3
- UXHYOWXTJLBEPG-GSSVUCPTSA-N Asn-Thr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UXHYOWXTJLBEPG-GSSVUCPTSA-N 0.000 description 3
- 240000006439 Aspergillus oryzae Species 0.000 description 3
- KOHBWQDSVCARMI-BWBBJGPYSA-N Cys-Cys-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KOHBWQDSVCARMI-BWBBJGPYSA-N 0.000 description 3
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 3
- GNDJOCGXGLNCKY-ACZMJKKPSA-N Gln-Cys-Cys Chemical compound N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O GNDJOCGXGLNCKY-ACZMJKKPSA-N 0.000 description 3
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 3
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 3
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 3
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 3
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 3
- MKIAPEZXQDILRR-YUMQZZPRSA-N Gly-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN MKIAPEZXQDILRR-YUMQZZPRSA-N 0.000 description 3
- NDKSHNQINMRKHT-PEXQALLHSA-N His-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N NDKSHNQINMRKHT-PEXQALLHSA-N 0.000 description 3
- LJUIEESLIAZSFR-SRVKXCTJSA-N His-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N LJUIEESLIAZSFR-SRVKXCTJSA-N 0.000 description 3
- NBJAAWYRLGCJOF-UGYAYLCHSA-N Ile-Asp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NBJAAWYRLGCJOF-UGYAYLCHSA-N 0.000 description 3
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 3
- IASQBRJGRVXNJI-YUMQZZPRSA-N Leu-Cys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)NCC(O)=O IASQBRJGRVXNJI-YUMQZZPRSA-N 0.000 description 3
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 3
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 3
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 3
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 3
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 3
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 3
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 3
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 3
- MNMYOSZWCKYEDI-JRQIVUDYSA-N Tyr-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MNMYOSZWCKYEDI-JRQIVUDYSA-N 0.000 description 3
- VKYDVKAKGDNZED-STECZYCISA-N Tyr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N VKYDVKAKGDNZED-STECZYCISA-N 0.000 description 3
- GXAZTLJYINLMJL-LAEOZQHASA-N Val-Asn-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GXAZTLJYINLMJL-LAEOZQHASA-N 0.000 description 3
- FPCIBLUVDNXPJO-XPUUQOCRSA-N Val-Cys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FPCIBLUVDNXPJO-XPUUQOCRSA-N 0.000 description 3
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 3
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 3
- 108010068265 aspartyltyrosine Proteins 0.000 description 3
- 108010027338 isoleucylcysteine Proteins 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 230000010076 replication Effects 0.000 description 3
- 238000006467 substitution reaction Methods 0.000 description 3
- SBKVPJHMSUXZTA-MEJXFZFPSA-N (2S)-2-[[(2S)-2-[[(2S)-1-[(2S)-5-amino-2-[[2-[[(2S)-1-[(2S)-6-amino-2-[[(2S)-2-[[(2S)-5-amino-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-amino-3-(1H-indol-3-yl)propanoyl]amino]-3-(1H-imidazol-4-yl)propanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-4-methylpentanoyl]amino]-5-oxopentanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]pyrrolidine-2-carbonyl]amino]acetyl]amino]-5-oxopentanoyl]pyrrolidine-2-carbonyl]amino]-4-methylsulfanylbutanoyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@@H](C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CNC=N1 SBKVPJHMSUXZTA-MEJXFZFPSA-N 0.000 description 2
- YFZCWRCBISUCFG-ZDMORDKWSA-N (2s)-2-[[(2s)-6-amino-2-[[(2s)-1-[(2s,3r)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[2-[[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]amino]acetyl]amino]-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-hydroxybutanoyl]pyrr Chemical compound C([C@@H](C(=O)N[C@@H]([C@H](O)C)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N)C1=CC=C(O)C=C1 YFZCWRCBISUCFG-ZDMORDKWSA-N 0.000 description 2
- 101150028074 2 gene Proteins 0.000 description 2
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 2
- ANPFQTJEPONRPL-UGYAYLCHSA-N Asn-Ile-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O ANPFQTJEPONRPL-UGYAYLCHSA-N 0.000 description 2
- YNQMEIJEWSHOEO-SRVKXCTJSA-N Asn-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O YNQMEIJEWSHOEO-SRVKXCTJSA-N 0.000 description 2
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 2
- XACXDSRQIXRMNS-OLHMAJIHSA-N Asp-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)O XACXDSRQIXRMNS-OLHMAJIHSA-N 0.000 description 2
- MJJIHRWNWSQTOI-VEVYYDQMSA-N Asp-Thr-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MJJIHRWNWSQTOI-VEVYYDQMSA-N 0.000 description 2
- 108050001049 Extracellular proteins Proteins 0.000 description 2
- LTXLIIZACMCQTO-GUBZILKMSA-N Gln-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LTXLIIZACMCQTO-GUBZILKMSA-N 0.000 description 2
- SRZLHYPAOXBBSB-HJGDQZAQSA-N Glu-Arg-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SRZLHYPAOXBBSB-HJGDQZAQSA-N 0.000 description 2
- NJPQBTJSYCKCNS-HVTMNAMFSA-N Glu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N NJPQBTJSYCKCNS-HVTMNAMFSA-N 0.000 description 2
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 2
- LBHOVGUGOBINDL-KKUMJFAQSA-N His-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O LBHOVGUGOBINDL-KKUMJFAQSA-N 0.000 description 2
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 2
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 2
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 2
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 2
- BOFAFKVZQUMTID-AVGNSLFASA-N Leu-Gln-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BOFAFKVZQUMTID-AVGNSLFASA-N 0.000 description 2
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 2
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- AQGUSRZKDZYGGV-GMOBBJLQSA-N Pro-Ile-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O AQGUSRZKDZYGGV-GMOBBJLQSA-N 0.000 description 2
- 239000004365 Protease Substances 0.000 description 2
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 2
- 241000235347 Schizosaccharomyces pombe Species 0.000 description 2
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 2
- DLPXTCTVNDTYGJ-JBDRJPRFSA-N Ser-Ile-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(O)=O DLPXTCTVNDTYGJ-JBDRJPRFSA-N 0.000 description 2
- NYQIZWROIMIQSL-VEVYYDQMSA-N Thr-Pro-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O NYQIZWROIMIQSL-VEVYYDQMSA-N 0.000 description 2
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 2
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 2
- 238000000137 annealing Methods 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- 230000004071 biological effect Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 230000013595 glycosylation Effects 0.000 description 2
- 238000006206 glycosylation reaction Methods 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- 210000002288 golgi apparatus Anatomy 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 230000037361 pathway Effects 0.000 description 2
- 230000002797 proteolythic effect Effects 0.000 description 2
- 238000003259 recombinant expression Methods 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 230000005945 translocation Effects 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- HAVKMRGWNXMCDR-STQMWFEESA-N Arg-Gly-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HAVKMRGWNXMCDR-STQMWFEESA-N 0.000 description 1
- ZZZWQALDSQQBEW-STQMWFEESA-N Arg-Gly-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZZZWQALDSQQBEW-STQMWFEESA-N 0.000 description 1
- CVXXSWQORBZAAA-SRVKXCTJSA-N Arg-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N CVXXSWQORBZAAA-SRVKXCTJSA-N 0.000 description 1
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 1
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 1
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 1
- USENATHVGFXRNO-SRVKXCTJSA-N Asp-Tyr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 USENATHVGFXRNO-SRVKXCTJSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 235000002247 Aspergillus oryzae Nutrition 0.000 description 1
- 101001007681 Candida albicans (strain WO-1) Kexin Proteins 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- 108060002716 Exonuclease Proteins 0.000 description 1
- 108010033155 GGGCCC-specific type II deoxyribonucleases Proteins 0.000 description 1
- OREPWMPAUWIIAM-ZPFDUUQYSA-N Gln-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N OREPWMPAUWIIAM-ZPFDUUQYSA-N 0.000 description 1
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 1
- BUAKRRKDHSSIKK-IHRRRGAJSA-N Glu-Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BUAKRRKDHSSIKK-IHRRRGAJSA-N 0.000 description 1
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 1
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- HBMRTXJZQDVRFT-DZKIICNBSA-N Glu-Tyr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HBMRTXJZQDVRFT-DZKIICNBSA-N 0.000 description 1
- 102400000321 Glucagon Human genes 0.000 description 1
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- ULRFSEJGSHYLQI-YESZJQIVSA-N His-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O ULRFSEJGSHYLQI-YESZJQIVSA-N 0.000 description 1
- NBWATNYAUVSAEQ-ZEILLAHLSA-N His-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O NBWATNYAUVSAEQ-ZEILLAHLSA-N 0.000 description 1
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 1
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 1
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 1
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 1
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 1
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 1
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 1
- 101150045458 KEX2 gene Proteins 0.000 description 1
- 125000000570 L-alpha-aspartyl group Chemical group [H]OC(=O)C([H])([H])[C@]([H])(N([H])[H])C(*)=O 0.000 description 1
- 241000235087 Lachancea kluyveri Species 0.000 description 1
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 1
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 1
- IWTBYNQNAPECCS-AVGNSLFASA-N Leu-Glu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IWTBYNQNAPECCS-AVGNSLFASA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 1
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 1
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 1
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 1
- 101710140452 Mating factor alpha-1 Proteins 0.000 description 1
- 230000004988 N-glycosylation Effects 0.000 description 1
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 1
- DSXPMZMSJHOKKK-HJOGWXRNSA-N Phe-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DSXPMZMSJHOKKK-HJOGWXRNSA-N 0.000 description 1
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 1
- 108010021757 Polynucleotide 5'-Hydroxyl-Kinase Proteins 0.000 description 1
- 102000008422 Polynucleotide 5'-hydroxyl-kinase Human genes 0.000 description 1
- WWAQEUOYCYMGHB-FXQIFTODSA-N Pro-Asn-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 WWAQEUOYCYMGHB-FXQIFTODSA-N 0.000 description 1
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 1
- RNEFESSBTOQSAC-DCAQKATOSA-N Pro-Ser-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O RNEFESSBTOQSAC-DCAQKATOSA-N 0.000 description 1
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 1
- 241000582914 Saccharomyces uvarum Species 0.000 description 1
- 101900104102 Schizosaccharomyces pombe Triosephosphate isomerase Proteins 0.000 description 1
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 1
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 1
- UDNVOQMPQBEITB-MEYUZBJRSA-N Thr-His-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UDNVOQMPQBEITB-MEYUZBJRSA-N 0.000 description 1
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 1
- DSGIVWSDDRDJIO-ZXXMMSQZSA-N Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DSGIVWSDDRDJIO-ZXXMMSQZSA-N 0.000 description 1
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 1
- 102000005924 Triose-Phosphate Isomerase Human genes 0.000 description 1
- DANHCMVVXDXOHN-SRVKXCTJSA-N Tyr-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DANHCMVVXDXOHN-SRVKXCTJSA-N 0.000 description 1
- AOIZTZRWMSPPAY-KAOXEZKKSA-N Tyr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O AOIZTZRWMSPPAY-KAOXEZKKSA-N 0.000 description 1
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 1
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 1
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 1
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- -1 ammonium sulphate Chemical class 0.000 description 1
- 235000011130 ammonium sulphate Nutrition 0.000 description 1
- 239000001166 ammonium sulphate Substances 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 108010051210 beta-Fructofuranosidase Proteins 0.000 description 1
- 244000309466 calf Species 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 210000002421 cell wall Anatomy 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 230000030609 dephosphorylation Effects 0.000 description 1
- 238000006209 dephosphorylation reaction Methods 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 102000013165 exonuclease Human genes 0.000 description 1
- 108010092809 exonuclease Bal 31 Proteins 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000000706 filtrate Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 210000000936 intestine Anatomy 0.000 description 1
- 239000001573 invertase Substances 0.000 description 1
- 235000011073 invertase Nutrition 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 239000002751 oligonucleotide probe Substances 0.000 description 1
- 210000001322 periplasm Anatomy 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 150000008300 phosphoramidites Chemical class 0.000 description 1
- 238000013492 plasmid preparation Methods 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 230000001376 precipitating effect Effects 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 230000006641 stabilisation Effects 0.000 description 1
- 238000011105 stabilization Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 210000003934 vacuole Anatomy 0.000 description 1
- 239000007222 ypd medium Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
- C12N15/625—DNA sequences coding for fusion proteins containing a sequence coding for a signal sequence
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
- C12N15/81—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/01—Fusion polypeptide containing a localisation/targetting motif
- C07K2319/036—Fusion polypeptide containing a localisation/targetting motif targeting to the medium outside of the cell, e.g. type III secretion
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/70—Fusion polypeptide containing domain for protein-protein interaction
- C07K2319/74—Fusion polypeptide containing domain for protein-protein interaction containing a fusion for binding to a cell surface receptor
- C07K2319/75—Fusion polypeptide containing domain for protein-protein interaction containing a fusion for binding to a cell surface receptor containing a fusion for activation of a cell surface receptor, e.g. thrombopoeitin, NPY and other peptide hormones
Definitions
- the present invention relates to a method of constructing synthetic leader peptide sequences for secreting heterologous polypeptides in yeast, and yeast expression vectors for use in the method.
- Yeast organisms produce a number of proteins which are synthesized intracellularly, but which have a function outside the cell. Such extracellular proteins are referred to as secreted proteins. These secreted proteins are expressed initially inside the cell in a precursor or a pre-protein form containing a presequence ensuring effective direction of the expressed product across the membrane of the endoplasmic reticulum (ER).
- the presequence normally named a signal peptide, is generally cleaved off from the desired product during translocation. Once entered in the secretory pathway, the protein is transported to the Golgi apparatus.
- the protein can follow different routes that lead to compartments such as the cell vacuole or the cell membrane, or it can be routed out of the cell to be secreted to the external medium (Pfeffer, S.R. and Rothman, J.E. Ann.Rev. Biochem. 56 (1987), 829-852).
- European published patent application No. 88 632 describes a process by which proteins heterologous to yeast are expressed, processed and secreted by transforming a yeast organism with an expression vehicle harbouring DNA encoding the desired protein and a signal peptide, preparing a culture of the transformed organism, growing the culture and recovering the protein from the culture medium.
- the signal peptide may be the signal peptide of the desired protein itself, a heterologous signal peptide or a hybrid of native and heterologous signal peptide.
- a problem encountered with the use of signal peptides heterologous to yeast might be that the heterologous signal peptide does not ensure efficient translocation and/or cleavage after the signal peptide.
- the S. cerevisiae MF ⁇ 1 ( ⁇ -factor) is synthesized as a prepro form of 165 amino acids comprising signal-or prepeptide of 19 amino acids followed by a "leader” or propeptide of 64 amino aicds, encompassing three N-linked glycosylation sites followed by (LysArg(Asp/Glu, Ala) 2-3 ⁇ -factor) 4 (Kurjan, J. and Herskowitz, I. Cell 30 (1982), 933-943).
- the signal-leader part of the preproMF ⁇ 1 has been widely employed to obtain synthesis and secretion of heterologous proteins in S. cerivisiae.
- EP 16201, 123 294, 123 544, and 163 529 describe processes by which the ⁇ -factor signal-leader from Saccharomyces cerevisiae (MF ⁇ l or MF ⁇ 2) is utilized in the secretion process of expressed heterologous proteins in yeast.
- MF ⁇ l or MF ⁇ 2 Saccharomyces cerevisiae
- EP 206783 discloses a system for the secretion of polypeptides from S.
- the ⁇ -factor leader sequence has been truncated to eliminate the four ⁇ -factor peptides present on the native leader sequence so as to leave the leader peptide itself fused to a heterologous polypeptide via the ⁇ -factor processing site LysArgGluAlaGluAla.
- This construction is indicated to lead to an efficient process of smaller peptides (less than 50 amino acids).
- the native ⁇ -factor leader sequence has been truncated to leave one or two ⁇ -factor peptides between the leader peptide and the polypeptide.
- a number of secreted proteins are routed so as to be exposed to a proteolytic processing system which can cleave the peptide bond at the carboxy end of two consecutive basic amino acids.
- This enzymatic activity is in S. cerevisiae encoded by the KEX 2 gene (Julius, D.A. et al., Cell 37 (1984b), 1075). Processing of the product by the KEX 2 gene product is needed for the secretion of active S. cerevisiae mating factor ⁇ 1 (MF ⁇ 1 or ⁇ -factor) but is not involved in the secretion of active S. cerevisiae mating factor a.
- the present invention relates to a method of constructing a synthetic leader peptide sequence for secreting heterologous polypeptides in yeast, the method comprising
- X n is a DNA sequence encoding n amino acids, wherein n is O or an integer of from 1 to about 10 amino acids,
- RS is a restriction endonuclease recognition site for insertion of random DNA fragments, which site is provided at the junction of X n and X m ,
- X m is a DNA sequence encoding m amino acids, wherein m is O or an integer from 1 to about 10,
- NZT is a DNA sequence encoding Asn-Xaa-Thr, wherein p is O or 1,
- X is a DNA sequence encoding q amino acids, wherein q is O or an integer from 1 to about 10,
- PS is a DNA sequence encoding a peptide defining a yeast processing site
- step (b) transforming a yeast host cell with the expression vector of step (a);
- step (c) culturing the transformed host cell of step (b) under appropriate conditions
- step (d) screening the culture of step (c) for secretion of the heterologous polypeptide.
- leader peptide is understood to indicate a peptide whose function is to allow the heterologous polypeptide to be directed from the endoplasmic reticulum to the Golgi apparatus and further to a secretory veside for secretion into the medium, (i.e. exportation of the expressed polypeptide across the cell wall or at least through the cellular membrane into the periplasmic space of the cell).
- synthetic used in connection with leader peptides is intended to indicate that the leader peptide constructed by the present method is one not found in nature.
- signal peptide is understood to mean a presequence which is predominantly hydrophobic in nature and present as an N-terminal sequence of the precursor form of an extracellular protein expressed in yeast.
- the function of the signal peptide is to allow the heterologous protein to be secreted to enter the endoplasmic reticulum.
- the signal peptide is normally cleaved off in the course of this process.
- the signal peptide may be heterologous or homologous to the yeast organism producing the protein but, as explained above, a more efficient cleavage of the signal peptide may be obtained when it is homologous to the yeast organism in question.
- heterologous polypeptide is intended to indicate a polypeptide which is not produced by the host yeast organism in nature.
- the heterologous polypeptide is preferably one the secretion of which by transformed yeast cells may easily be detected, e.g. by established standard methods such as by immunological screening by means of antibodies reactive with the polypeptide in question (cf. for instance Sambrook, Fritsch and Maniatis, Molecular Cloning; A Laboratory Manual, Cold Spring Harbor, New York, 1989) or by screening for a specific biological activity of the heterologous polypeptide.
- a positive result of the screening indicates that a leader peptide useful for the secretion of heterologous polypeptides in yeast has been constructed.
- a random DNA fragment is intended to indicate any sequence of DNA at least 3 nucleotides in length, for instance obtained by digesting genomic DNA (of any organism) with restriction endonuclease(s) or by preparing synthetic DNA, e.g. by the phosphoamidite method described by S.L. Beaucage and M.H. Caruthers, Tetrahedron Letters 22, 1981, pp. 1859- 1869.
- the peptide Asn-Xaa-Thr encoded by "(NZT) p " is an asparaginelinked glycosylation site.
- "Xaa” denotes any one of the known amino acids except Pro.
- the present invention relates to a yeast expression cloning vector comprising the following sequence
- This vector may be used in the construction of leader peptide sequences according to the method described above.
- the present invention relates to a yeast expression vector comprising the following sequence
- ranDNA is a random DNA fragment inserted in a restriction endonuclease recognition site provided at the junction of X n and X m .
- the leader peptide sequence (once identified by the method of the invention) will be composed of the sequence X n -ranDNA-X m -(NZT) p -X q .
- Such a vector may be used in the production of a heterologous polypeptide of interest.
- the present invention relates to a process for producing a heterologous polypeptide in yeast, the process comprising culturing a yeast cell, which is capable of expressing a heterologous polypeptide and which is transformed with a yeast expression vector as described above including a leader peptide sequence constructed by the method of the invention, in a suitable medium to obtain expression and secretion of the heterologous polypeptide, after which the heterologous polypeptide is recovered from the medium.
- the length of the random DNA fragment inserted in the expression vector is not particularly critical. However, in order to be of a manageable length, the fragment preferably has a length of from 16 to about 600 base pairs. More preferably, the fragment has a length of from about 15 to about 300 base pairs. It is at present considered that a suitable length of the fragment is from about 30 to about 150 base pairs.
- the random DNA fragment preferably encodes a high proportion of polar amino acids. These are selected from the group consisting of Glu, Asp, Lys, Arg, His, Thr, Ser, Asn and Gin.
- a high proportion of is understood to indicate that the DNA fragment encodes a larger number of polar amino acids than do other DNA sequences of a corresponding length.
- the fragment encodes at least one proline.
- n and/or m and/or q are preferably ⁇ 1. In particular, all of n, m and q are ⁇ 1.
- the signal peptide sequence may encode any signal peptide which ensures an effective direction of the expressed heterologous polypeptide into the secretory pathway of the cell.
- the signal peptide may be a naturally occurring signal peptide or functional parts thereof, or it may be a synthetic peptide.
- Suitable signal peptides have been found to be the ⁇ factor signal peptide, the signal peptide of mouse salivary amylase, a modified carboxypeptidase signal peptide, the yeast BAR1 signal peptide or the Humicola lanuginosa lipase signal peptide, or a derivative thereof.
- the mouse salivary amylase signal sequence is described by 0. Hagenbüchle et al., Nature 289, 1981, pp. 643-646.
- the carboxypeptidase signal sequence is described by L.A. Valls et al., Cell 48, 1987, pp. 887-897.
- the BAR1 signal peptide is disclosed in WO 87/02670.
- the H. lanuginosa lipase signal peptide is disclosed in EP 305 216.
- the yeast processing site encoded by the DNA sequence PS may suitably be any paired combination of Lys and Arg, such as Lys-Arg, Arg-Lys, Lys-Lys or Arg-Arg, which permits processing of the heterologous polypeptide by the KEX2 protease of Saccharomyces cerevisiae or the equivalent protease in other yeast species (D.A. Julius et al., Cell 37, 1984, 1075 ff.). If KEX2 processing is not convenient, e.g. if it would lead to cleavage of the polypeptide product, a processing site for another protease may be selected instead comprising an amino acid combination which is not found in the polypeptide product, e.g. the processing site for FX a , Ile-Glu-Gly-Arg (cf. Sambrook, Fritsch and Maniatis, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, New York, 1989).
- the heterologous protein produced by the method of the invention may be any protein which may advantageously be produced in yeast.
- examples of such proteins are aprotinin, tissue factor pathway inhibitor or other protease inhibitors, insulin or insulin precursors, human or bovine growth hormone, interleukin, glucagon, tissue plasminogen activator, transforming growth factor ⁇ or ⁇ , platelet-derived growth factor, enzymes, or a functional analogue thereof.
- the term "functional analogue” is meant to indicate a polypeptide with a similar function as the native protein (this is intended to be understood as relating to the nature rather than the level of biological activity of the native protein).
- the polypeptide may be structurally similar to the native protein and may be derived from the native protein by addition of one or more amino acids to either or both the C- and N-terminal end of the native protein, substitution of one or more amino acids at one or a number of different sites in the native amino acid sequence, deletion of one or more amino acids at either or both ends of the native protein or at one or several sites in the amino acid sequence, or insertion of one or more amino acids at one or more sites in the native amino acid sequence.
- modifications are well known for several of the proteins mentioned above.
- the random DNA fragment and the sequence 5'-SP-X n -3'-RS-5'-X m -(NZT) p -X q -PS-*gene*-3' may be prepared synthetically by established standard methods, e.g. the phosphoamidite method described by S.L. Beaucage and M.H. Caruthers, Tetrahedron Letters 22, 1981, pp. 1859-1869, or the method described by Matthes et al., EMBO Journal 3, 1984, pp. 801-805. According to the phosphoamidite method, oligonucleotides are synthesized, e.g.
- sequence 5'-SP-X n -3'-RS-5'-X m -(NZT) p -X q -PS- *gene*-3' need not be prepared in a single operation, but may be assembled from two or more oligonucleotides prepared synthetically in this fashion.
- the random DNA fragment or one or more parts of the sequence 5l-SP-X n -3'-RS-5'-X m -(NZT) p -X q -PS-*gene*-3' may also be of genomic or cDNA origin, for instance obtained by preparing a genomic or cDNA library and screening for DNA sequences coding for said parts (typically SP or *gene*) by hybridization using synthetic oligonucleotide probes in accordance with standard techniques (cf. Sambrook, Fritsch and Maniatis, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, New York, 1989).
- a genomic or cDNA sequence encoding a signal peptide may be joined to a genomic or cDNA sequence encoding the heterologous protein, after which the DNA sequence may be modified by the insertion of synthetic oligonucleotides encoding the sequence X n -3'-RS-5'-X m -(NZT) p -X q -PS in accordance with well-known procedures.
- the random DNA fragment and/ or the sequence 5'-SP-X n - 3'-RS-5'-X m -(NZT) p -X q -PS-*gene*-3' may be of mixed synthetic and genomic, mixed synthetic and cDNA or mixed genomic and cDNA origin prepared by annealing fragments of synthetic, genomic or cDNA origin (as appropriate), the fragments corresponding to various parts of the entire DNA sequence, in accordance with standard techniques.
- the DNA sequence encoding the signal peptide or the heterologous polypeptide may be of genomic or cDNA origin, while the sequence X n -3'-RS-5'-X m -(NZT) p -X q -PS may be prepared synthetically.
- Preferred DNA constructs encoding insulin precursors are as shown in Sequence Listings ID Nos. 1-13, or suitable modifications thereof.
- suitable modifications of the DNA sequence are nucleotide substitutions which do not give rise to another amino acid sequence of the protein, but which may correspond to the codon usage of the yeast organism into which the DNA construct is inserted or nucleotide substitutions which do give rise to a different amino acid sequence and therefore, possibly, a different protein structure.
- Other examples of possible modifications are insertion of three or multiples of three nucleotides into the sequence, addition of three or multiples of three nucleotides at either end of the sequence and deletion of three or multiples of three nucleotides at either end of or within the sequence.
- the recombinant expression vector carrying the sequence 5'-SP-X n -3'-RS-5'-X m -(NZT) p -X q -PS-*gene*-3' or 5'-SP-X n -ranDNA-X m - (NZT) p -X q -PS-*gene*-3' may be any vector which is capable of replicating in yeast organisms.
- either DNA sequence should be operably connected to a suitable promoter sequence.
- the promoter may be any DNA sequence which shows transcriptional activity in yeast and may be derived from genes encoding proteins either homologous or heterologous to yeast.
- the promoter is preferably derived from a gene encoding a protein homologous to yeast. Examples of suitable promoters are the Saccharomyces cerevisiae MF ⁇ 1, TPI, ADH or PGK promoters.
- TPI terminator cf. T. Alber and G. Kawasaki, J. Mol. Appl. Genet. 1, 1982, pp. 419-434.
- the recombinant expression vector of the invention further comprises a DNA sequence enabling the vector to replicate in yeast.
- yeast sequences are the yeast plasmid 2 ⁇ replication genes REP 1-3 and origin of replication.
- the vector may also comprise a selectable marker, e.g. the Schizosaccharomyces pombe TPI gene as described by P.R. Russell, Gene 40, 1985, pp. 125-130.
- the vector may be constructed either by first preparing a DNA construct containing the entire sequence 5'-SP-X n -3'-RS-5'-X m -(NZT) p -X q -PS-*gene*-3' and subsequently inserting this fragment into a suitable expression vector, or by sequentially inserting DNA fragments containing genetic information for the individual elements (such as the signal peptide, the sequence X n -3'-RS-5'-X m -(NZT) p -X q or the heterologous polypeptide) followed by ligation.
- DNA fragments containing genetic information for the individual elements such as the signal peptide, the sequence X n -3'-RS-5'-X m -(NZT) p -X q or the heterologous polypeptide
- the yeast organism used in the method of the invention may be any suitable yeast organism which, on cultivation, produces large amounts of the heterologous polypeptide in question.
- suitable yeast organisms may be strains of the yeast species Saccharomyces cerevisiae, Saccharomyces reteyveri, Schizosaccharomyces pombe or Saccharomyces uvarum.
- the transformation of the yeast cells may for instance be effected by protoplast formation followed by transformation in a manner known per se.
- the medium used to cultivate the cells may be any conventional medium suitable for growing yeast organisms.
- the secreted heterologous protein may be recovered from the medium by conventional procedures including separating the yeast cells from the medium by centrifugation or filtration, precipitating the proteinaceous components of the supernatant or filtrate by means of a salt, e.g. ammonium sulphate, followed by purification by a variety of chromatographic procedures, e.g. ion exchange chromatography, affinity chromatography, or the like.
- a salt e.g. ammonium sulphate
- Fig. 1 schematically shows the construction of pMT742 ⁇
- Fig. 2 schematically shows the construction of pLaC202
- Fig. 3 shows the DNA sequence and derived amino acid sequence at the cloning site in pLaC202 for random DNA fragments (it should be noted that the sequence is cleaved in the unique ClaI site and that ligation without insertion of random DNA will lead to a change in the reading frame);
- Fig. 4 schematically shows the construction of pLSC6315D#
- Plasmids and DNA materials are of the C-POT type. Such plasmids are described in EP patent application No. 171 142 and are characterized in containing the Schizosaccharomyces pombe triose phosphate isomerase gene (POT) for the purpose of plasmid selection and stabilization.
- POT Schizosaccharomyces pombe triose phosphate isomerase gene
- a plasmid containing the POT-gene is available from a deposited E. coli strain (ATCC 39685).
- the plasmids furthermore contain the S. cerevisiae triose phosphate isomerase promoter and terminator (P TPI and T TPI ). They are identical to pMT742 (M. Egel-Mitani et al., Gene 73, 1988, pp. 113-120) (see fig. 1) except for the region defined by the Sph-XbaI restriction sites encompassing the P TPI and the coding region for signal
- the P TPI has been modified with respect to the sequence found in pMT742, only in order to facilitate construction work.
- An internal SphI restriction site has been eliminated by SphI cleavage, removel of single stranded tails and religation.
- DNA sequences, upstream to and without any impact on the promoter have been removed by Bal31 exonuclease treatment followed by addition of an SphI restriction site linker.
- This promoter construction present on a 373 bp SphI- EcoRI fragment is designated P TPI ⁇ and when used in plasmids already described this promoter modification is indicated by the addition of a ⁇ to the plasmid name, e.g. pMT7425 (fig. 1).
- This vector containing a unique Clal site constitutes one embodiment of the random DNA cloning vector in which the product gene codes for the insulin precursor MI3 (B(1-29)-Ala-Ala-Lys-A(l-21)).
- MI3 insulin precursor
- the following examples concerns the leaders cloned via this construct.
- Total DNA was isolated from S. cerevisiae strain MT663, and digested by TaqI, HinPI or TaqI + HinP I. The digests were separated according to size on a 1% agarose gel, and fragments smaller than 600 bp were isolated from each of the three digestions.
- pLaC202 previously digested with ClaI, prevented from self ligation with Calf Intestine Alkaline Phosphatase (CIAP), dephosphorylation, was mixed with the fragment pools described above and ligated.
- Recombinant plasmids were prepared from each of the three types in pools encompassing all 5000 transformants. These plasmid pools were used to transform S. cerevisiae strain MT663 and the resulting TPI transformants were immunoscreened for MI3 secretion.
- Sequencing of the inserts of the eight isolated pLaC202 derivatives showed three different sequences, two of which, pLSC6315 and pLSC5210, most efficiently support MI3 secretion.
- the sequences of the cloned DNA and flanking regions are shown in Sequence Listings ID Nos. 2 and 4, respectively.
- pLSC6315 was chosen for further modification of the cloned synthetic leader sequence.
- pLSC6315 was digested with the ApaI endonuclease followed by treatment with the exonuclease Bal31. After phenol extraction the resulting DNA was digested with XbaI and DNA fragments smaller than the original 367 bp Apal-XbaI fragment, were isolated.
- pLaC202 was digested with ClaI, and the single stranded CG tails generated were removed, followed by XbaI digestion and isolation of the 11 Kb XbaI-]ClaI[ fragment ("] [" indicates that the single-stranded tails have been trimmed off). This fragment was mixed with the pLSC6415 fragments isolated above and ligated (fig. 6).
- Sequencing of the inserts showed 5 different sequences, two of which (pLAO2 and pLAO5) are more efficient where MI3 secretion is concerned.
- the sequences of the DNA inserts in pLAO2 and pLAO5 are shown, together with the flanking regions, in Sequence Listings ID Nos. 10 and 12, respectively.
- Example 5 Yeast strains harbouring plasmids as described above, were grown in YPD medium (Sherman, F. et al., Methods in Yeast Genetics, Cold Spring Harbor Laboratory 1981). For each strain 6 individual 5 ml cultures were shaken at 30oC for 60 hours, with a final OD 600 of approx. 15. After centrifugation the supernatant was removed for HPLC analysis by which method the concentration of secreted insulin precursor was measured by a method described by Leo Snel et al. (1987) Chromatographia 24, 329-332.
- MOLECULE TYPE protein
- TGT ACC TCC ATC TGC TCC TTG TAC CAA TTG GAA AAC TAC TGC AAC 348 cys Thr Ser Ile Cys Ser Leu Tyr Gln Leu Glu Asn Tyr Cys Asn
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Organic Chemistry (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Zoology (AREA)
- Molecular Biology (AREA)
- Mycology (AREA)
- Plant Pathology (AREA)
- Microbiology (AREA)
- Physics & Mathematics (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Consolidation Of Soil By Introduction Of Solidifying Substances Into Soil (AREA)
- Building Environments (AREA)
- Working Measures On Existing Buildindgs (AREA)
- Prevention Of Electric Corrosion (AREA)
- Bridges Or Land Bridges (AREA)
Abstract
Description
Claims
Priority Applications (7)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CS931192A CZ119293A3 (en) | 1990-12-19 | 1991-12-18 | Method of constructing synthetic leading peptide sequence |
KR1019930701890A KR930703450A (en) | 1990-12-19 | 1991-12-18 | Construction method of synthetic leader sequence |
FI932831A FI932831A0 (en) | 1990-12-19 | 1991-12-18 | For the purposes of this Regulation, the following guidelines may be used |
AU91348/91A AU660161B2 (en) | 1990-12-19 | 1991-12-18 | A method of constructing synthetic leader sequences |
JP4502056A JPH06503957A (en) | 1990-12-19 | 1991-12-18 | How to construct a synthetic leader sequence |
SK62593A SK62593A3 (en) | 1990-12-19 | 1991-12-18 | Method of constructing synthetic leader sequences |
NO93932235A NO932235L (en) | 1990-12-19 | 1993-06-17 | PROCEDURE FOR PREPARING SYNTHETIC LEADER SEQUENCES |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DK300090A DK300090D0 (en) | 1990-12-19 | 1990-12-19 | PROCEDURE FOR PREPARING LEADER SEQUENCES |
DK3000/90 | 1990-12-19 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO1992011378A1 true WO1992011378A1 (en) | 1992-07-09 |
Family
ID=8118017
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/DK1991/000396 WO1992011378A1 (en) | 1990-12-19 | 1991-12-18 | A method of constructing synthetic leader sequences |
Country Status (17)
Country | Link |
---|---|
EP (1) | EP0563175A1 (en) |
JP (1) | JPH06503957A (en) |
KR (1) | KR930703450A (en) |
AU (1) | AU660161B2 (en) |
CA (1) | CA2098731A1 (en) |
CZ (1) | CZ119293A3 (en) |
DK (1) | DK300090D0 (en) |
FI (1) | FI932831A0 (en) |
HU (1) | HUT68751A (en) |
IE (1) | IE914433A1 (en) |
IL (1) | IL100408A0 (en) |
MX (1) | MX9102684A (en) |
NZ (1) | NZ241011A (en) |
PT (1) | PT99848A (en) |
SK (1) | SK62593A3 (en) |
WO (1) | WO1992011378A1 (en) |
ZA (1) | ZA919932B (en) |
Cited By (38)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1995034666A1 (en) * | 1994-06-16 | 1995-12-21 | Novo Nordisk A/S | Synthetic leader peptide sequences |
US5538863A (en) * | 1993-07-01 | 1996-07-23 | Immunex Corporation | Expression system comprising mutant yeast strain and expression vector encoding synthetic signal peptide |
EP0704527A3 (en) * | 1994-08-05 | 1997-08-27 | Pliva Pharm & Chem Works | DNA sequences encoding biosynthetic insulin precursors and process for prepation of insulin |
US5677172A (en) * | 1992-03-11 | 1997-10-14 | Makarow; Marja | Method for production of proteins in yeast |
US6017731A (en) * | 1996-12-13 | 2000-01-25 | Chiron Corporation | Method for expression of heterologous proteins in yeast |
US6183989B1 (en) | 1998-01-23 | 2001-02-06 | Novo Nordisk A/S | Process for making desired polypeptides in yeast |
WO2001049870A1 (en) | 1999-12-29 | 2001-07-12 | Novo Nordisk A/S | Method for making insulin precursors and insulin precursor analogues having improved fermentation yield in yeast |
WO2002077218A1 (en) | 2001-03-22 | 2002-10-03 | Novo Nordisk Health Care Ag | Coagulation factor vii derivatives |
US6500645B1 (en) | 1994-06-17 | 2002-12-31 | Novo Nordisk A/S | N-terminally extended proteins expressed in yeast |
AT410217B (en) * | 2000-06-15 | 2003-03-25 | Cistem Biotechnologies Gmbh | VECTOR AND A METHOD FOR THE EXPRESSION AND SELECTION OF RANDOMIZED PEPTIDE SEQUENCES |
WO2003027147A2 (en) | 2001-09-27 | 2003-04-03 | Novo Nordisk Health Care Ag | Human coagulation factor vii polypeptides |
CN1131312C (en) * | 1994-06-17 | 2003-12-17 | 诺沃挪第克公司 | N-terminally extended proteins expressed in yeast |
WO2005047508A1 (en) | 2003-11-14 | 2005-05-26 | Novo Nordisk A/S | Processes for making acylated insulin |
WO2005054291A1 (en) | 2003-12-03 | 2005-06-16 | Novo Nordisk A/S | Single-chain insulin |
WO2005078116A1 (en) * | 2004-01-16 | 2005-08-25 | Qiuyun Liu | A method of isolating antibacterial peptides and the isolated peptides thereof |
EP1683860A2 (en) | 1995-03-17 | 2006-07-26 | Novozymes A/S | Novel endoglucanases |
WO2007020256A1 (en) | 2005-08-16 | 2007-02-22 | Novo Nordisk A/S | Method for making mature insulin polypeptides |
WO2008037735A1 (en) | 2006-09-27 | 2008-04-03 | Novo Nordisk A/S | Method for making maturated insulin polypeptides |
EP1975177A1 (en) | 1996-03-01 | 2008-10-01 | Novo Nordisk A/S | An appetite-suppressing peptide, its compositions and use |
WO2009022013A1 (en) | 2007-08-15 | 2009-02-19 | Novo Nordisk A/S | Insulin analogues with an acyl and aklylene glycol moiety |
WO2009021955A1 (en) | 2007-08-13 | 2009-02-19 | Novo Nordisk A/S | Rapid acting insulin analogues |
WO2009022006A1 (en) | 2007-08-15 | 2009-02-19 | Novo Nordisk A/S | Insulins with an acyl moiety comprising repeating units of alkylene glycol containing amino acids |
WO2010103038A1 (en) | 2009-03-11 | 2010-09-16 | Novo Nordisk A/S | Interleukin-21 variants having antagonistic binding to the il-21 receptor |
EP2256129A1 (en) | 2006-02-27 | 2010-12-01 | Novo Nordisk A/S | Insulin derivatives |
WO2011006982A2 (en) | 2009-07-17 | 2011-01-20 | Rigshospitalet | Inhibitors of complement activation |
EP2316930A1 (en) | 2005-09-14 | 2011-05-04 | Novo Nordisk Health Care AG | Human coagulation factor VII polypeptides |
WO2011064282A1 (en) | 2009-11-25 | 2011-06-03 | Novo Nordisk A/S | Method for making polypeptides |
WO2011067283A1 (en) | 2009-12-01 | 2011-06-09 | Novo Nordisk A/S | Novel peptidyl alpha-hydroxyglycine alpha-amidating lyases |
WO2011089170A2 (en) | 2010-01-22 | 2011-07-28 | Novo Nordisk A/S | Process for preparing fgf-21 with low degree of o-glycosylation |
EP2360181A1 (en) | 2005-04-18 | 2011-08-24 | Novo Nordisk A/S | IL-21 variants |
WO2011107591A1 (en) | 2010-03-05 | 2011-09-09 | Rigshospitalet | Chimeric inhibitor molecules of complement activation |
EP2392655A2 (en) | 2003-09-09 | 2011-12-07 | Novo Nordisk Health Care AG | Coagulation factor VII polypeptides |
US20130190476A1 (en) * | 2010-07-28 | 2013-07-25 | Thomas M. Lancaster | Recombinantly expressed insulin polypeptides and uses thereof |
WO2013117705A1 (en) | 2012-02-09 | 2013-08-15 | Var2 Pharmaceuticals Aps | Targeting of chondroitin sulfate glycans |
WO2014060401A1 (en) | 2012-10-15 | 2014-04-24 | Novo Nordisk Health Care Ag | Coagulation factor vii polypeptides |
WO2014195452A1 (en) | 2013-06-07 | 2014-12-11 | Novo Nordisk A/S | Method for making mature insulin polypeptides |
CN106029688A (en) * | 2014-02-28 | 2016-10-12 | 诺和诺德股份有限公司 | Mating factor alpha propeptide variants |
WO2020012021A1 (en) | 2018-07-13 | 2020-01-16 | Varct Diagnostics Aps | Isolation of circulating cells of fetal origin using recombinant malaria protein var2csa |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1989002463A1 (en) * | 1987-09-07 | 1989-03-23 | Novo-Nordisk A/S | Synthetic yeast leader peptides |
EP0324274A1 (en) * | 1987-12-30 | 1989-07-19 | Chiron Corporation | Improved expression and secretion of heterologous proteins in yeast employing truncated alpha-factor leader sequences |
WO1990001063A1 (en) * | 1988-07-23 | 1990-02-08 | Delta Biotechnology Limited | New secretory leader sequences |
WO1990010075A1 (en) * | 1989-03-03 | 1990-09-07 | Novo Nordisk A/S | Yeast processing system comprising a negatively charged amino acid adjacent to the processing site |
-
1990
- 1990-12-19 DK DK300090A patent/DK300090D0/en not_active Application Discontinuation
-
1991
- 1991-12-17 NZ NZ241011A patent/NZ241011A/en unknown
- 1991-12-18 EP EP92901668A patent/EP0563175A1/en not_active Ceased
- 1991-12-18 JP JP4502056A patent/JPH06503957A/en not_active Expired - Lifetime
- 1991-12-18 CZ CS931192A patent/CZ119293A3/en unknown
- 1991-12-18 ZA ZA919932A patent/ZA919932B/en unknown
- 1991-12-18 IL IL100408A patent/IL100408A0/en unknown
- 1991-12-18 CA CA002098731A patent/CA2098731A1/en not_active Abandoned
- 1991-12-18 AU AU91348/91A patent/AU660161B2/en not_active Ceased
- 1991-12-18 WO PCT/DK1991/000396 patent/WO1992011378A1/en not_active Application Discontinuation
- 1991-12-18 FI FI932831A patent/FI932831A0/en not_active Application Discontinuation
- 1991-12-18 KR KR1019930701890A patent/KR930703450A/en not_active Withdrawn
- 1991-12-18 PT PT99848A patent/PT99848A/en not_active Application Discontinuation
- 1991-12-18 HU HU9301801A patent/HUT68751A/en unknown
- 1991-12-18 SK SK62593A patent/SK62593A3/en unknown
- 1991-12-18 IE IE443391A patent/IE914433A1/en not_active Application Discontinuation
- 1991-12-19 MX MX9102684A patent/MX9102684A/en unknown
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1989002463A1 (en) * | 1987-09-07 | 1989-03-23 | Novo-Nordisk A/S | Synthetic yeast leader peptides |
EP0324274A1 (en) * | 1987-12-30 | 1989-07-19 | Chiron Corporation | Improved expression and secretion of heterologous proteins in yeast employing truncated alpha-factor leader sequences |
WO1990001063A1 (en) * | 1988-07-23 | 1990-02-08 | Delta Biotechnology Limited | New secretory leader sequences |
WO1990010075A1 (en) * | 1989-03-03 | 1990-09-07 | Novo Nordisk A/S | Yeast processing system comprising a negatively charged amino acid adjacent to the processing site |
Non-Patent Citations (2)
Title |
---|
CHEMICAL ABSTRACTS, Volume 103, No. 15, 14 October 1985, (Columbus, Ohio, US), STEPIEN P.O. et al.: "A human-like preproinsulin leader sequence directsprotein secretion in yeast", see page 197, Abstract 117421z, & ICSU Short Rep. 1984, 1, 240- 241. * |
Dialog Information Services, File 155, Medline, Dialog Accession No. 07901087, Medline Accession No. 92039087, CLEMENTS J M et al.: "Secretion of human epidermal growth factor from Saconaromyces cerevisiae using synthetic leader sequences", & Gene (Netherlands) Oct 15 1991, 106 (2) p267-71. * |
Cited By (54)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5677172A (en) * | 1992-03-11 | 1997-10-14 | Makarow; Marja | Method for production of proteins in yeast |
US5939287A (en) * | 1992-03-11 | 1999-08-17 | Makarow; Marja | Method for production of proteins in yeast |
US5538863A (en) * | 1993-07-01 | 1996-07-23 | Immunex Corporation | Expression system comprising mutant yeast strain and expression vector encoding synthetic signal peptide |
WO1995034666A1 (en) * | 1994-06-16 | 1995-12-21 | Novo Nordisk A/S | Synthetic leader peptide sequences |
US5639642A (en) * | 1994-06-16 | 1997-06-17 | Novo Nordisk A/S | Synthetic leader peptide sequences |
US5795746A (en) * | 1994-06-16 | 1998-08-18 | Novo Nordisk A/S | Synthetic leader peptide sequences |
CN1131312C (en) * | 1994-06-17 | 2003-12-17 | 诺沃挪第克公司 | N-terminally extended proteins expressed in yeast |
US6500645B1 (en) | 1994-06-17 | 2002-12-31 | Novo Nordisk A/S | N-terminally extended proteins expressed in yeast |
EP0704527A3 (en) * | 1994-08-05 | 1997-08-27 | Pliva Pharm & Chem Works | DNA sequences encoding biosynthetic insulin precursors and process for prepation of insulin |
EP2431462A2 (en) | 1995-03-17 | 2012-03-21 | Novozymes A/S | Novel endoglucanases |
EP1683860A2 (en) | 1995-03-17 | 2006-07-26 | Novozymes A/S | Novel endoglucanases |
EP1975177A1 (en) | 1996-03-01 | 2008-10-01 | Novo Nordisk A/S | An appetite-suppressing peptide, its compositions and use |
EP2295453A2 (en) | 1996-03-01 | 2011-03-16 | Novo Nordisk A/S | An appetite-suppressing peptide, its compositions and use |
US6312923B1 (en) | 1996-12-13 | 2001-11-06 | Chiron Corporation | Method for expression of heterologous proteins in yeast |
US6083723A (en) * | 1996-12-13 | 2000-07-04 | Chiron Corporation | Method for expression of heterologous proteins in yeast |
US6706496B2 (en) | 1996-12-13 | 2004-03-16 | Chiron Corporation | Method for expression of heterologous proteins in yeast |
US6897043B2 (en) | 1996-12-13 | 2005-05-24 | Chiron Corporation | Method for expression of heterologous proteins in yeast |
US6017731A (en) * | 1996-12-13 | 2000-01-25 | Chiron Corporation | Method for expression of heterologous proteins in yeast |
US7166446B2 (en) | 1996-12-13 | 2007-01-23 | Chiron Corporation | Method for expression of heterologous proteins in yeast |
US6183989B1 (en) | 1998-01-23 | 2001-02-06 | Novo Nordisk A/S | Process for making desired polypeptides in yeast |
WO2001049870A1 (en) | 1999-12-29 | 2001-07-12 | Novo Nordisk A/S | Method for making insulin precursors and insulin precursor analogues having improved fermentation yield in yeast |
AT410217B (en) * | 2000-06-15 | 2003-03-25 | Cistem Biotechnologies Gmbh | VECTOR AND A METHOD FOR THE EXPRESSION AND SELECTION OF RANDOMIZED PEPTIDE SEQUENCES |
WO2002077218A1 (en) | 2001-03-22 | 2002-10-03 | Novo Nordisk Health Care Ag | Coagulation factor vii derivatives |
WO2003027147A2 (en) | 2001-09-27 | 2003-04-03 | Novo Nordisk Health Care Ag | Human coagulation factor vii polypeptides |
EP2392655A2 (en) | 2003-09-09 | 2011-12-07 | Novo Nordisk Health Care AG | Coagulation factor VII polypeptides |
WO2005047508A1 (en) | 2003-11-14 | 2005-05-26 | Novo Nordisk A/S | Processes for making acylated insulin |
WO2005054291A1 (en) | 2003-12-03 | 2005-06-16 | Novo Nordisk A/S | Single-chain insulin |
WO2005078116A1 (en) * | 2004-01-16 | 2005-08-25 | Qiuyun Liu | A method of isolating antibacterial peptides and the isolated peptides thereof |
EP2360181A1 (en) | 2005-04-18 | 2011-08-24 | Novo Nordisk A/S | IL-21 variants |
WO2007020256A1 (en) | 2005-08-16 | 2007-02-22 | Novo Nordisk A/S | Method for making mature insulin polypeptides |
EP2316930A1 (en) | 2005-09-14 | 2011-05-04 | Novo Nordisk Health Care AG | Human coagulation factor VII polypeptides |
EP2256129A1 (en) | 2006-02-27 | 2010-12-01 | Novo Nordisk A/S | Insulin derivatives |
WO2008037735A1 (en) | 2006-09-27 | 2008-04-03 | Novo Nordisk A/S | Method for making maturated insulin polypeptides |
WO2009021955A1 (en) | 2007-08-13 | 2009-02-19 | Novo Nordisk A/S | Rapid acting insulin analogues |
WO2009022006A1 (en) | 2007-08-15 | 2009-02-19 | Novo Nordisk A/S | Insulins with an acyl moiety comprising repeating units of alkylene glycol containing amino acids |
EP2708554A1 (en) | 2007-08-15 | 2014-03-19 | Novo Nordisk A/S | Insulin analogues with an acyl and alkylene glycol moiety |
WO2009022013A1 (en) | 2007-08-15 | 2009-02-19 | Novo Nordisk A/S | Insulin analogues with an acyl and aklylene glycol moiety |
WO2010103038A1 (en) | 2009-03-11 | 2010-09-16 | Novo Nordisk A/S | Interleukin-21 variants having antagonistic binding to the il-21 receptor |
WO2011006982A2 (en) | 2009-07-17 | 2011-01-20 | Rigshospitalet | Inhibitors of complement activation |
EP3395828A1 (en) | 2009-07-17 | 2018-10-31 | Rigshospitalet | Inhibitors of complement activation |
WO2011064282A1 (en) | 2009-11-25 | 2011-06-03 | Novo Nordisk A/S | Method for making polypeptides |
WO2011067283A1 (en) | 2009-12-01 | 2011-06-09 | Novo Nordisk A/S | Novel peptidyl alpha-hydroxyglycine alpha-amidating lyases |
WO2011089170A2 (en) | 2010-01-22 | 2011-07-28 | Novo Nordisk A/S | Process for preparing fgf-21 with low degree of o-glycosylation |
EP3815708A1 (en) | 2010-03-05 | 2021-05-05 | Omeros Corporation | Chimeric inhibitor molecules of complement activation |
WO2011107591A1 (en) | 2010-03-05 | 2011-09-09 | Rigshospitalet | Chimeric inhibitor molecules of complement activation |
EP2598527A4 (en) * | 2010-07-28 | 2014-01-08 | Smartcells Inc | Recombinantly expressed insulin polypeptides and uses thereof |
US9074015B2 (en) * | 2010-07-28 | 2015-07-07 | Smartcells, Inc. | Recombinantly expressed insulin polypeptides and uses thereof |
US20130190476A1 (en) * | 2010-07-28 | 2013-07-25 | Thomas M. Lancaster | Recombinantly expressed insulin polypeptides and uses thereof |
WO2013117705A1 (en) | 2012-02-09 | 2013-08-15 | Var2 Pharmaceuticals Aps | Targeting of chondroitin sulfate glycans |
WO2014060401A1 (en) | 2012-10-15 | 2014-04-24 | Novo Nordisk Health Care Ag | Coagulation factor vii polypeptides |
WO2014195452A1 (en) | 2013-06-07 | 2014-12-11 | Novo Nordisk A/S | Method for making mature insulin polypeptides |
CN106029688A (en) * | 2014-02-28 | 2016-10-12 | 诺和诺德股份有限公司 | Mating factor alpha propeptide variants |
US12024542B2 (en) | 2014-02-28 | 2024-07-02 | Novo Nordisk A/S | Mating factor alpha pro-peptide variants |
WO2020012021A1 (en) | 2018-07-13 | 2020-01-16 | Varct Diagnostics Aps | Isolation of circulating cells of fetal origin using recombinant malaria protein var2csa |
Also Published As
Publication number | Publication date |
---|---|
AU9134891A (en) | 1992-07-22 |
HU9301801D0 (en) | 1993-10-28 |
FI932831L (en) | 1993-06-18 |
IL100408A0 (en) | 1992-09-06 |
AU660161B2 (en) | 1995-06-15 |
EP0563175A1 (en) | 1993-10-06 |
MX9102684A (en) | 1992-06-01 |
KR930703450A (en) | 1993-11-30 |
JPH06503957A (en) | 1994-05-12 |
HUT68751A (en) | 1995-07-28 |
NZ241011A (en) | 1993-04-28 |
ZA919932B (en) | 1992-08-26 |
DK300090D0 (en) | 1990-12-19 |
CA2098731A1 (en) | 1992-06-19 |
CZ119293A3 (en) | 1994-02-16 |
FI932831A0 (en) | 1993-06-18 |
PT99848A (en) | 1993-06-30 |
SK62593A3 (en) | 1993-10-06 |
IE914433A1 (en) | 1992-07-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU660161B2 (en) | A method of constructing synthetic leader sequences | |
EP0763117B1 (en) | Synthetic leader peptide sequences | |
EP0792367B1 (en) | A dna construct encoding the yap3 signal peptide | |
JP3730255B2 (en) | N-terminal extended protein expressed in yeast | |
US6500645B1 (en) | N-terminally extended proteins expressed in yeast | |
US5316923A (en) | Synthetic yeast leader peptides | |
IE892361L (en) | Peptide and dna sequences | |
EP0946735A1 (en) | N-terminally extended proteins expressed in yeast | |
EP0868523B1 (en) | Vector for expression of n-terminally extended proteins in yeast cell | |
CA2192942C (en) | Synthetic leader peptide sequences | |
RU2167939C2 (en) | Method of polypeptide producing in yeast |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AU CA CS FI HU JP KR NO PL SU US |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): AT BE CH DE DK ES FR GB GR IT LU MC NL SE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1992901668 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 62593 Country of ref document: SK Ref document number: 2098731 Country of ref document: CA Ref document number: PV1993-1192 Country of ref document: CZ |
|
WWE | Wipo information: entry into national phase |
Ref document number: 932831 Country of ref document: FI |
|
WWP | Wipo information: published in national office |
Ref document number: 1992901668 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: PV1993-1192 Country of ref document: CZ |
|
WWR | Wipo information: refused in national office |
Ref document number: PV1993-1192 Country of ref document: CZ |
|
WWR | Wipo information: refused in national office |
Ref document number: 1992901668 Country of ref document: EP |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: 1992901668 Country of ref document: EP |