WO2015127365A2 - Calcium-independent sortase a mutants - Google Patents
Calcium-independent sortase a mutants Download PDFInfo
- Publication number
- WO2015127365A2 WO2015127365A2 PCT/US2015/017116 US2015017116W WO2015127365A2 WO 2015127365 A2 WO2015127365 A2 WO 2015127365A2 US 2015017116 W US2015017116 W US 2015017116W WO 2015127365 A2 WO2015127365 A2 WO 2015127365A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- seq
- mutant
- amino acid
- sortase
- calcium
- Prior art date
Links
- 239000011575 calcium Substances 0.000 title claims description 93
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 title claims description 89
- 229910052791 calcium Inorganic materials 0.000 title claims description 89
- 108090000250 sortase A Proteins 0.000 claims abstract description 166
- 230000003197 catalytic effect Effects 0.000 claims abstract description 65
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 195
- 108090000623 proteins and genes Proteins 0.000 claims description 140
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 136
- 102000004169 proteins and genes Human genes 0.000 claims description 126
- 102000040430 polynucleotide Human genes 0.000 claims description 112
- 108091033319 polynucleotide Proteins 0.000 claims description 112
- 239000002157 polynucleotide Substances 0.000 claims description 112
- 238000006467 substitution reaction Methods 0.000 claims description 99
- 125000000539 amino acid group Chemical group 0.000 claims description 88
- 238000000034 method Methods 0.000 claims description 79
- 239000002773 nucleotide Substances 0.000 claims description 60
- 102000004190 Enzymes Human genes 0.000 claims description 57
- 108090000790 Enzymes Proteins 0.000 claims description 57
- 239000000203 mixture Substances 0.000 claims description 57
- 210000004027 cell Anatomy 0.000 claims description 56
- 125000003729 nucleotide group Chemical group 0.000 claims description 56
- 150000001413 amino acids Chemical class 0.000 claims description 49
- 210000004899 c-terminal region Anatomy 0.000 claims description 39
- 150000007523 nucleic acids Chemical class 0.000 claims description 35
- 102000039446 nucleic acids Human genes 0.000 claims description 33
- 108020004707 nucleic acids Proteins 0.000 claims description 33
- 238000006243 chemical reaction Methods 0.000 claims description 25
- 230000001404 mediated effect Effects 0.000 claims description 21
- 229920000642 polymer Polymers 0.000 claims description 12
- 238000012217 deletion Methods 0.000 claims description 11
- 230000037430 deletion Effects 0.000 claims description 11
- 150000003384 small molecules Chemical class 0.000 claims description 10
- 150000002632 lipids Chemical class 0.000 claims description 8
- 238000012258 culturing Methods 0.000 claims description 7
- 239000003814 drug Substances 0.000 claims description 7
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 claims description 7
- 239000001963 growth medium Substances 0.000 claims description 7
- 229910052751 metal Inorganic materials 0.000 claims description 7
- 239000002184 metal Substances 0.000 claims description 7
- 239000003053 toxin Substances 0.000 claims description 7
- 231100000765 toxin Toxicity 0.000 claims description 7
- 239000000427 antigen Substances 0.000 claims description 6
- 102000036639 antigens Human genes 0.000 claims description 6
- 108091007433 antigens Proteins 0.000 claims description 6
- 150000001720 carbohydrates Chemical class 0.000 claims description 6
- 102000005701 Calcium-Binding Proteins Human genes 0.000 claims description 5
- 108010045403 Calcium-Binding Proteins Proteins 0.000 claims description 5
- 239000004971 Cross linker Substances 0.000 claims description 5
- 229940124597 therapeutic agent Drugs 0.000 claims description 5
- 239000003054 catalyst Substances 0.000 claims description 4
- 239000002738 chelating agent Substances 0.000 claims description 4
- 239000002872 contrast media Substances 0.000 claims description 4
- 125000003282 alkyl amino group Chemical group 0.000 claims 2
- 230000001747 exhibiting effect Effects 0.000 abstract description 3
- 101100478373 Bacillus subtilis (strain 168) srtD gene Proteins 0.000 description 311
- 101150087718 srtA gene Proteins 0.000 description 311
- 102000004196 processed proteins & peptides Human genes 0.000 description 122
- 229920001184 polypeptide Polymers 0.000 description 117
- 235000001014 amino acid Nutrition 0.000 description 101
- 235000018102 proteins Nutrition 0.000 description 74
- 229940088598 enzyme Drugs 0.000 description 47
- 239000012634 fragment Substances 0.000 description 32
- 239000000758 substrate Substances 0.000 description 30
- 239000000047 product Substances 0.000 description 27
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 22
- 239000000243 solution Substances 0.000 description 20
- 239000013615 primer Substances 0.000 description 19
- 230000000694 effects Effects 0.000 description 18
- 230000001419 dependent effect Effects 0.000 description 17
- 230000035772 mutation Effects 0.000 description 16
- 239000012038 nucleophile Substances 0.000 description 16
- 230000004048 modification Effects 0.000 description 12
- 238000012986 modification Methods 0.000 description 12
- 239000013598 vector Substances 0.000 description 12
- 239000011616 biotin Substances 0.000 description 11
- 229960002685 biotin Drugs 0.000 description 11
- 235000020958 biotin Nutrition 0.000 description 11
- 238000003780 insertion Methods 0.000 description 11
- 230000037431 insertion Effects 0.000 description 11
- 238000000746 purification Methods 0.000 description 11
- 102200074006 rs35908728 Human genes 0.000 description 11
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 10
- 239000003795 chemical substances by application Substances 0.000 description 10
- 239000000126 substance Substances 0.000 description 10
- 108091028043 Nucleic acid sequence Proteins 0.000 description 9
- 102220388294 c.281C>G Human genes 0.000 description 9
- 238000003776 cleavage reaction Methods 0.000 description 9
- 239000000499 gel Substances 0.000 description 9
- 230000007017 scission Effects 0.000 description 9
- 230000014509 gene expression Effects 0.000 description 8
- 238000002372 labelling Methods 0.000 description 8
- 229920001223 polyethylene glycol Polymers 0.000 description 8
- 230000008569 process Effects 0.000 description 8
- 241000894007 species Species 0.000 description 8
- -1 1-adamantyl Chemical group 0.000 description 7
- 108020004414 DNA Proteins 0.000 description 7
- 102000053602 DNA Human genes 0.000 description 7
- 239000002202 Polyethylene glycol Substances 0.000 description 7
- 101800001707 Spacer peptide Proteins 0.000 description 7
- 239000000539 dimer Substances 0.000 description 7
- 238000004519 manufacturing process Methods 0.000 description 7
- 230000002441 reversible effect Effects 0.000 description 7
- 238000001542 size-exclusion chromatography Methods 0.000 description 7
- ABZLKHKQJHEPAX-UHFFFAOYSA-N tetramethylrhodamine Chemical compound C=12C=CC(N(C)C)=CC2=[O+]C2=CC(N(C)C)=CC=C2C=1C1=CC=CC=C1C([O-])=O ABZLKHKQJHEPAX-UHFFFAOYSA-N 0.000 description 7
- 101100339481 Cochliobolus miyabeanus HOG1 gene Proteins 0.000 description 6
- 241000588724 Escherichia coli Species 0.000 description 6
- 241001465754 Metazoa Species 0.000 description 6
- 101150009928 SRM1 gene Proteins 0.000 description 6
- 230000027455 binding Effects 0.000 description 6
- OVBPIULPVIDEAO-LBPRGKRZSA-N folic acid Chemical compound C=1N=C2NC(N)=NC(=O)C2=NC=1CNC1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 OVBPIULPVIDEAO-LBPRGKRZSA-N 0.000 description 6
- 238000005924 transacylation reaction Methods 0.000 description 6
- 241000193738 Bacillus anthracis Species 0.000 description 5
- 241000894006 Bacteria Species 0.000 description 5
- 241000495778 Escherichia faecalis Species 0.000 description 5
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical compound OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 5
- 241000193996 Streptococcus pyogenes Species 0.000 description 5
- 230000008901 benefit Effects 0.000 description 5
- 229910052799 carbon Inorganic materials 0.000 description 5
- 150000001875 compounds Chemical class 0.000 description 5
- 230000021615 conjugation Effects 0.000 description 5
- 235000014113 dietary fatty acids Nutrition 0.000 description 5
- 229930195729 fatty acid Natural products 0.000 description 5
- 239000000194 fatty acid Substances 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- 108020004999 messenger RNA Proteins 0.000 description 5
- 239000000178 monomer Substances 0.000 description 5
- 229910052759 nickel Inorganic materials 0.000 description 5
- 241000124008 Mammalia Species 0.000 description 4
- 108010052285 Membrane Proteins Proteins 0.000 description 4
- 241000288906 Primates Species 0.000 description 4
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 4
- 102100030413 Spermidine synthase Human genes 0.000 description 4
- 241000700605 Viruses Species 0.000 description 4
- 235000014633 carbohydrates Nutrition 0.000 description 4
- 239000003153 chemical reaction reagent Substances 0.000 description 4
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 4
- 230000001351 cycling effect Effects 0.000 description 4
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 4
- 239000013604 expression vector Substances 0.000 description 4
- 150000004665 fatty acids Chemical class 0.000 description 4
- MHMNJMPURVTYEJ-UHFFFAOYSA-N fluorescein-5-isothiocyanate Chemical compound O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 MHMNJMPURVTYEJ-UHFFFAOYSA-N 0.000 description 4
- 235000019152 folic acid Nutrition 0.000 description 4
- 239000011724 folic acid Substances 0.000 description 4
- 230000005291 magnetic effect Effects 0.000 description 4
- 239000000463 material Substances 0.000 description 4
- 239000005022 packaging material Substances 0.000 description 4
- 230000035515 penetration Effects 0.000 description 4
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 4
- 108020003175 receptors Proteins 0.000 description 4
- 102000005962 receptors Human genes 0.000 description 4
- 238000011084 recovery Methods 0.000 description 4
- 239000000523 sample Substances 0.000 description 4
- 238000002553 single reaction monitoring Methods 0.000 description 4
- 238000013426 sirius red morphometry Methods 0.000 description 4
- 239000007787 solid Substances 0.000 description 4
- 238000003860 storage Methods 0.000 description 4
- 238000001890 transfection Methods 0.000 description 4
- 229920002307 Dextran Polymers 0.000 description 3
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 3
- 241000282412 Homo Species 0.000 description 3
- 241000186779 Listeria monocytogenes Species 0.000 description 3
- 102000018697 Membrane Proteins Human genes 0.000 description 3
- 102000008300 Mutant Proteins Human genes 0.000 description 3
- 108010021466 Mutant Proteins Proteins 0.000 description 3
- 108091005804 Peptidases Proteins 0.000 description 3
- DNIAPMSPPWPWGF-UHFFFAOYSA-N Propylene glycol Chemical compound CC(O)CO DNIAPMSPPWPWGF-UHFFFAOYSA-N 0.000 description 3
- 239000004365 Protease Substances 0.000 description 3
- 241000283984 Rodentia Species 0.000 description 3
- 238000007792 addition Methods 0.000 description 3
- 150000003973 alkyl amines Chemical group 0.000 description 3
- 230000004075 alteration Effects 0.000 description 3
- 150000001412 amines Chemical class 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 239000000872 buffer Substances 0.000 description 3
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 3
- 238000004113 cell culture Methods 0.000 description 3
- 238000010367 cloning Methods 0.000 description 3
- 238000006471 dimerization reaction Methods 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 230000002209 hydrophobic effect Effects 0.000 description 3
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 3
- 238000000338 in vitro Methods 0.000 description 3
- 238000010348 incorporation Methods 0.000 description 3
- 230000017730 intein-mediated protein splicing Effects 0.000 description 3
- 210000004962 mammalian cell Anatomy 0.000 description 3
- 239000003068 molecular probe Substances 0.000 description 3
- 239000013612 plasmid Substances 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 210000001519 tissue Anatomy 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- 238000013519 translation Methods 0.000 description 3
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- SLXKOJJOQWFEFD-UHFFFAOYSA-N 6-aminohexanoic acid Chemical compound NCCCCCC(O)=O SLXKOJJOQWFEFD-UHFFFAOYSA-N 0.000 description 2
- 102000004506 Blood Proteins Human genes 0.000 description 2
- 108010017384 Blood Proteins Proteins 0.000 description 2
- 241000283690 Bos taurus Species 0.000 description 2
- BVKZGUZCCUSVTD-UHFFFAOYSA-L Carbonate Chemical compound [O-]C([O-])=O BVKZGUZCCUSVTD-UHFFFAOYSA-L 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- 241000282326 Felis catus Species 0.000 description 2
- 241000233866 Fungi Species 0.000 description 2
- 102000005720 Glutathione transferase Human genes 0.000 description 2
- 108010070675 Glutathione transferase Proteins 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- 101000759918 Homo sapiens Ubiquitin carboxyl-terminal hydrolase isozyme L3 Proteins 0.000 description 2
- LRQKBLKVPFOOQJ-YFKPBYRVSA-N L-norleucine Chemical group CCCC[C@H]([NH3+])C([O-])=O LRQKBLKVPFOOQJ-YFKPBYRVSA-N 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- OVBPIULPVIDEAO-UHFFFAOYSA-N N-Pteroyl-L-glutaminsaeure Natural products C=1N=C2NC(N)=NC(=O)C2=NC=1CNC1=CC=C(C(=O)NC(CCC(O)=O)C(O)=O)C=C1 OVBPIULPVIDEAO-UHFFFAOYSA-N 0.000 description 2
- 229930193140 Neomycin Natural products 0.000 description 2
- 229910019142 PO4 Inorganic materials 0.000 description 2
- 241000009328 Perro Species 0.000 description 2
- 239000004793 Polystyrene Substances 0.000 description 2
- 239000004372 Polyvinyl alcohol Substances 0.000 description 2
- 241000700159 Rattus Species 0.000 description 2
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 239000012505 Superdex™ Substances 0.000 description 2
- 108090000190 Thrombin Proteins 0.000 description 2
- 102100025040 Ubiquitin carboxyl-terminal hydrolase isozyme L3 Human genes 0.000 description 2
- 238000001042 affinity chromatography Methods 0.000 description 2
- 125000004429 atom Chemical group 0.000 description 2
- 239000011324 bead Substances 0.000 description 2
- 230000001588 bifunctional effect Effects 0.000 description 2
- 230000004071 biological effect Effects 0.000 description 2
- 239000006143 cell culture medium Substances 0.000 description 2
- 235000012000 cholesterol Nutrition 0.000 description 2
- 239000013078 crystal Substances 0.000 description 2
- 210000000805 cytoplasm Anatomy 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 229940079593 drug Drugs 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- 150000002148 esters Chemical class 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 2
- 229940014144 folate Drugs 0.000 description 2
- 229960000304 folic acid Drugs 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 239000011521 glass Substances 0.000 description 2
- RWSXRVCMGQZWBV-WDSKDSINSA-N glutathione Chemical compound OC(=O)[C@@H](N)CCC(=O)N[C@@H](CS)C(=O)NCC(O)=O RWSXRVCMGQZWBV-WDSKDSINSA-N 0.000 description 2
- VOZRXNHHFUQHIL-UHFFFAOYSA-N glycidyl methacrylate Chemical compound CC(=C)C(=O)OCC1CO1 VOZRXNHHFUQHIL-UHFFFAOYSA-N 0.000 description 2
- 238000011194 good manufacturing practice Methods 0.000 description 2
- 239000003102 growth factor Substances 0.000 description 2
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 2
- 238000003384 imaging method Methods 0.000 description 2
- 230000005847 immunogenicity Effects 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 238000005304 joining Methods 0.000 description 2
- 229960000318 kanamycin Drugs 0.000 description 2
- 229930027917 kanamycin Natural products 0.000 description 2
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 2
- 229930182823 kanamycin A Natural products 0.000 description 2
- 230000000670 limiting effect Effects 0.000 description 2
- 230000029226 lipidation Effects 0.000 description 2
- 239000006166 lysate Substances 0.000 description 2
- 239000006249 magnetic particle Substances 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 125000000956 methoxy group Chemical group [H]C([H])([H])O* 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 229960004927 neomycin Drugs 0.000 description 2
- 230000001537 neural effect Effects 0.000 description 2
- 238000006384 oligomerization reaction Methods 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- 238000010647 peptide synthesis reaction Methods 0.000 description 2
- 239000000816 peptidomimetic Substances 0.000 description 2
- 238000002823 phage display Methods 0.000 description 2
- CTRLRINCMYICJO-UHFFFAOYSA-N phenyl azide Chemical compound [N-]=[N+]=NC1=CC=CC=C1 CTRLRINCMYICJO-UHFFFAOYSA-N 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 2
- 239000010452 phosphate Substances 0.000 description 2
- 229920000233 poly(alkylene oxides) Polymers 0.000 description 2
- 229920001515 polyalkylene glycol Polymers 0.000 description 2
- 229920001451 polypropylene glycol Polymers 0.000 description 2
- 229920002223 polystyrene Polymers 0.000 description 2
- 229920002451 polyvinyl alcohol Polymers 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 238000002818 protein evolution Methods 0.000 description 2
- 238000001742 protein purification Methods 0.000 description 2
- 238000010188 recombinant method Methods 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 239000011347 resin Substances 0.000 description 2
- 229920005989 resin Polymers 0.000 description 2
- 229930190553 ribostamycin Natural products 0.000 description 2
- 229960003485 ribostamycin Drugs 0.000 description 2
- NSKGQURZWSPSBC-NLZFXWNVSA-N ribostamycin Chemical compound N[C@H]1[C@H](O)[C@@H](O)[C@H](CN)O[C@@H]1O[C@@H]1[C@@H](O[C@H]2[C@@H]([C@@H](O)[C@H](CO)O2)O)[C@H](O)[C@@H](N)C[C@H]1N NSKGQURZWSPSBC-NLZFXWNVSA-N 0.000 description 2
- NSKGQURZWSPSBC-UHFFFAOYSA-N ribostamycin A Natural products NC1C(O)C(O)C(CN)OC1OC1C(OC2C(C(O)C(CO)O2)O)C(O)C(N)CC1N NSKGQURZWSPSBC-UHFFFAOYSA-N 0.000 description 2
- 229920002477 rna polymer Polymers 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 230000028327 secretion Effects 0.000 description 2
- 238000002864 sequence alignment Methods 0.000 description 2
- 239000007790 solid phase Substances 0.000 description 2
- 125000006850 spacer group Chemical group 0.000 description 2
- 230000009870 specific binding Effects 0.000 description 2
- 239000007858 starting material Substances 0.000 description 2
- 125000003396 thiol group Chemical group [H]S* 0.000 description 2
- 229960004072 thrombin Drugs 0.000 description 2
- 241000701447 unidentified baculovirus Species 0.000 description 2
- 241001515965 unidentified phage Species 0.000 description 2
- 239000011800 void material Substances 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- HDXVFGYNQCBFKD-JGWLITMVSA-N (2r,3s,4r,5r)-6-amino-2,3,4,5-tetrahydroxyhexanal Chemical compound NC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C=O HDXVFGYNQCBFKD-JGWLITMVSA-N 0.000 description 1
- HDXVFGYNQCBFKD-KVTDHHQDSA-N (2s,3s,4r,5r)-6-amino-2,3,4,5-tetrahydroxyhexanal Chemical compound NC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)C=O HDXVFGYNQCBFKD-KVTDHHQDSA-N 0.000 description 1
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 1
- AXAVXPMQTGXXJZ-UHFFFAOYSA-N 2-aminoacetic acid;2-amino-2-(hydroxymethyl)propane-1,3-diol Chemical compound NCC(O)=O.OCC(N)(CO)CO AXAVXPMQTGXXJZ-UHFFFAOYSA-N 0.000 description 1
- BZTDTCNHAFUJOG-UHFFFAOYSA-N 6-carboxyfluorescein Chemical compound C12=CC=C(O)C=C2OC2=CC(O)=CC=C2C11OC(=O)C2=CC=C(C(=O)O)C=C21 BZTDTCNHAFUJOG-UHFFFAOYSA-N 0.000 description 1
- CJIJXIFQYOPWTF-UHFFFAOYSA-N 7-hydroxycoumarin Natural products O1C(=O)C=CC2=CC(O)=CC=C21 CJIJXIFQYOPWTF-UHFFFAOYSA-N 0.000 description 1
- 108010066676 Abrin Proteins 0.000 description 1
- 102000012440 Acetylcholinesterase Human genes 0.000 description 1
- 108010022752 Acetylcholinesterase Proteins 0.000 description 1
- 241000251468 Actinopterygii Species 0.000 description 1
- 108010000239 Aequorin Proteins 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- 108010011170 Ala-Trp-Arg-His-Pro-Gln-Phe-Gly-Gly Proteins 0.000 description 1
- 108010088751 Albumins Proteins 0.000 description 1
- 102000009027 Albumins Human genes 0.000 description 1
- 239000012103 Alexa Fluor 488 Substances 0.000 description 1
- 239000012114 Alexa Fluor 647 Substances 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 101800002011 Amphipathic peptide Proteins 0.000 description 1
- 241000282672 Ateles sp. Species 0.000 description 1
- 241000972773 Aulopiformes Species 0.000 description 1
- 241000271566 Aves Species 0.000 description 1
- 108090001008 Avidin Proteins 0.000 description 1
- 241000157302 Bison bison athabascae Species 0.000 description 1
- 102100025250 C-X-C motif chemokine 14 Human genes 0.000 description 1
- 102100024217 CAMPATH-1 antigen Human genes 0.000 description 1
- 108010065524 CD52 Antigen Proteins 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 241000282465 Canis Species 0.000 description 1
- 241000282461 Canis lupus Species 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 241000282994 Cervidae Species 0.000 description 1
- 108010012236 Chemokines Proteins 0.000 description 1
- 102000019034 Chemokines Human genes 0.000 description 1
- 102100022641 Coagulation factor IX Human genes 0.000 description 1
- 102100023804 Coagulation factor VII Human genes 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 241000699800 Cricetinae Species 0.000 description 1
- 102000004127 Cytokines Human genes 0.000 description 1
- 108090000695 Cytokines Proteins 0.000 description 1
- IGXWBGJHJZYPQS-SSDOTTSWSA-N D-Luciferin Chemical compound OC(=O)[C@H]1CSC(C=2SC3=CC=C(O)C=C3N=2)=N1 IGXWBGJHJZYPQS-SSDOTTSWSA-N 0.000 description 1
- FBPFZTCFMRRESA-JGWLITMVSA-N D-glucitol Polymers OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-JGWLITMVSA-N 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- XPDXVDYUQZHFPV-UHFFFAOYSA-N Dansyl Chloride Chemical compound C1=CC=C2C(N(C)C)=CC=CC2=C1S(Cl)(=O)=O XPDXVDYUQZHFPV-UHFFFAOYSA-N 0.000 description 1
- CYCGRDQQIOGCKX-UHFFFAOYSA-N Dehydro-luciferin Natural products OC(=O)C1=CSC(C=2SC3=CC(O)=CC=C3N=2)=N1 CYCGRDQQIOGCKX-UHFFFAOYSA-N 0.000 description 1
- 102000016607 Diphtheria Toxin Human genes 0.000 description 1
- 108010053187 Diphtheria Toxin Proteins 0.000 description 1
- 241000271571 Dromaius novaehollandiae Species 0.000 description 1
- 241000701832 Enterobacteria phage T3 Species 0.000 description 1
- 241000283086 Equidae Species 0.000 description 1
- 241000283073 Equus caballus Species 0.000 description 1
- 102000003951 Erythropoietin Human genes 0.000 description 1
- 108090000394 Erythropoietin Proteins 0.000 description 1
- 108010020195 FLAG peptide Proteins 0.000 description 1
- XZWYTXMRWQJBGX-VXBMVYAYSA-N FLAG peptide Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 XZWYTXMRWQJBGX-VXBMVYAYSA-N 0.000 description 1
- 108010076282 Factor IX Proteins 0.000 description 1
- 108010023321 Factor VII Proteins 0.000 description 1
- 108010054218 Factor VIII Proteins 0.000 description 1
- 102000001690 Factor VIII Human genes 0.000 description 1
- 108010014173 Factor X Proteins 0.000 description 1
- 108010074860 Factor Xa Proteins 0.000 description 1
- 239000004230 Fast Yellow AB Substances 0.000 description 1
- 241000282324 Felis Species 0.000 description 1
- BJGNCJDXODQBOB-UHFFFAOYSA-N Fivefly Luciferin Natural products OC(=O)C1CSC(C=2SC3=CC(O)=CC=C3N=2)=N1 BJGNCJDXODQBOB-UHFFFAOYSA-N 0.000 description 1
- MURGITYSBWUQTI-UHFFFAOYSA-N Fluorescin Natural products OC(=O)C1=CC=CC=C1C1C2=CC=C(O)C=C2OC2=CC(O)=CC=C21 MURGITYSBWUQTI-UHFFFAOYSA-N 0.000 description 1
- 241000700662 Fowlpox virus Species 0.000 description 1
- 241000287828 Gallus gallus Species 0.000 description 1
- KOSRFJWDECSPRO-WDSKDSINSA-N Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(O)=O KOSRFJWDECSPRO-WDSKDSINSA-N 0.000 description 1
- 108010024636 Glutathione Proteins 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 101000858068 Homo sapiens C-X-C motif chemokine 14 Proteins 0.000 description 1
- 101001133056 Homo sapiens Mucin-1 Proteins 0.000 description 1
- 101000884271 Homo sapiens Signal transducer CD24 Proteins 0.000 description 1
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 1
- PMMYEEVYMWASQN-DMTCNVIQSA-N Hydroxyproline Chemical compound O[C@H]1CN[C@H](C(O)=O)C1 PMMYEEVYMWASQN-DMTCNVIQSA-N 0.000 description 1
- 206010021703 Indifference Diseases 0.000 description 1
- 102000005755 Intercellular Signaling Peptides and Proteins Human genes 0.000 description 1
- 108010070716 Intercellular Signaling Peptides and Proteins Proteins 0.000 description 1
- 102000014150 Interferons Human genes 0.000 description 1
- 108010050904 Interferons Proteins 0.000 description 1
- 102000015696 Interleukins Human genes 0.000 description 1
- 108010063738 Interleukins Proteins 0.000 description 1
- 102000004895 Lipoproteins Human genes 0.000 description 1
- 108090001030 Lipoproteins Proteins 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- DDWFXDSYGUXRAY-UHFFFAOYSA-N Luciferin Natural products CCc1c(C)c(CC2NC(=O)C(=C2C=C)C)[nH]c1Cc3[nH]c4C(=C5/NC(CC(=O)O)C(C)C5CC(=O)O)CC(=O)c4c3C DDWFXDSYGUXRAY-UHFFFAOYSA-N 0.000 description 1
- 241000282553 Macaca Species 0.000 description 1
- 108700018351 Major Histocompatibility Complex Proteins 0.000 description 1
- PEEHTFAAVSWFBL-UHFFFAOYSA-N Maleimide Chemical compound O=C1NC(=O)C=C1 PEEHTFAAVSWFBL-UHFFFAOYSA-N 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- 101710175625 Maltose/maltodextrin-binding periplasmic protein Proteins 0.000 description 1
- 241000283923 Marmota monax Species 0.000 description 1
- 241001508691 Martes zibellina Species 0.000 description 1
- 102100034256 Mucin-1 Human genes 0.000 description 1
- MSFSPUZXLOGKHJ-UHFFFAOYSA-N Muraminsaeure Natural products OC(=O)C(C)OC1C(N)C(O)OC(CO)C1O MSFSPUZXLOGKHJ-UHFFFAOYSA-N 0.000 description 1
- 241000699666 Mus <mouse, genus> Species 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 102000008934 Muscle Proteins Human genes 0.000 description 1
- 108010074084 Muscle Proteins Proteins 0.000 description 1
- 241000282339 Mustela Species 0.000 description 1
- 101710135898 Myc proto-oncogene protein Proteins 0.000 description 1
- 102100038895 Myc proto-oncogene protein Human genes 0.000 description 1
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 description 1
- 238000005481 NMR spectroscopy Methods 0.000 description 1
- 108010025020 Nerve Growth Factor Proteins 0.000 description 1
- 102000007072 Nerve Growth Factors Human genes 0.000 description 1
- 102000005348 Neuraminidase Human genes 0.000 description 1
- 108010006232 Neuraminidase Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 241000282579 Pan Species 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 108010013639 Peptidoglycan Proteins 0.000 description 1
- 108010043958 Peptoids Proteins 0.000 description 1
- 108010004729 Phycoerythrin Proteins 0.000 description 1
- 108700011066 PreScission Protease Proteins 0.000 description 1
- 101800004937 Protein C Proteins 0.000 description 1
- 102000017975 Protein C Human genes 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 101000762949 Pseudomonas aeruginosa (strain ATCC 15692 / DSM 22644 / CIP 104116 / JCM 14847 / LMG 12228 / 1C / PRS 101 / PAO1) Exotoxin A Proteins 0.000 description 1
- 101001039269 Rattus norvegicus Glycine N-methyltransferase Proteins 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010039491 Ricin Proteins 0.000 description 1
- 241000277331 Salmonidae Species 0.000 description 1
- 101800001700 Saposin-D Proteins 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 108010071390 Serum Albumin Proteins 0.000 description 1
- 102000007562 Serum Albumin Human genes 0.000 description 1
- 102100038081 Signal transducer CD24 Human genes 0.000 description 1
- FBPFZTCFMRRESA-NQAPHZHOSA-N Sorbitol Polymers OCC(O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-NQAPHZHOSA-N 0.000 description 1
- 108010090804 Streptavidin Proteins 0.000 description 1
- 101710172711 Structural protein Proteins 0.000 description 1
- 241000272534 Struthio camelus Species 0.000 description 1
- 241000701093 Suid alphaherpesvirus 1 Species 0.000 description 1
- 241000282887 Suidae Species 0.000 description 1
- 108010076818 TEV protease Proteins 0.000 description 1
- 101710120037 Toxin CcdB Proteins 0.000 description 1
- 102000003929 Transaminases Human genes 0.000 description 1
- 108090000340 Transaminases Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 101710150448 Transcriptional regulator Myc Proteins 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- 241000700618 Vaccinia virus Species 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 241000282485 Vulpes vulpes Species 0.000 description 1
- 229940022698 acetylcholinesterase Drugs 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 238000004115 adherent culture Methods 0.000 description 1
- 239000002671 adjuvant Substances 0.000 description 1
- 239000007801 affinity label Substances 0.000 description 1
- 238000001261 affinity purification Methods 0.000 description 1
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 1
- 125000001931 aliphatic group Chemical group 0.000 description 1
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 125000006295 amino methylene group Chemical group [H]N(*)C([H])([H])* 0.000 description 1
- 239000002647 aminoglycoside antibiotic agent Substances 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 239000003708 ampul Substances 0.000 description 1
- 238000004873 anchoring Methods 0.000 description 1
- 238000010171 animal model Methods 0.000 description 1
- 239000012736 aqueous medium Substances 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 210000001130 astrocyte Anatomy 0.000 description 1
- GUBGYTABKSRVRQ-QUYVBRFLSA-N beta-maltose Chemical compound OC[C@H]1O[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@H](O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@@H]1O GUBGYTABKSRVRQ-QUYVBRFLSA-N 0.000 description 1
- 239000011230 binding agent Substances 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 230000031018 biological processes and functions Effects 0.000 description 1
- 239000006172 buffering agent Substances 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 102000028861 calmodulin binding Human genes 0.000 description 1
- 108091000084 calmodulin binding Proteins 0.000 description 1
- 239000011111 cardboard Substances 0.000 description 1
- 241001233037 catfish Species 0.000 description 1
- 125000002091 cationic group Chemical group 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 239000000919 ceramic Substances 0.000 description 1
- 230000009920 chelation Effects 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 238000012411 cloning technique Methods 0.000 description 1
- 238000000975 co-precipitation Methods 0.000 description 1
- FDJOLVPMNUYSCM-UVKKECPRSA-L cobalt(3+);[(2r,3s,4r,5s)-5-(5,6-dimethylbenzimidazol-1-yl)-4-hydroxy-2-(hydroxymethyl)oxolan-3-yl] [(2r)-1-[3-[(2r,3r,4z,7s,9z,12s,13s,14z,17s,18s,19r)-2,13,18-tris(2-amino-2-oxoethyl)-7,12,17-tris(3-amino-3-oxopropyl)-3,5,8,8,13,15,18,19-octamethyl-2,7, Chemical compound [Co+3].N#[C-].C1([C@H](CC(N)=O)[C@@]2(C)CCC(=O)NC[C@@H](C)OP([O-])(=O)O[C@H]3[C@H]([C@H](O[C@@H]3CO)N3C4=CC(C)=C(C)C=C4N=C3)O)[N-]\C2=C(C)/C([C@H](C\2(C)C)CCC(N)=O)=N/C/2=C\C([C@H]([C@@]/2(CC(N)=O)C)CCC(N)=O)=N\C\2=C(C)/C2=N[C@]1(C)[C@@](C)(CC(N)=O)[C@@H]2CCC(N)=O FDJOLVPMNUYSCM-UVKKECPRSA-L 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 230000001268 conjugating effect Effects 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 229920001577 copolymer Polymers 0.000 description 1
- 230000006240 deamidation Effects 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000001627 detrimental effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 208000035475 disorder Diseases 0.000 description 1
- PMMYEEVYMWASQN-UHFFFAOYSA-N dl-hydroxyproline Natural products OC1C[NH2+]C(C([O-])=O)C1 PMMYEEVYMWASQN-UHFFFAOYSA-N 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 238000012407 engineering method Methods 0.000 description 1
- 229940105423 erythropoietin Drugs 0.000 description 1
- DEFVIWRASFVYLL-UHFFFAOYSA-N ethylene glycol bis(2-aminoethyl)tetraacetic acid Chemical compound OC(=O)CN(CC(O)=O)CCOCCOCCN(CC(O)=O)CC(O)=O DEFVIWRASFVYLL-UHFFFAOYSA-N 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 238000002270 exclusion chromatography Methods 0.000 description 1
- 229960004222 factor ix Drugs 0.000 description 1
- 229940012413 factor vii Drugs 0.000 description 1
- 229960000301 factor viii Drugs 0.000 description 1
- 229940012426 factor x Drugs 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 235000019688 fish Nutrition 0.000 description 1
- 239000011888 foil Substances 0.000 description 1
- 108091011001 folic acid binding proteins Proteins 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 150000002303 glucose derivatives Polymers 0.000 description 1
- 150000002304 glucoses Polymers 0.000 description 1
- 125000002791 glucosyl group Polymers C1([C@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 1
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 1
- 229960003180 glutathione Drugs 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- 230000036252 glycation Effects 0.000 description 1
- 150000002314 glycerols Polymers 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 125000004435 hydrogen atom Chemical group [H]* 0.000 description 1
- 229920001477 hydrophilic polymer Polymers 0.000 description 1
- WGCNASOHLSPBMP-UHFFFAOYSA-N hydroxyacetaldehyde Natural products OCC=O WGCNASOHLSPBMP-UHFFFAOYSA-N 0.000 description 1
- 229960002591 hydroxyproline Drugs 0.000 description 1
- 239000012642 immune effector Substances 0.000 description 1
- 229940121354 immunomodulator Drugs 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 150000002484 inorganic compounds Chemical class 0.000 description 1
- 229910010272 inorganic material Inorganic materials 0.000 description 1
- 238000009434 installation Methods 0.000 description 1
- 102000006495 integrins Human genes 0.000 description 1
- 108010044426 integrins Proteins 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 229940047124 interferons Drugs 0.000 description 1
- 229940047122 interleukins Drugs 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 229940029329 intrinsic factor Drugs 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 125000005647 linker group Chemical group 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- HWYHZTIRURJOHG-UHFFFAOYSA-N luminol Chemical compound O=C1NNC(=O)C2=C1C(N)=CC=C2 HWYHZTIRURJOHG-UHFFFAOYSA-N 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 229910021645 metal ion Inorganic materials 0.000 description 1
- CWWARWOPSKGELM-SARDKLJWSA-N methyl (2s)-2-[[(2s)-2-[[2-[[(2s)-2-[[(2s)-2-[[(2s)-5-amino-2-[[(2s)-5-amino-2-[[(2s)-1-[(2s)-6-amino-2-[[(2s)-1-[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]pyrrolidine-2-carbonyl]amino]hexanoyl]pyrrolidine-2-carbonyl]amino]-5-oxopentanoyl]amino]-5 Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)OC)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CCCCN)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)CCCN=C(N)N)C1=CC=CC=C1 CWWARWOPSKGELM-SARDKLJWSA-N 0.000 description 1
- 210000000274 microglia Anatomy 0.000 description 1
- 108091005601 modified peptides Proteins 0.000 description 1
- 102000035118 modified proteins Human genes 0.000 description 1
- 108091005573 modified proteins Proteins 0.000 description 1
- 238000001823 molecular biology technique Methods 0.000 description 1
- ZTLGJPIZUOVDMT-UHFFFAOYSA-N n,n-dichlorotriazin-4-amine Chemical compound ClN(Cl)C1=CC=NN=N1 ZTLGJPIZUOVDMT-UHFFFAOYSA-N 0.000 description 1
- 210000004897 n-terminal region Anatomy 0.000 description 1
- 239000002105 nanoparticle Substances 0.000 description 1
- 210000004498 neuroglial cell Anatomy 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 239000003900 neurotrophic factor Substances 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 125000001209 o-nitrophenyl group Chemical group [H]C1=C([H])C(*)=C(C([H])=C1[H])[N+]([O-])=O 0.000 description 1
- 210000004248 oligodendroglia Anatomy 0.000 description 1
- 102000027450 oncoproteins Human genes 0.000 description 1
- 108091008819 oncoproteins Proteins 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 150000002894 organic compounds Chemical class 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 239000000123 paper Substances 0.000 description 1
- 239000011087 paperboard Substances 0.000 description 1
- 230000005298 paramagnetic effect Effects 0.000 description 1
- 230000000144 pharmacologic effect Effects 0.000 description 1
- 230000035790 physiological processes and functions Effects 0.000 description 1
- 239000004033 plastic Substances 0.000 description 1
- 229920003023 plastic Polymers 0.000 description 1
- 229920000765 poly(2-oxazolines) Polymers 0.000 description 1
- 229920005646 polycarboxylate Polymers 0.000 description 1
- 229920002704 polyhistidine Polymers 0.000 description 1
- 239000001267 polyvinylpyrrolidone Substances 0.000 description 1
- 235000013855 polyvinylpyrrolidone Nutrition 0.000 description 1
- 238000011176 pooling Methods 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- OXCMYAYHXIHQOA-UHFFFAOYSA-N potassium;[2-butyl-5-chloro-3-[[4-[2-(1,2,4-triaza-3-azanidacyclopenta-1,4-dien-5-yl)phenyl]phenyl]methyl]imidazol-4-yl]methanol Chemical compound [K+].CCCCC1=NC(Cl)=C(CO)N1CC1=CC=C(C=2C(=CC=CC=2)C2=N[N-]N=N2)C=C1 OXCMYAYHXIHQOA-UHFFFAOYSA-N 0.000 description 1
- 239000003755 preservative agent Substances 0.000 description 1
- 230000002335 preservative effect Effects 0.000 description 1
- 125000001500 prolyl group Chemical group [H]N1C([H])(C(=O)[*])C([H])([H])C([H])([H])C1([H])[H] 0.000 description 1
- 229960000856 protein c Drugs 0.000 description 1
- 230000002797 proteolythic effect Effects 0.000 description 1
- 230000006337 proteolytic cleavage Effects 0.000 description 1
- 239000012264 purified product Substances 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- PYWVYCXTNDRMGF-UHFFFAOYSA-N rhodamine B Chemical compound [Cl-].C=12C=CC(=[N+](CC)CC)C=C2OC2=CC(N(CC)CC)=CC=C2C=1C1=CC=CC=C1C(O)=O PYWVYCXTNDRMGF-UHFFFAOYSA-N 0.000 description 1
- 235000019515 salmon Nutrition 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 239000004017 serum-free culture medium Substances 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 230000003381 solubilizing effect Effects 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 210000000130 stem cell Anatomy 0.000 description 1
- 108010018381 streptavidin-binding peptide Proteins 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 235000000346 sugar Nutrition 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 150000003461 sulfonyl halides Chemical class 0.000 description 1
- 230000020382 suppression by virus of host antigen processing and presentation of peptide antigen via MHC class I Effects 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 238000004114 suspension culture Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 125000003441 thioacyl group Chemical group 0.000 description 1
- 125000000341 threoninyl group Chemical group [H]OC([H])(C([H])([H])[H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 229960000707 tobramycin Drugs 0.000 description 1
- NLVFBUXFDBBNBW-PBSUHMDJSA-N tobramycin Chemical compound N[C@@H]1C[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N NLVFBUXFDBBNBW-PBSUHMDJSA-N 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- FGMPLJWBKKVCDB-UHFFFAOYSA-N trans-L-hydroxy-proline Natural products ON1CCCC1C(O)=O FGMPLJWBKKVCDB-UHFFFAOYSA-N 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 150000004043 trisaccharides Chemical group 0.000 description 1
- 230000004614 tumor growth Effects 0.000 description 1
- ORHBXUUXSCNDEV-UHFFFAOYSA-N umbelliferone Chemical compound C1=CC(=O)OC2=CC(O)=CC=C21 ORHBXUUXSCNDEV-UHFFFAOYSA-N 0.000 description 1
- HFTAFOQKODTIJY-UHFFFAOYSA-N umbelliferone Natural products Cc1cc2C=CC(=O)Oc2cc1OCC=CC(C)(C)O HFTAFOQKODTIJY-UHFFFAOYSA-N 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 229940045999 vitamin b 12 Drugs 0.000 description 1
- 229920003169 water-soluble polymer Polymers 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y304/00—Hydrolases acting on peptide bonds, i.e. peptidases (3.4)
- C12Y304/22—Cysteine endopeptidases (3.4.22)
- C12Y304/2207—Sortase A (3.4.22.70)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/48—Hydrolases (3) acting on peptide bonds (3.4)
- C12N9/50—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25)
- C12N9/52—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25) derived from bacteria or Archaea
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/20—Fusion polypeptide containing a tag with affinity for a non-protein ligand
- C07K2319/21—Fusion polypeptide containing a tag with affinity for a non-protein ligand containing a His-tag
Definitions
- Protein engineering is becoming a widely used tool in many areas of protein biochemistry.
- One engineering method is controlled protein ligation.
- Native chemical protein ligation relies on efficient preparation of synthetic peptide esters, which can be technically difficult to prepare for many proteins.
- Recombinant technologies can be used to generate protein-protein fusions, joining the C-terminus of one protein with the N-terrninus of another protein.
- Intein-based protein ligation systems can also be used to join proteins.
- a prerequisite for this intein-mediated ligation method is that the target protein is expressed as a correctly folded fusion with the intein, which is often
- the transpeptidation reaction catalyzed by sortases has emerged as a general method for derivatizing proteins with various types of modifications.
- target proteins are engineered to contain a sortase recognition motif (LPXT) near their C- termini .
- LXT sortase recognition motif
- these artificial sortase substrates undergo a transacylation reaction resulting in the exchange of residues C-terminal to the threonine residue with the synthetic oligoglycine peptide , resulting in the protein C-terminus being ligated to the N- terminus of the synthetic peptide .
- SaSrtA S . aureus sortase A
- Ca 2+ dependency that can limit the effectiveness of SaSrtA in low Ca 2+ concentrations , such as the cytoplasm or when Ca 2+ binding compounds, for example phosphate, carbonate, and
- EDTA ethylenediaminetetraacetic acid
- the disclosure provides a sortase A mutant comprising at least three amino acid substitutions relative to a wild- type sortase A, wherein the amino acid substitutions comprise a) a K residue at position 105 ; b) a Q or A residue at position 108 ; and c) at least one amino acid substitution selected from the group
- the wild-type sortase A comprises a S. aureus sortase A
- the wild-type sortase A comprises a protein sequence of SEQ ID NO: 1.
- the wild-type sortase A comprises a protein sequence of SEQ ID NO: 3.
- the wild- type sortase A comprises a protein of SEQ ID NO: 5.
- the mutant comprises a deletion of amino acids 2-25. In some embodiments, comprising a deletion of amino acids 2-59.
- the mutant comprises at least two amino acid substitutions selected from the group consisting of i) -vi) . In some embodiments, the mutant comprises at least three amino acid substitutions selected from the group consisting of i) -vi) . In some embodiments, the mutant comprises at least four amino acid substitu ions selected from the group consisting of i) -vi) . In some embodiments, the mutant comprises at least five amino acid substitutions selected from the group consisting of i) -vi) .
- the mutant comprises at least 60% sequence identity to amino acid residues 60-206 of the wild-type sortase A. In some embodiments, the mutant comprises at least 80% sequence identity to amino acid residues 60-206 of the wild-type sortase A. In some embodiments, the mutant comprises at least 90% sequence identity to amino acid residues 60-206 of the wild-type sortase A. In some embodiments, the mutant comprises one or more C- terminal or N-terminal tags. In some embodiments, the one or more C-terminal or N-terminal tags comprises a His6 tag.
- the mutant exhibits sortase A catalytic activity in the absence of calcium.
- the mutant exhibits sortase A catalytic activity in the absence of exogenous calcium. In some embodiments, the mutant exhibits sortase A catalytic activity in the presence of calcium-binding proteins. In some embodiments, the mutant exhibits sortase A catalytic activity in the presence of calcium concentrations up to 10 m .
- the disclosure provides a
- the disclosure provides a polynucleotide encoding a catalytically active variant or fragment of a mutant srtA described herein.
- the disclosure provides a nucleic acid construct comprising the polynucleotides described herein .
- the disclosure provides a host cell transformed with the nucleic acid constructs described herein .
- the disclosure provides a method of preparing a mutant sortase A comprising: (a) culturing the host cell of claim 22 in a suitable culture medium under suitable conditions to produce the mutant sortase A; and optionally (b) purifying the mutant sortase A to provide a mutant sortase A.
- the disclosure provides an enzyme composition comprising at least one sortase A mutant described herein. In some aspects, the disclosure provides a method comprising performing a sortase-mediated transpeptidation reaction catalyzed by the enzyme compositions described herein .
- the disclosure relates to the use of an enzyme composition described herein for the sortagging of a target protein.
- the disclosure provides a method for sortagging a target protein, comprising: (a)
- transamidate the target protein and the moiety thereby sortagging the target protein.
- the terminal oligoglycine sequence comprises 1-10 N-terminal glycine residues.
- the moiety comprises an amino acid, a peptide, a protein, a polynucleotide, a
- a tag a metal atom, a chelating agent, a contrast agent, a catalyst, a polymer, a recognition element, a small molecule, a lipid, a label, an epitope, a small molecule, a therapeutic agent, a cross- linker, a toxin, a radioisotope, an antigen, or a click chemistry handle .
- the disclosure provides a kit for sortagging a target protein comprising the enzyme composition of claim 24.
- the disclosure provides a sortase A mutant comprising an amino acid sequence at least 80% identical to SEQ ID NO : 9, and wherein the mutant comprises a) a K residue at position 47 of SEQ ID NO: 9; b) a Q or A residue at position 50 of SEQ ID NO: 9; and c) at least one amino acid residue of SEQ ID NO: 9 selected from the group consisting of i) a R residue at position 36 of SEQ ID NO : 9 ; ii) a N residue at position 102 of SEQ ID NO: 9; iii) a A residue at position 107 of SEQ ID NO: 9; iv) a E residue at position 132 of SEQ ID NO: 9; and v) a T residue at position 138 of SEQ ID NO: 9.
- the mutant comprises an amino acid sequence of SEQ ID NO: 16. In some embodiments, the mutant comprises an amino acid sequence of SEQ ID NO: 17. In some embodiments, the mutant comprises an amino acid sequence of SEQ ID NO: 18. In some embodiments, the mutant comprises an amino acid sequence of SEQ ID NO: 19. In some embodiments, the mutant comprises an amino acid sequence of SEQ ID NO: 20. In some embodiments, the mutant comprises an amino acid sequence of SEQ ID NO: 21.
- the mutant comprises at least two amino acid residues of SEQ ID NO: 9 selected from the group consisting of i) -v) . In some embodiments, the mutant comprises an amino acid sequence of SEQ ID NO: 15.
- the mutant comprises at least three amino acid residues of SEQ ID NO: 9 selected from the group consisting of i) -v) . In some embodiments, the mutant comprises an amino acid sequence of SEQ ID NO: 12. In some embodiments, the mutant comprises an amino acid sequence of SEQ ID NO: 13. In some embodiments, the mutant comprises an amino acid sequence of SEQ ID NO: 14.
- the mutant comprises at least four amino acid residues of SEQ ID NO: 9 selected from the group consisting of i) -v) . In some embodiments , the mutant comprises an amino acid sequence of SEQ ID NO: 11. In some embodiments, the mutant comprises an amino acid sequence at least 90% identical to SEQ ID NO: 9. In some embodiments, the mutant comprises an amino acid sequence at least 95% identical to SEQ ID NO: 9. In some embodiments, the mutant comprises an amino acid sequence at least 96% identical to SEQ ID NO: 9. In some
- the mutant comprises an amino acid sequence at least 97% identical to SEQ ID NO: 9. In some embodiments, the mutant comprises an amino acid sequence at least 97% identical to SEQ ID NO: 9.
- the mutant comprises an amino acid sequence at least 98% identical to SEQ ID NO: 9. In some embodiments, the mutant comprises an amino acid sequence at least 98% identical to SEQ ID NO: 9.
- the mutant comprises an amino acid sequence at least 99% identical to SEQ ID NO: 9. In some embodiments, the mutant comprises an amino acid sequence at least 99% identical to SEQ ID NO: 9.
- the mutant comprises an amino acid sequence of SEQ ID NO: 9.
- the disclosure comprises an enzyme composition comprising at least one mutant sortase A described herein.
- the at least one mutant sortase A is selected from the group consisting of SEQ ID NOs : 9-21.
- the at least one mutant sortase A is selected from the group consisting of catalytically active variants of SEQ ID NOs : 9-21.
- the at least one mutant sortase A is selected from the group consisting of catalytically active fragments of SEQ ID NOs : 9-21.
- the disclosure provides a method comprising performing a sortase-mediated transpeptidation reaction catalyzed an enzyme composition described herein .
- the disclosure relates to the use of an enzyme composition described herein for the sortagging of a target protein.
- the disclosure relates to a method for sortagging a target protein, comprising: (a) providing a target protein comprising a sortase
- the oligoglycine sequence comprises 1-10 N-terminal glycine residues.
- the moiety comprises an amino acid, a peptide, a protein, a polynucleotide, a
- a tag a metal atom, a chelating agent, a contrast agent, a catalyst, a polymer, a recognition element, a small molecule, a lipid, a label, an epitope, a small molecule, a therapeutic agent, a cross- linker, a toxin, a radioisotope, an antigen, or a click chemistry handle .
- the disclosure provides a kit for sortagging a target protein comprising an enzyme composition described herein.
- the disclosure provides a
- polynucleotide encoding a sortase A mutant comprising a nucleotide sequence at least 80% identical to SEQ ID NO. 10, wherein the nucleotide sequence encodes a) a K residue at position 47 of SEQ ID NO: 9; b) a Q or A residue at position 50 of SEQ ID NO: 9; and c) at least one amino acid residue of SEQ ID NO: 9 selected from the group consisting of i) a R residue at position 36; ii) a N residue at position 102; iii) a A residue at position 107; iv) a E residue at position 132; and v) a T residue at position 138.
- the polynucleotide encodes at least two amino acid residues of SEQ ID NO: 9 selected from the group consisting of i) -v) . In some embodiments, the polynucleotide encodes at least three amino acid residues of SEQ ID NO: 9 selected from the group
- polynucleotide encodes at least four amino acid residues of SEQ ID NO: 9 selected from the group consisting of i) - v) .
- the polynucleotide comprises a nucleotide sequence at least 85% identical to SEQ ID NO: 10. In some embodiments, the polynucleotide comprises a nucleotide sequence at least 90% identical to SEQ ID NO: 10. In some embodiments, the polynucleotide comprises a nucleotide sequence at least 95% identical to SEQ ID NO: 10. In some embodiments, the polynucleotide comprises a nucleotide sequence at least 96% identical to SEQ ID NO: 10. In some embodiments, the polynucleotide comprises a nucleotide sequence at least 97% identical to SEQ ID NO: 10.
- the polynucleotide comprises a nucleotide sequence at least 98% identical to SEQ ID NO: 10. In some embodiments, the polynucleotide comprises a nucleotide sequence at least 99% identical to SEQ ID NO: 10. In some embodiments, the polynucleotide comprises a nucleotide sequence of SEQ ID NO : 10.
- the disclosure provides a nucleic acid construct comprising any of the polynucleotides described herein.
- the nucleic acid construct comprises a nucleotide sequence that encodes one or more C-terminal or N-terminal tags.
- the nucleic acid construct comprises one or more C-terminal or N-terminal tags comprises a His6 tag .
- the disclosure provides a host cell transformed with a nucleic acid construct described herein .
- the disclosure provides a method of preparing a mutant sortase A comprising: (a) culturing the host cell of claim 75 in a suitable culture medium under suitable conditions to produce the mutant sortase A; and optionally (b) purifying the mutant sortase A to provide a mutant sortase A.
- FIG. 1 is a schematic representation of a sortase- catalyzed transacylation reaction.
- FIG . 2 is a schematic representation of the site- specific C-terminal labeling scheme (left) and N-terminal labeling scheme (right) using sortase A.
- labeling begins with a substrate- recognition step (top) , and then proceeds with generation of a thioacyl intermediate (middle) , followed by ligation of an exogenously
- FIGS . 3A-3H are schematic representations of various polypeptide conjugations using sortase-mediated ligation .
- FIG . 3A is a schematic representation showing the sortagging of a molecular probe carrying an oligoglycine tag to a target pro ein having a C- erminal LPXTG- tag .
- FIG. 3B is a schematic representation showing sortase-mediated ligation of a polypeptide to a nucleic acid .
- FIG. 3C is a schematic representation of a sortase- mediated ligation being used to produce a neoglyconjugate by fusing a polypeptide comprising the LPXTG sortase recognition motif to amino-methylene groups in 6- aminohexoses .
- FIG. 3D is a schematic representation of the immobilization of an LPXTG- tagged protein onto an oligoglycine-coated solid surface, for example,
- FIG. 3E is a schematic representation of a cell surface protein genetically engineered to include an extracellular C-terminal region expressing a C-terminal LPXTG motif, which has been labeled with a triglycine- tagged probe by sortagging.
- FIG. 3F is a schematic illustration of the
- dimerization/oligomerization of a protein by sortagging of a bifunctional protein possessing an N-terminal oligoglycine tag and a C-terminal LPXTG tag.
- FIG . 3G is a schematic representation of the circularization of a bifunctional protein containing an N-terminal oligoglycine tag and a C-terminal LPXTG tag utilizing sortase-mediated ligation.
- FIG . 3H is a schematic representation of the site- specific attachment of a lipid utilizing sortase-mediated transpeptidation .
- FIGS . 4A-4U show exemplary sequences contemplated by the present disclosure . Mutations relative to wild- type sortase A are shown in red font . Portions of sequences that are absent in an exemplary wild-type S. aureus sortase A sequence and/or contain a 6XHis tag are highlighted in yellow .
- FIG. 4A is an exemplary wild type S. aureus Sortase A (SaSrtA) amino acid sequence.
- FIG. 4B is an exemplary nucleotide sequence encoding the amino acid sequence shown in FIG. 4A.
- FIG. 4C is an exemplary wild type S. aureus Sortase A (SaSrtA) amino acid sequence.
- FIG. 4D is an exemplary nucleotide sequence encoding the amino acid sequence shown in FIG. 4C.
- FIG. 4E is an exemplary wild type sortase delta 25 amino acid sequence.
- FIG. 4F is an exemplary nucleotide sequence encoding the amino acid sequence shown in FIG. 4E.
- FIG. 4G is an exemplary mutant srtA
- FIG. 4H is an exemplary nucleotide sequence encoding the amino acid sequence shown in FIG. 4G.
- FIG. 41 is an exemplary calcium- independent mutant srtA P94R/D160N/D165A/K190E/K196T amino acid sequence.
- FIG. 4J is an exemplary nucleotide sequence encoding the amino acid sequence shown in FIG . 41.
- FIG . 4K is an exem lary calcium- independent srtA P94S/D160N/D165A/K196T amino acid sequence.
- FIG. 4L is an exemplary calcium-independent srtA
- FIG. 4M is an exemplary calcium- independent srtA P94S/D160N/K196T amino acid sequence.
- FIG. 4N is an exemplary calcium- independent srtA P94S/D160N/D165A amino acid sequence.
- FIG . 40 is an. exemplary calcium-independent srtA P94S/D165A amino acid sequence .
- FIG. 4P is an exemplary calcium-independent srtA P94S amino acid sequence.
- FIG. 4Q is an exemplary calcium-independent srtA P94R amino acid sequence.
- FIG. 4R is an exemplary calcium- independent srtA
- FIG. 4S is an exemplary calcium- independent srtA D165A amino acid sequence.
- FIG. 4T is an exemplary calcium- independent srtA K190E amino acid sequence.
- FIG. 4U is an exemplary calcium- independent srtA K196T amino acid sequence.
- FIGS. 5A and 5B show exemplary alignments of various srtA sequences contemplated by the present disclosure.
- FIG. 5A shows an alignment of SEQ ID NO: 1 and SEQ
- SEQ ID NO: 9 indicating that SEQ ID NO: 9 is at least 65% identical to SEQ ID NO : 1.
- FIG. 5B shows an alignment of SEQ ID NO: 7 and SEQ ID NO: 9, indicating that SEQ ID NO: 9 is at least 98% identical to SEQ ID NO: 7.
- FIG. 6 shows a comparison of activities of various sortases at 0 °C, in the presence and absence of calcium.
- 30uM of a substrate (VHH84D4) containing an LPETGG- 6xHis C-terminal tag was incubated in 50mM Tris, pH 7.5, 150mM NaCl , 500uM GGG-TAMRA and lOmM CaC12 or lOmM EGTA with either 5uM (1) WT SrtA delta 25, (2) 5uM pentamutant SrtA or (3) heptamutant SrtA for up to 6 hours at 0°C.
- the reaction was then visualized by running a sample on a 12% Tris-glycine SDS PAGE gel.
- FIGS. 7A - 7C are chromatograms demonstrating that the calcium-independent srtA mutant of SEQ ID NO: 9 exists predominantly in monomeric form (FIG. 7A) compared to the calcium-dependent srtA mutant of SEQ ID NO: 7 which exists in both dimeric and monomeric form (FIG. 7B) , and the calcium-dependent wild-type srtA which also exists in both dimeric and monomeric form (FIG. 7C) .
- FIG. 8 is a photograph of a gel demonstrating the high purity of monomeric sortase.
- FIG. 9 is a chromatogram demonstrating the stability of monomeric sortase. DETAILED DESCRIPTION OF THE INVENTION
- aspects of the disclosure relate to sortase mutants, and in particular to sortase A mutants that exhibit an indifference to calcium. Work described herein
- sortase A mutants described herein surprisingly and unexpectedly exhibit sortase A catalytic activity (e.g., increased catalytic activity compared to wild type sortase A, ) , and do so in a manner that is independent of the presence or concentration of calcium.
- Sortases, sortase-mediated transacylation reactions, and their use in transacylation (sometimes also referred to as transpeptidation) for protein engineering are well known to those of skill in the art (see, e.g., Ploegh et al., International Patent Application PCT/US2010/000274 , and Ploegh et al . , International Patent Application
- FIG. 1 shows an exemplary transpeptidation reaction catalyzed by sortase, which results in the ligation of species containing a sortase recognition motif (e.g., an LPXT motif or an LPXTG motif) with those bearing an N-terminal sortase recognition motif (e.g., an LPXT motif or an LPXTG motif) with those bearing an N-terminal sortase recognition motif (e.g., an LPXT motif or an LPXTG motif) with those bearing an N-terminal sortase recognition motif (e.g., an LPXT motif or an LPXTG motif) with those bearing an N-terminal sortase recognition motif (e.g., an LPXT motif or an LPXTG motif) with those bearing an N-terminal sortase recognition motif (e.g., an LPXT motif or an LPXTG motif) with those bearing an N-terminal sortase recognition motif (e.g., an LP
- sortases can be used for both C- terminal (left) and N-terminal (right) site-specific labeling.
- the sortase transacylation reaction allows for the facile installation of all kinds of substituents at the C- terminus or N-terminus of a suitably modified protein.
- the sortase reaction can be employed for ligating polypeptides to one another, ligating synthetic peptides to recombinant proteins, linking a reporting molecule to a polypeptide, linking a polypeptide to a label or probe (FIG. 3A) , joining a nucleic acid to a polypeptide (FIG. 3B) , ligating a glycan to a polypeptide (FIG. 3C) , conjugating a polypeptide to a solid support or polymer (FIG. 3D) , site- specific modification of the extracellular C-terminal region of cell surface proteins expressed in living cells (FIG. 3E) , site- specific modification of the extracellular N-terminal region of cell surface proteins expressed in living cells ,
- sortase A derives its ability to site- specifically modify target proteins by recognition of sortase recognition motifs, such as the motif LPXTG.
- sortase recognition motifs such as the motif LPXTG.
- Other suitable sortase recognition motifs are apparent to the skilled artisan. It will be
- a recognition sequence with respect to sequences recognized by sortase, are used interchangeably.
- a recognition sequence further comprises one or more additional amino acids, e.g., at the N or C terminus. Such additional amino acids may provide context that improves the recognition of the recognition motif.
- sortase recognition sequence may refer to a masked or unmasked sortase recognition sequence.
- a calcium-independent mutant sortase has a different substrate specificity with regard to the sortase recognition motif as compared to a calcium-dependent wild type sortase.
- Sortases with different substrate specificities with regard to the sortase recognition motif recognize different sortase recognition sequences. For example, Sortases 1 and 2 are said to have different substrate specificities if SRM1 and SRM2 are different sortase recognition motifs and Sortase 1 recognizes SRM1 and Sortase 2 is active in recognizing SRM2 but has little or no activity
- the substrate specificities overlap in that one of the two sortases recognizes both SRM1 and SR 2 while the other sortase recognizes only one of the SRMs . In some embodiments the substrate specificities do not overlap, e.g., Sortase 1 recognizes SRM1 but does not recognize SRM2 , and Sortase 2 recognizes SRM2 but does not recognize SRM1. In some embodiments, two calcium- independent sortases may alternately or additionally utilize different
- nucleophiles are nucleophiles . In some embodiments the nucleophile specificity overlaps while in some embodiments the nucleophile specificity does not overlap.
- a sortase with an altered substrate specificity with regard to the sortase recognition motif may be generated by engineering one or more mutations in the sortase, e.g., in a region of the protein that is involved in recognition and/or binding of the sortase recognition motif, e.g., the putative substrate recognition loop (e.g., the loop connecting strands ⁇ 6 and ⁇ 7 ( ⁇ 6/ ⁇ 7 loop) in SrtA (Val 161 -Asp 17S ) .
- the putative substrate recognition loop e.g., the loop connecting strands ⁇ 6 and ⁇ 7 ( ⁇ 6/ ⁇ 7 loop) in SrtA (Val 161 -Asp 17S ) .
- a phage- display, yeast display, or other screen of a mutant sortase library randomized in the substrate recognition loop may be performed, and variants with altered
- a sortase with an altered nucleophile specificity may be generated by engineering one or more mutations in the sortase, e.g., in a region of the protein that is involved in
- a phage-display screen of a mutant sortase library randomized in the substrate recognition loop may be performed, and variants with altered substrate specificity may be identified.
- a calcium- independent sortase described herein is modified to alter its substrate specificity with regard to the sortase recognition motif and/or its nucleophile specificity by introducing one or more mutations into the sortase.
- a calcium-dependent sortase is modified to alter its substrate specificity and/or nucleophile specificity and is rendered calcium- independent by introducing the mutations described herein. It will be appreciated that mutations conferring calcium- independence and altered substrate and/or nucleophile specificity may be engineered individually or sequentially in groups of one or more, in any order or combination that results in a desired sequence. In some embodiments two calcium- independent sortases with different substrate
- nucleophile specificities and/or different nucleophile specificities are derived from the same wild type sortase, e.g., S. aureus SrtA.
- the substrate e.g., S. aureus SrtA.
- Sortases with different substrate specificities may be used, for example, to introduce two different moieties to a target protein.
- a target protein may comprise or be modified to comprise first and second SRMs.
- Two sortases each of which specifically recognizes only one of the two SRMs, may be used to conjugate two agents to the protein by reaction with the two SRMs, e.g., at the N- and C- termini .
- the conjugations may take place in a single reaction vessel or may be performed sequentially.
- the two agents may be any of the agents described herein, and may be the same or different.
- a first agent may comprise a toxin and a second agent may comprise a detectable label , e.g., for imaging.
- the disclosure provides a kit comprising two calcium- independent sortases with
- nucleophile specificity is derived from the same wild type sortase , e.g., S . aureus SrtA.
- substrate specificities e.g., S . aureus SrtA.
- substrate specificities e.g., S . aureus SrtA.
- amino acid sequences of sortase A polypeptides and the nucleotide sequences that encode them are known to those of skill in the art and are disclosed in a number of references cited herein, the entire contents of all of which are incorporated herein by reference.
- polypeptide refers to a molecule comprising at least two covalently attached amino acids.
- a polypeptide can be made up of naturally occurring amino acids and peptide bonds and/or synthetic peptidomimetic residues and/or bonds.
- Polypeptides described herein include naturally purified products, products of chemical synthetic procedures, and products produced by recombinant techniques from a prokaryotic or eukaryotic host, including, for example, bacterial (e.g., E. coli) , yeast, higher plant, insect and mammalian cells.
- the disclosure also features biologically active fragments of calcium- independent srtA mutants.
- amino acid refers to naturally occurring and synthetic amino acids, as well as amino acid analogs and amino acid mimetics that function in a manner similar to the naturally occurring amino acids.
- Naturally occurring amino acids are those encoded by the genetic code, as well as those amino acids that are later modified, e.g., hydroxyproline .
- Amino acid analogs are compounds that have the same basic chemical structure as a naturally occurring amino acid, i.e., a carbon bound to hydrogen, a carboxyl group, an amino group, and an R group, e.g., norleucine. Such analogs have modified R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic chemical structure as a naturally occurring amino acid .
- Amino acid mimetics refers to chemical compounds that have a structure that is different from the general chemical structure of an amino acid, but that function in a manner similar to a naturally occurring amino acid.
- Sortase A is a polypeptide having a length of 206 amino acids which typically comprises a hydrophobic N- terminal domain (e.g., residues 1 to about 25) which functions as both a signal peptide and a membrane anchoring domain, a central linker domain (e.g., from about residue 26 to about residue 59) , and a C-terminal catalytic domain (e.g., from about residue 60 to about residue 206) .
- a hydrophobic N- terminal domain e.g., residues 1 to about 25
- a central linker domain e.g., from about residue 26 to about residue 59
- C-terminal catalytic domain e.g., from about residue 60 to about residue 206 .
- FIG. 4A shows an exemplary nucleotide sequence (SEQ ID NO: 2; NCBI
- NC_002745.2 encoding the wild type SaSrtA protein.
- FIG. 4C Another exemplary sequence of wild- type S. aureus SrtA is shown in FIG. 4C (SEQ ID NO: 3; GenBank Accession number AAD48437 ; NCBI Reference Sequence:
- wild-type S. aureus SrtA sequences, calcium- independent srtA mutants, and catalytically active fragments or variants thereof disclosed herein may comprise either a K or N at position 57. It should be appreciated that wild-type S. aureus SrtA sequences, calcium- independent srtA mutants, and catalytically active fragments or variants thereof disclosed herein may comprise either an E or G at position 167.
- aspects of the present disclosure relate to calcium- independent sortase A mutants. Such mutants may be produced through processes such as directed evolution, site-specific modification, etc . It should be
- the calcium- independent srtA mutants disclosed herein can be used in any application in which sortagging is desirable, including for example, the sortase -mediated ligation reactions described in FIGS. 3A-3I .
- the calcium- independent srtA mutants comprise at least one amino acid substitution relative to a wild-type sortase A polypeptide, and catalytically active fragments, catalytically active derivatives, or catalytically active variants thereof, and polynucleotides encoding the same.
- calcium- independent refers to the ability of a sortase A enzyme to exhibit catalytic activity in a manner that is independent of the presence of calcium, or independent of the amount of calcium present within a concentration range that is not
- the calcium- independent srtA mutants disclosed herein can be assayed for their ability to exhibit sortase A catalytic activity in a calcium- independent manner by contacting a target protein comprising a C-terminal sortase
- the calcium- independent srtA mutants can be assayed for their ability to exhibit sortase A catalytic activity in the presence of calcium
- calcium-dependent sortase A concentrations which are required for calcium-dependent sortase A to exhibit catalytic activity.
- "calcium-dependent" in connection with a sortase means that the catalytic activity of the sortase relies or depends on the presence and concentration of calcium, such that in the absence of calcium or the absence of a sufficient amount of calcium, the calcium-dependent sortase will not exhibit sortase A catalytic activity or has greatly reduced catalytic activity as compared with its activity when calcium is present in sufficient amounts (e.g., 5 mM - 10 mM) .
- the present disclosure contemplates any srtA variants which exhibit sortase A catalytic activity comparable to the wild-type srtA enzyme, as long as they possess calcium- independent sortase A catalytic activity.
- fragments As used herein in connection with “fragments” ,
- sortase A catalytic activity refers to the ability of a sortase to catalyze the cleavage of a polypeptide within a sortase A consensus recognition sequence and ligate the free primary amino group (NH 2 -CH 2 -) of a oligoglycine sequence to the free C-terminal carboxyl group of the cleaved polypeptide.
- oligoglycine refers to a (Gly) n s equence, wherein n is between 1 and about 10, or more preferably between 2 and about 5, and even more preferably 2 or 3 , glycine residues.
- An "N-terminal" oligoglycine sequence is located at the N-terminus of a polypeptide , such that the polypeptide comprises a free primary amino (NH 2 personallyCH 2 _) group at its N- erminus .
- N- terminal oligoglycine sequence can also include an internal oligoglycine sequence that is capable of forming a oligoglycine sequence under applicable conditions , e.g., by cleavage of an N-terminal peptide sequence by an endogenous host cell enzyme, or by specific proteolytic cleavage in vitro .
- Sortase A catalytic activity of the calcium- independent srtA mutants disclosed herein can be assayed using methods known in the art. The crystal structure of SrtA complexed with a substrate has been determined allowing catalytic active domains of sortase A proteins from various Gram-positive bacterium to be easily discerned by those of skill in the art, see for example, Y. Zong et al . J. Biol Chem. 2004, 279, 31383-31389, which is incorporated herein by reference.
- polynucleotide has one or more alterations (e.g., additions, substitutions, and/or deletions) with respect to a reference polypeptide or polynucleotide, which may be referred to as the "original polypeptide” or “original polynucleotide", respectively.
- An addition may be an insertion or may be at either terminus.
- a variant may be shorter or longer than the reference polypeptide or polynucleotide.
- variant encompasses
- fragments are continuous portions of a polypeptide or polynucleotide that is shorter than the reference polypeptide or polynucleotide.
- a variant comprises or consists of a
- a fragment or variant is at least 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 92.5%, 95%, 96%, 97%, 98%, 99%, or more as long as the reference polypeptide or polynucleotide.
- a fragment may lack an N-terminal and/or C-terminal portion of a reference polypeptide.
- a fragment may lack up to 5%, 10%, 15%, 20%, or 25% of the length of the polypeptide from either or both ends.
- a fragment may be an N-terminal, C-terminal, or internal fragment.
- a variant polypeptide comprises or consists of at least one domain of a reference polypeptide.
- a variant polynucleotide hybridizes to a reference polynucleotide under art-recognized stringent conditions, e.g., high stringency conditions, for sequences of the length of the reference polypeptide.
- a variant polypeptide or polynucleotide comprises or consists of a polypeptide or polynucleotide that is at least 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, or more identical in sequence to the reference polypeptide or polynucleotide over at least 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% of the reference polypeptide or polynucleotide.
- a variant polypeptide comprises or consists of a polypeptide that is at least 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, or more identical in sequence to the reference polypeptide over at least 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% of the reference polypeptide, with the proviso that, for purposes of computing percent identity, a conservative amino acid substitution is considered identical to the amino acid it replaces .
- a variant polypeptide comprises or consists of a polypeptide that is at least 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, or more identical to the reference polypeptide over at least 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% of the reference polypeptide, with the proviso that any one or more amino acid substitutions (up to the total number of such substitutions) may be restricted to conservative substitutions.
- a percent identity is measured over at least 100; 200; 300; 400; 500; 600; 700; 800; 900; 1,000; 1,200; 1,500; 2,000; 2,500; 3,000;
- sequence of a variant polypeptide comprises or consists of a sequence that has N amino acid differences with respect to a reference sequence, wherein N is any integer between 1 and 10 or between 1 and 20 or any integer up to 1%, 2%, 5%, or 10% of the number of amino acids in the reference polypeptide, where an "amino acid difference" refers to a substitution, insertion, or deletion of an amino acid.
- a difference is a conservative substitution. Conservative substitutions may be made, e.g., on the basis of
- a variant is a functional variant, i.e., the variant at least in part retains at least one activity (e.g., calcium- independent sortase A catalytic activity) of the reference polypeptide or polynucleotide. In some embodiments a variant at least in part retains more than one or substantially all known activities of the reference polypeptide or
- an activity may be, e.g., a catalytic activity, binding activity, ability to perform or participate in a biological function or process, etc.
- an activity is one that has (or the lack of which has) a detectable effect on an observable phenotype of a cell or organism.
- an activity of a variant may be at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, or more, of the activity of the reference polypeptide or polynucleotide , up to approximately 100%, approximately 125%, or approximately 150% of the activity of the reference polypeptide or polynucleotide, in various embodiments .
- a variant e.g., a functional variant, comprises or consists of a polypeptide at least 80%, 90%, 92.5%, 95%, 96%, 97%, 98%, 99%. 99.5% or 100% identical to an reference polypeptide or polynucleotide over at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or 100% of the full length of the reference polypeptide or polynucleotide or over at least 70%, 75%, 80%, 85%, 90%, 92.5%, 95%, 96%, 97%, 98%, or 99% or 100% of a functional fragment of the reference polypeptide or polynucleotide.
- an alteration e.g., a substitution or deletion, e.g., in a functional variant, does not alter or delete an amino acid or nucleotide that is known or predicted to be important for an activity, e.g., a known or predicted catalytic residue or residue involved in binding a substrate or cofactor.
- nucleotide (s) amino acid(s), or region (s) exhibiting lower degrees of conservation across species as compared with other amino acids or regions may be selected for alteration.
- Variants may be tested in one or more suitable assays to assess activity.
- a polypeptide or polynucleotide sequence in the NCBI RefSeq database may be used as a reference sequence.
- polynucleotide is a naturally occurring variant or fragment. In some embodiments a variant or fragment of a naturally occurring polypeptide or polynucleotide is not naturally occurring.
- the calcium-independent srtA mutant is selected according to the degree of sequence homology with a wild-type sortase A enzyme. In some embodiments , the calcium- independent srtA mutant is selected according to the degree of sequence homology with S. aureus sortase A. In some embodiments, the calcium- independent srtA mutant is selected according to the degree of sequence homology with SEQ ID NO: 1 (FIG. 4A) . In some embodiments, the calcium- independent srtA mutant is selected according to the degree of sequence homology with SEQ ID NO: 3 (FIG. 4C) . In some
- the calcium- independent srtA mutant is selected according to the degree of sequence homology with SEQ ID NO: 5 (FIG. 4E) . In some embodiments, the calcium- independent srtA mutant is selected according to the degree of sequence homology with SEQ ID NO: 7 (FIG. 4G) .
- Calcium- independent srtA mutants having a desired degree of homology to a wild-type sortase A enzyme can be identified by, e.g., using the wild-type sortase A nucleotide sequences as query sequences in a search against public databases to identify related sequences.
- the calcium- independent srtA mutant comprises an amino acid sequence homologous to amino acids 60-206 of SEQ ID NO: 1, e.g. an amino acid sequence that is at least 55%, at least 60%, at least
- the calcium- independent srtA mutant comprises an amino acid sequence homologous to amino acids 60-206 of SEQ ID NO: 3, e.g. an amino acid sequence that is at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95% or higher, homologous thereto.
- a nucleotide sequence encoding a calcium- independent srtA mutant has at least 25%, or preferably at least 30%, or more preferably at least 35% or more identity with the nucleic acid sequence of SEQ ID NO : 2.
- a nucleotide sequence encoding a calcium- independent srtA mutant has at least 25%, or preferably at least 30%, or more preferably at least 35% or more identity with the nucleic acid sequence of SEQ ID NO: 4.
- the calcium- independent srtA mutant has at least 35%, or preferably at least 40%, or more preferably at least 45% similarity with the amino acid sequence of SEQ ID NO: 1. In further embodiments, the calcium- independent srtA mutant has at least 35%, or preferably at least 40%, or more preferably at least 45% similarity with the amino acid sequence of SEQ ID NO: 3.
- the calcium-independent srtA mutant is a variant of SrtA of another Gram-positive bacterium having one or more as substitutions, deletions, insertions , and/or other modifications relative to the native nucleotide and/or amino acid sequence of S . aureus srtA.
- the variant comprises one or more conservative amino acid substitutions relative to SrtA of another Gram-positive bacterium.
- the variant comprises one or more amino acid substitutions relative to SrtA of another Gram-positive bacterium, wherein the one or more as amino acid
- substitutions are predominantly, e.g., at least 50% , or preferably at least 60%, or more preferably at least 70% or more , conservative substitutions .
- fragments of calcium- independent srtA mutants disclosed herein exhibiting calcium- independent sortase A catalytic activity are contemplated herein, and can be utilized in the methods described herein.
- fragments can be identified by producing transaminase fragments by known recombinant techniques or proteolytic techniques , for example , and determining the rate of protein or peptide ligation.
- the fragment sometimes consists of about 80% of the full-length transamidase amino acid sequence, and sometimes about 70%, about 60%, about 50%, about 40% or about 30% of the full-length transamidase amino acid sequence such as that of a wild- type S. aureus Sortase A.
- the fragment lacks an N-terminal portion of the full-length sequence, e.g., the fragment lacks the N-terminal portion extending to the end of the membrane anchor sequence.
- the fragment comprises the C-terminus of a full-length transamidase amino acid sequence.
- a catalytic core region from a sortase is utilized, e.g., a region from about position 60 to about position 206 of SrtA, e.g., S. aureus SrtA.
- the fragment comprises a sortase A lacking N-terminal amino acids 2-25 (SEQ ID NO: 5) (FIG. 4E) .
- An exemplary nucleotide sequence i.e., SEQ ID NO: 6) encoding such fragment is shown in FIG. 4F.
- Calcium-independent srtA mutants can be derived from sortase A sequences from other organisms.
- the disclosure provides mutants of any Ca ++ - dependent SrtA from a species other than S. aureus, wherein the mutants comprise any of the mutations or combinations of mutations described herein at the corresponding positions in the Ca' " -dependent SrtA from a species other than S. aureus.
- a calcium- independent srtA mutant is derived from sortase A sequences from other organisms comprising nucleotide sequences substantially identical or similar to the nucleotide sequences that encode Srt A.
- a similar or substantially identical nucleotide sequence may include modifications to the native sequence, such as
- nucleotide sequences that sometimes are about at least 20%, 30%, 40%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, or 85% or more identical to a native nucleotide sequence, and sometimes are about 90% or 95% or more identical to the native nucleotide sequence (each identity percentage can include a 1%, 2%, 3% or 4% variance) .
- One test for determining whether two nucleic acids are substantially identical is to determine the percentage of identical nucleotide sequences shared between the nucleic acids.
- sequence identity can be performed by a variety of techniques which are available to the skilled artisan. For example, computer programs such as BLAST2, BLASTN, BLASTP, Gapped BLAST, etc., may be used to generate alignments and/or to obtain a percent identity.
- computer programs such as BLAST2, BLASTN, BLASTP, Gapped BLAST, etc., may be used to generate alignments and/or to obtain a percent identity.
- a calcium- independent srtA mutant comprises at least three amino acid substitutions relative to a wild-type sortase A, wherein the amino acid substitutions comprise a) a K residue at position 105; b) a Q or A residue at position 108; and c) at least one amino acid substitution selected from the group
- mutant is used to refer to a one amino acid sequence which has changed by at least one amino acid residue relative to another amino acid sequence.
- the term "mutant” with reference to “sortases” should not be considered to imply that any particular way of generating the mutant sequences is required or that any particular starting materials is required .
- the present disclosure contemplates any suitable method of generating the calcium-independent srtA mutants described herein.
- suitable methods include, but are not limited to introducing mutations into an appropriate wild-type coding sequence, synthesizing the sequences of the calcium-independent srtA mutants de novo, for example, utilizing solid phase peptide synthesis, and in vitro translation a synthetic mRNA, to name only a few. It is to be further understood that the disclosure contemplates calcium- independent mutants of any wild-type sortase A. Those skilled in the art will appreciate that the wild- type sequences of sortase A may vary, e.g., SrtA from various species may have gaps, insertions, and/or vary in length relative to the amino acid sequence of exemplary wild-type S. aureus SrtA.
- the disclosure is not intended to be limited in any way by the original amino acid residue at a particular position in any wild- type sortase A sequence used to generate a calcium- independent srtA mutant.
- any substitution which results in the specified amino acid residue at a position specified herein is contemplated by the disclosure.
- the phrase "X residue at position Y" means that the X residue in the resulting mutant srtA replaces whatever amino acid was present in the original sortase A amino sequence at the position Y in the original sorta.se A amino acid sequence that corresponds to the same position in an exemplary wild-type S.
- aureus srtA amino acid sequence when accounting for any gaps and/or insertions in the original sortase A amino acid sequence relative to the exemplary wild-type S. aureus srtA amino acid sequence.
- the following examples are instructive and are not intended to be limiting in any way . If the original wild-type sortase A used to generate a srtA mutant is the wild-type S. pyogenes srtA (NCBI Gene ID: 901269), "position 94" in the phrase "an amino acid substitution comprising a R residue at position 94" would mean position 115 of the wild-type S. pyogenes srtA as position 115 of the wild-type S .
- pyogenes srtA corresponds to position 94 in the exemplary wild-type S. aureus srtA sequence (NCBI Gene ID: 3238307) when taking into account gaps and/or insertions in the sequence alignment of S. pyogenes srtA and S. aureus srtA.
- N residue at position 115 of the wild- type S. pyogenes srtA amino acid sequence would be replaced by a R residue in the resulting mutant srtA.
- position 94 in the phrase "an amino acid substitution comprising a R residue at position 94" would mean position 123 of the wild-type B .
- anthracis srtA as position 123 of the wild-type B .
- anthracis srtA corresponds to position 94 in the exemplary wild-type S. aureus srtA sequence (NCBI Gene ID: 3238307) when taking into account gaps and/or insertions .in the sequence alignment of B . anthracis srtA and S . aureus srtA.
- the E residue at position 123 of the wild- type B . anthracis srtA amino acid sequence would be replaced by a R residue in the resulting mutant srtA.
- position 94 in the phrase "an amino acid substitution comprising a R residue at position 94" would mean position 112 of the wild- type E. faecalis srtA as position 112 of the wild- type E. faecalis srtA corresponds to position 94 in the exemplary wild- type S . aureus srtA sequence (Gene ID: 3238307) when taking into account gaps and/or insertions in the alignment of E. faecalis srtA and S . aureus srtA.
- the N residue at position 112 of the wild-type E. faecalis srtA amino acid sequence would be replaced by a R residue in the resulting mutant srtA.
- any original wild-type sortase A sequence to be used for generating a calcium- independent srtA mutant with an exemplary wild-type S. aureus sortase A sequence for purposes of determining the positions in the original wild-type sortase A sequence that correspond to the exemplary wild-type S . aureus sortase A sequence when taking into account gaps and/or insertions in the alignment of the two sequences.
- the substitution comprises a E105K substitution.
- E105K substitution in reference to mutating a particular original sortase A sequence (e.g., a wild type sortase A sequence) refers to the substitution of a K residue at a position in the original sortase A amino acid sequence that corresponds to the E105 residue in the corresponding exemplary wild- type S. aureus srtA amino acid sequence .
- the original sortase A sequence to be mutated in accordance with the disclosure is a L. monocytogenes srtA sequence (NCBI Gene ID:
- the phrase "E105K substitution” refers to the substitution of the R residue at position 112 of the L. monocytogenes srtA sequence wi h a K residue as position 112 of L. monocytogenes srtA corresponds to position 105 of the exem lary wild-type S. aureus srtA sequence (NCBI Gene ID : 3238307) when the two sequences are aligned taking into account any gaps and/or insertions.
- the substitution comprises a E108Q
- substitution comprises an E108A substitution. In some embodiments, the substitution comprises a P94R substitution. In some embodiments, the substitution comprises a P94S
- substitution comprises a D160N substitution. In some embodiments, the substitution comprises a D165A substitution. In some embodiments, the substitution comprises a K190E
- the substitution comprises a K196T substitution.
- the wild-type sortase A comprises S. aureus sortase A. In some embodiments, the wild-type sortase A comprises an amino acid sequence of
- the wild-type sortase A comprises an amino acid sequence of SEQ ID NO: 3.
- the calcium- independent srtA mutant comprises a deletion of amino acids 2-25. In some embodiments, the calcium- independent srtA mutant
- the calcium- independent srtA mutant comprises at least two amino acid substitutions selected from the group consisting of i) -vi) .
- the calcium-independent srtA mutant comprises at least two amino acid substitutions comprising i) and iii) . In some embodiments, the calcium-independent srtA mutant comprises at least two amino acid substitutions comprising i) and iv) . In some embodiments , the calcium-independent srtA mutant
- the calcium- independent srtA mutant comprises at least two amino acid substitutions comprising i) and vi) .
- the calcium-independent srtA mutant comprises at least two amino acid substitutions comprising ii) and iii) . In some embodiments, the calcium- independent srtA mutant comprises at least two amino acid substitutions comprising ii) and iv) . In some embodiments, the calcium- independent srtA mutant
- the calcium- independent srtA mutant comprises at least two amino acid substitutions comprising ii) and vi) .
- the calcium- independent srtA mutant comprises at least two amino acid substitutions comprising iii) and iv) . In some embodiments, the calcium- independent srtA mutant comprises at least two amino acid substitutions comprising iii) and v) . In some embodiments, the calcium- independent srtA mutant
- the calcium- independent srtA mutant comprises at least two amino acid substitutions comprising iv) and v) . In some embodiments, the calcium- independent srtA mutant comprises at least two amino acid substi utions comprising iv) and vi) .
- the calcium- independent srtA mutant comprises at least two amino acid substitutions comprising v) and vi) .
- the calcium- independent srtA mutant comprises at least three amino acid substitutions selected from the group consisting of i) -vi) .
- the calcium- independent srtA mutant comprises at least three amino acid substitutions comprising i) , iii) and iv) . In some embodiments, the calcium-independent srtA mutant comprises at least three amino acid substitutions comprising i) , iii) and v) . In some embodiments, the calcium- independent srtA mutant comprises at least three amino acid substitutions comprising i) , iii) and vi) .
- the calcium-independent srtA mutant comprises at least three amino acid substitutions comprising i) , iv) and v) . In some embodiments, the calcium- independent srtA mutant comprises at least three amino acid substitutions comprising i) , iv) and vi) .
- the calcium- independent srtA mutant comprises at least three amino acid substitutions comprising i) , v) and vi) .
- the calcium-independent srtA mutant comprises at least three amino acid substitutions comprising ii) , iii) and iv) . In some embodiments, the calcium- independent srtA mutant comprises at least three amino acid substitutions comprising ii) , iii) and v) . In some embodiments, the calcium- independent srtA mutant comprises at least three amino acid substitutions comprising ii) , iii) and vi) .
- the calcium- independent srtA mutant comprises at least three amino acid substitutions comprising ii) , iv) and v) . In some embodiments, the calcium- independent srtA mutant comprises at least three amino acid substitutions comprising ii) , iv) and vi) .
- the calcium- independent srtA mutant comprises at least three amino acid substitutions comprising ii) , v) and vi) .
- the calcium- independent srtA mutant comprises at least three amino acid substitutions comprising iii) , iv) and v) . In some embodiments , the calcium- independent srtA mutant comprises at least three amino acid substitutions comprising iii), v) and vi) .
- the calcium- independent srtA mutant comprises at least three amino acid substitutions comprising iv) , v) and vi) .
- the calcium- independent srtA mutant comprises at least four amino acid substitutions selected from the group consisting of i) -vi) .
- the calcium- independent srtA mutant comprises at least four amino acid substitutions comprising i) , iii), iv) and v) . In some embodiments, the calcium- independent srtA mutant comprises at least four amino acid substitutions comprising i) , iii) , v) and vi) .
- the calcium- independent srtA mutant comprises at least four amino acid substitutions comprising ii) , iii) , iv) and v) . In some embodiments, the calcium- independent srtA mutant comprises at least four amino acid substitutions comprising ii) , iii) , v) and vi) .
- the calcium- independent srtA mutant comprises at least four amino acid substitutions comprising iii) , iv) , v) and vi) .
- the calcium- independent srtA mutant comprises at least five amino acid substitutions selected from the group consisting of i) -vi) .
- the calcium- independent srtA mutant comprises at least five amino acid substitutions comprising i) , iii) , iv) , v) and vi) .
- the calcium- independent srtA mutant comprises at least five amino acid substitutions comprising ii), iii), iv), v) and vi) .
- the calcium- independent srtA mutants comprises at least 60% identity to SEQ ID NO: 1, or at least 65% identity to SEQ ID NO: 1, or at least 70% identity to SEQ ID NO: 1, or at least 75% identity to SEQ ID NO: 1, or at least 80% identity to SEQ ID NO : 1, or at least 85% identity to SEQ ID NO: 1, or at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or up to 99% identity to SEQ ID NO: 1.
- the calcium- independent srtA mutants comprises at least 60% identity to amino acid residues 60-206 of SEQ ID NO: 1, or at least 65% identity to amino acid residues 60-206 of SEQ ID NO: 1, or at least 70% identity to amino acid residues 60-206 of SEQ ID NO: 1, or at least 75% identity to amino acid residues 60-206 of SEQ ID NO: 1, or at least 80% identity to amino acid residues 60-206 of SEQ ID NO: 1, or at least 85% identity to amino acid residues 60-206 of SEQ ID NO: 1, or at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or up to 99% identity to amino acid residues 60-206 SEQ ID NO: 1.
- the calcium- independent srtA mutants comprises at least 60% identity to SEQ ID NO: 3, or at least 65% identity to SEQ ID NO: 3, or at least 70% identity to SEQ ID NO: 3, or at least 75% identity to SEQ ID NO: 3, or at least 80% identity to SEQ ID NO : 3, or at least 85% identity to SEQ ID NO: 3, or at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or up to 99% identity to SEQ ID NO: 3.
- the calcium- independent srtA mutants comprises at least 60% identity to amino acid residues 60-206 of SEQ ID NO: 3, or at least 65% identity to amino acid residues 60-206 of SEQ ID NO: 3, or at least 70% identity to amino acid residues 60-206 of SEQ ID NO : 3, or at least 75% identity to amino acid residues 60-206 of SEQ ID NO: 3, or at least 80% identity to amino acid residues 60-206 of SEQ ID NO: 3, or at least 85% identity to amino acid residues 60-206 of SEQ ID NO: 3, or at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or up to 99% identity to amino acid residues 60-206 SEQ ID NO: 3.
- the calcium-independent srtA mutants comprises at least 60% identity to SEQ ID NO : 5, or at least 65% identity to SEQ ID NO: 5, or at least 70% identity to SEQ ID NO: 5, or at least 75% identity to SEQ ID NO: 5, or at least 80% identity to SEQ ID NO: 5, or at least 85% identity to SEQ ID NO: 5, or at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or up to 99% identity to SEQ ID NO: 5.
- any of the calcium- independent srtA mutants disclosed herein can include a tag.
- a calcium-independent srtA mutant comprises a C-terminal tag.
- a calcium- independent srtA mutant comprises a N-terminal tag.
- the calcium- independent srtA mutant comprises a N-terminal His6 tag.
- the calcium- independent srtA mutants exhibit sortase A catalytic activity in the absence of calcium. In some embodiments, the calcium- independent srtA mutants exhibit sortase A catalytic activity in the absence of exogenous calcium.
- the calcium- independent srtA mutants exhibit sortase A catalytic activity in the presence of calcium-binding proteins.
- the calcium- independent srtA mutants exhibit sortase A catalytic activity in the presence of calcium concentrations up to 1 mM, up to 2 mM, up to 3 mM, up to 4 mM, up to 5 mM, up to 6 mM, up to 7 mM, up to 8 mM, up to 9 mM, or up to 10 mM or more.
- the calcium-independent srtA mutants exhibit sortase A catalytic activity in the presence of calcium concentrations less than 1 mM. In some embodiments, the calcium- independent srtA mutants exhibit sortase A catalytic activity in the presence of calcium concentrations less than 100 ⁇ , less than 10 ⁇ , less than 1 ⁇ , less than 0.1 ⁇ , or less than 0.01 ⁇ .
- the calcium-independent srtA mutants exhibit at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, or at least up to 80% of wild-type sortase A catalytic activity at 0 mM Ca 2+ compared to wild-type sortase A catalytic activity at 10 mM Ca 2+ .
- the calcium-independent srtA mutants exhibit at least 85%, at least 90%, at least 91%, at least 92%, at least 90%, at least 91%, or at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or up to 100% of wild-type sortase A catalytic activity at 0 mM Ca 2 ' ' compared to wild- type sortase A catalytic activity at 10 mM Ca 2+ .
- the calcium- independent srtA mutants exhibit at least 1- fold, at least 2 - fold, at least 10-fold, at least 20-fold, at least 30-fold, at least 40-fold, at least 50-fold, at least 60-fold, at least 70-fold, at least 80-fold, at least 90-fold, or at least up to 100-fold, or more sortase A catalytic activity at 0 mM Ca 2+ compared to wild-type sortase A catalytic activity at 10 mM Ca 2+ .
- the calcium- independent srtA mutants exhibit at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, or at least up to 80% of sortase A catalytic activity of SEQ ID NO: 1 at 0 mM Ca 2+ compared to sortase A catalytic activity of SEQ ID NO: 1 at 10 mM Ca 2+ .
- the calcium- independent srtA mutants exhibit at least 85%, at least 90%, at least 91%, at least 92%, at least 90%, at least 91%, or at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or up to 100% of sortase A catalytic activity of SEQ ID NO: 1 at 0 mM Ca 2+ compared to sortase A catalytic activity of SEQ ID NO: 1 at 10 mM Ca 2 " .
- the calcium- independent srtA mutants disclosed herein exhibit increased catalytic activity compared to the sortase A of SEQ ID NO: 1.
- the calcium-independent srtA mutants exhibit at least 2-fold, at least 3-fold, at least 4- fold, at least 5-fold, at least 6-fold, at least 7-fold, at least 8-fold, at least 9-fold, at least 10-fold, at least 20-fold, at least 30-fold, at least 40-fold, at least 50-fold, at least 60-fold, at least 70-fold, at least 80-fold, at least 90-fold, or at least up to 100- fold, or more than sortase A catalytic activity of SEQ ID NO: 1 at 0 mM Ca 2+ compared to sortase A catalytic activity of SEQ ID NO: 1 at 10 mM Ca 2+ .
- the calcium- independent srtA mutants exhibit at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, or at least up to 80% of sortase A catalytic activity of SEQ ID NO: 3 at 0 mM Ca 2+ compared to sortase A catalytic activity of SEQ ID NO: 3 at 10 mM Ca 2+ .
- the calcium- independent srtA mutants exhibit at least 85%, at least 90%, at least 91%, at least 92%, at least 90%, at least 91%, or at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or up to 100% of sortase A catalytic activity of SEQ ID NO: 3 at 0 mM Ca 2+ compared to sortase A catalytic activity of SEQ ID NO: 3 at 10 mM Ca 2+ .
- the calcium- independent srtA mutants disclosed herein exhibit increased catalytic activity compared to the sortase A of SEQ ID NO: 3.
- the calcium- independent srtA mutants exhibit at least at least 2-fold, at least 3-fold, at least 4-fold, at least 5-fold, at least 6-fold, at least 7-fold, at least 8-fold, at least 9-fold, at least 10- fold, at least 20-fold, at least 30-fold, at least 40- fold, at least 50-fold, at least 60-fold, at least 70- fold, at least 80-fold, at least 90-fold, or at least up to 100-fold, or more than sortase A catalytic activity of SEQ ID NO: 3 at 0 mM Ca 2+ compared to sortase A catalytic activity of SEQ ID NO: 3 at 10 mM Ca 2+ .
- the calcium- independent srtA mutants exhibit at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, or at least up to 80% of sortase A catalytic activity of SEQ ID NO: 5 at 0 mM Ca 2+ compared to sortase A catalytic activity of SEQ ID NO: 5 at 10 mM Ca 2+ .
- the calcium- independent srtA mutants exhibit at least 85%, at least 90%, at least 91%, at least 92%, at least 90%, at least 91%, or at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or up to
- the calcium- independent srtA mutants disclosed herein exhibit increased catalytic activity compared to the sortase A of SEQ ID NO: 5.
- the calcium- independent srtA mutants exhibit at least 2-fold, at least 3-fold, at least 4- fold, at least 5-fold, at least 6-fold, at least 7-fold, at least 8-fold, at least 9-fold, at least 10-fold, at least 20-fold, at least 30-fold, at least 40-fold, at least 50-fold, at least 60-fold, at least 70-fold, at least 80-fold, at least 90-fold, or at least up to 100- fold, or more than sortase A catalytic activity of SEQ ID NO: 5 at 0 mM Ca 2+ compared to sortase A catalytic activity of SEQ ID NO: 5 at 10 mM Ca 2+ .
- aspects of the present disclosure also relate to calcium- independent mutants comprising an amino acid sequence at least 80% identical to SEQ ID NO: 9.
- a calcium- independent srtA mutant comprises an amino acid sequence at least 80% identical to SEQ ID NO : 9, wherein the mutant comprises a) a K residue at position 47 of SEQ ID NO: 9; b) a Q residue at position 50 of SEQ ID NO: 9; and c) at least one amino acid residue of SEQ ID NO: 9 selected from the group consisting of i) a R residue at position 36 of SEQ ID NO:9; ii) a N residue at position 102 of SEQ ID NO: 9; iii) a A residue at position 107 of SEQ ID NO: 9; iv) a E residue at position 132 of SEQ ID NO : 9; and v) a T residue at position 138 of SEQ ID NO : 9.
- a calcium- independent srtA mutant comprises an amino acid sequence at least 80% identical to SEQ ID NO : 9, wherein the mutant comprises a) a K residue at position 47 of SEQ ID NO : 9; b) a A residue at position 50 of SEQ ID NO: 9; and c) at least one amino acid residue of SEQ ID NO: 9 selected from the group consisting of i) a R residue at position 36 of SEQ ID NO: 9; ii) a N residue at position 102 of SEQ ID NO: 9; iii) a A residue at position 107 of SEQ ID NO: 9; iv) a E residue at position 132 of SEQ ID NO: 9; and v) a T residue at position 138 of SEQ ID NO: 9.
- a calcium- independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 16 (FIG. 4P) . In some embodiments, a calcium independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 16 comprising a A residue at position 50. In some embodiments, a calcium- independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 17 (FIG. 4Q) . In some embodiments, a calcium independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 17 comprising a A residue at position 50. In some
- a calcium-independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 18 (FIG. 4R) .
- a calcium independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 18 comprising a A residue at position 50.
- a calcium- independent srtA mutant comprises an amino acid sequence of SEQ ID NO : 19 (FIG. 4S) .
- a calcium independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 19 comprising a A residue at position 50.
- a calcium-independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 20 (FIG. 4T) .
- a calcium independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 20 comprising a A residue at position 50.
- a calcium- independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 21 (FIG. 4U) .
- a calcium independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 21 comprising a A residue at position 50.
- a calcium- independent srtA mutant comprises at least two amino acid residues of SEQ ID NO: 9 selected from the group consisting of i) -v) .
- a calcium- independent srtA mutant comprises at least two amino acid residues of SEQ ID NO: 9 comprising i) and ii) . In some embodiments, a calcium- independent srtA mutant comprises at least two amino acid residues of SEQ ID NO: 9 comprising i) and iii) . In some embodiments, a calcium- independent srtA mutant comprises at least two amino acid residues of SEQ ID NO: 9 comprising i) and iv) . In some embodiments, a calcium-independent srtA mutant comprises at least two amino acid residues of SEQ ID NO: 9 comprising i) and v) .
- a calcium- independent srtA mutant comprises at least two amino acid residues of SEQ ID NO: 9 comprising ii) and iii) . In some embodiments, a calcium- independent srtA mutant comprises at least two amino acid residues of SEQ ID NO: 9 comprising ii) and iv) . In some embodiments , a calcium- independent srtA mutant comprises at least two amino acid residues of SEQ ID NO : 9 comprising ii) and v) .
- a calcium- independent srtA mutant comprises at least two amino acid residues of SEQ ID NO : 9 comprising iii) and iv) . In some embodiments , a calcium- independent srtA mutant comprises at least two amino acid residues of SEQ ID NO: 9 comprising iii) and v) .
- a calcium- independent srtA mutant comprises at least two amino acid residues of SEQ ID NO: 9 comprising iv) and v) . In some embodiments, a calcium- independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 15 (FIG. 40) . In some embodiments, a calcium independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 15 comprising a A residue at position 50.
- a calcium- independent srtA mutant comprises at least three amino acid residues of SEQ ID NO: 9 selected from the group consisting of i) -v) .
- a calcium- independent srtA mutant comprises at least three amino acid residues of SEQ ID NO: 9 comprising i) , ii) , and iii) . In some embodiments, a calcium- independent srtA mutant comprises at least three amino acid residues of SEQ ID NO: 9 comprising i) , ii) , and iv) . In some embodiments, a calcium- independent srtA mutant comprises at least three amino acid residues of SEQ ID NO: 9 comprising i) , ii) , and v) .
- a calcium- independent srtA mutant comprises at least three amino acid residues of SEQ ID NO: 9 comprising ii) , iii) , and iv) . In some embodiments, a calcium- independent srtA mutant comprises at least three amino acid residues of SEQ ID NO : 9 comprising ii) , iv) , and v) .
- a calcium- independent srtA mutant comprises at least three amino acid residues of SEQ ID NO : 9 comprising iii ) , iv) , and v) .
- a calcium- independent srtA mutant comprises an amino acid sequence of SEQ ID NO : 12. (FIG . 4L) ) In some embodiments , a calcium- independent srtA mutant comprises an amino acid sequence SEQ ID NO:
- a calcium- independent srtA mutant comprises an amino acid sequence of SEQ ID NO : 14 (FIG . 4N) .
- a calcium independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 14 comprising a A residue at position 50.
- a calcium- independent srtA mutant comprises at least four amino acid residues of SEQ ID NO: 9 selected from the group consisting of i) -v) .
- a calcium- independent srtA mutant comprises at least four amino acid residues of SEQ ID NO: 9 comprising i) , ii) , iii) , and iv) . In some embodiments, a calcium-independent srtA mutant comprises at least four amino acid residues of SEQ ID NO: 9 comprising i) , iii) , iv) , and v) . In some embodiments, a calcium- independent srtA mutant comprises at least four amino acid residues of SEQ ID NO: 9 comprising ii) , iii), iv) , and v) .
- a calcium- independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 11 (FIG. 4K) . In some embodiments, a calcium independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 11 comprising a A residue at position 50.
- a calcium-independent srtA mutant comprises an amino acid sequence at least 90% identical to SEQ ID NO: 9. In some embodiments, a calcium- independent srtA mutant comprises an amino acid sequence at least 95% identical to SEQ ID NO: 9. In some embodiments, a calcium-independent srtA mutant comprises an amino acid sequence at least 96% identical to SEQ ID NO: 9. In some embodiments, a calcium- independent srtA mutant comprises an amino acid sequence at least 97% identical to SEQ ID NO: 9. In some embodiments, a calcium- independent srtA mutant comprises an amino acid sequence at least 98% identical to SEQ ID NO : 9. In some embodiments, a calcium- independent srtA mutant comprises an amino acid sequence at least 99% identical to SEQ ID NO : 9. In some embodiments, a calcium- independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 9.
- polynucleotides encoding mutants of sortase A comprising a nucleotide sequence at least 80% identical to SEQ ID NO: 10 are disclosed.
- a nucleotide sequence at least 80% identical to SEQ ID NO: 10 are disclosed.
- polynucleotide encoding a mutant of sortase A comprises a nucleotide sequence at least 80% identical to SEQ ID NO: 10, and encodes a) a K residue at position 47 of SEQ ID NO: 9; b) a Q residue at position 50 of SEQ ID NO: 9; and c) at least one amino acid residue of SEQ ID NO: 9 selected from the group
- a polynucleotide encoding a mutant of sortase A comprises a nucleotide sequence at least 80% identical to SEQ ID NO: 10, and encodes a) a K residue at position 47 of SEQ ID NO: 9; b) a A residue at position 50 of SEQ ID NO: 9; and c) at least one amino acid residue of SEQ ID NO: 9 selected from the group consisting of i) a R residue at position 36; ii) a N residue at position 102; iii) a A residue at position 107; iv) a E residue at position 132; and v) a T residue at position 138.
- the polynucleotide encodes at least two amino acid residues of SEQ ID NO: 9 selected from the group consisting of i) -v) .
- the polynucleotide encodes at least two amino acid residues of SEQ ID NO: 9 comprising i) and ii) . In some embodiments, the polynucleotide encodes at least two amino acid residues of SEQ ID NO: 9 comprising i) and iii) . In some embodiments, the polynucleotide encodes at least two amino acid residues of SEQ ID NO: 9 comprising i) and iv) . In some
- the polynucleotide encodes at least two amino acid residues of SEQ ID NO: 9 comprising i) and v) .
- the polynucleotide encodes at least two amino acid residues of SEQ ID NO: 9 comprising ii) and iii) . In some embodiments, the polynucleotide encodes at least two amino acid residues of SEQ ID NO: 9 comprising ii) and iv) . In some embodiments, the polynucleotide encodes at least two amino acid residues of SEQ ID NO: 9 comprising ii) and v) .
- the polynucleotide encodes at least two amino acid residues of SEQ ID NO: 9 comprising iii) and iv) . In some embodiments, the polynucleotide encodes at least two amino acid residues of SEQ ID NO: 9 comprising iii) and v) .
- the polynucleotide encodes at least two amino acid residues of SEQ ID NO: 9 comprising iv) and v) .
- the polynucleotide encodes at least three amino acid residues of SEQ ID NO: 9 selected from the group consisting of i) -v) . In some embodiments, the polynucleotide encodes at least three amino acid residues of SEQ ID NO: 9
- the polynucleotide encodes at least three amino acid residues of SEQ ID NO: 9 comprising i) , ii) and iv) . In some embodiments, the polynucleotide encodes at least three amino acid residues of SEQ ID NO: 9 comprising i) , ii) and v) .
- the polynucleotide encodes at least three amino acid residues of SEQ ID NO: 9
- the polynucleotide encodes at least three amino acid residues of SEQ ID NO: 9 comprising ii) , iii) and v) .
- the polynucleotide encodes at least three amino acid residues of SEQ ID NO: 9
- the polynucleotide encodes at least four amino acid residues of SEQ ID NO: 9 selected from the group consisting of i) -v) .
- the polynucleotide encodes at least four amino acid residues of SEQ ID NO: 9 comprising i) , ii) , iii) , and iv) . In some embodiments , the polynucleotide encodes at least four amino acid residues of SEQ ID NO: 9 comprising ii) , iii) , iv) and v) .
- the polynucleotide encodes at each of the amino acid residues of SEQ ID NO: 9.
- a polynucleotide encoding a calcium- independent srtA mutant comprises a nucleotide sequence at least 85% identical to SEQ ID NO: 10. In some embodiments, a polynucleotide encoding a calcium- independent srtA mutant comprises a nucleotide sequence at least 90% identical to SEQ ID NO: 10. In some embodiments, a polynucleotide encoding a calcium- independent srtA mutant comprises a nucleotide sequence at least 95% identical to SEQ ID NO: 10.
- a polynucleotide encoding a calcium- independent srtA mutant comprises a nucleotide sequence at least 96% identical to SEQ ID NO: 10. In some embodiments, a polynucleotide encoding a calcium- independent srtA mutant comprises a nucleotide sequence at least 97% identical to SEQ ID NO: 10. In some embodiments, polynucleotide encoding a calcium- independent srtA mutant comprises a nucleotide sequence at least 98% identical to SEQ ID NO: 10.
- a polynucleotide encoding a calcium- independent srtA mutant comprises a nucleotide sequence at least 99% identical to SEQ ID NO: 10. In some embodiments, a polynucleotide encoding a calcium- independent srtA mutant comprises a nucleotide sequence at least 100% identical to SEQ ID NO: 10. In some embodiments, a polynucleotide encoding a calcium- independent srtA mutant comprises a nucleotide sequence of SEQ ID NO: 10.
- nucleic acid construct comprising a polynucleotide disclosed herein (e.g., a polynucleotide encoding a calcium- independent srtA mutant) .
- the nucleic acid construct comprises a nucleotide sequence that encodes one or more C-terminal or N-terminal tags.
- polynucleotide refers to a molecule, which is a ribonucleic acid (RNA) or deoxyribonucleic acid (DNA) molecule, either single stranded or double stranded.
- RNA ribonucleic acid
- DNA deoxyribonucleic acid
- the polynucleotides of the present disclosure such as polynucleotides encoding the calcium- independent srtA mutants, can be isolated or synthesized using standard molecular biology techniques and the sequence information provided herein. The synthetic
- polynucleotides may be optimized in codon use, preferably according to the methods described in WO2006/077258 and/or PCT/EP2007/055943 , which are herein incorporated by reference.
- PCT/EP2007/055943 addresses codon-pair optimization.
- the polynucleotides encoding the calcium- independent srtA mutants of the disclosure can be amplified using cDNA, mRNA or alternatively, genomic DNA, as a template and appropriate oligonucleotide primers according to standard PGR amplification techniques. The nucleic acid so amplified can be cloned into an
- a polynucleotide may either be present in isolated form, or be comprised in recombinant nucleic acid molecules or vectors, or be comprised in a host cell.
- isolated polypeptide or protein is intended a polypeptide or protein removed from its native
- polypeptides and proteins expressed in host cells are considered isolated for the purpose of the disclosure, as are native or recombinant polypeptides which have been substantially purified by any suitable technique such as, for example, the single- step purification method
- vector refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked.
- Expression vectors useful in the present disclosure include chromosomal-, episomal- and virus-derived vectors e.g., vectors derived from bacterial plasmids,
- Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional transformation or transfection techniques.
- yeast episome yeast chromosomal elements
- viruses such as baculoviruses , papova viruses, vaccinia viruses, adenoviruses, fowl pox viruses, pseudorabies viruses and retroviruses
- vectors derived from combinations thereof such as those derived from plasmid and bacteriophage genetic elements, such as cosmids and phagemids .
- Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional transformation or transfection techniques.
- transformation and “transfection” are intended to refer to a variety of art-recognized techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, including calcium phosphate or calcium chloride co- precipitation, DEAE-dextran-mediated transfection, transduction, infection, lipofection, cationic
- the vectors, such as expression vectors, of the disclosure can be introduced into host cells to thereby produce proteins or peptides, encoded by nucleic acids as described herein (e.g. calcium- independent srtA mutant proteins, catalytically active fragments, catalytically active variants or catalytically active derivatives thereof) .
- the vectors, such as recombinant expression vectors, of the disclosure can be designed for expression of calcium-independent srtA mutant proteins in
- prokaryotic or eukaryotic cells prokaryotic or eukaryotic cells.
- calcium- independent srtA mutant proteins can be expressed in bacterial cells such as E. coli, insect cells (using baculovirus expression
- a "host cell,” as used herein, is any cell capable of being grown and maintained in cell culture under conditions allowing for production and recovery of useful quantities of a biological product, as defined herein.
- Host cells can be unmodified cells or cell lines, or cell lines which have been genetically modified (e.g., to facilitate production of a biological product) .
- the host cell is a cell line that has been modified to allow for growth under desired conditions, such as in serum-free media, in cell suspension culture, or in adherent cell culture.
- a host cell can be chosen that modulates the expression of the inserted sequences, or modifies and processes the gene product in a specific, desired fashion . Such modifications (e.g. , glycosylation) and processing (e.g., cleavage) of protein products may facilitate optimal functioning of the protein.
- Suitable host cells are preferably prokaryotic microorganisms such as bacteria (e.g., E. coli) , or in some embodiments eukaryotic organisms, for example f ngi, such as yeasts or filamentous fungi, or plant cells.
- the calcium- independent srtA mutant according to the disclosure can be recovered and purified from recombinant cell cultures by methods known in the art (e.g., ion- exchange chromatography, hydrophobic interaction
- exclusion chromatography to further separate the target calcium- independent mutant srtA from the bulk protein to enable recovery of the target calcium- independent srtA mutant in a highly purified state
- the disclosure relates to a method of producing a calcium- independent srtA mutant comprising the steps of: (a) culturing the host cell according to the disclosure in a suitable culture medium under suitable conditions to produce calcium- independent srtA mutant; and optionally (b) purifying said calcium- independent srtA mutant to provide a purified calcium- independent srtA mutant product .
- Another aspect of the present disclosure relates to methods for the production of substantially purified calcium- independent srtA mutant enzyme of the present disclosure. It should be appreciated that such enzyme can be prepared in compliance with Good Manufacturing Practices (GMP) or used in a GMP-com liant process.
- GMP Good Manufacturing Practices
- substantially purified calcium- independent srtA mutant involves cloning a nucleic acid segment encoding the calcium- independent srtA mutant enzyme can be inserted in a vector that contains sequences allowing expression of a sortase-transamidase in another organism, such as E.
- a suitable host organism can then be transformed or transfected with the vector containing the cloned nucleic acid segment. Expression is then performed in that host organism. The expressed enzyme is then purified using standard techniques. Techniques for the purification of cloned proteins are well known in the art and need not be detailed further here (e.g., affinity chromatography on a nickel NTA column for purification of a calcium- independent srtA mutant enzyme extended at its carboxyl terminus with a sufficient number of histidine residues to allow specific binding of the protein molecule to the nickel NTA column through the histidine residues) .
- the disclosure further provides an enzyme composition comprising one or more calcium- independent srtA mutants.
- the disclosure relates to a sortase-mediated transpeptidation reaction catalyzed by an enzyme composition according to the disclosure. Examples of synthetic nucleophiles that can be used in such sortase-mediated transpeptidation reactions are shown in Table 1 below .
- Table 1 Examples of synthetic nucleophiles used in site-specific sortase A transpeptidation reactions .
- an enzyme composition disclosed herein for the sortagging of a protein.
- An enzyme composition of the disclosure may comprise a polypeptide which has the same enzymatic activity, for example the same type of transamidase activity as that provided by a polypeptide of the disclosure.
- An enzyme composition of the disclosure may comprise a polypeptide which has a different type of enzymatic activity than that provided by a polypeptide of the disclosure.
- composition is purified to comprise calcium- independent srtA mutants of a particular sequence (e.g., SEQ ID NO: 9) .
- enzyme composition is purified to comprise calcium- independent srtA mutants of a particular sequence (e.g., SEQ ID NO: 9) .
- the enzyme composition is purified to comprise calcium- independent srtA mutants of a particular sequence (e.g., SEQ ID NO: 9) .
- the enzyme composition is purified to comprise calcium- independent srtA mutants of a particular sequence (e.g., SEQ ID NO: 9) .
- the enzyme composition comprises calcium- independent srtA mutants predominantly in monomeric form.
- the enzyme composition comprises a mixture of both monomeric and dimeric calcium- independent srtA mutants .
- the enzyme composition comprises a mixture of calcium- independent srtA mutants comprising different sequences.
- the enzyme composition comprises at least two, at least three , at least four, or at least five calcium- independent sortase A mutants selected from the group consisting of SEQ ID NO : 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12 , SEQ ID NO: 13 , SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18 , SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21 , and combinations thereof .
- the enzyme composition comprises a group of calcium- independent sortase A mutants comprising SEQ ID NO : 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO:
- the enzyme composition comprises a calcium- independent srtA mutant and a calcium-dependent srtA
- the enzyme composition is a mixture of the enzyme composition
- the enzyme composition comprises a
- the enzyme composition is a mixture of the enzyme composition
- the enzyme comprises: 15 NO: 3 or SEQ ID NO: 5. In some embodiments, the enzyme
- composition comprises an aqueous medium.
- the calcium- independent srtA mutants and enzyme compositions comprising the calcium- independent srtA
- Exemplary such applications include, but are not limited to , specific incorporation of novel functionality into proteins , synthesis of neoglycoconjugates , immobilization of proteins to solid surfaces, protein labeling on living
- eGFP-LPETG Polystyrene beads Parthasart hy et al . (2007) eGFP-LPETGG-His f , Glycidyl methacrylate (GMA) Chan et al. (2007)
- the calcium- independent srtA mutants according to the disclosure may feature a number of significant
- these advantages may include aspects such as lower production costs,
- a calcium- independent srtA mutant or composition of the disclosure may be used in any process which requires the sortagging of a moiety of interest to a target protein .
- sortagging , “sortase- mediated ligation”, “sortase-mediated transpeptidation” , “sortase-mediated transacylation” , are used
- tags include, but are not limited to, amino acids, peptides, proteins, nucleic acids, polynucleotides, sugars, carbohydrates, polymers, lipids, fatty acids, and small molecules. Other suitable tags will be apparent to those of skill in the art and the disclosure is not limited in this aspect.
- a tag comprises a sequence useful for purifying, expressing, solubilizing, and/or detecting a polypeptide.
- a tag comprises an HA, TAP, Myc, 6XHis, Flag, or GST tag, to name few examples .
- a tag comprises a solubility-enhancing tag (e.g., a SUMO tag, NUS A tag, SNUT tag, a Strep tag, or a monomeric mutant of the Ocr protein of bacteriophage T7) . See, e.g., Esposito D and Chatterjee DK. Curr Opin Biotechnol . ; 17(4):353-8 (2006).
- a tag is cleavable, so that it can be removed, e.g., by a protease. In some embodiments, this is achieved by including a protease cleavage site in the tag, e.g., adjacent or linked to a functional portion of the tag.
- exemplary proteases include, e.g., thrombin, TEV protease, Factor Xa, PreScission protease, etc.
- a "self-cleaving" tag is used. See, e.g., PCT/US05/05763.
- a tag comprises a click chemistry handle.
- the calcium-independent srtA mutants disclosed herein can be used to sortag cells to be administered to a subject, e.g., human subjects.
- a method for sortagging a target polypeptide comprises (a) providing a polypeptide comprising a sortase recognition motif; (b) providing a moiety comprising an N-terminal oligoglycine sequence or a terminal alkylamine; and (c) contacting the target polypeptide with the moiety in the presence of an enzyme composition described herein (e.g., a composition comprising at least one calcium- independent srtA mutant disclosed herein) under conditions suitable for the srtA A mutant to ligate the moiety to the target protein, thereby sortagging the target protein.
- an enzyme composition described herein e.g., a composition comprising at least one calcium- independent srtA mutant disclosed herein
- sortagging methods disclosed herein can be used to attach a moiety to any target protein or polypeptide.
- Methods and compositions provided herein can be used to conjugate essentially any polypeptide to any moiety.
- Non- limiting examples of polypeptides that can be produced or conjugated according to methods provided herein include receptors, membrane proteins, cytokines, chemokines, hormones, enzymes, growth factors, growth factor receptors, antibodies, antibody derivatives and other immune effectors,
- interleukins interleukins , interferons, erythropoietin, integrins, soluble major histocompatibility complex antigens, binding proteins, transcription factors, translation factors, oncoproteins or proto-oncoproteins , muscle proteins, myeloproteins, neuroactive proteins, tumor growth suppressors, structural proteins, and blood proteins (e.g., thrombin, serum albumin, Factor VII, Factor VIII, Factor IX, Factor X, Protein C, von
- the polypeptide is a glycoprotein or other polypeptide which requires post- translational modification, such as deamidation, glycation, or the like, for optimal activity.
- the polypeptide is a lipoprotein. Exemplary target polypeptides which have been sortagged are shown in Table 3 below.
- Table 3 Examples of proteins labeled by sortase A transpeptidation .
- the target polypeptide comprises or consists of a polypeptide that is at least 80%, or at least 90%, e.g., at least 95%, 86%, 97%, 98%, 99%, 99.5%,
- the target polypeptide has no more than 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 amino acid differences relative to a naturally occurring sequence.
- the naturally occurring protein is a mammalian protein, e.g., of human origin.
- Naturally occurring sequences e.g., genomic, mRNA, and polypeptide sequences, from a wide variety of species, including human, are known in the art and are available in publicly accessible databases such as those available at the National Center for Biotechnology Information (www.ncbi.nih.gov) or Universal Protein Resource
- Databases include, e.g., GenBank, RefSeq, Gene, UniProtKB/SwissProt , UniProtKB/Trembl , and the like. Sequences, e.g., nucleic acid (e.g., mRNA) and polypeptide sequences, in the NCBI Reference Sequence database may be used as reference sequences.
- a target polypeptide is a protein that is approved by the US Food & Drug
- Such proteins may or may not be one for which a PEGylated version has been tested in clinical trials and/or has been approved for marketing .
- a target polypeptide is a neurotrophic factor, i.e., a factor that promotes survival , development and/or function of neural lineage cells (which term as used herein includes neural
- progenitor cells e.g., astrocytes, oligodendrocytes, microglia.
- glial cells e.g., astrocytes, oligodendrocytes, microglia
- t e target protein is one that forms homodimers or heterodimers , (or homo- or heterooligomers comprising more than two subunits, such as tetramers) .
- the target polypeptide is an enzyme, e.g., an enzyme that is important in metabolism or other physiological processes.
- a target protein comprises a receptor or receptor fragment (e.g., extracellular domain).
- the target polypeptide is sortagged in cells to be administered to a subject.
- the present disclosure contemplates any application for which administration of cells comprising a sortagged
- polypeptide is desirable (e.g., therapeutic, diagnostic, imaging, etc.)
- a "subject” means a human or animal. Usually the animal is a vertebrate such as a primate, rodent, domestic animal or game animal.
- Primates include chimpanzees, cynomologous monkeys, spider monkeys, and macaques, e.g., Rhesus.
- Rodents include mice , rats, woodchucks , ferrets , rabbits and hamsters.
- Domestic and game animals include cows, horses, pigs, deer, bison, buffalo, feline species, e.g., domestic cat, canine species, e.g., dog, fox, wolf, avian species , e.g., chicken, emu, ostrich, and fish, e.g., trout , catfish and salmon .
- Subj ect includes any subset of the foregoing, e.g., all of the above, but excluding one or more groups or species such as humans , primates or rodents .
- the subj ect is a mammal, e.g., a primate, e.g., a human .
- the subj ect is a mammal .
- the mammal can be a human, non- human primate, mouse, rat, dog, cat, horse, or cow, but are not limited to these examples. Mammals other than humans can be advantageously used, for example, as subj ects that represent animal models of disease .
- a subj ect can be male or female .
- proteins e.g., secreted eukaryotic (e.g., mammalian) proteins
- intracellular processing e.g., cleavage of a secretion signal prior to secretion and/or removal of other portion (s) that are not required for biological activity
- Such mature, biologically active versions of target proteins are used in certain embodiments of the disclosure.
- polypeptide according to methods provided herein can be any agent suitable for conjugation to a polypeptide, i.e., capable of being operably linked to a sortase recognition sequence.
- the moiety can confer any of a number of possible functionalities to the polypeptide, such as but not limited to, altered physico-chemical properties, such as solubility and/or stability,- altered pharmacokinetic properties, such as bioavailability, clearance rate, and/or plasma half-life and/or altered biological activity, such as immunogenicity and/or antigenicity.
- Non-limiting examples of moieties include: a small -molecule , a peptide, a polypeptide, a lipid and/or fatty acid, a carbohydrate, a nucleic acid, a reporter molecule (e.g., a reporter enzyme, fluorescent molecule, a radiolabel, an affinity label, or the like), a toxin, a therapeutic agent, a nanoparticle, a resin, a cell, a virus particle, an adjuvant molecule, or a polymer (e.g., a hydrophilic polymer), an affinity tag (e.g., His6) , or the like.
- the moiety is a pharmacological carrier molecule.
- the moiety comprises, consists essentially of, or consists of a member of a prosthetic binding group, such as biotin/avidin,
- biotin/streptavidin maltose binding protein/maltose
- glutathione S-transferase/glutathione glutathione S-transferase/glutathione
- metal/polyhistidine antibody/epitope , antibody/antigen, antibody/protein A or protein G, hapten/anti-hapten, folic acid/folate binding protein, vitamin B 12/intrinsic factor, nucleic acid/complementary nucleic acid,
- the moiety comprises, consists essentially of, or consists of a peptide, a
- peptidomimetic e.g., a peptoid
- an amino acid an amino acid analog
- a polynucleotide or polynucleotide analog e.g., a polynucleotide or a nucleotide or nucleotide analog
- an organic or inorganic compound having a molecular weight between about 500 and about 10,000.
- the moiety comprises, consists essentially of, or consists of a second polypeptide.
- the polypeptide can be any polypeptide.
- a protein which is difficult to produce in a cell e.g., either due to toxicity
- fragments which can be j oined using the methods described herein e.g. , first portion of the protein can be attached to the second portion, reconstituting an active protein using the methods described herein.
- the moiety comprises , consists essentially of , or consists of a reporter molecule , such as a fluorescent molecule (e.g. , umbelliferone ,
- fluorescein fluorescein isothiocyanate , rhodamine , dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin
- a radioisotope e.g., Cu-64, Ga67, Ga-68, Zr-89, Ru-97, Tc-99, Rh-105, Pd-109, In- 1.1.1, 1-123 , I- 125, 1-131, Re-186, Re- 188, Au-198 , Pb-203 , At-211, Pb- 212 or Bi-212
- a detectable enzyme e.g.
- horseradish peroxidase alkaline phosphatase, p-galactosidase, or acetylcholinesterase
- a luminescent material e.g., luminol
- a bioluminescent material e.g., luciferase, luciferin, or aequorin
- the moiety comprises, consists essentially of, or consists of a biologically active molecule, such as a toxin (e.g., abrin, ricin A,
- pseudomonas exotoxin or diphtheria toxin .
- the moiety comprises the polypeptide itself, such that the polypeptide is cyclized by the conjugation.
- cyclized proteins often exhibit desired properties relative to the
- polypeptide can be 'chained' (e.g., dimerized, trimerized, etc) .
- the moiety is a water-soluble polymer, non-peptidic polymer with an average molecular weight of about 200 to about 200,000 Daltons, depending on the desired effect on the properties of the
- the moiety comprises, consists essentially of, or consists of a polymeric group, such as polyalkylene oxide (PAO) , polyalkylene glycol (PAG) , polyethylene glycol (PEG) , methoxypolyethylene glycol (mPEG) , polypropylene glycol (PPG), branched PEGs, copolymers of ethylene glycol and propylene glycol, polyvinyl alcohol (PVA) ,
- PAO polyalkylene oxide
- PAG polyalkylene glycol
- PEG polyethylene glycol
- mPEG methoxypolyethylene glycol
- PPG polypropylene glycol
- PVA polyvinyl alcohol
- polycarboxylate poly-vinylpyrrolidone , polyethylene-co- maleic acid anhydride, polystyrene-co-maleic acid anhydride, dext rboxymethy1 -dextran,
- polyoxyethylated glycerol polyoxyethylated sorbitol, polyoxyethylated glucose, dextran, polyoxazoline, polyacryloylmorp oline, or a serum protein binding- ligand, such as a compound which binds to albumin (e.g., fatty acids, C 5 -C 2 4 fatty acid, aliphatic diacid (e.g. C 5 - C 24 ) ) .
- albumin e.g., fatty acids, C 5 -C 2 4 fatty acid, aliphatic diacid (e.g. C 5 - C 24 )
- compositions provided herein are known in the art and are described, e.g., in U.S. Pat. No. 5,629,384, which is herein incorporated by reference.
- the target polypeptide or moiety comprises an affinity tag that can be used to facilitate recovery and/or isolation of the conjugated polypeptide.
- An affinity tag used in a method or composition provided herein can comprise any peptide or other molecule for which an antibody or other specific binding agent is available.
- Affinity tags known in the art as being useful for protein purification include, but are not limited to, a poly-histidine segment, protein A (e.g., Nilsson et al . , EMBO J. 4:1075 (1985); Nilsson et al . , Methods Enzymol . 198:3 (1991)), glutathione S transferase (e.g., Smith and Johnson, Gene 67:31 (1988)), Glu-Glu affinity tag (e.g., Grussenmeyer et al . , Proc . Natl. Acad. Sci. USA 82:7952 (1985)), substance P, FLAG peptide (e.g., Hopp et al . , Biotechnology 6:1204 (1988)), c-myc tags (detected with anti-myc antibodies) ,
- an affinity tag described herein allows for selective enrichment of desired conjugation products.
- an affinity tag is located N- terminal of a sortase recognition sequence or C- terminal of an oligoglycine sequence so that the tag remains associated with the polypeptide after sortase- catalyzed cleavage and ligation. As such, the affinity tag is retained in the conjugated polypeptide upon cleavage and/or ligation of the sortase recognition sequence by a sortase and affinity
- an affinity tag is located C- terminal of a sortase recognition sequence so that the tag is cleaved from the polypeptide upon sortase- catalyzed cleavage and ligation.
- the affinity tag is located C-terminal of the sortase recognition sequence (i.e., the sortase recognition sequence is between the target polypeptide and the affinity tag) .
- the target polypeptide comprises a spacer peptide.
- a spacer peptide separates the target polypeptide from a sortase recognition sequence and/or an affinity tag, and/or the sortase recognition sequence from an affinity tag.
- a spacer peptide can be of any size, e.g., from several to 30 or more amino acid residues, sufficient to serve the intended purpose. Spacer peptides can enhance conformational flexibility between two or more domains of a protein and/or minimize steric interference with the folding and/or function of two or more domains of a protein.
- a spacer peptide will generally comprise an inert, flexible amino acid sequence, e.g., comprising predominantly glycine, serine, and/or alanine residues.
- a spacer peptide sequence can be modified with one or more proline residues at the beginning and/or at the end of the spacer in order to isolate the spacer as a separate functional domain from neighboring domains of the protein.
- spacer peptides are known in the art.
- Contacting refers to the addition of the target polypeptide to the culture medium in a manner that allows the calcium- independent srtA mutant to ligate the moiety to target polypeptide.
- the contacting includes culturing the cells for a defined period of time in the presence of the target polypeptide. In other embodiments, the contacting includes culturing the cells for a variable period of time until a desired endpoint or other indicator is achieved.
- Conditions which allow the calcium- independent srtA mutant to cleave the sortase recognition sequence and ligate the moiety to the target polypeptide include for example, standard cell growth conditions known to those of skill in the art, e.g. for mammalian cells; 37° C . , 5% C0 2 , and an appropriate cell culture medium.
- the cell culture medium may vary depending upon the host cell and can be determined readily by those of skill in the art.
- the "contacting" step in the method takes place in the absence of calcium, i.e., there is no exogenous calcium added to the culture medium or reaction mixture .
- a sorta.se is immobilized by attaching it to a support.
- a "support” may be any entity or plurality of entities having a surface to which a substance may be attached or on which a substance may be placed. Examples, include, e.g., particles, slides, filters, interior wall or bottom of a vessel (e.g., a culture vessel such as a plate or flask, well of a microwell plate, tube), chips, etc.
- a support may be composed, e.g., of glass, metal, gels (e.g., agarose) , ceramics, polymers, or combinations thereof.
- Immobilization may comprise contacting sortase or a composition containing sortase with an affinity reagent, e.g., an antibody, that binds to sortase, wherein the affinity reagent is attached to a support.
- an affinity reagent e.g., an antibody
- the sortase is tagged, and the affinity reagent binds to the tag.
- sortase may comprise a tag, e.g., a 6X-His tag, which may be used to immobilize the sortase to a metal-ion containing resin or substrate.
- sortase is immobilized to magnetic particles.
- magnetic particles may be magneti sable and paramagnetic, e.g., superparamagnetic, i.e., they may only magnetic in a magnetic field.
- the support is in a column. Unreacted sortase substrates and reaction products may readily be separated from an immobilized sortase.
- the disclosure encompasses agents produced according to methods described herein, and compositions comprising such agents. It will be understood that, in some aspects, the disclosure encompasses methods of using such agents, e.g., for one or more purposes described herein.
- the disclosure further provides packaged products and kits, including calcium- independent srtA mutants described herein or polynucleotides encoding the same, cell lines, cell cultures, populations and compositions, including, as well cells, cultures, populations, and compositions enriched or selected for any calcium- independent srtA mutants or variants thereof, packaged into suitable packaging material.
- a packaged product or kit includes calcium- independent srtA mutants in monomeric form.
- the packaged product or kit includes calcium-independent srtA mutants in dimeric form.
- the packaged product or kit includes a mixed population of monomeric and dimeric calcium- independent srtA mutants.
- a packaged product or kit includes a label, such as a list of the contents of the package, or instructions for using the kit e.g., instructions for sortagging a target polypeptide, isolating or producing a substantially purified calcium- independent srtA mutant disclosed herein, administering sortagged cells, e.g., implanting or transplanting in vivo, or screening for a compound or agent that modulates activity of the calcium- independent srtA mutants or variants thereof .
- a label such as a list of the contents of the package
- instructions for using the kit e.g., instructions for sortagging a target polypeptide, isolating or producing a substantially purified calcium- independent srtA mutant disclosed herein, administering sortagged cells, e.g., implanting or transplanting in vivo, or screening for a compound or agent that modulates activity of the calcium- independent srtA mutants or variants thereof .
- a packaged product or kit includes a container, such as a sealed pouch or shipping container, or an article of manufacture, for example, to carry out a sortase-mediated ligation reaction utilizing a calcium-independent srtA mutant described herein, variant thereof or composition comprising the same, or preserving or storing the calcium- independent srtA mutants , such as a tissue culture dish, tube, flask, roller bottle or plate (e.g., a single multi-well plate or dish such as an 8 , 16, 32 , 64, 96, 384 and 1536 multi- well plate or dish) .
- a container such as a sealed pouch or shipping container, or an article of manufacture, for example, to carry out a sortase-mediated ligation reaction utilizing a calcium-independent srtA mutant described herein, variant thereof or composition comprising the same, or preserving or storing the calcium- independent srtA mutants , such as a tissue culture dish, tube, flask, roller bottle
- packaging material refers to a physical structure housing the product or components of the kit .
- the packaging material can maintain the components sterilel , and can be made of material commonly used for such purposes (e.g. , paper, corrugated fiber, glass , plastic , foil , ampules , etc . ) .
- a label or packaging insert can be included, listing contents or appropriate written instructions, for example, practicing a method of the disclosure.
- a packaged product or kit can therefore include instructions for practicing any of the methods of the disclosure described herein.
- calcium- independent srtA mutants described herein, variants thereof, or enzyme compositions comprising them can be included in a tissue culture dish, tube, flask, roller bottle or plate (e.g., a single multi-well plate or dish such as an 8, 16, 32, 64, 96, 384 and 1536 multi-well plate or dish) together with instructions, e.g., for sortagging, purification, preserving or screening.
- Instructions may be on "printed matter," e.g., on paper or cardboard within the kit, on a label affixed to the package, kit or packaging material, or attached to a tissue culture dish, tube, flask, roller bottle, plate (e.g., a single multi-well plate or dish such as an 8, 16, 32, 64, 96, 384 and 1536 multi-well plate or dish) or vial containing a component of the kit.
- Instructions may comprise voice or video tape and additionally be included on a computer readable medium, such as a disk (floppy diskette or hard disk) , optical CD such as CD- or DVD- ROM/RAM, magnetic tape, electrical storage media such as RAM and ROM and hybrids of these such as magnetic/optical storage media.
- kits can optionally include additional components, such as buffering agent, a preservative, or a reagent.
- additional components such as buffering agent, a preservative, or a reagent.
- Each component of the kit can be enclosed within an individual container or in a mixture and all of the various containers can be within single or multiple packages .
- nucleophiles can be used in a sortase reaction that comprise reactive chemical moieties, for example, moieties, or "handles", suitable for a click chemistry- reaction, as is described in detail in Published PCT International Application WO 2013/036630, the entirety of which is incorporated herein by reference.
- the invention includes embodiments in which exactly one member of the group is present in, employed in, or otherwise relevant to a given product or process.
- the invention also includes embodiments in which more than one, or all of the group members are present in, employed in, or otherwise relevant to a given product or process.
- any one or more nucleic acids , polypeptides , cells , species or types of organism, disorders, subjects, or combinations thereof, can be excluded .
- composition of matter e.g., a nucleic acid, polypeptide, cell, or non-human transgenic animal
- composition of matter according to any of the methods disclosed herein, and methods of using the composition of matter for any of the purposes disclosed herein are aspects of the invention, unless otherwise indicated or unless it would be evident to one of ordinary skill in the art that a contradiction or inconsistency would arise.
- claims or description relate to a method, e.g., it is to be understood that methods of making compositions useful for performing the method, and products produced according to the method, are aspects of the invention, unless otherwise indicated or unless it would be evident to one of ordinary skill in the art that a contradiction or inconsistency would arise.
- the invention includes embodiments in which the endpoints are included, embodiments in which both endpoints are excluded, and embodiments in which one endpoint is included and the other is excluded. It should be assumed that both endpoints are included unless indicated otherwise.
- invention includes embodiments that relate analogously to any intervening value or range defined by any two values in the series, and that the lowest value may be taken as a minimum and the greatest value may be taken as a maximum.
- Numerical values include values expressed as percentages. For any embodiment of the invention in which a numerical value is prefaced by "about” or “approximately” , the invention includes an embodiment in which the exact value is recited. For any embodiment of the invention in which a numerical value is not prefaced by "about” or “approximately” , the invention includes an embodiment in which the value is prefaced by "about” or “approximately” .
- Approximately or “about” generally includes numbers that fall within a range of 1% or in some embodiments within a range of 5% of a number or in some embodiments within a range of 10% of a number in either direction (greater than or less than the number) unless otherwise stated or otherwise evident from the context (except where such number would impermissibly exceed 100% of a possible value) . It should be
- Example 1 A substantially pure, stable, high yield, calcium-independent srtA monomeric mutant
- the present inventors surprisingly and unexpectedly found that a calcium- independent srtA mutant (SEQ ID NO: 9) purifies predominantly in monomeric form compared to a calcium-dependent srtA mutant (SEQ ID NO: 7) that purifies both in dimeric and monomeric forms.
- the sortase purification process involves direct loading of clarified E.coli lysate onto a nickel NTA column. His-tagged sortase binds to the nickel NTA column (via metal chelation) and is eluted using an imidazole buffer. Eluted material from the nickel NTA column is loaded onto a size exclusion chromatography column (SEC) (e.g., a HILOAD ® 16/600 SUPERDEX ® 75 pg) and resulting separations (chromatogram) are show in FIGS. 7A, 7B and 7C.
- SEC size exclusion chromatography column
- the column resolves/separates residual high
- the calcium- independent srtA mutant of SEQ ID NO: 9 exists predominantly in monomeric form (FIG. 7A) , in contrast, as noted above, the calcium-dependent sortase obtained from SEQ ID NO: 7 exists in both dimeric and monomeric form (FIG. 7B) , as does the calcium- dependent wild-type sortase (FIG. 7C) .
- Those skilled in the art will appreciate the unexpected advantages which this confers on the calcium- independent srtA mutant of SEQ ID NO: 9, including a higher recovery yield of 100% monomer, and simpler manufacturing process to isolate monomer as there is far less dimer to separate out, amongst others.
- the lower amount of dimer form observed with the calcium- independent srtA mutant of SEQ ID NO: 9 is due to lower propensity to dimerize based on the outer surface location of the substituted amino acids between the calcium-dependent srtA mutant of SEQ ID NO: 7 and the calcium- independent srtA mutant of SEQ ID NO: 9 (see, e.g., FIG. 5B for an alignment of SEQ ID NO: 7 and SEQ ID NO: 9).
- the lower propensity of the calcium-independent srtA mutant of SEQ ID NO: 9 to dimerize is likely to translate into a more "stable" enzyme as it should also have a lower tendency to dimerize during storage, which may prolong shelf-life or permit storage under less costly conditions (e.g. refrigerated vs. frozen at - 80°C) .
- FIG. 8 shows a gel demonstrating the high purity of monomeric sortase compared to dimeric sortase.
- the gel illustrates the protein constituents present in the three main components from the calcium- independent srtA mutant of SEQ ID NO: 9 preparation separated on the SEC purification column. Collected fractions were pooled comprising the Void, Dimer and Monomeric peaks shown in FIG. 7B.
- the "Load” in FIG. 8 represents the starting material in the clarified E. coli lysate.
- the "Void" lane in FIG. 7B contains some sortase along with a large number of higher molecular weight proteins.
- the "Dimer” lane in FIG. 8 contains
- FIG. 9 shows the SEC results demonstrating the stability of monomeric sortase. Briefly, the dimeric and monomeric pools collected from the SEC purification step described above were stored at 4 °C for 24 hours, and then analyzed by analytical SEC (SUPERDEX ® 75 10/300 GL) . As shown in FIG.
- the calcium- independent srtA mutant of SEQ ID NO: 9 was generated by introducing two additional nucleotide point mutations into a calcium-dependent srtA mutant of SEQ ID NO: 7. These nucleotide point mutations comprised a G->A mutation at position 139, and a G-C mutation at position 148 of SEQ ID NO: 7 .
- the mutations were introduced on primers used to amplify a pET29 mutant delta 59 SrtA construct (see, e.g., Liu et al . , "A general strategy for the evolution of bond-forming enzymes using yeast display," PNAS.
- PCR amplification was then be performed in two stages. First, the construct was amplified with primers shown in Table 4 below such that the forward primer introduced a restriction site for cloning (e.g., a Ndel restriction site), and the reverse primer introduced the two point mutations described above .
- a restriction site for cloning e.g., a Ndel restriction site
- the reverse primer introduced the two point mutations described above .
- the second half of the construct was amplified with the primers shown in Table 5 below such that the forward primer introduced the two point mutations described above, and the reverse primer introduced a restriction site for cloning (e.g., a Xhol restriction site) .
- a restriction site for cloning e.g., a Xhol restriction site
- PCR reactions were performed using the pET29 pentamutant delta 59 SrtA plasmid DNA as a template, under the conditions set forth in Table 6 and cycling conditions set forth in Table 7 below.
- the resulting PGR products from the first round of PGR were analyzed by gel electrophoresis, excised from the gel and purified. The products were then used as a template in a second round of PGR, and amplified using forward primer 1 and reverse primer 2 described above.
- the second round of PCR reactions was carried out under the conditions set forth in Table 8 below under the cycling conditions set forth in Table 9.
- the resulting product from the second round of PGR reactions was analyzed by gel electrophoresis, digested with Ndel and Xhol , and ligated into pET30b so that there is 6x his tag at the 3' end of the construct.
- SEQ ID NO: 1 Exemplary Wild Type S. aureus Sortase A amino acid sequence
- SEQ ID NO: 3 Exemplary Wild Type S. aureus Sortase A amino acid sequence
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Genetics & Genomics (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Enzymes And Modification Thereof (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
Disclosed herein are calcium- independent sortase A mutants exhibiting increased catalytic activity compared to wild-type sortase A.
Description
Inventor(s) : Jessica Ingram, Hidde Ploegh Attorney's Docket No.: WIBR- 1 1-WOl
CALCIUM-INDEPENDENT SORTASE A MUTANTS
RELATED APPLICATIONS
This application claims the benefit of U.S.
Application No. 61/943,042, filed on February 21, 2014. The entire teachings of the above application ( s ) are incorporated herein by reference.
GOVERNMENT SUPPORT
This invention was made with government support under R0A1087879 awarded by the National Institutes of Health. The government has certain rights in the invention .
BACKGROUND OF THE INVENTION
Protein engineering is becoming a widely used tool in many areas of protein biochemistry. One engineering method is controlled protein ligation. Native chemical protein ligation relies on efficient preparation of synthetic peptide esters, which can be technically difficult to prepare for many proteins. Recombinant technologies can be used to generate protein-protein fusions, joining the C-terminus of one protein with the N-terrninus of another protein. Intein-based protein ligation systems can also be used to join proteins. A
prerequisite for this intein-mediated ligation method is that the target protein is expressed as a correctly folded fusion with the intein, which is often
challenging. The difficulties of conventional native and recombinant ligation technologies significantly limit the application of protein ligation.
The transpeptidation reaction catalyzed by sortases has emerged as a general method for derivatizing proteins with various types of modifications. For conventional sortase modifications, target proteins are engineered to contain a sortase recognition motif (LPXT) near their C- termini . When incubated with synthetic peptides
containing one or more N-terminal glycine residues and a recombinant sortase, these artificial sortase substrates undergo a transacylation reaction resulting in the exchange of residues C-terminal to the threonine residue with the synthetic oligoglycine peptide , resulting in the protein C-terminus being ligated to the N- terminus of the synthetic peptide .
S . aureus sortase A (SaSrtA) is a common sortase utilized for conventional sortase modification of target proteins . However, SaSrtA possesses a Ca2+ dependency that can limit the effectiveness of SaSrtA in low Ca2+ concentrations , such as the cytoplasm or when Ca2+ binding compounds, for example phosphate, carbonate, and
ethylenediaminetetraacetic acid (EDTA) , are present .
SUMMARY OF THE INVENTION
In some aspects, the disclosure provides a sortase A mutant comprising at least three amino acid substitutions relative to a wild- type sortase A, wherein the amino acid substitutions comprise a) a K residue at position 105 ; b) a Q or A residue at position 108 ; and c) at least one
amino acid substitution selected from the group
consisting of i) a R residue at position 94; ii) a S residue at position 94; iii) a N residue at position 160; iv) a A residue at position 165; v) a E residue at position 190; and vi) a T residue at position 196.
In some embodiments, the wild-type sortase A comprises a S. aureus sortase A In some embodiments, the wild-type sortase A comprises a protein sequence of SEQ ID NO: 1. In some embodiments, the wild-type sortase A comprises a protein sequence of SEQ ID NO: 3. In some embodiments, the wild- type sortase A comprises a protein of SEQ ID NO: 5.
In some embodiments, the mutant comprises a deletion of amino acids 2-25. In some embodiments, comprising a deletion of amino acids 2-59.
In some embodiments, the mutant comprises at least two amino acid substitutions selected from the group consisting of i) -vi) . In some embodiments, the mutant comprises at least three amino acid substitutions selected from the group consisting of i) -vi) . In some embodiments, the mutant comprises at least four amino acid substitu ions selected from the group consisting of i) -vi) . In some embodiments, the mutant comprises at least five amino acid substitutions selected from the group consisting of i) -vi) .
In some embodiments, the mutant comprises at least 60% sequence identity to amino acid residues 60-206 of the wild-type sortase A. In some embodiments, the mutant comprises at least 80% sequence identity to amino acid residues 60-206 of the wild-type sortase A. In some embodiments, the mutant comprises at least 90% sequence identity to amino acid residues 60-206 of the wild-type sortase A.
In some embodiments, the mutant comprises one or more C- terminal or N-terminal tags. In some embodiments, the one or more C-terminal or N-terminal tags comprises a His6 tag.
In some embodiments, the mutant exhibits sortase A catalytic activity in the absence of calcium.
In some embodiments, the mutant exhibits sortase A catalytic activity in the absence of exogenous calcium. In some embodiments, the mutant exhibits sortase A catalytic activity in the presence of calcium-binding proteins. In some embodiments, the mutant exhibits sortase A catalytic activity in the presence of calcium concentrations up to 10 m .
In some aspects, the disclosure provides a
polynucleotide encoding a mutant srtA described herein.
In some aspects, the disclosure provides a polynucleotide encoding a catalytically active variant or fragment of a mutant srtA described herein.
In some aspects , the disclosure provides a nucleic acid construct comprising the polynucleotides described herein .
In some aspects, the disclosure provides a host cell transformed with the nucleic acid constructs described herein .
In some aspects, the disclosure provides a method of preparing a mutant sortase A comprising: (a) culturing the host cell of claim 22 in a suitable culture medium under suitable conditions to produce the mutant sortase A; and optionally (b) purifying the mutant sortase A to provide a mutant sortase A.
In some aspects, the disclosure provides an enzyme composition comprising at least one sortase A mutant described herein.
In some aspects, the disclosure provides a method comprising performing a sortase-mediated transpeptidation reaction catalyzed by the enzyme compositions described herein .
In some aspects, the disclosure relates to the use of an enzyme composition described herein for the sortagging of a target protein.
In some aspects, the disclosure provides a method for sortagging a target protein, comprising: (a)
providing a target protein comprising a sortase
recognition motif; (b) providing a moiety conjugated to a terminal oligoglycine sequence or a terminal alkylamine ; and (c) contacting the target protein with the moiety in the presence of the enzyme composition of claim 24 under conditions suitable for the sortase A mutant to
transamidate the target protein and the moiety, thereby sortagging the target protein.
In some embodiments, the terminal oligoglycine sequence comprises 1-10 N-terminal glycine residues.
In some embodiments, the moiety comprises an amino acid, a peptide, a protein, a polynucleotide, a
carbohydrate, a tag, a metal atom, a chelating agent, a contrast agent, a catalyst, a polymer, a recognition element, a small molecule, a lipid, a label, an epitope, a small molecule, a therapeutic agent, a cross- linker, a toxin, a radioisotope, an antigen, or a click chemistry handle .
In some aspects, the disclosure provides a kit for sortagging a target protein comprising the enzyme composition of claim 24.
In some aspects, the disclosure provides a sortase A mutant comprising an amino acid sequence at least 80% identical to SEQ ID NO : 9, and wherein the mutant
comprises a) a K residue at position 47 of SEQ ID NO: 9; b) a Q or A residue at position 50 of SEQ ID NO: 9; and c) at least one amino acid residue of SEQ ID NO: 9 selected from the group consisting of i) a R residue at position 36 of SEQ ID NO : 9 ; ii) a N residue at position 102 of SEQ ID NO: 9; iii) a A residue at position 107 of SEQ ID NO: 9; iv) a E residue at position 132 of SEQ ID NO: 9; and v) a T residue at position 138 of SEQ ID NO: 9.
In some embodiments, the mutant comprises an amino acid sequence of SEQ ID NO: 16. In some embodiments, the mutant comprises an amino acid sequence of SEQ ID NO: 17. In some embodiments, the mutant comprises an amino acid sequence of SEQ ID NO: 18. In some embodiments, the mutant comprises an amino acid sequence of SEQ ID NO: 19. In some embodiments, the mutant comprises an amino acid sequence of SEQ ID NO: 20. In some embodiments, the mutant comprises an amino acid sequence of SEQ ID NO: 21.
In some embodiments, the mutant comprises at least two amino acid residues of SEQ ID NO: 9 selected from the group consisting of i) -v) . In some embodiments, the mutant comprises an amino acid sequence of SEQ ID NO: 15.
In some embodiments, the mutant comprises at least three amino acid residues of SEQ ID NO: 9 selected from the group consisting of i) -v) . In some embodiments, the mutant comprises an amino acid sequence of SEQ ID NO: 12. In some embodiments, the mutant comprises an amino acid sequence of SEQ ID NO: 13. In some embodiments, the mutant comprises an amino acid sequence of SEQ ID NO: 14.
In some embodiments, the mutant comprises at least four amino acid residues of SEQ ID NO: 9 selected from the group consisting of i) -v) . In some embodiments , the mutant comprises an amino acid sequence of SEQ ID NO: 11.
In some embodiments, the mutant comprises an amino acid sequence at least 90% identical to SEQ ID NO: 9. In some embodiments, the mutant comprises an amino acid sequence at least 95% identical to SEQ ID NO: 9. In some embodiments, the mutant comprises an amino acid sequence at least 96% identical to SEQ ID NO: 9. In some
embodiments, the mutant comprises an amino acid sequence at least 97% identical to SEQ ID NO: 9. In some
embodiments, the mutant comprises an amino acid sequence at least 98% identical to SEQ ID NO: 9. In some
embodiments, the mutant comprises an amino acid sequence at least 99% identical to SEQ ID NO: 9. In some
embodiments, the mutant comprises an amino acid sequence of SEQ ID NO: 9.
In some aspects, the disclosure comprises an enzyme composition comprising at least one mutant sortase A described herein. In some embodiments, the at least one mutant sortase A is selected from the group consisting of SEQ ID NOs : 9-21. In some embodiments, the at least one mutant sortase A is selected from the group consisting of catalytically active variants of SEQ ID NOs : 9-21. In some embodiments, the at least one mutant sortase A is selected from the group consisting of catalytically active fragments of SEQ ID NOs : 9-21.
In some aspects, the disclosure provides a method comprising performing a sortase-mediated transpeptidation reaction catalyzed an enzyme composition described herein .
In some aspects, the disclosure relates to the use of an enzyme composition described herein for the sortagging of a target protein.
In some aspects, the disclosure relates to a method for sortagging a target protein, comprising: (a)
providing a target protein comprising a sortase
recognition motif; (b) providing a moiety conjugated to at least one of an oligoglycine sequence or a terminal alkylamine; and (c) contacting the target protein with the moiety in the presence of an enzyme composition described herein under conditions suitable for the sortase A mutant to transamidate the target protein and the moiety, thereby sortagging the target protein.
In some embodiments, the oligoglycine sequence comprises 1-10 N-terminal glycine residues.
In some embodiments, the moiety comprises an amino acid, a peptide, a protein, a polynucleotide, a
carbohydrate, a tag, a metal atom, a chelating agent, a contrast agent, a catalyst, a polymer, a recognition element, a small molecule, a lipid, a label, an epitope, a small molecule, a therapeutic agent, a cross- linker, a toxin, a radioisotope, an antigen, or a click chemistry handle .
In some aspects, the disclosure provides a kit for sortagging a target protein comprising an enzyme composition described herein.
In some aspects, the disclosure provides a
polynucleotide encoding a sortase A mutant comprising a nucleotide sequence at least 80% identical to SEQ ID NO. 10, wherein the nucleotide sequence encodes a) a K residue at position 47 of SEQ ID NO: 9; b) a Q or A residue at position 50 of SEQ ID NO: 9; and c) at least one amino acid residue of SEQ ID NO: 9 selected from the group consisting of i) a R residue at position 36; ii) a N residue at position 102; iii) a A residue at position 107; iv) a E residue at position 132; and v) a T residue at position 138.
In some embodiments, the polynucleotide encodes at least two amino acid residues of SEQ ID NO: 9 selected from the group consisting of i) -v) . In some embodiments, the polynucleotide encodes at least three amino acid residues of SEQ ID NO: 9 selected from the group
consisting of i) -v) . In some embodiments, the
polynucleotide encodes at least four amino acid residues of SEQ ID NO: 9 selected from the group consisting of i) - v) .
In some embodiments, the polynucleotide comprises a nucleotide sequence at least 85% identical to SEQ ID NO: 10. In some embodiments, the polynucleotide comprises a nucleotide sequence at least 90% identical to SEQ ID NO: 10. In some embodiments, the polynucleotide comprises a nucleotide sequence at least 95% identical to SEQ ID NO: 10. In some embodiments, the polynucleotide comprises a nucleotide sequence at least 96% identical to SEQ ID NO: 10. In some embodiments, the polynucleotide comprises a nucleotide sequence at least 97% identical to SEQ ID NO: 10. In some embodiments, the polynucleotide comprises a nucleotide sequence at least 98% identical to SEQ ID NO: 10. In some embodiments, the polynucleotide comprises a nucleotide sequence at least 99% identical to SEQ ID NO: 10. In some embodiments, the polynucleotide comprises a nucleotide sequence of SEQ ID NO : 10.
In some aspects, the disclosure provides a nucleic acid construct comprising any of the polynucleotides described herein. In some embodiments, the nucleic acid construct comprises a nucleotide sequence that encodes one or more C-terminal or N-terminal tags. In some embodiments , the nucleic acid construct comprises one or more C-terminal or N-terminal tags comprises a His6 tag .
In some aspects, the disclosure provides a host cell transformed with a nucleic acid construct described herein .
In some aspects, the disclosure provides a method of preparing a mutant sortase A comprising: (a) culturing the host cell of claim 75 in a suitable culture medium under suitable conditions to produce the mutant sortase A; and optionally (b) purifying the mutant sortase A to provide a mutant sortase A.
BRIEF DESCRIPTION OF THE DRAWINGS
The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawings will be provided by the Office upon request and payment of the necessary fee.
FIG. 1 is a schematic representation of a sortase- catalyzed transacylation reaction.
FIG . 2 is a schematic representation of the site- specific C-terminal labeling scheme (left) and N-terminal labeling scheme (right) using sortase A. Such labeling begins with a substrate- recognition step (top) , and then proceeds with generation of a thioacyl intermediate (middle) , followed by ligation of an exogenously
introduced nucleophile to form a new peptide bond
(bottom) (Popp and Ploegh, Angew. Chem. Int. Ed. 2011, 50, 5024-5032) .
FIGS . 3A-3H are schematic representations of various polypeptide conjugations using sortase-mediated ligation .
FIG . 3A is a schematic representation showing the sortagging of a molecular probe carrying an oligoglycine tag to a target pro ein having a C- erminal LPXTG- tag .
FIG. 3B is a schematic representation showing sortase-mediated ligation of a polypeptide to a nucleic acid .
FIG. 3C is a schematic representation of a sortase- mediated ligation being used to produce a neoglyconjugate by fusing a polypeptide comprising the LPXTG sortase recognition motif to amino-methylene groups in 6- aminohexoses .
FIG. 3D is a schematic representation of the immobilization of an LPXTG- tagged protein onto an oligoglycine-coated solid surface, for example,
polystyrene beads or a biosensor chip .
FIG. 3E is a schematic representation of a cell surface protein genetically engineered to include an extracellular C-terminal region expressing a C-terminal LPXTG motif, which has been labeled with a triglycine- tagged probe by sortagging.
FIG. 3F is a schematic illustration of the
dimerization/oligomerization of a protein by sortagging of a bifunctional protein possessing an N-terminal oligoglycine tag and a C-terminal LPXTG tag.
FIG . 3G is a schematic representation of the circularization of a bifunctional protein containing an N-terminal oligoglycine tag and a C-terminal LPXTG tag utilizing sortase-mediated ligation.
FIG . 3H is a schematic representation of the site- specific attachment of a lipid utilizing sortase-mediated transpeptidation .
FIGS . 4A-4U show exemplary sequences contemplated by the present disclosure . Mutations relative to wild- type sortase A are shown in red font . Portions of sequences that are absent in an exemplary wild-type S. aureus
sortase A sequence and/or contain a 6XHis tag are highlighted in yellow .
FIG. 4A is an exemplary wild type S. aureus Sortase A (SaSrtA) amino acid sequence.
FIG. 4B is an exemplary nucleotide sequence encoding the amino acid sequence shown in FIG. 4A.
FIG. 4C is an exemplary wild type S. aureus Sortase A (SaSrtA) amino acid sequence.
FIG. 4D is an exemplary nucleotide sequence encoding the amino acid sequence shown in FIG. 4C.
FIG. 4E is an exemplary wild type sortase delta 25 amino acid sequence.
FIG. 4F is an exemplary nucleotide sequence encoding the amino acid sequence shown in FIG. 4E.
FIG. 4G is an exemplary mutant srtA
P94R/D160ND165A/K190E/K196T amino acid sequence.
FIG. 4H is an exemplary nucleotide sequence encoding the amino acid sequence shown in FIG. 4G.
FIG. 41 is an exemplary calcium- independent mutant srtA P94R/D160N/D165A/K190E/K196T amino acid sequence.
FIG. 4J is an exemplary nucleotide sequence encoding the amino acid sequence shown in FIG . 41.
FIG . 4K is an exem lary calcium- independent srtA P94S/D160N/D165A/K196T amino acid sequence.
FIG. 4L is an exemplary calcium-independent srtA
D160N/K190E/K196T amino acid sequence.
FIG. 4M is an exemplary calcium- independent srtA P94S/D160N/K196T amino acid sequence.
FIG. 4N is an exemplary calcium- independent srtA P94S/D160N/D165A amino acid sequence.
FIG . 40 is an. exemplary calcium-independent srtA P94S/D165A amino acid sequence .
FIG. 4P is an exemplary calcium-independent srtA P94S amino acid sequence.
FIG. 4Q is an exemplary calcium-independent srtA P94R amino acid sequence.
FIG. 4R is an exemplary calcium- independent srtA
D160N amino acid sequence.
FIG. 4S is an exemplary calcium- independent srtA D165A amino acid sequence.
FIG. 4T is an exemplary calcium- independent srtA K190E amino acid sequence.
FIG. 4U is an exemplary calcium- independent srtA K196T amino acid sequence.
FIGS. 5A and 5B show exemplary alignments of various srtA sequences contemplated by the present disclosure.
FIG. 5A shows an alignment of SEQ ID NO: 1 and SEQ
ID NO: 9, indicating that SEQ ID NO: 9 is at least 65% identical to SEQ ID NO : 1.
FIG. 5B shows an alignment of SEQ ID NO: 7 and SEQ ID NO: 9, indicating that SEQ ID NO: 9 is at least 98% identical to SEQ ID NO: 7.
FIG. 6 shows a comparison of activities of various sortases at 0 °C, in the presence and absence of calcium. 30uM of a substrate (VHH84D4) containing an LPETGG- 6xHis C-terminal tag was incubated in 50mM Tris, pH 7.5, 150mM NaCl , 500uM GGG-TAMRA and lOmM CaC12 or lOmM EGTA with either 5uM (1) WT SrtA delta 25, (2) 5uM pentamutant SrtA or (3) heptamutant SrtA for up to 6 hours at 0°C. The reaction was then visualized by running a sample on a 12% Tris-glycine SDS PAGE gel. Incorporation of GGG-TAMRA label at the C-terminus of the substrate was monitored by a shift to a lower molecular weight upon loss of the 6x His tag (Coomassie stained protein gel , upper panel ) and
presence of a TAMRA-labeled product (fluorescent scan of protein gel, lower panel) .
FIGS. 7A - 7C are chromatograms demonstrating that the calcium-independent srtA mutant of SEQ ID NO: 9 exists predominantly in monomeric form (FIG. 7A) compared to the calcium-dependent srtA mutant of SEQ ID NO: 7 which exists in both dimeric and monomeric form (FIG. 7B) , and the calcium-dependent wild-type srtA which also exists in both dimeric and monomeric form (FIG. 7C) .
FIG. 8 is a photograph of a gel demonstrating the high purity of monomeric sortase.
FIG. 9 is a chromatogram demonstrating the stability of monomeric sortase. DETAILED DESCRIPTION OF THE INVENTION
Aspects of the disclosure relate to sortase mutants, and in particular to sortase A mutants that exhibit an indifference to calcium. Work described herein
demonstrates that the calcium-independent sortase A mutants described herein surprisingly and unexpectedly exhibit sortase A catalytic activity (e.g., increased catalytic activity compared to wild type sortase A, ) , and do so in a manner that is independent of the presence or concentration of calcium.
Sortases, sortase-mediated transacylation reactions, and their use in transacylation (sometimes also referred to as transpeptidation) for protein engineering are well known to those of skill in the art (see, e.g., Ploegh et al., International Patent Application PCT/US2010/000274 , and Ploegh et al . , International Patent Application
PCT/US2011/033303, the entire contents of each of which are incorporated herein by reference) . FIG. 1 shows an exemplary transpeptidation reaction catalyzed by sortase,
which results in the ligation of species containing a sortase recognition motif (e.g., an LPXT motif or an LPXTG motif) with those bearing an N-terminal
oligoglycine moiety. As is shown in FIG. 2, sortases can be used for both C- terminal (left) and N-terminal (right) site-specific labeling.
Those skilled in the art will appreciate that the sortase transacylation reaction allows for the facile installation of all kinds of substituents at the C- terminus or N-terminus of a suitably modified protein. For example, the sortase reaction can be employed for ligating polypeptides to one another, ligating synthetic peptides to recombinant proteins, linking a reporting molecule to a polypeptide, linking a polypeptide to a label or probe (FIG. 3A) , joining a nucleic acid to a polypeptide (FIG. 3B) , ligating a glycan to a polypeptide (FIG. 3C) , conjugating a polypeptide to a solid support or polymer (FIG. 3D) , site- specific modification of the extracellular C-terminal region of cell surface proteins expressed in living cells (FIG. 3E) , site- specific modification of the extracellular N-terminal region of cell surface proteins expressed in living cells ,
dimerization or oligomerization of polypeptides (FIG . 3F) , circularization of polypeptides (FIG . 3G) , site- specific attachment of lipids to polypeptides (FIG. 3H) , and protein purification (FIG . 31) .
As is demonstrated in FIGS . 3A-3I , sortase A derives its ability to site- specifically modify target proteins by recognition of sortase recognition motifs, such as the motif LPXTG. Other suitable sortase recognition motifs are apparent to the skilled artisan. It will be
appreciated that the terms "recognition motif" and
"recognition sequence" , with respect to sequences
recognized by sortase, are used interchangeably. In some embodiments, a recognition sequence further comprises one or more additional amino acids, e.g., at the N or C terminus. Such additional amino acids may provide context that improves the recognition of the recognition motif. Those skilled in the art will appreciate that the term "sortase recognition sequence" may refer to a masked or unmasked sortase recognition sequence.
In some embodiments, a calcium-independent mutant sortase has a different substrate specificity with regard to the sortase recognition motif as compared to a calcium-dependent wild type sortase. Sortases with different substrate specificities with regard to the sortase recognition motif recognize different sortase recognition sequences. For example, Sortases 1 and 2 are said to have different substrate specificities if SRM1 and SRM2 are different sortase recognition motifs and Sortase 1 recognizes SRM1 and Sortase 2 is active in recognizing SRM2 but has little or no activity
recognizing SRM1. In some embodiments the substrate specificities overlap in that one of the two sortases recognizes both SRM1 and SR 2 while the other sortase recognizes only one of the SRMs . In some embodiments the substrate specificities do not overlap, e.g., Sortase 1 recognizes SRM1 but does not recognize SRM2 , and Sortase 2 recognizes SRM2 but does not recognize SRM1. In some embodiments, two calcium- independent sortases may alternately or additionally utilize different
nucleophiles . In some embodiments the nucleophile specificity overlaps while in some embodiments the nucleophile specificity does not overlap.
A sortase with an altered substrate specificity with regard to the sortase recognition motif may be generated
by engineering one or more mutations in the sortase, e.g., in a region of the protein that is involved in recognition and/or binding of the sortase recognition motif, e.g., the putative substrate recognition loop (e.g., the loop connecting strands β6 and β7 ( β 6/β7 loop) in SrtA (Val161-Asp17S) . A crystal structure of S. aureus SrtA and a substrate, illustrating the loops, is
described in Zong, Y., et al . , J Biol Chem. 2004 Jul 23,-279(30) :31383-9.) . In some embodiments, a phage- display, yeast display, or other screen of a mutant sortase library randomized in the substrate recognition loop may be performed, and variants with altered
substrate specificity may be identified. A sortase with an altered nucleophile specificity may be generated by engineering one or more mutations in the sortase, e.g., in a region of the protein that is involved in
recognition and/or binding of the nucleophile, e.g., (e.g., the loop connecting strands β7 and β8 in SrtA) . See In some embodiments, a phage-display screen of a mutant sortase library randomized in the substrate recognition loop may be performed, and variants with altered substrate specificity may be identified.
In some embodiments, a calcium- independent sortase described herein is modified to alter its substrate specificity with regard to the sortase recognition motif and/or its nucleophile specificity by introducing one or more mutations into the sortase. In some embodiments, a calcium-dependent sortase is modified to alter its substrate specificity and/or nucleophile specificity and is rendered calcium- independent by introducing the mutations described herein. It will be appreciated that mutations conferring calcium- independence and altered substrate and/or nucleophile specificity may be
engineered individually or sequentially in groups of one or more, in any order or combination that results in a desired sequence. In some embodiments two calcium- independent sortases with different substrate
specificities and/or different nucleophile specificities are derived from the same wild type sortase, e.g., S. aureus SrtA. In some embodiments the substrate
specificities do not overlap. Sortases with different substrate specificities may be used, for example, to introduce two different moieties to a target protein.
For example, a target protein may comprise or be modified to comprise first and second SRMs. Two sortases, each of which specifically recognizes only one of the two SRMs, may be used to conjugate two agents to the protein by reaction with the two SRMs, e.g., at the N- and C- termini . The conjugations may take place in a single reaction vessel or may be performed sequentially. The two agents may be any of the agents described herein, and may be the same or different. For example, a first agent may comprise a toxin and a second agent may comprise a detectable label , e.g., for imaging.
In some embodiments , the disclosure provides a kit comprising two calcium- independent sortases with
different substrate specificities with regard to the sortase recognition motif and/or with regard to
nucleophile specificity . In some embodiments two calcium- independent sortases with different substrate specificities and/or different nucleophile specificities are derived from the same wild type sortase , e.g., S . aureus SrtA. In some embodiments the substrate
specificities and/or nucleophile specificities do not overla .
Amino acid sequences of sortase A polypeptides and the nucleotide sequences that encode them are known to those of skill in the art and are disclosed in a number of references cited herein, the entire contents of all of which are incorporated herein by reference. A
"polypeptide" , "peptide" or "protein" refers to a molecule comprising at least two covalently attached amino acids. A polypeptide can be made up of naturally occurring amino acids and peptide bonds and/or synthetic peptidomimetic residues and/or bonds. Polypeptides described herein include naturally purified products, products of chemical synthetic procedures, and products produced by recombinant techniques from a prokaryotic or eukaryotic host, including, for example, bacterial (e.g., E. coli) , yeast, higher plant, insect and mammalian cells. The disclosure also features biologically active fragments of calcium- independent srtA mutants.
The term "amino acid" refers to naturally occurring and synthetic amino acids, as well as amino acid analogs and amino acid mimetics that function in a manner similar to the naturally occurring amino acids. Naturally occurring amino acids are those encoded by the genetic code, as well as those amino acids that are later modified, e.g., hydroxyproline . Amino acid analogs are compounds that have the same basic chemical structure as a naturally occurring amino acid, i.e., a carbon bound to hydrogen, a carboxyl group, an amino group, and an R group, e.g., norleucine. Such analogs have modified R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic chemical structure as a naturally occurring amino acid . Amino acid mimetics refers to chemical compounds that have a structure that is different from the general chemical structure of an
amino acid, but that function in a manner similar to a naturally occurring amino acid.
Sortase A is a polypeptide having a length of 206 amino acids which typically comprises a hydrophobic N- terminal domain (e.g., residues 1 to about 25) which functions as both a signal peptide and a membrane anchoring domain, a central linker domain (e.g., from about residue 26 to about residue 59) , and a C-terminal catalytic domain (e.g., from about residue 60 to about residue 206) .
An exemplary sortase A is the wild-type S . aureus SrtA. An exemplary amino acid sequence of wild-type S . aureus SrtA is shown in FIG . 4A (SEQ ID NO: 1 ; Gene ID : 1125243 ; NCBI RefSeq Acc . No. NP_375640) FIG. 4B shows an exemplary nucleotide sequence (SEQ ID NO: 2; NCBI
Reference Sequence: NC_002745.2) encoding the wild type SaSrtA protein. Another exemplary sequence of wild- type S. aureus SrtA is shown in FIG. 4C (SEQ ID NO: 3; GenBank Accession number AAD48437 ; NCBI Reference Sequence:
NC_002951.2) . It should be appreciated that wild-type S. aureus SrtA sequences, calcium- independent srtA mutants, and catalytically active fragments or variants thereof disclosed herein may comprise either a K or N at position 57. It should be appreciated that wild-type S. aureus SrtA sequences, calcium- independent srtA mutants, and catalytically active fragments or variants thereof disclosed herein may comprise either an E or G at position 167.
Aspects of the present disclosure relate to calcium- independent sortase A mutants. Such mutants may be produced through processes such as directed evolution, site-specific modification, etc . It should be
appreciated that the calcium- independent srtA mutants
disclosed herein can be used in any application in which sortagging is desirable, including for example, the sortase -mediated ligation reactions described in FIGS. 3A-3I .
In some embodiments, the calcium- independent srtA mutants comprise at least one amino acid substitution relative to a wild-type sortase A polypeptide, and catalytically active fragments, catalytically active derivatives, or catalytically active variants thereof, and polynucleotides encoding the same.
As used herein "calcium- independent" refers to the ability of a sortase A enzyme to exhibit catalytic activity in a manner that is independent of the presence of calcium, or independent of the amount of calcium present within a concentration range that is not
detrimental to proper functioning of the enzyme. The calcium- independent srtA mutants disclosed herein can be assayed for their ability to exhibit sortase A catalytic activity in a calcium- independent manner by contacting a target protein comprising a C-terminal sortase
recognition motif with a tagged N-terminal oligoglycine derivative in the presence of the calcium- independent srtA mutant, and in the absence of calcium, and
determining whether the target protein is ligated to the tagged N-terminal oligoglycine derivative by the calcium- independent srtA mutant in the absence of calcium. In some embodiments, the calcium- independent srtA mutants can be assayed for their ability to exhibit sortase A catalytic activity in the presence of calcium
concentrations which are lower than calcium
concentrations which are required for calcium-dependent sortase A to exhibit catalytic activity. As used herein, "calcium-dependent" in connection with a sortase means
that the catalytic activity of the sortase relies or depends on the presence and concentration of calcium, such that in the absence of calcium or the absence of a sufficient amount of calcium, the calcium-dependent sortase will not exhibit sortase A catalytic activity or has greatly reduced catalytic activity as compared with its activity when calcium is present in sufficient amounts (e.g., 5 mM - 10 mM) .
The present disclosure contemplates any srtA variants which exhibit sortase A catalytic activity comparable to the wild-type srtA enzyme, as long as they possess calcium- independent sortase A catalytic activity. As used herein in connection with "fragments" ,
"derivatives" and "variants", "catalytically active" refers to sortase A catalytic activity. As used herein, "sortase A catalytic activity" refers to the ability of a sortase to catalyze the cleavage of a polypeptide within a sortase A consensus recognition sequence and ligate the free primary amino group (NH2-CH2-) of a oligoglycine sequence to the free C-terminal carboxyl group of the cleaved polypeptide. The term "oligoglycine" refers to a (Gly)n sequence, wherein n is between 1 and about 10, or more preferably between 2 and about 5, and even more preferably 2 or 3 , glycine residues. An "N-terminal" oligoglycine sequence is located at the N-terminus of a polypeptide , such that the polypeptide comprises a free primary amino (NH2„CH2_) group at its N- erminus . An "N- terminal" oligoglycine sequence can also include an internal oligoglycine sequence that is capable of forming a oligoglycine sequence under applicable conditions , e.g., by cleavage of an N-terminal peptide sequence by an endogenous host cell enzyme, or by specific proteolytic cleavage in vitro .
Sortase A catalytic activity of the calcium- independent srtA mutants disclosed herein can be assayed using methods known in the art. The crystal structure of SrtA complexed with a substrate has been determined allowing catalytic active domains of sortase A proteins from various Gram-positive bacterium to be easily discerned by those of skill in the art, see for example, Y. Zong et al . J. Biol Chem. 2004, 279, 31383-31389, which is incorporated herein by reference.
A "variant" of a particular polypeptide or
polynucleotide has one or more alterations (e.g., additions, substitutions, and/or deletions) with respect to a reference polypeptide or polynucleotide, which may be referred to as the "original polypeptide" or "original polynucleotide", respectively. An addition may be an insertion or may be at either terminus. A variant may be shorter or longer than the reference polypeptide or polynucleotide. The term "variant" encompasses
"fragments" . A "fragment" is a continuous portion of a polypeptide or polynucleotide that is shorter than the reference polypeptide or polynucleotide. In some embodiments a variant comprises or consists of a
fragment.. In some embodiments a fragment or variant is at least 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 92.5%, 95%, 96%, 97%, 98%, 99%, or more as long as the reference polypeptide or polynucleotide. In some embodiments a fragment may lack an N-terminal and/or C-terminal portion of a reference polypeptide. For example, a fragment may lack up to 5%, 10%, 15%, 20%, or 25% of the length of the polypeptide from either or both ends. A fragment may be an N-terminal, C-terminal, or internal fragment. In some embodiments a variant polypeptide comprises or consists of at least one domain of a reference polypeptide. In
some embodiments a variant polynucleotide hybridizes to a reference polynucleotide under art-recognized stringent conditions, e.g., high stringency conditions, for sequences of the length of the reference polypeptide. In some embodiments a variant polypeptide or polynucleotide comprises or consists of a polypeptide or polynucleotide that is at least 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, or more identical in sequence to the reference polypeptide or polynucleotide over at least 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% of the reference polypeptide or polynucleotide. In some embodiments a variant polypeptide comprises or consists of a polypeptide that is at least 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, or more identical in sequence to the reference polypeptide over at least 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% of the reference polypeptide, with the proviso that, for purposes of computing percent identity, a conservative amino acid substitution is considered identical to the amino acid it replaces . In some embodiments a variant polypeptide comprises or consists of a polypeptide that is at least 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, or more identical to the reference polypeptide over at least 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% of the reference polypeptide, with the proviso that any one or more amino acid substitutions (up to the total number of such substitutions) may be restricted to conservative substitutions. In some embodiments a percent identity is measured over at least 100; 200; 300; 400; 500; 600; 700; 800; 900; 1,000; 1,200; 1,500; 2,000; 2,500; 3,000;
3,500; 4,000; 4,500; or 5,000 amino acids. In some embodiments the sequence of a variant polypeptide
comprises or consists of a sequence that has N amino acid differences with respect to a reference sequence, wherein N is any integer between 1 and 10 or between 1 and 20 or any integer up to 1%, 2%, 5%, or 10% of the number of amino acids in the reference polypeptide, where an "amino acid difference" refers to a substitution, insertion, or deletion of an amino acid. In some embodiments a difference is a conservative substitution. Conservative substitutions may be made, e.g., on the basis of
similarity in side chain size, polarity, charge,
solubility, hydrophobic!ty, hydrophilicity and/or the amphipathic nature of the residues involved. Suitable conservative substitutions are apparent to the skilled artisa .
In some embodiments a variant is a functional variant, i.e., the variant at least in part retains at least one activity (e.g., calcium- independent sortase A catalytic activity) of the reference polypeptide or polynucleotide. In some embodiments a variant at least in part retains more than one or substantially all known activities of the reference polypeptide or
polynucleotide. An activity may be, e.g., a catalytic activity, binding activity, ability to perform or participate in a biological function or process, etc. In some embodiments an activity is one that has (or the lack of which has) a detectable effect on an observable phenotype of a cell or organism. In some embodiments an activity of a variant may be at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, or more, of the activity of the reference polypeptide or polynucleotide , up to approximately 100%, approximately 125%, or approximately 150% of the activity of the reference polypeptide or polynucleotide, in various embodiments . In some
embodiments a variant, e.g., a functional variant, comprises or consists of a polypeptide at least 80%, 90%, 92.5%, 95%, 96%, 97%, 98%, 99%. 99.5% or 100% identical to an reference polypeptide or polynucleotide over at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or 100% of the full length of the reference polypeptide or polynucleotide or over at least 70%, 75%, 80%, 85%, 90%, 92.5%, 95%, 96%, 97%, 98%, or 99% or 100% of a functional fragment of the reference polypeptide or polynucleotide. In some embodiments an alteration, e.g., a substitution or deletion, e.g., in a functional variant, does not alter or delete an amino acid or nucleotide that is known or predicted to be important for an activity, e.g., a known or predicted catalytic residue or residue involved in binding a substrate or cofactor. In some embodiments nucleotide (s) , amino acid(s), or region (s) exhibiting lower degrees of conservation across species as compared with other amino acids or regions may be selected for alteration. Variants may be tested in one or more suitable assays to assess activity. In certain embodiments a polypeptide or polynucleotide sequence in the NCBI RefSeq database may be used as a reference sequence. In some embodiments a variant or fragment of a naturally occurring polypeptide or
polynucleotide is a naturally occurring variant or fragment. In some embodiments a variant or fragment of a naturally occurring polypeptide or polynucleotide is not naturally occurring.
In some embodiments, the calcium-independent srtA mutant is selected according to the degree of sequence homology with a wild-type sortase A enzyme. In some embodiments , the calcium- independent srtA mutant is selected according to the degree of sequence homology
with S. aureus sortase A. In some embodiments, the calcium- independent srtA mutant is selected according to the degree of sequence homology with SEQ ID NO: 1 (FIG. 4A) . In some embodiments, the calcium- independent srtA mutant is selected according to the degree of sequence homology with SEQ ID NO: 3 (FIG. 4C) . In some
embodiments, the calcium- independent srtA mutant is selected according to the degree of sequence homology with SEQ ID NO: 5 (FIG. 4E) . In some embodiments, the calcium- independent srtA mutant is selected according to the degree of sequence homology with SEQ ID NO: 7 (FIG. 4G) . Calcium- independent srtA mutants having a desired degree of homology to a wild-type sortase A enzyme can be identified by, e.g., using the wild-type sortase A nucleotide sequences as query sequences in a search against public databases to identify related sequences. For example, in some embodiments the calcium- independent srtA mutant comprises an amino acid sequence homologous to amino acids 60-206 of SEQ ID NO: 1, e.g. an amino acid sequence that is at least 55%, at least 60%, at least
65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95% or higher, homologous thereto. In some embodiments the calcium- independent srtA mutant comprises an amino acid sequence homologous to amino acids 60-206 of SEQ ID NO: 3, e.g. an amino acid sequence that is at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95% or higher, homologous thereto. In some embodiments, a nucleotide sequence encoding a calcium- independent srtA mutant has at least 25%, or preferably at least 30%, or more preferably at least 35% or more identity with the nucleic acid sequence of SEQ ID NO : 2. In some embodimen s , a nucleotide
sequence encoding a calcium- independent srtA mutant has at least 25%, or preferably at least 30%, or more preferably at least 35% or more identity with the nucleic acid sequence of SEQ ID NO: 4.
In further embodiments, the calcium- independent srtA mutant has at least 35%, or preferably at least 40%, or more preferably at least 45% similarity with the amino acid sequence of SEQ ID NO: 1. In further embodiments, the calcium- independent srtA mutant has at least 35%, or preferably at least 40%, or more preferably at least 45% similarity with the amino acid sequence of SEQ ID NO: 3.
In some embodiments, the calcium-independent srtA mutant is a variant of SrtA of another Gram-positive bacterium having one or more as substitutions, deletions, insertions , and/or other modifications relative to the native nucleotide and/or amino acid sequence of S . aureus srtA. In some embodiments, the variant comprises one or more conservative amino acid substitutions relative to SrtA of another Gram-positive bacterium. In further embodiments , the variant comprises one or more amino acid substitutions relative to SrtA of another Gram-positive bacterium, wherein the one or more as amino acid
substitutions are predominantly, e.g., at least 50% , or preferably at least 60%, or more preferably at least 70% or more , conservative substitutions .
It will be appreciated that fragments of calcium- independent srtA mutants disclosed herein exhibiting calcium- independent sortase A catalytic activity are contemplated herein, and can be utilized in the methods described herein. As described in PCT/US2010/000274 , fragments can be identified by producing transaminase fragments by known recombinant techniques or proteolytic techniques , for example , and determining the rate of
protein or peptide ligation. The fragment sometimes consists of about 80% of the full-length transamidase amino acid sequence, and sometimes about 70%, about 60%, about 50%, about 40% or about 30% of the full-length transamidase amino acid sequence such as that of a wild- type S. aureus Sortase A. In some embodiments, the fragment lacks an N-terminal portion of the full-length sequence, e.g., the fragment lacks the N-terminal portion extending to the end of the membrane anchor sequence. In some embodiments the fragment comprises the C-terminus of a full-length transamidase amino acid sequence. In some embodiments, a catalytic core region from a sortase is utilized, e.g., a region from about position 60 to about position 206 of SrtA, e.g., S. aureus SrtA. In some embodiments, the fragment comprises a sortase A lacking N-terminal amino acids 2-25 (SEQ ID NO: 5) (FIG. 4E) . An exemplary nucleotide sequence (i.e., SEQ ID NO: 6) encoding such fragment is shown in FIG. 4F.
Calcium-independent srtA mutants can be derived from sortase A sequences from other organisms. In some embodiments the disclosure provides mutants of any Ca++- dependent SrtA from a species other than S. aureus, wherein the mutants comprise any of the mutations or combinations of mutations described herein at the corresponding positions in the Ca'" -dependent SrtA from a species other than S. aureus. In some embodiments, a calcium- independent srtA mutant is derived from sortase A sequences from other organisms comprising nucleotide sequences substantially identical or similar to the nucleotide sequences that encode Srt A. A similar or substantially identical nucleotide sequence may include modifications to the native sequence, such as
substitutions, deletions, or insertions of one or more
nucleotides. Included are nucleotide sequences that sometimes are about at least 20%, 30%, 40%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, or 85% or more identical to a native nucleotide sequence, and sometimes are about 90% or 95% or more identical to the native nucleotide sequence (each identity percentage can include a 1%, 2%, 3% or 4% variance) . One test for determining whether two nucleic acids are substantially identical is to determine the percentage of identical nucleotide sequences shared between the nucleic acids.
Calculations of sequence identity can be performed by a variety of techniques which are available to the skilled artisan. For example, computer programs such as BLAST2, BLASTN, BLASTP, Gapped BLAST, etc., may be used to generate alignments and/or to obtain a percent identity.
In some embodiments, a calcium- independent srtA mutant comprises at least three amino acid substitutions relative to a wild-type sortase A, wherein the amino acid substitutions comprise a) a K residue at position 105; b) a Q or A residue at position 108; and c) at least one amino acid substitution selected from the group
consisting of i) a R residue at position 94; ii) a S residue at position 94; iii) a N residue at position 160; iv) a A residue at position 165; v) a E residue at position 190; and vi) a T residue at position 196. As used herein, "mutant" is used to refer to a one amino acid sequence which has changed by at least one amino acid residue relative to another amino acid sequence. The term "mutant" with reference to "sortases" should not be considered to imply that any particular way of generating the mutant sequences is required or that any particular starting materials is required . The present disclosure
contemplates any suitable method of generating the calcium-independent srtA mutants described herein.
Examples of suitable methods include, but are not limited to introducing mutations into an appropriate wild-type coding sequence, synthesizing the sequences of the calcium-independent srtA mutants de novo, for example, utilizing solid phase peptide synthesis, and in vitro translation a synthetic mRNA, to name only a few. It is to be further understood that the disclosure contemplates calcium- independent mutants of any wild-type sortase A. Those skilled in the art will appreciate that the wild- type sequences of sortase A may vary, e.g., SrtA from various species may have gaps, insertions, and/or vary in length relative to the amino acid sequence of exemplary wild-type S. aureus SrtA. Accordingly, the disclosure is not intended to be limited in any way by the original amino acid residue at a particular position in any wild- type sortase A sequence used to generate a calcium- independent srtA mutant. Thus, any substitution which results in the specified amino acid residue at a position specified herein is contemplated by the disclosure. For the avoidance of doubt , the phrase "X residue at position Y" means that the X residue in the resulting mutant srtA replaces whatever amino acid was present in the original sortase A amino sequence at the position Y in the original sorta.se A amino acid sequence that corresponds to the same position in an exemplary wild-type S. aureus srtA amino acid sequence when accounting for any gaps and/or insertions in the original sortase A amino acid sequence relative to the exemplary wild-type S. aureus srtA amino acid sequence. The following examples are instructive and are not intended to be limiting in any way .
If the original wild-type sortase A used to generate a srtA mutant is the wild-type S. pyogenes srtA (NCBI Gene ID: 901269), "position 94" in the phrase "an amino acid substitution comprising a R residue at position 94" would mean position 115 of the wild-type S. pyogenes srtA as position 115 of the wild-type S . pyogenes srtA corresponds to position 94 in the exemplary wild-type S. aureus srtA sequence (NCBI Gene ID: 3238307) when taking into account gaps and/or insertions in the sequence alignment of S. pyogenes srtA and S. aureus srtA. In such example, the N residue at position 115 of the wild- type S. pyogenes srtA amino acid sequence would be replaced by a R residue in the resulting mutant srtA.
If the original wild-type sortase A used to generate a srtA mutant is the wild-type B. anthracis srtA (NCBI Gene ID: 1088124), "position 94" in the phrase "an amino acid substitution comprising a R residue at position 94" would mean position 123 of the wild-type B . anthracis srtA as position 123 of the wild-type B . anthracis srtA corresponds to position 94 in the exemplary wild-type S. aureus srtA sequence (NCBI Gene ID: 3238307) when taking into account gaps and/or insertions .in the sequence alignment of B . anthracis srtA and S . aureus srtA. In such example, the E residue at position 123 of the wild- type B . anthracis srtA amino acid sequence would be replaced by a R residue in the resulting mutant srtA.
If the original wild-type sortase A used to generate a srtA mutant is the wild-type E. faecalis srtA (NCBI Gene ID: 1201902), "position 94" in the phrase "an amino acid substitution comprising a R residue at position 94" would mean position 112 of the wild- type E. faecalis srtA as position 112 of the wild- type E. faecalis srtA corresponds to position 94 in the exemplary wild- type S .
aureus srtA sequence (Gene ID: 3238307) when taking into account gaps and/or insertions in the alignment of E. faecalis srtA and S . aureus srtA. In such example, the N residue at position 112 of the wild-type E. faecalis srtA amino acid sequence would be replaced by a R residue in the resulting mutant srtA.
Based on the teachings herein and examples above, those skilled in the art will understand how to align any original wild-type sortase A sequence to be used for generating a calcium- independent srtA mutant with an exemplary wild-type S. aureus sortase A sequence for purposes of determining the positions in the original wild-type sortase A sequence that correspond to the exemplary wild-type S . aureus sortase A sequence when taking into account gaps and/or insertions in the alignment of the two sequences.
In some embodiments, the substitution comprises a E105K substitution. It is to be understood that the phrase "E105K substitution", in reference to mutating a particular original sortase A sequence (e.g., a wild type sortase A sequence) refers to the substitution of a K residue at a position in the original sortase A amino acid sequence that corresponds to the E105 residue in the corresponding exemplary wild- type S. aureus srtA amino acid sequence . For example, if the original sortase A sequence to be mutated in accordance with the disclosure is a L. monocytogenes srtA sequence (NCBI Gene ID:
986837) , the phrase "E105K substitution" refers to the substitution of the R residue at position 112 of the L. monocytogenes srtA sequence wi h a K residue as position 112 of L. monocytogenes srtA corresponds to position 105 of the exem lary wild-type S. aureus srtA sequence (NCBI Gene ID : 3238307) when the two sequences are aligned
taking into account any gaps and/or insertions. In some embodiments, the substitution comprises a E108Q
substitution. In some embodiments, the substitution comprises an E108A substitution. In some embodiments, the substitution comprises a P94R substitution. In some embodiments, the substitution comprises a P94S
substitution. In some embodiments, the substitution comprises a D160N substitution. In some embodiments, the substitution comprises a D165A substitution. In some embodiments, the substitution comprises a K190E
substitution. In some embodiments, the substitution comprises a K196T substitution.
In some embodiments, the wild-type sortase A comprises S. aureus sortase A. In some embodiments, the wild-type sortase A comprises an amino acid sequence of
SEQ ID NO: 1. In some embodiments, the wild-type sortase A comprises an amino acid sequence of SEQ ID NO: 3. In some embodiments, the calcium- independent srtA mutant comprises a deletion of amino acids 2-25. In some embodiments, the calcium- independent srtA mutant
comprises a deletion of amino acids 2-59.
In some embodiments , the calcium- independent srtA mutant comprises at least two amino acid substitutions selected from the group consisting of i) -vi) .
In some embodiments, the calcium-independent srtA mutant comprises at least two amino acid substitutions comprising i) and iii) . In some embodiments, the calcium-independent srtA mutant comprises at least two amino acid substitutions comprising i) and iv) . In some embodiments , the calcium-independent srtA mutant
comprises at least two amino acid substitutions
comprising i) and v) . In some embodiments , the calcium-
independent srtA mutant comprises at least two amino acid substitutions comprising i) and vi) .
In some embodiments, the calcium-independent srtA mutant comprises at least two amino acid substitutions comprising ii) and iii) . In some embodiments, the calcium- independent srtA mutant comprises at least two amino acid substitutions comprising ii) and iv) . In some embodiments, the calcium- independent srtA mutant
comprises at least two amino acid substitutions
comprising ii) and v) . In some embodiments, the calcium- independent srtA mutant comprises at least two amino acid substitutions comprising ii) and vi) .
In some embodiments, the calcium- independent srtA mutant comprises at least two amino acid substitutions comprising iii) and iv) . In some embodiments, the calcium- independent srtA mutant comprises at least two amino acid substitutions comprising iii) and v) . In some embodiments, the calcium- independent srtA mutant
comprises at least two amino acid substitutions
comprising iii) and vi) .
In some embodiments , the calcium- independent srtA mutant comprises at least two amino acid substitutions comprising iv) and v) . In some embodiments, the calcium- independent srtA mutant comprises at least two amino acid substi utions comprising iv) and vi) .
In some embodiments , the calcium- independent srtA mutant comprises at least two amino acid substitutions comprising v) and vi) .
In some embodiments , the calcium- independent srtA mutant comprises at least three amino acid substitutions selected from the group consisting of i) -vi) .
In some embodiments , the calcium- independent srtA mutant comprises at least three amino acid substitutions
comprising i) , iii) and iv) . In some embodiments, the calcium-independent srtA mutant comprises at least three amino acid substitutions comprising i) , iii) and v) . In some embodiments, the calcium- independent srtA mutant comprises at least three amino acid substitutions comprising i) , iii) and vi) .
In some embodiments, the calcium-independent srtA mutant comprises at least three amino acid substitutions comprising i) , iv) and v) . In some embodiments, the calcium- independent srtA mutant comprises at least three amino acid substitutions comprising i) , iv) and vi) .
In some embodiments, the calcium- independent srtA mutant comprises at least three amino acid substitutions comprising i) , v) and vi) .
In some embodiments, the calcium-independent srtA mutant comprises at least three amino acid substitutions comprising ii) , iii) and iv) . In some embodiments, the calcium- independent srtA mutant comprises at least three amino acid substitutions comprising ii) , iii) and v) . In some embodiments, the calcium- independent srtA mutant comprises at least three amino acid substitutions comprising ii) , iii) and vi) .
In some embodiments, the calcium- independent srtA mutant comprises at least three amino acid substitutions comprising ii) , iv) and v) . In some embodiments, the calcium- independent srtA mutant comprises at least three amino acid substitutions comprising ii) , iv) and vi) .
In some embodiments, the calcium- independent srtA mutant comprises at least three amino acid substitutions comprising ii) , v) and vi) .
In some embodiments , the calcium- independent srtA mutant comprises at least three amino acid substitutions comprising iii) , iv) and v) . In some embodiments , the
calcium- independent srtA mutant comprises at least three amino acid substitutions comprising iii), v) and vi) .
In some embodiments, the calcium- independent srtA mutant comprises at least three amino acid substitutions comprising iv) , v) and vi) .
In some embodiments, the calcium- independent srtA mutant comprises at least four amino acid substitutions selected from the group consisting of i) -vi) .
In some embodiments, the calcium- independent srtA mutant comprises at least four amino acid substitutions comprising i) , iii), iv) and v) . In some embodiments, the calcium- independent srtA mutant comprises at least four amino acid substitutions comprising i) , iii) , v) and vi) .
In some embodiments, the calcium- independent srtA mutant comprises at least four amino acid substitutions comprising ii) , iii) , iv) and v) . In some embodiments, the calcium- independent srtA mutant comprises at least four amino acid substitutions comprising ii) , iii) , v) and vi) .
In some embodiments, the calcium- independent srtA mutant comprises at least four amino acid substitutions comprising iii) , iv) , v) and vi) .
In some embodiments, the calcium- independent srtA mutant comprises at least five amino acid substitutions selected from the group consisting of i) -vi) .
In some embodiments, the calcium- independent srtA mutant comprises at least five amino acid substitutions comprising i) , iii) , iv) , v) and vi) .
In some embodiments, the calcium- independent srtA mutant comprises at least five amino acid substitutions comprising ii), iii), iv), v) and vi) .
In some embodiments, the calcium- independent srtA mutants comprises at least 60% identity to SEQ ID NO: 1, or at least 65% identity to SEQ ID NO: 1, or at least 70% identity to SEQ ID NO: 1, or at least 75% identity to SEQ ID NO: 1, or at least 80% identity to SEQ ID NO : 1, or at least 85% identity to SEQ ID NO: 1, or at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or up to 99% identity to SEQ ID NO: 1. In some embodiments, the calcium- independent srtA mutants comprises at least 60% identity to amino acid residues 60-206 of SEQ ID NO: 1, or at least 65% identity to amino acid residues 60-206 of SEQ ID NO: 1, or at least 70% identity to amino acid residues 60-206 of SEQ ID NO: 1, or at least 75% identity to amino acid residues 60-206 of SEQ ID NO: 1, or at least 80% identity to amino acid residues 60-206 of SEQ ID NO: 1, or at least 85% identity to amino acid residues 60-206 of SEQ ID NO: 1, or at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or up to 99% identity to amino acid residues 60-206 SEQ ID NO: 1.
In some embodiments, the calcium- independent srtA mutants comprises at least 60% identity to SEQ ID NO: 3, or at least 65% identity to SEQ ID NO: 3, or at least 70% identity to SEQ ID NO: 3, or at least 75% identity to SEQ ID NO: 3, or at least 80% identity to SEQ ID NO : 3, or at least 85% identity to SEQ ID NO: 3, or at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or up to 99% identity to SEQ ID NO: 3. In some embodiments, the calcium- independent srtA mutants comprises at least 60% identity to amino acid residues 60-206 of SEQ ID NO: 3, or at least 65% identity to amino acid residues 60-206 of SEQ ID NO: 3, or at least 70% identity to amino acid residues 60-206 of SEQ ID NO : 3, or at least 75% identity to amino acid residues 60-206 of SEQ ID NO: 3, or at least 80%
identity to amino acid residues 60-206 of SEQ ID NO: 3, or at least 85% identity to amino acid residues 60-206 of SEQ ID NO: 3, or at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or up to 99% identity to amino acid residues 60-206 SEQ ID NO: 3.
In some embodiments, the calcium-independent srtA mutants comprises at least 60% identity to SEQ ID NO : 5, or at least 65% identity to SEQ ID NO: 5, or at least 70% identity to SEQ ID NO: 5, or at least 75% identity to SEQ ID NO: 5, or at least 80% identity to SEQ ID NO: 5, or at least 85% identity to SEQ ID NO: 5, or at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or up to 99% identity to SEQ ID NO: 5.
It should be appreciated that any of the calcium- independent srtA mutants disclosed herein can include a tag. In some embodiments, a calcium-independent srtA mutant comprises a C-terminal tag. In some embodiments, a calcium- independent srtA mutant comprises a N-terminal tag. In some embodiments, the calcium- independent srtA mutant comprises a N-terminal His6 tag. In some
embodiments , the calcium- independent srtA mutant
comprises a C- terminal His6 tag .
In some embodiments, the calcium- independent srtA mutants exhibit sortase A catalytic activity in the absence of calcium. In some embodiments, the calcium- independent srtA mutants exhibit sortase A catalytic activity in the absence of exogenous calcium.
In some embodiments, the calcium- independent srtA mutants exhibit sortase A catalytic activity in the presence of calcium-binding proteins.
In some embodiments, the calcium- independent srtA mutants exhibit sortase A catalytic activity in the presence of calcium concentrations up to 1 mM, up to 2
mM, up to 3 mM, up to 4 mM, up to 5 mM, up to 6 mM, up to 7 mM, up to 8 mM, up to 9 mM, or up to 10 mM or more.
In some embodiments, the calcium-independent srtA mutants exhibit sortase A catalytic activity in the presence of calcium concentrations less than 1 mM. In some embodiments, the calcium- independent srtA mutants exhibit sortase A catalytic activity in the presence of calcium concentrations less than 100 μΜ, less than 10 μΜ, less than 1 μΜ, less than 0.1 μΜ, or less than 0.01 μΜ.
In some embodiments, the calcium-independent srtA mutants exhibit at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, or at least up to 80% of wild-type sortase A catalytic activity at 0 mM Ca2+ compared to wild-type sortase A catalytic activity at 10 mM Ca2+.
In some embodiments, the calcium-independent srtA mutants exhibit at least 85%, at least 90%, at least 91%, at least 92%, at least 90%, at least 91%, or at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or up to 100% of wild-type sortase A catalytic activity at 0 mM Ca2'' compared to wild- type sortase A catalytic activity at 10 mM Ca2+.
In some embodiments, the calcium- independent srtA mutants exhibit at least 1- fold, at least 2 - fold, at least 10-fold, at least 20-fold, at least 30-fold, at least 40-fold, at least 50-fold, at least 60-fold, at least 70-fold, at least 80-fold, at least 90-fold, or at least up to 100-fold, or more sortase A catalytic activity at 0 mM Ca2+ compared to wild-type sortase A catalytic activity at 10 mM Ca2+.
In some embodiments, the calcium- independent srtA mutants exhibit at least 50%, at least 55%, at least 60%,
at least 65%, at least 70%, at least 75%, or at least up to 80% of sortase A catalytic activity of SEQ ID NO: 1 at 0 mM Ca2+ compared to sortase A catalytic activity of SEQ ID NO: 1 at 10 mM Ca2+ .
In some embodiments, the calcium- independent srtA mutants exhibit at least 85%, at least 90%, at least 91%, at least 92%, at least 90%, at least 91%, or at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or up to 100% of sortase A catalytic activity of SEQ ID NO: 1 at 0 mM Ca2+ compared to sortase A catalytic activity of SEQ ID NO: 1 at 10 mM Ca2" .
In some embodiments, the calcium- independent srtA mutants disclosed herein exhibit increased catalytic activity compared to the sortase A of SEQ ID NO: 1. In some embodiments, the calcium-independent srtA mutants exhibit at least 2-fold, at least 3-fold, at least 4- fold, at least 5-fold, at least 6-fold, at least 7-fold, at least 8-fold, at least 9-fold, at least 10-fold, at least 20-fold, at least 30-fold, at least 40-fold, at least 50-fold, at least 60-fold, at least 70-fold, at least 80-fold, at least 90-fold, or at least up to 100- fold, or more than sortase A catalytic activity of SEQ ID NO: 1 at 0 mM Ca2+ compared to sortase A catalytic activity of SEQ ID NO: 1 at 10 mM Ca2+ .
In some embodiments, the calcium- independent srtA mutants exhibit at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, or at least up to 80% of sortase A catalytic activity of SEQ ID NO: 3 at 0 mM Ca2+ compared to sortase A catalytic activity of SEQ ID NO: 3 at 10 mM Ca2+ .
In some embodiments, the calcium- independent srtA mutants exhibit at least 85%, at least 90%, at least 91%,
at least 92%, at least 90%, at least 91%, or at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or up to 100% of sortase A catalytic activity of SEQ ID NO: 3 at 0 mM Ca2+ compared to sortase A catalytic activity of SEQ ID NO: 3 at 10 mM Ca2+ .
In some embodiments, the calcium- independent srtA mutants disclosed herein exhibit increased catalytic activity compared to the sortase A of SEQ ID NO: 3. In some embodiments, the calcium- independent srtA mutants exhibit at least at least 2-fold, at least 3-fold, at least 4-fold, at least 5-fold, at least 6-fold, at least 7-fold, at least 8-fold, at least 9-fold, at least 10- fold, at least 20-fold, at least 30-fold, at least 40- fold, at least 50-fold, at least 60-fold, at least 70- fold, at least 80-fold, at least 90-fold, or at least up to 100-fold, or more than sortase A catalytic activity of SEQ ID NO: 3 at 0 mM Ca2+ compared to sortase A catalytic activity of SEQ ID NO: 3 at 10 mM Ca2+ .
In some embodiments, the calcium- independent srtA mutants exhibit at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, or at least up to 80% of sortase A catalytic activity of SEQ ID NO: 5 at 0 mM Ca2+ compared to sortase A catalytic activity of SEQ ID NO: 5 at 10 mM Ca2+.
In some embodiments, the calcium- independent srtA mutants exhibit at least 85%, at least 90%, at least 91%, at least 92%, at least 90%, at least 91%, or at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or up to
100% of sortase A catalytic activity of SEQ ID NO: 5 at 0 mM Ca2+ compared to sortase A catalytic activity of SEQ ID NO: 5 at 10 mM Ca2+ .
In some embodiments, the calcium- independent srtA mutants disclosed herein exhibit increased catalytic activity compared to the sortase A of SEQ ID NO: 5. In some embodiments, the calcium- independent srtA mutants exhibit at least 2-fold, at least 3-fold, at least 4- fold, at least 5-fold, at least 6-fold, at least 7-fold, at least 8-fold, at least 9-fold, at least 10-fold, at least 20-fold, at least 30-fold, at least 40-fold, at least 50-fold, at least 60-fold, at least 70-fold, at least 80-fold, at least 90-fold, or at least up to 100- fold, or more than sortase A catalytic activity of SEQ ID NO: 5 at 0 mM Ca2+ compared to sortase A catalytic activity of SEQ ID NO: 5 at 10 mM Ca2+ .
Aspects of the present disclosure also relate to calcium- independent mutants comprising an amino acid sequence at least 80% identical to SEQ ID NO: 9.
In some embodiments, a calcium- independent srtA mutant comprises an amino acid sequence at least 80% identical to SEQ ID NO : 9, wherein the mutant comprises a) a K residue at position 47 of SEQ ID NO: 9; b) a Q residue at position 50 of SEQ ID NO: 9; and c) at least one amino acid residue of SEQ ID NO: 9 selected from the group consisting of i) a R residue at position 36 of SEQ ID NO:9; ii) a N residue at position 102 of SEQ ID NO: 9; iii) a A residue at position 107 of SEQ ID NO: 9; iv) a E residue at position 132 of SEQ ID NO : 9; and v) a T residue at position 138 of SEQ ID NO : 9.
In some embodiments , a calcium- independent srtA mutant comprises an amino acid sequence at least 80% identical to SEQ ID NO : 9, wherein the mutant comprises a) a K residue at position 47 of SEQ ID NO : 9; b) a A residue at position 50 of SEQ ID NO: 9; and c) at least one amino acid residue of SEQ ID NO: 9 selected from the
group consisting of i) a R residue at position 36 of SEQ ID NO: 9; ii) a N residue at position 102 of SEQ ID NO: 9; iii) a A residue at position 107 of SEQ ID NO: 9; iv) a E residue at position 132 of SEQ ID NO: 9; and v) a T residue at position 138 of SEQ ID NO: 9.
In some embodiments, a calcium- independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 16 (FIG. 4P) . In some embodiments, a calcium independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 16 comprising a A residue at position 50. In some embodiments, a calcium- independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 17 (FIG. 4Q) . In some embodiments, a calcium independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 17 comprising a A residue at position 50. In some
embodiments, a calcium-independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 18 (FIG. 4R) . In some embodiments, a calcium independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 18 comprising a A residue at position 50. In some
embodiments, a calcium- independent srtA mutant comprises an amino acid sequence of SEQ ID NO : 19 (FIG. 4S) . In some embodiments, a calcium independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 19 comprising a A residue at position 50. In some
embodiments, a calcium-independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 20 (FIG. 4T) . In some embodiments, a calcium independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 20 comprising a A residue at position 50. In some
embodiments, a calcium- independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 21 (FIG. 4U) . In some embodiments, a calcium independent srtA mutant
comprises an amino acid sequence of SEQ ID NO: 21 comprising a A residue at position 50.
In some embodiments, a calcium- independent srtA mutant comprises at least two amino acid residues of SEQ ID NO: 9 selected from the group consisting of i) -v) .
In some embodiments, a calcium- independent srtA mutant comprises at least two amino acid residues of SEQ ID NO: 9 comprising i) and ii) . In some embodiments, a calcium- independent srtA mutant comprises at least two amino acid residues of SEQ ID NO: 9 comprising i) and iii) . In some embodiments, a calcium- independent srtA mutant comprises at least two amino acid residues of SEQ ID NO: 9 comprising i) and iv) . In some embodiments, a calcium-independent srtA mutant comprises at least two amino acid residues of SEQ ID NO: 9 comprising i) and v) .
In some embodiments, a calcium- independent srtA mutant comprises at least two amino acid residues of SEQ ID NO: 9 comprising ii) and iii) . In some embodiments, a calcium- independent srtA mutant comprises at least two amino acid residues of SEQ ID NO: 9 comprising ii) and iv) . In some embodiments , a calcium- independent srtA mutant comprises at least two amino acid residues of SEQ ID NO : 9 comprising ii) and v) .
In some embodiments , a calcium- independent srtA mutant comprises at least two amino acid residues of SEQ ID NO : 9 comprising iii) and iv) . In some embodiments , a calcium- independent srtA mutant comprises at least two amino acid residues of SEQ ID NO: 9 comprising iii) and v) .
In some embodiments , a calcium- independent srtA mutant comprises at least two amino acid residues of SEQ ID NO: 9 comprising iv) and v) .
In some embodiments, a calcium- independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 15 (FIG. 40) . In some embodiments, a calcium independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 15 comprising a A residue at position 50.
In some embodiments, a calcium- independent srtA mutant comprises at least three amino acid residues of SEQ ID NO: 9 selected from the group consisting of i) -v) .
In some embodiments, a calcium- independent srtA mutant comprises at least three amino acid residues of SEQ ID NO: 9 comprising i) , ii) , and iii) . In some embodiments, a calcium- independent srtA mutant comprises at least three amino acid residues of SEQ ID NO: 9 comprising i) , ii) , and iv) . In some embodiments, a calcium- independent srtA mutant comprises at least three amino acid residues of SEQ ID NO: 9 comprising i) , ii) , and v) .
In some embodiments, a calcium- independent srtA mutant comprises at least three amino acid residues of SEQ ID NO: 9 comprising ii) , iii) , and iv) . In some embodiments, a calcium- independent srtA mutant comprises at least three amino acid residues of SEQ ID NO : 9 comprising ii) , iv) , and v) .
In some embodiments, a calcium- independent srtA mutant comprises at least three amino acid residues of SEQ ID NO : 9 comprising iii ) , iv) , and v) .
In some embodiments, a calcium- independent srtA mutant comprises an amino acid sequence of SEQ ID NO : 12. (FIG . 4L) ) In some embodiments , a calcium- independent srtA mutant comprises an amino acid sequence SEQ ID NO:
13 (FIG . 4M) . In some embodiments , a calcium- independent srtA mutant comprises an amino acid sequence of SEQ ID NO : 14 (FIG . 4N) . In some embodiments , a calcium
independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 14 comprising a A residue at position 50.
In some embodiments, a calcium- independent srtA mutant comprises at least four amino acid residues of SEQ ID NO: 9 selected from the group consisting of i) -v) .
In some embodiments, a calcium- independent srtA mutant comprises at least four amino acid residues of SEQ ID NO: 9 comprising i) , ii) , iii) , and iv) . In some embodiments, a calcium-independent srtA mutant comprises at least four amino acid residues of SEQ ID NO: 9 comprising i) , iii) , iv) , and v) . In some embodiments, a calcium- independent srtA mutant comprises at least four amino acid residues of SEQ ID NO: 9 comprising ii) , iii), iv) , and v) . In some embodiments, a calcium- independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 11 (FIG. 4K) . In some embodiments, a calcium independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 11 comprising a A residue at position 50.
In some embodiments, a calcium-independent srtA mutant comprises an amino acid sequence at least 90% identical to SEQ ID NO: 9. In some embodiments, a calcium- independent srtA mutant comprises an amino acid sequence at least 95% identical to SEQ ID NO: 9. In some embodiments, a calcium-independent srtA mutant comprises an amino acid sequence at least 96% identical to SEQ ID NO: 9. In some embodiments, a calcium- independent srtA mutant comprises an amino acid sequence at least 97% identical to SEQ ID NO: 9. In some embodiments, a calcium- independent srtA mutant comprises an amino acid sequence at least 98% identical to SEQ ID NO : 9. In some embodiments, a calcium- independent srtA mutant comprises an amino acid sequence at least 99% identical to SEQ ID
NO : 9. In some embodiments, a calcium- independent srtA mutant comprises an amino acid sequence of SEQ ID NO: 9.
It should be appreciated that with the amino acid sequences of the calcium-independent srtA mutants disclosed herein, the skilled person may determine suitable polynucleotides that encode calcium- independent srtA mutants. Accordingly, certain aspects of the disclosure provide polynucleotide sequences comprising the gene encoding calcium- independent srtA mutants, and their coding sequences. In some embodiments,
polynucleotides encoding mutants of sortase A comprising a nucleotide sequence at least 80% identical to SEQ ID NO: 10 are disclosed. In some embodiments, a
polynucleotide encoding a mutant of sortase A (i.e., polynucleotides encoding calcium- independent srtA mutants) comprises a nucleotide sequence at least 80% identical to SEQ ID NO: 10, and encodes a) a K residue at position 47 of SEQ ID NO: 9; b) a Q residue at position 50 of SEQ ID NO: 9; and c) at least one amino acid residue of SEQ ID NO: 9 selected from the group
consisting of i) a R residue at position 36; ii) a N residue at position 102; iii) a A residue at position 107; iv) a E residue at position 132; and v) a T residue at position 138.
In some embodiments, a polynucleotide encoding a mutant of sortase A (i.e., polynucleotides encoding calcium- independent srtA mutants) comprises a nucleotide sequence at least 80% identical to SEQ ID NO: 10, and encodes a) a K residue at position 47 of SEQ ID NO: 9; b) a A residue at position 50 of SEQ ID NO: 9; and c) at least one amino acid residue of SEQ ID NO: 9 selected from the group consisting of i) a R residue at position 36; ii) a N residue at position 102; iii) a A residue at
position 107; iv) a E residue at position 132; and v) a T residue at position 138.
In some embodiments, the polynucleotide encodes at least two amino acid residues of SEQ ID NO: 9 selected from the group consisting of i) -v) .
In some embodiments, the polynucleotide encodes at least two amino acid residues of SEQ ID NO: 9 comprising i) and ii) . In some embodiments, the polynucleotide encodes at least two amino acid residues of SEQ ID NO: 9 comprising i) and iii) . In some embodiments, the polynucleotide encodes at least two amino acid residues of SEQ ID NO: 9 comprising i) and iv) . In some
embodiments, the polynucleotide encodes at least two amino acid residues of SEQ ID NO: 9 comprising i) and v) .
In some embodiments, the polynucleotide encodes at least two amino acid residues of SEQ ID NO: 9 comprising ii) and iii) . In some embodiments, the polynucleotide encodes at least two amino acid residues of SEQ ID NO: 9 comprising ii) and iv) . In some embodiments, the polynucleotide encodes at least two amino acid residues of SEQ ID NO: 9 comprising ii) and v) .
In some embodiments, the polynucleotide encodes at least two amino acid residues of SEQ ID NO: 9 comprising iii) and iv) . In some embodiments, the polynucleotide encodes at least two amino acid residues of SEQ ID NO: 9 comprising iii) and v) .
In some embodiments, the polynucleotide encodes at least two amino acid residues of SEQ ID NO: 9 comprising iv) and v) .
In some embodiments, the polynucleotide encodes at least three amino acid residues of SEQ ID NO: 9 selected from the group consisting of i) -v) .
In some embodiments, the polynucleotide encodes at least three amino acid residues of SEQ ID NO: 9
comprising i) , ii) and iii) . In some embodiments, the polynucleotide encodes at least three amino acid residues of SEQ ID NO: 9 comprising i) , ii) and iv) . In some embodiments, the polynucleotide encodes at least three amino acid residues of SEQ ID NO: 9 comprising i) , ii) and v) .
In some embodiments, the polynucleotide encodes at least three amino acid residues of SEQ ID NO: 9
comprising ii) , iii) and iv) . In some embodiments, the polynucleotide encodes at least three amino acid residues of SEQ ID NO: 9 comprising ii) , iii) and v) .
In some embodiments, the polynucleotide encodes at least three amino acid residues of SEQ ID NO: 9
comprising iii) , iv) and v) .
In some embodiments, the polynucleotide encodes at least four amino acid residues of SEQ ID NO: 9 selected from the group consisting of i) -v) .
In some embodiments, the polynucleotide encodes at least four amino acid residues of SEQ ID NO: 9 comprising i) , ii) , iii) , and iv) . In some embodiments , the polynucleotide encodes at least four amino acid residues of SEQ ID NO: 9 comprising ii) , iii) , iv) and v) .
In some embodiments, the polynucleotide encodes at each of the amino acid residues of SEQ ID NO: 9.
In some embodiments, a polynucleotide encoding a calcium- independent srtA mutant comprises a nucleotide sequence at least 85% identical to SEQ ID NO: 10. In some embodiments, a polynucleotide encoding a calcium- independent srtA mutant comprises a nucleotide sequence at least 90% identical to SEQ ID NO: 10. In some embodiments, a polynucleotide encoding a calcium-
independent srtA mutant comprises a nucleotide sequence at least 95% identical to SEQ ID NO: 10. In some embodiments, a polynucleotide encoding a calcium- independent srtA mutant comprises a nucleotide sequence at least 96% identical to SEQ ID NO: 10. In some embodiments, a polynucleotide encoding a calcium- independent srtA mutant comprises a nucleotide sequence at least 97% identical to SEQ ID NO: 10. In some embodiments, polynucleotide encoding a calcium- independent srtA mutant comprises a nucleotide sequence at least 98% identical to SEQ ID NO: 10. In some embodiments, a polynucleotide encoding a calcium- independent srtA mutant comprises a nucleotide sequence at least 99% identical to SEQ ID NO: 10. In some embodiments, a polynucleotide encoding a calcium- independent srtA mutant comprises a nucleotide sequence at least 100% identical to SEQ ID NO: 10. In some embodiments, a polynucleotide encoding a calcium- independent srtA mutant comprises a nucleotide sequence of SEQ ID NO: 10.
Certain aspects of the disclosure relate to a nucleic acid construct comprising a polynucleotide disclosed herein (e.g., a polynucleotide encoding a calcium- independent srtA mutant) .
In some embodiments, the nucleic acid construct comprises a nucleotide sequence that encodes one or more C-terminal or N-terminal tags.
The term "polynucleotide" refers to a molecule, which is a ribonucleic acid (RNA) or deoxyribonucleic acid (DNA) molecule, either single stranded or double stranded. The polynucleotides of the present disclosure, such as polynucleotides encoding the calcium- independent srtA mutants, can be isolated or synthesized using
standard molecular biology techniques and the sequence information provided herein. The synthetic
polynucleotides may be optimized in codon use, preferably according to the methods described in WO2006/077258 and/or PCT/EP2007/055943 , which are herein incorporated by reference. PCT/EP2007/055943 addresses codon-pair optimization. The polynucleotides encoding the calcium- independent srtA mutants of the disclosure can be amplified using cDNA, mRNA or alternatively, genomic DNA, as a template and appropriate oligonucleotide primers according to standard PGR amplification techniques. The nucleic acid so amplified can be cloned into an
appropriate vector and characterized by DNA sequence analysis. A polynucleotide may either be present in isolated form, or be comprised in recombinant nucleic acid molecules or vectors, or be comprised in a host cell. Herein standard isolation, hybridization,
transformation and cloning techniques are used (e.g., as described in Sambrook, J., Fritsh, E. F . , and Maniatis, T. Molecular Cloning: A Laboratory Manual. 2nd, ed . , Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989).
By "isolated" polypeptide or protein is intended a polypeptide or protein removed from its native
environment. For example, recombinant.ly produced
polypeptides and proteins expressed in host cells are considered isolated for the purpose of the disclosure, as are native or recombinant polypeptides which have been substantially purified by any suitable technique such as, for example, the single- step purification method
disclosed in Smith and Johnson, Gene 67:31-40 (1988) .
As used herein, the term "vector" refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked.
Expression vectors useful in the present disclosure include chromosomal-, episomal- and virus-derived vectors e.g., vectors derived from bacterial plasmids,
bacteriophage, yeast episome , yeast chromosomal elements, viruses such as baculoviruses , papova viruses, vaccinia viruses, adenoviruses, fowl pox viruses, pseudorabies viruses and retroviruses, and vectors derived from combinations thereof, such as those derived from plasmid and bacteriophage genetic elements, such as cosmids and phagemids . Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional transformation or transfection techniques. As used herein, the terms
"transformation" and "transfection" are intended to refer to a variety of art-recognized techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, including calcium phosphate or calcium chloride co- precipitation, DEAE-dextran-mediated transfection, transduction, infection, lipofection, cationic
lipidmediated transfection or electroporation. Suitable methods for transforming or transfecting host cells can be found in Sambrook, et al . (Molecular Cloning: A
Laboratory Manual, 2nd, ed. Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989), Davis et al . , Basic Methods in Molecular Biology (1986) and other laboratory manuals .
It will be appreciated by those skilled in the art that the design of the expression vector can depend on such factors as the choice of the host cell to be
transformed, the level of expression of protein desired, etc. The vectors, such as expression vectors, of the
disclosure can be introduced into host cells to thereby produce proteins or peptides, encoded by nucleic acids as described herein (e.g. calcium- independent srtA mutant proteins, catalytically active fragments, catalytically active variants or catalytically active derivatives thereof) . The vectors, such as recombinant expression vectors, of the disclosure can be designed for expression of calcium-independent srtA mutant proteins in
prokaryotic or eukaryotic cells.
For example, calcium- independent srtA mutant proteins can be expressed in bacterial cells such as E. coli, insect cells (using baculovirus expression
vectors), filamentous fungi, yeast cells or mammalian cells. Suitable host cells are discussed further in Goeddel, Gene Expression Technology: Methods in
Enzymology 185, Academic Press, San Diego, Calif. (1990) . Representative examples of appropriate hosts are
described hereafter.
A "host cell," as used herein, is any cell capable of being grown and maintained in cell culture under conditions allowing for production and recovery of useful quantities of a biological product, as defined herein. Host cells can be unmodified cells or cell lines, or cell lines which have been genetically modified (e.g., to facilitate production of a biological product) . In some embodiments, the host cell is a cell line that has been modified to allow for growth under desired conditions, such as in serum-free media, in cell suspension culture, or in adherent cell culture.
A host cell can be chosen that modulates the expression of the inserted sequences, or modifies and processes the gene product in a specific, desired fashion . Such modifications (e.g. , glycosylation) and
processing (e.g., cleavage) of protein products may facilitate optimal functioning of the protein. Suitable host cells are preferably prokaryotic microorganisms such as bacteria (e.g., E. coli) , or in some embodiments eukaryotic organisms, for example f ngi, such as yeasts or filamentous fungi, or plant cells.
The calcium- independent srtA mutant according to the disclosure can be recovered and purified from recombinant cell cultures by methods known in the art (e.g., ion- exchange chromatography, hydrophobic interaction
chromatography, affinity chromatography and size
exclusion chromatography to further separate the target calcium- independent mutant srtA from the bulk protein to enable recovery of the target calcium- independent srtA mutant in a highly purified state) .
In some aspects the disclosure relates to a method of producing a calcium- independent srtA mutant comprising the steps of: (a) culturing the host cell according to the disclosure in a suitable culture medium under suitable conditions to produce calcium- independent srtA mutant; and optionally (b) purifying said calcium- independent srtA mutant to provide a purified calcium- independent srtA mutant product .
Another aspect of the present disclosure relates to methods for the production of substantially purified calcium- independent srtA mutant enzyme of the present disclosure. It should be appreciated that such enzyme can be prepared in compliance with Good Manufacturing Practices (GMP) or used in a GMP-com liant process.
An exemplary method for the production of
substantially purified calcium- independent srtA mutant involves cloning a nucleic acid segment encoding the calcium- independent srtA mutant enzyme can be inserted in
a vector that contains sequences allowing expression of a sortase-transamidase in another organism, such as E.
coli. A suitable host organism can then be transformed or transfected with the vector containing the cloned nucleic acid segment. Expression is then performed in that host organism. The expressed enzyme is then purified using standard techniques. Techniques for the purification of cloned proteins are well known in the art and need not be detailed further here (e.g., affinity chromatography on a nickel NTA column for purification of a calcium- independent srtA mutant enzyme extended at its carboxyl terminus with a sufficient number of histidine residues to allow specific binding of the protein molecule to the nickel NTA column through the histidine residues) .
In some aspects the disclosure further provides an enzyme composition comprising one or more calcium- independent srtA mutants. In some aspects the disclosure relates to a sortase-mediated transpeptidation reaction catalyzed by an enzyme composition according to the disclosure. Examples of synthetic nucleophiles that can be used in such sortase-mediated transpeptidation reactions are shown in Table 1 below .
Table 1: Examples of synthetic nucleophiles used in site-specific sortase A transpeptidation reactions .
Probe La] Labeling Property Endowed
Site
H-G5K(biotin) L-OH C Term Biophysical handle
H-G5K (A P) K (Biotin) I -OH C Term Biophysical
handle/photocleavage
H-G5K (phenylazide) K (biotin) C Term Biophysical
G-OH handle/photo-crosslinker
H-G3K(FITC) -NH2 C Term Fluorescence
H-G3K (K (TAMRA) ) -NH2 C Term Fluorescence
H-G3YC (biotin) -NH2 C Term Biophysical handle
H-G3YC (Alexa 488) -NH2 C Term Fluorescence
H-AA-Ahx-K (K (TAMRA) ) -NH2 C Term Fluorescence
(S.
pyogenes)
H-G3K(C12-C24) -NH2 C Term Lipidation
H-G3K(l-ad) -NH2 C Term Hydrophobicity
H-G3WK (cholesterol) -NH2 C Term Lipidation
d-Tat (1st residue is G) C Term Cell Penetration
H-G2Y-PTD5-NH2 C Term Cell Penetration
(H2NRRQRRTSKLMKRAhx) 2KYK (GG C Term Cell Penetration
-NH2) -NH2
H-G3K(folate) -NH2 C Term Folic Acid
H2N-PEG G Term Inert Polymer
H-G3K (PEG) -OH C Term Inert Polymer
H-G3 -MAP-NH2 C Term Cell Penetration
Aminoglycoside antibiotics C Term Antibiotic
(various)
C Term GPI Mimic
GPI mimics based on 19 with C Term GPI Mimic
trisaccharide cores
Biotin-PEG-YGLPETGG-NH2 N Term Biophysical handle
Alexa 64 / -LPETGG-NH2 N Term Fluorescence
Alexa 488 -LPETGG-NH2 N Term Fluorescence
Biotin-LPRT-OMe N Term Biophysical handle
FITC-Ahx-LPRT-OMe N Term Fluorescence
FAM-LPETG-NH2 N Term Fluorescence
Biotin-GGLPETG-NH2 N Term Biophysical handle
N3-ALPETG-NH2 N Term Handle for Bioorthognal chmical reactions
[a] 1-Ad = 1-adamantyl, Ahx=aminohexanoic acid, FAM=carboxyfluorescein, FITC = fluorescin isothiocyanate , PEG = polyethylene glycol, TAMRA =
carboxytetramethylrhodamine
In some aspects the disclosure relates to the use of an enzyme composition disclosed herein for the sortagging of a protein. An enzyme composition of the disclosure may comprise a polypeptide which has the same enzymatic activity, for example the same type of transamidase activity as that provided by a polypeptide of the disclosure. An enzyme composition of the disclosure may comprise a polypeptide which has a different type of enzymatic activity than that provided by a polypeptide of the disclosure. In some embodiments, the enzyme
composition is purified to comprise calcium- independent srtA mutants of a particular sequence (e.g., SEQ ID NO: 9) . In some embodiments, the enzyme composition
comprises calcium- independent srtA mutants predominantly in monomeric form. In some embodiments, the enzyme composition comprises a mixture of both monomeric and dimeric calcium- independent srtA mutants . In some embodiments , the enzyme composition comprises a mixture of calcium- independent srtA mutants comprising different sequences. In some embodiments, the enzyme composition comprises at least two, at least three , at least four, or at least five calcium- independent sortase A mutants selected from the group consisting of SEQ ID NO : 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12 , SEQ ID NO: 13 , SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18 , SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21 , and combinations thereof . In some embodiments , the enzyme composition comprises a group of calcium- independent sortase A mutants comprising SEQ ID NO : 9,
SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO:
13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID
NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO : 20, SEQ ID NO: 21, and combinations thereof. In some
5 embodiments, the enzyme composition comprises a calcium- independent srtA mutant and a calcium-dependent srtA
mutant. In some embodiments, the enzyme composition
comprises a calcium- independent srtA mutant of SEQ ID NO: 9 and a calcium-dependent srtA mutant of SEQ ID NO: 7.
10 In some embodiments, the enzyme composition comprises a
wild-type srtA enzyme and a calcium- independent srtA
mutant. In some embodiments, the enzyme composition
comprises a calcium- independent srtA mutant of SEQ ID NO:
9, and a wild-type srtA enzyme of SEQ ID NO: 1, SEQ ID
15 NO: 3 or SEQ ID NO: 5. In some embodiments, the enzyme
composition comprises an aqueous medium.
The calcium- independent srtA mutants and enzyme compositions comprising the calcium- independent srtA
mutants disclosed herein can be used in any application
20 for which sortase-mediated ligation is desirable.
Exemplary such applications include, but are not limited to , specific incorporation of novel functionality into proteins , synthesis of neoglycoconjugates , immobilization of proteins to solid surfaces, protein labeling on living
25 cells, and protein circularization/dimerization, as is
shown in Table 2 below.
Table 2 Summary of applications for sortase-mediated ligation
Target Proteins Molecular Probe References
( 1) Specific incorporation of novel functionality into proteins
GFP-LPETG-Hisg Folate, (D)-Tat, AT-P-022, G2Y- Mao (2004)
PTD5, Gly-emGFP
GFP-LPETG-His6 Polyethyleneglycol (PEG) Parthasart hy et al . (2007)
H-2Kb-LPETG Biotin, phenylazide Popp et photocrosslinker, FITC, al. (2007) tetramethylrhodamine (TAMRA) 3- amino-3 - (o-nitrophenyl) ropionic
acid
CXCL14 -LPETG TAMRA Popp et al. (2007)
GFP-LPETG Fatty acids, cholesterol Antos et al. (2008)
PNA-LPKTGG Amphipathic peptide (MAP) Pritz et al. (2007)
GBl-VcSH3-LPETG-His5 Unlabelled GBl for NMR analysis Kobashigaw a et al . (2009)
(2) Synthesis of neoglycoconj ugates
YALPETGK peptide 6 -deoxy- 6 -amino-glucose, Samantaray
6 -deoxy- 6 -amino-mannose et al .
(2008)
Rev-LPETGK Kanamycin, ribostamycin, neomycin Samantaray et al .
(2008)
Tat -LPETGK Kanamycin, ribostamycin, neomycin Samantaray et al . (2008)
Mrp-LPNTG-Xn Tobramycin Samantaray et al . (2008)
Target Proteins Surface References
(3) Immobilisation of proteins to solid surfaces
eGFP-LPETG Polystyrene beads Parthasart hy et al . (2007) eGFP-LPETGG-Hisf, Glycidyl methacrylate (GMA) Chan et al. (2007)
Tus-LPETGG-His6 Glycidyl methacrylate (GMA) Chan et al. (2007)
Fbp-LPETG C 5 biosensor chip Chow et al. (2008)
Target Proteins Molecular Probe References
(4) Protein labeling on living cell
CD154 -LPETG Biotin, TAMRA Popp et al. (2007)
Neuraminidase-LPETG Biotin Popp et al. (2007)
ODF-LPETGG Alexa Fluor 488, biotin T¾n¾-ki3. St
al. (2008)
LPETGs-ECFP LPETGG-Alexa Fluor 647, LPETGG- Yamamoto biotin et al .
(2009)
(5) Protein circularisation
Glyv,-EGFP-LPETG-Hiss - - - Antos et al. (2009)
The calcium- independent srtA mutants according to the disclosure may feature a number of significant
advantages over existing srtA variants currently used.
Depending on the specific application, these advantages may include aspects such as lower production costs,
higher specificity towards the substrate, reduced
antigenicity, fewer undesirable side activities, higher yields when produced in a suitable microorganism, more suitable pH and temperature ranges, calcium- independence, i.e., the ability to exhibit sortase A catalytic activity in environments in which calcium is low or unavailable , such as the cytoplasm, or in the presence of calcium- binding proteins, or when calcium-binding substances such as phosphate , carbonate, and EDTA are present . In
various embodiments, a calcium- independent srtA mutant or composition of the disclosure may be used in any process which requires the sortagging of a moiety of interest to a target protein . The terms "sortagging" , "sortase-
mediated ligation", "sortase-mediated transpeptidation" , "sortase-mediated transacylation" , are used
interchangeably herein to refer to the process of adding a tag. It should be appreciated that any tag can be used. Examples of suitable tags include, but are not limited to, amino acids, peptides, proteins, nucleic acids, polynucleotides, sugars, carbohydrates, polymers, lipids, fatty acids, and small molecules. Other suitable tags will be apparent to those of skill in the art and the disclosure is not limited in this aspect.
In some embodiments, a tag comprises a sequence useful for purifying, expressing, solubilizing, and/or detecting a polypeptide. In some embodiments, a tag comprises an HA, TAP, Myc, 6XHis, Flag, or GST tag, to name few examples . In some embodiments a tag comprises a solubility-enhancing tag (e.g., a SUMO tag, NUS A tag, SNUT tag, a Strep tag, or a monomeric mutant of the Ocr protein of bacteriophage T7) . See, e.g., Esposito D and Chatterjee DK. Curr Opin Biotechnol . ; 17(4):353-8 (2006). In some embodiments, a tag is cleavable, so that it can be removed, e.g., by a protease. In some embodiments, this is achieved by including a protease cleavage site in the tag, e.g., adjacent or linked to a functional portion of the tag. Exemplary proteases include, e.g., thrombin, TEV protease, Factor Xa, PreScission protease, etc. In some embodiments, a "self-cleaving" tag is used. See, e.g., PCT/US05/05763.
In some embodiments, a tag comprises a click chemistry handle.
In some embodiments, the calcium-independent srtA mutants disclosed herein can be used to sortag cells to be administered to a subject, e.g., human subjects.
In some embodiments, a method for sortagging a target polypeptide comprises (a) providing a polypeptide comprising a sortase recognition motif; (b) providing a moiety comprising an N-terminal oligoglycine sequence or a terminal alkylamine; and (c) contacting the target polypeptide with the moiety in the presence of an enzyme composition described herein (e.g., a composition comprising at least one calcium- independent srtA mutant disclosed herein) under conditions suitable for the srtA A mutant to ligate the moiety to the target protein, thereby sortagging the target protein.
It should be appreciated that the sortagging methods disclosed herein can be used to attach a moiety to any target protein or polypeptide. Methods and compositions provided herein can be used to conjugate essentially any polypeptide to any moiety. Non- limiting examples of polypeptides that can be produced or conjugated according to methods provided herein include receptors, membrane proteins, cytokines, chemokines, hormones, enzymes, growth factors, growth factor receptors, antibodies, antibody derivatives and other immune effectors,
interleukins , interferons, erythropoietin, integrins, soluble major histocompatibility complex antigens, binding proteins, transcription factors, translation factors, oncoproteins or proto-oncoproteins , muscle proteins, myeloproteins, neuroactive proteins, tumor growth suppressors, structural proteins, and blood proteins (e.g., thrombin, serum albumin, Factor VII, Factor VIII, Factor IX, Factor X, Protein C, von
Willebrand factor, etc.). In some embodiments, the polypeptide is a glycoprotein or other polypeptide which requires post- translational modification, such as deamidation, glycation, or the like, for optimal
activity. In some embodiments, the polypeptide is a lipoprotein. Exemplary target polypeptides which have been sortagged are shown in Table 3 below.
Table 3 : Examples of proteins labeled by sortase A transpeptidation .
[a] The numbers denote the probe identities from Table 1
In some embodiments the target polypeptide comprises or consists of a polypeptide that is at least 80%, or at least 90%, e.g., at least 95%, 86%, 97%, 98%, 99%, 99.5%,
Substrate Solution/Cel Labeling Label (s) LaJ
1 Surface
H-2Kb Solution C Terminal __L 2 3 1
CD154 Cell surface C Terminal 1,5 neuraminidase Cell surface C Terminal 1
ODF Cell surface C Terminal 6,7
CRE Solution C Terminal 5
UCHL3 Solution C Terminal 1
P97 Solution C Terminal 5 eGFP Solution C Terminal 9, 10, 11
GFP Solution C Terminal 13 , 14, 15
PNA Solution C Terminal 18 eGFP Solution C Terminal 16, 17
Mrp Solution C Terminal 19
YALPETGK Solution C Terminal 19
(His) 6YALPETGKS Solution C Terminal 20
CD52 peptides Solution C Terminal 21
CD24 Solution C Terminal 21
MUC1 Solution C Terminal 21
LPETG5 -ECFP-TM Cell surface N Terminal 22, 23
LPETG5-PAFR Cell surface N Terminal 24
G3 - /G5-CTXB Solution N Terminal 25,26
G3 -eGFP Solution N Terminal 26
G-UCHL3 Solution N Terminal 26
S . aureus Cell surface N Terminal 27,28,29 surface
peptidoglycan
eGFP Solution N and C 26 and 8
Terminal
UCHL3 Solution N and C 26 and 8
Terminal
or 100% identical to a naturally occurring protein or polypeptide. In some embodiments, the target polypeptide has no more than 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 amino acid differences relative to a naturally occurring sequence. In some embodiments the naturally occurring protein is a mammalian protein, e.g., of human origin. Naturally occurring sequences, e.g., genomic, mRNA, and polypeptide sequences, from a wide variety of species, including human, are known in the art and are available in publicly accessible databases such as those available at the National Center for Biotechnology Information (www.ncbi.nih.gov) or Universal Protein Resource
(www.uniprot.org) . Databases include, e.g., GenBank, RefSeq, Gene, UniProtKB/SwissProt , UniProtKB/Trembl , and the like. Sequences, e.g., nucleic acid (e.g., mRNA) and polypeptide sequences, in the NCBI Reference Sequence database may be used as reference sequences.
In some embodiments, a target polypeptide is a protein that is approved by the US Food & Drug
Administration (or an equivalent regulatory authority such as the European Medicines Evaluation Agency) for use in treating a disease or disorder in humans . Such proteins may or may not be one for which a PEGylated version has been tested in clinical trials and/or has been approved for marketing .
In some embodiments , a target polypeptide is a neurotrophic factor, i.e., a factor that promotes survival , development and/or function of neural lineage cells (which term as used herein includes neural
progenitor cells , neurons , and glial cells , e.g., astrocytes, oligodendrocytes, microglia) .
In some embodiments , t e target protein is one that forms homodimers or heterodimers , (or homo- or
heterooligomers comprising more than two subunits, such as tetramers) .
In some embodiments, the target polypeptide is an enzyme, e.g., an enzyme that is important in metabolism or other physiological processes. In some embodiments, a target protein comprises a receptor or receptor fragment (e.g., extracellular domain).
In some embodiments, the target polypeptide is sortagged in cells to be administered to a subject. The present disclosure contemplates any application for which administration of cells comprising a sortagged
polypeptide is desirable (e.g., therapeutic, diagnostic, imaging, etc.) As used herein, a "subject" means a human or animal. Usually the animal is a vertebrate such as a primate, rodent, domestic animal or game animal.
Primates include chimpanzees, cynomologous monkeys, spider monkeys, and macaques, e.g., Rhesus. Rodents include mice , rats, woodchucks , ferrets , rabbits and hamsters. Domestic and game animals include cows, horses, pigs, deer, bison, buffalo, feline species, e.g., domestic cat, canine species, e.g., dog, fox, wolf, avian species , e.g., chicken, emu, ostrich, and fish, e.g., trout , catfish and salmon . Subj ect includes any subset of the foregoing, e.g., all of the above, but excluding one or more groups or species such as humans , primates or rodents . In certain embodiments, the subj ect is a mammal, e.g., a primate, e.g., a human . Preferably, the subj ect is a mammal . The mammal can be a human, non- human primate, mouse, rat, dog, cat, horse, or cow, but are not limited to these examples. Mammals other than humans can be advantageously used, for example, as subj ects that represent animal models of disease . A subj ect can be male or female .
One of skill in the art will be aware that certain proteins, e.g., secreted eukaryotic (e.g., mammalian) proteins, often undergo intracellular processing (e.g., cleavage of a secretion signal prior to secretion and/or removal of other portion (s) that are not required for biological activity) , to generate a mature form. Such mature, biologically active versions of target proteins are used in certain embodiments of the disclosure.
A "moiety" to be conjugated (ligated) to a
polypeptide according to methods provided herein can be any agent suitable for conjugation to a polypeptide, i.e., capable of being operably linked to a sortase recognition sequence. The moiety can confer any of a number of possible functionalities to the polypeptide, such as but not limited to, altered physico-chemical properties, such as solubility and/or stability,- altered pharmacokinetic properties, such as bioavailability, clearance rate, and/or plasma half-life and/or altered biological activity, such as immunogenicity and/or antigenicity. Non-limiting examples of moieties include: a small -molecule , a peptide, a polypeptide, a lipid and/or fatty acid, a carbohydrate, a nucleic acid, a reporter molecule (e.g., a reporter enzyme, fluorescent molecule, a radiolabel, an affinity label, or the like), a toxin, a therapeutic agent, a nanoparticle, a resin, a cell, a virus particle, an adjuvant molecule, or a polymer (e.g., a hydrophilic polymer), an affinity tag (e.g., His6) , or the like. In one embodiment, the moiety is a pharmacological carrier molecule.
In some embodiments, the moiety comprises, consists essentially of, or consists of a member of a prosthetic binding group, such as biotin/avidin,
biotin/streptavidin, maltose binding protein/maltose,
glutathione S-transferase/glutathione ,
metal/polyhistidine, antibody/epitope , antibody/antigen, antibody/protein A or protein G, hapten/anti-hapten, folic acid/folate binding protein, vitamin B 12/intrinsic factor, nucleic acid/complementary nucleic acid,
sulfhydryl/maleimide, sulfhydryl/haloacetyl derivative, amine/isotriocyanate , amine/succinimidyl ester, or amine/sulfonyl halides .
In some embodiments, the moiety comprises, consists essentially of, or consists of a peptide, a
peptidomimetic (e.g., a peptoid) , an amino acid, an amino acid analog, a polynucleotide or polynucleotide analog, a nucleotide or nucleotide analog, or an organic or inorganic compound having a molecular weight between about 500 and about 10,000.
In some embodiments, the moiety comprises, consists essentially of, or consists of a second polypeptide. The polypeptide can be any polypeptide. For example, a protein which is difficult to produce in a cell (e.g., either due to toxicity) , can be expressed as two
fragments which can be j oined using the methods described herein (e.g. , first portion of the protein can be attached to the second portion, reconstituting an active protein using the methods described herein) .
In some embodiments , the moiety comprises , consists essentially of , or consists of a reporter molecule , such as a fluorescent molecule (e.g. , umbelliferone ,
fluorescein, fluorescein isothiocyanate , rhodamine , dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin) ; a radioisotope (e.g., Cu-64, Ga67, Ga-68, Zr-89, Ru-97, Tc-99, Rh-105, Pd-109, In- 1.1.1, 1-123 , I- 125, 1-131, Re-186, Re- 188, Au-198 , Pb-203 , At-211, Pb- 212 or Bi-212) a detectable enzyme (e.g. , horseradish
peroxidase, alkaline phosphatase, p-galactosidase, or acetylcholinesterase); a luminescent material (e.g., luminol) ; or a bioluminescent material (e.g., luciferase, luciferin, or aequorin) .
In some embodiments, the moiety comprises, consists essentially of, or consists of a biologically active molecule, such as a toxin (e.g., abrin, ricin A,
pseudomonas exotoxin or diphtheria toxin) .
In some embodiments, the moiety comprises the polypeptide itself, such that the polypeptide is cyclized by the conjugation. Advantageously, cyclized proteins often exhibit desired properties relative to the
corresponding linear protein, such as enhanced
solubility, enhanced stability, enhanced plasma half-life and/or decreasing immunogenicity . In other embodiments, the polypeptide can be 'chained' (e.g., dimerized, trimerized, etc) .
In some embodiments, the moiety is a water-soluble polymer, non-peptidic polymer with an average molecular weight of about 200 to about 200,000 Daltons, depending on the desired effect on the properties of the
polypeptide. For example, in some embodiments, the moiety comprises, consists essentially of, or consists of a polymeric group, such as polyalkylene oxide (PAO) , polyalkylene glycol (PAG) , polyethylene glycol (PEG) , methoxypolyethylene glycol (mPEG) , polypropylene glycol (PPG), branched PEGs, copolymers of ethylene glycol and propylene glycol, polyvinyl alcohol (PVA) ,
polycarboxylate, poly-vinylpyrrolidone , polyethylene-co- maleic acid anhydride, polystyrene-co-maleic acid anhydride, dext rboxymethy1 -dextran,
polyoxyethylated glycerol, polyoxyethylated sorbitol, polyoxyethylated glucose, dextran, polyoxazoline,
polyacryloylmorp oline, or a serum protein binding- ligand, such as a compound which binds to albumin (e.g., fatty acids, C5-C24 fatty acid, aliphatic diacid (e.g. C5- C24) ) . Additional polymers useful in methods and
compositions provided herein are known in the art and are described, e.g., in U.S. Pat. No. 5,629,384, which is herein incorporated by reference.
In some aspects, the target polypeptide or moiety comprises an affinity tag that can be used to facilitate recovery and/or isolation of the conjugated polypeptide.
An affinity tag used in a method or composition provided herein can comprise any peptide or other molecule for which an antibody or other specific binding agent is available. Affinity tags known in the art as being useful for protein purification include, but are not limited to, a poly-histidine segment, protein A (e.g., Nilsson et al . , EMBO J. 4:1075 (1985); Nilsson et al . , Methods Enzymol . 198:3 (1991)), glutathione S transferase (e.g., Smith and Johnson, Gene 67:31 (1988)), Glu-Glu affinity tag (e.g., Grussenmeyer et al . , Proc . Natl. Acad. Sci. USA 82:7952 (1985)), substance P, FLAG peptide (e.g., Hopp et al . , Biotechnology 6:1204 (1988)), c-myc tags (detected with anti-myc antibodies) ,
calmodulin binding protein, and streptavidin binding peptide.
In some embodiments, an affinity tag described herein allows for selective enrichment of desired conjugation products. In some embodiments, an affinity tag is located N- terminal of a sortase recognition sequence or C- terminal of an oligoglycine sequence so that the tag remains associated with the polypeptide after sortase- catalyzed cleavage and ligation. As such, the affinity tag is retained in the conjugated
polypeptide upon cleavage and/or ligation of the sortase recognition sequence by a sortase and affinity
purification isolates the intact conjugated polypeptide.
In some embodiments, an affinity tag is located C- terminal of a sortase recognition sequence so that the tag is cleaved from the polypeptide upon sortase- catalyzed cleavage and ligation. For example, the affinity tag is located C-terminal of the sortase recognition sequence (i.e., the sortase recognition sequence is between the target polypeptide and the affinity tag) . As such, after performing the cleavage and ligation reactions, the affinity tags are no longer attached to the target polypeptide, and fragments containing the affinity tag are easily removed by affinity purification.
In some aspects, the target polypeptide comprises a spacer peptide. For example, in some embodiments, a spacer peptide separates the target polypeptide from a sortase recognition sequence and/or an affinity tag, and/or the sortase recognition sequence from an affinity tag. A spacer peptide can be of any size, e.g., from several to 30 or more amino acid residues, sufficient to serve the intended purpose. Spacer peptides can enhance conformational flexibility between two or more domains of a protein and/or minimize steric interference with the folding and/or function of two or more domains of a protein. A spacer peptide will generally comprise an inert, flexible amino acid sequence, e.g., comprising predominantly glycine, serine, and/or alanine residues. In some embodiments, a spacer peptide sequence can be modified with one or more proline residues at the beginning and/or at the end of the spacer in order to isolate the spacer as a separate functional domain from
neighboring domains of the protein. A variety of spacer peptides are known in the art.
"Contacting" according to methods provided herein refers to the addition of the target polypeptide to the culture medium in a manner that allows the calcium- independent srtA mutant to ligate the moiety to target polypeptide. In some embodiments, the contacting includes culturing the cells for a defined period of time in the presence of the target polypeptide. In other embodiments, the contacting includes culturing the cells for a variable period of time until a desired endpoint or other indicator is achieved.
Conditions which allow the calcium- independent srtA mutant to cleave the sortase recognition sequence and ligate the moiety to the target polypeptide, include for example, standard cell growth conditions known to those of skill in the art, e.g. for mammalian cells; 37° C . , 5% C02, and an appropriate cell culture medium. The cell culture medium may vary depending upon the host cell and can be determined readily by those of skill in the art. In some embodiments, the "contacting" step in the method takes place in the absence of calcium, i.e., there is no exogenous calcium added to the culture medium or reaction mixture .
In some embodiments a sorta.se is immobilized by attaching it to a support. As used herein, a "support" may be any entity or plurality of entities having a surface to which a substance may be attached or on which a substance may be placed. Examples, include, e.g., particles, slides, filters, interior wall or bottom of a vessel (e.g., a culture vessel such as a plate or flask, well of a microwell plate, tube), chips, etc. A support
may be composed, e.g., of glass, metal, gels (e.g., agarose) , ceramics, polymers, or combinations thereof.
Immobilization may comprise contacting sortase or a composition containing sortase with an affinity reagent, e.g., an antibody, that binds to sortase, wherein the affinity reagent is attached to a support. In some embodiments the sortase is tagged, and the affinity reagent binds to the tag. In some embodiments sortase may comprise a tag, e.g., a 6X-His tag, which may be used to immobilize the sortase to a metal-ion containing resin or substrate. In some embodiments sortase is immobilized to magnetic particles. It will be understood that magnetic particles may be magneti sable and paramagnetic, e.g., superparamagnetic, i.e., they may only magnetic in a magnetic field. In some embodiments the support is in a column. Unreacted sortase substrates and reaction products may readily be separated from an immobilized sortase.
It will be understood that in some aspects, the disclosure encompasses agents produced according to methods described herein, and compositions comprising such agents. It will be understood that, in some aspects, the disclosure encompasses methods of using such agents, e.g., for one or more purposes described herein.
The disclosure further provides packaged products and kits, including calcium- independent srtA mutants described herein or polynucleotides encoding the same, cell lines, cell cultures, populations and compositions, including, as well cells, cultures, populations, and compositions enriched or selected for any calcium- independent srtA mutants or variants thereof, packaged into suitable packaging material. In some embodiments, a packaged product or kit includes calcium- independent srtA
mutants in monomeric form. In some embodiments, the packaged product or kit includes calcium-independent srtA mutants in dimeric form. In some embodiments, the packaged product or kit includes a mixed population of monomeric and dimeric calcium- independent srtA mutants.
In some aspects, a packaged product or kit includes a label, such as a list of the contents of the package, or instructions for using the kit e.g., instructions for sortagging a target polypeptide, isolating or producing a substantially purified calcium- independent srtA mutant disclosed herein, administering sortagged cells, e.g., implanting or transplanting in vivo, or screening for a compound or agent that modulates activity of the calcium- independent srtA mutants or variants thereof .
In certain embodiments, a packaged product or kit includes a container, such as a sealed pouch or shipping container, or an article of manufacture, for example, to carry out a sortase-mediated ligation reaction utilizing a calcium-independent srtA mutant described herein, variant thereof or composition comprising the same, or preserving or storing the calcium- independent srtA mutants , such as a tissue culture dish, tube, flask, roller bottle or plate (e.g., a single multi-well plate or dish such as an 8 , 16, 32 , 64, 96, 384 and 1536 multi- well plate or dish) .
The term "packaging material" refers to a physical structure housing the product or components of the kit . The packaging material can maintain the components sterilel , and can be made of material commonly used for such purposes (e.g. , paper, corrugated fiber, glass , plastic , foil , ampules , etc . ) . A label or packaging insert can be included, listing contents or appropriate
written instructions, for example, practicing a method of the disclosure.
A packaged product or kit can therefore include instructions for practicing any of the methods of the disclosure described herein. For example, calcium- independent srtA mutants described herein, variants thereof, or enzyme compositions comprising them can be included in a tissue culture dish, tube, flask, roller bottle or plate (e.g., a single multi-well plate or dish such as an 8, 16, 32, 64, 96, 384 and 1536 multi-well plate or dish) together with instructions, e.g., for sortagging, purification, preserving or screening.
Instructions may be on "printed matter," e.g., on paper or cardboard within the kit, on a label affixed to the package, kit or packaging material, or attached to a tissue culture dish, tube, flask, roller bottle, plate (e.g., a single multi-well plate or dish such as an 8, 16, 32, 64, 96, 384 and 1536 multi-well plate or dish) or vial containing a component of the kit. Instructions may comprise voice or video tape and additionally be included on a computer readable medium, such as a disk (floppy diskette or hard disk) , optical CD such as CD- or DVD- ROM/RAM, magnetic tape, electrical storage media such as RAM and ROM and hybrids of these such as magnetic/optical storage media.
Disclosed kits can optionally include additional components, such as buffering agent, a preservative, or a reagent. Each component of the kit can be enclosed within an individual container or in a mixture and all of the various containers can be within single or multiple packages .
Further, some aspects of this disclosure provide that nucleophiles can be used in a sortase reaction that
comprise reactive chemical moieties, for example, moieties, or "handles", suitable for a click chemistry- reaction, as is described in detail in Published PCT International Application WO 2013/036630, the entirety of which is incorporated herein by reference.
* * *
One skilled in the art readily appreciates that the present invention is well adapted to carry out the objects and obtain the ends and advantages mentioned, as well as those inherent therein. The details of the description and the examples herein are representative of certain embodiments, are exemplary, and are not intended as limitations on the scope of the invention.
Modifications therein and other uses will occur to those skilled in the art. These modifications are encompassed within the spirit of the invention. It will be readily apparent to a person skilled in the art that varying substitutions and modifications may be made to the invention disclosed herein without departing from the scope and spirit of the invention.
The articles "a" and "an" as used herein in the specification and in the claims, unless clearly indicated to the contrary, should be understood to include the plural referents. Claims or descriptions that include "or" between one or more members of a group are
considered satisfied if one, more than one, or all of the group members are present in, employed in, or otherwise relevant to a given product or process unless indicated to the contrary or otherwise evident from the context. The invention includes embodiments in which exactly one member of the group is present in, employed in, or otherwise relevant to a given product or process. The invention also includes embodiments in which more than
one, or all of the group members are present in, employed in, or otherwise relevant to a given product or process. Furthermore, it is to be understood that the invention provides all variations, combinations, and permutations in which one or more limitations, elements, clauses, descriptive terms, etc., from one or more of the listed claims is introduced into another claim dependent on the same base claim (or, as relevant, any other claim) unless otherwise indicated or unless it would be evident to one of ordinary skill in the art that a contradiction or inconsistency would arise. It is contemplated that all embodiments described herein are applicable to all different aspects of the invention where appropriate. It is also contemplated that any of the embodiments or aspects can be freely combined with one or more other such embodiments or aspects whenever appropriate. Where elements are presented as lists, e.g., in Markush group or similar format, it is to be understood that each subgroup of the elements is also disclosed, and any element (s) can be removed from the group. It should be understood that, in general, where the invention, or aspects of the invention, is/are referred to as
comprising particular elements, features, etc., certain embodiments of the invention or aspects of the invention consist , or consist essentially of , such elements , features, etc. For purposes of simplicity those
embodiments have not in every case been specifically set forth in so many words herein. It should also be understood that any embodiment or aspect of the invention can be explicitly excluded from the claims , regardless of whether the specific exclusion is recited in the
specification. For example, any one or more nucleic acids , polypeptides , cells , species or types of organism,
disorders, subjects, or combinations thereof, can be excluded .
Where the claims or description relate to a
composition of matter, e.g., a nucleic acid, polypeptide, cell, or non-human transgenic animal, it is to be understood that methods of making or using the
composition of matter according to any of the methods disclosed herein, and methods of using the composition of matter for any of the purposes disclosed herein are aspects of the invention, unless otherwise indicated or unless it would be evident to one of ordinary skill in the art that a contradiction or inconsistency would arise. Where the claims or description relate to a method, e.g., it is to be understood that methods of making compositions useful for performing the method, and products produced according to the method, are aspects of the invention, unless otherwise indicated or unless it would be evident to one of ordinary skill in the art that a contradiction or inconsistency would arise.
Where ranges are given herein, the invention includes embodiments in which the endpoints are included, embodiments in which both endpoints are excluded, and embodiments in which one endpoint is included and the other is excluded. It should be assumed that both endpoints are included unless indicated otherwise.
Furthermore, it is to be understood that unless otherwise indicated or otherwise evident from the context and understanding of one of ordinary skill in the art, values that are expressed as ranges can assume any specific value or subrange within the stated ranges in different embodiments of the invention, to the tenth of the unit of the lower limit of the range, unless the context clearly dictates otherwise. It is also understood that where a
series of numerical values is stated herein, the
invention includes embodiments that relate analogously to any intervening value or range defined by any two values in the series, and that the lowest value may be taken as a minimum and the greatest value may be taken as a maximum. Numerical values, as used herein, include values expressed as percentages. For any embodiment of the invention in which a numerical value is prefaced by "about" or "approximately" , the invention includes an embodiment in which the exact value is recited. For any embodiment of the invention in which a numerical value is not prefaced by "about" or "approximately" , the invention includes an embodiment in which the value is prefaced by "about" or "approximately" . "Approximately" or "about" generally includes numbers that fall within a range of 1% or in some embodiments within a range of 5% of a number or in some embodiments within a range of 10% of a number in either direction (greater than or less than the number) unless otherwise stated or otherwise evident from the context (except where such number would impermissibly exceed 100% of a possible value) . It should be
understood that , unless clearly indicated to the
contrary, in any methods claimed herein that include more than one act, the order of the acts of the method is not necessarily limited to the order in which the acts of the method are recited, but the invention includes
embodiments in which the order is so limited. It should also be understood that unless otherwise indicated or evident from the context, any product or composition described herein may be considered "isolated" .
Examples
Example 1: A substantially pure, stable, high yield, calcium-independent srtA monomeric mutant
The present inventors surprisingly and unexpectedly found that a calcium- independent srtA mutant (SEQ ID NO: 9) purifies predominantly in monomeric form compared to a calcium-dependent srtA mutant (SEQ ID NO: 7) that purifies both in dimeric and monomeric forms.
The sortase purification process involves direct loading of clarified E.coli lysate onto a nickel NTA column. His-tagged sortase binds to the nickel NTA column (via metal chelation) and is eluted using an imidazole buffer. Eluted material from the nickel NTA column is loaded onto a size exclusion chromatography column (SEC) (e.g., a HILOAD® 16/600 SUPERDEX® 75 pg) and resulting separations (chromatogram) are show in FIGS. 7A, 7B and 7C.
The column resolves/separates residual high
molecular weight impurities from the sortase dimer and monomer. As is shown in FIGS. 7A, 7B and 7C,
unexpectedly, the calcium- independent srtA mutant of SEQ ID NO: 9 exists predominantly in monomeric form (FIG. 7A) , in contrast, as noted above, the calcium-dependent sortase obtained from SEQ ID NO: 7 exists in both dimeric and monomeric form (FIG. 7B) , as does the calcium- dependent wild-type sortase (FIG. 7C) . Those skilled in the art will appreciate the unexpected advantages which this confers on the calcium- independent srtA mutant of SEQ ID NO: 9, including a higher recovery yield of 100% monomer, and simpler manufacturing process to isolate monomer as there is far less dimer to separate out, amongst others.
Without wishing to be bound by theory, it is believed that the lower amount of dimer form observed with the calcium- independent srtA mutant of SEQ ID NO: 9 is due to lower propensity to dimerize based on the outer surface location of the substituted amino acids between the calcium-dependent srtA mutant of SEQ ID NO: 7 and the calcium- independent srtA mutant of SEQ ID NO: 9 (see, e.g., FIG. 5B for an alignment of SEQ ID NO: 7 and SEQ ID NO: 9). The lower propensity of the calcium-independent srtA mutant of SEQ ID NO: 9 to dimerize is likely to translate into a more "stable" enzyme as it should also have a lower tendency to dimerize during storage, which may prolong shelf-life or permit storage under less costly conditions (e.g. refrigerated vs. frozen at - 80°C) .
FIG. 8 shows a gel demonstrating the high purity of monomeric sortase compared to dimeric sortase. For example, the gel illustrates the protein constituents present in the three main components from the calcium- independent srtA mutant of SEQ ID NO: 9 preparation separated on the SEC purification column. Collected fractions were pooled comprising the Void, Dimer and Monomeric peaks shown in FIG. 7B. The "Load" in FIG. 8 represents the starting material in the clarified E. coli lysate. The "Void" lane in FIG. 7B contains some sortase along with a large number of higher molecular weight proteins. The "Dimer" lane in FIG. 8 contains
predominantly sortase but also contains some higher molecular weight proteins. The "Monomer" lane in FIG. 8 runs as a single sortase band on the gel, indicating high purity of the monomeric form of the calcium- independent srtA mutant of SEQ ID NO: 9.
FIG . 9 shows the SEC results demonstrating the stability of monomeric sortase. Briefly, the dimeric and monomeric pools collected from the SEC purification step described above were stored at 4 °C for 24 hours, and then analyzed by analytical SEC (SUPERDEX® 75 10/300 GL) . As shown in FIG. 9, the monomeric sortase isolated during the initial purification, when analyzed one day later, still exists in monomeric form indicating that the monomeric form is stable and not in dynamic equilibrium with the dimer. Similarly, the dimeric sortase isolated during the initial purification, when analyzed one day later, still exists in dimeric form. One can note a small amount of monomer in the Dimer Pool as a
consequence of pooling a fraction that was not completely resolved (separated) in the SEC purification step.
Taken together, the work described herein
demonstrates that the calcium- independent srtA mutant of SEQ ID NO: 9 can be utilized to produce high yields of a substantially pure, stable, monomeric sortase enzyme that exhibits increased sortase A catalytic activity compared to wild-type srtA regardless of the availability of calcium .
Example 2 : Exemplary Method of Generating Calcium- Independent Sortase A Mutants
As noted above , given the nucleotide and amino acid sequences of the calcium- independent srtA mutants disclosed herein (e.g., SEQ ID NOs : 9-21) , a skilled artisan can readily appreciate a variety of suitable techniques for generating calcium- independent srtA mutants based on those sequences , e.g., site-directed mutagenesis , in vitro translation of synthetic mRNA, de novo synthesis , for example , utilizing solid phase peptide synthesis . What follows is a description of an
exemplary method of generating a calcium- independent srtA mutant comprising SEQ ID NO: 9. The skilled artisan will appreciate that the method can readily be adapted to generate any calcium- independent srtA mutant. Other suitable methods will be apparent to the skilled artisan based on the teachings herein.
The calcium- independent srtA mutant of SEQ ID NO: 9 was generated by introducing two additional nucleotide point mutations into a calcium-dependent srtA mutant of SEQ ID NO: 7. These nucleotide point mutations comprised a G->A mutation at position 139, and a G-C mutation at position 148 of SEQ ID NO: 7 .
The mutations were introduced on primers used to amplify a pET29 mutant delta 59 SrtA construct (see, e.g., Liu et al . , "A general strategy for the evolution of bond-forming enzymes using yeast display," PNAS.
2011; 108 (28) : 11399-11404) . PCR amplification was then be performed in two stages. First, the construct was amplified with primers shown in Table 4 below such that the forward primer introduced a restriction site for cloning (e.g., a Ndel restriction site), and the reverse primer introduced the two point mutations described above .
Table 4 : Primers for first half of the construct
Next, the second half of the construct was amplified with the primers shown in Table 5 below such that the forward primer introduced the two point mutations
described above, and the reverse primer introduced a restriction site for cloning (e.g., a Xhol restriction site) .
Table 5: Primers for second half of construct
The PCR reactions were performed using the pET29 pentamutant delta 59 SrtA plasmid DNA as a template, under the conditions set forth in Table 6 and cycling conditions set forth in Table 7 below.
Table 6: PCR Reaction Conditions for First Round of
PCR
0.5uL Template DNA (l-2ng total)
5uL lOx HiFi Polymerase buffer (Roche)
1.5uL lOuM forward primer
1.5uL lOuM reverse primer
luL HiFi Polymerase
1.2uL lOmM dNTPs
6uL 25mM MgC12
33uL dH20
Table 7: PCR Cycling Conditions for First Round of
PCR
95C for 5min
35 cycles of
95C for 30s
55C for 30s
72C for 1 min
72C for 7min
4C
The resulting PGR products from the first round of PGR were analyzed by gel electrophoresis, excised from the gel and purified. The products were then used as a template in a second round of PGR, and amplified using forward primer 1 and reverse primer 2 described above. The second round of PCR reactions was carried out under the conditions set forth in Table 8 below under the cycling conditions set forth in Table 9.
Table 8 : PCR Reaction Conditions for First Round of
PCR
5uL PCR product 1
luL PCR product 2
5uL lOx HiFi Buffer (Roche)
6uL 25mM MgCl2
1.2uL 10m dNTPs
luL HiFi Polymerase
30.8uL dH20
Table 9 : PCR Cycling Conditions for First Round of
PCR
95C for 5min
followed by 5 cycles of
95C 30s
55C 30s
72C 1 min
At this point, the following was added to the
reaction :
luL HiFi Polymerase (Roche)
1.5uL lOuM forward primer
1.5uL lOuM reverse primer
And cycled as follows:
95C for 5min
35 cycles of
95C for 30s
55C for 30s
72C for 1 min
72C for 7min
4C
The resulting product from the second round of PGR reactions was analyzed by gel electrophoresis, digested with Ndel and Xhol , and ligated into pET30b so that there is 6x his tag at the 3' end of the construct.
The construct was then verified by DNA sequencing and transformed into chemically competent BL21 (DE3) E.coli for protein expression. Protein was expressed and purified following published methods (Popp, Current Protocols in Protein Science, 2009) .
Sequence Key
SEQ ID NO: 1 - Exemplary Wild Type S. aureus Sortase A amino acid sequence
SEQ ID NO: 2 - Nucleotide sequence encoding Wild Type S. aureus Sortase A
SEQ ID NO: 3 - Exemplary Wild Type S. aureus Sortase A amino acid sequence
SEQ ID NO: 4 - Nucleotide sequence encoding Wild Type S. aureus Sortase A
SEQ ID NO: 5 - WT sortase delta 25 amino acid sequence
SEQ ID NO: 6 - Nucleotide sequence encoding WT sortase delta 25
SEQ ID NO: 7 - srtA P94R/D160N/D165A/K190E/K196T amino acid sequence
SEQ ID NO: 8 - Nucleotide sequence encoding srtA
P94R/D160N/D165A/K190E/K196T
SEQ ID NO: 9 - Calcium- independent srtA
P94R/D160N/D165A/K190E/K196T
SEQ ID NO: 10 - Nucleotide sequence encoding calcium- independent srtA P94R/D160N/D165A/K190E/K196T
SEQ ID NO: 11 - Calcium- independent srtA
P94S/D160N/D165A/K196T amino acid sequence
SEQ ID NO: 12 - Calcium- independent srtA
D160N/K190E/K196T amino acid sequence
SEQ ID NO: 13 - Calcium- independent srtA P94S/D160N/K196T amino acid sequence
SEQ ID NO: 14- Calcium- independent srtA P94S/D160N/D165A amino acid sequence
SEQ ID NO: 15 - Calcium- independent srtA P94S/D165A amino acid sequence
SEQ ID NO: 16 - Calcium- independent srtA P94S amino acid sequence
SEQ ID NO: 17 - Calcium- independent srtA P94R amino acid sequence
SEQ ID NO: 18 - Calcium- independent srtA D160N amino acid sequence
SEQ ID NO: 19 - Calcium- independent srtA D165A amino acid sequence
SEQ ID NO 20 - Calcium- independent srtA K190E amino acid sequence
SEQ ID NO 21 - Calcium- independent srtA K196T amino acid sequence
SEQ ID NO 22 - Forward Primer 1
SEQ ID NO 23 - Reverse Primer 1
SEQ ID NO 24 - Forward Primer 2
SEQ ID NO 25 - Reverse Primer 2
Claims
1. A sortase A mutant comprising at least three amino acid substitutions relative to a wild-type sortase A, wherein the amino acid substitutions comprise a) a K residue at position 105; b) a Q or A residue at position 108; and c) at least one amino acid substitution selected from the group consisting of i) a R residue at position 94; ii) a S residue at position 94; iii) a N residue at position 160; iv) a A residue at position 165; v) a E residue at position 190; and vi) a T residue at position 196.
2. The mutant of claim 1, wherein the wild- type sortase A comprises a S. aureus sortase A
3. The mutant of claim 1, wherein the wild- type sortase A comprises a protein sequence of SEQ ID NO: 1.
4. The mutant of claim 1 , wherein the wild- type sortase A comprises a protein sequence of SEQ ID NO: 3.
5. The mutant of any one of claims 1 to 4, comprising a deletion of amino acids 2-25.
6. The mutant of any one of claims 1 to 5, comprising a deletion of amino acids 2-59.
7. The mutant of any one of claims 1 to 6, comprising at least two amino acid substitutions selected from the group consisting of i) -vi) .
8. The mutant of any one of claims 1 to 7 , comprising at least three amino acid substitutions selected from the group consisting of i) -vi) .
9. The mutant of any one of claims 1 to 8 , comprising at least four amino acid substitutions selected from the group consisting of i) -vi) .
10. The mutant of any one of claims 1 to 9, comprising at least five amino acid substitutions selected from the group consisting of i) -vi) .
11. The mutant of any one of claims 1 to 10, wherein the mutant comprises at least 60% sequence identity to amino acid residues 60-206 of the wild-type sortase A.
12. The mutant of any one of claims 1 to 11, wherein the mutant comprises at least 80% sequence identity to amino acid residues 60-206 of the wild-type sortase A.
13. The mutant of any one of claims 1 to 12, wherein the mutant comprises at least 90% sequence identity to amino acid residues 60-206 of the wild-type sortase A.
14. The mutant of any one of claims 1 to 13, further comprising one or more C-terminal or N-terminal tags.
15. The mutant of claim 14, wherein the one or more C- terminal or N-terminal tags comprises a His6 tag.
16. The mutant of any one of claims 1 to 15, wherein the mutant exhibits sortase A catalytic activity in the absence of calcium.
17. The mutant of any one of claims 1 to 16, wherein the mutant exhibits sortase A catalytic activity in the absence of exogenous calcium.
18. The mutant of any one of claims 1 to 17, wherein the mutant exhibits sortase A catalytic activity in the presence of calcium-binding proteins.
19. The mutant of any one of claims 1 to 18, wherein the mutant exhibits sortase A catalytic activity in the presence of calcium concentrations up to 10 mM .
20. A polynucleotide encoding the mutant of any one of claims 1 to 19.
21. A nucleic acid construct comprising the
polynucleotide of claim 18.
22. A host cell transformed with the nucleic acid construct of claim 19.
23. A method of preparing a mutant sortase A comprising:
(a) culturing the host cell of claim 22 in a suitable culture medium under suitable conditions to produce the mutant sortase A; and optionally
(b) purifying the mutant sortase A to provide a mutant sortase A.
24. An enzyme composition comprising at least one sortase A mutant of any one of claims 1 to 19.
25. A method comprising performing a sortase-mediated transpeptidation reaction catalyzed by the enzyme composition of claim 24.
26. Use of an enzyme composition of claim 24 for the sortagging of a target protein.
27. A method for sortagging a target protein,
comprising :
(a) providing a target protein comprising a sortase recognition motif;
(b) providing a moiety conjugated to a terminal oligoglycine sequence or a terminal alkylamine ; and
(c) contacting the target protein with the moiety in the presence of the enzyme composition of claim 24 under conditions suitable for the sortase A mutant to
transamidate the target protein and the moiety, thereby sortagging the target protein.
28. The method of claim 27, wherein the terminal oligoglycine sequence comprises 1-10 N-terminal glycine residues .
29. The method of claims 27 or 28, wherein the moiety comprises an amino acid, a peptide, a protein, a
polynucleotide, a carbohydrate, a tag, a metal atom, a chelating agent, a contrast agent, a catalyst, a polymer, a recognition element, a small molecule, a lipid, a label, an epitope, a small molecule, a therapeutic agent,
a cross-linker, a toxin, a radioisotope, an antigen, or a click chemistry handle.
30. A kit for sortagging a target protein comprising the enzyme composition of claim 24.
31. A sortase A mutant comprising an amino acid sequence at least 80% identical to SEQ ID NO: 9, and
wherein the mutant comprises a) a K residue at position 47 of SEQ ID NO: 9; b) a Q or A residue at position 50 of SEQ ID NO: 9; and c) at least one amino acid residue of SEQ ID NO: 9 selected from the group consisting of i) a R residue at position 36 of SEQ ID NO: 9; ii) a N residue at position 102 of SEQ ID NO: 9; iii) a A residue at position 107 of SEQ ID NO: 9; iv) a E residue at position 132 of SEQ ID NO: 9; and v) a T residue at position 138 of SEQ ID NO: 9.
32. The mutant of claim 31, comprising an amino acid sequence of SEQ ID NO: 16.
33. The mutant of claim 31, comprising an amino acid sequence of SEQ ID NO: 17.
34. The mutant of claim 31, comprising an amino acid sequence of SEQ ID NO: 18.
35. The mutant of claim 31, comprising an amino acid sequence of SEQ ID NO: 19.
36. The mutant of claim 31, comprising an amino acid sequence of SEQ ID NO: 20.
37. The mutant of claim 31, comprising an amino acid sequence of SEQ ID NO: 21.
38. The mutant of claim 31, comprising at least two amino acid residues of SEQ ID NO: 9 selected from the group consisting of i) -v) .
39. The mutant of claim 38, comprising an amino acid sequence of SEQ ID NO: 15.
40. The mutant of claim 31, comprising at least three amino acid residues of SEQ ID NO: 9 selected from the group consisting of i) -v) .
41. The mutant of claim 40, comprising an amino acid sequence of SEQ ID NO: 12.
42. The mutant of claim 40, comprising an amino acid sequence of SEQ ID NO: 13.
43. The mutant of claim 40, comprising an amino acid sequence of SEQ ID NO: 14.
44. The mutant of claim 31, comprising at least four amino acid residues of SEQ ID NO: 9 selected from the group consisting of i) -v) .
45. The mutant of claim 44, comprising an amino acid sequence of SEQ ID NO: 11.
46. The mutant of claim 31, comprising an amino acid sequence at least 90% identical to SEQ ID NO: 9.
47. The mutant of claim 31, comprising an amino acid sequence at least 95% identical to SEQ ID NO : 9.
48. The mutant of claim 31, comprising an amino acid sequence at least 96% identical to SEQ ID NO: 9.
49. The mutant of claim 31, comprising an amino acid sequence at least 97% identical to SEQ ID NO: 9.
50. The mutant of claim 31, comprising an amino acid sequence at least 98% identical to SEQ ID NO : 9.
51. The mutant of claim 31, comprising an amino acid sequence at least 99% identical to SEQ ID NO: 9.
52. The mutant of claim 31, comprising an amino acid sequence of SEQ ID NO: 9.
53. An enzyme composition comprising at least one mutant sortase A of any one of claims 31 to 52.
54. A method comprising performing a sortase-mediated transpeptidation reaction catalyzed by the enzyme composition of claim 53.
55. Use of an enzyme composition of claim 53 for the sortagging of a target protein.
56. A method for sortagging a target protein,
comprising :
(a) providing a target protein comprising a sortase recognition motif ;
(b) providing a moiety conjugated to at least one of an oligoglycine sequence or a terminal alkylamine ; and
(c) contacting the target protein with the moiety in the presence of the enzyme composition of claim 53 under conditions suitable for the sortase A mutant to
transamidate the target protein and the moiety, thereby sortagging the target protein.
57. The method of claim 56, wherein the oligoglycine sequence comprises 1-10 N-terminal glycine residues.
58. The method of claims 55 or 56, wherein the moiety comprises an amino acid, a peptide, a protein, a
polynucleotide, a carbohydrate, a tag, a metal atom, a chelating agent, a contrast agent, a catalyst, a polymer, a recognition element, a small molecule, a lipid, a label, an epitope, a small molecule, a therapeutic agent, a cross-linker, a toxin, a radioisotope, an antigen, or a click chemistry handle.
59. A kit for sortagging a target protein comprising the enzyme composition of claim 53.
60. A polynucleotide encoding a sortase A mutant comprising a nucleotide sequence at least 80% identical to SEQ ID NO. 10, wherein the nucleotide sequence encodes a) a K residue at position 47 of SEQ ID NO: 9; b) a Q or A residue at position 50 of SEQ ID NO: 9; and c) at least one amino acid residue of SEQ ID NO: 9 selected from the group consisting of i) a R residue at position 36; ii) a N residue at position 102; iii) a A residue at position 107; iv) a E residue at position 132; and v) a T residue at position 138.
61. The polynucleotide of claim 60, wherein the
polynucleotide encodes at least two amino acid residues of SEQ ID NO: 9 selected from the group consisting of i) - v) .
62. The polynucleotide of claim 60, wherein the
polynucleotide encodes at least three amino acid residues of SEQ ID NO: 9 selected from the group consisting of i) - v) .
63. The polynucleotide of claim 60, wherein the
polynucleotide encodes at least four amino acid residues of SEQ ID NO: 9 selected from the group consisting of i) - v) .
64. The polynucleotide of claim 60, comprising a nucleotide sequence at least 85% identical to SEQ ID NO: 10.
65. The polynucleotide of claim 60, comprising a nucleotide sequence at least 90% identical to SEQ ID NO: 10.
66. The polynucleotide of claim 60, comprising a nucleotide sequence at least 95% identical to SEQ ID NO: 10.
67. The polynucleotide of claim 60, comprising a nucleotide sequence at least 96% identical to SEQ ID NO:
10.
68. The polynucleotide of claim 60, comprising a nucleotide sequence at least 97% identical to SEQ ID NO: 10.
69. The polynucleotide of claim 60, comprising a nucleotide sequence at least 98% identical to SEQ ID NO: 10.
70. The polynucleotide of claim 60, comprising a nucleotide sequence at least 99% identical to SEQ ID NO: 10.
71. The polynucleotide of claim 60, comprising a nucleotide sequence of SEQ ID NO: 10.
72. A nucleic acid construct comprising the
polynucleotide of any one of claims 60 to 71.
73. The nucleic acid construct of claim 72, further comprising a nucleotide sequence that encodes one or more C-terminal or N-terminal tags.
74. The nucleic acid construct of claim 73, wherein one or more C-terminal or N-terminal tags comprises a His6 tag.
75. A host cell transformed with the nucleic acid construct of any one of claims 72 to 74.
76. A method of preparing a mutant sortase A comprising : (a) culturing the host cell of claim 75 in a suitable culture medium under suitable conditions to produce the mutant sortase A; and optionally
(b) purifying the mutant sortase A to provide a mutant sortase A .
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201461943042P | 2014-02-21 | 2014-02-21 | |
US61/943,042 | 2014-02-21 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2015127365A2 true WO2015127365A2 (en) | 2015-08-27 |
WO2015127365A3 WO2015127365A3 (en) | 2015-10-15 |
Family
ID=53879254
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2015/017116 WO2015127365A2 (en) | 2014-02-21 | 2015-02-23 | Calcium-independent sortase a mutants |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2015127365A2 (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2018005716A3 (en) * | 2016-07-01 | 2018-02-08 | Denali Therapeutics Inc. | Albumin variants for enhanced serum half-life |
US20220403360A1 (en) * | 2021-01-28 | 2022-12-22 | Genequantum Healthcare (Suzhou) Co., Ltd. | Ligase fusion proteins and application thereof |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2365523A1 (en) * | 1999-04-15 | 2000-10-26 | The Regents Of The University Of California | Identification of sortase gene |
WO2013016653A1 (en) * | 2011-07-28 | 2013-01-31 | Cell Signaling Technology, Inc. | Multi component detection |
-
2015
- 2015-02-23 WO PCT/US2015/017116 patent/WO2015127365A2/en active Application Filing
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2018005716A3 (en) * | 2016-07-01 | 2018-02-08 | Denali Therapeutics Inc. | Albumin variants for enhanced serum half-life |
US20220403360A1 (en) * | 2021-01-28 | 2022-12-22 | Genequantum Healthcare (Suzhou) Co., Ltd. | Ligase fusion proteins and application thereof |
Also Published As
Publication number | Publication date |
---|---|
WO2015127365A3 (en) | 2015-10-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US5723584A (en) | Biotinylation of proteins | |
Tsukiji et al. | Sortase‐mediated ligation: a gift from gram‐positive bacteria to protein engineering | |
Popp et al. | Making and breaking peptide bonds: protein engineering using sortase | |
JP4369662B2 (en) | Combinatorial library of monomer domains | |
CA2583009C (en) | Ubiquitin or gamma-crystalline conjugates for use in therapy, diagnosis and chromatography | |
Chapman-Smith et al. | In vivo enzymatic protein biotinylation | |
US20170226495A1 (en) | Sortase molecules and uses thereof | |
WO2005051976A2 (en) | Protein and peptide ligation processes and one-step purification processes | |
JP4377242B2 (en) | Protein tag comprising a biotinylated domain, method for increasing solubility and method for determining folding state | |
JP2008173124A (en) | Identification and isolation of novel polypeptides having WW domains and methods of use thereof | |
KR20220105157A (en) | Protein Purification Using the Split Intein System | |
US20160304571A1 (en) | Synthesis of Site Specifically-Linked Ubiquitin | |
EP2027313B1 (en) | Compositions and methods comprising the use of cell surface displayed homing endonucleases | |
CN108699104A (en) | FKBP structural domains with transglutamin-ase 9 enzyme recognition site | |
WO2015127365A2 (en) | Calcium-independent sortase a mutants | |
TW201309720A (en) | Peptide library | |
JP5865002B2 (en) | Recombinant plasmid vector and protein production method using the same | |
CN108823225B (en) | Expression system for directly realizing fatty acid modification in protein translation process and application | |
US20230296616A1 (en) | Arginine fluorescent probe, preparation method therefor and application thereof | |
CN117062828A (en) | Polypeptides interacting with peptide tags at the loop or terminal and uses thereof | |
WO2002066669A1 (en) | Method for purification of soluble ssao | |
RU2728652C1 (en) | Recombinant plasmid dna pet19b-sav, providing synthesis of full-length streptavidin streptomyces avidinii protein, bacterial strain escherichia coli - producer of soluble full-length protein of streptavidin streptomyces avidinii | |
US20240026343A1 (en) | Method for screening dimerized cyclic peptide | |
RU2418859C1 (en) | Composition of complementing fragments of molecule of photoproteins from hydrozoa and method of its application for analysis of intermolecular interactions | |
US8273540B2 (en) | Methods and compositions relating to E3 ring-E2 fusion proteins |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 15751497 Country of ref document: EP Kind code of ref document: A2 |
|
NENP | Non-entry into the national phase in: |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 15751497 Country of ref document: EP Kind code of ref document: A2 |