WO1993001305A1 - A method for the identification of protease inhibitors - Google Patents
A method for the identification of protease inhibitors Download PDFInfo
- Publication number
- WO1993001305A1 WO1993001305A1 PCT/US1992/005745 US9205745W WO9301305A1 WO 1993001305 A1 WO1993001305 A1 WO 1993001305A1 US 9205745 W US9205745 W US 9205745W WO 9301305 A1 WO9301305 A1 WO 9301305A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- protease
- nucleic acid
- coding sequence
- acid sequence
- protein
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 49
- 239000000137 peptide hydrolase inhibitor Substances 0.000 title claims abstract description 35
- 229940042399 direct acting antivirals protease inhibitors Drugs 0.000 title abstract description 13
- 108091005804 Peptidases Proteins 0.000 claims abstract description 134
- 239000004365 Protease Substances 0.000 claims abstract description 130
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 115
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 76
- 239000003112 inhibitor Substances 0.000 claims abstract description 48
- 230000000694 effects Effects 0.000 claims abstract description 18
- 230000001419 dependent effect Effects 0.000 claims abstract description 17
- 230000001131 transforming effect Effects 0.000 claims abstract description 9
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 claims abstract 34
- 210000004027 cell Anatomy 0.000 claims description 115
- 108091026890 Coding region Proteins 0.000 claims description 51
- 150000007523 nucleic acids Chemical group 0.000 claims description 43
- 108020004414 DNA Proteins 0.000 claims description 42
- 241000723854 Zucchini yellow mosaic virus Species 0.000 claims description 33
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 claims description 32
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 28
- 241000588724 Escherichia coli Species 0.000 claims description 23
- 229940124158 Protease/peptidase inhibitor Drugs 0.000 claims description 21
- 108020004707 nucleic acids Proteins 0.000 claims description 17
- 102000039446 nucleic acids Human genes 0.000 claims description 17
- 229960005322 streptomycin Drugs 0.000 claims description 16
- 102100023912 40S ribosomal protein S12 Human genes 0.000 claims description 6
- 230000003115 biocidal effect Effects 0.000 claims description 6
- 230000002401 inhibitory effect Effects 0.000 claims description 6
- 108010092841 ribosomal protein S12 Proteins 0.000 claims description 6
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 claims description 4
- 229910052799 carbon Inorganic materials 0.000 claims description 4
- 230000001580 bacterial effect Effects 0.000 claims description 3
- 108090000765 processed proteins & peptides Proteins 0.000 abstract description 45
- 230000001404 mediated effect Effects 0.000 abstract description 5
- 238000011282 treatment Methods 0.000 abstract description 5
- 230000002779 inactivation Effects 0.000 abstract description 3
- 230000005764 inhibitory process Effects 0.000 abstract description 3
- 230000004913 activation Effects 0.000 abstract description 2
- 102000035195 Peptidases Human genes 0.000 description 99
- 235000019419 proteases Nutrition 0.000 description 72
- 235000018102 proteins Nutrition 0.000 description 57
- 239000002773 nucleotide Substances 0.000 description 29
- 125000003729 nucleotide group Chemical group 0.000 description 29
- 238000003776 cleavage reaction Methods 0.000 description 28
- 230000007017 scission Effects 0.000 description 28
- 101710132601 Capsid protein Proteins 0.000 description 27
- 101710094648 Coat protein Proteins 0.000 description 27
- 101710125418 Major capsid protein Proteins 0.000 description 27
- 101710141454 Nucleoprotein Proteins 0.000 description 27
- 101710083689 Probable capsid protein Proteins 0.000 description 27
- 102100021181 Golgi phosphoprotein 3 Human genes 0.000 description 26
- 108010076039 Polyproteins Proteins 0.000 description 25
- 102000004196 processed proteins & peptides Human genes 0.000 description 25
- 241000196324 Embryophyta Species 0.000 description 24
- 239000013612 plasmid Substances 0.000 description 19
- 229940088598 enzyme Drugs 0.000 description 18
- 102000004190 Enzymes Human genes 0.000 description 17
- 108090000790 Enzymes Proteins 0.000 description 17
- 150000001413 amino acids Chemical class 0.000 description 17
- 108020004705 Codon Proteins 0.000 description 16
- 101800001776 Nuclear inclusion protein B Proteins 0.000 description 16
- 239000012634 fragment Substances 0.000 description 15
- 230000014509 gene expression Effects 0.000 description 14
- 239000000203 mixture Substances 0.000 description 14
- 241000700605 Viruses Species 0.000 description 13
- 235000001014 amino acid Nutrition 0.000 description 13
- 239000000758 substrate Substances 0.000 description 12
- 239000003981 vehicle Substances 0.000 description 12
- 241001465754 Metazoa Species 0.000 description 10
- 238000010367 cloning Methods 0.000 description 10
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 10
- 230000014616 translation Effects 0.000 description 10
- 238000012163 sequencing technique Methods 0.000 description 9
- 238000013519 translation Methods 0.000 description 9
- 239000013598 vector Substances 0.000 description 9
- 239000002299 complementary DNA Substances 0.000 description 8
- 201000010099 disease Diseases 0.000 description 8
- 238000009472 formulation Methods 0.000 description 8
- 239000002243 precursor Substances 0.000 description 8
- 230000035945 sensitivity Effects 0.000 description 8
- 230000027455 binding Effects 0.000 description 7
- 230000007246 mechanism Effects 0.000 description 7
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 6
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 6
- 239000004480 active ingredient Substances 0.000 description 6
- 238000010369 molecular cloning Methods 0.000 description 6
- 229920001184 polypeptide Polymers 0.000 description 6
- 210000001519 tissue Anatomy 0.000 description 6
- 238000013518 transcription Methods 0.000 description 6
- 230000035897 transcription Effects 0.000 description 6
- 241000589156 Agrobacterium rhizogenes Species 0.000 description 5
- 101800001402 Cytoplasmic inclusion protein Proteins 0.000 description 5
- 108010067902 Peptide Library Proteins 0.000 description 5
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 5
- 210000000349 chromosome Anatomy 0.000 description 5
- 230000000295 complement effect Effects 0.000 description 5
- 238000012217 deletion Methods 0.000 description 5
- 230000037430 deletion Effects 0.000 description 5
- 239000005546 dideoxynucleotide Substances 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- 238000001415 gene therapy Methods 0.000 description 5
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 5
- 230000012010 growth Effects 0.000 description 5
- 238000003780 insertion Methods 0.000 description 5
- 230000037431 insertion Effects 0.000 description 5
- 230000002797 proteolythic effect Effects 0.000 description 5
- 230000009466 transformation Effects 0.000 description 5
- 230000009261 transgenic effect Effects 0.000 description 5
- 230000003612 virological effect Effects 0.000 description 5
- 102000053602 DNA Human genes 0.000 description 4
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 4
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 4
- 241000710078 Potyvirus Species 0.000 description 4
- 229960000723 ampicillin Drugs 0.000 description 4
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 4
- 239000000969 carrier Substances 0.000 description 4
- 238000010276 construction Methods 0.000 description 4
- 238000000338 in vitro Methods 0.000 description 4
- 239000000411 inducer Substances 0.000 description 4
- 210000004698 lymphocyte Anatomy 0.000 description 4
- 108020004999 messenger RNA Proteins 0.000 description 4
- 229940046166 oligodeoxynucleotide Drugs 0.000 description 4
- 229920000642 polymer Polymers 0.000 description 4
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 230000001105 regulatory effect Effects 0.000 description 4
- 230000010076 replication Effects 0.000 description 4
- 150000003839 salts Chemical group 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 3
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 3
- 241000894006 Bacteria Species 0.000 description 3
- 108010005843 Cysteine Proteases Proteins 0.000 description 3
- 102000005927 Cysteine Proteases Human genes 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 239000004471 Glycine Substances 0.000 description 3
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 3
- 239000006391 Luria-Bertani Medium Substances 0.000 description 3
- 101800000577 Maturation protease Proteins 0.000 description 3
- 108010052285 Membrane Proteins Proteins 0.000 description 3
- 108010006035 Metalloproteases Proteins 0.000 description 3
- 102000005741 Metalloproteases Human genes 0.000 description 3
- 108700026244 Open Reading Frames Proteins 0.000 description 3
- 108010076504 Protein Sorting Signals Proteins 0.000 description 3
- 108700022715 Viral Proteases Proteins 0.000 description 3
- 101900314169 Zucchini yellow mosaic virus Capsid protein Proteins 0.000 description 3
- 238000000246 agarose gel electrophoresis Methods 0.000 description 3
- 210000000170 cell membrane Anatomy 0.000 description 3
- 239000001913 cellulose Substances 0.000 description 3
- 229920002678 cellulose Polymers 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 241001233957 eudicotyledons Species 0.000 description 3
- 239000000284 extract Substances 0.000 description 3
- 108020001507 fusion proteins Proteins 0.000 description 3
- 102000037865 fusion proteins Human genes 0.000 description 3
- 238000001727 in vivo Methods 0.000 description 3
- 238000011081 inoculation Methods 0.000 description 3
- 230000003834 intracellular effect Effects 0.000 description 3
- 239000008101 lactose Substances 0.000 description 3
- 108010026228 mRNA guanylyltransferase Proteins 0.000 description 3
- 239000006225 natural substrate Substances 0.000 description 3
- 239000000546 pharmaceutical excipient Substances 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 235000019833 protease Nutrition 0.000 description 3
- 230000002441 reversible effect Effects 0.000 description 3
- 210000003705 ribosome Anatomy 0.000 description 3
- 239000000829 suppository Substances 0.000 description 3
- 238000013268 sustained release Methods 0.000 description 3
- 239000012730 sustained-release form Substances 0.000 description 3
- 238000002560 therapeutic procedure Methods 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 2
- 208000030507 AIDS Diseases 0.000 description 2
- 241000710929 Alphavirus Species 0.000 description 2
- 108010077805 Bacterial Proteins Proteins 0.000 description 2
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 2
- 102000029816 Collagenase Human genes 0.000 description 2
- 108060005980 Collagenase Proteins 0.000 description 2
- 241000723607 Comovirus Species 0.000 description 2
- 108020004635 Complementary DNA Proteins 0.000 description 2
- 241000218631 Coniferophyta Species 0.000 description 2
- 108091035707 Consensus sequence Proteins 0.000 description 2
- 241000711573 Coronaviridae Species 0.000 description 2
- 102000010911 Enzyme Precursors Human genes 0.000 description 2
- 108010062466 Enzyme Precursors Proteins 0.000 description 2
- 241000701959 Escherichia virus Lambda Species 0.000 description 2
- 241000710831 Flavivirus Species 0.000 description 2
- 241000209510 Liliopsida Species 0.000 description 2
- 241000709757 Luteovirus Species 0.000 description 2
- 206010027476 Metastases Diseases 0.000 description 2
- 108060004795 Methyltransferase Proteins 0.000 description 2
- 102000006404 Mitochondrial Proteins Human genes 0.000 description 2
- 108010058682 Mitochondrial Proteins Proteins 0.000 description 2
- 206010028980 Neoplasm Diseases 0.000 description 2
- 241000723638 Nepovirus Species 0.000 description 2
- 108010067372 Pancreatic elastase Proteins 0.000 description 2
- 102000016387 Pancreatic elastase Human genes 0.000 description 2
- 241000710778 Pestivirus Species 0.000 description 2
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 2
- 108010078762 Protein Precursors Proteins 0.000 description 2
- 102000014961 Protein Precursors Human genes 0.000 description 2
- 108010022999 Serine Proteases Proteins 0.000 description 2
- 102000012479 Serine Proteases Human genes 0.000 description 2
- 241000710119 Sobemovirus Species 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 108091081024 Start codon Proteins 0.000 description 2
- 108020000999 Viral RNA Proteins 0.000 description 2
- 208000036142 Viral infection Diseases 0.000 description 2
- 238000002835 absorbance Methods 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 239000003242 anti bacterial agent Substances 0.000 description 2
- 229940088710 antibiotic agent Drugs 0.000 description 2
- 230000000975 bioactive effect Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 2
- OSASVXMJTNOKOY-UHFFFAOYSA-N chlorobutanol Chemical compound CC(C)(O)C(Cl)(Cl)Cl OSASVXMJTNOKOY-UHFFFAOYSA-N 0.000 description 2
- 208000037976 chronic inflammation Diseases 0.000 description 2
- 230000006020 chronic inflammation Effects 0.000 description 2
- 230000001332 colony forming effect Effects 0.000 description 2
- 239000013256 coordination polymer Substances 0.000 description 2
- 229920001577 copolymer Polymers 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 208000035475 disorder Diseases 0.000 description 2
- 238000009510 drug design Methods 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 108010030074 endodeoxyribonuclease MluI Proteins 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- NAQMVNRVTILPCV-UHFFFAOYSA-N hexane-1,6-diamine Chemical compound NCCCCCCN NAQMVNRVTILPCV-UHFFFAOYSA-N 0.000 description 2
- 210000003630 histaminocyte Anatomy 0.000 description 2
- 230000003301 hydrolyzing effect Effects 0.000 description 2
- 230000036737 immune function Effects 0.000 description 2
- 238000003119 immunoblot Methods 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 208000015181 infectious disease Diseases 0.000 description 2
- 230000000977 initiatory effect Effects 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 239000002502 liposome Substances 0.000 description 2
- 230000002132 lysosomal effect Effects 0.000 description 2
- 238000013507 mapping Methods 0.000 description 2
- 230000009401 metastasis Effects 0.000 description 2
- 244000000010 microbial pathogen Species 0.000 description 2
- 210000002850 nasal mucosa Anatomy 0.000 description 2
- 230000008520 organization Effects 0.000 description 2
- 238000004806 packaging method and process Methods 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 125000006850 spacer group Chemical group 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 239000000725 suspension Substances 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 231100000331 toxic Toxicity 0.000 description 2
- 230000002588 toxic effect Effects 0.000 description 2
- 239000003053 toxin Substances 0.000 description 2
- 231100000765 toxin Toxicity 0.000 description 2
- 108700012359 toxins Proteins 0.000 description 2
- 230000005030 transcription termination Effects 0.000 description 2
- 238000010361 transduction Methods 0.000 description 2
- 230000026683 transduction Effects 0.000 description 2
- 230000032258 transport Effects 0.000 description 2
- GETQZCLCWQTVFV-UHFFFAOYSA-N trimethylamine Chemical compound CN(C)C GETQZCLCWQTVFV-UHFFFAOYSA-N 0.000 description 2
- 241000701161 unidentified adenovirus Species 0.000 description 2
- 241001430294 unidentified retrovirus Species 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- 101150085703 vir gene Proteins 0.000 description 2
- 230000009385 viral infection Effects 0.000 description 2
- SUNMBRGCANLOEG-UHFFFAOYSA-N 1,3-dichloroacetone Chemical class ClCC(=O)CCl SUNMBRGCANLOEG-UHFFFAOYSA-N 0.000 description 1
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 1
- MIJDSYMOBYNHOT-UHFFFAOYSA-N 2-(ethylamino)ethanol Chemical compound CCNCCO MIJDSYMOBYNHOT-UHFFFAOYSA-N 0.000 description 1
- NEAQRZUHTPSBBM-UHFFFAOYSA-N 2-hydroxy-3,3-dimethyl-7-nitro-4h-isoquinolin-1-one Chemical class C1=C([N+]([O-])=O)C=C2C(=O)N(O)C(C)(C)CC2=C1 NEAQRZUHTPSBBM-UHFFFAOYSA-N 0.000 description 1
- OPIFSICVWOWJMJ-AEOCFKNESA-N 5-bromo-4-chloro-3-indolyl beta-D-galactoside Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1OC1=CNC2=CC=C(Br)C(Cl)=C12 OPIFSICVWOWJMJ-AEOCFKNESA-N 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- 108010088751 Albumins Proteins 0.000 description 1
- 102000009027 Albumins Human genes 0.000 description 1
- 241000756998 Alismatales Species 0.000 description 1
- 208000024827 Alzheimer disease Diseases 0.000 description 1
- 241000219317 Amaranthaceae Species 0.000 description 1
- QGZKDVFQNNGYKY-UHFFFAOYSA-O Ammonium Chemical compound [NH4+] QGZKDVFQNNGYKY-UHFFFAOYSA-O 0.000 description 1
- 241001124076 Aphididae Species 0.000 description 1
- 102000004580 Aspartic Acid Proteases Human genes 0.000 description 1
- 108010017640 Aspartic Acid Proteases Proteins 0.000 description 1
- 108091005502 Aspartic proteases Proteins 0.000 description 1
- 102000035101 Aspartic proteases Human genes 0.000 description 1
- 241000208838 Asteraceae Species 0.000 description 1
- 102000004506 Blood Proteins Human genes 0.000 description 1
- BTBUEUYNUDRHOZ-UHFFFAOYSA-N Borate Chemical compound [O-]B([O-])[O-] BTBUEUYNUDRHOZ-UHFFFAOYSA-N 0.000 description 1
- 102400000107 C-terminal peptide Human genes 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- 102000008186 Collagen Human genes 0.000 description 1
- 108010035532 Collagen Proteins 0.000 description 1
- 208000035473 Communicable disease Diseases 0.000 description 1
- 108091028732 Concatemer Proteins 0.000 description 1
- 240000001980 Cucurbita pepo Species 0.000 description 1
- 235000009852 Cucurbita pepo Nutrition 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- 108010016626 Dipeptides Proteins 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 108010067770 Endopeptidase K Proteins 0.000 description 1
- 108010059378 Endopeptidases Proteins 0.000 description 1
- 102000005593 Endopeptidases Human genes 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 108010091443 Exopeptidases Proteins 0.000 description 1
- 102000018389 Exopeptidases Human genes 0.000 description 1
- 241000220485 Fabaceae Species 0.000 description 1
- 208000007212 Foot-and-Mouth Disease Diseases 0.000 description 1
- 241000710198 Foot-and-mouth disease virus Species 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- 229940122440 HIV protease inhibitor Drugs 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 102100034343 Integrase Human genes 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- 241000234269 Liliales Species 0.000 description 1
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- FYYHWMGAXLPEAU-UHFFFAOYSA-N Magnesium Chemical compound [Mg] FYYHWMGAXLPEAU-UHFFFAOYSA-N 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- 102000002274 Matrix Metalloproteinases Human genes 0.000 description 1
- 108010000684 Matrix Metalloproteinases Proteins 0.000 description 1
- 108090000301 Membrane transport proteins Proteins 0.000 description 1
- 102000003939 Membrane transport proteins Human genes 0.000 description 1
- 108091092724 Noncoding DNA Proteins 0.000 description 1
- 101710163270 Nuclease Proteins 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 108010038807 Oligopeptides Proteins 0.000 description 1
- 102000015636 Oligopeptides Human genes 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 108010058846 Ovalbumin Proteins 0.000 description 1
- ZPHBZEQOLSRPAK-UHFFFAOYSA-N Phosphoramidon Natural products C=1NC2=CC=CC=C2C=1CC(C(O)=O)NC(=O)C(CC(C)C)NP(O)(=O)OC1OC(C)C(O)C(O)C1O ZPHBZEQOLSRPAK-UHFFFAOYSA-N 0.000 description 1
- 208000000474 Poliomyelitis Diseases 0.000 description 1
- 108010020346 Polyglutamic Acid Proteins 0.000 description 1
- 108010039918 Polylysine Proteins 0.000 description 1
- 229920000037 Polyproline Polymers 0.000 description 1
- ZLMJMSJWJFRBEC-UHFFFAOYSA-N Potassium Chemical compound [K] ZLMJMSJWJFRBEC-UHFFFAOYSA-N 0.000 description 1
- XBDQKXXYIPTUBI-UHFFFAOYSA-M Propionate Chemical compound CCC([O-])=O XBDQKXXYIPTUBI-UHFFFAOYSA-M 0.000 description 1
- 101710118538 Protease Proteins 0.000 description 1
- 108010007131 Pulmonary Surfactant-Associated Protein B Proteins 0.000 description 1
- 102100032617 Pulmonary surfactant-associated protein B Human genes 0.000 description 1
- 230000004570 RNA-binding Effects 0.000 description 1
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 208000035415 Reinfection Diseases 0.000 description 1
- 102000009661 Repressor Proteins Human genes 0.000 description 1
- 108010034634 Repressor Proteins Proteins 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 229920002684 Sepharose Polymers 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- 102000009843 Thyroglobulin Human genes 0.000 description 1
- 108010034949 Thyroglobulin Proteins 0.000 description 1
- 108700009124 Transcription Initiation Site Proteins 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 108010067390 Viral Proteins Proteins 0.000 description 1
- 230000005856 abnormality Effects 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 150000003838 adenosines Chemical class 0.000 description 1
- 239000000443 aerosol Substances 0.000 description 1
- 150000001299 aldehydes Chemical class 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 229940019748 antifibrinolytic proteinase inhibitors Drugs 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 108010058966 bacteriophage T7 induced DNA polymerase Proteins 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 229960000686 benzalkonium chloride Drugs 0.000 description 1
- CADWTSSKOVRVJC-UHFFFAOYSA-N benzyl(dimethyl)azanium;chloride Chemical compound [Cl-].C[NH+](C)CC1=CC=CC=C1 CADWTSSKOVRVJC-UHFFFAOYSA-N 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- GUBGYTABKSRVRQ-QUYVBRFLSA-N beta-maltose Chemical compound OC[C@H]1O[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@H](O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@@H]1O GUBGYTABKSRVRQ-QUYVBRFLSA-N 0.000 description 1
- 230000001588 bifunctional effect Effects 0.000 description 1
- 229910002056 binary alloy Inorganic materials 0.000 description 1
- 239000011230 binding agent Substances 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 230000008033 biological extinction Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 229960000074 biopharmaceutical Drugs 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 230000023555 blood coagulation Effects 0.000 description 1
- 210000002798 bone marrow cell Anatomy 0.000 description 1
- 125000005620 boronic acid group Chemical class 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 238000010805 cDNA synthesis kit Methods 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 239000002775 capsule Substances 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 210000001175 cerebrospinal fluid Anatomy 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 229960004926 chlorobutanol Drugs 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 230000001886 ciliary effect Effects 0.000 description 1
- 229920001436 collagen Polymers 0.000 description 1
- 230000004154 complement system Effects 0.000 description 1
- 238000013270 controlled release Methods 0.000 description 1
- 239000013601 cosmid vector Substances 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- WZHCOOQXZCIUNC-UHFFFAOYSA-N cyclandelate Chemical compound C1C(C)(C)CC(C)CC1OC(=O)C(O)C1=CC=CC=C1 WZHCOOQXZCIUNC-UHFFFAOYSA-N 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 230000000593 degrading effect Effects 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 238000009795 derivation Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 108091007734 digestive enzymes Proteins 0.000 description 1
- 102000038379 digestive enzymes Human genes 0.000 description 1
- 108091007735 digestive proteases Proteins 0.000 description 1
- MUCZHBLJLSDCSD-UHFFFAOYSA-N diisopropyl fluorophosphate Chemical compound CC(C)OP(F)(=O)OC(C)C MUCZHBLJLSDCSD-UHFFFAOYSA-N 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- 239000002552 dosage form Substances 0.000 description 1
- 231100000673 dose–response relationship Toxicity 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 230000013020 embryo development Effects 0.000 description 1
- 239000003995 emulsifying agent Substances 0.000 description 1
- 230000007368 endocrine function Effects 0.000 description 1
- 229940066758 endopeptidases Drugs 0.000 description 1
- 238000012869 ethanol precipitation Methods 0.000 description 1
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 1
- 229960005542 ethidium bromide Drugs 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- -1 for example Chemical class 0.000 description 1
- 125000000524 functional group Chemical group 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- 239000003102 growth factor Substances 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 208000006454 hepatitis Diseases 0.000 description 1
- 231100000283 hepatitis Toxicity 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 239000004030 hiv protease inhibitor Substances 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 235000011167 hydrochloric acid Nutrition 0.000 description 1
- 239000000017 hydrogel Substances 0.000 description 1
- 230000007062 hydrolysis Effects 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-M hydroxide Chemical compound [OH-] XLYOFNOQVPJJNP-UHFFFAOYSA-M 0.000 description 1
- 150000004679 hydroxides Chemical class 0.000 description 1
- 210000001822 immobilized cell Anatomy 0.000 description 1
- 230000002163 immunogen Effects 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 239000007943 implant Substances 0.000 description 1
- 230000000415 inactivating effect Effects 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 239000007972 injectable composition Substances 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 150000007529 inorganic bases Chemical class 0.000 description 1
- 238000010255 intramuscular injection Methods 0.000 description 1
- 239000007927 intramuscular injection Substances 0.000 description 1
- 210000001739 intranuclear inclusion body Anatomy 0.000 description 1
- 238000010253 intravenous injection Methods 0.000 description 1
- 230000007794 irritation Effects 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- JJWLVOIRVHMVIS-UHFFFAOYSA-N isopropylamine Chemical compound CC(C)N JJWLVOIRVHMVIS-UHFFFAOYSA-N 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 150000002576 ketones Chemical class 0.000 description 1
- 108010045069 keyhole-limpet hemocyanin Proteins 0.000 description 1
- 231100000225 lethality Toxicity 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 239000006193 liquid solution Substances 0.000 description 1
- 239000006194 liquid suspension Substances 0.000 description 1
- 210000004880 lymph fluid Anatomy 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 239000011777 magnesium Substances 0.000 description 1
- 229910052749 magnesium Inorganic materials 0.000 description 1
- ZLNQQNXFFQJAID-UHFFFAOYSA-L magnesium carbonate Chemical compound [Mg+2].[O-]C([O-])=O ZLNQQNXFFQJAID-UHFFFAOYSA-L 0.000 description 1
- 239000001095 magnesium carbonate Substances 0.000 description 1
- 229910000021 magnesium carbonate Inorganic materials 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 230000009061 membrane transport Effects 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 208000037819 metastatic cancer Diseases 0.000 description 1
- 208000011575 metastatic malignant neoplasm Diseases 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 230000003278 mimic effect Effects 0.000 description 1
- 150000007522 mineralic acids Chemical class 0.000 description 1
- 230000011278 mitosis Effects 0.000 description 1
- 230000000869 mutational effect Effects 0.000 description 1
- 201000009240 nasopharyngitis Diseases 0.000 description 1
- 230000007658 neurological function Effects 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 108010058731 nopaline synthase Proteins 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- QIQXTHQIDYTFRH-UHFFFAOYSA-N octadecanoic acid Chemical compound CCCCCCCCCCCCCCCCCC(O)=O QIQXTHQIDYTFRH-UHFFFAOYSA-N 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 230000005305 organ development Effects 0.000 description 1
- 150000007524 organic acids Chemical class 0.000 description 1
- 235000005985 organic acids Nutrition 0.000 description 1
- 150000007530 organic bases Chemical class 0.000 description 1
- 229940092253 ovalbumin Drugs 0.000 description 1
- 239000006179 pH buffering agent Substances 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 230000008506 pathogenesis Effects 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 230000007918 pathogenicity Effects 0.000 description 1
- 239000008194 pharmaceutical composition Substances 0.000 description 1
- NMHMNPHRMNGLLB-UHFFFAOYSA-N phloretic acid Chemical compound OC(=O)CCC1=CC=C(O)C=C1 NMHMNPHRMNGLLB-UHFFFAOYSA-N 0.000 description 1
- 108010072906 phosphoramidon Proteins 0.000 description 1
- BWSDNRQVTFZQQD-AYVHNPTNSA-N phosphoramidon Chemical compound O([P@@](O)(=O)N[C@H](CC(C)C)C(=O)N[C@H](CC=1[C]2C=CC=CC2=NC=1)C(O)=O)[C@H]1O[C@@H](C)[C@H](O)[C@@H](O)[C@@H]1O BWSDNRQVTFZQQD-AYVHNPTNSA-N 0.000 description 1
- 235000011007 phosphoric acid Nutrition 0.000 description 1
- 150000003016 phosphoric acids Chemical class 0.000 description 1
- 230000035790 physiological processes and functions Effects 0.000 description 1
- 239000006187 pill Substances 0.000 description 1
- 244000000003 plant pathogen Species 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 238000007747 plating Methods 0.000 description 1
- 229920001200 poly(ethylene-vinyl acetate) Polymers 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 229920001515 polyalkylene glycol Polymers 0.000 description 1
- 229920000728 polyester Polymers 0.000 description 1
- 229920002643 polyglutamic acid Polymers 0.000 description 1
- 108010094020 polyglycine Proteins 0.000 description 1
- 229920000232 polyglycine polymer Polymers 0.000 description 1
- 229920000656 polylysine Polymers 0.000 description 1
- 238000003752 polymerase chain reaction Methods 0.000 description 1
- 108010026466 polyproline Proteins 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 239000011591 potassium Substances 0.000 description 1
- 229910052700 potassium Inorganic materials 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 239000003755 preservative agent Substances 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- MFDFERRIHVXMIY-UHFFFAOYSA-N procaine Chemical compound CCN(CC)CCOC(=O)C1=CC=C(N)C=C1 MFDFERRIHVXMIY-UHFFFAOYSA-N 0.000 description 1
- 229960004919 procaine Drugs 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 238000011321 prophylaxis Methods 0.000 description 1
- 238000001243 protein synthesis Methods 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000001172 regenerating effect Effects 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 230000000754 repressing effect Effects 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 206010039073 rheumatoid arthritis Diseases 0.000 description 1
- CVHZOJJKTDOEJC-UHFFFAOYSA-N saccharin Chemical compound C1=CC=C2C(=O)NS(=O)(=O)C2=C1 CVHZOJJKTDOEJC-UHFFFAOYSA-N 0.000 description 1
- 239000006152 selective media Substances 0.000 description 1
- 235000004400 serine Nutrition 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- 230000000392 somatic effect Effects 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 230000035322 succinylation Effects 0.000 description 1
- 238000010613 succinylation reaction Methods 0.000 description 1
- OBTWBSRJZRCYQV-UHFFFAOYSA-N sulfuryl difluoride Chemical class FS(F)(=O)=O OBTWBSRJZRCYQV-UHFFFAOYSA-N 0.000 description 1
- 239000004094 surface-active agent Substances 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 239000003826 tablet Substances 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 125000003396 thiol group Chemical group [H]S* 0.000 description 1
- CNHYKKNIIGEXAY-UHFFFAOYSA-N thiolan-2-imine Chemical compound N=C1CCCS1 CNHYKKNIIGEXAY-UHFFFAOYSA-N 0.000 description 1
- 150000003573 thiols Chemical class 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 229960002175 thyroglobulin Drugs 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 231100000033 toxigenic Toxicity 0.000 description 1
- 230000001551 toxigenic effect Effects 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- 150000003626 triacylglycerols Chemical class 0.000 description 1
- 230000001810 trypsinlike Effects 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 230000035899 viability Effects 0.000 description 1
- 230000029812 viral genome replication Effects 0.000 description 1
- 244000052613 viral pathogen Species 0.000 description 1
- 210000002845 virion Anatomy 0.000 description 1
- 230000001018 virulence Effects 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
- 238000009736 wetting Methods 0.000 description 1
- 239000000080 wetting agent Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/24—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Enterobacteriaceae (F), e.g. Citrobacter, Serratia, Proteus, Providencia, Morganella, Yersinia
- C07K14/245—Escherichia (G)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/48—Hydrolases (3) acting on peptide bonds (3.4)
- C12N9/50—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25)
- C12N9/503—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25) derived from viruses
- C12N9/506—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25) derived from viruses derived from RNA viruses
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/70—Fusion polypeptide containing domain for protein-protein interaction
- C07K2319/735—Fusion polypeptide containing domain for protein-protein interaction containing a domain for self-assembly, e.g. a viral coat protein (includes phage display)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/34011—Potyviridae
- C12N2770/34022—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
Definitions
- the instant invention relates generally to the identification of protease inhibitors. More
- the invention relates to methods of
- identifying viral protease inhibitors which can in turn be used to treat or prevent viral infection.
- Proteases are enzymes that cleave peptide bonds, thereby altering proteins. Besides degrading proteins, these enzymes play a regulatory role in a variety of physiological processes. Proteases fall into four general classes: serine, cysteine, aspartic acid, and metalloproteases. These classes are distinguished primarily by mechanism (Dunn, B.M., in Proteolytic
- Serine and cysteine proteases have almost identical two-step mechanisms with an acyl-enzyme intermediate. Together they comprise the majority of the known proteases. Aspartic and metallo-proteases catalyze direct hydrolysis of the peptide bond.
- procaryotic and eucaryotic proteins are synthesized as larger biologically inactive precursors which become activated only when acted upon by
- endoproteases proteinases
- proteins proteinsases
- endoproteases proteinsases
- proteins proteinsases
- These enzymes typically recognize specific domains, usually less than ten amino acids in length including the sissile bond, in exposed loops of generally loose secondary structure (Keil, B., in Methods in Protein Sequence Analysis, M. Elzinga, ed., Humana Press, Clifton, N.J., 1982, pp. 291-304).
- Maturation proteases are responsible for both intracellular and extracellular cleavage of protein precursors and many of the proteolytically processed proteins in turn play key roles in physiological
- intracellular proteolytic maturation include secreted proteins, lysosomal enzymes, mitochondrial proteins and membrane proteins. These proteins are highly diverse in function, having endocrine, neurological, and immune functions, as well as acting as growth factors and antibiotics. Secreted proteins that undergo
- extracellular proteolytic processing include the plasma zymogens involved in blood clotting and the immune complement system.
- Maturation proteases which are indirectly involved in human disease are generally distinguished by their high degree of substrate specificity. However, a host of digestive proteases of lesser specificity are also known and are more directly involved in diseases such as chronic inflammation and tumor metastasis. These enzymes include elastases, collagenases, mast cell proteases, and extracellular matrix-degrading metalloproteases, among others.
- Plant viruses which encoce proteases for this purpose include the
- potyviruses comoviruses, nepoviruses, sobemoviruses, and luteoviruses. These viruses cause economically important diseases in all major monocot and dicot families.
- Similarly equipped animal viruses include the
- picomaviruses picomaviruses, retroviruses, alphaviruses, flaviviruses, pestiviruses, coronaviruses, and adenoviruses.
- Diseases caused by these viruses include foot-and-mouth disease, AIDS, the common cold, hepatitis and polio.
- ZYMV Zika virus
- picomaviruses (Dougherty, W.G., et al., Virology (1989) 172:302-310: Bazan, J.F., and R.J. Fletterick, Proc.
- the invention herein is based on the discovery of a unique method for detecting peptide protease inhibitors. These inhibitors can be used directly or indirectly in the treatment of protease-dependent diseases. Alternatively, the inhibitors so identified can be utilized as structural models for the rational design of peptide-mimetics.
- the subject invention is directed to a method for detecting a protease inhibitor which comprises:
- the subject invention is directed to a DNA construct comprising:
- control sequences that are operably linked to the first and second coding sequences whereby the coding sequences can be transcribed and translated in a host cell, and at least one of the control sequences is heterologous to at least one of the coding sequences.
- the DNA construct further includes a second DNA coding sequence for the protease of interest.
- the subject invention is directed to host cells stably transformed with these DNA constructs.
- Figure 1 depicts Protease Inhibitor Selection System I, as applied to ZYMV protease.
- Figure 2 depicts representative examples of Protease Inhibitor Selection System II.
- Figure 3 shows the strategy of cDNA synthesis from ZYMV and cloning methods.
- FIG. 4 shows the nucleotide sequence of the
- Figure 5A shows the organization of the primary translation products of pZPro ⁇ , pZPro7 and placZ ⁇ -CP.
- Figure 5B depicts the results of immunoblot analysis of SDS/PAGE separated proteins from E. coli cells harboring these plasmids.
- Figure 6 depicts the derivation of the pZPro7, pZPro9, pZPro10, pZPro11 and pZPro12 constructs and the organization of the primary translation products.
- the open boxes denote ZYMV 49 kDa protease (Pro) cleavage sites.
- Strep R streptomycin-resistant.
- Amp R
- ampicillin-resistant i.e., transformed.
- Cfu colony- forming units.
- NT not tested.
- proteases an enzyme that cleaves a peptide bond.
- the term includes both endopeptidases (also called proteinases) which are proteases that hydrolyze internal peptide bonds, and exopeptidases, which are proteases that cleave either N- or C-terminal peptide bonds .
- endopeptidases also called proteinases
- exopeptidases which are proteases that cleave either N- or C-terminal peptide bonds .
- Some proteases are highly specific , cleaving only between two particular amino acids within a particular protein.
- Other proteases are less specific, cleaving between more than one amino acid pair and/or cleaving between an amino acid pair in more than one location in the same and/or different proteins.
- proteases include maturation proteases
- proteins proteins, lysosomal enzymes, mitochondrial proteins, membrane proteins, plasma zymogens, digestive enzymes, elastases, collagenases, mast cell proteases,
- extracellular matrix-degrading metalloproteinases such as proteases from potyviruses, comoviruses, nepoviruses, sobemoviruses, and
- luteoviruses luteoviruses
- animal viral proteases such as
- proteases from picomaviruses, retroviruses are proteases from picomaviruses, retroviruses,
- alphaviruses flaviviruses, pestiviruses, coronaviruses, and adenoviruses.
- protease inhibitor is meant a molecule capable of altering the activity of a protease such that the protease is unable to completely hydrolyze a peptide bond for which it is specific.
- Protease inhibitors can be peptides composed solely of genetically encodable amino acids.
- Protease inhibitor also encompasses synthetic peptide derivatives such as peptide aldehydes and ketones, peptide boronic acids, peptide chloromethyl ketones, azapeptides, peptide hydroxamic acids, and peptide thiols.
- protee inhibitor also encompasses synthetic nonpeptide compounds such as
- peptide and "protein” are used in their broadest sense, i.e., any polymer of genetically encodable amino acids (dipeptide or greater) linked through peptide bonds.
- polymer of genetically encodable amino acids linked through peptide bonds.
- oligopeptides polypeptides, protein fragments, muteins, fusion proteins and the like.
- a "host cell” is a cell which has been transformed, or is capable of transformation, by an exogenous nucleotide sequence.
- host cells for use in the present invention may be either procaryote or eucaryote, depending on the specific protease in question and the selection system desired.
- bacterial cells either gram-negative or gram-positive
- Eucaryotic cells can be used, however, in cases where either the protease or its dependent phenotype can be adequately expressed only in such cells, such as cases in which certain types of transport, metabolism, or post-translational modification are required.
- Eucaryotic cells can also be used to select inhibitors of other types of biological activities which can be expressed only in such cells, such as animal virus replication.
- One skilled in the art can readily determine an appropriate host cell for use in the present invention.
- a “replicon” is any genetic element (e.g., plasmid, chromosome, virus) that functions as an autonomous unit of DNA replication; i.e., capable of replication under its own control.
- a “vector” is a replicon, such as a plasmid, phage, or cosmid, to which another DNA segment may be attached so as to bring about the replication of the attached segment.
- double-stranded DNA molecule refers to the polymeric form of deoxyribonucleotides (bases adenine, guanine, thymine, or cytosine) in a double-stranded helix, both relaxed and supercoiled. This term refers only to the primary and secondary structure of the molecule, and does not limit it to any particular
- this term includes double-stranded DNA found, inter alia, in linear DNA molecules (e.g., restriction fragments), viruses, plasmids, and
- chromosomes In discussing the structure of particular double-stranded DNA molecules, sequences may be described herein according to the normal convention of giving only the sequence in the 5' to 3' direction along the
- nontranscribed strand of DNA i.e., the strand having the sequence homologous to the mRNA.
- a DNA "coding sequence” or a “nucleotide sequence encoding” a particular protein is a DNA
- a coding sequence can include, but is not limited to, procaryotic sequences, cDNA from eucaryotic mRNA, genomic DNA sequences from eucaryotic (e.g., mammalian) DNA, and even synthetic DNA sequences.
- a transcription termination sequence will usually be located 3' to the coding sequence.
- a “promoter sequence” is a DNA regulatory region capable of binding RNA polymerase in a cell and initiating transcription of a downstream (3' direction) coding sequence.
- the promoter sequence is bound at the 3' terminus by the translation start codon (ATG) of a coding sequence and extends upstream (5' direction) to include the minimum number of bases or elements necessary to initiate transcription at levels detectable above background.
- a transcription initiation site (conveniently defined by mapping with nuclease Sl), as well as protein binding domains (consensus sequences) responsible for the binding of RNA polymerase.
- Eucaryotic promoters will often, but not always, contain "TATA" boxes and "CAT” boxes.
- Procaryotic promoters contain Shine-Dalgarao sequences in addition to the -10 and -35 consensus sequences.
- control sequences refers collectively to promoter sequences, ribosome binding sites
- a coding sequence is "operably linked to" another coding sequence when RNA polymerase will
- telomere a sequence that is transcribed into mRNA, which is then translated into a chimeric polypeptide encoded by the two coding sequences.
- the coding sequences need not be contiguous to one another so long as the transcribed sequence is ultimately processed to produce the desired chimeric protein.
- a control sequence "directs the transcription" of a coding sequence in a cell when RNA polymerase will bind the promoter sequence and transcribe the coding sequence into mRNA, which is then translated into the polypeptide encoded by the coding sequence.
- a cell has been "transformed" by an exogenous nucleotide sequence when the sequence has been introduced inside the cell membrane.
- An exogenous nucleotide sequence may or may not be integrated (covalently linked) to chromosomal nucleic acid making up the genome of ?the cell.
- the exogenous nucleotide sequence may be maintained on an episomal element, such as a plasmid.
- a stably transformed cell is one in which the exogenous nucleotide sequence has become integrated into the chromosome so that it is inherited by daughter cells through chromosome replication. This stability is demonstrated by the ability of the
- a “clone” is a population of cells derived from a single cell or common ancestor by mitosis.
- a “cell line” is a clone of a primary cell that is capable of stable growth in vitro for many generations.
- a "heterologous" region of a DNA construct is an identifiable segment of DNA within or attached to another DNA molecule that is not found in association with the other molecule in nature.
- the heterologous region encodes a bacterial gene
- the gene will usually be flanked by DNA that does not flank the bacterial gene in the genome of the source bacteria.
- heterologous coding sequence is a construct where the coding sequence itself is not found in nature (e.g., synthetic sequences having codons different from the native gene). Allelic variation or naturally occurring mutational events do not give rise to a heterologous region of DNA, as used herein.
- treatment refers to either (i) the prevention of infection or reinfection (prophylaxis), or (ii) the reduction or elimination of symptoms of the disease of interest (therapy).
- Described herein is a system which can be used to select effective inhibitors of proteases from large pools of random peptide sequences.
- the method utilizes a known protease which can be obtained through standard techniques, i.e. direct isolation, synthesis or
- the nucleotide sequence of the protease can be determined and used to transform a host cell.
- the host cell is also transformed with a
- nucleotide sequence encoding a protein that confers a negative phenotype on the cell, such as sensitivity to a given antibiotic, or inability to grow on a given carbon source, which is dependent on the activity of the cloned protease.
- conferring the dependent phenotype are contained on one or more constructs which have been introduced into the host cell.
- inhibition of the protease confers a selectable phenotype on the cell (e.g., antibiotic resistance, or the ability to grow on a given carbon source).
- the particular inhibitor can be isolated, sequenced and further used as described below.
- the negative phenotype may be expressed by either of two general mechanisms.
- a gene conferring a dominant negative phenotype is expressed as an inactive precursor protein which is activated by protease-mediated cleavage at a site which has been engineered to resemble a natural substrate of the
- protease In the second mechanism, a gene conferring a selectable phenotype is inactivated by protease-mediated cleavage at a similarly engineered site.
- the above-described host cells can then be used to detect effective inhibitors of the protease from large pools of random peptides encoded on another plasmid.
- This additional plasmid contains a gene which encodes a "carrier" protein in which all or part of an exposed domain has been
- the randomized domain may range from four to fifteen amino acids in length.
- the length of the randomized domain may range from four to fifteen amino acids in length.
- peptides intended for use for the design of peptide mimetics will tend to have shorter sequences than
- peptides for use in peptide or gene therapy.
- Such random sequence “libraries” can be constructed by replacing the sequence encoding the exposed domain in the "carrier” protein gene with a set of synthetic oligodeoxynucleotides of random sequence.
- a natural substrate of the protease in question can be conveniently used for the "carrier” protein.
- one of the many well-characterized natural protease inhibitors may be used (Proteinase Inhibitors. A.J. Barrett and G. Salvesen, eds. (Elsevier, Amsterdam, 1986), section C), in which the amino acid sequence of the native binding domain has been randomized.
- flanking domains of the "carrier” may be minimized by flanking the randomized sequence with short "spacers" of polyproline or polyglycine which are highly flexible
- Some of these "random" peptides will, by chance, have structures which are capable of binding tightly to the active site of the protease, thereby preventing it from either activating or inactivating the negative phenotype-conferring protein, depending on the mechanism employed. This, in turn, will confer the selectable phenotype on the host cell when transformed with these constructs.
- the structures of effective inhibitors can then be determined by sequencing the appropriate regions of constructs recovered from such phenotype-selected cells.
- Protease Inhibitor Selection System I Protease Inhibitor Selection System 1 as applied to the ZYMV protease is illustrated in Figure 1.
- a portion of the ZYMV polyprotein is depicted which includes the protease, replicase (NIb), and coat protein (see U.S. Patent Application Serial No. 07/560,130).
- the arrows indicate substrate sites at which the protease cleaves the polyprotein.
- either the replicase or coat protein is replaced with the coding sequence for the protein
- E. coli ribosomal protein S12 which confers sensitivity to streptomycin on streptomycin- resistant hosts such as E. coli strain MC1009 (Post, L.E., and M. Nomura, J. Biol. Chem. (1980) 255:4660-4666).
- a transcription repressor protein may also be used such as the lactose, tryptophan, or phage lambda repressors (Lewin, B., Genes IV. Cell Press, Cambridge, 1990, pp. 240-264). These act by repressing expression of antibiotic resistance genes in hosts in which these genes are transcribed from repressible promoters. In either case the negative phenotype is only displayed when the negative phenotype protein is being actively cleaved out of the polyprotein by the protease.
- protease and the negative phenotype precursor may be convenient to express from separate transcription units.
- the only requirements are that the negative phenotype protein be linked to an extraneous domain by a peptide sequence which is a natural substrate for the protease in question, and that this precursor be inactive until cleaved by the protease.
- Figure 2 illustrates the second mechanism wherein the negative phenotype is conferred by protease-mediated inactivation of a protein conferring a
- Protease Inhibitor Selection System II selectable phenotype
- proteins include secreted or membrane proteins which confer resistance to the antibiotics ampicillin,
- protease substrate peptide sequence is inserted between the signal sequence and the mature protein such that cleavage by the protease renders the protein incapable of membrane transport or insertion and thereby inactive.
- the protease substrate sequence may be inserted into a surface domain of the mature protein such that cleavage by the protease renders the protein
- a special case of this selection system occurs with proteases which are toxic when expressed in E. coli by virtue of their fortuitous inactivation of one or more host proteins which are required for growth (Baum, E.Z., et al., Proc. Natl. Acad. Sci. USA (1990) 87:5573-5577).
- the inactivated host protein(s) confer the selectable phenotype in the presence of inhibitors of the protease.
- the random peptide inhibitor gene library may be delivered to the selector cells by any of several methods, the choice of which will depend to some extent on the size of the library.
- One skilled in the art can readily determine an acceptable technique to use with a given library. For example, chemical transformation with purified plasmid (Sambrook, J., et al., Molecular
- bacteriophage-derived vectors can be used for delivery by transduction.
- a plasmid vector can be converted to a cosmid vector (Sambrook, J., et al., Molecular
- transductions of the packaged cosmids into selector cells can then be accomplished by established methods.
- concatemers of the inhibitor gene library can be used instead of "stuffer" DNA in the cosmid to achieve the necessary size for packaging. This reduces, by an order of magnitude, the number of
- transformants that need to be screened to cover the entire library are transformedants that need to be screened to cover the entire library.
- the stringency of selection by these systems can be adjusted in a variety of ways.
- a number of transcriptional promoters and enhancers of varying strengths are available (Sambrook, J., et al., Molecular Cloning, Cold Spring Harbor Laboratory, 1989, ch. 17), which can be used with the protease, negative phenotype precursor, and inhibitor genes to raise or lower the inhibitor strength required for selection.
- Inducible promoters can be used, such that their strengths may be titrated by adjusting the amount of inducer in the growth medium. For example, by having the inhibitor expressed from an inducible promoter, the potency threshold of a pool of selected inhibitors is raised, and the size of the pool reduced, simply by reducing the amount of inducer present during selection.
- inhibitor inducibility can be used to counterselect stable false positives, such as revertants that have mutated the protease gene, simply by replica plating onto selective medium in the absence of inducer. Only the revertants are able to grow.
- protease inhibitor can be isolated and chemically characterized, using known techniques. These systems can be used to generate inhibitor peptides for any protease which can be
- proteases of many important plant and animal viral pathogens inhibitors of the proteases of other types of microbial pathogens as well as cellular proteases which have been implicated in such disorders as rheumatoid arthritis, Alzheimer's disease, and tumor metastasis, can also be identified.
- the instant invention can be used to identify protease inhibitors which in turn are useful in the treatment of protease-dependent diseases in both plants and animals.
- the inhibitors can be used directly in peptide therapy or can be encoded in a gene and used in gene therapy.
- the identified inhibitors can also serve as structural models for the rational design of peptide-mimetics, that is, synthetic compounds that mimic the protease-binding action of the identified protease inhibitors to bring reactive groups into contact with the protease active site. (See, e.g., Demuth, H.-U., J.
- invention also has a more general application in the construction and use of in vivo systems for the selection of bioactive peptides from peptide libraries.
- systems may be designed for the selection of peptide inhibitors of hydrolytic enzymes other than proteases.
- hydrolytic enzymes other than proteases.
- a number of such enzymes are known which can hydrolyze natural or artificial substrates to produce one or more compounds which are toxic to E. coli (for
- any phenotype of cultured procaryotic or eucaryotic cells which can be altered by the endogenous expression of appropriate peptides such that cells expressing such peptides can be readily distinguished and isolated from cells which either do not express such peptides or which express peptides which do not alter the phenotype, may provide the basis for establishing an in vivo system for the selection of bioactive peptides from peptide libraries.
- most tractable medically important phenotypes will be those manifesting susceptibility to microbial
- the endogenous expression of a random peptide library as an exposed domain of a suitable stable "carrier" protein in a population of cultured mammalian cells of sufficient size to ensure that all or most members of the library are represented in the population may be used to select peptides which interfere with the ability of microbial pathogens such as viruses or bacteria or their toxins to inhibit cell growth.
- microbial pathogens such as viruses or bacteria or their toxins
- Such cell populations are challenged by such pathogens or toxins, only those cells expressing inhibitory peptides will grow, allowing the active peptides to be identified by established methods.
- Such peptides can in turn be used for the development of effective therapies.
- the identified inhibitors can be used to create transgenic plants.
- One commonly used method of gene transfer in plants involves the insertion of the gene of interest into the T-DNA region of a Ti or Ri plasmid derived from A. tumefaciens or A. rhizogenes, respectively.
- Many control sequences are known which when coupled to a heterologous coding sequence and transformed into a host organism show fidelity in gene expression with respect to tissue/organ specificity of the original coding sequence. See, e.g., Benfey, P.N., and Chua, N.H., Science (1989) 244:174-181.
- Suitable control sequences for use in these plasmids include promoters for constitutive leaf-specific expression of the desired gene in the various target plants.
- Other useful control sequences include a
- NOS nopaline synthase gene
- vir vir
- the virulence (vir) gene from either the Ti or Ri plasmid must also be present, either along with the T-DNA portion, or via a binary system where the vir gene is present on a separate vector.
- Such systems, vectors for use therein, and methods of transforming plant cells are described in U.S. Patent No. 4,658,082, and Simpson, R.B., et al., Plant Mol. Biol. (1986) 6:403-415, incorporated herein by reference in their entirety.
- these plasmids can be placed into A. rhizogenes or A. tumefaciens and these vectors used to transform cells of plant species which are ordinarily susceptible to the particular plant pathogen.
- A. tumefaciens or A. rhizogenes will depend on the plant being transformed thereby.
- A. tumefaciens is the preferred organism for transformation. Most dicotyledons, some gymnosperms, and a few monocotyledons (e.g., certain members of the
- A. rhizogenes also has a wide host range, embracing most dicots and some gymnosperms, which
- these cells can be used to regenerate transgenic plants.
- whole plants can be infected with these vectors by wounding the plant and then introducing the vector into the wound site. Any part of the plant can be wounded, including leaves, stems and roots.
- plant tissue in the form of an explant, such as cotyledonary tissue or leaf disks, can be inoculated with these vectors and cultured under conditions which promote plant regeneration. Roots or shoots transformed by inoculation of plant tissue with A. rhizogenes or A. tumefaciens. containing the desired gene, can be used as a source of plant tissue to
- the inhibitors identified by the present method can also be used in gene therapy.
- HIV-specific protease inhibitor genes in which a natural mammalian protease inhibitor serves as carrier for the HIV protease inhibitor domain, can be used in anti-AIDS gene therapy.
- Lymphocytes or bone marrow cells from the patient can be transformed with the protease inhibitor gene in vitro and returned to the patient, where they establish an HIV-resistant subpopulation of lymphocytes which can gradually restore cell-mediated immune function as the patient's untransformed lymphocytes are depleted by the virus.
- proteases active in blood, lymph, or cerebro-spinal fluid which are essential components of disorders such as chronic inflammations, metastatic cancers, and certain viral infections, may be targeted by protease inhibitor gene therapy, in which the inhibitors are secreted by transgenic lymphocytes or other
- the inhibitors identified by the present method can be altered by established methods to improve their pharmaco-kinetic properties.
- the inhibitors may be
- a carrier for example, a statin, a statin, a statin, a statin, a statin, a statin, a statin, a statin, a statin, a statin, a statin, a statin, a statin, a statin, a statin, a statin, a statin, a statin, a statin, a statin, a statin, a statin
- Suitable carriers are typically large, slowly metabolized macromolecules such as: proteins; polysaccharides, such as sepharose, agarose, cellulose, cellulose beads and the like; polymeric amino acids such as polyglutamic acid, polylysine, and the like; amino acid copolymers; and inactive virus particles.
- Especially useful protein substrates are serum albumins, keyhole limpet hemocyanin, immunoglobulin molecules, thyroglobulin, ovalbumin, and other proteins well known to those skilled in the art.
- the protein substrates may be used in their native form or their functional group content may be modified by, for example, succinylation of lysine
- a sulfhydryl group may also be incorporated into the carrier (or inhibitor) by, for example, reaction of amino functions with 2-iminothiolane or the N-hydroxysuccinimide ester of 3-(4-dithiopyridyl) propionate.
- Suitable carriers may also be modified to incorporate spacer arms (such as hexamethylene diamine or other bifunctional molecules of similar size) for attachment of peptides. Methods of coupling peptides to proteins or cells are known to those of skill in the art.
- compositions are prepared as injectables, either as liquid solutions or suspensions; solid forms suitable for solution in, or suspension in, liquid vehicles prior to injection may also be prepared.
- the preparation may also be emulsified or the active ingredient encapsulated in liposome vehicles.
- the active immunogenic ingredient is often mixed with vehicles containing excipients which are pharmaceutically acceptable and compatible with the active ingredient. Suitable vehicles are, for example, water, saline, dextrose, glycerol, ethanol, or the like, and combinations thereof.
- the vehicle may contain minor amounts of auxiliary substances such as wetting or emulsifying agents, or pH buffering agents.
- auxiliary substances such as wetting or emulsifying agents, or pH buffering agents.
- Actual methods of preparing such dosage forms are known, or will be apparent, to those skilled in the art. See, e.g., Remington's Pharmaceutical Sciences, Mack Publishing Company, Easton, Pennsylvania, 15th edition, 1975.
- the composition or formulation to be administered will, in any event, contain a quantity of the inhibitor adequate to achieve the desired effect in the individual being treated.
- suppositories and, in some cases, aerosol, intranasal, oral formulations, and sustained release formulations.
- vehicle composition will include traditional binders and carriers, such as, polyalkylene glycols, or
- triglycerides Such suppositories may be formed from mixtures containing the active ingredient in the range of about 0.5V to about 10% (w/w), preferably about 1% to about 2%.
- Oral vehicles include such normally employed excipients as, for example, pharmaceutical grades of mannitol, lactose, starch, magnesium, stearate, sodium saccharin cellulose, magnesium carbonate, and the like. These oral compositions may be taken in the form of solutions, suspensions, tablets, pills, capsules,
- sustained release formulations or powders, and contain from about 10% to about 95% of the active ingredient, preferably about 25% to about 70%.
- Intranasal formulations will usually include vehicles that neither cause irritation to the nasal mucosa nor significantly disturb ciliary function.
- Diluents such as water, aqueous saline or other known substances can be employed with the subject invention.
- the nasal formulations may also contain preservatives such as, but not limited to, chlorobutanol and
- a surfactant may be present to enhance absorption of the subject proteins by the nasal mucosa.
- Controlled or sustained release formulations are made by incorporating the inhibitor into carriers or vehicles such as liposomes, nonresorbable impermeable polymers such as ethylenevinyl acetate copolymers and Hytrel ® copolymers, swellable polymers such as hydrogels, or resorbable polymers such as collagen and certain polyacids or polyesters such as those used to make resorbable sutures.
- carriers or vehicles such as liposomes, nonresorbable impermeable polymers such as ethylenevinyl acetate copolymers and Hytrel ® copolymers, swellable polymers such as hydrogels, or resorbable polymers such as collagen and certain polyacids or polyesters such as those used to make resorbable sutures.
- the inhibitors can also be delivered using implanted mini-pumps, well known in the art.
- the inhibitors may be formulated into pharmaceutical compositions in either neutral or salt forms.
- Pharmaceutically acceptable salts include the acid addition salts (formed with the free amino groups of the active polypeptides) and which are formed with inorganic acids such as, for example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, tartaric, mandelic, and the like. Salts formed from free carboxyl groups may also be derived from inorganic bases such as, for example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, tartaric, mandelic, and the like. Salts formed from free carboxyl groups may also be derived from inorganic bases such as, for
- sodium, potassium, ammonium, calcium, or ferric hydroxides examples include sodium, potassium, ammonium, calcium, or ferric hydroxides, and such organic bases as isopropylamine, trimethylamine, 2-ethylamino ethanol, histidine,
- procaine and the like.
- the inhibitor of interest is administered parenterally, usually by
- injectable formulations will contain an effective amount of the active ingredient in a vehicle, the exact amount being readily determined by one skilled in the art.
- the active ingredient may typically range from about 1% to about 95% (w/w) of the composition, or even higher or lower if appropriate.
- the quantity to be administered depends on the animal to be treated and the particular inhibitor used. Effective dosages can be readily established by one of ordinary skill in the art through routine trials establishing dose response curves.
- the subject is treated by administration of the
- the subject may be administered as many doses as is required to effectively treat the individual.
- This example describes the construction and expression in E. coli of a gene which encodes a portion of the ZYMV polyprotein.
- the primary translation product of this gene is a 140 kDa protein which includes the 49 kDa protease and flanking cleavage sites, a portion of the nuclear inclusion 'b' protein (NIb, also referred to as the replicase), including the NIb/coat protein
- a California isolate of ZYMV was obtained from Professor J.A. Dodds of the University of California at Riverside. The virus was propagated by mechanical inoculation of the cotyledons of ten-day-old Cucurbita pepo cv. early straightneck seedlings. Systemically infected leaves were harvested 3-8 weeks after
- the virus was quantified by absorbance at 260 nm using an extinction coefficient of 2.8 A 260 /mg/ml.
- Viral genomic RNA was isolated from purified virions by digestion with protease K in borate buffer (pH 9) containing 1% SDS and 4 mM EDTA for one hour at 37°C, followed by phenol/chloroform extraction and ethanol precipitation. The RNA was redispersed in water,
- PZRl the largest cDNA clone obtained from the first experiment, had a 2.3 kb insert, the ends of which were sequenced using the Sanger dideoxy chain-terminating method as described in the product literature for the Sequenase 2 sequencing kit (United States Biologicals) with the M13 universal and reverse primers encoded on either end of the multiple cloning site in pBluescript.
- the orientation of the insert relative to the viral genome and the multiple cloning site was indicated by the appearance of the polyadenylate tract from the 3' end of the genome in the sequence from the reverse primer.
- the remainder of the clone was sequenced stepwise, 200-300 nucleotides at a time, in both directions from synthetic oligodeoxynucleotide primers complementary to the distal ends of each of the successive sequencing runs. Sequence data were processed and analyzed on a DEC VAX 11/750 minicomputer.
- the second round of cDNA cloning was accomplished in the same manner as the first except that a synthetic oligodeoxynucleotide complementary to the 5' end of pZRl was used as primer and the cDNAs were ligated directly, without linkers, into the EcoRV site of
- sequences of adjacent time points to overlap were filled in using synthetic oligodeoxynucleotide primers made from the sequence near the 5' end of the gap.
- Clone pZB60 was sequenced in both directions from nested deletions in the same manner as for pZB11 except that a unique NcoI site within pZB11 was used as the starting point for deletions in the 5' direction and only those clones with deletions mapping between the 5' end of pZB11 and the 5' end of pZB60 were sequenced.
- a third round of cDNA cloning was conducted as described above for the preparation of pZB11 and pZB60 except that a synthetic oligomer complementary to the 5' end of pZB60 was used as a primer. From this round, pZF18, having an insert of 3.7 kb was obtained and sequenced as described above for pZB11 and pZB60.
- the 5' end of the viral RNA sequence was determined by reverse transcription of purified viral RNA using a synthetic oligonucleotide primer complementary to nucleotides 76-99 at the 5' end of pZF18.
- the Sanger dideoxynucleotide chain-terminating method was used essentially as described in the Promega Gem Seq manual (Promega Corp.).
- the continuous open reading frame of the viral genome was identified with the aid of a computer as described above.
- the coding sequences of the functional ZYMV gene products were identified by amino acid sequence homology to those of other potyviruses (Allison, R., et al., Virology (1986) 154:9-20: Domier, L.L., et al.,
- Figure 4 shows the nucleotide sequence of the ZYMV genome as determined above along with the deduced amino acid sequence.
- the nucleotide sequence is numbered from the 5' terminus.
- the 5' non-coding region extends from nucleotide 1 to nucleotide 139.
- Nucleotides 140-142 initiate the polyprotein coding sequence with a
- the cleavage site between the 46 kDa protein and the cytoplasmic inclusion protein (CI) is believed to occur between the glutamine at codon 1164 (nucleotides 3629-3631) and the glycine at codon 1165 (nucleotides 3632-3634).
- the cleavage site between CI and VPg/protease (VPg and protease are probably not separated in ZYMV) is believed to occur between the glutamine at codon 1798 (nucleotides 5531-5533) and the serine at codon 1799 (nucleotides 5534-5536).
- the cleavage site between VPg/protease and RNA replicase is believed to occur between the glutamine at codon 2284 (nucleotides 6989-6991) and the serine at codon 2285 (nucleotides 6992-6994).
- the cleavage site between the RNA replicase and the coat protein (CP) is believed to occur between the glutamine at codon 2801 (nucleotides 8540-8542) and the serine at codon 2802 (nucleotides 8543-8545). Termination of the polyprotein coincides with termination of the coat protein and is believed to occur at the stop codon (nucleotides 9380-9382) following the glutamine at codon 3080.
- the 3' non-coding sequence then extends from nucleotide 9383 to nucleotide 9593 before terminating in a polyadenylate sequence of
- cDNA clone pZRl contained approximately 80 adenosines at its 3' terminus.
- pZPro5
- a restriction fragment of 1666 base pairs (bp) extending between the PvuII and SspI sites of ZYMV cDNA clone pZB11 (described above) was isolated by agarose gel electrophoresis and ligated into the Smal site of plasmid pTZ18U (Sambrook, J., et al., Molecular Cloning, Cold Spring Harbor Laboratory, 1989; Mead D.A., et al.,
- This restriction fragment comprises a portion of the coding sequence of the ZYMV polyprotein which includes part of the
- cytoplasmic inclusion protein (CI)
- 6 kDa protein the 6 kDa protein
- 49 kDa protease the 4 kDa protease
- NIb protein a portion of the NIb protein. Insertion of this fragment into the Smal site of pTZ18U places the reading frame encoding these proteins in phase with that of the expressible lacZ ⁇ gene of pTZ18U such that expression of this gene from the lac promoter is expected to produce a fusion protein comprised of a small portion of the lacZ ⁇ peptide fused to the amino terminus of the ZYMV polyprotein fragment.
- This construct was denoted pZPro5 and its structure was confirmed by
- pZPro6 and pZPro7 The 2280 bp SalI restriction fragment from ZYMV cDNA clone pZR1 (described above), which comprises a portion of the ZYMV genome including part of the NIb protein, CP, the 3' non-coding sequence, and a portion of the polyadenylate sequence, was inserted into the SalI site of pZPro5 to create pZPro6.
- Dideoxynucleotide sequencing of pZPro6 confirmed that the NIb/CP-encoding reading frame of the inserted fragment was in phase with the open reading frame of pZPro5 such that expression of this construct from the lac promoter is expected to produce a single polypeptide of approximately 140 kDa.
- strain DH5 ⁇ was transformed with pZPro6 and pZPro7, and transformed clones were identified and isolated by selection for ampicillin resistance.
- lacZ ⁇ -ZYMV polyprotein gene was monitored by immunoblotting of SDS/PAGE-resolved proteins from these cells using
- This example describes the construction of expressible genes encoding polyproteins which contain the ZYMV 49 kDa protease and the E. coli ribosomal protein S12. The ability of these gene constructs to confer sensitivity to the antibiotic streptomycin on several streptomycin-resistant E. coli strains by virtue of correct and efficient processing of the polyprotein by the ZYMV 49 kDa protease is demonstrated.
- Streptomycin lethality in E. coli has been ascribed to its ability to interfere with protein synthesis by binding to the S12 subunit of the 30S component of the ribosome (Gorini, L., in Ribosomes, M. Nomura, A. Tissieres, P. Lengyel, eds., Cold Spring Harbor Laboratory, 1974, pp. 791-803). Streptomycin- resistant mutants have been isolated which express altered forms of S12 which retain the ability to
- the CP-encoding sequences in pZPro7 were precisely replaced with the coding sequence for S12 to create pZPro9. This was accomplished as follows.
- the sequence bounded by the Bglll site in NIb and the P1' position of the Nlb/CP cleavage site in pZPro7 was amplified by polymerase chain reaction (PCR, Saiki, R.K., et al., Science (1988) 239:487).
- the S12 coding sequence from the second amino acid to the end was amplified by PCR from plasmid pNO1523 (Dean, D., Gene
- the 3' primer contained an MluI site following the stop codon.
- pZPro9 was then transformed into streptomycin-resistant EL. coli strains MC1009, HB101, and N100
- the Nlb/CP cleavage site was removed from pZPro9 to create pZPro12, which should produce a polyprotein from which S12 cannot be freed by the protease. This was accomplished by cleaving pZPro9 with EcoRV and Hpal, which removed most of the NIb protein including the Nlb/CP cleavage site , and replacing it with the fragment produced by EcoRV alone, which restored most of the NIb protein down to within 12 amino acids of the Nlb/CP cleavage site (see Figure 6). The structure of pZPro12 was confirmed by dideoxynucleotide sequencing. Transformation with pZPro12 has no
- the strep-sensitive phenotype produced by pZPro9 is completely dependent on the presence of a substrate cleavage site at which the protease can cleave functional S12 from the polyprotein.
- pZPro9 polyprotein
- the expression of the pZPro9 polyprotein places a considerable burden on growing E. coli cells.
- the pZPro9 was streamlined by removing the EcoRV fragment described above, which contains most of the NIb protein exclusive of the cleavage sites at either end.
- This construct denoted pZPro10, encodes a
- strep-resistant E. coli strains displayed a strep-sensitive phenotype identical to that of the pZPro9 transformants.
- pZPro10 transformants grew considerably more vigorously than pZPro9 transformants, indicating a significant reduction in the metabolic burden on the host cells.
- inhibitors from peptide libraries or to confirm selected inhibitors on the basis of their ability to restore rapid growth.
- the protease removes itself from the polyprotein of pZPro10, the 14 kDa S12 is left in a 21.5 kDa precursor until freed by the protease.
- the NIb/CP cleavage site was removed from pZPro10 in the same manner that the EcoRV-Hpal restriction fragment was removed from pZPro9 to create pZPro12.
- GATAGTCCTC CGCAAGCCGA GTAAGCAGCG GGTTTTCGCT CGTATCGAGC AGGATGAGGC 480
- TTACAAGCGC ACATACAAGA AGGAAAGGAA GAAAGTGGCG CAAAAGCAAA TTGTGCAAGC 660
- AAAATCAACG AGCTTACCTG CACATCTTGC CAAGAAGGGT AAGGTGTTAC TACTCGAACC 3960
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Molecular Biology (AREA)
- Virology (AREA)
- Medicinal Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Wood Science & Technology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Biophysics (AREA)
- Zoology (AREA)
- Gastroenterology & Hepatology (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
Abstract
Methods for detecting protease inhibitors are disclosed. Also described are DNA constructs and host cells transformed with these constructs for use in the subject methods. The methods utilize a host cell which exhibits a negative phenotype dependent on the activity of a given protease. Thus, inhibition of the protease confers a selectable phenotype on the cell. The negative phenotype can be conferred by either protease-mediated activation or inactivation of a protein conferring a selectable phenotype. The inhibitor is detected by transforming host cells expressing the genes for the selectable phenotype and a given protease with random peptide sequences. Inhibitors so identified can be used either directly or indirectly in the treatment of protease-dependent disorders.
Description
A METHOD FOR THE IDENTIFICATION OF PROTEASE INHIBITORS
Technical Field
The instant invention relates generally to the identification of protease inhibitors. More
particularly, the invention relates to methods of
identifying viral protease inhibitors which can in turn be used to treat or prevent viral infection.
Background
Proteases are enzymes that cleave peptide bonds, thereby altering proteins. Besides degrading proteins, these enzymes play a regulatory role in a variety of physiological processes. Proteases fall into four general classes: serine, cysteine, aspartic acid, and metalloproteases. These classes are distinguished primarily by mechanism (Dunn, B.M., in Proteolytic
Enzymes, R.J. Geynon and J.S. Bond, eds., IRL Press, Oxford, 1989, pp. 57-82). Serine and cysteine proteases have almost identical two-step mechanisms with an acyl-enzyme intermediate. Together they comprise the majority of the known proteases. Aspartic and metallo-proteases catalyze direct hydrolysis of the peptide bond.
Many procaryotic and eucaryotic proteins are synthesized as larger biologically inactive precursors which become activated only when acted upon by
endoproteases (proteinases). These enzymes typically recognize specific domains, usually less than ten amino acids in length including the sissile bond, in exposed loops of generally loose secondary structure (Keil, B.,
in Methods in Protein Sequence Analysis, M. Elzinga, ed., Humana Press, Clifton, N.J., 1982, pp. 291-304).
Maturation proteases are responsible for both intracellular and extracellular cleavage of protein precursors and many of the proteolytically processed proteins in turn play key roles in physiological
abnormalities which give rise to disease states (Andrews, P.C., et al., Experientia (1987) 43:784-789; Reich, E., et al., Cold Spring Harbor Symposium: Proteases and Biological Control, Cold Spring Harbor Laboratory, Cold Spring Harbor, 1975). Proteins that undergo
intracellular proteolytic maturation include secreted proteins, lysosomal enzymes, mitochondrial proteins and membrane proteins. These proteins are highly diverse in function, having endocrine, neurological, and immune functions, as well as acting as growth factors and antibiotics. Secreted proteins that undergo
extracellular proteolytic processing include the plasma zymogens involved in blood clotting and the immune complement system.
Maturation proteases which are indirectly involved in human disease are generally distinguished by their high degree of substrate specificity. However, a host of digestive proteases of lesser specificity are also known and are more directly involved in diseases such as chronic inflammation and tumor metastasis. These enzymes include elastases, collagenases, mast cell proteases, and extracellular matrix-degrading metalloproteases, among others.
Proteases also play key roles in many infectious diseases. An obligatory step in the
replication of many pathogenic plant and animal viruses involves virus-determined proteolytic processing of the primary viral gene products (Hellen, C.U.T., et al. ,
Biochemistry (1989) 28:9881-9890). Plant viruses which encoce proteases for this purpose include the
potyviruses, comoviruses, nepoviruses, sobemoviruses, and luteoviruses. These viruses cause economically important diseases in all major monocot and dicot families.
Similarly equipped animal viruses include the
picomaviruses, retroviruses, alphaviruses, flaviviruses, pestiviruses, coronaviruses, and adenoviruses. Diseases caused by these viruses include foot-and-mouth disease, AIDS, the common cold, hepatitis and polio.
For example, Zucchini Yellow Mosaic Virus
(ZYMV) , a potyvirus, expresses its genome as a single 350 kDa polyprotein which is cleaved into at least seven mature gene products by three distinct proteolytic activities. Two of the proteases are virus-encoded
(Dougherty, W.G., and J.C. Carrington, Ann. Rev.
Phytopathol. (1988) 26:123-143; Carrington, J.C., et al., EMBO J. (1990) 9:1347-1353), including the potyviral 49 kDa protease. This protease is responsible for at least five of the seven cleavages. This enzyme is a trypsin-like cysteine protease which is structurally and
mechanistically representative of the largest class of viral proteases, including those of the animal
picomaviruses (Dougherty, W.G., et al., Virology (1989) 172:302-310: Bazan, J.F., and R.J. Fletterick, Proc.
Natl. Acad. Sci. USA (1988) 85 :7872-7876). This enzyme is highly specific and appears to recognize a region comprised of about seven amino acids surrounding the sissile bond (Dougherty, W.G., and T.D. Parks, Virology (1989) 172:145-155). Of the five sites cleaved by this enzyme, the two flanking the protease appear to be cleaved intramolecularly, while the remaining three appear to be cleaved intermolecularly (Garcia, J.A., et al., J. Gen. Virol. (1990) 71:2773-2779). Of the latter
three, the site between the NIb protein and the coat protein appears to be the most active.
Disclosure of the Invention
The invention herein is based on the discovery of a unique method for detecting peptide protease inhibitors. These inhibitors can be used directly or indirectly in the treatment of protease-dependent diseases. Alternatively, the inhibitors so identified can be utilized as structural models for the rational design of peptide-mimetics.
Accordingly, in one embodiment, the subject invention is directed to a method for detecting a protease inhibitor which comprises:
(a) providing a population of host cells expressing a first nucleic acid sequence encoding a protease and a second nucleic acid sequence encoding a protein capable of conferring a selectable phenotype on said host cells dependent on the activity of said protease;
(b) providing a pool of nucleic acid constructs wherein at least one of the constructs in the pool comprises a nucleic acid sequence encoding an inhibitor of the protease;
(c) transforming the host cells of (a) with the nucleic acid constructs of (b); and
(d) growing the transformed host cells of (c) under conditions that distinguish cells with the
selectable phenotype, thereby detecting the presence of the protease inhibitor.
In another embodiment, the subject invention is directed to a DNA construct comprising:
(a) a first DNA coding sequence for a protein capable of conferring a selectable phenotype on a host
cell transformed therewith, the selectable phenotype dependent on the activity of a protease; and
(b) control sequences that are operably linked to the first and second coding sequences whereby the coding sequences can be transcribed and translated in a host cell, and at least one of the control sequences is heterologous to at least one of the coding sequences.
In an alternate embodiment, the DNA construct further includes a second DNA coding sequence for the protease of interest.
In yet another embodiment, the subject invention is directed to host cells stably transformed with these DNA constructs.
These and other embodiments of the subject invention will readily occur to those of ordinary skill in the art in view of the disclosure herein..
Brief Description of the Figures
Figure 1 depicts Protease Inhibitor Selection System I, as applied to ZYMV protease.
Figure 2 depicts representative examples of Protease Inhibitor Selection System II.
Figure 3 shows the strategy of cDNA synthesis from ZYMV and cloning methods.
Figure 4 shows the nucleotide sequence of the
ZYMV genome (SEQ ID NO:1).
Figure 5A shows the organization of the primary translation products of pZProβ, pZPro7 and placZα-CP. Figure 5B depicts the results of immunoblot analysis of SDS/PAGE separated proteins from E. coli cells harboring these plasmids.
Figure 6 depicts the derivation of the pZPro7, pZPro9, pZPro10, pZPro11 and pZPro12 constructs and the organization of the primary translation products. The
open boxes denote ZYMV 49 kDa protease (Pro) cleavage sites. StrepR = streptomycin-resistant. AmpR =
ampicillin-resistant, i.e., transformed. Cfu = colony- forming units. NT = not tested.
Detailed Description of the Invention
The practice of the present invention will employ, unless otherwise indicated, conventional
techniques of molecular biology, microbiology, virology, recombinant DNA technology, and immunology, which are within the skill of the art. Such techniques are
explained fully in the literature. See, e.g., Sambrook, Fritsch & Maniatis, Molecular Cloning: A Laboratory Manual, Second Edition (1989); Maniatis, Fritsch &
Sambrook, Molecular Cloning: A Laboratory Manual (1982); DNA Cloning, Vols. I and II (D.N. Glover ed. 1985);
Oligonucleotide Synthesis (M.J. Gait ed. 1984); Nucleic Acid Hybridization (B.D. Hames & S.J. Higgins eds 1984); Animal Cell Culture (R.K. Freshney ed. 1986); Immobilized Cells and Enzvmes (IRL press, 1986); B. Perbal, A
Practical Guide to Molecular Cloning (1984); the series, Methods In Enzymology (S. Colowick and N. Kaplan eds., Academic Press, Inc.); and Handbook of Experimental
Immunology, Vols. I-IV (D.M. Weir and C.C. Blackwell eds., 1986, Blackwell Scientific Publications).
All patents, patent applications, and publications mentioned herein, whether supra or infra, are hereby incorporated by reference in their entirety. A. Definitions
In describing the present invention, the following terms will be employed, and are intended to be defined as indicated below.
By "protease" is meant an enzyme that cleaves a peptide bond. The term includes both endopeptidases (also called proteinases) which are proteases that hydrolyze internal peptide bonds, and exopeptidases, which are proteases that cleave either N- or C-terminal peptide bonds . Some proteases are highly specific , cleaving only between two particular amino acids within a particular protein. Other proteases are less specific, cleaving between more than one amino acid pair and/or cleaving between an amino acid pair in more than one location in the same and/or different proteins.
Exemplary proteases include maturation proteases
responsible for both intracellular and extracellular cleavage of protein precursors, such as secreted
proteins, lysosomal enzymes, mitochondrial proteins, membrane proteins, plasma zymogens, digestive enzymes, elastases, collagenases, mast cell proteases,
extracellular matrix-degrading metalloproteinases; plant viral proteases such as proteases from potyviruses, comoviruses, nepoviruses, sobemoviruses, and
luteoviruses; and animal viral proteases such as
proteases from picomaviruses, retroviruses,
alphaviruses, flaviviruses, pestiviruses, coronaviruses, and adenoviruses.
By "protease inhibitor" is meant a molecule capable of altering the activity of a protease such that the protease is unable to completely hydrolyze a peptide bond for which it is specific. Protease inhibitors can be peptides composed solely of genetically encodable amino acids. "Protease inhibitor" also encompasses synthetic peptide derivatives such as peptide aldehydes and ketones, peptide boronic acids, peptide chloromethyl ketones, azapeptides, peptide hydroxamic acids, and peptide thiols. "Protease inhibitor" also encompasses
synthetic nonpeptide compounds such as
diisopropylphosphofluoridate, sulfonylfluorides,
phosphoramidon, and halomethylcoumarins. For a detailed discussion of protease inhibitors, see Proteinase
Inhibitors, A.J. Barrett and G. Salvesen, eds. (Elsevier, Amsterdam, 1986).
The terms "peptide" and "protein" are used in their broadest sense, i.e., any polymer of genetically encodable amino acids (dipeptide or greater) linked through peptide bonds. Thus, the terms include
oligopeptides, polypeptides, protein fragments, muteins, fusion proteins and the like.
A "host cell" is a cell which has been transformed, or is capable of transformation, by an exogenous nucleotide sequence. As described more fully below, host cells for use in the present invention may be either procaryote or eucaryote, depending on the specific protease in question and the selection system desired. In general, bacterial cells (either gram-negative or gram-positive) are the hosts of choice when the protease and its dependent phenotype can be expressed in active form in these cells. Eucaryotic cells can be used, however, in cases where either the protease or its dependent phenotype can be adequately expressed only in such cells, such as cases in which certain types of transport, metabolism, or post-translational modification are required. Eucaryotic cells can also be used to select inhibitors of other types of biological activities which can be expressed only in such cells, such as animal virus replication. One skilled in the art can readily determine an appropriate host cell for use in the present invention.
A "replicon" is any genetic element (e.g., plasmid, chromosome, virus) that functions as an
autonomous unit of DNA replication; i.e., capable of replication under its own control.
A "vector" is a replicon, such as a plasmid, phage, or cosmid, to which another DNA segment may be attached so as to bring about the replication of the attached segment.
A "double-stranded DNA molecule" refers to the polymeric form of deoxyribonucleotides (bases adenine, guanine, thymine, or cytosine) in a double-stranded helix, both relaxed and supercoiled. This term refers only to the primary and secondary structure of the molecule, and does not limit it to any particular
tertiary forms. Thus, this term includes double-stranded DNA found, inter alia, in linear DNA molecules (e.g., restriction fragments), viruses, plasmids, and
chromosomes. In discussing the structure of particular double-stranded DNA molecules, sequences may be described herein according to the normal convention of giving only the sequence in the 5' to 3' direction along the
nontranscribed strand of DNA (i.e., the strand having the sequence homologous to the mRNA).
A DNA "coding sequence" or a "nucleotide sequence encoding" a particular protein, is a DNA
sequence which is transcribed and translated into a polypeptide in vivo when placed under the control of appropriate regulatory sequences. The boundaries of the coding sequence are determined by a start codon at the 5' (amino) terminus and a translation stop codon at the 3' (carboxy) terminus. A coding sequence can include, but is not limited to, procaryotic sequences, cDNA from eucaryotic mRNA, genomic DNA sequences from eucaryotic (e.g., mammalian) DNA, and even synthetic DNA sequences. A transcription termination sequence will usually be located 3' to the coding sequence.
A "promoter sequence" is a DNA regulatory region capable of binding RNA polymerase in a cell and initiating transcription of a downstream (3' direction) coding sequence. For purposes of defining the present invention, the promoter sequence is bound at the 3' terminus by the translation start codon (ATG) of a coding sequence and extends upstream (5' direction) to include the minimum number of bases or elements necessary to initiate transcription at levels detectable above background. Within the promoter sequence will be found a transcription initiation site (conveniently defined by mapping with nuclease Sl), as well as protein binding domains (consensus sequences) responsible for the binding of RNA polymerase. Eucaryotic promoters will often, but not always, contain "TATA" boxes and "CAT" boxes.
Procaryotic promoters contain Shine-Dalgarao sequences in addition to the -10 and -35 consensus sequences.
DNA "control sequences" refers collectively to promoter sequences, ribosome binding sites,
polyadenylation signals, transcription termination sequences, upstream regulatory domains, enhancers, and the like, which collectively provide for the
transcription and translation of a coding sequence in a host cell.
A coding sequence is "operably linked to" another coding sequence when RNA polymerase will
transcribe the two coding sequences into mRNA, which is then translated into a chimeric polypeptide encoded by the two coding sequences. The coding sequences need not be contiguous to one another so long as the transcribed sequence is ultimately processed to produce the desired chimeric protein.
A control sequence "directs the transcription" of a coding sequence in a cell when RNA polymerase will
bind the promoter sequence and transcribe the coding sequence into mRNA, which is then translated into the polypeptide encoded by the coding sequence.
A cell has been "transformed" by an exogenous nucleotide sequence when the sequence has been introduced inside the cell membrane. An exogenous nucleotide sequence may or may not be integrated (covalently linked) to chromosomal nucleic acid making up the genome of ?the cell. In procaryotes and yeasts, for example, the exogenous nucleotide sequence may be maintained on an episomal element, such as a plasmid. With respect to most other eucaryotic cells, a stably transformed cell is one in which the exogenous nucleotide sequence has become integrated into the chromosome so that it is inherited by daughter cells through chromosome replication. This stability is demonstrated by the ability of the
eucaryotic cell to establish cell lines or clones
comprised of a population of daughter cell containing the exogenous sequence.
A "clone" is a population of cells derived from a single cell or common ancestor by mitosis. A "cell line" is a clone of a primary cell that is capable of stable growth in vitro for many generations.
A "heterologous" region of a DNA construct is an identifiable segment of DNA within or attached to another DNA molecule that is not found in association with the other molecule in nature. Thus, when the heterologous region encodes a bacterial gene, the gene will usually be flanked by DNA that does not flank the bacterial gene in the genome of the source bacteria.
Another example of the heterologous coding sequence is a construct where the coding sequence itself is not found in nature (e.g., synthetic sequences having codons different from the native gene). Allelic variation or
naturally occurring mutational events do not give rise to a heterologous region of DNA, as used herein.
The term "treatment" as used herein refers to either (i) the prevention of infection or reinfection (prophylaxis), or (ii) the reduction or elimination of symptoms of the disease of interest (therapy).
B. General Methods
Described herein is a system which can be used to select effective inhibitors of proteases from large pools of random peptide sequences. The method utilizes a known protease which can be obtained through standard techniques, i.e. direct isolation, synthesis or
recombinant technology. The nucleotide sequence of the protease can be determined and used to transform a host cell. The host cell is also transformed with a
nucleotide sequence encoding a protein that confers a negative phenotype on the cell, such as sensitivity to a given antibiotic, or inability to grow on a given carbon source, which is dependent on the activity of the cloned protease. Genes for the protease and the protein
conferring the dependent phenotype are contained on one or more constructs which have been introduced into the host cell. Thus, inhibition of the protease confers a selectable phenotype on the cell (e.g., antibiotic resistance, or the ability to grow on a given carbon source). Once identified, the particular inhibitor can be isolated, sequenced and further used as described below.
The negative phenotype may be expressed by either of two general mechanisms. In the first, a gene conferring a dominant negative phenotype is expressed as an inactive precursor protein which is activated by protease-mediated cleavage at a site which has been
engineered to resemble a natural substrate of the
protease. In the second mechanism, a gene conferring a selectable phenotype is inactivated by protease-mediated cleavage at a similarly engineered site.
The above-described host cells can then be used to detect effective inhibitors of the protease from large pools of random peptides encoded on another plasmid.
Cells transformed with variants of this plasmid which encode effective inhibitors are identified by selection for the appropriate phenotype. This additional plasmid contains a gene which encodes a "carrier" protein in which all or part of an exposed domain has been
randomized with respect to its amino acid sequence.
Typically, the randomized domain may range from four to fifteen amino acids in length. The length of the
randomized amino acid sequence will depend on the
specific application of the inhibitor and can be readily determined by one skilled in the art. For example, peptides intended for use for the design of peptide mimetics will tend to have shorter sequences than
peptides for use in peptide or gene therapy.
Part or all of this sequence is randomized with respect to the twenty genetically-encodable amino acids. Thus, a fully randomized set of heptapeptide sequences would contain more than 109 different peptides.
Such random sequence "libraries" can be constructed by replacing the sequence encoding the exposed domain in the "carrier" protein gene with a set of synthetic oligodeoxynucleotides of random sequence. A natural substrate of the protease in question can be conveniently used for the "carrier" protein. Alternatively, one of the many well-characterized natural protease inhibitors may be used (Proteinase Inhibitors. A.J. Barrett and G. Salvesen, eds. (Elsevier, Amsterdam,
1986), section C), in which the amino acid sequence of the native binding domain has been randomized.
Structural constraints placed on the randomized sequence by the flanking domains of the "carrier" may be minimized by flanking the randomized sequence with short "spacers" of polyproline or polyglycine which are highly flexible
(Creighton, T.E., Proteins: Structures and Molecular
Properties. W.H. Freeman, New York, .1984, ch. 5).
Some of these "random" peptides will, by chance, have structures which are capable of binding tightly to the active site of the protease, thereby preventing it from either activating or inactivating the negative phenotype-conferring protein, depending on the mechanism employed. This, in turn, will confer the selectable phenotype on the host cell when transformed with these constructs. The structures of effective inhibitors can then be determined by sequencing the appropriate regions of constructs recovered from such phenotype-selected cells.
A representative example of the first mechanism described above, i.e., wherein the activation of a negative phenotype-conferring protein is inhibited
(hereinafter referred to as Protease Inhibitor Selection System I) as applied to the ZYMV protease is illustrated in Figure 1. A portion of the ZYMV polyprotein is depicted which includes the protease, replicase (NIb), and coat protein (see U.S. Patent Application Serial No. 07/560,130). The arrows indicate substrate sites at which the protease cleaves the polyprotein. In this selection system, either the replicase or coat protein is replaced with the coding sequence for the protein
conferring the negative phenotype. An example of the latter includes E. coli ribosomal protein S12, which confers sensitivity to streptomycin on streptomycin-
resistant hosts such as E. coli strain MC1009 (Post, L.E., and M. Nomura, J. Biol. Chem. (1980) 255:4660-4666). A transcription repressor protein may also be used such as the lactose, tryptophan, or phage lambda repressors (Lewin, B., Genes IV. Cell Press, Cambridge, 1990, pp. 240-264). These act by repressing expression of antibiotic resistance genes in hosts in which these genes are transcribed from repressible promoters. In either case the negative phenotype is only displayed when the negative phenotype protein is being actively cleaved out of the polyprotein by the protease.
For other proteases it may be convenient to express the protease and the negative phenotype precursor from separate transcription units. The only requirements are that the negative phenotype protein be linked to an extraneous domain by a peptide sequence which is a natural substrate for the protease in question, and that this precursor be inactive until cleaved by the protease.
Figure 2 illustrates the second mechanism wherein the negative phenotype is conferred by protease-mediated inactivation of a protein conferring a
selectable phenotype (referred to herein as Protease Inhibitor Selection System II). Examples of such
proteins include secreted or membrane proteins which confer resistance to the antibiotics ampicillin,
tetracycline, or kanamycin (Methods in Enzymology,
vol. 43, Academic Press, New York, 1975), or which confer the ability to utilize carbon sources such as lactose or maltose (Bieker, K.L., and T.J. Silhavy, Trends in
Genetics (1990) 6:329-334). These proteins are normally expressed as precursors in which an amino-terminal signal sequence directs transport of the protein across the cell membrane or insertion of the protein into the cell membrane, after which the signal sequence is
proteolytically removed. In these constructs, the protease substrate peptide sequence is inserted between the signal sequence and the mature protein such that cleavage by the protease renders the protein incapable of membrane transport or insertion and thereby inactive. Alternatively, the protease substrate sequence may be inserted into a surface domain of the mature protein such that cleavage by the protease renders the protein
inactive.
A special case of this selection system occurs with proteases which are toxic when expressed in E. coli by virtue of their fortuitous inactivation of one or more host proteins which are required for growth (Baum, E.Z., et al., Proc. Natl. Acad. Sci. USA (1990) 87:5573-5577). In such cases, the inactivated host protein(s) confer the selectable phenotype in the presence of inhibitors of the protease.
The random peptide inhibitor gene library may be delivered to the selector cells by any of several methods, the choice of which will depend to some extent on the size of the library. One skilled in the art can readily determine an acceptable technique to use with a given library. For example, chemical transformation with purified plasmid (Sambrook, J., et al., Molecular
Cloning. Cold Spring Harbor Laboratory, 1989, pp. 1.76-1.84) can be used for libraries of up to 108-109 members, depending on the efficiency. Such a library can
accommodate a complete set of fully randomized
pentapeptides. High voltage electroporation with
purified plasmid (Dower, W.J., et al., Nucleic Acids Res.
(1988) 16:6127-6145) is useful for libraries of 1010-1011 members, nearly sufficient to accommodate a complete set of fully randomized heptapeptides . For larger libraries,
bacteriophage-derived vectors can be used for delivery by transduction.
For example, a plasmid vector can be converted to a cosmid vector (Sambrook, J., et al., Molecular
Cloning. Cold Spring Harbor Laboratory, 1989, ch. 3) simply by insertion of a cos site and an appropriate length of "stuffer" DNA. Concatenate ligation of the library to such a vector can be followed by efficient packaging into phage λ pseudovirions using commercially available preparations. Efficient, large-scale
transductions of the packaged cosmids into selector cells can then be accomplished by established methods. In a further refinement, concatemers of the inhibitor gene library can be used instead of "stuffer" DNA in the cosmid to achieve the necessary size for packaging. This reduces, by an order of magnitude, the number of
transformants that need to be screened to cover the entire library.
The stringency of selection by these systems can be adjusted in a variety of ways. A number of transcriptional promoters and enhancers of varying strengths are available (Sambrook, J., et al., Molecular Cloning, Cold Spring Harbor Laboratory, 1989, ch. 17), which can be used with the protease, negative phenotype precursor, and inhibitor genes to raise or lower the inhibitor strength required for selection. Inducible promoters can be used, such that their strengths may be titrated by adjusting the amount of inducer in the growth medium. For example, by having the inhibitor expressed from an inducible promoter, the potency threshold of a pool of selected inhibitors is raised, and the size of the pool reduced, simply by reducing the amount of inducer present during selection. In addition, such inhibitor inducibility can be used to counterselect
stable false positives, such as revertants that have mutated the protease gene, simply by replica plating onto selective medium in the absence of inducer. Only the revertants are able to grow.
Once detected, the protease inhibitor can be isolated and chemically characterized, using known techniques. These systems can be used to generate inhibitor peptides for any protease which can be
expressed in active form in a suitable host and for which substrate cleavage site sequences are known. In addition to the proteases of many important plant and animal viral pathogens, inhibitors of the proteases of other types of microbial pathogens as well as cellular proteases which have been implicated in such disorders as rheumatoid arthritis, Alzheimer's disease, and tumor metastasis, can also be identified.
C. Use and Administration
The instant invention can be used to identify protease inhibitors which in turn are useful in the treatment of protease-dependent diseases in both plants and animals. The inhibitors can be used directly in peptide therapy or can be encoded in a gene and used in gene therapy. The identified inhibitors can also serve as structural models for the rational design of peptide-mimetics, that is, synthetic compounds that mimic the protease-binding action of the identified protease inhibitors to bring reactive groups into contact with the protease active site. (See, e.g., Demuth, H.-U., J.
Enzyme Inhibition (1990) 3:249-278). The present
invention also has a more general application in the construction and use of in vivo systems for the selection of bioactive peptides from peptide libraries. For example, systems may be designed for the selection of
peptide inhibitors of hydrolytic enzymes other than proteases. A number of such enzymes are known which can hydrolyze natural or artificial substrates to produce one or more compounds which are toxic to E. coli (for
example, see Hydrolytic Enzymes, A. Neuberger and K.
Brocklehurst, eds. Elsevier, Amsterdam, 1987). The expression of such an enzyme in an appropriate host cell allows the selection of peptide inhibitors of the enzyme based on their ability to confer viability on the cells in the presence of toxigenic substrates.
In general, any phenotype of cultured procaryotic or eucaryotic cells which can be altered by the endogenous expression of appropriate peptides such that cells expressing such peptides can be readily distinguished and isolated from cells which either do not express such peptides or which express peptides which do not alter the phenotype, may provide the basis for establishing an in vivo system for the selection of bioactive peptides from peptide libraries. Among the most tractable medically important phenotypes will be those manifesting susceptibility to microbial
pathogenicity.
For example, the endogenous expression of a random peptide library as an exposed domain of a suitable stable "carrier" protein in a population of cultured mammalian cells of sufficient size to ensure that all or most members of the library are represented in the population may be used to select peptides which interfere with the ability of microbial pathogens such as viruses or bacteria or their toxins to inhibit cell growth. When such cell populations are challenged by such pathogens or toxins, only those cells expressing inhibitory peptides will grow, allowing the active peptides to be identified
by established methods. Such peptides can in turn be used for the development of effective therapies.
For the treatment of plant pathogenesis, the identified inhibitors can be used to create transgenic plants. One commonly used method of gene transfer in plants involves the insertion of the gene of interest into the T-DNA region of a Ti or Ri plasmid derived from A. tumefaciens or A. rhizogenes, respectively. Many control sequences are known which when coupled to a heterologous coding sequence and transformed into a host organism show fidelity in gene expression with respect to tissue/organ specificity of the original coding sequence. See, e.g., Benfey, P.N., and Chua, N.H., Science (1989) 244:174-181. Suitable control sequences for use in these plasmids include promoters for constitutive leaf-specific expression of the desired gene in the various target plants. Other useful control sequences include a
promoter and terminator from the nopaline synthase gene (NOS). The NOS promoter and terminator are present in the plasmid pARC2, available from the American Type
Culture Collection and designated ATCC 67238. If such a system is used, the virulence (vir) gene from either the Ti or Ri plasmid must also be present, either along with the T-DNA portion, or via a binary system where the vir gene is present on a separate vector. Such systems, vectors for use therein, and methods of transforming plant cells are described in U.S. Patent No. 4,658,082, and Simpson, R.B., et al., Plant Mol. Biol. (1986) 6:403-415, incorporated herein by reference in their entirety.
Once constructed, these plasmids can be placed into A. rhizogenes or A. tumefaciens and these vectors used to transform cells of plant species which are ordinarily susceptible to the particular plant pathogen. The selection of either A. tumefaciens or A. rhizogenes
will depend on the plant being transformed thereby. In general, A. tumefaciens is the preferred organism for transformation. Most dicotyledons, some gymnosperms, and a few monocotyledons (e.g., certain members of the
Liliales and Arales) are susceptible to infection with A. tumefaciens. A. rhizogenes also has a wide host range, embracing most dicots and some gymnosperms, which
includes members of the Leguminosae, Compositae and
Chenopodiaceae. Alternative techniques which have proven to be effective in genetically transforming plants include particle bombardment and electroporation. See, e.g., Rhodes, CA., et al., Science (1988) 240:204-207: Shigekawa, K., and Dower, W.J., BioTechniques (1988)
6:742-751; Sanford, J.C., et al., Particulate Science and Technology (1987) 5:27-37; and McCabe, D.E.,
BioTechnology (1988) 6:923-926.
Once transformed, these cells can be used to regenerate transgenic plants. For example, whole plants can be infected with these vectors by wounding the plant and then introducing the vector into the wound site. Any part of the plant can be wounded, including leaves, stems and roots. Alternatively, plant tissue, in the form of an explant, such as cotyledonary tissue or leaf disks, can be inoculated with these vectors and cultured under conditions which promote plant regeneration. Roots or shoots transformed by inoculation of plant tissue with A. rhizogenes or A. tumefaciens. containing the desired gene, can be used as a source of plant tissue to
regenerate transgenic plants, either via somatic
embryogenesis or organogenesis. Examples of such methods for regenerating plant tissue are disclosed in Shahin,
E.A., Theor. Appl. Genet. (1985) 69:235-240; U.S. Patent
No. 4,658,082; and Simpson et al., supra.
The inhibitors identified by the present method can also be used in gene therapy. For example, HIV- specific protease inhibitor genes, in which a natural mammalian protease inhibitor serves as carrier for the HIV protease inhibitor domain, can be used in anti-AIDS gene therapy. Lymphocytes or bone marrow cells from the patient can be transformed with the protease inhibitor gene in vitro and returned to the patient, where they establish an HIV-resistant subpopulation of lymphocytes which can gradually restore cell-mediated immune function as the patient's untransformed lymphocytes are depleted by the virus.
Similarly, proteases active in blood, lymph, or cerebro-spinal fluid which are essential components of disorders such as chronic inflammations, metastatic cancers, and certain viral infections, may be targeted by protease inhibitor gene therapy, in which the inhibitors are secreted by transgenic lymphocytes or other
transgenic cell implants.
For therapeutic use in animals, the inhibitors identified by the present method can be altered by established methods to improve their pharmaco-kinetic properties. For example, the inhibitors may be
administered linked to a carrier. For example, a
fragment may be conjugated with a macromolecular carrier. Suitable carriers are typically large, slowly metabolized macromolecules such as: proteins; polysaccharides, such as sepharose, agarose, cellulose, cellulose beads and the like; polymeric amino acids such as polyglutamic acid, polylysine, and the like; amino acid copolymers; and inactive virus particles. Especially useful protein substrates are serum albumins, keyhole limpet hemocyanin, immunoglobulin molecules, thyroglobulin, ovalbumin, and other proteins well known to those skilled in the art.
The protein substrates may be used in their native form or their functional group content may be modified by, for example, succinylation of lysine
residues or reaction with Cys-thiolactone. A sulfhydryl group may also be incorporated into the carrier (or inhibitor) by, for example, reaction of amino functions with 2-iminothiolane or the N-hydroxysuccinimide ester of 3-(4-dithiopyridyl) propionate. Suitable carriers may also be modified to incorporate spacer arms (such as hexamethylene diamine or other bifunctional molecules of similar size) for attachment of peptides. Methods of coupling peptides to proteins or cells are known to those of skill in the art.
It is also possible to administer the inhibitors identified using the instant method alone, or mixed with a pharmaceutically acceptable vehicle or excipient. Typically, the compositions are prepared as injectables, either as liquid solutions or suspensions; solid forms suitable for solution in, or suspension in, liquid vehicles prior to injection may also be prepared. The preparation may also be emulsified or the active ingredient encapsulated in liposome vehicles. The active immunogenic ingredient is often mixed with vehicles containing excipients which are pharmaceutically acceptable and compatible with the active ingredient. Suitable vehicles are, for example, water, saline, dextrose, glycerol, ethanol, or the like, and combinations thereof. In addition, if desired, the vehicle may contain minor amounts of auxiliary substances such as wetting or emulsifying agents, or pH buffering agents. Actual methods of preparing such dosage forms are known, or will be apparent, to those skilled in the art. See, e.g., Remington's Pharmaceutical Sciences, Mack Publishing Company, Easton, Pennsylvania, 15th edition, 1975. The
composition or formulation to be administered will, in any event, contain a quantity of the inhibitor adequate to achieve the desired effect in the individual being treated.
Additional formulations which are suitable for other modes of administration include suppositories and, in some cases, aerosol, intranasal, oral formulations, and sustained release formulations. For suppositories, the vehicle composition will include traditional binders and carriers, such as, polyalkylene glycols, or
triglycerides. Such suppositories may be formed from mixtures containing the active ingredient in the range of about 0.5V to about 10% (w/w), preferably about 1% to about 2%. Oral vehicles include such normally employed excipients as, for example, pharmaceutical grades of mannitol, lactose, starch, magnesium, stearate, sodium saccharin cellulose, magnesium carbonate, and the like. These oral compositions may be taken in the form of solutions, suspensions, tablets, pills, capsules,
sustained release formulations, or powders, and contain from about 10% to about 95% of the active ingredient, preferably about 25% to about 70%.
Intranasal formulations will usually include vehicles that neither cause irritation to the nasal mucosa nor significantly disturb ciliary function.
Diluents such as water, aqueous saline or other known substances can be employed with the subject invention. The nasal formulations may also contain preservatives such as, but not limited to, chlorobutanol and
benzalkonium chloride. A surfactant may be present to enhance absorption of the subject proteins by the nasal mucosa.
Controlled or sustained release formulations are made by incorporating the inhibitor into carriers or
vehicles such as liposomes, nonresorbable impermeable polymers such as ethylenevinyl acetate copolymers and Hytrel® copolymers, swellable polymers such as hydrogels, or resorbable polymers such as collagen and certain polyacids or polyesters such as those used to make resorbable sutures. The inhibitors can also be delivered using implanted mini-pumps, well known in the art.
Furthermore, the inhibitors (or complexes thereof) may be formulated into pharmaceutical compositions in either neutral or salt forms. Pharmaceutically acceptable salts include the acid addition salts (formed with the free amino groups of the active polypeptides) and which are formed with inorganic acids such as, for example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, tartaric, mandelic, and the like. Salts formed from free carboxyl groups may also be derived from inorganic bases such as, for
example, sodium, potassium, ammonium, calcium, or ferric hydroxides, and such organic bases as isopropylamine, trimethylamine, 2-ethylamino ethanol, histidine,
procaine, and the like.
To treat an animal subject, the inhibitor of interest is administered parenterally, usually by
intramuscular injection in an appropriate vehicle. Other modes of administration, however, such as subcutaneous, intravenous injection and intranasal delivery, are also acceptable. Injectable formulations will contain an effective amount of the active ingredient in a vehicle, the exact amount being readily determined by one skilled in the art. The active ingredient may typically range from about 1% to about 95% (w/w) of the composition, or even higher or lower if appropriate. The quantity to be administered depends on the animal to be treated and the particular inhibitor used. Effective dosages can be
readily established by one of ordinary skill in the art through routine trials establishing dose response curves. The subject is treated by administration of the
particular inhibitor, in at least one dose. Moreover, the subject may be administered as many doses as is required to effectively treat the individual.
Below are examples of specific embodiments for carrying out the present invention. The examples are offered for illustrative purposes only, and are not
intended to limit the scope of the present invention in any way.
EXAMPLES Example 1
The genetic expression of active ZYMV 49 kDa protease in E. coli.
This example describes the construction and expression in E. coli of a gene which encodes a portion of the ZYMV polyprotein. The primary translation product of this gene is a 140 kDa protein which includes the 49 kDa protease and flanking cleavage sites, a portion of the nuclear inclusion 'b' protein (NIb, also referred to as the replicase), including the NIb/coat protein
cleavage site, followed by the coat protein (CP).
Evidence is presented showing that the expression of this gene in E. coli leads to an accumulation of mature CP as a result of efficient cleavage at the NIb/CP cleavage site by the 49 kDa protease. cDNA Cloning and Sequencing of the ZYMV Genome
A California isolate of ZYMV was obtained from Professor J.A. Dodds of the University of California at Riverside. The virus was propagated by mechanical
inoculation of the cotyledons of ten-day-old Cucurbita pepo cv. early straightneck seedlings. Systemically infected leaves were harvested 3-8 weeks after
inoculation and virus was purified therefrom essentially as described by Lisa, V., et al., Phytopathol (1981)
71:667-672. The virus was quantified by absorbance at 260 nm using an extinction coefficient of 2.8 A260/mg/ml.
Viral genomic RNA was isolated from purified virions by digestion with protease K in borate buffer (pH 9) containing 1% SDS and 4 mM EDTA for one hour at 37°C, followed by phenol/chloroform extraction and ethanol precipitation. The RNA was redispersed in water,
quantified by absorbance at 260 nm, and analyzed by agarose gel electrophoresis in the presence of methyl mercuric hydroxide.
DNAs complementary to ZYMV RNA were synthesized essentially according to Gubler, U., et al. Gene (1983) 25:263-269, as described in the technical manual for the Riboclone cDNA Synthesis Kit (Promega Corp.). Figure 3 shows an outline of this procedure. The first strand was synthesized using AMV reverse transcriptase and an oligodeoxythymidylate primer. After second strand synthesis EcoRI linkers were added, digested, and the cDNAs were ligated into the EcoRI site of pBluescript (Stratagene, Inc.). The ligation product was then used to transform competent E. coli XL-1 Blue cells
(Stratagene) which were then plated in the presence of lac inducer (IPTG) and substrate (X-gal) for color selection of recombinants. Plasmid DNA was isolated from colorless clones by the alkaline lysis miniprep method
(Molecular Cloning. A Laboratory Manual, 2d Ed., J.
Sambrook, E. Fritsch, and T. Maniatis, eds., Cold Spring
Harbor Press, New York, 1989) and insert sizes were
estimated after digestion with EcoRI by agarose gel electrophoresis in the presence of ethidium bromide.
PZRl, the largest cDNA clone obtained from the first experiment, had a 2.3 kb insert, the ends of which were sequenced using the Sanger dideoxy chain-terminating method as described in the product literature for the Sequenase 2 sequencing kit (United States Biologicals) with the M13 universal and reverse primers encoded on either end of the multiple cloning site in pBluescript. The orientation of the insert relative to the viral genome and the multiple cloning site was indicated by the appearance of the polyadenylate tract from the 3' end of the genome in the sequence from the reverse primer. The remainder of the clone was sequenced stepwise, 200-300 nucleotides at a time, in both directions from synthetic oligodeoxynucleotide primers complementary to the distal ends of each of the successive sequencing runs. Sequence data were processed and analyzed on a DEC VAX 11/750 minicomputer.
The second round of cDNA cloning was accomplished in the same manner as the first except that a synthetic oligodeoxynucleotide complementary to the 5' end of pZRl was used as primer and the cDNAs were ligated directly, without linkers, into the EcoRV site of
pBluescript (Figure 3). From this cloning two clones were obtained, pZB11 and pZB60, which had inserts of 2.3 kb and 3.8 kb, respectively. For the sequencing of pZB11, nested deletions were prepared from each end of the insert according to Henikoff, S., Gene (1984)
28:357ff as described in the product literature for the Erase-a-base System kit (Promega Corp.). For each direction, approximately twenty-four clones containing deletions spanning the entire length of the insert were sequenced simultaneously from the M13 universal or
reverse primers. Any gaps left by failure of the
sequences of adjacent time points to overlap were filled in using synthetic oligodeoxynucleotide primers made from the sequence near the 5' end of the gap.
Clone pZB60 was sequenced in both directions from nested deletions in the same manner as for pZB11 except that a unique NcoI site within pZB11 was used as the starting point for deletions in the 5' direction and only those clones with deletions mapping between the 5' end of pZB11 and the 5' end of pZB60 were sequenced.
A third round of cDNA cloning was conducted as described above for the preparation of pZB11 and pZB60 except that a synthetic oligomer complementary to the 5' end of pZB60 was used as a primer. From this round, pZF18, having an insert of 3.7 kb was obtained and sequenced as described above for pZB11 and pZB60.
The 5' end of the viral RNA sequence was determined by reverse transcription of purified viral RNA using a synthetic oligonucleotide primer complementary to nucleotides 76-99 at the 5' end of pZF18. The Sanger dideoxynucleotide chain-terminating method was used essentially as described in the Promega Gem Seq manual (Promega Corp.).
The continuous open reading frame of the viral genome was identified with the aid of a computer as described above. The coding sequences of the functional ZYMV gene products were identified by amino acid sequence homology to those of other potyviruses (Allison, R., et al., Virology (1986) 154:9-20: Domier, L.L., et al.,
Nucleic Acids Res (1986) 14:5417-5430; Robaglia, C, et al., J Gen Virol (1989) 70:935-947; Maiss, E., et al., J Gen Virol (1989) 70:513-524). The identity of the coat protein gene was further confirmed by subcloning the presumptive coding sequence into a modified version of
pBluescript from which the gene could be expressed in vitro. In vitro translation of the gene produced a product of the expected size which reacted specifically with antiserum raised against purified ZYMV coat protein when analyzed by Western blotting.
Figure 4 shows the nucleotide sequence of the ZYMV genome as determined above along with the deduced amino acid sequence. The nucleotide sequence is numbered from the 5' terminus. The 5' non-coding region extends from nucleotide 1 to nucleotide 139. Nucleotides 140-142 initiate the polyprotein coding sequence with a
methionine codon in a consensus translation initiation context (Joshi, C.P., Nucleic Acids Res (1987) 15:6643-6653). By homology with the potyviral polyprotein sequences cited above, the cleavage site between the aphid transmission helper component (HC) and the 46 kDa protein is believed to occur between the glycine at codon 766 (nucleotides 2435-2437) and the glycine at codon 767 (nucleotides 2438-2440). The cleavage site between the 46 kDa protein and the cytoplasmic inclusion protein (CI) is believed to occur between the glutamine at codon 1164 (nucleotides 3629-3631) and the glycine at codon 1165 (nucleotides 3632-3634). The cleavage site between CI and VPg/protease (VPg and protease are probably not separated in ZYMV) is believed to occur between the glutamine at codon 1798 (nucleotides 5531-5533) and the serine at codon 1799 (nucleotides 5534-5536). The cleavage site between VPg/protease and RNA replicase (Rep) is believed to occur between the glutamine at codon 2284 (nucleotides 6989-6991) and the serine at codon 2285 (nucleotides 6992-6994). The cleavage site between the RNA replicase and the coat protein (CP) is believed to occur between the glutamine at codon 2801 (nucleotides 8540-8542) and the serine at codon 2802 (nucleotides
8543-8545). Termination of the polyprotein coincides with termination of the coat protein and is believed to occur at the stop codon (nucleotides 9380-9382) following the glutamine at codon 3080. The 3' non-coding sequence then extends from nucleotide 9383 to nucleotide 9593 before terminating in a polyadenylate sequence of
variable length. cDNA clone pZRl contained approximately 80 adenosines at its 3' terminus. pZPro5
A restriction fragment of 1666 base pairs (bp) extending between the PvuII and SspI sites of ZYMV cDNA clone pZB11 (described above) was isolated by agarose gel electrophoresis and ligated into the Smal site of plasmid pTZ18U (Sambrook, J., et al., Molecular Cloning, Cold Spring Harbor Laboratory, 1989; Mead D.A., et al.,
Protein Engineering (1986) 1:67). This restriction fragment comprises a portion of the coding sequence of the ZYMV polyprotein which includes part of the
cytoplasmic inclusion protein (CI), the 6 kDa protein, the 49 kDa protease, and a portion of the NIb protein. Insertion of this fragment into the Smal site of pTZ18U places the reading frame encoding these proteins in phase with that of the expressible lacZα gene of pTZ18U such that expression of this gene from the lac promoter is expected to produce a fusion protein comprised of a small portion of the lacZα peptide fused to the amino terminus of the ZYMV polyprotein fragment. This construct was denoted pZPro5 and its structure was confirmed by
dideoxynucleotide sequencing (Sanger, F., et al., Proc. Natl. Acad. Sci. USA (1977) 74:5463-5467). pZPro6 and pZPro7
The 2280 bp SalI restriction fragment from ZYMV cDNA clone pZR1 (described above), which comprises a portion of the ZYMV genome including part of the NIb protein, CP, the 3' non-coding sequence, and a portion of the polyadenylate sequence, was inserted into the SalI site of pZPro5 to create pZPro6. Dideoxynucleotide sequencing of pZPro6 confirmed that the NIb/CP-encoding reading frame of the inserted fragment was in phase with the open reading frame of pZPro5 such that expression of this construct from the lac promoter is expected to produce a single polypeptide of approximately 140 kDa.
In a further refinement of pZPro6, the 1244 bp Mlul-Nael fragment was removed and the 705 bp MluI-EcoRV fragment from pZR1 was inserted in its place, removing most of the ZYMV 3' non-coding and polyadenylate
sequences, which include several unwanted restriction sites. This construct was denoted pZPro7. E. coli
strain DH5α was transformed with pZPro6 and pZPro7, and transformed clones were identified and isolated by selection for ampicillin resistance.
Expression of the lacZα-ZYMV polyprotein gene in pZPro6 and pZPro7 was monitored by immunoblotting of SDS/PAGE-resolved proteins from these cells using
polyclonal antisera raised in rabbits against denatured ZYMV coat protein (Burnette, W.N., Anal. Biochem. (1981) 112:195). Results are shown in Figure 5B. Extract from cells harboring either pZPro6 or pZPro7 contained a single immunoreactive band which co-migrated with the major species of mature ZYMV coat protein at
approximately 31 kDa. Since the CP-containing primary translation product of 140 kDa was not detected in these extracts, the exclusive appearance of mature CP implies correct and efficient processing of the polyprotein by the ZYMV 49 kDa protease.
To rule out the possibility that mature CP might have been produced either by an endogenous E. coli protease, or by fortuitous initiation of translation near the amino terminus of mature CP, a variant of pZPro7 was also analyzed. This variant, denoted placZα-CP, was made by deleting the sequence encoding the protease and most of the NIb protein from pZPro7, leaving part of the NIb protein and CP in phase with the lacZα peptide in a 46 kDa open reading frame (see Figure 5A). Extracts from cells harboring this construct contained a single
immunoreactive band which migrated with an apparent MW of 46 kDa (Figure 5B, lane 2). The apparent absence in these cells of a species co-migrating with mature CP in the absence of the ZYMV 49 kDa protease indicates that the activity of the latter is indeed responsible for the occurrence of mature CP in cells harboring pZPro6 and pZPro7.
Example 2
Construction and analysis of genes which confer a
negative phenotype on E. coli cells by virtue of the activity of the ZYMV 49 kDa protease according to the scheme described above for Protease Inhibitor Selection System I.
This example describes the construction of expressible genes encoding polyproteins which contain the ZYMV 49 kDa protease and the E. coli ribosomal protein S12. The ability of these gene constructs to confer sensitivity to the antibiotic streptomycin on several streptomycin-resistant E. coli strains by virtue of correct and efficient processing of the polyprotein by the ZYMV 49 kDa protease is demonstrated.
Streptomycin lethality in E. coli has been ascribed to its ability to interfere with protein
synthesis by binding to the S12 subunit of the 30S component of the ribosome (Gorini, L., in Ribosomes, M. Nomura, A. Tissieres, P. Lengyel, eds., Cold Spring Harbor Laboratory, 1974, pp. 791-803). Streptomycin- resistant mutants have been isolated which express altered forms of S12 which retain the ability to
participate in the assembly of ribosomes which can function in the presence of streptomycin. Wildtype S12 has been shown to confer a dominant streptomycin- sensitive phenotype on merodiploids which express both wildtype and streptomycin-resistant forms of S12. The highly sequestered position of S12 in the ribosome suggests that S12-containing polyproteins should be too encumbered to participate in the assembly of functional ribosomes. Thus, the expression of such polyproteins in streptomycin-resistant hosts should be unable to confer streptomycin sensitivity unless mature S12 can be
proteolytically freed from the polyprotein. pZPro9
The CP-encoding sequences in pZPro7 were precisely replaced with the coding sequence for S12 to create pZPro9. This was accomplished as follows. The sequence bounded by the Bglll site in NIb and the P1' position of the Nlb/CP cleavage site in pZPro7 (Schecter, I., and A. Berger, Biochem. Biophys. Res. Commun. (1967) 27:157) was amplified by polymerase chain reaction (PCR, Saiki, R.K., et al., Science (1988) 239:487). The S12 coding sequence from the second amino acid to the end was amplified by PCR from plasmid pNO1523 (Dean, D., Gene
(1981) 15:99-102). The 3' primer contained an MluI site following the stop codon. Following cleavage of the first PCR product by BglII and the second by MluI both were simultaneously ligated into pZPro7 from which the
Bglll-Mlul fragment had been removed. Following
transformation and plasmid DNA purification, the
structure of pZPro9 (see Figure 6) was confirmed by dideoxynucleotide sequencing.
pZPro9 was then transformed into streptomycin-resistant EL. coli strains MC1009, HB101, and N100
(American Type Culture Collection Catalogue of Bacteria and Phages, 1989). After two rounds of single colony isolation in the presence of ampicillin (amp), single colonies of each transformant were grown in Luria-Bertani medium (LB) containing 50 μg/ml amp to mid-log phase and plated on solid LB containing 100 μg/ml amp, 100 μg/ml streptomycin (strep), or both.
Consistently, fewer than one in 105 amp-resistant colony-forming units (cfu) of each transformant was observed to grow in the presence of both amp and strep, while the same hosts harboring pZPro7 plated with similar efficiencies on amp alone or amp and strep (see Figure 6). Thus, by virtue of having S12 in place of CP, pZPro9 is able to confer strep sensitivity on strep-resistant hosts, while its CP-containing parent, pZPro7, is not. The pZPro9 transformants were fully sensitive to as little as 3 μg strep/ml while the parent strains were fully resistant to up to 350 μ/ml. Also, the pZPro9 transformants plated equally well on strep alone or amp alone, indicating that pZPro9 is quickly lost in the absence of amp selection and that there is no discernible tendency to replace the strep-resistant gene in the host chromosome with the S12 gene by homologous recombination.
To confirm that cleavage of the polyprotein by the ZYMV 49 kDa protease to liberate mature S12 is required for strep sensitivity, the Nlb/CP cleavage site was removed from pZPro9 to create pZPro12, which should produce a polyprotein from which S12 cannot be freed by
the protease. This was accomplished by cleaving pZPro9 with EcoRV and Hpal, which removed most of the NIb protein including the Nlb/CP cleavage site , and replacing it with the fragment produced by EcoRV alone, which restored most of the NIb protein down to within 12 amino acids of the Nlb/CP cleavage site (see Figure 6). The structure of pZPro12 was confirmed by dideoxynucleotide sequencing. Transformation with pZPro12 has no
discernible effect on the ability of strep-resistant E. coli strains to grow vigorously in the presence of up to 350 μg/ml streptomycin. Thus, the strep-sensitive phenotype produced by pZPro9 is completely dependent on the presence of a substrate cleavage site at which the protease can cleave functional S12 from the polyprotein.
The expression of the pZPro9 polyprotein, like most large eucaryotic proteins, places a considerable burden on growing E. coli cells. In an attempt to reduce this burden, the pZPro9 was streamlined by removing the EcoRV fragment described above, which contains most of the NIb protein exclusive of the cleavage sites at either end. This construct, denoted pZPro10, encodes a
polyprotein of about 83 kDa, of which 49 kDa is the protease and about 14 kDa is S12 (see Figure 6). Upon transformation with pZPro10, strep-resistant E. coli strains displayed a strep-sensitive phenotype identical to that of the pZPro9 transformants. In addition, pZPro10 transformants grew considerably more vigorously than pZPro9 transformants, indicating a significant reduction in the metabolic burden on the host cells.
Thus, removal of most of the NIb protein had no
discernible effect on the efficiency of removal of functional S12 from the polyprotein by the protease.
However, pZPro10-expressing cells still grew poorly compared to the untransformed host. Recent work
with another viral protease suggests that this is
probably due, at least in part, to fortuitous activity of the protease on host proteins (Baum, E.Z., et al., Proc. Natl. Acad. Sci. USA (1990) 87:5573-5577). Inhibitors of the protease should at least partly restore normal growth. As this growth differential is relatively easy to score, it is possible to use the toxicity of the protease as the negative phenotype, and to select
inhibitors from peptide libraries, or to confirm selected inhibitors on the basis of their ability to restore rapid growth.
Once the protease removes itself from the polyprotein of pZPro10, the 14 kDa S12 is left in a 21.5 kDa precursor until freed by the protease. To confirm that neither this precursor nor the polyprotein itself is able to confer strep sensitivity, the NIb/CP cleavage site was removed from pZPro10 in the same manner that the EcoRV-Hpal restriction fragment was removed from pZPro9 to create pZPro12. This new construct, pZPro11, shown in Figure 6, had no discernible effect on the level of strep resistance shown by strep-resistant E. coli strains.
Thus, again, the presence of a substrate cleavage site adjacent to S12 is required for strep sensitivity, implying that the ZYMV 49 kDa protease is specifically responsible for generating functional S12.
Thus, systems for identifying and selecting protease inhibitors from peptide libraries have been disclosed. Although preferred embodiments of the subject invention have been described in some detail, it is understood that obvious variations can be made without departing from the spirit and the scope of the invention as defined by the appended claims.
SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT: BALINT, ROBERT
(ii) TITLE OF INVENTION: A METHOD FOR THE IDENTIFICATION OF PROTEASE INHIBITORS
(iii) NUMBER OF SEQUENCES: 1
(iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: Irell & Manella
(B) STREET: 545 Middlefield Road, Suite 200
(C) CITY: Menlo Park
(D) STATE: CA
(E) COUNTRY: USA
(F) ZIP: 94025
(v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy disk
(B) COMPUTER: IBM PC compatible
(C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: Patentln Release #1.0, Version #1.2
(vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER: US 07/727,837
(B) FILING DATE: 08-JUL-1991
(C) CLASSIFICATION:
(viii) ATTORNEY/AGENT INFORMATION:
(A) NAME: ROBINS, ROBERTA L.
(B) REGISTRATION NUMBER: 33,208
(C) REFERENCE/DOCKET NUMBER: 7115-0045
(ix) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: (415) 327-7250
(B) TELEFAX: (415) 327-2951
(C) TELEX: 706141
(2) INFORMATION FOR SEQ ID NO:1:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 9593 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:
AAAATTGAAA CAAATCACAA AGACTACAAG AATCAACGAT CAAGCAAACG AATTTTTGAA 60
CGTATTTACA AACAAGCAAT CTAAAACTCT TACAGTATTA AGAAATTCTC CAATCACTTC 120
GTTTACTTCA GACATAACAA TGGCCTCCAT CATGATTGGT TCAATCTCTG TACCCATTGC 180
AAAGACTGAG CAGTGTGCAA ACACTCAAGT AAGTAATCGG GCTAATATAG TGGCACCTGG 240
CCACATGGCA ACATGCCCAT TGCCACTGAA AACGCACATG TATTACAGGC ATGAGTCCAA 300
GAAGTTGATG CAATCAAACA AGAGCATTGA CATTCTGAAC AACTTCTTCA GCACTGACGA 360
GATGAAGTTT AGGCTCACTC GAAACGAGAT GAGCAAGCTG AAAAAGGGTC CGAGCGGGAG 420
GATAGTCCTC CGCAAGCCGA GTAAGCAGCG GGTTTTCGCT CGTATCGAGC AGGATGAGGC 480
AGCACGCAAG GAAGAGGCTG TTTTCCTCGA AGGAAATTAT GACGATTCCA TCACAAATCT 540
AGCACGTGTT CTTCCACCTG AAGTGACTCA CAACGTTGAT GTGAGCTTGC GATCACCGTT 600
TTACAAGCGC ACATACAAGA AGGAAAGGAA GAAAGTGGCG CAAAAGCAAA TTGTGCAAGC 660
ACCACTTAAT AGCTTGTGCA CACGTGTTCT TAAAATTGCA CGCAATAAAA ATATCCCTGT 720
TGAGATGATT GGCAACAAGA AGGCGAGACA TACACTCACC TTCAAGAGGT TTAGGGGATG 780
TTTTGTTGGA AAGGTGTCAG TTGCGCATGA AGAAGGACGA ATGCGGCACA CTGAGATGTC 840
GTATGAGCAG TTTAAATGGC TTCTTAAAGC CATTTGTCAG GTCACCCATA CAGAGCGAAT 900
TCGTGAGGAA GATATTAAAC CAGGTTGTAG TGGGTGGGTG TTGGGCACTA ATCATACATT 960
GACTAAAAGA TATTCAAGAT TGCCACATTT GGTGATTCGA GGTAGAGACG ACGATGGGAT 1020
TGTGAACGCG CTGGAACAGG TGTTATTTTA TAGCGAAGTT GACCACTATT CGTCGCAACC 1080
GGAAGTTCAG TTCTTCCAAG GATGGCGACG AATGTTTGAT AAGTTTAGGC CTAGCCCAGA 1140
TCATGTGTGC AAAGTTGACC ACAACAACGA GGAATGTGGT GAGTTAGCAG CAATCTTTTG 1200
TCAGGCTCTA TTCCCAGTAG TGAAACTATC GTGCCAAACA TGCAGAGAAA AGCTTAGTAG 1260
AGTTAGCTTT GAGGAATTCA AAGATTCTTT GAACGCAAAC TTTATTATCC ACAAGGATGA 1320
ATGGGGTAGT TTCAAGGAAG GCTCTCAATA CGATAATATT TTCAAATTAA TCAAAGTGGC 1380
AACACAGGCA ACTCAGAATC TCAAGCTCTC ATCTGAAGTT ATGAAATTAG TTCAGAACCA 1440
CACAAGCACT CACATGAAGC AAATACAAGA CATCAATAAG GCGCTCATGA AAGGTTCATT 1500
GGTTGCGCAA GACGAATTGG ACTTAGCTTT GAAACAGCTT CTTGAAATGA CTCAGTGGTT 1560
TAAGAACCAC ATGCACCTGA CTGGTGAGGA GGCATTGAAG ATGTTCAGAA ATAAGCGTTC 1620
TAGCAAGGCC ATGATAAATC CTAGCCTTCT ATGTGGCAAC CAATTGGACA AAAATGGAAA 1680
TTTTGTTTGG GGAGAAAGAG GATACCATTC CAAGCGATTA TTCAAGAACT TCTTCGAAGA 1740
AGTAATACCA AGCGAAGGAT ATACGAAGTA CGTAGTGCGA AACTTTCCAA ATGGTACTCG 1800
TAAGTTGGCC ATAGGCTCAT TGATTGTACC ACTTAATTTG GATAGGGCAC GCACTGCACT 1860
ACTTGGAGAG AGTATTGAGA AGAAGCCACT CACATCAGCG TGTGTCTCCC AACAGAATGG 1920
AAATTATATA CACTCATGCT GCTGTGTAAC GATGGATGAT GGAACCCCGA TGTACTCCGA 1980
GCTTAAGAGC CCGACGAAGA GGCATCTAGT TATAGGAGCT TCTAGTGATC CAAAGTACAT 2040
TGATCTGCCA GCATCTGAGG CAGAACGCAT GTATATAGCA AAGGAAGGTT ATTGCTATCT 2100
CAGTATTTTC CTCGCAATGC TTGTAAATGT TAATGAGAAC GAAGCAAAGG ATTTCACCAA 2160
AATGATTCGT GATGTTTTGA TCCCCATGCT TGGGCAGTGG CCTTCATTGA TGGATGTTGC 2220
AACTGCAGCA TATATTCTAG GTGTATTCCA TCCTGAAACG CGATGCGCTG AATTACCCAG 2280
GATCCTTGTT GACCACGCTA CACAAACCAT GCATGTCATT GATTCTTATG GATCACTAAC 2340
TGTTGGGTAT CACGTGCTCA AGGCTGGAAC TGTCAATCAT TTAATTCAAT TTGCCTCAAA 2400
TGATCTGCAA AGCGAGATGA AACATTACAG AGTTGGTGGG ACACCAACAC AGCGCATTAA 2460 ACTCGAGGAG CAGCTGATTA AAGGAATTTT CAAACCAAAA CTTATGATGC AGCTCCTCCA 2520 TGATGACCCA TACATATTAT TACTTGGCAT GATTTCACCC ACCATTCTTG TACATATGTA 2580 TAGGATGCGT CATTTTGAGC GGGGTATTGA GATATGGATT AAGAGGGATC ATGAAATCGG 2640 AAAGATTTTC GTCATATTAG AGCAGCTCAC ACGCAAGGTT GCTCTGGCAG AAGTTCTTGT 2700 GGATCAACTT AACTTGATAA GTGAAGCTTC ACCACATTTA CTTGAAATTA TGAAGGGTTG 2760 TCAAGATAAT CAGAGGGCAT ACGTACCTGC GCTGGATTTG CTAACGATAC AAGTGGAGCG 2820
TGAGTTTTCA AATAAAGAAC TCAAAACCAA TGGCTATCCA GATTTGCAGC AAACGCTCTT 2880
CGATATGAGG GAAAAAATGT ATGCAAAGCA GCTGCACAAT TCATGGCAAG AGCTAAGCTT 2940
GCTGGAAAAA TCCTGTGTAA CCGTGCGATT GAAGCAATTC TCGATTTTTA CGGAAAGAAA 3000
TTTAATCCAG CGAGCAAAAG AAGGAAAGCG CGCATCTTCG CTACAATTTG TTCACGAGTG 3060
TTTTATCACG ACCCGAGTAC ATGCGAAGAG CATTCGCGAT GCAGGCGTGC GTAAACTAAA 3120
TGAGGCTCTC GTCGGAACTT GTAAATTCTT TTTCTCTTGT GGTTTCAAAA TTTTTGCGCG 3180
ATGCTATAGC GACATAATAT ACCTTGTGAA CCTGTGTTTG GTTTTCTCCT TGGTGCTACA 3240
AATGTCCAAT ACTGTGCGCA GTATGATAGC AGCGACAAGG GAAGAAAAAG AGAGAGCGAT 3300
GGCAAATAAA GCTGATGAAA ATGAAAGGAC GTTAATGCAT ATGTACCACA TTTTCAGCAA 3360
GAAACAGGAT GATGCGCCCA TATACAATGA CTTTCTTGAA CATGTGCGTA ATGTGAGACC 3420
AGATCTTGAG GAAACTCTCT TGTACATGGC TGGCGTAGAA GTTGTTTCAA CACAGGCTAA 3480
GTCAGCGGTT CAGATTCAAT TCGAGAAAAT TATAGCTGTG TTGGCGCTGC TTACCATGTG 3540
CTTTGACGCC GAAAGAAGCG ATGCCATTTT CAAGATTTTG ACAAAACTCA AAACAGTTTT 3600
TGGTACGGTT GGAGAAACGG TCCGACTTCA AGGGCTTGAA GACATTGAAA GCTTGGAGGA 3660
CGATAAAAGA CTCACAATTG ATTTTGATAT TAACACGAAC GAGGCTCAAT CGTCAACAAC 3720
ATTTGATGTC CATTTTGATG ACTGGTGGAA TCGGCAACTA CAGCAAAATC GCACAGTTCC 3780
ACATTACAGG ACCACAGGCA AATTCCTTGA ATTTACCAGA AATACTGCAG CTTTTGTGGC 3840
CAATGAAATA GCATCATCAA GTGAGGGAGA GTTCTTAGTT AGAGGAGCAG TAGGTTCTGC 3900
AAAATCAACG AGCTTACCTG CACATCTTGC CAAGAAGGGT AAGGTGTTAC TACTCGAACC 3960
TACACGCCCT TTGGCGGAGA ATGTTAGTAG ACAGTTAGCA GGTGATCCTT TCTTTCAAAA 4020
CGTTACACTC AGAATGAGAG GGTTAAGTTG TTTTGGTTCA AGCAATATTA CAGTGATGAC 4080
GAGTGGATTT GCTTTTCACT ACTATGTTAA CAATCCACAT CAATTGATGG AATTTGACTC 4140
TGTCATCATA GACGAGTGCC ATGTCACAGA CAGTGCGACC ATAGCTTTCA ATTGTGCACC 4200
TAAAGAGTAC AACTTTGCTG GCAAATTGAT TAAAGTGTCT GCAACGCCGC CAGGGAGAGA 4260
GTGCGATTTC GATACGCAAT TCGCGGTGAA AGTCAAAACA GAGGACCATC TTTCATTCCA 4320
TGCATTCGTT GGCGCACAGA AGACTGGTTC AAATGCTGAC ATGGTTCAGC ATGGTAATAA 4380
CATACTTGTG TATGTTGCAA GTTACAACGA AGTGGACATG CTCTCTAAGT TACTCACTGA 4440
GCGCCAATTT TCAGTTACAA AGGTAGATGG GCGAACAATG CAGCTTGGAA AAACTACCAT 4500
TGAAACGCAT GGAACTAGCC AAAAGCCCCA TTTCATAGTA GCTACAAACA TCATCGAGAA 4560
TGGAGTGACG TTGGATGTTG AGTGTGTTGT TGATTTTGGA CTAAAAGTGG TCGCAGAACT 4620
GGACAGCGAA AATCGGTGTG TGCGCTACAA TAAGAAATCA GTTAGTTATG GAGAGAGGAT 4680
TCAGCGACTA GGAAGAGTGG GGAGATCTAA GCCTGGAACT GCATTGCGTA TAGGGCACAC 4740
AGAAAAAGGC ATCGAAACCA TTCCTGAATT CATTGCCACA GAAGCAGCAG CCTTATCATT 4800
TGCATATGGG CTTCCAGTCA CCACACATGG AGTTTCCACA AATATACTTG GAAAGTGCAC 4860
AGTTAAACAG ATGAAATGTG CTTTGAACTT TGAGCTAACT CCTTTCTTCA CCACTCATTT 4920
AATCCGTCAT GATGGTAGTA TGCATCCACT AATACACGAA GAATTGAAGC AGTTCAAACT 4980
CAGGGATTCA GAAATGGTGC TCAACAAGGT TGCATTACCT CATCAATTTG TGAGCCAATG 5040
GATGGATCAA AGTGAGTATG AACGCATTGG AGTGCACGTT CAATGCCATG AGAGCACACG 5100
CATACCTTTT TACACAAATG GAATACCTGA TAAAGTCTAT GAGAGAATTT GGAAGTGCAT 5160
ACAAGAAAAC AAGAACGATG CGGTTTTTGG TAAGCTTTCA AGTGCTTGTT CAACTAAGGT 5220
TAGTTATACA CTTAGCACTG ATCCAGCAGC ATTACCCAGA ACTATTGCAA TCATCGATCA 5280
CCTGCTTGCC GAGGAAATGA TGAAGCGGAA TCACTTCGAC ACTATCAGCT CAGCTGTAAC 5340
GGGCTATTCA TTTTCCCTTG CTGGAATTGC TGATTCTTTC AGGAAGAGAT ACATGCGCGA 5400
TTACACAGCG CACAACATTG CAATTCTCCA ACAAGCACGT GCCCAGCTGC TTGAATTTAA 5460
TAGTAAGAAT GTGAACATTA ACAATCTGTC CGATTTAGAA GGAATTGGAG TCATTAAGTC 5520
GGTGGTGTTG CAAAGTAAGC AAGAGGTCAG CAGTTTCCTC GGACTTCGCG GTAAATGGGA 5580
TGGAAAGAAA TTTGCGAATG ATGTGATATT GGCGATTATG ACACTCTTAG GAGGTGGGTG 5640
GTTCATGTGG GAATACTTCA CGAAAAAGAT CAATGAACCC GTGCGCGTTG AAAGCAAGAA 5700
ACGTCGATCT CAAAAATTGA AATTCAGGGA TGCGTACGAT AGAAAAGTTG GACGTGAGAT 5760
TTTTGGTGAT GATGATACAA TTGGGCGCAC TTTCGGCGAA GCTTACACGA AGAGAGGAAA 5820
GGTCAAAGGA AACAACAACA CAAAAGGAAT GGGACGGAAA ACTCGCAATT TTGTGCATTT 58880
ATATGGTGTG GAGCCTGAGA ATTACAGTTT TATCAGATTT GTGGACCCTC TCACTGGCCA 59940
TACATTGGAC GAAAGCACCC ATACAGACAT ATCGTTAGTG CAGGAGGAGT TTGGAAGTAT 6000
TAGAGAGAAA TTTCTGGAGA ATGATTTGAT CTCGAGGCAG TCTATTATCA ACAAACCCGG 6060
CATTCAGGCA TATTTTATGG GCAAGGGCAC TGAAGAAGCA CTCAAAGTTG ACTTGACTCC 6120
TCATGTACCA TTGCTTCTGT GCAGAAACAC CAATGCTATT GCGGGATACC CAGAGAGAGA 6180
ACTTTTATTC CAAAGTTGTG AAAGGTTGTT CAATGGCTAC AAAGGTCTGT GGAATGGATC 7500
TTTAAAGGCC GAGCTCAGGC CGCTTGAGAA AGTCAGGGCT AACAAAACAC GAACCTTTAC 7560
AGCAGCGCCA ATTGATACAT TGCTTGGAGC TAAAGTTTGT GTGGATGATT TCAACAATGA 7620
GTTCTACAGG AAAAACCTCA AGTGTCCATG GACGGTCGGC ATGACAAAAT TTTATGGTGG 7680
TTGGGATAAA TTGATGAGAT CATTACCTGA TGGTTGGTTG TATTGTCATG CTGATGGATC 7740
ACAGTTCGAT AGTTCGTTAA CCCCAGCCTT ACTGAACGCA GTGCTCATAA TCAGGTCATT 7800
TTATATGGAG GATTGGTGGG TCGGCCAAGA GATGCTTGAA AATCTTTATG CCGAGATTGT 7860
GTACACTCCA ATTCTTGCTC CTGATGGAAC AATTTTCAAG AAATTTAGAG GTAACAACAG 7920
TGGGCAACCC TCAACAGTGG TGGATAACAC ACTAATGGTT GTGATCTCTA TTTACTATGC 7980
GTGCATGAAA TTTGGTTGGA ACTGCGAGGA GATTGAGAAT AAACTTGTCT TCTTTGCAAA 8040
TGGAGATGAT CTGATACTTG CAGTCAAAGA TGAGGATAGC GGCTTACTTG ATAACATGTC 8100
ATCCTCTTTT TGCGAACTTG GACTGAATTA TGATTTTTCA GAACGTACGC ATAAAAGAGA 8 160
AGATCTTTGG TTCATGTCCC ACCAAGCAAT GCTAGTTGAT GGAATGTACA CTCCAAAACT 8220
CGAGAAAGAG AGAATTGTTT CAATTCTAGA GTGGGATAGA AGCAAAGAAA TTATGCACCG 8280
AACAGAGGCT ATTTGCGCTG CGATGATTGA GGCATGGGGG CACACCGAGC TCTTGCAAGA 8340
AATCAGAAAG TTTTACCTAT GGTTCGTTGA AAAAGAAGAG GTGCGAGAAT TGGCACCCCT 8400
CGGAAAAGCT CCATACATAG CTGAGACAGC ACTTCGTAAG TTATACACTG ACAAGGGAGC 8460
AGATACAAGT GAACTGGCAC GCTACCTACA AGCCCTCCAT CAAGATATCT TCTTTGAGCA 8520
AGGAGACACT GTGATGCTCC AATCAGGCAC TCAGCCAACT GTGGCAGATG CTGGAGCTAC 8580
AAAGAAAGAT AAAGAAGATG ACAAAGGGAA AAACAAGGAC GTTACAGGCT CCGGCTCAGG 8640
Claims
1. A method for detecting a protease
inhibitor, said method comprising:
(a) providing a population of host cells expressing a first nucleic acid sequence encoding a protease and a second nucleic acid sequence encoding a protein capable of conferring a selectable phenotype on said host cells dependent on the activity of said protease;
(b) providing a pool of nucleic acid constructs wherein at least one of said constructs in said pool comprises a nucleic acid sequence encoding an inhibitor of said protease;
(c) transforming said host cells of (a) with said nucleic acid constructs of (b); and
(d) growing said transformed host cells of (c) under conditions that distinguish cells with said
selectable phenotype, thereby detecting the presence of said protease inhibitor.
2. The method of claim 1 wherein said host cells are bacterial cells.
3. The method of claim 2 wherein said
selectable phenotype is the ability of said bacterial cells to grow in the presence of a given antibiotic.
4. The method of claim 3 wherein said second nucleic acid sequence comprises a nucleic acid sequence encoding E. coli ribosomal protein S12 and said
antibiotic is streptomycin.
5. The method of claim 2 wherein said first nucleic acid sequence comprises a nucleic acid sequence encoding ZYMV 49 kDa protease.
6. The method of claim 2 wherein said
selectable phenotype is the ability of said transformed host cells to grow in the presence of a given carbon source.
7. The method of claim 1 wherein said second nucleic acid sequence comprises a nucleic acid sequence encoding a protein which is inactivated by said protease.
8. The method of claim 1 wherein said second nucleic acid sequence comprises a nucleic acid sequence encoding a protein which is activated by said protease.
9. A DNA construct comprising:
(a) a first DNA coding sequence for a protein capable of conferring a selectable phenotype on a host cell transformed therewith, said selectable phenotype dependent on the activity of a protease; and
(b) control sequences that are operably linked to said first and second coding sequences whereby said coding sequences can be transcribed and translated in a host cell, and at least one of said control sequences is heterologous to at least one of said coding sequences.
10. The DNA construct of claim 9 further comprising a second DNA coding sequence for said
protease.
11. The DNA construct of claim 10 wherein said protease is ZYMV 49 kDa protease.
12. The DNA construct of claim 9 wherein said first DNA coding sequence codes for E. coli ribosomal protein S12 and said selectable phenotype is streptomycin resistance.
13. The DNA construct of claim 10 wherein said first DNA coding sequence codes for E. coli ribosomal protein S12 and said selectable phenotype is streptomycin resistance.
14. The DNA construct of claim 11 wherein said first DNA coding sequence codes for E. coli ribosomal protein S12 and said selectable phenotype is streptomycin resistance.
15. A DNA construct comprising:
(a) a DNA coding sequence for a protein capable of inhibiting the action of a given protease, said protein identified by the method of claim 1; and
(b) control sequences that are operably linked to said coding sequence whereby said coding sequence can be transcribed and translated in a host cell, and at least one of said control sequences is heterologous to at least said coding sequence.
16. The DNA construct of claim 15 wherein said protease is ZYMV 49 kDa protease.
17. A host cell stably transformed with a DNA construct according to claim 9.
18. A host cell stably transformed with a DNA construct according to claim 10.
19. The host cell of claim 18 further
transformed with a DNA construct comprising:
(a) a DNA coding sequence for a protein capable of inhibiting the action of a given protease, said protein identified by a method comprising
(i) providing a population of host cells expressing a first nucleic acid sequence encoding a protease and a second nucleic acid sequence encoding a protein capable of conferring a selectable phenotype on said host cells dependent on the activity of said
protease;
(ii) providing a pool of nucleic acid constructs wherein at least one of said constructs in said pool comprises a nucleic acid sequence encoding an inhibitor of said protease;
(iii) transforming said host cells of (i) with said nucleic acid constructs of (ii); and
(iv) growing said transformed host cells of (iii) under conditions that distinguish cells with said selectable phenotype, thereby detecting the presence of said protease inhibitor; and
(b) control sequences that are operably linked to said coding sequence whereby said coding sequence can be transcribed and translated in a host cell, and at least one of said control sequences is heterologous to at least said coding sequence.
20. A host cell stably transformed with a DNA construct according to claim 11.
21. The host cell of claim 20 further transformed with a DNA construct comprising: (a) a DNA coding sequence for a protein capable of inhibiting the action of a given protease, said protein identified by a method comprising
(i) providing a population of host cells expressing a first nucleic acid sequence encoding a protease and a second nucleic acid sequence encoding a protein capable of conferring a selectable phenotype on said host cells dependent on the activity of said protease;
(ii) providing a pool of nucleic acid constructs wherein at least one of said constructs in said pool comprises a nucleic acid sequence encoding an inhibitor of said protease;
(iii) transforming said host cells of (i) with said nucleic acid constructs of (ii); and
(iv) growing said transformed host cells of (iii) under conditions that distinguish cells with said selectable phenotype, thereby detecting the presence of said protease inhibitor; and
(b) control sequences that are operably linked to said coding sequence whereby said coding sequence can be transcribed and translated in a host cell, and at least one of said control sequences is heterologous to at least said coding sequence.
22. A host cell stably transformed with a DNA construct according to claim 12.
23. A host cell stably transformed with a DNA construct according to claim 13.
24. The host cell of claim 23 further transformed with a DNA construct comprising: (a) a DNA coding sequence for a protein capable of inhibiting the action of a given protease, said protein identified by a method comprising
(i) providing a population of host cells expressing a first nucleic acid sequence encoding a protease and a second nucleic acid sequence encoding a protein capable of conferring a selectable phenotype on said host cells dependent on the activity of said
protease;
(ii) providing a pool of nucleic acid constructs wherein at least one of said constructs in said pool comprises a nucleic acid sequence encoding an inhibitor of said protease;
(iii) transforming said host cells of (i) with said nucleic acid constructs of (ii); and
(iv) growing said transformed host cells of (iii) under conditions that distinguish cells with said selectable phenotype, thereby detecting the presence of said protease inhibitor; and
(b) control sequences that are operably linked to said coding sequence whereby said coding sequence can be transcribed and translated in a host cell, and at least one of said control sequences is heterologous to at least said coding sequence.
25. A host cell stably transformed with a DNA construct according to claim 14.
26. The host cell of claim 25 further transformed with a DNA construct comprising:
(a) a DNA coding sequence for a protein capable of inhibiting the action of ZYMV 49 kDa protease, said protein identified by a method comprising (i) providing a population of host cells expressing a first nucleic acid sequence encoding a protease and a second nucleic acid sequence encoding a protein capable of conferring a selectable phenotype on said host cells dependent on the activity of said
protease;
(ii) providing a pool of nucleic acid constructs wherein at least one of said constructs in said pool comprises a nucleic acid sequence encoding an inhibitor of said protease;
(iii) transforming said host cells of (i) with said nucleic acid constructs of (ii); and
(iv) growing said transformed host cells of (iii) under conditions that distinguish cells with said selectable phenotype, thereby detecting the presence of said protease inhibitor; and
(b) control sequences that are operably linked to said coding sequence whereby said coding sequence can be transcribed and translated in a host cell, and at least one of said control sequences is heterologous to at least said coding sequence.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US72783791A | 1991-07-09 | 1991-07-09 | |
US727,837 | 1991-07-09 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO1993001305A1 true WO1993001305A1 (en) | 1993-01-21 |
Family
ID=24924278
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US1992/005745 WO1993001305A1 (en) | 1991-07-09 | 1992-07-09 | A method for the identification of protease inhibitors |
Country Status (2)
Country | Link |
---|---|
AU (1) | AU2377392A (en) |
WO (1) | WO1993001305A1 (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1996034976A1 (en) * | 1995-05-01 | 1996-11-07 | Vertex Pharmaceuticals Incorporated | Methods, nucleotide sequences and host cells for assaying exogenous and endogenous protease activity |
US6117639A (en) * | 1998-08-31 | 2000-09-12 | Vertex Pharmaceuticals Incorporated | Fusion proteins, DNA molecules, vectors, and host cells useful for measuring protease activity |
WO2002018588A1 (en) * | 2000-08-29 | 2002-03-07 | Novozymes A/S | Method for screening highly active proteases and inhibitors |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0321973A2 (en) * | 1987-12-23 | 1989-06-28 | BOEHRINGER INGELHEIM INTERNATIONAL GmbH | Expression of the virally encoded protease P2A of HRV2 |
-
1992
- 1992-07-09 WO PCT/US1992/005745 patent/WO1993001305A1/en active Application Filing
- 1992-07-09 AU AU23773/92A patent/AU2377392A/en not_active Abandoned
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0321973A2 (en) * | 1987-12-23 | 1989-06-28 | BOEHRINGER INGELHEIM INTERNATIONAL GmbH | Expression of the virally encoded protease P2A of HRV2 |
Non-Patent Citations (3)
Title |
---|
GENE, Volume 15, issued 1981, D. DEAN, "A Plasmid Cloning Vector for the Direct Selection of Strains Carrying Recombinant Plasmids". * |
JOURNAL OF GENERAL VIROLOGY, Volume 71, issued 1990, R. GRUMET et al., "cDNA Cloning and Sequence Analysis of the 3'-Terminal Region of Zucchini Yellow Mosaic Virus RNA", see especially Figure 1. * |
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES, Volume 87, issued December 1990, E.Z. BAUM et al., "Beta-Galactosidase Containing a Human Immunodeficiency Virus Protease Cleavage Site is Cleaved and Inactivated by Human Immunodeficiency Virus Protease", pages 10023-10027. * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1996034976A1 (en) * | 1995-05-01 | 1996-11-07 | Vertex Pharmaceuticals Incorporated | Methods, nucleotide sequences and host cells for assaying exogenous and endogenous protease activity |
US5861267A (en) * | 1995-05-01 | 1999-01-19 | Vertex Pharmaceuticals Incorporated | Methods, nucleotide sequences and host cells for assaying exogenous and endogenous protease activity |
US6117639A (en) * | 1998-08-31 | 2000-09-12 | Vertex Pharmaceuticals Incorporated | Fusion proteins, DNA molecules, vectors, and host cells useful for measuring protease activity |
US6528276B1 (en) | 1998-08-31 | 2003-03-04 | Vertex Pharmaceuticals Inc. | Fusion proteins, DNA molecules, vectors, and host cells useful for measuring protease activity |
WO2002018588A1 (en) * | 2000-08-29 | 2002-03-07 | Novozymes A/S | Method for screening highly active proteases and inhibitors |
Also Published As
Publication number | Publication date |
---|---|
AU2377392A (en) | 1993-02-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Sasaki et al. | An insect picorna-like virus, Plautia stali intestine virus, has genes of capsid proteins in the 3′ part of the genome | |
US5955647A (en) | Method for using tobacco mosaic virus to overproduce peptides and proteins | |
Bujarski et al. | Genetic recombination between RNA components of a multipartite plant virus | |
EP0989863B1 (en) | System for the in vivo delivery and expression of heterologous genes in the bone marrow | |
Verchot et al. | Evidence that the potyvirus P1 proteinase functions in trans as an accessory factor for genome amplification | |
Rao et al. | Molecular studies on bromovirus capsid protein: II. Functional analysis of the amino-terminal arginine-rich motif and its role in encapsidation, movement, and pathology | |
JP5995926B2 (en) | Promoterless cassette for expression of alphavirus structural proteins | |
AU634168B2 (en) | Potyvirus coat protein genes and plants transformed therewith | |
Kollàr et al. | Defective interfering RNA-mediated resistance against cymbidium ringspot tombusvirus in transgenic plants | |
Urcuqui-Inchima et al. | Potyvirus helper component-proteinase self-interaction in the yeast two-hybrid system and delineation of the interaction domain involved | |
US6177075B1 (en) | Insect viruses and their uses in protecting plants | |
WO1993001305A1 (en) | A method for the identification of protease inhibitors | |
Xu et al. | Rice dwarf phytoreovirus segment S11 encodes a nucleic acid binding protein | |
EP0938574B1 (en) | Method for inducing viral resistance into a plant | |
KIM et al. | Expression, purification, and identification of a novel self-cleavage site of the NIa C-terminal 27-kDa protease of turnip mosaic potyvirus C5 | |
AU692166B2 (en) | Lettuce infectious yellows virus genes | |
Jewell et al. | Limited expression of poliovirus by vaccinia virus recombinants due to inhibition of the vector by proteinase 2A | |
US7078586B2 (en) | Papaya ringspot virus genes | |
US6005166A (en) | Papaya ringspot virus replicase gene | |
US7045685B2 (en) | Insect viruses and their uses in protecting plants | |
AU2002307322A1 (en) | Papaya ringspot virus genes | |
Sarkar et al. | Creation of leaderless FMDV replicon for development of replication defective virus (leaderless FMDV): A strategy towards the development of attenuated vaccine with marker facility | |
US5514570A (en) | Squash mosaic virus genes and plants transformed therewith | |
Hackland | The development of transgenic plants resistant to cucumber mosaic virus and tobacco necrosis virus | |
EP1309708A2 (en) | Method for altering degradation of engineered protein in plant cells |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AU CA JP KR RU |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): AT BE CH DE DK ES FR GB GR IT LU MC NL SE |
|
122 | Ep: pct application non-entry in european phase | ||
NENP | Non-entry into the national phase |
Ref country code: CA |