WO2005036180A1 - Methodes d'analyse utilisant des biomarqueurs concentres avec des molecules d'attraction de biomarqueurs - Google Patents
Methodes d'analyse utilisant des biomarqueurs concentres avec des molecules d'attraction de biomarqueurs Download PDFInfo
- Publication number
- WO2005036180A1 WO2005036180A1 PCT/US2004/033305 US2004033305W WO2005036180A1 WO 2005036180 A1 WO2005036180 A1 WO 2005036180A1 US 2004033305 W US2004033305 W US 2004033305W WO 2005036180 A1 WO2005036180 A1 WO 2005036180A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- biomarker
- molecule
- attractant
- molecules
- atfractant
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/68—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
- G01N33/6803—General methods of protein analysis not limited to specific proteins or families of proteins
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/53—Immunoassay; Biospecific binding assay; Materials therefor
- G01N33/574—Immunoassay; Biospecific binding assay; Materials therefor for cancer
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/53—Immunoassay; Biospecific binding assay; Materials therefor
- G01N33/574—Immunoassay; Biospecific binding assay; Materials therefor for cancer
- G01N33/57484—Immunoassay; Biospecific binding assay; Materials therefor for cancer involving compounds serving as markers for tumor, cancer, neoplasia, e.g. cellular determinants, receptors, heat shock/stress proteins, A-protein, oligosaccharides, metabolites
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/68—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
- G01N33/6803—General methods of protein analysis not limited to specific proteins or families of proteins
- G01N33/6848—Methods of protein analysis involving mass spectrometry
Definitions
- the invention relates to analysis of molecules present in biological fluids. More specifically, the invention relates to the use of biomarker attractant molecules to concentrate low abundance biomarkers such as peptides, proteins and protein fragments for analysis.
- Biological fluids such as blood and lymph are the repositories of a vast number of molecules that are excreted or otherwise shed by cells.
- the molecules that are present in biological fluids reflect the physiological and pathological states of the cells that are contacted by the fluids or which were contacted by the fluids from which they are derived (for example, lymphatic fluid enters the blood through the thoracic duct). Therefore, a major goal of clinical diagnostics is to correlate the particular molecules (biomarkers) present in biological fluids with particular disease states. Although the biomarker approach to diagnostics holds great promise, the number of biomarkers that are reaching routine clinical use remains low.
- the low molecular mass range (for example, between about 0.3 kDa and 12 kDa) of the plasma proteome remains largely uncharacterized, principally because no adequate method for discovering and utilizing biomarkers in this mass range currently exists.
- the lower limit of effective resolution achieved by conventional 2-D gel electrophoresis (2D-GE) is about 10 kDa.
- mass spectrometry (MS) exhibits its optimal performance in the low molecular mass range, it is nonetheless inadequate for comprehensive analysis of plasma proteins because the dynamic range of concentrations of proteins in plasma is much greater than the dynamic range of the method.
- MS mass spectrometry
- Enzymatic treatment may, however, destroy or mask the information content of the sample by cleaving disease biomarkers and by creating large quantities of interfering enzymatic fragments from high abundance proteins.
- mass spectrometry for biomarker analysis have turned to the native undigested serum proteome as a launch point for biomarker discovery (see, for example, Petricoin et al., "Use of Proteomic Patterns in Serum and Identification of Ovarian Cancer," Lancet, 359: 572-577, 2002).
- PSA prostate-specific antigen
- prostatic tissue first identified in prostatic tissue, was later found to be present in significant part as a complex with ⁇ antichymotrypsin (See, for example, Wang et al., Invest Urol., 17: 159-63, 1979 and Lilja et al, Clin. Chem., 37: 1618-1625, 1991).
- the existence of these two forms of PSA has both complicated diagnostic assays for prostate cancer, and been exploited in such assays to provide better differentiation between benign prostatic hypertrophy and prostate cancer (see, for example, Peter et al., Clinical Chemistry, 46: 474-482, 2000).
- PSP 94 prostate secretory protein of 94 amino acids
- serum proteins See, for example, Dube et al, J. Androl., 8: 182-189, 1987; Wu et al., J. Cell Biochem., 76: 71-83, 1999; and Guo et al., "Serum Bound Forms of PSP94 in Prostate Cancer Patients," J. Cell Biochem., 76: 71-83, 1999).
- the existence of multiple forms of PSP94 has complicated the development of assays for this biomarker (See, for example, U.S. Patent No. 6,107,103).
- biomarkers exist as complexes in serum, it has been not reported that complexes of molecules adhered to each other in biological fluids can be separated from biological fluids and probed for the presence of potential biomarkers.
- biomarker attractant molecules Molecules and substances which bind and concentrate low abundance, low molecular weight biomarkers are referred to herein as "biomarker attractant molecules.”
- a method is disclosed for discovering or identifying potential biomarkers by separating biomarker attractant molecules from biological fluids and detecting the molecules that are adhered to the biomarker attractant molecules, for example, by separating or isolating the biomarker from the biomarker attractant molecules and analyzing those molecules. This method is contrary to known methods of biomarker discovery in that high abundance molecules present in the biological fluid are not separated out of the biological fluid to improve analysis. Rather, the adherence of biomarkers to such high abundance molecules is exploited to increase detection sensitivity.
- a naturally occurring biomarker attractant molecule is isolated from a biological fluid obtained from different populations of subjects exhibiting different biological states (such as the presence and absence of a type of cancer). Molecules differentially bound to the biomarker attractant molecule between the populations are then detected and identified as biomarkers that discriminate between the biological states.
- a biomarker attractant molecule is added to the biological fluid of the subjects and allowed to concentrate low molecular weight molecules for some period of time (such as more than a day). After the period of time, the biomarker attractant molecule is isolated from the biological fluid of the subjects and analyzed for bound biomarkers that discriminate between the biological states.
- the cellular proteins that are the source of the fragments have, in some instances, already been correlated with a particular biological state (such as a disease state or pathological state) in a particular location (such as a particular organ, sub-anatomic structure, cell or organelle).
- a diagnostic method is disclosed in which biomarker attractant molecules are separated from a biological fluid and biomarkers adhered to the biomarker attractant molecule are detected to determine the biological state of a subject.
- proteins, fragments thereof, and combinations of such proteins and protein fragment that have been identified as biomarkers are used to provide a diagnosis for a subject.
- biomarker attractant molecules that are separated from a biological fluid are analyzed for the presence of fragments of particular cellular proteins that are known to correlate with particular biological states, such as biological states in particular anatomic or physiologic locations.
- Such fragments that might otherwise be cleared from the biological fluid and that would otherwise never reach detectable levels are surprisingly concentrated by adherence to biomarker attractant molecules and therefore can be detected.
- biomarker attractant molecule bound fragments of proteins associated with hypoxia and/or apoptosis in the heart muscle are detected and used to diagnose myocardial ischemia or a myocardial infarction (heart attack). Concentration of such fragments by biomarker attractant molecules facilitates their detection.
- particular diagnostic fragment/biomarker attractant molecule complexes or biomarker/biomarker attractant molecule complexes may then be more sensitively detected through further amplification using, for example, enzyme-linked antibodies that recognize the fragment/biomarker attractant complex.
- a capture surface that specifically binds one or more biomarker attractant molecules is used as a bait surface to separate the biomarker attractant molecule(s) from a biological fluid.
- Antibodies directed to one or more of the diagnostic fragments, biomarkers, or complexes thereof with the biomarker attractant molecule, are then applied to the captured complexes.
- Binding of the antibodies to the adhered complexes can be detected using any known antibody detection method including colorimetric methods, chemiluminescent methods, fluorescence methods, evanescent wave detection methods, electrochemical methods, magnetic methods or electrical methods.
- the capture surface is used as a target for laser deso ⁇ tion and molecules bound to the separated biomarker attractant molecules are analyzed by mass spectrometry.
- FIG. 1 is a schematic diagram of a process by which proteins shed by cells are clipped into fragments small enough to pass into the blood stream.
- FIG. 2 is a schematic diagram of a model of biomarker production and distribution between cellular and vascular compartments.
- FIG. 3 is a series of graphs showing the results of a theoretical model of biomarker amplification/concentration according to the disclosed method.
- FIG. 4 is a diagram illustrating a separation protocol used to generate the fractions (A-F) that were analyzed to generate the corresponding mass spectra of FIG. 5.
- FIG. 5 is a diagram illustrating the mass spectra obtained for each of the fractions illustrated in FIG. 4.
- the letters A-F denote the spectra obtained for fractions A-F as illustrated in FIG. 4.
- FIG. 6 is a series of SELDI-TOF mass spectra obtained for serum and fractions thereof that were derived from an ovarian cancer sera study set using the protocol shown in FIG. 4.
- FIG. 7 is a series of mass spectra in which the spectral region between 5000-7500 m/z has been magnified, and comparing albumin associated ions normalized for sample concentration and amplitude.
- FIG. 8 is series of SELDI-TOF mass spectra showing the dependence of ion amplitude on quantity of albumin captured (0.025-0.175 mg) from the serum.
- FIG. 9 is a schematic diagram of an exemplary process by which biomarker attractant molecules (denoted carrier with diagnostic cargo in the figure) may be isolated from biological fluids by binding the biomarker attractant molecule to capture agents inside of a flow cell, and the biomarkers bound to the biomarker attractant molecule are separated therefrom.
- FIG. 10 is a schematic diagram of a process by which the biomarkers bound to the biomarker attractant molecule immobilized by the process of FIG. 8 are eluted directly into a mass spectrometer for analysis, and the mass spectral data is used to detect previously sequenced protein fragments that are indicative of a subject's physiological and/or pathological profile.
- FIG. 10 is a schematic diagram of a process by which the biomarkers bound to the biomarker attractant molecule immobilized by the process of FIG. 8 are eluted directly into a mass spectrometer for analysis, and the mass spectral data is used to detect previously sequenced protein fragments that are indicative of a subject's
- FIG. 11 is a schematic diagram showing a process whereby a fluorescent labeled capture agent (in this case an antibody) immobilized to a silica substrate (A) is used to capture a biomarker attractant molecule (in this case a carrier protein) with biomarkers attached (B).
- a fluorescent labeled capture agent in this case an antibody
- A silica substrate
- B biomarker attractant molecule
- B biomarkers attached
- Light in this case UV light
- C immobilized antibody
- a laser pulse is used to desorb biomarkers directly from the biomarker attractant molecule, for example, after an energy absorbing molecule such as cinnapinic acid is used to coat the substrate with its bound molecules.
- FIG. 12 is a schematic diagram showing a method of diagnosis utilizing the information obtained from analysis of biomarkers that bind to a biomarker attractant molecule.
- biomarker attractant molecule refers to physical or chemical association of another molecule (such as a known or potential biomarker) with the biomarker attractant molecule.
- the association can be either direct or indirect, in that molecules can be adhered in one or more layers to a surface (external or internal) of the biomarker attractant molecule.
- molecules exhibiting hydrophilic and hydrophobic domains (an amphipathic molecule) on opposite sides of the molecule can adhere to a hydrophobic domain on a biomarker attractant molecule, thereby presenting the hydrophilic domain of the adhered molecule to the solution.
- a hydrophilic molecule can then adhere to the hydrophilic side of the amphipathic molecule, and thus become indirectly adhered to the biomarker attractant molecule.
- adhered molecules are molecules that do not form covalent bonds with the biomarker attractant molecule.
- the adhered molecule is bound non-specifically (that is, not to the substantial exclusion of other molecules) to one or more domains on the biomarker attractant molecule.
- the molecule is adhered to the biomarker attractant molecule with an association constant (K a ) of less than about 10 8 , for example, a K a of less than about IO 7 , such as less than about IO 6 or about 10 s .
- Adhered molecules also can be distinguished according to the strength of their association based upon the identity, strength (concentration), and or type of denaturant required to dissociate them from the biomarker attractant molecule.
- the term "biological fluid” refers to a liquid into which biomarkers are released, or a liquid derived from the liquid into which biomarkers are initially released. Such derivation may occur either in vivo or in vitro.
- the biological fluid is a circulating fluid such as blood or lymph, or a fraction thereof, such as serum or plasma.
- the biological fluid remains substantially in a particular locus, for example, synovial fluid, cerebrospinal fluid or interstitial fluid.
- the biological fluid is an excreted fluid, for example, urine, breast milk, saliva, sweat, tears, mucous, nipple aspirants, semen, vaginal fluid, pre-ejaculate and the like.
- a biological fluid also refers to a liquid in which cells are cultured in vitro such as a growth medium, or a liquid in which a cell sample is homogenized, such as a buffer.
- biomarker refers to one or more molecules (or signals due to such molecules in an analytical method such as mass spectrometry) that are differentially released into a biological fluid by any means (including secretion or by leakage through the cell membrane) in one or more subjects in each of two or more populations that exhibit different biological states.
- biomarkers include proteins, lipids, lipoproteins, glycoproteins, nucleic acids (such as circulating nucleic acids, see, for example, Krishna et al., Jpn. J. Clin. Oncol., 34: 307-311, 2004; Ngan et al., Ann. NY Acad. Sci.,, 1022: 263-270, 2004; Carstensen et si., Ann. NY Acad. Sci., 1022: 210-210, 2004; Watanagara et al., Ann. NY Acad. Sci., LU22: 90-99, 2004; Herman, "Circulating Methylated DNA,” Ann. NY Acad.
- biomarker atfractant molecule refers to a molecule, or other substance to which biomarkers in a biological fluid adhere.
- biomarkers adhere to a BAM with a low binding affinity (for example, a binding affinity of less than 10 "3 , 10 "4 , 10°, IO “6 , IO “7 or IO “8 L/mol-min).
- An antibody may be a BAM to the extent that it binds biomarkers, other than through the specific antigen antibody interaction that results from the immune response that stimulated its production.
- biomarker binding to an antibody BAM may occur outside of the complementarity defining region (CDR), or outside of the variable region altogether, for example by binding to the Fc portion of the antibody.
- the BAM is not an antibody.
- a particular BAM may selectively bind a class of biomarkers
- the binding affinities of the biomarkers in a particular class do not differ as significantly as the binding affinities of an antigen to a particular antibody compared to other non- recognized molecules.
- the less specific nature of biomarker binding may be illustrated in certain examples of the BAM in which more than one biomarker binds to the BAM, for example, at least 2, at least 5, at least 10, at least 20, or even 50 or more biomarkers bind to the BAM.
- BAMs have a half-life of existence in a particular biological fluid (for example in the body) that is longer than the half-life of biomarkers that become adhered to the BAMs and thereby concentrate the biomarker in the biological fluid.
- BAMs can have a half-life of greater than about 1 day, such as greater than 2, 5, 10, 20 or 50 days.
- the BAM has as size and/or shape such that it is not substantially filtered from the blood stream by the kidneys.
- the BAM has a molecular weight of greater than 25 kDa, for example, greater than 30, 50, 75, 100, 150, 200 or 300 kDa.
- the BAM molecule has a molecular weight falling within a particular range, for example between 30 and 50 kDa, between 50 and 75 kDa, between 75 and 100 kDa, between 100 and 150 kDa, between 150 and 200 kDa, between 200 and 300 kDa, or any other range between 30 kDa and 300 kDa.
- Biomarkers may adsorb to the surface or be absorbed into the interior of the BAM, or both.
- BAMs include proteins (including natural and engineered proteins such as chimeric proteins proteins with modified amino acid composition, proteins modified posfranslationally, nucleic acids, carbohydrate decorated molecules, and organic polymers), dendrimers and particles (such as microparticles and nanoparticles, including silica, metal, ceramic and carbohydrate microparticles and nanoparticles ), and cellular microparticles (see, for example, Diamant et al, Cellular microparticles: new players in the field of vascular disease?, EurJ Clin Invest. 34: 392-401, 2004).
- proteins including natural and engineered proteins such as chimeric proteins proteins with modified amino acid composition, proteins modified posfranslationally, nucleic acids, carbohydrate decorated molecules, and organic polymers
- dendrimers and particles such as microparticles and nanoparticles, including silica, metal, ceramic and carbohydrate microparticles and nanoparticles
- cellular microparticles see, for example, Diamant et al
- BAMs may be produced or derivatized to provide ionic groups (such as carboxylate, protonated arnine, quaternary ammonium, and sulfate groups), hydrogen-bond acceptors or hydrogen-bond donors, electron donors or electron acceptors, polar groups (such as amino, hydroxyl, ester, sulfhydryl and nitrile groups), hydrophobic groups (such as alkyl, alkenyl and alkynyl groups or groups with specific partition coefficients), peptides, proteins, nucleic acids, carbohydrates, lipids or any combination thereof, on their surfaces or in their interiors.
- ionic groups such as carboxylate, protonated arnine, quaternary ammonium, and sulfate groups
- hydrogen-bond acceptors or hydrogen-bond donors such as amino, hydroxyl, ester, sulfhydryl and nitrile groups
- hydrophobic groups such as alkyl, alkenyl and alkynyl groups or groups with
- the BAM is a protein, such as a naturally occurring protein, it may also be referred to as a "carrier protein” to reflect its role in collecting and concentrating LMM biomarkers from biological fluids.
- carrier proteins include albumin, iron binding proteins (such as transferrin), fibrinogen, alpha-2-macroglobulin, immunoglobulins (such as IgA, IgE and IgG), complement, haptoglobulin, lipoproteins, prealbumin, alpha- 1 -acid glycoprotein, fibronectin, and ceruloplasmin, and fragments, combinations and chemical derivatives thereof.
- carrier proteins include albumin, iron binding proteins (such as transferrin), fibrinogen, alpha-2-macroglobulin, immunoglobulins (such as IgA, IgE and IgG), complement, haptoglobulin, lipoproteins, prealbumin, alpha- 1 -acid glycoprotein, fibronectin, and ceruloplasmin, and fragments, combinations
- the feature can be a biological trait such as a genotypic trait or a phenotypic trait.
- the feature can be a physiological or disease trait, such as the presence, stage or absence of a particular disease, including infectious disease.
- the feature also can be a condition (environmental, social, psychological, time- dependent, etc.) to which the organism or cell has been exposed or not exposed, including exposure/non-exposure to a drug or exposure/non-exposure to a toxin.
- biological states include a pathologic diagnosis, a toxicity state, efficacy of a drug, prognosis of a disease, stage of a disease, condition of an organ, presence of a pathogen (such as a virus), or toxicity of one or more drugs.
- cancers include presence, absence or stage of cancers, such as carcinomas, melanomas, lymphomas, sarcomas, blastomas, leukemias, myelomas, and neural tumors.
- cancers further include prostate cancer, breast cancer, pancreatic cancer, ovarian cancer, testicular cancer and lung cancer.
- capture agent refers to an agent that preferentially binds or selects a particular molecule, substance or material from a sample to the substantial exclusion of others present in the sample.
- capture agents include antibodies, apatamers, and reactive groups that selectively react with a particular functional group present on the molecule, substance or material to be captured and not on others.
- the capture agent may be a magnet.
- the capture agent may be a molecular sieve or size exclusion material that either retains or passes the molecule, substance or material to be captured while passing or retaining others.
- the capture agent may be lectin that binds the sugar.
- denaturant refers to an agent that disrupts the adherence of at least a portion of biomarkers adhered to a biomarker attractant molecule.
- denaturants examples include high (>1M) concentration salt solutions, solutions comprising a chaotropic agent such as urea, guanidinium chloride, and acetonitrile, solutions having low pH (for example, pH ⁇ 5) or high pH (for example, pH>9), a high temperature solution (for example a solution having a temperature greater than 30 degrees celcius), an electrochemical perturbation, or application of electromagnetic radiation (for example, a laser pulse).
- a denaturant may, in some embodiments, be a series of solutions with a continuous or discontinuous gradient of increasing or decreasing salt concentration, increasing or decreasing concentration of a chaotropic agent, increasing or decreasing pH, increasing or decreasing temperature, or any combination thereof.
- a denaturant is other than a reducing agent such as ⁇ -mercaptoethanol or an amino alcohol or diamine compound (such as ethanolamine) that has at least two nucleophilic groups, at least one of which is an amine group.
- reducing agent such as ⁇ -mercaptoethanol or an amino alcohol or diamine compound (such as ethanolamine) that has at least two nucleophilic groups, at least one of which is an amine group.
- non-denaturant refers to an agent that does not disrupt binding of a biomarker to a biomarker attractant molecule. Examples of non-denaturants include low concentration solutions of salts such as buffers and substantially pure water, such as distilled or reverse-osmosis purified water.
- non-denaturing conditions refers to conditions do not significantly disrupt adherence of a biomarker to a biomarker attractant molecule.
- such conditions include those under which biomarkers adhered to a biomarker attractant molecule with an association constant (K a ) of greater than IO 4 , such as greater than IO 5 , 10 6 , IO 7 or 10 s are remain substantially adhered to the biomarker attractant molecule.
- K a association constant
- biomarkers having a rate of association with the biomarker attractant molecule of greater than 10 -6 L/mol min, such as greater than 10 "5 , IO "4 or 10 "3 L/mol min remain adhered to the biomarker atfractant molecule.
- Biomarker attractant molecules can be separated from a biological fluid under non-denaturing conditions to help reduce loss of biomarkers adhered to the biomarker attractant molecule during the separation.
- domain refers to a portion of a molecule that exhibits a particular chemical or physical property.
- a domain can be as small as a particular amino acid residue on a peptide or as large as the entire surface of a biomarker attractant molecule, although more typically a domain will represent some portion of the surface of biomarker attractant molecule.
- a chemical or physical property can be substantially homogeneous, for example, substantially hydrophobic or substantially hydrophilic.
- Domains also can exhibit a set of chemical or physical properties that lead to adherence of particular classes of molecules, for example, phosphorylated or ubiquitinated protein fragments.
- eluting refers to the act of disrupting intermolecular bonds (such as hydrogen bonds, dipole-dipole bonds, hydrophobic interactions and the like) between two molecules, substance, materials and the like, for example, a biomarker and a biomarker attractant molecule, and separating the two, for example, into separate containers or physical locations such as wells on a microtiter plate.
- identity molecule refers to a molecule for which an analytical method is available for identifying its presence in a sample, such as substantially (for example, to a greater than 98% or 99% certainty) or unambiguously identifying its presence in a sample.
- Identifiable molecules can, for example, be identified according to their mass (such by a single or multiple peaks in a mass spectrum or an electrophoretogram), according to their immunochemical properties (such as by direct or sandwich antibody detection), or according to their sequence (such as using a cDNA array or RT- PCR).
- at least an identifiable fragment of a protein refers to fragment of the protein that, if detected, identifies (such as substantially or unambiguously) the protein from which the fragment was derived.
- An identifiable fragment of a protein can include, for example, one or more proteolytic peptide sequences (such as tryptic peptide sequences), the masses or sequences of which can be used to identify the protein with substantial certainty (for example, with >98% or >99% probability), for example unambiguously (such as accurate mass tag).
- An identifiable fragment of a protein can include an epitopic sequence of amino acids that is recognized by one or more antibodies that also recognize the whole protein from which the fragment is derived.
- protein array analysis refers to any technique in which samples comprising proteins or proteins fragments are placed on a substrate in particular locations and probed in a location specific manner, for example, with fluorescent antibodies to the proteins. Protein array analysis is discussed in detail in Liotta et al, Method and Devices for the isolation and analysis of cellular protein content," U.S. Patent Application No. 09/913,667, which is inco ⁇ orated by reference herein.
- Mass spectrometry refers to any technique in which molecules are analyzed by converting them to gas phase ions and measuring the mass-to-charge ratio of the ions.
- mass spectrometric techniques include matrix-assisted laser deso ⁇ tion ionization time-of-flight (MALDI- TOF), surface-enhanced laser deso ⁇ tion/ionization time-of-flight (SELDI-TOF), electrospray ionization mass spectrometry (ESI-MS), and Fourier-transform ion-cyclotron resonance (FT-ICR), and related techniques.
- MALDI- TOF matrix-assisted laser deso ⁇ tion ionization time-of-flight
- SELDI-TOF surface-enhanced laser deso ⁇ tion/ionization time-of-flight
- ESI-MS electrospray ionization mass spectrometry
- FT-ICR Fourier-transform ion-cyclotron resonance
- population refers to one or more subjects, but is more typically two or more subjects, for example, many more, such as more than 10, 100, 500 or 1000.
- separating refers to dividing a substance (such as a biomarker attractant molecule) from other substances (such as other molecules in a biological fluid) in order to obtain a purified form of the substance, such as an isolated form of the substance.
- purifying is a relative term that does not require absolute purity and, for example, refers to separating a substance from other substances to provide a preparation wherein the separated substance represents at least 50% of the preparation.
- isolated refers to substantially separating a substance from other substances to provide an even more purified preparation, for example, a preparation in which the substance represents at least 90%, 95% or 98% or more of the preparation.
- size exclusion refers to any technique that separates molecules based upon their size, including liquid chromatographic techniques such as column and high-pressure liquid chromatography (HPLC) and electrophoretic techniques such as denaturing polyacrylamide electrophoresis.
- subject refers to an animal, and more particularly to a mammal (for example, a human or a veterinary animal such as a dog, a cat, a pig, a horse, a sheep, or a cow). ///. Overview Blood contains thousands of molecules shed from tissues into the blood stream. Similarly, molecules are shed into other biological fluids such as lymph fluid, interstitial fluid, synovial fluid, cerebrospinal fluid, saliva, seminal fluid and the like.
- molecules initially shed into one fluid may become part of another biological fluid.
- lymph fluid is ultimately introduced into blood and blood is filtered by the kidneys to provide urine.
- Many of the molecules carried by these fluids are fragments or enzymatically modified forms of normal functioning proteins emanating from all levels of cellular and extra-cellular compartments. Every cell in the body can be viewed as leaving a record of its physiologic state in its waste products, or the products of its interactions with neighboring cells (FIG. 1). As shown in FIG. 1, such waste products may, for example, enter the bloodstream by passing through the vascular wall. To enter the bloodstream, however, large proteins are often be degraded into fragments small enough to go through or between the endothelial cells lining the blood vessels.
- the lower molecular weight (LMW, such as less than about 12 kDa) proteome has not previously been systematically studied because this region of the proteome has been viewed as a non- interesting dumping ground of metabolic and proteolytic byproducts.
- traditional protein- based discovery tools such as 2-D gel electrophoresis cannot discriminate in the LMW range.
- previous methods for preparing serum for biomarker discovery efforts have destroyed, contaminated, or discarded, the LMW biomarker information archive by enzymatic treatment (e.g. trypsin digestion) or by removing the high abundance serum proteins, such as albumin.
- a method for identifying a biomarker in a biological fluid that includes separating a biomarker attractant molecule from samples of the biological fluid and detecting a biomarker that is differentially adhered to the biomarker atfractant molecules between different populations of subjects that exhibit two or more different biological states.
- the detected biomarker that is adhered to the biomarker attractant molecule in the biological fluid discriminates between the different populations of subjects, and therefore can be used to discriminate between the biological states of subjects. For example, detection of the biomarker adhered to the biomarker attractant molecule in another subject can indicate that the subject exhibits a particular biological state such as a disease state.
- the biomarker attractant molecule is separated from the biological fluid using a capture agent that specifically or ⁇ on-specifically binds the biomarker attractant molecule (such as an antibody anchored to a substrate or a substrate that preferentially attracts the biomarker attractant molecule to its surface), and thereby removes at least a portion of the biomarker atfractant molecule from the fluid.
- biomarker atfractant molecule can be separated from the biological fluid by size exclusion chromatography, for example, by cutoff at a particular molecular weight or by fractionation in to ranges of molecular weight.
- size exclusion chromatography for example, by cutoff at a particular molecular weight or by fractionation in to ranges of molecular weight.
- any method that can concentrate the biomarker attractant molecule away from the biological fluid can be used to provide isolated biomarker atfractant molecules for analysis of their adhered diagnostic cargo.
- the biomarker attractant molecule that is separated from the biological fluid will have a half-life in the biological fluid that is longer than the half-life of biomarker that it amplifies and concentrates.
- the biomarker attractant molecule can have a half-life in the biological fluid of greater than 1 day, for example, 3 days or more, such as 5 days, 10 days, 20 days, 30 days, 50 days, 100 days or more.
- biomarker atfractant molecules will have a molecular weight greater than about 25 kDa, for example, greater than about 40 kDa, such as greater than about 60 kDa or greater than about 80 kDa.
- the biomarker attractant molecule is selected from the group consisting of albumin, transferrin, f ⁇ brinogen, alpha-2- macroglobulin, an immunoglobulin, complement, haptoglobulin, a lipoprotein, prealbumin, alpha 1 acid glycoprotein, fibronectin, and ceruloplasmin, and fragments, chemical modifications and combinations thereof.
- biomarker attractant molecules can be obtained directly from a biological fluid, or can be Biomarker attractant molecules can be native to a subject's body or not native to a mammalian body (such as a human body) and introduced into the biological fluid of the subjects in the different populations to harvest diagnostic information from the biological fluid.
- native biomarker attractant molecules such as blood proteins
- native biomarker attractant molecules can be extracted from a subject, stripped of bound molecules, and reintroduced to harvest a new constellation of bound molecules.
- non-native biomarker attractant molecules include nanoparticles formed, for example, from silica, a ceramic, a magnetic metal oxide, or a biodegradable material.
- the biomarker atfractant molecule also can include a dendrimer.
- nanoparticles and dendrimers can be chemically modified to provide surfaces having particular chemical/physical properties.
- nanoparticles can be modified to include at least one region (or domain) on their surface that is substantially hydrophobic and at least one region (or domain) on their surface that is substantially hydrophilic such that hydrophobic and hydrophilic biomarkers become adhered to the regions of the surface, respectively.
- Another way to modify the surface properties of a non-native biomarker attractant molecule is to conjugate it with a biological molecule, such as a protein or a protein fragment. Conjugated biomolecules can be chosen to specifically or non-specifically adhere particular classes of biomarkers.
- serum proteins such as albumin, transferrin, fibrinogen, alpha 2 macroglobulin, immunoglobulins, complement, haptoglobulin, a lipoprotein, prealbumin, ⁇ lpha 1 acid glycoprotein, fibronectin, and ceruloplasmin, or fragments, chemical modifications and combinations thereof
- conjugation of such proteins to, for example, a nanoparticle or a dendrimer can provide a molecular "mop" that can be especially useful when the goal is to identify biomarkers in a biological fluid.
- Biomarker attractant molecules also can be produced to include properties or groups of atoms that facilitate their separation from a biological fluid, such as magnetism or an affinity tag such as a poly-histidine affinity tag or a sugar recognized by a lectin. Biomarkers can be harvested from any biological fluid through adherence to biomarker attractant molecules.
- biological fluids include blood, plasma, serum, lymph fluid, synovial fluid, cerebrospinal fluid, breast milk, nipple aspirants, sweat, tears, saliva, mucous, pre-ejaculate, semen, vaginal fluid, a cell culture medium, or interstitial fluid, or a fraction thereof, a combination thereof, or a fluid derived therefrom.
- biomarkers While it is possible to detect biomarkers by directly probing a separated biomarker attractant molecule for the presence of adhered molecules (for example, with panels of antibodies to known proteins), in some embodiments, it is advantageous to separate at least a portion of the molecules adhered to the biomarker attractant molecule from the biomarker attractant molecule prior to facilitate their detection.
- Molecules can be separated from a biomarker attractant molecule by dissociating them from the biomarker attractant molecule. This can be accomplished, for example, by contacting the biomarker attractant molecule with a denaturant to elute bound molecules from the biomarker attractant molecule. In this embodiment, it may be advantageous to immobilize the biomarker attractant molecule with a capture agent.
- Elution of biomarkers can be partial or complete, and can include a gradient elution with increasingly stronger denaturants and fractionation of the molecules eluted from the biomarker atfractant molecule during the gradient elution.
- molecules that are eluted from a biomarker atfractant molecule are further separated from the biomarker atfractant molecule by size exclusion chromatography.
- An alternative method of dissociating molecules from biomarker attractant molecules is to directly dissociate them using using a laser. Molecules adhered to biomarker attractant molecules that are discriminatory between biological states and hence diagnostic for particular biological states can be identified according to the disclosed methods. There is virtually no limit to the biological states that can be discriminated using the methods.
- Biomarker attractant molecules can be detected by any method, but some methods include mass specfrometric detection (which can be coupled to pattern recognition, especially for discovery of complex biomarkers comprising multiple molecule or identification of such complex biomarkers). Particular methods of mass specfrometric detection include MALDI- TOF, SELDI-TOF, FT-ICR, and ESI-MS.
- biomarker atfractant molecules include protein array analysis, which also can be coupled to pattern recognition. Other methods include immunochemical analysis of molecules adhered to the biomarker attractant molecule, either with or without dissociation of the molecules from the biomarker attractant molecule.
- the diverse methods of detection that can be used in the disclosed methods make it possible to identify and utilize biomarkers of diverse types.
- the biomarker comprises a protein, a lipid, a carbohydrate or a nucleic acid, or a fragment or combination thereof.
- the biomarker includes two or more different molecules adhered to the biomarker attractant molecule, for example 5 or more, such as more than 10, 20 or 50.
- the biomarker includes two or more different identifiable proteins or protein fragments.
- the biomarker adheres to the biomarker attractant molecule non- covalently, for example, with a binding constant of less than 10 s M "1 .
- the biomarker attractant molecule adheres and thereby concentrates the biomarker because the concentration of the biomarker attractant molecule in the biological fluid is at least 10 times greater (for example, at least 100 times greater, such as at least 10 3 , IO 4 , IO 5 or IO 6 times greater) than the concentration of the biomarker in the biological fluid.
- the biomarker atfractant molecule further concentrates the biomarker because the biomarker attractant molecule has a half-life of greater than 1 day in the biological fluid.
- the biomarker attractant molecule circulates in the biological fluid and collects the biomarker as it is released into the biological fluid, and wherein the biomarker adheres specifically or non-specifically to one or more domains on the surface of the biomarker attractant molecule, the one or more domains on the surface of the biomarker further adhering additional molecules found in the biological fluid.
- the biological fluid comprises serum or plasma and the biomarker, which would otherwise be filtered out of the serum or plasma at a rate equal to or greater than a rate of its introduction in the serum or plasma, is retained and concentrated over time in the serum or plasma because the biomarker atfractant molecule to which it adheres is large enough and in sufficient excess concentration relative to the biomarker to prevent the biomarker from being filtered from the serum or plasma by normal kidney/glomerular function.
- the biomarker adhered to the biomarker attractant molecule has a molecular weight of less than about 12 kDa (for example, less than about 10 kDa, such as less than about 8 kDa, 6 kDa or 5 kDa.).
- the biomarker attractant molecule includes two or more different biomarker attractant molecules.
- a combination of two or more biomarker atfractant molecules such as native carrier proteins can be used to provide more discrimination between biological states.
- the two or more different biomarker attractant molecules include all molecules naturally present in the biological fluid having a molecular weight of greater than about 25 kDa (for example, greater than about 30 kDa, such as greater than 40, kDa, 50 kDa, 70kDa or 90 kDa).
- the biomarker itself can include two or more different molecules differentially adhered between the two or more different biomarker atfractant molecules. Biomarkers identified (or discovered) by the disclosed method are contemplated.
- the method includes separating a biomarker attractant molecule from a biological fluid obtained from the subject, wherein a biomarker of the biological state adheres to the biomarker attractant molecule. Detection of the biomarker adhered to the biomarker attractant molecule indicates that the subject exhibits the biological state. All of the permutations with regard to the separation of the biomarker attractant molecule from the biological fluid, detection of the biomarker, the biomarker itself, the biological fluids and the biomarker attractant molecules that were discussed above in regard to method of identifying a biomarker can be used in the method of detecting a biological state.
- a denaturant used to dissociate a biomarker from a biomarker attractant molecule does not include a reducing agent or a nucleophilic compound comprising at least two nucleophilic groups, wherein at least one of the at least two nucleophilic groups in the nucleophilic compound is an amine group.
- the biological state detected includes a pathological condition.
- the pathological condition can include a disease or condition of a particular organ and the biomarker is a protein or a fragment thereof that is known to be produced in the particular organ (such as the heart) as a result of the disease or condition that becomes concentrated by the biomarker attractant molecule.
- the pathological condition is the presence of a tumor, for example, of a particular type or a toxin exposure.
- a particular pathological condition is detected using a particular biomarker attractant molecule by detecting at least an identifiable fragment of each of two or more particular proteins (previously identified by the disclosed discovery method to be discriminatory) adhered to the particular biomarker attractant molecule are disclosed in Examples 4-7.
- the following examples are provided to illustrate certain particular features and/or embodiments, but these examples should not be construed to limit the invention to the particular features or embodiments described.
- Example 1 This example demonstrates the method and provides experimental results that illustrate how low molecular mass (LMM) biomarkers reaching the blood may be significantly concentrated over time by binding (even with very low affinities such IO "4 L/mol min or lower, for example, IO "5 L/mol-min, or lower) to larger blood proteins having substantial half-lives (such as greater than 1 day, for example greater than 2, 5, 10, 20 or 50 days).
- LMM low molecular mass
- Amplification refers to an increase in the total plasma biomarker concentration due to the accumulation of biomarker molecules over time as they remain bound to carrier proteins.
- the proportion of low molecular mass species detectable by SELDI surface enhanced laser deso ⁇ tion and ionization that are associated with the higher molecular mass serum proteome was determined.
- Human serum was fractionated into high molecular mass and low molecular mass native fractions. Each fraction was assayed by SELDI to assess whether the preponderance of low molecular mass ions is found in the low or the high molecular mass fraction.
- the subpopulation of molecular species bound to albumin compared to the total carrier protein fraction was also determined.
- the tissue compartment generating the biomarker is composed of target cell (representing, for example, a tumor cell) and host cell (representing, for example, a normal cell) compartments, and the net rate of biomarker shed into the vascular compartment from the tissue compartment is represented by k ⁇ .
- the biomarker in the vascular compartment exists either in the bound or free (unbound) state.
- the carrier protein and the biomarker leave the vascular compartment at different rates, represented by k ⁇ b and k oU .
- the binding state of the biomarker determines its rate of clearance from the vascular compartment. Additional abbreviations used in the equations of the model that follows are shown below in Table 1. Table 1: Theoretical Model Abbreviations
- Biomarkers are considered to be molecules shed into the circulatory compartment from perfused tissue (See FIG. 1).
- the type and relative abundance of single or multiple biomarkers can reflect the diseased or physiologic state of the tissue source.
- Biomarkers arise either directly from the cellular target itself, or from an interaction between the target cell and its surrounding host cells. Exchange of cytokines, enzymes, ligands, and metabolic products at the interface between the target cell and the host can potentially generate a population of modified molecules.
- the biomarker becomes evenly distributed within the circulation compartment.
- the total concentration of the biomarker is dependent upon the biomarker production rate, the biomarker clearance /excretion rate, the binding of the biomarker to a circulating carrier protein, and the clearance / excretion rate of the carrier protein (see FIG. 2).
- V is the volume of distribution or total blood volume
- C B is the total concentration of the biomarker b consisting of the sum of the bound and free forms of the biomarker.
- the rate of biomarker production is k in b
- the rate of its elimination, degradation, or excretion is k oUt .
- M B is the mass of the total biomarker within the compartment.
- An individual biomarker molecule can exist in the free form or it can become bound to a circulating carrier protein, "r.”
- the system may be represented as koff in which case the total mass of biomarker as a function of time is
- M B is the mass of the total biomarker
- M b is the mass of the free form biomarker
- M r is the mass of the carrier protein
- M r is the mass of the biomarker bound to the carrier protein
- k discipline n is the rate of biomarker associating with the carrier protein
- k ⁇ is the rate that the bound biomarker dissociates from the carrier protein.
- C b (t) f k ⁇ + k off C br (t) 1 - f k oUt , b + k on (C r (t) - C br (t)) 1 C b (t) (3) dt " l V J I J
- C b is the free form biomarker concentration
- C r is the concentration of the carrier protein
- C ⁇ is the concentration of the biomarker bound to the carrier protein.
- C b is the free biomarker concentration
- C r is the concentration of the carrier protein
- C br is the concentration of the biomarker bound to the carrier protein.
- the total concentration of biomarker can be expressed as the sum of the bound and free forms of the biomarker: thus,
- the following equation represents amplification of biomarker concentration in the presence versus the absence of the earner protein, under the assumption that the biomarker is continuously produced or shed at the tissue source, and leads to the conclusion that biomarker molecules can accumulate over time in a earner-bound form
- the total concentration of a biomarker measured m a blood sample can therefore become elevated due to its association with the earner protem T
- he level of amplification (A) of the biomarker concentration at steady state, due to the presence of the carrier protem can be defined as the following ratio
- the clearance and excretion rate for free biomarkers was chosen to span the known range for small molecules.
- the influence on total biomarker concentration over time of the ca ⁇ ier protein/biomarker complex clearance rate is illustrated in FIG. 3 A, which is a graphical representation of the numerical solutions to Equation 8 for a series of total biomarker clearance rates (half-lives).
- the total biomarker concentration is shown as a function of time after commencement of biomarker production.
- the production rate of the biomarker is assumed to be continuous at 1.0 femtomole/day.
- the biomarker mass is assumed to be 10 kDa, and possess a half-life of one hour.
- FIG. 3C shows the effect that both biomarker affinity for a carrier protein and carrier protein concentration have on the percentage of biomarkers that are bound rather than free in a biological fluid.
- Numerical integration of equation 8 also shows that assuming a low affinity of 10 "4 between the biomarker and the carrier protein, a biomarker of mass 10 kDa with a half-life of one hour produced at a rate of 1.0 fmol/day, half-life of the carrier protein equal to albumin (18 days), and a vast excess of the albumin, virtually all biomarker will exist in the bound form. That is, even if a putative biomarker possesses even a relatively weak affinity for a high abundance carrier protein such as albumin (half life 18 days), the majority of the total circulating biomarker will still exist bound to albumin and be concentrated (amplified) in the biological fluid.
- the serum was fractionated into molecular mass classes under native conditions. Mass fractionation was carried out as follows. Thirty microliters of unfractionated human native serum was introduced into a Sephadex G-25 ® or a Sephadex G-50 molecular sieve spin column according to the manufacture's instructions. The column was centrifuged at 3000 x g for three minutes, and approximately 30 microliters of eluate containing the high molecular mass fraction was collected. The eluate was treated with 50% acetonitrile (w/w in water) to dissociate bound molecules for 30 minutes and was transferred to the inlet of a molecular filtration microcolumn. (Microcon
- Equilibration Buffer provided with the kit for a final volume of 200 ⁇ L and vortex sample.
- the column was rehydrated twice with 400 ⁇ L of Equilibration Buffer and centrifuge through the column insert for 2 minutes at 2,000 ⁇ r ⁇ 200 ⁇ L of diluted serum was introduced into the rehydrated albumin column and centrifuged for 2 minutes at 2,000 ⁇ m.
- the eluate from the column contains the serum without albumin.
- the bound fraction contains the albumin and the low molecular weight species bound to the albumin.
- 400 ⁇ L EAM solution were then added, composed of 50% acetonitrile and 0.1% TFA, to the column to strip the column and dissociate albumin from its bound species.
- EAM solution was centrifuged through the column at 2,000 ⁇ m for 3 minutes.
- the eluate contained the dissociated albumin and low molecular weight species that binds to albumin.
- Analysis of the proteins bound to the column using ion trap mass spectrometry was performed in line with an LCQ Classic MS (ThermoFinnigan, San Jose, CA) with a modified nanospray source. Dynamic exclusion of the three most abundant peptide hits from a full MS scan were selected for MS/MS analysis by collision induced dissociation with normalized collision energy of 35% and an activation time of 30 ms.
- Ion spray voltage was 2.00 kV with a capillary voltage of 26.20 V and a capillary temperature of 160 °C.
- WCX2 protein arrays for SELDI-TOF analysis were processed in a bioprocessor (Ciphergen Biosystems, Inc). 100 ⁇ l of 10 mM HCL was applied to the protein arrays in the bioprocessor and allowed to incubate for 5 minutes. The HCL wash was aspirated and discarded and 100 ⁇ l of H 2 0 was applied and allowed to incubate for one minute. The H 2 0 was aspirated and discarded, then reapplied for another minute. 100 ⁇ l of 10 mM ammonium acetate with 0.1% TritonX was applied to the surface and allowed to incubate for 5 minutes.
- ammonium acetate was aspirated and discarded. A second application of ammonium acetate was applied and allowed to incubate for 5 minutes. The chip surfaces were then dried using a vacuum to remove any excess amount of liquid. Five ⁇ l of raw sera, or molecular mass fraction, or eluate was then applied to each chip surface and allowed to incubate for 55 minutes. Each protein chip was washed six times with 150 ⁇ l of PBS and H 2 0 and then vacuum dried. Cross contamination was eliminated between spots by using a bioprocessor gasket.
- FIG. 5A is the spectrum obtained for a control of the SELDI matrix alone. Spectra generated from whole, native, unfractionated serum applied directly to the WCX2 chip surface.
- FIG 5B is the spectrum after removal of all species greater than 30,000 MW with a Microcon ® YM-30 Molecular Filter from the unfractionated serum.
- FIG. 5C is a direct analysis of high molecular weight native species by MS after fractionation through a Sephadex G-25 Molecular Sieve.
- the MS profile displays the low molecular weight species ionized and desorbed away from the high molecular weight carriers.
- FIG. 5D shows the spectrum after the high molecular weight fraction eluted from the G-25 sieve was treated with a 50% acetonitrile solution to dissociate potential low molecular weight species bound to their high molecular weight carriers, and the dissociated eluate was then passed through the YM-30 Filter. Only the low molecular weight species previously bound to the G-25 sieved high molecular weight fraction should pass through the filter.
- FIG 5E is a spectrum of the ion species associated with HMW earner species depleted of albumin and albumin binding partners.
- FIG 5F is a spectrum showing the ion species associated specifically with albumin only.
- a comparison of the spectrum of FIG. 5B to that of FIG. 5C shows that the majority of ions generated from unfractionated serum are derived from species associated with larger carrier proteins.
- FIG. 5D displays the spectrum of ion species previously bound, and then dissociated and separated from the higher molecular mass fraction. The intensity and number of ion species is augmented compared to FIG. 5A and FIG 5C.
- FIGS. 5E and 5F compare the proportion of SELDI-TOF generated ions associated with albumin compared to all the other serum carrier proteins.
- FIG. 5E displays the ions associated with the non-albumin earner proteins
- FIG. 5F displays the ions generated from species bound only to albumin.
- a significant proportion of the ions in the spectra appear to be derived from species associated with albumin.
- Microcapillary LC MS/MS was used to verify that the albumin bound fraction acquired through stripping the Montage Albumin Deplete Column was entirely albumin and its bound low molecular mass species. Since there was no indication of other high molecular mass proteins bound to the albumin specific column, the low molecular mass species detected were specific to albumin. Furthermore, a number of low molecular mass species that were dissociated from the albumin carrier were positively identified.
- FIG. 8 displays the ion spectra for an example serum sample in which the ion species bound only to albumin are compared for different amounts of albumin captured.
- FIGS. 7A-D show the example of ion 6631.7043, a member of the ion pattern 100% correlated with ovarian cancer in this clinical study set. Matched for dilution and amplitude, the predicate ion is highly associated with albumin, and the ionization intensity is augmented in the albumin bound fraction. Finally the complexity and amplitude of the ion species of the SELDI-TOF spectra is directly related to the amount of albumin captured as shown in FIG. 8.
- the ion species increase in amplitude and complexity when they are derived from larger quantities of bound albumin. This supports the concept that the biomarkers are quantitatively associated with the albumin.
- the theoretical and experimental data support the concept that the vast majority of small mass ions detected by mass spectrometry of native human serum exist in association with circulating earner proteins of higher molecular weight. This conclusion is important for biomarker physiology and biomarker measurement technology.
- concentration of a biomarker measured in serum or plasma is directly related to the clearance rate or half life of the carrier protein, not the biomarker clearance rate itself. As shown above in equation 13, the concentration of the biomarker is a function of the ratio between the biomarker production rate from the tissue and the clearance rate of the carrier protein.
- carrier protein binding amplifies the total biomarker concentration levels measured in serum or plasma. Amplification occurs because the carrier protein acts as a reservoir to accumulate the biomarker over time, as the tissue is continuously producing the biomarker.
- a biomarker produced by a small volume of tissue such as the ovary, prostate, or breast at a low concentration (for example, one femtomole per day) can accumulate to a concentration of one picomole in the serum because it binds with a carrier protein with much larger half-life.
- the existence of the carrier protein can raise the concentration of the biomarker to a range detectable by conventional assay technology.
- the free biomarker would be rapidly cleared by the kidney and would therefore reside at a steady state concentration many fold below the detection limits of assay technology. Another conclusion is that the results extend beyond current mass spectrometry detection technology. Small biomarkers are commonly not the province of two-site sandwich immunoassays, because it is difficult to develop two antibody-binding sites on the same small molecule. In contrast, if one half of the immunoassay sandwich is the carrier protein and the other half was the small biomarker, a sandwich immunoassay can be achieved. The biomarker clearance rate becomes the carrier protein clearance rate because the carrier protein, even if it has low affinity for the biomarker, is in vast excess.
- ⁇ is produced by a one cubic centimeter tumor composed of IO 9 cells, then each cell would produce approximately 16,000 molecules per day. This approximation is consistent with previous experimental findings. As shown in the experimental data presented in FIGS. 5-7, the theoretical predictions of FIG. 3 have been demonstrated. Thus, in analyzing unfractionated native sera, the majority of ions generated by SELDI-TOF analysis are found to be associated with carrier proteins, rather than free in solution phase (compare FIG. 5B to FIGS. 5C-F). Moreover, as shown in FIGS. 6 and 7, ion species altered in disease study sets may be those specifically captured on a single carrier protein. In the example, the carrier protein is albumin.
- each carrier protein may have its own constellation of bound biomarkers. Indeed, the distribution of biomarkers among specific plasma/ serum carrier proteins may contains important diagnostic information.
- Example 2 Identification of Proteins Having Fragments Bound to Natural Circulating BAMS Hundreds of peptides and clipped proteins bound to carrier proteins such as albumin have been sequenced by MS/MS. Following trypsin digestion of the canier bound fragments, identification of the proteins/peptides was performed in line with an LCQ Classic MS (ThermoFinnigan, San Jose, CA) with a modified nanospray source. Dynamic exclusion of the three most abundant peptide hits from a full MS scan were selected for MS/MS analysis by collision induced dissociation with normalized collision energy of 35% and an activation time of 30 ms.
- LCQ Classic MS ThermoFinnigan, San Jose, CA
- Ion spray voltage was 2.00 kV with a capillary voltage of 26.20 V and a capillary temperature of 160 °C.
- Results for MS/MS scans were searched and compared with theoretical spectra in the Sequest Browser database specified for human proteins.
- the sequencing results for trypsin fragments of carrier bound peptides/proteins reveal molecules derived from all parts of cells, ranging from low concentration transcription factors from the nucleus, to membrane bound receptors, to secreted growth factors. Table 3 below illustrates the general types of proteins found. Table 3
- Example 3 Discovery of ovarian cancer biomarkers bound to human serum albumin.
- the Clementine Data Mining Bioinformatics Toolset (SPSS, Inc., Chicago, IL) was used to discover patterns of ions representing biomarkers bound to albumin that discriminate between serum samples obtained from subjects with and without ovarian cancer.
- SPSS Clementine Data Mining Bioinformatics Toolset
- a feature selection was performed by computing and then comparing the aggregate mean values of the total ion count (sum of amps) on a bucket by bucket (bin by bin) basis, and selecting the 100 buckets (bins) with the largest difference in mean values between normal and cancer samples.
- the Clementine system was then trained using un-normalized total ion count per bin values.
- a set of ion species bound to albumin were found to discriminate between 127 non cancer and 115 ovarian cancer cases with 100% specificity and 100% sensitivity.
- a Qstar mass spectrometer (described in Example 1) was used to analyze albumin bound proteins (and protein fragments) in the mass to charge regions of 1000-7760 and 7762-11,000. These ranges excluded the matrix effect area and the region of Transferrin.
- the ion amplitude and density of ion peaks was ten fold greater using the albumin eluted fraction, compared to direct analysis of whole unfractionated sera, making detection of the discriminatory patterns of ions su ⁇ risingly reliable.
- the mass specfrometric analysis of molecules bound to a BAM in this case a serum carrier protein, specifically albumin
- a BAM in this case a serum carrier protein, specifically albumin
- the BAM is natural albumin. Sequencing and identification of the carrier protein associated molecules revealed that they are primarily derived from low abundance cellular and tissue sources, and represent all major classes of cellular compartments, including membrane, cytosol, nucleus, and extracellular matrix.
- Example 4 - Detection of Stage Specific Ovarian Cancer Biomarkers and Biomarkers Indicating Lack of Ovarian Cancer in a Subject.
- albumin bound molecules were analyzed to detect protein and protein fragment biomarkers that discriminate between different populations of subjects having no evidence of ovarian cancer, Stage I ovarian cancer or Stage III ovarian cancer.
- Biomarkers adhered to albumin were analyzed and identified by mass spectrometry.
- Serum Samples Serum was collected prior to physical evaluation, diagnosis and treatment and was stored at -80 centigrade. The study set used consisted of 98 unaffected "high risk" patient samples, and 146 cancer patients.
- Serum samples were obtained from the National Ovarian Cancer Early Detection Program (NOCEDP) and gynecologic oncology clinic at Northwestern University (Chicago, Illinois). Specimens from women enrolled in the NOCEDP who had no evidence of any cancer for 5 years were evaluated as healthy (no cancer) women. Similarly, only preoperative specimens were used from women who were surgically staged and found to have epithelial ovarian carcinoma. Two hundred and forty eight samples were prepared using a Biomek 2000 robotic liquid handler (Beckman Coulter, Inc., Palo Alto, California). All analyses were performed using ProteinChip weak cation exchange interaction chips (WCX2, Ciphergen Biosystems Inc., Fremont, California).
- WCX2 ProteinChip weak cation exchange interaction chips
- a control reference sample was randomly applied to one spot on each protein array as a quality control for overall process integrity, sample preparation and mass spectrometer function.
- the control sample, SRM 1951 A which is comprised of pooled normal human sera, was provided by the National Institute of Standards and Technology (Gaithersburg, Maryland).
- Application of Quality Control Analysis The 248 mass spectra acquired by the QqTOF MS were analyzed with a wide variety of statistical tools to evaluate the spectral quality [i.e. record count and mean amplitude] and statistical variances greater than the population norm. Mean spectral amplitude and the file record count (i.e. the total number of data points within a mass spectrum) were selected as global parameters for statistical analysis.
- acetic acid purchased from Mallinckrodt Baker (Phillipsburg, NJ). H 2 0 was doubly distilled in house with Kontes High Purity Water System. Porcine sequencing grade modified trypsin was purchased from Promega (Madison, WI). Coomassie Brilliant Blue R 250 was purchased from Fluka Chemical (Switzerland). Sequencing of albumin associated proteins was conducted on groups of 30 stage 1 cancers,
- Vivaspin 500 centrifugal membrane filters were purchased from VivaScience. ZipTip g (6 ⁇ m bed) desalting columns and Montage Albumin Depletion Columns were purchased from Millipore. Elution buffer was replaced with a 70% ACN/ 30% dd H 2 0/ 0.2% TFA mixture. Pre cast gels, sample, and running buffers were purchased from Invifrogen Co. Fused silica is from Polymicro Technologies (Phoenix, AZ).
- Albumin and bound peptide purification 40 ⁇ L of human stage specific (pooled) ovarian cancer serum ( ⁇ 5 mg protein) was diluted to 200 ⁇ L with Equilibration Buffer (Millipore) and run through a (Montage) albumin specific affinity column twice. The bound protein was washed thoroughly, and eluted from the column by equilibrating with 70% ACN/ 30% H 2 0/ 0.2% TFA for 30 minutes, followed by a slow spin-through of the elution mixture. The eluate was lyophilized to ⁇ 10 ⁇ L in a HetoVac rotofor (CT 110) and reconstituted in a 95% H 2 OZ 5% ACN / 0.1 % formic acid buffer (Buffer A).
- CT 110 HetoVac rotofor
- Buffer A 95% H 2 OZ 5% ACN / 0.1 % formic acid buffer
- Samples were sometimes desalted with a ZipTip cleanup or with Vivaspin 500 centrifugal membranes and always reconstituted in a 1 : 1 mixture of water and SDS sample buffer (20 ⁇ L total volume).
- ID gel separation and digestion 20 ⁇ L of sample in SDS sample buffer (40 ⁇ L of original serum) was boiled for 5 minutes at 95 °C and run on a ID pre-cast gel (16% Tricene or 4-20% Tris-Glycine) to separate albumin from the proteins/peptides/fragments of interest. The gel was stained with Coomassie Blue for one hour and destained overnight in 30% methanol/ 10 % acetic acid solution.
- Microcapillary reverse phase LC/MS/MS analysis was performed with Dionex's LC Packings liquid chromatography system coupled online to a ThermoFinnigan LCQ Classic ion trap mass spectrometer (San Jose, CA) with a nanospray source. Reverse phase separations were performed with an in-house, slurry packed capillary column.
- the C 18 silica-bonded column is 75 ⁇ m i.d., 360 ⁇ m o.d., 10 cm long fused silica packed with 5 ⁇ m beads with 300 Angstrom pores (Vydac, Hesperia, CA).
- 8 cartridge (Dionex) acts as a desalting column.
- the albumin extraction, gel electrophoresis, protein digestion/extraction, and LC/MS/MS analysis was repeated in more than five distinct trials - each time yielding diminishing returns of new identifications for low abundance peptide hits.
- Repetitive sequencing of peptides in multiple trials further validated the experimental procedure - both within a stage and between ovarian cancer stages. All of the proteins identified adhered to albumin, and their presence or absence in one or more of the no-cancer high risk, stage I cancer and stage III cancer populations are shown in Table 5 below.
- Biomarkers useful for providing diagnoses of ovarian cancer can be chosen from Table 5 as follows: a) Biomarkers that indicate a subject exhibits no cancer can be chosen to include at least an identifiable fragment of each of two or more proteins in Table 5 (such as 3 or more, or 5 or more) that are were identified only in the no cancer population. b) Biomarkers that indicate a subject exhibits ovarian cancer regardless of its stage can be chosen to include at least an identifiable fragment of each of two or more proteins in Table 5 (such as 3 or more, or 5 or more) that were identified only in either or both of Stage I and Stage III populations, and can further include fragments of proteins in Table 5.
- Biomarkers that indicate a subject exhibits Stage I ovarian cancer can be chosen to include at least an identifiable fragment of each of two or more proteins in Table 5 (such as 3 or more, or 5 or more) that were identified only in the Stage I population.
- Biomarkers that indicate a subject exhibits Stage III ovarian cancer can be chosen to include at least an identifiable fragment of each of two or more proteins in Table 5 (such as 3 or more, or 5 or more) that were identified only in the Stage III population.
- Example 5 Provided Example 5 -Prostate Cancer Biomarkers This example demonstrates that the disclosed method can detect serum biomarkers for prostate cancer.
- the sera of subjects in each of a prostate cancer population and a no cancer population (hospital control) were pooled and analyzed as in Example 4, except that potential biomarkers adhered to all serum proteins of MW > 25 kDa were separated from the pooled sera samples, and potential protein and protein fragment biomarkers were eluted therefrom and identified.
- the results of this analysis are shown in Table 6.
- Useful biomarkers for prostate cancer can be selected to include at least an identifiable fragment of each of two or more proteins in Table 6 (such as 3 or more, or 5 or more) that were found only in the prostate cancer population.
- Useful biomarkers for the absence of prostate cancer can be selected to include at least an identifiable fragment of each of two or more proteins in Table 6 (such as 3 or more, or 5 or more) that were found only in the no prostate cancer population.
- Example 6 Breast Cancer This example demonstrates that the disclosed method can detect serum biomarkers for breast cancer.
- the sera of subjects in each of a breast cancer population and a no breast cancer population (hospital control) were pooled and analyzed as in Example 4. The results of this analysis are shown in Table 7.
- Useful biomarkers for breast cancer can be selected to include at least an identifiable fragment of each of two or more proteins in Table 7 (such as 3 or more, or 5 or more) that were found only in the breast cancer population.
- Useful biomarkers for the absence of breast cancer can be selected to include at least an identifiable fragment of each of two or more proteins in Table 7 (such as 3 or more, or 5 or more) that were found only in the no breast cancer population.
- Example 7 Lung Cancer Biomarkers This example demonstrates that the disclosed method can detect serum biomarkers for lung cancer, and biomarkers that differentiate the histopathological type of the lung cancer.
- the sera of subjects in each of an adenocarcinoma population, a squamous cell carcinoma population and a no cancer population (hospital control) were pooled and analyzed as in Example 4. The results of this analysis are shown in Table 8.
- Useful biomarkers for lung cancer can be selected to include at least an identifiable fragment of each of two or more proteins in Table 8 (such as 3 or more, or 5 or more) that were found only in one or both of the adenocarcinoma population and the squamous cell carcinoma population.
- Useful biomarkers for adenocarcinoma can be selected to include at least an identifiable fragment of each of two or more proteins in Table 8 (such as 3 or more, or 5 or more) found only in the adenocarcinoma population.
- Useful biomarkers for squamous cell carcinoma can be selected to include at least an identifiable fragment of each of two or more proteins in Table 8 (such as 3 or more, or 5 or more) found only in the squamous cell carcinoma population.
- Useful biomarkers for the absence of lung cancer can be selected to include at least an identifiable fragment of each of two or more proteins in Table 8 (such as 3 or more, or 5 or more) found only in the no cancer population.
- Example 8 High Throughput Chamber for Collection of Biomarkers from BAMs Collection of BAMs and subsequent elution of their bound biomarker cargo may be accomplished in a high throughput flow chamber.
- the sample body fluid containing the carrier molecule and its associated diagnostic molecule analytes are passed into a column or passed along a surface which contains immobilized antibodies or other ligand directed against a BAM, such as the illustrated carrier protein.
- immobilized antibodies that also comprise a means to couple (for example a means to covalently couple) the captured BAMs are used.
- the analytes can be dissociated off the earner molecule using a denaturation or suitable elution buffer. Because the carrier protein itself is covalently attached to the solid phase through the antibody/ligand it will not be eluted. Thus, only the analytes previously associated with the carrier molecule will be eluted and can be analyzed for the diagnostic information they contain based on their biochemical or physiologic characteristics.
- An example means for coupling the antibody ligand to the captured BAM is the use of light activated coupling described by Holden and Cramer (JACS, 125: 8074-8075, 2003).
- This method couples fluorescently labeled molecules to binding partners using a photo-bleaching reaction that initiates a reactive singlet oxygen to covalently couple with highly localized electron rich sites on the binding partner.
- a serum sample is first passed through a column containing immobilized fluorescein (or Alexa dye) labeled antibodies to albumin.
- the albumin is captured on the antibodies.
- a photo-bleaching illumination is then provided to couple the antibodies to the albumin through the fluorescein moiety. This is then followed by an elution step which elutes the albumin associated analytes but leaves the albumin itself bound to the column.
- the whole process can be in-line and continuous, with sample introduction followed by pulses of light energy to couple the carrier molecule and followed by subsequent elutions of the analytes.
- the capture itself can take place in a filter format, a microwell chamber, etched channels, packed nozzels, in-solution on particles which are subsequently filtered out or captured by another means or in a microfluidic device. Further separation and processing can be done before or after this capture and elution step.
- the output can even go directly into a mass spectrometer, or into a chemical and enzymatic step (for example, a trypsin or cyanogen bromide cleavage) and then be analyzed by MS as illustrated in FIGS. 10.
- FIG. 10 FIG.
- FIG. 11 illustrates an embodiment in which capture and covalent coupling of a BAM to an immobilized antibody is followed directly by laser deso ⁇ tion into a mass spectrometer.
- an antibody labeled with fluorescein is immobilized on a silica surface through a linker molecule (FIG. 11 A).
- a biological fluid comprising a BAM that is bound by the particular antibody (such as the illustrated carrier protein) is passed through the device and the BAM with its diagnostic cargo is captured by the immobilized antibody (FIG. 1 IB).
- UV (or visible) light is then used to excite the fluorescein moiety, which then catalyzes covalent coupling between antibody and the BAM (FIG. 11C).
- a laser pulse is used to desorb the biomarkers, such as peptides, from the BAM and into a mass spectrometer.
- a capture agent such as an antibody or protein A
- the device itself may be constructed from a wide variety of materials, including, for example, a metal, silicon or a plastic.
- methods for attaching capture agents to particular substrate materials include the photoactivated polystyrene method of Bora et al. (J. Immunol. Methods, 268: 171-7, 2002), the method for immobilizing proteins to modified silicon surfaces described by Yakovleva et al. (Anal.
- Chelating peptides may also be used to immobilize antiobodies on a solid support (see, for example, Lo ⁇ tscher et al., J. Chromatogr., 595: 113-119, 1992). Antibodies may also be immobilized on ⁇ polystyrene copolymer surface, such as onto styrene copolymer beads, which could be used in a column format for the disclosed device where a sample is passed through a column of such beads to remove BAMs, from which biomarkers may later be eluted (see, for example, Bale-Oemck et al , Ann Biol Clin, 48 651-654, 1990) A combmation of thiol-terminated silanes and heterobifunctional crosslinkers may be used to immobilize antiobodies on silica surfaces (see, for example, Bhatia et al , Anal Biochem , 178 408-413, 1989) Additional methods exist and are contemplated If the B
- Example 9 Artificial BAMs
- BAMs designed to specifically capture and amplify different classes of LMW biomarkers by altermg, for example, their surface chemistry
- BAMs tailored to monitor specific diseases developing in target tissues may be produced using existing targeting technologies
- Such BAMs agents may be instilled into the blood of a patient at one pomt m time, and then collected at a later time point with their bound diagnostic cargo
- nanoharvesting BAMs, filtered and arrayed m a high throughput platform may be directly quened with mass spectroscopy to obtain an individual global health profile rendered at an affordable cost, all from a drop of blood
- suitable artificial BAMs include chime ⁇ c protems and proteins altered to enhance separation of the BAM from a biological fluid following
- Silica-coated nanoparticles their preparation, and derivatization are described in detail in U.S. Patent No. 6,548,264.
- Another class of artificial BAM useful for the disclosed methods are the ultrafine lightly coated supe ⁇ aramagnetic particles described in U.S. Patent No. 6,207,134. These particles comprise a metal oxide core, such as an iron oxide core and a shell comprising a polyelectrolyte such as a structural polysaccharide or a synthetic polymer such as a polyaminoacid. Mico ⁇ articles and nanoparticles comprising cross-linked monosaccharides and oligosaccharides may also be utilized as BAMs. Examples of such particles are described in U.S. Patent No. 6, 197,757.
- Such particles are biodegradable and may, for example, be separated from biological fluids with lectins specific for the sugar moieties found on their surface.
- the particles comprise cyclodextrins that may be chosen to be of a size that captures biomarkers of particular mass ranges.
- the surface of such particles may be derivatized with, for example, amino or carboxylate groups according to known methods.
- Biodegradable, injectable nanoparticles that are not rapidly cleared from blood, and may be modified to target specific cells or organs, and to change their lipophilicity and hydrophilicity, are described in U.S. Patent No. 5,543,159.
- microspheres and nanospheres that may be used directly or derivatized for use as BAMs are available from Bangs Laboratories, Inc. (Fishers, IN). Examples of such microspheres include polystyrene, latex, silica, and polymethylmethacrylate particles. Covalent coupling of various biomolecules (such as Protein- A) and derivatization of the surface of such spheres are discussed in TechNotes 201 and 205 published by Bangs Laboratories (TechNote 201, October 25, 2002/TechNote 205, March 30, 2002).
- Another class of BAMs that may be used in the disclosed methods and apparatus are dendrimers. Examples of dendrimers, their synthesis, and surface derivatization are discussed in U.S.
- Dendrimers may also be derivatized with antibodies to provide targeted access to particular types of tissue, and collection of biomarkers therefrom. Both polyamidoamine and polyalkylenimine dendrimers may be used. Propylenimine dendrimers with primary amino surface groups, polyamidoamine dendrimers with primary amino surface groups, polyamidoamine dendrimers with carboxylate surface groups and polyamidoamine dendrimers with hydroxyl surface groups are available from Aldrich (Milwaukee, WI). Over 200 modifications of the surface groups of dendrimers are described in Dvornic et al, Poly. Prepr., 40:408, 1999.
- Example 10 - Pattern Recognition Methods Any known pattern recognition method may be used to identify BAM-associated molecules that discriminate between biological states, and to provide a probable diagnosis of a subject's biological state based on BAM-associated molecules.
- Statistical methods of pattern recognition may be used and are reviewed in Jain et al., IEEE Transactions on Pattern Analysis and Machine Intelligence, 22: 4-37, 2000, which is inco ⁇ orated by reference herein.
- Artificial intelligence methods may also be used (see, for example, Zupan and Gasteiger, Neural Networks for Chemists, VCH, 1993).
- pattern recognition relates to the discovery of hidden patterns that characterize subsets of data within a larger data set, for example, data patterns that distinguish between objects of different classifications.
- Pattern recognition methods may be divided into three major components: clustering or classification, association rules, and sequential analysis.
- classification or clustering a set of data is analyzed to generate a set of grouping rules that may, for example, be used to classify diseases.
- An association rule is a rule that implies certain relationships between objects in a database. For example, sets of symptoms often occurring together with certain kinds of diseases may be identified.
- sequential analysis patterns that occur in sequence are discovered. For example, patterns that correlate with progression of disease may be detected. Pattern recognition involves a training (learning) mode and a classification (testing) mode.
- data for a set of objects is used to establish patterns of data that characterize various subsets of objects present amongst all objects (such as patterns of mass spectral data that are characteristic of those samples from subjects with cancer).
- Class (subset) labels may be assigned to the objects before patterns are detected, or after objects are first grouped into subsets based on their similar patterns of data.
- the pattern discovery process is known as supervised learning, and in the latter, unsupervised learning.
- the goal is to extract pattems of data that may be used to classify future data.
- a diagnostic pattern of mass spectral data may be validated by using the pattern to classify the spectra of samples known to be from subjects with or without cancer. If the diagnostic pattern is "robust," it will provide the correct classification of the known samples. Often this process is repeated for each of the known samples used during the training mode in what is called a leave-one-out (LOO) cross-validation.
- LEO leave-one-out
- Both the training mode and the classification mode may involve preprocessing of raw data to separate the desired pattern(s) from background and/or noise. Preprocessing may also include normalization of the data, or any other process that ultimately leads to a compact representation of the pattern(s) within the data. For example, mass spectral data may be normalized such that the highest intensity signal is given an intensity value of 100 and all other signals are given an intensity value equal to their intensity as a percentage of that seen for the highest intensity signal.
- an even simpler way of preprocessing mass spectral data is to convert the data to a series of ones and zeroes for each mass value (or mass/charge ratio), where a one means that a signal (regardless of intensity) is present at a particular mass value and a zero represents the absence of a signal at a particular mass value.
- Feature extraction selection is performed during training to identify a subset of features within the data (such as mass spectral signals at particular mass values) that may be used to classify objects into their respective classes without having to consider all the available features. More specifically, feature selection refers to using an algorithm to select the most discriminatory subset of features within the input feature set, and feature extraction refers to applying an algorithm that creates new features based on transformations or combinations of the original feature set.
- Feature selection may be performed after feature extraction to select the most discriminatory set of extracted features.
- Feature extraction/selection may involve an exhaustive search for the optimal subset of features (i.e. all combinations are tried), or when the number of features is so large that an exhaustive search is computationally infeasible, a heuristic search is used. In either case, feature extraction/selection leads to greater computational efficiency in learning.
- learning involves applying a classifier to partition the feature space into regions characteristic of each class of object within the training set. In other words, combinations of features that are characteristic of objects of a particular class are identified.
- objects may be represented by points in multidimensional feature space with coordinates conesponding to the values of their features, objects of the same class will tend to appear close to each other (cluster) in the feature space.
- the classifier tries to draw boundaries between the portions of the feature space where objects of the same class tend to appear together.
- an object is represented by a point in the feature space and assigned a class according to the class of objects that are nearest to it in the feature space.
- a feedback process may also be employed to refine the preprocessing and feature/extraction steps in order to provide data subsets that supply pattems that best separate objects into their known classes. Pattern recognition methods that involve iterative identification of feature subsets are sometimes called "wrapper" methods.
- Feature subsets that provide improved diagnostic patterns are retained and used to generate new feature subsets, which are then tested for improved diagnostic ability. This process is repeated until an optimal subset of features is identified. If, however, the feedback process is not employed, the technique is known as a "filter” method, and feature selection is performed independently of the learning algorithm.
- classifier methods that may be used for pattern recognition during learning include decision tree methods, statistical methods, neural networks and evolutionary algorithms. These same methods may also be used to extract and select appropriate subsets of features. Combinations of these methods are also possible. For example, one type of algorithm may be used for feature extraction/selection and a second algorithm used as the classifier, in either a filter or wrapper method. Decision tree methods classify objects using a series of tests based on data features.
- the algorithm considers all the possible tests (e.g. presence, absence or range of values of each feature) that can split the objects of the data set into two subsets.
- the test that provides the "best" separation of the objects into known classes (or subsets of similar pattern) is then selected.
- the remaining tests are then explored to find the test that best separates each of these subsets into two smaller subsets with increasing homogeneity of class membership, and the process is repeated until each subset hopefully contains objects of only one class.
- the resulting scheme may be represented by a tree-like structure (dendrogram) with branches corresponding to each test that was used to partition the data into separate classes.
- Unknown objects are classified by applying the series of feature tests used to partition the training data, and assigned the class membership of the subset into which it falls.
- each object in a data set is represented as a vector in multidimensional space where the space is defined by the features or combinations of the features used to describe the objects.
- Statistical decision concepts are then used to establish boundaries that partition the feature space into regions characteristic of different classes of objects. Examples of statistical pattern recognition methods include principal component analysis (PCA) and discriminant analysis.
- PCA principal component analysis
- discriminant analysis Artificial neural networks mimic the pattern-finding capacity of the human brain and may be viewed as a parallel computing system consisting of a large number of simple processors with many interconnections. Neural networks relate particular input patterns to a classification through these interconnections.
- neural networks have the ability to learn complex non-linear relationships between data and the classes of objects they represent.
- Neural networks also have the characteristic of adapting themselves to the data. Examples of neural networks include the Self-Organizing Map (SOM) and multilayer feed-forward neural networks.
- SOM Self-Organizing Map
- Evolutionary algorithms such as genetic algorithms, are optimization techniques that mimic the processes of natural selection, and in the case of genetic algorithms, genetic mutation and recombination.
- individuals represent candidate solutions to the optimization problem being solved. For example, in the feature subset selection problem, each individual would represent a feature subset.
- Cluster analysis is a broad term used to describe methods that group, or cluster, objects into subsets based on their data patterns, often in an unsupervised manner. Since each object may be represented as a point in multidimensional feature space with coordinates corresponding to its values for the features, a cluster analysis will group objects based on their similar positions in the feature space. Generally, clustering techniques employ some measure of similarity to decide cluster membership of objects. Common similarity measures include distances between objects in the feature space (e.g.
- clusters of objects in the feature space may be selected in a manner that minimizes the distances between objects in the same cluster and maximizes the distances between objects in different clusters.
- the -means algorithm and the Self-Organizing Map are examples of cluster analyses.
- Particular examples of pattern recognition techniques that may be used in the disclosed methods including the methods described in U.S. Patent Application Publication No. 20030004402, which is inco ⁇ orated by reference herein, and the methods described in PCT Publication WO 02/42733, which is also inco ⁇ orated by reference herein. Additional methods for pattern recognition are described in U.S. Patent Publication Nos. 20020193950 and 20030134304. Examples of statistical pattern recognition software that is commercially available include
- SPSS SPSS Inc., Chicago, IL
- JMP SAS Inc., Cary NC
- Stata Stata Inc., College Station, TX
- Cluster Artificial neural network software is available from, among other sources, Neurodimension, Inc., Gainesville, FL (Neurosolutions) and The Mathworks, Inc., Natick, MA (MATLAB Neural Network Toolbox).
- Example 11 Diagnostic Method Based on Identity of BAM-associated Biomarkers
- This example describes a particular embodiment of a method for diagnosing the biological state of a subject.
- the molecules bound to one or more biomarker attractant molecules are analyzed for the presence of previously identified biomarkers.
- biomarkers shed for example, by a tumor cell pass into the blood stream where they encounter and bind to a circulating earner protein such as albumin. Over time, such biomarkers accumulate on the carrier protein along with other molecules.
- laser deso ⁇ tion of the bound molecules may be used to introduce them into a mass spectrometer which provides an ion spectrum readout of the molecules bound to the carrier protein.
- ions appearing at particular masses are fragments of known proteins that are correlated with a biological state of a particular anatomic of physiologic location.
- identification of particular ions in a mass spectrum of the molecules bound to a particular BAM is correlated with the presence of a particular biological state. For example, identification of a particular fragment from a particular protein (such as a protein known to be associated with growth or metastasis of a tumor or a particular type of tumor) may be used to make a specific diagnosis.
- BAM associated fragments of proteins associated with hypoxia or apoptosis of the heart muscle may be detected to provide a diagnosis of myocardial infarction (heart attack).
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Immunology (AREA)
- Chemical & Material Sciences (AREA)
- Urology & Nephrology (AREA)
- Biomedical Technology (AREA)
- Hematology (AREA)
- Physics & Mathematics (AREA)
- Cell Biology (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- Pathology (AREA)
- General Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- Food Science & Technology (AREA)
- Medicinal Chemistry (AREA)
- Analytical Chemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Biophysics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Hospice & Palliative Care (AREA)
- Oncology (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Investigating Or Analysing Biological Materials (AREA)
- Other Investigation Or Analysis Of Materials By Electrical Means (AREA)
Abstract
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US50978203P | 2003-10-08 | 2003-10-08 | |
US60/509,782 | 2003-10-08 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2005036180A1 true WO2005036180A1 (fr) | 2005-04-21 |
Family
ID=34435025
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2004/033305 WO2005036180A1 (fr) | 2003-10-08 | 2004-10-08 | Methodes d'analyse utilisant des biomarqueurs concentres avec des molecules d'attraction de biomarqueurs |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2005036180A1 (fr) |
Cited By (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2007089731A3 (fr) * | 2006-01-27 | 2008-04-24 | Univ George Mason | Marqueurs de liquide oculaire |
WO2009117785A1 (fr) * | 2008-03-28 | 2009-10-01 | Newsouth Innovations Pty Limited | Procédé de typage tissulaire hla |
EP2035830A4 (fr) * | 2006-06-14 | 2010-07-21 | Univ Johns Hopkins | Complexe proteine/peptide lie a l'albumine en tant que biomarqueur pour une maladie |
CN102460176A (zh) * | 2009-05-21 | 2012-05-16 | 李荣秀 | 差异多肽检测方法及其应用 |
WO2014016829A1 (fr) * | 2012-07-22 | 2014-01-30 | Mx Biotech Ltd. | Particules de fibrinogène insoluble (pfi) et revêtements : composition, procédés de fabrication et applications à des fins de recueil et de culture de cellules de mammifère |
WO2015116988A3 (fr) * | 2014-01-31 | 2015-09-24 | Counterpoint Health Solutions, Inc. | Métabolites liés par des liaisons covalentes utilisables en tant que biomarqueurs |
CN106456749A (zh) * | 2014-03-11 | 2017-02-22 | 小利兰·斯坦福大学托管委员会 | 抗SIRPα抗体和双特异性巨噬细胞增强型抗体 |
US9603837B2 (en) | 2012-03-28 | 2017-03-28 | Ixcela, Inc. | IPA as a therapeutic agent and a biomarker for disease risk for Huntington's disease |
CN111133526A (zh) * | 2017-07-18 | 2020-05-08 | 生命分析有限公司 | 发掘可用于机器学习技术中的新颖特征,例如用于诊断医疗状况的机器学习技术 |
CN111220720A (zh) * | 2019-12-05 | 2020-06-02 | 上海上药第一生化药业有限公司 | 胰蛋白酶及其酶原的纯度检测方法 |
US11041847B1 (en) | 2019-01-25 | 2021-06-22 | Ixcela, Inc. | Detection and modification of gut microbial population |
US11062792B2 (en) | 2017-07-18 | 2021-07-13 | Analytics For Life Inc. | Discovering genomes to use in machine learning techniques |
US20230223111A1 (en) * | 2021-03-31 | 2023-07-13 | PrognomIQ, Inc. | Multi-omic assessment |
WO2024017335A1 (fr) * | 2022-07-22 | 2024-01-25 | 电子科技大学长三角研究院(衢州) | Procédé de communication moléculaire directionnelle à relais basé sur de multiples attracteurs |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1999041613A1 (fr) * | 1998-02-12 | 1999-08-19 | Immunivest | Methodes et reactifs pour l'isolation rapide et efficace de cellules cancereuses circulantes |
WO2002077176A2 (fr) * | 2001-03-22 | 2002-10-03 | The Government Of The United States Of America As Represented By The Secretary, Department Of Health And Human Services | Proteines presentant une expression aberrante dans des tumeurs microdissequees capturees au laser |
US6471968B1 (en) * | 2000-05-12 | 2002-10-29 | Regents Of The University Of Michigan | Multifunctional nanodevice platform |
-
2004
- 2004-10-08 WO PCT/US2004/033305 patent/WO2005036180A1/fr active Application Filing
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1999041613A1 (fr) * | 1998-02-12 | 1999-08-19 | Immunivest | Methodes et reactifs pour l'isolation rapide et efficace de cellules cancereuses circulantes |
US6471968B1 (en) * | 2000-05-12 | 2002-10-29 | Regents Of The University Of Michigan | Multifunctional nanodevice platform |
WO2002077176A2 (fr) * | 2001-03-22 | 2002-10-03 | The Government Of The United States Of America As Represented By The Secretary, Department Of Health And Human Services | Proteines presentant une expression aberrante dans des tumeurs microdissequees capturees au laser |
Non-Patent Citations (4)
Title |
---|
A. I. MEHTA ET AL: "Ovarian cancer biomarker amplification by serum carrier protein binding", MOLECULAR CELL PROTEOMICS, vol. 2, no. 9, 2003 - 1 September 2003 (2003-09-01), pages 844, XP002314936 * |
M. ZHOU ET AL: "Identification of protein-protein interactions with human serum", MOLECULAR CELL PROTEOMICS, vol. 2, no. 9, 2003 - 1 September 2003 (2003-09-01), pages 945, XP002314937 * |
MEHTA ARPITA I ET AL: "Biomarker amplification by serum carrier protein binding", DISEASE MARKERS, vol. 19, no. 1, 2003, pages 1 - 10, XP009043071, ISSN: 0278-0240 * |
PETRICOIN E F ET AL: "Use of proteomic patterns in serum to identify ovarian cancer", LANCET, XX, XX, vol. 359, no. 9306, 16 February 2002 (2002-02-16), pages 572 - 577, XP004339269, ISSN: 0140-6736 * |
Cited By (30)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2007089731A3 (fr) * | 2006-01-27 | 2008-04-24 | Univ George Mason | Marqueurs de liquide oculaire |
US8741662B2 (en) | 2006-06-14 | 2014-06-03 | The Johns Hopkin University | Albumin-bound protein/peptide complex as a biomarker for disease |
EP2035830A4 (fr) * | 2006-06-14 | 2010-07-21 | Univ Johns Hopkins | Complexe proteine/peptide lie a l'albumine en tant que biomarqueur pour une maladie |
WO2009117785A1 (fr) * | 2008-03-28 | 2009-10-01 | Newsouth Innovations Pty Limited | Procédé de typage tissulaire hla |
CN102460176B (zh) * | 2009-05-21 | 2014-09-24 | 李荣秀 | 差异多肽检测方法及其应用 |
CN102460176A (zh) * | 2009-05-21 | 2012-05-16 | 李荣秀 | 差异多肽检测方法及其应用 |
US9603837B2 (en) | 2012-03-28 | 2017-03-28 | Ixcela, Inc. | IPA as a therapeutic agent and a biomarker for disease risk for Huntington's disease |
US9636325B2 (en) | 2012-03-28 | 2017-05-02 | Ixcela, Inc. | Indole lactic acid or a mixture of indole lactic acid and indole-3-propionic acid for treatment of Huntington's disease |
US9744155B2 (en) | 2012-03-28 | 2017-08-29 | Ixcela, Inc. | IPA as a therapeutic agent, as a protective agent, and as a biomarker of disease risk |
US10278958B2 (en) | 2012-03-28 | 2019-05-07 | Ixcela, Inc. | IPA as a protective agent |
WO2014016829A1 (fr) * | 2012-07-22 | 2014-01-30 | Mx Biotech Ltd. | Particules de fibrinogène insoluble (pfi) et revêtements : composition, procédés de fabrication et applications à des fins de recueil et de culture de cellules de mammifère |
WO2015116988A3 (fr) * | 2014-01-31 | 2015-09-24 | Counterpoint Health Solutions, Inc. | Métabolites liés par des liaisons covalentes utilisables en tant que biomarqueurs |
CN105934520A (zh) * | 2014-01-31 | 2016-09-07 | 伊克斯塞拉有限公司 | 作为生物标志物的共价键代谢物 |
CN106456749B (zh) * | 2014-03-11 | 2021-03-30 | 小利兰·斯坦福大学托管委员会 | 抗SIRPα抗体和双特异性巨噬细胞增强型抗体 |
CN106456749A (zh) * | 2014-03-11 | 2017-02-22 | 小利兰·斯坦福大学托管委员会 | 抗SIRPα抗体和双特异性巨噬细胞增强型抗体 |
CN111133526A (zh) * | 2017-07-18 | 2020-05-08 | 生命分析有限公司 | 发掘可用于机器学习技术中的新颖特征,例如用于诊断医疗状况的机器学习技术 |
CN111133526B (zh) * | 2017-07-18 | 2024-04-12 | 生命分析有限公司 | 发掘可用于机器学习技术中的新颖特征,例如用于诊断医疗状况的机器学习技术 |
EP3655895A4 (fr) * | 2017-07-18 | 2021-04-21 | Analytics For Life Inc. | Découverte de nouvelles caractéristiques à utiliser dans des techniques d'apprentissage automatique, telles que des techniques d'apprentissage automatique pour diagnostiquer des états de santé |
US12243624B2 (en) | 2017-07-18 | 2025-03-04 | Analytics For Life Inc. | Discovering novel features to use in machine learning techniques, such as machine learning techniques for diagnosing medical conditions |
US11062792B2 (en) | 2017-07-18 | 2021-07-13 | Analytics For Life Inc. | Discovering genomes to use in machine learning techniques |
US11139048B2 (en) | 2017-07-18 | 2021-10-05 | Analytics For Life Inc. | Discovering novel features to use in machine learning techniques, such as machine learning techniques for diagnosing medical conditions |
US11287420B2 (en) | 2019-01-25 | 2022-03-29 | Ixcela, Inc. | Detection and modification of gut microbial population |
US11287417B2 (en) | 2019-01-25 | 2022-03-29 | Ixcela, Inc. | Detection and modification of gut microbial population |
US11287419B2 (en) | 2019-01-25 | 2022-03-29 | Ixcela, Inc. | Detection and modification of gut microbial population |
US11287418B2 (en) | 2019-01-25 | 2022-03-29 | Ixcela, Inc. | Detection and modification of gut microbial population |
US11041847B1 (en) | 2019-01-25 | 2021-06-22 | Ixcela, Inc. | Detection and modification of gut microbial population |
CN111220720A (zh) * | 2019-12-05 | 2020-06-02 | 上海上药第一生化药业有限公司 | 胰蛋白酶及其酶原的纯度检测方法 |
CN111220720B (zh) * | 2019-12-05 | 2022-07-22 | 上海上药第一生化药业有限公司 | 胰蛋白酶及其酶原的纯度检测方法 |
US20230223111A1 (en) * | 2021-03-31 | 2023-07-13 | PrognomIQ, Inc. | Multi-omic assessment |
WO2024017335A1 (fr) * | 2022-07-22 | 2024-01-25 | 电子科技大学长三角研究院(衢州) | Procédé de communication moléculaire directionnelle à relais basé sur de multiples attracteurs |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP6010597B2 (ja) | 卵巣癌用のバイオマーカー | |
JP4644123B2 (ja) | 卵巣がんを検出するためのバイオマーカーの使用 | |
EP1756566B1 (fr) | Biomarqueurs du cancer de l'ovaire | |
US8389222B2 (en) | Apolipoprotein fingerprinting technique and methods related thereto | |
JP5159619B2 (ja) | 卵巣癌のバイオマーカー:ctap3関連タンパク質 | |
AU2006336091B2 (en) | Method and markers for the diagnosis of renal diseases | |
EP1838867B1 (fr) | Isoforme d'apolipoproteine a-ii utilisee en tant que biomarqueur pour le cancer de la prostate | |
US8623667B2 (en) | Method for diagnosing cancer using lectin | |
WO2005036180A1 (fr) | Methodes d'analyse utilisant des biomarqueurs concentres avec des molecules d'attraction de biomarqueurs | |
Gulcicek et al. | Proteomics and the analysis of proteomic data: an overview of current protein‐profiling technologies | |
US20050100967A1 (en) | Detection of endometrial pathology | |
JP4864875B2 (ja) | 血液悪性腫瘍における診断、分類、治療に対する反応および臨床的挙動の予測、治療の階層化、ならびに疾患のモニタリングのための血漿プロテオミクスパターンの使用法 | |
Jimenez et al. | High-throughput and targeted in-depth mass spectrometry-based approaches for biofluid profiling and biomarker discovery | |
CN101087889A (zh) | 乳癌的生物标记 | |
US20050158745A1 (en) | Methods and compositions for detection of nasopharyngeal carcinoma | |
US20130261015A1 (en) | Polypeptide markers for the diagnosis of cancers, and methods for the diagnosis of cancers using the same | |
Meani et al. | Clinical application of proteomics in ovarian cancer prevention and treatment | |
CA2525740A1 (fr) | Biomarqueurs permettant d'effectuer un diagnostic differentiel de la pancreatite et du cancer du pancreas | |
WO2006071843A2 (fr) | Biomarqueurs destines au cancer du sein |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
122 | Ep: pct application non-entry in european phase |