+

WO2000066777A2 - Analyse de la courbe de denaturation thermique de l'adn - Google Patents

Analyse de la courbe de denaturation thermique de l'adn Download PDF

Info

Publication number
WO2000066777A2
WO2000066777A2 PCT/US2000/011084 US0011084W WO0066777A2 WO 2000066777 A2 WO2000066777 A2 WO 2000066777A2 US 0011084 W US0011084 W US 0011084W WO 0066777 A2 WO0066777 A2 WO 0066777A2
Authority
WO
WIPO (PCT)
Prior art keywords
data
dna
samples
target
melting curve
Prior art date
Application number
PCT/US2000/011084
Other languages
English (en)
Other versions
WO2000066777A3 (fr
Inventor
George Tice, Jr.
Ganesh Vaidyanathan
Aaron J. Owens
Michael Schaffer
Mark A. Jensen
James W. Hazel
Original Assignee
Qualicon, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualicon, Inc. filed Critical Qualicon, Inc.
Priority to EP00931949A priority Critical patent/EP1185697A2/fr
Priority to JP2000615399A priority patent/JP2002543408A/ja
Priority to AU49753/00A priority patent/AU4975300A/en
Publication of WO2000066777A2 publication Critical patent/WO2000066777A2/fr
Publication of WO2000066777A3 publication Critical patent/WO2000066777A3/fr

Links

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N21/00Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
    • G01N21/62Systems in which the material investigated is excited whereby it emits light or causes a change in wavelength of the incident light
    • G01N21/63Systems in which the material investigated is excited whereby it emits light or causes a change in wavelength of the incident light optically excited
    • G01N21/64Fluorescence; Phosphorescence
    • G01N21/6428Measuring fluorescence of fluorescent products of reactions or of fluorochrome labelled reactive substances, e.g. measuring quenching effects, using measuring "optrodes"
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6844Nucleic acid amplification reactions
    • C12Q1/6851Quantitative amplification
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6888Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
    • C12Q1/689Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms for bacteria
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/70Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving virus or bacteriophage
    • C12Q1/701Specific hybridization probes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/16Primer sets for multiplex assays

Definitions

  • This invention relates to the field of molecular biology and more particularly to DNA-based diagnostic protocols.
  • dsDNA double-stranded DNA
  • separation of the nucleic acid products of an assay such as a DNA amplification reaction, from the starting matrix. Separation methods include analytical centrifugation, equilibrium dialysis, melting temperature profiles, and polyacrylamide gel electrophoresis and the like. Such methodology is complex, expensive and time consuming.
  • Higuchi employs an intercalating fluorescent dye (ethidium bromide) for labeling during the amplification and subsequent homogeneous detection of the dsDNA.
  • Yamamoto et al. in U.S. Patent 5,670,315 teach measurement of fluorescence during Polymerase Chain Reaction (PCR) using a specified dye (a pyrilium) as a quantitative indicator of PCR product.
  • PCR is the well-known method in which extremely small amounts of genetic material are rapidly amplified as set forth in U.S. Patent 4,683,195 by Mullis et al. and U.S. Patent 4,683,202 by Mullis.
  • the above cited methods are useful for the detection and quantitation of amplified target DNA.
  • some of these methods can not adequately distinguish single stranded DNA (ssDNA) from dsDNA.
  • All of the methods use purified, homogeneous DNA sample in the stated amplification reactions, and some methods use variable-temperature analyses with sensitive DNA-intercalating dyes.
  • none of the methods, using variable-temperature analyses with sensitive DNA-intercalating agents analyze the pattern of fluorescence signal in the manner described herein, namely monitoring and interpreting the change in fluorescence while slowly raising the temperature of the DNA sample to a denaturing level. By so doing, the analysis of the patterns generated can be used to distinguish specific target dsDNA from other nucleic acid material, for example an internal positive control DNA fragment.
  • Applicants have made the unexpected and surprising discovery that using divergent modeling techniques for the analysis of melting curve data and comparing the model's predictions improves the quality of data obtained from the assay.
  • This method can definitively detect the presence of a specific dsDNA fragment in the presence of other non-target amplified material.
  • the method of the invention is applicable to the detection of a specific target dsDNA regardless of its source.
  • the dsDNA usually is derived by a primer-directed amplification protocol.
  • Applicants have discovered that the information needed to identify specific dsDNA exists in melting curves, and can be extracted despite i) extraneous information derived from a starting matrix (such as occurs in food samples) ii) amplified background material and iii) interfering enzymatic reactions such as those that cause smears in gel electrophoresis. Further, it was found that appropriate, predictive, modeling techniques can be applied to data derived from these curves and used to overcome these difficulties and be probative of the presence of target DNA.
  • an important aspect of this invention is to provide a rapid, inexpensive, easily-carried-out, homogeneous test for identifying i) a target DNA, or ii) an organism by identifying a specific piece of DNA target unique to that organism.
  • the test provides improved reliability, particularly in the presence of a starting matrix that ordinarily interferes with DNA-based identification assays.
  • a further aspect provides a more definitive result than temperature dependent fluorescence (TDF). TDF is described in World Patents WO 97/46712 and 46714 by Wittwer et al. or endpoint fluorescence measurements.
  • a still further aspect of this invention is that it equals or exceeds the sensitivity of gel detection methods.
  • This invention pertains to the use of computer-processed melting curve analysis to achieve target DNA strand detection. More particularly, this invention concerns a computational method for homogeneous detection of a specific target DNA sequence in a solution of unknown sequences including any starting matrix comprising: i. providing a solution containing a specific DNA sequence at an effective concentration; ii. adding an effective amount of reagents sufficient to amplify a specific
  • DNA sequence including an intercalating dye that emits a significant fluorescent signal when bound to double stranded DNA and an insignificant signal in the presence of single stranded DNA; iii. applying successive serial treatments to amplify the specific target DNA in the reaction; iv. subjecting the solution of step ii to a single temperature cycle rising from an annealing condition to a melted condition while exciting fluorescence and monitoring the excited fluorescence at selected intervals to produce a series of analog output signals representative of the melting curve of DNA in the solution; v.
  • Figures la-c show, as a function of temperature, the progression from a) smooth melting curve data, to b) the transformed (i.e., -d(logF)/dT) data, and c) the sub-sampled data for a DNA fragment amplified from Listeria monocytogenes.
  • Figure 2 plots the sub-sampled, transformed -d(logF)/dT fluorescence data generated by raising the temperature of multiple samples to a denaturing level.
  • the plot is a composite of many experiments, and includes both Salmonella spiked (added bacterial DNA) and unspiked (no added bacterial DNA) samples presented in Example 1.
  • Figure 3 plots the sub-sampled, transformed -d(logF)/dT fluorescence data generated by raising the temperature of multiple samples to a denaturing level.
  • the plot is a composite of many experiments, and includes both Salmonella spiked and unspiked samples presented in Example 2.
  • Figure 4 plots the sub-sampled, transformed -d(logF)/dT fluorescence data generated by raising the temperature of multiple samples to a denaturing level.
  • the plot is a composite of many experiments, and includes both Escherichia coli spiked and unspiked samples presented in Example 1.
  • Figure 5 plots the sub-sampled, transformed -d(logF)/dT fluorescence data generated by raising the temperature of multiple samples to a denaturing level.
  • the plot is a composite of many experiments, and includes both Listeria monocytogenes spiked and unspiked samples presented in Example 1.
  • Figure 6 plots the sub-sampled, transformed -d(logF)/dT fluorescence data generated by raising the temperature of a sample to a denaturing level.
  • the plot includes typical product "smears" that were generated in Examples 1 and 2.
  • a smear is defined as a nonspecific amplified DNA product having a wide range of fragment sizes. This range is typically from 100 base pairs to greater than 2,000 base pairs.
  • Figure 7 is a flow sheet for the data analysis methods of the invention. It is believed that the information rich areas defined by InfoEvolve could be modeled using Neural Networks, and conversely the sub-sampled data could be modeled with InfoEvolve, as indicated by the arrows connecting the respective boxes. However, this practice is not explicitly demonstrated in this application.
  • Figure 8 shows typical raw melting curve data as a function of temperature.
  • Figure 9 shows a typical transformed melting curve spectrum.
  • Figure 10 is a visualization of cells with two features and two output states.
  • Figure 11 is an information map for the smear versus non-smear Salmonella example of InfoEvolve.
  • Figure 12 is an information map showing the information rich regions in the temperature spectrum for distinguishing positive versus negative samples in the second Salmonella example of InfoEvolve.
  • Figure 13 is a plot of the L. mono melting curve of Example 4.
  • Figure 14 is a plot of the melting curve of the positive control of Example 4.
  • Figure 15 is a plot of the melting curve of the solution with both positive control and
  • SEQ ID NO:l is the synthetic oligonucleotide sequence for a primer specific for amplifying a section of the Salmonella genome.
  • SEQ ID NO:2 is the synthetic oligonucleotide sequence for a primer specific for amplifying a section of the Salmonella genome.
  • SEQ ID NO: 3 is the synthetic oligonucleotide sequence for a primer specific for amplifying a section of the SV-40 genome.
  • SEQ ID NO:4 is the synthetic oligonucleotide sequence for a primer specific for amplifying a section of the SV-40 genome.
  • SEQ ID NO: 5 is the synthetic oligonucleotide sequence for a primer specific for amplifying a section of the Escherichia coli 0157:H7 genome.
  • SEQ ID NO:6 is the synthetic oligonucleotide sequence for a primer specific for amplifying a section of the Escherichia coli 0157:H7 genome.
  • SEQ ID NO:7 is the synthetic oligonucleotide sequence for a primer specific for amplifying a section of the Listeria monocytogenes genome.
  • SEQ ID NO: 8 is the synthetic oligonucleotide sequence for a primer specific for amplifying a section of the Listeria monocytogenes genome.
  • SEQ ID NO: 9 is the synthetic oligonucleotide sequence for a primer specific for amplifying a section of the genus Listeria genome.
  • SEQ ID NO: 10 is the synthetic oligonucleotide sequence for a primer specific for amplifying a section of the genus Listeria genome.
  • These primers are used in amplification of strain specific DNA fragments. All are included as part of the Qualicon BAX® for Screening kits, manufactured by Qualicon Inc., Wilmington, DE, that are useful in detecting genus Listeria (product number 17710603), Salmonella (product number 17710604), Listeria monocytogenes (product number 17710605), Escherichia coli 0157:H7 (product number 17710606), and the SV-40 fragment is a control reaction.
  • This invention pertains to the use of computerized melting curve analysis to achieve target DNA strand detection as stated below: a computational method for homogeneous detection of a specific target DNA sequence in a solution of unknown sequences including any starting matrix comprising: i. providing a solution containing a specific DNA sequence at an effective concentration; ii. adding an effective amount of reagents sufficient to amplify a specific DNA sequence including an intercalating dye that emits a significant fluorescent signal when bound to double stranded DNA and an insignificant signal in the presence of single stranded DNA; iii. applying successive serial treatments to amplify the specific target DNA in a reaction; iv.
  • step ii subjecting the solution of step ii to a single temperature cycle rising from an annealing condition to a melted condition while exciting fluorescence and monitoring the excited fluorescence at selected intervals to produce a series of analog output signals representative of the melting curve of DNA in the solution; v. inputting the output signals to a computer programmed to analyze the curve in digital form, smoothing and transforming the data of said curve to form a spectrum and analyzing the spectrum to determine a probative predictive model therefrom based on values previously established by training the computer program with known DNA sequences; and vi. identifying the unknown sequence by a predictive value in excess of a selected threshold value.
  • significant FIU it is meant any Fluorescence in Arbitrary Units (FIU) signal that is at least one standard deviation above the mean of a reagent blank FIU value.
  • insignificant FIU it is meant any FIU signal that is less than the mean of a reagent blank FIU value.
  • smoothing it is meant removing the random scatter of data points throughout the overall trend.
  • training it is meant determining the weights or parameters of the model which give the best match of the observed data.
  • predictive value it is meant a number that is generated by the computational method of the invention, that has a value of between zero and one. Wherein zero is defined as having no correlation to a trained positive value, and one is defined as having a 100% correlation to a trained positive value.
  • threshold value it is meant an arbitrary value used to compare with the predictive value. If the predictive value is greater than the threshold value the sample is considered positive, i.e., contains target DNA. The closer the threshold value is to one, the more confident you are in calling the sample positive. It is preferred that the threshold value is greater than 0.4, preferably greater than 0.5, more preferably greater than 0.7, and most preferably greater than 0.9.
  • an effective amount it is meant the minimal-to-optimal amount of reagent necessary to produce or detect a product, when used in concert with other defined reagents in protocols outlined in the invention.
  • effective concentration it is meant the minimum-to-optimum amount of DNA in a solution that can be amplified by the method of the invention to form a detectable product.
  • the "selected length" of said product is determined by the sequence of the target DNA and the selection of primers that will amplify a specific DNA fragment when the protocol outlined in the invention is followed.
  • the effective concentration of DNA will be affected by components introduced by the tested sample.
  • starting matrix all constituents contained in a sample solution including, DNA, buffer components, food components, and other additives.
  • the preferred method uses any well characterized amplification protocol, including but not limited to, PCR, strand displacement, ligase chain reaction (LCR), and the particularly preferred method of nucleic acid sequence based amplification (NASBA) with PCR (Food Microbiology Fundamentals and Frontiers, 1997, M. E. Doyle, L. R. Beuchat, and T. J. Montville, ASM Publication, pp. 723-724).
  • Intercalating dyes such as ethidium bromide, YoPro®, and SYBR® Green (manufactured by Molecular Probes) are useful in practicing the technique.
  • YoPro® Quinolinium, 4-[(3-methyl-2(3H)-benzoxazolylidene)- methyl]-l-[3-(trimethylammonio)propyl]-,diiodide
  • SYBR® Green Molecular Probes, Inc., Eugene, OR
  • the mathematical basis for determining the nature of the melting curve may involve the determination of information rich regions which may be done either by visual isolation of significant features or, preferably, by algorithmic creation of an information map.
  • the smoothing step is best performed by a Savitzky-Golay algorithm.
  • the data establishing a melting curve for definitive comparison is analyzed by one or more of several divergent modeling procedures. These include neural net analysis and "InfoEvolve" analysis. Where less precise results are adequate (i.e., lesser than 80% accuracy), multiple linear regression (MLR), partial least squares regression (PLSR), and logistic regression (LR) can be used to establish a predictive model. If more than one mathematical protocol is employed to model the same test data, and this is preferred, the results are polled to improve performance of predictive accuracy.
  • a "smear” is a test in which an excess of amplified genetic matter is present producing multiple bands in electrophoresis indicating a non-predictive test.
  • the predictive method can reproducibly identify samples generating smears, thereby allowing for the removal of these samples from further analysis or consideration. Further, the method of the invention may be done in the presence of a positive control.
  • the invention comprises subjecting a sample having an adequate number of copies of a target DNA sequence (such as derived from a nucleic acid amplification technique) to a single temperature cycle passing from the annealed state of the target DNA to the denatured state and obtaining temperature and fluorescence data.
  • a target DNA sequence such as derived from a nucleic acid amplification technique
  • fluorescence detection means such as the ABI 7700 available from The Perkin-Elmer Corporation.
  • the data is exported and processed with various transformations in preparation for analysis.
  • the processed data is then fed into a predictive modeling system, or systems, which have been trained using melting curve data derived from known examples tested under pristine conditions.
  • the procedure preferred is to (a) interpolate the data to get evenly spaced temperature data points (T), (b) take the log of fluorescence (F), and (c) smooth the results.
  • the Savitzky-Golay algorithm known to those skilled in the art of mathematical and statistical analysis, is preferred for smoothing.
  • the negative derivative, -d(logF)/dT is calculated and the data is preferably reduced to 11 to 13 data points using a target specific range between 70°C and 95°C with a preferred spacing of one degree Celsius.
  • Figures la, b, and c show this progression using data obtained from Listeria monocytogenes. In Figure la, the smoothed fluorescence in arbitrary units (FIU) is shown.
  • Figure lb there is seen the plot of the derivative and Figure lc shows the transformed data.
  • Data from target DNAs are best reduced to a lower resolution by reduction to a manageable number of data points. Examples of the best number of data points for specific target DNAs are described in Table I along with the temperature range spanned.
  • the process of the present invention may be used to detect the presence of a wide variety of target nucleic acids.
  • target nucleic acids include, but are not limited to, DNA derived from microorganisms such as bacteria, yeast and fungi, as well as viruses, insects, plants, animals and humans.
  • pathogenic microorganisms known to contaminate food. These include but are not limited to E. coli 017:H57. Salmonella, Listeria monocytogenes, genus Listeria, and Campylobacter which, despite the attention paid to preventive measures, sicken and kill significant numbers of people each year making rapid and reliable identification highly important.
  • this process can detect whether a sample contains multiple-sized DNA amplification products that are defined as a smear.
  • An artificial neural network or a parallel distributed processing network, is a computational circuit which is implemented as a multi-layered arrangement of interconnected processing elements.
  • the network architecture contains at least three layers of processing elements or units: a first, or "input”, layer; a second, intermediate or “hidden”, layer; and a third, or "output” layer. There can also be additional hidden layers.
  • the layers are typically fully connected with weights (i.e., one from each input unit to each hidden unit, one from each hidden unit to each output unit), and the output of each unit in the hidden and output layers is given by the weighted sum of the connections coming from the prior layer, followed by a nonlinear transfer function like the sigmoid or hyperbolic tangent.
  • weights i.e., one from each input unit to each hidden unit, one from each hidden unit to each output unit
  • the weights of the network are obtained with respect to a training data set by an iterative procedure of changing the connection weights in order to minimize the sum of squares of predicted outputs versus the observed outputs.
  • Methods of training the network include the Delta Rule (Rumelhart and McClelland, 1986), the Levenberg-Marquardt method, conjugate gradient, quasi-Newton methods (Hertz et al, 1991), and the stiff backpropagation method (Filkin, U.S. Patent 5,046,020).
  • the objective of the data analysis method is to take an observed melting curve, with counting rate of photons (F) as a function of temperature (T), and to input that information into a computer.
  • the computer is programmed to make a prediction of whether or not the corresponding sample was contaminated with a specific DNA of a pathogen.
  • the temperatures T(t) output by the thermal cycler are not necessarily equally spaced, use linear interpolation to produce a sequence of equally- spaced temperatures T(t') and the corresponding fluorescence F(t').
  • the equal temperature step is 0.1 degrees centigrade.
  • the equally spaced observations of fluorescence are next transformed to produce a signal that looks like a spectrum.
  • both the smoothing and the derivative are performed using the Savitsky-Golay smoothing algorithm (PLS Toolbox for Matlab, EigenvectorResearch, Inc.), with a smoothing interval of 2.5 degrees Celsius.
  • the Melting Curve Spectrum (block B) corresponding to the raw Melting Curve Data in Figure 8 is shown in Figure 9. Note that the spectrum is smooth, positive, and with well- distinguished peaks.
  • the Melting Curve Spectrum (block B) is then analyzed by two parallel methods, as shown in Figure 7. In the left alternative, block C, the spectrum is either averaged or subsampled to produce a spectrum with fewer points. This is done because empirical models are generally more robust when trained on fewer features.
  • the distance in temperature is increased from 0.1 °C to 1.0°C either by averaging 10 successive data points or by using each tenth data point (sub-sampling).
  • the sub-sampled Melting Curve Spectrum in block C can then be used with either a neural network model (block E) or an InfoEvolve model (block F).
  • the Melting Curve Spectrum can be used with an InfoEvolve model to select a set of information-rich Temperatures. (See the discussion of InfoEvolve following).
  • the Melting Curve Spectrum corresponding to these Temperatures can then be used either with a neural network model (block E) or an InfoEvolve Model (block F).
  • the neural network used (block E) in the preferred implementation is a feedforward multi-layer perceptron (Rumelhart and McClelland, 1986), with one input unit for each of the 10 to 14 subsampled points on the Melting Curve Spectrum (-d log(F)/dT), a number, M, of hidden units, and a single output unit with zero target corresponding to a non-spike sample and one to a spiked sample.
  • the network is trained using stiff backpropagation (Filkin, U.S. Patent 5,046,020), with the DuPont proprietary software package called "The Neural Webkit" (van Stekelenborg, 1997).
  • the input elements are scaled together, which is appropriate for a spectrum.
  • Example 7 There are cases in which two (or more) simultaneous analyses are conducted in the same reaction (see Example 7).
  • the amplified DNA from a target organism (Target) is detected simultaneously with the amplified product from an internal positive control DNA (INPC).
  • the Target and INPC fragments are produced from the same substrate pool, so their peak heights in the Melting Curve Spectrum are related.
  • Neural networks can model two related simultaneous outputs like Target and INPC.
  • the temperature range of the Melting Curve Spectrum must encompass both peak temperatures for Target and INPC.
  • the neural network can also be trained using the data on information rich regions generated by InfoEvolve (block D). This gives a second neural network model. It can also be trained on multiple data sets, with inputs either from block C or block D, to give several parallel neural network models, each capable of making a prediction of the not spiked versus, spiked classification. DESCRIPTION OF INFOEVOLVE
  • the second preferred predictive model, InfoEvolve, block F of Figure 7, is a methodology that applies the principle of evolution to information theory. It is the subject of the provisional U.S. Application No. 60/131,804, filed on April 30, 1999, and commonly owned and filed of even date with the instant application, the contents of which are incorporated herein by reference.
  • the method is nonlinear, robust and inherently parallel in architecture, making it suitable for the modeling of complex, multi-dimensional processes. In addition, it simultaneously produces solutions to the inverse problem of data or process control as a natural consequence.
  • An important benefit is the identification of the most information rich inputs, or combination of inputs, for the modeling task at hand. This aids in developing optimal strategies for decision making. For example, in DNA identification, the ability to locate the most information rich portion of a melting curve temperature spectrum permits development of the most robust predictive empirical model.
  • a related data visualization tool using concepts derived from image analysis is part also of the conception. This enables easy visualization of information rich zones in a multi-dimensional data space leading to methods for identifying the most information rich pathway from a starting point in a process space to a targeted ending point.
  • a fundamental postulate of the approach is that there is an underlying order in many apparently disordered systems and, by evolving the optimum (most information rich) representation of the data set, the underlying structure can be revealed.
  • the method uses both local and global information measures in characterizing the information content of multi-dimensional feature spaces. Empirical results show that local information dominates the predictive capability of the model.
  • the method is a globally-influenced, local technique in contrast to many other methods that use global optimizations over the entire data set as the major mechanism by which to converge to a solution.
  • the InfoEvolve modeling/discovery framework can be broadly divided into two stages.
  • the first stage represents the process of "Feature Identification” (going from box B to D in Figure 7) where the most information rich combinations of input features for a particular modeling problem are evolved.
  • the second stage represents the process of "Model Evolution” (box D to F in Figure 7), where the best predictive model is evolved using the information rich features discovered in the first stage.
  • Feature identification stage a potentially large list of input features, many of which might be irrelevant to the modeling problem at hand, is provided.
  • a genetic framework is used to evolve the most information rich subset of input features so that the dimensionality of the modeling problem is reduced. This is very important in order to develop more robust predictive models.
  • the information content of the subspace can be measured using entropic measures derived from information theory (Shannon, 1997).
  • the local information content of a cell can be described by the relative distribution of output states present in that cell. For example, if all the data points which project into a cell represent one output state, that cell is considered to be relatively information rich. Conversely, if multiple output states are represented in the population of data points within a given cell, that cell is considered to be relatively information poor.
  • Figure 10 illustrates both information rich and information poor cells within a representative feature subspace. In the figure the round dots represent output-state 1 and the square dots represent output-state 2.
  • the global entropy of each subspace in our original random pool of subspaces is then used as a fitness function to drive an evolutionary process to evolve the most information rich subspaces.
  • Standard genetic algorithms have been adapted to perform this task (Holland, 1975: Goldberg, 1989; Mitchell, 1997).
  • the final pool of information rich input feature combinations can be used directly as inputs to the Model Evolution phase of InfoEvolve. Alternatively, this pool can be analyzed statistically to isolate the most frequently occurring inputs by creating a histogram of input feature frequencies derived from the final pool. We refer to this histogram as an Information Map.
  • an exhaustive search over all sub dimensions of this subset can be performed to determine an optimum modeling architecture involving this feature subset. For example, if we select 12 input features from the Information Map, we can test all one dimensional projections of these 12 features against a test set to determine a test error rate. We describe how the test error rate is measured for a given subset of feature subspaces in the following paragraph. We can repeat this process by considering all two dimensional, three dimensional and higher dimensional projections respectively for modeling accuracy against the test set. Once an optimum dimensionality has been determined, all the feature subspace combinations consistent with the optimum dimensional projection can be computed and this set of subspaces can be used as inputs into the Model Evolution phase.
  • the pool of feature subspaces determined via the methods described above is used as inputs to evolve the optimum subset of feature subspaces which results in the minimum test error.
  • a genetic framework is also used in this step, but the fitness function which drives the second evolutionary stage is based on the predicted error in the test data set used to develop the empirical model.
  • test data point is then projected into each feature subspace, and an entropically weighted probability vector is calculated over all the subspaces present in the subset to predict the likelihood of that test point being in each possible output state.
  • An output state for the test point is then predicted by selecting the state which has the highest probability value. This process is repeated for all the test data points, and an overall test error rate is calculated based on the difference between the actual and predicted output states.
  • This test error is used as the fitness function in the second evolutionary stage.
  • the test error is calculated for each subset of feature subspaces in our original random pool, and is used to drive an evolutionary process which results in evolving the subset of feature subspaces which results in the minimum error.
  • This final subset of feature subspaces derived from information rich features or feature combinations constitutes the empirical model evolved by InfoEvolve. New data points can be projected into this subset of feature subspaces and probability vectors can be calculated as described above in order to predict an output state.
  • HIERARCHICAL DNA IDENTIFICATION STRATEGY AND HYBRID MODELING We can develop a powerful DNA fragment identification method by using the two modeling schemes outlined above back-to-back in a sequential fashion. The first level of identification is to separate non-smear from smeared samples. This is followed by identifying the specific DNA fragment of interest for the non smeared samples. In practice, this hierarchical method has proven to be more accurate than using a single 3 state model with positives, negatives and other nucleic acid material representing the possible output categories.
  • the foods and bacterial strains used in this study were selected for the following reasons. Ground beef and ground turkey were selected because these foods are commonly tested, have a high level of animal commensals, high fat content or have non uniform physical properties. Soft cheese was chosen for high fat content, high level of soft cheese organisms and are mechanically challenging. Mozzarella cheese contains high background of hard cheese organisms. Milk was included in the testing panel because of the high lactic bacteria background and it is optically dense with strong light scattering properties. Cocoa was tested because this food contains high levels of polyphenolic compounds (PCR inhibitors) and is optically dense with strong light absorbing properties. Finally coleslaw was chosen because of the low pH associated with this food. The bacterial strains that were chosen are either common contaminants of the foods being tested or represent different ribotypes of the BAX® target organisms.
  • Salmonella Sample Preparation In a 2 ml screw cap tube, 5 microliters (ul) of the incubated diluted enrichment were added to 200 ul of the lysis reagent (5 ul BAX® lysis buffer and 62.5 ul BAX® Protease) containing a 1 : 10,000 dilution of the DNA intercalating dye SYBR® Green (Molecular Probes). The tubes were incubated at 37°C for 20 minutes followed by 95°C for 10 minutes. Fifty microliters of this crude bacterial lysate was used to hydrate the one BAX® Salmonella sample tablet that was contained in each of the PCR tubes of the type used with the Perkin Elmer 7700 Sequence Detector instrument. The tubes were capped and thermal cycled according to the following protocol in a Perkin Elmer 9600 thermal cycler:
  • E. coli Sample Preparation In a 2 ml screw cap tube, three (3) microliters of the overnight enrichment was added to 400 ul of the lysis reagent (5 ml BAX® lysis buffer + 62.5 ul BAX® Protease) containing a 1 :25,000 dilution of the DNA intercalating dye SYBR® Green (Molecular Probes). The tubes were incubated at 37°C for 20 minutes followed by 95°C for 10 minutes. Fifty (50) microliters of this crude bacterial lysate was used to hydrate one BAX® E. coli sample that were contained in PCR tubes used with the Perkin Elmer 7700 Sequence Detector instrument. The tubes were capped and thermal cycled according to the following protocol in a Perkin Elmer 9600 thermal cycler:
  • melting curves were generated on the Perkin Elmer 7700 DNA Sequence Detector by running the following conditions:
  • the multicomponent data was exported from the instrument and was used in the analysis.
  • the production of the specific DNA fragment was verified by adding 15 ul of BAX® Loading Dye to the amplified sample. A 15 ul aliquot was then loaded into a well of a 2% agarose gel containing ethidium bromide. The gel was run at 180 volts for 30 minutes. The specific product was then visualized using UV transillumination.
  • the raw fluorescence data was imported into Microsoft Excel for processing. For each well, the fluorescence data was normalized by subtracting the lowest fluorescence value from the rest of the data points. From this stage divergent approaches were used for modeling the data. 1. Melting Curve
  • the normalized data is then smoothed with a Savitzky-Golay smoothing algorithm.
  • the negative derivative was taken of the logarithm of the fluorescence with respect to temperature (-dlog(F)/dT) and plotted, -dlog(F)/dT (y-axis) vs. Temp (x-axis).
  • the data is interpolated to a 0.1 °C resolution using a cubic spline interpolating function.
  • the logarithm of the interpolated data is then taken and smoothed with a Savitzky-Golay smoothing algorithm over 2.5 degrees.
  • the negative derivative is taken of the log fluorescence with respect to temperature (-d(log F)/dT) and parsed at a 1.0°C interval using the data ranges below:
  • the goal of modeling the data is to differentiate a positive sample from a blank reaction based on the melting curve data.
  • the positive samples are assigned a classification of ' 1 ', while the blank reactions were assigned a classification of '0'.
  • the combined data set is then randomly split into a training set and a test set using an approximate 75:25 ratio.
  • the training set is used to train a neural network after which the test set is used to verify the trained model.
  • any data generated using the same reaction and melting curve conditions should be able to be processed with the model, to obtain a predicted result.
  • Modeling involves minimizing the errors in the classification of the training and test samples. Three modeling approaches were used to analyze the data: Neural Networks, InfoEvolve, and Logistic Regression. Salmonella Results:
  • Salmonella samples were produced during these experiments.
  • the data set consisted of 119 positive samples (samples spiked with Salmonella) and 30 blank reactions (unspiked samples). All of the samples which were spiked with Salmonella were positive based on the agarose gel results. All, but one of the thirty blank reactions were negative on the gel. This one sample showed no Salmonella melting curve pattern. By examining the remaining melting curves, there was a 100% agreement between the melting curve pattern and the spiked/unspiked status of the samples.
  • E. coli samples were produced during these experiments.
  • the data set consisted of 48 positive samples (samples spiked with E. coli) and 10 blank reactions (unspiked samples). All but one of the samples spiked with E. coli were positive based on the agarose gel results. The spiked sample which didn't have a band on the gel shows a characteristic E. coli melting curve, yet the pattern is within the noise of the system.
  • the data set analyzed consisted of 96 positive samples (samples spiked with L. mono) and 24 blank reactions (unspiked samples). Of the samples which were spiked with L. mono, 78 were positive based on the agarose gel results. Of the 18 spiked samples which were not detected by the gel method, 14 occurred in chocolate and 4 occurred in coleslaw. All of the spiked samples had characteristic L. mono melting curve patterns and were detected as positives by the method of the invention. The 24 blank reactions were all negative on the gel, and were negative on analysis of the melting curve patterns.
  • a training set and test set were created.
  • the training set consisted of 89 samples, while the test set consisted of 31 samples.
  • Each set contained a proportional distribution of spiked and unspiked samples.
  • the Neural Network and InfoEvolve models were able to classify all of the test samples correctly.
  • the logistic regression model failed on one sample.
  • Foods were purchased from local grocery stores and were stored at 4°C. Thirty different foods were pre-enriched according the BAM procedure. Following the prescribed enrichment, samples were spiked with Salmonella newport or were left unspiked, see Table III. The enrichments were then diluted 1 : 10 in BHI (Difco) and then incubated at 37°C for 3 hours.
  • Liquid Egg TSB 1 10 0, 10 4 /mL, 10 5 /mL
  • Red Wheat Bran LB 1 10 0, 10 4 /mL, 10 5 /mL
  • Thyme TSB 1:10 10 7 /mL
  • Oregano TSB 1 100 10 7 /mL
  • Cinnamon TSB 1 100 10 7 /mL
  • Hershey's Cocoa Non fat dry milk 1 10 0, 10 7 /mL
  • PVPP Polyvinvlpolypvrrolidone
  • Salmonella Sample Preparation In a 2 ml screw cap tube, five (5) microliters of the enrichment or PVPP treated sample was added to 200 ul of the lysis reagent (5 ml BAX® lysis buffer and 62.5 ul BAX® Protease) containing a 1 : 10,000 dilution of the DNA intercalating dye SYBR® Green (Molecular Probes). The tubes were incubated at 37°C for 20 minutes followed by 95°C for 10 minutes. Following the 95°C incubation, 50 ul of a 4 mg/ml BSA solution was added to the lysate. This was done for both PVPP treated and untreated samples. As a control, some samples were left untreated.
  • the lysis reagent 5 ml BAX® lysis buffer and 62.5 ul BAX® Protease
  • SYBR® Green Molecular Probes
  • the negative derivative is taken of the smoothed fluorescence with respect to temperature (-dlog(F)/dT) and plotted, -dlog(F)/dT (y-axis) vs. Temperature (x-axis).
  • the data is interpolated to a 0.1 °C resolution using a cubic spline interpolating function.
  • the logarithm of the interpolated data is then taken and then smoothed with a Savitzky-Golay smoothing algorithm over 2.5 degrees.
  • the negative derivative is taken of the log fluorescence with respect to temperature (-d(log F)/dT) and parsed at a 1.0°C interval using the data range for Salmonella: 82.0°C to 93.0°C (12 data points).
  • the models used for the prediction of the samples are those created using the original 149 Salmonella data points previously mentioned.
  • Results Of the 309 data samples produced during these experiments, 204 were spiked with Salmonella and 105 samples were 'blank' reactions. Of the 204 spiked samples, 143 samples were positive on an agarose gel and 61 were negative on the gel. The negative samples can be attributed to the inhibition of PCR or inadequate gel or PCR sensitivity. Of the 105 'blank' reactions, 95 were negative on the gel, and 10 were positive on the gel. The positive samples can be attributed to natural food contamination (e.g., liquid egg samples) or technical errors.
  • natural food contamination e.g., liquid egg samples
  • the output of each of the modeling methods is a number between one and zero.
  • a T represents a 'spiked' prediction while a '0' represents an 'unspiked' prediction.
  • the number for each of the methods below shows the number of samples which agreed with the expected prediction.
  • the 'Expected Prediction' column displays a one or a zero based on the spike status and gel result. This number is what the model would be expected to predict based on the training samples.
  • EXAMPLE 3 EVALUATION OF THE HOMOGENEOUS DETECTION FOR THE DISCRIMINATION OF NON-SPECIFIC PCR PRODUCT FROM SPECIFIC PCR PRODUCT
  • the PCR process can generate non specific DNA products in the absence or presence of the target DNA. The mechanism that causes the formation of these products is not well understood. In order to have an acceptable detection performance, the method should be able to discriminate specific target DNA fragments from non-specific DNA products such as primer dimers and multiple sized fragments. Generation of non specific PCR products An overnight culture of E. coli O157:H7 was prepared by transferring one colony from an agar plate and placing it into 10 mL of BHI (Difco Laboratories) broth.
  • the culture was incubated overnight (22 ⁇ 2 hrs) at 35°C.
  • 3 ul of the overnight culture was added to 400 ul BAX® Lysis Reagent (5 ml BAX® Lysis Buffer + 62.5 ul BAX® Protease). This was done in triplicate.
  • the tubes were placed in a 37°C water bath for 20 minutes, followed by a 95°C water bath for 10 minutes. After lysis, the samples were cooled on ice.
  • two BAX® E coli Sample tablets contained in PCR tubes were hydrated with 50 ul of the lysate. The tubes were capped and thermal cycled on a Perkin-Elmer 9600 Thermal Cycler using the following conditions:
  • the production of the specific E coli fragment was verified by adding 15 ul of BAX® Loading Dye to one sample from each lysate. A 15 ul aliquot was then loaded into a well of a 2% agarose gel containing ethidium bromide. The gel was run at 180 volts for 30 minutes. The specific product then visualized using UV transillumination. In order to generate non-specific PCR products, 10 ul of the E.
  • the raw fluorescence data was imported into Microsoft Excel for processing. For each well, the fluorescence data was normalized by subtracting the lowest fluorescence value from the rest of the data points. From this stage divergent approaches are used for visualizing the data and modeling the data.
  • the normalized data was then smoothed with a Savitzky-Golay smoothing algorithm.
  • the negative derivative was taken of the fluorescence with respect to temperature (-dlog(F)/dT) and plotted, -dlog(F)/dT (y-axis) vs. Temp (x-axis).
  • the data was interpolated to a 0.1 °C resolution using a cubic spline interpolating function.
  • the logarithm of the interpolated data was then taken.
  • This data was then smoothed with a Savitzky-Golay smoothing algorithm over 2.5 degrees.
  • the negative derivative was taken of the log fluorescence with respect to temperature (-d(log F)/dT).
  • the derivative data was then parsed at a 1.0°C interval from 82.0°C to 93.0°C (12 data points). These twelve points are a section of the melting curve which has proven to contain the most information about the sample.
  • the modeling process utilizes the 12 data points described above.
  • the goal of modeling the data is to differentiate smears from non-smears (in this case Salmonella and blank samples). For this reason, the data was combined with Salmonella and blank samples.
  • the smear samples were assigned a classification of ' 1', while the non-smear samples were assigned a classification of '0'.
  • the combined data set was then randomly split into a training set and a test set using an approximate 75:25 ratio.
  • the training set was used to train a neural network.
  • the test set was used to verify the effectiveness of the trained model.
  • the intent of the modeling process was to minimize errors in the training and test sets. Results
  • a total of thirty smear samples were generated. Eight of the thirty samples were generated artificially by the previously mentioned protocol. The remaining 22 smears occurred naturally in various experiments and consisted of 8 environmental samples, 12 Salmonella samples, and 2 unspiked reactions.
  • EXAMPLE 4 THE DETECTION OF MULTIPLE DNA FRAGMENTS IN A SINGLE TUBE USING MELTING CURVE ANALYSIS It would be advantageous to have the ability to be able to homogeneously distinguish multiple DNA fragments in a single reaction tube. This would allow the incorporation of an amplification control reaction in the same tube as the test reaction. In addition, it could allow for the amplification and detection of a multiplex reaction where a sample is being screened for two or more specific DNA targets. This example shows the feasibility of using melting curve analysis for the homogeneous detection of multiple DNA fragments in a single tube.
  • a 1 :10,000 dilution of SYBR® Green was made in BAX® lysis buffer (Qualicon, Inc.). To an aliquot of the diluted dye solution was added a control plasmid to yield a level of about 10 3 -10 5 copies/reaction. In addition, amplifying primers to the control plasmid were added to a final concentration of 0.1 uM. L. monocytogenes DNA was added to both diluted dye solutions, with and without plasmid control reagents, to yield reactions containing 2, 0.2, 0.02 or 0 ng DNA/reaction. These solutions were then used to hydrate a BAX ® for Screening/ . monocytogenes sample tablet that was contained in a PCR reaction tube. The tube was capped and thermal cycled under the following conditions: 94°C 2.0 minutes 1 cycle 94°C 15 seconds 38 cycles
  • the multicomponent data was exported from the instrument and was used in the analysis.
  • the raw fluorescence data was imported into Microsoft Excel for processing. For each well, the fluorescence data was normalized by subtracting the lowest fluorescence value from the rest of the data points. The normalized data was then smoothed with a Savitzky- Golay smoothing algorithm. The negative derivative was taken of the fluorescence with respect to temperature (-dF/dT) and plotted, -dF/dT (y-axis) vs. Temp (x-axis). The plots were visually inspected to determine the absence or presence of the specific DNA targets. These results were compared to the results obtained from agarose gel detection. Results
  • the melting curve profiles for the Listeria monocytogenes amplification product and the positive control amplification product were produced according to the instant method ( Figures 13 and 14).
  • the melting curve profile of the combined products was also produced ( Figure 15).
  • the profile difference between the two products implied that a trained model would be capable of identifying a Listeria amplification product from a positive control when amplified in a single tube.
  • a crude cell lysate of Salmonella typhimurium was prepared by taking a five microliter aliquot of an 10 8 /ml culture and adding it to 200 ul of lysis reagent (lysis buffer and 62.5 ul of Protease) containing a 1:10,000 dilution of SYBR® GreenTM (Molecular Probes, Eugene OR). The tube was capped and incubated for 37°C for 20 minutes followed by 95°C for 10 minutes. There were four replicate lysates prepared. Fifty microliters of this crude bacterial lysate was used to hydrate eight BAX® Salmonella sample tablets that were contained in PCR tubes used with the Perkin Elmer 7700 Sequence Detector instrument. In addition, fifty microliters of lysis reagent containing dye was dispensed into fifty-six PCR tubes. All tubes were capped and thermal cycled according to the following protocol in a Perkin Elmer 9600 thermal cycler:
  • PCR Product Titer Following PCR, all of the fifty-six lysis reagent containing dye reactions were pooled and was used as the diluent to titer out the Salmonella specific product.
  • a 1 :10 dilution of the Salmonella specific product was made by adding thirty microliters of the PCR product to two-hundred seventy microliters of lysis reagent diluent.
  • a two-fold serial dilution of this material was made by mixing one hundred microliters of the 1 :10 diluted material with one hundred microliters of the lysis reagent diluent.
  • the raw fluorescence data was imported into Microsoft Excel for processing. For each well, the fluorescence data was normalized by subtracting the lowest fluorescence value from the rest of the data points. The normalized data is then smoothed with a Savitzky- Golay smoothing algorithm. The negative derivative is taken of the fluorescence with respect to temperature (-dF/dT) and plotted, -dF/dT (y-axis) vs. Temp (x-axis). The plots were visual inspected to determine the absence or presence of the specific DNA targets. These results were compared to the results obtained from agarose gel detection. Results
  • the level of sensitivity for the gel based detection was a 1 :10 dilution of the starting material whereas the level of sensitivity for melting curve analysis was 1 :40. These results indicate that melting curve analysis is approximately four times more sensitive than gel based detection.
  • This tablet contained the following components
  • the trehalose and carbowax were dissolved in deionized water.
  • the solution was cooled to 4°C.
  • the deoxy nucleotides, primers, SV40 DNA, Surfact-Amps, SYBR® Green, BSA and AmpliTaq were added to the cooled solution of trehalose and carbowax.
  • the solution was filtered through a 5 um cartridge, and then sprayed into a liquid nitrogen chamber at a reagent spray rate of approximately 120 ml/min through a 1.1 mm tip.
  • the frozen blend was then collected in the tray at the end of the chamber due to gravity.
  • the frozen blend was then freeze dried in a freeze drier (such as the GT6, available from Finn Aqua of Germany).
  • the freeze drying program consisted of primary drying at a product temperature of -40°C and a chamber pressure of 50 micron for 50 hours. Secondary drying was done at 25°C for 20 hours. The freeze dried blend was then sized through a 24 mesh screen. The sized blend was then tableted using a 3/32 tool to a final weight of 7.6 mg/tablet.
  • Lvsate Preparation In a 2 ml screw cap tube, 5 microliters (ul) of the incubated diluted enrichment were added to 200 ul of the lysis reagent (5 ml BAX® lysis buffer and 62.5 ul BAX® Protease). The tubes were incubated at 37°C for 20 minutes followed by 95°C for
  • the normalized data was then smoothed with a Savitzky-Golay smoothing algorithm.
  • the negative derivative was taken of the logarithm of the fluorescence with respect to temperature (-dlog(F)/dT) and plotted, -dlog(F)/dT (y-axis) vs. Temp (x-axis).
  • a melting temperature range for the internal positive control reaction was established. All samples that had a melting curve peak in this temperature range were considered positive and if no peak was found in that range the sample was called negative. The decision of whether a sample was positive or negative was made independent of peak height.
  • 336 Salmonella samples were produced during these experiments.
  • the data set consisted of 252 positive samples (samples spiked with Salmonella) and 84 blank reactions (unspiked samples).
  • 103 samples did not produce a Salmonella-specific band on the gel. This includes the 83 unspiked samples (one of the 84 unspiked sample was positive) and 20 spiked samples. All of the samples contained the chemistry necessary for the internal positive control reaction.
  • Two Salmonella- ga ⁇ ve reactions both were spiked failed to produce an internal positive control band on the gel. These samples were considered indeterminate because neither the Salmonella reaction nor the internal positive control reaction worked.
  • the remainder of the samples showed either or both the Salmonella and internal positive control reactions on the gel establishing a basis for multiplexing internal controls with the Salmonella specific reaction.
  • a training set and test set were created from these samples.
  • the training set consisted of 252 samples, while the test set consisted of 84 samples.
  • Each set contained a proportional distribution of spiked and unspiked samples.
  • the Neural Network modeling technique was able to classify all of the test samples correctly for an effective 100% agreement to the gel method.
  • EXAMPLE 7 SIMULTANEOUS MODELING OF TARGET AND INTERNAL POSITIVE CONTROL FOR SALMONELLA
  • 'INPC' internal positive control DNA
  • 'Target' specific target organism DNA
  • the signals for Target and INPC are thus related in these reactions. It is common practice with neural networks to model two related outputs simultaneously, with one model having two outputs. This Example demonstrates a preferred implementation of this method, although other variations exist for multiple output reactions.
  • Salmonella DNA was serially diluted in BAX® lysis buffer to yield concentrations of 10-0.005 ng/ml. These solutions were then used to hydrate a BAX® for Screening/Salmonella sample tablet with built-in control (Qualicon Kit catalog #17710604) that was contained in an optical PCR reaction tube. In addition, tablets were hydrated with lysis buffer only and served as a negative 'Target' control. The tubes were capped and processed on an instrument that integrates thermal cycling and the fluorescence detection of DNA fragments. This instrument is described in US Provisional Application No. 60/165,267 filed on November 12, 1999, the disclosure of which is hereby incorporated by reference. The samples were thermal cycled using the following program: 94°C 2.0 minutes 1 cycle
  • melting curves were generated on the instrument by running the following conditions: 1) a pre-melt incubation at 68.0°C for 120 seconds followed by
  • the raw fluorescence data was imported into Microsoft Excel for processing. For each well, the fluorescence data was normalized by subtracting the lowest fluorescence value from the rest of the data points. 1. Melting Curve
  • the normalized data was then smoothed with a Savitzky-Golay smoothing algorithm.
  • the negative derivative was taken of the logarithm of the fluorescence with respect to temperature (-dlog(F)/dT) and plotted, -dlog(F)/dT (y-axis) vs. Temp (x-axis).
  • a melting temperature range containing both the target organism ('Target') and the internal positive control reaction ('INPC') was established. All samples that had a target melting curve peak in the appropriate temperature range were considered Target positive, and if no target peak was found in that range the sample was called Target negative. The decision of whether a sample was positive or negative was made independent of peak height. Similarly, the samples were classified as INPC positive or negative depending on whether a peak was observed in the appropriate temperature range.
  • the variable 'IsPeak' is defined as positive if either of Target or INPC (or both) is positive and negative if neither Target nor INPC are positive.
  • 'IsPeak' is positive if there is an appropriate peak (Target or INPC) in the melting curve and is negative if there is no corresponding peak, i.e., for a null spectrum or blank sample. It was found empirically that classification accuracy increased using IsPeak as an output rather than INPC. 2. Simultaneous Modeling of the Target and IsPeak Data
  • the data was interpolated to a 0.1 °C resolution using a cubic spline interpolating function.
  • the logarithm of the interpolated data was then taken and smoothed with a Savitzky-Golay smoothing algorithm over 2.5 degrees.
  • the negative derivative was taken of the log fluorescence with respect to temperature (-d(log F)/dT) using the data range of 72.0°C to 93.0°C.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Organic Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Engineering & Computer Science (AREA)
  • Immunology (AREA)
  • Analytical Chemistry (AREA)
  • Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Molecular Biology (AREA)
  • Microbiology (AREA)
  • Biophysics (AREA)
  • General Engineering & Computer Science (AREA)
  • Biotechnology (AREA)
  • Genetics & Genomics (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Optics & Photonics (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • General Physics & Mathematics (AREA)
  • Pathology (AREA)
  • Virology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Investigating Or Analysing Biological Materials (AREA)

Abstract

L'invention concerne un procédé homogène de détection d'un ADN cible, tel que des séquences distinctes de bactéries particulières. On calcule la courbe de dénaturation thermique et on effectue une analyse informatisée en utilisant un modèle de prédiction en fonction du schéma posologique avec un ADN connu.
PCT/US2000/011084 1999-04-30 2000-04-26 Analyse de la courbe de denaturation thermique de l'adn WO2000066777A2 (fr)

Priority Applications (3)

Application Number Priority Date Filing Date Title
EP00931949A EP1185697A2 (fr) 1999-04-30 2000-04-26 Analyse de la courbe de denaturation thermique de l'adn
JP2000615399A JP2002543408A (ja) 1999-04-30 2000-04-26 Dna融解曲線分析に基づくアッセイ
AU49753/00A AU4975300A (en) 1999-04-30 2000-04-26 Dna melting curve analysis

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US13190199P 1999-04-30 1999-04-30
US60/131,901 1999-04-30

Publications (2)

Publication Number Publication Date
WO2000066777A2 true WO2000066777A2 (fr) 2000-11-09
WO2000066777A3 WO2000066777A3 (fr) 2001-08-02

Family

ID=22451516

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2000/011084 WO2000066777A2 (fr) 1999-04-30 2000-04-26 Analyse de la courbe de denaturation thermique de l'adn

Country Status (4)

Country Link
EP (1) EP1185697A2 (fr)
JP (1) JP2002543408A (fr)
AU (1) AU4975300A (fr)
WO (1) WO2000066777A2 (fr)

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004537050A (ja) * 2001-07-25 2004-12-09 ポスコ 核酸ハイブリッド化検出方法
EP1499745A2 (fr) * 2002-04-26 2005-01-26 University of Utah Research Foundation Caracterisation d'acides nucleiques simple brin par analyse de la denaturation d'une structure secondaire a l'aide d'un colorant d'acide nucleique specifique de double brin
US6852986B1 (en) 1999-11-12 2005-02-08 E. I. Du Pont De Nemours And Company Fluorometer with low heat-generating light source
US7169555B2 (en) 1999-12-23 2007-01-30 Roche Diagnostics Corporation Method for indentifying microbial DNA or RNA
WO2010111509A1 (fr) * 2009-03-25 2010-09-30 Life Technologies Corporation Adn témoin de discrimination positive/d'extraction
WO2011011391A1 (fr) 2009-07-22 2011-01-27 E.I. Dupont De Nemours And Company Séquences et leur utilisation pour la détection et la caractérisation de e. coli o157:h7
WO2011048227A1 (fr) 2009-10-22 2011-04-28 Biotools Biotechnological & Medical Laboratories, S.A. Composition, méthode et trousse pour la détection de bactéries par séquençage
US8271205B2 (en) 2008-09-19 2012-09-18 Corbett Research Pty Ltd Method and system for analysis of melt curves, particularly dsDNA and protein melt curves
WO2013049489A1 (fr) 2011-09-28 2013-04-04 E.I. Dupont De Nemours And Company Séquences et leur utilisation pour la détection et la caractérisation de bactéries stec
EP2695951A1 (fr) * 2009-02-27 2014-02-12 Bio-rad Laboratories, Inc. Détection SNP par regroupement de courbes de fusion
WO2015026757A2 (fr) 2013-08-20 2015-02-26 E. I. Du Pont De Nemours And Company Séquences et leur utilisation pour la détection de salmonella enteritidis et/ou salmonella typhimurium
WO2016011255A1 (fr) * 2014-07-18 2016-01-21 E. I. Du Pont De Nemours And Company Séquences et leur utilisation pour la détection de listeria monocytogenes
US10019550B2 (en) 2012-09-27 2018-07-10 Epistem Limited Optical processing
WO2019152336A1 (fr) * 2018-01-30 2019-08-08 Biofire Defense, Llc Méthodes et systèmes de validation d'un dosage d'amplification d'acide nucléique
WO2020051301A1 (fr) 2018-09-06 2020-03-12 Hygiena, Llc Séquences et leur utilisation pour la détection et la caractérisation de e. coli o157:h7
WO2020205491A1 (fr) 2019-04-05 2020-10-08 Hygiena, Llc Séquences et leur utilisation pour la détection et la caractérisation du genre cronobacter
DE102020207587A1 (de) 2020-06-18 2021-12-23 Robert Bosch Gesellschaft mit beschränkter Haftung Verfahren und Steuergerät zum Auswerten eines Lumineszenzsignals in einem Analysegerät zum Analysieren einer Probe biologischen Materials und Analysegerät zum Analysieren einer Probe biologischen Materials
US11634782B2 (en) 2019-03-20 2023-04-25 Hygiena, Llc Quantification of microorganisms in samples and methods of determining quantification conditions thereof
US11915795B2 (en) * 2016-12-23 2024-02-27 The Regents Of The University Of California Method and device for digital high resolution melt
CN119626522A (zh) * 2025-02-11 2025-03-14 深圳易致生物科技有限公司 智能分子诊断和分析方法、装置、设备及存储介质
US12291745B2 (en) 2019-01-29 2025-05-06 Biofire Defense, Llc Methods and systems for validation of a nucleic acid amplification assay

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2008539727A (ja) * 2005-05-03 2008-11-20 ハンディーラブ インコーポレイテッド 凍結乾燥ペレット
KR101095093B1 (ko) 2007-02-02 2011-12-16 디지탈 지노믹스(주) 경쟁적 혼성화 반응을 이용한 가중치 합 연산 방법
JP5866280B2 (ja) * 2009-05-15 2016-02-17 バイオファイアー・ディフェンス・エルエルシー 融解曲線の自動解析システムおよび自動解析法
CN109520564B (zh) * 2018-11-05 2020-10-30 福建石子科技有限公司 一种智能分析模型的监控应用方法

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1990010270A1 (fr) * 1989-02-28 1990-09-07 E.I. Du Pont De Nemours And Company Appareil et procede de regulation d'un traitement utilisant un reseau de traitement forme reparti en parallele
EP0686699A2 (fr) * 1994-06-08 1995-12-13 The Perkin-Elmer Corporation Appareil et procédé pour déterminer la concentration de l'acide nucléique de but dans le PCR
WO1998036096A1 (fr) * 1997-02-14 1998-08-20 E.I. Du Pont De Nemours And Company Detection d'adn en double brin dans une solution homogene
US5871908A (en) * 1992-02-05 1999-02-16 Evotec Biosystems Gmbh Process for the determination of in vitro amplified nucleic acids

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1990010270A1 (fr) * 1989-02-28 1990-09-07 E.I. Du Pont De Nemours And Company Appareil et procede de regulation d'un traitement utilisant un reseau de traitement forme reparti en parallele
US5871908A (en) * 1992-02-05 1999-02-16 Evotec Biosystems Gmbh Process for the determination of in vitro amplified nucleic acids
EP0686699A2 (fr) * 1994-06-08 1995-12-13 The Perkin-Elmer Corporation Appareil et procédé pour déterminer la concentration de l'acide nucléique de but dans le PCR
WO1998036096A1 (fr) * 1997-02-14 1998-08-20 E.I. Du Pont De Nemours And Company Detection d'adn en double brin dans une solution homogene

Cited By (38)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7109495B2 (en) 1999-11-12 2006-09-19 E.I. Du Pont De Nemours And Company Fluorometer with low heat-generating light source
US7968856B2 (en) 1999-11-12 2011-06-28 E.I. Du Pont De Nemours And Company Fluorometer with low heat-generating light source
US6852986B1 (en) 1999-11-12 2005-02-08 E. I. Du Pont De Nemours And Company Fluorometer with low heat-generating light source
US7291724B2 (en) 1999-12-23 2007-11-06 Roche Diagnostics Corporation Oligonucleotides for rapidly identifying microbial DNA or RNA
US7169555B2 (en) 1999-12-23 2007-01-30 Roche Diagnostics Corporation Method for indentifying microbial DNA or RNA
JP2004537050A (ja) * 2001-07-25 2004-12-09 ポスコ 核酸ハイブリッド化検出方法
EP1499745A2 (fr) * 2002-04-26 2005-01-26 University of Utah Research Foundation Caracterisation d'acides nucleiques simple brin par analyse de la denaturation d'une structure secondaire a l'aide d'un colorant d'acide nucleique specifique de double brin
EP1499745A4 (fr) * 2002-04-26 2006-03-15 Univ Utah Res Found Caracterisation d'acides nucleiques simple brin par analyse de la denaturation d'une structure secondaire a l'aide d'un colorant d'acide nucleique specifique de double brin
US7297484B2 (en) 2002-04-26 2007-11-20 Idaho Technology Characterization of single stranded nucleic acids by melting analysis of secondary structure using double strand-specific nucleic acid dye
US7524632B2 (en) 2002-04-26 2009-04-28 University Of Utah Research Foundation Species identification by melting analysis of secondary structure of single stranded nucleic acids
US8271205B2 (en) 2008-09-19 2012-09-18 Corbett Research Pty Ltd Method and system for analysis of melt curves, particularly dsDNA and protein melt curves
US9864832B2 (en) 2009-02-27 2018-01-09 Bio-Rad Laboratories, Inc. SNP detection by melt curve clustering
EP2695951A1 (fr) * 2009-02-27 2014-02-12 Bio-rad Laboratories, Inc. Détection SNP par regroupement de courbes de fusion
WO2010111509A1 (fr) * 2009-03-25 2010-09-30 Life Technologies Corporation Adn témoin de discrimination positive/d'extraction
US9834815B2 (en) 2009-03-25 2017-12-05 Life Technologies Corporation Discriminatory positive/extraction control DNA
WO2011011391A1 (fr) 2009-07-22 2011-01-27 E.I. Dupont De Nemours And Company Séquences et leur utilisation pour la détection et la caractérisation de e. coli o157:h7
WO2011048227A1 (fr) 2009-10-22 2011-04-28 Biotools Biotechnological & Medical Laboratories, S.A. Composition, méthode et trousse pour la détection de bactéries par séquençage
WO2013049489A1 (fr) 2011-09-28 2013-04-04 E.I. Dupont De Nemours And Company Séquences et leur utilisation pour la détection et la caractérisation de bactéries stec
US9528162B2 (en) 2011-09-28 2016-12-27 E I Du Pont De Nemours And Company Sequences and their use for detection and characterization of STEC bacteria
US8802371B2 (en) 2011-09-28 2014-08-12 E. I. Du Pont De Nemours And Company Sequences and their use for detection and characterization of STEC bacteria
US10347363B2 (en) 2012-09-27 2019-07-09 Genedrive Diagnostics Ltd Optical processing
EP2901137B1 (fr) * 2012-09-27 2019-03-06 Epistem Limited Procédé, dispositif et support de données physiques pour traiter des données de courbe de fluorescence de dissociation et / ou d'association
US10019550B2 (en) 2012-09-27 2018-07-10 Epistem Limited Optical processing
WO2015026757A2 (fr) 2013-08-20 2015-02-26 E. I. Du Pont De Nemours And Company Séquences et leur utilisation pour la détection de salmonella enteritidis et/ou salmonella typhimurium
WO2016011255A1 (fr) * 2014-07-18 2016-01-21 E. I. Du Pont De Nemours And Company Séquences et leur utilisation pour la détection de listeria monocytogenes
AU2015289612B2 (en) * 2014-07-18 2021-03-25 Qualicon Diagnostics, Llc Sequences and their use for detection of Listeria monocytogenes
CN106661626A (zh) * 2014-07-18 2017-05-10 纳幕尔杜邦公司 用于单核细胞增多性李斯特菌检测的序列和它们的用法
US11915795B2 (en) * 2016-12-23 2024-02-27 The Regents Of The University Of California Method and device for digital high resolution melt
EP3746565A4 (fr) * 2018-01-30 2021-11-03 BioFire Defense, LLC Méthodes et systèmes de validation d'un dosage d'amplification d'acide nucléique
CN112105747A (zh) * 2018-01-30 2020-12-18 拜奥法尔防护有限责任公司 用于验证核酸扩增测定的方法和系统
WO2019152336A1 (fr) * 2018-01-30 2019-08-08 Biofire Defense, Llc Méthodes et systèmes de validation d'un dosage d'amplification d'acide nucléique
WO2020051301A1 (fr) 2018-09-06 2020-03-12 Hygiena, Llc Séquences et leur utilisation pour la détection et la caractérisation de e. coli o157:h7
US12291745B2 (en) 2019-01-29 2025-05-06 Biofire Defense, Llc Methods and systems for validation of a nucleic acid amplification assay
US11634782B2 (en) 2019-03-20 2023-04-25 Hygiena, Llc Quantification of microorganisms in samples and methods of determining quantification conditions thereof
WO2020205491A1 (fr) 2019-04-05 2020-10-08 Hygiena, Llc Séquences et leur utilisation pour la détection et la caractérisation du genre cronobacter
DE102020207587A1 (de) 2020-06-18 2021-12-23 Robert Bosch Gesellschaft mit beschränkter Haftung Verfahren und Steuergerät zum Auswerten eines Lumineszenzsignals in einem Analysegerät zum Analysieren einer Probe biologischen Materials und Analysegerät zum Analysieren einer Probe biologischen Materials
CN119626522A (zh) * 2025-02-11 2025-03-14 深圳易致生物科技有限公司 智能分子诊断和分析方法、装置、设备及存储介质
CN119626522B (zh) * 2025-02-11 2025-05-09 深圳易致生物科技有限公司 智能分子诊断和分析方法、装置、设备及存储介质

Also Published As

Publication number Publication date
EP1185697A2 (fr) 2002-03-13
AU4975300A (en) 2000-11-17
WO2000066777A3 (fr) 2001-08-02
JP2002543408A (ja) 2002-12-17

Similar Documents

Publication Publication Date Title
WO2000066777A2 (fr) Analyse de la courbe de denaturation thermique de l'adn
Malorny et al. Standardization of diagnostic PCR for the detection of foodborne pathogens
Gurbanov et al. Rapid classification of heavy metal-exposed freshwater bacteria by infrared spectroscopy coupled with chemometrics using supervised method
Leriche et al. Alteration of raw-milk cheese by Pseudomonas spp.: monitoring the sources of contamination using fluorescence spectroscopy and metabolic profiling
Nilsson et al. A RAPD-PCR method for large-scale typing of Bacillus cereus
CN101556242A (zh) 用傅立叶红外光谱鉴别微生物的方法
Giacometti et al. Application of MALDI-TOF MS for the subtyping of Arcobacter butzleri strains and comparison with their MLST and PFGE types
Guibet et al. Discrimination and classification of Enterococci by Fourier transform infrared (FT-IR) spectroscopy
Spitaels et al. MALDI-TOF MS as a novel tool for dereplication and characterization of microbiota in bacterial diversity studies
WO2020170051A1 (fr) Quantification par apprentissage machine d'organismes cibles à l'aide de dosages d'amplification d'acide nucléique
Osek et al. Listeria monocytogenes in foods—From culture identification to whole‐genome characteristics
EP1335028B1 (fr) Analyse d'essais multiples d'amplifications d'acides nucléiques en temps réel
Yu et al. Spectroscopic differentiation and quantification of microorganisms in apple juice
EP2419846B1 (fr) Procédés pour une quantification d'acides nucléiques
CN116179664A (zh) 基于内参确定微生物的高通量检测方法和系统及试剂盒
US20220220547A1 (en) System and method for detecting inhibition of a biological assay
EP3969993A1 (fr) Systèmes et procédés d'évaluation de bien-être de répertoire immunologique
Griffin et al. Towards the development of a sterile model cheese for assessing the potential of hyperspectral imaging as a non-destructive fungal detection method
US20210005286A1 (en) Methods for the analysis of dissociation melt curve data
Kim et al. Integration of MALDI-TOF MS and machine learning to classify enterococci: A comparative analysis of supervised learning algorithms for species prediction
US20030180716A1 (en) Microbe indentifying method, microbe identifying apparatus, method for creating database for microbe identification, microbe identifying program, and recording medium on which the same is recorded
JP4020640B2 (ja) 微生物識別方法および微生物識別装置
Al-Mariri et al. Identification of Listeria Species by Fourier-transform Infrared Spectroscopy
US20030175687A1 (en) Methods for the detection and identification of microorganisms
Dziuba Identification of selected Leuconostoc species with the use of FTIR spectroscopy and artificial neural networks

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG UZ VN YU ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
121 Ep: the epo has been informed by wipo that ep was designated in this application
AK Designated states

Kind code of ref document: A3

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG UZ VN YU ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

ENP Entry into the national phase in:

Ref country code: JP

Ref document number: 2000 615399

Kind code of ref document: A

Format of ref document f/p: F

WWE Wipo information: entry into national phase

Ref document number: 2000931949

Country of ref document: EP

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

WWP Wipo information: published in national office

Ref document number: 2000931949

Country of ref document: EP

WWW Wipo information: withdrawn in national office

Ref document number: 2000931949

Country of ref document: EP

点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载