Title: Apparatus, Process and Methods for Use With
Quantitative PCR
Field of the Invention
The invention relates to the field of nucleic acid quantification, particularly as it relates to the polymerase chain reaction (11PCR").
Background to the Invention
Quantitative kinetic PCR
The polymerase chain reaction (PCR) is a reiterative process in which as little as a single target DNA molecule can be amplified to quantifiable amounts. This is conventionally accomplished using an enzyme-based replication mechanism, in which reaction temperature is repetitively cycled (called thermocycling) for a sufficient number of times to amplify the target to saturating amounts (generally 50 thermocycles). The introduction of DNA fluorescence for monitoring PCR amplification in real time, led to a major advancement for quantitative PCR. Although several variants in fluorescence chemistry have been developed, they all share the same key property; that is fluorescence intensity is related to the mass of amplified DNA. Fluorescent detection of amplified DNA not only allowed the kinetics of PCR amplification to be determined with high resolution, it also allowed automation via software data analysis.
In addition to indicating whether a specific DNA target is present within a sample (e.g. the presence of a specific pathogen such as HIV), real time PCR also potentially allows the number of target DNA molecules to be quantified with accuracy and sensitivity (down to a single target molecule), and with an enormous dynamic range (1-107 target molecules). PCR is the basis for a number of new biomedical diagnostic applications, in addition to quantitative applications for molecular genetics and genomics research (e.g. gene expression analysis, pathogen detection, genetic analysis, among many others).
Despite the nearly eight years since the commercial introduction of fluorescence-based quantitative PCR, several caveats still remain. In simple
terms not only is the accuracy and reliability unsatisfactory, but also the extent and source(s) of these deficiencies are largely unrecognized.
To date, all known real time PCR quantification methodologies are based upon the principle of defining a common point within the amplification curves, that allow amplification reactions to be compared to each other.
Generally referred to as the "threshold method", the inventors of this approach (Higuchi et al. 1993; EP0640828) defined this point by selecting a threshold fluorescence, from which a fractional threshold cycle (Ct; also called the crossing point or CP) is derived for each amplification reaction. This approach is illustrated by Rutledge and Cote (2003), which describes the underlying mathematics and the application of standard curves for converting Ct values into the number of target molecules.
The threshold approach has historically dominated the field of quantitative PCR. Although several variants have appeared which attempt to automate the selection of the fluorescence threshold, e.g. based upon maximum or minimum values of the first and/or second derivatives of the reaction fluorescence curve (Wittwer, US 6.303.305 EM ; McMillan, US 6,713,297 B2), none of these modify how target quantification is accomplished. The threshold approach as currently practiced has several weaknesses, directly relating to two aspects encompassing amplification efficiency. First is that amplification efficiency must be known in order to achieve target quantification. Amplification efficiencies in turn can only be determined accurately via construction of standard curves (US 2002/0058262 A1 , US 6,691 ,041 B2), which are difficult to construction, prone to errors and are impractical for high-throughput applications. Second, it must be assumed that amplification efficiencies of samples are identical to that of DNA standards used in the construction of the standard curves (US 6,691 ,041 B2), an assumption that has been found to be invalid in many cases. Liu and Saint (2002) provide the only known exception to the threshold method for quantitative application of real time PCR. They demonstrated that curve fitting of fluorescence readings to the classic Boltzmann sigmoid function (a process commonly used for nonlinear regression analysis of
sigmoidal datasets (Motulsky and Christopoulos 2003), allowed target quantity to be determined from a single amplification reaction, and equally important, without the need to know amplification efficiency. The application of sigmoidal mathematics furthermore illustrates that, contrary to that demanded by the exponential mathematics upon which the threshold methods are based, amplification efficiency is never constant during PCR amplification, but rather each thermocycle produces progressively lower amplification efficiency, a process that eventually leads to cessation of amplification.
Notwithstanding these revelations, several deficiencies of the Liu and Saint approach have prevented its implementation and have led to a general disregard of this approach. In addition to the complexities of curve fitting that compromises its practicality, a significant weakness of the Liu and Saint ' approach is its inability to convert fluorescence units into the number of target molecules. Current research and diagnostic tools Involving quantitative real time
PCR are not fully satisfactory. The "threshold-based method" that has dominated quantitative application of real time PCR to date can give rise to difficulties in implementation, insufficient reliability, and difficulties with quality control. In particular, the necessary assumption that amplification efficiency of samples is equal to that of the DNA standards used for absolute quantification has been shown to be invalid in many applications, resulting in large and unpredictable errors. Occurrences of such errors are of paramount importance for many diagnostic applications that require a high level of quantitative accuracy and reliability, such as for determination of viral load or residual disease. The innate limitations of the threshold-based methods have thus precluded broad implementation of quantitative kinetic PCR in many types of biomedical and diagnostic applications.
Summary of the Invention There is disclosed herein an apparatus process, and method that permit PCR quantification of unique target polynucleic acid sequences, including DNA and RNA sequences (called the "target") contained within a sample. This is accomplished by linear regression analysis of data generated
during PCR amplification (using standard, commercially available instαi mentation and enzyme kit), from which values for two parameters are derived. These are then inputted into a novel mathematical function from which target quantity is calculated. The final step is conversion of quantitative units to the number of target molecules, based upon correlation of reaction fluorescence to DNA mass, a process called herein "optical calibration".
The present invention greatly simplifies target quantification by circumventing the need for curve fitting, using instead an easy to implement approach based upon linear regression analysis. Combined with methods for establishing and monitoring the accuracy of quantitative scale via optical calibration, the present invention not only permits conversion of fluorescence units Into the number of target molecules, but as well provides effective quality control and assurance over quantitative scale. This enables unprecedented capabilities for verification of quantitative accuracy, and for development of generally applicable quantitative standards for calibrating real time PCR thermocyclers.
Brief Description of the Figures
FIGURE 1 A is a graphical depiction of an embodiment of sigmoidal functions describing amplicon accumulation and cycle efficiency.
FIGURE 1 B is a graphical depiction of an example of cycle efficiency plotted against reaction florescence. FIGURE 2A is a graphical depiction of an example of loss in amplicon efficiency in relation to DNA accumulation.
FIGURE 2B is a graphical depiction of an example of fluorescence readings
(actual and predicted) in relation to cycle number.
FIGURE 2C is a graphical depiction of an example of ratio-derived amplification efficiency plotted against cycle number.
FIGURE 3A is a graphical depiction of an example of amplification profiles of reaction fluorescence vs. cycle number with varying reaction volume.
FIGURE 3B is a graphical depiction of an example of ratio-derived cycle efficiency vs. reaction fluorescence with varying reaction volume.
FIGURE 4A is a graphical depiction of an example of amplification profiles of reaction fluorescence vs. cycle number with varying annealing and elongation times.
FIGURE 4B is a graphical depiction of an example of ratio-derived cycle efficiency vs. reaction fluorescence with varying annealing and elongation times.
FIGURE 5 is a graphical depiction of an example of drifting plateau produced by low base amplification efficiencies.
FIGURE 6 is a graphical depiction of an example of the relationship between fluorescence and DNA mass.
FIGURE 7 is a graphical depiction of mathematical modeling of PCR amplification. FIGURE 8 is a depiction of a screen shot of an MS Excel ™ worksheet used in an example of LRE quantification.
Detailed Description of the Preferred Embodiments
Linear Regression of Efficiency (LRE)
It is disclosed herein that a linear relationship exists between reaction fluorescence and amplification efficiency (Figures 1 and 2). This relationship permits determination of two fundamental kinetic parameters of PCR amplification via linear regression, that in combination with a novel sigmoid function, enables target quantification.
Also disclosed herein is the extension of this quantitative capability to calibration-of-scale by correlating reaction fluorescence to DNA mass that in turn permits the number of target molecules to be calculated. Based upon optical calibration, this aspect of the invention also permits effective monitoring accuracy-of-scale, in addition to greatly facilitating high-throughput applications, in part by circumventing the need to prepare quantitative standards for each individual target, as is required using current threshold- based methodologies.
PCR quantification via LRE is an eight-step process, although certain steps may be useful alone or in sub-combinations. For LRE, the process can be conducted in eight steps, for example:
1. A typical real time PCR amplification is conducted and fluorescence is assayed at a selected temperature such that reaction fluorescence is directly proportional to the amount of DNA present in the reaction; reaction fluorescence is measured following each thermocycle, producing an profile of the DNA amplification process (e.g. Figure 2B)
2. Raw reaction fluorescence readings (Fc) are exported for analysis (e.g. into a spreadsheet)
3. Background fluorescence is subtracted from all reaction fluorescence readings
4. Amplification efficiency is calculated for each thermocycle from the ratio of reaction fluorescence (FQ) to the reaction
fluorescence from the previous thermocyde (Fc-i), generating what is called the ratio-based estimate of amplification efficiency (ER):
5. ER is plotted against reaction fluorescence (Fc). producing a line from which the Y intercept and slope are determined using a standard linear regression analysis (Figure 2). These represent two key kinetic parameters that define the shape of the amplification curve, called maximal amplification efficiency (Emax) and rate-of-decay of amplification efficiency (ΔE) as described by:
The maximal fluorescence (Fmaχ) produced by the amplification is in turn determined by the ratio of Emaχ to its rate-of-decay (ΔE):
Equation 2 can also be used to predict the efficiency of each cycle (Ec) based upon its corresponding fluorescence (Fc), which enables quality assessment of the fluorescence dataset based upon the level of correlation between ER with Ec:
6. Erøx and Fmax are placed into a novel mathematical function (equation 5) from which target quantity is calculated using cycle number (C) and the corresponding reaction fluorescence reading (Fc). This produces a target quantity in the fluorescence i units (F0), calculated for each thermocycle (within the amplification curve), from which an average and standard deviation of target quantity is calculated:
A related sigmoid function can be used to predict reaction fluorescence (FP) once Fo is calculated. This enables assessment of the quality of the fluorescence dataset based upon the level of correlation between Fp and Fc:
7. Target quantity is converted into DNA mass (Mo) by multiplication with a calibration factor (CF) that relates reaction fluorescence units (FU) to DNA mass (nanogram) and is expressed in units of nanograms/FU.
Mo = CFxFo Equation 7
8. DNA mass of the target quantity is converted into number of DNA molecules based upon the size of the amplified region (amplicon size; As) where No is the number of target molecules:
N0 = (Mox9.1x1011)/As Equation 8
Attributes of LRE quantification
Many capabilities provided by LRE in certain embodiments are not provided by threshold-based methods, such as:
1. Target quantification that does not require construction of standard curves; threshold-based quantification cannot be accurately accomplished without standard curves
2. Variation in maximal amplification efficiency (Emax) within samples does not greatly impact target quantification accuracy; using threshold-based quantification even small differences in maximal amplification efficiency from that of the standard curve can produce large errors
3. Amplification efficacy can be assessed for individual amplification reactions via maximal amplification efficiency; this enables presence of PCR inhibitors or ineffective reaction preparation to be assessed for each individual amplification reaction
4. Target quantification can be accomplished from data obtained from a single amplification reaction; threshold-based quantification can only be accomplished by comparison to other samples, with absolute quantification (i.e. the number of target molecules) requiring PCR amplification of DNA standards in which the quantity of target is known 5. Facilitates development of high-throughput applications by abrogating the need for preparing quantified standards for each
target combined with effective assessment of accuracy-of-scale through optical calibration
6. Rapid optimization of reaction conditions is greatly facilitated via accurate but simple to conduct assessment of amplification efficacy within individual amplification reactions
7. Easy to implement, requiring little or no setup; even the most basic application of a threshold method requires extensive setup and testing
Establishing Quantitative Scale via Optical Calibration The accuracy of quantitative determinations using real time PCR is directly dependent upon the accuracy of quantitative scale. Historically, quantitative scale has been acquired from PCR-generated standard curves constructed for each target, a process prone to errors including ineffective quantification of the DNA standard (Rasmussen 2002, Bu St in 2002, Rutledge and Cδte 2003). This situation that is further exacerbated by the requirement that a DNA standard be prepared for each target. Accuracy-of-scale is paramount to the success of most quantitative applications of real time PCR, particularly for biomedical applications (e.g. measuring viral load and residual disease). An alternative approach encompasses a process called "optical calibration" that exploits the discovery that quantitative scale is directly linked to am pi icon fluorescence through its relationship to DNA mass, such that target quantification can be achieved by correlating reaction fluorescence to the DNA mass present in the reaction mixture. Optical calibration is important in some applications due to the uncalibrated nature of fluorescence units produced by real time PCR1 such that fluorescence units generated by each individual thermocycler is unique to both the optical characteristics of its instrumentation and the optical characteristics of the reaction mixture. Thus it is generally important to derive a conversion factor for each thermocycler and reaction configuration. DNA mass can be accurately determined based upon the reaction fluorescence as disclosed herein. This in turn enables the
number of target molecules to be calculated, without having to resort to a PCR-generated standard curve.
Founded upon this principle, absolute quantification of multiple targets can be effectively achieved using a single, preestablishβd quantitative scale, circumventing the need to prepare quantified DNA standards for each target, as well as enabling an unprecedented level of quantitative accuracy to be established.
In addition to facilitating development of large-scale applications such as high throughput analyses for clinical diagnostics and genomics research, optical calibration as described herein enables a higher level of quality control and assurance of quantitative scale, than attainable using current methodologies. Furthermore, it enables development of universal standards that could facilitate development of worldwide standards for establishment and verification of quantitative scale, a deficiency that currently plagues quantitative PCR (Bustin 2002).
LRE quantification generates target quantity in fluorescence units (Equation 5), which although useful for comparing the relative quantities of target in different samples, is in some applications less useful than quantification in units of target molecules. This is similar to the deficiency of quantification via curve-fitting as described by Liu and Saint (2002) that also produces target quantity in fluorescence units. Conversion of fluorescence units into the number of target molecules substantially Increases the utility of the LRE-based quantification, particularly for biomedical diagnostics in which accurate, verifiable quantification of the number of target molecules is paramount (e.g. for viral load and residual disease).
Optical calibration is a multi-faceted process in which fluorescence units are correlated to DNA mass through derivation of a calibration factor (CF). In addition to conversion of F0 into the number of target molecules (No), optical calibration enables assessment of optical precision and accuracy of real-time instruments in an effective, accurate and timely manor. Furthermore, verification of accuracy-of-scale can be accomplished by comparing optical calibrations obtained by two independent methods, a capability lacking under currently applied methodologies.
The nucleic acid standard
Optical calibration requires a nucleic acid standard that preferably has the following characteristics:
1. Known size, preferably accurate to the base pair
2. Complete sequence available
3. Easily obtained in highly pure form and lacking contaminants that could act as an inhibitor of PCR; preferably available commercially
In some instances the nucleic acid will be DNA. In some cases it will be
RNA. In some instances, the nucleic acid standard is preferably DNA when the target is DNA and RNA when the target is RNA.
Commercially available DNA from the bacteria phage lambda is an effective standard for optical calibration in many cases. A quantified stock of the nucleic acid standard is prepared, from which diluted samples are made.
Two general methods for optical calibration
Optical calibration encompasses two distinct methods that are linked through the use of the same quantified stock of the nucleic acid standard. The first method uses PCR amplification of a target(s) within the nucleic acid standard, using a diluted sample containing an appropriate amount of the nucleic acid standard. The nucleic acid mass of the target (M0) is determined by first calculating the number of nucleic acid molecules of the standard contained in the sample (No):
N0 = (SMx9.1x1011ySs Equation 9
Where SM is the mass of the nucleic acid standard in nanograms and Ss is the molecular size in base pairs (where the standard is DNA) of the nucleic acid standard. The nucleic acid mass of the target quantity (M0) is then calculated:
M0 = (N0XAs)/ 9.1x1011 Equation 10
Where As is the size of the target amplicon in base pairs. Equations 9 and 10 can be combined and simplified:
M0 = (SMxAs)/Ss Equation 11
Following PCR amplification of the target, the LRE process is used to determine the target quantity in fluorescence units (Fo) and the conversion factor (CF0) determined :
CF0 = Mo/Fo Equation 12
Where CFo is the number nanograms per fluorescence unit (ng/FU) based upon real time PCR quantification of the nucleic acid standard.
The second method is called "direct calibration" and does not involve PCR amplification. Direct calibration can be conducted by preparing mock PCR reaction mixes identical to that used for PCR amplification, except that they contain the nucleic acid standard in an amount(s) similar to that generated within a typical amplification profile (approximately 10-50 ng for a 25 μl reaction volume). Reaction mixes lacking the nucleic acid standard are also prepared for determining background fluorescence (Fb). Note also that is ) not necessary to use the same standard as that used for PCR-baεed calibration, such that any source of nucleic acid can be used assuming that it is accurately quantified.
The mock reaction mixes are placed into the real time thermocycler and the block temperate increased to the same temperature used to read reaction fluorescence during PCR amplification. This is necessary because reaction fluorescence is temperature dependent. Once that the selected temperature is reached, fluorescence reading(s) are taken (Fd). The conversion factor based upon direct calibration (CFd) can then be calculated using the equation:
CFd = SM/(Fd-Fb) Equation 13
Where CFd is the "direct" calibration factor in units of ng/FU, SM is the mass of the nucleic acid standard in nanograms, Fo is the fluorescence of the mock PCR reaction at the read temperature and Fb is the background fluorescence at the read temperature produced by a mock PCR reaction containing no DNA standard. It should be noted that a primary assumption upon which this approach is based, is that fluorescence readings produced by the real time thermocycle (i.e. the instrument optics) have a linear relationship to DNA mass. This is not to say however, that alternative analyses could be used to compensate for such a lack of linearity, and thus a lack of linearity between reaction fluorescence and DNA mass does not necessarily invalidate this process.
Evaluating quantitative accuracy via optical precision
In addition to enabling the conversion of Fo into No1 optical calibration can also be used to evaluate optical precision, that in turn has a direct impact on quantitative accuracy. The most fundamental approach is assessment of the correlation between CFd and CF0; that is quantitative calibrations derived by two independent methods who's level of correlation provides an indication of the precision of the optical calibration process. Additionally, the level of optical stability over time can be evaluated through the correlation between multiple applications of the optical calibration process, conducted over time. This in turn provides an indication of how frequently a real time thermocycler needs be recalibrated in order to maintain a specified tolerance for accuracy- of-scale, and thus ultimately quantitative accuracy.
Assessment of quantitative accuracy can be extended further by evaluating optical precision produced by individual wells (both intra- and inter- run variation in absolute units of fluorescence generated by the instrument), in addition to assessing the correlation of the absolute units of fluorescence generated from each well (inter-well variation in fluorescence units). This is based upon the principle that the absolute value of the fluorescence readings
directly determines the quantitative value of the target. That is, fluorescence is the sole quantitative unit generated during PCR amplification, such that aberrancies in determination of reaction fluorescence impacts the resulting quantitative determination. Thus the overall quantitative accuracy produced by a real time thermocycler machine is directly dependent on the precision, stability, and accuracy of its optics.
Attributes of optical calibration
In some embodiments, optical calibration allows one or more specific advantages, such as: 1. Direct calibration enables optical calibration without PCR amplification; effective, easy to execute, and reliable 2. Enables verification of optical calibration by assessing the correlation between amplification of a quantified standard (CFo) with that of direct calibration (CFd) 3. Provides a single quantitative scale for all targets; prevailing threshold-based methods requires preparation of quantified standard for each Individual target; this process eliminates this limitation through the application of a single DNA standard, greatly facilitating high throughput applications 4. Use of an easily accessible DNA standard facilitates standardization of optical calibration and thus for establishment of quantitative scale, a process that is also platform independent (i.e. not unique to any individual real time thermocycler machine, model or manufacture) 5. Direct calibration can be verified via application of multiple DNA sources; that Is direct calibration is not limited to a single source of
DNA standard 6. Verification of PCR-based optical calibration through amplification of multiple targets within the DNA standard; that is to conduct optical calibrations via amplification of multiple targets using the same DNA standard preparation
7. Verification of PCR-based calibration via direct calibration using the same DNA standard preparation; this provides a direct link between the two calibrations methods providing further verification of the accuracy of quantitative scale 8. Enables standardization by adoption of specified DNA sources for preparation of standards for optical calibration, based upon several desirable attributes such as accessibility, purity, known size and sequence
In an embodiment of the invention there is provided a method for identifying a patient having a condition of interest involving or identifiable by the abundance of a least one selected nucleic acid sequence, said method comprising:
a) identifying a target nucleic acid sequence of interest; b) amplifying the target sequence using fluorescence-based quantitative Polymerase Chain Reaction; c) determining the reaction fluorescence for each cycle d) determining the ratio-based estimate of efficiency (ER), for each thermocycle wherein
"Fc" is reaction fluorescence following a selected thermocycle and "Fc-i" is reaction fluorescence following the previous thermocycle; e) plotting ER against F0 to produce a line as described by:
ER = ΔE χ Fc +Em∞
f) determine the Y intercept and slope of the line from step e, where in the intercept is the maximal amplification efficiency
(Em*χ) and the slope is the rate-of-decay of the base amplification efficiency (ΔE); g) determining Fmax wherein
h) generating a determination of target quantity (Fo) in fluorescence units (FU) for a given cycle (C), wherein
i) determining the target quantity in polynucleic acid mass (Mo) using a conversion factor (CF), wherein
where CF is determined using a process called optical calibration, in which reaction fluorescence is correlated to polynucleic acid mass j) calculating the number of polynucleic acid molecules based upon polynucleic acid mass of the target (Mo) based upon the size of the amplified region (A
s), wherein
k) comparing the number of polynucleic acid molecules calculated above to a predetermined number or range of polynuleic acid molecules associated with a condition of interest.
In some instances it is useful to have a process for quantifying amplification efficiency for a nucleic acid. For instance, one may wish to optimize reaction conditions for a specific nucleic acid, and a ready measure of efficiency can assist in this.
In some instances it is useful to have a process for determining the extreme limit of a signal used in reporting nucleic acid incorporation during or as a result of the polymerase chain reaction. Knowledge of such a limit can be used in selecting a signaling/reporting system and well as a detection system.
In some instances it is useful to have a process for determining the rate of decay of amplification efficiency relating to the amplification of a nucleic acid in a sample. Knowledge of the decay rate of efficiency can be used in determining reaction conditions and duration.
In some instances it is useful to have a process for determining the quantity of the predict of amplification of a nucleic acid sequence after a selected number of thermo cycles in the polymerase chain reaction. Quantity may be determined based on signal output, or may be converted into actual copy number for the amplified sequence. Nucleic acid quantity can be used to assess the abundance of a nucleic acid sequence of interest from a sample, for instance to determine if it exceeds a threshold level which is considered "safe" or "normal" for the healthy population.
In some instances it is useful to have a machine readable media such as a read-only memory, a random access memory, an optical or magnetic disk or tape, or other device for storage of information, containing instructions for carrying out one or more of the processes listed above. Such media can be used together with a polymerase chain reaction apparatus to assess PCR results on an ongoing basis, or may be used on a separate processor to examine cycle-specific signal data obtained from a PCR apparatus.
It will be understood that the "signal" to be assessed relating to nucleic acid incorporation may be of any suitable type. At present, fluorescence is a popular signal method used to track PCR reactions. However, In the past radio-labeled nucleotide incorporation was used, and such methods may still be preferred for particular applications. Additionally other methods will be useful for certain situations. So long as a method provides an assayable signal, it is contemplated by this invention. While the examples relate to signals which increase with the level of nucleotide incorporation, it will be understood that methods may also arise where signal decreases with the level of nucleotide incorporation (for example due to quenching up on incorporation). It will be apparent to one skilled in the art, in light of the disclosure herein, to apply the methods and processes disclosed herein to such signal systems as well.
EXAMPLES
Detailed information relating to the Figures
Figure 1 depicts sigmoidal modeling of the polymerase chain reaction reveals symmetry between amplicon DNA accumulation and loss in amplification rate. (A) Plots of the sigmoid functions describing amplicon accumulation (equation 1; solid line) and cycle efficiency (equation 2; dashed line) illustrate a symmetrical relationship, where Cy2 defines the fractional cycle when both reach precisely half of their respective maxima. F0 is target quantity expressed in fluorescence units, EmaX is the maximal amplification efficiency and Fmaχ is the maximal reaction fluorescence. (B) Plotting cycle efficiency against reaction fluorescence produces a line, indicating that the progressive loss in cycle efficiency is directly related to amplicon accumulation. This generates a linear representation of PCR amplification, where the Y intercept defines the maximal amplification efficiency (Emax) when Fc=O and the X intercept defines the end-point or plateau phase (Fmaχ) when Ec=O, with the slope defining the rate-of-loss in cycle efficiency (ΔE) that occurs as amplification progresses.
Figure 2 Fundamentals of linear regression of efficiency (LRE). (A) Loss in amplification efficiency is directly related to amplicon DNA accumulation. Ratio-derived amplification efficiency (ER, equation 1 ) is plotted against reaction fluorescence (Fc), demonstrating a linear relationship between amplicon DNA mass and cycle efficiency. The intercepts define the starting or "maximal" efficiency (Emax; Fc=0) and the end-point or plateau phase (Fmax; ER=O), with the slope defining the rate-of-decay in cycle efficiency (ΔE). This in turn allows two of three kinetic parameters to be determined using linear regression analysis. (B) Actual fluorescence readings (dots) plotted with predicted reaction fluorescence (FP) calculated using equation 6 (circles) with Emax and ΔE values derived from (A). Arrows indicate the cycles that were included in the linear regression analysis shown in (A). (C) Ratio-derived amplification efficiency, ER (dots; equation 1) plotted with Ec predicted by equation 4 (circles) using Emax derived from (A). The sporadic fluctuation in ER produced by cycles 10-14 reflect high error-of-measurement produced by low
reaction fluorescence. Arrows indicate the cycles that where included in the linear regression analysis conducted in (A).
Figure 3 Reaction volume is a determinant of Fmax. (A) Amplification profiles of reactions containing an identical quantity of target but of increasingly greater volume (20 μl (•), 30 μl (T), 40 μl (♦), 50 μl (A) and 60 μl (■)). (B) Ratio-derived estimates of cycle efficiency (ECl equation 3) were determined from the fluorescence readings generated by each amplification reaction and plotted against the respective reaction fluorescence (Fc), producing a linear representation of each of the amplification profiles shown in (A) (Fig. 1B). This demonstrates that increasing reaction volume reduces the rate-of-loss in cycle efficiency (slope, ΔE) but has no impact on the maximal amplification efficiency (Y intercept, Emax). producing a proportional increase in maximal reaction fluorescence (X intercept, FmaX) consistent with that predicted by equation 5. Table III below relates to Figure 3 and provides a summary of the linear regression analysis (LRE analysis, equation 4) of the plots shown in Figure 3 (B), which reveals that amplicoπ concentration at Fmax (FU/μl) is only modestly impacted by the three fold increase in reaction volume. Vol., reaction volume; r2, linear correlation, coefficient; Fmax FU/μl, florescence units per μl of reaction at Fmax.
Vol. r max
CuI) r2 Emax ΔE ■"max FU/μl
20 0.9984 92.2% -1 .18x10 4 7,81 0 391
30 0.9995 93.5% -8.19x10 5 1 1 ,4OO 380
40 0.9996 93.1 % -6.47x10"6 14,4OO 360
50 0.Θ994 91 .8% -5.09x10 5 18,000 360
6O 0.9996 93.4% -4.70X10 6 19,900 332
Table III relating to Figure 3.
Figure 4 Time of annealing and elongation is a determinant of maximal amplification efficiency. (A) Amplification profiles of reactions containing identical quantity of target but with different annealing and elongation (A&E)
times (4 min (T)1 2 miπ (A)1 1 min (■) and 0.5 miπ (•)). This reveals that both maximal reaction fluorescence and slope of the amplification profile decrease with decreasing A&E time, along with a progressive shift in profile position consistent with a reduced maximal amplification efficiency. (B) LRE plots show that reduced time of A&E produces a decreased maximal amplification efficiency (Y intercept, Emax), but has minimal, if any impact on the rate-of-loss in cycle efficiency (slope, ΔE). Table IV relates to figure 4 and provides a summary of the linear regression analysis (LRE analysis; equation 4) of the plots shown in (B). Vol., reaction volume; r2, linear correlation coefficient.
A&E
(min) r2 tmax ΔE Fmax
.4.0 0.9996 94.7% -8.74x10-5 10,800
2.0 0.9988 90.1% -9.63x10* 9,360
1.0 0.9981 81.0% -1.04x10"* 7,790
0.5 0.9892 65.7% -9.60x10"5 6,840
Table IV relating to Figure 4,
Figure 5 Drifting plateau produced by low base amplification efficiencies.
Actual (Fc) compared to predicted (Fp) reaction fluorescence of the amplifications presented in Figure 4A reveal a trend for amplification to continue within the plateau region, that becomes more pronounced as base amplification efficiency is reduced. Drift in the plateau region has a major impact on curving fitting to a sigmoid function whereas LRE is unaffected (Figure 4).
Figure 6 Optical calibration via two distinct processes. A quantified, lambda DNA stock solution was employed as a standard for optical calibration using both direct (A and B) and PCR-based (C) determination of the calibration factor (CFd and CFo respectively) used for converting fluorescence units into DNA mass. A linear relationship of DNA mass with fluorescence is found over the range of 15-50 πg of lambda DNA when placed Into mock PCR reactions
(25 μl containing all of the components in a standard amplification reaction). The tubes were placed into the thermocycler, heated to the read temperature used for real time PCR (68 0C) and multiple optical reads taken. Following subtraction of the fluorescence produced in the absence of lambda DNA (i.e. 0 ng), the average of the fluorescence readings were plotted against the corresponding mass of lambda DNA, confirming a linear relationship of DNA mass with reaction fluorescence (^=0.995). (Table V below) A summary of the data shown in (A) in which CFd (direct calibration factor; CFd = DNA mass/fluor.) is calculated for each lambda DNA mass, from which an average is determined. (Table Vl below) PCR-based optical calibration in which three targets were amplified from the same stock of lambda DNA standard used for direct calibration, with each amplification reaction containing 500 fg of this lambda DNA preparation (SM). The predicted target quantity in DNA mass (M0; nanograms) was calculated using equation 11 based upon the known size of the lambda DNA molecule (48,502 bp; S5) and the amplicon size of each target (As). Following four replicate amplification for each target was conducted using the same thermocycling conditions and read temperature (68 0C), LRE was used to determine Fo (target quantity in fluorescence units) from which a calibration factor (CF0) was calculated for each target and an average taken. Both methods produced an average CF that agree within ~4%, providing verification of accuracy, at least based upon a shared DNA standard. Fluor., fluorescence.
Tables V and Vl relating to Figure 6.
Figure 7 Mathematical modeling of PCR amplification and assessment of the quantitative precision provided by LRE analysis. (A) Replicate amplifications were conducted on ten fold dilutions of lambda genomic DNA and LRE quantification conducted on each amplification profile. Actual (•) and predicted (o, equation 7) reaction fluorescence for each target quantity are plotted against cycle number. The table below depicts a summary of the LRE analysis conducted on the fluorescence profiles presented in Figure 7A, along with the calculated target quantity (F0, equation 6. Table II). R2, nonlinear correlation coefficient (see Materials and Methods and related Figures)
Table VII related to Figure 7.
Figure 8 Screen shot of the MS Excel worksheet used for the LRE quantification conducted on the 18,800 molecule (1 pg) lambda genome amplification presented in Fig. 7. Graphs of the LRE analysis and comparison of actual to predicted fluorescence profiles are presented, with arrows denoting the range of cycles included in the analysis. Following background subtraction, reaction fluorescence readings from four replicate reactions were averaged (F0), imported into MS Excel, and ratio-derived estimate of cycle efficiency (Ec) calculated using equation 1. Linear regression analysis was then conducted on the seven cycles within the central region of the amplification profile (cycles 20-26), from which values for EmaX (intercept) and ΔE (slope) were determined (equation 2), and Fmaχ calculated using equation 3. These values wήrAthfinJopiittfirUnto. ejquatjnnJ>.and a. F0.estimate calculated for each cycle using cycle number and the corresponding fluorescence reading, from which an average F0 was determined (cycles 19- 26), along with the coefficient of variance (CV). Predicted reaction fluorescence (Fp) were then calculated for each cycle using equation 6.
Mathematical derivations A significant advancement in quantitative PCR, is disclosed herein, based upon curve-fitting of fluorescence readings to the four-parametric sigmoid function:
where C is cycle number, F
c is reaction fluorescence at cycle C, Fmax is the maximal reaction fluorescence at which amplification ceases, C
1/2 is the fractional cycle at which reaction fluorescence reaches half of F
maχ, k is the slope of the curve, and Fb is the background reaction fluorescence. Although not commonly used for analysis of PCR datasets, nonlinear regression can provide values for these four parameters by fitting
fluorescence readings to equation 14. This in turn allows the kinetics within individual amplification reactions to be investigated, revealing previously unrecognized principles of PCR amplification such as that fact that amplification efficiency is never constant, but is highly dynamic as described by a second sigmoid function (Liu and Saint 2002):
where Ec is the amplification efficiency at cycle C, also referred to as "cycle efficiency". Under this model, amplification efficiency decreases continuously such that each cycle has a unique amplification efficiency. The principle of this and other insights can be illustrated by plots of these equations, the most notable being a striking symmetry between amplicon DNA accumulation and loss in amplification efficiency (Figure 1).
Notwithstanding the complexity of these sigmoid functions, it is unnecessary to invoke nonlinear mathematics in order to examine amplification dynamics. An underutilized approach is based upon the classic definition of amplification efficiency, that is, the relative increase in amplicon DNA over a single thermocycle:
ER =— ^ — 1 Equation 1
FC-i
where Fc-i is the fluorescence of the preceding cycle and ER the "ratio- derived" cycle efficiency. This allows the apparent interrelationship between amplicon DNA accumulation and loss in amplification rate to be examined directly, without having to resort to nonlinear regression. Indeed, a simple plot of ER against F0 reveals a linear relation, indicating that loss in amplification efficiency is directly correlated to amplicon DNA accumulation (Figure 2A).
The dynamics of amplification efficiency can thus be described by the linear equation:
where the slope (ΔE) defines the rate-of-decay in amplification efficiency and the intercept (EB) is defined as the "base" amplification efficiency, that is, the apparent amplification efficiency in the absence of target DNA (when Fc=O). It is also evident that as PCR amplification enters the plateau phase, Fc approaches a maximum (F
maX) as ER approaches zero, so that:
indicating that F
max is determined by the ratio of base amplification efficiency to its rate-of-decay. By analogy to cycle efficiency, E
R can be replaced by E
c:
Equation 4 allows determination of cycle efficiency based solely upon reaction fluorescence. A subtle distinction between base amplification efficiency and the initial amplification efficiency as previously defined by Rutledge (2004) can be illustrated such that when C=O, Fc=F
0:
Thus, although essentially identical at low target quantities, the difference between E
B and Eo reflects the apparent reduction in amplification efficiency produced by addition of the target during reaction preparation. The dynamics of amplification efficiency could thus be illustrated as a series of steps in which Fo represents the target quantity and Fc represents amplicon DNA:
In addition to several practical implications, this allows for further elaboration of the sigmoid functions described by equations 14 and 15. For example, a direct relationship between EB and k can be demonstrated by considering the scenario that if
when C=O
1 then E
c ~ E
B/2 (Figure 1) so that equation 15 becomes:
Returning to the four-parametric sigmoid function, equation 14 can be modified by first substituting for k using equation 17, which simplifies to:
When C=O, F
0 equals the target quantity such that:
where Fo is target quantity in fluorescence units. Another fundamental aspect apparent from the linear equation 2, is that E
B and ΔE are the sole determinants of the shape of the amplification curve, that in turn is independent of target quantity, such that Cu
2 is the sole parameter defining target quantity.
These derivatives can be extended still further through rearrangement of equation 18:
so that substituting for C1/2 and simplifying, equation 19 becomes:
Importantly, equation 5 allows target quantity to be determined directly from reaction fluorescence, once estimates for EB and ΔE are obtained via linear regression using equation 2, a process termed "linear regression of efficiency" or LRE. The effectiveness of LRE for target quantitative is illustrated in another example.
Similarly, rearranging equation 19:
so that substituting for C
1/2 and simplifying, equation 18 becomes.
Fp = Equation 6
which illustrates that the kinetics of PCR amplification can be traced back to Just three fundamental parameters: target quantity (F0), base amplification efficiency (E6) and the rate-of-decay of amplification efficiency (ΔE), if it is remembered that Fmax is determined by E0 and ΔE as described by equation 3.
LRE quantification
The general effectiveness of these derivatives for modeling real time PCR can be demonstrated the linear regression analysis presented in Figure 2A, and comparing the actual Fc readings to that predicted by equation 6 (Fp), which produces a nonlinear correlation coefficient of 0.99996 over the complete curve (Figure 2B). A similar level of correlation can also be shown for ER from equation 2 with that predicted by equation 4 (Ec; Figure 2C)1 which produces a nonlinear correlation coefficient of 0.99895 over the thermocycles included in Figure 2A.
Further verification that these LRE can effectively describe PCR amplification can be illustrated by examining the impact of increasing reaction volume, which was found to produce a proportional increase in Fmax (Figure 3A). Linear regression analysis reveals that while maximal efficiency is unaffected (E=max; intercept), there is a progressive reduction in rate-of-decay in amplification efficiency (ΔE; elope) with increasing reaction volume (Figure 3B and Table III).
Furthermore, reducing amplification efficiency by reducing the time of annealing and elongation greatly impacts Fmax, along with the shape and position of the amplification curve (Figure 4A). Linear regression analysis reveals that reducing time of annealing and elongation reduces maximal efficiency (Emaχ; intercept) whilst having a modest if any impact on the rate-of- decay in amplification efficiency (ΔE; slope) (Figure 4B and Table IV). Table IV).
These data also provide direct experimental support for the relationship of F^x to Emax and ΔE, as described by equation 3. The ability to investigate the kinetics of PCR amplification (i.e. mechanistic modeling) can also have important practical applications, as for example, in the evaluation and optimization of amplification efficacy. These encompass issues familiarity to most, but which to date have received little or no experimental support. With respect to reaction preparation, examples include the relative impact of primer and enzyme concentration, or of amplicon size on quantitative efficacy. Of possibly greater importance is the ability to define in analytical terms, the impact of thermocycling parameters. As demonstrated here, the time of annealing and elongation has a major influence on amplification efficiency. Indeed, of all parameters examined to date, time of annealing and elongation has been found to have the greatest impact on amplification efficiency, significantly greater for example, than differences observed between primer pairs or produced by changes in the temperature of annealing and elongation (data not shown). Furthermore, establishment of a set of universal amplification conditions has been developed, in which high amplification efficiencies (95-100%) are routinely achieved, without having to resort to optimizing conditions unique to each individual primer-pair. Combined with the quantitative capabilities of LRE, these additional attributes , significantly impact development of high-throughput applications.
Furthermore, it is evident that LRE will have a major impact on extending the quantitative capabilities and applications. Indeed, all of the quantitative attributes of sigmoidal curve-fitting using Boltzmann-based sigmoid functions, are directly applicable to an LRE-based approach. These include the ability to monitor amplification efficacy within individual reactions, and to conduct quantification without having to construct standard curves. A major advantage of LRE is however, circumventing the complexities of nonlinear regression (i.e. curve fitting), and in particular, the difficulties produced by a drifting plateau..
Drifting plateau
A major deficiency in the curving fitting approach as described by Liu and Saint (2002) is the inability to conducted accurate quantification under
conditions in which amplification does not complete cease in the plateau phase. Although not documented to date, this continued amplification within the plateau phase can be illustrated by comparing actual to predicted reaction fluorescence (Figure 5) in which actual fluorescence increase above that predicted by sigmoidal curve-fitting. Also referred to a plateau drift this persistent increase in reaction fluorescence is inversely related to maximal efficiency, such that the rate of continued amplification increase as the predicted cycle efficiency approaches zero (Figure 5). A major deficiency for quantification via curve fitting is that a drifting plateau dramatically impacts the quantitative effectiveness. LRE circumvents this deficiency (Figure 4).
Optical Calibration
A linear relationship between reaction fluorescence and DNA mass was demonstrated using a quantified stock of lambda DNA as a DNA standard (Figure 6A). Based upon equation 13, a calibration factor (CFd) was calculated from the average of the. DNA mass to fluorescence ratio for each mock reaction (Table V). Verification of this CFd value was then obtained by conducting PCR amplifications using the same lambda DNA preparation, based upon Mo to F0 ratios generated from three targets within the lambda DNA (equation 12). The average CF0 was found to be nearly identical to that obtained via direct calibration, differing from CFd by only 4%.
Development of a protocol for large-scale qPCR applications
A major deficiency of existing qPCR applications is the need to optimize both primer design and cycling regime for each individual target. This requires construction of a standard curve for each target, that in turn requires preparation of a quantified DNA standard unique to each target. Selection and preparation of these standards are technically challenging and time consuming, greatly hindering the design of large numbers of primer pairs as is required for large-scale applications, such as for example, that required for high-throughput gene expression analysis. Combined with optical calibration, LRE analysis completely abrogates the requirement for standard curve construction, facilitating the simultaneous analysis of large numbers of targets. By virtue of allowing amplification
efficiency to be determined for each individual amplification reaction, LRE also facilitates testing and routine quality control of both primer design and cycling regime. Large-scale application of LRE analysis has subsequently led to development of a universal cycling protocol that generates both high amplification efficiencies and a very low frequency of non-specific amplification products (i.e. produces high amplification specificity). This protocol preferably incorporates a two-step cycling regime consisting of a denaturation step at or about 950C for 10 seconds, and a combined annealing and elongation (A&E) step at 650C for 180 seconds. The selection of temperature and time of the A&E step, influences amplification specificity and high amplification efficiency. It is within the skil of one in the art, in light of the disclosure herein, to select suitable annealing and elongation temperatures. Typical temperature ranges are 60 to 75 degrees Celsius, sometimes 65 to 70 degree Celsius. (These temperature ranges are driven significantly by the activity and stability of the Taq enzyme. One skilled in the art in possession of a different polymerase would be able to select a suitable temperature in light of the disclosure herein.)
The application of LRE analysis was further utilized to examine the efficacy of primer design methodologies. To date, it is common practice to utilize specialized software programs to design primers, based upon the objective of reducing the frequency of non-specific amplification products and promoting the general effectiveness of primer pair combinations. Despite the" general perception that careful selection of the DNA sequence encompassed by a primer is required for a high rate of success, LRE analysis, in combination with the cycle regime described above, led to the discovery that only a single parameter is important for effective primer design. This parameter is primer melting temperature (Tm), a parameter determined solely by primer length and base pair composition; the actual DNA sequence within a primer was generally found to have no substantive impact on amplification efficiency, nor on the determination of amplification specificity.
In addition to the simplicity of this approach to facilitate large-scale primer design, this discovery greatly facilitates large-scale application of qPCR by allowing the position of the primer within the target to be selected
without restriction. In practice this involves selecting a 3 prime starting position, generally near to the stop codon for mRNA quantification, and to increase the length of the primer until the calculated Tm reaches 700C. Combined with the universal cycling regime described above, LRE analysis of a large number of primer pairs has demonstrated that >95% of all primer pairs produce high amplification specificity, with all primer pairs producing amplification efficiencies within the range of 95-100% as determined by LRE analysis.
Materials and Methods
Figure 8 Data Analysis
SigmaPlot (Version 8) was used to generate the plots presented In Fig.
8 based upon sigmoidal curve-fitting of the fluorescence readings taken from the 4.0 min amplification reaction presented in Fig. 4, which produced: *=1.51 ,
All other analyses were conducted using MS
Excel.
Nonlinear correlation coefficients of predicted fluorescence profiles presented in Fig. 7 were calculated over the range of cycles included in the
LRE analysis (Fig. 8), using the equation:
R2 = 1_ ∑< >2 2
where Fc is reaction fluorescence and FP is the predicted reaction fluorescence at cycle C, with F^ being the average reaction fluorescence over the range of cycles used in the LRE analysis.
PCR amplification
Bacteriophage lambda DNA (New England BioLabs) was used as template for all amplification reactions, which were conducted with a MX3000P spectrofluorometric thermal cycler (Stratagene) using QuantiTect™ SYBR® Green PCR Kit (Qiagen Inc.). Data presented in Fig. 3 and 4 were
generated using primer pair Lam K7-K8 (amplicon size 225 bp), and primer pair Lam K7-K12 (amplicon size 150 bp) was used to generate the data presented in Fig. 7.
Lam K7: CTGCTGGCCGGAACTAATGAATTTATTGGT Lam K8: ACCGAGTTCAGAAATAAATAACGCGTCGCCGGAA
Lam K12: ATGCCACG ATG CCTCATCACTGTTG Unless otherwise indicated, four replicate amplification reactions containing 10 pg lambda DNA and 500 πM of each primer were conducted, v starting with a 15 min incubation at 95 0C, followed by a cycling regime of 95 0C-10 sec, 65 °C-180 sec, and reaction fluorescence determined by averaging five fluorescent readings taken at the end of each cycle. Each run was completed with a melting curve analysis to confirm the specificity of amplification and lack of primer dimers. Following fluorescence background subtraction, fluorescence readings from the four replicates were averaged and exported into MS Excel for analysis.
References
Inclusion of a reference is neither an admission nor a suggestion that it is relevant to the patentability of anything disclosed herein.
Bustin, S.A. (2002) Quantification of rπRNA using real-time reverse transcription PCR (RT-PCR): trends and problems. J MoI Endocrinol,
29, 23-39. Higuchi, R., Fockler, C, Dollinger, G. and Watson, R. (1993) Kinetic PCR analysis: real-time monitoring of DNA amplification reactions.
Biotechnology, 11, 1026-1030. Liu, W. and Saint, D.A. (Jun 7, 2002) Validation of a quantitative method for real time PCR kinetics. Biochem Biophys Res Commun, 294, 347-353. Motulsky, H., A. Christopoulos, Fitting Models to Biological Data using Linear and Nonlinear Regression: A practical guide to curve fitting (GraphPad
Software, Inc., 2003). Rasmussen, R. (2001 ) Quantification on the LightCycler. In Meuer, S.,
Wittwer, C. and Nakagawara, K. (eds.), Rapid Cycle Real-Time PCR: Methods and Applications. Springer Press, Heidelberg, pp. 21-34.
Rutledge, R.G. and Cόte, C. (Aug 15, 2003) Mathematics of quantitative kinetic PCR and the application of standard curves. Nucleic Acids Res,
31, e93.
US 6,691 ,041 B2 Method for the efficiency-corrected real-time quantification of nucleic acids.
US 2002/0058262 A1 Method for determining the efficiency of nucleic acid amplifications. US 6,569,627 B2 Monitoring hybridization during PCR SYBR Green I.