WO2002008469A9 - Procedes, systemes et articles manufactures destines a evaluer des donnees biologiques - Google Patents
Procedes, systemes et articles manufactures destines a evaluer des donnees biologiquesInfo
- Publication number
- WO2002008469A9 WO2002008469A9 PCT/US2001/023629 US0123629W WO0208469A9 WO 2002008469 A9 WO2002008469 A9 WO 2002008469A9 US 0123629 W US0123629 W US 0123629W WO 0208469 A9 WO0208469 A9 WO 0208469A9
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- algorithm
- allele
- ofthe
- quality value
- data
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 161
- 238000004519 manufacturing process Methods 0.000 title claims abstract description 4
- 108700028369 Alleles Proteins 0.000 claims abstract description 464
- 230000008569 process Effects 0.000 claims description 59
- 150000007523 nucleic acids Chemical class 0.000 claims description 46
- 108020004707 nucleic acids Proteins 0.000 claims description 44
- 102000039446 nucleic acids Human genes 0.000 claims description 44
- 238000001514 detection method Methods 0.000 claims description 33
- 239000002773 nucleotide Substances 0.000 claims description 22
- 125000003729 nucleotide group Chemical group 0.000 claims description 22
- 238000006243 chemical reaction Methods 0.000 claims description 20
- 238000007781 pre-processing Methods 0.000 claims description 10
- 150000001413 amino acids Chemical class 0.000 claims description 9
- 239000012634 fragment Substances 0.000 description 47
- 208000003028 Stuttering Diseases 0.000 description 36
- 239000003550 marker Substances 0.000 description 29
- 238000012545 processing Methods 0.000 description 19
- 238000013459 approach Methods 0.000 description 15
- 230000037230 mobility Effects 0.000 description 14
- 230000006870 function Effects 0.000 description 11
- 230000000694 effects Effects 0.000 description 10
- 108020004414 DNA Proteins 0.000 description 9
- 230000003321 amplification Effects 0.000 description 8
- 238000004458 analytical method Methods 0.000 description 8
- 230000002068 genetic effect Effects 0.000 description 8
- 238000009499 grossing Methods 0.000 description 8
- 238000003199 nucleic acid amplification method Methods 0.000 description 8
- 239000000243 solution Substances 0.000 description 8
- 238000003860 storage Methods 0.000 description 8
- 238000012360 testing method Methods 0.000 description 8
- 238000012935 Averaging Methods 0.000 description 7
- 101001024425 Mus musculus Ig gamma-2A chain C region secreted form Proteins 0.000 description 7
- 238000004364 calculation method Methods 0.000 description 7
- 238000010586 diagram Methods 0.000 description 6
- 239000000975 dye Substances 0.000 description 6
- 238000005070 sampling Methods 0.000 description 6
- 238000003491 array Methods 0.000 description 5
- 108090000623 proteins and genes Proteins 0.000 description 5
- 230000004044 response Effects 0.000 description 5
- 230000008901 benefit Effects 0.000 description 4
- 238000013507 mapping Methods 0.000 description 4
- 239000011159 matrix material Substances 0.000 description 4
- 230000003287 optical effect Effects 0.000 description 4
- 239000013598 vector Substances 0.000 description 4
- 241000288140 Gruiformes Species 0.000 description 3
- 238000007476 Maximum Likelihood Methods 0.000 description 3
- 108091092878 Microsatellite Proteins 0.000 description 3
- 238000012408 PCR amplification Methods 0.000 description 3
- 208000037656 Respiratory Sounds Diseases 0.000 description 3
- 230000005540 biological transmission Effects 0.000 description 3
- 239000000470 constituent Substances 0.000 description 3
- 238000001502 gel electrophoresis Methods 0.000 description 3
- 238000005259 measurement Methods 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 102000004169 proteins and genes Human genes 0.000 description 3
- 238000011002 quantification Methods 0.000 description 3
- 206010037833 rales Diseases 0.000 description 3
- 238000011144 upstream manufacturing Methods 0.000 description 3
- 241000252505 Characidae Species 0.000 description 2
- 101100480705 Cupriavidus necator (strain ATCC 17699 / DSM 428 / KCTC 22496 / NCIMB 10442 / H16 / Stanier 337) tauE gene Proteins 0.000 description 2
- 241000984642 Cura Species 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- 206010011878 Deafness Diseases 0.000 description 2
- 108091028043 Nucleic acid sequence Proteins 0.000 description 2
- 239000007983 Tris buffer Substances 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000000546 chi-square test Methods 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 239000012141 concentrate Substances 0.000 description 2
- 238000011109 contamination Methods 0.000 description 2
- 238000013480 data collection Methods 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000009826 distribution Methods 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 238000009472 formulation Methods 0.000 description 2
- 230000014509 gene expression Effects 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 238000011068 loading method Methods 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 239000001048 orange dye Substances 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 230000008707 rearrangement Effects 0.000 description 2
- 239000001044 red dye Substances 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 238000004513 sizing Methods 0.000 description 2
- 230000003068 static effect Effects 0.000 description 2
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 2
- 230000000007 visual effect Effects 0.000 description 2
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 1
- 206010071602 Genetic polymorphism Diseases 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 238000000211 autoradiogram Methods 0.000 description 1
- 238000013476 bayesian approach Methods 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 239000001045 blue dye Substances 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 239000010432 diamond Substances 0.000 description 1
- 230000003467 diminishing effect Effects 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 238000003205 genotyping method Methods 0.000 description 1
- 239000001046 green dye Substances 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 238000012804 iterative process Methods 0.000 description 1
- 238000003064 k means clustering Methods 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000002493 microarray Methods 0.000 description 1
- 238000000329 molecular dynamics simulation Methods 0.000 description 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 1
- 238000000734 protein sequencing Methods 0.000 description 1
- 230000000630 rising effect Effects 0.000 description 1
- 238000002922 simulated annealing Methods 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000011179 visual inspection Methods 0.000 description 1
- 239000001043 yellow dye Substances 0.000 description 1
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
Definitions
- This invention relates to data methods and systems for assigning values to nucleic information.
- the methods and systems are used for assigning values to nucleic information.
- variations between individuals may signify predisposition to a disease or other
- amplified material By determining the fragment length, one can determine the number of
- Figure 4 depicts a flow chart ofthe steps performed by the committee machine of Figure 1 for use with methods and systems consistent with certain embodiments ofthe present invention.
- the parameter k is called the "Baseline Window Size”. Similarly, the baselining component
- the positive structure can be eliminated by executing
- nucleotide lengths 110, 114, 117, 120, and 125 may be used.
- primer peak locates within the first half of the signal and the size standard fragments locate in the
- An in-lane size standard is a set of peaks resulting from running a size standard on an
- H [2722, 6219, 1060, 5380, 7726, 1082, 7424, 1263, 7335, 7937, 1562].
- edges (i,j) and (j,k) define a ratio r ⁇ . of lengths:
- the global methods determine the size/f ⁇ of a fragment at scan number J: by evaluating a function/! The function depends on the method:
- a scan number x corresponds to time (since the capillary length, or well-to-read distance, is fixed), and so is inversely proportional to mobility.
- certain embodiments ofthe Envelope Caller may include the following:
- the heuristic caller assumes that there are a maximum of two
- committee machine 110 If there is no call for the Heuristic algorithm, and the same call for the Envelope method and the Optimizer, committee machine 110 passes those calls to the user and assigns a confidence value of 0.621. If only the Optimizer produces a call, committee machine 110 assigns a confidence value of 0.692 correct. And finally, any cases that do not fit into the above scenarios are assigned the calls given by the Heuristic algorithm and are assigned a confidence value of 0.771.
- the above listed determination of agreement is exemplary. One skilled in the art will appreciate that other determinations of confidences are available. For example, additional algorithms may be used to produce more accurate confidence levels according to certain embodiments.
- preprocessing simply involves sampling the original signal to reduce its dimensionality. This can be performed by calculating the most important features of the signal; the peaks and valleys. By representing the signal in such a compact form, the search space is reduced significantly.
- the peaks form the set of candidate allele peaks that will be considered as possibilities for the allele calls.
- the next two boxes show the varying the parameters and the calculation ofthe residual. This process is iterated, and in the final box, a winning set of allele peaks (it could be a set of one peak) is declared. Actual output ofthe algorithm is contained in Figure 15.
- the call is made by finding the maximums in each of panels 1 and 2.
- the individual algorithms may not be optimal when employed alone.
- CTR cathode
- LCD liquid crystal display
- This input device typically has two degrees of freedom in two axes, a first axis (e.g., x) and a second axis (e.g., y), that allows the device to specify positions in a plane.
Landscapes
- Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Analytical Chemistry (AREA)
- Biophysics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Biotechnology (AREA)
- Evolutionary Biology (AREA)
- General Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Theoretical Computer Science (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Investigating Or Analysing Biological Materials (AREA)
- Complex Calculations (AREA)
- Measurement Of The Respiration, Hearing Ability, Form, And Blood Characteristics Of Living Organisms (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CA002416764A CA2416764A1 (fr) | 2000-07-21 | 2001-07-23 | Procedes, systemes et articles manufactures destines a evaluer des donnees biologiques |
AU2002211212A AU2002211212A1 (en) | 2000-07-21 | 2001-07-23 | Methods, systems, and articles of manufacture for evaluating biological data |
EP01979226A EP1349960A2 (fr) | 2000-07-21 | 2001-07-23 | Procedes, systemes et articles manufactures destines a evaluer des donnees biologiques |
JP2002513951A JP2004516455A (ja) | 2000-07-21 | 2001-07-23 | 生物学的データを評価するための方法、システム、および製品 |
Applications Claiming Priority (8)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US21969700P | 2000-07-21 | 2000-07-21 | |
US60/219,697 | 2000-07-21 | ||
US22755600P | 2000-08-23 | 2000-08-23 | |
US60/227,556 | 2000-08-23 | ||
US72491000A | 2000-11-28 | 2000-11-28 | |
US09/724,910 | 2000-11-28 | ||
US29012901P | 2001-05-10 | 2001-05-10 | |
US60/290,129 | 2001-05-10 |
Publications (3)
Publication Number | Publication Date |
---|---|
WO2002008469A2 WO2002008469A2 (fr) | 2002-01-31 |
WO2002008469A3 WO2002008469A3 (fr) | 2003-07-17 |
WO2002008469A9 true WO2002008469A9 (fr) | 2003-11-20 |
Family
ID=27499158
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2001/023629 WO2002008469A2 (fr) | 2000-07-21 | 2001-07-23 | Procedes, systemes et articles manufactures destines a evaluer des donnees biologiques |
Country Status (5)
Country | Link |
---|---|
EP (1) | EP1349960A2 (fr) |
JP (1) | JP2004516455A (fr) |
AU (1) | AU2002211212A1 (fr) |
CA (1) | CA2416764A1 (fr) |
WO (1) | WO2002008469A2 (fr) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2005531853A (ja) * | 2002-06-28 | 2005-10-20 | アプレラ コーポレイション | Snp遺伝子型クラスタリングのためのシステムおよび方法 |
JP4713138B2 (ja) * | 2004-12-06 | 2011-06-29 | 株式会社日立ソリューションズ | 遺伝子情報の表示方法及び表示装置及びプログラム |
EP1862929A1 (fr) * | 2006-02-28 | 2007-12-05 | Hitachi Software Engineering Co., Ltd. | Procédé et dispositif d'évaluation de résultat de genotypage |
CN101467032B (zh) | 2006-04-14 | 2013-03-27 | 日本电气株式会社 | 个体识别方法及设备 |
WO2015029670A1 (fr) * | 2013-08-27 | 2015-03-05 | 日産自動車株式会社 | Mécanisme piston/manivelle à plusieurs liaisons pour moteur à combustion interne |
JP2017532699A (ja) * | 2014-09-05 | 2017-11-02 | ナントミクス,エルエルシー | 起源の判定のためのシステムと方法 |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5580728A (en) * | 1994-06-17 | 1996-12-03 | Perlin; Mark W. | Method and system for genotyping |
US6019896A (en) * | 1998-03-06 | 2000-02-01 | Molecular Dynamics, Inc. | Method for using a quality metric to assess the quality of biochemical separations |
WO1999053423A1 (fr) * | 1998-04-16 | 1999-10-21 | Northeastern University | Systeme expert d'analyse des electrophoregrammes d'adn |
-
2001
- 2001-07-23 JP JP2002513951A patent/JP2004516455A/ja active Pending
- 2001-07-23 EP EP01979226A patent/EP1349960A2/fr not_active Withdrawn
- 2001-07-23 AU AU2002211212A patent/AU2002211212A1/en not_active Abandoned
- 2001-07-23 CA CA002416764A patent/CA2416764A1/fr not_active Abandoned
- 2001-07-23 WO PCT/US2001/023629 patent/WO2002008469A2/fr active Search and Examination
Also Published As
Publication number | Publication date |
---|---|
WO2002008469A3 (fr) | 2003-07-17 |
AU2002211212A1 (en) | 2002-02-05 |
CA2416764A1 (fr) | 2002-01-31 |
WO2002008469A2 (fr) | 2002-01-31 |
EP1349960A2 (fr) | 2003-10-08 |
JP2004516455A (ja) | 2004-06-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20020116135A1 (en) | Methods, systems, and articles of manufacture for evaluating biological data | |
US10347365B2 (en) | Systems and methods for visualizing a pattern in a dataset | |
US20040117130A1 (en) | System and method for improving the accuracy of DNA sequencing and error probability estimation through application of a mathematical model to the analysis of electropherograms | |
Perlin et al. | Toward fully automated genotyping: genotyping microsatellite markers by deconvolution | |
Ewing et al. | Base-calling of automated sequencer traces usingPhred. I. Accuracy assessment | |
Price et al. | Whole-genome analysis of Alu repeat elements reveals complex evolutionary history | |
US8392126B2 (en) | Method and system for determining the accuracy of DNA base identifications | |
US7406385B2 (en) | System and method for consensus-calling with per-base quality values for sample assemblies | |
US20220101944A1 (en) | Methods for detecting copy-number variations in next-generation sequencing | |
US20050149271A1 (en) | Methods and apparatus for complex gentics classification based on correspondence anlysis and linear/quadratic analysis | |
Coffa et al. | Analysis of MLPA data using novel software coffalyser .NET by MRC-Holland | |
WO2002008469A9 (fr) | Procedes, systemes et articles manufactures destines a evaluer des donnees biologiques | |
US7912652B2 (en) | System and method for mutation detection and identification using mixed-base frequencies | |
WO2020104394A1 (fr) | Procédé et produit de programme d'ordinateur pour l'analyse d'adn fœtal par séquençage massif | |
CN116312783A (zh) | 一种dna合成难度预测的系统及其应用 | |
Walther et al. | Basecalling with lifetrace | |
US11328794B2 (en) | Method for determining relatedness of genomic samples using partial sequence information | |
WO2024140881A1 (fr) | Procédé et dispositif de détermination de la concentration d'adn fœtal | |
WO2003102211A2 (fr) | Méthode de détection des variations de l'adn dans des données de séquences | |
Vranckx et al. | Analysis of MALDI‐TOF MS Spectra using the BioNumerics Software | |
Talenti et al. | The evolution and convergence of mutation spectra across mammals | |
Hellenthal | Population structure, demography and recent admixture | |
Zavala et al. | Benchmarking for genotyping and imputation using degraded DNA for forensic applications across diverse populations | |
Parsley | Benchmark of Tools and Methods Used in the Analysis of Massively Parallel Reporter Assays | |
Gafurov et al. | Probabilistic Models of k-mer Frequencies |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2416764 Country of ref document: CA |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2001979226 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2002211212 Country of ref document: AU |
|
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
WWP | Wipo information: published in national office |
Ref document number: 2001979226 Country of ref document: EP |
|
COP | Corrected version of pamphlet |
Free format text: PAGES 1/40-40/40, DRAWINGS, REPLACED BY NEW PAGES 1/36-36/36; DUE TO LATE TRANSMITTAL BY THE RECEIVING OFFICE |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: 2001979226 Country of ref document: EP |
|
DPE2 | Request for preliminary examination filed before expiration of 19th month from priority date (pct application filed from 20040101) |