WO2000065088A2 - Amorces servant a l'identification, le typage ou la classification d'acides nucleiques - Google Patents
Amorces servant a l'identification, le typage ou la classification d'acides nucleiques Download PDFInfo
- Publication number
- WO2000065088A2 WO2000065088A2 PCT/EP2000/003636 EP0003636W WO0065088A2 WO 2000065088 A2 WO2000065088 A2 WO 2000065088A2 EP 0003636 W EP0003636 W EP 0003636W WO 0065088 A2 WO0065088 A2 WO 0065088A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- primers
- primer
- extendible
- hla
- nucleic acid
- Prior art date
Links
- 108020004707 nucleic acids Proteins 0.000 title claims abstract description 27
- 102000039446 nucleic acids Human genes 0.000 title claims abstract description 27
- 150000007523 nucleic acids Chemical class 0.000 title claims abstract description 27
- 238000000034 method Methods 0.000 claims abstract description 49
- 239000011159 matrix material Substances 0.000 claims abstract description 39
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 36
- 108700028369 Alleles Proteins 0.000 claims abstract description 30
- 108020004465 16S ribosomal RNA Proteins 0.000 claims abstract description 20
- 102000054765 polymorphisms of proteins Human genes 0.000 claims abstract description 12
- 238000004458 analytical method Methods 0.000 claims abstract description 7
- 239000012634 fragment Substances 0.000 claims description 16
- 102100028972 HLA class I histocompatibility antigen, A alpha chain Human genes 0.000 claims description 13
- 108010075704 HLA-A Antigens Proteins 0.000 claims description 13
- 125000003729 nucleotide group Chemical group 0.000 claims description 13
- 102100028976 HLA class I histocompatibility antigen, B alpha chain Human genes 0.000 claims description 11
- 102100028971 HLA class I histocompatibility antigen, C alpha chain Human genes 0.000 claims description 11
- 108010058607 HLA-B Antigens Proteins 0.000 claims description 11
- 108010052199 HLA-C Antigens Proteins 0.000 claims description 11
- 239000002773 nucleotide Substances 0.000 claims description 10
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 8
- 108010072685 Uracil-DNA Glycosidase Proteins 0.000 claims description 7
- AHCYMLUZIRLXAA-SHYZEUOFSA-N Deoxyuridine 5'-triphosphate Chemical compound O1[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C[C@@H]1N1C(=O)NC(=O)C=C1 AHCYMLUZIRLXAA-SHYZEUOFSA-N 0.000 claims description 5
- 239000000427 antigen Substances 0.000 claims description 4
- 108091007433 antigens Proteins 0.000 claims description 4
- 102000036639 antigens Human genes 0.000 claims description 4
- XNKICCFGYSXSAI-UHFFFAOYSA-N 1,1-diphenylpropan-2-amine Chemical compound C=1C=CC=CC=1C(C(N)C)C1=CC=CC=C1 XNKICCFGYSXSAI-UHFFFAOYSA-N 0.000 claims description 2
- 238000012512 characterization method Methods 0.000 claims description 2
- 238000004624 confocal microscopy Methods 0.000 claims description 2
- 238000009396 hybridization Methods 0.000 claims description 2
- 238000012408 PCR amplification Methods 0.000 claims 1
- 102000006943 Uracil-DNA Glycosidase Human genes 0.000 claims 1
- 238000000492 total internal reflection fluorescence microscopy Methods 0.000 claims 1
- 239000000243 solution Substances 0.000 description 43
- 108020004414 DNA Proteins 0.000 description 20
- 239000013598 vector Substances 0.000 description 19
- 238000006243 chemical reaction Methods 0.000 description 17
- 239000000523 sample Substances 0.000 description 13
- 101100284398 Bos taurus BoLA-DQB gene Proteins 0.000 description 12
- 101001100327 Homo sapiens RNA-binding protein 45 Proteins 0.000 description 7
- 102100038823 RNA-binding protein 45 Human genes 0.000 description 7
- 102100037111 Uracil-DNA glycosylase Human genes 0.000 description 7
- 230000003321 amplification Effects 0.000 description 7
- 230000006872 improvement Effects 0.000 description 7
- 238000003199 nucleic acid amplification method Methods 0.000 description 7
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 6
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 6
- 230000009467 reduction Effects 0.000 description 6
- 210000004027 cell Anatomy 0.000 description 5
- 238000002844 melting Methods 0.000 description 5
- 230000008018 melting Effects 0.000 description 5
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 5
- 230000006399 behavior Effects 0.000 description 4
- 230000008901 benefit Effects 0.000 description 4
- 239000000975 dye Substances 0.000 description 4
- 238000013467 fragmentation Methods 0.000 description 4
- 238000006062 fragmentation reaction Methods 0.000 description 4
- 239000011521 glass Substances 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 102000004190 Enzymes Human genes 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- 102100036241 HLA class II histocompatibility antigen, DQ beta 1 chain Human genes 0.000 description 3
- 108010065026 HLA-DQB1 antigen Proteins 0.000 description 3
- 108010067148 HLA-DQbeta antigen Proteins 0.000 description 3
- 238000013459 approach Methods 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 210000000056 organ Anatomy 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 238000010186 staining Methods 0.000 description 3
- JEPVUMTVFPQKQE-AAKCMJRZSA-N 2-[(1s,2s,3r,4s)-1,2,3,4,5-pentahydroxypentyl]-1,3-thiazolidine-4-carboxylic acid Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)C1NC(C(O)=O)CS1 JEPVUMTVFPQKQE-AAKCMJRZSA-N 0.000 description 2
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 2
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 2
- 241000143060 Americamysis bahia Species 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 238000000018 DNA microarray Methods 0.000 description 2
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 2
- 102100029966 HLA class II histocompatibility antigen, DP alpha 1 chain Human genes 0.000 description 2
- 102100031618 HLA class II histocompatibility antigen, DP beta 1 chain Human genes 0.000 description 2
- 102100036242 HLA class II histocompatibility antigen, DQ alpha 2 chain Human genes 0.000 description 2
- 108010093061 HLA-DPA1 antigen Proteins 0.000 description 2
- 108010045483 HLA-DPB1 antigen Proteins 0.000 description 2
- 108010086786 HLA-DQA1 antigen Proteins 0.000 description 2
- 108010058966 bacteriophage T7 induced DNA polymerase Proteins 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 238000000205 computational method Methods 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 239000005546 dideoxynucleotide Substances 0.000 description 2
- 239000013024 dilution buffer Substances 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 238000007670 refining Methods 0.000 description 2
- MPLHNVLQVRSVEE-UHFFFAOYSA-N texas red Chemical compound [O-]S(=O)(=O)C1=CC(S(Cl)(=O)=O)=CC=C1C(C1=CC=2CCCN3CCCC(C=23)=C1O1)=C2C1=C(CCC1)C3=[N+]1CCCC3=C2 MPLHNVLQVRSVEE-UHFFFAOYSA-N 0.000 description 2
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 102100040485 HLA class II histocompatibility antigen, DRB1 beta chain Human genes 0.000 description 1
- 108010064885 HLA-DR3 Antigen Proteins 0.000 description 1
- 108010039343 HLA-DRB1 Chains Proteins 0.000 description 1
- 101000941450 Lasioglossum laticeps Lasioglossin-1 Proteins 0.000 description 1
- 108700018351 Major Histocompatibility Complex Proteins 0.000 description 1
- 229920001213 Polysorbate 20 Polymers 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- BLRPTPMANUNPDV-UHFFFAOYSA-N Silane Chemical compound [SiH4] BLRPTPMANUNPDV-UHFFFAOYSA-N 0.000 description 1
- 229910000831 Steel Inorganic materials 0.000 description 1
- ODMZDQKUHYGKKN-UHFFFAOYSA-N TCA C Natural products CC(CCCC(C)C1=CCC2(C)OC3=C(CC12)C(=O)C(O)CC3)C(=O)O ODMZDQKUHYGKKN-UHFFFAOYSA-N 0.000 description 1
- 239000008186 active pharmaceutical agent Substances 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 230000001174 ascending effect Effects 0.000 description 1
- 230000003190 augmentative effect Effects 0.000 description 1
- 238000009835 boiling Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- SPTYHKZRPFATHJ-HYZXJONISA-N dT6 Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)CO)[C@@H](O)C1 SPTYHKZRPFATHJ-HYZXJONISA-N 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000013213 extrapolation Methods 0.000 description 1
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 102000054767 gene variant Human genes 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 238000007654 immersion Methods 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 1
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 102000004169 proteins and genes Human genes 0.000 description 1
- 239000011535 reaction buffer Substances 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 229910000077 silane Inorganic materials 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 239000010959 steel Substances 0.000 description 1
- 230000020382 suppression by virus of host antigen processing and presentation of peptide antigen via MHC class I Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 238000002054 transplantation Methods 0.000 description 1
- 241001624918 unidentified bacterium Species 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
- C12Q1/6874—Methods for sequencing involving nucleic acid arrays, e.g. sequencing by hybridisation
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6813—Hybridisation assays
- C12Q1/6834—Enzymatic or biochemical coupling of nucleic acids to a solid phase
- C12Q1/6837—Enzymatic or biochemical coupling of nucleic acids to a solid phase using probe arrays or probe chips
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6881—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for tissue or cell typing, e.g. human leukocyte antigen [HLA] probes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/172—Haplotypes
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
Definitions
- DNA-sequence analysis is rapidly becoming a standard tool in modern, molecular biology research. Examples of applications include: Sequencing of unknown DNA-sequences, Identifying novel genes in stretches of sequenced DNA, Predicting protein-sequence and -structure from DNA-sequence alone and Identification of known gene-variations (sometimes called "typing a gene").
- HLA Human Leucocyte Antigen
- MHC Major Histocompatibility Complex
- Another application where a rapid and accurate identification of a gene is desired is when trying to identify unknown bacteria.
- a rapid identification of the bacteria causing the illness of a patient makes it possible to administer the correct medication early in the treatment of the disease, thus reducing the discomfort for the patient.
- every self- replicating organism so far studied use ribosomes when translating mRNA to proteins, analysis of one of the genes coding for the ribosome, for instance the 16S rRNA in the case of prokaryotes, could be used to identify the organism in question.
- APEX Arrayed Primer Extension
- the array primer extension method APEX for resequencing would need more than 16,000 primers if all DQB alleles would be sequenced from a 500 bp long PCR fragment. If all DQB alleles in pairs should be combined the number of primers might be even higher which would be the situation for a heterozygote found in most individuals. But this might not be necessary, if some variations always or never occur together. This needs to be studied though, and a way found to determine the least number of primers (and what their sequences are) required for unambiguously identifying those genes.
- An object of this invention is to find and implement an efficient algorithm capable of doing just that.
- the algorithm should preferably also take into account the melting points of the primers, so that the extension reaction can take place under optimal conditions for all of the primers on the chip. It should also minimise the number of "self-extended” primers, i.e. primers that can extend themselves without any sample DNA.
- This algorithm is then to be tested and evaluated on the HLA and 16S rRNA- genes.
- HLA is chosen partly because of the importance of rapid typing of these genes, leading to the fact that there are many other methods to which APEX can be compared. It is also because the HLA-genes are "easy” to work with, since they rarely contain any insertions or deletions. These kinds of variations in the gene could potentially create problems when designing primers for APEX.
- the 16S rRNA contains insertions and deletions and can thus be used to see if the algorithm can handle such variations.
- the invention provides a method of identifying a set of extendible primers for use in the identification, typing or classification of a nucleic acid of known sequence having known polymorphisms wherein: i) all possible nucleotide sequences of a chosen length of the nucleic acid are identified and their corresponding extendible primers, ii) at least one extendible primer is removed from the set wherein the at least one primer removed identifies a segment of the nucleic acid identified by at least one other primer.
- the method includes between step i) and ii): ia) potential extensions for each primer are identified with respect to each nucleotide sequence, ib) for each extendible primer the identified potential extensions are compared to determine which pairs of sequences can be discriminated by the primer.
- a matrix of primers and pairs of primer extensions is prepared in binary form and is subjected to analysis by a set covering problem (SCP) algorithm as described in more detail below.
- SCP set covering problem
- the invention also includes a set of extendible primers, for use in the identification, typing or classification of a nucleic acid of known sequence having known polymorphisms, identified by the method as defined.
- the primers are attached by 5'-ends to a surface of a support on which they are presented in the form of an array.
- the invention provides a set of extendible primers, for use in the identification, typing or classification of a human leucocyte antigen (HLA) gene as indicated, the set comprising about the number of primers indicated and being capable of distinguishing about the number of alleles indicated:
- HLA human leucocyte antigen
- the invention provides a set of extendible primers, for use in the identification, typing or classification of 16S rRNA, wherein the set comprises about 210 primers and is capable of distinguishing at least about 1207 different sequences.
- the approximate number of primers is indicated. As indicated below, it may be possible by the use of the algorithms exemplified or other algorithms to generate slightly smaller sets of primers capable of distinguishing the number of alleles or sequences indicated, and these sets are envisaged according to the invention. Of course, other primers may be present in addition to those indicated as essential, and may be useful for checking purposes.
- the number of alleles or sequences indicated represents the approximate known number of polymorphisms or different sequences, and these will surely increase with time.
- the invention provides a method of identification, typing or classification of a nucleic acid of known sequence having known polymorphisms, by the use of the set of extendible primers as defined, which method comprises applying the nucleic acid or fragments thereof to the set of extendible primers under hybridisation conditions and effecting template-directed chain extension of extendible primers that have formed hybrids.
- template-directed chain extension is effected using four different fluorescently labelled chain-terminating nucleotide analogues, and results are analysed by an imaging system such as total internal reflection fluorescence (TIRF) or scanning confocal microscopy.
- TIRF total internal reflection fluorescence
- the various steps of the method may be performed as described in the literature for the known APEX technique.
- the invention provides a kit for use in the identification, typing or characterisation of a nucleic acid of known sequence having known polymorphisms, comprising the set of extendible primers as defined.
- the invention provides an array of sets of extendible primers as defined, for the simultaneous identification, typing or classification of two or more different HLA genes.
- the present invention it has been realised that where a number of different alleles are to be identified, the total number of primers required to distinguish each of the alleles could be reduced as some primers would be common to all of the alleles, for example.
- complete sets of primers for identification of each allele are identified and then the total number of primers in the combined sets is reduced using predetermined rules.
- the present invention is based on the premise that as the primers are used to identify the presence or absence of a particular nucleotide sequence in any allele, the specific nucleotide that extends any particular primer is of less relevance than simply whether the primer has been extended.
- SCP Set Covering Problem
- Figure 1 is a diagram of a signal matrix in accordance with the present invention
- Figure 2 is a diagram of the corresponding binary matrix for the signal matrix of Figure 1 ;
- Figure 3 is a flow diagram of the steps for reducing the primer set in accordance with the present invention. The following is an explanation to assist in an understanding of the principles underlying the manner in which the number of primers used in the identification of a plurality of sequences may be reduced.
- the number of primers required to identify k sequences grows as 0(k»l), where / is the length of the sequences as each sequence requires / primers.
- / is the length of the sequences as each sequence requires / primers.
- the less the sequences differ from one another the fewer primers are required as many of the primers required for identification of a first sequence may also be of use in identification of another sequence. This effect becomes more pronounced the greater the number of sequences to be identified and the greater the similarities.
- a signal matrix of k x n can be constructed. Each element in the matrix represents the signal, if any, that is generated by a particular primer with respect to a particular sequence.
- the signal will either be one of the four nucleotides 'A', 'C ⁇ 'G', or T or no signal '-'.
- Figure 1 is an example of such a signal matrix where, for example, the signal generated by primer 2 with respect to sequence 3 is T.
- the signal matrix is then converted into a binary matrix that represents whether the signals for any particular primer differ with respect to different sequences.
- the same signal 'G' is generated for both sequences 1 and 2 but a different signal T is generated with respect to sequence 3.
- the binary matrix is constructed by considering each column (each primer) of the signal matrix and comparing each signal in that column in turn.
- the first row of the matrix represents a comparison of the signals for the first and second sequences
- the second row represents a comparison of the signals for the first and third sequences
- the third row represents a comparison of the signals for the second and third sequences.
- Binary '0' represents the comparison revealing the same signal
- binary '1 ' represents the comparison reveals different signals.
- the binary matrix renders the data contained within that matrix suitable for mathematical analysis.
- SCP Set Covering Problem
- the most simple heuristic is the greedy algorithm, where columns are added one at a time.
- the column to be added in each step is chosen so as to cover as many uncovered rows as possible (a row is covered if it has at least one non-zero element).
- S r is the set of columns already included in the solution at iteration r
- R r is the set of rows with no non-zero elements at iteration r
- column; ' / is selected according to:
- C j / P other terms can be used.
- Example terms are c,, C j / log 2 Pj or c, / (P j )2.
- Greedy algorithms of this type are described in "An Efficient Heuristic for Large Set Covering Problems", Vasko, Wilson, Naval Research Logistics Quarterly 1984, 31 :163-171 the contents of which is incorporated herein by reference. The difference is in how much emphasis to place on the cost of the column versus how many rows the column covers. It is shown, however, that this entire class of heuristics share the same worst case behaviour. If we denote the set of columns in the solution as S and the solution value as Z, then the worst case behaviour can be described as:
- Lagrangian relaxation heuristic is believed to be some kind of Lagrangian relaxation heuristic, where in each iteration the Lagrange multipliers for each column are used to calculate the Lagrangian cost for the columns.
- Lagrangian relaxation heuristic is described in "A Heuristic Method for the Set Covering Problem", Capara et al Technical Report OR-95-8, Operations Research Group, University of Bologna 1995 the content of which is incorporated herein by reference.
- a near optimal vector of these costs is then calculated by a subgradient algorithm, before being used as input to a greedy algorithm. This is repeated until no improvements in the solution can be made.
- Lagrangian subgradient methods the Lagrangian of the original problem is considered instead of the original problem. In this case, the Lagrangian will be
- u is the Lagrangian multiplier for row / ' .
- q(u) is the Lagrangian cost associated with column j, and is defined by
- Equation 5 An optimal solution to Equation 4 is given by
- L(u) can also be seen as an estimate of the lower bound for the solution, i.e. the sum of the costs for the columns in the optimal solution to the SCP will be > L(u).
- the solution to the SCP can be found by finding an optimal multiplier vector u instead, but this will require much computation especially for a large SCP. But near-optimal multiplier vectors can be found within short time by using the subgradient vector s(u), defined by
- u can be refined iteratively by using for example
- Equation 8 where ⁇ > 0 is a step-size parameter and UB is an upper bound on the value of the solution.
- the initial u° can be defined arbitrarily.
- To solve the SCP first a near-optimal multiplier vector u is found. This and Equation 6 is then used as a basis to form a feasible solution. The upper bound UB can then be updated to the value of this feasible solution (if it is better than the previous best solution), and a new near-optimal multiplier vector found and so on until convergence is reached.
- Another alternative computational method that may be employed to solve such a SCP is 'surrogate relaxation' in which in each iteration a corresponding continuous problem is solved and made feasible before a sub-gradient algorithm is applied.
- genetic algorithms may be employed in which the 'genome' consists of n bits, one bit for each of the columns.
- a primer in the selected reduced set may generate a negative, '-', signal rather than a positive signal, A, C, G, T.
- A, C, G, T a positive signal
- the least number of positive signals as well as the least number of differences in the signal pattern is preferably larger than one.
- all possible primers are selected (10) using the standard APEX procedure to produce a first set of primers.
- a substring of the sequence to be analysed is used to construct one primer, then the substring is displaced by one base and another primer is constructed. This process is carried out from the start of the sequence until the entire sequence has been covered. Both strands of DNA are used and this is repeated for all sequences.
- the primers should be long enough to be capable of discriminating between exact matches and mismatches involving one or two nucleotide pairs. Conveniently, the primers are 13bp long as this has been found to be sufficient to ensure the reaction, or longer to increase hybrid stability. However, to avoid steric hindrance on the chip each primer may be 5'-tailed. In this example, twelve T's are added to the 5'-end of the primer so that the final length of the primers is 25bp.
- primers that are not suitable as primers are rejected (12) and the rest is included in a primary primer set.
- Unsuitable primers are those where the three bases at the 3'-end are complementary to any substring of the primer. In some instances this can result in the primer being extended by a neighbouring primer and not the sample DNA as a template and for that reason such primers are considered unsuitable.
- any primers that would produce ambiguous signals are identified and rejected (14).
- a primer produces an ambiguous signal where it is not known which of the four bases is in the relevant position.
- Each of the remaining primers in the primary set primer is then compared to each sequence in turn to determine whether the primer is extendible by each sequence and if the primer is extendible the base with which it would be extended is determined.
- a signal matrix of the primers with respect to each of the sequences is thus generated (16).
- the three bases in the 3'-end of the primer must hybridise to the DNA. Otherwise the enzyme responsible for the extension will not be able to add a nucleotide to the primer. Of the rest of the primer (the poly-T tail excluded), at most two mismatches are allowed, otherwise the primer-DNA duplex is considered to be too unstable to be extended. In ordinary PCR, all the bases must match in order for the primer to be extended. But then the temperature is raised to the melting point, T m , of the primer in the extension step. In APEX, this reaction is carried out at 45°C, which is around 10°-20° below T m of most primers. This means that the primers will hybridise to the DNA despite a few mismatches, which is why two mismatches are allowed here.
- a primer could hybridise to a sequence in more than one position, and sometimes a primer could hybridise to both strands of one allele and give different signals. In those cases all the different signals are combined to form one resulting signal (e.g. 'A' and 'C together forms 'M', which is the NC-IUB (NC-IUB, 1985) code for this combination).
- the entries for each row are compared against one another, in other words for each primer the signals produced by the primer for each sequence are compared against each other.
- a binary matrix is thus generated (18) of the primers with respect to the identity or difference of signals for pairs of sequences.
- the binary matrix contains non-zero entries where the primer is able to distinguish between a pair of sequences.
- the number of pairs of sequences that each primer can distinguish between are counted and a score is allocated to each primer (20) in dependence on the total number of pairs of sequences counted. Thus, the number of non-zero elements for each primer are counted.
- Primers that are unable to distinguish between any pairs of sequences are rejected (22) and the remaining primers are sorted (24) in order of their score with the primers with the higher scores at the beginning.
- a core of primers is created next (26). The primer with the highest score is selected. Where two primers with equal scores exist, the number of positive signals is determined for each and the primer with the greater number of positive signals is chosen. If both primers remain equal, one is then selected arbitrarily over the other. After the main primer has been selected, the first twenty (five times the desired redundancy which is four here) primers giving positive signals for each sequence in turn are selected for the core. All remaining primers are rejected.
- a greedy algorithm is then run (28) using the core set of primers to identify the minimum number of primers necessary to distinguish each sequence.
- primers are added one at a time with each primer being selected in turn in relation to the number of uncovered rows it is capable of covering.
- the reduced set of primers is checked for any sequences that has fewer than four positive signals and extra primers are added as necessary to meet this minimum requirement.
- a redundancy check is then performed (30) to identify whether any more primers can be removed. During the redundancy check each primer is "tentatively" removed in turn to see whether the remaining primers meet the minimum requirements.
- next primer is tried. Otherwise the primer is temporarily removed from the set, and the process continues with the next primer in line. This process continues until no more primers can be removed, in which case the last primer to be removed is added back to the set, and the next primer in line tentatively removed and so on.
- This can be viewed as a depth-first search of a tree where the nodes are combinations of primers, and the number of primers in each node is one less than in a node one level above. The root node thus contains all primers from the greedy algorithm. It has p (the number of primers after the greedy algorithm) primers in it.
- CFT a modified algorithm
- This algorithm consists of three main phases: A subgradient phase where a near-optimal multiplier vector is found, a heuristic phase where a solution to the SCP is found and column-fixing, designed to improve the results of the heuristic phase.
- a near-optimal multiplier vector u is found using Equation 8. At the beginning, the starting vector u° used is defined as
- Equation 9 Later calls use the last vector u before column fixing, and apply a small perturbation before using it as the starting vector.
- the perturbation is randomly (and uniformly) distributed in the range ⁇ 10% for each element.
- the sequence of multiplier vectors is considered to have converged when the improvement in L(u) in the last 50 iterations is smaller than 0.1 %, or when the number of iterations reached 10 x m.
- the factor A in Equation 8 was set to 0.1 at the beginning, and was updated as follows: Every 20 iterations, the best and worst lower bounds L(u) during those 20 iterations are compared to each other. If the difference is larger than 1 %, the value of ⁇ is halved.
- ⁇ is multiplied with 1.5.
- the upper bound, UB used is the sum of the costs of the first primers that together cover all rows four times. Otherwise it is the value of the best solution found so far.
- the last vector from the subgradient phase is used to generate a sequence of multiplier vectors (again using Equation 8), and a feasible solution constructed for each of the multiplier vectors.
- the procedure used to generate a feasible solution is a variation of the greedy algorithm, where each column is scored according to
- R is the set of uncovered rows in each step.
- the column with the lowest q i.e. the columns with the best "gain/cost"-ratio, is added in each step to the solution. This continues until no improvements to the best solution (i.e. minimum number of primers) have been made for 50 iterations.
- the heuristic phase column fixing is applied to the solution. Columns that are absolutely necessary in order for a row to be covered (i.e. if there are only e columns covering a row and each row is to be covered e times) are fixed. These fixed columns are then used as a starting point for the greedy algorithm, and the first max ⁇ [200/mj, 1 ⁇ columns chosen therein are fixed as well.
- ⁇ max ⁇ c. (w * ), ⁇ + £ ,. M ;
- u,(K : - 1) is the contribution of row / ' to the gap between the estimated lower and upper bound of the problem. This is then split uniformly between all columns in the solution covering that row. Columns with small ⁇ j (contributing the least to the gap) are then likely to be part of the optimal solution. The p columns with the smallest ⁇ are then fixed before the entire algorithm is applied again to the resulting sub-problem. (Column fixing here has nothing to do with column fixing after the heuristic phase, so columns fixed there need no longer be fixed here), p is the smallest value satisfying
- the number of columns fixed in this step was also set to be at least one more than in the previous iteration (if no improvements were made). Otherwise the same number of columns would be fixed in a number of iterations before the value of ⁇ is large enough to allow more columns to be fixed.
- the algorithm is iterated until either the value of the best solution is less than the estimated lower bound, all columns in the best solution found so far are already fixed in the refining step or a time limit is exceeded.
- the time limit in this case was arbitrarily set to as many seconds as there were rows in the problem. However, the time limit is only checked before the refining step. If it is not exceeded, a whole iteration of the algorithm will be executed before another check is done. Here too a check was done afterwards to see if primers could be removed without breaking any constraints.
- the primers were initially sorted in order of score, this need not be performed.
- the algorithms for stripping out redundant primers are capable of operating with any order of primers including a wholly random order. However, slightly better results were obtained when ordering by score was performed.
- the HLA-sequences were available internally from Amersham Pharmacia Biotech (release December 1997), and included 91 alleles from HLA-A, 202 HLA-B, 47 HLA-C, 11 HLA-DPA1 (coding for the ⁇ -chain), 74 HLA-DPB1 ( ⁇ -chain), 18 HLA-DQA1 , 34 HLA-DQB1 , 192 HLA- DR1 and 35 sequences in all of HLA-DR3, -DR4 and -DR5. The length of these sequences range from ⁇ 250bp to -1100bp.
- the 16S rRNA-sequences were collected from GenBank
- Table 1 Details about data sets. The program was written using the Microsoft ® Visual C++ ® , version 5.0 compiler. It was executed on a PC with a Pentium ® MMX 233 MHz processor, 64 MB RAM and Windows ® 95, unless otherwise indicated. All execution times are for the entire program, including I/O.
- the binary SCP matrices were quite dense. The density (i.e. the number of non-zero elements in the matrix) usually lies around a few percent, of course depending on the application. A higher density means that fewer columns are needed in order to cover all rows. This is offset in this case by the fact that all rows were required to be covered multiple times. Another consequence of this high density is that the number of primers needed according to the greedy algorithm could be much higher than in the optimal solution. (Recall that the worst case behaviour of the greedy algorithm is a function of the largest column-sum of elements.)
- Table 2 Some details about the binary SCP matrix. Data are calculated for all primers in the primary set.
- the program could be considered as consisting of two phases.
- the first phase involves constructing all primers and finding out what kind of signal they will get for each sequence.
- the second phase is the optimisation phase, were the SCP is solved.
- Table 3 Number of primers in different stages of the algorithm and time to get signals for all primers.
- the number of primers in the core are for homozygotes.
- One explanation to this high density is that the sequences in the data sets are quite similar to each other, so that most primers will hybridise to and give signal for more than one sequence (either the same or different signals).
- This is also indicated in Table 3, where for some data sets there is a noticeable drop from the number of primers in the first set to the number of primers in the primary set. Most of this reduction is due to a primer having the same signal for all sequences, which in turn means that all sequences have a substring that is similar enough for the primer to hybridise to and that the nucleotide after the primer is the same for all sequences.
- the 16S rRNA data set has a much lower density, and no reduction in the primers going from the first set of primers to the primary set.
- sequences in this data set come from organisms which might be only distantly related to each other, there need not be as much similarity between the sequences as there is in the HLA data sets.
- Table 4 No. of primers after the greedy algorithm and time spent by it. Also final nr. of primers after check for redundancy and the total time spent solving the SCP. *Value from a 300MHz Pentium II with 512MB RAM running Windows NT 4.0. The computation was halted before completion due to time constraints.
- results from combining HLA sequences in order to differentiate between heterozygous individuals can be found in Table 7.
- CFT was only used for the two smallest data sets due to the time requirements. It performed slightly better than the greedy algorithm on those, but only by one primer on each data set.
- Table 7 Results from heterozygous pairs. Number of primers needed, the time spent, how many heterozygotes that did not differ by at least four signals from any other heterozygote and the percentage of total number of heterozygotes. * Value from a 300MHz Pentium II with 512MB
- Table 8 Heterozygous pairs that do not differ enough in their signal patterns, and how many signals they differ with.
- Table 9 Number of primers needed to discriminate between heterozygote HLA samples.
- Primers can be arranged on the surface of a support in such a way that different studied types, genes, alleles, species etc. form easily recognised characters such as figures or letters. These character forming primers can be additional primers of common origin from the gene of interest and be used for validation of the process.
- DNA Four homozygote for DQB cell lines, with alleles 0402, 0301 , 06011 and 0201.
- Amplification reagents PCR mix from the Amersham Pharmacia Biotech HLA DQB typing kit, a prototype kit.
- SAP will degrade (dephosphorylate) all free dNTPs and UDG will remove all dU from the DNA and after heating the strands will be broken at these points. This step is applicable to any DNA fragment.
- A, C,G, T amino TTT AGC CTT AAC GCC T X TGAC GTCA, where X is A, C. G or T.
- Cy2 - ddCTP (equal to fluorescein) 50 ⁇ M Cy3 - ddATP 50 ⁇ M
- the DQB amplification was done according to the method described by Williams et al. -96 using a 33% dUTP mix. After 40 cycles (95°C, 30 sec; 55°C, 30 sec; 72°C, 30 sec), one microliter of the PCR products was tested on a 1.5% agarose gel, before the fragmentation step.
- the samples were frozen and stored until they were used.
- the detection system is a total internal reflection fluorescence (TIRF) system, where microscopic slides are placed on top of a prism with oil on to link a laser beam in to the glass slide.
- the system has light of five different wave lengths from five different lasers to vary between. In this experiment only four were used.
- TIRF total internal reflection fluorescence
- the DNA from the four DQB homozygote cell lines were amplified according to the protocol in Williams et al. -96 with two different concentrations of dUTP. In addition to this, DNA from six different heterozygotes were amplified. All amplifications worked well and the expected 300 bp fragment were seen from all samples.
- Primer chips were washed and fragmented PCR products were incubated on the chip according to the protocol. The image was compared to the expected pattern. The expected pattern was similar to but somewhat different from the recorded pattern, the reason for this is that the set up was planned for a 500 bp fragment, but the actual fragment used was a 300 bp PCR fragment.
- Figure 4 shows the results from a cell line homozygous for the DQB 0204 allele.
- the pattern shown in the image is very close or similar to the expected results from exon 2.
- Another improvement that can be made is the following: As is, the program works only with discrete signals, e.g. either there is a signal 'A' or there is not, either there is a signal 'G' or there is not and so on. A more precise approach would be to predict how strong the signals will be for each primer on each sequence. A rough estimate of the signal strength should be possible given some thermodynamic data about the primers, most notably their melting points. With this information, and knowing the concentration of DNA in the sample among other things, the proportion of primers on the chip that will actually react with the sample DNA should be possible to estimate. It would thus allow a rough estimation of what strength the different signals will have. It will not be very precise, and the estimate might possibly be off by a factor 2 or more, but it will still give some information about what signals to expect from the chip.
- the temperature at which the reaction on the chip is carried out could be optimised as well. Since the sequences are known, it is possible to estimate the melting point of any primer to any sequence when there are a few mismatches. This could be done for all primers on all sequences, and a range of temperatures calculated. The actual temperature to use could then be chosen so as to be as optimal for as many primers on as many sequences as possible, instead of as now at a standard temperature.
- the algorithm itself could be improved.
- the complexity of the redundancy-check phase can be slightly reduced by having a vector consisting of the sums of the rows in each node. For each child-node, the column to be removed is then subtracted from this vector of sums. This operation can be carried out in O(m), and the final complexity will then be 0(m ⁇ N(p, p)) instead.
- the greedy algorithm another possible improvement is to check the primer set for redundancy each time a primer was added.
- the complexity for the greedy algorithm will be the same, as the check will take 0(m xp) (i.e. same as each iteration in the greedy algorithm) each time (with the improvement just mentioned). The check could take longer, but that is unlikely as that would imply that one primer could make several other primers redundant.
- the main advantage is, of course, that no redundancy check with its rather high complexity is needed afterwards.
- this method is only capable of identifying known gene- variants. If applied to a sample with a previously unknown variant, it is very probable that this new variant will be falsely identified as one of the known variants. It would be very advantageous if this method could be augmented in some way to recognise this fact, and give a warning if there could be an unknown variant in the sample. It could be done by giving a warning when the signal pattern gained differs from the signal pattern from any known variants, but this might not be enough. There is no guarantee that the new variant could not differ in some place not affecting any of the existing primers, which would lead to the new variant being indistinguishable from any of the known variants. Some other way is probably needed as well.
- TTTGCAAGTCCTCCTC ⁇ T ⁇ TCTCCTCCCGGT ⁇ TCCACAACCCGGTA ⁇ TTGGCCAGGTGGACA ⁇ TTGCGGTTCCTGGAG ⁇ TTCAGCCAGAAGGAC ⁇ TTGACTCGCCTCTGC ⁇ TTTCCAGGACTCGGC
- TTTTTGTACAGACGC TTCGGTCTCCTTCTT TTGCAATGGGGAGCC TTTGGATCTGGATAA ⁇ ⁇ GATGAAGATGAG
- GGTCACACCCCG GGGAGTTCCGGGC AGGAGGAGACAAC GGGTGGACACAAC TCTGCTCGGTGAC TGGGGCGGCTTGA GCGCACGTCCTCC TAGGATTTCGTGTA
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Engineering & Computer Science (AREA)
- Analytical Chemistry (AREA)
- Immunology (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Biotechnology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Cell Biology (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU50625/00A AU5062500A (en) | 1999-04-26 | 2000-04-20 | Primers for identifying typing or classifying nucleic acids |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP99303215 | 1999-04-26 | ||
EP99303215.0 | 1999-04-26 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2000065088A2 true WO2000065088A2 (fr) | 2000-11-02 |
WO2000065088A3 WO2000065088A3 (fr) | 2001-08-09 |
Family
ID=8241346
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/EP2000/003636 WO2000065088A2 (fr) | 1999-04-26 | 2000-04-20 | Amorces servant a l'identification, le typage ou la classification d'acides nucleiques |
Country Status (2)
Country | Link |
---|---|
AU (1) | AU5062500A (fr) |
WO (1) | WO2000065088A2 (fr) |
Cited By (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2001036679A2 (fr) * | 1999-11-15 | 2001-05-25 | Hartwell John G | METHODES DE GENERATION DE FRAGMENTS D'ADNc SIMPLE BRIN |
WO2001092572A1 (fr) * | 2000-06-01 | 2001-12-06 | Nisshinbo Industries, Inc. | Ensemble et procede de determination du type de hla |
WO2002029659A1 (fr) * | 2000-10-04 | 2002-04-11 | International Reagents Corporation | Systeme medical utilisant des puces d'adn |
WO2003066893A1 (fr) * | 2002-02-04 | 2003-08-14 | Vermicon Ag | Procede de detection specifique rapide de bacteries pathogenes presentes dans des denrees alimentaires |
US6713257B2 (en) | 2000-08-25 | 2004-03-30 | Rosetta Inpharmatics Llc | Gene discovery using microarrays |
WO2004029289A2 (fr) * | 2002-09-26 | 2004-04-08 | Roche Diagnostics Gmbh | Detection de la susceptibilite de developper des maladies auto-immunes |
EP1536021A1 (fr) * | 2003-11-27 | 2005-06-01 | Consortium National de Recherche en Genomique (CNRG) | Méthode pour le typage de HLA |
US7507568B2 (en) | 2002-09-25 | 2009-03-24 | The Proctor & Gamble Company | Three dimensional coordinates of HPTPbeta |
US7589212B2 (en) | 2006-06-27 | 2009-09-15 | Procter & Gamble Company | Human protein tyrosine phosphatase inhibitors and methods of use |
US7622593B2 (en) | 2006-06-27 | 2009-11-24 | The Procter & Gamble Company | Human protein tyrosine phosphatase inhibitors and methods of use |
US7632862B2 (en) | 2002-09-25 | 2009-12-15 | Procter & Gamble Company | Pharmaceutical compositions that modulate HPTPbeta activity |
US7795444B2 (en) | 2006-06-27 | 2010-09-14 | Warner Chilcott Company | Human protein tyrosine phosphatase inhibitors and methods of use |
US7807447B1 (en) | 2000-08-25 | 2010-10-05 | Merck Sharp & Dohme Corp. | Compositions and methods for exon profiling |
EP2371865A2 (fr) | 2006-04-07 | 2011-10-05 | Warner Chilcott Company, LLC | Anticorps se liant à la protéine tyrosine phosphatase béta humaine (HPTP-ß) et leurs utilisations |
US8569348B2 (en) | 2009-07-06 | 2013-10-29 | Aerpio Therapeutics Inc. | Compounds, compositions, and methods for preventing metastasis of cancer cells |
EP2722395A1 (fr) | 2001-10-15 | 2014-04-23 | Bioarray Solutions Ltd | Analyse multiplexée de loci polymorphes par l'interrogation concurrente et la détection au moyen d'enzyme |
US8846685B2 (en) | 2006-06-27 | 2014-09-30 | Aerpio Therapeutics Inc. | Human protein tyrosine phosphatase inhibitors and methods of use |
US8883832B2 (en) | 2009-07-06 | 2014-11-11 | Aerpio Therapeutics Inc. | Compounds, compositions, and methods for preventing metastasis of cancer cells |
AU2012200697B2 (en) * | 2001-05-07 | 2015-02-19 | Agriculture Victoria Services Pty Ltd | Modification of plant and seed development and plant responses to stresses and stimuli (4) |
US9096555B2 (en) | 2009-01-12 | 2015-08-04 | Aerpio Therapeutics, Inc. | Methods for treating vascular leak syndrome |
US12043664B2 (en) | 2011-10-13 | 2024-07-23 | EyePoint Pharmaceuticals, Inc. | Methods for treating vascular leak syndrome and cancer |
CN118506875A (zh) * | 2024-07-12 | 2024-08-16 | 中国科学院心理研究所 | Rna病毒引物优选的设计的方法、设备、介质和程序产品 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1995013396A2 (fr) * | 1993-11-11 | 1995-05-18 | U-Gene Research B.V. | Procede d'identification de micro-organismes, et dispositifs pour sa mise en oeuvre |
WO1995015400A1 (fr) * | 1993-12-03 | 1995-06-08 | The Johns Hopkins University | Genotypage par analyse simultanee de multiples loci a microsatellites |
US5883238A (en) * | 1993-03-18 | 1999-03-16 | N.V. Innogenetics S.A. | Process for typing HLA-B using specific primers and probes sets |
WO1999019509A2 (fr) * | 1997-10-10 | 1999-04-22 | Visible Genetics Inc. | Procede et trousse d'amplification, de sequençage et de typage de genes hla de classe 1 classiques |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH08308596A (ja) * | 1995-03-10 | 1996-11-26 | Wakunaga Pharmaceut Co Ltd | Hlaの検出 |
-
2000
- 2000-04-20 WO PCT/EP2000/003636 patent/WO2000065088A2/fr active Application Filing
- 2000-04-20 AU AU50625/00A patent/AU5062500A/en not_active Abandoned
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5883238A (en) * | 1993-03-18 | 1999-03-16 | N.V. Innogenetics S.A. | Process for typing HLA-B using specific primers and probes sets |
WO1995013396A2 (fr) * | 1993-11-11 | 1995-05-18 | U-Gene Research B.V. | Procede d'identification de micro-organismes, et dispositifs pour sa mise en oeuvre |
WO1995015400A1 (fr) * | 1993-12-03 | 1995-06-08 | The Johns Hopkins University | Genotypage par analyse simultanee de multiples loci a microsatellites |
WO1999019509A2 (fr) * | 1997-10-10 | 1999-04-22 | Visible Genetics Inc. | Procede et trousse d'amplification, de sequençage et de typage de genes hla de classe 1 classiques |
Non-Patent Citations (11)
Title |
---|
BEIN G ET AL.: "Rapid HLA-DRB1 genotyping by nested PCR amplification" TISSUE ANTIGENS, vol. 39, 1992, pages 68-73, XP000949259 * |
BUNCE M ET AL.: "The PCR-SSP manager computer program: A tool for maintaining sequence alignments and automatically updating the specificities of PCR-SSP primers and primer mixes" TISSUE ANTIGENS, vol. 52, 1998, pages 158-174, XP000972134 * |
CEREB N ET AL: "LOCUS-SPECIFIC AMPLIFICATION OF HLA CLASS I GENES FROM GENOMIC DNA: LOCUS-SPECIFIC SEQUENCES IN THE FIRST AND THIRD INTRONS OF HLA-A, -B, AND -C ALLELES" TISSUE ANTIGENS,DK,MUNKSGAARD, COPENHAGEN, vol. 45, 1995, pages 1-11, XP000197333 ISSN: 0001-2815 * |
DATABASE WPI , 1997 Derwent Publications Ltd., London, GB; AN 059711 XP002133096 "Detection and typing of class I MHC HLA-DR antigens - can check multiple specimens easily and type all HLA-DR (D-related) antigens known to be present in the Japanese population" -& JP 08 308596 A (WAKUNAGA SEIYAKU KK), 26 November 1996 (1996-11-26) * |
DOI K AND IMAI H: "Greedy algorithms for finding a small set of primers satisfying cover and length resolution conditions in PCR experiments" GENOME INF. SER., vol. 8, 1997, pages 43-52, XP000900076 * |
LEVINE J E ET AL.: "SSOP typing of the tenth international histocompatibility workshop reference cell lines for HLA genes" TISSUE ANTIGENS, vol. 44, 1994, pages 174-183, XP000972173 * |
LO V M: "HEURISTIC ALGORITHMS FOR TASK ASSIGNMENT IN DISTRIBUTED SYSTEMS" IEEE TRANSACTIONS ON COMPUTERS,US,IEEE INC. NEW YORK, vol. 37, no. 11, 1 November 1988 (1988-11-01), pages 1384-1397, XP000005083 ISSN: 0018-9340 * |
METSPALU A ET AL.: "Arrared primer extension (APEX) for mutation detection using gene specific DNA chips" AMERICAN JOURNAL OF HUMAN GENETICS, vol. 61, no. 4Sup, 1997, page A224 XP000900002 * |
OLERUP O ET AL.: "HLA-DQB1 and -DQA1 typing by PCR ampification with sequence-specific primers (PCR-SSP) in 2 hours" TISSUE ANTIGENS, vol. 41, 1993, pages 119-134, XP000972084 * |
PIRRUNG M C ET AL.: "Design and use of a solid phase DNA-based computational device to solve satisfiability (SAT) problems" FASEB JOURNAL, vol. 11, no. 9, 1997, page A1214 XP002133095 * |
VASKO F J: "An efficient heuristic for large set covering problems" NAVAL RESEARCH LOGISTICS QUARTERLY, vol. 31, 1984, pages 163-171, XP000890158 cited in the application * |
Cited By (51)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2001036679A2 (fr) * | 1999-11-15 | 2001-05-25 | Hartwell John G | METHODES DE GENERATION DE FRAGMENTS D'ADNc SIMPLE BRIN |
WO2001036679A3 (fr) * | 1999-11-15 | 2001-11-22 | John G Hartwell | METHODES DE GENERATION DE FRAGMENTS D'ADNc SIMPLE BRIN |
WO2001092572A1 (fr) * | 2000-06-01 | 2001-12-06 | Nisshinbo Industries, Inc. | Ensemble et procede de determination du type de hla |
US6713257B2 (en) | 2000-08-25 | 2004-03-30 | Rosetta Inpharmatics Llc | Gene discovery using microarrays |
US7807447B1 (en) | 2000-08-25 | 2010-10-05 | Merck Sharp & Dohme Corp. | Compositions and methods for exon profiling |
WO2002029659A1 (fr) * | 2000-10-04 | 2002-04-11 | International Reagents Corporation | Systeme medical utilisant des puces d'adn |
AU2012200697B2 (en) * | 2001-05-07 | 2015-02-19 | Agriculture Victoria Services Pty Ltd | Modification of plant and seed development and plant responses to stresses and stimuli (4) |
EP2722395A1 (fr) | 2001-10-15 | 2014-04-23 | Bioarray Solutions Ltd | Analyse multiplexée de loci polymorphes par l'interrogation concurrente et la détection au moyen d'enzyme |
WO2003066893A1 (fr) * | 2002-02-04 | 2003-08-14 | Vermicon Ag | Procede de detection specifique rapide de bacteries pathogenes presentes dans des denrees alimentaires |
US7507568B2 (en) | 2002-09-25 | 2009-03-24 | The Proctor & Gamble Company | Three dimensional coordinates of HPTPbeta |
US7632862B2 (en) | 2002-09-25 | 2009-12-15 | Procter & Gamble Company | Pharmaceutical compositions that modulate HPTPbeta activity |
US7769575B2 (en) | 2002-09-25 | 2010-08-03 | Warner Chilcott, LLC | Three dimensional coordinates of HPTPbeta |
WO2004029289A2 (fr) * | 2002-09-26 | 2004-04-08 | Roche Diagnostics Gmbh | Detection de la susceptibilite de developper des maladies auto-immunes |
WO2004029289A3 (fr) * | 2002-09-26 | 2004-07-22 | Roche Diagnostics Gmbh | Detection de la susceptibilite de developper des maladies auto-immunes |
EP1536021A1 (fr) * | 2003-11-27 | 2005-06-01 | Consortium National de Recherche en Genomique (CNRG) | Méthode pour le typage de HLA |
US8435740B2 (en) | 2003-11-27 | 2013-05-07 | Commissariat A L'energie Atomique Et Aux Energies Alternatives | Method for HLA typing |
WO2005052189A3 (fr) * | 2003-11-27 | 2005-10-20 | Consortium Nat De Rech En Geno | Methode de typage hla |
US20120157347A1 (en) * | 2003-11-27 | 2012-06-21 | Commissariat A L'energie Atomique | Method for hla typing |
WO2005052189A2 (fr) * | 2003-11-27 | 2005-06-09 | Consortium National De Recherche En Genomique (Cnrg) | Methode de typage hla |
US7820377B2 (en) | 2003-11-27 | 2010-10-26 | Commissariat A L'energie Atomique | Method for HLA typing |
EP2267159A3 (fr) * | 2003-11-27 | 2011-08-10 | Commissariat à l'Énergie Atomique et aux Énergies Alternatives | Méthode pour le typage de HLA-B |
JP2007512014A (ja) * | 2003-11-27 | 2007-05-17 | コンソルシャム ナショナル ドゥ ルシェルシュ アン ジェノミック(セーエヌエルジェー) | Hlaタイピング方法 |
JP2011200244A (ja) * | 2003-11-27 | 2011-10-13 | Commissariat A L'energie Atomique Et Aux Energies Alternatives | Hlaタイピング方法 |
EP2272986A3 (fr) * | 2003-11-27 | 2011-11-02 | Commissariat à l'Énergie Atomique et aux Énergies Alternatives | Procédé pour le typage de HLA |
US9926367B2 (en) | 2006-04-07 | 2018-03-27 | Aerpio Therapeutics, Inc. | Antibodies that bind human protein tyrosine phosphatase beta (HPTPbeta) and uses thereof |
EP2371865A2 (fr) | 2006-04-07 | 2011-10-05 | Warner Chilcott Company, LLC | Anticorps se liant à la protéine tyrosine phosphatase béta humaine (HPTP-ß) et leurs utilisations |
EP3252079A1 (fr) | 2006-04-07 | 2017-12-06 | Aerpio Therapeutics, Inc. | Anticorps se liant à la protéine tyrosine phosphatase béta humaine (hptp-ss) et leurs utilisations |
US11814425B2 (en) | 2006-04-07 | 2023-11-14 | Eye Point Pharmaceuticals, Inc. | Antibodies that bind human protein tyrosine phosphatase beta (HPTPbeta) and uses thereof |
US8524235B2 (en) | 2006-04-07 | 2013-09-03 | Aeripo Therapeutics Inc. | Method for treating coronary artery disease using antibody binding human protein tyrosine phosphatase beta(HPTPbeta) |
US8106078B2 (en) | 2006-06-27 | 2012-01-31 | Warner Chilcott Company, Llc | Human protein tyrosine phosphatase inhibitors and methods of use |
US7622593B2 (en) | 2006-06-27 | 2009-11-24 | The Procter & Gamble Company | Human protein tyrosine phosphatase inhibitors and methods of use |
US8329916B2 (en) | 2006-06-27 | 2012-12-11 | Aerpio Therapeutics Inc. | Human protein tyrosine phosphatase inhibitors and method of use |
US7589212B2 (en) | 2006-06-27 | 2009-09-15 | Procter & Gamble Company | Human protein tyrosine phosphatase inhibitors and methods of use |
US8258311B2 (en) | 2006-06-27 | 2012-09-04 | Aerpio Therapeutics Inc. | Human protein tyrosine phosphatase inhibitors and methods of use |
US8846685B2 (en) | 2006-06-27 | 2014-09-30 | Aerpio Therapeutics Inc. | Human protein tyrosine phosphatase inhibitors and methods of use |
US10463650B2 (en) | 2006-06-27 | 2019-11-05 | Aerpio Pharmaceuticals, Inc. | Human protein tyrosine phosphatase inhibitors and methods of use |
US8895563B2 (en) | 2006-06-27 | 2014-11-25 | Aerpio Therapeutics, Inc. | Human protein tyrosine phosphatase inhibitors and methods of use |
US8946232B2 (en) | 2006-06-27 | 2015-02-03 | Aerpio Therapeutics, Inc. | Human protein tyrosine phosphatase inhibitors and methods of use |
US8188125B2 (en) | 2006-06-27 | 2012-05-29 | Aerpio Therapeutics Inc. | Human protein tyrosine phosphatase inhibitors and methods of use |
US8338615B2 (en) | 2006-06-27 | 2012-12-25 | Aerpio Therapeutics Inc. | Human protein tyrosine phosphatase inhibitors and methods of use |
US9126958B2 (en) | 2006-06-27 | 2015-09-08 | Aerpio Therapeutics, Inc. | Human protein tyrosine phosphatase inhibitors and methods of use |
US7795444B2 (en) | 2006-06-27 | 2010-09-14 | Warner Chilcott Company | Human protein tyrosine phosphatase inhibitors and methods of use |
US9795594B2 (en) | 2006-06-27 | 2017-10-24 | Aerpio Therapeutics, Inc. | Human protein tyrosine phosphatase inhibitors and methods of use |
USRE46592E1 (en) | 2006-06-27 | 2017-10-31 | Aerpio Therapeutics, Inc. | Human protein tyrosine phosphatase inhibitors and methods of use |
US9096555B2 (en) | 2009-01-12 | 2015-08-04 | Aerpio Therapeutics, Inc. | Methods for treating vascular leak syndrome |
US9174950B2 (en) | 2009-07-06 | 2015-11-03 | Aerpio Therapeutics, Inc. | Compounds, compositions, and methods for preventing metastasis of cancer cells |
US9949956B2 (en) | 2009-07-06 | 2018-04-24 | Aerpio Therapeutics, Inc. | Compounds, compositions, and methods for preventing metastasis of cancer cells |
US8883832B2 (en) | 2009-07-06 | 2014-11-11 | Aerpio Therapeutics Inc. | Compounds, compositions, and methods for preventing metastasis of cancer cells |
US8569348B2 (en) | 2009-07-06 | 2013-10-29 | Aerpio Therapeutics Inc. | Compounds, compositions, and methods for preventing metastasis of cancer cells |
US12043664B2 (en) | 2011-10-13 | 2024-07-23 | EyePoint Pharmaceuticals, Inc. | Methods for treating vascular leak syndrome and cancer |
CN118506875A (zh) * | 2024-07-12 | 2024-08-16 | 中国科学院心理研究所 | Rna病毒引物优选的设计的方法、设备、介质和程序产品 |
Also Published As
Publication number | Publication date |
---|---|
WO2000065088A3 (fr) | 2001-08-09 |
AU5062500A (en) | 2000-11-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2000065088A2 (fr) | Amorces servant a l'identification, le typage ou la classification d'acides nucleiques | |
US6703228B1 (en) | Methods and products related to genotyping and DNA analysis | |
Ellegren | Sequencing goes 454 and takes large‐scale genomics into the wild | |
Hudson | Sequencing breakthroughs for genomic ecology and evolutionary biology | |
Sanchez et al. | A multiplex assay with 52 single nucleotide polymorphisms for human identification | |
Slate et al. | Gene mapping in the wild with SNPs: guidelines and future directions | |
Twyford et al. | Next-generation hybridization and introgression | |
CA2771330C (fr) | Procedes et materiaux permettant l'identification de races canines | |
Kenta et al. | Multiplex SNP‐SCALE: a cost‐effective medium‐throughput single nucleotide polymorphism genotyping method | |
US20100261189A1 (en) | System and method for detection of HLA Variants | |
KR20060103813A (ko) | 뉴클레오티드 서열의 실질적인 대표체 | |
EP1056889B1 (fr) | Procedes associes a la determination d'un genotype et a l'analyse de l'adn | |
US20140141436A1 (en) | Methods and Compositions for Very High Resolution Genotyping of HLA | |
Garosi et al. | Defining best practice for microarray analyses in nutrigenomic studies | |
CN108138226B (zh) | 单核苷酸多态性和插入缺失的多等位基因基因分型 | |
Lakshmi et al. | Mouse genomic representational oligonucleotide microarray analysis: detection of copy number variations in normal and tumor specimens | |
US20110053789A1 (en) | Mircoarray methods | |
US20040023275A1 (en) | Methods for genomic analysis | |
Lockhart et al. | DNA arrays and gene expression analysis in the brain | |
US6963805B2 (en) | Methods for identifying the evolutionarily conserved sequences | |
Tsukagoshi et al. | Isolation and characterization of 17 polymorphic microsatellite loci with tri-and tetra-nucleotide repeat motifs for Ayu (Plecoglossus altivelis) using next-generation sequencing | |
US20080026367A9 (en) | Methods for genomic analysis | |
US20040126800A1 (en) | Regulatory single nucleotide polymorphisms and methods therefor | |
Tebbutt | Genotyping of single nucleotide polymorphisms by arrayed primer extension | |
KR20250034300A (ko) | 재조합 이벤트를 검출하기 위한 방법 및 시스템 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
AK | Designated states |
Kind code of ref document: A3 Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A3 Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
122 | Ep: pct application non-entry in european phase | ||
NENP | Non-entry into the national phase |
Ref country code: JP |