US20130310549A1 - Primers for analyzing methylated sequences and methods of use thereof - Google Patents
Primers for analyzing methylated sequences and methods of use thereof Download PDFInfo
- Publication number
- US20130310549A1 US20130310549A1 US13/770,646 US201313770646A US2013310549A1 US 20130310549 A1 US20130310549 A1 US 20130310549A1 US 201313770646 A US201313770646 A US 201313770646A US 2013310549 A1 US2013310549 A1 US 2013310549A1
- Authority
- US
- United States
- Prior art keywords
- primer
- sequence
- seq
- primers
- nucleic acid
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 238000000034 method Methods 0.000 title abstract description 79
- 208000035657 Abasia Diseases 0.000 claims abstract description 40
- 150000007523 nucleic acids Chemical class 0.000 claims description 89
- 102000039446 nucleic acids Human genes 0.000 claims description 78
- 108020004707 nucleic acids Proteins 0.000 claims description 78
- 125000003729 nucleotide group Chemical group 0.000 claims description 51
- 239000002773 nucleotide Substances 0.000 claims description 49
- 108090000623 proteins and genes Proteins 0.000 claims description 32
- -1 4,4′-Dimethoxytrityloxy Chemical group 0.000 claims description 29
- 108091029430 CpG site Proteins 0.000 claims description 28
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 claims description 27
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 claims description 13
- 238000000137 annealing Methods 0.000 claims description 13
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical group O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 claims description 9
- 229940104302 cytosine Drugs 0.000 claims description 8
- 102000004169 proteins and genes Human genes 0.000 claims description 7
- 229920001519 homopolymer Polymers 0.000 claims description 6
- 229940035893 uracil Drugs 0.000 claims description 6
- 230000001973 epigenetic effect Effects 0.000 claims description 3
- IMRNWKAKFIGPOS-UHFFFAOYSA-N 3-[3-[bis(4-methoxyphenyl)-phenylmethoxy]propoxy-[di(propan-2-yl)amino]phosphanyl]oxypropanenitrile Chemical compound C1=CC(OC)=CC=C1C(OCCCOP(OCCC#N)N(C(C)C)C(C)C)(C=1C=CC(OC)=CC=1)C1=CC=CC=C1 IMRNWKAKFIGPOS-UHFFFAOYSA-N 0.000 claims description 2
- 101710134106 Twist-related protein Proteins 0.000 claims 2
- 239000003550 marker Substances 0.000 claims 2
- 238000007069 methylation reaction Methods 0.000 abstract description 81
- 230000011987 methylation Effects 0.000 abstract description 80
- 102000053602 DNA Human genes 0.000 description 49
- 108020004414 DNA Proteins 0.000 description 49
- 238000012163 sequencing technique Methods 0.000 description 45
- 230000003321 amplification Effects 0.000 description 43
- 238000003199 nucleic acid amplification method Methods 0.000 description 43
- 238000001514 detection method Methods 0.000 description 29
- 239000012634 fragment Substances 0.000 description 29
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 28
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 28
- CTMZLDSMFCVUNX-VMIOUTBZSA-N cytidylyl-(3'->5')-guanosine Chemical class O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@H](OP(O)(=O)OC[C@@H]2[C@H]([C@@H](O)[C@@H](O2)N2C3=C(C(N=C(N)N3)=O)N=C2)O)[C@@H](CO)O1 CTMZLDSMFCVUNX-VMIOUTBZSA-N 0.000 description 28
- 238000006243 chemical reaction Methods 0.000 description 20
- 108091029523 CpG island Proteins 0.000 description 19
- 210000004027 cell Anatomy 0.000 description 18
- 238000009396 hybridization Methods 0.000 description 18
- 230000000295 complement effect Effects 0.000 description 17
- 238000007855 methylation-specific PCR Methods 0.000 description 17
- 108091034117 Oligonucleotide Proteins 0.000 description 15
- 239000011324 bead Substances 0.000 description 15
- 239000000523 sample Substances 0.000 description 15
- LSNNMFCWUKXFEE-UHFFFAOYSA-M Bisulfite Chemical compound OS([O-])=O LSNNMFCWUKXFEE-UHFFFAOYSA-M 0.000 description 14
- 206010028980 Neoplasm Diseases 0.000 description 13
- 201000011510 cancer Diseases 0.000 description 13
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 13
- 239000007787 solid Substances 0.000 description 13
- 239000000758 substrate Substances 0.000 description 13
- 238000004458 analytical method Methods 0.000 description 12
- 230000000875 corresponding effect Effects 0.000 description 12
- 230000003287 optical effect Effects 0.000 description 12
- 108091093088 Amplicon Proteins 0.000 description 10
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 10
- 108091028043 Nucleic acid sequence Proteins 0.000 description 10
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 10
- 238000005516 engineering process Methods 0.000 description 10
- 238000010348 incorporation Methods 0.000 description 10
- 102100037369 Nidogen-1 Human genes 0.000 description 9
- 201000010099 disease Diseases 0.000 description 9
- ONCZDRURRATYFI-QTCHDTBASA-N methyl (2z)-2-methoxyimino-2-[2-[[(e)-1-[3-(trifluoromethyl)phenyl]ethylideneamino]oxymethyl]phenyl]acetate Chemical group CO\N=C(/C(=O)OC)C1=CC=CC=C1CO\N=C(/C)C1=CC=CC(C(F)(F)F)=C1 ONCZDRURRATYFI-QTCHDTBASA-N 0.000 description 9
- PYWVYCXTNDRMGF-UHFFFAOYSA-N rhodamine B Chemical compound [Cl-].C=12C=CC(=[N+](CC)CC)C=C2OC2=CC(N(CC)CC)=CC=C2C=1C1=CC=CC=C1C(O)=O PYWVYCXTNDRMGF-UHFFFAOYSA-N 0.000 description 9
- 102000004190 Enzymes Human genes 0.000 description 8
- 108090000790 Enzymes Proteins 0.000 description 8
- DZBUGLKDJFMEHC-UHFFFAOYSA-N acridine Chemical compound C1=CC=CC2=CC3=CC=CC=C3N=C21 DZBUGLKDJFMEHC-UHFFFAOYSA-N 0.000 description 8
- 239000007850 fluorescent dye Substances 0.000 description 8
- BBEAQIROQSPTKN-UHFFFAOYSA-N pyrene Chemical compound C1=CC=C2C=CC3=CC=CC4=CC=C1C2=C43 BBEAQIROQSPTKN-UHFFFAOYSA-N 0.000 description 8
- 108090000364 Ligases Proteins 0.000 description 7
- 102000003960 Ligases Human genes 0.000 description 7
- 239000000975 dye Substances 0.000 description 7
- 230000008569 process Effects 0.000 description 7
- LRSASMSXMSNRBT-UHFFFAOYSA-N 5-methylcytosine Chemical compound CC1=CNC(=O)N=C1N LRSASMSXMSNRBT-UHFFFAOYSA-N 0.000 description 6
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 6
- 101000924577 Homo sapiens Adenomatous polyposis coli protein Proteins 0.000 description 6
- 239000003153 chemical reaction reagent Substances 0.000 description 6
- 239000003795 chemical substances by application Substances 0.000 description 6
- GLNDAGDHSLMOKX-UHFFFAOYSA-N coumarin 120 Chemical compound C1=C(N)C=CC2=C1OC(=O)C=C2C GLNDAGDHSLMOKX-UHFFFAOYSA-N 0.000 description 6
- IINNWAYUJNWZRM-UHFFFAOYSA-L erythrosin B Chemical compound [Na+].[Na+].[O-]C(=O)C1=CC=CC=C1C1=C2C=C(I)C(=O)C(I)=C2OC2=C(I)C([O-])=C(I)C=C21 IINNWAYUJNWZRM-UHFFFAOYSA-L 0.000 description 6
- 238000002844 melting Methods 0.000 description 6
- 230000008018 melting Effects 0.000 description 6
- 230000035772 mutation Effects 0.000 description 6
- 239000000243 solution Substances 0.000 description 6
- 102100034540 Adenomatous polyposis coli protein Human genes 0.000 description 5
- 108700028369 Alleles Proteins 0.000 description 5
- 238000013459 approach Methods 0.000 description 5
- 238000001574 biopsy Methods 0.000 description 5
- 230000014509 gene expression Effects 0.000 description 5
- 230000000379 polymerizing effect Effects 0.000 description 5
- 239000011780 sodium chloride Substances 0.000 description 5
- 239000001509 sodium citrate Substances 0.000 description 5
- HBEDSQVIWPRPAY-UHFFFAOYSA-N 2,3-dihydrobenzofuran Chemical compound C1=CC=C2OCCC2=C1 HBEDSQVIWPRPAY-UHFFFAOYSA-N 0.000 description 4
- PXBFMLJZNCDSMP-UHFFFAOYSA-N 2-Aminobenzamide Chemical compound NC(=O)C1=CC=CC=C1N PXBFMLJZNCDSMP-UHFFFAOYSA-N 0.000 description 4
- OBYNJKLOYWCXEP-UHFFFAOYSA-N 2-[3-(dimethylamino)-6-dimethylazaniumylidenexanthen-9-yl]-4-isothiocyanatobenzoate Chemical compound C=12C=CC(=[N+](C)C)C=C2OC2=CC(N(C)C)=CC=C2C=1C1=CC(N=C=S)=CC=C1C([O-])=O OBYNJKLOYWCXEP-UHFFFAOYSA-N 0.000 description 4
- 238000001712 DNA sequencing Methods 0.000 description 4
- XPDXVDYUQZHFPV-UHFFFAOYSA-N Dansyl Chloride Chemical compound C1=CC=C2C(N(C)C)=CC=CC2=C1S(Cl)(=O)=O XPDXVDYUQZHFPV-UHFFFAOYSA-N 0.000 description 4
- 102100031780 Endonuclease Human genes 0.000 description 4
- 101000988401 Homo sapiens PDZ and LIM domain protein 3 Proteins 0.000 description 4
- 102100029177 PDZ and LIM domain protein 3 Human genes 0.000 description 4
- 239000004793 Polystyrene Substances 0.000 description 4
- AUNGANRZJHBGPY-SCRDCRAPSA-N Riboflavin Chemical compound OC[C@@H](O)[C@@H](O)[C@@H](O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O AUNGANRZJHBGPY-SCRDCRAPSA-N 0.000 description 4
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 4
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 4
- 238000002835 absorbance Methods 0.000 description 4
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 4
- ZYGHJZDHTFUPRJ-UHFFFAOYSA-N coumarin Chemical compound C1=CC=C2OC(=O)C=CC2=C1 ZYGHJZDHTFUPRJ-UHFFFAOYSA-N 0.000 description 4
- 239000005547 deoxyribonucleotide Substances 0.000 description 4
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 4
- 208000035475 disorder Diseases 0.000 description 4
- YQGOJNYOYNNSMM-UHFFFAOYSA-N eosin Chemical compound [Na+].OC(=O)C1=CC=CC=C1C1=C2C=C(Br)C(=O)C(Br)=C2OC2=C(Br)C(O)=C(Br)C=C21 YQGOJNYOYNNSMM-UHFFFAOYSA-N 0.000 description 4
- VYXSBFYARXAAKO-UHFFFAOYSA-N ethyl 2-[3-(ethylamino)-6-ethylimino-2,7-dimethylxanthen-9-yl]benzoate;hydron;chloride Chemical compound [Cl-].C1=2C=C(C)C(NCC)=CC=2OC2=CC(=[NH+]CC)C(C)=CC2=C1C1=CC=CC=C1C(=O)OCC VYXSBFYARXAAKO-UHFFFAOYSA-N 0.000 description 4
- 239000012530 fluid Substances 0.000 description 4
- GVEPBJHOBDJJJI-UHFFFAOYSA-N fluoranthrene Natural products C1=CC(C2=CC=CC=C22)=C3C2=CC=CC3=C1 GVEPBJHOBDJJJI-UHFFFAOYSA-N 0.000 description 4
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 4
- 239000000499 gel Substances 0.000 description 4
- 239000011521 glass Substances 0.000 description 4
- 230000006607 hypermethylation Effects 0.000 description 4
- 238000003384 imaging method Methods 0.000 description 4
- 150000002540 isothiocyanates Chemical class 0.000 description 4
- 239000003960 organic solvent Substances 0.000 description 4
- 229920002223 polystyrene Polymers 0.000 description 4
- 238000003753 real-time PCR Methods 0.000 description 4
- 229920002477 rna polymer Polymers 0.000 description 4
- 150000003839 salts Chemical class 0.000 description 4
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- 238000012176 true single molecule sequencing Methods 0.000 description 4
- 238000005406 washing Methods 0.000 description 4
- QGKMIGUHVLGJBR-UHFFFAOYSA-M (4z)-1-(3-methylbutyl)-4-[[1-(3-methylbutyl)quinolin-1-ium-4-yl]methylidene]quinoline;iodide Chemical compound [I-].C12=CC=CC=C2N(CCC(C)C)C=CC1=CC1=CC=[N+](CCC(C)C)C2=CC=CC=C12 QGKMIGUHVLGJBR-UHFFFAOYSA-M 0.000 description 3
- GOLORTLGFDVFDW-UHFFFAOYSA-N 3-(1h-benzimidazol-2-yl)-7-(diethylamino)chromen-2-one Chemical compound C1=CC=C2NC(C3=CC4=CC=C(C=C4OC3=O)N(CC)CC)=NC2=C1 GOLORTLGFDVFDW-UHFFFAOYSA-N 0.000 description 3
- 108091006146 Channels Proteins 0.000 description 3
- 108010037462 Cyclooxygenase 2 Proteins 0.000 description 3
- 230000007067 DNA methylation Effects 0.000 description 3
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 3
- 102100038280 Prostaglandin G/H synthase 2 Human genes 0.000 description 3
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 3
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 3
- 230000002159 abnormal effect Effects 0.000 description 3
- 238000004873 anchoring Methods 0.000 description 3
- 238000003556 assay Methods 0.000 description 3
- 239000012472 biological sample Substances 0.000 description 3
- 210000000349 chromosome Anatomy 0.000 description 3
- 238000010367 cloning Methods 0.000 description 3
- 239000005289 controlled pore glass Substances 0.000 description 3
- 150000002118 epoxides Chemical class 0.000 description 3
- 230000005284 excitation Effects 0.000 description 3
- 239000000284 extract Substances 0.000 description 3
- 238000005286 illumination Methods 0.000 description 3
- 238000011534 incubation Methods 0.000 description 3
- 150000002500 ions Chemical class 0.000 description 3
- 239000007788 liquid Substances 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 238000010369 molecular cloning Methods 0.000 description 3
- 239000002751 oligonucleotide probe Substances 0.000 description 3
- 102000040430 polynucleotide Human genes 0.000 description 3
- 108091033319 polynucleotide Proteins 0.000 description 3
- 239000002157 polynucleotide Substances 0.000 description 3
- 238000012175 pyrosequencing Methods 0.000 description 3
- 230000002285 radioactive effect Effects 0.000 description 3
- 108091008146 restriction endonucleases Proteins 0.000 description 3
- 238000000492 total internal reflection fluorescence microscopy Methods 0.000 description 3
- GIANIJCPTPUNBA-QMMMGPOBSA-N (2s)-3-(4-hydroxyphenyl)-2-nitramidopropanoic acid Chemical compound [O-][N+](=O)N[C@H](C(=O)O)CC1=CC=C(O)C=C1 GIANIJCPTPUNBA-QMMMGPOBSA-N 0.000 description 2
- DUFUXAHBRPMOFG-UHFFFAOYSA-N 1-(4-anilinonaphthalen-1-yl)pyrrole-2,5-dione Chemical compound O=C1C=CC(=O)N1C(C1=CC=CC=C11)=CC=C1NC1=CC=CC=C1 DUFUXAHBRPMOFG-UHFFFAOYSA-N 0.000 description 2
- ZTTARJIAPRWUHH-UHFFFAOYSA-N 1-isothiocyanatoacridine Chemical compound C1=CC=C2C=C3C(N=C=S)=CC=CC3=NC2=C1 ZTTARJIAPRWUHH-UHFFFAOYSA-N 0.000 description 2
- RUDINRUXCKIXAJ-UHFFFAOYSA-N 2,2,3,3,4,4,5,5,6,6,7,7,8,8,9,9,10,10,11,11,12,12,13,13,14,14,14-heptacosafluorotetradecanoic acid Chemical compound OC(=O)C(F)(F)C(F)(F)C(F)(F)C(F)(F)C(F)(F)C(F)(F)C(F)(F)C(F)(F)C(F)(F)C(F)(F)C(F)(F)C(F)(F)C(F)(F)F RUDINRUXCKIXAJ-UHFFFAOYSA-N 0.000 description 2
- IOOMXAQUNPWDLL-UHFFFAOYSA-N 2-[6-(diethylamino)-3-(diethyliminiumyl)-3h-xanthen-9-yl]-5-sulfobenzene-1-sulfonate Chemical compound C=12C=CC(=[N+](CC)CC)C=C2OC2=CC(N(CC)CC)=CC=C2C=1C1=CC=C(S(O)(=O)=O)C=C1S([O-])(=O)=O IOOMXAQUNPWDLL-UHFFFAOYSA-N 0.000 description 2
- LAXVMANLDGWYJP-UHFFFAOYSA-N 2-amino-5-(2-aminoethyl)naphthalene-1-sulfonic acid Chemical compound NC1=CC=C2C(CCN)=CC=CC2=C1S(O)(=O)=O LAXVMANLDGWYJP-UHFFFAOYSA-N 0.000 description 2
- 102100030162 2-oxoglutarate dehydrogenase-like, mitochondrial Human genes 0.000 description 2
- CPBJMKMKNCRKQB-UHFFFAOYSA-N 3,3-bis(4-hydroxy-3-methylphenyl)-2-benzofuran-1-one Chemical compound C1=C(O)C(C)=CC(C2(C3=CC=CC=C3C(=O)O2)C=2C=C(C)C(O)=CC=2)=C1 CPBJMKMKNCRKQB-UHFFFAOYSA-N 0.000 description 2
- FWBHETKCLVMNFS-UHFFFAOYSA-N 4',6-Diamino-2-phenylindol Chemical compound C1=CC(C(=N)N)=CC=C1C1=CC2=CC=C(C(N)=N)C=C2N1 FWBHETKCLVMNFS-UHFFFAOYSA-N 0.000 description 2
- YSCNMFDFYJUPEF-OWOJBTEDSA-N 4,4'-diisothiocyano-trans-stilbene-2,2'-disulfonic acid Chemical compound OS(=O)(=O)C1=CC(N=C=S)=CC=C1\C=C\C1=CC=C(N=C=S)C=C1S(O)(=O)=O YSCNMFDFYJUPEF-OWOJBTEDSA-N 0.000 description 2
- YJCCSLGGODRWKK-NSCUHMNNSA-N 4-Acetamido-4'-isothiocyanostilbene-2,2'-disulphonic acid Chemical compound OS(=O)(=O)C1=CC(NC(=O)C)=CC=C1\C=C\C1=CC=C(N=C=S)C=C1S(O)(=O)=O YJCCSLGGODRWKK-NSCUHMNNSA-N 0.000 description 2
- OSWZKAVBSQAVFI-UHFFFAOYSA-N 4-[(4-isothiocyanatophenyl)diazenyl]-n,n-dimethylaniline Chemical compound C1=CC(N(C)C)=CC=C1N=NC1=CC=C(N=C=S)C=C1 OSWZKAVBSQAVFI-UHFFFAOYSA-N 0.000 description 2
- SJQRQOKXQKVJGJ-UHFFFAOYSA-N 5-(2-aminoethylamino)naphthalene-1-sulfonic acid Chemical compound C1=CC=C2C(NCCN)=CC=CC2=C1S(O)(=O)=O SJQRQOKXQKVJGJ-UHFFFAOYSA-N 0.000 description 2
- ZWONWYNZSWOYQC-UHFFFAOYSA-N 5-benzamido-3-[[5-[[4-chloro-6-(4-sulfoanilino)-1,3,5-triazin-2-yl]amino]-2-sulfophenyl]diazenyl]-4-hydroxynaphthalene-2,7-disulfonic acid Chemical compound OC1=C(N=NC2=CC(NC3=NC(NC4=CC=C(C=C4)S(O)(=O)=O)=NC(Cl)=N3)=CC=C2S(O)(=O)=O)C(=CC2=C1C(NC(=O)C1=CC=CC=C1)=CC(=C2)S(O)(=O)=O)S(O)(=O)=O ZWONWYNZSWOYQC-UHFFFAOYSA-N 0.000 description 2
- NJYVEMPWNAYQQN-UHFFFAOYSA-N 5-carboxyfluorescein Chemical compound C12=CC=C(O)C=C2OC2=CC(O)=CC=C2C21OC(=O)C1=CC(C(=O)O)=CC=C21 NJYVEMPWNAYQQN-UHFFFAOYSA-N 0.000 description 2
- YERWMQJEYUIJBO-UHFFFAOYSA-N 5-chlorosulfonyl-2-[3-(diethylamino)-6-diethylazaniumylidenexanthen-9-yl]benzenesulfonate Chemical compound C=12C=CC(=[N+](CC)CC)C=C2OC2=CC(N(CC)CC)=CC=C2C=1C1=CC=C(S(Cl)(=O)=O)C=C1S([O-])(=O)=O YERWMQJEYUIJBO-UHFFFAOYSA-N 0.000 description 2
- AXGKYURDYTXCAG-UHFFFAOYSA-N 5-isothiocyanato-2-[2-(4-isothiocyanato-2-sulfophenyl)ethyl]benzenesulfonic acid Chemical compound OS(=O)(=O)C1=CC(N=C=S)=CC=C1CCC1=CC=C(N=C=S)C=C1S(O)(=O)=O AXGKYURDYTXCAG-UHFFFAOYSA-N 0.000 description 2
- HWQQCFPHXPNXHC-UHFFFAOYSA-N 6-[(4,6-dichloro-1,3,5-triazin-2-yl)amino]-3',6'-dihydroxyspiro[2-benzofuran-3,9'-xanthene]-1-one Chemical compound C=1C(O)=CC=C2C=1OC1=CC(O)=CC=C1C2(C1=CC=2)OC(=O)C1=CC=2NC1=NC(Cl)=NC(Cl)=N1 HWQQCFPHXPNXHC-UHFFFAOYSA-N 0.000 description 2
- WQZIDRAQTRIQDX-UHFFFAOYSA-N 6-carboxy-x-rhodamine Chemical compound OC(=O)C1=CC=C(C([O-])=O)C=C1C(C1=CC=2CCCN3CCCC(C=23)=C1O1)=C2C1=C(CCC1)C3=[N+]1CCCC3=C2 WQZIDRAQTRIQDX-UHFFFAOYSA-N 0.000 description 2
- YALJZNKPECPZAS-UHFFFAOYSA-N 7-(diethylamino)-3-(4-isothiocyanatophenyl)-4-methylchromen-2-one Chemical compound O=C1OC2=CC(N(CC)CC)=CC=C2C(C)=C1C1=CC=C(N=C=S)C=C1 YALJZNKPECPZAS-UHFFFAOYSA-N 0.000 description 2
- SGAOZXGJGQEBHA-UHFFFAOYSA-N 82344-98-7 Chemical compound C1CCN2CCCC(C=C3C4(OC(C5=CC(=CC=C54)N=C=S)=O)C4=C5)=C2C1=C3OC4=C1CCCN2CCCC5=C12 SGAOZXGJGQEBHA-UHFFFAOYSA-N 0.000 description 2
- LRFVTYWOQMYALW-UHFFFAOYSA-N 9H-xanthine Chemical compound O=C1NC(=O)NC2=C1NC=N2 LRFVTYWOQMYALW-UHFFFAOYSA-N 0.000 description 2
- 229930024421 Adenine Natural products 0.000 description 2
- 102100040152 Adenylyl-sulfate kinase Human genes 0.000 description 2
- 241000972773 Aulopiformes Species 0.000 description 2
- FYEHYMARPSSOBO-UHFFFAOYSA-N Aurin Chemical compound C1=CC(O)=CC=C1C(C=1C=CC(O)=CC=1)=C1C=CC(=O)C=C1 FYEHYMARPSSOBO-UHFFFAOYSA-N 0.000 description 2
- 102100035656 BCL2/adenovirus E1B 19 kDa protein-interacting protein 3 Human genes 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 102100021973 Carbonyl reductase [NADPH] 1 Human genes 0.000 description 2
- 206010008805 Chromosomal abnormalities Diseases 0.000 description 2
- 208000031404 Chromosome Aberrations Diseases 0.000 description 2
- 206010009944 Colon cancer Diseases 0.000 description 2
- AUNGANRZJHBGPY-UHFFFAOYSA-N D-Lyxoflavin Natural products OCC(O)C(O)C(O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O AUNGANRZJHBGPY-UHFFFAOYSA-N 0.000 description 2
- 108010017826 DNA Polymerase I Proteins 0.000 description 2
- 102000004594 DNA Polymerase I Human genes 0.000 description 2
- 102100028216 DNA polymerase zeta catalytic subunit Human genes 0.000 description 2
- 102100028285 DNA repair protein REV1 Human genes 0.000 description 2
- 102000017930 EDNRB Human genes 0.000 description 2
- QTANTQQOYSUMLC-UHFFFAOYSA-O Ethidium cation Chemical compound C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 QTANTQQOYSUMLC-UHFFFAOYSA-O 0.000 description 2
- 108700024394 Exon Proteins 0.000 description 2
- 102100031813 Fibulin-2 Human genes 0.000 description 2
- 102100037042 Forkhead box protein E1 Human genes 0.000 description 2
- 102100030943 Glutathione S-transferase P Human genes 0.000 description 2
- 102100035340 Guanine nucleotide-binding protein subunit beta-4 Human genes 0.000 description 2
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 2
- 101000585732 Homo sapiens 2-oxoglutarate dehydrogenase-like, mitochondrial Proteins 0.000 description 2
- 101000803294 Homo sapiens BCL2/adenovirus E1B 19 kDa protein-interacting protein 3 Proteins 0.000 description 2
- 101000896985 Homo sapiens Carbonyl reductase [NADPH] 1 Proteins 0.000 description 2
- 101000579381 Homo sapiens DNA polymerase zeta catalytic subunit Proteins 0.000 description 2
- 101000967299 Homo sapiens Endothelin receptor type B Proteins 0.000 description 2
- 101001065274 Homo sapiens Fibulin-2 Proteins 0.000 description 2
- 101001029304 Homo sapiens Forkhead box protein E1 Proteins 0.000 description 2
- 101001010139 Homo sapiens Glutathione S-transferase P Proteins 0.000 description 2
- 101001024249 Homo sapiens Guanine nucleotide-binding protein subunit beta-4 Proteins 0.000 description 2
- 101001050321 Homo sapiens Junctional adhesion molecule C Proteins 0.000 description 2
- 101000971638 Homo sapiens Kinesin-like protein KIF1A Proteins 0.000 description 2
- 101000972291 Homo sapiens Lymphoid enhancer-binding factor 1 Proteins 0.000 description 2
- 101000624947 Homo sapiens Nesprin-1 Proteins 0.000 description 2
- 101000595669 Homo sapiens Pituitary homeobox 2 Proteins 0.000 description 2
- 101000835083 Homo sapiens Tissue factor pathway inhibitor 2 Proteins 0.000 description 2
- 101000652324 Homo sapiens Transcription factor SOX-17 Proteins 0.000 description 2
- 102100023429 Junctional adhesion molecule C Human genes 0.000 description 2
- 102100021527 Kinesin-like protein KIF1A Human genes 0.000 description 2
- 102100022699 Lymphoid enhancer-binding factor 1 Human genes 0.000 description 2
- 102100025825 Methylated-DNA-protein-cysteine methyltransferase Human genes 0.000 description 2
- 101100412856 Mus musculus Rhod gene Proteins 0.000 description 2
- KWYHDKDOAIKMQN-UHFFFAOYSA-N N,N,N',N'-tetramethylethylenediamine Chemical compound CN(C)CCN(C)C KWYHDKDOAIKMQN-UHFFFAOYSA-N 0.000 description 2
- QPCDCPDFJACHGM-UHFFFAOYSA-N N,N-bis{2-[bis(carboxymethyl)amino]ethyl}glycine Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(=O)O)CCN(CC(O)=O)CC(O)=O QPCDCPDFJACHGM-UHFFFAOYSA-N 0.000 description 2
- 108091007369 NEUR proteins Proteins 0.000 description 2
- 102100023306 Nesprin-1 Human genes 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 2
- BELBBZDIHDAJOR-UHFFFAOYSA-N Phenolsulfonephthalein Chemical compound C1=CC(O)=CC=C1C1(C=2C=CC(O)=CC=2)C2=CC=CC=C2S(=O)(=O)O1 BELBBZDIHDAJOR-UHFFFAOYSA-N 0.000 description 2
- 102100036090 Pituitary homeobox 2 Human genes 0.000 description 2
- 241000205160 Pyrococcus Species 0.000 description 2
- 241000205156 Pyrococcus furiosus Species 0.000 description 2
- 241000205192 Pyrococcus woesei Species 0.000 description 2
- 102100033909 Retinoic acid receptor beta Human genes 0.000 description 2
- 108020004682 Single-Stranded DNA Proteins 0.000 description 2
- 108010090804 Streptavidin Proteins 0.000 description 2
- 101150005983 TWIST gene Proteins 0.000 description 2
- 241000422848 Taxodium mucronatum Species 0.000 description 2
- 229910052771 Terbium Inorganic materials 0.000 description 2
- 101100242191 Tetraodon nigroviridis rho gene Proteins 0.000 description 2
- 241000205180 Thermococcus litoralis Species 0.000 description 2
- 241000204666 Thermotoga maritima Species 0.000 description 2
- 241000589596 Thermus Species 0.000 description 2
- 241000589500 Thermus aquaticus Species 0.000 description 2
- 241000589499 Thermus thermophilus Species 0.000 description 2
- 102100026134 Tissue factor pathway inhibitor 2 Human genes 0.000 description 2
- 101710183280 Topoisomerase Proteins 0.000 description 2
- 108700009124 Transcription Initiation Site Proteins 0.000 description 2
- 102100030243 Transcription factor SOX-17 Human genes 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 2
- 229960000643 adenine Drugs 0.000 description 2
- GFFGJBXGBJISGV-UHFFFAOYSA-N adenyl group Chemical group N1=CN=C2N=CNC2=C1N GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 2
- 108010058966 bacteriophage T7 induced DNA polymerase Proteins 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 229960002685 biotin Drugs 0.000 description 2
- 239000011616 biotin Substances 0.000 description 2
- 210000004369 blood Anatomy 0.000 description 2
- 239000008280 blood Substances 0.000 description 2
- 230000004663 cell proliferation Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 239000013522 chelant Substances 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 229960000956 coumarin Drugs 0.000 description 2
- 235000001671 coumarin Nutrition 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 238000010168 coupling process Methods 0.000 description 2
- 238000005859 coupling reaction Methods 0.000 description 2
- 238000012350 deep sequencing Methods 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 229960000633 dextran sulfate Drugs 0.000 description 2
- 238000007847 digital PCR Methods 0.000 description 2
- OOYIOIOOWUGAHD-UHFFFAOYSA-L disodium;2',4',5',7'-tetrabromo-4,5,6,7-tetrachloro-3-oxospiro[2-benzofuran-1,9'-xanthene]-3',6'-diolate Chemical compound [Na+].[Na+].O1C(=O)C(C(=C(Cl)C(Cl)=C2Cl)Cl)=C2C21C1=CC(Br)=C([O-])C(Br)=C1OC1=C(Br)C([O-])=C(Br)C=C21 OOYIOIOOWUGAHD-UHFFFAOYSA-L 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- XHXYXYGSUXANME-UHFFFAOYSA-N eosin 5-isothiocyanate Chemical compound O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC(Br)=C(O)C(Br)=C1OC1=C(Br)C(O)=C(Br)C=C21 XHXYXYGSUXANME-UHFFFAOYSA-N 0.000 description 2
- 230000001747 exhibiting effect Effects 0.000 description 2
- ZFKJVJIDPQDDFY-UHFFFAOYSA-N fluorescamine Chemical compound C12=CC=CC=C2C(=O)OC1(C1=O)OC=C1C1=CC=CC=C1 ZFKJVJIDPQDDFY-UHFFFAOYSA-N 0.000 description 2
- MHMNJMPURVTYEJ-UHFFFAOYSA-N fluorescein-5-isothiocyanate Chemical compound O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 MHMNJMPURVTYEJ-UHFFFAOYSA-N 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-M hydrogensulfate Chemical compound OS([O-])(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-M 0.000 description 2
- FDGQSTZJBFJUBT-UHFFFAOYSA-N hypoxanthine Chemical compound O=C1NC=NC2=C1NC=N2 FDGQSTZJBFJUBT-UHFFFAOYSA-N 0.000 description 2
- 230000002452 interceptive effect Effects 0.000 description 2
- 229940107698 malachite green Drugs 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 108040008770 methylated-DNA-[protein]-cysteine S-methyltransferase activity proteins Proteins 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 238000012544 monitoring process Methods 0.000 description 2
- LKKPNUDVOYAOBB-UHFFFAOYSA-N naphthalocyanine Chemical compound N1C(N=C2C3=CC4=CC=CC=C4C=C3C(N=C3C4=CC5=CC=CC=C5C=C4C(=N4)N3)=N2)=C(C=C2C(C=CC=C2)=C2)C2=C1N=C1C2=CC3=CC=CC=C3C=C2C4=N1 LKKPNUDVOYAOBB-UHFFFAOYSA-N 0.000 description 2
- 238000007899 nucleic acid hybridization Methods 0.000 description 2
- 230000005257 nucleotidylation Effects 0.000 description 2
- AFAIELJLZYUNPW-UHFFFAOYSA-N pararosaniline free base Chemical compound C1=CC(N)=CC=C1C(C=1C=CC(N)=CC=1)=C1C=CC(=N)C=C1 AFAIELJLZYUNPW-UHFFFAOYSA-N 0.000 description 2
- 229960003531 phenolsulfonphthalein Drugs 0.000 description 2
- 125000001997 phenyl group Chemical group [H]C1=C([H])C([H])=C(*)C([H])=C1[H] 0.000 description 2
- ZWLUXSQADUDCSB-UHFFFAOYSA-N phthalaldehyde Chemical compound O=CC1=CC=CC=C1C=O ZWLUXSQADUDCSB-UHFFFAOYSA-N 0.000 description 2
- IEQIEDJGQAUEQZ-UHFFFAOYSA-N phthalocyanine Chemical compound N1C(N=C2C3=CC=CC=C3C(N=C3C4=CC=CC=C4C(=N4)N3)=N2)=C(C=CC=C2)C2=C1N=C1C2=CC=CC=C2C4=N1 IEQIEDJGQAUEQZ-UHFFFAOYSA-N 0.000 description 2
- 238000003752 polymerase chain reaction Methods 0.000 description 2
- AJMSJNPWXJCWOK-UHFFFAOYSA-N pyren-1-yl butanoate Chemical compound C1=C2C(OC(=O)CCC)=CC=C(C=C3)C2=C2C3=CC=CC2=C1 AJMSJNPWXJCWOK-UHFFFAOYSA-N 0.000 description 2
- 239000011535 reaction buffer Substances 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 108091008761 retinoic acid receptors β Proteins 0.000 description 2
- MYFATKRONKHHQL-UHFFFAOYSA-N rhodamine 123 Chemical compound [Cl-].COC(=O)C1=CC=CC=C1C1=C2C=CC(=[NH2+])C=C2OC2=CC(N)=CC=C21 MYFATKRONKHHQL-UHFFFAOYSA-N 0.000 description 2
- 229940043267 rhodamine b Drugs 0.000 description 2
- 229960002477 riboflavin Drugs 0.000 description 2
- 235000019192 riboflavin Nutrition 0.000 description 2
- 239000002151 riboflavin Substances 0.000 description 2
- 235000019515 salmon Nutrition 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N silicon dioxide Inorganic materials O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 2
- 239000001488 sodium phosphate Substances 0.000 description 2
- 229910000162 sodium phosphate Inorganic materials 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 210000002784 stomach Anatomy 0.000 description 2
- COIVODZMVVUETJ-UHFFFAOYSA-N sulforhodamine 101 Chemical compound OS(=O)(=O)C1=CC(S([O-])(=O)=O)=CC=C1C1=C(C=C2C3=C4CCCN3CCC2)C4=[O+]C2=C1C=C1CCCN3CCCC2=C13 COIVODZMVVUETJ-UHFFFAOYSA-N 0.000 description 2
- YBBRCQOCSYXUOC-UHFFFAOYSA-N sulfuryl dichloride Chemical class ClS(Cl)(=O)=O YBBRCQOCSYXUOC-UHFFFAOYSA-N 0.000 description 2
- GZCRRIHWUXGPOV-UHFFFAOYSA-N terbium atom Chemical compound [Tb] GZCRRIHWUXGPOV-UHFFFAOYSA-N 0.000 description 2
- WGTODYJZXSJIAG-UHFFFAOYSA-N tetramethylrhodamine chloride Chemical compound [Cl-].C=12C=CC(N(C)C)=CC2=[O+]C2=CC(N(C)C)=CC=C2C=1C1=CC=CC=C1C(O)=O WGTODYJZXSJIAG-UHFFFAOYSA-N 0.000 description 2
- MPLHNVLQVRSVEE-UHFFFAOYSA-N texas red Chemical compound [O-]S(=O)(=O)C1=CC(S(Cl)(=O)=O)=CC=C1C(C1=CC=2CCCN3CCCC(C=23)=C1O1)=C2C1=C(CCC1)C3=[N+]1CCCC3=C2 MPLHNVLQVRSVEE-UHFFFAOYSA-N 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- RYFMWSXOAZQYPI-UHFFFAOYSA-K trisodium phosphate Chemical compound [Na+].[Na+].[Na+].[O-]P([O-])([O-])=O RYFMWSXOAZQYPI-UHFFFAOYSA-K 0.000 description 2
- PJOHVEQSYPOERL-SHEAVXILSA-N (e)-n-[(4r,4as,7ar,12br)-3-(cyclopropylmethyl)-9-hydroxy-7-oxo-2,4,5,6,7a,13-hexahydro-1h-4,12-methanobenzofuro[3,2-e]isoquinoline-4a-yl]-3-(4-methylphenyl)prop-2-enamide Chemical compound C1=CC(C)=CC=C1\C=C\C(=O)N[C@]1(CCC(=O)[C@@H]2O3)[C@H]4CC5=CC=C(O)C3=C5[C@]12CCN4CC1CC1 PJOHVEQSYPOERL-SHEAVXILSA-N 0.000 description 1
- UHDGCWIWMRVCDJ-UHFFFAOYSA-N 1-beta-D-Xylofuranosyl-NH-Cytosine Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 UHDGCWIWMRVCDJ-UHFFFAOYSA-N 0.000 description 1
- 102100026210 1-phosphatidylinositol 4,5-bisphosphate phosphodiesterase gamma-2 Human genes 0.000 description 1
- YKBGVTZYEHREMT-KVQBGUIXSA-N 2'-deoxyguanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](CO)O1 YKBGVTZYEHREMT-KVQBGUIXSA-N 0.000 description 1
- VGONTNSXDCQUGY-RRKCRQDMSA-N 2'-deoxyinosine Chemical compound C1[C@H](O)[C@@H](CO)O[C@H]1N1C(N=CNC2=O)=C2N=C1 VGONTNSXDCQUGY-RRKCRQDMSA-N 0.000 description 1
- MWBWWFOAEOYUST-UHFFFAOYSA-N 2-aminopurine Chemical compound NC1=NC=C2N=CNC2=N1 MWBWWFOAEOYUST-UHFFFAOYSA-N 0.000 description 1
- 102100039776 39S ribosomal protein L4, mitochondrial Human genes 0.000 description 1
- 101800000504 3C-like protease Proteins 0.000 description 1
- CKTSBUTUHBMZGZ-ULQXZJNLSA-N 4-amino-1-[(2r,4s,5r)-4-hydroxy-5-(hydroxymethyl)oxolan-2-yl]-5-tritiopyrimidin-2-one Chemical compound O=C1N=C(N)C([3H])=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 CKTSBUTUHBMZGZ-ULQXZJNLSA-N 0.000 description 1
- WBSMIPAMAXNXFS-UHFFFAOYSA-N 5-Nitro-2-(3-phenylpropylamino)benzoic acid Chemical compound OC(=O)C1=CC([N+]([O-])=O)=CC=C1NCCCC1=CC=CC=C1 WBSMIPAMAXNXFS-UHFFFAOYSA-N 0.000 description 1
- LQLQRFGHAALLLE-UHFFFAOYSA-N 5-bromouracil Chemical compound BrC1=CNC(=O)NC1=O LQLQRFGHAALLLE-UHFFFAOYSA-N 0.000 description 1
- 102100026802 72 kDa type IV collagenase Human genes 0.000 description 1
- 102100022144 Achaete-scute homolog 2 Human genes 0.000 description 1
- 102100036464 Activated RNA polymerase II transcriptional coactivator p15 Human genes 0.000 description 1
- 101710111216 Activated RNA polymerase II transcriptional coactivator p15 Proteins 0.000 description 1
- 102100031933 Adhesion G protein-coupled receptor F5 Human genes 0.000 description 1
- 101100482081 Agrobacterium vitis (strain S4 / ATCC BAA-846) iaaM gene Proteins 0.000 description 1
- 102100033327 Ankyrin repeat domain-containing protein 40 Human genes 0.000 description 1
- 102100036451 Apolipoprotein C-I Human genes 0.000 description 1
- 102100029647 Apoptosis-associated speck-like protein containing a CARD Human genes 0.000 description 1
- 101100170834 Arabidopsis thaliana ERDJ3A gene Proteins 0.000 description 1
- 241000203069 Archaea Species 0.000 description 1
- 102100033648 Arf-GAP with Rho-GAP domain, ANK repeat and PH domain-containing protein 3 Human genes 0.000 description 1
- 102100034224 Armadillo repeat-containing X-linked protein 2 Human genes 0.000 description 1
- 102100036880 Armadillo repeat-containing protein 7 Human genes 0.000 description 1
- 102100028820 Aspartate-tRNA ligase, cytoplasmic Human genes 0.000 description 1
- 108700020463 BRCA1 Proteins 0.000 description 1
- 102000036365 BRCA1 Human genes 0.000 description 1
- 101150072950 BRCA1 gene Proteins 0.000 description 1
- 102100026349 Beta-1,4-galactosyltransferase 1 Human genes 0.000 description 1
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 description 1
- 102100037674 Bis(5'-adenosyl)-triphosphatase Human genes 0.000 description 1
- 102100024506 Bone morphogenetic protein 2 Human genes 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 206010006187 Breast cancer Diseases 0.000 description 1
- 208000026310 Breast neoplasm Diseases 0.000 description 1
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 1
- 102100036169 CAAX box protein 1 Human genes 0.000 description 1
- 102100028742 CAP-Gly domain-containing linker protein 4 Human genes 0.000 description 1
- 102100037917 CD109 antigen Human genes 0.000 description 1
- 102100027997 COP9 signalosome complex subunit 4 Human genes 0.000 description 1
- 102100024154 Cadherin-13 Human genes 0.000 description 1
- 102100025588 Calcitonin gene-related peptide 1 Human genes 0.000 description 1
- 102100036293 Calcium-binding mitochondrial carrier protein SCaMC-3 Human genes 0.000 description 1
- 102100025338 Calcium-binding tyrosine phosphorylation-regulated protein Human genes 0.000 description 1
- 102100022789 Calcium/calmodulin-dependent protein kinase type IV Human genes 0.000 description 1
- 102100033592 Calponin-3 Human genes 0.000 description 1
- 102100039510 Cancer/testis antigen 2 Human genes 0.000 description 1
- 102100035249 Carbonyl reductase [NADPH] 3 Human genes 0.000 description 1
- 108010072135 Cell Adhesion Molecule-1 Proteins 0.000 description 1
- 102100024649 Cell adhesion molecule 1 Human genes 0.000 description 1
- 102100023126 Cell surface glycoprotein MUC18 Human genes 0.000 description 1
- 206010008342 Cervix carcinoma Diseases 0.000 description 1
- 102100028487 Checkpoint protein HUS1 Human genes 0.000 description 1
- 102100028486 Checkpoint protein HUS1B Human genes 0.000 description 1
- 102100038220 Chromodomain-helicase-DNA-binding protein 6 Human genes 0.000 description 1
- 208000037051 Chromosomal Instability Diseases 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 208000001333 Colorectal Neoplasms Diseases 0.000 description 1
- 102100033283 Creatine kinase U-type, mitochondrial Human genes 0.000 description 1
- MIKUYHXYGGJMLM-GIMIYPNGSA-N Crotonoside Natural products C1=NC2=C(N)NC(=O)N=C2N1[C@H]1O[C@@H](CO)[C@H](O)[C@@H]1O MIKUYHXYGGJMLM-GIMIYPNGSA-N 0.000 description 1
- 102100025176 Cyclin-A1 Human genes 0.000 description 1
- 108010025468 Cyclin-Dependent Kinase 6 Proteins 0.000 description 1
- 102100026804 Cyclin-dependent kinase 6 Human genes 0.000 description 1
- 102100024458 Cyclin-dependent kinase inhibitor 2A Human genes 0.000 description 1
- 102100031051 Cysteine and glycine-rich protein 1 Human genes 0.000 description 1
- UHDGCWIWMRVCDJ-PSQAKQOGSA-N Cytidine Natural products O=C1N=C(N)C=CN1[C@@H]1[C@@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-PSQAKQOGSA-N 0.000 description 1
- IGXWBGJHJZYPQS-SSDOTTSWSA-N D-Luciferin Chemical compound OC(=O)[C@H]1CSC(C=2SC3=CC=C(O)C=C3N=2)=N1 IGXWBGJHJZYPQS-SSDOTTSWSA-N 0.000 description 1
- NYHBQMYGNKIUIF-UHFFFAOYSA-N D-guanosine Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(CO)C(O)C1O NYHBQMYGNKIUIF-UHFFFAOYSA-N 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 108010063113 DNA Polymerase II Proteins 0.000 description 1
- 102000010567 DNA Polymerase II Human genes 0.000 description 1
- 230000004544 DNA amplification Effects 0.000 description 1
- 230000030933 DNA methylation on cytosine Effects 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 102000005721 Death-Associated Protein Kinases Human genes 0.000 description 1
- 108010031042 Death-Associated Protein Kinases Proteins 0.000 description 1
- 102100038587 Death-associated protein kinase 1 Human genes 0.000 description 1
- CYCGRDQQIOGCKX-UHFFFAOYSA-N Dehydro-luciferin Natural products OC(=O)C1=CSC(C=2SC3=CC(O)=CC=C3N=2)=N1 CYCGRDQQIOGCKX-UHFFFAOYSA-N 0.000 description 1
- 102100033553 Delta-like protein 4 Human genes 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 102100037985 Dickkopf-related protein 3 Human genes 0.000 description 1
- SHIBSTMRCDJXLN-UHFFFAOYSA-N Digoxigenin Natural products C1CC(C2C(C3(C)CCC(O)CC3CC2)CC2O)(O)C2(C)C1C1=CC(=O)OC1 SHIBSTMRCDJXLN-UHFFFAOYSA-N 0.000 description 1
- 102100022820 Disintegrin and metalloproteinase domain-containing protein 28 Human genes 0.000 description 1
- 102100023317 DnaJ homolog subfamily C member 10 Human genes 0.000 description 1
- 102100033575 Doublesex- and mab-3-related transcription factor B1 Human genes 0.000 description 1
- 102100039812 E3 ubiquitin-protein ligase RNF182 Human genes 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 102100031814 EGF-containing fibulin-like extracellular matrix protein 1 Human genes 0.000 description 1
- 102100027259 Ena/VASP-like protein Human genes 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 101150078651 Epha4 gene Proteins 0.000 description 1
- 102100021616 Ephrin type-A receptor 4 Human genes 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 102100038595 Estrogen receptor Human genes 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 102000012216 Fanconi Anemia Complementation Group F protein Human genes 0.000 description 1
- 108010022012 Fanconi Anemia Complementation Group F protein Proteins 0.000 description 1
- 102100034553 Fanconi anemia group J protein Human genes 0.000 description 1
- 102100023599 Fibroblast growth factor-binding protein 3 Human genes 0.000 description 1
- 229920001917 Ficoll Polymers 0.000 description 1
- BJGNCJDXODQBOB-UHFFFAOYSA-N Fivefly Luciferin Natural products OC(=O)C1CSC(C=2SC3=CC(O)=CC=C3N=2)=N1 BJGNCJDXODQBOB-UHFFFAOYSA-N 0.000 description 1
- 108010010285 Forkhead Box Protein L2 Proteins 0.000 description 1
- 102100035137 Forkhead box protein L2 Human genes 0.000 description 1
- 102100027570 Forkhead box protein Q1 Human genes 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 102100030280 G-protein coupled receptor 39 Human genes 0.000 description 1
- 108010021779 GATA5 Transcription Factor Proteins 0.000 description 1
- 102000008412 GATA5 Transcription Factor Human genes 0.000 description 1
- 206010017993 Gastrointestinal neoplasms Diseases 0.000 description 1
- 241000237858 Gastropoda Species 0.000 description 1
- 241000193385 Geobacillus stearothermophilus Species 0.000 description 1
- 102100025783 Glutamyl aminopeptidase Human genes 0.000 description 1
- 102100036528 Glutathione S-transferase Mu 3 Human genes 0.000 description 1
- 108010066705 H-cadherin Proteins 0.000 description 1
- 102100027489 Helicase-like transcription factor Human genes 0.000 description 1
- 102100031017 Hepatoma-derived growth factor-like protein 1 Human genes 0.000 description 1
- 108091027305 Heteroduplex Proteins 0.000 description 1
- 102100031672 Homeobox protein CDX-1 Human genes 0.000 description 1
- 102100029240 Homeobox protein Hox-B5 Human genes 0.000 description 1
- 101000691589 Homo sapiens 1-phosphatidylinositol 4,5-bisphosphate phosphodiesterase gamma-2 Proteins 0.000 description 1
- 101000667416 Homo sapiens 39S ribosomal protein L4, mitochondrial Proteins 0.000 description 1
- 101000627872 Homo sapiens 72 kDa type IV collagenase Proteins 0.000 description 1
- 101000901109 Homo sapiens Achaete-scute homolog 2 Proteins 0.000 description 1
- 101000610212 Homo sapiens Adenylyl-sulfate kinase Proteins 0.000 description 1
- 101000775045 Homo sapiens Adhesion G protein-coupled receptor F5 Proteins 0.000 description 1
- 101000732368 Homo sapiens Ankyrin repeat domain-containing protein 40 Proteins 0.000 description 1
- 101000928628 Homo sapiens Apolipoprotein C-I Proteins 0.000 description 1
- 101000733571 Homo sapiens Arf-GAP with Rho-GAP domain, ANK repeat and PH domain-containing protein 3 Proteins 0.000 description 1
- 101000925939 Homo sapiens Armadillo repeat-containing X-linked protein 2 Proteins 0.000 description 1
- 101000927955 Homo sapiens Armadillo repeat-containing protein 7 Proteins 0.000 description 1
- 101000696909 Homo sapiens Aspartate-tRNA ligase, cytoplasmic Proteins 0.000 description 1
- 101000766145 Homo sapiens Beta-1,4-galactosyltransferase 1 Proteins 0.000 description 1
- 101000762366 Homo sapiens Bone morphogenetic protein 2 Proteins 0.000 description 1
- 101000947164 Homo sapiens CAAX box protein 1 Proteins 0.000 description 1
- 101000767061 Homo sapiens CAP-Gly domain-containing linker protein 4 Proteins 0.000 description 1
- 101000738399 Homo sapiens CD109 antigen Proteins 0.000 description 1
- 101000858667 Homo sapiens COP9 signalosome complex subunit 4 Proteins 0.000 description 1
- 101000741445 Homo sapiens Calcitonin Proteins 0.000 description 1
- 101000932890 Homo sapiens Calcitonin gene-related peptide 1 Proteins 0.000 description 1
- 101000935132 Homo sapiens Calcium-binding tyrosine phosphorylation-regulated protein Proteins 0.000 description 1
- 101000974816 Homo sapiens Calcium/calmodulin-dependent protein kinase type IV Proteins 0.000 description 1
- 101000945410 Homo sapiens Calponin-3 Proteins 0.000 description 1
- 101000889345 Homo sapiens Cancer/testis antigen 2 Proteins 0.000 description 1
- 101000737274 Homo sapiens Carbonyl reductase [NADPH] 3 Proteins 0.000 description 1
- 101000623903 Homo sapiens Cell surface glycoprotein MUC18 Proteins 0.000 description 1
- 101000839968 Homo sapiens Checkpoint protein HUS1 Proteins 0.000 description 1
- 101000839969 Homo sapiens Checkpoint protein HUS1B Proteins 0.000 description 1
- 101000883731 Homo sapiens Chromodomain-helicase-DNA-binding protein 5 Proteins 0.000 description 1
- 101000883736 Homo sapiens Chromodomain-helicase-DNA-binding protein 6 Proteins 0.000 description 1
- 101001135413 Homo sapiens Creatine kinase U-type, mitochondrial Proteins 0.000 description 1
- 101000934314 Homo sapiens Cyclin-A1 Proteins 0.000 description 1
- 101000922020 Homo sapiens Cysteine and glycine-rich protein 1 Proteins 0.000 description 1
- 101000956145 Homo sapiens Death-associated protein kinase 1 Proteins 0.000 description 1
- 101000872077 Homo sapiens Delta-like protein 4 Proteins 0.000 description 1
- 101000951342 Homo sapiens Dickkopf-related protein 3 Proteins 0.000 description 1
- 101000915391 Homo sapiens Disabled homolog 2 Proteins 0.000 description 1
- 101000756727 Homo sapiens Disintegrin and metalloproteinase domain-containing protein 23 Proteins 0.000 description 1
- 101000756756 Homo sapiens Disintegrin and metalloproteinase domain-containing protein 28 Proteins 0.000 description 1
- 101000908042 Homo sapiens DnaJ homolog subfamily C member 10 Proteins 0.000 description 1
- 101000871973 Homo sapiens Doublesex- and mab-3-related transcription factor B1 Proteins 0.000 description 1
- 101000667703 Homo sapiens E3 ubiquitin-protein ligase RNF182 Proteins 0.000 description 1
- 101001065272 Homo sapiens EGF-containing fibulin-like extracellular matrix protein 1 Proteins 0.000 description 1
- 101001057143 Homo sapiens Ena/VASP-like protein Proteins 0.000 description 1
- 101000882584 Homo sapiens Estrogen receptor Proteins 0.000 description 1
- 101000848171 Homo sapiens Fanconi anemia group J protein Proteins 0.000 description 1
- 101000827773 Homo sapiens Fibroblast growth factor-binding protein 3 Proteins 0.000 description 1
- 101000861406 Homo sapiens Forkhead box protein Q1 Proteins 0.000 description 1
- 101001009541 Homo sapiens G-protein coupled receptor 39 Proteins 0.000 description 1
- 101000719019 Homo sapiens Glutamyl aminopeptidase Proteins 0.000 description 1
- 101001071716 Homo sapiens Glutathione S-transferase Mu 3 Proteins 0.000 description 1
- 101001038390 Homo sapiens Guided entry of tail-anchored proteins factor 1 Proteins 0.000 description 1
- 101001081105 Homo sapiens Helicase-like transcription factor Proteins 0.000 description 1
- 101001083786 Homo sapiens Hepatoma-derived growth factor-like protein 1 Proteins 0.000 description 1
- 101000777808 Homo sapiens Homeobox protein CDX-1 Proteins 0.000 description 1
- 101000840553 Homo sapiens Homeobox protein Hox-B5 Proteins 0.000 description 1
- 101001001418 Homo sapiens Inhibitor of growth protein 4 Proteins 0.000 description 1
- 101000599852 Homo sapiens Intercellular adhesion molecule 1 Proteins 0.000 description 1
- 101000960337 Homo sapiens Intercellular adhesion molecule 5 Proteins 0.000 description 1
- 101000977765 Homo sapiens Iroquois-class homeodomain protein IRX-4 Proteins 0.000 description 1
- 101000614618 Homo sapiens Junctophilin-3 Proteins 0.000 description 1
- 101000614436 Homo sapiens Keratin, type I cytoskeletal 14 Proteins 0.000 description 1
- 101001026998 Homo sapiens Ketosamine-3-kinase Proteins 0.000 description 1
- 101001006895 Homo sapiens Krueppel-like factor 11 Proteins 0.000 description 1
- 101000981765 Homo sapiens Leucine-rich repeat-containing G-protein coupled receptor 6 Proteins 0.000 description 1
- 101001065550 Homo sapiens Lymphocyte antigen 6K Proteins 0.000 description 1
- 101001099308 Homo sapiens Meiotic recombination protein REC8 homolog Proteins 0.000 description 1
- 101000581428 Homo sapiens Mini-chromosome maintenance complex-binding protein Proteins 0.000 description 1
- 101000593405 Homo sapiens Myb-related protein B Proteins 0.000 description 1
- 101001022780 Homo sapiens Myosin light chain kinase, smooth muscle Proteins 0.000 description 1
- 101000709248 Homo sapiens NAD-dependent protein deacetylase sirtuin-7 Proteins 0.000 description 1
- 101000979909 Homo sapiens NMDA receptor synaptonuclear signaling and neuronal migration factor Proteins 0.000 description 1
- 101001125327 Homo sapiens Na(+)/H(+) exchange regulatory cofactor NHE-RF1 Proteins 0.000 description 1
- 101000928278 Homo sapiens Natriuretic peptides B Proteins 0.000 description 1
- 101000602237 Homo sapiens Neuroblastoma suppressor of tumorigenicity 1 Proteins 0.000 description 1
- 101000979321 Homo sapiens Neurofilament medium polypeptide Proteins 0.000 description 1
- 101000985296 Homo sapiens Neuron-specific calcium-binding protein hippocalcin Proteins 0.000 description 1
- 101000604058 Homo sapiens Neuronal pentraxin-1 Proteins 0.000 description 1
- 101001023729 Homo sapiens Neuropilin and tolloid-like protein 2 Proteins 0.000 description 1
- 101001072765 Homo sapiens Neutral alpha-glucosidase AB Proteins 0.000 description 1
- 101000603223 Homo sapiens Nischarin Proteins 0.000 description 1
- 101000866805 Homo sapiens Non-histone chromosomal protein HMG-17 Proteins 0.000 description 1
- 101000586302 Homo sapiens Oncostatin-M-specific receptor subunit beta Proteins 0.000 description 1
- 101100110004 Homo sapiens PYCARD gene Proteins 0.000 description 1
- 101000891014 Homo sapiens Peptidyl-prolyl cis-trans isomerase FKBP14 Proteins 0.000 description 1
- 101001094017 Homo sapiens Phosphatase and actin regulator 3 Proteins 0.000 description 1
- 101001001500 Homo sapiens Phosphatidylinositol N-acetylglucosaminyltransferase subunit H Proteins 0.000 description 1
- 101000701366 Homo sapiens Phospholipid-transporting ATPase IB Proteins 0.000 description 1
- 101000945272 Homo sapiens Phosphorylase b kinase regulatory subunit alpha, liver isoform Proteins 0.000 description 1
- 101000874141 Homo sapiens Probable ATP-dependent RNA helicase DDX43 Proteins 0.000 description 1
- 101000611943 Homo sapiens Programmed cell death protein 4 Proteins 0.000 description 1
- 101000903887 Homo sapiens Protein BEX1 Proteins 0.000 description 1
- 101000995300 Homo sapiens Protein NDRG2 Proteins 0.000 description 1
- 101000995332 Homo sapiens Protein NDRG4 Proteins 0.000 description 1
- 101000762425 Homo sapiens Protein boule-like Proteins 0.000 description 1
- 101001064096 Homo sapiens Protein disulfide-thiol oxidoreductase Proteins 0.000 description 1
- 101000579425 Homo sapiens Proto-oncogene tyrosine-protein kinase receptor Ret Proteins 0.000 description 1
- 101001130554 Homo sapiens Putative RNA-binding protein 15B Proteins 0.000 description 1
- 101000956729 Homo sapiens RNA helicase Mov10l1 Proteins 0.000 description 1
- 101000668170 Homo sapiens RNA-binding motif, single-stranded-interacting protein 2 Proteins 0.000 description 1
- 101001077488 Homo sapiens RNA-binding protein Raly Proteins 0.000 description 1
- 101000712958 Homo sapiens Ras association domain-containing protein 1 Proteins 0.000 description 1
- 101000712969 Homo sapiens Ras association domain-containing protein 5 Proteins 0.000 description 1
- 101001060859 Homo sapiens Ras-related protein Rab-32 Proteins 0.000 description 1
- 101000606537 Homo sapiens Receptor-type tyrosine-protein phosphatase delta Proteins 0.000 description 1
- 101000699848 Homo sapiens Retrotransposon Gag-like protein 8C Proteins 0.000 description 1
- 101000927796 Homo sapiens Rho guanine nucleotide exchange factor 7 Proteins 0.000 description 1
- 101000867039 Homo sapiens SWI/SNF-related matrix-associated actin-dependent regulator of chromatin subfamily E member 1-related Proteins 0.000 description 1
- 101000740178 Homo sapiens Sal-like protein 4 Proteins 0.000 description 1
- 101000821449 Homo sapiens Secreted and transmembrane protein 1 Proteins 0.000 description 1
- 101000864743 Homo sapiens Secreted frizzled-related protein 1 Proteins 0.000 description 1
- 101000864786 Homo sapiens Secreted frizzled-related protein 2 Proteins 0.000 description 1
- 101000701391 Homo sapiens Serine/threonine-protein kinase 31 Proteins 0.000 description 1
- 101000987315 Homo sapiens Serine/threonine-protein kinase PAK 3 Proteins 0.000 description 1
- 101000621061 Homo sapiens Serum paraoxonase/arylesterase 2 Proteins 0.000 description 1
- 101000836394 Homo sapiens Sestrin-1 Proteins 0.000 description 1
- 101000826125 Homo sapiens Single-stranded DNA-binding protein 2 Proteins 0.000 description 1
- 101000642262 Homo sapiens Spondin-1 Proteins 0.000 description 1
- 101000643632 Homo sapiens Synaptonemal complex protein 3 Proteins 0.000 description 1
- 101000891881 Homo sapiens Synaptotagmin-6 Proteins 0.000 description 1
- 101000837401 Homo sapiens T-cell leukemia/lymphoma protein 1A Proteins 0.000 description 1
- 101000946863 Homo sapiens T-cell surface glycoprotein CD3 delta chain Proteins 0.000 description 1
- 101000658632 Homo sapiens Testis-specific Y-encoded-like protein 6 Proteins 0.000 description 1
- 101000831567 Homo sapiens Toll-like receptor 2 Proteins 0.000 description 1
- 101000757378 Homo sapiens Transcription factor AP-2-alpha Proteins 0.000 description 1
- 101000819074 Homo sapiens Transcription factor GATA-4 Proteins 0.000 description 1
- 101000894871 Homo sapiens Transcription regulator protein BACH1 Proteins 0.000 description 1
- 101000904724 Homo sapiens Transmembrane glycoprotein NMB Proteins 0.000 description 1
- 101000847952 Homo sapiens Trypsin-3 Proteins 0.000 description 1
- 101000713585 Homo sapiens Tubulin beta-4A chain Proteins 0.000 description 1
- 101000713623 Homo sapiens Tubulin gamma-2 chain Proteins 0.000 description 1
- 101000610602 Homo sapiens Tumor necrosis factor receptor superfamily member 10C Proteins 0.000 description 1
- 101000610609 Homo sapiens Tumor necrosis factor receptor superfamily member 10D Proteins 0.000 description 1
- 101000760781 Homo sapiens Tyrosyl-DNA phosphodiesterase 2 Proteins 0.000 description 1
- 101000819146 Homo sapiens UDP-glucose 4-epimerase Proteins 0.000 description 1
- 101000772888 Homo sapiens Ubiquitin-protein ligase E3A Proteins 0.000 description 1
- 101000935117 Homo sapiens Voltage-dependent P/Q-type calcium channel subunit alpha-1A Proteins 0.000 description 1
- 101000788669 Homo sapiens Zinc finger MYM-type protein 2 Proteins 0.000 description 1
- 101000760246 Homo sapiens Zinc finger protein 345 Proteins 0.000 description 1
- 101000976622 Homo sapiens Zinc finger protein 42 homolog Proteins 0.000 description 1
- 101000785609 Homo sapiens Zinc finger protein 655 Proteins 0.000 description 1
- 241000714260 Human T-lymphotropic virus 1 Species 0.000 description 1
- 241000714259 Human T-lymphotropic virus 2 Species 0.000 description 1
- UGQMRVRMYYASKQ-UHFFFAOYSA-N Hypoxanthine nucleoside Natural products OC1C(O)C(CO)OC1N1C(NC=NC2=O)=C2N=C1 UGQMRVRMYYASKQ-UHFFFAOYSA-N 0.000 description 1
- 102100035677 Inhibitor of growth protein 4 Human genes 0.000 description 1
- 102100037877 Intercellular adhesion molecule 1 Human genes 0.000 description 1
- 102100039919 Intercellular adhesion molecule 5 Human genes 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 102100023531 Iroquois-class homeodomain protein IRX-4 Human genes 0.000 description 1
- 102100040488 Junctophilin-3 Human genes 0.000 description 1
- 102100040445 Keratin, type I cytoskeletal 14 Human genes 0.000 description 1
- 102100037378 Ketosamine-3-kinase Human genes 0.000 description 1
- 102100027797 Krueppel-like factor 11 Human genes 0.000 description 1
- 102100024140 Leucine-rich repeat-containing G-protein coupled receptor 6 Human genes 0.000 description 1
- 239000000232 Lipid Bilayer Substances 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- DDWFXDSYGUXRAY-UHFFFAOYSA-N Luciferin Natural products CCc1c(C)c(CC2NC(=O)C(=C2C=C)C)[nH]c1Cc3[nH]c4C(=C5/NC(CC(=O)O)C(C)C5CC(=O)O)CC(=O)c4c3C DDWFXDSYGUXRAY-UHFFFAOYSA-N 0.000 description 1
- 206010058467 Lung neoplasm malignant Diseases 0.000 description 1
- 102100032129 Lymphocyte antigen 6K Human genes 0.000 description 1
- 102100038882 Meiotic recombination protein REC8 homolog Human genes 0.000 description 1
- 102000030612 Melanocortin 5 receptor Human genes 0.000 description 1
- 108010088565 Melanocortin 5 receptor Proteins 0.000 description 1
- 102100026262 Metalloproteinase inhibitor 2 Human genes 0.000 description 1
- 102100026261 Metalloproteinase inhibitor 3 Human genes 0.000 description 1
- 108091092878 Microsatellite Proteins 0.000 description 1
- 102100027372 Mini-chromosome maintenance complex-binding protein Human genes 0.000 description 1
- 241000713333 Mouse mammary tumor virus Species 0.000 description 1
- 101100340130 Mus musculus Hoxd1 gene Proteins 0.000 description 1
- 101100095608 Mus musculus Serinc3 gene Proteins 0.000 description 1
- 102000013609 MutL Protein Homolog 1 Human genes 0.000 description 1
- 108010026664 MutL Protein Homolog 1 Proteins 0.000 description 1
- 102100034670 Myb-related protein B Human genes 0.000 description 1
- 241000204031 Mycoplasma Species 0.000 description 1
- 102100035044 Myosin light chain kinase, smooth muscle Human genes 0.000 description 1
- 102100027445 N-acetylserotonin O-methyltransferase-like protein Human genes 0.000 description 1
- 101710111485 N-acetylserotonin O-methyltransferase-like protein Proteins 0.000 description 1
- 102100022913 NAD-dependent protein deacetylase sirtuin-2 Human genes 0.000 description 1
- 102100034376 NAD-dependent protein deacetylase sirtuin-7 Human genes 0.000 description 1
- 102100024546 NMDA receptor synaptonuclear signaling and neuronal migration factor Human genes 0.000 description 1
- 102100029447 Na(+)/H(+) exchange regulatory cofactor NHE-RF1 Human genes 0.000 description 1
- 102100036836 Natriuretic peptides B Human genes 0.000 description 1
- 102100037142 Neuroblastoma suppressor of tumorigenicity 1 Human genes 0.000 description 1
- 102100023055 Neurofilament medium polypeptide Human genes 0.000 description 1
- 102100038436 Neuronal pentraxin-1 Human genes 0.000 description 1
- 102100035485 Neuropilin and tolloid-like protein 2 Human genes 0.000 description 1
- 101100426589 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) trp-3 gene Proteins 0.000 description 1
- 102100036592 Neutral alpha-glucosidase AB Human genes 0.000 description 1
- 102100038995 Nischarin Human genes 0.000 description 1
- 102100031346 Non-histone chromosomal protein HMG-17 Human genes 0.000 description 1
- 101710202677 Non-specific lipid-transfer protein Proteins 0.000 description 1
- 238000000636 Northern blotting Methods 0.000 description 1
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 1
- 239000004677 Nylon Substances 0.000 description 1
- 102100030098 Oncostatin-M-specific receptor subunit beta Human genes 0.000 description 1
- 238000002944 PCR assay Methods 0.000 description 1
- 108091093037 Peptide nucleic acid Chemical class 0.000 description 1
- 102100040350 Peptidyl-prolyl cis-trans isomerase FKBP14 Human genes 0.000 description 1
- 102100020739 Peptidyl-prolyl cis-trans isomerase FKBP4 Human genes 0.000 description 1
- 102100035269 Phosphatase and actin regulator 3 Human genes 0.000 description 1
- 102100036162 Phosphatidylinositol N-acetylglucosaminyltransferase subunit H Human genes 0.000 description 1
- 102100022428 Phospholipid transfer protein Human genes 0.000 description 1
- 102100030447 Phospholipid-transporting ATPase IB Human genes 0.000 description 1
- 102100033548 Phosphorylase b kinase regulatory subunit alpha, liver isoform Human genes 0.000 description 1
- 239000004952 Polyamide Substances 0.000 description 1
- 239000004698 Polyethylene Substances 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- 229920012196 Polyoxymethylene Copolymer Polymers 0.000 description 1
- 239000004743 Polypropylene Substances 0.000 description 1
- 108010069820 Pro-Opiomelanocortin Proteins 0.000 description 1
- 102100027467 Pro-opiomelanocortin Human genes 0.000 description 1
- 102100035724 Probable ATP-dependent RNA helicase DDX43 Human genes 0.000 description 1
- 206010036790 Productive cough Diseases 0.000 description 1
- 102100040992 Programmed cell death protein 4 Human genes 0.000 description 1
- 102100036371 Proprotein convertase subtilisin/kexin type 4 Human genes 0.000 description 1
- 101710180646 Proprotein convertase subtilisin/kexin type 4 Proteins 0.000 description 1
- 206010060862 Prostate cancer Diseases 0.000 description 1
- 208000000236 Prostatic Neoplasms Diseases 0.000 description 1
- 102100024042 Protein BEX1 Human genes 0.000 description 1
- 102000055027 Protein Methyltransferases Human genes 0.000 description 1
- 108700040121 Protein Methyltransferases Proteins 0.000 description 1
- 102100034436 Protein NDRG2 Human genes 0.000 description 1
- 102100034432 Protein NDRG4 Human genes 0.000 description 1
- 102100024493 Protein boule-like Human genes 0.000 description 1
- 102100030728 Protein disulfide-thiol oxidoreductase Human genes 0.000 description 1
- 102100028286 Proto-oncogene tyrosine-protein kinase receptor Ret Human genes 0.000 description 1
- 102100031409 Putative RNA-binding protein 15B Human genes 0.000 description 1
- 241001467519 Pyrococcus sp. Species 0.000 description 1
- 108091034057 RNA (poly(A)) Proteins 0.000 description 1
- 102100038488 RNA helicase Mov10l1 Human genes 0.000 description 1
- 101710086015 RNA ligase Proteins 0.000 description 1
- 102100039690 RNA-binding motif, single-stranded-interacting protein 2 Human genes 0.000 description 1
- 102100025052 RNA-binding protein Raly Human genes 0.000 description 1
- 238000001069 Raman spectroscopy Methods 0.000 description 1
- 102100033243 Ras association domain-containing protein 1 Human genes 0.000 description 1
- 102100033239 Ras association domain-containing protein 5 Human genes 0.000 description 1
- 102100027915 Ras-related protein Rab-32 Human genes 0.000 description 1
- 102100039666 Receptor-type tyrosine-protein phosphatase delta Human genes 0.000 description 1
- 102100037420 Regulator of G-protein signaling 4 Human genes 0.000 description 1
- 101710140404 Regulator of G-protein signaling 4 Proteins 0.000 description 1
- 108091081062 Repeated sequence (DNA) Proteins 0.000 description 1
- 102100027611 Rho-related GTP-binding protein RhoB Human genes 0.000 description 1
- 101150054980 Rhob gene Proteins 0.000 description 1
- 108091006464 SLC25A23 Proteins 0.000 description 1
- 108091006957 SLC35D1 Proteins 0.000 description 1
- 108091006947 SLC39A4 Proteins 0.000 description 1
- 101150062766 STARD8 gene Proteins 0.000 description 1
- 102100031482 SWI/SNF-related matrix-associated actin-dependent regulator of chromatin subfamily E member 1-related Human genes 0.000 description 1
- 102100037192 Sal-like protein 4 Human genes 0.000 description 1
- 102100021853 Secreted and transmembrane protein 1 Human genes 0.000 description 1
- 102100030058 Secreted frizzled-related protein 1 Human genes 0.000 description 1
- 102100030054 Secreted frizzled-related protein 2 Human genes 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- 108010022999 Serine Proteases Proteins 0.000 description 1
- 102000012479 Serine Proteases Human genes 0.000 description 1
- 102100030618 Serine/threonine-protein kinase 31 Human genes 0.000 description 1
- 102100027911 Serine/threonine-protein kinase PAK 3 Human genes 0.000 description 1
- 102100022824 Serum paraoxonase/arylesterase 2 Human genes 0.000 description 1
- 102100027288 Sestrin-1 Human genes 0.000 description 1
- 241000580858 Simian-Human immunodeficiency virus Species 0.000 description 1
- 102100023007 Single-stranded DNA-binding protein 2 Human genes 0.000 description 1
- 108010041216 Sirtuin 2 Proteins 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 102100036428 Spondin-1 Human genes 0.000 description 1
- 102100026755 StAR-related lipid transfer protein 8 Human genes 0.000 description 1
- 108010022348 Sulfate adenylyltransferase Proteins 0.000 description 1
- LSNNMFCWUKXFEE-UHFFFAOYSA-N Sulfurous acid Chemical compound OS(O)=O LSNNMFCWUKXFEE-UHFFFAOYSA-N 0.000 description 1
- 102100036235 Synaptonemal complex protein 3 Human genes 0.000 description 1
- 102100040763 Synaptotagmin-6 Human genes 0.000 description 1
- 230000006052 T cell proliferation Effects 0.000 description 1
- 102100028676 T-cell leukemia/lymphoma protein 1A Human genes 0.000 description 1
- 102100035891 T-cell surface glycoprotein CD3 delta chain Human genes 0.000 description 1
- 102000003629 TRPC3 Human genes 0.000 description 1
- 102100034912 Testis-specific Y-encoded-like protein 6 Human genes 0.000 description 1
- 241001237851 Thermococcus gorgonarius Species 0.000 description 1
- 241001235254 Thermococcus kodakarensis Species 0.000 description 1
- 240000002003 Thermococcus sp. JDF-3 Species 0.000 description 1
- 108010031372 Tissue Inhibitor of Metalloproteinase-2 Proteins 0.000 description 1
- 108010031429 Tissue Inhibitor of Metalloproteinase-3 Proteins 0.000 description 1
- 102100024333 Toll-like receptor 2 Human genes 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102100022972 Transcription factor AP-2-alpha Human genes 0.000 description 1
- 102100021380 Transcription factor GATA-4 Human genes 0.000 description 1
- 102100023935 Transmembrane glycoprotein NMB Human genes 0.000 description 1
- 101150037542 Trpc3 gene Proteins 0.000 description 1
- 102100034396 Trypsin-3 Human genes 0.000 description 1
- 102100036788 Tubulin beta-4A chain Human genes 0.000 description 1
- 102100036827 Tubulin gamma-2 chain Human genes 0.000 description 1
- 102000044209 Tumor Suppressor Genes Human genes 0.000 description 1
- 108700025716 Tumor Suppressor Genes Proteins 0.000 description 1
- 102100040115 Tumor necrosis factor receptor superfamily member 10C Human genes 0.000 description 1
- 102100040110 Tumor necrosis factor receptor superfamily member 10D Human genes 0.000 description 1
- 102100024578 Tyrosyl-DNA phosphodiesterase 2 Human genes 0.000 description 1
- 102100021436 UDP-glucose 4-epimerase Human genes 0.000 description 1
- 102100032284 UDP-glucuronic acid/UDP-N-acetylgalactosamine transporter Human genes 0.000 description 1
- 108010005656 Ubiquitin Thiolesterase Proteins 0.000 description 1
- 102100025038 Ubiquitin carboxyl-terminal hydrolase isozyme L1 Human genes 0.000 description 1
- 102100030434 Ubiquitin-protein ligase E3A Human genes 0.000 description 1
- 108010018161 UlTma DNA polymerase Proteins 0.000 description 1
- 229910052770 Uranium Inorganic materials 0.000 description 1
- 208000006105 Uterine Cervical Neoplasms Diseases 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 102100025330 Voltage-dependent P/Q-type calcium channel subunit alpha-1A Human genes 0.000 description 1
- 102100025085 Zinc finger MYM-type protein 2 Human genes 0.000 description 1
- 102100024719 Zinc finger protein 345 Human genes 0.000 description 1
- 102100023550 Zinc finger protein 42 homolog Human genes 0.000 description 1
- 102100026494 Zinc finger protein 655 Human genes 0.000 description 1
- 102100023140 Zinc transporter ZIP4 Human genes 0.000 description 1
- 230000001594 aberrant effect Effects 0.000 description 1
- 229920006243 acrylic copolymer Polymers 0.000 description 1
- 229960005305 adenosine Drugs 0.000 description 1
- IRLPACMLTUPBCL-FCIPNVEPSA-N adenosine-5'-phosphosulfate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@@H](CO[P@](O)(=O)OS(O)(=O)=O)[C@H](O)[C@H]1O IRLPACMLTUPBCL-FCIPNVEPSA-N 0.000 description 1
- 150000003838 adenosines Chemical class 0.000 description 1
- 125000003275 alpha amino acid group Chemical group 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 230000006907 apoptotic process Effects 0.000 description 1
- 238000000149 argon plasma sintering Methods 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- IQFYYKKMVGJFEH-UHFFFAOYSA-N beta-L-thymidine Natural products O=C1NC(=O)C(C)=CN1C1OC(CO)C(O)C1 IQFYYKKMVGJFEH-UHFFFAOYSA-N 0.000 description 1
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 239000000090 biomarker Substances 0.000 description 1
- 108010005713 bis(5'-adenosyl)triphosphatase Proteins 0.000 description 1
- 210000001124 body fluid Anatomy 0.000 description 1
- 210000000988 bone and bone Anatomy 0.000 description 1
- 210000001185 bone marrow Anatomy 0.000 description 1
- 229940098773 bovine serum albumin Drugs 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 210000000481 breast Anatomy 0.000 description 1
- 150000004648 butanoic acid derivatives Chemical class 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 210000001175 cerebrospinal fluid Anatomy 0.000 description 1
- 201000010881 cervical cancer Diseases 0.000 description 1
- 210000003679 cervix uteri Anatomy 0.000 description 1
- 230000003196 chaotropic effect Effects 0.000 description 1
- 238000007385 chemical modification Methods 0.000 description 1
- 108091006090 chromatin-associated proteins Proteins 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 238000004440 column chromatography Methods 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 238000004624 confocal microscopy Methods 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 238000001446 dark-field microscopy Methods 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- VGONTNSXDCQUGY-UHFFFAOYSA-N desoxyinosine Natural products C1C(O)C(CO)OC1N1C(NC=NC2=O)=C2N=C1 VGONTNSXDCQUGY-UHFFFAOYSA-N 0.000 description 1
- 229960002086 dextran Drugs 0.000 description 1
- QONQRTHLHBTMGP-UHFFFAOYSA-N digitoxigenin Natural products CC12CCC(C3(CCC(O)CC3CC3)C)C3C11OC1CC2C1=CC(=O)OC1 QONQRTHLHBTMGP-UHFFFAOYSA-N 0.000 description 1
- SHIBSTMRCDJXLN-KCZCNTNESA-N digoxigenin Chemical compound C1([C@@H]2[C@@]3([C@@](CC2)(O)[C@H]2[C@@H]([C@@]4(C)CC[C@H](O)C[C@H]4CC2)C[C@H]3O)C)=CC(=O)OC1 SHIBSTMRCDJXLN-KCZCNTNESA-N 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- XPPKVPWEQAFLFU-UHFFFAOYSA-J diphosphate(4-) Chemical compound [O-]P([O-])(=O)OP([O-])([O-])=O XPPKVPWEQAFLFU-UHFFFAOYSA-J 0.000 description 1
- 235000011180 diphosphates Nutrition 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000005672 electromagnetic field Effects 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 230000008995 epigenetic change Effects 0.000 description 1
- 210000003238 esophagus Anatomy 0.000 description 1
- 238000000105 evaporative light scattering detection Methods 0.000 description 1
- 238000007387 excisional biopsy Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000012953 feeding on blood of other organism Effects 0.000 description 1
- 230000005669 field effect Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- LIYGYAHYXQDGEP-UHFFFAOYSA-N firefly oxyluciferin Natural products Oc1csc(n1)-c1nc2ccc(O)cc2s1 LIYGYAHYXQDGEP-UHFFFAOYSA-N 0.000 description 1
- 235000019688 fish Nutrition 0.000 description 1
- 238000001917 fluorescence detection Methods 0.000 description 1
- 238000000799 fluorescence microscopy Methods 0.000 description 1
- 238000001215 fluorescent labelling Methods 0.000 description 1
- 238000007672 fourth generation sequencing Methods 0.000 description 1
- 210000001652 frontal lobe Anatomy 0.000 description 1
- 230000030279 gene silencing Effects 0.000 description 1
- 238000012252 genetic analysis Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 230000011365 genetic imprinting Effects 0.000 description 1
- 210000004602 germ cell Anatomy 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- ZJYYHGLJYGJLLN-UHFFFAOYSA-N guanidinium thiocyanate Chemical compound SC#N.NC(N)=N ZJYYHGLJYGJLLN-UHFFFAOYSA-N 0.000 description 1
- 229940029575 guanosine Drugs 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 102000055235 human DAB2 Human genes 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 238000007654 immersion Methods 0.000 description 1
- 238000007901 in situ hybridization Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 210000000936 intestine Anatomy 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 239000004816 latex Substances 0.000 description 1
- 229920000126 latex Polymers 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 239000013554 lipid monolayer Substances 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 235000019689 luncheon sausage Nutrition 0.000 description 1
- 201000005202 lung cancer Diseases 0.000 description 1
- 208000020816 lung neoplasm Diseases 0.000 description 1
- 210000001165 lymph node Anatomy 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 1
- 238000002493 microarray Methods 0.000 description 1
- 239000011859 microparticle Substances 0.000 description 1
- 238000000386 microscopy Methods 0.000 description 1
- 238000013508 migration Methods 0.000 description 1
- 230000005012 migration Effects 0.000 description 1
- 108091005601 modified peptides Chemical class 0.000 description 1
- 210000000214 mouth Anatomy 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- 230000000869 mutational effect Effects 0.000 description 1
- 238000013188 needle biopsy Methods 0.000 description 1
- 239000002853 nucleic acid probe Substances 0.000 description 1
- 229920001778 nylon Polymers 0.000 description 1
- 239000003921 oil Substances 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- JJVOROULKOMTKG-UHFFFAOYSA-N oxidized Photinus luciferin Chemical compound S1C2=CC(O)=CC=C2N=C1C1=NC(=O)CS1 JJVOROULKOMTKG-UHFFFAOYSA-N 0.000 description 1
- 101800000607 p15 Proteins 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- 239000013610 patient sample Substances 0.000 description 1
- 210000003800 pharynx Anatomy 0.000 description 1
- 150000003014 phosphoric acid esters Chemical class 0.000 description 1
- 239000004033 plastic Substances 0.000 description 1
- 229920003023 plastic Polymers 0.000 description 1
- 229920002647 polyamide Polymers 0.000 description 1
- 239000004417 polycarbonate Substances 0.000 description 1
- 229920000515 polycarbonate Polymers 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 238000006116 polymerization reaction Methods 0.000 description 1
- 229920001155 polypropylene Polymers 0.000 description 1
- 239000001267 polyvinylpyrrolidone Substances 0.000 description 1
- 229920000036 polyvinylpyrrolidone Polymers 0.000 description 1
- 235000013855 polyvinylpyrrolidone Nutrition 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000037452 priming Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 210000002307 prostate Anatomy 0.000 description 1
- 230000002797 proteolythic effect Effects 0.000 description 1
- 238000007388 punch biopsy Methods 0.000 description 1
- 229940048084 pyrophosphate Drugs 0.000 description 1
- 238000004445 quantitative analysis Methods 0.000 description 1
- 239000002096 quantum dot Substances 0.000 description 1
- 239000010453 quartz Substances 0.000 description 1
- 239000000376 reactant Substances 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 238000003757 reverse transcription PCR Methods 0.000 description 1
- 210000003296 saliva Anatomy 0.000 description 1
- 238000001758 scanning near-field microscopy Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 210000000582 semen Anatomy 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000007389 shave biopsy Methods 0.000 description 1
- 239000000741 silica gel Substances 0.000 description 1
- 229910002027 silica gel Inorganic materials 0.000 description 1
- 229910052710 silicon Inorganic materials 0.000 description 1
- 239000010703 silicon Substances 0.000 description 1
- 210000003491 skin Anatomy 0.000 description 1
- FQENQNTWSFEDLI-UHFFFAOYSA-J sodium diphosphate Chemical compound [Na+].[Na+].[Na+].[Na+].[O-]P([O-])(=O)OP([O-])([O-])=O FQENQNTWSFEDLI-UHFFFAOYSA-J 0.000 description 1
- 239000012064 sodium phosphate buffer Substances 0.000 description 1
- 229940048086 sodium pyrophosphate Drugs 0.000 description 1
- 239000007790 solid phase Substances 0.000 description 1
- 125000006850 spacer group Chemical group 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 108010068698 spleen exonuclease Proteins 0.000 description 1
- 210000003802 sputum Anatomy 0.000 description 1
- 208000024794 sputum Diseases 0.000 description 1
- 230000000087 stabilizing effect Effects 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- CSABAZBYIWDIDE-UHFFFAOYSA-N sulfino hydrogen sulfite Chemical compound OS(=O)OS(O)=O CSABAZBYIWDIDE-UHFFFAOYSA-N 0.000 description 1
- 108010067247 tacrolimus binding protein 4 Proteins 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 210000001550 testis Anatomy 0.000 description 1
- 235000019818 tetrasodium diphosphate Nutrition 0.000 description 1
- 239000001577 tetrasodium phosphonato phosphate Substances 0.000 description 1
- 229940104230 thymidine Drugs 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 102000055046 tissue-factor-pathway inhibitor 2 Human genes 0.000 description 1
- 108010016054 tissue-factor-pathway inhibitor 2 Proteins 0.000 description 1
- 101150116154 tms1 gene Proteins 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000014616 translation Effects 0.000 description 1
- 239000001226 triphosphate Substances 0.000 description 1
- 235000011178 triphosphate Nutrition 0.000 description 1
- 125000002264 triphosphate group Chemical class [H]OP(=O)(O[H])OP(=O)(O[H])OP(=O)(O[H])O* 0.000 description 1
- HRXKRNGNAMMEHJ-UHFFFAOYSA-K trisodium citrate Chemical compound [Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O HRXKRNGNAMMEHJ-UHFFFAOYSA-K 0.000 description 1
- 229940038773 trisodium citrate Drugs 0.000 description 1
- 230000005641 tunneling Effects 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 1
- 229940045145 uridine Drugs 0.000 description 1
- 210000003932 urinary bladder Anatomy 0.000 description 1
- 210000002700 urine Anatomy 0.000 description 1
- 210000004291 uterus Anatomy 0.000 description 1
- 210000001215 vagina Anatomy 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 239000002569 water oil cream Substances 0.000 description 1
- 229940075420 xanthine Drugs 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6844—Nucleic acid amplification reactions
- C12Q1/6858—Allele-specific amplification
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
- C12Q1/6886—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/154—Methylation markers
Definitions
- the present invention generally relates to a primers configured to specifically hybridize to either a methylated or an unmethylated CpG site of a template nucleic acid or adjacent to a methylated or an unmethylated CpG site of a template nucleic acid, and methods of use thereof.
- DNA methylation is an important regulator of gene expression and may play a role in the development and progression of a number of diseases, such as cancer. Methylation is typically limited to cytosines located 5′ to a guanine (i.e., CpG sequences), however other forms of methylation are known. Research suggests that genes with high levels of methylation in a promoter region are transcriptionally silent, which may allow unchecked cell proliferation. When a promoter region has excessive methylation, the methylation is typically most prevalent in sequences having CpG repeats, so called “CpG islands.” Undermethylation (hypomethylation) has also been implicated in the development and progression of cancer through different mechanisms.
- CMOS complementary metal-oxide-semiconductor
- CpG sites e.g., CpG islands
- Those include sequencing methods in which genomic DNA is isolated and treated with bisulfite. Because methylated cytosines are not affected by bisulfite treatment, the unmethylated Cs, e.g., within a CpG site, are converted to uracil, while methylated Cs are not converted. After sequencing, comparison of the starting DNA and the bisulfate treated DNA indicates the location of methylation sites.
- MSP methylation specific PCR
- One primer set is complimentary to sequences whose Cs are converted to Us by bisulfite treatment, and the other primer set is complimentary to non-converted Cs.
- both the methylated and unmethylated DNA are amplified.
- Comparison of the amplification products gives insight as to the methylation in a given sequence. See Herman et al., “Methylation-specific PCR: A novel PCR assay for methylation status of CpG islands,” P.N.A.S ., vol. 93, p.
- MSP multi-sequence polymerase chain reaction
- Using two different primer sets limits the application of MSP.
- the amplification process cannot be maximally efficient because of the need to operate in temperature regimes appropriate for both primer sets.
- the invention provides compositions and methods for performing methylation specific amplification of nucleic acids using a single primer set. Aspects of the invention are accomplished by using primers configured such that they are able to specifically hybridize to either a methylated or an unmethylated site, e.g., a CpG island, of a template nucleic acid. Aspects of the invention are also accomplished by using primers configured such that they are able to specifically hybridize adjacent to a methylated or an unmethylated site, e.g., a CpG island, of a template nucleic acid. Accordingly, there is no need to have different primer sequences that distinguish between converted uracil sequences and unconverted cytosine sequences. Thus, amplification reactions are performed on nucleic acid with a single set of primers, which reduces costs and lowers assay complexity.
- primers that include an abasic region that interacts with either the methylated or the unmethylated CpG site of the template nucleic acid.
- a primer may contain one or more abasic regions corresponding to expected locations of methylation sites. Often the abasic region will be linked to a guanine moiety.
- the primer may contain any known abasic (spacer) molecule that is known in the art.
- Another way to accomplish this goal is to provide primers including at least one mismatched nucleotide that has similar annealing characteristics to both uracil and cytosine such that the primer hybridizes to either the methylated or the unmethylated CpG site of the template nucleic acid.
- Another way to accomplish this goal is to provide primers that hybridize to sequences adjacent to methylated or the unmethylated CpG site(s) of the template nucleic acid.
- the primer set allows for amplification of the entire site. Analysis of the amplification products gives information about the methylation status of the sample.
- the primers may include an adaptor sequence such that amplicons are produced with adaptors already attached. In other embodiments, adaptors are attached to the amplicons after the amplification reaction.
- the adaptor sequence may optionally include a homopolymer region, e.g., a poly-A region.
- the primer can specifically hybridize to either the methylated or the unmethylated CpG site of the template nucleic acid under conditions of high stringency. In some instances, it is useful to use sequencing to analyze the amplification products. In these embodiments it may be helpful to have a universal adaptor on the amplicons so that they can be hybridized to a universal primer on a solid support for the sequencing reaction.
- Primers of the invention may be used in a number of applications where it is desirable to analyze methylation of a sequence, e.g., epigenetics or diagnosing a disease, e.g., cancer.
- a methylation pattern of a nucleic acid may be analyzed by obtaining template nucleic acid, contacting the template nucleic acid with an agent (e.g., bisulfite) that modifies unmethylated cytosine, hybridizing a primer configured such that it is able to specifically hybridize to either a methylated or an unmethylated CpG site of the template nucleic acid, amplifying the template nucleic acid, and analyzing a methylation pattern of the amplified nucleic acid.
- an agent e.g., bisulfite
- any amplification or analysis method may be used in the methods of the invention. For example, PCR amplification, direct sequencing, fluorescent probe hybridization, etc.
- the template nucleic acid is amplified with PCR.
- the amplified nucleic acid in analyzed by sequencing the nucleic acid.
- FIG. 1 depicts a TWIST sequence with various regions of potential methylation, and forward and reverse primers having abasic regions which can be used to amplify the TWIST sequence;
- FIG. 2 depicts a NID sequence with various regions of potential methylation, and forward and reverse primers having abasic regions which can be used to amplify the NID sequence;
- FIG. 3 depicts a TWIST sequence with various regions of potential methylation, and forward and reverse primers having mismatches which can be used to amplify the TWIST sequence;
- FIG. 4 depicts a NID sequence with various regions of potential methylation, and forward and reverse primers having mismatches which can be used to amplify the NID sequence;
- FIG. 5 depicts a TWIST sequence with various regions of potential methylation, and forward and reverse primers which can be used to amplify CpG islands within the TWIST sequence;
- FIG. 6 shows fluorescence intensity due to production of amplification products as a function of the concentration of the primers shown in FIG. 5 ;
- FIG. 7 depicts a NID sequence with various regions of potential methylation, and forward and reverse primers which can be used to amplify CpG islands within the NID sequence;
- FIG. 8 shows fluorescence intensity due to production of amplification products as a function of the concentration of the primers shown in FIG. 7 .
- the present invention generally relates to primers configured to able to specifically hybridize to either a methylated or an unmethylated site, e.g., a CpG site, e.g., a CpG island of a template nucleic acid, or to hybridize adjacent to a methylated or an unmethylated site, and methods of use thereof.
- the invention provides a primer configured such that it is able to specifically hybridize to either a methylated or an unmethylated CpG site of a template nucleic acid.
- Methylation generally refers to cytosine methylation at positions C5 or N4 of cytosine, the N6 position of adenine or other types of nucleic acid methylation.
- methylation is analyzed by treating a nucleic acid sequence with bisulfate and then comparing the sequences of the bisulfite-treated and untreated nucleic acid sequences to identify one or more methylation regions.
- methylation is detected using methylation specific polymerase chain reaction.
- DNA methylation is a chemical modification of DNA performed by enzymes called methyltransferases, in which a methyl group (m) is added to certain cytosines (C) of DNA, to yield 5-methylcytosine.
- This non-mutational (epigenetic) process (mC) is a critical factor in gene expression regulation. See, e.g., J. G. Herman, Seminars in Cancer Biology, 9: 359-67, 1999. Research suggests genes with high levels of 5-methylcytosine in a promoter region are transcriptionally silent, which allows unchecked cell proliferation. Additionally, it is likely that there a correlation between gene transcription and undermethylation.
- Methylation appears to influence gene expression by affecting the interactions with DNA of both chromatin proteins and specific transcription factors.
- 5-Methylcytosine performs much like a regular cytosine, pairing up with a guanine. However, some areas of genome are methylated more heavily than others and highly methylated areas tend to be less transcriptionally active, through a mechanism not fully understood. Methylation of cytosines can also persist from the germ line of one of the parents into the zygote, marking the chromosome as being inherited from this parent (genetic imprinting). DNA methylation frequently occurs in repeated sequences, e.g., CpG islands, and helps to suppress the expression and mobility of transposable elements.
- Methylation patterns of DNA from cancer cells are significantly different from those of normal cells. Therefore, detection of methylation patterns in appropriately selected genes of cancer cells can lead to discrimination of cancer cells from normal (i.e., non-cancerous) cells, thereby providing an approach to early detection of cancer.
- MSP methylation specific PCR
- a nucleic acid sample is treated with a methylation reactant, typically bisulfite, and then amplified in the presence of two sets of primers.
- One primer set is complimentary to sequences with converted Cs and the second primer set is complimentary to non-converted Cs.
- both the methylated and unmethylated DNA can be simultaneously amplified, and the amplification products compared (e.g., sequenced) to determine methylation sites in a given sequence.
- the MSP method, and variations on the MSP method are described in greater detail in U.S. Pat. Nos. 6,265,171, 6,331,393, 6,977,146, 7,186,512, and 7,229,759 all of which are incorporated by reference herein in their entireties.
- the methods are useful for detecting the methylation status of at least one gene.
- This generally means determining the presence or absence of 5-methylcytosine at one or a plurality of (functionally relevant) CpG dinucleotides within the DNA sequence of the at least one gene.
- aberrant methylation which may be referred to as hypermethylation, of the at least one gene may be detected.
- the methylation status is determined in one or more CpG sites, e.g., CpG islands in the at least one gene. These CpG sites are often found in the promoter region of the gene(s).
- CpG dinucleotides are typically concentrated in the promoter regions and exons of human genes and the methylation status of these CpG residues is of functional importance to whether the at least one gene is expressed. Since CpG dinucleotides susceptible to methylation are typically concentrated in the promoter region, exons and introns of human genes, promoter, exon and intron regions may be assessed in order to determine the methylation status of the at least one gene.
- a “promoter” is a region extending typically between approximately 1 Kb, 500 by or 150 to 300 by upstream from the transcription start site. The CpG site may surround or be positioned around the transcription start site of the at least one gene.
- methods for detecting methylation status rely upon a reagent which selectively modifies unmethylated cytosine residues in the DNA contained in the sample to produce detectable modified residues but which does not modify methylated cytosine residues.
- Any suitable reagent may be utilized in the methods of the invention. Examples include bisulphite, hydrogen sulphite and disulphite reagents and suitable mixtures thereof.
- the reagent comprises, consists essentially of or consists of a bisulphite reagent.
- the invention includes primers having an abasic region and a region complementary to a template having a methylated CpG sequence or a UpG sequence.
- the abasic region of the primer may only span one base of the template, for example, a C of a CpG sequence that is likely to be methylated.
- the invention includes primers whose abasic region spans more than one base, for example three or five bases.
- Primers of the invention may also include more than one abasic region, for example, a first abasic region separated by sequence consisting of A, C, T, and G, and then a second abasic region.
- the primer could comprise less than about 20, less than about 15, less than about 10, less than about 8, less than about 6, less than about 5, less than about 4, or less than about 3 abasic regions.
- the primer could comprise less than about 80%, less than about 60%, less than about 50%, less than about 30%, less than about 20%, less than about 15%, less than about 10%, less than about 5%, or less than about 2% of the sequence comprises abasic regions.
- a number of suitable primer sequences are disclosed, however, the concept is generally applicable to any primer sequence that would be used to analyze for methylation of a nucleic acid strand, e.g., to investigate methylation in a template.
- kits for providing a set of abasic primers for use in analyzing methylated sequences might include a reaction buffer, a DNA polymerase, and a primer having an abasic region and a region complementary to a template having a methylated CpG sequence or a UpG sequence.
- Some kits may additionally include a set of deoxyribonucleotides needed for the amplification.
- a template suspected of having a methylated cytosine through a series of steps including: 1) providing a template suspected of having a methylated cytosine, 2) providing a primer having an abasic region and a region complementary to a template having a methylated CpG sequence or a UpG sequence, 3) mixing the template and the primer in the presence of free deoxyribonucleotides; and, 4) amplifying the template using PCR techniques. Once the template has been amplified, the amplified template can be isolated and sequenced. Typically the template suspected of having a methylated cytosine will be contacted with a bisulfite solution prior to the amplification in order to convert unmethylated Cs to Us.
- the invention additionally relates to primers having a sequence complementary to a template having a methylated CpG sequence or a UpG sequence, the primer having at least one mismatched base corresponding to a cytosine in the methylated CpG sequence or a uracil in the UpG sequence.
- These mismatches would be chosen to have similar annealing characteristics to Us and Cs and specific to the PCR technique used. Following the first few cycles of the PCR, the mismatched base(s) will become incorporated in the amplified product and subsequent annealing cycles would not involve a mismatched base, rather, the primer becomes a perfect match for amplified templates.
- primary PCR reactions can be performed on genomic DNA with a single set of primers, followed by sequencing to identify and quantify clinically relevant sequences nested between the PCR primers.
- the mismatched bases of the primer may only span one base of the template, for example, a C of a CpG sequence that is likely to be methylated.
- the invention includes primers whose mismatched bases span more than one base, for example three or five bases.
- Primers of the invention may also include more than one group of mismatches, for example, a first mismatched base separated by sequence consisting of A, C, T, and G, and then a second mismatched base.
- the primer could comprise less than about 20, less than about 15, less than about 10, less than about 8, less than about 6, less than about 5, less than about 4, or less than about 3 mismatched bases.
- the primer could comprise less than about 80%, less than about 60%, less than about 50%, less than about 30%, less than about 20%, less than about 15%, less than about 10%, less than about 5%, or less than about 2% of the sequence comprises mismatched bases.
- a number of suitable primer sequences are disclosed, however, the concept is generally applicable to any primer sequence that would be used for amplification (e.g., MSP) or to investigate methylation in a nucleic acid strand.
- kits for providing a set of primers having mismatched bases for use in MSP might include a reaction buffer, a DNA polymerase, and a primer including a sequence complementary to a template having a methylated CpG sequence or a UpG sequence, the primer having at least one mismatched base corresponding to a cytosine in the methylated CpG sequence or a uracil in the UpG sequence.
- Some kits may additionally include a set of deoxyribonucleotides needed for the amplification.
- a template suspected of having a methylated cytosine through a series of steps including: 1) providing a template suspected of having a methylated cytosine, 2) providing a primer having a sequence complementary to a template having a methylated CpG sequence or a UpG sequence, the primer having at least one mismatched base corresponding to a cytosine in the methylated CpG sequence or a uracil in the UpG sequence, 3) mixing the template and the primer in the presence of free deoxyribonucleotides; and, 4) amplifying the template using PCR techniques. Once the template has been amplified, the amplified template can be isolated and sequenced. Typically the template suspected of having a methylated cytosine will be contacted with a bisulfite solution prior to the amplification in order to convert unmethylated Cs to Us.
- mismatches would be chosen to have similar annealing characteristics to Us and Cs and specific PCR. Following the first few cycles of the PCR, the mismatched base(s) become incorporated in the amplified product and subsequent annealing cycles do not involve a mismatched base, but it is now a perfect match. This approach also eliminates the necessity to perform a primary PCR with different primers that are complimentary to bisulfite converted and unconverted primer sequences.
- Primers of the invention are designed to be substantially complementary to each template (i.e., strand of the oligonucleotide to be amplified).
- the primers should have sufficient complementarity with a 5′ and 3′ oligonucleotide to hybridize and permit amplification of a nucleic acid sequence suspected to be at least partially methylated, e.g., including a CpG sequence.
- Primers of the invention are employed in the amplification process, which is an enzymatic chain reaction that produces exponentially increasing quantities of template relative to the number of reaction steps involved (e.g., polymerase chain reaction or PCR).
- one primer is complementary to a template corresponding to a negative ( ⁇ ) strand of the locus of interest, the antisense primer, and the other is complementary to a template corresponding to the positive (+) strand, the sense primer.
- the product of the chain reaction is a discrete nucleic acid duplex with termini corresponding to the ends of the specific primers employed.
- Suitable conditions for primer extension are readily determined by those skilled in the art. These conditions include incubation temperature, incubation time, assay reagents, stabilizing factors, polymerizing agent, pH, and ionic strength sufficient to promote base pairing between the primer and desired template or target sequence.
- Polymerizing agents include enzymes capable of extending a primer by adding or substituting a nucleotide or modified nucleotide at the 3′ end of the primer. Suitable polymerizing enzymes include all manner of nucleotide polymerases.
- Polymerases may be isolated or cloned from a variety of organisms including viruses, bacteria, archaebacteria, fungi, mycoplasma, prokaryotes, and eukaryotes. Polymerases exhibiting thermal stability may also be employed, such as for example, polymerases from Thermus species, including Thermus aquaticus, Thermus brocianus, Therms thermophilus , and Thermus flavus; Pyrococcus species, including Pyrococcus furiosus, Pyrococcus sp. GB-D, and Pyrococcus woesei, Thermococcus litoralis , and Thermogata maritime .
- Thermus species including Thermus aquaticus, Thermus brocianus, Therms thermophilus , and Thermus flavus
- Pyrococcus species including Pyrococcus furiosus, Pyrococcus sp. GB-D, and Pyrococcus woesei, Ther
- Biodegradable Biologically active proteolytic fragments, recombinant polymerases, genetically engineered polymerizing enzymes, and modified polymerases are included in the definition of polymerizing agent. It should be understood that the invention can employ various types of polymerases from various species and origins without undue experimentation.
- Suitable conditions for primer extension include hybridization and stringency conditions allowing desired hybridization between the primer and template or target sequence.
- two nucleic acid sequences are said to be capable of specifically hybridizing to one another if the two molecules are capable of forming an anti-parallel, double-stranded nucleic acid structure or hybrid under hybridizing conditions, whereas they are substantially unable to form a double-stranded structure or hybrid when incubated with a non-target nucleic acid sequence under the same conditions.
- a nucleic acid molecule is said to be the “complement” of another nucleic acid molecule if it exhibits complete Watson-Crick base pair complementarity.
- Two molecules are said to be “substantially complementary” if they can hybridize to one another with sufficient stability to permit them to remain annealed to one another under at least conventional low-stringency conditions. Similarly, the molecules are said to be “complementary” if they can hybridize to one another with sufficient stability to permit them to remain annealed to one another under conventional high-stringency conditions.
- Stringency conditions in referring to homology or substantial similarity in the hybridization context, can be combined conditions of salt, temperature, organic solvents or other parameters that are typically known to influence hybridization.
- high stringency conditions include conditions selected to be 5 or more degrees higher than the thermal melting point (Tm) for a specific sequence at a defined ionic strength and pH.
- the Tm is the temperature (under defined ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched sequence.
- Primers can be prepared by a variety of methods including but not limited to cloning of appropriate sequences and direct chemical synthesis using methods well known in the art (Narang et al., Methods Enzymol. 68:90 (1979); Brown et al., Methods Enzymol. 68:109 (1979)). Primers can also be obtained from commercial sources such as Operon Technologies, Amersham Pharmacia Biotech, Sigma, Integrated DNA Technologies, and Life Technologies. The primers can have an identical melting temperature. The lengths of the primers can be extended or shortened at the 5′ end or the 3′ end to produce primers with desired melting temperatures. Also, the annealing position of each primer pair can be designed such that the sequence and, length of the primer pairs yield the desired melting temperature.
- Computer programs can also be used to design primers, including but not limited to Array Designer Software (Arrayit Inc.), Oligonucleotide Probe Sequence Design Software for Genetic Analysis (Olympus Optical Co.), NetPrimer, and DNAsis from Hitachi Software Engineering.
- T m melting or annealing temperature
- the annealing temperature of the primers can be recalculated and increased after any cycle of amplification, including but not limited to cycle 1, 2, 3, 4, 5, cycles 6-10, cycles 10-15, cycles 15-20, cycles 20-25, cycles 25-30, cycles 30-35, or cycles 35-40.
- the 5′ half of the primers is incorporated into the products from each loci of interest, thus the T m can be recalculated based on both the sequences of the 5′ half and the 3′ half of each primer.
- a primer of the invention may contain an abasic region or a mismatch at any position on the primer, provided that the primer is stable enough to endure the thermocycling of PCR amplification and provided that the primer does not self-anneal.
- a number of abasic regions are commercially available from suppliers, such as Integrated DNA Technologies, and can be custom incorporated into primer sequences.
- Exemplary abasic regions include, but are not limited to, O-dimethoxytrityl-1′,2′-dideoxyribose-3′-[(2-cyanoethyl)-(N,N-diisopropyl)]-phosphoramidite, O-dimethoxytrityl-1′-methoxy-2′-dideoxyribose-3′-[(2-cyanoethyl)-(N,N-diisopropyl)]-phosphoramidite, [4-(4,4′-Dimethoxytrityloxy)butyramidomethyl)-1-(2-nitrophenyl)-ethyl]-2-cyanoethyl-(N,N-diisopropyl)-phosphoramidite, O-Dimethoxytrityl-1′-Deoxyribose-2′-O-Triisopropylsilyloxymethyl-3′-[(2-cyanoethyl)-(N,N
- Primers containing mismatches can be ordered from any commercial supplier, e.g., Integrated DNA Technologies, provided that at least one mismatch is incorporated corresponding to a suspected methylation site. These mismatches would be chosen to have similar annealing characteristics to Us and Cs and specific to the PCR technique used.
- FIGS. 1-4 A number of exemplary sequences are shown in FIGS. 1-4 , including both forward and reverse primers, including both abasic regions and mismatches.
- the top sequence is listed 5′ to 3′, thus the lower sequence and the primers are listed 3′ to 5′.
- FIG. 1 shows abasic forward and reverse primers suitable for amplifying portions of a sequence coding a TWIST gene. The “+” symbols indicate regions that are likely to have methylation.
- Forward primers include SEQ ID NO. 1: 3′-GATGGGGTTGTTAT*G*GGTTAGG-5′; SEQ ID NO. 2: 3′-TT*GAT*GTTTTTTGGGTTG-5′; and SEQ ID NO.
- Reverse primers include SEQ ID NO. 4: 3′-TT*GT*GGAGGAAGG*GA*GG-5′; SEQ ID NO. 5: 3′-AGGG*GGTTAGGAGG*GGG-5′; and SEQ ID NO. 6: 3′-T*GGT*GTT*GGGTTAGGT*GT-5′, wherein “*” is an abasic region spanning one base of the template.
- FIG. 2 shows abasic forward and reverse primers suitable for amplifying portions of a sequence coding a NID gene.
- the “+” symbols indicate regions that are likely to have methylation.
- Forward primers include SEQ ID NO. 7: 3′-TTTTTTTATT*GGGTTG-5′; SEQ ID NO. 8: 3′-G*GGTTTTTAAGGAGTTTTATTTT*-5′; and SEQ ID NO. 9: 3′-T*GTAAGGTTTGGGGTAG*GG-5′, wherein “*” is an abasic region spanning one base of the template.
- Reverse primers include SEQ ID NO. 10: 3′-AG*GTAAAGGGAATTT*G-5′; SEQ ID NO.
- FIG. 3 shows mismatched forward and reverse primers suitable for amplifying portions of a sequence coding a TWIST gene.
- the “+” symbols indicate regions that are likely to have methylation.
- Forward primers include SEQ ID NO. 13: 3′-GATGGGGTTGTTATNGNGGTTAGG-5′; SEQ ID NO. 14: 3′-TTNGATNGTTTTGGGTTG-5′; and SEQ ID NO. 15: 3′-GTTAGGGTTNGGGGGNGTTG-5′, wherein “N” is a single mismatched base.
- Reverse primers include SEQ ID NO. 16: 3′-TTNGTNGGAGGAAGGNGANGG-5′; SEQ ID NO. 17: 3′-AGGGNGGTTAGGAGGNGGG-5′; and SEQ ID NO. 18: 3′-TNGGTNGTTNGGGTTAGGTNGT-5′, wherein “N” is a single mismatched base.
- FIG. 4 shows mismatched forward and reverse primers suitable for amplifying portions of a sequence coding a NID gene.
- the “+” symbols indicate regions that are likely to have methylation.
- Forward primers include SEQ ID NO. 19: 3′-TTTTTTTATTNGGGTTG-5′; SEQ ID NO. 20: 3′-GNGGTTTTTAAGGAGTTTTATTTTN-5′; and SEQ ID NO. 21: 3′-TNGTAAGGTTTGGGGTAGNGG-5′, wherein “N” is a single mismatched base.
- Reverse primers include SEQ ID NO. 22: 3′-AGNGTAAAGGGAATTTNG-5′; SEQ ID NO. 23: 3′-TAGNGGNGAGTGNGGTTG-5′; and SEQ ID NO. 24: 3′-AGGTTGGTAGGGNGNGG-5′, wherein “N” is a single mismatched base.
- the primers of the invention will additionally include a label.
- the label may be directly attached to the primer or the label may be attached via a linker, for example a polymer linker.
- the label attached to the primer may be directly or indirectly detectable.
- the exact label may be selected based, at least in part, on the particular type of detection method used. Exemplary detection methods include radioactive detection, optical absorbance detection, e.g., UV-visible absorbance detection, optical emission detection, e.g., fluorescence, phosphorescence, chemiluminescence, or Raman scattering.
- Preferred labels include optically-detectable labels, such as fluorescent labels.
- fluorescent labels include, but are not limited to, 4-acetamido-4′-isothiocyanatostilbene-2,2′ disulfonic acid; acridine and derivatives: acridine, acridine isothiocyanate; 5-(2′-aminoethyl)aminonaphthalene-1-sulfonic acid (EDANS); 4-amino-N-[3-vinylsulfonyl)phenyl]naphthalimide-3,5 disulfonate; N-(4-anilino-1-naphthyl)maleimide; anthranilamide; BODIPY; alexa; fluorescien; conjugated multi-dyes; Brilliant Yellow; coumarin and derivatives; coumarin, 7-amino-4-methylcoumarin (AMC, Coumarin 120), 7-amino-4-trifluoromethylcouluarin (Coumaran 151); cyanine dyes; cyanosine
- the amplicons produced with the disclosed methods include a detectable barcode-type label to facilitate sorting of amplified products.
- a detectable barcode-type label can be any barcode-type label known in the art including, for example, radio-frequency tags, semiconductor chips, barcoded magnetic beads (e.g., from Applied Biocode, Inc., Santa Fe Springs, Calif.), and nucleic acid sequences. When assessing methylation status, it may be useful to incorporate a barcode into a nucleic acid amplification product that is suspected to have methylation at a CpGsite, or is adjacent to a methylation site.
- primers may include a barcode such that the barcode will be incorporated into the amplified produces.
- the unique barcode sequence could be incorporated into the 5′ end of the primer, or the barcode sequence could be incorporated into the 3′ end of the primer.
- the primers may additionally comprise adaptors, e.g., as discussed below, such that the adaptors are incorporated into the amplified products.
- the barcodes and/or the adaptors may be incorporated into the amplified products after amplification.
- a suitable restriction enzyme or other endonuclease
- the same steps may be used to add an adaptor, e.g., a universal adaptor to the amplification products.
- Barcode sequences typically include a set of oligonucleotides ranging from about 4 to about 20 oligonucleotide bases (e.g., 8-10 oligonucleotide bases), which uniquely encode a discrete library member preferably without containing significant homology to any sequence in the targeted genome.
- the barcode sequence generally includes features useful in sequencing reactions. For example the barcode sequences are designed to have minimal or no homopolymer regions, i.e., 2 or more of the same base in a row such as AA or CCC, within the barcode sequence.
- the barcode sequences are also designed so that they are at least one edit distance away from the base addition order when performing base-by-base sequencing, ensuring that the first and last base do not match the expected bases of the sequence.
- the barcode sequences are designed to be correlated to a particular subject, allowing subject samples to be distinguished.
- Designing barcodes is shown U.S. Pat. No. 6,235,475, the contents of which are incorporated by reference herein in their entirety.
- the barcode sequences range from about 2 nucleotides to about 25 nucleotides, e.g., about 5 nucleotides to about 10 nucleotides. Since the barcode sequence is sequenced along with the template nucleic acid to which it is attached, the oligonucleotide length should be of minimal length so as to permit the longest read from the template nucleic acid attached. Generally, the barcode sequences are spaced from the template nucleic acid molecule by at least one base (minimizes homopolymeric combinations).
- adaptor oligonucleotides are included in the primers.
- the adaptors include a homopolymer region, e.g., a region of poly(A) or poly(T), that can hybridize to a universal primer for the sequence reaction. See also Sabot et al. (U.S. patent application number 2009/0226975), Adessi et al. (U.S. Pat. No. 7,115,400), and Kawashima et al. (U.S. patent application number 2005/0100900), the content of each of which is incorporated by reference herein in its entirety.
- ligase may be any enzyme capable of ligating an oligonucleotide (RNA or DNA) to the primers.
- Suitable ligases include T4 DNA ligase and T4 RNA ligase (such ligases are available commercially, from New England Biolabs). Methods for using ligases are well known in the art.
- the polymerase may be any enzyme capable of adding nucleotides to the 3′ and the 5′ terminus of template nucleic acid molecules.
- a set of primers of the invention may be substituted for one or more set(s) of primers used in methylation identification and analysis techniques of the art.
- primers may also be used in quantitative (“real-time”) PCR (qPCR), which allows for quantitative analysis of specific sequences during the amplification.
- qPCR quantitative (“real-time”) PCR
- MSP primers specific for templates with methylated Cs, and Cs that have been converted to Us after contacting the template with bisulfite, are used in PCR. By comparing the amplification products of the two primer reactions, it is possible to distinguish between the chemically modified methylated and unmethylated DNA, which adds an improved sensitivity of methylation detection.
- MSP eliminates the false positive results inherent to other PCR-based approaches which rely on differential restriction enzyme cleavage to distinguish methylated from unmethylated DNA. MSP also allows examination of all CpG sites, not just those within sequences recognized by methylation-sensitive restriction enzymes. This markedly increases the number of such sites which can be assessed and will allow rapid, fine mapping of methylation patterns throughout CpG rich regions.
- the primers of the invention may be used for sensitive and specific test for detecting and diagnosing different diseases or disorders, particularly diseases or disorders associated with hypermethylation, particularly cancer. In some embodiments, it is possible to detect and diagnose a disease or disorder using only MSP. In some embodiments, the assessment a disease or disorder requires a combination of different types of assays, e.g., in addition to MSP.
- a sample is only analyzed for methylation.
- both a sequence mutation and abnormal methylation are analyzed from a patient sample.
- the sequence mutation and the abnormal methylation may occur on the same chromosome, or on different chromosomes.
- one or more chromosomal abnormalities e.g., chromosomal instability
- Samples may include any bodily fluid such as blood, a blood fraction, saliva, sputum, urine, semen, transvaginal fluid, cerebrospinal fluid, or stool.
- Other such samples may include one or more cells or a tissue biopsy, such as a cell or biopsy from the brain, mouth, throat, esophagus, stomach, lymph node, stomach, intestine (large or small), kidney, bladder, liver, pancreas, skin, muscle, bone, bone marrow, breast, ovary, vagina, cervix, uterus, testicle or prostate.
- the sample may be obtained by methods known in the art, such as a phlebotomy, cheek swab, fine needle aspiration, core needle biopsy, vacuum assisted biopsy, direct and frontal lobe biopsy, shave biopsy, punch biopsy, excisional biopsy, or cutterage biopsy.
- nucleic acids are extracted to assess nucleic acid sequence mutations, chemical sequence modifications, and/or chromosomal abnormalities.
- Primers of the invention may be prepared so that they hybridize under high stringency conditions. Nucleic acid hybridization may be affected by such conditions as salt concentration, temperature, or organic solvents, in addition to base composition, length of complementary strands, and number of nucleotide base mismatches between hybridizing nucleic acids, as is readily appreciated by those skilled in the art. Stringency of hybridization reactions is readily determinable by one of ordinary skill in the art, and generally is an empirical calculation dependent upon sequence length, washing temperature, and salt concentration. In general, longer sequences require higher temperatures for proper annealing, while shorter sequences need lower temperatures. Hybridization generally depends on the ability of denatured DNA to reanneal when complementary strands are present in an environment below its melting temperature.
- Stringent conditions or high stringency conditions typically: (1) employ low ionic strength and high temperature for washing, for example 0.015 M sodium chloride/0.0015 M sodium citrate/0.1% sodium dodecyl sulfate at 50° C.; (2) employ during hybridization a denaturing agent, such as formamide, for example, 50% (v/v) formamide with 0.1% bovine serum albumin/0.1% Ficoll/0.1% polyvinylpyrrolidone/50 mM sodium phosphate buffer at pH 6.5 with 750 mM sodium chloride, 75 mM sodium citrate at 42° C.; or (3) employ 50% formamide, 5 ⁇ SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5 ⁇ Denhardt's solution, sonicated salmon sperm DNA (50 .mu.g/ml), 0.1% SDS, and 10% dextran sulfate at 42° C., with was
- Moderately stringent conditions may be identified as described by Sambrook et al., Molecular Cloning: A Laboratory Manual, New York: Cold Spring Harbor Press, 1989 (the contents of which are incorporated by reference herein in their entirety), and include the use of washing solution and hybridization conditions (e.g., temperature, ionic strength and % SDS) less stringent that those described above.
- washing solution and hybridization conditions e.g., temperature, ionic strength and % SDS
- An example of moderately stringent conditions is overnight incubation at 37° C.
- primers of the invention may be prepared and hybridized under low stringency conditions.
- a primer of the invention may contain a relatively large number of abasic regions or mismatches, and/or it may be desirable to have the primer anneal with a diverse set of target sequences, e.g., when the target sequence is not known.
- the conditions for hybridization can be adjusted to obtain specificity suitable for the application.
- Nucleic acids may be obtained by methods known in the art. Generally, nucleic acids can be extracted from a biological sample by a variety of techniques such as those described by Maniatis, et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, N.Y., pp. 280-281, (1982), the contents of which is incorporated by reference herein in its entirety.
- the isolated nucleic acid molecules may be single-stranded, double-stranded, or double-stranded with single-stranded regions (for example, stem- and loop-structures).
- the isolated nucleic acid can be deoxyribonucleic acid (DNA) and/or ribonucleic acid (RNA).
- genomic DNA is isolated from the biological sample.
- Extracts may be prepared using standard techniques in the art, for example, by chemical or mechanical lysis of the cell. Extracts then may be further treated, for example, by filtration and/or centrifugation and/or with chaotropic salts such as guanidinium isothiocyanate or urea or with organic solvents such as phenol and/or HCC13 to denature any contaminating and potentially interfering proteins.
- chaotropic salts such as guanidinium isothiocyanate or urea
- organic solvents such as phenol and/or HCC13
- primers of the invention may be used for screening for the presence or absence of methylation of a nucleic acid sequence, such as de-methylation, methylation, hypomethylation and hypermethylation.
- Any one or combination of methods may be used for detecting methylation as well as the different types of genetic markers from the patient's isolated nucleic acid. Suitable methods include real-time or quantitative PCR, digital PCR, PCR in flowing or stationary droplets, well plates, slugs or fluid flowing segments, and the like, in capillary tubes, microfluidic chips, or standard thermocycler based PCR methods known to those having ordinary skill in the art.
- Additional detection methods can utilize binding to microarrays for subsequent fluorescent or non-fluorescent detection, barcode mass detection using a mass spectrometric methods, detection of emitted radiowaves, detection of scattered light from aligned barcodes, fluorescence detection using quantitative PCR or digital PCR methods.
- Still other techniques include, for example, Northern blot, selective hybridization, cleaved amplified polymorphic sequence analysis, short tandem repeat analysis, the use of supports coated with oligonucleotide probes, amplification of the nucleic acid by RT-PCR, quantitative PCR or ligation-PCR, etc.
- These methods can include the use of a nucleic acid probe (for example, an oligonucleotide) that can selectively or specifically detect the target nucleic acid in the sample to detect changes at the level of a single nucleotide polymorphism, whole DNA-fingerprint analysis, allele specific analysis.
- Amplification is accomplished according to various methods known to the person skilled in the art, such as PCR, LCR, transcription-mediated amplification (TMA), strand-displacement amplification (SDA), NASBA, the use of allele-specific oligonucleotides (ASO), allele-specific amplification, Southern blot, single-strand conformational analysis (SSCA), in-situ hybridization (e.g., FISH), migration on a gel, heteroduplex analysis, etc.
- ASO allele-specific oligonucleotides
- SSCA single-strand conformational analysis
- FISH in-situ hybridization
- migration on a gel heteroduplex analysis, etc.
- the quantity of nucleic acid detected can be compared to a reference value, for example a median or mean value observed in patients who do not have cancer, or to a value measured in parallel in a non-cancerous sample.
- a reference value for example a median or mean value observed in patients who do not have cancer
- amplified templates will be sequenced. Sequencing may be achieved by any method known in the art. DNA sequencing techniques include classic di-deoxy sequencing reactions (Sanger method) using labeled terminators or primers and gel separation in slab or capillary, sequencing by synthesis using reversibly terminated labeled nucleotides, pyrosequencing, 454 sequencing, allele specific hybridization to a library of labeled oligonucleotide probes, sequencing by synthesis using allele specific hybridization to a library of labeled clones that is followed by ligation, real time monitoring of the incorporation of labeled nucleotides during a polymerization step, polony sequencing, and SOLiD sequencing. Sequencing of separated molecules has more recently been demonstrated by sequential or single extension reactions using polymerases or ligases as well as by single or sequential differential hybridizations with libraries of probes.
- the target nucleic acid or the amplified nucleic acid or both are detected using sequencing.
- Sequencing-by-synthesis is a common technique used in next generation procedures and works well with the instant invention.
- other sequencing methods can be used, including sequence-by-ligation, sequencing-by-hybridization, gel-based techniques and others.
- sequencing involves hybridizing a primer to a template to form a template/primer duplex, contacting the duplex with a polymerase in the presence of a detectably-labeled nucleotides under conditions that permit the polymerase to add nucleotides to the primer in a template-dependent manner.
- detectable labels include radiolabels, florescent labels, enzymatic labels, etc.
- the detectable label may be an optically detectable label, such as a fluorescent label.
- Exemplary fluorescent labels include cyanine, rhodamine, fluorescien, coumarin, BODIPY, alexa, or conjugated multi-dyes. Numerous techniques are known for detecting sequences and some are exemplified below. However, the exact means for detecting and compiling sequence data does not affect the function of the invention described herein.
- a sequencing technique that can be used in the methods of the provided invention includes, for example, Helicos True Single Molecule Sequencing (tSMS) (Harris T. D. et al. (2008) Science 320:106-109).
- tSMS Helicos True Single Molecule Sequencing
- a DNA sample is cleaved into strands of approximately 100 to 200 nucleotides, and a polyA sequence is added to the 3′ end of each DNA strand.
- Each strand is labeled by the addition of a fluorescently labeled adenosine nucleotide.
- the DNA strands are then hybridized to a flow cell, which contains millions of oligo-T capture sites that are immobilized to the flow cell surface.
- the templates can be at a density of about 100 million templates/cm 2 .
- the flow cell is then loaded into an instrument, e.g., HeliScopeTM sequencer, and a laser illuminates the surface of the flow cell, revealing the position of each template.
- a CCD camera can map the position of the templates on the flow cell surface.
- the template fluorescent label is then cleaved and washed away.
- the sequencing reaction begins by introducing a DNA polymerase and a fluorescently labeled nucleotide.
- the oligo-T nucleic acid serves as a primer.
- the polymerase incorporates the labeled nucleotides to the primer in a template directed manner. The polymerase and unincorporated nucleotides are removed.
- the templates that have directed incorporation of the fluorescently labeled nucleotide are detected by imaging the flow cell surface. After imaging, a cleavage step removes the fluorescent label, and the process is repeated with other fluorescently labeled nucleotides until the desired read length is achieved. Sequence information is collected with each nucleotide addition step. Further description of tSMS is shown for example in Lapidus et al. (U.S. Pat. No. 7,169,560), Lapidus et al. (U.S. patent application number 2009/0191565), Quake et al. (U.S. Pat. No. 6,818,395), Harris (U.S. Pat. No. 7,282,337), Quake et al. (U.S. patent application number 2002/0164629), and Braslaysky, et al., PNAS (USA), 100: 3960-3964 (2003), the contents of each of these references is incorporated by reference herein in its entirety.
- 454 sequencing involves two steps. In the first step, DNA is sheared into fragments of approximately 300-800 base pairs, and the fragments are blunt ended. Oligonucleotide adaptors are then ligated to the ends of the fragments. The adaptors serve as primers for amplification and sequencing of the fragments.
- the fragments can be attached to DNA capture beads, e.g., streptavidin-coated beads using, e.g., Adaptor B, which contains 5′-biotin tag.
- the fragments attached to the beads are PCR amplified within droplets of an oil-water emulsion. The result is multiple copies of clonally amplified DNA fragments on each bead.
- the beads are captured in wells (pico-liter sized). Pyrosequencing is performed on each DNA fragment in parallel. Addition of one or more nucleotides generates a light signal that is recorded by a CCD camera in a sequencing instrument. The signal strength is proportional to the number of nucleotides incorporated. Pyrosequencing makes use of pyrophosphate (PPi) which is released upon nucleotide addition. PPi is converted to ATP by ATP sulfurylase in the presence of adenosine 5′ phosphosulfate. Luciferase uses ATP to convert luciferin to oxyluciferin, and this reaction generates light that is detected and analyzed.
- PPi pyrophosphate
- SOLiD sequencing genomic DNA is sheared into fragments, and adaptors are attached to the 5′ and 3′ ends of the fragments to generate a fragment library.
- internal adaptors can be introduced by ligating adaptors to the 5′ and 3′ ends of the fragments, circularizing the fragments, digesting the circularized fragment to generate an internal adaptor, and attaching adaptors to the 5′ and 3′ ends of the resulting fragments to generate a mate-paired library.
- clonal bead populations are prepared in microreactors containing beads, primers, template, and PCR components.
- templates are denatured and beads are enriched to separate the beads with extended templates.
- Templates on the selected beads are subjected to a 3′ modification that permits bonding to a glass slide.
- the sequence can be determined by sequential hybridization and ligation of partially random oligonucleotides with a central determined base (or pair of bases) that is identified by a specific fluorophore. After a color is recorded, the ligated oligonucleotide is cleaved and removed and the process is then repeated.
- Ion Torrent sequencing U.S. patent application numbers 2009/0026082, 2009/0127589, 2010/0035252, 2010/0137143, 2010/0188073, 2010/0197507, 2010/0282617, 2010/0300559), 2010/0300895, 2010/0301398, and 2010/0304982, the content of each of which is incorporated by reference herein in its entirety.
- Ion Torrent sequencing DNA is sheared into fragments of approximately 300-800 base pairs, and the fragments are blunt ended. Oligonucleotide adaptors are then ligated to the ends of the fragments.
- the adaptors serve as primers for amplification and sequencing of the fragments.
- the fragments can be attached to a surface and is attached at a resolution such that the fragments are individually resolvable. Addition of one or more nucleotides releases a proton (H + ), which signal detected and recorded in a sequencing instrument. The signal strength is proportional to the number of nucleotides incorporated.
- Illumina sequencing is based on the amplification of DNA on a solid surface using fold-back PCR and anchored primers. Genomic DNA is fragmented, and adapters are added to the 5′ and 3′ ends of the fragments. DNA fragments that are attached to the surface of flow cell channels are extended and bridge amplified. The fragments become double stranded, and the double stranded molecules are denatured. Multiple cycles of the solid-phase amplification followed by denaturation can create several million clusters of approximately 1,000 copies of single-stranded DNA molecules of the same template in each channel of the flow cell.
- Primers DNA polymerase and four fluorophore-labeled, reversibly terminating nucleotides are used to perform sequential sequencing. After nucleotide incorporation, a laser is used to excite the fluorophores, and an image is captured and the identity of the first base is recorded. The 3′ terminators and fluorophores from each incorporated base are removed and the incorporation, detection and identification steps are repeated.
- SMRT single molecule, real-time
- each of the four DNA bases is attached to one of four different fluorescent dyes. These dyes are phospholinked.
- a single DNA polymerase is immobilized with a single molecule of template single stranded DNA at the bottom of a zero-mode waveguide (ZMW).
- ZMW is a confinement structure which enables observation of incorporation of a single nucleotide by DNA polymerase against the background of fluorescent nucleotides that rapidly diffuse in an out of the ZMW (in microseconds). It takes several milliseconds to incorporate a nucleotide into a growing strand.
- the fluorescent label is excited and produces a fluorescent signal, and the fluorescent tag is cleaved off. Detection of the corresponding fluorescence of the dye indicates which base was incorporated. The process is repeated.
- a nanopore is a small hole, of the order of 1 nanometer in diameter. Immersion of a nanopore in a conducting fluid and application of a potential across it results in a slight electrical current due to conduction of ions through the nanopore. The amount of current which flows is sensitive to the size of the nanopore. As a DNA molecule passes through a nanopore, each nucleotide on the DNA molecule obstructs the nanopore to a different degree. Thus, the change in the current passing through the nanopore as the DNA molecule passes through the nanopore represents a reading of the DNA sequence.
- a sequencing technique that can be used in the methods of the provided invention involves using a chemical-sensitive field effect transistor (chemFET) array to sequence DNA (for example, as described in US Patent Application Publication No. 20090026082).
- chemFET chemical-sensitive field effect transistor
- DNA molecules can be placed into reaction chambers, and the template molecules can be hybridized to a sequencing primer bound to a polymerase.
- Incorporation of one or more triphosphates into a new nucleic acid strand at the 3′ end of the sequencing primer can be detected by a change in current by a chemFET.
- An array can have multiple chemFET sensors.
- single nucleic acids can be attached to beads, and the nucleic acids can be amplified on the bead, and the individual beads can be transferred to individual reaction chambers on a chemFET array, with each chamber having a chemFET sensor, and the nucleic acids can be sequenced.
- Another example of a sequencing technique that can be used in the methods of the provided invention involves using a electron microscope (Moudrianakis E. N. and Beer M. Proc Natl Acad Sci USA. 1965 March; 53:564-71).
- individual DNA molecules are labeled using metallic labels that are distinguishable using an electron microscope. These molecules are then stretched on a flat surface and imaged using an electron microscope to measure sequences.
- Sequences can be read that originate from a single molecule or that originate from amplifications from a single molecule Millions of independent amplifications of single molecules can be performed in parallel either on a solid surface or in tiny compartments in water/oil emulsion.
- the DNA sample to be sequenced can be diluted and/or dispersed sufficiently to obtain one molecule in each compartment. This dilution can be followed by DNA amplification to generate copies of the original DNA sequences and creating “clusters” of molecules all having the same sequence. These clusters can then be sequenced. Many millions of reads can be generated in one run. Sequence can be generated starting at the 5′ end of a given strand of an amplified sequence and/or sequence can be generated from starting from the 5′ end of the complementary sequence. In a preferred embodiment, sequence from strands is generated, i.e. paired end reads (see for example, Harris, U.S. Pat. No. 7,767,400).
- Nucleotides useful in the invention include any nucleotide or nucleotide analog, whether naturally-occurring or synthetic.
- preferred nucleotides include phosphate esters of deoxyadenosine, deoxycytidine, deoxyguanosine, deoxythymidine, adenosine, cytidine, guanosine, and uridine.
- nucleotides useful in the invention include an adenine, cytosine, guanine, thymine base, a xanthine or hypoxanthine; 5-bromouracil, 2-aminopurine, deoxyinosine, or methylated cytosine, such as 5-methylcytosine, and N4-methoxydeoxycytosine.
- bases of polynucleotide mimetics such as methylated nucleic acids, e.g., 2′-O-methRNA, peptide nucleic acids, modified peptide nucleic acids, locked nucleic acids and any other structural moiety that can act substantially like a nucleotide or base, for example, by exhibiting base-complementarity with one or more bases that occur in DNA or RNA and/or being capable of base-complementary incorporation, and includes chain-terminating analogs.
- a nucleotide corresponds to a specific nucleotide species if they share base-complementarity with respect to at least one base.
- Nucleotides for nucleic acid sequencing according to the invention preferably include a detectable label that is directly or indirectly detectable.
- Preferred labels include optically-detectable labels, such as fluorescent labels.
- fluorescent labels include, but are not limited to, 4-acetamido-4′-isothiocyanatostilbene-2,2′ disulfonic acid; acridine and derivatives: acridine, acridine isothiocyanate; 5-(2′-aminoethyl)aminonaphthalene-1-sulfonic acid (EDANS); 4-amino-N-[3-vinylsulfonyl)phenyl]naphthalimide-3,5 disulfonate; N-(4-anilino-1-naphthyl)maleimide; anthranilamide; BODIPY; Brilliant Yellow; coumarin and derivatives; coumarin, 7-amino-4-methylcoumarin (AMC, Coumarin 120), 7-
- Nucleic acid polymerases generally useful in the invention include DNA polymerases, RNA polymerases, reverse transcriptases, and mutant or altered forms of any of the foregoing. DNA polymerases and their properties are described in detail in, among other places, DNA Replication 2nd edition, Kornberg and Baker, W. H. Freeman, New York, N.Y. (1991).
- Known conventional DNA polymerases useful in the invention include, but are not limited to, Pyrococcus furiosus (Pfu) DNA polymerase (Lundberg et al., 1991, Gene, 108: 1, Stratagene), Pyrococcus woesei (Pwo) DNA polymerase (Hinnisdaels et al., 1996, Biotechniques, 20:186-8, Boehringer Mannheim), Thermus thermophilus (Tth) DNA polymerase (Myers and Gelfand 1991, Biochemistry 30:7661), Bacillus stearothermophilus DNA polymerase (Stenesh and McGowan, 1977, Biochim Biophys Acta 475:32), Thermococcus litoralis (Tli) DNA polymerase (also referred to as VentTM DNA polymerase, Cariello et al., 1991, Polynucleotides Res, 19: 4193, New England Biolabs), 9.degree.NmTM DNA polymerase (New England Biolabs
- thermococcus sp Thermus aquaticus (Taq) DNA polymerase (Chien et al., 1976, J. Bacteoriol, 127: 1550), DNA polymerase, Pyrococcus kodakaraensis KOD DNA polymerase (Takagi et al., 1997, Appl. Environ. Microbiol. 63:4504), JDF-3 DNA polymerase (from thermococcus sp.
- Thermophilic DNA polymerases include, but are not limited to, ThermoSequenase®, 9.degree.NmTM, TherminatorTM, Taq, Tne, Tma, Pfu, Tfl, Tth, Tli, Stoffel fragment, VentTM and Deep VentTM DNA polymerase, KOD DNA polymerase, Tgo, JDF-3, and mutants, variants and derivatives thereof.
- a highly-preferred form of any polymerase is a 3′ exonuclease-deficient mutant.
- Reverse transcriptases useful in the invention include, but are not limited to, reverse transcriptases from HIV, HTLV-1, HTLV-II, FeLV, FIV, SIV, AMV, MMTV, MoMuLV and other retroviruses (see Levin, Cell 88:5-8 (1997); Verma, Biochim Biophys Acta. 473:1-38 (1977); Wu et al., CRC Crit. Rev Biochem. 3:289-347 (1975)).
- nucleic acid template molecules are attached to a substrate (also referred to herein as a surface) and subjected to analysis by single molecule sequencing as described herein. Nucleic acid template molecules are attached to the surface such that the template/primer duplexes are individually optically resolvable.
- Substrates for use in the invention can be two- or three-dimensional and can include a planar surface (e.g., a glass slide) or can be shaped.
- a substrate can include glass (e.g., controlled pore glass (CPG)), quartz, plastic (such as polystyrene (low cross-linked and high cross-linked polystyrene), polycarbonate, polypropylene and poly(methymethacrylate)), acrylic copolymer, polyamide, silicon, metal (e.g., alkanethiolate-derivatized gold), cellulose, nylon, latex, dextran, gel matrix (e.g., silica gel), polyacrolein, or composites.
- CPG controlled pore glass
- plastic such as polystyrene (low cross-linked and high cross-linked polystyrene), polycarbonate, polypropylene and poly(methymethacrylate)
- acrylic copolymer polyamide
- silicon e.g., metal (e.g., alkanethiolate-derivatized gold)
- cellulose e.g., nylon, latex, dextran, gel matrix (e.g.
- Suitable three-dimensional substrates include, for example, spheres, microparticles, beads, membranes, slides, plates, micromachined chips, tubes (e.g., capillary tubes), microwells, microfluidic devices, channels, filters, or any other structure suitable for anchoring a nucleic acid.
- Substrates can include planar arrays or matrices capable of having regions that include populations of template nucleic acids or primers. Examples include nucleoside-derivatized CPG and polystyrene slides; derivatized magnetic slides; polystyrene grafted with polyethylene glycol, and the like.
- Substrates are preferably coated to allow optimum optical processing and nucleic acid attachment. Substrates for use in the invention can also be treated to reduce background. Exemplary coatings include epoxides, and derivatized epoxides (e.g., with a binding molecule, such as an oligonucleotide or streptavidin).
- Various methods can be used to anchor or immobilize the nucleic acid molecule to the surface of the substrate.
- the immobilization can be achieved through direct or indirect bonding to the surface.
- the bonding can be by covalent linkage. See, Joos et al., Analytical Biochemistry 247:96-101, 1997; Oroskar et al., Clin. Chem. 42:1547-1555, 1996; and Khandjian, Mol. Bio. Rep. 11:107-115, 1986.
- a preferred attachment is direct amine bonding of a terminal nucleotide of the template or the 5′ end of the primer to an epoxide integrated on the surface.
- the bonding also can be through non-covalent linkage.
- biotin-streptavidin (Taylor et al., J. Phys. D. Appl. Phys. 24:1443, 1991) and digoxigenin with anti-digoxigenin (Smith et al., Science 253:1122, 1992) are common tools for anchoring nucleic acids to surfaces and parallels.
- the attachment can be achieved by anchoring a hydrophobic chain into a lipid monolayer or bilayer.
- Other methods for known in the art for attaching nucleic acid molecules to substrates also can be used.
- exemplary detection methods include radioactive detection, optical absorbance detection, e.g., UV-visible absorbance detection, optical emission detection, e.g., fluorescence or chemiluminescence.
- extended primers can be detected on a substrate by scanning all or portions of each substrate simultaneously or serially, depending on the scanning method used.
- fluorescence labeling selected regions on a substrate may be serially scanned one-by-one or row-by-row using a fluorescence microscope apparatus, such as described in Fodor (U.S. Pat. No. 5,445,934) and Mathies et al. (U.S. Pat. No. 5,091,652).
- Devices capable of sensing fluorescence from a single molecule include scanning tunneling microscope (siM) and the atomic force microscope (AFM). Hybridization patterns may also be scanned using a CCD camera (e.g., Model TE/CCD512SF, Princeton Instruments, Trenton, N.J.) with suitable optics (Ploem, in Fluorescent and Luminescent Probes for Biological Activity Mason, T. G. Ed., Academic Press, Landon, pp. 1-11 (1993), such as described in Yershov et al., Proc. Natl. Acad. Sci. 93:4913 (1996), or may be imaged by TV monitoring.
- CCD camera e.g., Model TE/CCD512SF, Princeton Instruments, Trenton, N.J.
- suitable optics Ploem, in Fluorescent and Luminescent Probes for Biological Activity Mason, T. G. Ed., Academic Press, Landon, pp. 1-11 (1993), such as described in Y
- a phosphorimager device For radioactive signals, a phosphorimager device can be used (Johnston et al., Electrophoresis, 13:566, 1990; Drmanac et al., Electrophoresis, 13:566, 1992; 1993).
- Other commercial suppliers of imaging instruments include General Scanning Inc., (Watertown, Mass. on the World Wide Web at genscan.com), Genix Technologies (Waterloo, Ontario, Canada; on the World Wide Web at confocal.com), and Applied Precision Inc. Such detection methods are particularly useful to achieve simultaneous scanning of multiple attached template nucleic acids.
- Optical setups include near-field scanning microscopy, far-field confocal microscopy, wide-field epi-illumination, light scattering, dark field microscopy, photoconversion, single and/or multiphoton excitation, spectral wavelength discrimination, fluorophor identification, evanescent wave illumination, and total internal reflection fluorescence (TIRF) microscopy.
- TIRF total internal reflection fluorescence
- certain methods involve detection of laser-activated fluorescence using a microscope equipped with a camera.
- Suitable photon detection systems include, but are not limited to, photodiodes and intensified CCD cameras.
- an intensified charge couple device (ICCD) camera can be used.
- ICCD intensified charge couple device
- the use of an ICCD camera to image individual fluorescent dye molecules in a fluid near a surface provides numerous advantages. For example, with an ICCD optical setup, it is possible to acquire a sequence of images (movies) of fluorophores.
- TIRF microscopy uses totally internally reflected excitation light and is well known in the art. See, e.g., the World Wide Web at nikon-instruments.jp/eng/page/products/tirf.aspx.
- detection is carried out using evanescent wave illumination and total internal reflection fluorescence microscopy.
- An evanescent light field can be set up at the surface, for example, to image fluorescently-labeled nucleic acid molecules.
- the optical field does not end abruptly at the reflective interface, but its intensity falls off exponentially with distance.
- This surface electromagnetic field called the “evanescent wave”
- the thin evanescent optical field at the interface provides low background and facilitates the detection of single molecules with high signal-to-noise ratio at visible wavelengths.
- the evanescent field also can image fluorescently-labeled nucleotides upon their incorporation into the attached template/primer complex in the presence of a polymerase. Total internal reflectance fluorescence microscopy is then used to visualize the attached template/primer duplex and/or the incorporated nucleotides with single molecule resolution.
- Some embodiments of the invention use non-optical detection methods such as, for example, detection using nanopores (e.g., protein or solid state) through which molecules are individually passed so as to allow identification of the molecules by noting characteristics or changes in various properties or effects such as capacitance or blockage current flow (see, for example, Stoddart et al, Proc. Nat. Acad. Sci., 106:7702, 2009; Purnell and Schmidt, ACS Nano, 3:2533, 2009; Branton et al, Nature Biotechnology, 26:1146, 2008; Polonsky et al, U.S. Application 2008/0187915; Mitchell & Howorka, Angew. Chem. Int. Ed. 47:5565, 2008; Borsenberger et al, J. Am. Chem. Soc., 131, 7530, 2009); or other suitable non-optical detection methods.
- nanopores e.g., protein or solid state
- Alignment and/or compilation of sequence results obtained from the image stacks produced as generally described above utilizes look-up tables that take into account possible sequences changes (due, e.g., to errors, mutations, etc.). Essentially, sequencing results obtained as described herein are compared to a look-up type table that contains all possible reference sequences plus 1 or 2 base errors.
- a plurality of nucleic acid molecules being sequenced are bound to a solid support.
- a capture sequence/universal priming site can be added at the 3′ and/or 5′ end of the template.
- the nucleic acids may be bound to the solid support by hybridizing the capture sequence to a complementary sequence covalently attached to the solid support.
- the capture sequence (also referred to as a universal capture sequence) is a nucleic acid sequence complimentary to a sequence attached to a solid support that may dually serve as a universal primer.
- the capture sequence is polyN n , where N is U, A, T, G, or C, e.g., 20-70, 40-60, e.g., about 50.
- the capture sequence could be polyT 40-50 or its complement.
- a member of a coupling pair such as, e.g., antibody/antigen, receptor/ligand, or the avidin-biotin pair as described in, e.g., U.S. Patent Application No. 2006/0252077
- a coupling pair such as, e.g., antibody/antigen, receptor/ligand, or the avidin-biotin pair as described in, e.g., U.S. Patent Application No. 2006/0252077
- hypermethylated genes that have been linked to various types of cancer have been identified.
- hypermethylated genes that have been linked with susceptibility to or incidence of colorectal cancer include, for example, FOXE1, SOX17, SYNE1, BOLL, CABYR, EFEMP1, FBLN2, FOXL2, GNB4, GSTM3, HoxD1, Jph3, Neuralized (NEURL), PPP1R14a, TP53AP1, RAB32, APC2, GPNMB, MMP2, EVL, STARD8, PTPRD, CD109, LGR6, RET, CHD5, RNF182, ICAM5, ARMCX2, CBR1, DDX43, DMRTB1, FBLN2, H1ST2H2AA, ICAM1, LY6K, NEF3, POMC, STK31, SYCP3, TCL1A, TFPI-2, TLR2, UCHL1, ZFP42, ASCL2, ATP8A2, CTAG2, EPHA4, FANCF, FOXQ1, HUS1B
- hypermethylated markers that have been shown to be associated with susceptibility to or incidence of gastrointestinal cancer include, without limitation, NDRG4/NDRG2 subfamily gene, GATA4, OSMR, GATA5, SFRP1, ADAM23, JPH3, SFRP2, APC, MGMT, TFPI2, BNIP3, FOXE1, SYNE1, SOX17, PHACTR3, JAM3, or any combination thereof.
- Nucleic acid sequences and the corresponding amino acid sequences for each of the genes referenced herein are well known and have been published in sequence databases that are freely accessible to the general public such as the GenBank, the National Institutes of Health genetic sequence database, and Protein, which includes a collection of sequences from several sources, including translations from annotated coding regions in GenBank, RefSeq and TPA, as well as records from SwissProt, PIR, PRF, and PDB.
- sequences are readily available to the skilled artisan for use in designing chimeric primers and chimeric probes having sequences specific for the DNA encoding such genes for use in the methods of the invention.
- Two primers, Forward A4 (FORA4) (SEQ ID NO. 25: 3′-TTT*GAT*GTTTTTTGGGTTG) and Reverse SM REV1 (SEQ ID NO. 26: 3′-GAGGGAAGGGGGAGGG) were prepared as shown in FIG. 5 .
- the primers were complimentary to sequences adjacent to regions of TWIST that are known to have potential methylation sites. As shown in FIG. 5 , there were 14 CpG islands between the primer sites. Some or all of the CpG islands could be methylated in a sample, and the methylation pattern may be indicative of a disease, e.g., cancer.
- Forward primer FORA4 contains two abasic regions, allowing it to specifically hybridize with methylated or unmethylated template DNA.
- the corresponding mismatch primer, wherein “*” is replaced with “N” could be prepared using the methods described herein, but is not shown.
- Reverse primer SM REV1 did not contain any abasic (or mismatched) sites.
- FIG. 6 shows the amplification efficiency of the primers as a function of concentration. As shown in FIG. 6 , using as little as 300 nM primers, it is possible to reliably detect the amplicons using only 40 cycles of PCR, and regardless of the methylation pattern of the original template. Analysis of the amplicons, e.g., via deep sequencing, will give information about the methylation of this region of TWIST.
- Two primers, Forward A3 (FORA3) (SEQ ID NO. 27: 3′-TTTTTTTATTGGGGTTGTTT) and Reverse SM REV3 (SEQ ID NO. 28: 3′-GTTGTGGTTTTAATAGGTTG) were prepared as shown in FIG. 7 .
- the primers were complimentary to sequences adjacent to regions of NID that are known to have potential methylation sites. As shown in FIG. 7 , there were 13 CpG islands between the primer sites. Some or all of the CpG islands could be methylated in a sample, and the methylation pattern may be indicative of a disease, e.g., cancer.
- Forward primer FORA3 contains one mismatch region (shown as lower-case or underlined “G”), allowing it to specifically hybridize with methylated or unmethylated template DNA.
- the corresponding abasic primer, wherein “G” is replaced with “*” could be prepared using the methods described herein, but is not shown.
- Reverse primer SM REV3 did not contain any mismatched (or abasic) sites.
- FIG. 8 shows the amplification efficiency of the primers as a function of concentration. As shown in FIG. 8 , using as little as 300 nM primers, it is possible to reliably detect the amplicons using only 38 cycles of PCR, and regardless of the methylation pattern of the original template. Analysis of the amplicons, e.g., via deep sequencing, will give information about the methylation of this region of NID.
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Analytical Chemistry (AREA)
- Immunology (AREA)
- Molecular Biology (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Microbiology (AREA)
- General Health & Medical Sciences (AREA)
- Pathology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Hospice & Palliative Care (AREA)
- Oncology (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
Primers having abasic regions or mismatches for amplifying sequences suspected of having methylation. Primers having abasic regions or mismatches for amplifying sequences adjacent to suspected or known methylated sequences. Methods of using primers having abasic regions or mismatches for identification of methylated sequences or sequences adjacent to suspected or known methylation sequences.
Description
- This application is a continuation of U.S. patent application Ser. No. 13/472,209, filed May 15, 2012, now U.S. Pat. No. 8,377,657, which is incorporated by reference herein in its entirety.
- The present invention generally relates to a primers configured to specifically hybridize to either a methylated or an unmethylated CpG site of a template nucleic acid or adjacent to a methylated or an unmethylated CpG site of a template nucleic acid, and methods of use thereof.
- DNA methylation is an important regulator of gene expression and may play a role in the development and progression of a number of diseases, such as cancer. Methylation is typically limited to cytosines located 5′ to a guanine (i.e., CpG sequences), however other forms of methylation are known. Research suggests that genes with high levels of methylation in a promoter region are transcriptionally silent, which may allow unchecked cell proliferation. When a promoter region has excessive methylation, the methylation is typically most prevalent in sequences having CpG repeats, so called “CpG islands.” Undermethylation (hypomethylation) has also been implicated in the development and progression of cancer through different mechanisms.
- Several methods have been developed to identify and quantify methylation, especially in CpG sites, e.g., CpG islands, that are implicated in silencing promoters. Those include sequencing methods in which genomic DNA is isolated and treated with bisulfite. Because methylated cytosines are not affected by bisulfite treatment, the unmethylated Cs, e.g., within a CpG site, are converted to uracil, while methylated Cs are not converted. After sequencing, comparison of the starting DNA and the bisulfate treated DNA indicates the location of methylation sites.
- Perhaps the most widely-used method of probing methylation patterns is methylation specific PCR (MSP) which uses two sets of primers for an amplification reaction. One primer set is complimentary to sequences whose Cs are converted to Us by bisulfite treatment, and the other primer set is complimentary to non-converted Cs. Using these two separate primer sets, both the methylated and unmethylated DNA are amplified. Comparison of the amplification products gives insight as to the methylation in a given sequence. See Herman et al., “Methylation-specific PCR: A novel PCR assay for methylation status of CpG islands,” P.N.A.S., vol. 93, p. 9821-26 (1996), which is incorporated herein by reference in its entirety. This technique can detect methylation changes as small as ±0.1%. In addition to methylation of CpG islands, many of the sequences surrounding clinically relevant hypermethylated CpG islands can also be hypermethylated, and are potential biomarkers.
- A problem with MSP is that it requires the use of two different primer sets, one for sequences containing methylated Cs, and the other for sequences containing Us, which were converted from unmethylated Cs by bisulfite treatment. Using two different primer sets limits the application of MSP. In addition to the costs associated with producing and maintaining two separate primer sets, the amplification process cannot be maximally efficient because of the need to operate in temperature regimes appropriate for both primer sets.
- The invention provides compositions and methods for performing methylation specific amplification of nucleic acids using a single primer set. Aspects of the invention are accomplished by using primers configured such that they are able to specifically hybridize to either a methylated or an unmethylated site, e.g., a CpG island, of a template nucleic acid. Aspects of the invention are also accomplished by using primers configured such that they are able to specifically hybridize adjacent to a methylated or an unmethylated site, e.g., a CpG island, of a template nucleic acid. Accordingly, there is no need to have different primer sequences that distinguish between converted uracil sequences and unconverted cytosine sequences. Thus, amplification reactions are performed on nucleic acid with a single set of primers, which reduces costs and lowers assay complexity.
- One way that this is accomplished is by providing primers that include an abasic region that interacts with either the methylated or the unmethylated CpG site of the template nucleic acid. For example, a primer may contain one or more abasic regions corresponding to expected locations of methylation sites. Often the abasic region will be linked to a guanine moiety. The primer may contain any known abasic (spacer) molecule that is known in the art. Another way to accomplish this goal is to provide primers including at least one mismatched nucleotide that has similar annealing characteristics to both uracil and cytosine such that the primer hybridizes to either the methylated or the unmethylated CpG site of the template nucleic acid. Another way to accomplish this goal is to provide primers that hybridize to sequences adjacent to methylated or the unmethylated CpG site(s) of the template nucleic acid. The primer set allows for amplification of the entire site. Analysis of the amplification products gives information about the methylation status of the sample.
- In some embodiments, the primers may include an adaptor sequence such that amplicons are produced with adaptors already attached. In other embodiments, adaptors are attached to the amplicons after the amplification reaction. The adaptor sequence may optionally include a homopolymer region, e.g., a poly-A region. In some embodiment, the primer can specifically hybridize to either the methylated or the unmethylated CpG site of the template nucleic acid under conditions of high stringency. In some instances, it is useful to use sequencing to analyze the amplification products. In these embodiments it may be helpful to have a universal adaptor on the amplicons so that they can be hybridized to a universal primer on a solid support for the sequencing reaction.
- Primers of the invention may be used in a number of applications where it is desirable to analyze methylation of a sequence, e.g., epigenetics or diagnosing a disease, e.g., cancer. For example, a methylation pattern of a nucleic acid may be analyzed by obtaining template nucleic acid, contacting the template nucleic acid with an agent (e.g., bisulfite) that modifies unmethylated cytosine, hybridizing a primer configured such that it is able to specifically hybridize to either a methylated or an unmethylated CpG site of the template nucleic acid, amplifying the template nucleic acid, and analyzing a methylation pattern of the amplified nucleic acid. Any amplification or analysis method may be used in the methods of the invention. For example, PCR amplification, direct sequencing, fluorescent probe hybridization, etc. In some instances, the template nucleic acid is amplified with PCR. In some instances, the amplified nucleic acid in analyzed by sequencing the nucleic acid.
-
FIG. 1 depicts a TWIST sequence with various regions of potential methylation, and forward and reverse primers having abasic regions which can be used to amplify the TWIST sequence; -
FIG. 2 depicts a NID sequence with various regions of potential methylation, and forward and reverse primers having abasic regions which can be used to amplify the NID sequence; -
FIG. 3 depicts a TWIST sequence with various regions of potential methylation, and forward and reverse primers having mismatches which can be used to amplify the TWIST sequence; -
FIG. 4 depicts a NID sequence with various regions of potential methylation, and forward and reverse primers having mismatches which can be used to amplify the NID sequence; -
FIG. 5 depicts a TWIST sequence with various regions of potential methylation, and forward and reverse primers which can be used to amplify CpG islands within the TWIST sequence; -
FIG. 6 shows fluorescence intensity due to production of amplification products as a function of the concentration of the primers shown inFIG. 5 ; -
FIG. 7 depicts a NID sequence with various regions of potential methylation, and forward and reverse primers which can be used to amplify CpG islands within the NID sequence; -
FIG. 8 shows fluorescence intensity due to production of amplification products as a function of the concentration of the primers shown inFIG. 7 . - The present invention generally relates to primers configured to able to specifically hybridize to either a methylated or an unmethylated site, e.g., a CpG site, e.g., a CpG island of a template nucleic acid, or to hybridize adjacent to a methylated or an unmethylated site, and methods of use thereof. In certain aspects, the invention provides a primer configured such that it is able to specifically hybridize to either a methylated or an unmethylated CpG site of a template nucleic acid.
- Methylation generally refers to cytosine methylation at positions C5 or N4 of cytosine, the N6 position of adenine or other types of nucleic acid methylation. In some embodiments, methylation is analyzed by treating a nucleic acid sequence with bisulfate and then comparing the sequences of the bisulfite-treated and untreated nucleic acid sequences to identify one or more methylation regions. In some embodiments, methylation is detected using methylation specific polymerase chain reaction.
- DNA methylation is a chemical modification of DNA performed by enzymes called methyltransferases, in which a methyl group (m) is added to certain cytosines (C) of DNA, to yield 5-methylcytosine. This non-mutational (epigenetic) process (mC) is a critical factor in gene expression regulation. See, e.g., J. G. Herman, Seminars in Cancer Biology, 9: 359-67, 1999. Research suggests genes with high levels of 5-methylcytosine in a promoter region are transcriptionally silent, which allows unchecked cell proliferation. Additionally, it is likely that there a correlation between gene transcription and undermethylation.
- Methylation appears to influence gene expression by affecting the interactions with DNA of both chromatin proteins and specific transcription factors. 5-Methylcytosine performs much like a regular cytosine, pairing up with a guanine. However, some areas of genome are methylated more heavily than others and highly methylated areas tend to be less transcriptionally active, through a mechanism not fully understood. Methylation of cytosines can also persist from the germ line of one of the parents into the zygote, marking the chromosome as being inherited from this parent (genetic imprinting). DNA methylation frequently occurs in repeated sequences, e.g., CpG islands, and helps to suppress the expression and mobility of transposable elements. Because 5-methylcytosine is chemically very similar to thymidine, CpG sites are frequently mutated and become rare in the genome, except at CpG islands where they remain unmethylated. Epigenetic changes of this type thus have the potential to direct increased frequencies of permanent genetic mutation.
- Methylation patterns of DNA from cancer cells are significantly different from those of normal cells. Therefore, detection of methylation patterns in appropriately selected genes of cancer cells can lead to discrimination of cancer cells from normal (i.e., non-cancerous) cells, thereby providing an approach to early detection of cancer.
- A common method for assessing methylation status, e.g., the presence of CpG islands, is methylation specific PCR, also known as MSP. In MSP a nucleic acid sample is treated with a methylation reactant, typically bisulfite, and then amplified in the presence of two sets of primers. One primer set is complimentary to sequences with converted Cs and the second primer set is complimentary to non-converted Cs. Using these two separate primer sets, both the methylated and unmethylated DNA can be simultaneously amplified, and the amplification products compared (e.g., sequenced) to determine methylation sites in a given sequence. The MSP method, and variations on the MSP method, are described in greater detail in U.S. Pat. Nos. 6,265,171, 6,331,393, 6,977,146, 7,186,512, and 7,229,759 all of which are incorporated by reference herein in their entireties.
- As stated, the methods are useful for detecting the methylation status of at least one gene. This generally means determining the presence or absence of 5-methylcytosine at one or a plurality of (functionally relevant) CpG dinucleotides within the DNA sequence of the at least one gene. In particular, aberrant methylation, which may be referred to as hypermethylation, of the at least one gene may be detected. Typically, the methylation status is determined in one or more CpG sites, e.g., CpG islands in the at least one gene. These CpG sites are often found in the promoter region of the gene(s). Thus, CpG dinucleotides are typically concentrated in the promoter regions and exons of human genes and the methylation status of these CpG residues is of functional importance to whether the at least one gene is expressed. Since CpG dinucleotides susceptible to methylation are typically concentrated in the promoter region, exons and introns of human genes, promoter, exon and intron regions may be assessed in order to determine the methylation status of the at least one gene. A “promoter” is a region extending typically between approximately 1 Kb, 500 by or 150 to 300 by upstream from the transcription start site. The CpG site may surround or be positioned around the transcription start site of the at least one gene.
- In some embodiments, methods for detecting methylation status rely upon a reagent which selectively modifies unmethylated cytosine residues in the DNA contained in the sample to produce detectable modified residues but which does not modify methylated cytosine residues. Any suitable reagent may be utilized in the methods of the invention. Examples include bisulphite, hydrogen sulphite and disulphite reagents and suitable mixtures thereof. In an embodiment of the invention, the reagent comprises, consists essentially of or consists of a bisulphite reagent. In some embodiments, the invention includes primers having an abasic region and a region complementary to a template having a methylated CpG sequence or a UpG sequence. In some embodiments, the abasic region of the primer may only span one base of the template, for example, a C of a CpG sequence that is likely to be methylated. However, the invention includes primers whose abasic region spans more than one base, for example three or five bases. Primers of the invention may also include more than one abasic region, for example, a first abasic region separated by sequence consisting of A, C, T, and G, and then a second abasic region. The primer could comprise less than about 20, less than about 15, less than about 10, less than about 8, less than about 6, less than about 5, less than about 4, or less than about 3 abasic regions. The primer could comprise less than about 80%, less than about 60%, less than about 50%, less than about 30%, less than about 20%, less than about 15%, less than about 10%, less than about 5%, or less than about 2% of the sequence comprises abasic regions. A number of suitable primer sequences are disclosed, however, the concept is generally applicable to any primer sequence that would be used to analyze for methylation of a nucleic acid strand, e.g., to investigate methylation in a template.
- The invention additionally provides kits for providing a set of abasic primers for use in analyzing methylated sequences. For example, such a kit might include a reaction buffer, a DNA polymerase, and a primer having an abasic region and a region complementary to a template having a methylated CpG sequence or a UpG sequence. Some kits may additionally include a set of deoxyribonucleotides needed for the amplification.
- Using the primers or the kits above, it is possible to amplify a template suspected of having a methylated cytosine through a series of steps including: 1) providing a template suspected of having a methylated cytosine, 2) providing a primer having an abasic region and a region complementary to a template having a methylated CpG sequence or a UpG sequence, 3) mixing the template and the primer in the presence of free deoxyribonucleotides; and, 4) amplifying the template using PCR techniques. Once the template has been amplified, the amplified template can be isolated and sequenced. Typically the template suspected of having a methylated cytosine will be contacted with a bisulfite solution prior to the amplification in order to convert unmethylated Cs to Us.
- The invention additionally relates to primers having a sequence complementary to a template having a methylated CpG sequence or a UpG sequence, the primer having at least one mismatched base corresponding to a cytosine in the methylated CpG sequence or a uracil in the UpG sequence. These mismatches would be chosen to have similar annealing characteristics to Us and Cs and specific to the PCR technique used. Following the first few cycles of the PCR, the mismatched base(s) will become incorporated in the amplified product and subsequent annealing cycles would not involve a mismatched base, rather, the primer becomes a perfect match for amplified templates. Using these primers, it is no longer necessary to have two different primer sets that distinguish between bisulfite converted sequences and unconverted sequences when performing methylation analysis. In an embodiment, primary PCR reactions can be performed on genomic DNA with a single set of primers, followed by sequencing to identify and quantify clinically relevant sequences nested between the PCR primers.
- In some embodiments, the mismatched bases of the primer may only span one base of the template, for example, a C of a CpG sequence that is likely to be methylated. However, the invention includes primers whose mismatched bases span more than one base, for example three or five bases. Primers of the invention may also include more than one group of mismatches, for example, a first mismatched base separated by sequence consisting of A, C, T, and G, and then a second mismatched base. The primer could comprise less than about 20, less than about 15, less than about 10, less than about 8, less than about 6, less than about 5, less than about 4, or less than about 3 mismatched bases. The primer could comprise less than about 80%, less than about 60%, less than about 50%, less than about 30%, less than about 20%, less than about 15%, less than about 10%, less than about 5%, or less than about 2% of the sequence comprises mismatched bases. A number of suitable primer sequences are disclosed, however, the concept is generally applicable to any primer sequence that would be used for amplification (e.g., MSP) or to investigate methylation in a nucleic acid strand.
- The present invention also relates to kits for providing a set of primers having mismatched bases for use in MSP. For example, such a kit might include a reaction buffer, a DNA polymerase, and a primer including a sequence complementary to a template having a methylated CpG sequence or a UpG sequence, the primer having at least one mismatched base corresponding to a cytosine in the methylated CpG sequence or a uracil in the UpG sequence. Some kits may additionally include a set of deoxyribonucleotides needed for the amplification.
- Using the primers or the kits of the invention, it is possible to amplify a template suspected of having a methylated cytosine through a series of steps including: 1) providing a template suspected of having a methylated cytosine, 2) providing a primer having a sequence complementary to a template having a methylated CpG sequence or a UpG sequence, the primer having at least one mismatched base corresponding to a cytosine in the methylated CpG sequence or a uracil in the UpG sequence, 3) mixing the template and the primer in the presence of free deoxyribonucleotides; and, 4) amplifying the template using PCR techniques. Once the template has been amplified, the amplified template can be isolated and sequenced. Typically the template suspected of having a methylated cytosine will be contacted with a bisulfite solution prior to the amplification in order to convert unmethylated Cs to Us.
- These mismatches would be chosen to have similar annealing characteristics to Us and Cs and specific PCR. Following the first few cycles of the PCR, the mismatched base(s) become incorporated in the amplified product and subsequent annealing cycles do not involve a mismatched base, but it is now a perfect match. This approach also eliminates the necessity to perform a primary PCR with different primers that are complimentary to bisulfite converted and unconverted primer sequences.
- Primers of the invention are designed to be substantially complementary to each template (i.e., strand of the oligonucleotide to be amplified). In other words, the primers should have sufficient complementarity with a 5′ and 3′ oligonucleotide to hybridize and permit amplification of a nucleic acid sequence suspected to be at least partially methylated, e.g., including a CpG sequence.
- Primers of the invention are employed in the amplification process, which is an enzymatic chain reaction that produces exponentially increasing quantities of template relative to the number of reaction steps involved (e.g., polymerase chain reaction or PCR). Typically, one primer is complementary to a template corresponding to a negative (−) strand of the locus of interest, the antisense primer, and the other is complementary to a template corresponding to the positive (+) strand, the sense primer. Annealing the primers to denatured nucleic acid followed by extension with an enzyme, such as the large fragment of DNA Polymerase I and nucleotides, results in newly synthesized + and − strands containing the target locus sequence. Because these newly synthesized sequences are also templates, repeated cycles of denaturing, primer annealing, and extension results in exponential production of the region (i.e., the target locus sequence) defined by the primer. The product of the chain reaction is a discrete nucleic acid duplex with termini corresponding to the ends of the specific primers employed.
- Suitable conditions for primer extension are readily determined by those skilled in the art. These conditions include incubation temperature, incubation time, assay reagents, stabilizing factors, polymerizing agent, pH, and ionic strength sufficient to promote base pairing between the primer and desired template or target sequence. Polymerizing agents include enzymes capable of extending a primer by adding or substituting a nucleotide or modified nucleotide at the 3′ end of the primer. Suitable polymerizing enzymes include all manner of nucleotide polymerases.
- Polymerases may be isolated or cloned from a variety of organisms including viruses, bacteria, archaebacteria, fungi, mycoplasma, prokaryotes, and eukaryotes. Polymerases exhibiting thermal stability may also be employed, such as for example, polymerases from Thermus species, including Thermus aquaticus, Thermus brocianus, Therms thermophilus, and Thermus flavus; Pyrococcus species, including Pyrococcus furiosus, Pyrococcus sp. GB-D, and Pyrococcus woesei, Thermococcus litoralis, and Thermogata maritime. Biologically active proteolytic fragments, recombinant polymerases, genetically engineered polymerizing enzymes, and modified polymerases are included in the definition of polymerizing agent. It should be understood that the invention can employ various types of polymerases from various species and origins without undue experimentation.
- Suitable conditions for primer extension include hybridization and stringency conditions allowing desired hybridization between the primer and template or target sequence. As used herein, two nucleic acid sequences are said to be capable of specifically hybridizing to one another if the two molecules are capable of forming an anti-parallel, double-stranded nucleic acid structure or hybrid under hybridizing conditions, whereas they are substantially unable to form a double-stranded structure or hybrid when incubated with a non-target nucleic acid sequence under the same conditions. A nucleic acid molecule is said to be the “complement” of another nucleic acid molecule if it exhibits complete Watson-Crick base pair complementarity. Two molecules are said to be “substantially complementary” if they can hybridize to one another with sufficient stability to permit them to remain annealed to one another under at least conventional low-stringency conditions. Similarly, the molecules are said to be “complementary” if they can hybridize to one another with sufficient stability to permit them to remain annealed to one another under conventional high-stringency conditions. Stringency conditions in referring to homology or substantial similarity in the hybridization context, can be combined conditions of salt, temperature, organic solvents or other parameters that are typically known to influence hybridization. Typically, high stringency conditions include conditions selected to be 5 or more degrees higher than the thermal melting point (Tm) for a specific sequence at a defined ionic strength and pH. The Tm is the temperature (under defined ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched sequence. These techniques are well known in the art. For example, conventional stringency conditions are described in Sambrook, J., et al., Molecular Cloning, a Laboratory Manual, 2nd Edition, Cold Spring Harbor Press, Cold Spring Harbor, N.Y. (1989), and Haymes, B. D., et al., Nucleic Acid Hybridization, A Practical Approach, IRL Press, Washington, D.C. (1985), both of which are incorporated by reference in their entireties.
- Primers can be prepared by a variety of methods including but not limited to cloning of appropriate sequences and direct chemical synthesis using methods well known in the art (Narang et al., Methods Enzymol. 68:90 (1979); Brown et al., Methods Enzymol. 68:109 (1979)). Primers can also be obtained from commercial sources such as Operon Technologies, Amersham Pharmacia Biotech, Sigma, Integrated DNA Technologies, and Life Technologies. The primers can have an identical melting temperature. The lengths of the primers can be extended or shortened at the 5′ end or the 3′ end to produce primers with desired melting temperatures. Also, the annealing position of each primer pair can be designed such that the sequence and, length of the primer pairs yield the desired melting temperature. Computer programs can also be used to design primers, including but not limited to Array Designer Software (Arrayit Inc.), Oligonucleotide Probe Sequence Design Software for Genetic Analysis (Olympus Optical Co.), NetPrimer, and DNAsis from Hitachi Software Engineering. The Tm (melting or annealing temperature) of each primer is calculated using software programs such as Oligo Design, available from Invitrogen Corp.
- The annealing temperature of the primers can be recalculated and increased after any cycle of amplification, including but not limited to
cycle - A primer of the invention may contain an abasic region or a mismatch at any position on the primer, provided that the primer is stable enough to endure the thermocycling of PCR amplification and provided that the primer does not self-anneal. A number of abasic regions are commercially available from suppliers, such as Integrated DNA Technologies, and can be custom incorporated into primer sequences. Exemplary abasic regions include, but are not limited to, O-dimethoxytrityl-1′,2′-dideoxyribose-3′-[(2-cyanoethyl)-(N,N-diisopropyl)]-phosphoramidite, O-dimethoxytrityl-1′-methoxy-2′-dideoxyribose-3′-[(2-cyanoethyl)-(N,N-diisopropyl)]-phosphoramidite, [4-(4,4′-Dimethoxytrityloxy)butyramidomethyl)-1-(2-nitrophenyl)-ethyl]-2-cyanoethyl-(N,N-diisopropyl)-phosphoramidite, O-Dimethoxytrityl-1′-Deoxyribose-2′-O-Triisopropylsilyloxymethyl-3′-[(2-cyanoethyl)-(N,N-diisopropyl)]-phosphoramidite, (4,4′-Dimethoxytrityloxy)-dodecyl-1-[(2-cyanoethyl)-(N,N-diisopropyl)]-phosphoramidite), O-Dimethoxytritylhexaethyleneglycol, 1-[(2-cyanoethyl)-(N,N-diisopropyl)]-phosphoramidite, O-Dimethoxytrityl-triethyleneglycol, 1-[(2-cyanoethyl)-(N,N-diisopropyl)]-phosphoramidite; and 3-(4,4′-Dimethoxytrityloxy)propyl-1-[(2-cyanoethyl)-(N,N-diisopropyl)]-phosphoramidite. Primers containing mismatches can be ordered from any commercial supplier, e.g., Integrated DNA Technologies, provided that at least one mismatch is incorporated corresponding to a suspected methylation site. These mismatches would be chosen to have similar annealing characteristics to Us and Cs and specific to the PCR technique used.
- A number of exemplary sequences are shown in
FIGS. 1-4 , including both forward and reverse primers, including both abasic regions and mismatches. InFIGS. 1-4 the top sequence is listed 5′ to 3′, thus the lower sequence and the primers are listed 3′ to 5′.FIG. 1 shows abasic forward and reverse primers suitable for amplifying portions of a sequence coding a TWIST gene. The “+” symbols indicate regions that are likely to have methylation. Forward primers include SEQ ID NO. 1: 3′-GATGGGGTTGTTAT*G*GGTTAGG-5′; SEQ ID NO. 2: 3′-TT*GAT*GTTTTTTGGGTTG-5′; and SEQ ID NO. 3: 3′-GTTAGGGTT*GGGGG*GTTG-5′, wherein “*” is an abasic region spanning one base of the template. Reverse primers include SEQ ID NO. 4: 3′-TT*GT*GGAGGAAGG*GA*GG-5′; SEQ ID NO. 5: 3′-AGGG*GGTTAGGAGG*GGG-5′; and SEQ ID NO. 6: 3′-T*GGT*GTT*GGGTTAGGT*GT-5′, wherein “*” is an abasic region spanning one base of the template. -
FIG. 2 shows abasic forward and reverse primers suitable for amplifying portions of a sequence coding a NID gene. The “+” symbols indicate regions that are likely to have methylation. Forward primers include SEQ ID NO. 7: 3′-TTTTTTTATT*GGGTTG-5′; SEQ ID NO. 8: 3′-G*GGTTTTTAAGGAGTTTTATTTT*-5′; and SEQ ID NO. 9: 3′-T*GTAAGGTTTGGGGTAG*GG-5′, wherein “*” is an abasic region spanning one base of the template. Reverse primers include SEQ ID NO. 10: 3′-AG*GTAAAGGGAATTT*G-5′; SEQ ID NO. 11: 3′-TAG*GG*GAGTG*GGTTG-5′; and SEQ ID NO. 12: 3′-AGGTTGGTAGGG*G*GG-5′, wherein “*” is an abasic region spanning one base of the template. -
FIG. 3 shows mismatched forward and reverse primers suitable for amplifying portions of a sequence coding a TWIST gene. The “+” symbols indicate regions that are likely to have methylation. Forward primers include SEQ ID NO. 13: 3′-GATGGGGTTGTTATNGNGGTTAGG-5′; SEQ ID NO. 14: 3′-TTNGATNGTTTTTTGGGTTG-5′; and SEQ ID NO. 15: 3′-GTTAGGGTTNGGGGGNGTTG-5′, wherein “N” is a single mismatched base. Reverse primers include SEQ ID NO. 16: 3′-TTNGTNGGAGGAAGGNGANGG-5′; SEQ ID NO. 17: 3′-AGGGNGGTTAGGAGGNGGG-5′; and SEQ ID NO. 18: 3′-TNGGTNGTTNGGGTTAGGTNGT-5′, wherein “N” is a single mismatched base. -
FIG. 4 shows mismatched forward and reverse primers suitable for amplifying portions of a sequence coding a NID gene. The “+” symbols indicate regions that are likely to have methylation. Forward primers include SEQ ID NO. 19: 3′-TTTTTTTATTNGGGTTG-5′; SEQ ID NO. 20: 3′-GNGGTTTTTAAGGAGTTTTATTTTN-5′; and SEQ ID NO. 21: 3′-TNGTAAGGTTTGGGGTAGNGG-5′, wherein “N” is a single mismatched base. Reverse primers include SEQ ID NO. 22: 3′-AGNGTAAAGGGAATTTNG-5′; SEQ ID NO. 23: 3′-TAGNGGNGAGTGNGGTTG-5′; and SEQ ID NO. 24: 3′-AGGTTGGTAGGGNGNGG-5′, wherein “N” is a single mismatched base. - In some embodiments, the primers of the invention will additionally include a label. The label may be directly attached to the primer or the label may be attached via a linker, for example a polymer linker. The label attached to the primer may be directly or indirectly detectable. In certain embodiments, the exact label may be selected based, at least in part, on the particular type of detection method used. Exemplary detection methods include radioactive detection, optical absorbance detection, e.g., UV-visible absorbance detection, optical emission detection, e.g., fluorescence, phosphorescence, chemiluminescence, or Raman scattering. Preferred labels include optically-detectable labels, such as fluorescent labels. Examples of fluorescent labels include, but are not limited to, 4-acetamido-4′-isothiocyanatostilbene-2,2′ disulfonic acid; acridine and derivatives: acridine, acridine isothiocyanate; 5-(2′-aminoethyl)aminonaphthalene-1-sulfonic acid (EDANS); 4-amino-N-[3-vinylsulfonyl)phenyl]naphthalimide-3,5 disulfonate; N-(4-anilino-1-naphthyl)maleimide; anthranilamide; BODIPY; alexa; fluorescien; conjugated multi-dyes; Brilliant Yellow; coumarin and derivatives; coumarin, 7-amino-4-methylcoumarin (AMC, Coumarin 120), 7-amino-4-trifluoromethylcouluarin (Coumaran 151); cyanine dyes; cyanosine; 4′,6-diaminidino-2-phenylindole (DAPI); 5′5″-dibromopyrogallol-sulfonaphthalein (Bromopyrogallol Red); 7-diethylamino-3-(4′-isothiocyanatophenyl)-4-methylcoumarin; diethylenetriamine pentaacetate; 4,4′-diisothiocyanatodihydro-stilbene-2,2′-disulfonic acid; 4,4′-diisothiocyanatostilbene-2,2′-disulfonic acid; 5-[dimethylamino]naphthalene-1-sulfonyl chloride (DNS, dansylchloride); 4-dimethylaminophenylazophenyl-4′-isothiocyanate (DABITC); eosin and derivatives; eosin, eosin isothiocyanate, erythrosin and derivatives; erythrosin B, erythrosin, isothiocyanate; ethidium; fluorescein and derivatives; 5-carboxyfluorescein (FAM), 5-(4,6-dichlorotriazin-2-yl)aminofluorescein (DTAF), 2′,7′-dimethoxy-4′5′-dichloro-6-carboxyfluorescein, fluorescein, fluorescein isothiocyanate, QFITC, (XRITC); fluorescamine; IR144; IR1446; Malachite Green isothiocyanate; 4-methylumbelliferoneortho cresolphthalein; nitrotyrosine; pararosaniline; Phenol Red; B-phycoerythrin; o-phthaldialdehyde; pyrene and derivatives: pyrene, pyrene butyrate, succinimidyl 1-pyrene; quantum dots; Reactive Red 4 (Cibacron™ Brilliant Red 3B-A) rhodamine and derivatives: 6-carboxy-X-rhodamine (ROX), 6-carboxyrhodamine (R6G), lissamine rhodamine B sulfonyl chloride rhodamine (Rhod), rhodamine B, rhodamine 123, rhodamine X isothiocyanate, sulforhodamine B, sulforhodamine 101, sulfonyl chloride derivative of sulforhodamine 101 (Texas Red); N,N,N′,N′ tetramethyl-6-carboxyrhodamine (TAMRA); tetramethyl rhodamine; tetramethyl rhodamine isothiocyanate (TRITC); riboflavin; rosolic acid; terbium chelate derivatives; Atto dyes, Cy3; Cy5; Cy5.5; Cy7; IRD 700; IRD 800; La Jolla Blue; phthalo cyanine; and naphthalo cyanine. Labels other than fluorescent labels are contemplated by the invention, including other optically-detectable labels.
- In some embodiments, the amplicons produced with the disclosed methods include a detectable barcode-type label to facilitate sorting of amplified products. A detectable barcode-type label can be any barcode-type label known in the art including, for example, radio-frequency tags, semiconductor chips, barcoded magnetic beads (e.g., from Applied Biocode, Inc., Santa Fe Springs, Calif.), and nucleic acid sequences. When assessing methylation status, it may be useful to incorporate a barcode into a nucleic acid amplification product that is suspected to have methylation at a CpGsite, or is adjacent to a methylation site.
- In some instances, primers may include a barcode such that the barcode will be incorporated into the amplified produces. For example, the unique barcode sequence could be incorporated into the 5′ end of the primer, or the barcode sequence could be incorporated into the 3′ end of the primer. The primers may additionally comprise adaptors, e.g., as discussed below, such that the adaptors are incorporated into the amplified products.
- In alternate embodiments, the barcodes and/or the adaptors may be incorporated into the amplified products after amplification. For example, a suitable restriction enzyme (or other endonuclease) may be used to cut off an end of an amplification product so that a barcode can be added with a ligase. The same steps may be used to add an adaptor, e.g., a universal adaptor to the amplification products. These methods provide additional functionality for later processes, for example, sorting and sequencing.
- Attaching barcode sequences to nucleic acids is shown in U.S. Pub. 2008/0081330 and PCT/US09/64001, the content of each of which is incorporated by reference herein in its entirety. Methods for designing sets of barcode sequences and other methods for attaching barcode sequences are shown in U.S. Pat. Nos. 6,138,077; 6,352,828; 5,636,400; 6,172,214; 6235,475; 7,393,665; 7,544,473; 5,846,719; 5,695,934; 5,604,097; 6,150,516; RE39,793; 7,537,897; 6,172,218; and 5,863,722, the content of each of which is incorporated by reference herein in its entirety.
- Barcode sequences typically include a set of oligonucleotides ranging from about 4 to about 20 oligonucleotide bases (e.g., 8-10 oligonucleotide bases), which uniquely encode a discrete library member preferably without containing significant homology to any sequence in the targeted genome. The barcode sequence generally includes features useful in sequencing reactions. For example the barcode sequences are designed to have minimal or no homopolymer regions, i.e., 2 or more of the same base in a row such as AA or CCC, within the barcode sequence. The barcode sequences are also designed so that they are at least one edit distance away from the base addition order when performing base-by-base sequencing, ensuring that the first and last base do not match the expected bases of the sequence. In certain embodiments, the barcode sequences are designed to be correlated to a particular subject, allowing subject samples to be distinguished. Designing barcodes is shown U.S. Pat. No. 6,235,475, the contents of which are incorporated by reference herein in their entirety.
- In certain embodiments, the barcode sequences range from about 2 nucleotides to about 25 nucleotides, e.g., about 5 nucleotides to about 10 nucleotides. Since the barcode sequence is sequenced along with the template nucleic acid to which it is attached, the oligonucleotide length should be of minimal length so as to permit the longest read from the template nucleic acid attached. Generally, the barcode sequences are spaced from the template nucleic acid molecule by at least one base (minimizes homopolymeric combinations).
- In certain embodiments adaptor oligonucleotides are included in the primers. In some embodiments, the adaptors include a homopolymer region, e.g., a region of poly(A) or poly(T), that can hybridize to a universal primer for the sequence reaction. See also Sabot et al. (U.S. patent application number 2009/0226975), Adessi et al. (U.S. Pat. No. 7,115,400), and Kawashima et al. (U.S. patent application number 2005/0100900), the content of each of which is incorporated by reference herein in its entirety. Any method known in the art may be used to join the adaptors with the primers, for example, a ligase, a polymerase, Topo cloning (e.g., Invitrogen's topoisomerase vector cloning system using a topoisomerase enzyme), or chemical ligation or conjugation. The ligase may be any enzyme capable of ligating an oligonucleotide (RNA or DNA) to the primers. Suitable ligases include T4 DNA ligase and T4 RNA ligase (such ligases are available commercially, from New England Biolabs). Methods for using ligases are well known in the art. The polymerase may be any enzyme capable of adding nucleotides to the 3′ and the 5′ terminus of template nucleic acid molecules.
- Generally, a set of primers of the invention, including a forward and a reverse primer, may be substituted for one or more set(s) of primers used in methylation identification and analysis techniques of the art. In addition to PCR, primers may also be used in quantitative (“real-time”) PCR (qPCR), which allows for quantitative analysis of specific sequences during the amplification. A method for using PCR to assess methylation of a group of CpG sites within a CpG island by using two sets of primers is described in U.S. Pat. No. 6,017,704 incorporated by reference herein in its entirety. Primer sets of the invention are generally compatible with other PCR protocols described in the literature.
- In the example of MSP, primers specific for templates with methylated Cs, and Cs that have been converted to Us after contacting the template with bisulfite, are used in PCR. By comparing the amplification products of the two primer reactions, it is possible to distinguish between the chemically modified methylated and unmethylated DNA, which adds an improved sensitivity of methylation detection. In addition, MSP eliminates the false positive results inherent to other PCR-based approaches which rely on differential restriction enzyme cleavage to distinguish methylated from unmethylated DNA. MSP also allows examination of all CpG sites, not just those within sequences recognized by methylation-sensitive restriction enzymes. This markedly increases the number of such sites which can be assessed and will allow rapid, fine mapping of methylation patterns throughout CpG rich regions.
- The primers of the invention may be used for sensitive and specific test for detecting and diagnosing different diseases or disorders, particularly diseases or disorders associated with hypermethylation, particularly cancer. In some embodiments, it is possible to detect and diagnose a disease or disorder using only MSP. In some embodiments, the assessment a disease or disorder requires a combination of different types of assays, e.g., in addition to MSP.
- In some embodiments a sample is only analyzed for methylation. In other embodiments, both a sequence mutation and abnormal methylation are analyzed from a patient sample. The sequence mutation and the abnormal methylation may occur on the same chromosome, or on different chromosomes. Optionally, one or more chromosomal abnormalities (e.g., chromosomal instability) may be detected in combination with a sequence mutation and abnormal methylation to further improve diagnostic accuracy.
- Methods of the invention involve obtaining a biological sample, from a subject. Samples may include any bodily fluid such as blood, a blood fraction, saliva, sputum, urine, semen, transvaginal fluid, cerebrospinal fluid, or stool. Other such samples may include one or more cells or a tissue biopsy, such as a cell or biopsy from the brain, mouth, throat, esophagus, stomach, lymph node, stomach, intestine (large or small), kidney, bladder, liver, pancreas, skin, muscle, bone, bone marrow, breast, ovary, vagina, cervix, uterus, testicle or prostate.
- The sample may be obtained by methods known in the art, such as a phlebotomy, cheek swab, fine needle aspiration, core needle biopsy, vacuum assisted biopsy, direct and frontal lobe biopsy, shave biopsy, punch biopsy, excisional biopsy, or cutterage biopsy. Once the sample is obtained, nucleic acids are extracted to assess nucleic acid sequence mutations, chemical sequence modifications, and/or chromosomal abnormalities.
- Primers of the invention may be prepared so that they hybridize under high stringency conditions. Nucleic acid hybridization may be affected by such conditions as salt concentration, temperature, or organic solvents, in addition to base composition, length of complementary strands, and number of nucleotide base mismatches between hybridizing nucleic acids, as is readily appreciated by those skilled in the art. Stringency of hybridization reactions is readily determinable by one of ordinary skill in the art, and generally is an empirical calculation dependent upon sequence length, washing temperature, and salt concentration. In general, longer sequences require higher temperatures for proper annealing, while shorter sequences need lower temperatures. Hybridization generally depends on the ability of denatured DNA to reanneal when complementary strands are present in an environment below its melting temperature. The higher the degree of desired homology between the sequence and hybridizable sequence, the higher the relative temperature that can be used. As a result, it follows that higher relative temperatures would tend to make the reaction conditions more stringent, while lower temperatures less so. For additional details and explanation of stringency of hybridization reactions, see Ausubel et al., Current Protocols in Molecular Biology, Wiley Interscience Publishers, (1995), the contents of which are incorporated by reference herein in their entirety.
- Stringent conditions or high stringency conditions typically: (1) employ low ionic strength and high temperature for washing, for example 0.015 M sodium chloride/0.0015 M sodium citrate/0.1% sodium dodecyl sulfate at 50° C.; (2) employ during hybridization a denaturing agent, such as formamide, for example, 50% (v/v) formamide with 0.1% bovine serum albumin/0.1% Ficoll/0.1% polyvinylpyrrolidone/50 mM sodium phosphate buffer at pH 6.5 with 750 mM sodium chloride, 75 mM sodium citrate at 42° C.; or (3) employ 50% formamide, 5×SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5×Denhardt's solution, sonicated salmon sperm DNA (50 .mu.g/ml), 0.1% SDS, and 10% dextran sulfate at 42° C., with washes at 42° C. in 0.2×SSC (sodium chloride/sodium citrate) and 50% formamide at 55° C., followed by a high-stringency wash consisting of 0.1×SSC containing EDTA at 55° C.
- Moderately stringent conditions may be identified as described by Sambrook et al., Molecular Cloning: A Laboratory Manual, New York: Cold Spring Harbor Press, 1989 (the contents of which are incorporated by reference herein in their entirety), and include the use of washing solution and hybridization conditions (e.g., temperature, ionic strength and % SDS) less stringent that those described above. An example of moderately stringent conditions is overnight incubation at 37° C. in a solution including: 20% formamide, 5×SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5×Denhardt's solution, 10% dextran sulfate, and 20 mg/ml denatured sheared salmon sperm DNA, followed by washing the filters in 1×SSC at about 37° C. to 50° C. The skilled artisan will recognize how to adjust the temperature, ionic strength, etc. as necessary to accommodate factors such as sequence length and the like.
- In other embodiments, primers of the invention may be prepared and hybridized under low stringency conditions. For example, a primer of the invention may contain a relatively large number of abasic regions or mismatches, and/or it may be desirable to have the primer anneal with a diverse set of target sequences, e.g., when the target sequence is not known. In general the conditions for hybridization can be adjusted to obtain specificity suitable for the application.
- Nucleic acids may be obtained by methods known in the art. Generally, nucleic acids can be extracted from a biological sample by a variety of techniques such as those described by Maniatis, et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, N.Y., pp. 280-281, (1982), the contents of which is incorporated by reference herein in its entirety. The isolated nucleic acid molecules may be single-stranded, double-stranded, or double-stranded with single-stranded regions (for example, stem- and loop-structures). The isolated nucleic acid can be deoxyribonucleic acid (DNA) and/or ribonucleic acid (RNA). In a particular embodiment, genomic DNA is isolated from the biological sample.
- It may be necessary to first prepare an extract of the cell and then perform further steps—i.e., differential precipitation, column chromatography, extraction with organic solvents and the like—in order to obtain a sufficiently pure preparation of nucleic acid. Extracts may be prepared using standard techniques in the art, for example, by chemical or mechanical lysis of the cell. Extracts then may be further treated, for example, by filtration and/or centrifugation and/or with chaotropic salts such as guanidinium isothiocyanate or urea or with organic solvents such as phenol and/or HCC13 to denature any contaminating and potentially interfering proteins.
- In a particular, primers of the invention may be used for screening for the presence or absence of methylation of a nucleic acid sequence, such as de-methylation, methylation, hypomethylation and hypermethylation. Any one or combination of methods may be used for detecting methylation as well as the different types of genetic markers from the patient's isolated nucleic acid. Suitable methods include real-time or quantitative PCR, digital PCR, PCR in flowing or stationary droplets, well plates, slugs or fluid flowing segments, and the like, in capillary tubes, microfluidic chips, or standard thermocycler based PCR methods known to those having ordinary skill in the art. Additional detection methods can utilize binding to microarrays for subsequent fluorescent or non-fluorescent detection, barcode mass detection using a mass spectrometric methods, detection of emitted radiowaves, detection of scattered light from aligned barcodes, fluorescence detection using quantitative PCR or digital PCR methods.
- Still other techniques include, for example, Northern blot, selective hybridization, cleaved amplified polymorphic sequence analysis, short tandem repeat analysis, the use of supports coated with oligonucleotide probes, amplification of the nucleic acid by RT-PCR, quantitative PCR or ligation-PCR, etc. These methods can include the use of a nucleic acid probe (for example, an oligonucleotide) that can selectively or specifically detect the target nucleic acid in the sample to detect changes at the level of a single nucleotide polymorphism, whole DNA-fingerprint analysis, allele specific analysis. Amplification is accomplished according to various methods known to the person skilled in the art, such as PCR, LCR, transcription-mediated amplification (TMA), strand-displacement amplification (SDA), NASBA, the use of allele-specific oligonucleotides (ASO), allele-specific amplification, Southern blot, single-strand conformational analysis (SSCA), in-situ hybridization (e.g., FISH), migration on a gel, heteroduplex analysis, etc. If necessary, the quantity of nucleic acid detected can be compared to a reference value, for example a median or mean value observed in patients who do not have cancer, or to a value measured in parallel in a non-cancerous sample. Thus, it is possible to demonstrate a variation in the level of expression.
- In some embodiments, amplified templates will be sequenced. Sequencing may be achieved by any method known in the art. DNA sequencing techniques include classic di-deoxy sequencing reactions (Sanger method) using labeled terminators or primers and gel separation in slab or capillary, sequencing by synthesis using reversibly terminated labeled nucleotides, pyrosequencing, 454 sequencing, allele specific hybridization to a library of labeled oligonucleotide probes, sequencing by synthesis using allele specific hybridization to a library of labeled clones that is followed by ligation, real time monitoring of the incorporation of labeled nucleotides during a polymerization step, polony sequencing, and SOLiD sequencing. Sequencing of separated molecules has more recently been demonstrated by sequential or single extension reactions using polymerases or ligases as well as by single or sequential differential hybridizations with libraries of probes.
- In certain embodiments, the target nucleic acid or the amplified nucleic acid or both are detected using sequencing. Sequencing-by-synthesis is a common technique used in next generation procedures and works well with the instant invention. However, other sequencing methods can be used, including sequence-by-ligation, sequencing-by-hybridization, gel-based techniques and others. In general, sequencing involves hybridizing a primer to a template to form a template/primer duplex, contacting the duplex with a polymerase in the presence of a detectably-labeled nucleotides under conditions that permit the polymerase to add nucleotides to the primer in a template-dependent manner. Signal from the detectable label is then used to identify the incorporated base and the steps are sequentially repeated in order to determine the linear order of nucleotides in the template. Exemplary detectable labels include radiolabels, florescent labels, enzymatic labels, etc. In particular embodiments, the detectable label may be an optically detectable label, such as a fluorescent label. Exemplary fluorescent labels include cyanine, rhodamine, fluorescien, coumarin, BODIPY, alexa, or conjugated multi-dyes. Numerous techniques are known for detecting sequences and some are exemplified below. However, the exact means for detecting and compiling sequence data does not affect the function of the invention described herein.
- A sequencing technique that can be used in the methods of the provided invention includes, for example, Helicos True Single Molecule Sequencing (tSMS) (Harris T. D. et al. (2008) Science 320:106-109). In the tSMS technique, a DNA sample is cleaved into strands of approximately 100 to 200 nucleotides, and a polyA sequence is added to the 3′ end of each DNA strand. Each strand is labeled by the addition of a fluorescently labeled adenosine nucleotide. The DNA strands are then hybridized to a flow cell, which contains millions of oligo-T capture sites that are immobilized to the flow cell surface. The templates can be at a density of about 100 million templates/cm2. The flow cell is then loaded into an instrument, e.g., HeliScope™ sequencer, and a laser illuminates the surface of the flow cell, revealing the position of each template. A CCD camera can map the position of the templates on the flow cell surface. The template fluorescent label is then cleaved and washed away. The sequencing reaction begins by introducing a DNA polymerase and a fluorescently labeled nucleotide. The oligo-T nucleic acid serves as a primer. The polymerase incorporates the labeled nucleotides to the primer in a template directed manner. The polymerase and unincorporated nucleotides are removed. The templates that have directed incorporation of the fluorescently labeled nucleotide are detected by imaging the flow cell surface. After imaging, a cleavage step removes the fluorescent label, and the process is repeated with other fluorescently labeled nucleotides until the desired read length is achieved. Sequence information is collected with each nucleotide addition step. Further description of tSMS is shown for example in Lapidus et al. (U.S. Pat. No. 7,169,560), Lapidus et al. (U.S. patent application number 2009/0191565), Quake et al. (U.S. Pat. No. 6,818,395), Harris (U.S. Pat. No. 7,282,337), Quake et al. (U.S. patent application number 2002/0164629), and Braslaysky, et al., PNAS (USA), 100: 3960-3964 (2003), the contents of each of these references is incorporated by reference herein in its entirety.
- Another example of a DNA sequencing technique that can be used in the methods of the provided invention is 454 sequencing (Roche) (Margulies, M et al. 2005, Nature, 437, 376-380). 454 sequencing involves two steps. In the first step, DNA is sheared into fragments of approximately 300-800 base pairs, and the fragments are blunt ended. Oligonucleotide adaptors are then ligated to the ends of the fragments. The adaptors serve as primers for amplification and sequencing of the fragments. The fragments can be attached to DNA capture beads, e.g., streptavidin-coated beads using, e.g., Adaptor B, which contains 5′-biotin tag. The fragments attached to the beads are PCR amplified within droplets of an oil-water emulsion. The result is multiple copies of clonally amplified DNA fragments on each bead. In the second step, the beads are captured in wells (pico-liter sized). Pyrosequencing is performed on each DNA fragment in parallel. Addition of one or more nucleotides generates a light signal that is recorded by a CCD camera in a sequencing instrument. The signal strength is proportional to the number of nucleotides incorporated. Pyrosequencing makes use of pyrophosphate (PPi) which is released upon nucleotide addition. PPi is converted to ATP by ATP sulfurylase in the presence of
adenosine 5′ phosphosulfate. Luciferase uses ATP to convert luciferin to oxyluciferin, and this reaction generates light that is detected and analyzed. - Another example of a DNA sequencing technique that can be used in the methods of the provided invention is SOLiD technology (Applied Biosystems). In SOLiD sequencing, genomic DNA is sheared into fragments, and adaptors are attached to the 5′ and 3′ ends of the fragments to generate a fragment library. Alternatively, internal adaptors can be introduced by ligating adaptors to the 5′ and 3′ ends of the fragments, circularizing the fragments, digesting the circularized fragment to generate an internal adaptor, and attaching adaptors to the 5′ and 3′ ends of the resulting fragments to generate a mate-paired library. Next, clonal bead populations are prepared in microreactors containing beads, primers, template, and PCR components. Following PCR, the templates are denatured and beads are enriched to separate the beads with extended templates. Templates on the selected beads are subjected to a 3′ modification that permits bonding to a glass slide. The sequence can be determined by sequential hybridization and ligation of partially random oligonucleotides with a central determined base (or pair of bases) that is identified by a specific fluorophore. After a color is recorded, the ligated oligonucleotide is cleaved and removed and the process is then repeated.
- Another example of a DNA sequencing technique that can be used in the methods of the provided invention is Ion Torrent sequencing (U.S. patent application numbers 2009/0026082, 2009/0127589, 2010/0035252, 2010/0137143, 2010/0188073, 2010/0197507, 2010/0282617, 2010/0300559), 2010/0300895, 2010/0301398, and 2010/0304982), the content of each of which is incorporated by reference herein in its entirety. In Ion Torrent sequencing, DNA is sheared into fragments of approximately 300-800 base pairs, and the fragments are blunt ended. Oligonucleotide adaptors are then ligated to the ends of the fragments. The adaptors serve as primers for amplification and sequencing of the fragments. The fragments can be attached to a surface and is attached at a resolution such that the fragments are individually resolvable. Addition of one or more nucleotides releases a proton (H+), which signal detected and recorded in a sequencing instrument. The signal strength is proportional to the number of nucleotides incorporated.
- Another example of a sequencing technology that can be used in the methods of the provided invention is Illumina sequencing. Illumina sequencing is based on the amplification of DNA on a solid surface using fold-back PCR and anchored primers. Genomic DNA is fragmented, and adapters are added to the 5′ and 3′ ends of the fragments. DNA fragments that are attached to the surface of flow cell channels are extended and bridge amplified. The fragments become double stranded, and the double stranded molecules are denatured. Multiple cycles of the solid-phase amplification followed by denaturation can create several million clusters of approximately 1,000 copies of single-stranded DNA molecules of the same template in each channel of the flow cell. Primers, DNA polymerase and four fluorophore-labeled, reversibly terminating nucleotides are used to perform sequential sequencing. After nucleotide incorporation, a laser is used to excite the fluorophores, and an image is captured and the identity of the first base is recorded. The 3′ terminators and fluorophores from each incorporated base are removed and the incorporation, detection and identification steps are repeated.
- Another example of a sequencing technology that can be used in the methods of the provided invention includes the single molecule, real-time (SMRT) technology of Pacific Biosciences. In SMRT, each of the four DNA bases is attached to one of four different fluorescent dyes. These dyes are phospholinked. A single DNA polymerase is immobilized with a single molecule of template single stranded DNA at the bottom of a zero-mode waveguide (ZMW). A ZMW is a confinement structure which enables observation of incorporation of a single nucleotide by DNA polymerase against the background of fluorescent nucleotides that rapidly diffuse in an out of the ZMW (in microseconds). It takes several milliseconds to incorporate a nucleotide into a growing strand. During this time, the fluorescent label is excited and produces a fluorescent signal, and the fluorescent tag is cleaved off. Detection of the corresponding fluorescence of the dye indicates which base was incorporated. The process is repeated.
- Another example of a sequencing technique that can be used in the methods of the provided invention is nanopore sequencing (Soni G V and Meller A. (2007) Clin Chem 53: 1996-2001). A nanopore is a small hole, of the order of 1 nanometer in diameter. Immersion of a nanopore in a conducting fluid and application of a potential across it results in a slight electrical current due to conduction of ions through the nanopore. The amount of current which flows is sensitive to the size of the nanopore. As a DNA molecule passes through a nanopore, each nucleotide on the DNA molecule obstructs the nanopore to a different degree. Thus, the change in the current passing through the nanopore as the DNA molecule passes through the nanopore represents a reading of the DNA sequence.
- Another example of a sequencing technique that can be used in the methods of the provided invention involves using a chemical-sensitive field effect transistor (chemFET) array to sequence DNA (for example, as described in US Patent Application Publication No. 20090026082). In one example of the technique, DNA molecules can be placed into reaction chambers, and the template molecules can be hybridized to a sequencing primer bound to a polymerase. Incorporation of one or more triphosphates into a new nucleic acid strand at the 3′ end of the sequencing primer can be detected by a change in current by a chemFET. An array can have multiple chemFET sensors. In another example, single nucleic acids can be attached to beads, and the nucleic acids can be amplified on the bead, and the individual beads can be transferred to individual reaction chambers on a chemFET array, with each chamber having a chemFET sensor, and the nucleic acids can be sequenced.
- Another example of a sequencing technique that can be used in the methods of the provided invention involves using a electron microscope (Moudrianakis E. N. and Beer M. Proc Natl Acad Sci USA. 1965 March; 53:564-71). In one example of the technique, individual DNA molecules are labeled using metallic labels that are distinguishable using an electron microscope. These molecules are then stretched on a flat surface and imaged using an electron microscope to measure sequences.
- Sequences can be read that originate from a single molecule or that originate from amplifications from a single molecule Millions of independent amplifications of single molecules can be performed in parallel either on a solid surface or in tiny compartments in water/oil emulsion. The DNA sample to be sequenced can be diluted and/or dispersed sufficiently to obtain one molecule in each compartment. This dilution can be followed by DNA amplification to generate copies of the original DNA sequences and creating “clusters” of molecules all having the same sequence. These clusters can then be sequenced. Many millions of reads can be generated in one run. Sequence can be generated starting at the 5′ end of a given strand of an amplified sequence and/or sequence can be generated from starting from the 5′ end of the complementary sequence. In a preferred embodiment, sequence from strands is generated, i.e. paired end reads (see for example, Harris, U.S. Pat. No. 7,767,400).
- Nucleotides useful in the invention include any nucleotide or nucleotide analog, whether naturally-occurring or synthetic. For example, preferred nucleotides include phosphate esters of deoxyadenosine, deoxycytidine, deoxyguanosine, deoxythymidine, adenosine, cytidine, guanosine, and uridine. Other nucleotides useful in the invention include an adenine, cytosine, guanine, thymine base, a xanthine or hypoxanthine; 5-bromouracil, 2-aminopurine, deoxyinosine, or methylated cytosine, such as 5-methylcytosine, and N4-methoxydeoxycytosine. Also included are bases of polynucleotide mimetics, such as methylated nucleic acids, e.g., 2′-O-methRNA, peptide nucleic acids, modified peptide nucleic acids, locked nucleic acids and any other structural moiety that can act substantially like a nucleotide or base, for example, by exhibiting base-complementarity with one or more bases that occur in DNA or RNA and/or being capable of base-complementary incorporation, and includes chain-terminating analogs. A nucleotide corresponds to a specific nucleotide species if they share base-complementarity with respect to at least one base.
- Nucleotides for nucleic acid sequencing according to the invention preferably include a detectable label that is directly or indirectly detectable. Preferred labels include optically-detectable labels, such as fluorescent labels. Examples of fluorescent labels include, but are not limited to, 4-acetamido-4′-isothiocyanatostilbene-2,2′ disulfonic acid; acridine and derivatives: acridine, acridine isothiocyanate; 5-(2′-aminoethyl)aminonaphthalene-1-sulfonic acid (EDANS); 4-amino-N-[3-vinylsulfonyl)phenyl]naphthalimide-3,5 disulfonate; N-(4-anilino-1-naphthyl)maleimide; anthranilamide; BODIPY; Brilliant Yellow; coumarin and derivatives; coumarin, 7-amino-4-methylcoumarin (AMC, Coumarin 120), 7-amino-4-trifluoromethylcouluarin (Coumaran 151); cyanine dyes; cyanosine; 4′,6-diaminidino-2-phenylindole (DAPI); 5′5″-dibromopyrogallol-sulfonaphthalein (Bromopyrogallol Red); 7-diethylamino-3-(4′-isothiocyanatophenyl)-4-methylcoumarin; diethylenetriamine pentaacetate; 4,4′-diisothiocyanatodihydro-stilbene-2,2′-disulfonic acid; 4,4′-diisothiocyanatostilbene-2,2′-disulfonic acid; 5-[dimethylamino]naphthalene-1-sulfonyl chloride (DNS, dansylchloride); 4-dimethylaminophenylazophenyl-4′-isothiocyanate (DABITC); eosin and derivatives; eosin, eosin isothiocyanate, erythrosin and derivatives; erythrosin B, erythrosin, isothiocyanate; ethidium; fluorescein and derivatives; 5-carboxyfluorescein (FAM), 5-(4,6-dichlorotriazin-2-yl)aminofluorescein (DTAF), 2′,7′-dimethoxy-4′5′-dichloro-6-carboxyfluorescein, fluorescein, fluorescein isothiocyanate, QFITC, (XRITC); fluorescamine; IR144; IR1446; Malachite Green isothiocyanate; 4-methylumbelliferoneortho cresolphthalein; nitrotyrosine; pararosaniline; Phenol Red; B-phycoerythrin; o-phthaldialdehyde; pyrene and derivatives: pyrene, pyrene butyrate, succinimidyl 1-pyrene; butyrate quantum dots; Reactive Red 4 (Cibacron™ Brilliant Red 3B-A) rhodamine and derivatives: 6-carboxy-X-rhodamine (ROX), 6-carboxyrhodamine (R6G), lissamine rhodamine B sulfonyl chloride rhodamine (Rhod), rhodamine B, rhodamine 123, rhodamine X isothiocyanate, sulforhodamine B, sulforhodamine 101, sulfonyl chloride derivative of sulforhodamine 101 (Texas Red); N,N,N′,N′ tetramethyl-6-carboxyrhodamine (TAMRA); tetramethyl rhodamine; tetramethyl rhodamine isothiocyanate (TRITC); riboflavin; rosolic acid; terbium chelate derivatives; Cy3; Cy5; Cy5.5; Cy7; IRD 700; IRD 800; La Jolta Blue; phthalo cyanine; and naphthalo cyanine. Preferred fluorescent labels are cyanine-3 and cyanine-5. Labels other than fluorescent labels are contemplated by the invention, including other optically-detectable labels.
- Nucleic acid polymerases generally useful in the invention include DNA polymerases, RNA polymerases, reverse transcriptases, and mutant or altered forms of any of the foregoing. DNA polymerases and their properties are described in detail in, among other places, DNA Replication 2nd edition, Kornberg and Baker, W. H. Freeman, New York, N.Y. (1991). Known conventional DNA polymerases useful in the invention include, but are not limited to, Pyrococcus furiosus (Pfu) DNA polymerase (Lundberg et al., 1991, Gene, 108: 1, Stratagene), Pyrococcus woesei (Pwo) DNA polymerase (Hinnisdaels et al., 1996, Biotechniques, 20:186-8, Boehringer Mannheim), Thermus thermophilus (Tth) DNA polymerase (Myers and Gelfand 1991, Biochemistry 30:7661), Bacillus stearothermophilus DNA polymerase (Stenesh and McGowan, 1977, Biochim Biophys Acta 475:32), Thermococcus litoralis (Tli) DNA polymerase (also referred to as Vent™ DNA polymerase, Cariello et al., 1991, Polynucleotides Res, 19: 4193, New England Biolabs), 9.degree.Nm™ DNA polymerase (New England Biolabs), Stoffel fragment, ThermoSequenase® (Amersham Pharmacia Biotech UK), Therminator™ (New England Biolabs), Thermotoga maritima (Tma) DNA polymerase (Diaz and Sabino, 1998 Braz J. Med. Res, 31:1239), Thermus aquaticus (Taq) DNA polymerase (Chien et al., 1976, J. Bacteoriol, 127: 1550), DNA polymerase, Pyrococcus kodakaraensis KOD DNA polymerase (Takagi et al., 1997, Appl. Environ. Microbiol. 63:4504), JDF-3 DNA polymerase (from thermococcus sp. JDF-3, Patent application WO 0132887), Pyrococcus GB-D (PGB-D) DNA polymerase (also referred as Deep Vent™ DNA polymerase, Juncosa-Ginesta et al., 1994, Biotechniques, 16:820, New England Biolabs), UlTma DNA polymerase (from thermophile Thermotoga maritima; Diaz and Sabino, 1998 Braz J. Med. Res, 31:1239; PE Applied Biosystems), Tgo DNA polymerase (from thermococcus gorgonarius, Roche Molecular Biochemicals), E. coli DNA polymerase I (Lecomte and Doubleday, 1983, Polynucleotides Res. 11:7505), T7 DNA polymerase (Nordstrom et al., 1981, J. Biol. Chem. 256:3112), and archaeal DP1I/DP2 DNA polymerase II (Cann et al, 1998, Proc. Natl. Acad. Sci. USA 95:14250).
- Both mesophilic polymerases and thermophilic polymerases are contemplated. Thermophilic DNA polymerases include, but are not limited to, ThermoSequenase®, 9.degree.Nm™, Therminator™, Taq, Tne, Tma, Pfu, Tfl, Tth, Tli, Stoffel fragment, Vent™ and Deep Vent™ DNA polymerase, KOD DNA polymerase, Tgo, JDF-3, and mutants, variants and derivatives thereof. A highly-preferred form of any polymerase is a 3′ exonuclease-deficient mutant.
- Reverse transcriptases useful in the invention include, but are not limited to, reverse transcriptases from HIV, HTLV-1, HTLV-II, FeLV, FIV, SIV, AMV, MMTV, MoMuLV and other retroviruses (see Levin, Cell 88:5-8 (1997); Verma, Biochim Biophys Acta. 473:1-38 (1977); Wu et al., CRC Crit. Rev Biochem. 3:289-347 (1975)).
- In a preferred embodiment, nucleic acid template molecules are attached to a substrate (also referred to herein as a surface) and subjected to analysis by single molecule sequencing as described herein. Nucleic acid template molecules are attached to the surface such that the template/primer duplexes are individually optically resolvable. Substrates for use in the invention can be two- or three-dimensional and can include a planar surface (e.g., a glass slide) or can be shaped. A substrate can include glass (e.g., controlled pore glass (CPG)), quartz, plastic (such as polystyrene (low cross-linked and high cross-linked polystyrene), polycarbonate, polypropylene and poly(methymethacrylate)), acrylic copolymer, polyamide, silicon, metal (e.g., alkanethiolate-derivatized gold), cellulose, nylon, latex, dextran, gel matrix (e.g., silica gel), polyacrolein, or composites.
- Suitable three-dimensional substrates include, for example, spheres, microparticles, beads, membranes, slides, plates, micromachined chips, tubes (e.g., capillary tubes), microwells, microfluidic devices, channels, filters, or any other structure suitable for anchoring a nucleic acid. Substrates can include planar arrays or matrices capable of having regions that include populations of template nucleic acids or primers. Examples include nucleoside-derivatized CPG and polystyrene slides; derivatized magnetic slides; polystyrene grafted with polyethylene glycol, and the like.
- Substrates are preferably coated to allow optimum optical processing and nucleic acid attachment. Substrates for use in the invention can also be treated to reduce background. Exemplary coatings include epoxides, and derivatized epoxides (e.g., with a binding molecule, such as an oligonucleotide or streptavidin).
- Various methods can be used to anchor or immobilize the nucleic acid molecule to the surface of the substrate. The immobilization can be achieved through direct or indirect bonding to the surface. The bonding can be by covalent linkage. See, Joos et al., Analytical Biochemistry 247:96-101, 1997; Oroskar et al., Clin. Chem. 42:1547-1555, 1996; and Khandjian, Mol. Bio. Rep. 11:107-115, 1986. A preferred attachment is direct amine bonding of a terminal nucleotide of the template or the 5′ end of the primer to an epoxide integrated on the surface. The bonding also can be through non-covalent linkage. For example, biotin-streptavidin (Taylor et al., J. Phys. D. Appl. Phys. 24:1443, 1991) and digoxigenin with anti-digoxigenin (Smith et al., Science 253:1122, 1992) are common tools for anchoring nucleic acids to surfaces and parallels. Alternatively, the attachment can be achieved by anchoring a hydrophobic chain into a lipid monolayer or bilayer. Other methods for known in the art for attaching nucleic acid molecules to substrates also can be used.
- Any detection method can be used that is suitable for the type of label employed. Thus, exemplary detection methods include radioactive detection, optical absorbance detection, e.g., UV-visible absorbance detection, optical emission detection, e.g., fluorescence or chemiluminescence. For example, extended primers can be detected on a substrate by scanning all or portions of each substrate simultaneously or serially, depending on the scanning method used. For fluorescence labeling, selected regions on a substrate may be serially scanned one-by-one or row-by-row using a fluorescence microscope apparatus, such as described in Fodor (U.S. Pat. No. 5,445,934) and Mathies et al. (U.S. Pat. No. 5,091,652). Devices capable of sensing fluorescence from a single molecule include scanning tunneling microscope (siM) and the atomic force microscope (AFM). Hybridization patterns may also be scanned using a CCD camera (e.g., Model TE/CCD512SF, Princeton Instruments, Trenton, N.J.) with suitable optics (Ploem, in Fluorescent and Luminescent Probes for Biological Activity Mason, T. G. Ed., Academic Press, Landon, pp. 1-11 (1993), such as described in Yershov et al., Proc. Natl. Acad. Sci. 93:4913 (1996), or may be imaged by TV monitoring. For radioactive signals, a phosphorimager device can be used (Johnston et al., Electrophoresis, 13:566, 1990; Drmanac et al., Electrophoresis, 13:566, 1992; 1993). Other commercial suppliers of imaging instruments include General Scanning Inc., (Watertown, Mass. on the World Wide Web at genscan.com), Genix Technologies (Waterloo, Ontario, Canada; on the World Wide Web at confocal.com), and Applied Precision Inc. Such detection methods are particularly useful to achieve simultaneous scanning of multiple attached template nucleic acids.
- A number of approaches can be used to detect incorporation of fluorescently-labeled nucleotides into a single nucleic acid molecule. Optical setups include near-field scanning microscopy, far-field confocal microscopy, wide-field epi-illumination, light scattering, dark field microscopy, photoconversion, single and/or multiphoton excitation, spectral wavelength discrimination, fluorophor identification, evanescent wave illumination, and total internal reflection fluorescence (TIRF) microscopy. In general, certain methods involve detection of laser-activated fluorescence using a microscope equipped with a camera. Suitable photon detection systems include, but are not limited to, photodiodes and intensified CCD cameras. For example, an intensified charge couple device (ICCD) camera can be used. The use of an ICCD camera to image individual fluorescent dye molecules in a fluid near a surface provides numerous advantages. For example, with an ICCD optical setup, it is possible to acquire a sequence of images (movies) of fluorophores.
- Some embodiments of the present invention use TIRF microscopy for imaging. TIRF microscopy uses totally internally reflected excitation light and is well known in the art. See, e.g., the World Wide Web at nikon-instruments.jp/eng/page/products/tirf.aspx. In certain embodiments, detection is carried out using evanescent wave illumination and total internal reflection fluorescence microscopy. An evanescent light field can be set up at the surface, for example, to image fluorescently-labeled nucleic acid molecules. When a laser beam is totally reflected at the interface between a liquid and a solid substrate (e.g., a glass), the excitation light beam penetrates only a short distance into the liquid. The optical field does not end abruptly at the reflective interface, but its intensity falls off exponentially with distance. This surface electromagnetic field, called the “evanescent wave”, can selectively excite fluorescent molecules in the liquid near the interface. The thin evanescent optical field at the interface provides low background and facilitates the detection of single molecules with high signal-to-noise ratio at visible wavelengths.
- The evanescent field also can image fluorescently-labeled nucleotides upon their incorporation into the attached template/primer complex in the presence of a polymerase. Total internal reflectance fluorescence microscopy is then used to visualize the attached template/primer duplex and/or the incorporated nucleotides with single molecule resolution.
- Some embodiments of the invention use non-optical detection methods such as, for example, detection using nanopores (e.g., protein or solid state) through which molecules are individually passed so as to allow identification of the molecules by noting characteristics or changes in various properties or effects such as capacitance or blockage current flow (see, for example, Stoddart et al, Proc. Nat. Acad. Sci., 106:7702, 2009; Purnell and Schmidt, ACS Nano, 3:2533, 2009; Branton et al, Nature Biotechnology, 26:1146, 2008; Polonsky et al, U.S. Application 2008/0187915; Mitchell & Howorka, Angew. Chem. Int. Ed. 47:5565, 2008; Borsenberger et al, J. Am. Chem. Soc., 131, 7530, 2009); or other suitable non-optical detection methods.
- Alignment and/or compilation of sequence results obtained from the image stacks produced as generally described above utilizes look-up tables that take into account possible sequences changes (due, e.g., to errors, mutations, etc.). Essentially, sequencing results obtained as described herein are compared to a look-up type table that contains all possible reference sequences plus 1 or 2 base errors.
- In some embodiments, a plurality of nucleic acid molecules being sequenced are bound to a solid support. To immobilize the nucleic acid on a solid support, a capture sequence/universal priming site can be added at the 3′ and/or 5′ end of the template. The nucleic acids may be bound to the solid support by hybridizing the capture sequence to a complementary sequence covalently attached to the solid support. The capture sequence (also referred to as a universal capture sequence) is a nucleic acid sequence complimentary to a sequence attached to a solid support that may dually serve as a universal primer. In some embodiments, the capture sequence is polyNn, where N is U, A, T, G, or C, e.g., 20-70, 40-60, e.g., about 50. For example, the capture sequence could be polyT40-50 or its complement. As an alternative to a capture sequence, a member of a coupling pair (such as, e.g., antibody/antigen, receptor/ligand, or the avidin-biotin pair as described in, e.g., U.S. Patent Application No. 2006/0252077) may be linked to each fragment to be captured on a surface coated with a respective second member of that coupling pair.
- Numerous examples of hypermethylated genes that have been linked to various types of cancer have been identified. Examples of hypermethylated genes that have been linked with susceptibility to or incidence of colorectal cancer include, for example, FOXE1, SOX17, SYNE1, BOLL, CABYR, EFEMP1, FBLN2, FOXL2, GNB4, GSTM3, HoxD1, Jph3, Neuralized (NEURL), PPP1R14a, TP53AP1, RAB32, APC2, GPNMB, MMP2, EVL, STARD8, PTPRD, CD109, LGR6, RET, CHD5, RNF182, ICAM5, ARMCX2, CBR1, DDX43, DMRTB1, FBLN2, H1ST2H2AA, ICAM1, LY6K, NEF3, POMC, STK31, SYCP3, TCL1A, TFPI-2, TLR2, UCHL1, ZFP42, ASCL2, ATP8A2, CTAG2, EPHA4, FANCF, FOXQ1, HUS1B, JAM3, LEF1, MOV10L1, NPPB, PWWP1, RASSF5, REC8L1, SALL4, BEX1, BNIP3, CCK, CDX1, CNN3, CXX1, IRX4, MC5R, RSNL2, SMARCA3, SPON1, SYT6, TRPC3, TSPYL6, ZNF345, DKK3, ZNF655, B4GALT1, C10orf119, C10orf13, CBR1, COPS4, COVA1, CSRP1, DARS, DNAJC10, FKBP14, FN3KRP, GANAB, HUS1, KLF11, MRPL4, MYLK, NELF, NETO2, PAPSS2, RBMS2, RHOB, SECTM1, SIRT2, SIRT7, SLC35D1, SLC9A3R1, TTRAP, TUBG2, F1120277, MYBL2, GPR116, QSMR, PC4, SLC39A4, UBE3A, PDLIM3, UBE21, or any combination thereof.
- Examples of hypermethylated genes that have been linked with susceptibility to or incidence of prostate cancer include, for example, GSTP1, APC, PTGS2, T1G1, EDNRB, RASS1a, GSTP1, APC, PTGS2, T1G1, EDNRB, RASS1a, CD3D, APOC1, NBL1, ING4, LEF1, CENTD3, MGC15396, FKBP4, PLTP, TFAP2A, ATXN1, BMP2, ENPEP, MCAM, SSBP2, PDLIM3, NDP, or any combination thereof.
- Examples of hypermethylated genes that have been linked with susceptibility to or incidence of breast cancer include, for example, PITX2, PITX2, BACH1, CKMT, GALE, HMG20B, KRT14, OGDHL, PON2, SESN1, KIF1A (kinesin family member 1A) PDLIM3, MAL (T cell proliferation protein), or any combination thereof.
- Examples of hypermethylated genes that have been linked with susceptibility to or incidence of lung cancer include, for example, p161NK4a, APC, TMS1, RASSF1, DAPK, PRSS3 (serine protease family member-trypsinogen IV—a putative tumor suppressor gene), human DAB2 interactive protein gene, apoptosis-associated speck-lick protein containing a CARD, p16, FHIT, H-cadherin, RARβ, RARB2, PHKA2, CBR3, CAMK4, HOXB5, ZNF198, RGS4, RBM15B, PDLIM3, PAK3, PIGH, TUBB4, NISCH or any combination thereof.
- Examples of hypermethylated markers that have been shown to be associated with susceptibility to or incidence of gastrointestinal cancer include, without limitation, NDRG4/NDRG2 subfamily gene, GATA4, OSMR, GATA5, SFRP1, ADAM23, JPH3, SFRP2, APC, MGMT, TFPI2, BNIP3, FOXE1, SYNE1, SOX17, PHACTR3, JAM3, or any combination thereof.
- Examples of hypermethylated genes that have been linked with susceptibility to or incidence of cervical cancer include, for example, ESR1, DAP-kinase, APC, TIMP-3, RAR-beta, CALCA, TSLC1, TIMP-2, DcR1, DcR2, BRCA1, p15, RassflA, MLH1, MGMT, PDCD4, TFPI2, ARMC7, TRM-HUMAN, OGDHL, PTGS2, CDK6, GPR39, HMGN2, C130RF18, ASMTL, DLL4, NP-659450.1, NP-078820.1, CLU, HPCA, PLCG2, RALY, GNB4, CCNA1, NPTX1, C90RF19, or any combination thereof.
- Nucleic acid sequences and the corresponding amino acid sequences for each of the genes referenced herein are well known and have been published in sequence databases that are freely accessible to the general public such as the GenBank, the National Institutes of Health genetic sequence database, and Protein, which includes a collection of sequences from several sources, including translations from annotated coding regions in GenBank, RefSeq and TPA, as well as records from SwissProt, PIR, PRF, and PDB. Thus, such sequences are readily available to the skilled artisan for use in designing chimeric primers and chimeric probes having sequences specific for the DNA encoding such genes for use in the methods of the invention.
- Two primers, Forward A4 (FORA4) (SEQ ID NO. 25: 3′-TTT*GAT*GTTTTTTGGGTTG) and Reverse SM REV1 (SEQ ID NO. 26: 3′-GAGGGAAGGGGGAGGG) were prepared as shown in
FIG. 5 . The primers were complimentary to sequences adjacent to regions of TWIST that are known to have potential methylation sites. As shown inFIG. 5 , there were 14 CpG islands between the primer sites. Some or all of the CpG islands could be methylated in a sample, and the methylation pattern may be indicative of a disease, e.g., cancer. - Forward primer FORA4 contains two abasic regions, allowing it to specifically hybridize with methylated or unmethylated template DNA. The corresponding mismatch primer, wherein “*” is replaced with “N” could be prepared using the methods described herein, but is not shown. Reverse primer SM REV1 did not contain any abasic (or mismatched) sites.
- Using the disclosed primers, amplicons were produced and amplified using qPCR.
FIG. 6 shows the amplification efficiency of the primers as a function of concentration. As shown inFIG. 6 , using as little as 300 nM primers, it is possible to reliably detect the amplicons using only 40 cycles of PCR, and regardless of the methylation pattern of the original template. Analysis of the amplicons, e.g., via deep sequencing, will give information about the methylation of this region of TWIST. - Two primers, Forward A3 (FORA3) (SEQ ID NO. 27: 3′-TTTTTTTATTGGGGTTGTTT) and Reverse SM REV3 (SEQ ID NO. 28: 3′-GTTGTGGTTTTAATAGGTTG) were prepared as shown in
FIG. 7 . The primers were complimentary to sequences adjacent to regions of NID that are known to have potential methylation sites. As shown inFIG. 7 , there were 13 CpG islands between the primer sites. Some or all of the CpG islands could be methylated in a sample, and the methylation pattern may be indicative of a disease, e.g., cancer. - Forward primer FORA3 contains one mismatch region (shown as lower-case or underlined “G”), allowing it to specifically hybridize with methylated or unmethylated template DNA. The corresponding abasic primer, wherein “G” is replaced with “*” could be prepared using the methods described herein, but is not shown. Reverse primer SM REV3 did not contain any mismatched (or abasic) sites.
- Using the disclosed primers, amplicons were produced and amplified using qPCR.
FIG. 8 shows the amplification efficiency of the primers as a function of concentration. As shown inFIG. 8 , using as little as 300 nM primers, it is possible to reliably detect the amplicons using only 38 cycles of PCR, and regardless of the methylation pattern of the original template. Analysis of the amplicons, e.g., via deep sequencing, will give information about the methylation of this region of NID. - References and citations to other documents, such as patents, patent applications, patent publications, journals, books, web contents, have been made throughout this disclosure. All documents are hereby incorporated herein by reference in their entirety for all purposes.
- The invention may be embodies in other specific forms without departing from the spirit or essential characteristics thereof. The foregoing embodiments are therefore to be considered in all respects illustrative rather than limiting on the invention described herein. Scope of the invention is thus indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore intended to be embraced from within.
Claims (20)
1. A primer, comprising an abasic region, that can hybridize to either a methylated CpG site or a corresponding unmethylated CpG site of a template nucleic acid because the abasic region can interact with both the methylated CpG site and the corresponding unmethylated CpG site of the template nucleic acid.
2. The primer according to claim 1 , wherein the primer hybridizes to a sequence known to be an epigenetic marker.
3. The primer according to claim 1 , wherein the primer hybridizes to a sequence coding for a TWIST-related protein or a NID-related protein.
4. The primer according to claim 1 , wherein the abasic region spans at least one base of the template.
5. The primer according to claim 1 , wherein the abasic region is covalently linked to a guanine moiety.
6. The primer according to claim 1 , wherein the abasic region comprises a moiety selected from the group consisting of:
O-dimethoxytrityl-1′,2′-dideoxyribose-3′-[(2-cyanoethyl)-(N,N-diisopropyl)]-phosphoramidite;
O-dimethoxytrityl-1′-methoxy-2′-dideoxyribose-3′-[(2-cyanoethyl)-(N,N-diisopropyl)]-phosphoramidite;
[4-(4,4′-Dimethoxytrityloxy)butyramidomethyl)-1-(2-nitrophenyl)-ethyl]-2-cyanoethyl-(N,N-diisopropyl)-phosphoramidite;
O-Dimethoxytrityl-1′-Deoxyribose-2′-O-Triisopropylsilyloxymethyl-3′-[(2-cyanoethyl)-(N,N-diisopropyl)]-phosphoramidite;
(4,4′-Dimethoxytrityloxy)-dodecyl-1-[(2-cyanoethyl)-(N,N-diisopropyl)]-phosphoramidite);
O-Dimethoxytritylhexaethyleneglycol, 1-[(2-cyanoethyl)-(N,N-diisopropyl)]-phosphoramidite;
O-Dimethoxytrityl-triethyleneglycol, 1-[(2-cyanoethyl)-(N,N-diisopropyl)]-phosphoramidite; and
3-(4,4′-Dimethoxytrityloxy)propyl-1-[(2-cyanoethyl)-(N,N-diisopropyl)]-phosphoramidite.
7. The primer according to claim 1 , wherein the primer is able to specifically hybridize to either the methylated or the unmethylated CpG site of the template nucleic acid under conditions of high stringency.
8. The primer according to claim 1 , wherein the primer further comprises an adaptor sequence.
9. The primer according to claim 8 , wherein the adaptor sequence comprises a homopolymer region.
10. The primer according to claim 1 , additionally comprising a barcode.
11. A primer, comprising at least one mismatched nucleotide that has similar annealing characteristics to both uracil and cytosine, that can hybridize to either a methylated CpG site or a corresponding unmethylated CpG site of a template nucleic acid because the mismatched nucleotide can interact with both the methylated CpG site and the corresponding unmethylated CpG site of the template nucleic acid.
12. The primer according to claim 11 , wherein the primer comprises the nucleotide sequence selected from the group consisting of: SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, and SEQ ID NO: 24.
13. The primer according to claim 11 , wherein the primer hybridizes to a sequence known to be an epigenetic marker.
14. The primer according to claim 11 , wherein the primer hybridizes to a sequence coding for a TWIST-related protein or a NID-related protein.
15. The primer according to claim 11 , wherein the mismatched nucleotide spans at least one base of the template.
16. The primer according to claim 11 , wherein the mismatched nucleotide is covalently linked to a guanine moiety.
17. The primer according to claim 11 , wherein the primer is able to specifically hybridize to either the methylated or the unmethylated CpG site of the template nucleic acid under conditions of high stringency.
18. The primer according to claim 11 , wherein the primer further comprises an adaptor sequence.
19. The primer according to claim 18 , wherein the adaptor sequence comprises a homopolymer region.
20. The primer according to claim 11 , additionally comprising a barcode.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US13/770,646 US20130310549A1 (en) | 2012-05-15 | 2013-02-19 | Primers for analyzing methylated sequences and methods of use thereof |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US13/472,209 US8377657B1 (en) | 2012-05-15 | 2012-05-15 | Primers for analyzing methylated sequences and methods of use thereof |
US13/770,646 US20130310549A1 (en) | 2012-05-15 | 2013-02-19 | Primers for analyzing methylated sequences and methods of use thereof |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/472,209 Continuation US8377657B1 (en) | 2012-05-15 | 2012-05-15 | Primers for analyzing methylated sequences and methods of use thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
US20130310549A1 true US20130310549A1 (en) | 2013-11-21 |
Family
ID=47682779
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/472,209 Expired - Fee Related US8377657B1 (en) | 2012-05-15 | 2012-05-15 | Primers for analyzing methylated sequences and methods of use thereof |
US13/770,646 Abandoned US20130310549A1 (en) | 2012-05-15 | 2013-02-19 | Primers for analyzing methylated sequences and methods of use thereof |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/472,209 Expired - Fee Related US8377657B1 (en) | 2012-05-15 | 2012-05-15 | Primers for analyzing methylated sequences and methods of use thereof |
Country Status (1)
Country | Link |
---|---|
US (2) | US8377657B1 (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2018136728A1 (en) * | 2017-01-20 | 2018-07-26 | Children's Hospital Medical Center | Methods and compositions relating to oprm1 dna methylation for personalized pain management |
US10878939B2 (en) | 2014-02-24 | 2020-12-29 | Children's Hospital Medical Center | Methods and compositions for personalized pain management |
US11597978B2 (en) | 2011-11-30 | 2023-03-07 | Children's Hospital Medical Center | Personalized pain management and anesthesia: preemptive risk identification and therapeutic decision support |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5786146A (en) | 1996-06-03 | 1998-07-28 | The Johns Hopkins University School Of Medicine | Method of detection of methylated nucleic acid using agents which modify unmethylated cytosine and distinguishing modified methylated and non-methylated nucleic acids |
AU2003274914A1 (en) | 2002-08-21 | 2004-03-11 | Epoch Biosciences, Inc. | Abasic site endonuclease assay |
EP1608784B1 (en) * | 2003-03-21 | 2009-01-14 | MDS Analytical Technologies (US) Inc. | Global linear non-biased nucleic acid amplification |
KR101110013B1 (en) | 2007-10-05 | 2012-02-29 | (주)바이오니아 | Primers for pcr amplification comprising abasic parts within the primer sequences |
-
2012
- 2012-05-15 US US13/472,209 patent/US8377657B1/en not_active Expired - Fee Related
-
2013
- 2013-02-19 US US13/770,646 patent/US20130310549A1/en not_active Abandoned
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11597978B2 (en) | 2011-11-30 | 2023-03-07 | Children's Hospital Medical Center | Personalized pain management and anesthesia: preemptive risk identification and therapeutic decision support |
US11746377B2 (en) | 2011-11-30 | 2023-09-05 | Children's Hospital Medical Center | Personalized pain management and anesthesia: preemptive risk identification and therapeutic decision support |
US12077821B2 (en) | 2011-11-30 | 2024-09-03 | Children's Hospital Medical Center | Personalized pain management and anesthesia: preemptive risk identification and therapeutic decision support |
US10878939B2 (en) | 2014-02-24 | 2020-12-29 | Children's Hospital Medical Center | Methods and compositions for personalized pain management |
WO2018136728A1 (en) * | 2017-01-20 | 2018-07-26 | Children's Hospital Medical Center | Methods and compositions relating to oprm1 dna methylation for personalized pain management |
US11618924B2 (en) | 2017-01-20 | 2023-04-04 | Children's Hospital Medical Center | Methods and compositions relating to OPRM1 DNA methylation for personalized pain management |
US12146193B2 (en) | 2017-01-20 | 2024-11-19 | Children's Hospital Medical Center | Methods and compositions relating to OPRM1 DNA methylation for personalized pain management |
Also Published As
Publication number | Publication date |
---|---|
US8377657B1 (en) | 2013-02-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20210324468A1 (en) | Compositions and methods for screening mutations in thyroid cancer | |
ES3001090T3 (en) | Methods for analyzing nucleic acid sequence information using GC bias, optionally to detect fetal nucleic acid abnormalities | |
CN109952381B (en) | Method for multiplex detection of methylated DNA | |
WO2018069450A1 (en) | Methylation biomarkers for lung cancer | |
JP2007075114A (en) | Detection of gene methylation | |
US20200277654A1 (en) | Method for Detecting multiple DNA Mutations and Copy Number Variations | |
JP2007125014A (en) | Gene methylation test method | |
TW202417642A (en) | Methylation markers for identifying cancer and the applications | |
EP4074840A1 (en) | Pcr method and pcr kit for increasing allelic discrimination | |
CN102016067A (en) | Detection of GSTP1 hypermethylation in prostate cancer | |
JP2007530026A (en) | Nucleic acid sequencing | |
US8377657B1 (en) | Primers for analyzing methylated sequences and methods of use thereof | |
US20130309667A1 (en) | Primers for analyzing methylated sequences and methods of use thereof | |
JP5258760B2 (en) | Method for amplifying methylated or unmethylated nucleic acid | |
US20130310550A1 (en) | Primers for analyzing methylated sequences and methods of use thereof | |
JPWO2005021743A1 (en) | Nucleic acid amplification primer and colorectal cancer inspection method using the same | |
US11447832B2 (en) | Compositions and methods for oncology precision assays | |
CN118451202A (en) | Tumor detection method and application | |
KR101587635B1 (en) | Method for Detecting Methylation of Thyroid Cancer Specific Methylation Marker Gene for Thyroid Cancer Diagnosis | |
KR20230124636A (en) | Compositions and methods for highly sensitive detection of target sequences in multiplex reactions | |
JP4359497B2 (en) | Nucleic acid amplification primer, nucleic acid amplification primer set, and cancer testing method using the same | |
US20220307077A1 (en) | Conservative concurrent evaluation of dna modifications | |
KR20250011954A (en) | Compositions and methods for oncological assays | |
JP2024531713A (en) | Method for detecting a target polynucleotide in a sample | |
WO2025078657A1 (en) | Amplification-free target enrichment workflow for direct detection of nucleic acid modifications |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: PHYSICIANS CHOICE LABORATORY SERVICES, LLC, SOUTH Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:PREDICTIVE BIOSCIENCES, INC.;REEL/FRAME:032705/0646 Effective date: 20140401 |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |