WO2007058992A2 - Mutations and polymorphisms of hdac6 - Google Patents
Mutations and polymorphisms of hdac6 Download PDFInfo
- Publication number
- WO2007058992A2 WO2007058992A2 PCT/US2006/043899 US2006043899W WO2007058992A2 WO 2007058992 A2 WO2007058992 A2 WO 2007058992A2 US 2006043899 W US2006043899 W US 2006043899W WO 2007058992 A2 WO2007058992 A2 WO 2007058992A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- cancer
- hdac6
- aryl
- alkyl
- heteroaryl
- Prior art date
Links
- 230000035772 mutation Effects 0.000 title claims abstract description 159
- 102000054765 polymorphisms of proteins Human genes 0.000 title claims abstract description 29
- 102100022537 Histone deacetylase 6 Human genes 0.000 claims abstract description 196
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 176
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 169
- 229920001184 polypeptide Polymers 0.000 claims abstract description 161
- 238000000034 method Methods 0.000 claims abstract description 156
- 101000899330 Homo sapiens Histone deacetylase 6 Proteins 0.000 claims abstract description 48
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims abstract description 38
- 201000010099 disease Diseases 0.000 claims abstract description 35
- 238000011282 treatment Methods 0.000 claims abstract description 35
- 239000003814 drug Substances 0.000 claims abstract description 34
- 229940079593 drug Drugs 0.000 claims abstract description 31
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 30
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 30
- 239000002157 polynucleotide Substances 0.000 claims abstract description 30
- 101100230661 Homo sapiens HDAC6 gene Proteins 0.000 claims abstract description 7
- WWGBHDIHIVGYLZ-UHFFFAOYSA-N N-[4-[3-[[[7-(hydroxyamino)-7-oxoheptyl]amino]-oxomethyl]-5-isoxazolyl]phenyl]carbamic acid tert-butyl ester Chemical compound C1=CC(NC(=O)OC(C)(C)C)=CC=C1C1=CC(C(=O)NCCCCCCC(=O)NO)=NO1 WWGBHDIHIVGYLZ-UHFFFAOYSA-N 0.000 claims abstract 41
- 125000003118 aryl group Chemical group 0.000 claims description 130
- 206010028980 Neoplasm Diseases 0.000 claims description 100
- 102000054766 genetic haplotypes Human genes 0.000 claims description 75
- 201000011510 cancer Diseases 0.000 claims description 68
- -1 2- phenylethenyl Chemical group 0.000 claims description 53
- 125000001072 heteroaryl group Chemical group 0.000 claims description 48
- 125000000753 cycloalkyl group Chemical group 0.000 claims description 47
- 125000000217 alkyl group Chemical group 0.000 claims description 46
- 125000000592 heterocycloalkyl group Chemical group 0.000 claims description 43
- 125000003710 aryl alkyl group Chemical group 0.000 claims description 42
- 125000004446 heteroarylalkyl group Chemical group 0.000 claims description 41
- 125000001424 substituent group Chemical group 0.000 claims description 39
- 239000003795 chemical substances by application Substances 0.000 claims description 36
- 150000001875 compounds Chemical class 0.000 claims description 31
- 108700028369 Alleles Proteins 0.000 claims description 30
- 208000031261 Acute myeloid leukaemia Diseases 0.000 claims description 26
- 125000001797 benzyl group Chemical group [H]C1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])* 0.000 claims description 22
- 229910052799 carbon Inorganic materials 0.000 claims description 21
- 125000005344 pyridylmethyl group Chemical group [H]C1=C([H])C([H])=C([H])C(=N1)C([H])([H])* 0.000 claims description 21
- 239000013598 vector Substances 0.000 claims description 21
- 208000010833 Chronic myeloid leukaemia Diseases 0.000 claims description 20
- 208000032791 BCR-ABL1 positive chronic myelogenous leukemia Diseases 0.000 claims description 18
- 208000033761 Myelogenous Chronic BCR-ABL Positive Leukemia Diseases 0.000 claims description 18
- 239000002253 acid Substances 0.000 claims description 18
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 claims description 18
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 claims description 15
- 125000000266 alpha-aminoacyl group Chemical group 0.000 claims description 14
- 206010006187 Breast cancer Diseases 0.000 claims description 13
- 125000002252 acyl group Chemical group 0.000 claims description 13
- 229910052717 sulfur Inorganic materials 0.000 claims description 13
- 208000014829 head and neck neoplasm Diseases 0.000 claims description 12
- 201000001441 melanoma Diseases 0.000 claims description 12
- 208000002154 non-small cell lung carcinoma Diseases 0.000 claims description 12
- 208000029729 tumor suppressor gene on chromosome 11 Diseases 0.000 claims description 12
- 208000026310 Breast neoplasm Diseases 0.000 claims description 11
- 230000002068 genetic effect Effects 0.000 claims description 11
- 229910052757 nitrogen Inorganic materials 0.000 claims description 11
- 125000004122 cyclic group Chemical group 0.000 claims description 10
- 125000001495 ethyl group Chemical group [H]C([H])([H])C([H])([H])* 0.000 claims description 10
- 239000005517 L01XE01 - Imatinib Substances 0.000 claims description 9
- 239000002246 antineoplastic agent Substances 0.000 claims description 9
- 125000004186 cyclopropylmethyl group Chemical group [H]C([H])(*)C1([H])C([H])([H])C1([H])[H] 0.000 claims description 9
- UIVFUQKYVFCEKJ-OPTOVBNMSA-N gimatecan Chemical compound C1=CC=C2C(\C=N\OC(C)(C)C)=C(CN3C4=CC5=C(C3=O)COC(=O)[C@]5(O)CC)C4=NC2=C1 UIVFUQKYVFCEKJ-OPTOVBNMSA-N 0.000 claims description 9
- 229950009073 gimatecan Drugs 0.000 claims description 9
- 239000003112 inhibitor Substances 0.000 claims description 9
- 229910052760 oxygen Inorganic materials 0.000 claims description 9
- 229960005184 panobinostat Drugs 0.000 claims description 9
- OHRURASPPZQGQM-GCCNXGTGSA-N romidepsin Chemical compound O1C(=O)[C@H](C(C)C)NC(=O)C(=C/C)/NC(=O)[C@H]2CSSCC\C=C\[C@@H]1CC(=O)N[C@H](C(C)C)C(=O)N2 OHRURASPPZQGQM-GCCNXGTGSA-N 0.000 claims description 9
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 claims description 8
- HKVAMNSJSFKALM-GKUWKFKPSA-N Everolimus Chemical compound C1C[C@@H](OCCO)[C@H](OC)C[C@@H]1C[C@@H](C)[C@H]1OC(=O)[C@@H]2CCCCN2C(=O)C(=O)[C@](O)(O2)[C@H](C)CC[C@H]2C[C@H](OC)/C(C)=C/C=C/C=C/[C@@H](C)C[C@@H](C)C(=O)[C@H](OC)[C@H](O)/C(C)=C/[C@@H](C)C(=O)C1 HKVAMNSJSFKALM-GKUWKFKPSA-N 0.000 claims description 8
- 150000001721 carbon Chemical group 0.000 claims description 8
- 239000003153 chemical reaction reagent Substances 0.000 claims description 8
- 125000001316 cycloalkyl alkyl group Chemical group 0.000 claims description 8
- 229960005167 everolimus Drugs 0.000 claims description 8
- RTKIYFITIVXBLE-QEQCGCAPSA-N trichostatin A Chemical compound ONC(=O)/C=C/C(/C)=C/[C@@H](C)C(=O)C1=CC=C(N(C)C)C=C1 RTKIYFITIVXBLE-QEQCGCAPSA-N 0.000 claims description 8
- 206010009944 Colon cancer Diseases 0.000 claims description 7
- 208000008839 Kidney Neoplasms Diseases 0.000 claims description 7
- 206010058467 Lung neoplasm malignant Diseases 0.000 claims description 7
- 208000034578 Multiple myelomas Diseases 0.000 claims description 7
- 206010035226 Plasma cell myeloma Diseases 0.000 claims description 7
- 206010038389 Renal cancer Diseases 0.000 claims description 7
- 238000011319 anticancer therapy Methods 0.000 claims description 7
- HESCAJZNRMSMJG-HGYUPSKWSA-N epothilone A Natural products O=C1[C@H](C)[C@H](O)[C@H](C)CCC[C@H]2O[C@H]2C[C@@H](/C(=C\c2nc(C)sc2)/C)OC(=O)C[C@H](O)C1(C)C HESCAJZNRMSMJG-HGYUPSKWSA-N 0.000 claims description 7
- QXRSDHAAWVKZLJ-TYFQHMATSA-N epothilone b Chemical compound C/C([C@@H]1C[C@@H]2O[C@@]2(C)CCC[C@@H]([C@H]([C@H](C)C(=O)C(C)(C)[C@H](O)CC(=O)O1)O)C)=C\C1=CSC(C)=N1 QXRSDHAAWVKZLJ-TYFQHMATSA-N 0.000 claims description 7
- 201000010982 kidney cancer Diseases 0.000 claims description 7
- 201000005202 lung cancer Diseases 0.000 claims description 7
- 208000020816 lung neoplasm Diseases 0.000 claims description 7
- 238000004519 manufacturing process Methods 0.000 claims description 7
- 229950007460 patupilone Drugs 0.000 claims description 7
- 108010091666 romidepsin Proteins 0.000 claims description 7
- 229960003452 romidepsin Drugs 0.000 claims description 7
- JWOGUUIOCYMBPV-GMFLJSBRSA-N (3S,6S,9S,12R)-3-[(2S)-Butan-2-yl]-6-[(1-methoxyindol-3-yl)methyl]-9-(6-oxooctyl)-1,4,7,10-tetrazabicyclo[10.4.0]hexadecane-2,5,8,11-tetrone Chemical compound N1C(=O)[C@H](CCCCCC(=O)CC)NC(=O)[C@H]2CCCCN2C(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CC1=CN(OC)C2=CC=CC=C12 JWOGUUIOCYMBPV-GMFLJSBRSA-N 0.000 claims description 6
- 208000003609 Bile Duct Adenoma Diseases 0.000 claims description 6
- 206010005003 Bladder cancer Diseases 0.000 claims description 6
- 208000003174 Brain Neoplasms Diseases 0.000 claims description 6
- 208000001333 Colorectal Neoplasms Diseases 0.000 claims description 6
- 206010017993 Gastrointestinal neoplasms Diseases 0.000 claims description 6
- 206010025323 Lymphomas Diseases 0.000 claims description 6
- 206010029260 Neuroblastoma Diseases 0.000 claims description 6
- 206010033128 Ovarian cancer Diseases 0.000 claims description 6
- 206010061535 Ovarian neoplasm Diseases 0.000 claims description 6
- 206010061902 Pancreatic neoplasm Diseases 0.000 claims description 6
- 206010060862 Prostate cancer Diseases 0.000 claims description 6
- 208000000236 Prostatic Neoplasms Diseases 0.000 claims description 6
- 208000005718 Stomach Neoplasms Diseases 0.000 claims description 6
- 208000007097 Urinary Bladder Neoplasms Diseases 0.000 claims description 6
- 208000008385 Urogenital Neoplasms Diseases 0.000 claims description 6
- 206010017758 gastric cancer Diseases 0.000 claims description 6
- 208000005017 glioblastoma Diseases 0.000 claims description 6
- 230000012010 growth Effects 0.000 claims description 6
- 125000005885 heterocycloalkylalkyl group Chemical group 0.000 claims description 6
- 206010020718 hyperplasia Diseases 0.000 claims description 6
- 208000015486 malignant pancreatic neoplasm Diseases 0.000 claims description 6
- 208000026037 malignant tumor of neck Diseases 0.000 claims description 6
- BMGQWWVMWDBQGC-IIFHNQTCSA-N midostaurin Chemical compound CN([C@H]1[C@H]([C@]2(C)O[C@@H](N3C4=CC=CC=C4C4=C5C(=O)NCC5=C5C6=CC=CC=C6N2C5=C43)C1)OC)C(=O)C1=CC=CC=C1 BMGQWWVMWDBQGC-IIFHNQTCSA-N 0.000 claims description 6
- 229950010895 midostaurin Drugs 0.000 claims description 6
- 208000008443 pancreatic carcinoma Diseases 0.000 claims description 6
- 201000011549 stomach cancer Diseases 0.000 claims description 6
- 201000005112 urinary bladder cancer Diseases 0.000 claims description 6
- WAEXFXRVDQXREF-UHFFFAOYSA-N vorinostat Chemical compound ONC(=O)CCCCCCC(=O)NC1=CC=CC=C1 WAEXFXRVDQXREF-UHFFFAOYSA-N 0.000 claims description 6
- 229960000237 vorinostat Drugs 0.000 claims description 6
- 125000004169 (C1-C6) alkyl group Chemical group 0.000 claims description 5
- OONFNUWBHFSNBT-HXUWFJFHSA-N AEE788 Chemical compound C1CN(CC)CCN1CC1=CC=C(C=2NC3=NC=NC(N[C@H](C)C=4C=CC=CC=4)=C3C=2)C=C1 OONFNUWBHFSNBT-HXUWFJFHSA-N 0.000 claims description 5
- 229930189037 Trapoxin Natural products 0.000 claims description 5
- NIJJYAXOARWZEE-UHFFFAOYSA-N Valproic acid Chemical compound CCCC(C(O)=O)CCC NIJJYAXOARWZEE-UHFFFAOYSA-N 0.000 claims description 5
- YLMAHDNUQAMNNX-UHFFFAOYSA-N imatinib methanesulfonate Chemical compound CS(O)(=O)=O.C1CN(C)CCN1CC1=CC=C(C(=O)NC=2C=C(NC=3N=C(C=CN=3)C=3C=NC=CC=3)C(C)=CC=2)C=C1 YLMAHDNUQAMNNX-UHFFFAOYSA-N 0.000 claims description 5
- 125000004178 (C1-C4) alkyl group Chemical group 0.000 claims description 4
- 125000002777 acetyl group Chemical group [H]C([H])([H])C(*)=O 0.000 claims description 4
- 125000004442 acylamino group Chemical group 0.000 claims description 4
- 229940041181 antineoplastic drug Drugs 0.000 claims description 4
- 239000003596 drug target Substances 0.000 claims description 4
- 238000011269 treatment regimen Methods 0.000 claims description 4
- OBKXEAXTFZPCHS-UHFFFAOYSA-N 4-phenylbutyric acid Chemical compound OC(=O)CCCC1=CC=CC=C1 OBKXEAXTFZPCHS-UHFFFAOYSA-N 0.000 claims description 3
- JTDYUFSDZATMKU-UHFFFAOYSA-N 6-(1,3-dioxo-2-benzo[de]isoquinolinyl)-N-hydroxyhexanamide Chemical compound C1=CC(C(N(CCCCCC(=O)NO)C2=O)=O)=C3C2=CC=CC3=C1 JTDYUFSDZATMKU-UHFFFAOYSA-N 0.000 claims description 3
- DWIYBCKFYUQVLU-UHFFFAOYSA-N 7-[4-(4-cyanophenyl)phenoxy]-n-hydroxyheptanamide Chemical compound C1=CC(OCCCCCCC(=O)NO)=CC=C1C1=CC=C(C#N)C=C1 DWIYBCKFYUQVLU-UHFFFAOYSA-N 0.000 claims description 3
- FERIUCNNQQJTOY-UHFFFAOYSA-M Butyrate Chemical compound CCCC([O-])=O FERIUCNNQQJTOY-UHFFFAOYSA-M 0.000 claims description 3
- FERIUCNNQQJTOY-UHFFFAOYSA-N Butyric acid Natural products CCCC(O)=O FERIUCNNQQJTOY-UHFFFAOYSA-N 0.000 claims description 3
- DLVJMFOLJOOWFS-UHFFFAOYSA-N Depudecin Natural products CC(O)C1OC1C=CC1C(C(O)C=C)O1 DLVJMFOLJOOWFS-UHFFFAOYSA-N 0.000 claims description 3
- JWOGUUIOCYMBPV-UHFFFAOYSA-N OT-Key 11219 Natural products N1C(=O)C(CCCCCC(=O)CC)NC(=O)C2CCCCN2C(=O)C(C(C)CC)NC(=O)C1CC1=CN(OC)C2=CC=CC=C12 JWOGUUIOCYMBPV-UHFFFAOYSA-N 0.000 claims description 3
- IPBVNPXQWQGGJP-UHFFFAOYSA-N acetic acid phenyl ester Natural products CC(=O)OC1=CC=CC=C1 IPBVNPXQWQGGJP-UHFFFAOYSA-N 0.000 claims description 3
- 108010082820 apicidin Proteins 0.000 claims description 3
- 229930186608 apicidin Natural products 0.000 claims description 3
- GYKLFBYWXZYSOW-UHFFFAOYSA-N butanoyloxymethyl 2,2-dimethylpropanoate Chemical compound CCCC(=O)OCOC(=O)C(C)(C)C GYKLFBYWXZYSOW-UHFFFAOYSA-N 0.000 claims description 3
- DLVJMFOLJOOWFS-INMLLLKOSA-N depudecin Chemical compound C[C@@H](O)[C@@H]1O[C@H]1\C=C\[C@H]1[C@H]([C@H](O)C=C)O1 DLVJMFOLJOOWFS-INMLLLKOSA-N 0.000 claims description 3
- 229940087476 femara Drugs 0.000 claims description 3
- 230000001939 inductive effect Effects 0.000 claims description 3
- HPJKCIUCZWXJDR-UHFFFAOYSA-N letrozole Chemical compound C1=CC(C#N)=CC=C1C(N1N=CN=C1)C1=CC=C(C#N)C=C1 HPJKCIUCZWXJDR-UHFFFAOYSA-N 0.000 claims description 3
- 238000012544 monitoring process Methods 0.000 claims description 3
- 229940049953 phenylacetate Drugs 0.000 claims description 3
- WLJVXDMOQOGPHL-UHFFFAOYSA-N phenylacetic acid Chemical compound OC(=O)CC1=CC=CC=C1 WLJVXDMOQOGPHL-UHFFFAOYSA-N 0.000 claims description 3
- 229950009215 phenylbutanoic acid Drugs 0.000 claims description 3
- MFBOGIVSZKQAPD-UHFFFAOYSA-M sodium butyrate Chemical compound [Na+].CCCC([O-])=O MFBOGIVSZKQAPD-UHFFFAOYSA-M 0.000 claims description 3
- YECWTLGLNDDPGE-PIFXLSLCSA-N trichostatin C Chemical compound C(/[C@@H](C)C(=O)C=1C=CC(=CC=1)N(C)C)=C(/C)\C=C\C(=O)NO[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O YECWTLGLNDDPGE-PIFXLSLCSA-N 0.000 claims description 3
- YECWTLGLNDDPGE-UHFFFAOYSA-N trichostatin D Natural products C=1C=C(N(C)C)C=CC=1C(=O)C(C)C=C(C)C=CC(=O)NOC1OC(CO)C(O)C(O)C1O YECWTLGLNDDPGE-UHFFFAOYSA-N 0.000 claims description 3
- 229960000604 valproic acid Drugs 0.000 claims description 3
- 229950000578 vatalanib Drugs 0.000 claims description 3
- YCOYDOIWSSHVCK-UHFFFAOYSA-N vatalanib Chemical compound C1=CC(Cl)=CC=C1NC(C1=CC=CC=C11)=NN=C1CC1=CC=NC=C1 YCOYDOIWSSHVCK-UHFFFAOYSA-N 0.000 claims description 3
- DEQANNDTNATYII-OULOTJBUSA-N (4r,7s,10s,13r,16s,19r)-10-(4-aminobutyl)-19-[[(2r)-2-amino-3-phenylpropanoyl]amino]-16-benzyl-n-[(2r,3r)-1,3-dihydroxybutan-2-yl]-7-[(1r)-1-hydroxyethyl]-13-(1h-indol-3-ylmethyl)-6,9,12,15,18-pentaoxo-1,2-dithia-5,8,11,14,17-pentazacycloicosane-4-carboxa Chemical compound C([C@@H](N)C(=O)N[C@H]1CSSC[C@H](NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](CC=2C3=CC=CC=C3NC=2)NC(=O)[C@H](CC=2C=CC=CC=2)NC1=O)C(=O)N[C@H](CO)[C@H](O)C)C1=CC=CC=C1 DEQANNDTNATYII-OULOTJBUSA-N 0.000 claims description 2
- 108010016076 Octreotide Proteins 0.000 claims description 2
- 229910052739 hydrogen Inorganic materials 0.000 claims description 2
- 125000004123 n-propyl group Chemical group [H]C([H])([H])C([H])([H])C([H])([H])* 0.000 claims description 2
- VMZMNAABQBOLAK-DBILLSOUSA-N pasireotide Chemical compound C([C@H]1C(=O)N2C[C@@H](C[C@H]2C(=O)N[C@H](C(=O)N[C@H](CC=2C3=CC=CC=C3NC=2)C(=O)N[C@H](C(N[C@@H](CC=2C=CC(OCC=3C=CC=CC=3)=CC=2)C(=O)N1)=O)CCCCN)C=1C=CC=CC=1)OC(=O)NCCN)C1=CC=CC=C1 VMZMNAABQBOLAK-DBILLSOUSA-N 0.000 claims description 2
- 229960005415 pasireotide Drugs 0.000 claims description 2
- 108700017947 pasireotide Proteins 0.000 claims description 2
- 229940072272 sandostatin Drugs 0.000 claims description 2
- 125000001475 halogen functional group Chemical group 0.000 claims 4
- FWZRWHZDXBDTFK-ZHACJKMWSA-N panobinostat Chemical group CC1=NC2=CC=C[CH]C2=C1CCNCC1=CC=C(\C=C\C(=O)NO)C=C1 FWZRWHZDXBDTFK-ZHACJKMWSA-N 0.000 claims 3
- 210000000481 breast Anatomy 0.000 claims 2
- 230000002496 gastric effect Effects 0.000 claims 2
- 208000037841 lung tumor Diseases 0.000 claims 2
- 208000025113 myeloid leukemia Diseases 0.000 claims 1
- XRASPMIURGNCCH-UHFFFAOYSA-N zoledronic acid Chemical compound OP(=O)(O)C(P(O)(O)=O)(O)CN1C=CN=C1 XRASPMIURGNCCH-UHFFFAOYSA-N 0.000 claims 1
- 229940002005 zometa Drugs 0.000 claims 1
- 230000000694 effects Effects 0.000 abstract description 37
- 239000013604 expression vector Substances 0.000 abstract description 31
- 238000000338 in vitro Methods 0.000 abstract description 6
- 238000003745 diagnosis Methods 0.000 abstract description 5
- 230000004043 responsiveness Effects 0.000 abstract description 5
- 206010071602 Genetic polymorphism Diseases 0.000 abstract description 2
- 238000012863 analytical testing Methods 0.000 abstract description 2
- 108090000623 proteins and genes Proteins 0.000 description 156
- 108010023925 Histone Deacetylase 6 Proteins 0.000 description 155
- 210000004027 cell Anatomy 0.000 description 93
- 239000000523 sample Substances 0.000 description 60
- 150000007523 nucleic acids Chemical group 0.000 description 59
- 102000003964 Histone deacetylase Human genes 0.000 description 58
- 108090000353 Histone deacetylase Proteins 0.000 description 58
- 239000002773 nucleotide Substances 0.000 description 58
- 125000003729 nucleotide group Chemical group 0.000 description 58
- 102000004169 proteins and genes Human genes 0.000 description 58
- 230000014509 gene expression Effects 0.000 description 53
- 108091034117 Oligonucleotide Proteins 0.000 description 50
- 235000018102 proteins Nutrition 0.000 description 50
- 102000039446 nucleic acids Human genes 0.000 description 48
- 108020004707 nucleic acids Proteins 0.000 description 48
- 108020004414 DNA Proteins 0.000 description 37
- 235000001014 amino acid Nutrition 0.000 description 36
- 238000004458 analytical method Methods 0.000 description 36
- 230000004044 response Effects 0.000 description 33
- 108020004999 messenger RNA Proteins 0.000 description 30
- 238000004949 mass spectrometry Methods 0.000 description 28
- 150000001413 amino acids Chemical class 0.000 description 27
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 26
- 238000001514 detection method Methods 0.000 description 26
- 125000005647 linker group Chemical group 0.000 description 26
- 238000003205 genotyping method Methods 0.000 description 25
- 230000001105 regulatory effect Effects 0.000 description 24
- 239000013615 primer Substances 0.000 description 22
- 210000001519 tissue Anatomy 0.000 description 22
- 239000003550 marker Substances 0.000 description 19
- 239000007787 solid Substances 0.000 description 18
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 16
- 230000006870 function Effects 0.000 description 16
- 239000000203 mixture Substances 0.000 description 15
- 238000003199 nucleic acid amplification method Methods 0.000 description 15
- 230000003321 amplification Effects 0.000 description 14
- 238000003556 assay Methods 0.000 description 14
- 230000026731 phosphorylation Effects 0.000 description 14
- 238000006366 phosphorylation reaction Methods 0.000 description 14
- 238000009396 hybridization Methods 0.000 description 13
- 230000004048 modification Effects 0.000 description 13
- 238000012986 modification Methods 0.000 description 13
- 238000013518 transcription Methods 0.000 description 13
- 230000035897 transcription Effects 0.000 description 13
- 125000003275 alpha amino acid group Chemical group 0.000 description 12
- 239000008280 blood Substances 0.000 description 12
- 210000004369 blood Anatomy 0.000 description 12
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 12
- 230000004927 fusion Effects 0.000 description 12
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 12
- 125000005843 halogen group Chemical group 0.000 description 12
- 230000001225 therapeutic effect Effects 0.000 description 12
- 108091028043 Nucleic acid sequence Proteins 0.000 description 11
- 239000011324 bead Substances 0.000 description 11
- 230000008859 change Effects 0.000 description 11
- 238000005516 engineering process Methods 0.000 description 11
- 239000012634 fragment Substances 0.000 description 11
- 239000011159 matrix material Substances 0.000 description 11
- 238000003752 polymerase chain reaction Methods 0.000 description 11
- 230000010741 sumoylation Effects 0.000 description 11
- 238000012360 testing method Methods 0.000 description 11
- 241001465754 Metazoa Species 0.000 description 10
- 150000007513 acids Chemical class 0.000 description 10
- 125000000539 amino acid group Chemical group 0.000 description 10
- 230000006907 apoptotic process Effects 0.000 description 10
- 230000037396 body weight Effects 0.000 description 10
- 239000002299 complementary DNA Substances 0.000 description 10
- 230000001404 mediated effect Effects 0.000 description 10
- 238000003259 recombinant expression Methods 0.000 description 10
- WEVYNIUIFUYDGI-UHFFFAOYSA-N 3-[6-[4-(trifluoromethoxy)anilino]-4-pyrimidinyl]benzamide Chemical compound NC(=O)C1=CC=CC(C=2N=CN=C(NC=3C=CC(OC(F)(F)F)=CC=3)C=2)=C1 WEVYNIUIFUYDGI-UHFFFAOYSA-N 0.000 description 9
- 102000002669 Small Ubiquitin-Related Modifier Proteins Human genes 0.000 description 9
- 108010043401 Small Ubiquitin-Related Modifier Proteins Proteins 0.000 description 9
- 125000003545 alkoxy group Chemical group 0.000 description 9
- 230000004071 biological effect Effects 0.000 description 9
- 210000000349 chromosome Anatomy 0.000 description 9
- 230000000295 complement effect Effects 0.000 description 9
- 238000010205 computational analysis Methods 0.000 description 9
- 210000004881 tumor cell Anatomy 0.000 description 9
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 8
- 102000053602 DNA Human genes 0.000 description 8
- 101000692455 Homo sapiens Platelet-derived growth factor receptor beta Proteins 0.000 description 8
- 102100026547 Platelet-derived growth factor receptor beta Human genes 0.000 description 8
- 125000003282 alkyl amino group Chemical group 0.000 description 8
- 230000033115 angiogenesis Effects 0.000 description 8
- 239000012472 biological sample Substances 0.000 description 8
- 238000002405 diagnostic procedure Methods 0.000 description 8
- 125000005842 heteroatom Chemical group 0.000 description 8
- 239000003276 histone deacetylase inhibitor Substances 0.000 description 8
- 210000004962 mammalian cell Anatomy 0.000 description 8
- 238000005259 measurement Methods 0.000 description 8
- 230000008569 process Effects 0.000 description 8
- 230000004853 protein function Effects 0.000 description 8
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 7
- 241000588724 Escherichia coli Species 0.000 description 7
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 7
- 125000004103 aminoalkyl group Chemical group 0.000 description 7
- 230000000692 anti-sense effect Effects 0.000 description 7
- 238000013459 approach Methods 0.000 description 7
- 239000002585 base Substances 0.000 description 7
- 238000006243 chemical reaction Methods 0.000 description 7
- 238000003795 desorption Methods 0.000 description 7
- HHZIURLSWUIHRB-UHFFFAOYSA-N nilotinib Chemical compound C1=NC(C)=CN1C1=CC(NC(=O)C=2C=C(NC=3N=C(C=CN=3)C=3C=NC=CC=3)C(C)=CC=2)=CC(C(F)(F)F)=C1 HHZIURLSWUIHRB-UHFFFAOYSA-N 0.000 description 7
- 125000005429 oxyalkyl group Chemical group 0.000 description 7
- 239000000243 solution Substances 0.000 description 7
- 239000000758 substrate Substances 0.000 description 7
- 102000004190 Enzymes Human genes 0.000 description 6
- 108090000790 Enzymes Proteins 0.000 description 6
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 6
- 241000699666 Mus <mouse, genus> Species 0.000 description 6
- 125000004390 alkyl sulfonyl group Chemical group 0.000 description 6
- 230000027455 binding Effects 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 6
- 230000000903 blocking effect Effects 0.000 description 6
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 6
- 230000004663 cell proliferation Effects 0.000 description 6
- 229940104302 cytosine Drugs 0.000 description 6
- 239000000975 dye Substances 0.000 description 6
- 229940088598 enzyme Drugs 0.000 description 6
- 102000054908 human HDAC6 Human genes 0.000 description 6
- FPOHNWQLNRZRFC-ZHACJKMWSA-N panobinostat Chemical compound CC=1NC2=CC=CC=C2C=1CCNCC1=CC=C(\C=C\C(=O)NO)C=C1 FPOHNWQLNRZRFC-ZHACJKMWSA-N 0.000 description 6
- 230000003334 potential effect Effects 0.000 description 6
- 238000011160 research Methods 0.000 description 6
- 230000008685 targeting Effects 0.000 description 6
- 238000013519 translation Methods 0.000 description 6
- 230000004075 alteration Effects 0.000 description 5
- 125000004391 aryl sulfonyl group Chemical group 0.000 description 5
- 230000008901 benefit Effects 0.000 description 5
- 125000002619 bicyclic group Chemical group 0.000 description 5
- 229960002685 biotin Drugs 0.000 description 5
- 239000011616 biotin Substances 0.000 description 5
- 238000003776 cleavage reaction Methods 0.000 description 5
- 238000010367 cloning Methods 0.000 description 5
- 230000007423 decrease Effects 0.000 description 5
- 238000000132 electrospray ionisation Methods 0.000 description 5
- 238000010369 molecular cloning Methods 0.000 description 5
- 239000012071 phase Substances 0.000 description 5
- 238000009520 phase I clinical trial Methods 0.000 description 5
- 239000013612 plasmid Substances 0.000 description 5
- 230000002265 prevention Effects 0.000 description 5
- 238000000746 purification Methods 0.000 description 5
- 230000007017 scission Effects 0.000 description 5
- 238000007619 statistical method Methods 0.000 description 5
- 239000000126 substance Substances 0.000 description 5
- 125000000472 sulfonyl group Chemical group *S(*)(=O)=O 0.000 description 5
- 208000024891 symptom Diseases 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- 238000002560 therapeutic procedure Methods 0.000 description 5
- 206010051066 Gastrointestinal stromal tumour Diseases 0.000 description 4
- 241000238631 Hexapoda Species 0.000 description 4
- 108010033040 Histones Proteins 0.000 description 4
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 4
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 4
- 208000033776 Myeloid Acute Leukemia Diseases 0.000 description 4
- 229930012538 Paclitaxel Natural products 0.000 description 4
- 238000011529 RT qPCR Methods 0.000 description 4
- 241000700159 Rattus Species 0.000 description 4
- 108020004511 Recombinant DNA Proteins 0.000 description 4
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 4
- 230000021736 acetylation Effects 0.000 description 4
- 230000002378 acidificating effect Effects 0.000 description 4
- 125000005119 alkyl cycloalkyl group Chemical group 0.000 description 4
- 230000001028 anti-proliverative effect Effects 0.000 description 4
- 238000003491 array Methods 0.000 description 4
- 235000020958 biotin Nutrition 0.000 description 4
- 210000001124 body fluid Anatomy 0.000 description 4
- 230000004069 differentiation Effects 0.000 description 4
- 239000000262 estrogen Substances 0.000 description 4
- 210000003527 eukaryotic cell Anatomy 0.000 description 4
- 201000011243 gastrointestinal stromal tumor Diseases 0.000 description 4
- 229940121372 histone deacetylase inhibitor Drugs 0.000 description 4
- KTUFNOKKBVMGRW-UHFFFAOYSA-N imatinib Chemical compound C1CN(C)CCN1CC1=CC=C(C(=O)NC=2C=C(NC=3N=C(C=CN=3)C=3C=NC=CC=3)C(C)=CC=2)C=C1 KTUFNOKKBVMGRW-UHFFFAOYSA-N 0.000 description 4
- 230000001965 increasing effect Effects 0.000 description 4
- 238000002372 labelling Methods 0.000 description 4
- 238000001840 matrix-assisted laser desorption--ionisation time-of-flight mass spectrometry Methods 0.000 description 4
- 125000004433 nitrogen atom Chemical group N* 0.000 description 4
- 229960001592 paclitaxel Drugs 0.000 description 4
- 210000001236 prokaryotic cell Anatomy 0.000 description 4
- 230000000069 prophylactic effect Effects 0.000 description 4
- 238000002864 sequence alignment Methods 0.000 description 4
- 238000012163 sequencing technique Methods 0.000 description 4
- 150000003384 small molecules Chemical class 0.000 description 4
- 241000894007 species Species 0.000 description 4
- RCINICONZNJXQF-MZXODVADSA-N taxol Chemical compound O([C@@H]1[C@@]2(C[C@@H](C(C)=C(C2(C)C)[C@H](C([C@]2(C)[C@@H](O)C[C@H]3OC[C@]3([C@H]21)OC(C)=O)=O)OC(=O)C)OC(=O)[C@H](O)[C@@H](NC(=O)C=1C=CC=CC=1)C=1C=CC=CC=1)O)C(=O)C1=CC=CC=C1 RCINICONZNJXQF-MZXODVADSA-N 0.000 description 4
- 238000001890 transfection Methods 0.000 description 4
- 230000009261 transgenic effect Effects 0.000 description 4
- 238000011144 upstream manufacturing Methods 0.000 description 4
- 244000105975 Antidesma platyphyllum Species 0.000 description 3
- 108020005544 Antisense RNA Proteins 0.000 description 3
- 102000006947 Histones Human genes 0.000 description 3
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 3
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 3
- 241000102542 Kara Species 0.000 description 3
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 3
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 3
- 108091092878 Microsatellite Proteins 0.000 description 3
- 241001529936 Murinae Species 0.000 description 3
- 101100230662 Mus musculus Hdac6 gene Proteins 0.000 description 3
- 241000699670 Mus sp. Species 0.000 description 3
- 102100020718 Receptor-type tyrosine-protein kinase FLT3 Human genes 0.000 description 3
- 108091081021 Sense strand Proteins 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 108010090804 Streptavidin Proteins 0.000 description 3
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 3
- 239000004473 Threonine Substances 0.000 description 3
- 102000009484 Vascular Endothelial Growth Factor Receptors Human genes 0.000 description 3
- 238000006640 acetylation reaction Methods 0.000 description 3
- 230000002411 adverse Effects 0.000 description 3
- 125000004397 aminosulfonyl group Chemical group NS(=O)(=O)* 0.000 description 3
- 239000012491 analyte Substances 0.000 description 3
- 238000010171 animal model Methods 0.000 description 3
- 239000000427 antigen Substances 0.000 description 3
- 108091007433 antigens Proteins 0.000 description 3
- 102000036639 antigens Human genes 0.000 description 3
- 238000013528 artificial neural network Methods 0.000 description 3
- 239000010839 body fluid Substances 0.000 description 3
- 238000004422 calculation algorithm Methods 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 3
- 150000001720 carbohydrates Chemical class 0.000 description 3
- 235000014633 carbohydrates Nutrition 0.000 description 3
- 125000004181 carboxyalkyl group Chemical group 0.000 description 3
- 238000000423 cell based assay Methods 0.000 description 3
- 230000010261 cell growth Effects 0.000 description 3
- 230000001413 cellular effect Effects 0.000 description 3
- 238000012512 characterization method Methods 0.000 description 3
- 108010041758 cleavase Proteins 0.000 description 3
- 239000003184 complementary RNA Substances 0.000 description 3
- 229940127089 cytotoxic agent Drugs 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 230000018109 developmental process Effects 0.000 description 3
- 208000035475 disorder Diseases 0.000 description 3
- 230000001747 exhibiting effect Effects 0.000 description 3
- 239000012530 fluid Substances 0.000 description 3
- 238000002866 fluorescence resonance energy transfer Methods 0.000 description 3
- 125000000524 functional group Chemical group 0.000 description 3
- 239000011521 glass Substances 0.000 description 3
- 235000009424 haa Nutrition 0.000 description 3
- 230000036541 health Effects 0.000 description 3
- 230000006195 histone acetylation Effects 0.000 description 3
- 210000004408 hybridoma Anatomy 0.000 description 3
- 125000004356 hydroxy functional group Chemical group O* 0.000 description 3
- 230000005764 inhibitory process Effects 0.000 description 3
- 150000002484 inorganic compounds Chemical class 0.000 description 3
- 229910010272 inorganic material Inorganic materials 0.000 description 3
- 230000003993 interaction Effects 0.000 description 3
- 208000032839 leukemia Diseases 0.000 description 3
- 238000007834 ligase chain reaction Methods 0.000 description 3
- 150000002632 lipids Chemical class 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 238000002493 microarray Methods 0.000 description 3
- 238000003032 molecular docking Methods 0.000 description 3
- 150000002825 nitriles Chemical class 0.000 description 3
- 238000011275 oncology therapy Methods 0.000 description 3
- 150000002894 organic compounds Chemical class 0.000 description 3
- 230000037361 pathway Effects 0.000 description 3
- 230000002974 pharmacogenomic effect Effects 0.000 description 3
- 230000004481 post-translational protein modification Effects 0.000 description 3
- 230000003389 potentiating effect Effects 0.000 description 3
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 125000006239 protecting group Chemical group 0.000 description 3
- 108020001580 protein domains Proteins 0.000 description 3
- 230000009145 protein modification Effects 0.000 description 3
- 108700031431 rat HDAC6 Proteins 0.000 description 3
- 108091008598 receptor tyrosine kinases Proteins 0.000 description 3
- 102000027426 receptor tyrosine kinases Human genes 0.000 description 3
- 230000006798 recombination Effects 0.000 description 3
- 238000005215 recombination Methods 0.000 description 3
- 230000010076 replication Effects 0.000 description 3
- 238000012552 review Methods 0.000 description 3
- 230000019491 signal transduction Effects 0.000 description 3
- 230000011664 signaling Effects 0.000 description 3
- 239000002904 solvent Substances 0.000 description 3
- 238000006467 substitution reaction Methods 0.000 description 3
- 125000003396 thiol group Chemical group [H]S* 0.000 description 3
- 230000002103 transcriptional effect Effects 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 3
- 239000005483 tyrosine kinase inhibitor Substances 0.000 description 3
- 229940121358 tyrosine kinase inhibitor Drugs 0.000 description 3
- 150000004917 tyrosine kinase inhibitor derivatives Chemical class 0.000 description 3
- 241000701447 unidentified baculovirus Species 0.000 description 3
- 230000003612 virological effect Effects 0.000 description 3
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 2
- LBUJPTNKIBCYBY-UHFFFAOYSA-N 1,2,3,4-tetrahydroquinoline Chemical compound C1=CC=C2CCCNC2=C1 LBUJPTNKIBCYBY-UHFFFAOYSA-N 0.000 description 2
- FCEHBMOGCRZNNI-UHFFFAOYSA-N 1-benzothiophene Chemical compound C1=CC=C2SC=CC2=C1 FCEHBMOGCRZNNI-UHFFFAOYSA-N 0.000 description 2
- BRARRAHGNDUELT-UHFFFAOYSA-N 3-hydroxypicolinic acid Chemical compound OC(=O)C1=NC=CC=C1O BRARRAHGNDUELT-UHFFFAOYSA-N 0.000 description 2
- 229940122815 Aromatase inhibitor Drugs 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 206010065553 Bone marrow failure Diseases 0.000 description 2
- 101100539164 Caenorhabditis elegans ubc-9 gene Proteins 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- 108091033380 Coding strand Proteins 0.000 description 2
- 108020004635 Complementary DNA Proteins 0.000 description 2
- 102000003915 DNA Topoisomerases Human genes 0.000 description 2
- 108090000323 DNA Topoisomerases Proteins 0.000 description 2
- 230000004568 DNA-binding Effects 0.000 description 2
- 108010002156 Depsipeptides Proteins 0.000 description 2
- 229920002307 Dextran Polymers 0.000 description 2
- 238000004252 FT/ICR mass spectrometry Methods 0.000 description 2
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 2
- 102000005720 Glutathione transferase Human genes 0.000 description 2
- 108010070675 Glutathione transferase Proteins 0.000 description 2
- 102000002068 Glycopeptides Human genes 0.000 description 2
- 108010015899 Glycopeptides Proteins 0.000 description 2
- 102100039999 Histone deacetylase 2 Human genes 0.000 description 2
- 102100021454 Histone deacetylase 4 Human genes 0.000 description 2
- 102100021453 Histone deacetylase 5 Human genes 0.000 description 2
- 102100038720 Histone deacetylase 9 Human genes 0.000 description 2
- 101001035011 Homo sapiens Histone deacetylase 2 Proteins 0.000 description 2
- 101000899259 Homo sapiens Histone deacetylase 4 Proteins 0.000 description 2
- 101000899255 Homo sapiens Histone deacetylase 5 Proteins 0.000 description 2
- 101000932478 Homo sapiens Receptor-type tyrosine-protein kinase FLT3 Proteins 0.000 description 2
- 101000851007 Homo sapiens Vascular endothelial growth factor receptor 2 Proteins 0.000 description 2
- SIKJAQJRHWYJAI-UHFFFAOYSA-N Indole Chemical compound C1=CC=C2NC=CC2=C1 SIKJAQJRHWYJAI-UHFFFAOYSA-N 0.000 description 2
- 108091092195 Intron Proteins 0.000 description 2
- 239000004472 Lysine Substances 0.000 description 2
- 108091027974 Mature messenger RNA Proteins 0.000 description 2
- 229940119336 Microtubule stabilizer Drugs 0.000 description 2
- 108010021466 Mutant Proteins Proteins 0.000 description 2
- 102000008300 Mutant Proteins Human genes 0.000 description 2
- JCXJVPUVTGWSNB-UHFFFAOYSA-N Nitrogen dioxide Chemical compound O=[N]=O JCXJVPUVTGWSNB-UHFFFAOYSA-N 0.000 description 2
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 2
- 239000004698 Polyethylene Substances 0.000 description 2
- KYQCOXFCLRTKLS-UHFFFAOYSA-N Pyrazine Chemical compound C1=CN=CC=N1 KYQCOXFCLRTKLS-UHFFFAOYSA-N 0.000 description 2
- JUJWROOIHBZHMG-UHFFFAOYSA-N Pyridine Chemical compound C1=CC=NC=C1 JUJWROOIHBZHMG-UHFFFAOYSA-N 0.000 description 2
- KAESVJOAVNADME-UHFFFAOYSA-N Pyrrole Chemical compound C=1C=CNC=1 KAESVJOAVNADME-UHFFFAOYSA-N 0.000 description 2
- SMWDFEZZVXVKRB-UHFFFAOYSA-N Quinoline Chemical compound N1=CC=CC2=CC=CC=C21 SMWDFEZZVXVKRB-UHFFFAOYSA-N 0.000 description 2
- 238000002123 RNA extraction Methods 0.000 description 2
- 102000006382 Ribonucleases Human genes 0.000 description 2
- 108010083644 Ribonucleases Proteins 0.000 description 2
- 238000012300 Sequence Analysis Methods 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 102000005157 Somatostatin Human genes 0.000 description 2
- 108010056088 Somatostatin Proteins 0.000 description 2
- 102000013530 TOR Serine-Threonine Kinases Human genes 0.000 description 2
- 108010065917 TOR Serine-Threonine Kinases Proteins 0.000 description 2
- DTQVDTLACAAQTR-UHFFFAOYSA-N Trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 description 2
- 108091008605 VEGF receptors Proteins 0.000 description 2
- 102100033177 Vascular endothelial growth factor receptor 2 Human genes 0.000 description 2
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 2
- DGYIJVNZSDYBOE-UHFFFAOYSA-N [CH2]C1=CC=NC=C1 Chemical group [CH2]C1=CC=NC=C1 DGYIJVNZSDYBOE-UHFFFAOYSA-N 0.000 description 2
- 230000005856 abnormality Effects 0.000 description 2
- 238000009825 accumulation Methods 0.000 description 2
- 239000002671 adjuvant Substances 0.000 description 2
- 238000001261 affinity purification Methods 0.000 description 2
- 125000001931 aliphatic group Chemical group 0.000 description 2
- 125000003277 amino group Chemical group 0.000 description 2
- 239000005557 antagonist Substances 0.000 description 2
- 230000000259 anti-tumor effect Effects 0.000 description 2
- 238000003782 apoptosis assay Methods 0.000 description 2
- 239000003886 aromatase inhibitor Substances 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 238000003766 bioinformatics method Methods 0.000 description 2
- 229920001222 biopolymer Polymers 0.000 description 2
- 239000001506 calcium phosphate Substances 0.000 description 2
- 229910000389 calcium phosphate Inorganic materials 0.000 description 2
- 235000011010 calcium phosphates Nutrition 0.000 description 2
- 230000005907 cancer growth Effects 0.000 description 2
- 239000000969 carrier Substances 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 2
- 230000022131 cell cycle Effects 0.000 description 2
- 230000024245 cell differentiation Effects 0.000 description 2
- 230000032823 cell division Effects 0.000 description 2
- 229920002678 cellulose Polymers 0.000 description 2
- 235000010980 cellulose Nutrition 0.000 description 2
- 238000002512 chemotherapy Methods 0.000 description 2
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 2
- 125000001309 chloro group Chemical group Cl* 0.000 description 2
- 238000000975 co-precipitation Methods 0.000 description 2
- 230000001268 conjugating effect Effects 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 238000004132 cross linking Methods 0.000 description 2
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 2
- 230000006196 deacetylation Effects 0.000 description 2
- 238000003381 deacetylation reaction Methods 0.000 description 2
- NNBZCPXTIHJBJL-UHFFFAOYSA-N decalin Chemical compound C1CCCC2CCCCC21 NNBZCPXTIHJBJL-UHFFFAOYSA-N 0.000 description 2
- 238000006731 degradation reaction Methods 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 235000011180 diphosphates Nutrition 0.000 description 2
- 231100000371 dose-limiting toxicity Toxicity 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 210000002889 endothelial cell Anatomy 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- NIHNNTQXNPWCJQ-UHFFFAOYSA-N fluorene Chemical compound C1=CC=C2CC3=CC=CC=C3C2=C1 NIHNNTQXNPWCJQ-UHFFFAOYSA-N 0.000 description 2
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 2
- 239000007850 fluorescent dye Substances 0.000 description 2
- 125000001153 fluoro group Chemical group F* 0.000 description 2
- 210000004602 germ cell Anatomy 0.000 description 2
- 229940080856 gleevec Drugs 0.000 description 2
- 230000009036 growth inhibition Effects 0.000 description 2
- 230000002489 hematologic effect Effects 0.000 description 2
- 229960002411 imatinib Drugs 0.000 description 2
- 230000028993 immune response Effects 0.000 description 2
- 238000003018 immunoassay Methods 0.000 description 2
- 238000010166 immunofluorescence Methods 0.000 description 2
- 230000002401 inhibitory effect Effects 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 238000005040 ion trap Methods 0.000 description 2
- 150000002500 ions Chemical class 0.000 description 2
- AWJUIBRHMBBTKR-UHFFFAOYSA-N isoquinoline Chemical compound C1=NC=CC2=CC=CC=C21 AWJUIBRHMBBTKR-UHFFFAOYSA-N 0.000 description 2
- 230000000670 limiting effect Effects 0.000 description 2
- 230000033001 locomotion Effects 0.000 description 2
- 239000002207 metabolite Substances 0.000 description 2
- 230000011278 mitosis Effects 0.000 description 2
- 230000036457 multidrug resistance Effects 0.000 description 2
- 238000002887 multiple sequence alignment Methods 0.000 description 2
- 125000001624 naphthyl group Chemical group 0.000 description 2
- 239000002547 new drug Substances 0.000 description 2
- QJGQUHMNIGDVPM-UHFFFAOYSA-N nitrogen group Chemical group [N] QJGQUHMNIGDVPM-UHFFFAOYSA-N 0.000 description 2
- 238000007899 nucleic acid hybridization Methods 0.000 description 2
- 239000002751 oligonucleotide probe Substances 0.000 description 2
- 238000002515 oligonucleotide synthesis Methods 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 239000003960 organic solvent Substances 0.000 description 2
- 230000008520 organization Effects 0.000 description 2
- 230000003285 pharmacodynamic effect Effects 0.000 description 2
- 238000009521 phase II clinical trial Methods 0.000 description 2
- 125000001997 phenyl group Chemical group [H]C1=C([H])C([H])=C(*)C([H])=C1[H] 0.000 description 2
- 210000004214 philadelphia chromosome Anatomy 0.000 description 2
- 238000000206 photolithography Methods 0.000 description 2
- 239000002243 precursor Substances 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 230000005522 programmed cell death Effects 0.000 description 2
- 230000035755 proliferation Effects 0.000 description 2
- 230000004952 protein activity Effects 0.000 description 2
- 230000000717 retained effect Effects 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 125000006850 spacer group Chemical group 0.000 description 2
- 238000004611 spectroscopical analysis Methods 0.000 description 2
- NHXLMOGPVYXJNR-UHFFFAOYSA-N srif Chemical compound N1C(=O)C(C(C)O)NC(=O)C(CCCCN)NC(=O)C(CC=2C3=CC=CC=C3NC=2)NC(=O)C(CC=2C=CC=CC=2)NC(=O)C(CC=2C=CC=CC=2)NC(=O)C(CC(N)=O)NC(=O)C(CCCCN)NC(=O)C(NC(=O)CNC(=O)C(C)N)CSSCC(C(O)=O)NC(=O)C(CO)NC(=O)C(C(O)C)NC(=O)C1CC1=CC=CC=C1 NHXLMOGPVYXJNR-UHFFFAOYSA-N 0.000 description 2
- 238000010561 standard procedure Methods 0.000 description 2
- CXWXQJXEFPUFDZ-UHFFFAOYSA-N tetralin Chemical compound C1=CC=C2CCCCC2=C1 CXWXQJXEFPUFDZ-UHFFFAOYSA-N 0.000 description 2
- 238000001269 time-of-flight mass spectrometry Methods 0.000 description 2
- 230000002110 toxicologic effect Effects 0.000 description 2
- 231100000027 toxicology Toxicity 0.000 description 2
- 108010060597 trapoxin A Proteins 0.000 description 2
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 2
- 238000000539 two dimensional gel electrophoresis Methods 0.000 description 2
- 210000003606 umbilical vein Anatomy 0.000 description 2
- 210000002700 urine Anatomy 0.000 description 2
- 239000013603 viral vector Substances 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- 210000005253 yeast cell Anatomy 0.000 description 2
- 239000011701 zinc Substances 0.000 description 2
- 229910052725 zinc Inorganic materials 0.000 description 2
- ASWBNKHCZGQVJV-UHFFFAOYSA-N (3-hexadecanoyloxy-2-hydroxypropyl) 2-(trimethylazaniumyl)ethyl phosphate Chemical compound CCCCCCCCCCCCCCCC(=O)OCC(O)COP([O-])(=O)OCC[N+](C)(C)C ASWBNKHCZGQVJV-UHFFFAOYSA-N 0.000 description 1
- 125000006763 (C3-C9) cycloalkyl group Chemical group 0.000 description 1
- GLUABPSZMHYCNO-UHFFFAOYSA-N 1,2,3,3a,4,5,6,6a-octahydropyrrolo[3,2-b]pyrrole Chemical compound N1CCC2NCCC21 GLUABPSZMHYCNO-UHFFFAOYSA-N 0.000 description 1
- PIHAUZGWAXLKCA-UHFFFAOYSA-N 1,2,3,4,4a,5,6,7,8,8a-decahydro-1,8-naphthyridine Chemical compound N1CCCC2CCCNC21 PIHAUZGWAXLKCA-UHFFFAOYSA-N 0.000 description 1
- QOBTUJBWKJKGCI-UHFFFAOYSA-N 1,2,3,4,6,11-hexahydropyrido[3,2-c][1,5]benzodiazepin-5-one Chemical compound O=C1NC2=CC=CC=C2NC2=C1CCCN2 QOBTUJBWKJKGCI-UHFFFAOYSA-N 0.000 description 1
- BCMCBBGGLRIHSE-UHFFFAOYSA-N 1,3-benzoxazole Chemical compound C1=CC=C2OC=NC2=C1 BCMCBBGGLRIHSE-UHFFFAOYSA-N 0.000 description 1
- ZNGWEEUXTBNKFR-UHFFFAOYSA-N 1,4-oxazepane Chemical compound C1CNCCOC1 ZNGWEEUXTBNKFR-UHFFFAOYSA-N 0.000 description 1
- JQBOAIAQGAIKBN-UHFFFAOYSA-N 1,5-dihydropyrido[2,3-b][1,4]diazepin-4-one Chemical compound N1C=CC(=O)NC2=NC=CC=C21 JQBOAIAQGAIKBN-UHFFFAOYSA-N 0.000 description 1
- 125000004343 1-phenylethyl group Chemical group [H]C1=C([H])C([H])=C(C([H])=C1[H])C([H])(*)C([H])([H])[H] 0.000 description 1
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 1
- SVUOLADPCWQTTE-UHFFFAOYSA-N 1h-1,2-benzodiazepine Chemical compound N1N=CC=CC2=CC=CC=C12 SVUOLADPCWQTTE-UHFFFAOYSA-N 0.000 description 1
- HIYWOHBEPVGIQN-UHFFFAOYSA-N 1h-benzo[g]indole Chemical compound C1=CC=CC2=C(NC=C3)C3=CC=C21 HIYWOHBEPVGIQN-UHFFFAOYSA-N 0.000 description 1
- XWIYUCRMWCHYJR-UHFFFAOYSA-N 1h-pyrrolo[3,2-b]pyridine Chemical compound C1=CC=C2NC=CC2=N1 XWIYUCRMWCHYJR-UHFFFAOYSA-N 0.000 description 1
- IHKOXYNAMXNNDK-UHFFFAOYSA-N 2,3,3a,4,5,6a-hexahydrofuro[2,3-b]furan Chemical compound C1COC2OCCC21 IHKOXYNAMXNNDK-UHFFFAOYSA-N 0.000 description 1
- HCJXTKKLAOJVHJ-UHFFFAOYSA-N 2,3,3a,4a,5,6,7,7a,8,8a-decahydro-1h-dicyclopenta[2,1-b:2',1'-f]pyran Chemical compound C1C2CCCC2OC2CCCC21 HCJXTKKLAOJVHJ-UHFFFAOYSA-N 0.000 description 1
- URDIRPDCQWYRTI-UHFFFAOYSA-N 2,3,3a,5,6,6a-hexahydrothieno[3,2-b]thiophene Chemical compound S1CCC2SCCC21 URDIRPDCQWYRTI-UHFFFAOYSA-N 0.000 description 1
- GBCQLGDTLMHVHU-UHFFFAOYSA-N 2,3,4,4a,5,6,7,8,9,9a-decahydro-1h-benzo[7]annulene Chemical compound C1CCCCC2CCCCC21 GBCQLGDTLMHVHU-UHFFFAOYSA-N 0.000 description 1
- UFBJCMHMOXMLKC-UHFFFAOYSA-N 2,4-dinitrophenol Chemical compound OC1=CC=C([N+]([O-])=O)C=C1[N+]([O-])=O UFBJCMHMOXMLKC-UHFFFAOYSA-N 0.000 description 1
- FSPQCTGGIANIJZ-UHFFFAOYSA-N 2-[[(3,4-dimethoxyphenyl)-oxomethyl]amino]-4,5,6,7-tetrahydro-1-benzothiophene-3-carboxamide Chemical compound C1=C(OC)C(OC)=CC=C1C(=O)NC1=C(C(N)=O)C(CCCC2)=C2S1 FSPQCTGGIANIJZ-UHFFFAOYSA-N 0.000 description 1
- 125000006282 2-chlorobenzyl group Chemical group [H]C1=C([H])C(Cl)=C(C([H])=C1[H])C([H])([H])* 0.000 description 1
- BFSVOASYOCHEOV-UHFFFAOYSA-N 2-diethylaminoethanol Chemical compound CCN(CC)CCO BFSVOASYOCHEOV-UHFFFAOYSA-N 0.000 description 1
- 125000000094 2-phenylethyl group Chemical group [H]C1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])C([H])([H])* 0.000 description 1
- MLLFHKLSIHJGEL-UHFFFAOYSA-N 3-amino-2-(2-nitrophenyl)propanoic acid Chemical compound NCC(C(O)=O)C1=CC=CC=C1[N+]([O-])=O MLLFHKLSIHJGEL-UHFFFAOYSA-N 0.000 description 1
- 125000003852 3-chlorobenzyl group Chemical group [H]C1=C([H])C(=C([H])C(Cl)=C1[H])C([H])([H])* 0.000 description 1
- 125000006283 4-chlorobenzyl group Chemical group [H]C1=C([H])C(=C([H])C([H])=C1Cl)C([H])([H])* 0.000 description 1
- 125000001255 4-fluorophenyl group Chemical group [H]C1=C([H])C(*)=C([H])C([H])=C1F 0.000 description 1
- 125000004172 4-methoxyphenyl group Chemical group [H]C1=C([H])C(OC([H])([H])[H])=C([H])C([H])=C1* 0.000 description 1
- OMGNOSZSCQGCGV-UHFFFAOYSA-N 6,11-dihydro-5h-benzo[b][1,4]benzodiazepine Chemical compound C1NC2=CC=CC=C2NC2=CC=CC=C12 OMGNOSZSCQGCGV-UHFFFAOYSA-N 0.000 description 1
- ZIUYHTQZEPDUCZ-UHFFFAOYSA-N 7h-pyrrolo[2,3-h]quinoline Chemical compound C1=CN=C2C(C=CN3)=C3C=CC2=C1 ZIUYHTQZEPDUCZ-UHFFFAOYSA-N 0.000 description 1
- 206010000830 Acute leukaemia Diseases 0.000 description 1
- 206010067484 Adverse reaction Diseases 0.000 description 1
- 102100027211 Albumin Human genes 0.000 description 1
- 108010088751 Albumins Proteins 0.000 description 1
- 102100023635 Alpha-fetoprotein Human genes 0.000 description 1
- 208000024827 Alzheimer disease Diseases 0.000 description 1
- QGZKDVFQNNGYKY-UHFFFAOYSA-O Ammonium Chemical compound [NH4+] QGZKDVFQNNGYKY-UHFFFAOYSA-O 0.000 description 1
- 102000013142 Amylases Human genes 0.000 description 1
- 108010065511 Amylases Proteins 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- 206010003445 Ascites Diseases 0.000 description 1
- 241000972773 Aulopiformes Species 0.000 description 1
- 108090001008 Avidin Proteins 0.000 description 1
- 208000032800 BCR-ABL1 positive blast phase chronic myelogenous leukemia Diseases 0.000 description 1
- 208000004860 Blast Crisis Diseases 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 0 C*(C*)C(C)(*)*(C)C*(C)N(*(C)C/C(/C(C)(*)/C=C1)=C\C(C)(*)C=C1C(*)=CC(NO)=O)*=C Chemical compound C*(C*)C(C)(*)*(C)C*(C)N(*(C)C/C(/C(C)(*)/C=C1)=C\C(C)(*)C=C1C(*)=CC(NO)=O)*=C 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 240000001432 Calendula officinalis Species 0.000 description 1
- 235000005881 Calendula officinalis Nutrition 0.000 description 1
- 241000282472 Canis lupus familiaris Species 0.000 description 1
- 244000025254 Cannabis sativa Species 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 208000024172 Cardiovascular disease Diseases 0.000 description 1
- 102000052052 Casein Kinase II Human genes 0.000 description 1
- 108010010919 Casein Kinase II Proteins 0.000 description 1
- 102000005403 Casein Kinases Human genes 0.000 description 1
- 108010031425 Casein Kinases Proteins 0.000 description 1
- 108090000994 Catalytic RNA Proteins 0.000 description 1
- 102000053642 Catalytic RNA Human genes 0.000 description 1
- 241000700198 Cavia Species 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 238000002883 ClustalW sequence alignment Methods 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- 206010052358 Colorectal cancer metastatic Diseases 0.000 description 1
- 208000035473 Communicable disease Diseases 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 241000186216 Corynebacterium Species 0.000 description 1
- 239000004971 Cross linker Substances 0.000 description 1
- 241000701022 Cytomegalovirus Species 0.000 description 1
- 108020003215 DNA Probes Proteins 0.000 description 1
- 230000005778 DNA damage Effects 0.000 description 1
- 231100000277 DNA damage Toxicity 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 239000003298 DNA probe Substances 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 230000006820 DNA synthesis Effects 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 241000702421 Dependoparvovirus Species 0.000 description 1
- 102000001477 Deubiquitinating Enzymes Human genes 0.000 description 1
- 108010093668 Deubiquitinating Enzymes Proteins 0.000 description 1
- BWGNESOTFCXPMA-UHFFFAOYSA-N Dihydrogen disulfide Chemical compound SS BWGNESOTFCXPMA-UHFFFAOYSA-N 0.000 description 1
- 241000255925 Diptera Species 0.000 description 1
- 101100506416 Drosophila melanogaster HDAC1 gene Proteins 0.000 description 1
- 206010059866 Drug resistance Diseases 0.000 description 1
- 208000030453 Drug-Related Side Effects and Adverse reaction Diseases 0.000 description 1
- 108010059378 Endopeptidases Proteins 0.000 description 1
- 102000005593 Endopeptidases Human genes 0.000 description 1
- 108010013369 Enteropeptidase Proteins 0.000 description 1
- 102100029727 Enteropeptidase Human genes 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- 241000283086 Equidae Species 0.000 description 1
- 208000031637 Erythroblastic Acute Leukemia Diseases 0.000 description 1
- 208000036566 Erythroleukaemia Diseases 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 108010074860 Factor Xa Proteins 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 241001481828 Glyptocephalus cynoglossus Species 0.000 description 1
- 108091027305 Heteroduplex Proteins 0.000 description 1
- 102100039869 Histone H2B type F-S Human genes 0.000 description 1
- 102100033636 Histone H3.2 Human genes 0.000 description 1
- 102000003893 Histone acetyltransferases Human genes 0.000 description 1
- 108090000246 Histone acetyltransferases Proteins 0.000 description 1
- 102100021455 Histone deacetylase 3 Human genes 0.000 description 1
- 102100038715 Histone deacetylase 8 Human genes 0.000 description 1
- 241001272567 Hominoidea Species 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 101001035372 Homo sapiens Histone H2B type F-S Proteins 0.000 description 1
- 101000899282 Homo sapiens Histone deacetylase 3 Proteins 0.000 description 1
- 101001032113 Homo sapiens Histone deacetylase 7 Proteins 0.000 description 1
- 101001032118 Homo sapiens Histone deacetylase 8 Proteins 0.000 description 1
- 101001032092 Homo sapiens Histone deacetylase 9 Proteins 0.000 description 1
- 101000983850 Homo sapiens Phosphatidate phosphatase LPIN3 Proteins 0.000 description 1
- 101001012157 Homo sapiens Receptor tyrosine-protein kinase erbB-2 Proteins 0.000 description 1
- 108090000144 Human Proteins Proteins 0.000 description 1
- 102000003839 Human Proteins Human genes 0.000 description 1
- 241000701109 Human adenovirus 2 Species 0.000 description 1
- UFHFLCQGNIYNRP-UHFFFAOYSA-N Hydrogen Chemical compound [H][H] UFHFLCQGNIYNRP-UHFFFAOYSA-N 0.000 description 1
- 208000013038 Hypocalcemia Diseases 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 102000009786 Immunoglobulin Constant Regions Human genes 0.000 description 1
- 108010009817 Immunoglobulin Constant Regions Proteins 0.000 description 1
- 208000026350 Inborn Genetic disease Diseases 0.000 description 1
- 102100034343 Integrase Human genes 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-N L-arginine Chemical compound OC(=O)[C@@H](N)CCCN=C(N)N ODKSFYDXXFIFQN-BYPYZUCNSA-N 0.000 description 1
- 125000000415 L-cysteinyl group Chemical group O=C([*])[C@@](N([H])[H])([H])C([H])([H])S[H] 0.000 description 1
- DWPCPZJAHOETAG-IMJSIDKUSA-N L-lanthionine Chemical group OC(=O)[C@@H](N)CSC[C@H](N)C(O)=O DWPCPZJAHOETAG-IMJSIDKUSA-N 0.000 description 1
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 238000012563 MS-based analyses Methods 0.000 description 1
- 241000829100 Macaca mulatta polyomavirus 1 Species 0.000 description 1
- PEEHTFAAVSWFBL-UHFFFAOYSA-N Maleimide Chemical compound O=C1NC(=O)C=C1 PEEHTFAAVSWFBL-UHFFFAOYSA-N 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 206010027452 Metastases to bone Diseases 0.000 description 1
- 102000010645 MutS Proteins Human genes 0.000 description 1
- 108010038272 MutS Proteins Proteins 0.000 description 1
- 230000004988 N-glycosylation Effects 0.000 description 1
- 238000005481 NMR spectroscopy Methods 0.000 description 1
- 229910002651 NO3 Inorganic materials 0.000 description 1
- 108091007491 NSP3 Papain-like protease domains Proteins 0.000 description 1
- 102000008763 Neurofilament Proteins Human genes 0.000 description 1
- 108010088373 Neurofilament Proteins Proteins 0.000 description 1
- VEQPNABPJHWNSG-UHFFFAOYSA-N Nickel(2+) Chemical compound [Ni+2] VEQPNABPJHWNSG-UHFFFAOYSA-N 0.000 description 1
- NHNBFGGVMKEFGY-UHFFFAOYSA-N Nitrate Chemical compound [O-][N+]([O-])=O NHNBFGGVMKEFGY-UHFFFAOYSA-N 0.000 description 1
- 239000000020 Nitrocellulose Substances 0.000 description 1
- 238000000636 Northern blotting Methods 0.000 description 1
- 101710163270 Nuclease Proteins 0.000 description 1
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 1
- 239000004677 Nylon Substances 0.000 description 1
- BWDQBBCUWLSASG-MDZDMXLPSA-N OCCN(CCc1c[nH]c2c1cccc2)Cc1ccc(/C=C/C(NO)=O)cc1 Chemical compound OCCN(CCc1c[nH]c2c1cccc2)Cc1ccc(/C=C/C(NO)=O)cc1 BWDQBBCUWLSASG-MDZDMXLPSA-N 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- ZCQWOFVYLHDMMC-UHFFFAOYSA-N Oxazole Chemical compound C1=COC=N1 ZCQWOFVYLHDMMC-UHFFFAOYSA-N 0.000 description 1
- 238000002944 PCR assay Methods 0.000 description 1
- 241000282579 Pan Species 0.000 description 1
- 208000037273 Pathologic Processes Diseases 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 102000007079 Peptide Fragments Human genes 0.000 description 1
- 108010033276 Peptide Fragments Proteins 0.000 description 1
- PCNDJXKNXGMECE-UHFFFAOYSA-N Phenazine Natural products C1=CC=CC2=NC3=CC=CC=C3N=C21 PCNDJXKNXGMECE-UHFFFAOYSA-N 0.000 description 1
- 102100025728 Phosphatidate phosphatase LPIN3 Human genes 0.000 description 1
- 241000276498 Pollachius virens Species 0.000 description 1
- 239000004743 Polypropylene Substances 0.000 description 1
- 239000004793 Polystyrene Substances 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- 108010026552 Proteome Proteins 0.000 description 1
- WTKZEGDFNFYCGP-UHFFFAOYSA-N Pyrazole Chemical compound C=1C=NNC=1 WTKZEGDFNFYCGP-UHFFFAOYSA-N 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 102000009609 Pyrophosphatases Human genes 0.000 description 1
- 108010009413 Pyrophosphatases Proteins 0.000 description 1
- 108010066717 Q beta Replicase Proteins 0.000 description 1
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 1
- 208000035977 Rare disease Diseases 0.000 description 1
- 102100030086 Receptor tyrosine-protein kinase erbB-2 Human genes 0.000 description 1
- 101710151245 Receptor-type tyrosine-protein kinase FLT3 Proteins 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- 102000051619 SUMO-1 Human genes 0.000 description 1
- 239000004113 Sepiolite Substances 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 101710081623 Small ubiquitin-related modifier 1 Proteins 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 241000282887 Suidae Species 0.000 description 1
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical group [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 description 1
- 108091008874 T cell receptors Proteins 0.000 description 1
- 102000016266 T-Cell Antigen Receptors Human genes 0.000 description 1
- FZWLAAWBMGSTSO-UHFFFAOYSA-N Thiazole Chemical compound C1=CSC=N1 FZWLAAWBMGSTSO-UHFFFAOYSA-N 0.000 description 1
- 108090000190 Thrombin Proteins 0.000 description 1
- 206010070863 Toxicity to various agents Diseases 0.000 description 1
- 108090000631 Trypsin Proteins 0.000 description 1
- 102000004142 Trypsin Human genes 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- 108091023045 Untranslated Region Proteins 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 239000005862 Whey Substances 0.000 description 1
- 102000007544 Whey Proteins Human genes 0.000 description 1
- 108010046377 Whey Proteins Proteins 0.000 description 1
- XXFXTBNFFMQVKJ-UHFFFAOYSA-N [diphenyl(trityloxy)methyl]benzene Chemical group C=1C=CC=CC=1C(C=1C=CC=CC=1)(C=1C=CC=CC=1)OC(C=1C=CC=CC=1)(C=1C=CC=CC=1)C1=CC=CC=C1 XXFXTBNFFMQVKJ-UHFFFAOYSA-N 0.000 description 1
- 239000000370 acceptor Substances 0.000 description 1
- 239000008351 acetate buffer Substances 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 239000013543 active substance Substances 0.000 description 1
- 208000021841 acute erythroid leukemia Diseases 0.000 description 1
- 230000006838 adverse reaction Effects 0.000 description 1
- 239000003513 alkali Substances 0.000 description 1
- 150000003973 alkyl amines Chemical class 0.000 description 1
- 125000004947 alkyl aryl amino group Chemical group 0.000 description 1
- 238000007844 allele-specific PCR Methods 0.000 description 1
- 108010026331 alpha-Fetoproteins Proteins 0.000 description 1
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 1
- 229910021502 aluminium hydroxide Inorganic materials 0.000 description 1
- 150000001412 amines Chemical group 0.000 description 1
- 235000019418 amylase Nutrition 0.000 description 1
- 229940025131 amylases Drugs 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 230000001772 anti-angiogenic effect Effects 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 230000009830 antibody antigen interaction Effects 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical group [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 1
- 230000002238 attenuated effect Effects 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- RFRXIWQYSOIBDI-UHFFFAOYSA-N benzarone Chemical compound CCC=1OC2=CC=CC=C2C=1C(=O)C1=CC=C(O)C=C1 RFRXIWQYSOIBDI-UHFFFAOYSA-N 0.000 description 1
- 229940049706 benzodiazepine Drugs 0.000 description 1
- 229930187545 betaenone Natural products 0.000 description 1
- 239000013060 biological fluid Substances 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 230000031018 biological processes and functions Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 239000000090 biomarker Substances 0.000 description 1
- 238000001574 biopsy Methods 0.000 description 1
- 210000004204 blood vessel Anatomy 0.000 description 1
- 125000001246 bromo group Chemical group Br* 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 239000006172 buffering agent Substances 0.000 description 1
- OIMVCDGRHUIUQF-UHFFFAOYSA-N butanedioic acid phthalazine Chemical compound OC(=O)CCC(O)=O.c1ccc2cnncc2c1 OIMVCDGRHUIUQF-UHFFFAOYSA-N 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 125000004432 carbon atom Chemical group C* 0.000 description 1
- 125000002915 carbonyl group Chemical group [*:2]C([*:1])=O 0.000 description 1
- 238000010236 cell based technology Methods 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 230000025084 cell cycle arrest Effects 0.000 description 1
- 230000006721 cell death pathway Effects 0.000 description 1
- 230000003915 cell function Effects 0.000 description 1
- 230000030570 cellular localization Effects 0.000 description 1
- 230000033077 cellular process Effects 0.000 description 1
- 230000010094 cellular senescence Effects 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 210000001175 cerebrospinal fluid Anatomy 0.000 description 1
- 125000003636 chemical group Chemical group 0.000 description 1
- 238000007385 chemical modification Methods 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 238000003759 clinical diagnosis Methods 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000002860 competitive effect Effects 0.000 description 1
- 238000000205 computational method Methods 0.000 description 1
- 230000003750 conditioning effect Effects 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 230000030944 contact inhibition Effects 0.000 description 1
- 239000013068 control sample Substances 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 125000001995 cyclobutyl group Chemical group [H]C1([H])C([H])([H])C([H])(*)C1([H])[H] 0.000 description 1
- 125000000113 cyclohexyl group Chemical group [H]C1([H])C([H])([H])C([H])([H])C([H])(*)C([H])([H])C1([H])[H] 0.000 description 1
- 125000004210 cyclohexylmethyl group Chemical group [H]C([H])(*)C1([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C1([H])[H] 0.000 description 1
- 125000001511 cyclopentyl group Chemical group [H]C1([H])C([H])([H])C([H])([H])C([H])(*)C1([H])[H] 0.000 description 1
- 125000001559 cyclopropyl group Chemical group [H]C1([H])C([H])([H])C1([H])* 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 230000006735 deficit Effects 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 238000003935 denaturing gradient gel electrophoresis Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- PJQCANLCUDUPRF-UHFFFAOYSA-N dibenzocycloheptene Chemical compound C1CC2=CC=CC=C2CC2=CC=CC=C12 PJQCANLCUDUPRF-UHFFFAOYSA-N 0.000 description 1
- SLPJGDQJLTYWCI-UHFFFAOYSA-N dimethyl-(4,5,6,7-tetrabromo-1h-benzoimidazol-2-yl)-amine Chemical compound BrC1=C(Br)C(Br)=C2NC(N(C)C)=NC2=C1Br SLPJGDQJLTYWCI-UHFFFAOYSA-N 0.000 description 1
- 125000005982 diphenylmethyl group Chemical group [H]C1=C([H])C([H])=C(C([H])=C1[H])C([H])(*)C1=C([H])C([H])=C([H])C([H])=C1[H] 0.000 description 1
- XPPKVPWEQAFLFU-UHFFFAOYSA-J diphosphate(4-) Chemical compound [O-]P([O-])(=O)OP([O-])([O-])=O XPPKVPWEQAFLFU-UHFFFAOYSA-J 0.000 description 1
- 208000022602 disease susceptibility Diseases 0.000 description 1
- 208000037765 diseases and disorders Diseases 0.000 description 1
- 238000002651 drug therapy Methods 0.000 description 1
- 230000005684 electric field Effects 0.000 description 1
- 238000002330 electrospray ionisation mass spectrometry Methods 0.000 description 1
- 239000003480 eluent Substances 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 239000002158 endotoxin Substances 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 102000052116 epidermal growth factor receptor activity proteins Human genes 0.000 description 1
- 108700015053 epidermal growth factor receptor activity proteins Proteins 0.000 description 1
- CURLTUGMZLYLDI-UHFFFAOYSA-N epoxyketone group Chemical group O=C=O CURLTUGMZLYLDI-UHFFFAOYSA-N 0.000 description 1
- DNJIEGIFACGWOD-UHFFFAOYSA-N ethyl mercaptane Natural products CCS DNJIEGIFACGWOD-UHFFFAOYSA-N 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000002550 fecal effect Effects 0.000 description 1
- 210000003754 fetus Anatomy 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 230000005714 functional activity Effects 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- YRTCKZIKGWZNCU-UHFFFAOYSA-N furo[3,2-b]pyridine Chemical compound C1=CC=C2OC=CC2=N1 YRTCKZIKGWZNCU-UHFFFAOYSA-N 0.000 description 1
- 125000002541 furyl group Chemical group 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 102000037865 fusion proteins Human genes 0.000 description 1
- 229940083124 ganglion-blocking antiadrenergic secondary and tertiary amines Drugs 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 230000004545 gene duplication Effects 0.000 description 1
- 230000004077 genetic alteration Effects 0.000 description 1
- 231100000118 genetic alteration Toxicity 0.000 description 1
- 238000012252 genetic analysis Methods 0.000 description 1
- 208000016361 genetic disease Diseases 0.000 description 1
- 230000007614 genetic variation Effects 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 238000011478 gradient descent method Methods 0.000 description 1
- 239000003102 growth factor Substances 0.000 description 1
- 150000004820 halides Chemical class 0.000 description 1
- 125000005179 haloacetyl group Chemical group 0.000 description 1
- FBPFZTCFMRRESA-UHFFFAOYSA-N hexane-1,2,3,4,5,6-hexol Chemical compound OCC(O)C(O)C(O)C(O)CO FBPFZTCFMRRESA-UHFFFAOYSA-N 0.000 description 1
- 150000002402 hexoses Chemical class 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- OAKJQQAXSVQMHS-UHFFFAOYSA-O hydrazinium(1+) Chemical compound [NH3+]N OAKJQQAXSVQMHS-UHFFFAOYSA-O 0.000 description 1
- BNRNAKTVFSZAFA-UHFFFAOYSA-N hydrindane Chemical compound C1CCCC2CCCC21 BNRNAKTVFSZAFA-UHFFFAOYSA-N 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 125000001165 hydrophobic group Chemical group 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 229930005346 hydroxycinnamic acid Natural products 0.000 description 1
- 230000000705 hypocalcaemia Effects 0.000 description 1
- 210000001822 immobilized cell Anatomy 0.000 description 1
- 230000003100 immobilizing effect Effects 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 229940072221 immunoglobulins Drugs 0.000 description 1
- 230000002055 immunohistochemical effect Effects 0.000 description 1
- 238000003364 immunohistochemistry Methods 0.000 description 1
- 101150030475 impact gene Proteins 0.000 description 1
- 230000001771 impaired effect Effects 0.000 description 1
- 238000000126 in silico method Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 238000009399 inbreeding Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- PZOUSPYUWWUPPK-UHFFFAOYSA-N indole Natural products CC1=CC=CC2=C1C=CN2 PZOUSPYUWWUPPK-UHFFFAOYSA-N 0.000 description 1
- RKJUIXBNRJVNHR-UHFFFAOYSA-N indolenine Natural products C1=CC=C2CC=NC2=C1 RKJUIXBNRJVNHR-UHFFFAOYSA-N 0.000 description 1
- LPAGFVYQRIESJQ-UHFFFAOYSA-N indoline Chemical compound C1=CC=C2NCCC2=C1 LPAGFVYQRIESJQ-UHFFFAOYSA-N 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 229910052500 inorganic mineral Inorganic materials 0.000 description 1
- 239000003049 inorganic solvent Substances 0.000 description 1
- 229910001867 inorganic solvent Inorganic materials 0.000 description 1
- 125000002346 iodo group Chemical group I* 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- SZVJSHCCFOBDDC-UHFFFAOYSA-N iron(II,III) oxide Inorganic materials O=[Fe]O[Fe]O[Fe]=O SZVJSHCCFOBDDC-UHFFFAOYSA-N 0.000 description 1
- 238000001155 isoelectric focusing Methods 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 125000001449 isopropyl group Chemical group [H]C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 1
- 125000000842 isoxazolyl group Chemical group 0.000 description 1
- 108010045069 keyhole-limpet hemocyanin Proteins 0.000 description 1
- 238000009533 lab test Methods 0.000 description 1
- 125000005524 levulinyl group Chemical group 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 229920006008 lipopolysaccharide Polymers 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 210000002751 lymph Anatomy 0.000 description 1
- 230000035168 lymphangiogenesis Effects 0.000 description 1
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 230000005291 magnetic effect Effects 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 230000036210 malignancy Effects 0.000 description 1
- 230000003211 malignant effect Effects 0.000 description 1
- 210000005075 mammary gland Anatomy 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000001819 mass spectrum Methods 0.000 description 1
- 238000013178 mathematical model Methods 0.000 description 1
- 238000001906 matrix-assisted laser desorption--ionisation mass spectrometry Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- TWXDDNPPQUTEOV-FVGYRXGTSA-N methamphetamine hydrochloride Chemical compound Cl.CN[C@@H](C)CC1=CC=CC=C1 TWXDDNPPQUTEOV-FVGYRXGTSA-N 0.000 description 1
- 229960000485 methotrexate Drugs 0.000 description 1
- 125000004170 methylsulfonyl group Chemical group [H]C([H])([H])S(*)(=O)=O 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 235000013336 milk Nutrition 0.000 description 1
- 239000008267 milk Substances 0.000 description 1
- 210000004080 milk Anatomy 0.000 description 1
- 239000011707 mineral Substances 0.000 description 1
- 239000003607 modifier Substances 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 230000007498 myristoylation Effects 0.000 description 1
- YOHYSYJDKVYCJI-UHFFFAOYSA-N n-[3-[[6-[3-(trifluoromethyl)anilino]pyrimidin-4-yl]amino]phenyl]cyclopropanecarboxamide Chemical compound FC(F)(F)C1=CC=CC(NC=2N=CN=C(NC=3C=C(NC(=O)C4CC4)C=CC=3)C=2)=C1 YOHYSYJDKVYCJI-UHFFFAOYSA-N 0.000 description 1
- 125000004108 n-butyl group Chemical group [H]C([H])([H])C([H])([H])C([H])([H])C([H])([H])* 0.000 description 1
- 239000006225 natural substrate Substances 0.000 description 1
- 210000005044 neurofilament Anatomy 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 229910001453 nickel ion Inorganic materials 0.000 description 1
- 125000000449 nitro group Chemical group [O-][N+](*)=O 0.000 description 1
- 229920001220 nitrocellulos Polymers 0.000 description 1
- 230000036963 noncompetitive effect Effects 0.000 description 1
- 238000003499 nucleic acid array Methods 0.000 description 1
- 239000002853 nucleic acid probe Substances 0.000 description 1
- 238000011330 nucleic acid test Methods 0.000 description 1
- 230000000269 nucleophilic effect Effects 0.000 description 1
- 210000004940 nucleus Anatomy 0.000 description 1
- 229920001778 nylon Polymers 0.000 description 1
- 239000003921 oil Substances 0.000 description 1
- 238000002966 oligonucleotide array Methods 0.000 description 1
- 230000005959 oncogenic signaling Effects 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 125000001181 organosilyl group Chemical group [SiH3]* 0.000 description 1
- 239000001301 oxygen Chemical group 0.000 description 1
- 125000003854 p-chlorophenyl group Chemical group [H]C1=C([H])C(*)=C([H])C([H])=C1Cl 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- 239000000123 paper Substances 0.000 description 1
- 230000005298 paramagnetic effect Effects 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 230000008506 pathogenesis Effects 0.000 description 1
- 230000009054 pathological process Effects 0.000 description 1
- 150000002972 pentoses Chemical class 0.000 description 1
- 238000010647 peptide synthesis reaction Methods 0.000 description 1
- 239000000816 peptidomimetic Substances 0.000 description 1
- 238000009522 phase III clinical trial Methods 0.000 description 1
- 125000003170 phenylsulfonyl group Chemical group C1(=CC=CC=C1)S(=O)(=O)* 0.000 description 1
- 125000005936 piperidyl group Chemical group 0.000 description 1
- 239000000902 placebo Substances 0.000 description 1
- 229940068196 placebo Drugs 0.000 description 1
- 210000002381 plasma Anatomy 0.000 description 1
- 239000004033 plastic Substances 0.000 description 1
- 229920003023 plastic Polymers 0.000 description 1
- 230000010287 polarization Effects 0.000 description 1
- 229920001983 poloxamer Polymers 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 229920000447 polyanionic polymer Polymers 0.000 description 1
- 229920000573 polyethylene Polymers 0.000 description 1
- 229920002704 polyhistidine Polymers 0.000 description 1
- 229920005862 polyol Polymers 0.000 description 1
- 150000003077 polyols Chemical class 0.000 description 1
- 229920001155 polypropylene Polymers 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 229920002223 polystyrene Polymers 0.000 description 1
- 230000001323 posttranslational effect Effects 0.000 description 1
- 239000003755 preservative agent Substances 0.000 description 1
- 230000002335 preservative effect Effects 0.000 description 1
- 238000009117 preventive therapy Methods 0.000 description 1
- 150000003141 primary amines Chemical class 0.000 description 1
- 239000002987 primer (paints) Substances 0.000 description 1
- 238000000159 protein binding assay Methods 0.000 description 1
- 238000000164 protein isolation Methods 0.000 description 1
- 230000009822 protein phosphorylation Effects 0.000 description 1
- 230000006337 proteolytic cleavage Effects 0.000 description 1
- UMJSCPRVCHMLSP-UHFFFAOYSA-N pyridine Natural products COC1=CC=CN=C1 UMJSCPRVCHMLSP-UHFFFAOYSA-N 0.000 description 1
- YEYHFKBVNARCNE-UHFFFAOYSA-N pyrido[2,3-b]pyrazine Chemical compound N1=CC=NC2=CC=CN=C21 YEYHFKBVNARCNE-UHFFFAOYSA-N 0.000 description 1
- 125000004076 pyridyl group Chemical group 0.000 description 1
- 238000012175 pyrosequencing Methods 0.000 description 1
- 238000003908 quality control method Methods 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 239000011535 reaction buffer Substances 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 230000009257 reactivity Effects 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000022983 regulation of cell cycle Effects 0.000 description 1
- 230000025053 regulation of cell proliferation Effects 0.000 description 1
- 230000014493 regulation of gene expression Effects 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 230000008943 replicative senescence Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 238000003757 reverse transcription PCR Methods 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 108091092562 ribozyme Proteins 0.000 description 1
- 238000005096 rolling process Methods 0.000 description 1
- 210000003296 saliva Anatomy 0.000 description 1
- 235000019515 salmon Nutrition 0.000 description 1
- 150000003839 salts Chemical group 0.000 description 1
- 238000003118 sandwich ELISA Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 125000002914 sec-butyl group Chemical group [H]C([H])([H])C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 1
- 238000011519 second-line treatment Methods 0.000 description 1
- 210000000582 semen Anatomy 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000009589 serological test Methods 0.000 description 1
- 239000010703 silicon Substances 0.000 description 1
- 229910052710 silicon Inorganic materials 0.000 description 1
- 238000002922 simulated annealing Methods 0.000 description 1
- QFJCIRLUMZQUOT-HPLJOQBZSA-N sirolimus Chemical class C1C[C@@H](O)[C@H](OC)C[C@@H]1C[C@@H](C)[C@H]1OC(=O)[C@@H]2CCCCN2C(=O)C(=O)[C@](O)(O2)[C@H](C)CC[C@H]2C[C@H](OC)/C(C)=C/C=C/C=C/[C@@H](C)C[C@@H](C)C(=O)[C@H](OC)[C@H](O)/C(C)=C/[C@@H](C)C(=O)C1 QFJCIRLUMZQUOT-HPLJOQBZSA-N 0.000 description 1
- 210000000329 smooth muscle myocyte Anatomy 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000007790 solid phase Substances 0.000 description 1
- NHXLMOGPVYXJNR-ATOGVRKGSA-N somatostatin Chemical class C([C@H]1C(=O)N[C@H](C(N[C@@H](CO)C(=O)N[C@@H](CSSC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@@H](CC=2C3=CC=CC=C3NC=2)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N1)[C@@H](C)O)NC(=O)CNC(=O)[C@H](C)N)C(O)=O)=O)[C@H](O)C)C1=CC=CC=C1 NHXLMOGPVYXJNR-ATOGVRKGSA-N 0.000 description 1
- 229940075620 somatostatin analogue Drugs 0.000 description 1
- 238000001179 sorption measurement Methods 0.000 description 1
- 239000003381 stabilizer Substances 0.000 description 1
- 238000003153 stable transfection Methods 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- KXRZBTAEDBELFD-UHFFFAOYSA-N sulfamethopyrazine Chemical compound COC1=NC=CN=C1NS(=O)(=O)C1=CC=C(N)C=C1 KXRZBTAEDBELFD-UHFFFAOYSA-N 0.000 description 1
- 239000011593 sulfur Chemical group 0.000 description 1
- 210000004243 sweat Anatomy 0.000 description 1
- 210000001138 tear Anatomy 0.000 description 1
- 125000000999 tert-butyl group Chemical group [H]C([H])([H])C(*)(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 125000003718 tetrahydrofuranyl group Chemical group 0.000 description 1
- 125000001412 tetrahydropyranyl group Chemical group 0.000 description 1
- WROMPOXWARCANT-UHFFFAOYSA-N tfa trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F.OC(=O)C(F)(F)F WROMPOXWARCANT-UHFFFAOYSA-N 0.000 description 1
- 125000001544 thienyl group Chemical group 0.000 description 1
- 150000003573 thiols Chemical group 0.000 description 1
- 229960004072 thrombin Drugs 0.000 description 1
- 125000002088 tosyl group Chemical group [H]C1=C([H])C(=C([H])C([H])=C1C([H])([H])[H])S(*)(=O)=O 0.000 description 1
- 238000007056 transamidation reaction Methods 0.000 description 1
- 108091006105 transcriptional corepressors Proteins 0.000 description 1
- 108091008023 transcriptional regulators Proteins 0.000 description 1
- 230000037426 transcriptional repression Effects 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 150000003852 triazoles Chemical class 0.000 description 1
- 125000002023 trifluoromethyl group Chemical group FC(F)(F)* 0.000 description 1
- 125000002221 trityl group Chemical group [H]C1=C([H])C([H])=C([H])C([H])=C1C([*])(C1=C(C(=C(C(=C1[H])[H])[H])[H])[H])C1=C([H])C([H])=C([H])C([H])=C1[H] 0.000 description 1
- 239000012588 trypsin Substances 0.000 description 1
- 230000004614 tumor growth Effects 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
- MSRILKIQRXUYCT-UHFFFAOYSA-M valproate semisodium Chemical compound [Na+].CCCC(C(O)=O)CCC.CCCC(C([O-])=O)CCC MSRILKIQRXUYCT-UHFFFAOYSA-M 0.000 description 1
- 238000009834 vaporization Methods 0.000 description 1
- 230000008016 vaporization Effects 0.000 description 1
- PXXNTAGJWPJAGM-UHFFFAOYSA-N vertaline Natural products C1C2C=3C=C(OC)C(OC)=CC=3OC(C=C3)=CC=C3CCC(=O)OC1CC1N2CCCC1 PXXNTAGJWPJAGM-UHFFFAOYSA-N 0.000 description 1
- 239000002023 wood Substances 0.000 description 1
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K31/00—Medicinal preparations containing organic active ingredients
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
- A61K38/04—Peptides having up to 20 amino acids in a fully defined sequence; Derivatives thereof
- A61K38/15—Depsipeptides; Derivatives thereof
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P35/00—Antineoplastic agents
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
- C12Q1/6886—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
Definitions
- This invention relates generally to the analytical testing of tissue samples in vitro, and more particularly to aspects of genetic mutations and polymorphisms of HDAC6.
- Theranostic tests are also useful to select subjects for treatments that are particularly likely to benefit from the treatment or to provide an early and objective indication of treatment efficacy in individual subjects, so that the treatment can be altered with a minimum of delay.
- Theranostics are useful in clinical diagnosis and management of a variety of diseases and disorders, which include, but are not limited to, e.g., cardiovascular disease, cancer, infectious diseases, Alzheimer's disease and the prediction of drug toxicity or drug resistance.
- Theranostic tests may be developed in any suitable diagnostic testing format, which include, but is not limited to, e.g., immunohistochemical tests, clinical chemistry, immunoassay, cell-based technologies, and nucleic acid tests.
- SNPs single nucleotide polymorphisms
- HDACs Distinct classes of HDACs have been identified in mammalian cells.
- Class I members of the HDAC family such as HDACl, HDAC2, HDAC3, and HDAC8, are enzymatic transcriptional corepressors homologous to yeast Rpd3.
- Class II members of the HDAC family including HDAC4, HDAC5, HDAC6, HDAC7, and HDAC9, possess domains similar to the 1 deacetylase domain of yeast Hdal. Bertos et al, Biochem. Cell Biol, 79(3):243-52 (2001).
- HDACs are known to play key roles in the regulation of cell proliferation. Consequently, inhibition of HDACs has become an important approach for anti-cancer therapy.
- Histone acetylation modifiers have been described to participate as cofactors in mammalian transcriptional complexes involved in the regulation of cellular proliferation and differentiation. Mahlknecht et al, Biochem. Biophys. Res. Commun., 263(2):482-90 (1999). Indeed, increasing evidence suggests a connection between histone acetylation and the development of cancer and leukemia.
- Cancer cells exhibit a set of unique properties that distinguish them from their normal counterparts, e.g., increased growth rates, loss of differentiation, escape from cell death pathways, evasion of antiproliferative signals, a decreased reliance on exogenous growth factors and escape from replicative senescence. Wade PA., Hum. MoI Genet., 10(7):693-8 (2001). Acquisition of these cellular features by malignant cells is typified by impairment of normal cellular control mechanisms. The failure to deacetylate and thus repress transcription by the Class I histone deacetylases HDACl and HDAC2 due to disruption of the Rb family of proteins has been firmly established as a mechanism leading to increases in growth rate and cellular proliferation.
- Trichostatin A is a Streptomyces metabolite that causes differentiation of murine erythroleukemia cells as well as specific inhibition of the cell cycle of some lower eukaryotes and mammalian cells.
- the targeted molecule of TSA has been shown by genetic and biochemical analyses to be histone deacetylases (HDACs).
- HDACs histone deacetylases
- HDAC6 a novel cinnamic hydroxamic acid analog
- CML-BC chronic myeloid leukemia blast crisis
- the invention provides for the use of an HDAC6 modulating agent in the manufacture of a medicament for the treatment of cancer in a selected patient population.
- the patient population is selected on the basis of the genotype of the patients at an HDAC6 genetic locus indicative of efficacy of the HDAC6 modulating agent in treating cancer.
- the cancer can be acute myeloid leukemia (a.k.a., acute myelogenous leukemia).
- the invention also provides an isolated polynucleotide having a sequence encoding an HDAC6 mutation.
- the HDAC6 mutations are the previously- unidentified mutations listed in TABLE 1. Accordingly, the invention provides vectors and organisms containing the HDAC6 mutations of the invention and polypeptides encoded by polynucleotides containing the HDAC6 mutations of the invention.
- the invention further provides a method for treating cancer in a subject.
- the genotype or haplotype of a subject is obtained at an HDAC6 gene locus, so that the genotype and/or haplotype is indicative of a propensity of the cancer to respond to the drug.
- an anticancer therapy is administered to the subject.
- the invention provides a method for diagnosing cancer in a subject and a method for choosing subjects for inclusion in a clinical trial for determining efficacy of an HDAC6 modulating agent; in both these methods the genotype and/or haplotype of a subject is interrogated at an HDAC6 gene locus. Also provided by the invention are kits for use in determining a treatment strategy for cancer.
- the invention also provides for the use of each of the mutations of the inventions as a drug target.
- the invention provides methods for modulating pathways and cascades associated with HDAC and HDAC mutant molecules.
- the various aspects of the present invention relate to polynucleotides encoding HDAC6 mutations and polymorphisms of the invention, expression vectors encoding the HDAC6 mutant polypeptides of the invention and organisms that express the HDAC6 mutant/polymorphic polynucleotides and/or HDAC6 mutant/polymorphic polypeptides of the invention.
- the various aspects of the present invention further relate to diagnostic/theranostic methods and kits that use the HDAC6 mutations and/or polymorphisms of the invention to identify individuals predisposed to disease or to classify individuals and tumours with regard to drug responsiveness, side effects, or optimal drug dose.
- the invention provides methods for compound validation and a computer system for storing and analyzing data related to the HDAC6 mutations and polymorphisms of the invention. Accordingly, various particular embodiments that illustrate these aspects follow. [12] Definitions. The definitions of certain terms as used in this specification are provided below. Definitions of other terms may be found in the glossary provided by the U.S. Department of Energy, Office of Science, Human Genome Project ( ⁇ ttp://vv ⁇ vw.ornl.gov/sci/techresources/Human_Genome/glossary/).
- allele means a particular form of a gene or DNA sequence at a specific chromosomal location (locus).
- the term “antibody” includes, but is not limited to, polyclonal antibodies, monoclonal antibodies, humanized or chimeric antibodies and biologically functional antibody fragments sufficient for binding of the antibody fragment to the protein.
- the term “clinical response” means any or all of the following: a quantitative measure of the response, no response, and adverse response (i.e., side effects).
- the term “clinical trial” means any research study designed to collect clinical data on responses to a particular treatment, and includes but is not limited to phase I, phase II and phase III clinical trials. Standard methods are used to define the patient population and to enrol subjects.
- the phrase "disease associated with a HDAC6" mutation refers to any disease or disorder arising from a mutation in at least one position a gene encoding histone deacetylase 6 (HDAC6).
- the mutation refers to any alteration of the nucleic acid sequence encoding HDAC6 that inactivates the functionality of the protein produced by that gene.
- Such mutations can include, but are not limited to, an amino acid substitution wherein a native amino acid is replaced with another amino acid residue.
- diseases include, but are not limited to, acute myeloid leukaemia (AML), a chronic myeloid leukemia (CML), hyperplasia, glioblastoma, melanoma, cholangioma, breast cancer, genitourinary cancer, lung cancer, non-small-cell lung cancer (NSCLC), gastrointestinal cancer, epidermoid cancer, melanoma, ovarian cancer, pancreas cancer, neuroblastoma, head cancer, neck cancer, bladder cancer, renal cancer, brain cancer, gastric cancer, prostate cancer, colorectal cancer, multiple myeloma and lymphomas.
- AML acute myeloid leukaemia
- CML chronic myeloid leukemia
- hyperplasia glioblastoma, melanoma
- cholangioma breast cancer
- lung cancer non-small-cell lung cancer
- gastrointestinal cancer epidermoid cancer
- melanoma ovarian cancer
- pancreas cancer neuroblastom
- the term "effective amount" of a compound is a quantity sufficient to achieve a desired pharmacodynamic, toxicologic, therapeutic and/or prophylactic effect, for example, an amount which results in the prevention of or a decrease in the symptoms associated with a disease that is being treated, e.g., the diseases associated with HDAC6 mutant polypeptides and HDAC6 mutant polynucleotides identified herein.
- the amount of compound administered to the subject will depend on the type and severity of the disease and on the characteristics of the individual, such as general health, age, sex, body weight and tolerance to drugs. It will also depend on the degree, severity and type of disease. The skilled artisan will be able to determine appropriate dosages depending on these and other factors.
- an effective amount of the compounds of the present invention sufficient for achieving a therapeutic or prophylactic effect, range from about 0.000001 mg per kilogram body weight per day to about 10,000 mg per kilogram body weight per day.
- the dosage ranges are from about 0.0001 mg per kilogram body weight per day to about 100 mg per kilogram body weight per day.
- the compounds of the present invention can also be administered in combination with each other, or with one or more additional therapeutic compounds.
- Glivec® (Gleevec®; imatinib) is a medication for chronic myeloid leukaemia (CML) and certain stages of gastrointestinal stromal tumours (GIST). It targets and interferes with the molecular abnormalities that drive the growth of cancer cells.
- CML chronic myeloid leukaemia
- GIST gastrointestinal stromal tumours
- Glivec® has potential as an anticancer therapy for several types of cancer, including leukaemia and solid tumours.
- the aromatase inhibitor FEMARA is a treatment for advanced breast cancer in postmenopausal women. It blocks the use of oestrogen by certain types of breast cancer that require oestrogen to grow. Janicke F, Breast 13 Suppl l:S10-8 (December 2004); Mouridsen
- Sandostatin® LAR® is used to treat patients with acromegaly and to control symptoms, such as severe diarrhoea and flushing, in patients with functional gastro-entero- pancreatic (GEP) tumours (e.g., metastatic carcinoid tumours and vasoactive intestinal peptide-secreting tumours [Vipomas]).
- GEP gastro-entero- pancreatic
- Sandostatin® LAR® regulates hormones in the body to help manage diseases and their symptoms.
- ZOMETA® is a treatment for hypocalcaemia of malignancy (HCM)I and for the treatment of bone metastases across a broad range of tumour types. These tumours include multiple myeloma, prostrate cancer, breast cancer, lung cancer, renal cancer and other solid tumours. Rosen LS et al., Cancer 100(12):2613-21 (June 15, 2004).
- Vatalanib (l-[4-chloroanilino]-4-[4-pyridylmethyl] phthalazine succinate) is a multi- VEGF receptor (VEGF) inhibitor that may block the creation of new blood vessels to prevent tumour growth.
- VEGF VEGF receptor
- This compound inhibits all known VEGF receptor tyrosine kinases, blocking angiogenesis and lymphangiogenesis. Drevs J et al., Cancer Res. 60:4819-4824 (2000); Wood JM et al., Cancer Res. 60:2178-2189 (2000).
- Vatalanib is being studied in two large, multinational, randomized, phase III, placebo-controlled trials in combination with FOLFOX- 4 in first-line and second-line treatment of patients with metastatic colorectal cancer.
- everolimus inhibits oncogenic signalling in tumour cells.
- mTOR mammalian target of rapamycin
- everolimus exhibits broad antiproliferative activity in tumour cell lines and animal models of cancer. Boulay A et al., Cancer Res. 64:252-261 (2004).
- everolimus also potently inhibited the proliferation of human umbilical vein endothelial cells directly indicating an involvement in angiogenesis.
- everolimus may provide a clinical benefit to patients with cancer.
- Everolimus is being investigated for its antitumour properties in a number of clinical studies in patients with haematological and solid tumours. Huang S & Houghton PJ, Curr. Opin. Investig. Drugs 3:295-304 (2002).
- Gimatecan is a novel oral inhibitor of topoisomerase I (topo I). Gimatecan blocks cell division in cells that divide rapidly, such as cancer cells, which activates apoptosis. Preclinical data indicate that gimatecan is not a substrate for multidrug resistance pumps, and that it increases the drug-target interaction. De Cesare M et al, Cancer Res. 61:7189-7195 (2001). Phase I clinical studies indicate that the dose-limiting toxicity of gimatecan is myelosuppression.
- Patupilone is a microtubule stabilizer.
- Altmann K-H Curr. Opin. Chem. Biol. 5:424- 431 (2001); Altmann K-H et al., Biochim Biophys Acta. 470:M79-M91 (2000); O'Neill V et al., 36th Annual Meeting of the American Society of Clinical Oncology; May 19-23, 2000; New La, LA, Abstract 829; Calvert PM et al. Proceedings of the 11th National Cancer Institute-European Organization for Research and Treatment of Cancer/American Association for Cancer Research Symposium on New Drugs in Cancer Therapy; November 7- 10, 2000; Amsterdam, The Netherlands, Abstract 575.
- Patupilone blocked mitosis and induced apoptosis greater than the frequently used anticancer drug paclitaxel. Also, patupilone retained full activity against human cancer cells that were resistant to paclitaxel and other chemotherapeutic agents.
- Midostaurin is an inhibitor of multiple signalling proteins. By targeting specific receptor tyrosine kinases and components of several signal transduction pathways, midostaurin impacts several targets involved in cell growth (e.g., KIT 5 PDGFR, PKC), leukaemic cell proliferation (e.g., FLT3), and angiogenesis (e.g., VEGFR2).
- KIT 5 PDGFR PDGFR
- PKC leukaemic cell proliferation
- angiogenesis e.g., VEGFR2
- midostaurin showed broad antiproliferative activity against various tumour cell lines, including those that were resistant to several other chemotherapeutic agents.
- the somatostatin analogue pasireotide is a stable cyclohexapeptide with broad somatotropin release inhibiting factor (SRIF) receptor binding.
- SRIF broad somatotropin release inhibiting factor
- LBH589 is a histone deacetylase (HDAC) inhibitor. By blocking the deacetylase activity of HDAC, HDAC inhibitors activate gene transcription of critical genes that cause apoptosis (programmed cell death). By triggering apoptosis, LBH589 induces growth inhibition and regression in tumour cell lines. LBH589 is being tested in phase I clinical trials as an anticancer agent. See also, George P et al, Blood 105(4): 1768-76 (February 15, 2005).
- AMNl 07 is an oral tyrosine kinase inhibitor that targets Bcr-Abl, KIT, and PDGFR.
- AMNl 07 is highly potent and has high selectivity for Bcr-Abl, KIT, and PDGFR.
- AMN 107 also shows activity against mutated variants of Bcr-Abl.
- AMN 107 is currently being studied in phase I clinical trials.
- AMNl 07 is an oral tyrosine kinase inhibitor that targets Bcr-Abl, KIT, and PDGFR.
- AMNl 07 is highly potent and has high selectivity for Bcr-Abl. KIT, and PDGFR.
- AMN 107 also shows activity against mutated variants of Bcr-Abl.
- AMN107 is currently being studied in phase I clinical trials.
- HDAC6 modulating agent is any compound that alters ⁇ e.g., increases or decreases).
- HDAC6 modulating agent can be a small molecule, antibody, polypeptide, carbohydrate, lipid, nucleotide, or combination thereof.
- the HDAC6 modulating agent can be an organic compound or an inorganic compound.
- the HDAC6 modulating agents is a histone deacetylase inhibitors (HDAI).
- HDAI are described in e.g. Monneret C, European Journal of Medicinal Chemistry, 40, (2005), 1—13, the contents of witch is herewith incorporated by reference.
- HDAIs include for example sodium butyrate, phenylacetate, phenylbutyrate, valproic acid, tributyrinpivaloyloxymethyl butyrate, pivanex®, trichostatinA (TSA), trichostatin C, trapoxins A and B, depudecin, cyclic hydroxamic-acid containing peptide (CHAPs), apicidin or OSI-2040, suberoylanilide hydroxamic acid (SAHA), oxamflatindepsipeptide, FK228, scriptaid, biarylhydroxamate inhibitor, A-161906, JNJ16241199, PDX 101, MS-275, CI-994.
- TSAHA cyclic hydroxamic-acid containing peptide
- SAHA suberoylanilide hydroxamic acid
- biarylhydroxamate inhibitor A-161906, JNJ16241199, PDX 101, MS-275, CI
- HDAI compounds of particular interest are hydroxamate compounds described by the formula I:
- Ri is H, halo, or a straight chain C J -C 6 alkyl (especially methyl, ethyl or n-propyl, which methyl, ethyl and n-propyl substituents are unsubstituted or substituted by one or more substituents described below for alkyl substituents);
- R2 is selected from H 5 Ci-Cio alkyl, (preferably Ci-C ⁇ alkyl, e.g.
- R5 is selected from H, C 1 -CO alkyl, C 4 - C 9 cycloalkyl, C 4 — C 9 heterocycloalkyl, acyl, aryl, heteroaryl, arylalkyl (e.g. benzyl), heteroarylalkyl (e.g.
- n, ni, n 2 and n 3 are the same or different and independently selected from 0 — 6, when nj is 1- 6, each carbon atom can be optionally and independently substituted with R 3 and/or R 4 ;
- X and Y are the same or different and independently selected from H, halo, Cj-C 4 alkyl, such as CH 3 and CF 3 , NO 2 , C(O)Ri, OR 9 , SR 9 , CN, and NRi 0 Ri 1 ;
- R 6 is selected from H, Ci-C ⁇ alkyl, C 4 - C 9 cycloalkyl, C 4 - C 9 heterocycloalkyl, cycloalkylalkyl (e.g., cyclopropylmethyl), aryl, heteroaryl, arylalkyl (e.g., benzyl, 2- phenylethenyl), heteroarylalkyl (e.g., pyridylmethyl), OR 12 , and NR 13 R 14 ;
- R 7 is selected from ORj 5 , SRi 5 , S(O)Ri 6 , SO 2 Ri 75 NRi 3 Ri 4 , and NRj 2 SO 2 R 6 ;
- R 8 is selected from H, ORi 5 , NR J3 R J4 , C r C 6 alkyl, C 4 - C 9 cycloalkyl, C 4 - C 9 heterocycloalkyl, aryl, heteroaryl, arylalkyl (e.g., benzyl), and heteroarylalkyl (e.g., pyridylmethyl);
- R 9 is selected from Cj - C 4 alkyl, for example, CH 3 and CF 3 , C(O)-alkyl, for example C(O)CH 3 , and C(O)CF 3 ;
- Rio and Rn are the same or different and independently selected from H, C t -C 4 alkyl, and -C(O)-alkyl;
- Ri 2 is selected from H, Ci -Ce alkyl, C 4 — C 9 cycloalkyl, C 4 - C9 heterocycloalkyl, C 4 — C 9 heterocycloalkylalkyl, aryl, mixed aryl and non-aryl polycycle, heteroaryl, arylalkyl (e.g., benzyl), and heteroarylalkyl (e.g., pyridylmethyl);
- Rn and R 14 are the same or different and independently selected from H, Q-C 6 alkyl, C 4 — C9 cycloalkyl, C 4 — C 9 heterocycloalkyl, aryl, heteroaryl, arylalkyl (e.g., benzyl), heteroarylalkyl (e.g., pyridylmethyl), amino acyl, or R 13 and R 14 together with the nitrogen to which they are bound are C4 — C 9 heterocycloalkyl, heteroaryl, polyheteroaryl, non-aromatic polyheterocycle or mixed aryl and non-aryl polyheterocycle;
- R 1 S is selected from H, Cj-C ⁇ alkyl, C 4 — C 9 cycloalkyl, C 4 - C 9 heterocycloalkyl, aryl, heteroaryl, arylalkyl, heteroarylalkyl and (CH 2 ) m ZRi 2 ;
- Ri 6 is selected from Ci-C 6 alkyl, C 4 — C 9 cycloalkyl, C 4 — C 9 heterocycloalkyl, aryl, heteroaryl, polyheteroaryl, arylalkyl, heteroarylalkyl and (CHa) 1n ZRi 2 ;
- Ri 7 is selected from Ci-Ce alkyl, C 4 - C 9 cycloalkyl, C 4 — C9 heterocycloalkyl, aryl, aromatic polycycles, heteroaryl, arylalkyl, heteroarylalkyl, polyheteroaryl and NR ⁇ R ⁇ ; m is an integer selected from 0 to 6; and
- Z is selected from O, NRj 3 , S and S(O).
- unsubstituted means that there is no substituent or that the only substituents are hydrogen.
- Halo substituents are selected from fluoro, chloro, bromo and iodo, preferably fluoro or chloro.
- Alkyl substituents include straight and branched Ci-C ⁇ alkyl, unless otherwise noted.
- suitable straight and branched Ci-C ⁇ alkyl substituents include methyl, ethyl, n- propyl, 2-propyl, n-butyl, sec-butyl, t-butyl, and the like.
- the alkyl substituents include both unsubstituted alkyl groups and alkyl groups that are substituted by one or more suitable substituents, including unsaturation (i.e.
- alkyl groups there are one or more double or triple C-C bonds), acyl, cycloalkyl, halo, oxyalkyl, alkylamino, aminoalkyl, acylamino and OR1 5 , for example, alkoxy.
- Preferred substituents for alkyl groups include halo, hydroxy, alkoxy, oxyalkyl, alkylamino, and aminoalkyl.
- Cycloalkyl substituents include C 3 -C 9 cycloalkyl groups, such as cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl and the like, unless otherwise specified. Unless otherwise noted, cycloalkyl substituents include both unsubstituted cycloalkyl groups and cycloalkyl groups that are substituted by one or more suitable substituents, including Ci-C 6 alkyl, halo, hydroxy, aminoalkyl, oxyalkyl, alkylamino, and ORi 5 , such as alkoxy.
- Preferred substituents for cycloalkyl groups include halo, hydroxy, alkoxy, oxyalkyl, alkylamino and aminoalkyl.
- alkyl and cycloalkyl substituents also applies to the alkyl portions of other substituents, such as without limitation, alkoxy, alkyl amines, alkyl ketones, arylalkyl, heteroarylalkyl, alkylsulfonyl and alkyl ester substituents and the like.
- Heterocycloalkyl substituents include 3 to 9 membered aliphatic rings, such as 4 to 7 membered aliphatic rings, containing from one to three heteroatoms selected from nitrogen, sulfur, oxygen.
- heterocycloalkyl substituents include pyrrolidyl, tetrahydrofuryl, tetrahydrothiofuranyl, piperidyl, piperazyl, tetrahydropyranyl, morphilino, 1,3-diazapane, 1,4-diazapane, 1 ,4-oxazepane, and 1,4-oxathiapane.
- the rings are unsubstituted or substituted on the carbon atoms by one or more suitable substituents, including C]-C 6 alkyl, C4 - C 9 cycloalkyl, aryl, heteroaryl, arylalkyl (e.g., benzyl), and heteroarylalkyl (e.g., pyridylmethyl), halo, amino, alkyl amino and OR 1S , for example alkoxy.
- suitable substituents including C]-C 6 alkyl, C4 - C 9 cycloalkyl, aryl, heteroaryl, arylalkyl (e.g., benzyl), and heteroarylalkyl (e.g., pyridylmethyl), halo, amino, alkyl amino and OR 1S , for example alkoxy.
- nitrogen heteroatoms are unsubstituted or substituted by H, C 1 -C 4 alkyl, arylalkyl (e.g., benzyl), and heteroarylalkyl (e.g., pyridylmethyl), acyl, aminoacyl, alkylsulfonyl, and arylsulfonyl.
- Cycloalkylalkyl substituents include compounds of the formula -(CH 2 ) n5 -cycloalkyl wherein n5 is a number from 1-6.
- Suitable alkylcycloalkyl substituents include cyclopentylmethyl-, cyclopentylethyl, cyclohexylmethyl and the like. Such substituents are unsubstituted or substituted in the alkyl portion or in the cycloalkyl portion by a suitable substituent, including those listed above for alkyl and cycloalkyl.
- Aryl substituents include unsubstituted phenyl and phenyl substituted by one or more suitable substituents, including C 1 -C 6 alkyl, cycloalkylalkyl (e.g., cyclopropylmethyl), O(CO)alkyl, oxyalkyl, halo, nitro, amino, alkylamino, aminoalkyl, alkyl ketones, nitrile, carboxyalkyl, alkylsulfonyl, aminosulfonyl, arylsulfonyl, and OR1 5 , such as alkoxy.
- suitable substituents including C 1 -C 6 alkyl, cycloalkylalkyl (e.g., cyclopropylmethyl), O(CO)alkyl, oxyalkyl, halo, nitro, amino, alkylamino, aminoalkyl, alkyl ketones, nitrile, carboxyalkyl, alkylsulf
- Preferred substituents include including Ci-C 6 alkyl, cycloalkyl (e.g., cyclopropylmethyl), alkoxy, oxyalkyl, halo, nitro, amino, alkylamino, aminoalkyl, alkyl ketones, nitrile, carboxyalkyl, alkylsulfonyl, arylsulfonyl, and aminosulfonyl.
- Suitable aryl groups include Ci- C 4 alkylphenyl, Ci-C 4 alkoxyphenyl, trifluoromethylphenyl, methoxyphenyl, hydroxyethylphenyl, dimethylaminophenyl, aminopropylphenyl, carbethoxyphenyl, methanesulfonylphenyl and tolylsulfonylphenyl.
- Aromatic polycycles include naphthyl, and naphthyl substituted by one or more suitable substituents, including Ci-C 6 alkyl, alkylcycloalkyl (e.g., cyclopropylmethyl), oxyalkyl, halo, nitro, amino, alkylamino, aminoalkyl, alkyl ketones, nitrile, carboxyalkyl, alkylsulfonyl, arylsulfonyl, aminosulfonyl and ORi 5 , such as alkoxy.
- suitable substituents including Ci-C 6 alkyl, alkylcycloalkyl (e.g., cyclopropylmethyl), oxyalkyl, halo, nitro, amino, alkylamino, aminoalkyl, alkyl ketones, nitrile, carboxyalkyl, alkylsulfonyl, arylsulfonyl, aminosulfonyl and ORi 5
- Heteroaryl substituents include compounds with a 5 to 7 member aromatic ring containing one or more heteroatoms, for example from 1 to 4 heteroatoms, selected from N, O and S.
- Typical heteroaryl substituents include furyl, thienyl, pyrrole, pyrazole, triazole, thiazole, oxazole, pyridine, pyrimidine, isoxazolyl, pyrazine and the like.
- heteroaryl substituents are unsubstituted or substituted on a carbon atom by one or more suitable substituents, including alkyl, the alkyl substituents identified above, and another heteroaryl substituent.
- Nitrogen atoms are unsubstituted or substituted, for example by R1 3 ; especially useful N substituents include H, C1 — C4 alkyl, acyl, aminoacyl, and sulfonyl.
- Arylalkyl substituents include groups of the formula -(CH 2 ) n5 -aryl, -(CH 2 ) n5 -i-(CH- aryl)-(CH 2 ) n5 -aryl or -(CH 2 )n5-iCH(aryl)(aryl) wherein aryl and n5 are defined above.
- Such arylalkyl substituents include benzyl, 2-phenylethyl.
- Arylalkyl substituents are unsubstituted or substituted in the alkyl moiety or the aryl moiety or both as described above for alkyl and aryl substituents.
- Heteroarylalkyl substituents include groups of the formula — (CH 2 ) n5 -heteroaryl wherein heteroaryl and n5 are defined above and the bridging group is linked to a carbon or a nitrogen of the heteroaryl portion, such as 2-, 3- or 4-pyridylmethyl, imidazolylmethyl, quinolylethyl, and pyrrolylbutyl. Heteroaryl substituents are unsubstituted or substituted as discussed above for heteroaryl and alkyl substituents.
- Amino acyl substituents include groups of the formula -C(O)-(CH 2 ) n -C(H)(NRi 3 Ri 4 )- (CH2) n -R5 wherein n, Rn, R14 and R 5 are described above.
- Suitable aminoacyl substituents include natural and non-natural amino acids such as glycinyl, D-tryptophanyl, L-lysinyl, D- or L-homoserinyl, 4-aminobutryic acyl, ⁇ -3-amin-4-hexenoyl.
- Non-aromatic polycycle substituents include bicyclic and tricyclic fused ring systems where each ring can be 4-9 membered and each ring can contain zero, 1 or more double and/or triple bonds.
- Suitable examples of non-aromatic polycycles include decalin, octahydroindene, perhydrobenzocycloheptene, perhydrobenzo-[/]-azulene. Such substituents are unsubstituted or substituted as described above for cycloalkyl groups.
- Mixed aryl and non-aryl polycycle substituents include bicyclic and tricyclic fused ring systems where each ring can be 4 — 9 membered and at least one ring is aromatic.
- Suitable examples of mixed aryl and non-aryl polycycles include methylenedioxyphenyl, bis- methylenedioxyphenyl, 1,2,3,4-tetrahydronaphthalene, dibenzosuberane, dihdydroanthracene, 9H-fluorene. Such substituents are unsubstituted or substituted by nitro or as described above for cycloalkyl groups.
- Polyheteroaryl substituents include bicyclic and tricyclic fused ring systems where each ring can independently be 5 or 6 membered and contain one or more heteroatom, for example, 1, 2, 3, or 4 heteroatoms, chosen from O, N or S such that the fused ring system is aromatic.
- Suitable examples of polyheteroaryl ring systems include quinoline, isoquinoline, pyridopyrazine, pyrrolopyridine, furopyridine, indole, benzofuran, benzothiofuran, benzindole, benzoxazole, pyrroloquinoline, and the like.
- Nitrogen atoms are unsubstituted or substituted, for example by Rj 3 ; especially useful N substituents include H, Ci — C 4 alkyl, acyl, aminoacyl, and sulfonyl.
- Non-aromatic polyheterocyclic substituents include bicyclic and tricyclic fused ring systems where each ring can be 4 — 9 membered, contain one or more heteroatom, for example, 1, 2, 3, or 4 heteroatoms, chosen from O, N or S and contain zero or one or more C- C double or triple bonds.
- non-aromatic polyheterocycles include hexitol, cis-perhydro-cyclohepta[b]pyridinyl, decahydro-benzo[f][l,4]oxazepinyl, 2,8- dioxabicyclo[3.3.0]octane, hexahydro-thieno[3,2-b]thiophene, perhydropyrrolo[3,2-b]pyrrole, perhydronaphthyridine, perhydro-lH-dicyclopenta[b,e]pyran.
- non- aromatic polyheterocyclic substituents are unsubstituted or substituted on a carbon atom by one or more substituents, including alkyl and the alkyl substituents identified above.
- Nitrogen atoms are unsubstituted or substituted, for example, by R 13 ; especially useful N substituents include H, C 1 - C 4 alkyl, acyl, aminoacyl, and sulfonyl.
- Mixed aryl and non-aryl polyheterocycles substituents include bicyclic and tricyclic fused ring systems where each ring can be 4 - 9 membered, contain one or more heteroatom chosen from O, N or S, and at least one of the rings must be aromatic.
- Suitable examples of mixed aryl and non-aryl polyheterocycles include 2,3-dihydroindole, 1,2,3,4- tetrahydroquinoline,5, 11 -dihydro- 10H-dibenz[b,e] [ 1 ,4]diazepine,5H- dibenzo[b,e] [1 ,4]diazepine, 1 ,2-dihydropyrrolo[3,4-b][l ,5]benzodiazepine, 1,5-dihydro- pyrido[2,3-b] [1 ,4]diazepin-4-one, 1 ,2,3,4,6, 11 -hexahydro-benzo[b]pyrido[2,3-e] [1 ,4]diazepin- 5-one.
- Nitrogen atoms are unsubstituted or substituted, for example, by R 13 ; especially useful N substituents include H, Ci - C4 alkyl, acyl, aminoacyl, and sulfonyl.
- Amino substituents include primary, secondary and tertiary amines and in salt form, quaternary amines.
- Examples of amino substituents include mono- and di-alkylamino, mono- and di-aryl amino, mono- and di-arylalkyl amino, aryl-arylalkylamino, alkyl-arylamino, alkyl- arylalkylamino and the like.
- Sulfonyl substituents include alkylsulfonyl and arylsulfonyl, for example methane sulfonyl, benzene sulfonyl, tosyl and the like.
- Acyl substituents include groups of formula -C(O)-W, -OC(O)-W, -C(O)-O-W or - C(O)NRi 3 Ri 4 , where W is R 16 , H or cycloalkylalkyl.
- Acylamino substituents include substituents of the formula -N(R ⁇ )C(O)-W, — N(R 12 )C(O)-O-W, and -N(Ri 2 )C(O)-NHOH and Ri 2 and W are defined above.
- Preferences for each of the substituents include the following: Ri is H, halo, or a straight chain Ci -C 4 alkyl;
- R 2 is selected from H, C 1 -C 6 alkyl, C 4 — C 9 cycloalkyl, C 4 — C 9 heterocycloalkyl, alkylcycloalkyl, aryl, heteroaryl, arylalkyl, heteroarylalkyl, -(CH 2 ) n C(O)R 6 , amino acyl, and -(CHa) n R 7 ;
- Rs is selected from H, C 1 -CO alkyl, C 4 — C 9 cycloalkyl, C 4 — C 9 heterocycloalkyl, aryl, heteroaryl, arylalkyl, heteroarylalkyl, a aromatic polycycle, a non-aromatic polycycle, a mixed aryl and non-aryl polycycle, polyheteroaryl, a non-aromatic polyheterocycle, and a mixed aryl and non-aryl polyheterocycle; n, ni, ri 2 and n 3 are the same or different and independently selected from 0 — 6, when ni is 1-6, each carbon atom is unsubstituted or independently substituted with R 3 and/or R 4 ;
- X and Y are the same or different and independently selected from H 5 halo, C 1 -C 4 alkyl, CF 3 , NO 2 , C(O)R 1 , OR 9 , SR 9 , CN, and NRi 0 Rn;
- Re is selected from H, Ci-C 6 alkyl, C 4 - C9 cycloalkyl, C 4 - C 9 heterocycloalkyl, alkylcycloalkyl, aryl, heteroaryl, arylalkyl, heteroarylalkyl, OR 12 , and NR 13 R 14 ;
- R 7 is selected from OR 15 , SR 15 , S(O)Ri 6 , SO 2 Rn, NR J3 RH, and NR 12 SO 2 R 6 ;
- R 8 is selected from H, ORi 5 , NR 13 Rj 4 , C r C 6 alkyl, C 4 - C 9 cycloalkyl, C 4 - C 9 heterocycloalkyl, aryl, heteroaryl, arylalkyl, and heteroarylalkyl;
- R 9 is selected from C 1 - C 4 alkyl and C(O)-alkyl
- Rio and Rj 1 are the same or different and independently selected from H, Cj-C 4 alkyl, and -C(O)-alkyl;
- R12 is selected from H, Ci-Ce alkyl, C 4 — C 9 cycloalkyl, C 4 - C 9 heterocycloalkyl, aryl, heteroaryl, arylalkyl, and heteroarylalkyl;
- R 13 and Ri 4 are the same or different and independently selected from H, Cj-C 6 alkyl, C 4 — C 9 cycloalkyl, C 4 — C 9 heterocycloalkyl, aryl, heteroaryl, arylalkyl, heteroarylalkyl and amino acyl;
- Ris is selected from H, Ci-C 6 alkyl, C 4 - C 9 cycloalkyl, C 4 - C 9 heterocycloalkyl, aryl, heteroaryl, arylalkyl, heteroarylalkyl and (CH2) m ZRi 2 ;
- R16 is selected from Ci-Ce alkyl, C 4 - C 9 cycloalkyl, C 4 - C 9 heterocycloalkyl, aryl, heteroaryl, arylalkyl, heteroarylalkyl and (CH 2 ) m ZRi 2 ;
- R 17 is selected from C 1 -C 6 alkyl, C 4 — C 9 cycloalkyl, C 4 - C 9 heterocycloalkyl, aryl, heteroaryl, arylalkyl, heteroarylalkyl and NRi 3 Rj 4 ; m is an integer selected from 0 to 6; and
- Z is selected from O, NRj 3 , S, S(O).
- Useful compounds of the formula (I) include those wherein each of Ri, X, Y, R 3 , and R 4 is H, including those wherein one of n 2 and n 3 is zero and the other is 1, especially those wherein R 2 is H Or-CH 2 -CH 2 -OH.
- One suitable genus of hydroxamate compounds are those of formula Ia:
- R 2 is selected from H, Cj-C 6 alkyl, C 4 — C 9 cycloalkyl, C 4 - C 9 heterocycloalkyl, alkylcycloalkyl, aryl, heteroaryl, arylalkyl, heteroarylalkyl, -(CHa) n C(O)R ⁇ , amino acyl and - (CHz) n R 7 ;
- R 5 ' is heteroaryl, heteroarylalkyl (e.g., pyridylmethyl), aromatic polycycles, non-aromatic polycycles, mixed aryl and non-aryl polycycles, polyheteroaryl, or mixed aryl and non-aryl polyheterocycles.
- R 5 ' is aryl, arylalkyl, aromatic polycycles, non-aromatic polycycles, and mixed aryl and non-aryl polycycles; especially aryl, such as p-fluorophenyl, p-chlorophenyl. p-O-Ci-C4-alkylphenyl, such as p-methoxyphenyl, and p-Ci-C 4 -alkylphenyl; and arylalkyl.
- the term "effective amount" of a compound is a quantity sufficient to achieve a desired pharmacodynamic, toxicologic, therapeutic and/or prophylactic effect, for example, an amount which results in the prevention of or a decrease in the symptoms associated with a disease that is being treated, e.g., the diseases associated with HDAC6 mutant polypeptides and HDAC6 mutant polynucleotides identified herein.
- the amount of compound administered to the subject will depend on the type and severity of the disease and on the characteristics of the individual, such as general health, age, sex, body weight and tolerance to drugs. It will also depend on the degree, severity and type of disease. The skilled artisan will be able to determine appropriate dosages depending on these and other factors.
- an effective amount of the compounds of the present invention sufficient for achieving a therapeutic or prophylactic effect, range from about 0.000001 mg per kilogram body weight per day to about 10,000 mg per kilogram body weight per day.
- the dosage ranges are from about 0.0001 mg per kilogram body weight per day to about 100 mg per kilogram body weight per day.
- the compounds of the present invention can also be administered in combination with each other, or with one or more additional therapeutic compounds.
- Glivec® (Gleevec®; imatinib) is a medication for chronic myeloid leukaemia (CML) and certain stages of gastrointestinal stromal tumours (GIST). It targets and interferes with the molecular abnormalities that drive the growth of cancer cells. Corless CL et al, J. Clin. Oncol. 22(18):3813-25 (September 15, 2004); Verweij J et al, Lancet 364(9440): 1127-34 (September 25, 2004); Kantarjian HM et al, Blood 104(7): 1979-88 (October 1, 2004). By inhibiting multiple targets, Glivec® has potential as an anticancer therapy for several types of cancer, including leukaemia and solid tumours.
- the aromatase inhibitor FEMARA ® is a treatment for advanced breast cancer in postmenopausal women. It blocks the use of oestrogen by certain types of breast cancer that require oestrogen to grow. Janicke F, Breast 13 Suppl l:S10-8 (December 2004); Mouridsen H et al., Oncologist 9(5):489-96 (2004).
- Sandostatin® LAR® is used to treat patients with acromegaly and to control symptoms, such as severe diarrhoea and flushing, in patients with functional gastro-entero- pancreatic (GEP) tumours (e.g., metastatic carcinoid tumours and vasoactive intestinal peptide-secreting tumours [VIPomas]).
- GEP gastro-entero- pancreatic
- Sandostatin® LAR® regulates hormones in the body to help manage diseases and their symptoms.
- ZOMET A® is a treatment for hypocalcaemia of malignancy (HCM)I and for the treatment of bone metastases across a broad range of tumour types. These tumours include multiple myeloma, prostrate cancer, breast cancer, lung cancer, renal cancer and other solid tumours. Rosen LS et al, Cancer 100(12):2613-21 (June 15, 2004).
- Vatalanib (l-[4-chloroanilino]-4-[4-pyridylmethyl] phthalazine succinate) is a multi- VEGF receptor (VEGF) inhibitor that may block the creation of new blood vessels to prevent tumour growth.
- VEGF VEGF receptor
- This compound inhibits all known VEGF receptor tyrosine kinases, blocking angiogenesis and lymphangiogenesis. Drevs J et al, Cancer Res. 60:4819-4824 (2000); Wood JM et al, Cancer Res. 60:2178-2189 (2000).
- Vatalanib is being studied in two large, multinational, randomized, phase III, placebo-controlled trials in combination with FOLFOX- 4 in first-line and second-line treatment of patients with metastatic colorectal cancer.
- rapamycin derivative everolirnus inhibits oncogenic signalling in tumour cells.
- mTOR mammalian target of rapamycin
- everolimus exhibits broad antiproliferative activity in tumour cell lines and animal models of cancer. Boulay A et ah, Cancer Res. 64:252-261 (2004).
- everolimus also potently inhibited the proliferation of human umbilical vein endothelial cells directly indicating an involvement in angiogenesis.
- everolimus may provide a clinical benefit to patients with cancer.
- Everolimus is being investigated for its antitumour properties in a number of clinical studies in patients with haematological and solid tumours. Huang S & Houghton PJ, Curr. Opin. Investig. Drugs 3:295-304 (2002).
- Gimatecan is a novel oral inhibitor of topoisomerase I (topo I). Gimatecan blocks cell division in cells that divide rapidly, such as cancer cells, which activates apoptosis. Preclinical data indicate that gimatecan is not a substrate for multidrug resistance pumps, and that it increases the drug-target interaction. De Cesare M et ah, Cancer Res. 61:7189-7195 (2001). Phase I clinical studies indicate that the dose-limiting toxicity of gimatecan is myelosuppression.
- Patupilone is a microtubule stabilizer.
- Altmann K-H Curr. Opin. Chem. Biol. 5:424- 431 (2001); Altmann K-H et al, Biochim Biophys Acta. 470:M79-M91 (2000); O'Neill V et al., 36th Annual Meeting of the American Society of Clinical Oncology; May 19-23, 2000; New La, LA, Abstract 829; Calvert PM et al. Proceedings of the 11th National Cancer Institute-European Organization for Research and Treatment of Cancer/American Association for Cancer Research Symposium on New Drugs in Cancer Therapy; November 7-10, 2000; Amsterdam, The Netherlands, Abstract 575.
- Patupilone blocked mitosis and induced apoptosis greater than the frequently used anticancer drug paclitaxel. Also, patupilone retained full activity against human cancer cells that were resistant to paclitaxel and other chemotherapeutic agents.
- Midostaurin is an inhibitor of multiple signalling proteins. By targeting specific receptor tyrosine kinases and components of several signal transduction pathways, midostaurin impacts several targets involved in cell growth (e.g., KIT, PDGFR, PKC), leukaemic cell proliferation (e.g., FLT3), and angiogenesis (e.g., VEGFR2).
- KIT KIT
- PDGFR PDGFR
- PKC leukaemic cell proliferation
- angiogenesis e.g., VEGFR2
- midostaurin showed broad antiproliferative activity against various tumour cell lines, including those that were resistant to several other chemotherapeutic agents.
- the somatostatin analogue pasireotide is a stable cyclohexapeptide with broad somatotropin release inhibiting factor (SRIF) receptor binding.
- SRIF broad somatotropin release inhibiting factor
- LBH589 is a histone deacetylase (HDAC) inhibitor. By blocking the deacetylase activity of HDAC, HDAC inhibitors activate gene transcription of critical genes that cause apoptosis (programmed cell death). By triggering apoptosis, LBH589 induces growth inhibition and regression in tumour cell lines. LBH589 is being tested in phase I clinical trials as an anticancer agent. See also, George P et al., Blood 105(4): 1768-76 (February 15, 2005). [72] AEE788 inhibits multiple receptor tyrosine kinases including EGFR, HER2, and VEGFR 5 which stimulate tumour cell growth and angiogenesis. Traxler P et al., Cancer Res.
- HDAC histone deacetylase
- AEE788 showed high target specificity and demonstrated antiproliferative effects against tumour cell lines and in animal models of cancer. AEE788 also exhibited direct antiangiogenic activity. AEE788 is currently in phase I clinical development.
- AMN 107 is an oral tyrosine kinase inhibitor that targets Bcr-Abl, ICIT, and PDGFR.
- Preclinical studies have shown in cellular assays using Philadelphia chromosome-positive (Ph+) CML cells that AMN 107 is highly potent and has high selectivity for Bcr-Abl, KIT, and PDGFR.
- Ph+ Philadelphia chromosome-positive
- AMN 107 also shows activity against mutated variants of Bcr-Abl.
- AMNl 07 is currently being studied in phase I clinical trials.
- HDAC6 modulating agent is any compound that alters ⁇ e.g., increases or decreases).
- the expression level or biological activity level of HDAC6 polypeptide compared to the expression level or biological activity level of HDAC6 polypeptide in the absence of the HDAC6 modulating agent.
- HDAC6 modulating agent can be a small molecule, antibody, polypeptide, carbohydrate, lipid, nucleotide, or combination thereof.
- the HDAC6 modulating agent can be an organic compound or an inorganic compound.
- expression includes but is not limited to one or more of the following: transcription of the gene into precursor mRNA; splicing and other processing of the precursor mRNA to produce mature mRNA; mRNA stability; translation of the mature mRNA into protein (including codon usage and tRNA availability); and glycosylation and/or other modifications of the translation product, if required for proper expression and function.
- gene means a segment of DNA that contains all the information for the regulated biosynthesis of an RNA product, including promoters, exons, introns, and other untranslated regions that control expression.
- genotype means an unphased 5' to 3' sequence of nucleotide pairs found at one or more polymorphic or mutant sites in a locus on a pair of homologous chromosomes in an individual.
- genotype includes a full- genotype and/or a sub-genotype.
- locus means a location on a chromosome or DNA molecule corresponding to a gene or a physical or phenotypic feature.
- mutant means any heritable or acquired variation from the wild-type that alters the nucleotide sequence thereby changing the protein sequence.
- mutant is used interchangeably with the terms “marker”, “biomarker”, and “target” throughout the specification.
- the term "medical condition” includes, but is not limited to, any condition or disease manifested as one or more physical and/or psychological symptoms for which treatment and/or prevention is desirable, and includes previously and newly identified diseases and other disorders.
- nucleotide pair means the two nucleotides bound to each other between the two nucleotide strands.
- polymorphic site means a position within a locus at which at least two alternative sequences are found in a population, the most frequent of which has a frequency of no more than 99%.
- polymorphism means any sequence variant present at a frequency of >1% in a population.
- the sequence variant may be present at a frequency significantly greater than 1% such as 5% or 10% or more.
- the term may be used to refer to the sequence variation observed in an individual at a polymorphic site.
- Polymorphisms include nucleotide substitutions, insertions, deletions and microsatellites and may, but need not, result in detectable differences in gene expression or protein function.
- polynucleotide means any RNA or DNA, which may be unmodified or modified RNA or DNA.
- Polynucleotides include, without limitation, single- and double-stranded DNA, DNA that is a mixture of single- and double-stranded regions, single- and double-stranded RNA 5 RNA that is mixture of single- and double-stranded regions, and hybrid molecules comprising DNA and RNA that may be single-stranded or, more typically, double-stranded or a mixture of single- and double-stranded regions.
- polynucleotide refers to triple-stranded regions comprising RNA or DNA or both RNA and DNA.
- the term polynucleotide also includes DNAs or RNAs containing one or more modified bases and DNAs or RNAs with backbones modified for stability or for other reasons.
- the polynucleotide contains polynucleotide sequences from the HDAC6 gene.
- polypeptide means any polypeptide comprising two or more amino acids joined to each other by peptide bonds or modified peptide bonds, i.e., peptide isosteres.
- Polypeptide refers to both short chains, commonly referred to as peptides, glycopeptides or oligomers, and to longer chains, generally referred to as proteins.
- Polypeptides may contain amino acids other than the 20 gene-encoded amino acids.
- Polypeptides include amino acid sequences modified either by natural processes, such as post- translational processing, or by chemical modification techniques that are well known in the art. Such modifications are well described in basic texts and in more detailed monographs, as well as in a voluminous research literature.
- the polypeptide contains polypeptide sequences from the HDAC6 protein.
- small molecule means a composition that has a molecular weight of less than about 5 kDa and more preferably less than about 2 kDa.
- Small molecules can be, e.g., nucleic acids, peptides, polypeptides, glycopeptides, peptidomimetics, carbohydrates, lipids, lipopolysaccharides, combinations of these, or other organic or inorganic molecules.
- mutant nucleic acid means a nucleic acid sequence, which comprises a nucleotide that is variable within an otherwise identical nucleotide sequence between individuals or groups of individuals, thus, existing as alleles. Such mutant nucleic acids are preferably from about 15 to about 500 nucleotides in length.
- the mutant nucleic acids may be part of a chromosome, or they may be an exact copy of a part of a chromosome, e.g., by amplification of such a part of a chromosome through PCR or through cloning.
- the mutant probes according to the invention are oligonucleotides that are complementary to a mutant nucleic acid.
- SNP nucleic acid means a nucleic acid sequence, which comprises a nucleotide that is variable within an otherwise identical nucleotide sequence between individuals or groups of individuals, thus, existing as alleles. Such SNP nucleic acids are preferably from about 15 to about 500 nucleotides in length.
- the SNP nucleic acids may be part of a chromosome, or they may be an exact copy of a part of a chromosome, e.g., by amplification of such a part of a chromosome through PCR or through cloning.
- the SNP nucleic acids are referred to hereafter simply as "SNPs".
- the SNP probes according to the invention are oligonucleotides that are complementary to a SNP nucleic acid. In a particular embodiment, the SNP is in the HDAC6 gene.
- the term "subject" as used herein refers to any living organism capable of eliciting an immune response.
- the term subject includes, but is not limited to, humans, nonhuman primates such as chimpanzees and other apes and monkey species; farm animals such as cattle, sheep, pigs, goats and horses; domestic mammals such as dogs and cats; laboratory animals including rodents such as mice, rats and guinea pigs, and the like.
- the term does not denote a particular age or sex. Thus, adult and newborn subjects, as well as fetuses, whether male or female, are intended to be covered.
- the administration of an agent or drug to a subject or patient includes self-administration and the administration by another. It is also to be appreciated that the various modes of treatment or prevention of medical conditions as described are intended to mean “substantial”, which includes total but also less than total treatment or prevention, and wherein some biologically or medically relevant result is achieved.
- HDAC6 Mutations and Polymorphisms of the Invention To investigate HDAC6 mutations in association with cancer, e.g., acute myeloid leukemia (AML; a.ka., acute myelogenous leukemia), DHPLC analysis was conducted on blood samples from 15 AML patients and tumor tissue from 30 breast cancer samples.
- AML acute myeloid leukemia
- AML a.ka., acute myelogenous leukemia
- DHPLC analysis was conducted on blood samples from 15 AML patients and tumor tissue from 30 breast cancer samples.
- the two (2) HDAC6 missense mutations identified in AML tissue are summarized below in TABLE 1 and TABLE 2.
- the missense mutations were located in the human HDAC6 gene (NP_006035).
- HDAC6 missense mutations were identified in the present invention.
- Example 1 Bioinformatics analysis of the HDAC6 mutations of the invention are further detailed in Example 1. This mutation may be used to identify patient responsiveness or resistantance to HDAC inhibitors.
- Sequence variation in the human germline consists primarily of SNPs, the remainder being short tandem repeats (including micro-satellites), long tandem repeats (mini-satellites), and other insertions and deletions.
- a SNP is the occurrence of nucleotide variability at a single position in the genome, in which two alternative bases occur at appreciable frequency (i.e., >1%) in the human population.
- a SNP may occur within a gene or within intergenic regions of the genome.
- SNPs have the potential to be important tools for locating genes that are involved in human disease conditions. See e.g., Wang et al, Science 280: 1077-1082 (1998)).
- An association between SNP's and/or mutations and a particular phenotype ⁇ e.g., cancer type) does not necessarily indicate or require that the SNP or mutation is causative of the phenotype. Instead, an association with a SNP may merely be due to genome proximity between a SNP and those genetic factors actually responsible for a given phenotype, such that the SNP and said genetic factors are closely linked. That is, a SNP may be in linkage disequilibrium ("LD") with the "true" functional variant. LD exists when alleles at two distinct locations of the genome are more highly associated than expected. Thus, a SNP may serve as a marker that has value by virtue of its proximity to a mutation or other DNA alteration (e.g., gene duplication) that causes a particular phenotype.
- LD linkage disequilibrium
- SNPs and mutations that are associated with disorders may also have a direct effect on the function of the genes in which they are located.
- a sequence variant e.g., SNP
- SNP may result in an amino acid change or may alter exon-intron splicing, thereby directly modifying the relevant protein, or it may exist in a regulatory region, altering the cycle of expression or the stability of the mRNA (see, e.g., Nowotny et al., Current Opinions in Neurobiology, 11:637-641 (2001)).
- nucleic acid molecules containing the gene may be complementary double stranded molecules and thus reference to a particular site on the sense strand refers as well to the corresponding site on the complementary antisense strand. That is, reference may be made to the same polymorphic or mutant site on either strand and an oligonucleotide may be designed to hybridize specifically to either strand at a target region containing the polymorphic and/or mutant site.
- the invention also includes single-stranded polynucleotides and mutations that are complementary to the sense strand of the genomic variants described herein.
- SNPs and Mutations Many different techniques can be used to identify and characterize SNPs and mutations, including single- strand conformation polymorphism (SSCP) analysis, heteroduplex analysis by denaturing high-performance liquid chromatography (DHPLC) 5 direct DNA sequencing and computational methods (Shi et al, CHn Chem 47:164-172 (2001)).
- SSCP single- strand conformation polymorphism
- DPLC denaturing high-performance liquid chromatography
- the most ' common SNP-typing methods currently include hybridization, primer extension, and cleavage methods. Each of these methods must be connected to an appropriate detection system.
- Detection technologies include fluorescent polarization (Chan et al, Genome Res. 9:492-499 (1999)), luminometric detection of pyrophosphate release (pyrosequencing) (Ahmadiian et ah, Anal. Biochem. 280:103-10 (2000)), fluorescence resonance energy transfer (FRET)-based cleavage assays, DHPLC, and mass spectrometry (Shi, Clin Chem 47:164-172 (2001); U.S. Pat. No. 6,300,076 Bl). Other methods of detecting and characterizing SNPs and mutations are those disclosed in U.S. Pat. Nos. 6,297,018 Bl and 6,300,063 Bl.
- the detection of polymorphisms and mutations is detected using INVADERTM technology (available from Third Wave Technologies Inc. Madison, Wisconsin USA).
- INVADERTM technology available from Third Wave Technologies Inc. Madison, Wisconsin USA.
- a specific upstream "invader” oligonucleotide and a partially overlapping downstream probe together form a specific structure when bound to complementary DNA template.
- This structure is recognized and cut at a specific site by the Cleavase enzyme, resulting in the release of the 5' flap of the probe oligonucleotide.
- This fragment then serves as the "invader” oligonucleotide with respect to synthetic secondary targets and secondary fluorescently labelled signal probes contained in the reaction mixture. This results in specific cleavage of the secondary signal probes by the Cleavase enzyme.
- Fluorescent signal is generated when this secondary probe (labelled with dye molecules capable of fluorescence resonance energy transfer) is cleaved.
- Cleavases have stringent requirements relative to the structure formed by the overlapping DNA sequences or flaps and can, therefore, be used to specifically detect single base pair mismatches immediately upstream of the cleavage site on the downstream DNA strand.
- Ryan D et al Molecular Diagnosis 4(2): 135-144 (1999) and Lyamichev V et al Nature Biotechnology 17: 292-296 (1999), see also U.S. Pat. Nos. 5,846,717 and 6,001,567.
- the identity of polymorphisms and mutations may also be determined using a mismatch detection technique including, but not limited to, the RNase protection method using riboprobes (Winter et al, Proc. Natl. Acad. ScL USA 82:7575 (1985); Meyers et al, Science 230:1242 (1985)) and proteins which recognize nucleotide mismatches, such as the E. coli mutS protein (Modrich P, Ann Rev Genet 25:229-253 (1991)).
- variant alleles can be identified by single strand conformation polymorphism (SSCP) analysis (Orita et al., Genomics 5:874-879 (1989); Humphries et al, in Molecular Diagnosis of Genetic Diseases, Elles R, ed. (1996) pp. 321-340) or denaturing gradient gel electrophoresis (DGGE) (Wartell et al, Nucl. Acids Res. 18:2699-2706 (1990); Sheffield et al, Proc. Natl. Acad. Sci. USA 86: 232-236 (1989)).
- SSCP single strand conformation polymorphism
- DGGE denaturing gradient gel electrophoresis
- a polymerase-mediated primer extension method may also be used to identify the polymorphisms/mutations.
- multiple polymorphic and/or mutant sites may be investigated by simultaneously amplifying multiple regions of the nucleic acid using sets of allele-specific primers as described in WO 89/10414.
- the invention provides methods and compositions for haplotyping and/or genotyping the genetic polymorphisms (and possibly mutations) in an individual.
- the terms "genotype” and “haplotype” mean the genotype or haplotype containing the nucleotide pair or nucleotide, respectively, that is present at one or more of the novel polymorphic (or mutant) sites described herein and may optionally also include the nucleotide pair or nucleotide present at one or more additional polymorphic (or mutant) sites in the gene.
- the additional polymorphic (and mutant) sites may be currently known polymorphic/mutant sites or sites that are subsequently discovered.
- compositions contain oligonucleotide probes and primers designed to specifically hybridize to one or more target regions containing, or that are adjacent to, a polymorphic or mutant site.
- Oligonucleotide compositions of the invention are useful in methods for genotyping and/or haplotyping a gene in an individual.
- the methods and compositions for establishing the genotype or haplotype of an individual at the novel polymorphic/mutant sites described herein are useful for studying the effect of the polymorphisms and mutations in the aetiology of diseases affected by the expression and function of the protein, studying the efficacy of drugs targeting, predicting individual susceptibility to diseases affected by the expression and function of the protein and predicting individual responsiveness to drugs targeting the gene product.
- Genotyping oligonucleotides of the invention may be immobilized on or synthesized on a solid surface such as a microchip, bead, or glass slide (see, e.g., WO 98/20020 and WO 98/20019).
- Immobilized genotyping oligonucleotides may be used in a variety of polymorphism and mutation detection assays, including but not limited to probe hybridization and polymerase extension assays.
- Immobilized genotyping oligonucleotides of the invention may comprise an ordered array of oligonucleotides designed to rapidly screen a DNA sample for polymorphisms and mutations in multiple genes at the same time.
- An allele-specific oligonucleotide primer of the invention has a 3' terminal nucleotide, or preferably a 3' penultimate nucleotide, that is complementary to only one nucleotide of a particular SNP, thereby acting as a primer for polymerase-mediated extension only if the allele containing that nucleotide is present.
- Allele-specific oligonucleotide (ASO) primers hybridizing to either the coding or noncoding strand are contemplated by the invention.
- An ASO primer for detecting gene polymorphisms and mutations can be developed using techniques known to those of skill in the art.
- genotyping oligonucleotides of the invention hybridize to a target region located one to several nucleotides downstream of one of the novel polymorphic or mutant sites identified herein. Such oligonucleotides are useful in polymerase-mediated primer extension methods for detecting one of the novel polymorphisms or mutations described herein and therefore such genotyping oligonucleotides are referred to herein as "primer-extension oligonucleotides”.
- the 3 '-terminus of a primer-extension oligonucleotide is a deoxynucleotide complementary to the nucleotide located immediately adjacent to the polymorphic/mutant site.
- a genotyping method of the invention involves isolating from an individual a nucleic acid mixture comprising at least one copy of the gene of interest and/or a fragment or flanking regions thereof, and determining the identity of the nucleotide pair at one or more of the polymorphic/mutant sites in the nucleic acid mixture.
- the two "copies" of a germline gene in an individual may be the same on each allele or may be different on each allele.
- the genotyping method comprises determining the identity of the nucleotide pair at each polymorphic and mutant site.
- the nucleic acid mixture is isolated from a biological sample taken from the individual, such as a blood sample, tumour or tissue sample.
- tissue samples include whole blood, tumour or as part of any tissue type, semen, saliva, tears, urine, fecal material, sweat, buccal smears, skin and hair.
- the nucleic acid mixture may be comprised of genomic DNA, mRNA, or cDNA and, in the latter two cases, the biological sample must be obtained from an organ in which the gene may be expressed.
- mRNA or cDNA preparations would not be used to detect polymorphisms or mutations located in introns or in 5' and 3' nontranscribed regions.
- a gene fragment If a gene fragment is isolated, it must usually contain the polymorphic and/or mutant sites to be genotyped. Exceptions can include mutations leading to truncation of the gene where a specific polymorphism may be lost. In these cases, the specific DNA alterations are determined by assessing the flanking sequences of the gene and underscore the need to specifically look for both polymorphisms and mutations.
- One embodiment of the haplotyping method of the invention comprises isolating from an individual a nucleic acid molecule containing only one of the two copies of a gene of interest, or a fragment thereof, and determining the identity of the nucleotide at one or more of the polymorphic or mutant sites in that copy.
- the nucleic acid may be isolated using any method capable of separating the two copies of the gene or fragment.
- any individual clone will only provide haplotype information on one of the two gene copies present in an individual. If haplotype information is desired for the individual's other copy, additional clones will need to be examined. Typically, at least five clones should be examined to have more than a 90% probability of haplotyping both copies of the gene in an individual.
- the nucleotide at each polymorphic or mutant site is identified.
- a haplotype pair is determined for an individual by identifying the phased sequence of nucleotides at one or more of the polymorphic/mutant sites in each copy of the gene that is present in the individual.
- the haplotyping method comprises identifying the phased sequence of nucleotides at each polymorphic/mutant site in each copy of the gene.
- the identifying step is preferably performed with each copy of the gene being placed in separate containers. However, if the two copies are labelled with different tags, or are otherwise separately distinguishable or identifiable, it is possible in some cases to perform the method in the same container.
- first and second copies of the gene are labelled with different first and second fluorescent dyes, respectively, and an allele-specific oligonucleotide labelled with yet a third different fluorescent dye is used to assay the polymorphic/mutant sites, then detecting a combination of the first and third dyes would identify the polymorphism or mutation in the first gene copy, while detecting a combination of the second and third dyes would identify the polymorphism or mutation in the second gene copy.
- the identity of a nucleotide (or nucleotide pair) at a polymorphic and/or mutant site may be determined by amplifying a target region containing the polymorphic and/or mutant sites directly from one or both copies of the gene, or fragments thereof, and sequencing the amplified regions by conventional methods. It will be readily appreciated by the skilled artisan that only one nucleotide will be detected at a polymorphic or mutant site in individuals who are homozygous at that site, while two different nucleotides will be detected if the individual is heterozygous for that site.
- the polymorphism or mutation may be identified directly, known as positive-type identification, or by inference, referred to as negative-type identification.
- a site may be positively determined to be either guanine or cytosine for all individuals homozygous at that site, or both guanine and cytosine, if the individual is heterozygous at that site.
- the site may be negatively determined to be not guanine (and thus cytosine/cytosine) or not cytosine (and thus guanine/guanine).
- Polymorphic and mutant sites in linkage disequilibrium with the polymorphic or mutant sites of the invention may be located in regions of the same gene or in other genomic regions. Genotyping of a polymorphic/mutant site in linkage disequilibrium with the novel polymorphic/mutant sites described herein may be performed by, but is not limited to, any of the above-mentioned methods for detecting the identity of the allele at a polymorphic/mutant site.
- the target regions may be amplified using any oligonucleotide-directed amplification method, including but not limited to polymerase chain reaction (PCR).
- PCR polymerase chain reaction
- LCR ligase chain reaction
- OLA oligonucleotide ligation assay
- Oligonucleotides useful as primers or probes in such methods should specifically hybridize to a region of the nucleic acid that contains or is adjacent to the polymorphic/mutant site.
- the oligonucleotides are between 10 and 35 nucleotides in length and preferably, between 15 and 30 nucleotides in length. Most preferably, the oligonucleotides are 20 to 25 nucleotides long. The exact length of the oligonucleotide will depend on many factors that are routinely considered and practiced by the skilled artisan.
- nucleic acid amplification procedures may be used to amplify the target region including transcription-based amplification systems (U.S. Pat. No. 5,130,238; EP 329,822; U.S. Pat. No. 5,169,766, published PCT patent application WO 89/06700) and isothermal methods (Walker et al, Proc. Natl. Acad. Sci. USA 89: 392-396 (1992)).
- a polymorphism or mutation in the target region may be assayed before or after amplification using one of several hybridization-based methods known in the art. Typically, allele-specif ⁇ c oligonucleotides are utilized in performing such methods.
- the allele-specific oligonucleotides may be used as differently labelled probe pairs, with one member of the pair showing a perfect match to one variant of a target sequence and the other member showing a perfect match to a different variant.
- more than one polymorphic/mutant site may be detected at once using a set of allele-specific oligonucleotides or oligonucleotide pairs.
- the members of the set have melting temperatures within 5°C, and more preferably within 2°C, of each other when hybridizing to each of the polymorphic or mutant sites being detected.
- Hybridizing Allele-Specific Oligonucleotide to a Target Gene Hybridization of an allele-specific oligonucleotide to a target polynucleotide may be performed with both entities in solution, or such hybridization may be performed when either the oligonucleotide or the target polynucleotide is covalently or noncovalently affixed to a solid support. Attachment may be mediated, for example, by antibody-antigen interactions, poly-L-Lys, streptavidin or avidin-biotin, salt bridges, hydrophobic interactions, chemical linkages, UV cross-linking, baking, etc.
- Allele-specific oligonucleotide may be synthesized directly on the solid support or attached to the solid support subsequent to synthesis.
- Solid-supports suitable for use in detection methods of the invention include substrates made of silicon, glass, plastic, paper and the like, which may be formed, for example, into wells (as in 96-well plates), slides, sheets, membranes, fibres, chips, dishes, and beads.
- the solid support may be treated, coated or derivatised to facilitate the immobilization of the allele-specific oligonucleotide or target nucleic acid.
- the genotype or haplotype for the gene of an individual may also be determined by hybridization of a nucleic sample containing one or both copies of the gene to nucleic acid arrays and subarrays such as described in WO 95/11995.
- the arrays would contain a battery of allele-specific oligonucleotides representing each of the polymorphic or mutant sites to be ⁇ included in the genotype or haplotype.
- the present invention provides a method for determining the frequency of a genotype or haplotype in a population.
- the method comprises determining the genotype or the haplotype for a gene present in each member of the population, wherein the genotype or haplotype comprises the nucleotide pair or nucleotide detected at one or more of the polymorphic sites in the gene and mutations identified in the region, and calculating the frequency at which the genotype or haplotype is found in the population.
- the population may be a reference population, a family population, a same sex population, a population group, or a trait population (e.g., a group of individuals exhibiting a trait of interest such as a medical condition or response to a therapeutic treatment).
- a trait population e.g., a group of individuals exhibiting a trait of interest such as a medical condition or response to a therapeutic treatment.
- frequency data for genotypes and/or haplotypes found in a reference population are used in a method for identifying an association between a trait and a genotype or a haplotype.
- the trait may be any detectable phenotype, including but not limited to cancer, susceptibility to a disease or response to a treatment.
- the method involves obtaining data on the frequency of the genotypes or haplotypes of interest in a reference population and comparing the data to the frequency of the genotypes or haplotypes in a population exhibiting the trait.
- Frequency data for one or both of the reference and trait populations may be obtained by genotyping or haplotyping each individual in the populations using one of the methods described above.
- the haplotypes for the trait population may be determined directly or, alternatively, by the predictive genotype to haplotype approach described above.
- the trait is susceptibility to a disease, severity of a disease, the staging of a disease or response to a drug.
- Such methods have applicability in developing diagnostic tests and therapeutic treatments for all pharmacogenetic applications where there is the potential for an association between a genotype and a treatment outcome, including efficacy measurements, PD measurements, PK measurements and side effect measurements.
- the frequency data for the reference and/or trait populations are obtained by accessing previously determined frequency data, which may be in written or electronic form.
- the frequency data may be present in a database that is accessible by a computer. Once the frequency data are obtained, the frequencies of the genotypes or haplotypes of interest in the reference and trait populations are compared.
- the frequencies of all genotypes and/or haplotypes observed in the populations are compared. If a particular genotype or haplotype for the gene is more frequent in the trait population than in the reference population at a statistically significant amount, then the trait is predicted to be associated with that genotype or haplotype.
- the haplotype frequency data for different ethnogeographic groups are examined to determine whether they are consistent with Hardy-Weinberg equilibrium. Hartl DL et al., Principles of Population Genomics, 3rd Ed. (Sinauer Associates, Sunderland, MA, 1997).
- a statistically significant difference between the observed and expected haplotype frequencies could be due to one or more factors including significant inbreeding in the population group, strong selective pressure on the gene, sampling bias, and/or errors in the genotyping process. If large deviations from ⁇ ardy-Weinberg equilibrium are observed in an ethnogeographic group, the number of individuals in that group can be increased to see if the deviation is due to a sampling bias.
- the assigning step involves performing the following analysis. First, each of the possible haplotype pairs is compared to the haplotype pairs in the reference population.
- the individual is preferably haplotyped using a direct molecular haplotyping method such as, for example, those discussed supra.
- statistical analysis is performed by the use of standard ANOVA tests with a Bonferoni correction and/or a bootstrapping method that simulates the genotype phenotype correlation many times and calculates a significance value.
- a calculation may be performed to correct for a significant association that might be found by chance.
- For statistical methods useful in the methods of the present invention see Bailey NTJ, Statistical Methods in Biology, 3 rd Edition (Cambridge Univ. Press, Cambridge, 1997); Waterman MS, Introduction to Computational Biology (CRC Press, 2000) and Bioinformatics, Baxevanis AD & Ouellette BFF, eds.
- the trait of interest is a clinical response exhibited by a patient to some therapeutic treatment, for example, response to a drug targeting or to a therapeutic treatment for a medical condition.
- a detectable genotype or haplotype that is in linkage disequilibrium with a genotype or haplotype of interest may be used as a surrogate marker.
- a genotype that is in linkage disequilibrium with another genotype is indicated where a particular genotype or haplotype for a given gene is more frequent in the population that also demonstrates the potential surrogate marker genotype than in the reference population. If the frequency is statistically significant, then the marker genotype is predictive of that genotype or haplotype, and can be used as a surrogate marker.
- genotype or haplotype data is obtained on the clinical responses exhibited by a population of individuals who received the treatment, hereinafter the "clinical population". This clinical data may be obtained by analyzing the results of a clinical trial that has already been previously conducted and/or by designing and carrying out one or more new clinical trials.
- the individuals included in the clinical population be graded for the existence of the medical condition of interest. This grading of potential patients could employ a standard physical exam or one or more lab tests. Alternatively, grading of patients could use genotyping or haplotyping for situations where there is a strong correlation between haplotype pair and disease susceptibility or severity.
- the therapeutic treatment of interest is administered to each individual in the trial population, and each individual's response to the treatment is measured using one or more predetermined criteria. It is contemplated that in many cases, the trial population will exhibit a range of responses, and that the investigator may choose more than one responder groups (e.g., low, medium, high) made up by the various responses. In addition, the gene for each individual in the trial population is genotyped and/or haplotyped, which may be done before or after administering the treatment.
- a second method for finding correlations between genotype and haplotype content and clinical responses uses predictive models based on error-minimizing optimization algorithms, one of which is a genetic algorithm.
- Judson R Genetic Algorithms and Their Uses in Chemistry, in Reviews in Computational Chemistry, Vol. 10, Lipkowitz KB & Boyd DB 5 eds. (VCH Publishers, New York, 1997) pp. 1-73.
- Simulated annealing Press et al, Numerical Recipes in C: The Art of Scientific Computing, Ch. 10 (Cambridge University Press, Cambridge, 1992)
- neural networks (Rich E & Knight K, Artificial Intelligence, 2nd Edition, Ch.
- Correlations may also be analyzed using analysis of variation (ANOVA) techniques to determine how much of the variation in the clinical data is explained by different subsets of the polymorphic and mutant sites in the gene.
- ANOVA is used to test hypotheses about whether a response variable is caused by or correlates with one or more traits or variables that can be measured (Fisher & vanBelle, supra, Ch. 10).
- correlations between individual response and genotype or haplotype content are created. Correlations may be produced in several ways. In one method, individuals are grouped by their genotype or haplotype (or haplotype pair) (also referred to as a polymorphism/mutation group), and then the averages and standard deviations of clinical responses exhibited by the members of each polymorphism/mutation group are calculated.
- the identification of an association between a clinical response and a genotype or haplotype (or haplotype pair) for the gene may be the basis for designing a diagnostic method to determine those individuals who will or will not respond to the treatment, or alternatively, will respond at a lower level and thus may require more treatment, i.e., a greater dose of a drug or suffer an adverse reaction.
- the diagnostic method may take one of several forms: for example, a direct DNA test (i.e., genotyping or haplotyping one or more of the polymorphic/mutant sites in the gene), a serological test, or a physical exam measurement. The only requirement is that there be a good correlation between the diagnostic test results and the underlying genotype or haplotype. In a preferred embodiment, this diagnostic method uses the predictive genotyping/haplotyping method described above.
- Genotypes and haplotypes that correlate with efficacious drug responses will be used to select patients for therapy of existing diseases.
- Genotypes and haplotypes that correlate with adverse consequences will be used to either modify how the drug is administered ⁇ e.g., dose, schedule or in combination with other drugs) or eliminated as an option.
- the invention also provides a computer system for storing and displaying polymorphism and ⁇ mutation data determined for the gene.
- the computer system comprises a computer processing unit, a display, and a database containing the polymorphism/mutation data.
- the polymorphism/mutation data includes the polymorphisms, mutations, the genotypes and the haplotypes identified for a given gene in a reference population.
- the computer system is capable of producing a display showing haplotypes organized according to their evolutionary relationships.
- a computer may implement any or all analytical and mathematical operations involved in practicing the methods of the present invention.
- the computer may execute a program that generates views (or screens) displayed on a display device and with which the user can interact to view and analyze large amounts of information relating to the gene and its genomic variation, including chromosome location, gene structure, and gene family, gene expression data, polymorphism data, mutation data, genetic sequence data, and clinical population data (e.g., data on ethnogeographic origin, clinical responses, genotypes, and haplotypes for one or more populations).
- the polymorphism and mutation data described herein may be stored as part of a relational database (e.g., an instance of an Oracle database or a set of ASCII flat files). These polymorphism and mutation data may be stored on the computer's hard drive or may, for example, be stored on a CD-ROM or on one or more other storage devices accessible by the computer. For example, the data may be stored on one or more databases in communication with the computer via a network.
- a relational database e.g., an instance of an Oracle database or a set of ASCII flat files.
- the invention provides SNP and mutation probes, which are useful in classifying subjects according to their types of genetic variation.
- the SNP and mutation probes according to the invention are oligonucleotides, which discriminate between SNPs or mutations and the wild-type sequence in conventional allelic discrimination assays.
- the oligonucleotides according to this aspect of the invention are complementary to one allele of the SNP/mutant nucleic acid, but not to any other allele of the SNP/Mutant nucleic acid. Oligonucleotides according to this embodiment of the invention can discriminate between SNPs and mutations in various ways.
- an oligonucleotide of appropriate length will hybridize to one SNP or mutation, but not to any other.
- the oligonucleotide may be labelled using a radiolabel or a fluorescent molecular tag.
- an oligonucleotide of appropriate length can be used as a primer for PCR, wherein the 3' terminal nucleotide is complementary to one allele containing a SNP or mutation, but not to any other allele.
- the presence or absence of amplification by PCR determines the haplotype of the SNP or the specific mutation.
- Genomic and cDNA fragments of the invention comprise at least one novel polymorphic site or mutation identified herein, have a length of at least 10 nucleotides, and may range up to the full length of the gene.
- a fragment according to the present invention is between 100 and 3000 nucleotides in length, and more preferably between 200 and 2000 nucleotides in length, and most preferably between 500 and 1000 nucleotides in length.
- kits of the Invention provides nucleic acid and polypeptide detection kits useful for haplotyping and/or genotyping the genes in an individual. Such kits are useful for classifying individuals for the purpose of classifying individuals. Specifically, the invention encompasses kits for detecting the presence of a polypeptide or nucleic acid corresponding to a marker of the invention in a biological sample, e.g., any tissue or bodily fluid including, but not limited to, serum, plasma, lymph, cystic fluid, urine, stool, cerebrospinal fluid, ascites fluid or blood, and including biopsy samples of body tissue.
- a biological sample e.g., any tissue or bodily fluid including, but not limited to, serum, plasma, lymph, cystic fluid, urine, stool, cerebrospinal fluid, ascites fluid or blood, and including biopsy samples of body tissue.
- the kit can comprise a labelled compound or agent capable of detecting a polypeptide or an mRNA encoding a polypeptide corresponding to a marker of the invention in a biological sample and means for determining the amount of the polypeptide or mRNA in the sample, e.g., an antibody which binds the polypeptide or an oligonucleotide probe which binds to DNA or mRNA encoding the polypeptide.
- Kits can also include instructions for interpreting the results obtained using the kit.
- the invention provides a kit comprising at least two genotyping oligonucleotides packaged in separate containers.
- the kit may also contain other components such as hybridization buffer (where the oligonucleotides are to be used as a probe) packaged in a separate container.
- the kit may contain, packaged in separate containers, a polymerase and a reaction buffer optimized for primer extension mediated by the polymerase, such as in the case of PCR.
- such kit may further comprise a DNA sample collecting means.
- the genotyping primer composition may comprise at least two sets of allele specific primer pairs.
- the two genotyping oligonucleotides are packaged in separate containers.
- the kit can comprise, e.g., (1) a first antibody, e.g., attached to a solid support, which binds to a polypeptide corresponding to a marker or the invention; and, optionally; (2) a second, different antibody which binds to either the polypeptide or the first antibody and is conjugated to a detectable label.
- the kit can comprise, e.g., (1) an oligonucleotide, e.g., a detectably-labelled oligonucleotide, which hybridizes to a nucleic acid sequence encoding a polypeptide corresponding to a marker of the invention; or (2) a pair of primers useful for amplifying a nucleic acid molecule corresponding to a marker of the invention.
- the kit can also comprise, e.g., a buffering agent, a preservative or a protein- stabilizing agent.
- the kit can further comprise components necessary for detecting the detectable-label, e.g., an enzyme or a substrate.
- the kit can also contain a control sample or a series of control samples, which can be assayed and compared to the test sample.
- Each component of the kit can be enclosed within an individual container and all of the various containers can be within a single package, along with instructions for interpreting the results of the assays performed using the kit.
- the present invention includes one or more polynucleotides encoding mutant or polymorphic polypeptides, including degenerate variants thereof.
- the invention also encompasses allelic variants of the same, that is, naturally occurring alternative forms of the isolated polynucleotides that encode mutant polypeptides that are identical, homologous or related to those encoded by the polynucleotides.
- non-naturally occurring variants may be produced by mutagenesis techniques or by direct synthesis techniques well known in the art. Accordingly, nucleic acid sequences capable of hybridizing at low stringency with any nucleic acid sequences encoding mutant polypeptide of the present invention are considered to be within the scope of the invention.
- a typical prehybridization, hybridization, and wash protocol is as follows: (1) prehybridization: incubate nitrocellulose filters containing the denatured target DNA for 3-4 hours at 55°C in 5xDenhardt's solution, 6xSSC (2OxSSC consists of 175 g NaCl, 88.2 g sodium citrate in 800 ml H 2 O adjusted to pH.
- Recombinant Expression Vectors Another aspect of the invention includes vectors containing one or more nucleic acid sequences encoding a mutant or polymorphic polypeptide.
- many conventional techniques in molecular biology, microbiology and recombinant DNA are used. These techniques are well known and are explained in, e.g., Current Protocols in Molecular Biology, VoIs. I-III, Ausubel, ed. (1997); Sambrook et al., Molecular Cloning: A Laboratory Manual, 2 nd Edition. (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1989); Glover DN, DNA Cloning: A Practical Approach, VoIs.
- the nucleic acid containing all or a portion of the nucleotide sequence encoding the polypeptide is inserted into an appropriate cloning vector, or an expression vector (i.e., a vector that contains the necessary elements for the transcription and translation of the inserted polypeptide coding sequence) by recombinant DNA techniques well known in the art and as detailed below.
- an expression vector i.e., a vector that contains the necessary elements for the transcription and translation of the inserted polypeptide coding sequence
- expression vectors useful in recombinant DNA techniques are often in the form of plasmids.
- plasmid and “vector” can be used interchangeably as the plasmid is the most commonly used form of vector.
- the invention is intended to include such other forms of expression vectors that are not technically plasmids, such as viral vectors (e.g., replication defective retroviruses, adenoviruses and adeno-associated viruses), which serve equivalent functions.
- viral vectors e.g., replication defective retroviruses, adenoviruses and adeno-associated viruses
- Such viral vectors permit infection of a subject and expression in that subject of a compound. Becker et ah, Meth. Cell Biol. A3: 161 89 (1994).
- the recombinant expression vectors of the invention comprise a nucleic acid encoding a mutant or polymorphic polypeptide in a form suitable for expression of the nucleic acid in a host cell, which means that the recombinant expression vectors include one or more regulatory sequences, selected on the basis of the host cells to be used for expression that is operatively linked to the nucleic acid sequence to be expressed.
- "operably linked" is intended to mean that the nucleotide sequence of interest is linked to the regulatory sequences in a manner that allows for expression of the nucleotide sequence (e.g., in an in vitro transcription/translation system or in a host cell when the vector is introduced into the host cell).
- regulatory sequence is intended to include promoters, enhancers and other expression control elements (e.g., polyadenylation signals). Such regulatory sequences are described, for example, in Goeddel, Gene Expression Technology: Methods In Enzymology (Academic Press, San Diego, Calif., 1990). Regulatory sequences include those that direct constitutive expression of a nucleotide sequence in many types of host cell and those that direct expression of the nucleotide sequence only in certain host cells (e.g., tissue specific regulatory sequences). It will be appreciated by those skilled in the art that the design of the expression vector can depend on such factors as the choice of the host cell to be transformed, the level of expression of polypeptide desired, etc.
- the expression vectors of the invention can be introduced into host cells to thereby produce polypeptides or peptides, including fusion polypeptides, encoded by nucleic acids as described herein (e.g., mutant polypeptides and mutant-derived fusion polypeptides, etc.).
- mutant and Polymorphic Polypeptide-Expressing Host Cells Another aspect of the invention pertains to mutant and polymorphic polypeptide-expressing host cells, which contain a nucleic acid encoding one or more mutant/polymorphic polypeptides of the invention.
- the desired isogene may be introduced into a host cell in a vector such that the isogene remains extrachromosomal. In such a situation, the gene will be expressed by the cell from the extrachromosomal location.
- the isogene is introduced into a cell in such a way that it recombines with the endogenous gene present in the cell.
- Such recombination requires the occurrence of a double recombination event, thereby resulting in the desired gene polymorphism or mutation.
- Vectors for the introduction of genes both for recombination and for extrachromosomal maintenance are known in the art, and any suitable vector or vector construct may be used in the invention. Methods such as electroporation, particle bombardment, calcium phosphate co-precipitation and viral transduction for introducing DNA into cells are known in the art; therefore, the choice of method may lie with the competence and preference of the skilled practitioner.
- the recombinant expression vectors of the invention can be designed for expression of mutant polypeptides in prokaryotic or eukaryotic cells.
- mutant/polymorphic polypeptides can be expressed in bacterial cells such as Escherichia coli (E. col ⁇ ), insect cells (using baculovirus expression vectors), fungal cells, e.g., yeast, yeast cells or mammalian cells. Suitable host cells are discussed further in Goeddel, Gene Expression Technology: Methods In Enzymology (Academic Press, San Diego, Calif., 1990).
- the recombinant expression vector can be transcribed and translated in vitro, for example using T7 promoter regulatory sequences and T7 polymerase.
- the SMP2 promoter is useful in the expression of polypeptides in smooth muscle cells, Qian et al., Endocrinology 140(4): 1826 (1999).
- Fusion vectors add a number of amino acids to a polypeptide encoded therein, usually to the amino terminus of the recombinant polypeptide.
- Such fusion vectors typically serve three purposes: (i) to increase expression of recombinant polypeptide; (ii) to increase the solubility of the recombinant polypeptide; and (iii) to aid in the purification of the recombinant polypeptide by acting as a ligand in affinity purification.
- a proteolytic cleavage site is introduced at the junction of the fusion moiety and the recombinant polypeptide to enable separation of the recombinant polypeptide from the fusion moiety subsequent to purification of the fusion polypeptide.
- enzymes, and their cognate recognition sequences include Factor Xa, thrombin and enterokinase.
- Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith and Johnson, Gene 67: 31 40 (1988)), pMAL (New England Biolabs, Beverly, Mass.) and pRIT5 (Pharmacia, Piscataway, NJ.) that fuse glutathione S transferase (GST), maltose E binding polypeptide, or polypeptide A, respectively, to the target recombinant polypeptide.
- GST glutathione S transferase
- maltose E binding polypeptide or polypeptide A, respectively, to the target recombinant polypeptide.
- coli expression vectors include pTrc (Amrann et al, Gene 69:301 315 (1988)) and pET Hd (Stud ⁇ er et al, Gene Expression Technology: Methods In Enzymology (Academic Press, San Diego, Calif., 1990) pp.60-89).
- One strategy to maximize recombinant polypeptide expression in E. coli is to express the polypeptide in host bacteria with an impaired capacity to proteolytically cleave the recombinant polypeptide. See, e.g., Gottesman, Gene Expression Technology: Methods In Enzymology (Academic Press, San Diego, Calif., 1990) 119 128.
- Another strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted into an expression vector so that the individual codons for each amino acid are those preferentially utilized in the expression host, e.g., E. coli (see, e.g., Wada et al, Nucl. Acids Res. 20: 2111-2118 (1992)). Such alteration of nucleic acid sequences of the invention can be carried out by standard DNA synthesis techniques.
- the mutant/polymorphic polypeptide expression vector is a yeast expression vector.
- yeast expression vectors for expression in yeast Saccharamyces cerivisiae include pYepSecl (Baldari et al, EMBOJ.
- mutant polypeptide can be expressed in insect cells using baculovirus expression vectors.
- Baculovirus vectors available for expression of polypeptides in cultured insect cells include the pAc series (Smith et al, MoI. Cell. Biol.
- a nucleic acid of the invention is expressed in mammalian cells using a mammalian expression vector.
- mammalian expression vectors include pCDM8 (Seed, Nature 329: 842 846 (1987)) and pMT2PC (Kaufman et al, EMBO J. 6: 187 195 (1987)).
- the expression vector's control functions are often provided by viral regulatory elements.
- promoters are derived from polyoma, adenovirus 2, cytomegalovirus, and simian virus 40.
- suitable expression systems for both prokaryotic and eukaryotic cells see, e.g., Chapters 16 and 17 of Sambrook, et al, Molecular Cloning: A Laboratory Manual, 2nd Ed.(Co ⁇ Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1989).
- the recombinant mammalian expression vector is capable of directing expression of the nucleic acid preferentially in a particular cell type (e.g., tissue specific regulatory elements are used to express the nucleic acid).
- tissue specific regulatory elements are known in the art.
- suitable tissue specific promoters include the albumin promoter (liver specific; Pinkert, et al, Genes Dev. 1: 268 277 (1987)), lymphoid specific promoters (Calame & Eaton, Adv. Immunol. 43: 235 275 (1988)), in particular promoters of T cell receptors (Winoto & Baltimore, EMBO J.
- the invention further provides a recombinant expression vector comprising a DNA molecule of the invention cloned into the expression vector in an antisense orientation. That is, the DNA molecule is operatively linked to a regulatory sequence in a manner that allows for expression (by transcription of the DNA molecule) of an RNA molecule that is antisense to a mutant polypeptide mRNA.
- Regulatory sequences operatively linked to a nucleic acid cloned in the antisense orientation can be chosen that direct the continuous expression of the antisense RNA molecule in a variety of cell types, for instance viral promoters and/or enhancers, or regulatory sequences can be chosen that direct constitutive, tissue specific or cell type specific expression of antisense RNA.
- the antisense expression vector can be in the form of a recombinant plasmid, phagemid or attenuated virus in which antisense nucleic acids are produced under the control of a high efficiency regulatory region, the activity of which can be determined by the cell type into which the vector is introduced.
- Another aspect of the invention pertains to host cells into which a recombinant expression vector of the invention has been introduced.
- the terms "host cell” and “recombinant host cell” are used interchangeably herein. It is understood that such terms refer not only to the particular subject cell but also to the progeny or potential progeny of such a cell. Because certain modifications may occur in succeeding generations due to either mutation or environmental influences, such progeny may not, in fact, be identical to the parent cell, but are still included within the scope of the term as used herein.
- a host cell can be any prokaryotic or eukaryotic cell.
- mutant polypeptide can be expressed in bacterial cells such as E. coli, insect cells, yeast or mammalian cells (such as Chinese hamster ovary cells (CHO) or COS cells).
- bacterial cells such as E. coli, insect cells, yeast or mammalian cells (such as Chinese hamster ovary cells (CHO) or COS cells).
- mammalian cells such as Chinese hamster ovary cells (CHO) or COS cells.
- Other suitable host cells are known to those skilled in the art.
- Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional transformation or transfection techniques.
- transformation and “transfection” are intended to refer to a variety of art recognized techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, including calcium phosphate or calcium chloride co precipitation, DEAE dextran mediated transfection, lipofection, or electroporation. Suitable methods for transforming or transfecting host cells can be found in Sambrook, et al., Molecular Cloning: A Laboratory Manual, 2nd ed. (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1989), and other laboratory manuals.
- a gene that encodes a selectable marker (e.g., resistance to antibiotics) is generally introduced into the host cells along with the gene of interest.
- selectable markers include those that confer resistance to drugs, such as G418, hygromycin and methotrexate.
- Nucleic acid encoding a selectable marker can be introduced into a host cell on the same vector as that encoding mutant polypeptide or can be introduced on a separate vector.
- a host cell that includes a compound of the invention such as a prokaryotic or eukaryotic host cell in culture, can be used to produce (i.e., express) recombinant mutant/polymorphic polypeptide.
- the method comprises culturing the host cell of invention (into which a recombinant expression vector encoding mutant/polymorphic polypeptide has been introduced) in a suitable medium such that mutant polypeptide is produced.
- the method further comprises the step of isolating mutant/polymorphic polypeptide from the medium or the host cell.
- Purification of recombinant polypeptides is well known in the art and includes ion exchange purification techniques, or affinity purification techniques, for example with an antibody to the compound. Methods of creating antibodies to the compounds of the present invention are discussed below.
- Transgenic Animals Recombinant organisms, i.e., transgenic animals, expressing a variant gene of the invention are prepared using standard procedures known in the art. Transgenic animals carrying the constructs of the invention can be made by several methods known to those having skill in the art. See, e.g., U.S. Pat. No. 5,610,053 and "The Introduction of Foreign Genes into Mice" and the cited references therein, in: Recombinant DNA, Watson JD, Gilman M, Witkowski J & Zoller M, eds. (W.H. Freeman and Company, New York) pp. 254-272.
- Transgenic animals stably expressing a human isogene and producing human protein can be used as biological models for studying diseases related to abnorraal expression and/or activity, and for screening and assaying various candidate drugs, compounds, and treatment regimens to reduce the symptoms or effects of these diseases.
- Characterizing Gene Expression Level Methods to detect and measure mRNA levels (i.e., gene transcription level) and levels of polypeptide gene expression products (i.e., gene translation level) are well-known in the art and include the use of nucleotide microarrays and polypeptide detection methods involving mass spectrometers, reverse-transcription and amplification and/or antibody detection and quantification techniques. See also, Strachan T & Read A, Human Molecular Genetics, 2 nd Edition. (John Wiley and Sons, Inc. Publication, New York, 1999)).
- RNA isolation technique that does not select against the isolation of mRNA can be utilized for the purification of RNA from cells. See, e.g., Ausubel et at, Ed., Curr. Prot. MoI. Biol. (John Wiley & Sons, New York, 1987-1999).
- the level of the mRNA expression product of the target gene is determined.
- Methods to measure the level of a specific mRNA are well-known in the art and include Northern blot analysis, reverse transcription PCR and real time quantitative PCR or by hybridization to a oligonucleotide array or microarray.
- the determination of the level of expression may be performed by determination of the level of the protein or polypeptide expression product of the gene in body fluids or tissue samples including but not limited to blood or serum. Large numbers of tissue samples can readily be processed using techniques well-known to those of skill in the art, such as, e.g., the single-step RNA isolation process of U.S. Pat. No. 4,843,155.
- the isolated mRNA can be used in hybridization or amplification assays that include, but are not limited to, Southern or Northern analyses, PCR analyses and probe arrays.
- One preferred diagnostic method for the detection of mRNA levels involves contacting the isolated mRNA with a nucleic acid molecule (probe) that can hybridize to the mRNA encoded by the gene being detected.
- the nucleic acid probe can be, e.g., a full-length cDNA, or a portion thereof, such as an oligonucleotide of at least 7, 15, 30, 50, 100, 250 or 500 nucleotides in length and sufficient to specifically hybridize under stringent conditions to an mRNA or genomic DNA encoding a marker of the present invention.
- Other suitable probes for use in the diagnostic assays of the invention are described herein. Hybridization of an mRNA with the probe indicates that the marker in question is being expressed.
- the probes are immobilized on a solid surface and the mRNA is contacted with the probes, for example, in an Affymetrix gene chip array (Affymetrix, Calif. USA).
- a skilled artisan can readily adapt known mRNA detection methods for use in detecting the level of mRNA encoded by the markers of the present invention.
- An alternative method for determining the level of mRNA corresponding to a marker of the present invention in a sample involves the process of nucleic acid amplification, e.g., by RT-PCR (the experimental embodiment set forth in U.S. Pat. No. 4,683,202); ligase chain reaction (Barany et ah, Proc. Natl. Acad.
- amplification primers are defined as being a pair of nucleic acid molecules that can anneal to 5' or 3' regions of a gene (plus and minus strands, respectively, or vice- versa) and contain a short region in between. In general, amplification primers are from about 10-30 nucleotides in length and flank a region from about 50-200 nucleotides in length.
- RT-PCR Real-time quantitative PCR
- the RT-PCR assay utilizes an RNA reverse transcriptase to catalyze the synthesis of a DNA strand from an RNA strand, including an mRNA strand.
- the resultant DNA may be specifically detected and quantified and this process may be used to determine the levels of specific species of mRNA.
- TAQMAN® PE Applied Biosystems, Foster City, Calif., USA
- AMPLITAQ GOLDTM DNA polymerase exploits the 5' nuclease activity of AMPLITAQ GOLDTM DNA polymerase to cleave a specific form of probe during a PCR reaction.
- TAQMANTM probe See Luthra et ah, Am. J. Pathol. 153: 63-68 (1998); Kuimelis et ah, Nucl. Acids Symp. Ser. 37: 255-256 (1997); and Mullah et al, Nucl. Acids Res. 26(4): 1026- 1031 (1998)).
- cleavage of the probe separates a reporter dye and a quencher dye, resulting in increased fluorescence of the reporter.
- the accumulation of PCR products is detected directly by monitoring the increase in fluorescence of the reporter dye. Heid et ah, Genome Res. 6(6): 986-994 (1996)). The higher the starting copy number of nucleic acid target, the sooner a significant increase in fluorescence is observed. See Gibson, Heid & Williams et ah, Genome Res. 6: 995-1001 (1996).
- Detection of Polypeptides can be detected by a probe which is detectably labelled, or which can be subsequently labelled.
- the term "labelled", with regard to the probe or antibody is intended to encompass direct-labelling of the probe or antibody by coupling, i.e., physically linking, a detectable substance to the probe or antibody, as well as indirect- labelling of the probe or antibody by reactivity with another reagent that is directly-labelled. Examples of indirect labelling include detection of a primary antibody using a fluorescently- labelled secondary antibody and end-labelling of a DNA probe with biotin such that it can be detected with fluorescently-labelled streptavidin.
- the probe is an antibody that recognizes the expressed protein.
- a variety of formats can be employed to determine whether a sample contains a target protein that binds to a given antibody.
- Immunoassay methods useful in the detection of target polypeptides of the present invention include, but are not limited to, e.g., dot blotting, western blotting, protein chips, competitive and noncompetitive protein binding assays, enzyme-linked immunosorbant assays (ELISA), immunohistochemistry. fluorescence activated cell sorting (FACS) 5 and others commonly used and widely-described in scientific and patent literature, and many employed commercially.
- a skilled artisan can readily adapt known protein/antibody detection methods for use in determining whether cells express a marker of the present invention and the relative concentration of that specific polypeptide expression product in blood or other body tissues.
- Proteins from individuals can be isolated using techniques that are well-known to those of skill in the art. The protein isolation methods employed can, e.g., be such as those described in Harlow & Lane, Antibodies: A Laboratory Manual (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1988)).
- various host animals may be immunized by injection with the polypeptide, or a portion thereof.
- host animals may include, but are not limited to, rabbits, mice and rats.
- Various adjuvants may be used to increase the immunological response, depending on the host species including, but not limited to, Freund's (complete and incomplete), mineral gels, such as aluminium hydroxide; surface active substances, such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet hemocyanin and dinitrophenol; and potentially useful human adjuvants, such as bacille Camette-Guerin (BCG) and Coryne bacterium parvum.
- BCG Bacille Camette-Guerin
- Monoclonal antibodies which are homogeneous populations of antibodies to a particular antigen, may be obtained by any technique that provides for the production of antibody molecules by continuous cell lines in culture. These include, but are not limited to, the hybridoma technique of Kohler & Milstein, Nature 256: 495-497 (1975); and U.S. Pat. No. 4,376,110; the human B-cell hybridoma technique of Kosbor et ah, Immunol. Today 4: 72 (1983); Cole et al, Proc. Natl. Acad. ScL USA 80: 2026-2030 (1983); and the EBV- hybridoma technique of Cole et al., Monoclonal Antibodies and Cancer Therapy (Alan R. Liss, Inc., 1985) pp. 77-96.
- chimeric antibodies are a molecule in which different portions are derived from different animal species, such as those having a variable or hypervariable region derived form a murine mAb and a human immunoglobulin constant region.
- Suitable solid phase supports or carriers include any support capable of binding an antigen or an antibody.
- Well-known supports or carriers include glass, polystyrene, polypropylene, polyethylene, dextran, nylon, amylases, natural and modified celluloses, polyacrylamides, gabbros and magnetite.
- a useful method for ease of detection, is the sandwich ELISA, of which a number of variations exist, all of which are intended to be used in the methods and assays of the present invention.
- sandwich assay is intended to encompass all variations on the basic two-site technique. Immunofluorescence and EIA techniques are both very well- established in the art. However, other reporter molecules, such as radioisotopes, chemiluminescent or bioluminescent molecules may also be employed. It will be readily apparent to the skilled artisan how to vary the procedure to suit the required use.
- Whole genome monitoring of protein i.e., the "proteome” can be carried out by constructing a microarray in which binding sites comprise immobilized, preferably monoclonal, antibodies specific to a plurality of protein species encoded by the cell genome.
- binding sites comprise immobilized, preferably monoclonal, antibodies specific to a plurality of protein species encoded by the cell genome.
- antibodies are present for a substantial fraction of the encoded proteins, or at least for those proteins relevant to testing or confirming a biological network model of interest.
- methods for making monoclonal antibodies are well-known. See, e.g., Harlow & Lane, Antibodies: A Laboratory ManuaV (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1988)).
- monoclonal antibodies are raised against synthetic peptide fragments designed based on genomic sequence of the cell. With such an antibody array, proteins from the cell are contacted to the array and their binding is measured with assays known in the art.
- Two-dimensional gel electrophoresis is well-known in the art and typically involves isoelectric focusing along a first dimension followed by SDS-PAGE electrophoresis along a second dimension. See, e.g., Hames et al, Gel Electrophoresis of Proteins: A Practical Approach (IRL Press, New York, 1990); Shevchenko et al, Proc. Natl. Acad. ScL USA 93: 14440-14445 (1996); Sagliocco et al., Yeast 12: 1519-1533 (1996); and Lander, Science TJA: 536-539 (1996)).
- MS-based analysis methodology is useful for analysis of isolated target polypeptide as well as analysis of target polypeptide in a biological sample.
- MS formats for use in analyzing a target polypeptide include ionization (I) techniques, such as, but not limited to, matrix assisted laser des ⁇ rption (MALDI), continuous or pulsed electrospray ionization (ESI) and related methods, such as ionspray or thermospray, and massive cluster impact (MCI).
- I ionization
- MALDI matrix assisted laser des ⁇ rption
- ESI electrospray ionization
- MCI massive cluster impact
- Such ion sources can be matched with detection formats, including linear or non-linear reflectron time of flight (TOF), single or multiple quadrupole, single or multiple magnetic sector Fourier transform ion cyclotron resonance (FTICR), ion trap and combinations thereof such as ion-trap/TOF.
- TOF linear or non-linear reflectron time of flight
- FTICR magnetic sector Fourier transform ion cyclotron resonance
- ion trap and combinations thereof such as ion-trap/TOF.
- numerous matrix/wavelength combinations e.g., matrix assisted laser desorption (MALDI)
- solvent combinations e.g., ESI
- the target polypeptide can be solubilised in an appropriate solution or reagent system.
- a solution or reagent system e.g., an organic or inorganic solvent
- MS of peptides also is described, e.g., in International PCT Application No. WO 93/24834 and U.S.
- a solvent is selected that minimizes the risk that the target polypeptide will be decomposed by the energy introduced for the vaporization process.
- a reduced risk of target polypeptide decomposition can be achieved, e.g., by embedding the sample in a matrix.
- a suitable matrix can be an organic compound such as a sugar, e.g., a pentose or hexose, or a polysaccharide such as cellulose. Such compounds are decomposed thermolytically into CO 2 and H 2 O such that no residues are formed that can lead to chemical reactions.
- the matrix also can be an inorganic compound, such as nitrate of ammonium, which is decomposed essentially without leaving any residue.
- Electrospray MS has been described by Fenn et al., J. Phys. Chem. 88: 4451-4459 (1984); and PCT Application No. WO 90/14148; and current applications are summarized in review articles. See Smith et al, Anal. Chem. 62: 882-89 (1990); and Ardrey, Spectroscopy 4: 10-18 (1992).
- the mass of a target polypeptide determined by MS can be compared to the mass of a corresponding known polypeptide.
- the corresponding known polypeptide can be the corresponding non-mutant protein, e.g., wild-type protein.
- ESI the determination of molecular weights in femtomole amounts of sample is very accurate due to the presence of multiple ion peaks, all of which can be used for mass calculation.
- Sub-attomole levels of protein have been detected, e.g., using ESI MS (Valaskovic et al, Science 273: 1199-1202 (1996)) and MALDI MS (Li et al, J. Am. Chem. Soc. 118: 1662-1663 (1996)).
- Matrix Assisted Laser Desorption The level of the target protein in a biological sample, e.g., body fluid or tissue sample, may be measured by means of mass spectrometric (MS) methods including, but not limited to, those techniques known in the art as matrix-assisted laser desorption/ionization, time-of-flight mass spectrometry (MALDI- TOF-MS) and surfaces enhanced for laser desorption/ionization, time-of-flight mass spectrometry (SELDI-TOF-MS) as further detailed below.
- MS mass spectrometric
- Methods for performing MALDI are well-known to those of skill in the art. See, e.g., Juhasz et al, Analysis, Anal. Chem.
- MALDI-TOF-MS has been described by Hillenkamp et al., Biological Mass Spectrometry, Burlingame & McCloskey, eds. (Elsevier Science Publ., Amsterdam, 1990) pp. 49-60. [191] A variety of techniques for marker detection using mass spectroscopy can be used. See Bordeaux Mass Spectrometry Conference Report, Hillenkamp, Ed., pp.
- MS techniques allow the successful volatilization of high molecular weight biopolymers, without fragmentation, and have enabled a wide variety of biological macromolecules to be analyzed by mass spectrometry.
- SMDI Surfaces Enhanced for Laser Desorption/Ionization
- Other techniques are used which employ new MS probe element compositions with surfaces that allow the probe element to actively participate in the capture and docking of specific analytes, described as Affinity Mass Spectrometry (AMS). See SELDI patents U.S. Pat. Nos. 5,719,060; 5,894,063; 6,020,208; 6,027,942; 6,124,137; and U.S. Patent application No. U.S. 2003/0003465.
- SEAC probe elements have been designed with Surfaces Enhanced for Affinity Capture (SEAC). See Hutchens & Yip, Rapid Commun. Mass Spectrom. 7: 576-580 (1993).
- SEAC probe elements have been used successfully to retrieve and tether different classes of biopolymers, particularly proteins, by exploiting what is known about protein surface structures and biospecific molecular recognition.
- the immobilized affinity capture devices on the MS probe element surface, i.e., SEAC determines the location and affinity (specificity) of the analyte for the probe surface, therefore the subsequent analytical MS process is efficient.
- SELDI Surfaces Enhanced for Neat Desorption
- the probe element surfaces i.e., sample presenting means
- SEAC Energy Absorbing Molecules
- the probe element surfaces, i.e., sample presenting means are designed to contain chemically defined and/or biologically defined affinity capture devices to facilitate either the specific or non-specific attachment or adsorption (so-called docking or tethering) of analytes to the probe surface, by a variety of mechanisms (mostly non-covalent)
- SEPAR Surfaces Enhanced for Photolabile Attachment and Release
- the chemical specificities determining the type and number of the photolabile molecule attachment points between the SEPAR sample presenting means ⁇ i.e., probe element surface) and the analyte (e.g., protein) may involve any one or more of a number of different residues or chemical structures in the analyte ⁇ e.g., His, Lys, Arg, Tyr, Phe and Cys residues in the case of proteins and peptides).
- a polypeptide of interest also can be modified to facilitate conjugation to a solid support.
- a chemical or physical moiety can be incorporate into the polypeptide at an appropriate position.
- a polypeptide of interest can be modified by adding an appropriate functional group to the carboxyl terminus or amino terminus of the polypeptide, or to an amino acid in the peptide, ⁇ e.g., to a reactive side chain, or to the peptide backbone.
- a naturally-occurring amino acid normally present in the polypeptide also can contain a functional group suitable for conjugating the polypeptide to the solid support.
- a cysteine residue present in the polypeptide can be used to conjugate the polypeptide to a support containing a sulfhydryl group through a disulfide linkage, e.g., a support having cysteine residues attached thereto.
- bonds that can be formed between two amino acids include, but are not limited to, e.g., monosulfide bonds between two lanthionine residues, which are non- naturally-occurring amino acids that can be incorporated into a polypeptide; a lactam bond formed by a transamidation reaction between the side chains of an acidic amino acid and a basic amino acid, such as between the y-carboxyl group of GIu (or alpha carboxyl group of Asp) and the amino group of Lys; or a lactone bond produced, e.g., by a crosslink between the hydroxy group of Ser and the carboxyl group of GIu (or alpha carboxyl group of Asp).
- a solid support can be modified to contain a desired amino acid residue, e.g., a GIu residue, and a polypeptide having a Ser residue, particularly a Ser residue at the N-terminus or C- terminus, can be conjugated to the solid support through the formation of a lactone bond.
- the support need not be modified to contain the particular amino acid, e.g., GIu, where it is desired to form a lactone-like bond with a Ser in the polypeptide, but can be modified, instead, to contain an accessible carboxyl group, thus providing a function corresponding to the alpha carboxyl group of GIu.
- a thiol-reactive functionality is particularly useful for conjugating a polypeptide to a solid support.
- a thiol-reactive functionality is a chemical group that can rapidly react with a nucleophilic thiol moiety to produce a covalent bond, e.g., a disulfide bond or a thioether bond.
- thiol-reactive functionalities include, e.g., haloacetyls, such as iodoacetyl; diazoketones; epoxy ketones, alpha- and beta-unsaturated carbonyls, such as alpha-enones and beta-enones; and other reactive Michael acceptors, such as maleimide; acid halides; benzyl halides; and the like. See Greene & Wuts, Protective Groups in Organic Synthesis, 2 nd Edition (John Wiley & Sons, 1991).
- the thiol groups can be blocked with a photocleavable protecting group, which then can be selectively cleaved, e.g., by photolithography, to provide portions of a surface activated for immobilization of a polypeptide of interest.
- Photocleavable protecting groups are known in the art (see, e.g., published International PCT Application No. WO 92/10092; and McCray et al, Ann. Rev. Biophys. Biophys. Chem. 18: 239-270 (1989)) and can be selectively de-blocked by irradiation of selected areas of the surface using, e.g., a photolithography mask.
- Linkers A polypeptide of interest can be attached directly to a support via a linker. Any linkers known to those of skill in the art to be suitable for linking peptides or amino acids to supports, either directly or via a spacer, may be used. For example, the polypeptide can be conjugated to a support, such as a bead, through means of a variable spacer.
- Linkers include, Rink amide linkers (see, e.g., Rink, Tetrahedron Lett. 28: 3787 (1976)); trityl chloride linkers (see, e.g., Leznoff, Ace Chem. Res.
- linkers see, e.g., Bodansky et al, Peptide Synthesis, 2 nd Edition (Academic Press, New York, 1976)
- trityl linkers are known. See, e.g., U.S. Pat. Nos. 5,410,068 and 5,612,474.
- Amino trityl linkers are also known. See, e.g., U.S. Pat. No. 5,198,531.
- Other linkers include those that can be incorporated into fusion proteins and expressed in a host cell. Such linkers may be selected amino acids, enzyme substrates or any suitable peptide.
- the linker may be made, e.g., by appropriate selection of primers when isolating the nucleic acid. Alternatively, they may be added by post-translational modification of the protein of interest.
- Linkers that are suitable for chemically linking peptides to supports include disulfide bonds, thioether bonds, hindered disulfide bonds and covalent bonds between free reactive groups, such as amine and thiol groups.
- Cleavable Linkers A linker can provide a reversible linkage such that it is cleaved under the select conditions.
- selectively cleavable linkers including photocleavable linkers (see U.S. Pat. No. 5,643,722), acid cleavable linkers (see Fattom et al, Infect. Immun. 60: 584-589 (1992)), acid-labile linkers (see Welh ⁇ ner et al, J. Biol. Chem. 266: 4309-4314 (1991)) and heat sensitive linkers are useful.
- a linkage can be, e.g., a disulfide bond, which is chemically cleavable by mercaptoethanol or dithioerythrol; a biotin/streptavidin linkage, which can be photocleavable; a heterobifunctional derivative of a trityl ether group, which can be cleaved by exposure to acidic conditions or under conditions of MS (see K ⁇ ster et al, Tetrahedron Lett.
- a levulinyl-mediated linkage which can be cleaved under almost neutral conditions with a hydrazinium/acetate buffer; an arginine-arginine or a lysine-lysine bond, either of which can be cleaved by an endopeptidase, such as trypsin; a pyrophosphate bond, which can be cleaved by a pyrophosphatase; or a ribonucleotide bond, which can be cleaved using a ribonuclease or by exposure to alkali condition.
- an endopeptidase such as trypsin
- a pyrophosphate bond which can be cleaved by a pyrophosphatase
- a ribonucleotide bond which can be cleaved using a ribonuclease or by exposure to alkali condition.
- a photolabile cross-linker such as 3-amino-(2-nitrophenyl)propionic acid can be employed as a means for cleaving a polypeptide from a solid support.
- Other linkers include RNA linkers that are cleavable by ribozymes and other RNA enzymes and linkers, such as the various domains, such as CHi, CH 2 and CH 3 , from the constant region of human IgGl. See, Batra et al., Mol Immunol 30: 379-396 (1993).
- linker that is cleavable under MS conditions, such as a silyl linkage or photocleavable linkage
- a linker such as an avidin biotin linkage
- Acid-labile linkers are particularly useful chemically cleavable linkers for mass spectrometry, especially for MALDI-TOF, because the acid labile bond is cleaved during conditioning of the target polypeptide upon addition of a 3-HPA matrix solution.
- the acid labile bond can be introduced as a separate linker group, e.g., an acid labile trityl group, or can be incorporated in a synthetic linker by introducing one or more silyl bridges using diisopropylysilyl, thereby forming a diisopropylysilyl linkage between the polypeptide and the solid support.
- the diisopropylysilyl linkage can be cleaved using mildly acidic conditions, such as 1.5% trifluoroacetic acid (TFA) or 3-HPA/l% TFA MALDI-TOF matrix solution.
- TFA trifluoroacetic acid
- Methods for the preparation of diisopropylysilyl linkages and analogues thereof are well-known in the art. See, e.g., Saha. etal., J. Org. Chem. 58: 7827-7831 (1993).
- Pin tools include those disclosed herein or otherwise known in the art. See, e.g., U.S. Application Serial Nos. 08/786,988 and 08/787,639; and International PCT Application No. WO 98/20166.
- a pin tool in an array can be applied to wells containing polypeptides of interest.
- the pin tool has a functional group attached to each pin tip, or a solid support, e.g., functionalized beads or paramagnetic beads are attached to each pin, the polypeptides in a well can be captured (1 pmol capacity).
- the pins can be kept in motion (vertical, 1-2 mm travel) to increase the efficiency of the capture.
- a reaction such as an in vitro transcription is being performed in the wells
- movement of the pins can increase efficiency of the reaction. Further immobilization can result by applying an electrical field to the pin tool.
- the pin tool (with or without voltage) can be modified to have conjugated thereto a reagent specific for the polypeptide of interest, such that only the polypeptides of interest are bound by the pins.
- the pins can have nickel ions attached, such that only polypeptides containing a polyhistidine sequence are bound.
- the pins can have antibodies specific for a target polypeptide attached thereto, or to beads that, in turn, are attached to the pins, such that only the target polypeptides, which contain the epitope recognized by the antibody, are bound by the pins.
- Captured polypeptides can be analyzed by a variety of means including, e.g., spectrometry techniques, such as UV/VIS, IR, fluorescence, chemiluminescence, NMR spectroscopy, MS or other methods known in the art, or combinations thereof. If conditions preclude direct analysis of captured polypeptides, the polypeptides can be released or transferred from the pins, under conditions such that the advantages of sample concentration are not lost. Accordingly, the polypeptides can be removed from the pins using a minimal volume of eluent, and without any loss of sample. Where the polypeptides are bound to the beads attached to the pins, the beads containing the polypeptides can be removed from the pins and measurements made directly from the beads.
- spectrometry techniques such as UV/VIS, IR, fluorescence, chemiluminescence, NMR spectroscopy, MS or other methods known in the art, or combinations thereof. If conditions preclude direct analysis of captured polypeptides, the polypeptides can be
- Pin tools can be useful for immobilizing polypeptides of interest in spatially addressable manner on an array. Such spatially addressable or pre-addressable arrays are useful in a variety of processes, including, for example, quality control and amino acid sequencing diagnostics.
- the pin tools described in the U.S. Application Nos. 08/786,988 and 08/787,639 and International PCT Application No. WO 98/20166 are serial and parallel dispensing tools that can be employed to generate multi-element arrays of polypeptides on a surface of the solid support.
- the array surface can be flat, with beads or geometrically altered to include wells, which can contain beads.
- MS geometries can be adapted for accommodating a pin tool apparatus.
- aspects of the biological activity state, or mixed aspects can be measured in order to obtain drug and pathway responses.
- the activities of proteins relevant to the characterization of cell function can be measured, and embodiments of this invention can be based on such measurements.
- Activity measurements can be performed by any functional, biochemical or physical means appropriate to the particular activity being characterized. Where the activity involves a chemical transformation, the cellular protein can be contacted with natural substrates, and the rate of transformation measured. Where the activity involves association in multimeric units, e.g., association of an activated DNA binding complex with DNA, the amount of associated protein or secondary consequences of the association, such as amounts of mRNA transcribed, can be measured.
- response data may be formed of mixed aspects of the biological state of a cell.
- Response data can be constructed from, e.g., changes in certain mRNA abundances, changes in certain protein abundances and changes in certain protein activities.
- HDAC6 mutation identified in human cancer was analyzed using computational analysis tools to determine the effect(s) of these mutations on HDAC6 function.
- SNPs single nucleotide polymorphisms
- cSNPs single nucleotide polymorphisms
- TABLE 4 The known HDAC6 missense mutations and single nucleotide polymorphisms (SNPs), e.g., synonymous SNPs and non-synonymous SNPs, are summarized in TABLE 4 below.
- SNPs in coding regions (cSNPs) are of interest for the study of human phenotypic variation and disease gene mapping. Such polymorphisms are likely to impact gene function directly, and to be in strong linkage disequilibrium with rare disease-causing mutations in genes.
- Two (2) cSNPs of HDAC6 have been reported within the dbSNP database (http://www.ncbi.nlm.nih.gov/SNP/index.html). Neither of the mutations in TABLE 1 matches known non-synonymous SNPs.
- HDAC6 mutations of the present invention were analyzed using the Pfam computational analysis tool.
- Pfam is a large collection of multiple sequence alignments and hidden Markov models covering many common protein families based on the Swissprot 44.5 and SP-TrEMBL 27.5 protein sequence databases. Bateman et al, Nucl. Acids Res., Database Issue 32:D138-D141 (2004).
- a search using Pfam showed that HDAC6 contains two (2) HDAC domains and a Zn-finger domain also found in ubiquitin-hydrolases and other protein.
- HDAC6 85 GTGLVLDEQLNEFH CLWDDSFPEGPERLHAIKEQLIQEGLLDRC 128 eivtnsqeprkatdeelllvHsedyveflesvsptnlekldkgtNeielk ++ +1 1 i i i i i i +I++ +
- HDAC6 129 VSFQ ARFAEKEELMLVHSLEYIDLMETTQYMNEGELRV LA 168
- HDAC6 316 DADYIAAFLHVLLPVALEFQPQLVLVAAGFDALQGDP KGEMAATPAG 362 (SEQIDNO: 6) ygeivrflkslaqkhcdgpllvVlEGGYtlraiasavarcwialtggllg ++ +++
- HDAC6 363 FAQLTHLLMGLA GGKLILSLEGGYNLR ALAEGVSASLHTLLG 404
- HDAC6 480 RTGLVYDQNMMNHC NLwdSHHPEVPQRILRIMCRLEELGLAG 521
- HDAC6 522 RCLTLT PRPATEAELLTCHSAEYVGHLRATEKMKTRELHR E 562
- HDAC6 563 SSNF DSIYICPSTFACAQLATGAACRLVEAVLSGEVLNGAAWRPPG 609
- HDAC6 610 HHAEQDAACGFCFFNSVAVAARHAQTIS-GHAIRILIVDWDVHHGNGTQH 658
- HDAC6 756 PEGYAHLTHLLMGLA ⁇ GRIILILEGGYNLT SISESMAACTRS 797
- HDAC6 798 LLG 800 (SEQ ID NO: 8)
- HDAC6 1133 C-GDCGTIQENWVCLSCYQVYCGRY INGHMLQHHGNSGHPL 1172
- the two mutations identified in this inventation are location in between the second HDAC domain and the z-finger domain. Amino acid change in these positions may alter the protein structure and hence the protein function.
- HDAC6 mutation position G 1064 is located in between two putative serine phosphorylation sites, S 1062 and S 1066.
- the alteration of HDAC6 phosporylation pattern may affect (i.e., increase or decrease) HDAC6 biological activity when compared to the biological activity of wild-type HDAC6 polypeptide.
- Small ubiquitin-related modifier (SUMO) family proteins (a.k.a., PICl, UBLl, Sentrin, GMPl, and Smt3) are covalently attached to select target proteins in post- translational modification. Johnson, E.S., Ann. Rev. Biochem., 73: 355-382 (2004).
- SUMO is a member of a ubiquitin-like protein family that is ligated to lysine residues in a variety of target proteins, modulating their functions.
- SUMO modification is reversible, and does not appear to target proteins for degradation but rather alters the target protein function through changes in cellular localization, biochemical activation, or through protection from ubiquitin- dependent degradation.
- HDAC6 sumoylation sites were identified by computational analysis using the SUMOPlot computational analysis tool. Hinsley et ah, Protein Sd., 13: 2588 - 2599 (2004); Van Dyck et al, J. Biol. Chem., 279: 36121 - 36131 (2004).
- SUMOplotTM predicts the probability for the SUMO consensus sequence (SUMO-CS) to be engaged in SUMO attachment. That is, most SUMO-modif ⁇ ed proteins contain the tetrapeptide motif B-K-x-D/E where B is a hydrophobic residue, K is the lysine conjugated to SUMO, x is any amino acid (aa), and D or E is an acidic residue.
- Substrate specificity appears to be derived directly from Ubc9 and the respective substrate motif.
- the SUMOplotTM score system is based on two criteria: 1) direct amino acid match to the SUMO-CS observed and shown to bind Ubc9, and 2) substitution of the consensus amino acid residues with amino acid residues exhibiting similar hydrophobicity.
- HDAC6 wild-type polypeptide are summarized below in TABLE 9 and TABLE 10.
- HDAC6 wild-type polypeptide SEQ ID NO:11
- TABLE 9 The amino acid sequence of HDAC6 wild-type polypeptide.
- Bold text with light gray shading designates a sumoylation motif.
- Bold and underlined text with dark gray shading designates a sumoylation motif of high probability.
- HDAC6 sumoylation sites identified in HDAC6 are summarized below in TABLE 10. A score above 0.5 indicates a high sumoylation potential.
- PROSITE is a database of protein families and domains. It consists of biologically significant sites, patterns and profiles that help to reliably identify to which known protein family (if any) a new sequence belongs as well as to identify potential sites for protein modification.
- Other potential sites for protein modification of HDAC6 polypeptide as predicted by ProSite analysis are summarized below in TABLE 11. Amino acid positions highlighted in bold text represent possible interference by mutations.
- PKC phosphorylation 10-12 30-32; 31-33; 527-529; 551-553; PS00005 771-773; 833-835; 847-849; 852-854; 868-
- G930 and G 1064 were observed to be close to putative N-myristoylation sites.
- the HDAC6 mutation site G930 was observed close to a putative phosphorylation site by casein kinase IL Mutation in these positions may alter the modification patterns of HDAC6 and hence the protein function.
- HDAC6 HDAC6 biological function
- Known HDAC sequences of various organisms including, e.g., rat HDAC6 (XP_228753), mouse HDAC6 (NP_034543), human HDAC6 (NP__006035), human HDAClO (NP_114408), mouse HDAClO (NP_954668), fly HDAC6 (NP_727842), putative mosquito HDAC (XP_565819), putative worm HDAC sequences (NP_500787 & NP_500788) were obtained from GenBank and aligned using ClustalW.
- ClustalW is a general purpose multiple sequence alignment program for DNA or proteins. It produces biologically meaningful multiple sequence alignments of divergent sequences. It calculates the best match for the selected sequences, and lines them up so that the identities, similarities and differences can be seen.
- HDAClO mouse HDAClO mouse HEDMTATRLLWDDPECEIECPERLTAALDGLRQRGLEERCLCLSACEASEEE-LGLVHSP HDAC6_fly DBSMSQHCCLWDKE—HYECPERFTRVLERCRELNLTERCLELPSRSATKDEILRLHTEE HDAC_mosqu-.to
- HDAC_worm KQTQNEHENTVCPT HPESSDRILKIKEALTKTKILEKCTVLTNFLEIDDADLEVTHDK
- HDAC worm2 NQTQNEHENTVCPT HPESSDRILKIKEALTKTKILEKCTVLTNFLEIDDADLEVTHDK
- HDAC_mosquito L V ES VW SGT V Q N G M HDAC_worm SMVKDLMESEKKTQEDINSQCEKYDSVFMTENSMKVAKDGVACVRDLTNRIMANEASNGF HDAC worm2 SMVKDLMESEKKTQEDINSQCEKYD ⁇ VFMTENSMKVAKDGVACVRDLTNRIMANEASNGF
- HDAC_worm AWRPPGHHADSVSPCGFCLFNNVAQAAEEAF-FSGAERILIVDLDVHHGHGTQRIFYDD
- HDAC worm2 AVVRPPGHHADSVSPCGFCLFNNVAQAAEEAF- FSGAERILIVDLDVHHGHGTQRIFYDD
- HDAC6_rat LPVA FEPQLVLVAAGFDALHGDPKGEMSATPAGFAHLTHFLMGLAGGKLILSLEGGYN
- HDAC10_human PPLLPG GPVCKAAASAPSSLLDQPCLCPAP SVRTAVALT-TPDITLVLPP
- HDAC10_mouse LRLLGE SPTCAVAEDSLSPLLDQLCLRPAP PICTAVAST-VPGAALCLPP
- HDAC6_rat DSELLTCHSAEYVERLRATEKMKTRDLHREGA-NFESIYICPSTFACAQLATGAACRLVE
- HDAC_mosquito TEELCLVHGPEHVRLIGEVC QSSTMKQTAD-QYNSIYFHPSTDASARMATGSVLTVVE
- HDAC_worm2 NEEIRLVHTKKMLEHLRTTETMKDEELMEEAEKEFNSIYLTRDTLKVARKAVGAVLQSVD : . : * : :
- HDAC worm EIFEKDAGQRNALVIVRPPGHHASASKSSGFCIFNNVAVAAKYAQRRHKAKR-VLILDWD
- HDAC worm2 EIFEKDRGQRNALVIVRPPGHHASaSKSSGFCIFNNVAVAAKYAQRRHKAKR-VI 1 ILDWD
- HDAC worm2 REVCAVQQKYWSILKGFQVTPSNYGLDIDDEAYD
- HDAC6_mouse NFPNQTTS-VEALGETEP TPPASHTNKQTTGASPLQGVTAQQSLQ—LGVLSTLELSR
- HDAC_mosquito LSE RGGTDQQQPQQTTSTPSSSSSATTAGSSGGTAERGN HDAC_worm ⁇ VLN ⁇ KLINKNGKSSAAT LKVKGKAATDP HDAC worm2 DDSIDMADQSSSSGSSSSSTRESHNLEIMDSGPAHA
- HDAC6_rat EAQEVQES EEGLLGEAAGGQDMNSLMLTQGFGDFNTQDVFYAVTPLSWCFHLMAVC
- HDAC6_mouse EAEEAHDS EEGLLGEAAGGQDMNSLMLTQGFGDFNTQDVFYAVTPLSWCPHLMAVC HDAC6_human ESQGASESQAPGEENLLGEAAGGQDMADSMLMQGSRGLTDQAIFYAVTPLPWCPHLVAVC HDAC10_human HDAC10_mouse
- HDAC6_rat P-IPAAGLDVSQPCKTCGSVQENWVCLTCYQVYCSRYVKAHMVCH-HEASEHPLVLSCVD
- HDAC6_raouse P-IPAAGLDVSQPCKTCGTVQENWVCLTCYQVYCSRYVNAHMVCH-HEASEHPLVLSCVD
- HDAC worm2 LSVWCYPCDSYVHNPALIGAKSAAHESKFGETMP ⁇ HDAC6_rat (SEQ ID NO: 19)
- HDAC10_human SEQ ID NO: 22
- HDAC10_mouse SEQ ID NO: 23
- HDAC6_fly DNGAGSSICLRLERNN SEQ ID NO: 24
- HDAC_i ⁇ osquito SEQ ID NO: 25
- HDAC_worm SEQ ID NO: 26
- HDAC_WOrm2 (SEQ ID NO : 27)
- an agent that modulates HDAC6 biological activity i.e., HDAC6 modulating agent, e.g., HDAC6 antagonist
- HDAC6 modulating agent e.g., HDAC6 antagonist
- the SNP is selected from the group consisting of the
- HDAC6 mutations summarized in TABLE 1 and TABLE 2.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Veterinary Medicine (AREA)
- Chemical & Material Sciences (AREA)
- Public Health (AREA)
- General Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Pharmacology & Pharmacy (AREA)
- Animal Behavior & Ethology (AREA)
- Epidemiology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Immunology (AREA)
- Engineering & Computer Science (AREA)
- Gastroenterology & Hepatology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Organic Chemistry (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)
Abstract
This invention relates generally to the analytical testing of tissue samples in vitro, and more particularly to aspects of genetic polymorphisms and mutations of the HDAC6 gene. The invention provides new HDAC6 mutations and SNPs5 useful in the diagnosis and treatment of subjects in need thereof. Accordingly, the various aspects of the present invention relate to polynucleotides encoding the HDAC6 mutations of the invention, expression vectors encoding the HDAC6 mutant polypeptides of the invention and organisms that express the HDAC6 mutant and polymorphic ' polynucleotides and/or HDAC6 mutant/polymorphic polypeptides of the invention. The various aspects of the present invention further relate to diagnostic/theranostic methods and kits that use the HDAC6 mutations and polymorphisms of the invention to identify individuals predisposed to disease or to classify individuals with regard to drug responsiveness, side effects, or optimal drug dose.
Description
MUTATIONS AND POLYMORPHISMS OF HDAC6
FIELD OF THE INVENTION
[01] This invention relates generally to the analytical testing of tissue samples in vitro, and more particularly to aspects of genetic mutations and polymorphisms of HDAC6.
BACKGROUND OF THE INVENTION
[02] Conventional medical approaches to diagnosis and treatment of disease is based on clinical data alone, or made in conjunction with a diagnostic test. Such traditional practices often lead to therapeutic choices that are not optimal for the efficacy of the prescribed drug therapy or to minimize the likelihood of side effects for an individual subject Therapy specific diagnostics (a.k.a., theranostics) is an emerging medical technology field, which provides tests useful to diagnose a disease, choose the correct treatment regime and monitor a subject's response. That is, theranostics are useful to predict and assess drug response in individual subjects, i.e., individualized medicine. Theranostic tests are also useful to select subjects for treatments that are particularly likely to benefit from the treatment or to provide an early and objective indication of treatment efficacy in individual subjects, so that the treatment can be altered with a minimum of delay. Theranostics are useful in clinical diagnosis and management of a variety of diseases and disorders, which include, but are not limited to, e.g., cardiovascular disease, cancer, infectious diseases, Alzheimer's disease and the prediction of drug toxicity or drug resistance. Theranostic tests may be developed in any suitable diagnostic testing format, which include, but is not limited to, e.g., immunohistochemical tests, clinical chemistry, immunoassay, cell-based technologies, and nucleic acid tests. Progress in pharmacogenomics and pharmacogenetics, which establishes correlations between responses to specific drugs and the genetic profile of individual patients and/or their tumours, is foundational to the development of new theranostic approaches. As such, there is a need in the art for the evaluation of patient-to-patient variations and tumour mutations in gene sequence and gene expression. A common form of genetic profiling relies on the identification of DNA sequence variations called single nucleotide polymorphisms ("SNPs"), which are one type of genetic alteration leading to patient-to-patient variation in individual drug response. In addition, it is well established in the art that acquired DNA changes (mutations) are responsible, alone or in part, for pathological processes. It follows that, there is a need art to identify and characterize genetic mutations and SNPs, which are
useful to identify the genotypes of subjects and their tumours associated with drug responsiveness, side effects, or optimal dose. Acetylation of histories plays an important role in regulating various cellular processes. Gene expression is, in part, controlled by the acetylation state of nucleosomal histones. Grozinger et al, Proc. Natl. Acad. Sci. U.S.A., 96(9):4868-73 (1999). The dynamic control of protein acetylation levels in vivo occurs through the opposing actions of histone acetyltransferases and histone deacetylases (HDACs). Bertos et al, Biochem. Cell Biol, 79(3):243-52 (2001). Distinct classes of HDACs have been identified in mammalian cells. Class I members of the HDAC family, such as HDACl, HDAC2, HDAC3, and HDAC8, are enzymatic transcriptional corepressors homologous to yeast Rpd3. Class II members of the HDAC family, including HDAC4, HDAC5, HDAC6, HDAC7, and HDAC9, possess domains similar to the1 deacetylase domain of yeast Hdal. Bertos et al, Biochem. Cell Biol, 79(3):243-52 (2001).
[03] HDACs are known to play key roles in the regulation of cell proliferation. Consequently, inhibition of HDACs has become an important approach for anti-cancer therapy. Histone acetylation modifiers have been described to participate as cofactors in mammalian transcriptional complexes involved in the regulation of cellular proliferation and differentiation. Mahlknecht et al, Biochem. Biophys. Res. Commun., 263(2):482-90 (1999). Indeed, increasing evidence suggests a connection between histone acetylation and the development of cancer and leukemia.
[04] Cancer cells exhibit a set of unique properties that distinguish them from their normal counterparts, e.g., increased growth rates, loss of differentiation, escape from cell death pathways, evasion of antiproliferative signals, a decreased reliance on exogenous growth factors and escape from replicative senescence. Wade PA., Hum. MoI Genet., 10(7):693-8 (2001). Acquisition of these cellular features by malignant cells is typified by impairment of normal cellular control mechanisms. The failure to deacetylate and thus repress transcription by the Class I histone deacetylases HDACl and HDAC2 due to disruption of the Rb family of proteins has been firmly established as a mechanism leading to increases in growth rate and cellular proliferation. Recent data suggest that this regulatory circuit also executes G(I) checkpoint arrest downstream of DNA damage, cellular senescence and contact inhibition. In contrast to this failure to deacetylate, it now seems probable that changes in differentiation status may result in part from inappropriate deacetylation and concomitant transcriptional
repression mediated by the Class II histone deacetylases. This inappropriate deacetylation by HDAC4, HDAC5 and HDAC6 follows their relocalization from the cytoplasm to the nucleus. [05] Histone acetylation is a key modification to control transcription, and HDACs are profoundly involved in pathogenesis of cancer through removing acetyl groups from histones and other transcriptional regulators. Trichostatin A (TSA) is a Streptomyces metabolite that causes differentiation of murine erythroleukemia cells as well as specific inhibition of the cell cycle of some lower eukaryotes and mammalian cells. The targeted molecule of TSA has been shown by genetic and biochemical analyses to be histone deacetylases (HDACs). Yoshida et al, Curr Med Chem., 10(22):2351-8 (2003). Trapoxin (TPX) and FK228 (also known as FR901228 and depsipeptide because FK228 = FR901228 = depsipeptide), structurally unrelated microbial metabolites, were also shown to inhibit HDACs. Yoshida et al, Curr Med Chem., 10(22):2351-8 (2003). These HDAC inhibitors cause cell cycle arrest, differentiation and/or apoptosis of many tumors, suggesting their usefulness for chemotherapy and differentiation therapy. Also, inhibition of HDAC function, e.g., HDAC6, with HDAC antagonist LBH589, a novel cinnamic hydroxamic acid analog, induces acetylation of histone H3 and H4 and plays a role in the induction of cell-cycle G(I) phase accumulation and apoptosis of the human chronic myeloid leukemia blast crisis (CML-BC) K562 cells and acute leukemia MV4-11 cells with the activating length mutation of FLT-3. George et al, Blood, 105(4): 1768-76 (2005). Accordingly, there is a need in the art for additional information about the relationship between HDAC6 mutations and cancer.
SUMMARY OF THE INVENTION
[06] The invention provides for the use of an HDAC6 modulating agent in the manufacture of a medicament for the treatment of cancer in a selected patient population. The patient population is selected on the basis of the genotype of the patients at an HDAC6 genetic locus indicative of efficacy of the HDAC6 modulating agent in treating cancer. In several embodiments, the cancer can be acute myeloid leukemia (a.k.a., acute myelogenous leukemia).
[07] The invention also provides an isolated polynucleotide having a sequence encoding an HDAC6 mutation. In several embodiments, the HDAC6 mutations are the previously- unidentified mutations listed in TABLE 1. Accordingly, the invention provides vectors and
organisms containing the HDAC6 mutations of the invention and polypeptides encoded by polynucleotides containing the HDAC6 mutations of the invention.
[08] The invention further provides a method for treating cancer in a subject. The genotype or haplotype of a subject is obtained at an HDAC6 gene locus, so that the genotype and/or haplotype is indicative of a propensity of the cancer to respond to the drug. Then, an anticancer therapy is administered to the subject.
[09] The invention provides a method for diagnosing cancer in a subject and a method for choosing subjects for inclusion in a clinical trial for determining efficacy of an HDAC6 modulating agent; in both these methods the genotype and/or haplotype of a subject is interrogated at an HDAC6 gene locus. Also provided by the invention are kits for use in determining a treatment strategy for cancer.
[10] The invention also provides for the use of each of the mutations of the inventions as a drug target. In other aspects, the invention provides methods for modulating pathways and cascades associated with HDAC and HDAC mutant molecules.
DETAILED DESCRIPTION OF THE INVENTION
[11] It is to be appreciated that certain aspects, modes, embodiments, variations and features of the invention are described below in various levels of detail in order to provide a substantial understanding of the present invention. In general, such disclosure provides new HDAC6 mutations and SNPs that may be useful, alone or in combination, in the diagnosis and treatment of subjects in need thereof. Accordingly, the various aspects of the present invention relate to polynucleotides encoding HDAC6 mutations and polymorphisms of the invention, expression vectors encoding the HDAC6 mutant polypeptides of the invention and organisms that express the HDAC6 mutant/polymorphic polynucleotides and/or HDAC6 mutant/polymorphic polypeptides of the invention. The various aspects of the present invention further relate to diagnostic/theranostic methods and kits that use the HDAC6 mutations and/or polymorphisms of the invention to identify individuals predisposed to disease or to classify individuals and tumours with regard to drug responsiveness, side effects, or optimal drug dose. In other aspects, the invention provides methods for compound validation and a computer system for storing and analyzing data related to the HDAC6 mutations and polymorphisms of the invention. Accordingly, various particular embodiments that illustrate these aspects follow.
[12] Definitions. The definitions of certain terms as used in this specification are provided below. Definitions of other terms may be found in the glossary provided by the U.S. Department of Energy, Office of Science, Human Genome Project (^ttp://vvΛvw.ornl.gov/sci/techresources/Human_Genome/glossary/).
[13] As used herein, the term "allele" means a particular form of a gene or DNA sequence at a specific chromosomal location (locus).
[14] As used herein, the term "antibody" includes, but is not limited to, polyclonal antibodies, monoclonal antibodies, humanized or chimeric antibodies and biologically functional antibody fragments sufficient for binding of the antibody fragment to the protein. [15] As used herein, the term "clinical response" means any or all of the following: a quantitative measure of the response, no response, and adverse response (i.e., side effects). [16] As used herein, the term "clinical trial" means any research study designed to collect clinical data on responses to a particular treatment, and includes but is not limited to phase I, phase II and phase III clinical trials. Standard methods are used to define the patient population and to enrol subjects.
[17] As used herein, the phrase "disease associated with a HDAC6" mutation refers to any disease or disorder arising from a mutation in at least one position a gene encoding histone deacetylase 6 (HDAC6). The mutation refers to any alteration of the nucleic acid sequence encoding HDAC6 that inactivates the functionality of the protein produced by that gene. Such mutations can include, but are not limited to, an amino acid substitution wherein a native amino acid is replaced with another amino acid residue. Examples of diseases include, but are not limited to, acute myeloid leukaemia (AML), a chronic myeloid leukemia (CML), hyperplasia, glioblastoma, melanoma, cholangioma, breast cancer, genitourinary cancer, lung cancer, non-small-cell lung cancer (NSCLC), gastrointestinal cancer, epidermoid cancer, melanoma, ovarian cancer, pancreas cancer, neuroblastoma, head cancer, neck cancer, bladder cancer, renal cancer, brain cancer, gastric cancer, prostate cancer, colorectal cancer, multiple myeloma and lymphomas.
[18] As used herein, the term "effective amount" of a compound is a quantity sufficient to achieve a desired pharmacodynamic, toxicologic, therapeutic and/or prophylactic effect, for example, an amount which results in the prevention of or a decrease in the symptoms associated with a disease that is being treated, e.g., the diseases associated with HDAC6 mutant polypeptides and HDAC6 mutant polynucleotides identified herein. The amount of
compound administered to the subject will depend on the type and severity of the disease and on the characteristics of the individual, such as general health, age, sex, body weight and tolerance to drugs. It will also depend on the degree, severity and type of disease. The skilled artisan will be able to determine appropriate dosages depending on these and other factors. Typically, an effective amount of the compounds of the present invention, sufficient for achieving a therapeutic or prophylactic effect, range from about 0.000001 mg per kilogram body weight per day to about 10,000 mg per kilogram body weight per day. Preferably, the dosage ranges are from about 0.0001 mg per kilogram body weight per day to about 100 mg per kilogram body weight per day. The compounds of the present invention can also be administered in combination with each other, or with one or more additional therapeutic compounds.
[19] Glivec® (Gleevec®; imatinib) is a medication for chronic myeloid leukaemia (CML) and certain stages of gastrointestinal stromal tumours (GIST). It targets and interferes with the molecular abnormalities that drive the growth of cancer cells. Corless CL et ah, J. Clin.
Oncol. 22(18):3813-25 (September 15, 2004); Verweij J et al, Lancet 364(9440): 1127-34
(September 25, 2004); Kantarjian HM et al, Blood 104(7): 1979-88 (October 1, 2004). By inhibiting multiple targets, Glivec® has potential as an anticancer therapy for several types of cancer, including leukaemia and solid tumours.
[20] The aromatase inhibitor FEMARA is a treatment for advanced breast cancer in postmenopausal women. It blocks the use of oestrogen by certain types of breast cancer that require oestrogen to grow. Janicke F, Breast 13 Suppl l:S10-8 (December 2004); Mouridsen
H et al, Oncologist 9(5):489-96 (2004).
[21] Sandostatin® LAR® is used to treat patients with acromegaly and to control symptoms, such as severe diarrhoea and flushing, in patients with functional gastro-entero- pancreatic (GEP) tumours (e.g., metastatic carcinoid tumours and vasoactive intestinal peptide-secreting tumours [Vipomas]). Oberg K, Chemotherapy 47 Suppl 2:40-53 (2001);
Raderer M et al, Oncology 60(2):141-5 (2001); Aparicio T et al, Eur. J. Cancer 37(8):1014-
9 (May 2001). Sandostatin® LAR® regulates hormones in the body to help manage diseases and their symptoms.
[22] ZOMETA® is a treatment for hypocalcaemia of malignancy (HCM)I and for the treatment of bone metastases across a broad range of tumour types. These tumours include
multiple myeloma, prostrate cancer, breast cancer, lung cancer, renal cancer and other solid tumours. Rosen LS et al., Cancer 100(12):2613-21 (June 15, 2004).
[23] Vatalanib (l-[4-chloroanilino]-4-[4-pyridylmethyl] phthalazine succinate) is a multi- VEGF receptor (VEGF) inhibitor that may block the creation of new blood vessels to prevent tumour growth. This compound inhibits all known VEGF receptor tyrosine kinases, blocking angiogenesis and lymphangiogenesis. Drevs J et al., Cancer Res. 60:4819-4824 (2000); Wood JM et al., Cancer Res. 60:2178-2189 (2000). Vatalanib is being studied in two large, multinational, randomized, phase III, placebo-controlled trials in combination with FOLFOX- 4 in first-line and second-line treatment of patients with metastatic colorectal cancer. Thomas A et al., 37th Annual Meeting of the American Society of Clinical Oncology, San Francisco, CA, Abstract 279 (May 12-15, 2001).
[24] The orally bioavailable rapamycin derivative everolimus inhibits oncogenic signalling in tumour cells. By blocking the mammalian target of rapamycin (mTOR)-mediated signalling, everolimus exhibits broad antiproliferative activity in tumour cell lines and animal models of cancer. Boulay A et al., Cancer Res. 64:252-261 (2004). In preclinical studies, everolimus also potently inhibited the proliferation of human umbilical vein endothelial cells directly indicating an involvement in angiogenesis. By blocking tumour cell proliferation and angiogenesis, everolimus may provide a clinical benefit to patients with cancer. Everolimus is being investigated for its antitumour properties in a number of clinical studies in patients with haematological and solid tumours. Huang S & Houghton PJ, Curr. Opin. Investig. Drugs 3:295-304 (2002).
[25] Gimatecan is a novel oral inhibitor of topoisomerase I (topo I). Gimatecan blocks cell division in cells that divide rapidly, such as cancer cells, which activates apoptosis. Preclinical data indicate that gimatecan is not a substrate for multidrug resistance pumps, and that it increases the drug-target interaction. De Cesare M et al, Cancer Res. 61:7189-7195 (2001). Phase I clinical studies indicate that the dose-limiting toxicity of gimatecan is myelosuppression.
[26] Patupilone is a microtubule stabilizer. Altmann K-H, Curr. Opin. Chem. Biol. 5:424- 431 (2001); Altmann K-H et al., Biochim Biophys Acta. 470:M79-M91 (2000); O'Neill V et al., 36th Annual Meeting of the American Society of Clinical Oncology; May 19-23, 2000; New Orleans, LA, Abstract 829; Calvert PM et al. Proceedings of the 11th National Cancer Institute-European Organization for Research and Treatment of Cancer/American
Association for Cancer Research Symposium on New Drugs in Cancer Therapy; November 7- 10, 2000; Amsterdam, The Netherlands, Abstract 575. Patupilone blocked mitosis and induced apoptosis greater than the frequently used anticancer drug paclitaxel. Also, patupilone retained full activity against human cancer cells that were resistant to paclitaxel and other chemotherapeutic agents.
[27] Midostaurin is an inhibitor of multiple signalling proteins. By targeting specific receptor tyrosine kinases and components of several signal transduction pathways, midostaurin impacts several targets involved in cell growth (e.g., KIT5 PDGFR, PKC), leukaemic cell proliferation (e.g., FLT3), and angiogenesis (e.g., VEGFR2). Weisberg E et al. Cancer Cell 1 :433-443 (2002); Fabbro D et al., Anticancer Drug Des. 15:17-28 (2000). In preclinical studies, midostaurin showed broad antiproliferative activity against various tumour cell lines, including those that were resistant to several other chemotherapeutic agents. [28] The somatostatin analogue pasireotide is a stable cyclohexapeptide with broad somatotropin release inhibiting factor (SRIF) receptor binding. Brans C et al., Eur. J. Endocrinol. 146(5):707-16 (May 2002); Weckbecker G et al, Endocrinology 143(10):4123- 30 (October 2002); Oberg K, Chemotherapy 47 Suppl 2:40-53 (2001).
[29] LBH589 is a histone deacetylase (HDAC) inhibitor. By blocking the deacetylase activity of HDAC, HDAC inhibitors activate gene transcription of critical genes that cause apoptosis (programmed cell death). By triggering apoptosis, LBH589 induces growth inhibition and regression in tumour cell lines. LBH589 is being tested in phase I clinical trials as an anticancer agent. See also, George P et al, Blood 105(4): 1768-76 (February 15, 2005). [30] AMNl 07 is an oral tyrosine kinase inhibitor that targets Bcr-Abl, KIT, and PDGFR. Preclinical studies have shown in cellular assays using Philadelphia chromosome-positive (Ph+) CML cells that AMNl 07 is highly potent and has high selectivity for Bcr-Abl, KIT, and PDGFR. Weisberg E et al, Cancer Cell 7(2): 129-41 (February 2005); OΗare T et al, Cancer Cell 7(2): 117-9 (February 2005). AMN 107 also shows activity against mutated variants of Bcr-Abl. AMN 107 is currently being studied in phase I clinical trials. [0001] AMNl 07 is an oral tyrosine kinase inhibitor that targets Bcr-Abl, KIT, and PDGFR. Preclinical studies have shown in cellular assays using Philadelphia chromosome— positive (Ph+) CML cells that AMNl 07 is highly potent and has high selectivity for Bcr-Abl. KIT, and PDGFR. Weisberg E et al, Cancer Cell 7(2):129-41 (February 2005); OΗare T et al,
Cancer Cell 7(2): 117-9 (February 2005). AMN 107 also shows activity against mutated variants of Bcr-Abl. AMN107 is currently being studied in phase I clinical trials. [31] As used herein, the term "HDAC6 modulating agent" is any compound that alters {e.g., increases or decreases). The expression level or biological activity level of HDAC6 polypeptide compared to the expression level or biological activity level of HDAC6 polypeptide in the absence of the HDAC6 modulating agent. HDAC6 modulating agent can be a small molecule, antibody, polypeptide, carbohydrate, lipid, nucleotide, or combination thereof. The HDAC6 modulating agent can be an organic compound or an inorganic compound.In a preferred embodiment, the HDAC6 modulating agents is a histone deacetylase inhibitors (HDAI). HDAI are described in e.g. Monneret C, European Journal of Medicinal Chemistry, 40, (2005), 1—13, the contents of witch is herewith incorporated by reference. HDAIs include for example sodium butyrate, phenylacetate, phenylbutyrate, valproic acid, tributyrinpivaloyloxymethyl butyrate, pivanex®, trichostatinA (TSA), trichostatin C, trapoxins A and B, depudecin, cyclic hydroxamic-acid containing peptide (CHAPs), apicidin or OSI-2040, suberoylanilide hydroxamic acid (SAHA), oxamflatindepsipeptide, FK228, scriptaid, biarylhydroxamate inhibitor, A-161906, JNJ16241199, PDX 101, MS-275, CI-994. The structure and synthesis of HDAIs are known in the art and can for instance be found in Monneret et al., supra and references therein.
[32] HDAI compounds of particular interest are hydroxamate compounds described by the formula I:
wherein
Ri is H, halo, or a straight chain C J-C6 alkyl (especially methyl, ethyl or n-propyl, which methyl, ethyl and n-propyl substituents are unsubstituted or substituted by one or more substituents described below for alkyl substituents);
R2 is selected from H5 Ci-Cio alkyl, (preferably Ci-Cβ alkyl, e.g. methyl, ethyl or - CH2CH2-OH), C4 - C9 cycloalkyl, C4 - C9 heterocycloalkyl, C4 - C9 heterocycloalkylalkyl, cycloalkylalkyl (e.g., cyclopropylmethyl), aryl, heteroaryl, arylalkyl (e.g. benzyl), heteroarylalkyl (e.g. pyridylmethyl), -(CH2)nC(O)R6, -(CH2)n0C(O)R6, amino acyl, HON- C(O)-CH=C(Ri )-aryl-alkyl- and -(CH2)nR7;
R3 and R4 are the same or different and independently H, Ci-C6 alkyl, acyl or acylamino, or R3 and R4 together with the carbon to which they are bound represent C=O, C=S, or C=NRg, or R2 together with the nitrogen to which it is bound and R3 together with the carbon to which it is bound can form a C4 — C9 heterocycloalkyl, a heteroaryl, a polyheteroaryl, a non-aromatic polyheterocycle, or a mixed aryl and non-aryl polyheterocycle ring;
R5 is selected from H, C1-CO alkyl, C4 - C9 cycloalkyl, C4 — C9 heterocycloalkyl, acyl, aryl, heteroaryl, arylalkyl (e.g. benzyl), heteroarylalkyl (e.g. pyridylmethyl), aromatic polycycles, non-aromatic polycycles, mixed aryl and non-aryl polycycles, polyheteroaryl, non-aromatic polyheterocycles, and mixed aryl and non-aryl polyheterocycles; n, ni, n2 and n3 are the same or different and independently selected from 0 — 6, when nj is 1- 6, each carbon atom can be optionally and independently substituted with R3 and/or R4; X and Y are the same or different and independently selected from H, halo, Cj-C4 alkyl, such as CH3 and CF3, NO2, C(O)Ri, OR9, SR9, CN, and NRi0Ri 1;
R6 is selected from H, Ci-Cβ alkyl, C4 - C9 cycloalkyl, C4 - C9 heterocycloalkyl, cycloalkylalkyl (e.g., cyclopropylmethyl), aryl, heteroaryl, arylalkyl (e.g., benzyl, 2- phenylethenyl), heteroarylalkyl (e.g., pyridylmethyl), OR12, and NR13R14;
R7 is selected from ORj5, SRi5, S(O)Ri6, SO2Ri75 NRi3Ri4, and NRj2SO2R6;
R8 is selected from H, ORi5, NRJ3RJ4, CrC6 alkyl, C4 - C9 cycloalkyl, C4 - C9 heterocycloalkyl, aryl, heteroaryl, arylalkyl (e.g., benzyl), and heteroarylalkyl (e.g., pyridylmethyl);
R9 is selected from Cj - C4 alkyl, for example, CH3 and CF3, C(O)-alkyl, for example C(O)CH3, and C(O)CF3;
Rio and Rn are the same or different and independently selected from H, Ct-C4 alkyl, and -C(O)-alkyl;
Ri2 is selected from H, Ci -Ce alkyl, C4 — C9 cycloalkyl, C4 - C9 heterocycloalkyl, C4 — C9 heterocycloalkylalkyl, aryl, mixed aryl and non-aryl polycycle, heteroaryl, arylalkyl (e.g., benzyl), and heteroarylalkyl (e.g., pyridylmethyl);
Rn and R14 are the same or different and independently selected from H, Q-C6 alkyl, C4 — C9 cycloalkyl, C4 — C9 heterocycloalkyl, aryl, heteroaryl, arylalkyl (e.g., benzyl), heteroarylalkyl (e.g., pyridylmethyl), amino acyl, or R13 and R14 together with the nitrogen to which they are bound are C4 — C9 heterocycloalkyl, heteroaryl, polyheteroaryl, non-aromatic polyheterocycle or mixed aryl and non-aryl polyheterocycle;
R1S is selected from H, Cj-Cβ alkyl, C4 — C9 cycloalkyl, C4 - C9 heterocycloalkyl, aryl, heteroaryl, arylalkyl, heteroarylalkyl and (CH2)mZRi2;
Ri6 is selected from Ci-C6 alkyl, C4 — C9 cycloalkyl, C4 — C9 heterocycloalkyl, aryl, heteroaryl, polyheteroaryl, arylalkyl, heteroarylalkyl and (CHa)1nZRi2;
Ri7 is selected from Ci-Ce alkyl, C4 - C9 cycloalkyl, C4 — C9 heterocycloalkyl, aryl, aromatic polycycles, heteroaryl, arylalkyl, heteroarylalkyl, polyheteroaryl and NRπRπ; m is an integer selected from 0 to 6; and
Z is selected from O, NRj3, S and S(O).
[33] As appropriate, unsubstituted means that there is no substituent or that the only substituents are hydrogen.
[34] Halo substituents are selected from fluoro, chloro, bromo and iodo, preferably fluoro or chloro.
[35] Alkyl substituents include straight and branched Ci-Cβalkyl, unless otherwise noted. Examples of suitable straight and branched Ci-Cβalkyl substituents include methyl, ethyl, n- propyl, 2-propyl, n-butyl, sec-butyl, t-butyl, and the like. Unless otherwise noted, the alkyl substituents include both unsubstituted alkyl groups and alkyl groups that are substituted by one or more suitable substituents, including unsaturation (i.e. there are one or more double or triple C-C bonds), acyl, cycloalkyl, halo, oxyalkyl, alkylamino, aminoalkyl, acylamino and OR15, for example, alkoxy. Preferred substituents for alkyl groups include halo, hydroxy, alkoxy, oxyalkyl, alkylamino, and aminoalkyl.
[36] Cycloalkyl substituents include C3-C9 cycloalkyl groups, such as cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl and the like, unless otherwise specified. Unless otherwise noted, cycloalkyl substituents include both unsubstituted cycloalkyl groups and cycloalkyl groups that are substituted by one or more suitable substituents, including Ci-C6 alkyl, halo,
hydroxy, aminoalkyl, oxyalkyl, alkylamino, and ORi 5, such as alkoxy. Preferred substituents for cycloalkyl groups include halo, hydroxy, alkoxy, oxyalkyl, alkylamino and aminoalkyl. [37] The above discussion of alkyl and cycloalkyl substituents also applies to the alkyl portions of other substituents, such as without limitation, alkoxy, alkyl amines, alkyl ketones, arylalkyl, heteroarylalkyl, alkylsulfonyl and alkyl ester substituents and the like. [38] Heterocycloalkyl substituents include 3 to 9 membered aliphatic rings, such as 4 to 7 membered aliphatic rings, containing from one to three heteroatoms selected from nitrogen, sulfur, oxygen. Examples of suitable heterocycloalkyl substituents include pyrrolidyl, tetrahydrofuryl, tetrahydrothiofuranyl, piperidyl, piperazyl, tetrahydropyranyl, morphilino, 1,3-diazapane, 1,4-diazapane, 1 ,4-oxazepane, and 1,4-oxathiapane. Unless otherwise noted, the rings are unsubstituted or substituted on the carbon atoms by one or more suitable substituents, including C]-C6 alkyl, C4 - C9 cycloalkyl, aryl, heteroaryl, arylalkyl (e.g., benzyl), and heteroarylalkyl (e.g., pyridylmethyl), halo, amino, alkyl amino and OR1S, for example alkoxy. Unless otherwise noted, nitrogen heteroatoms are unsubstituted or substituted by H, C1-C4 alkyl, arylalkyl (e.g., benzyl), and heteroarylalkyl (e.g., pyridylmethyl), acyl, aminoacyl, alkylsulfonyl, and arylsulfonyl.
[39] Cycloalkylalkyl substituents include compounds of the formula -(CH2)n5-cycloalkyl wherein n5 is a number from 1-6. Suitable alkylcycloalkyl substituents include cyclopentylmethyl-, cyclopentylethyl, cyclohexylmethyl and the like. Such substituents are unsubstituted or substituted in the alkyl portion or in the cycloalkyl portion by a suitable substituent, including those listed above for alkyl and cycloalkyl.
[40] Aryl substituents include unsubstituted phenyl and phenyl substituted by one or more suitable substituents, including C1-C6 alkyl, cycloalkylalkyl (e.g., cyclopropylmethyl), O(CO)alkyl, oxyalkyl, halo, nitro, amino, alkylamino, aminoalkyl, alkyl ketones, nitrile, carboxyalkyl, alkylsulfonyl, aminosulfonyl, arylsulfonyl, and OR15, such as alkoxy. Preferred substituents include including Ci-C6 alkyl, cycloalkyl (e.g., cyclopropylmethyl), alkoxy, oxyalkyl, halo, nitro, amino, alkylamino, aminoalkyl, alkyl ketones, nitrile, carboxyalkyl, alkylsulfonyl, arylsulfonyl, and aminosulfonyl. Examples of suitable aryl groups include Ci- C4alkylphenyl, Ci-C4alkoxyphenyl, trifluoromethylphenyl, methoxyphenyl, hydroxyethylphenyl, dimethylaminophenyl, aminopropylphenyl, carbethoxyphenyl, methanesulfonylphenyl and tolylsulfonylphenyl.
[41] Aromatic polycycles include naphthyl, and naphthyl substituted by one or more suitable substituents, including Ci-C6 alkyl, alkylcycloalkyl (e.g., cyclopropylmethyl), oxyalkyl, halo, nitro, amino, alkylamino, aminoalkyl, alkyl ketones, nitrile, carboxyalkyl, alkylsulfonyl, arylsulfonyl, aminosulfonyl and ORi 5, such as alkoxy.
[42] Heteroaryl substituents include compounds with a 5 to 7 member aromatic ring containing one or more heteroatoms, for example from 1 to 4 heteroatoms, selected from N, O and S. Typical heteroaryl substituents include furyl, thienyl, pyrrole, pyrazole, triazole, thiazole, oxazole, pyridine, pyrimidine, isoxazolyl, pyrazine and the like. Unless otherwise noted, heteroaryl substituents are unsubstituted or substituted on a carbon atom by one or more suitable substituents, including alkyl, the alkyl substituents identified above, and another heteroaryl substituent. Nitrogen atoms are unsubstituted or substituted, for example by R13; especially useful N substituents include H, C1 — C4 alkyl, acyl, aminoacyl, and sulfonyl. [43] Arylalkyl substituents include groups of the formula -(CH2)n5-aryl, -(CH2)n5-i-(CH- aryl)-(CH2)n5-aryl or -(CH2)n5-iCH(aryl)(aryl) wherein aryl and n5 are defined above. Such arylalkyl substituents include benzyl, 2-phenylethyl. 1-phenylethyl, tolyl-3 -propyl, 2- phenylpropyl, diphenylmethyl, 2-diphenylethyl, 5,5-dimethyl-3-phenylpentyl and the like. Arylalkyl substituents are unsubstituted or substituted in the alkyl moiety or the aryl moiety or both as described above for alkyl and aryl substituents.
[44] Heteroarylalkyl substituents include groups of the formula — (CH2)n5-heteroaryl wherein heteroaryl and n5 are defined above and the bridging group is linked to a carbon or a nitrogen of the heteroaryl portion, such as 2-, 3- or 4-pyridylmethyl, imidazolylmethyl, quinolylethyl, and pyrrolylbutyl. Heteroaryl substituents are unsubstituted or substituted as discussed above for heteroaryl and alkyl substituents.
[45] Amino acyl substituents include groups of the formula -C(O)-(CH2)n-C(H)(NRi3Ri4)- (CH2)n-R5 wherein n, Rn, R14 and R5 are described above. Suitable aminoacyl substituents include natural and non-natural amino acids such as glycinyl, D-tryptophanyl, L-lysinyl, D- or L-homoserinyl, 4-aminobutryic acyl, ±-3-amin-4-hexenoyl.
[46] Non-aromatic polycycle substituents include bicyclic and tricyclic fused ring systems where each ring can be 4-9 membered and each ring can contain zero, 1 or more double and/or triple bonds. Suitable examples of non-aromatic polycycles include decalin, octahydroindene, perhydrobenzocycloheptene, perhydrobenzo-[/]-azulene. Such substituents are unsubstituted or substituted as described above for cycloalkyl groups.
[47] Mixed aryl and non-aryl polycycle substituents include bicyclic and tricyclic fused ring systems where each ring can be 4 — 9 membered and at least one ring is aromatic. Suitable examples of mixed aryl and non-aryl polycycles include methylenedioxyphenyl, bis- methylenedioxyphenyl, 1,2,3,4-tetrahydronaphthalene, dibenzosuberane, dihdydroanthracene, 9H-fluorene. Such substituents are unsubstituted or substituted by nitro or as described above for cycloalkyl groups.
[48] Polyheteroaryl substituents include bicyclic and tricyclic fused ring systems where each ring can independently be 5 or 6 membered and contain one or more heteroatom, for example, 1, 2, 3, or 4 heteroatoms, chosen from O, N or S such that the fused ring system is aromatic. Suitable examples of polyheteroaryl ring systems include quinoline, isoquinoline, pyridopyrazine, pyrrolopyridine, furopyridine, indole, benzofuran, benzothiofuran, benzindole, benzoxazole, pyrroloquinoline, and the like. Unless otherwise noted, polyheteroaryl substituents are unsubstituted or substituted on a carbon atom by one or more suitable substituents, including alkyl, the alkyl substituents identified above and a substituent of the formula -0-(CH2CH=CH(CH3) (CH2)) 1-3H. Nitrogen atoms are unsubstituted or substituted, for example by Rj3; especially useful N substituents include H, Ci — C4 alkyl, acyl, aminoacyl, and sulfonyl.
[49] Non-aromatic polyheterocyclic substituents include bicyclic and tricyclic fused ring systems where each ring can be 4 — 9 membered, contain one or more heteroatom, for example, 1, 2, 3, or 4 heteroatoms, chosen from O, N or S and contain zero or one or more C- C double or triple bonds. Suitable examples of non-aromatic polyheterocycles include hexitol, cis-perhydro-cyclohepta[b]pyridinyl, decahydro-benzo[f][l,4]oxazepinyl, 2,8- dioxabicyclo[3.3.0]octane, hexahydro-thieno[3,2-b]thiophene, perhydropyrrolo[3,2-b]pyrrole, perhydronaphthyridine, perhydro-lH-dicyclopenta[b,e]pyran. Unless otherwise noted, non- aromatic polyheterocyclic substituents are unsubstituted or substituted on a carbon atom by one or more substituents, including alkyl and the alkyl substituents identified above. Nitrogen atoms are unsubstituted or substituted, for example, by R13; especially useful N substituents include H, C1 - C4 alkyl, acyl, aminoacyl, and sulfonyl.
[50] Mixed aryl and non-aryl polyheterocycles substituents include bicyclic and tricyclic fused ring systems where each ring can be 4 - 9 membered, contain one or more heteroatom chosen from O, N or S, and at least one of the rings must be aromatic. Suitable examples of mixed aryl and non-aryl polyheterocycles include 2,3-dihydroindole, 1,2,3,4-
tetrahydroquinoline,5, 11 -dihydro- 10H-dibenz[b,e] [ 1 ,4]diazepine,5H- dibenzo[b,e] [1 ,4]diazepine, 1 ,2-dihydropyrrolo[3,4-b][l ,5]benzodiazepine, 1,5-dihydro- pyrido[2,3-b] [1 ,4]diazepin-4-one, 1 ,2,3,4,6, 11 -hexahydro-benzo[b]pyrido[2,3-e] [1 ,4]diazepin- 5-one. Unless otherwise noted, mixed aryl and non-aryl polyheterocyclic substituents are unsubstituted or substituted on a carbon atom by one or more suitable substituents, including, -N-OH, =N-OH, alkyl and the alkyl substituents identified above. Nitrogen atoms are unsubstituted or substituted, for example, by R13; especially useful N substituents include H, Ci - C4 alkyl, acyl, aminoacyl, and sulfonyl.
[51] Amino substituents include primary, secondary and tertiary amines and in salt form, quaternary amines. Examples of amino substituents include mono- and di-alkylamino, mono- and di-aryl amino, mono- and di-arylalkyl amino, aryl-arylalkylamino, alkyl-arylamino, alkyl- arylalkylamino and the like.
[52] Sulfonyl substituents include alkylsulfonyl and arylsulfonyl, for example methane sulfonyl, benzene sulfonyl, tosyl and the like.
[53] Acyl substituents include groups of formula -C(O)-W, -OC(O)-W, -C(O)-O-W or - C(O)NRi3Ri4, where W is R16, H or cycloalkylalkyl.
[54] Acylamino substituents include substituents of the formula -N(R^)C(O)-W, — N(R12)C(O)-O-W, and -N(Ri2)C(O)-NHOH and Ri2 and W are defined above. [55] The R2 substituent HON-C(O)-CH=C(Ri)-aryl-alkyl- is a group of the formula
Preferences for each of the substituents include the following: Ri is H, halo, or a straight chain Ci -C4 alkyl;
R2 is selected from H, C1-C6 alkyl, C4 — C9 cycloalkyl, C4 — C9 heterocycloalkyl, alkylcycloalkyl, aryl, heteroaryl, arylalkyl, heteroarylalkyl, -(CH2)nC(O)R6, amino acyl, and -(CHa)nR7;
R3 and R4 are the same or different and independently selected from H, and C1-C6 alkyl, or R3 and R4 together with the carbon to which they are bound represent C=O, C=S, or C=NR8;
Rs is selected from H, C1-CO alkyl, C4 — C9 cycloalkyl, C4 — C9 heterocycloalkyl, aryl, heteroaryl, arylalkyl, heteroarylalkyl, a aromatic polycycle, a non-aromatic polycycle, a mixed aryl and non-aryl polycycle, polyheteroaryl, a non-aromatic polyheterocycle, and a mixed aryl and non-aryl polyheterocycle; n, ni, ri2 and n3 are the same or different and independently selected from 0 — 6, when ni is 1-6, each carbon atom is unsubstituted or independently substituted with R3 and/or R4;
X and Y are the same or different and independently selected from H5 halo, C1-C4 alkyl, CF3, NO2, C(O)R1, OR9, SR9, CN, and NRi0Rn;
Re is selected from H, Ci-C6 alkyl, C4 - C9 cycloalkyl, C4 - C9 heterocycloalkyl, alkylcycloalkyl, aryl, heteroaryl, arylalkyl, heteroarylalkyl, OR12, and NR13R14;
R7 is selected from OR15, SR15, S(O)Ri6, SO2Rn, NRJ3RH, and NR12SO2R6;
R8 is selected from H, ORi5, NR13Rj4, CrC6 alkyl, C4 - C9 cycloalkyl, C4 - C9 heterocycloalkyl, aryl, heteroaryl, arylalkyl, and heteroarylalkyl;
R9 is selected from C1 - C4 alkyl and C(O)-alkyl;
Rio and Rj 1 are the same or different and independently selected from H, Cj-C4 alkyl, and -C(O)-alkyl;
R12 is selected from H, Ci-Ce alkyl, C4 — C9 cycloalkyl, C4 - C9 heterocycloalkyl, aryl, heteroaryl, arylalkyl, and heteroarylalkyl;
R13 and Ri4 are the same or different and independently selected from H, Cj-C6 alkyl, C4 — C9 cycloalkyl, C4 — C9 heterocycloalkyl, aryl, heteroaryl, arylalkyl, heteroarylalkyl and amino acyl;
Ris is selected from H, Ci-C6 alkyl, C4 - C9 cycloalkyl, C4 - C9 heterocycloalkyl, aryl, heteroaryl, arylalkyl, heteroarylalkyl and (CH2)mZRi2;
R16 is selected from Ci-Ce alkyl, C4 - C9 cycloalkyl, C4 - C9 heterocycloalkyl, aryl, heteroaryl, arylalkyl, heteroarylalkyl and (CH2)mZRi2;
R17 is selected from C1-C6 alkyl, C4 — C9 cycloalkyl, C4 - C9 heterocycloalkyl, aryl, heteroaryl, arylalkyl, heteroarylalkyl and NRi3Rj4; m is an integer selected from 0 to 6; and
Z is selected from O, NRj3, S, S(O).
[56] Useful compounds of the formula (I) include those wherein each of Ri, X, Y, R3, and R4 is H, including those wherein one of n2 and n3 is zero and the other is 1, especially those wherein R2 is H Or-CH2-CH2-OH.
[57] One suitable genus of hydroxamate compounds are those of formula Ia:
R2 is selected from H, Cj-C6 alkyl, C4 — C9 cycloalkyl, C4 - C9 heterocycloalkyl, alkylcycloalkyl, aryl, heteroaryl, arylalkyl, heteroarylalkyl, -(CHa)nC(O)R^, amino acyl and - (CHz)nR7;
[58] hi one embodiment, R5' is heteroaryl, heteroarylalkyl (e.g., pyridylmethyl), aromatic polycycles, non-aromatic polycycles, mixed aryl and non-aryl polycycles, polyheteroaryl, or mixed aryl and non-aryl polyheterocycles.
[59] In another embodiment, R5' is aryl, arylalkyl, aromatic polycycles, non-aromatic polycycles, and mixed aryl and non-aryl polycycles; especially aryl, such as p-fluorophenyl, p-chlorophenyl. p-O-Ci-C4-alkylphenyl, such as p-methoxyphenyl, and p-Ci-C4-alkylphenyl; and arylalkyl. such as benzyl, ortho, meta or />αrø-fluorobenzyl, ortho, meta or para- chlorobenzyl, ortho, meta or para-mono, di or tri-O-Ci-C4-alkylbenzyl, such as ortho, meta or /?αrø-methoxybenzyl, røyo-diethoxybenzyl, o,/w,/>-triimethoxybenzyl , and ortho, meta or para- mono, di or tri Ci-C4-alkylphenyl, such asp-methyl, m,m-diethylphenyl. [60] As used herein, the term "effective amount" of a compound is a quantity sufficient to achieve a desired pharmacodynamic, toxicologic, therapeutic and/or prophylactic effect, for example, an amount which results in the prevention of or a decrease in the symptoms associated with a disease that is being treated, e.g., the diseases associated with HDAC6 mutant polypeptides and HDAC6 mutant polynucleotides identified herein. The amount of compound administered to the subject will depend on the type and severity of the disease and on the characteristics of the individual, such as general health, age, sex, body weight and tolerance to drugs. It will also depend on the degree, severity and type of disease. The skilled artisan will be able to determine appropriate dosages depending on these and other factors. Typically, an effective amount of the compounds of the present invention, sufficient for achieving a therapeutic or prophylactic effect, range from about 0.000001 mg per kilogram body weight per day to about 10,000 mg per kilogram body weight per day. Preferably, the dosage ranges are from about 0.0001 mg per kilogram body weight per day to about 100 mg
per kilogram body weight per day. The compounds of the present invention can also be administered in combination with each other, or with one or more additional therapeutic compounds.
[61] Glivec® (Gleevec®; imatinib) is a medication for chronic myeloid leukaemia (CML) and certain stages of gastrointestinal stromal tumours (GIST). It targets and interferes with the molecular abnormalities that drive the growth of cancer cells. Corless CL et al, J. Clin. Oncol. 22(18):3813-25 (September 15, 2004); Verweij J et al, Lancet 364(9440): 1127-34 (September 25, 2004); Kantarjian HM et al, Blood 104(7): 1979-88 (October 1, 2004). By inhibiting multiple targets, Glivec® has potential as an anticancer therapy for several types of cancer, including leukaemia and solid tumours.
[62] The aromatase inhibitor FEMARA® is a treatment for advanced breast cancer in postmenopausal women. It blocks the use of oestrogen by certain types of breast cancer that require oestrogen to grow. Janicke F, Breast 13 Suppl l:S10-8 (December 2004); Mouridsen H et al., Oncologist 9(5):489-96 (2004).
[63] Sandostatin® LAR® is used to treat patients with acromegaly and to control symptoms, such as severe diarrhoea and flushing, in patients with functional gastro-entero- pancreatic (GEP) tumours (e.g., metastatic carcinoid tumours and vasoactive intestinal peptide-secreting tumours [VIPomas]). Oberg K, Chemotherapy 47 Suppl 2:40-53 (2001); Raderer M et al., Oncology 60(2):141-5 (2001); Aparicio T et al, Eur. J. Cancer 37(8):1014- 9 (May 2001). Sandostatin® LAR® regulates hormones in the body to help manage diseases and their symptoms.
[64] ZOMET A® is a treatment for hypocalcaemia of malignancy (HCM)I and for the treatment of bone metastases across a broad range of tumour types. These tumours include multiple myeloma, prostrate cancer, breast cancer, lung cancer, renal cancer and other solid tumours. Rosen LS et al, Cancer 100(12):2613-21 (June 15, 2004).
[65] Vatalanib (l-[4-chloroanilino]-4-[4-pyridylmethyl] phthalazine succinate) is a multi- VEGF receptor (VEGF) inhibitor that may block the creation of new blood vessels to prevent tumour growth. This compound inhibits all known VEGF receptor tyrosine kinases, blocking angiogenesis and lymphangiogenesis. Drevs J et al, Cancer Res. 60:4819-4824 (2000); Wood JM et al, Cancer Res. 60:2178-2189 (2000). Vatalanib is being studied in two large, multinational, randomized, phase III, placebo-controlled trials in combination with FOLFOX- 4 in first-line and second-line treatment of patients with metastatic colorectal cancer. Thomas
A et al., 37th Annual Meeting of the American Society of Clinical Oncology, San Francisco, CA, Abstract 279 (May 12-15, 2001).
[66] The orally bioavailable rapamycin derivative everolirnus inhibits oncogenic signalling in tumour cells. By blocking the mammalian target of rapamycin (mTOR)-mediated signalling, everolimus exhibits broad antiproliferative activity in tumour cell lines and animal models of cancer. Boulay A et ah, Cancer Res. 64:252-261 (2004). In preclinical studies, everolimus also potently inhibited the proliferation of human umbilical vein endothelial cells directly indicating an involvement in angiogenesis. By blocking tumour cell proliferation and angiogenesis, everolimus may provide a clinical benefit to patients with cancer. Everolimus is being investigated for its antitumour properties in a number of clinical studies in patients with haematological and solid tumours. Huang S & Houghton PJ, Curr. Opin. Investig. Drugs 3:295-304 (2002).
[67] Gimatecan is a novel oral inhibitor of topoisomerase I (topo I). Gimatecan blocks cell division in cells that divide rapidly, such as cancer cells, which activates apoptosis. Preclinical data indicate that gimatecan is not a substrate for multidrug resistance pumps, and that it increases the drug-target interaction. De Cesare M et ah, Cancer Res. 61:7189-7195 (2001). Phase I clinical studies indicate that the dose-limiting toxicity of gimatecan is myelosuppression.
[68] Patupilone is a microtubule stabilizer. Altmann K-H, Curr. Opin. Chem. Biol. 5:424- 431 (2001); Altmann K-H et al, Biochim Biophys Acta. 470:M79-M91 (2000); O'Neill V et al., 36th Annual Meeting of the American Society of Clinical Oncology; May 19-23, 2000; New Orleans, LA, Abstract 829; Calvert PM et al. Proceedings of the 11th National Cancer Institute-European Organization for Research and Treatment of Cancer/American Association for Cancer Research Symposium on New Drugs in Cancer Therapy; November 7-10, 2000; Amsterdam, The Netherlands, Abstract 575. Patupilone blocked mitosis and induced apoptosis greater than the frequently used anticancer drug paclitaxel. Also, patupilone retained full activity against human cancer cells that were resistant to paclitaxel and other chemotherapeutic agents.
[69] Midostaurin is an inhibitor of multiple signalling proteins. By targeting specific receptor tyrosine kinases and components of several signal transduction pathways, midostaurin impacts several targets involved in cell growth (e.g., KIT, PDGFR, PKC), leukaemic cell proliferation (e.g., FLT3), and angiogenesis (e.g., VEGFR2). Weisberg E et
al Cancer Cell 1:433-443 (2002); Fabbro D et al, Anticancer Drug Des. 15:17-28 (2000). In preclinical studies, midostaurin showed broad antiproliferative activity against various tumour cell lines, including those that were resistant to several other chemotherapeutic agents. [70] The somatostatin analogue pasireotide is a stable cyclohexapeptide with broad somatotropin release inhibiting factor (SRIF) receptor binding. Bruns C et al., Eur. J. Endocrinol. 146(5):707-16 (May 2002); Weckbecker G et al, Endocrinology 143(10):4123- 30 (October 2002); Oberg K, Chemotherapy 47 Suppl 2:40-53 (2001).
[71] LBH589 is a histone deacetylase (HDAC) inhibitor. By blocking the deacetylase activity of HDAC, HDAC inhibitors activate gene transcription of critical genes that cause apoptosis (programmed cell death). By triggering apoptosis, LBH589 induces growth inhibition and regression in tumour cell lines. LBH589 is being tested in phase I clinical trials as an anticancer agent. See also, George P et al., Blood 105(4): 1768-76 (February 15, 2005). [72] AEE788 inhibits multiple receptor tyrosine kinases including EGFR, HER2, and VEGFR5 which stimulate tumour cell growth and angiogenesis. Traxler P et al., Cancer Res. 64:4931-4941 (2004). In preclinical studies, AEE788 showed high target specificity and demonstrated antiproliferative effects against tumour cell lines and in animal models of cancer. AEE788 also exhibited direct antiangiogenic activity. AEE788 is currently in phase I clinical development.
[73] AMN 107 is an oral tyrosine kinase inhibitor that targets Bcr-Abl, ICIT, and PDGFR. Preclinical studies have shown in cellular assays using Philadelphia chromosome-positive (Ph+) CML cells that AMN 107 is highly potent and has high selectivity for Bcr-Abl, KIT, and PDGFR. Weisberg E et al, Cancer Cell 7(2): 129-41 (February 2005); O'Hare T et al, Cancer Cell 7(2): 117-9 (February 2005). AMN 107 also shows activity against mutated variants of Bcr-Abl. AMNl 07 is currently being studied in phase I clinical trials. [74] As used herein, the term "HDAC6 modulating agent" is any compound that alters {e.g., increases or decreases). The expression level or biological activity level of HDAC6 polypeptide compared to the expression level or biological activity level of HDAC6 polypeptide in the absence of the HDAC6 modulating agent. HDAC6 modulating agent can be a small molecule, antibody, polypeptide, carbohydrate, lipid, nucleotide, or combination thereof. The HDAC6 modulating agent can be an organic compound or an inorganic compound.
[75] As used herein, "expression" includes but is not limited to one or more of the following: transcription of the gene into precursor mRNA; splicing and other processing of the precursor mRNA to produce mature mRNA; mRNA stability; translation of the mature mRNA into protein (including codon usage and tRNA availability); and glycosylation and/or other modifications of the translation product, if required for proper expression and function. [76] As used herein, the term "gene" means a segment of DNA that contains all the information for the regulated biosynthesis of an RNA product, including promoters, exons, introns, and other untranslated regions that control expression.
[77] As used herein, the term "genotype" means an unphased 5' to 3' sequence of nucleotide pairs found at one or more polymorphic or mutant sites in a locus on a pair of homologous chromosomes in an individual. As used herein, genotype includes a full- genotype and/or a sub-genotype.
[78] As used herein, the term "locus" means a location on a chromosome or DNA molecule corresponding to a gene or a physical or phenotypic feature.
[79] As used herein, the term "mutant" means any heritable or acquired variation from the wild-type that alters the nucleotide sequence thereby changing the protein sequence. The term "mutant" is used interchangeably with the terms "marker", "biomarker", and "target" throughout the specification.
[80] As used herein, the term "medical condition" includes, but is not limited to, any condition or disease manifested as one or more physical and/or psychological symptoms for which treatment and/or prevention is desirable, and includes previously and newly identified diseases and other disorders.
[81] As used herein, the term "nucleotide pair" means the two nucleotides bound to each other between the two nucleotide strands.
[82] As used herein, the term "polymorphic site" means a position within a locus at which at least two alternative sequences are found in a population, the most frequent of which has a frequency of no more than 99%.
[83] As used herein, the term "polymorphism" means any sequence variant present at a frequency of >1% in a population. The sequence variant may be present at a frequency significantly greater than 1% such as 5% or 10% or more. Also, the term may be used to refer to the sequence variation observed in an individual at a polymorphic site. Polymorphisms
include nucleotide substitutions, insertions, deletions and microsatellites and may, but need not, result in detectable differences in gene expression or protein function. [84] As used herein, the term "polynucleotide" means any RNA or DNA, which may be unmodified or modified RNA or DNA. Polynucleotides include, without limitation, single- and double-stranded DNA, DNA that is a mixture of single- and double-stranded regions, single- and double-stranded RNA5 RNA that is mixture of single- and double-stranded regions, and hybrid molecules comprising DNA and RNA that may be single-stranded or, more typically, double-stranded or a mixture of single- and double-stranded regions. In addition, polynucleotide refers to triple-stranded regions comprising RNA or DNA or both RNA and DNA. The term polynucleotide also includes DNAs or RNAs containing one or more modified bases and DNAs or RNAs with backbones modified for stability or for other reasons. In a particular embodiment, the polynucleotide contains polynucleotide sequences from the HDAC6 gene.
[85] As used herein, the term "polypeptide" means any polypeptide comprising two or more amino acids joined to each other by peptide bonds or modified peptide bonds, i.e., peptide isosteres. Polypeptide refers to both short chains, commonly referred to as peptides, glycopeptides or oligomers, and to longer chains, generally referred to as proteins. Polypeptides may contain amino acids other than the 20 gene-encoded amino acids. Polypeptides include amino acid sequences modified either by natural processes, such as post- translational processing, or by chemical modification techniques that are well known in the art. Such modifications are well described in basic texts and in more detailed monographs, as well as in a voluminous research literature. In a particular embodiment, the polypeptide contains polypeptide sequences from the HDAC6 protein.
[86] As used herein, the term "small molecule" means a composition that has a molecular weight of less than about 5 kDa and more preferably less than about 2 kDa. Small molecules can be, e.g., nucleic acids, peptides, polypeptides, glycopeptides, peptidomimetics, carbohydrates, lipids, lipopolysaccharides, combinations of these, or other organic or inorganic molecules.
[87] As used herein, the term "mutant nucleic acid" means a nucleic acid sequence, which comprises a nucleotide that is variable within an otherwise identical nucleotide sequence between individuals or groups of individuals, thus, existing as alleles. Such mutant nucleic acids are preferably from about 15 to about 500 nucleotides in length. The mutant nucleic
acids may be part of a chromosome, or they may be an exact copy of a part of a chromosome, e.g., by amplification of such a part of a chromosome through PCR or through cloning. The mutant probes according to the invention are oligonucleotides that are complementary to a mutant nucleic acid.
[88] As used herein, the term "SNP nucleic acid" means a nucleic acid sequence, which comprises a nucleotide that is variable within an otherwise identical nucleotide sequence between individuals or groups of individuals, thus, existing as alleles. Such SNP nucleic acids are preferably from about 15 to about 500 nucleotides in length. The SNP nucleic acids may be part of a chromosome, or they may be an exact copy of a part of a chromosome, e.g., by amplification of such a part of a chromosome through PCR or through cloning. The SNP nucleic acids are referred to hereafter simply as "SNPs". The SNP probes according to the invention are oligonucleotides that are complementary to a SNP nucleic acid. In a particular embodiment, the SNP is in the HDAC6 gene.
[89] As used herein, the term "subject" as used herein refers to any living organism capable of eliciting an immune response. The term subject includes, but is not limited to, humans, nonhuman primates such as chimpanzees and other apes and monkey species; farm animals such as cattle, sheep, pigs, goats and horses; domestic mammals such as dogs and cats; laboratory animals including rodents such as mice, rats and guinea pigs, and the like. The term does not denote a particular age or sex. Thus, adult and newborn subjects, as well as fetuses, whether male or female, are intended to be covered.
[90] As used herein, the administration of an agent or drug to a subject or patient includes self-administration and the administration by another. It is also to be appreciated that the various modes of treatment or prevention of medical conditions as described are intended to mean "substantial", which includes total but also less than total treatment or prevention, and wherein some biologically or medically relevant result is achieved.
[91] The details of one or more embodiments of the invention are set forth in the accompanying description below. Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, the preferred methods and materials are now described. Other features, objects, and advantages of the invention will be apparent from the description and the claims. In the specification and the appended claims, the singular forms include plural referents unless the context clearly dictates otherwise. Unless defined otherwise, all technical and scientific terms used herein
have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. AU references cited herein are incorporated herein by reference in their entireties and for all purposes to the same extent as if each individual publication, patent, or patent application was specifically and individually incorporated by reference in its entirety for all purposes.
HDAC6 Mutations and Polymorphisms of the Invention. To investigate HDAC6 mutations in association with cancer, e.g., acute myeloid leukemia (AML; a.ka., acute myelogenous leukemia), DHPLC analysis was conducted on blood samples from 15 AML patients and tumor tissue from 30 breast cancer samples. Lilleberg S.L., Curr. Opin. Drug. Discov. DeveL, 6(2):237-52 (2003). The two (2) HDAC6 missense mutations identified in AML tissue are summarized below in TABLE 1 and TABLE 2. The missense mutations were located in the human HDAC6 gene (NP_006035).
TABLE l
HDAC6 Mutations in AML Patients
Exon Mutation/SNP Allelic Unmutated Mutated Fraction Sequence Sequence
GGOCGG GAGGCAGCCATTGG GAGGCAGCCATTCG Exon 25 G930R 0.5 GGGAGCCATGC GGGAGCCATGC
SEQ ID NOrI SEQ ID NO:2
GGOCGG GGGGCCTCAGAATCT CGGGCCTCAGAATC Exon 26 G1064A 0.5 GAGGC GAGGC
SEQ ID NO:3 SEQ ID NO:4
[92] As shown above in TABLE 1 and further summarized below in TABLE 2, two (2)
HDAC6 missense mutations were identified in the present invention.
TABLE 2 HDAC6 Mutations in AML Patients
Gene Cancer NT change Mutation/SNP AUe. Frac Obs.
HDAC6 AML GGG>CGG G930R 0.5 1
NP006035 GGG>CGG G1064A 0.5 1
[93] Bioinformatics analysis of the HDAC6 mutations of the invention are further detailed in Example 1. This mutation may be used to identify patient responsiveness or resistantance to HDAC inhibitors.
[94] Identification of HDAC6 Mutations and Polymorphisms of the Invention in Human Cancers. Identification and Characterization of Gene Sequence Variation. Sequence variation in the human germline consists primarily of SNPs, the remainder being short tandem
repeats (including micro-satellites), long tandem repeats (mini-satellites), and other insertions and deletions. A SNP is the occurrence of nucleotide variability at a single position in the genome, in which two alternative bases occur at appreciable frequency (i.e., >1%) in the human population. A SNP may occur within a gene or within intergenic regions of the genome.
[95] Due to their prevalence and widespread nature, SNPs have the potential to be important tools for locating genes that are involved in human disease conditions. See e.g., Wang et al, Science 280: 1077-1082 (1998)).
[96] An association between SNP's and/or mutations and a particular phenotype {e.g., cancer type) does not necessarily indicate or require that the SNP or mutation is causative of the phenotype. Instead, an association with a SNP may merely be due to genome proximity between a SNP and those genetic factors actually responsible for a given phenotype, such that the SNP and said genetic factors are closely linked. That is, a SNP may be in linkage disequilibrium ("LD") with the "true" functional variant. LD exists when alleles at two distinct locations of the genome are more highly associated than expected. Thus, a SNP may serve as a marker that has value by virtue of its proximity to a mutation or other DNA alteration (e.g., gene duplication) that causes a particular phenotype.
[97] SNPs and mutations that are associated with disorders may also have a direct effect on the function of the genes in which they are located. For example, a sequence variant (e.g., SNP) may result in an amino acid change or may alter exon-intron splicing, thereby directly modifying the relevant protein, or it may exist in a regulatory region, altering the cycle of expression or the stability of the mRNA (see, e.g., Nowotny et al., Current Opinions in Neurobiology, 11:637-641 (2001)).
[98] In describing the polymorphic and mutant sites of the invention, reference is made to the sense strand of the gene for convenience. As recognized by the skilled artisan, however, nucleic acid molecules containing the gene may be complementary double stranded molecules and thus reference to a particular site on the sense strand refers as well to the corresponding site on the complementary antisense strand. That is, reference may be made to the same polymorphic or mutant site on either strand and an oligonucleotide may be designed to hybridize specifically to either strand at a target region containing the polymorphic and/or mutant site. Thus, the invention also includes single-stranded polynucleotides and mutations that are complementary to the sense strand of the genomic variants described herein.
[99] Identification and Characterization of SNPs and Mutations. Many different techniques can be used to identify and characterize SNPs and mutations, including single- strand conformation polymorphism (SSCP) analysis, heteroduplex analysis by denaturing high-performance liquid chromatography (DHPLC)5 direct DNA sequencing and computational methods (Shi et al, CHn Chem 47:164-172 (2001)). There is a wealth of sequence information in public databases; computational tools useful to identify SNPs in silico by aligning independently submitted sequences for a given gene (either cDNA or genomic sequences). The most ' common SNP-typing methods currently include hybridization, primer extension, and cleavage methods. Each of these methods must be connected to an appropriate detection system. Detection technologies include fluorescent polarization (Chan et al, Genome Res. 9:492-499 (1999)), luminometric detection of pyrophosphate release (pyrosequencing) (Ahmadiian et ah, Anal. Biochem. 280:103-10 (2000)), fluorescence resonance energy transfer (FRET)-based cleavage assays, DHPLC, and mass spectrometry (Shi, Clin Chem 47:164-172 (2001); U.S. Pat. No. 6,300,076 Bl). Other methods of detecting and characterizing SNPs and mutations are those disclosed in U.S. Pat. Nos. 6,297,018 Bl and 6,300,063 Bl.
[100] In a particularly preferred embodiment, the detection of polymorphisms and mutations is detected using INVADER™ technology (available from Third Wave Technologies Inc. Madison, Wisconsin USA). In this assay, a specific upstream "invader" oligonucleotide and a partially overlapping downstream probe together form a specific structure when bound to complementary DNA template. This structure is recognized and cut at a specific site by the Cleavase enzyme, resulting in the release of the 5' flap of the probe oligonucleotide. This fragment then serves as the "invader" oligonucleotide with respect to synthetic secondary targets and secondary fluorescently labelled signal probes contained in the reaction mixture. This results in specific cleavage of the secondary signal probes by the Cleavase enzyme. Fluorescent signal is generated when this secondary probe (labelled with dye molecules capable of fluorescence resonance energy transfer) is cleaved. Cleavases have stringent requirements relative to the structure formed by the overlapping DNA sequences or flaps and can, therefore, be used to specifically detect single base pair mismatches immediately upstream of the cleavage site on the downstream DNA strand. Ryan D et al, Molecular Diagnosis 4(2): 135-144 (1999) and Lyamichev V et al Nature Biotechnology 17: 292-296 (1999), see also U.S. Pat. Nos. 5,846,717 and 6,001,567.
[101] The identity of polymorphisms and mutations may also be determined using a mismatch detection technique including, but not limited to, the RNase protection method using riboprobes (Winter et al, Proc. Natl. Acad. ScL USA 82:7575 (1985); Meyers et al, Science 230:1242 (1985)) and proteins which recognize nucleotide mismatches, such as the E. coli mutS protein (Modrich P, Ann Rev Genet 25:229-253 (1991)). Alternatively, variant alleles can be identified by single strand conformation polymorphism (SSCP) analysis (Orita et al., Genomics 5:874-879 (1989); Humphries et al, in Molecular Diagnosis of Genetic Diseases, Elles R, ed. (1996) pp. 321-340) or denaturing gradient gel electrophoresis (DGGE) (Wartell et al, Nucl. Acids Res. 18:2699-2706 (1990); Sheffield et al, Proc. Natl. Acad. Sci. USA 86: 232-236 (1989)). A polymerase-mediated primer extension method may also be used to identify the polymorphisms/mutations. Several such methods have been described in the patent and scientific literature and include the "Genetic Bit Analysis" method (WO 92/15712) and the ligase/polymerase mediated genetic bit analysis (U.S. Pat. No. 5,679,524). Related methods are disclosed in WO 91/02087, WO 90/09455, WO 95/17676, and U.S. Pat. Nos. 5,302,509 and 5,945,283. Extended primers containing a polymorphism or mutation may be detected by mass spectrometry as described in U.S. Pat. No. 5,605,798. Another primer extension method is allele-specific PCR. Ruafio et al, Nucl Acids Res. 17: 8392 (1989); Ruafio et al, Nucl Acids Res. 19: 6877-6882 (1991); WO 93/22456; Turki et al, J. Clin. Invest. 95: 1635-1641 (1995). In addition, multiple polymorphic and/or mutant sites may be investigated by simultaneously amplifying multiple regions of the nucleic acid using sets of allele-specific primers as described in WO 89/10414.
[102] Haplotyping and Genotyping Oligonucleotides. The invention provides methods and compositions for haplotyping and/or genotyping the genetic polymorphisms (and possibly mutations) in an individual. As used herein, the terms "genotype" and "haplotype" mean the genotype or haplotype containing the nucleotide pair or nucleotide, respectively, that is present at one or more of the novel polymorphic (or mutant) sites described herein and may optionally also include the nucleotide pair or nucleotide present at one or more additional polymorphic (or mutant) sites in the gene. The additional polymorphic (and mutant) sites may be currently known polymorphic/mutant sites or sites that are subsequently discovered. [103] The compositions contain oligonucleotide probes and primers designed to specifically hybridize to one or more target regions containing, or that are adjacent to, a polymorphic or mutant site. Oligonucleotide compositions of the invention are useful in methods for
genotyping and/or haplotyping a gene in an individual. The methods and compositions for establishing the genotype or haplotype of an individual at the novel polymorphic/mutant sites described herein are useful for studying the effect of the polymorphisms and mutations in the aetiology of diseases affected by the expression and function of the protein, studying the efficacy of drugs targeting, predicting individual susceptibility to diseases affected by the expression and function of the protein and predicting individual responsiveness to drugs targeting the gene product.
[104] Some embodiments of the invention contain two or more differently labelled genotyping oligonucleotides, for simultaneously probing the identity of nucleotides at two or more polymorphic or mutant sites. It is also contemplated that primer compositions may contain two or more sets of allele-specific primer pairs to allow simultaneous targeting and amplification of two or more regions containing a polymorphic or mutant site. [105] Genotyping oligonucleotides of the invention may be immobilized on or synthesized on a solid surface such as a microchip, bead, or glass slide (see, e.g., WO 98/20020 and WO 98/20019). Such immobilized genotyping oligonucleotides may be used in a variety of polymorphism and mutation detection assays, including but not limited to probe hybridization and polymerase extension assays. Immobilized genotyping oligonucleotides of the invention may comprise an ordered array of oligonucleotides designed to rapidly screen a DNA sample for polymorphisms and mutations in multiple genes at the same time.
[106] An allele-specific oligonucleotide primer of the invention has a 3' terminal nucleotide, or preferably a 3' penultimate nucleotide, that is complementary to only one nucleotide of a particular SNP, thereby acting as a primer for polymerase-mediated extension only if the allele containing that nucleotide is present. Allele-specific oligonucleotide (ASO) primers hybridizing to either the coding or noncoding strand are contemplated by the invention. An ASO primer for detecting gene polymorphisms and mutations can be developed using techniques known to those of skill in the art.
Other genotyping oligonucleotides of the invention hybridize to a target region located one to several nucleotides downstream of one of the novel polymorphic or mutant sites identified herein. Such oligonucleotides are useful in polymerase-mediated primer extension methods for detecting one of the novel polymorphisms or mutations described herein and therefore such genotyping oligonucleotides are referred to herein as "primer-extension oligonucleotides". In a preferred embodiment, the 3 '-terminus of a primer-extension
oligonucleotide is a deoxynucleotide complementary to the nucleotide located immediately adjacent to the polymorphic/mutant site.
[107] Direct Genotyping Method of the Invention. One embodiment of a genotyping method of the invention involves isolating from an individual a nucleic acid mixture comprising at least one copy of the gene of interest and/or a fragment or flanking regions thereof, and determining the identity of the nucleotide pair at one or more of the polymorphic/mutant sites in the nucleic acid mixture. As will be readily understood by the skilled artisan, the two "copies" of a germline gene in an individual may be the same on each allele or may be different on each allele. In a particularly preferred embodiment, the genotyping method comprises determining the identity of the nucleotide pair at each polymorphic and mutant site. [108] Typically, the nucleic acid mixture is isolated from a biological sample taken from the individual, such as a blood sample, tumour or tissue sample. Suitable tissue samples include whole blood, tumour or as part of any tissue type, semen, saliva, tears, urine, fecal material, sweat, buccal smears, skin and hair. The nucleic acid mixture may be comprised of genomic DNA, mRNA, or cDNA and, in the latter two cases, the biological sample must be obtained from an organ in which the gene may be expressed. Furthermore, it will be understood by the skilled artisan that mRNA or cDNA preparations would not be used to detect polymorphisms or mutations located in introns or in 5' and 3' nontranscribed regions. If a gene fragment is isolated, it must usually contain the polymorphic and/or mutant sites to be genotyped. Exceptions can include mutations leading to truncation of the gene where a specific polymorphism may be lost. In these cases, the specific DNA alterations are determined by assessing the flanking sequences of the gene and underscore the need to specifically look for both polymorphisms and mutations.
[109] Direct Haplotyping Method of the Invention. One embodiment of the haplotyping method of the invention comprises isolating from an individual a nucleic acid molecule containing only one of the two copies of a gene of interest, or a fragment thereof, and determining the identity of the nucleotide at one or more of the polymorphic or mutant sites in that copy. The nucleic acid may be isolated using any method capable of separating the two copies of the gene or fragment. As will be readily appreciated by those skilled in the art, any individual clone will only provide haplotype information on one of the two gene copies present in an individual. If haplotype information is desired for the individual's other copy, additional clones will need to be examined. Typically, at least five clones should be
examined to have more than a 90% probability of haplotyping both copies of the gene in an individual. In a particularly preferred embodiment, the nucleotide at each polymorphic or mutant site is identified.
[110] In a preferred embodiment, a haplotype pair is determined for an individual by identifying the phased sequence of nucleotides at one or more of the polymorphic/mutant sites in each copy of the gene that is present in the individual. In a particularly preferred embodiment, the haplotyping method comprises identifying the phased sequence of nucleotides at each polymorphic/mutant site in each copy of the gene. When haplotyping both copies of the gene, the identifying step is preferably performed with each copy of the gene being placed in separate containers. However, if the two copies are labelled with different tags, or are otherwise separately distinguishable or identifiable, it is possible in some cases to perform the method in the same container. For example, if the first and second copies of the gene are labelled with different first and second fluorescent dyes, respectively, and an allele-specific oligonucleotide labelled with yet a third different fluorescent dye is used to assay the polymorphic/mutant sites, then detecting a combination of the first and third dyes would identify the polymorphism or mutation in the first gene copy, while detecting a combination of the second and third dyes would identify the polymorphism or mutation in the second gene copy.
[Ill] In both the genotyping and haplotyping methods, the identity of a nucleotide (or nucleotide pair) at a polymorphic and/or mutant site may be determined by amplifying a target region containing the polymorphic and/or mutant sites directly from one or both copies of the gene, or fragments thereof, and sequencing the amplified regions by conventional methods. It will be readily appreciated by the skilled artisan that only one nucleotide will be detected at a polymorphic or mutant site in individuals who are homozygous at that site, while two different nucleotides will be detected if the individual is heterozygous for that site. The polymorphism or mutation may be identified directly, known as positive-type identification, or by inference, referred to as negative-type identification. For example, where a SNP is known to be guanine and cytosine in a reference population, a site may be positively determined to be either guanine or cytosine for all individuals homozygous at that site, or both guanine and cytosine, if the individual is heterozygous at that site. Alternatively, the site may be negatively determined to be not guanine (and thus cytosine/cytosine) or not cytosine (and thus guanine/guanine).
[112] Indirect Genotyping Method using Polymorphic and Mutation Sites in Linkage Disequilibrium with a Target Polymorphism or Mutation. In addition, the identity of the alleles present at any of the novel polymorphic/mutant sites of the invention may be indirectly determined by genotyping other polymorphic/mutant sites in linkage disequilibrium with those sites of interest. As described supra, two sites are said to be in linkage disequilibrium if the presence of a particular variant (polymorphism or mutation) at one site is indicative of the presence of another variant at a second site. See, Stevens JC, MoL Diag. 4:309-317 (1999). Polymorphic and mutant sites in linkage disequilibrium with the polymorphic or mutant sites of the invention may be located in regions of the same gene or in other genomic regions. Genotyping of a polymorphic/mutant site in linkage disequilibrium with the novel polymorphic/mutant sites described herein may be performed by, but is not limited to, any of the above-mentioned methods for detecting the identity of the allele at a polymorphic/mutant site.
[113] Amplifying a Target Gene Region. The target regions may be amplified using any oligonucleotide-directed amplification method, including but not limited to polymerase chain reaction (PCR). (U.S. Pat. No. 4,965,188), ligase chain reaction (LCR) (Barany et ah, Proc. Natl. Acad. Set USA 88:189-193 (1991); published PCT patent application WO 90/01069), and oligonucleotide ligation assay (OLA) (Landegren et al., Science 241: 1077-1080 (1988)). Oligonucleotides useful as primers or probes in such methods should specifically hybridize to a region of the nucleic acid that contains or is adjacent to the polymorphic/mutant site. Typically, the oligonucleotides are between 10 and 35 nucleotides in length and preferably, between 15 and 30 nucleotides in length. Most preferably, the oligonucleotides are 20 to 25 nucleotides long. The exact length of the oligonucleotide will depend on many factors that are routinely considered and practiced by the skilled artisan.
[114] Other known nucleic acid amplification procedures may be used to amplify the target region including transcription-based amplification systems (U.S. Pat. No. 5,130,238; EP 329,822; U.S. Pat. No. 5,169,766, published PCT patent application WO 89/06700) and isothermal methods (Walker et al, Proc. Natl. Acad. Sci. USA 89: 392-396 (1992)). [115] A polymorphism or mutation in the target region may be assayed before or after amplification using one of several hybridization-based methods known in the art. Typically, allele-specifϊc oligonucleotides are utilized in performing such methods. The allele-specific oligonucleotides may be used as differently labelled probe pairs, with one member of the pair
showing a perfect match to one variant of a target sequence and the other member showing a perfect match to a different variant. In some embodiments, more than one polymorphic/mutant site may be detected at once using a set of allele-specific oligonucleotides or oligonucleotide pairs. Preferably, the members of the set have melting temperatures within 5°C, and more preferably within 2°C, of each other when hybridizing to each of the polymorphic or mutant sites being detected.
[116] Hybridizing Allele-Specific Oligonucleotide to a Target Gene. Hybridization of an allele-specific oligonucleotide to a target polynucleotide may be performed with both entities in solution, or such hybridization may be performed when either the oligonucleotide or the target polynucleotide is covalently or noncovalently affixed to a solid support. Attachment may be mediated, for example, by antibody-antigen interactions, poly-L-Lys, streptavidin or avidin-biotin, salt bridges, hydrophobic interactions, chemical linkages, UV cross-linking, baking, etc. Allele-specific oligonucleotide may be synthesized directly on the solid support or attached to the solid support subsequent to synthesis. Solid-supports suitable for use in detection methods of the invention include substrates made of silicon, glass, plastic, paper and the like, which may be formed, for example, into wells (as in 96-well plates), slides, sheets, membranes, fibres, chips, dishes, and beads. The solid support may be treated, coated or derivatised to facilitate the immobilization of the allele-specific oligonucleotide or target nucleic acid.
[117] The genotype or haplotype for the gene of an individual may also be determined by hybridization of a nucleic sample containing one or both copies of the gene to nucleic acid arrays and subarrays such as described in WO 95/11995. The arrays would contain a battery of allele-specific oligonucleotides representing each of the polymorphic or mutant sites to be ■ included in the genotype or haplotype.
[118] Determining Population Genotypes and Haplotypes and Correlating them with a Trait. The present invention provides a method for determining the frequency of a genotype or haplotype in a population. The method comprises determining the genotype or the haplotype for a gene present in each member of the population, wherein the genotype or haplotype comprises the nucleotide pair or nucleotide detected at one or more of the polymorphic sites in the gene and mutations identified in the region, and calculating the frequency at which the genotype or haplotype is found in the population. The population may be a reference population, a family population, a same sex population, a population group, or
a trait population (e.g., a group of individuals exhibiting a trait of interest such as a medical condition or response to a therapeutic treatment).
[119] In another aspect of the invention, frequency data for genotypes and/or haplotypes found in a reference population are used in a method for identifying an association between a trait and a genotype or a haplotype. The trait may be any detectable phenotype, including but not limited to cancer, susceptibility to a disease or response to a treatment. The method involves obtaining data on the frequency of the genotypes or haplotypes of interest in a reference population and comparing the data to the frequency of the genotypes or haplotypes in a population exhibiting the trait. Frequency data for one or both of the reference and trait populations may be obtained by genotyping or haplotyping each individual in the populations using one of the methods described above. The haplotypes for the trait population may be determined directly or, alternatively, by the predictive genotype to haplotype approach described above.
[120] In preferred embodiments, the trait is susceptibility to a disease, severity of a disease, the staging of a disease or response to a drug. Such methods have applicability in developing diagnostic tests and therapeutic treatments for all pharmacogenetic applications where there is the potential for an association between a genotype and a treatment outcome, including efficacy measurements, PD measurements, PK measurements and side effect measurements. [121] In another embodiment, the frequency data for the reference and/or trait populations are obtained by accessing previously determined frequency data, which may be in written or electronic form. For example, the frequency data may be present in a database that is accessible by a computer. Once the frequency data are obtained, the frequencies of the genotypes or haplotypes of interest in the reference and trait populations are compared. In a preferred embodiment, the frequencies of all genotypes and/or haplotypes observed in the populations are compared. If a particular genotype or haplotype for the gene is more frequent in the trait population than in the reference population at a statistically significant amount, then the trait is predicted to be associated with that genotype or haplotype. [122] In a preferred embodiment, the haplotype frequency data for different ethnogeographic groups are examined to determine whether they are consistent with Hardy-Weinberg equilibrium. Hartl DL et al., Principles of Population Genomics, 3rd Ed. (Sinauer Associates, Sunderland, MA, 1997). Hardy-Weinberg equilibrium postulates that the frequency of finding the haplotype pair H\/H2 is equal to PH-w (HiZH2) = 2p(H{)p (H2) if Hx ≠ H2 and PH-w
(HiIHi) =p (Hi)p (H.) if Hi = H2- A statistically significant difference between the observed and expected haplotype frequencies could be due to one or more factors including significant inbreeding in the population group, strong selective pressure on the gene, sampling bias, and/or errors in the genotyping process. If large deviations from Ηardy-Weinberg equilibrium are observed in an ethnogeographic group, the number of individuals in that group can be increased to see if the deviation is due to a sampling bias. If a larger sample size does not reduce the difference between observed and expected haplotype pair frequencies, then one may wish to consider haplotyping the individual using a direct haplotyping method such as, for example, CLASPER System™ technology (U.S. Pat. No. 5,866,404), SMD, or allele- specific long-range PCR (Michalotos-Beloin et ah, Nucl. Acids Res. 24: 4841-4843 (1996)). [123] In one embodiment of this method for predicting a haplotype pair, the assigning step involves performing the following analysis. First, each of the possible haplotype pairs is compared to the haplotype pairs in the reference population. Generally, only one of the haplotype pairs in the reference population matches a possible haplotype pair and that pair is assigned to the individual. Occasionally, only one haplotype represented in the reference haplotype pairs is consistent with a possible haplotype pair for an individual, and in such cases the individual is assigned a haplotype pair containing this known haplotype and a new haplotype derived by subtracting the known haplotype from the possible haplotype pair. In rare cases, either no haplotypes in the reference population are consistent with the possible haplotype pairs, or alternatively, multiple reference haplotype pairs are consistent with the possible haplotype pairs. In such cases, the individual is preferably haplotyped using a direct molecular haplotyping method such as, for example, those discussed supra. [124] In a preferred embodiment, statistical analysis is performed by the use of standard ANOVA tests with a Bonferoni correction and/or a bootstrapping method that simulates the genotype phenotype correlation many times and calculates a significance value. When many polymorphisms and/or mutations are being analyzed, a calculation may be performed to correct for a significant association that might be found by chance. For statistical methods useful in the methods of the present invention, see Bailey NTJ, Statistical Methods in Biology, 3rd Edition (Cambridge Univ. Press, Cambridge, 1997); Waterman MS, Introduction to Computational Biology (CRC Press, 2000) and Bioinformatics, Baxevanis AD & Ouellette BFF, eds. (John Wiley & Sons, Inc., 2001).
[125] In a preferred embodiment of the method, the trait of interest is a clinical response exhibited by a patient to some therapeutic treatment, for example, response to a drug targeting or to a therapeutic treatment for a medical condition.
[126] In another embodiment of the invention, a detectable genotype or haplotype that is in linkage disequilibrium with a genotype or haplotype of interest may be used as a surrogate marker. A genotype that is in linkage disequilibrium with another genotype is indicated where a particular genotype or haplotype for a given gene is more frequent in the population that also demonstrates the potential surrogate marker genotype than in the reference population. If the frequency is statistically significant, then the marker genotype is predictive of that genotype or haplotype, and can be used as a surrogate marker.
[127] Correlating Subject Genotype or Haplotype to Treatment Response. In order to deduce a correlation between a clinical response to a treatment and a genotype or haplotype, genotype or haplotype data is obtained on the clinical responses exhibited by a population of individuals who received the treatment, hereinafter the "clinical population". This clinical data may be obtained by analyzing the results of a clinical trial that has already been previously conducted and/or by designing and carrying out one or more new clinical trials. [128] It is preferred that the individuals included in the clinical population be graded for the existence of the medical condition of interest. This grading of potential patients could employ a standard physical exam or one or more lab tests. Alternatively, grading of patients could use genotyping or haplotyping for situations where there is a strong correlation between haplotype pair and disease susceptibility or severity.
[129] The therapeutic treatment of interest is administered to each individual in the trial population, and each individual's response to the treatment is measured using one or more predetermined criteria. It is contemplated that in many cases, the trial population will exhibit a range of responses, and that the investigator may choose more than one responder groups (e.g., low, medium, high) made up by the various responses. In addition, the gene for each individual in the trial population is genotyped and/or haplotyped, which may be done before or after administering the treatment.
[130] These results are then analyzed to determine if any observed variation in clinical response between polymorphism/mutation groups is statistically significant. Statistical analysis methods, which may be used, are described in Fisher LD & vanBelle G, Biostatistics: A Methodology for the Health Sciences (Wiley-lnterscience, New York, 1993). This analysis
may also include a regression calculation of which polymorphic/mutation sites in the gene contribute most significantly to the differences in phenotype.
[131] A second method for finding correlations between genotype and haplotype content and clinical responses uses predictive models based on error-minimizing optimization algorithms, one of which is a genetic algorithm. Judson R, Genetic Algorithms and Their Uses in Chemistry, in Reviews in Computational Chemistry, Vol. 10, Lipkowitz KB & Boyd DB5 eds. (VCH Publishers, New York, 1997) pp. 1-73. Simulated annealing (Press et al, Numerical Recipes in C: The Art of Scientific Computing, Ch. 10 (Cambridge University Press, Cambridge, 1992)), neural networks (Rich E & Knight K, Artificial Intelligence, 2nd Edition, Ch. 10 (McGraw-Hill, New York, 1991), standard gradient descent methods (Press et al, Numerical Recipes in C: The Art of Scientific Computing, Ch. 10 (Cambridge University Press, Cambridge, 1992), or other global or local optimization approaches (see discussion in Judson, supra) can also be used.
[132] Correlations may also be analyzed using analysis of variation (ANOVA) techniques to determine how much of the variation in the clinical data is explained by different subsets of the polymorphic and mutant sites in the gene. ANOVA is used to test hypotheses about whether a response variable is caused by or correlates with one or more traits or variables that can be measured (Fisher & vanBelle, supra, Ch. 10).
[133] After the clinical, mutation and polymorphism data have been obtained, correlations between individual response and genotype or haplotype content are created. Correlations may be produced in several ways. In one method, individuals are grouped by their genotype or haplotype (or haplotype pair) (also referred to as a polymorphism/mutation group), and then the averages and standard deviations of clinical responses exhibited by the members of each polymorphism/mutation group are calculated.
[134] From the analyses described above, the skilled artisan that predicts clinical response as a function of genotype or haplotype content may readily construct a mathematical model. The identification of an association between a clinical response and a genotype or haplotype (or haplotype pair) for the gene may be the basis for designing a diagnostic method to determine those individuals who will or will not respond to the treatment, or alternatively, will respond at a lower level and thus may require more treatment, i.e., a greater dose of a drug or suffer an adverse reaction. The diagnostic method may take one of several forms: for example, a direct DNA test (i.e., genotyping or haplotyping one or more of the polymorphic/mutant sites in the
gene), a serological test, or a physical exam measurement. The only requirement is that there be a good correlation between the diagnostic test results and the underlying genotype or haplotype. In a preferred embodiment, this diagnostic method uses the predictive genotyping/haplotyping method described above.
[135] Patient Selection for Therapy Based Upon Polymorphisms and/or Mutations. The application of genotypes and/or haplotypes that correlate with efficacious drug responses will be used to select patients for therapy of existing diseases. Genotypes and haplotypes that correlate with adverse consequences will be used to either modify how the drug is administered {e.g., dose, schedule or in combination with other drugs) or eliminated as an option.
[136] Patient Selection for Prophylactic Therapy Based Upon Polymorphisms and/or Mutations. The application of genotypes and/or haplotypes that correlate with a predisposition for disease will be used to select patients for preventative therapy. [137] Computer System for Storing or Displaying Polymorphism and Mutation Data. The invention also provides a computer system for storing and displaying polymorphism and χ mutation data determined for the gene. The computer system comprises a computer processing unit, a display, and a database containing the polymorphism/mutation data. The polymorphism/mutation data includes the polymorphisms, mutations, the genotypes and the haplotypes identified for a given gene in a reference population. In a preferred embodiment, the computer system is capable of producing a display showing haplotypes organized according to their evolutionary relationships. A computer may implement any or all analytical and mathematical operations involved in practicing the methods of the present invention. In addition, the computer may execute a program that generates views (or screens) displayed on a display device and with which the user can interact to view and analyze large amounts of information relating to the gene and its genomic variation, including chromosome location, gene structure, and gene family, gene expression data, polymorphism data, mutation data, genetic sequence data, and clinical population data (e.g., data on ethnogeographic origin, clinical responses, genotypes, and haplotypes for one or more populations). The polymorphism and mutation data described herein may be stored as part of a relational database (e.g., an instance of an Oracle database or a set of ASCII flat files). These polymorphism and mutation data may be stored on the computer's hard drive or may, for example, be stored on a CD-ROM or on one or more other storage devices accessible by the
computer. For example, the data may be stored on one or more databases in communication with the computer via a network.
[138] Nucleic Acid-based Diagnostics. In another aspect, the invention provides SNP and mutation probes, which are useful in classifying subjects according to their types of genetic variation. The SNP and mutation probes according to the invention are oligonucleotides, which discriminate between SNPs or mutations and the wild-type sequence in conventional allelic discrimination assays. In certain preferred embodiments, the oligonucleotides according to this aspect of the invention are complementary to one allele of the SNP/mutant nucleic acid, but not to any other allele of the SNP/Mutant nucleic acid. Oligonucleotides according to this embodiment of the invention can discriminate between SNPs and mutations in various ways. For example, under stringent hybridization conditions, an oligonucleotide of appropriate length will hybridize to one SNP or mutation, but not to any other. The oligonucleotide may be labelled using a radiolabel or a fluorescent molecular tag. Alternatively, an oligonucleotide of appropriate length can be used as a primer for PCR, wherein the 3' terminal nucleotide is complementary to one allele containing a SNP or mutation, but not to any other allele. In this embodiment, the presence or absence of amplification by PCR determines the haplotype of the SNP or the specific mutation. [139] Genomic and cDNA fragments of the invention comprise at least one novel polymorphic site or mutation identified herein, have a length of at least 10 nucleotides, and may range up to the full length of the gene. Preferably, a fragment according to the present invention is between 100 and 3000 nucleotides in length, and more preferably between 200 and 2000 nucleotides in length, and most preferably between 500 and 1000 nucleotides in length.
[140] Kits of the Invention. The invention provides nucleic acid and polypeptide detection kits useful for haplotyping and/or genotyping the genes in an individual. Such kits are useful for classifying individuals for the purpose of classifying individuals. Specifically, the invention encompasses kits for detecting the presence of a polypeptide or nucleic acid corresponding to a marker of the invention in a biological sample, e.g., any tissue or bodily fluid including, but not limited to, serum, plasma, lymph, cystic fluid, urine, stool, cerebrospinal fluid, ascites fluid or blood, and including biopsy samples of body tissue. For example, the kit can comprise a labelled compound or agent capable of detecting a polypeptide or an mRNA encoding a polypeptide corresponding to a marker of the invention
in a biological sample and means for determining the amount of the polypeptide or mRNA in the sample, e.g., an antibody which binds the polypeptide or an oligonucleotide probe which binds to DNA or mRNA encoding the polypeptide. Kits can also include instructions for interpreting the results obtained using the kit.
[141] In another embodiment, the invention provides a kit comprising at least two genotyping oligonucleotides packaged in separate containers. The kit may also contain other components such as hybridization buffer (where the oligonucleotides are to be used as a probe) packaged in a separate container. Alternatively, where the oligonucleotides are to be used to amplify a target region, the kit may contain, packaged in separate containers, a polymerase and a reaction buffer optimized for primer extension mediated by the polymerase, such as in the case of PCR.
[142] In a preferred embodiment, such kit may further comprise a DNA sample collecting means. In particular, the genotyping primer composition may comprise at least two sets of allele specific primer pairs. Preferably, the two genotyping oligonucleotides are packaged in separate containers.
[143] For antibody-based kits, the kit can comprise, e.g., (1) a first antibody, e.g., attached to a solid support, which binds to a polypeptide corresponding to a marker or the invention; and, optionally; (2) a second, different antibody which binds to either the polypeptide or the first antibody and is conjugated to a detectable label.
[144] For oligonucleotide-based kits, the kit can comprise, e.g., (1) an oligonucleotide, e.g., a detectably-labelled oligonucleotide, which hybridizes to a nucleic acid sequence encoding a polypeptide corresponding to a marker of the invention; or (2) a pair of primers useful for amplifying a nucleic acid molecule corresponding to a marker of the invention. [145] The kit can also comprise, e.g., a buffering agent, a preservative or a protein- stabilizing agent. The kit can further comprise components necessary for detecting the detectable-label, e.g., an enzyme or a substrate. The kit can also contain a control sample or a series of control samples, which can be assayed and compared to the test sample. Each component of the kit can be enclosed within an individual container and all of the various containers can be within a single package, along with instructions for interpreting the results of the assays performed using the kit.
[146] Making Polymorphisms and Mutations of the Invention. Effects of the polymorphisms and mutations identified herein on gene expression may be investigated by preparing
recombinant cells and/or organisms, preferably recombinant animals, containing a polymorphic variant and/or mutation of the gene.
[147] In one aspect, the present invention includes one or more polynucleotides encoding mutant or polymorphic polypeptides, including degenerate variants thereof. The invention also encompasses allelic variants of the same, that is, naturally occurring alternative forms of the isolated polynucleotides that encode mutant polypeptides that are identical, homologous or related to those encoded by the polynucleotides. Alternatively, non-naturally occurring variants may be produced by mutagenesis techniques or by direct synthesis techniques well known in the art. Accordingly, nucleic acid sequences capable of hybridizing at low stringency with any nucleic acid sequences encoding mutant polypeptide of the present invention are considered to be within the scope of the invention. For example, for a nucleic acid sequence of about 20-40 bases, a typical prehybridization, hybridization, and wash protocol is as follows: (1) prehybridization: incubate nitrocellulose filters containing the denatured target DNA for 3-4 hours at 55°C in 5xDenhardt's solution, 6xSSC (2OxSSC consists of 175 g NaCl, 88.2 g sodium citrate in 800 ml H2O adjusted to pH. 7.0 with 10 N NaOH), 0.1% SDS, and 100 mg/ml denatured salmon sperm DNA, (2) hybridization: incubate filters in prehybridization solution plus probe at 42°C for 14-48 hours, (3) wash; three 15 minutes washes in 6xSSC and 0.1% SDS at room temperature, followed by a final 1-1.5 minutes wash in 6xSSC and 0.1% SDS at 55°C. Other equivalent procedures, e.g., employing organic solvents such as formamide, are well known in the art. Standard stringency conditions are well characterized in standard molecular biology cloning texts. See, for example, Sambrook, Fritsch, & Maniatis, Molecular Cloning A Laboratory Manual, 2nd Ed., (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1989); Glover DN, DNA Cloning, Volumes I and II , (1985); Oligonucleotide Synthesis, Gait MJ, ed. (1984); Nucleic Acid Hybridization, Hames BD & Higgins SJ, eds. (1984).
[148] Recombinant Expression Vectors. Another aspect of the invention includes vectors containing one or more nucleic acid sequences encoding a mutant or polymorphic polypeptide. In practicing the present invention, many conventional techniques in molecular biology, microbiology and recombinant DNA are used. These techniques are well known and are explained in, e.g., Current Protocols in Molecular Biology, VoIs. I-III, Ausubel, ed. (1997); Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd Edition. (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1989); Glover DN, DNA Cloning:
A Practical Approach, VoIs. I and II (1985); Oligonucleotide Synthesis, Gait, Ed. (1984); Nucleic Acid Hybridization, Hames & Higgins, eds. (1985); Transcription and Translation, Hames & Higgins, Eds. (1984); Animal Cell Culture, Freshney, ed. (1986); Immobilized Cells and Enzymes (IRL Press, 1986); Perbal, A Practical Guide to Molecular Cloning; the series Methods in EnzymoL, (Academic Press, Inc., 1984); Gene Transfer Vectors for Mammalian Cells, Miller & Calos, eds. (Cold Spring Harbor Press, Cold Spring Harbor Laboratory, New York, 1987); and Methods in Enzymology, VoIs. 154 and 155, Wu & Grossman, and Wu, Eds., respectively.
[149] For recombinant expression of one or more the polypeptides of the invention, the nucleic acid containing all or a portion of the nucleotide sequence encoding the polypeptide is inserted into an appropriate cloning vector, or an expression vector (i.e., a vector that contains the necessary elements for the transcription and translation of the inserted polypeptide coding sequence) by recombinant DNA techniques well known in the art and as detailed below. [150] In general, expression vectors useful in recombinant DNA techniques are often in the form of plasmids. In the present specification, "plasmid" and "vector" can be used interchangeably as the plasmid is the most commonly used form of vector. However, the invention is intended to include such other forms of expression vectors that are not technically plasmids, such as viral vectors (e.g., replication defective retroviruses, adenoviruses and adeno-associated viruses), which serve equivalent functions. Such viral vectors permit infection of a subject and expression in that subject of a compound. Becker et ah, Meth. Cell Biol. A3: 161 89 (1994).
[151] The recombinant expression vectors of the invention comprise a nucleic acid encoding a mutant or polymorphic polypeptide in a form suitable for expression of the nucleic acid in a host cell, which means that the recombinant expression vectors include one or more regulatory sequences, selected on the basis of the host cells to be used for expression that is operatively linked to the nucleic acid sequence to be expressed. Within a recombinant expression vector, "operably linked" is intended to mean that the nucleotide sequence of interest is linked to the regulatory sequences in a manner that allows for expression of the nucleotide sequence (e.g., in an in vitro transcription/translation system or in a host cell when the vector is introduced into the host cell).
[152] The term "regulatory sequence" is intended to include promoters, enhancers and other expression control elements (e.g., polyadenylation signals). Such regulatory sequences are
described, for example, in Goeddel, Gene Expression Technology: Methods In Enzymology (Academic Press, San Diego, Calif., 1990). Regulatory sequences include those that direct constitutive expression of a nucleotide sequence in many types of host cell and those that direct expression of the nucleotide sequence only in certain host cells (e.g., tissue specific regulatory sequences). It will be appreciated by those skilled in the art that the design of the expression vector can depend on such factors as the choice of the host cell to be transformed, the level of expression of polypeptide desired, etc. The expression vectors of the invention can be introduced into host cells to thereby produce polypeptides or peptides, including fusion polypeptides, encoded by nucleic acids as described herein (e.g., mutant polypeptides and mutant-derived fusion polypeptides, etc.).
[153] Mutant and Polymorphic Polypeptide-Expressing Host Cells. Another aspect of the invention pertains to mutant and polymorphic polypeptide-expressing host cells, which contain a nucleic acid encoding one or more mutant/polymorphic polypeptides of the invention. To prepare a recombinant cell of the invention, the desired isogene may be introduced into a host cell in a vector such that the isogene remains extrachromosomal. In such a situation, the gene will be expressed by the cell from the extrachromosomal location. In a preferred embodiment, the isogene is introduced into a cell in such a way that it recombines with the endogenous gene present in the cell. Such recombination requires the occurrence of a double recombination event, thereby resulting in the desired gene polymorphism or mutation. Vectors for the introduction of genes both for recombination and for extrachromosomal maintenance are known in the art, and any suitable vector or vector construct may be used in the invention. Methods such as electroporation, particle bombardment, calcium phosphate co-precipitation and viral transduction for introducing DNA into cells are known in the art; therefore, the choice of method may lie with the competence and preference of the skilled practitioner.
[154] The recombinant expression vectors of the invention can be designed for expression of mutant polypeptides in prokaryotic or eukaryotic cells. For example, mutant/polymorphic polypeptides can be expressed in bacterial cells such as Escherichia coli (E. colϊ), insect cells (using baculovirus expression vectors), fungal cells, e.g., yeast, yeast cells or mammalian cells. Suitable host cells are discussed further in Goeddel, Gene Expression Technology: Methods In Enzymology (Academic Press, San Diego, Calif., 1990). Alternatively, the recombinant expression vector can be transcribed and translated in vitro, for example using
T7 promoter regulatory sequences and T7 polymerase. The SMP2 promoter is useful in the expression of polypeptides in smooth muscle cells, Qian et al., Endocrinology 140(4): 1826 (1999).
[155] Expression of polypeptides in prokaryotes is most often carried out in E. coli with vectors containing constitutive or inducible promoters directing the expression of either fusion or non fusion polypeptides. Fusion vectors add a number of amino acids to a polypeptide encoded therein, usually to the amino terminus of the recombinant polypeptide. Such fusion vectors typically serve three purposes: (i) to increase expression of recombinant polypeptide; (ii) to increase the solubility of the recombinant polypeptide; and (iii) to aid in the purification of the recombinant polypeptide by acting as a ligand in affinity purification. Often, in fusion expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion moiety and the recombinant polypeptide to enable separation of the recombinant polypeptide from the fusion moiety subsequent to purification of the fusion polypeptide. Such enzymes, and their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith and Johnson, Gene 67: 31 40 (1988)), pMAL (New England Biolabs, Beverly, Mass.) and pRIT5 (Pharmacia, Piscataway, NJ.) that fuse glutathione S transferase (GST), maltose E binding polypeptide, or polypeptide A, respectively, to the target recombinant polypeptide. [156] Examples of suitable inducible non fusion E. coli expression vectors include pTrc (Amrann et al, Gene 69:301 315 (1988)) and pET Hd (Studϊer et al, Gene Expression Technology: Methods In Enzymology (Academic Press, San Diego, Calif., 1990) pp.60-89). [157] One strategy to maximize recombinant polypeptide expression in E. coli is to express the polypeptide in host bacteria with an impaired capacity to proteolytically cleave the recombinant polypeptide. See, e.g., Gottesman, Gene Expression Technology: Methods In Enzymology (Academic Press, San Diego, Calif., 1990) 119 128. Another strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted into an expression vector so that the individual codons for each amino acid are those preferentially utilized in the expression host, e.g., E. coli (see, e.g., Wada et al, Nucl. Acids Res. 20: 2111-2118 (1992)). Such alteration of nucleic acid sequences of the invention can be carried out by standard DNA synthesis techniques. In another embodiment, the mutant/polymorphic polypeptide expression vector is a yeast expression vector.
[158] Examples of vectors for expression in yeast Saccharamyces cerivisiae include pYepSecl (Baldari et al, EMBOJ. 6: 229 234 (1987)), pMFa (Kurjan & Herskowitz, Cell 30: 933 943 (1982)), pJRY88 (Schultz et al, Gene 54: 113 123 (1987)), pYES2 (InVitrogen Corporation, San Diego, Calif., USA), and picZ (InVitrogen Corp, San Diego, Calif., USA). Alternatively, mutant polypeptide can be expressed in insect cells using baculovirus expression vectors. Baculovirus vectors available for expression of polypeptides in cultured insect cells (e.g., SF9 cells) include the pAc series (Smith et al, MoI. Cell. Biol. 3: 21562165 (1983)) and the pVL series (Lucklow & Summers, Virology 170: 31 39 (1989)). [159] In yet another embodiment, a nucleic acid of the invention is expressed in mammalian cells using a mammalian expression vector. Examples of mammalian expression vectors include pCDM8 (Seed, Nature 329: 842 846 (1987)) and pMT2PC (Kaufman et al, EMBO J. 6: 187 195 (1987)). When used in mammalian cells, the expression vector's control functions are often provided by viral regulatory elements. For example, commonly used promoters are derived from polyoma, adenovirus 2, cytomegalovirus, and simian virus 40. For other suitable expression systems for both prokaryotic and eukaryotic cells see, e.g., Chapters 16 and 17 of Sambrook, et al, Molecular Cloning: A Laboratory Manual, 2nd Ed.(Co\ά Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1989).
[160] In another embodiment, the recombinant mammalian expression vector is capable of directing expression of the nucleic acid preferentially in a particular cell type (e.g., tissue specific regulatory elements are used to express the nucleic acid). Tissue specific regulatory elements are known in the art. Nonlimiting examples of suitable tissue specific promoters include the albumin promoter (liver specific; Pinkert, et al, Genes Dev. 1: 268 277 (1987)), lymphoid specific promoters (Calame & Eaton, Adv. Immunol. 43: 235 275 (1988)), in particular promoters of T cell receptors (Winoto & Baltimore, EMBO J. 8: 729 733 (1989)) and immunoglobulins (Banerji et al, Cell 33: 729 740 (1983); Queen & Baltimore, Cell 33: 741 748 (1983)), neuron specific promoters (e.g., the neurofilament promoter; Byrne & Ruddle, Proc. Natl. Acad. ScL USA 86: 5473 5477 (1989)), pancreas specific promoters (Edlund et al, Science 230: 912 916 (1985)), and mammary gland specific promoters (e.g., milk whey promoter; U.S. Pat. No. 4,873,316 and European Application Publication No. 264,166). Developmentally regulated promoters are also encompassed, e.g., the murine hox promoters (Kessel & Grass, Science 249: 374 379 (1990)) and the α-fetoprotein promoter (Campes & Tilghman, Genes Dev. 3: 537 546 (1989)).
[161] The invention further provides a recombinant expression vector comprising a DNA molecule of the invention cloned into the expression vector in an antisense orientation. That is, the DNA molecule is operatively linked to a regulatory sequence in a manner that allows for expression (by transcription of the DNA molecule) of an RNA molecule that is antisense to a mutant polypeptide mRNA. Regulatory sequences operatively linked to a nucleic acid cloned in the antisense orientation can be chosen that direct the continuous expression of the antisense RNA molecule in a variety of cell types, for instance viral promoters and/or enhancers, or regulatory sequences can be chosen that direct constitutive, tissue specific or cell type specific expression of antisense RNA. The antisense expression vector can be in the form of a recombinant plasmid, phagemid or attenuated virus in which antisense nucleic acids are produced under the control of a high efficiency regulatory region, the activity of which can be determined by the cell type into which the vector is introduced. For a discussion of the regulation of gene expression using antisense genes see, e.g., Weintraub et al., "Antisense RNA as a molecular tool for genetic analysis," Reviews Trends in Genetics, Vol. 1(1) (1986). [162] Another aspect of the invention pertains to host cells into which a recombinant expression vector of the invention has been introduced. The terms "host cell" and "recombinant host cell" are used interchangeably herein. It is understood that such terms refer not only to the particular subject cell but also to the progeny or potential progeny of such a cell. Because certain modifications may occur in succeeding generations due to either mutation or environmental influences, such progeny may not, in fact, be identical to the parent cell, but are still included within the scope of the term as used herein.
[163] A host cell can be any prokaryotic or eukaryotic cell. For example, mutant polypeptide can be expressed in bacterial cells such as E. coli, insect cells, yeast or mammalian cells (such as Chinese hamster ovary cells (CHO) or COS cells). Other suitable host cells are known to those skilled in the art.
[164] Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional transformation or transfection techniques. As used herein, the terms "transformation" and "transfection" are intended to refer to a variety of art recognized techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, including calcium phosphate or calcium chloride co precipitation, DEAE dextran mediated transfection, lipofection, or electroporation. Suitable methods for transforming or transfecting host cells can be found in Sambrook, et al.,
Molecular Cloning: A Laboratory Manual, 2nd ed. (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1989), and other laboratory manuals.
[165] For stable transfection of mammalian cells, it is known that, depending upon the expression vector and transfection technique used, only a small fraction of cells may integrate the foreign DNA into their genome. In order to identify and select these integrants, a gene that encodes a selectable marker (e.g., resistance to antibiotics) is generally introduced into the host cells along with the gene of interest. Various selectable markers include those that confer resistance to drugs, such as G418, hygromycin and methotrexate. Nucleic acid encoding a selectable marker can be introduced into a host cell on the same vector as that encoding mutant polypeptide or can be introduced on a separate vector. Cells stably transfected with the introduced nucleic acid can be identified by drug selection (e.g., cells that have incorporated the selectable marker gene will survive, while the other cells die). [166] A host cell that includes a compound of the invention, such as a prokaryotic or eukaryotic host cell in culture, can be used to produce (i.e., express) recombinant mutant/polymorphic polypeptide. In one embodiment, the method comprises culturing the host cell of invention (into which a recombinant expression vector encoding mutant/polymorphic polypeptide has been introduced) in a suitable medium such that mutant polypeptide is produced. In another embodiment, the method further comprises the step of isolating mutant/polymorphic polypeptide from the medium or the host cell. Purification of recombinant polypeptides is well known in the art and includes ion exchange purification techniques, or affinity purification techniques, for example with an antibody to the compound. Methods of creating antibodies to the compounds of the present invention are discussed below.
[167] Transgenic Animals. Recombinant organisms, i.e., transgenic animals, expressing a variant gene of the invention are prepared using standard procedures known in the art. Transgenic animals carrying the constructs of the invention can be made by several methods known to those having skill in the art. See, e.g., U.S. Pat. No. 5,610,053 and "The Introduction of Foreign Genes into Mice" and the cited references therein, in: Recombinant DNA, Watson JD, Gilman M, Witkowski J & Zoller M, eds. (W.H. Freeman and Company, New York) pp. 254-272. Transgenic animals stably expressing a human isogene and producing human protein can be used as biological models for studying diseases related to
abnorraal expression and/or activity, and for screening and assaying various candidate drugs, compounds, and treatment regimens to reduce the symptoms or effects of these diseases. [168] Characterizing Gene Expression Level. Methods to detect and measure mRNA levels (i.e., gene transcription level) and levels of polypeptide gene expression products (i.e., gene translation level) are well-known in the art and include the use of nucleotide microarrays and polypeptide detection methods involving mass spectrometers, reverse-transcription and amplification and/or antibody detection and quantification techniques. See also, Strachan T & Read A, Human Molecular Genetics, 2nd Edition. (John Wiley and Sons, Inc. Publication, New York, 1999)).
[169] Determination of Target Gene Transcription. The determination of the level of the expression product of the gene in a biological sample, e.g., the tissue or body fluids of an individual, may be performed in a variety of ways. The term "biological sample" is intended to include tissues, cells, biological fluids and isolates thereof, isolated from a subject, as well as tissues, cells and fluids present within a subject. Many expression detection methods use isolated RNA. For in vitro methods, any RNA isolation technique that does not select against the isolation of mRNA can be utilized for the purification of RNA from cells. See, e.g., Ausubel et at, Ed., Curr. Prot. MoI. Biol. (John Wiley & Sons, New York, 1987-1999). [170] In one embodiment, the level of the mRNA expression product of the target gene is determined. Methods to measure the level of a specific mRNA are well-known in the art and include Northern blot analysis, reverse transcription PCR and real time quantitative PCR or by hybridization to a oligonucleotide array or microarray. In other more preferred embodiments, the determination of the level of expression may be performed by determination of the level of the protein or polypeptide expression product of the gene in body fluids or tissue samples including but not limited to blood or serum. Large numbers of tissue samples can readily be processed using techniques well-known to those of skill in the art, such as, e.g., the single-step RNA isolation process of U.S. Pat. No. 4,843,155. [171] The isolated mRNA can be used in hybridization or amplification assays that include, but are not limited to, Southern or Northern analyses, PCR analyses and probe arrays. One preferred diagnostic method for the detection of mRNA levels involves contacting the isolated mRNA with a nucleic acid molecule (probe) that can hybridize to the mRNA encoded by the gene being detected. The nucleic acid probe can be, e.g., a full-length cDNA, or a portion
thereof, such as an oligonucleotide of at least 7, 15, 30, 50, 100, 250 or 500 nucleotides in length and sufficient to specifically hybridize under stringent conditions to an mRNA or genomic DNA encoding a marker of the present invention. Other suitable probes for use in the diagnostic assays of the invention are described herein. Hybridization of an mRNA with the probe indicates that the marker in question is being expressed.
[172] In one format, the probes are immobilized on a solid surface and the mRNA is contacted with the probes, for example, in an Affymetrix gene chip array (Affymetrix, Calif. USA). A skilled artisan can readily adapt known mRNA detection methods for use in detecting the level of mRNA encoded by the markers of the present invention. [173] An alternative method for determining the level of mRNA corresponding to a marker of the present invention in a sample involves the process of nucleic acid amplification, e.g., by RT-PCR (the experimental embodiment set forth in U.S. Pat. No. 4,683,202); ligase chain reaction (Barany et ah, Proc. Natl. Acad. ScL USA 88:189-193 (1991)) self-sustained sequence replication (Guatelli et ah, Proc. Natl. Acad. ScL USA 87: 1874-1878 (1990)); transcriptional amplification system (Kwoh et ah, Proc. Natl. Acad. ScL USA 86: 1173-1177 (1989)); Q-Beta Replicase (Lizardi et ah, Biol. Technology 6: 1197 (1988)); rolling circle replication (U.S. Pat. No. 5,854,033); or any other nucleic acid amplification method, followed by the detection of the amplified molecules using techniques well-known to those of skill in the art. These detection schemes are especially useful for the detection of the nucleic acid molecules if such molecules are present in very low numbers. As used herein, "amplification primers" are defined as being a pair of nucleic acid molecules that can anneal to 5' or 3' regions of a gene (plus and minus strands, respectively, or vice- versa) and contain a short region in between. In general, amplification primers are from about 10-30 nucleotides in length and flank a region from about 50-200 nucleotides in length.
[174] Real-time quantitative PCR (RT-PCR) is one way to assess gene expression levels, e.g., of genes of the invention, e.g., those containing SNPs and mutations of interest. The RT- PCR assay utilizes an RNA reverse transcriptase to catalyze the synthesis of a DNA strand from an RNA strand, including an mRNA strand. The resultant DNA may be specifically detected and quantified and this process may be used to determine the levels of specific species of mRNA. One method for doing this is TAQMAN® (PE Applied Biosystems, Foster City, Calif., USA) and exploits the 5' nuclease activity of AMPLITAQ GOLD™ DNA polymerase to cleave a specific form of probe during a PCR reaction. This is referred to as a
TAQMAN™ probe. See Luthra et ah, Am. J. Pathol. 153: 63-68 (1998); Kuimelis et ah, Nucl. Acids Symp. Ser. 37: 255-256 (1997); and Mullah et al, Nucl. Acids Res. 26(4): 1026- 1031 (1998)). During the reaction, cleavage of the probe separates a reporter dye and a quencher dye, resulting in increased fluorescence of the reporter. The accumulation of PCR products is detected directly by monitoring the increase in fluorescence of the reporter dye. Heid et ah, Genome Res. 6(6): 986-994 (1996)). The higher the starting copy number of nucleic acid target, the sooner a significant increase in fluorescence is observed. See Gibson, Heid & Williams et ah, Genome Res. 6: 995-1001 (1996).
[175] Other technologies for measuring the transcriptional state of a cell produce pools of restriction fragments of limited complexity for electrophoretic analysis, such as methods combining double restriction enzyme digestion with phasing primers (see, e.g., EP 0 534858 Al), or methods selecting restriction fragments with sites closest to a defined mRNA end. (See, e.g., Prashar & Weissman, Proc. Natl. Acad. Sci. USA 93(2) 659-663 (1996)). [176] Other methods statistically sample cDNA pools, such as by sequencing sufficient bases, e.g., 20-50 bases, in each of multiple cDNAs to identify each cDNA, or by sequencing short tags, e.g., 9-10 bases, which are generated at known positions relative to a defined mRNA end pathway pattern. See, e.g., Velculescu, Science 270: 484-487 (1995). The cDNA levels in the samples are quantified and the mean, average and standard deviation of each cDNA is determined using by standard statistical means well-known to those of skill in the art. Norman TJ. Bailey, Statistical Methods In Biology, 3rd Edition (Cambridge University Press, 1995).
[177] Detection of Polypeptides. Immunological Detection Methods. Expression of the protein encoded by the genes of the invention can be detected by a probe which is detectably labelled, or which can be subsequently labelled. The term "labelled", with regard to the probe or antibody, is intended to encompass direct-labelling of the probe or antibody by coupling, i.e., physically linking, a detectable substance to the probe or antibody, as well as indirect- labelling of the probe or antibody by reactivity with another reagent that is directly-labelled. Examples of indirect labelling include detection of a primary antibody using a fluorescently- labelled secondary antibody and end-labelling of a DNA probe with biotin such that it can be detected with fluorescently-labelled streptavidin. Generally, the probe is an antibody that recognizes the expressed protein. A variety of formats can be employed to determine whether a sample contains a target protein that binds to a given antibody. Immunoassay methods
useful in the detection of target polypeptides of the present invention include, but are not limited to, e.g., dot blotting, western blotting, protein chips, competitive and noncompetitive protein binding assays, enzyme-linked immunosorbant assays (ELISA), immunohistochemistry. fluorescence activated cell sorting (FACS)5 and others commonly used and widely-described in scientific and patent literature, and many employed commercially. A skilled artisan can readily adapt known protein/antibody detection methods for use in determining whether cells express a marker of the present invention and the relative concentration of that specific polypeptide expression product in blood or other body tissues. Proteins from individuals can be isolated using techniques that are well-known to those of skill in the art. The protein isolation methods employed can, e.g., be such as those described in Harlow & Lane, Antibodies: A Laboratory Manual (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1988)).
[178] For the production of antibodies to a protein encoded by one of the disclosed genes, various host animals may be immunized by injection with the polypeptide, or a portion thereof. Such host animals may include, but are not limited to, rabbits, mice and rats. Various adjuvants may be used to increase the immunological response, depending on the host species including, but not limited to, Freund's (complete and incomplete), mineral gels, such as aluminium hydroxide; surface active substances, such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet hemocyanin and dinitrophenol; and potentially useful human adjuvants, such as bacille Camette-Guerin (BCG) and Coryne bacterium parvum.
[179] Monoclonal antibodies (mAbs), which are homogeneous populations of antibodies to a particular antigen, may be obtained by any technique that provides for the production of antibody molecules by continuous cell lines in culture. These include, but are not limited to, the hybridoma technique of Kohler & Milstein, Nature 256: 495-497 (1975); and U.S. Pat. No. 4,376,110; the human B-cell hybridoma technique of Kosbor et ah, Immunol. Today 4: 72 (1983); Cole et al, Proc. Natl. Acad. ScL USA 80: 2026-2030 (1983); and the EBV- hybridoma technique of Cole et al., Monoclonal Antibodies and Cancer Therapy (Alan R. Liss, Inc., 1985) pp. 77-96.
[180] In addition, techniques developed for the production of "chimeric antibodies" (see Morrison et al, Proc. Natl. Acad. Sci. USA 81: 6851-6855 (1984); Neuberger et al, Nature 312: 604-608 (1984); and Takeda et al, Nature 314: 452-454 (1985)), by splicing the genes
from a mouse antibody molecule of appropriate antigen specificity together with genes from a human antibody molecule of appropriate biological activity can be used. A chimeric antibody is a molecule in which different portions are derived from different animal species, such as those having a variable or hypervariable region derived form a murine mAb and a human immunoglobulin constant region.
[181] Alternatively, techniques described for the production of single chain antibodies (U.S. Pat. No. 4,946,778; Bird, Science 242: 423-426 (1988); Huston et al, Proc. Natl. Acad. Sci. USA 85: 5879-5883 (1988); and Ward et al, Nature 334: 544-546 (1989)) can be adapted to produce differentially expressed gene single-chain antibodies.
[182] Techniques useful for the production of "humanized antibodies" can be adapted to produce antibodies to the proteins, fragments or derivatives thereof. Such techniques are disclosed in U.S. Pat. Nos. 5,932,448; 5,693,762; 5,693,761; 5,585,089; 5,530,101; 5,569,825; 5,625,126; 5,633,425; 5,789,650; 5,661,016; and 5,770,429. [183] Antibodies or antibody fragments can be used in methods, such as Western blots or immunofluorescence techniques, to detect the expressed proteins. In such uses, it is generally preferable to immobilize either the antibody or proteins on a solid support. Suitable solid phase supports or carriers include any support capable of binding an antigen or an antibody. Well-known supports or carriers include glass, polystyrene, polypropylene, polyethylene, dextran, nylon, amylases, natural and modified celluloses, polyacrylamides, gabbros and magnetite.
[184] A useful method, for ease of detection, is the sandwich ELISA, of which a number of variations exist, all of which are intended to be used in the methods and assays of the present invention. As used herein, "sandwich assay" is intended to encompass all variations on the basic two-site technique. Immunofluorescence and EIA techniques are both very well- established in the art. However, other reporter molecules, such as radioisotopes, chemiluminescent or bioluminescent molecules may also be employed. It will be readily apparent to the skilled artisan how to vary the procedure to suit the required use. [185] Whole genome monitoring of protein, i.e., the "proteome," can be carried out by constructing a microarray in which binding sites comprise immobilized, preferably monoclonal, antibodies specific to a plurality of protein species encoded by the cell genome. Preferably, antibodies are present for a substantial fraction of the encoded proteins, or at least for those proteins relevant to testing or confirming a biological network model of interest. As
noted above, methods for making monoclonal antibodies are well-known. See, e.g., Harlow & Lane, Antibodies: A Laboratory ManuaV (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1988)). In a preferred embodiment, monoclonal antibodies are raised against synthetic peptide fragments designed based on genomic sequence of the cell. With such an antibody array, proteins from the cell are contacted to the array and their binding is measured with assays known in the art.
Detection of Polypeptides. Two-Dimensional Gel Electrophoresis
[186] Two-dimensional gel electrophoresis is well-known in the art and typically involves isoelectric focusing along a first dimension followed by SDS-PAGE electrophoresis along a second dimension. See, e.g., Hames et al, Gel Electrophoresis of Proteins: A Practical Approach (IRL Press, New York, 1990); Shevchenko et al, Proc. Natl. Acad. ScL USA 93: 14440-14445 (1996); Sagliocco et al., Yeast 12: 1519-1533 (1996); and Lander, Science TJA: 536-539 (1996)).
[187] Detection of Polypeptides. Mass Spectroscopy. The identity as well as expression level of target polypeptide can be determined using mass spectrocopy technique (MS). MS- based analysis methodology is useful for analysis of isolated target polypeptide as well as analysis of target polypeptide in a biological sample. MS formats for use in analyzing a target polypeptide include ionization (I) techniques, such as, but not limited to, matrix assisted laser desσrption (MALDI), continuous or pulsed electrospray ionization (ESI) and related methods, such as ionspray or thermospray, and massive cluster impact (MCI). Such ion sources can be matched with detection formats, including linear or non-linear reflectron time of flight (TOF), single or multiple quadrupole, single or multiple magnetic sector Fourier transform ion cyclotron resonance (FTICR), ion trap and combinations thereof such as ion-trap/TOF. For ionization, numerous matrix/wavelength combinations (e.g., matrix assisted laser desorption (MALDI)) or solvent combinations (e.g., ESI) can be employed.
[188] For mass spectroscopy (MS) analysis, the target polypeptide can be solubilised in an appropriate solution or reagent system. The selection of a solution or reagent system, e.g., an organic or inorganic solvent, will depend on the properties of the target polypeptide and the type of MS performed, and is based on methods well-known in the art. See, e.g., Vorm et al., Anal Chem. 61: 3281 (1994) for MALDI; and Valaskovic et al, Anal. Chem. 61: 3802 (1995), for ESI. MS of peptides also is described, e.g., in International PCT Application No.
WO 93/24834 and U.S. Pat. No. 5,792,664. A solvent is selected that minimizes the risk that the target polypeptide will be decomposed by the energy introduced for the vaporization process. A reduced risk of target polypeptide decomposition can be achieved, e.g., by embedding the sample in a matrix. A suitable matrix can be an organic compound such as a sugar, e.g., a pentose or hexose, or a polysaccharide such as cellulose. Such compounds are decomposed thermolytically into CO2 and H2O such that no residues are formed that can lead to chemical reactions. The matrix also can be an inorganic compound, such as nitrate of ammonium, which is decomposed essentially without leaving any residue. Use of these and other solvents is known to those of skill in the art. See, e.g., U.S. Pat. No. 5,062,935. Electrospray MS has been described by Fenn et al., J. Phys. Chem. 88: 4451-4459 (1984); and PCT Application No. WO 90/14148; and current applications are summarized in review articles. See Smith et al, Anal. Chem. 62: 882-89 (1990); and Ardrey, Spectroscopy 4: 10-18 (1992).
[189] The mass of a target polypeptide determined by MS can be compared to the mass of a corresponding known polypeptide. For example, where the target polypeptide is a mutant protein, the corresponding known polypeptide can be the corresponding non-mutant protein, e.g., wild-type protein. With ESI, the determination of molecular weights in femtomole amounts of sample is very accurate due to the presence of multiple ion peaks, all of which can be used for mass calculation. Sub-attomole levels of protein have been detected, e.g., using ESI MS (Valaskovic et al, Science 273: 1199-1202 (1996)) and MALDI MS (Li et al, J. Am. Chem. Soc. 118: 1662-1663 (1996)).
[190] Matrix Assisted Laser Desorption (MALDI) The level of the target protein in a biological sample, e.g., body fluid or tissue sample, may be measured by means of mass spectrometric (MS) methods including, but not limited to, those techniques known in the art as matrix-assisted laser desorption/ionization, time-of-flight mass spectrometry (MALDI- TOF-MS) and surfaces enhanced for laser desorption/ionization, time-of-flight mass spectrometry (SELDI-TOF-MS) as further detailed below. Methods for performing MALDI are well-known to those of skill in the art. See, e.g., Juhasz et al, Analysis, Anal. Chem. 68: 941-946 (1996), and see also, e.g., U.S. Pat. Nos. 5,777,325; 5,742,049; 5,654,545; 5,641,959; 5,654,545 and 5,760,393 for descriptions of MALDI and delayed extraction protocols. Numerous methods for improving resolution are also known. MALDI-TOF-MS
has been described by Hillenkamp et al., Biological Mass Spectrometry, Burlingame & McCloskey, eds. (Elsevier Science Publ., Amsterdam, 1990) pp. 49-60. [191] A variety of techniques for marker detection using mass spectroscopy can be used. See Bordeaux Mass Spectrometry Conference Report, Hillenkamp, Ed., pp. 354-362 (1988); Bordeaux Mass Spectrometry Conference Report, Karas & Hillenkamp, Eds., pp. 416-417 (1988); Karas & Hillenkamp, Anal. Chem. 60: 2299-2301 (1988); and Karas et al, Biomed. Environ. Mass Spectrum 18: 841-843 (1989). The use of laser beams in TOF-MS is shown, e.g., in U.S. Patent Nos. 4,694,167; 4,686,366, 4,295,046 and 5,045,694, which are incorporated herein by reference in their entireties. Other MS techniques allow the successful volatilization of high molecular weight biopolymers, without fragmentation, and have enabled a wide variety of biological macromolecules to be analyzed by mass spectrometry. [192] Surfaces Enhanced for Laser Desorption/Ionization (SELDI) Other techniques are used which employ new MS probe element compositions with surfaces that allow the probe element to actively participate in the capture and docking of specific analytes, described as Affinity Mass Spectrometry (AMS). See SELDI patents U.S. Pat. Nos. 5,719,060; 5,894,063; 6,020,208; 6,027,942; 6,124,137; and U.S. Patent application No. U.S. 2003/0003465. Several types of new MS probe elements have been designed with Surfaces Enhanced for Affinity Capture (SEAC). See Hutchens & Yip, Rapid Commun. Mass Spectrom. 7: 576-580 (1993). SEAC probe elements have been used successfully to retrieve and tether different classes of biopolymers, particularly proteins, by exploiting what is known about protein surface structures and biospecific molecular recognition. The immobilized affinity capture devices on the MS probe element surface, i.e., SEAC, determines the location and affinity (specificity) of the analyte for the probe surface, therefore the subsequent analytical MS process is efficient.
[193] Within the general category of SELDI are three separate subcategories: (1) Surfaces Enhanced for Neat Desorption (SEND), where the probe element surfaces, i.e., sample presenting means, are designed to contain Energy Absorbing Molecules (EAM) instead of "matrix" to facilitate desorption/ionizations of analytes added directly (neat) to the surface; (2) SEAC, where the probe element surfaces, i.e., sample presenting means, are designed to contain chemically defined and/or biologically defined affinity capture devices to facilitate either the specific or non-specific attachment or adsorption (so-called docking or tethering) of analytes to the probe surface, by a variety of mechanisms (mostly non-covalent); and
(3) Surfaces Enhanced for Photolabile Attachment and Release (SEPAR), where the probe element surfaces, i.e., sample presenting means, are designed or modified to contain one or more types of chemically defined cross-linking molecules to serve as covalent docking devices. The chemical specificities determining the type and number of the photolabile molecule attachment points between the SEPAR sample presenting means {i.e., probe element surface) and the analyte (e.g., protein) may involve any one or more of a number of different residues or chemical structures in the analyte {e.g., His, Lys, Arg, Tyr, Phe and Cys residues in the case of proteins and peptides).
[194] Functionalizing Polypeptides A polypeptide of interest also can be modified to facilitate conjugation to a solid support. A chemical or physical moiety can be incorporate into the polypeptide at an appropriate position. For example, a polypeptide of interest can be modified by adding an appropriate functional group to the carboxyl terminus or amino terminus of the polypeptide, or to an amino acid in the peptide, {e.g., to a reactive side chain, or to the peptide backbone. The artisan will recognize, however, that such a modification, e.g., the incorporation of a biotin moiety, can affect the ability of a particular reagent to interact specifically with the polypeptide and, accordingly, will consider this factor, if relevant, in selecting how best to modify a polypeptide of interest. A naturally-occurring amino acid normally present in the polypeptide also can contain a functional group suitable for conjugating the polypeptide to the solid support. For example, a cysteine residue present in the polypeptide can be used to conjugate the polypeptide to a support containing a sulfhydryl group through a disulfide linkage, e.g., a support having cysteine residues attached thereto. Other bonds that can be formed between two amino acids, include, but are not limited to, e.g., monosulfide bonds between two lanthionine residues, which are non- naturally-occurring amino acids that can be incorporated into a polypeptide; a lactam bond formed by a transamidation reaction between the side chains of an acidic amino acid and a basic amino acid, such as between the y-carboxyl group of GIu (or alpha carboxyl group of Asp) and the amino group of Lys; or a lactone bond produced, e.g., by a crosslink between the hydroxy group of Ser and the carboxyl group of GIu (or alpha carboxyl group of Asp). Thus, a solid support can be modified to contain a desired amino acid residue, e.g., a GIu residue, and a polypeptide having a Ser residue, particularly a Ser residue at the N-terminus or C- terminus, can be conjugated to the solid support through the formation of a lactone bond. The support need not be modified to contain the particular amino acid, e.g., GIu, where it is
desired to form a lactone-like bond with a Ser in the polypeptide, but can be modified, instead, to contain an accessible carboxyl group, thus providing a function corresponding to the alpha carboxyl group of GIu.
[195] Thiol-Reactive Functionalities A thiol-reactive functionality is particularly useful for conjugating a polypeptide to a solid support. A thiol-reactive functionality is a chemical group that can rapidly react with a nucleophilic thiol moiety to produce a covalent bond, e.g., a disulfide bond or a thioether bond. A variety of thiol-reactive functionalities are known in the art, including, e.g., haloacetyls, such as iodoacetyl; diazoketones; epoxy ketones, alpha- and beta-unsaturated carbonyls, such as alpha-enones and beta-enones; and other reactive Michael acceptors, such as maleimide; acid halides; benzyl halides; and the like. See Greene & Wuts, Protective Groups in Organic Synthesis, 2nd Edition (John Wiley & Sons, 1991). [196] If desired, the thiol groups can be blocked with a photocleavable protecting group, which then can be selectively cleaved, e.g., by photolithography, to provide portions of a surface activated for immobilization of a polypeptide of interest. Photocleavable protecting groups are known in the art (see, e.g., published International PCT Application No. WO 92/10092; and McCray et al, Ann. Rev. Biophys. Biophys. Chem. 18: 239-270 (1989)) and can be selectively de-blocked by irradiation of selected areas of the surface using, e.g., a photolithography mask.
[197] Linkers. A polypeptide of interest can be attached directly to a support via a linker. Any linkers known to those of skill in the art to be suitable for linking peptides or amino acids to supports, either directly or via a spacer, may be used. For example, the polypeptide can be conjugated to a support, such as a bead, through means of a variable spacer. Linkers, include, Rink amide linkers (see, e.g., Rink, Tetrahedron Lett. 28: 3787 (1976)); trityl chloride linkers (see, e.g., Leznoff, Ace Chem. Res. 11: 327 (1978)); and Merrifield linkers (see, e.g., Bodansky et al, Peptide Synthesis, 2nd Edition (Academic Press, New York, 1976)). For example, trityl linkers are known. See, e.g., U.S. Pat. Nos. 5,410,068 and 5,612,474. Amino trityl linkers are also known. See, e.g., U.S. Pat. No. 5,198,531. Other linkers include those that can be incorporated into fusion proteins and expressed in a host cell. Such linkers may be selected amino acids, enzyme substrates or any suitable peptide. The linker may be made, e.g., by appropriate selection of primers when isolating the nucleic acid. Alternatively, they may be added by post-translational modification of the protein of interest. Linkers that are suitable for chemically linking peptides to supports, include disulfide bonds, thioether bonds,
hindered disulfide bonds and covalent bonds between free reactive groups, such as amine and thiol groups.
[198] Cleavable Linkers A linker can provide a reversible linkage such that it is cleaved under the select conditions. In particular, selectively cleavable linkers, including photocleavable linkers (see U.S. Pat. No. 5,643,722), acid cleavable linkers (see Fattom et al, Infect. Immun. 60: 584-589 (1992)), acid-labile linkers (see Welhδner et al, J. Biol. Chem. 266: 4309-4314 (1991)) and heat sensitive linkers are useful. A linkage can be, e.g., a disulfide bond, which is chemically cleavable by mercaptoethanol or dithioerythrol; a biotin/streptavidin linkage, which can be photocleavable; a heterobifunctional derivative of a trityl ether group, which can be cleaved by exposure to acidic conditions or under conditions of MS (see Kόster et al, Tetrahedron Lett. 31 : 7095 (1990)); a levulinyl-mediated linkage, which can be cleaved under almost neutral conditions with a hydrazinium/acetate buffer; an arginine-arginine or a lysine-lysine bond, either of which can be cleaved by an endopeptidase, such as trypsin; a pyrophosphate bond, which can be cleaved by a pyrophosphatase; or a ribonucleotide bond, which can be cleaved using a ribonuclease or by exposure to alkali condition. A photolabile cross-linker, such as 3-amino-(2-nitrophenyl)propionic acid can be employed as a means for cleaving a polypeptide from a solid support. Brown et al, MoI Divers, pp. 4-12 (1995); Rothschild et al, Nucl Acids. Res. 24: 351-66 (1996); and U.S. Pat. No. 5,643,722. Other linkers include RNA linkers that are cleavable by ribozymes and other RNA enzymes and linkers, such as the various domains, such as CHi, CH2 and CH3, from the constant region of human IgGl. See, Batra et al., Mol Immunol 30: 379-396 (1993). [199] Combinations of any linkers are also contemplated herein. For example, a linker that is cleavable under MS conditions, such as a silyl linkage or photocleavable linkage, can be combined with a linker, such as an avidin biotin linkage, that is not cleaved under these conditions, but may be cleaved under other conditions. Acid-labile linkers are particularly useful chemically cleavable linkers for mass spectrometry, especially for MALDI-TOF, because the acid labile bond is cleaved during conditioning of the target polypeptide upon addition of a 3-HPA matrix solution. The acid labile bond can be introduced as a separate linker group, e.g., an acid labile trityl group, or can be incorporated in a synthetic linker by introducing one or more silyl bridges using diisopropylysilyl, thereby forming a diisopropylysilyl linkage between the polypeptide and the solid support. The diisopropylysilyl linkage can be cleaved using mildly acidic conditions, such as 1.5%
trifluoroacetic acid (TFA) or 3-HPA/l% TFA MALDI-TOF matrix solution. Methods for the preparation of diisopropylysilyl linkages and analogues thereof are well-known in the art. See, e.g., Saha. etal., J. Org. Chem. 58: 7827-7831 (1993).
[200] Use of a Pin Tool to Immobilize a Polypeptide The immobilization of a polypeptide of interest to a solid support using a pin tool can be particularly advantageous. Pin tools include those disclosed herein or otherwise known in the art. See, e.g., U.S. Application Serial Nos. 08/786,988 and 08/787,639; and International PCT Application No. WO 98/20166.
[201] A pin tool in an array, e.g., a 4 * 4 array, can be applied to wells containing polypeptides of interest. Where the pin tool has a functional group attached to each pin tip, or a solid support, e.g., functionalized beads or paramagnetic beads are attached to each pin, the polypeptides in a well can be captured (1 pmol capacity). During the capture step, the pins can be kept in motion (vertical, 1-2 mm travel) to increase the efficiency of the capture. Where a reaction, such as an in vitro transcription is being performed in the wells, movement of the pins can increase efficiency of the reaction. Further immobilization can result by applying an electrical field to the pin tool. When a voltage is applied to the pin tool, the polypeptides are attracted to the anode or the cathode, depending on their net charge. [202] For more specificity, the pin tool (with or without voltage) can be modified to have conjugated thereto a reagent specific for the polypeptide of interest, such that only the polypeptides of interest are bound by the pins. For example, the pins can have nickel ions attached, such that only polypeptides containing a polyhistidine sequence are bound. Similarly, the pins can have antibodies specific for a target polypeptide attached thereto, or to beads that, in turn, are attached to the pins, such that only the target polypeptides, which contain the epitope recognized by the antibody, are bound by the pins.
[203] Captured polypeptides can be analyzed by a variety of means including, e.g., spectrometry techniques, such as UV/VIS, IR, fluorescence, chemiluminescence, NMR spectroscopy, MS or other methods known in the art, or combinations thereof. If conditions preclude direct analysis of captured polypeptides, the polypeptides can be released or transferred from the pins, under conditions such that the advantages of sample concentration are not lost. Accordingly, the polypeptides can be removed from the pins using a minimal volume of eluent, and without any loss of sample. Where the polypeptides are bound to the
beads attached to the pins, the beads containing the polypeptides can be removed from the pins and measurements made directly from the beads.
[204] Pin tools can be useful for immobilizing polypeptides of interest in spatially addressable manner on an array. Such spatially addressable or pre-addressable arrays are useful in a variety of processes, including, for example, quality control and amino acid sequencing diagnostics. The pin tools described in the U.S. Application Nos. 08/786,988 and 08/787,639 and International PCT Application No. WO 98/20166 are serial and parallel dispensing tools that can be employed to generate multi-element arrays of polypeptides on a surface of the solid support. The array surface can be flat, with beads or geometrically altered to include wells, which can contain beads. In addition, MS geometries can be adapted for accommodating a pin tool apparatus.
[205] Other Aspects of the Biological State hi various embodiments of the invention, aspects of the biological activity state, or mixed aspects can be measured in order to obtain drug and pathway responses. The activities of proteins relevant to the characterization of cell function can be measured, and embodiments of this invention can be based on such measurements. Activity measurements can be performed by any functional, biochemical or physical means appropriate to the particular activity being characterized. Where the activity involves a chemical transformation, the cellular protein can be contacted with natural substrates, and the rate of transformation measured. Where the activity involves association in multimeric units, e.g., association of an activated DNA binding complex with DNA, the amount of associated protein or secondary consequences of the association, such as amounts of mRNA transcribed, can be measured. Also, where only a functional activity is known, e.g., as in cell cycle control, performance of the function can be observed. However known and measured, the changes in protein activities form the response data analyzed by the methods of this invention. In alternative and non-limiting embodiments, response data may be formed of mixed aspects of the biological state of a cell. Response data can be constructed from, e.g., changes in certain mRNA abundances, changes in certain protein abundances and changes in certain protein activities.
[206] The following EXAMPLES are presented in order to more fully illustrate the preferred embodiments of the invention. These EXAMPLES should in no way be construed as limiting the scope of the invention, as defined by the appended claims.
EXAMPLE l
BIOINFORMATICS ANALYSIS OF HDAC6 MUTATION. IDENTIFICATION OF
HDAC6 MUTATIONS IN HUMAN CANCER
[207] To determine HDAC6 mutations in association with cancer, DHPLC analysis was conducted on test samples derived from human tissues, e.g., AML, and breast cancer as summarized in TABLE 3 below. Lilleberg SL, Curr. Opin. Drug Discov. Devel., 6(2): 237-
52 (2003). Specifically, fifteen (15) blood tissue samples were analyzed from AML patents.
Two (2) HDAC6 missense mutations were identified as detailed below in TABLE 3 (see also,
TABLE 1 and TABLE 2, supra).
TABLE 3
HDAC6 Mutations in AML
Amino Acid Amino Acid
Mutation Conversion Position Identifier
GGG>CGG G>R 930 G930R
GGG>CGG G>A 1064 G1064A
Computational Analysis of the HDAC6 Mutation
[208] The HDAC6 mutation identified in human cancer was analyzed using computational analysis tools to determine the effect(s) of these mutations on HDAC6 function.
A. Comparison of Known HDAC6 Mutations and SNPs with the HDAC6 Mutation of the Present Invention
[209] The known HDAC6 missense mutations and single nucleotide polymorphisms (SNPs), e.g., synonymous SNPs and non-synonymous SNPs, are summarized in TABLE 4 below. SNPs in coding regions (cSNPs) are of interest for the study of human phenotypic variation and disease gene mapping. Such polymorphisms are likely to impact gene function directly, and to be in strong linkage disequilibrium with rare disease-causing mutations in genes. Two (2) cSNPs of HDAC6 have been reported within the dbSNP database (http://www.ncbi.nlm.nih.gov/SNP/index.html). Neither of the mutations in TABLE 1 matches known non-synonymous SNPs.
TABLE 4 Known HDAC6 Mutations and SNPs
cSNP Allele 1 frequency Allele 2 frequency Ref SNP
V701A T C rs 1467379
T994I C T rsl7144210
B. Analysis of the Effect of HDAC6 Mutation on HDAC6 Protein Domain Structure and Function i. Pfam Analysis of the Potential Effect of the HDAC6 mutation on
HDAC6 Protein Domain Structure
[210] The effect of the HDAC6 mutations of the present invention on the protein domain structure of HDAC6 was analyzed using the Pfam computational analysis tool. Pfam is a large collection of multiple sequence alignments and hidden Markov models covering many common protein families based on the Swissprot 44.5 and SP-TrEMBL 27.5 protein sequence databases. Bateman et al, Nucl. Acids Res., Database Issue 32:D138-D141 (2004). A search using Pfam showed that HDAC6 contains two (2) HDAC domains and a Zn-finger domain also found in ubiquitin-hydrolases and other protein.
[211] Sequence alignment of the wild-type human HDAC6 polypeptide sequence with the Pfam HDAC domain polypeptide sequence is summarized below in TABLE 5 and TABLE 6. The first HDAC domain was located in the wild-type HDAC6 polypeptide at amino acid position 85 to 404 (TABLE 5; HDAC domain; score 315.9, E=3e-92).
TABLE 5
Sequence Alignment Comparison of Human HDAC6 with Pfam Model of HDAC Domain
Located from AA 85 to AA 404
*->tvayvydpevlnyhfktisygagHpenpeRlrlihelLleygllkkm
++ I + I +++ ++ I + + +Il 1 I I I + I + I + I + + I I I +++
HDAC6 85 GTGLVLDEQLNEFH CLWDDSFPEGPERLHAIKEQLIQEGLLDRC 128 eivtnsqeprkatdeelllvHsedyveflesvsptnlekldkgtNeielk ++ +1 1 i i i i i i i +I++ +| + +|+ +|+ I
HDAC6 129 VSFQ ARFAEKEELMLVHSLEYIDLMETTQYMNEGELRV LA 168
yfnvgdDtpvfaglyeaaqlsvGgaldaadrllegeldiainwagGPgHH ++ +)++++ +++|+ I |+ I +I+++I++I I ++ + +++| I I I HDAC6 169 DT—YDSVYLHPNSYSCACLASGSVLRLVDAVLGAEIRNGMAIIRPPGHH 216
AkkgeasGFCylNniAiAIlellkkyPyveRvliiDiDvHHGDGTQeiFY I ++ l+l +| +I+I++++++I +IIII+I+IIII1 111+ 1+ HDAC6 217 AQHSLMDGYCMFNHVAVAARYAQQKH-RIRRVLIVDWDVHHGQGTQFTFD 265
nddrVltvSfHkygkGefFPGTieGditeiGkgkGkgytlNiPLledGtd +|+ II++I+I+I+ ! |+|+ ++ +| l+l MI+I+I++ I++ HDAC6 266 QDPSVLYFSIHRYEQGRFWPHLKASNWSTTGFGQGQGYTINVPWNQVGMR 315
DesYleafktiiepvleqFkPdaiwsaGfDalegDptmqLgsfnLTieg (SEQIDNO:5) I++I++I I +++ I |+ +1 + 1++++1 i I i M i in |+++ |+ I
HDAC6 316 DADYIAAFLHVLLPVALEFQPQLVLVAAGFDALQGDP KGEMAATPAG 362 (SEQIDNO: 6) ygeivrflkslaqkhcdgpllvVlEGGYtlraiasavarcwialtggllg ++ +++|+ M +I+I++ i i i I i+i i I+1 + | + IU
HDAC6 363 FAQLTHLLMGLA GGKLILSLEGGYNLR ALAEGVSASLHTLLG 404
[212] The second HDAC domain was located in the wild-type HDAC6 polypeptide at amino acid position 480 to 800 (TABLE 6; HDAC domain; score 355.9, E=5.5e-107).
TABLE 6
Sequence Alignment Comparison of Human HDAC6 with Pfam Model of HDAC Domain
Located at AA 480 to AA 800
*->tvayvydpevlnyhfktisyg..agHpenpeRlrlihelLleygllk +++ I I |+++ |+ + + M l I + I+++I+ I I I | +
HDAC6 480 RTGLVYDQNMMNHC NLwdSHHPEVPQRILRIMCRLEELGLAG 521
kmeivtnsqeprkatdeelllvHsedyveflesvsptnlekldkgtNeie
+++ +1 II + II++MI I l +11' 1+ +++++ +t + I
HDAC6 522 RCLTLT PRPATEAELLTCHSAEYVGHLRATEKMKTRELHR E 562
lkyfnvgdDtpvfaglyeaaqlsvGgaldaadrllegeldiainwagGPg
++| I ++++ + +++ I I I++I + I ++++++I + I |+ ++ +++++I I
HDAC6 563 SSNF DSIYICPSTFACAQLATGAACRLVEAVLSGEVLNGAAWRPPG 609
HHAkkgeasGFCylNniAiAIlellkkyPyve . RvIuDiDvHHGDGTQe 111+ + I + I I I ++ 1 +1 + 1 ++++++ +++ I + I I + I + 1 I I I I + I I I
HDAC6 610 HHAEQDAACGFCFFNSVAVAARHAQTIS-GHAIRILIVDWDVHHGNGTQH 658
iFYnddrVltvSfHkygkGefFPGTieGditeiGkgkGkgytlNiPLled I++I+ I l + l 1 + I + I++I + I I I Il +11 +1 I+Ϊ+I+ ++++ HDAC6 659 MFEDDPSVLYVSLHRYDHGTFFPMGDEGASSQIGRAAGTGFTVNVAWNGP 708
GtdDesYleafktiiepvleqFkPdaivvsaGfDalegDptmqLgsfnLT ++ I ++ I I + I + + I ++ + I + I ++++ I I I I I I I + I I I I l +++++ HDAC6 709 RMGDADYLAAWHRLVLPIAYEFNPELVLVSAGFDAARGDP LGGCQVS 755
iegygeivrflkslaqkhcdgplIvVlEGGYtIraiasavarcwialtgg
+ I I I++ +++I+ | |+ |++++ M I I l + l ++ + +| | +
HDAC6 756 PEGYAHLTHLLMGLAΞ GRIILILEGGYNLT SISESMAACTRS 797
llgo* (SEQ ID NO: 7) I I I
HDAC6 798 LLG 800 (SEQ ID NO: 8)
[213] The zinc finger domain was located in wild-type HDAC6 polypeptide at amino acid position 1133 to position 1204 (TABLE 7; zf-UBP; score from 1133 to 1204; score 106.3, E = 7.7e-29).
TABLE 7
Sequence Alignment Comparison of Human HDAC6 with Pfam Model of Zinc Finger Domain Located from AA 1133 to AA 1204
*->CvstCgltenlWlCLtCGqvgCGryqydgdGgngHaleHyretgHpl I + I 1 + +++ I + I l + l + l l + l I I I + 1 I 1 + 1 + 1 + ++ I I I I
HDAC6 1133 C-GDCGTIQENWVCLSCYQVYCGRY INGHMLQHHGNSGHPL 1172
avklktqstwcyaadvYvhqeddsedaldgkylvd-c-* (SEQ ID NO : 9)
++++ ++ I I ] I + + + + I I | ++ +++ +++ +++ HDAC6 1173 VLSYIDLSAWCYYCQAYVHHQALLD — VKNI-AHQ 1204 (SEQ ID NO r IO)
[214] The two mutations identified in this inventation are location in between the second HDAC domain and the z-finger domain. Amino acid change in these positions may alter the protein structure and hence the protein function.
C. Analysis of the Potential Effect of HDAC6 mutation on HDAC6 Protein Regulatory Sites i. NetPhos Analysis of the Effect of HDAC6 mutation on HDAC6 Protein Phosphorylation
[215] Phosphorylation plays an important role in regulating HDAC function. Chang S, Bezprozvannaya S5 Li S5 Olson EN. Proc Natl Acad Sci U S A. ;102(23):8120-5 (June 7, 2005). Potential HDAC6 phosphorylation sites were identified by computational analysis using the NetPhos computational analysis tool. NetPhos produces neural network predictions for serine, threonine and tyrosine phosphorylation sites in eukaryotic proteins. Blom et ah, J. MoI. Biol., 294(5): 1351-1362 (1999). Potential phosphorylation predicted by NetPhos analysis are summarized below in TABLE 8. To be considered a potential phosphorylation site a threshold score of 0.5 was required . The predicted phosphorylation sites marked in bold text represent possible mutation interference.
[216] HDAC6 mutation position G 1064 is located in between two putative serine phosphorylation sites, S 1062 and S 1066. The alteration of HDAC6 phosporylation pattern may affect (i.e., increase or decrease) HDAC6 biological activity when compared to the biological activity of wild-type HDAC6 polypeptide.
TABLE 8 HDAC6 Phosphorylation Sites Predicted by NetPhos
Phosphorylation Amino Acid Position
Serine 16,22,27,28,31,173,180,220,274,381,423,441,458,563,564,
670,689,755,786,819,846,847,884,911 ,1009, 1025,1026, 1035, 1045, 1060,1062,1066, 1175
Threonine 10,30,156,359,442,525,527,532,551,635,868,869,892,896, 923,937,938,947,951,965,966,979,980,993,994,1007,1008, 1021 Tyrosine 149,158,171,175,225,319,570,668,759,1156,1189
ii. Analysis of the Potential Effect of HDAC6 mutation on SUMO HDAC6 Protein Regulatory Sites
[217] Small ubiquitin-related modifier (SUMO) family proteins (a.k.a., PICl, UBLl, Sentrin, GMPl, and Smt3) are covalently attached to select target proteins in post- translational modification. Johnson, E.S., Ann. Rev. Biochem., 73: 355-382 (2004). SUMO is a member of a ubiquitin-like protein family that is ligated to lysine residues in a variety of target proteins, modulating their functions. SUMO modification is reversible, and does not appear to target proteins for degradation but rather alters the target protein function through changes in cellular localization, biochemical activation, or through protection from ubiquitin- dependent degradation. Posttranslational modification via sumoylation influences numerous biological processes, including signal transduction, transcriptional regulation, and growth control. Shiio and Eisenmann have demonstrated that the DNA-binding histone proteins are subject to sumoylation. Shiio and Eisenmann, Proc. Natl Acad. ScL U.S.A., 100(23): 13225- 30 (2003).
[218] Potential HDAC6 sumoylation sites were identified by computational analysis using the SUMOPlot computational analysis tool. Hinsley et ah, Protein Sd., 13: 2588 - 2599 (2004); Van Dyck et al, J. Biol. Chem., 279: 36121 - 36131 (2004). SUMOplot™ predicts the probability for the SUMO consensus sequence (SUMO-CS) to be engaged in SUMO attachment. That is, most SUMO-modifϊed proteins contain the tetrapeptide motif B-K-x-D/E where B is a hydrophobic residue, K is the lysine conjugated to SUMO, x is any amino acid (aa), and D or E is an acidic residue. Substrate specificity appears to be derived directly from Ubc9 and the respective substrate motif. The SUMOplot™ score system is based on two criteria: 1) direct amino acid match to the SUMO-CS observed and shown to bind Ubc9, and
2) substitution of the consensus amino acid residues with amino acid residues exhibiting similar hydrophobicity. The seven (7) potential HDAC6 sumoylation sites identified in
HDAC6 wild-type polypeptide are summarized below in TABLE 9 and TABLE 10.
[219] The amino acid sequence of HDAC6 wild-type polypeptide (SEQ ID NO:11) is shown below in TABLE 9. Bold text with light gray shading designates a sumoylation motif. Bold and underlined text with dark gray shading designates a sumoylation motif of high probability.
. TABLE 9 Sumoylation Motifs in Wild-type HDAC6
1 MTSTGQDSTT TRQRRSRQNP QSPPQDSSVT SKRNJBgRqAV PRSIPNLAEV
51 KKKGKMKKΪiG QAMEEDLIVG LQGMDLNLEA EftLftGTGLVL DEQLNEFHCL
101 WDDSFPEGPE RLHAIKEQLI QEGLLDRCVS FQARFAgSIf LMLVHSLEYI
151 DLMETTQYMN EGELRVLADT YDSVYLHPNS YSCACLASGS VLRLVDAVLG
201 AEIRNGMAII RPPGHHAQHS LMDGYCMFNH VAVAARYAQQ KHRIRRVLIV
251 DWDVHHGQGT QFTFDQDPSV LYFSIHRYEQ GRFWPHIiKAS NWSTTGFGQG
301 QGYTINVPWN QVGMRDADYI AAFLHVLLPV ALEFQPQLVL VAAGFDALQG
351 DgKGBMAATP AGFAQLTHLL MGLAGGKLIL SLEGGYNLRA LAEGVSASLH
401 TLLGDPCPML ESPGAPCRSA QASVSCALEA LEPFWEVLVR STETVERDNM
451 EEDNVEESEE EGPWEPPVLP ILTWPVLQSR TGLVYDQNMM NHCNLWDSHH
501 PEVPQRILRI MCRLEELGLA GRCLTLTPRP ATEAELLTCH SAEYVGHLRA
551 TEKMKTRELH RESSNFDSIY ICPSTFACAQ LATGAACRLV EAVLSGEVLN
601 GAAVVRPPGH HAEQDAACGF CFFNSVAVAA RHAQTISGHA LRILIVDWDV
651 HHGNGTQHMF EDDPSVLYVS LHRYDHGTFF PMGDEGASSQ IGRAAGTGFT
701 VNVAWNGPRM GDADYLAAWH RLVLPIAYEF NPELVLVSAG FDAARGDPLG
751 GCQVSPEGYA HLTHLLMGLA SGRIILILEG GYNLTSISES MAACTRSLLG
801 DPPPLLTLPR PPLSGALASI TETIQVHRRY WRSLRVgϋg DREGPSSΞKL
851 VTjKgKgQPAK PRLAERMTTR EKKVLEAGMG KVTSASFGEE STPGQTNSET
901 AWALTQDQP SEAATGGATL AQTISEAAIG GAMLGQTTSE EAVGGATPDQ
951 TTSEETVGGA ILDQTTSEDA VGGATLGQTT SEEAVGGATL AQTTSEAAME
1001 GATLDQTTSE EAPGGTELIQ TPLASSTDHQ TPPTSPVQGT TPQISPSTLI
1051 GSLRTLELGS ESQGASESQA PGEENLLGEA AGGQDMADSM LMQGSRGLTD
1101 QAIFYAVTPL PWCPHLVAVC PIPAAGLDVT QPCGDCGTIQ ENWVCLSCYQ
1151 VYCGRYINGH MLQHHGNSGH PLVLSYIDLS AWCYYCQAYV HHQALLDVKN
1201 IAHQNKFGED MPHPH SEQ ID NO: 11
[220] The seven (7) potential HDAC6 sumoylation sites identified in HDAC6 and their relative score are summarized below in TABLE 10. A score above 0.5 indicates a high sumoylation potential.
TABLE 10 Potential Sumoylation Sites in HDAC6
HDAC6 Amino Acid Position Group SEO ID NO: Score
K838 RSLRVMKVEDREGP 12 0.80
K36 TSKRNIICKGAVPRS 13 0.77
K353 ALQGDPKGEMAATP 14 0.61
K 138 QARFAEKEELMLVH 15 0.50
K854 SKLVTKKAPQPAKP 16 0.37
K52 NLAEVKKKGKMKKL 17 0.31
K58 KKGKMKKI-GQAMEE 18 0.31
[221] The mutations identified in this invention are not predicted to change the sumoylation pattern of HD AC6.
ii. PROSITE Analysis of the Potential Effect of HDAC6 mutation on Other HDAC6 Protein Regulatory Sites
[222] The effect of the HDAC6 mutations on other protein regulatory sites was analyzed using the PROSITE computational analysis tool. PROSITE is a database of protein families and domains. It consists of biologically significant sites, patterns and profiles that help to reliably identify to which known protein family (if any) a new sequence belongs as well as to identify potential sites for protein modification. HuIo N. et al, Nucl. Acids. Res., 32:D134- D 137 (2004); Sigrist C.J.A. et al, Brief Bioinform., 3:265-274 (2002); Gattiker A. et al, Applied Bioinformatics, 1:107-108 (2002). Other potential sites for protein modification of HDAC6 polypeptide as predicted by ProSite analysis are summarized below in TABLE 11. Amino acid positions highlighted in bold text represent possible interference by mutations.
TABLE 11 Potential HDAC6 protein modification sites predicted bv PROSITE
Modification Amino Acid Positions Threonine-rich region 937 - 1041 PS50325
Casein kinase II phosphorylation 4.7;iO4 - 107; 220 - 223; 458 - 461; 532 - PS00006 535; 564 - 567; 786 - 789; 819 - 822; 868 -
871; 886 - 889; 896 - 899; 923 -926; 937 -
940; 938 - 941; 951 - 954; 952 - 955; 965 -
968; 966 - 969; 979 - 982; 980 - 983; 993 -
996; 1007 - 1010; 1008 - 1011; 1025 -
1028; 1138 -1141; 1175- 1178
N-myristoylation site 5-10; 73-78; PS00008 259-264; 298-303; 300-305; 313-318; 372-
377; 394-399; 696-701; 768-773; 815-820;
894-899; 917-922; 931-936; 935-940; 973-
978; 977-982; 987-992; 1051-1056; 1064-
1069; 1083-1088; 1126-1131
PKC phosphorylation 10-12; 30-32; 31-33; 527-529; 551-553; PS00005 771-773; 833-835; 847-849; 852-854; 868-
870; 1052-1054
N-glycosylation 291-294; 654-657; 783-786
PSOOOOl
Tyrosine phosphorylation 709-715
PS00007
Cell Attachment Sequence 745-747
PSOOOl6
[223] G930 and G 1064 were observed to be close to putative N-myristoylation sites. The HDAC6 mutation site G930 was observed close to a putative phosphorylation site by casein kinase IL Mutation in these positions may alter the modification patterns of HDAC6 and hence the protein function.
D. ClustalW Polypeptide Alignment and Sequence Analysis to Estimate the Potential Effect of HDAC6 mutation on HDAC6 Function
[224] ClustalW polypeptide alignment and sequence analysis was used to estimate the effect of HDAC6 mutation on HDAC6 biological function. Known HDAC sequences of various organisms including, e.g., rat HDAC6 (XP_228753), mouse HDAC6 (NP_034543), human HDAC6 (NP__006035), human HDAClO (NP_114408), mouse HDAClO (NP_954668), fly HDAC6 (NP_727842), putative mosquito HDAC (XP_565819), putative worm HDAC sequences (NP_500787 & NP_500788) were obtained from GenBank and aligned using
ClustalW. Chenna et al, Nucleic Acids Res., 31 (13):3497-500 (2003). ClustalW is a general purpose multiple sequence alignment program for DNA or proteins. It produces biologically meaningful multiple sequence alignments of divergent sequences. It calculates the best match for the selected sequences, and lines them up so that the identities, similarities and differences can be seen.
[225] For the position with the mutation reported, the mutated residue was inspected for its occurrence in organisms other than human. It was hypothesized that if the mutated residue was present in the wild-type sequence of another species in the corresponding position, the amino acid change may not have any adverse effect on the protein function. The results of the Clustal W comparison analysis are summarized below in TABLE 12. The mutated amino acid residues identified in the present studies is highlighted in bold, underlined text.
TABLE 12 ClustalW Sequence Alignment of HDAC Sequences from Multiple Organisms
HDAC6 rat -MTSTGQDSS-TRQRKSRHNPQSPLQDSSAT
HDAC6_raouse -MTSTGQDSS-TRQRKSRHNPQSPLQESSAT
HDAC6_human -MTSTGQDSTTTRQRRSRQNPQSPPQDSSVT HDAC10_human HDAC10_mouse
HDAC6_fly MLTKDLNSFSQSPPIVTRRSAQQAKIQTRAMASKSKTGTTGAATAAGGTGSSSGSISGGF HDAC_mosquito HDAC_worm HDAC worm2
HDAC6_rat LKRGGKKGAVPHSSPNLAEVKKKGKMKKLSQPAEEDLIVGLQGLDLNSETRVPVGTGLVF
HDAC6_mouse LKRGGKKCAVPHSSPNLAEVKKKGKMKKLSQPAEEDLVVGLQGLDLNPETRVPVGTGLVF
HDAC6_human SKRNIKKGAVPRSIPNLAEVKKKGKMKKLGQAMEEDLIVGLQGMDLNLEAEALAGTGLVL
HDAC10_human -MGTALVY
HDAC10_mouse MGTALVY HDAC6_fly NAADPRKSNKPNAALLEAKRRARNNMLKNQGCGSQESVTDIFQNAVNSKSLVRKPTALIY HDAC_mosquito
HDAC_worm -MLFEDRQKNRS- -TGPTLIGF HDAC worm2 -MLFEDRQKNRS- -TGPTLIGF
HDAC6_rat DEQLNDFHCLWDDS—FPENPERLHAIKEQLILEGLLGRCVSFQARFAEKEE-LMLVHSL
HDAC6_mouse DEQLNDFHCLWDDS—FPESPERLHAIREQLILEGLLGRCVSFQAWFAEKEE-LMLVHSL
HDAC6_human DEQLNEFHCLWDDS—FPEGPERLHAIKEQLIQEGLLDRCVSFQARFAEKEE-LMLVHSL
HDAC10_human HEDMTATRLLWDDPECEIERPERLTAALDRLRQRGLEQRCLRLSAREASEEE-LGLVHSP
HDAClO mouse HEDMTATRLLWDDPECEIECPERLTAALDGLRQRGLEERCLCLSACEASEEE-LGLVHSP
HDAC6_fly DBSMSQHCCLWDKE—HYECPERFTRVLERCRELNLTERCLELPSRSATKDEILRLHTEE HDAC_mosqu-.to
HDAC_worm KQTQNEHENTVCPT—HPESSDRILKIKEALTKTKILEKCTVLTNFLEIDDADLEVTHDK HDAC worm2 NQTQNEHENTVCPT—HPESSDRILKIKEALTKTKILEKCTVLTNFLEIDDADLEVTHDK
HDAC6_rat EYIDLMETTQYMNEGELRVLAGTYDSVYLHPNSYSCACLATGSVLRLVDAVMGAEIRNGM HDAC6_mouse EYIDIMETTQYMNEGELRVLAETYDSVYLHPNSYSCACLATGSVLRLVDALMGAEIRNGM HDAC6 human EYIDLMETTQYMNEGELRVLADTYDSVYLHPNSYSCACLASGSVLRLVDAVLGAEIRNGM HDAC10_human EYVSLVRETQVLGKEELQALSGQFDAIYFHPSTFHCARLAAGAGLQLVDAVLTGAVQNGL HDACIOjmouse EYIALVQKTQTLDKEELHALSKQYDAVYFHPDTFHCARLAAGAALQLVDAVLTGAVHNGL
HDAC6_fly HFERLKETSGIRDDERMEELSSRYDSIYIHPSTFELSLLASGSTIELVDHLVAGKAQNGM
HDAC_mosquito LVESVWSGTVQNGM HDAC_worm SMVKDLMESEKKTQEDINSQCEKYDSVFMTENSMKVAKDGVACVRDLTNRIMANEASNGF HDAC worm2 SMVKDLMESEKKTQEDINSQCEKYDΞVFMTENSMKVAKDGVACVRDLTNRIMANEASNGF
HDAC6_rat AVIRPPGHHAQRSLMDGYCMFNHLAVAARYAQKKHRIQRILIVDWDVHHGQGTQFIFDQD
HDAC6_mouse AVIRPPGHHAQHNLMDGYCMFNHLAVAARYAQKKHRIQRVLIVDWDVHHGQGTQFIFDQD
HDAC6_human AIIRPPGHHAQHSLMDGYCMFNHVAVAARYAQQKHRIRRVLIVDWDVHHGQGTQFTFDQD
HDAC10_human ALVRPPGHHGQRAAANGFCVFNNVAIAAAHAKQKHGLHRILWDWDVHHGQGIQYLFEDD
HDAC10_mouse ALVRPPGHHSQRAAANGFCVFNNVALAAKHAKQKYGLQRILIVDWDVHHGQGIQYIFNDD
HDAC6_fly AiiRppGHHAMKAEYNGYCFFNNVALATQHALDVHKLQRiLiiDYDVHHGQGTQRFFYND
HDAC_mosquxto ArLRPPGHHAMTAEYNGYCFFNNVAIAAQHALDHLGTQKILIVDWDVHHGQGTQRMFYSD
HDAC_worm AWRPPGHHADSVSPCGFCLFNNVAQAAEEAF-FSGAERILIVDLDVHHGHGTQRIFYDD HDAC worm2 AVVRPPGHHADSVSPCGFCLFNNVAQAAEEAF- FSGAERILIVDLDVHHGHGTQRIFYDD
HDAC6_rat PSVLYFSIHRYEHGRFWPHLKASNWSTTGFGQGQGYTINVPWNQVGMRDADYIAAFLHIL
HDAC6_mouse PSVLYFSIHRYEHGRFWPHLKASNWSTIGFGQGQGYTINVPWNQTGMRDADYIAAFLHIL
HDAC6_huinan PSVLYFSIHRYEQGRFWPHLKASNWSTTGFGQGQGYTINVPWNQVGMRDADYIAAFLHVL
HDAC10_human PSVLYFSWHRYEHGRFWPFLRESDADAVGRGQGLGFTVNLPWNQVGMGNADYVAAFLHLL
HDAC10_mouse PSVLYFSWHRYEHGSFWPFLPESDADAVGQGQGQGFTVNLPWNQVGMGNADYLAAFLHVL
HDAC6_fly E1RVVYFSIHRFEHGSFWPHLHESDYHAIGSGAGTGYNFNV-1LNATGMTNGDYLAIFQQLL
HDAC_mosquito PRVLYFSIHRYEHGTFWPNLRESDFDYVGTGKGEGYNFNVPLNKTGMTNGDYLAIWQQLL
HDAC_worm KRVLYFSIHRHEHGLFWPHLPESDFDHIGSGKGLGYNANLALNETGCTDSDYLSIIFHVL
HDAC W0rm2 KRVLYFSIHRHEHGLFWPHLPESDFDHIGSGKGLGYNANLALNETGCTDSDYLSIIFHVL
HDAC6_rat LPVA—FEPQLVLVAAGFDALHGDPKGEMSATPAGFAHLTHFLMGLAGGKLILSLEGGYN
HDAC6_mouse LPVASEFQPQLVLVAAGFDALHGDPKGEMAATPAGFAHLTHLLMGLAGGKLILSLEGGYN
HDAC6_human LPVALEFQPQLVLVAAGFDALQGDPKGEMAATPAGFAQLTHLLMGLAGGKLILSLEGGYN
HDAC10_human LPLAFEFDPELVLVSAGFDSAIGDPEGQMQATPECFAHLTQLLQVLAGGRVCAVLEGGYH
HDAC10_mouse LPLAFEFDPELVLVSAGFDSAIGDPEGQMQATPECFAHLTQLLQVLAGGRICAVLEGGYH
HDAC6_fly LPVALEFQPELIIVSAGYDAALGCPEGEMEVTPACYPHLLNPLLRLADARVAVVLEGGYC
HDAC_mosquito LPVATEFQPELIIVSAGYDAAYGCPEGCMET-PAFYSHLLSPLLLMAQGRVAVVLEGGYC HDAC_worm LPLATQFDPHFVIISAGFDALLGDPLGGMCLTPDGYSHILYHLKSLAQGRMLVVLEGGYN HDAC worπι2 LPLATQFDPHFVIISAGFDALLGDPLGGMCLTPDGYSHILYHLKSLAQGRMLWLEGGYN
HDAC6_rat LHALAKGVSGSLHTLLGDPCPMLESPV-APCASAQTSISCTLEALEPFWEVLERSVEPQD HDAC6_mouse LRALAKGISASLHTLLGDPCPMLESCV-VPCASAQISIYCTLEALEPFWEVLERSVETQE HDAC6_human LRALAEGVSASLHTLLGDPCPMLESPG-APCRSAQASVSCALEALEPFWEVLVRSTETVE HDAC10_human LESLAESVCMTVQTLLGDPAPPLSGPM-APCQSALESIQΞARAAQAPHWKSLQQQDVTAV
HDAC10_mouse LESLAQSVCMMVQTLLGDPTPPLLGLM-VPCQSALESIQSVQTAQTPYWTSLQQN—VAP
HDAC6_fly LDSLAEGAALTLRSLLGDPCPPLVETVPLPRAELAQALLSCIAVHRPHWRCLQLQQTYDC
HDAC_mosquito LESLAEGCALTLKTLLGDPAPRLAEALQPPSESMQASILNCIYSHRKYWKCLQLYELYDL
HDAC worm HQISAVAVQRCVRVLLG-YAPFSIELNEAPKESTVDSCVSLVSVLRHHWNCFDYFPSRTS
HDAC worm2 HQISAVAVQRCVRVLLG-YAPFSIELNEAPKESTVDSCVSLVSVLRHHWNCFDYFPSRTS
HDAC6_rat EDEVEG DMLEDEEEEGHWEATALPMDT
HDAC6_πiouse EDEVEE AVLEEEEEEGGWEATALPMDT
HDAC6_human RDNMEE DNVEESEEEGPWEPPVLPILT
HDAC10_human PMSPSS HSPEGR P
HDAC10_mouse VLSSST HSPEER S
HDAC6_fly VELQDRDKEEDLHTVLRHWIGGPPPMD—RYPTRDTAIPLPPEKLTSNAARLQVLRAETK HDAC_mosquito EDYNNTNPQDNFHKVIKCYVPPEPPAQPGRYETRNCYPVQSADERARIQDRLARLHLATD
HDAC_worm LRLAQWPIVN—TKVIYNYDPTTRRAD 1 TGEIIQDELASTEFTASDV HDAC worm2 LRLAQWPIVN—TKVIYNYDPTTRRAD TGEIIQDELASTEFTASDV
HDAC6_rat WPLLQNRTGLVYDERMMSHCNLWDNHHPETPQRILRIMCHLEEVG-LAARCLILPARPAL
HDAC6_mouse WPLLQNRTGLVYDEKMMSHCNLWDNHHPETPQRILRIMCHLEEVG-LAARCLILPARPAL
HDAC6_human WPVLQSRTGLVYDQNMMNHCNLWDSHHPEVPQRILRIMCRLEELG-LAGRCLTLTPRPAT
HDAC10_human PPLLPG—GPVCKAAASAPSSLLDQPCLCPAP SVRTAVALT-TPDITLVLPP
HDAC10_mouse LRLLGE—SPTCAVAEDSLSPLLDQLCLRPAP PICTAVAST-VPGAALCLPP
HDAC6_fly LSVPSFKVCYAYDAQMLLHCNLNDTGHPEQPSRIQHIHK-MHDDYGLLKQMKQLSPRAAT HDAC_mosqua.to LSFPPDRVCYVYDELMLEHRNSAEKWHPEQPERIAKIMQRLTGDYQLAQRMKRLPGRLAT HDAC_worm IPTENMETLIYFNEGDDAHFDLEEDNHPEKPARTRRILKTLRESGVLEKCVDRNCERIAT HDAC worm2 IPTENMETLIYFNEGDDAHFDLEEDNHPEKPARTRRILKTLRESGVLEKCVDRNCERIAT
HDAC6_rat DSELLTCHSAEYVERLRATEKMKTRDLHREGA-NFESIYICPSTFACAQLATGAACRLVE
HDAC6_mouse GSELLTCHSAEYVEHLRTTEKMKTRDLHREGA-NFDSIYICPSTFACAKLATGAACRLVE
HDAC6_human EAELLTCHSAEYVGHLRATEKMKTRELHRESS-NFDSIYICPSTFACAQLATGAACRLVE
HDAC10_human —DVIQQEASALREETEAWARPHESLAREE ALTALGKLLYLLD
HDAC10_mouse —GVLHQEGSVLREETEAWARLHKSRFQDE DLATLGKILCLLD
HDAC6_fly TDEVCLAHTRAHVNTVRRLLGREPKELHDAAG-IYNSVYLHPRTFDCATLAAGLVLQAVD
HDAC_mosquito TEELCLVHGPEHVRLIGEVC—QSSTMKQTAD-QYNSIYFHPSTDASARMATGSVLTVVE
HDAC_worm NEEIRLVHTKKMLEHLRTTETMKDEELMEEAEKEFNSIYLTRDTLKVARKAVGAVLQSVD
HDAC_worm2 NEEIRLVHTKKMLEHLRTTETMKDEELMEEAEKEFNSIYLTRDTLKVARKAVGAVLQSVD : . : * : :
SEQ ID NO: 48
HDAC6_rat AVLSGEV—LNGIAIVRPPGHHAEPDAACGFCFFNSVAVAARHAQVIAGRALRILIVDWD
HDAC6_mouse AVLSGEV—LNGIAWRPPGHHAEPNAACGFCFFNSVAVAARHAQIIAGRALRILIVDWD
HDAC6_human AVLSGEV—LNGAAWRPPGHHAEQDAACGFCFFNSVAVAARHAQTISGHALRILIVDWD
HDAC10_human GMLDGQV—NSGIAATPASAAAATLDVAVRRGLS HGAQRLLCVALGQLDRPPD
HDAC10_mouse GIMDGQI—RNAIATTTALATAATLDVLIQRCLA RRAQRVLCVALGQLDRPLD
HDAC6_fly SVLRGES—RSGICNVRPPGHHAEQDHPHGFCIFNNVAIAAQYAIRDFGLER-VLIVDWD
HDAC_mσsquito DVLEGRT—RSGVCVVRPPGHHAEADTPHGFCIYNSIAVAARHAVKRYGLRR-VLIVDWD
HDAC worm EIFEKDAGQRNALVIVRPPGHHASASKSSGFCIFNNVAVAAKYAQRRHKAKR-VLILDWD
HDAC worm2 EIFEKDRGQRNALVIVRPPGHHASaSKSSGFCIFNNVAVAAKYAQRRHKAKR-VI1ILDWD
HDAC6_rat VHHGNGTQHIFEEDPSVLYVSLHRYDRGTFFPMGDEGASSQVGRAAGTGFTVNVPWNGPR
HDAC6_mouse VHHGNGTQHIFEDDPSVLYVSLHRYDRGTFFPMGDEGASSQVGRDAGIGFTVNVPWNGPR
HDAC6_human VHHGNGTQHMFEDDPSVLYVSLHRYDHGTFFPMGDEGASSQIGRAAGTGFTVNVAWNGPR
HDACl0_human LAH- -DGRSLWLNIR- -GKEAAALSMFHVSTPLPV
HDAC10_mouse LAD DGRILWLNIR GKDAAIQSMFHFSTPLPQ HDAC6_fly VHHGNGTQHIFESNPKVLYISLHRYEHGSFFPKGPDGNFDVVGKGAGRGFNVNIPWNKKG HDAC_mosqU-.to VHHGNGTQHIFEQDPHVLYVSVHRYDNGAFFPRSTDADYTWGSGPGRGFNVNIPWNRKG HDAC_worm VHHGNGTQEIFYEDSNVMYMSIHRHDKGNFYPIGEPKDYSDVGEGAGEGMSVNVPFSGVQ HDAC worra2 VHHGNGTQEIFYEDSNVMYMSIHRHDKGNFYPIGEPKDYSDVGEGAGEGMSVNVPFSGVQ
HDAC6_rat MGDADYLATWHRLVLPIAYEFNPELVLISAGFDAAQGDPLGGCQVTPEGYAHLTHLLMGL
HDAC6_mouse MGDADYLAAWHRLVIiPIAYEFNPELVLISAGFDAAQGDPLGGCQVTPEGYAHLTHLLMGL
HDAC6_huiαan MGDADYLAAWHRLVLPIAYEFNPELVLVSAGFDAARGDPLGGCQVSPEGYAHLTHI1LMGL
H DAC 10_human M-TGGFLSCILGLVLPLAYGFQPDLVLVALG- -PGHGLQG—PHAALLAAMLRGL
HDAClOjmouse T-TGGFLSLILGLVLPLAYGFQPDMVLMALG PAHGLQN — AQAALLAAMLRSP HDAC6_fly MGDLEYALAFQQLIMPIAYEFN PQLVLVSAGFDAAIGDPLGGCKVTAEGYGMLTHWLΞAL HDAC_mosquαto MGDPEYVAAFHSVIIIPIAYEYEPELVLVSAGFDAAIGDPLGGCRVTPEAYGHFTQWLSVL HDAC_worm MGDNEYQMAFQRVIMPIAYQFNPDLVLISAGFDAAVDDPLGEYKVTPETFALMTYQLSSL HDAC worm2 MGDNEYQMAFQRVIMPIAYQFNPDLVLISAGFDAAVDDPLGEYKVTPETFALMTYQLSSL
HDAC6_rat AGGRIILILEGGYNLTSISESMAACTHSLLG DPPPQLTSLRPPQSGALASI H DAC 6_inou S e AGGRIILILEGGYNLASISESMAACTHSLLG DPPPQLTLLRPPQSGALVSI HDAC6_human ASGRIILILEGGYNLTSISESMAACTRSLLG DPPPLLTLPRPPLSGALASI H DAC 10_human AGGRVLALLEE NSTPQLAGILARVLNG EAPPSLGPSSVASPEDVQAL H DAC 10_mou S e VGGRILAWEE ESIRLLARSLAQALHG ETPPSLGPFSKATPEEIQAL HDAC6_fly ASGRIIVCLEGGYNVNSIS-YAMTMCTKTLLGDPVPTPQLGATALQKPPTVAFQSCVESL
HDAC_mosquito AGGRLVLCLEGGYNVNSIS-HAMAMCAKALLGDPLP-MLLPVSGTARTPPATHASCCETL HDAC_worm AGGRIITVLEGGYNLTSISNSAQAVCEVLQNRSMLRRLREEKEQFATKPQKIESSCIKTI HDAC worm2 AGGRIITVLEGGYNLTSISNSAQAVCEVLQNRSMLRRLREEKEQFATKPQKIESSCIKTI
HDAC6_rat SEVIQVHRKYWRSLRLMKMEDKEERSSSRLVIKKLPQSASPVSAKGMTTPKGKVLEAGMR
HDAC6_mouse SEVIQVHRKYWRSLRLSKMEDKEECSSSRLVVKKLPPTASPVSAKEMTTPKGKVPEESVR
HDAC6_human TETIQVHRRYWRSLRVMKVEDREGPSSSKLVTKKAPQPAKPRLAERMTTREKKVLEAGMG
HDAC10_human MYLRGQLEPQWKMLQCHPHLVA
HDAC10_mouse MFLKARLEARWKLLQVAAPPP
HDAC6_fly QQCLQVQRNHWRSLEFVGRRLPRDPVVGENNNEDF
HDAC_mosquito SNVLSVQRLYWRSLCFNKKLPSSEPLLEETGSE
HDAC_worm REVCAVQQKYWSILKGFQIIIG
HDAC worm2 REVCAVQQKYWSILKGFQVTPSNYGLDIDDEAYD
HDAC6_rat KPTAALPTKESTLGQAKAKTAKALLAQGQ
HDAC6_mouse KTIAALPGKESTLGQAKSKMAKAVLAQGQ
HDACδ_human KVTSASFGEESTPGQTNSETAVVALTQDQPSEAATGGATLAQTISEAAIGGAMLGQTTSE
HDAC10_huπιan
HDAC10_mouse
HDAC6_fly
HDAC_mosquito
HDAC_worm
HDAC worm2
HDAC6 rat -SSEQAAKGTTLDLATSKDTVGGATTDQCASVAATE
HDAC6_mouse SSEQAAKGTTLDLATSKETVGGATTDLWASAAAPE HDAC6_human EAVGGATPDQTTSEETVGGAILDQTTSΞDAVGGATLGQTTSEEAVGGATLAQTTSEAAME HDAC10_huinan HDAC10_mouse
HDAC6_fly -LTASLRHLN
HDAC_mosquito -LIEALDNMT HDAC_worm HDAC worm2
HDAC6_rat NSANQTTSGEEASGETEΞFGTSPSSNASKQTTGASPLHGAAAQQSPE—LGLSSTLELSS
HDAC6_mouse NFPNQTTS-VEALGETEP—TPPASHTNKQTTGASPLQGVTAQQSLQ—LGVLSTLELSR HDAC6_human GATLDQTTSEEAPGGTELIQTPLASSTDHQTPPTSPVQGTTPQISPΞTLIGSLRTLELGS HDACl0_human HDAC10_mouse HDAC6_fly ISNDDATAAAGGLAGDRPDCGDERPSGSKPKVKVKTLSDYLAENKEALEQΞEM-
HDAC_mosquito LSE RGGTDQQQPQQTTSTPSSSSSATTAGSSGGTAERGN HDAC_worm ΞVLNΞKLINKNGKSSAAT LKVKGKAATDP HDAC worm2 DDSIDMADQSSSSGSSSSSTRESHNLEIMDSGPAHA
HDAC6_rat EAQEVQES EEGLLGEAAGGQDMNSLMLTQGFGDFNTQDVFYAVTPLSWCFHLMAVC
HDAC6_mouse EAEEAHDS EEGLLGEAAGGQDMNSLMLTQGFGDFNTQDVFYAVTPLSWCPHLMAVC HDAC6_human ESQGASESQAPGEENLLGEAAGGQDMADSMLMQGSRGLTDQAIFYAVTPLPWCPHLVAVC HDAC10_human HDAC10_mouse
HDAC6_fly -FAVYPLKTCPHLRLLR HDAC_mosquito -WIPRRDCPHLKLLR HDAC_worm VKQADDS HDAC worm2 VVPLATCPHLKEVK
HDAC6_rat P-IPAAGLDVSQPCKTCGSVQENWVCLTCYQVYCSRYVKAHMVCH-HEASEHPLVLSCVD HDAC6_raouse P-IPAAGLDVSQPCKTCGTVQENWVCLTCYQVYCSRYVNAHMVCH-HEASEHPLVLSCVD
HDAC6_human P-IPAAGLDVTQPCGDCGTIQENWVCLSCYQVYCGRYINGHMLQH-HGNSGHPLVLSYID HDAC10_human HDAC10_raouse
HDAC6_fly PEEAPRSLDSGAECSVCGSTGENWVCLSCRHVACGRYVNAHMEQH-SVEEQHPLAMSTAD HDAC_mosquxto PETAPEAIDTRTPCSDCGAEVENWICLLCFGVYCGRYVNEHMLRHGTGAADHPLALSFAD
HDAC_worm RRYNTRRRRSANDEVEDVMEKLENMKL HDAC worm2 P-LPPAKINARTACSECQIGAEVWTCLTCYKYNCGRFVNEHAMMH-HLSSSHPMALSMAD
HDAC6_rat LSTWCYLCQAYVHHEDLQDVKNAAHQNKFGEGMPHLQ-
HDAC6_mouse LSTWCYVCQAYVHQDDLQDVKNAAHQNKFGEDMPHSH-
HDAC6_human LSAWCYYCQAYVHHQALLDVKNIAHQNKFGEDMPHPH-
HDAC10_human
HDAC10_mouse
HDAC6_fly LSVWCYACSAYVDHPRLYAYLNPLHEDKFQEPMAWTHGCAWREDGCYATGPDGRDEDDDD
HDAC_mosquito LSVWCYGCDAYIDHPALHHYKNLVHQDKFHEPL
HDAC_worm
HDAC worm2 LSVWCYPCDSYVHNPALIGAKSAAHESKFGETMPΞ
HDAC6_rat (SEQ ID NO: 19)
HDAC6~mouse <SEQ ID N0: 20)
HDΛC6~human (SEQ ID NO: 21)
HDAC10_human (SEQ ID NO: 22)
HDAC10_mouse (SEQ ID NO: 23)
HDAC6_fly DNGAGSSICLRLERNN (SEQ ID NO: 24) "
HDAC_iήosquito (SEQ ID NO: 25)
HDAC_worm (SEQ ID NO: 26)
HDAC_WOrm2 (SEQ ID NO : 27)
[226] The region from P910 to T965 of human HDAC6 is missing in both mouse and rat HDAC6. The matching positions for Gl 064 in mouse and rat HDAC6 both have E instead of G.
E. Analysis of the Potential Effect of HDAC6 mutation on HDAC6 Protein Secondary Structure i. The Effect of HD AC6 Mutation on Amino Acid Property
[227] The change of amino acid property observed by HDAC6 mutation is summarized in TABLE 13. Valdar WS, Proteins 48(2): 227-41 (2002).
TABLE 13 Influence of HDAC6 Mutations on HDAC6 Amino Acid Property
Mutation Property change
G930R Tiny -> positive
G1064A No Change Predicted
ii. nnPredict Method Analysis of the Wild-type HDAC6 Secondary
Structure
[228] Secondary structure predictions of wild-type HDAC6 (TABLE 15) and mutant HDAC6 polypeptides (TABLE 17 and TABLE 19) were performed by nnPredict. The basis of the prediction is a two-layer, feed-forward neural network. The network weights were determined by a separate program — a modification of the Parallel Distributed Programming suite of McClelland and Rumelhart (MIT Press, Cambridge MA.1, Vol. 3, pp 318-362 (1988)). Complete details of the determination of the network weights is found in Kneller et. al (J. MoI. Biol., (214): 171-182 (1990)). The output is a secondary structure prediction for each position in the sequence.
[229] All TABLES (e.g., TABLES 15, 17, and 19) in this section that summarize the HDAC6 protein secondary structure as predicted by nnPredict use "H", "E" and a dash "-" as identifiers, which are defined as follows. A helix element is designated by the letter "H". A strand element is designated by the letter "E". No prediction is designated by a dash ("-")• Gray shading represents polypeptide regions where mutation was identified. [230] The amino acid sequence of wild-type HDAC6 polypeptide (SEQ ID NO: 11) is shown below in TABLE 14.
TABLE 14
MTSTGQDSTTTRQRRSRQNPQSPPQDSSVTSKRNIKKGAVPRSIPNLAEVKKKGKMKKLG
QAMEEDLIVGLQGMDLNLEAEALAGTGLVLDEQLNEFHCLWDDSFPEGPERLHAIKEQLI QEGLLDRCVSFQARFAEKEELMLVHSLEYIDLMETTQYMNEGELRVLADTYDSVYLHPNS YSCACLASGSVLRLVDAVLGAEIRNGMAIIRPPGHHAQHSLMDGYCMFNHVAVAARYAQQ KHRIRRVLIVDWDVHHGQGTQFTFDQDPSVLYFSIHRYEQGRFWPHLKASNWSTTGFGQG QGYTINVPWNQVGMRDADYIAAFLHVLLPVALEFQPQLVLVAAGFDALQGDPKGEMAATP AGFAQLTHLLMGLAGGKLILSLEGGYNLRALAEGVSASLHTLLGDPCPMLESPGAPCRSA QASVSCALEALEPFWEVLVRSTETVERDNMEEDNVEESEEEGPWEPPVLPILTWPVLQSR TGLVYDQNMMNHCNLWDSHHPEVPQRILRIMCRLEELGLAGRCLTLTPRPATEAELLTCH SAEYVGHLRATEKMKTRELHRESSNFDSIYICPSTFACAQLATGAACRLVEAVLSGEVLN GAAVVRPPGHHAEQDAACGFCFFNSVAVAARHAQTISGHALRILIVDWDVHHGNGTQHMF EDDPSVLYVSLHRYDHGTFFPMGDEGASSQIGRAAGTGFTVNVAWNGPRMGDADYLAAWH RLVLPIAYEFNPELVLVSAGFDAARGDPLGGCQVSPEGYAHLTHLLMGLASGRIILILEG GYNLTSISESMAACTRSLLGDPPPLLTLPRPPLSGALASITETIQVHRRYWRSLRVMKVE
DREGPSSSKLVTKKAPQPAKPRLAERMTTREKKVLEAGMGKVTSASFGEESTPGQTNSET AVVALTQDQPSEAATGGATLAQTISEAAIGGAMLGQTTSEEAVGGATPDQTTSEETVGGA ILDQTTSEDAVGGATLGQTTSEEAVGGATLAQTTSEAAMEGATLDQTTSEEAPGGTELIQ TPLASSTDHQTPPTSPVQGTTPQISPSTLIGSLRTLELGSESQGASESQAPGEENLLGEA AGGQDMADSMLMQGSRGLTDQAIFYAVTPLPWCPHLVAVCPIPAAGLDVTQPCGDCGTIQ ENWVCLSCYQVYCGRYINGHMLQHHGNSGHPLVLSYIDLSAWCYYCQAYVHHQALLDVKN IAHQNKFGEDMPHPH SEQ ID NO: 11
[231] A schematic representation of the secondary structure of wild-type HDAC6 polypeptide (SEQ ID NO:11) predicted using nnPredict analysis is shown below in TABLE 15. The position of the mutated amino acid residues are identified with an asterisk.
. TABLE 15 HHHHHH HHHH-
HHHHHHHHH HHHHHHHHH EHHHHHHHHHH HHHHHHHHH
HH HHHHHHH-HHHHHHHHHHH HE-H-H H-HHEEEH EEE
— EEE HHHHHHHHH-HHHHH — EEE HHHHHHHHHHHHHHHH EEE HHHHHHHHHHH—H-HH HHHEEHH
—HHHHHHHHHHHH EEEEH HHHHHHHHHHHHHHE HHHHHHHH-HHHHHEE E HHHHHHHHHHHHH-HH HHHHHH-
—HHE HHHHHHHHHHHH EEE HHHHHHHHHHHHHHHHHH—HHH- E H—HEE—HHHHHHHHHHHHH-HHEEEEEE HEEEEE HHEHE EEEE HHHHHH j-l£^_—.______——■""■XiEJEE~*ii.——————————__—_—: — ""HHHHHHHHH *~EEEEE— EEE—HHHHHHH H-HHHHHHHHHHHHHHH-H HEH HHHHHHHHHHHHHHH-H—EE H
HEEE H HHHHHHHHHHHH*—EE HHE EE fcjlit"———__—fj———————————HHHH"~"~EE*"HHHHHHHHHH""«*•————•——'•"•-*"■"—""-"""""-EEE EEE HEE * HHHHHH HHHHHHH HEEE
—EE-E EEE HEEE EEEEE-E HHHH-HHHHHHHHHHHHH t JT-iJXLlJ__-_—_-_.__.— --—- ___—___
[232] The amino acid sequence of HDAC6 mutant polypeptide G930R (SEQ ID NO:28) is shown below in TABLE 16. The position of the mutated amino acid residue is highlighted in bold underlined text.
TABLE 16
MTSTGQDSTTTRQRRSRQNPQSPPQDSSVTSKRNIKKGAVPRSIPNLAEVKKKGKMKKLG QAMEEDLIVGLQGMDLNLEAEALAGTGLVLDEQLNEFHCLWDDSFPEGPERLHAIKEQLI QEGLLDRCVSFQARFAEKEELMLVHSLEYIDLMETTQYMNEGELRVLADTYDSVYLHPNS YSCACLASGSVLRLVDAVLGAEIRNGMAIIRPPGHHAQHSLMDGYCMFNHVAVAARYAQQ KHRIRRVLIVDWDVHHGQGTQFTFDQDPSVLYFSIHRYEQGRFWPHLKASNWSTTGFGQG QGYTINVPWNQVGMRDADYIAAFLHVLLPVALEFQPQLVLVAAGFDALQGDPKGEMAATP AGFAQLTHLLMGLAGGKLILSLEGGYNLRALAEGVSASLHTLLGDPCPMLESPGAPCRSA QASVSCALEALEPFWEVLVRSTETVERDNMEEDNVEESEEEGPWEPPVLPILTWPVLQSR TGLVYDQNMMNHCNLWDSHHPEVPQRILRIMCRLEELGLAGRCLTLTPRPATEAELLTCH SAEYVGHLRATEKMKTRELHRESSNFDSIYICPSTFACAQLATGAACRLVEAVLSGEVLN GAAVVRPPGHHAEQDAACGFCFFNSVAVAARHAQTISGHALRILIVDWDVHHGNGTQHMF EDDPSVLYVSLHRYDHGTFFPMGDEGASSQIGRAAGTGFTVNVAWNGPRMGDADYLAAWH RLVLPIAYEFNPELVLVSAGFDAARGDPLGGCQVSPEGYAHLTHLLMGLASGRIILILEG GYNLTSISESMAACTRSLLGDPPPLLTLPRPPLSGALASITETIQVHRRYWRSLRVMKVE DREGPSSSKLVTKKAPQPAKPRLAERMTTREKKVLEAGMGKVTSASFGEESTPGQTNSET AVVALTQDQPSEAATGGATLAQTISEAAIRGAMLGQTTSEEAVGGATPDQTTSEETVGGA ILDQTTSEDAVGGATLGQTTSEEAVGGATLAQTTSEAAMEGATLDQTTSEEAPGGTELIQ TPLASSTDHQTPPTSPVQGTTPQISPSTLIGSLRTLELGSESQGASESQAPGEENLLGEA AGGQDMADSMLMQGSRGLTDQAIFYAVTPLPWCPHLVAVCPIPAΆGLDVTQPCGDCGTIQ ENWVCLSCYQVYCGRYINGHMLQHHGNSGHPLVLSYIDLSAWCYYCQAYVHHQALLDVKN IAHQNKFGEDMPHPH SEQ ID NO: 28
[233] A schematic representation of the secondary structure of HDAC6 mutant polypeptide G930R (SEQ ID NO:28) predicted using nnPredict analysis is shown below in TABLE 17. The position of the mutated amino acid residue is identified by bold underlined text.
TABLE 17
— —_—__—_ —.— —— iinnnrin nnnn
HHHHHHHHH HHHHHHHHH EHHHHHHHHHH HHHHHHHHH
HH HHHHHHH-HHHHHHHHHHH HE-H-H H-HHEEEH EEE
—EEE HHHHHHHHH-HHHHH—EEE HHHHHHHHHHHHHHHH
—H-EEEEEE EEE EEEEE EEE HHHHHHHHHHH--H-HH HHHEEHH
—HHHHHHHHHHHH EEEEH HHHHHHHHHHHHHHE HHHHHHHH-HHHHHEE E HHHHHHHHHHHHH-HH HHHHHH-
—HHE HHHHHHHHHHHH EEE HHHHHHHHHHHHHHHHHH—HHH- E H—HEE—HHHHHHHHHHHHH-HHEEEEEE HEEEEE HHEHE EEEE HHHHHH
HH HEEE-H HHHHHHHHH EEEEE-- EEE—HHHHHHH H-HHHHHHHHHHHHHHH-H HEH HHHHHHHHHHHHHHH-H—EE H
HEEE H HHHHHHHHHHHHJSHHE HHE EE '
EE H HHHH EE-HHHHHHHHHH EEE
——————————— _—_—__—__.-___— EEE HEE""""""-""*""""'""""""""""""HHHHHH HHHHHHH HEEE
—EE-E EEE HEEE EEEEE-E HHHH-HHHHHHHHHHHHH
HH
[234] The amino acid sequence of HDAC6 mutant polypeptide G1064A (SEQ ID NO:29) is shown below in TABLE 18. The position of the mutated amino acid residue is highlighted in bold underlined text.
TABLE 18
MTSTGQDSTTTRQRRSRQNPQSPPQDSSVTSKRNIKKGAVPRSIPNLAEVKKKGKMKKLG QAMEEDLIVGLQGMDLNLEAEALAGTGLVLDEQLNEFHCLWDDSFPEGPERLHAIKEQLI QEGLLDRCVSFQARFAEKEELMLVHSLEYIDLMETTQYMNEGELRVLADTYDSVYLHPNS YSCACLASGSVLRLVDAVLGAEIRNGMAIIRPPGHHAQHSLMDGYCMFNHVAVAARYAQQ KHRIRRVLIVDWDVHHGQGTQFTFDQDPSVLYFSIHRYEQGRFWPHLKAΞNWSTTGFGQG QGYTINVPWNQVGMRDADYIAAFLHVLLPVALEFQPQLVLVAAGFDALQGDPKGEMAATP AGFAQLTHLLMGLAGGKLILSLEGGYNLRALAEGVSASLHTLLGDPCPMLESPGAPCRSA QASVSCALEALEPFWEVLVRSTETVERDNMEEDNVEESEEEGPWEPPVLPILTWPVLQSR TGLVYDQNMMNHCNLWDSHHPEVPQRILRIMCRLEELGLAGRCLTLTPRPATEAELLTCH SAEYVGHLRATEKMKTRELHRESSNFDSIYICPSTFACAQLATGAACRLVEAVLSGEVLN GAAWRPPGHHAEQDAACGFCFFNSVAVAARHAQTISGHALRILIVDWDVHHGNGTQHMF EDDPSVLYVΞLHRYDHGTFFPMGDEGASSQIGRAAGTGFTVNVAWNGPRMGDADYLAAWH RLVLPIAYEFNPELVLVSAGFDAARGDPLGGCQVSPEGYAHLTHLLMGLASGRIILILEG GYNLTSISESMAACTRSLLGDPPPLLTLPRPPLSGALASITETIQVHRRYWRSLRVMKVE
DREGPSSSKLVTKKAPQPAKPRLAERMTTREKKVLEAGMGKVTSASFGEESTPGQTNSET AVVALTQDQPSEAATGGATLAQTISEAAIGGAMLGQTTSEEAVGGATPDQTTSEETVGGA ILDQTTSEDAVGGATLGQTTSEEAVGGATLAQTTSEAAMEGATLDQTTSEEAPGGTELIQ TPLASSTDHQTPPTSPVQGTTPQISPSTLIGSLRTLELGSESQAASESQAPGEENLLGEA
AGGQDMADSMLMQGSRGLTDQAIFYAVTPLPWCPHLVAVCPIPAAGLDVTQPCGDCGTIQ ENWVCLSCYQVYCGRYINGHMLQHHGNSGHPLVLSYIDLSAWCYYCQAYVHHQALLDVKN IAHQNKFGEDMPHPH SEQ ID NO: 29
[235] A schematic representation of the secondary structure of HDAC6 mutant polypeptide G1064A (SEQ ID NO:29) predicted using nnPredict analysis is shown below in TABLE 19. The position of the mutated amino acid residue is identified with an asterisk.
TABLE 19
_,__.«._„______________________ ————„——_—_—___—_—_— _—_—--._.-——nπunπHnuπtiπH-.-._nuπcmπnu—
HHHHHHHHH HHHHHHHHH EHHHHHHHHHH HHHHHHHHH
HH HHHHHHH-HHHHHHHHHHH HE-H-H H-HHEEEH EEE
—EEE HHHHHHHHH-HHHHH—EEE HHHHHHHHHHHHHHHH EEE HHHHHHHHHHH—H-HH HHHEEHH
—HHHHHHHHHHHH EEEEH HHHHHHHHHHHHHHE HHHHHHHH-HHHHHEE E HHHHHHHHHHHHH-HH HHHHHH-
—HHE HHHHHHHHHHHH EEE HHHHHHHHHHHHHHHHHH—HHH- E H—HEE—HHHHHHHHHHHHH-HHEEEEEE HEEEEE HHEHE EEEE HHHHHH
HH HEEE-H HHHHHHHHH EEEEE— EEE—HHHHHHH H-HHHHHHHHHHHHHHH-H HEH HHHHHHHHHHHHHHH-H—EE H
HEEE H HHHHHHHHHHHH EE HHE EE
EE H HHHH EE-HHHHHHHHHH EEE
HiHiXl) XlHiHt *" XlXiAAXlXl HHHHHHH HEEE
—EE-E EEE HEEE EEEEE-E HHHH-HHHHHHHHHHHHH
HH
[236] The influence of HDAC6 mutations on HDAC6 protein secondary structure is summarized below in TABLE 20.
TABLE 20 Influence of HDAC6 Mutations on HDAC6 Protein Secondary Structure
Mutation Predicted Protein Secondary Structure Change G930R Extend upstream helix by three amino acids G 1064 A No change
[237] General Overview Analysis. A summary of the results of computational analysis of the effect of the HDAC6 mutation identified in the present invention on select features of wild-type HDAC6 is provided below in TABLE 21. Two mutations of HDAC6 were identified in AML patients. Both positions, G930 and G 1064, were close to putative N- myristoylation sites. G 1064 was located between two putative serine phosphorylation sites S 1062 and S 1066. G930R introduced positive charge and may cause secondary structure change by extending a upstream helix by three amino acids.
TABLE 21 Evaluation of the HDAC6 Mutation by Sequence Features
Mutation Protein Phospho- SUMO Other AA AA Secondary domain rylation modification modification conservation property Structure change
G930R + G1064A +
+: the effect of mutation on protein function is low -H-: the effect of mutation on protein function is medium +++: the effect of mutation on protein function is high '
EXAMPLE 2
ANALYSIS OF HDAC6 MUTATION FOR THERANOSTIC CANCER TREATMENT IN
A SUBJECT
[238] In this invention, an agent that modulates HDAC6 biological activity (i.e., HDAC6 modulating agent, e.g., HDAC6 antagonist) is administered to a patient with cancer, e.g.,
AML, when the patient has a SNP pattern indicative of an HDAC6 mutation that correlates with the disease. In one embodiment, the SNP is selected from the group consisting of the
HDAC6 mutations summarized in TABLE 1 and TABLE 2.
EQUIVALENTS
[239] The present invention is not to be limited in terms of the particular embodiments described in this application, which are intended as single illustrations of individual aspects of the invention. Many modifications and variations of this invention can be made without departing from its spirit and scope, as will be apparent to those skilled in the art. Functionally equivalent methods and apparatuses within the scope of the invention, in addition to those enumerated herein, will be apparent to those skilled in the art from the foregoing descriptions. Such modifications and variations are intended to fall within the scope of the appended claims. The present invention is to be limited only by the terms of the appended claims, along with the full scope of equivalents to which such claims are entitled.
Claims
We claim:
1. The use of an HDAC6 modulating agent in the manufacture of a medicament for the treatment of a disease associated with a HDAC6 mutation in a selected patient population, wherein the patient population is selected on the basis of the genotype of the patients at a HDAC6 genetic locus indicative of a propensity for having a disease associated with a HDAC6 mutation.
2. The use of an HDAC6 modulating agent according to claim 1 , wherein the HDAC6 modulating agent is LBH589.
3. The use of an HDAC6 modulating agent according to claim 1 or 2, wherein the HDAC6 modulating agent is selected from the group consisting of: sodium butyrate, phenylacetate, phenylbutyrate, valproic acid, tributyrinpivaloyloxymethyl butyrate, pivanex®, trichostatinA (TSA)5 trichostatin C, trapoxins A and B, depudecin, cyclic hydroxamic-acid containing peptide (CHAPs), apicidin, or OSI-2040, suberoylanilide hydroxamic acid (SAHA), oxamflatindepsipeptide, FK228, scriptaid, biarylhydroxamate inhibitor, A-161906, JNJ16241199, PDX 101, MS-275, CI-994.
The use of an HDAC6 modulating agent according to claim 1 or 2, wherein the HDAC6 modulating agent is a compound is of the following structureformula (I):
wherein
Ri is H, halo, or a straight chain Ci-Ce alkyl (especially methyl, ethyl or «-propyl, which methyl, ethyl and n-propyl substituents are unsubstituted or substituted by one or more substituents described below for alkyl substituents);
R2 is selected from H, Ci-C10 alkyl, (e.g. methyl, ethyl or -CH2CH2-OH), C4 - C9 cycloalkyl, C4 — C9 heterocycloalkyl, C4 — C9 heterocycloalkylalkyl, cycloalkylalkyl (e.g., cyclopropylmethyl), aryl, heteroaryl, arylalkyl (e.g. benzyl), heteroarylalkyl (e.g. pyridylmethyl), -(CHa)nC(O)R6, -(CH2)nOC(O)R6, amino acyl, HON-C(O)- CH=C(Ri)-aryl-alkyl- and -(CH2)nR7;
R3 and R4 are the same or different and independently H, Cj -Ce alkyl, acyl or acylamino, or R3 and R» together with the carbon to which they are bound represent C=O, C=S, or C=NRg, or R2 together with the nitrogen to which it is bound and R3 together with the carbon to which it is bound can form a C4 — C9 heterocycloalkyl, a heteroaryl, a polyheteroaryl, a non-aromatic polyheterocycle, or a mixed aryl and non- aryl polyheterocycle ring;
Rs is selected from H, Ci-Cβ alkyl, C4 — C9 cycloalkyl, C4 - C9 heterocycloalkyl, acyl, aryl, heteroaryl, arylalkyl (e.g. benzyl), heteroarylalkyl (e.g. pyridylmethyl), aromatic polycycles, non-aromatic polycycles, mixed aryl and non-aryl polycycles, polyheteroaryl, non-aromatic polyheterocycles, and mixed aryl and non-aryl polyheterocycles; n, n.i, n2 and n3 are the same or different and independently selected from 0 — 6, when ni is 1-6, each carbon atom can be optionally and independently substituted with
X and Y are the same or different and independently selected from H, halo, Ci-C4 alkyl, such as CH3 and CF3, NO2, C(O)Ri, OR9, SR9, CN, and NRi0Rn;
R$ is selected from H, CpC6 alkyl, C4 - C9 cycloalkyl, C4 - C9 heterocycloalkyl, cycloalkylalkyl (e.g., cyclopropylmethyl), aryl, heteroaryl, arylalkyl (e.g., benzyl, 2- phenylethenyl), heteroarylalkyl (e.g., pyridylmethyl), ORj2, and NRnR]4;
R7 is selected from ORj5, SRi5, S(O)Ri6, SO2Ri7, NRi3Ri4, and NRi2SO2R6;
R8 is selected from H, ORi5, NRi3Ri4, Ci-C6 alkyl, C4 - C9 cycloalkyl, C4 - C9 heterocycloalkyl, aryl, heteroaryl, arylalkyl (e.g., benzyl), and heteroarylalkyl (e.g., pyridylmethyl);
R9 is selected from Ci - C4 alkyl, for example, CH3 and CF3, C(O)-alkyl, for example C(O)CH3, and C(O)CF3;
R10 and Ri 1 are the same or different and independently selected from H, C1-C4 alkyl, and -C(O)-alkyl;
R12 is selected from H9 C1-C6 alkyl, C4 - C9 cycloalkyl, C4 - C9 heterocycloalkyl, C4 - C9 heterocycloalkylalkyl, aryl, mixed aryl and non-aryl polycycle, heteroaryl, arylalkyl (e.g., benzyl), and heteroarylalkyl (e.g., pyridylmethyl);
Ri3 and R14 are the same or different and independently selected from H, Ci-C6 alkyl, C4 — C9 cycloalkyl, C4 — C9 heterocycloalkyl, aryl, heteroaryl, arylalkyl (e.g., benzyl), heteroarylalkyl (e.g., pyridylmethyl), amino acyl, or R13 and R14 together with the nitrogen to which they are bound are C4 - C9 heterocycloalkyl, heteroaryl, polyheteroaryl, non-aromatic polyheterocycle or mixed aryl and non-aryl polyheterocycle;
Ri5 is selected from H, Ci-C6 alkyl, C4 - C9 cycloalkyl, C4 - C9 heterocycloalkyl, aryl, heteroaryl, arylalkyl, heteroarylalkyl and (CH2)InZRi2;
R16 is selected from Ci-C6 alkyl, C4 — C9 cycloalkyl, C4 — C9 heterocycloalkyl, aryl, heteroaryl, polyheteroaryl, arylalkyl, heteroarylalkyl and (CH2)mZRi2;
Rn is selected from C]-C6 alkyl, C4 - C9 cycloalkyl, C4 — C9 heterocycloalkyl, aryl, aromatic polycycles, heteroaryl, arylalkyl, heteroarylalkyl, polyheteroaryl and NR13Ri4; m is an integer selected from 0 to 6; and
Z is selected from O, NR13, S and S(O).
5. The use of an HDAC6 modulating agent according to claim 4, wherein the HDAC6 modulating agent is a compound selected from the group consisting of formula II
6. The use of an HDAC6 modulating agent according to any one of claims 1 to 5, wherein the disease associated with a HDAC6 mutation is a cancer selected from the group consisting of: acute myeloid leukaemia (AML), a chronic myeloid leukemia (CML), hyperplasia, glioblastoma, melanoma, cholangioma, breast cancer, genitourinary cancer, lung cancer, non-small-cell lung cancer (NSCLC)5 gastrointestinal cancer, epidermoid cancer, melanoma, ovarian cancer, pancreas cancer, neuroblastoma, head cancer, neck cancer, bladder cancer, renal cancer, brain cancer, gastric cancer, prostate cancer, colorectal cancer, multiple myeloma and lymphomas.
The use of an HDAC6 modulating agent according to any one of claims 1 to 5, wherein the disease associated with a HDAC6 mutation is a tumour selected from the group consisting of a breast tumour, an epidermoid tumour, a neck tumour, a mouth tumour, a lung tumour, a gastrointestinal tumour, a colorectal tumour, and a genitourinary tumour.
8. An isolated polynucleotide having a sequence encoding an HDAC6 mutation listed in TABLE 1.
9. A vector comprising a polynucleotide of claim 8.
10. An organism containing a polynucleotide of claim 8.
11. The polynucleotide of claim 8, further comprising a polynucleotide sequence encoding an HDAC6 polypeptide having a sequence selected from the group consisting of SEQ ID NO: 28; and SEQ ID NO:29.
12. An isolated polypeptide having a sequence selected from the group consisting of SEQ ID NO:28; and SEQ ID NO:29.
13. The isolated polypeptide of claim 12, further comprising a sequence selected from the group consisting of SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, and SEQ ID NO: 18.
14. The use of a polynucleotide having a sequence encoding an HDAC6 mutation listed in TABLE 1 as a drug target.
15. A method for treating cancer in a subject, comprising the steps of:
(a) obtaining the genotype or haplotype of a subject at a HDAC6 gene locus, wherein the genotype and/or haplotype is indicative of a propensity for having cancer; and
(b) administering an anti-cancer therapy to the subject.
16. The method of claim 15, wherein the anti-cancer therapy is selected from the group consisting of Glivec®, FEMARA®, Sandostatin® LAR® , ZOMETA®, vatalanib, everolimus, gimatecan, patupilone, midostaurin, pasireotide, LBH589, AEE788 and AMNl 07.
17. The method of claim 15, wherein the cancer is selected from the group consisting ofacute myeloid leukaemia (AML), a chronic myeloid leukemia (CML), hyperplasia, glioblastoma, melanoma, cholangioma, breast cancer, genitourinary cancer, lung cancer, non-small-cell lung cancer (NSCLC), gastrointestinal cancer, epidermoid cancer, melanoma, ovarian cancer, pancreas cancer, neuroblastoma, head cancer, neck cancer, bladder cancer, renal cancer, brain cancer, gastric cancer, prostate cancer, colorectal cancer, multiple myeloma and lymphomas.
18. The method of claim 15, wherein the genotype is heterozygous with at least one of the alleles containing an HDAC6 polymorphism or mutation of TABLE 1.
19. The method of claim 19, wherein the genotype is homozygous with at least one of the alleles containing an HDAC6 mutation or polymorphism of TABLE 1.
20. The method of claim 15, wherein the anti-cancer therapy is the administration of a therapeutically effective amount of an HDAC6 modulating agent.
21. A method for diagnosing a propensity for having cancer in a subject, comprising the steps of:
(a) obtaining the genotype or haplotype of a subject at a HD AC6 gene locus, wherein the genotype and/or haplotype is indicative of a propensity of an anticancer drug to respond to the drug; and
(b) identifying the subject as having a propensity for having cancer.
22. A method for choosing subjects for inclusion in a clinical trial for determining efficacy of an HDAC6 modulating agent, comprising the steps of:
(a) interrogating the genotype and/or haplotype of a subject at an HDAC6 gene locus;
(b) then:
(i) including the subject in the trial if the genotype is indicative of a propensity to cancer by the subject;
(ii) excluding the subject from the trial if the genotype is not indicative of a propensity to cancer by the subject; or (iii) both (i) and (ii).
23. The method of claim 222, wherein the cancer is selected from the group consisting of acute myeloid leukaemia (AML)5 a chronic myeloid leukemia (CML), hyperplasia, glioblastoma, melanoma, cholangioma, breast cancer, genitourinary cancer, lung cancer, non- small-cell lung cancer (NSCLC), gastrointestinal cancer, epidermoid cancer, melanoma, ovarian cancer, pancreas cancer, neuroblastoma, head cancer, neck cancer, bladder cancer, renal cancer, brain cancer, gastric cancer, prostate cancer, colorectal cancer, multiple myeloma and lymphomas.
24. A kit for use in determining a treatment strategy for cancer, comprising:
(a) a reagent for detecting a polynucleotide encoding an HDAC6 mutation and/or polymorphism of TABLE 1;
(b) a container for the reagent; and
(c) a written product on, or in, the container describing the use of the polynucleotide in determining a treatment strategy for the cancer.
25. The kit of claim 24, wherein the cancer is selected from the group consisting of acute myeloid leukaemia (AML), a chronic myeloid leukemia (CML), hyperplasia, glioblastoma, melanoma, cholangioma, breast cancer, genitourinary cancer, lung cancer, non-small-cell lung cancer (NSCLC), gastrointestinal cancer, epidermoid cancer, melanoma, ovarian cancer, pancreas cancer, neuroblastoma, head cancer, neck cancer, bladder cancer, renal cancer, brain cancer, gastric cancer, prostate cancer, colorectal cancer, multiple myeloma and lymphomas.
26. The kit of claim 24, wherein the reagent for detecting the polynucleotide encoding an HDAC6 mutation of TABLE 1 is a set of primer pairs that hybridize to a polynucleotide on either side of the HDAC6 mutations and polymorphisms of TABLE 1.
27. A method of inducing growth arrest in a cell having a HDAC6 mutation, comprising: contacting the cell with a HDAC6 modulating agent in an amount effective to induce growth arrest, and monitoring the growth arrest of the cell.
28. The method of claim 27, wherein the cell is associated with a cancer selected from the group consisting of acute myeloid leukaemia (AML), a chronic myeloid leukemia (CML), hyperplasia, glioblastoma, melanoma, cholangioma, breast cancer, genitourinary cancer, lung cancer, non-small-cell lung cancer (NSCLC), gastrointestinal cancer, epidermoid cancer, melanoma, ovarian cancer, pancreas cancer, neuroblastoma, head cancer, neck cancer, bladder cancer, renal cancer, brain cancer, gastric cancer, prostate cancer, colorectal cancer, multiple myeloma and lymphomas.
29. The method of claim 27, wherein the cell is associated with a tumour selected from the group consisting of selected from the group consisting of a breast tumour, an epidermoid tumour, a neck tumour, a mouth tumour, a lung tumour, a gastrointestinal tumour, a colorectal tumour, and a genitourinary tumour.
30. The method of claim 27, wherein the HDAC6 modulating agent is LBH589.
31. The method of claim 27, wherein the HDAC6 modulating agent is selected from the group consisting sodium butyrate, phenylacetate, phenylbutyrate, valproic acid, tributyrinpivaloyloxymethyl butyrate, pivanex®, trichostatinA (TSA), trichostatin C, trapoxins A and B, depudecin, cyclic hydroxamic-acid containing peptide (CHAPs), apicidin, OSI-2040, suberoylanilide hydroxamic acid (SAHA), oxamflatindepsipeptide, FK228, scriptaid, biarylhydroxamate inhibitor, A-161906, JNJ16241199, PDX 101, MS-275, CI-994.
32. The method of claim 27, wherein the HDAC6 modulating agent is a compound having formula (I):
Ri is H, halo, or a straight chain Ci-C6 alkyl (especially methyl, ethyl or n-propyl, which methyl, ethyl and n-propyl substituents are unsubstituted or substituted by one or more substituents described below for alkyl substituents);
R2 is selected from H, Ci-Ci0 alkyl, (e.g. methyl, ethyl or -CH2CH2-OH), C4 - C9 cycloalkyl, C4 - C9 heterocycloalkyl, C4 - C9 heterocycloalkylalkyl, cycloalkylalkyl (e.g., cyclopropylmethyl), aryl, heteroaryl, arylalkyl (e.g. benzyl), heteroarylalkyl (e.g. pyridylmethyl), -(CH2)nC(O)R6, -(CHz)nOC(O)R6, amino acyl, HON-C(O)- CH=C(Ri)-aryl-alkyl- and -(CHz)nR7;
R3 and R4 are the same or different and independently H, Ci-Ce alkyl, acyl or acylamino, or R3 and R4 together with the carbon to which they are bound represent C=O, C=S, or C=NR8, or R2 together with the nitrogen to which it is bound and R3 together with the carbon to which it is bound can form a C4 - C9 heterocycloalkyl, a heteroaryl, a polyheteroaryl, a non-aromatic polyheterocycle, or a mixed aryl and non- aryl polyheterocycle ring;
R5 is selected from H, Ci-C6 alkyl, C4 - C9 cycloalkyl, C4 - C9 heterocycloalkyl, acyl, aryl, heteroaryl, arylalkyl (e.g. benzyl), heteroarylalkyl (e.g. pyridylmethyl), aromatic polycycles, non-aromatic polycycles, mixed aryl and non-aryl polycycles, polyheteroaryl, non-aromatic polyheterocycles, and mixed aryl and non-aryl polyheterocycles; n, nj. n2 and n3 are the same or different and independently selected from 0 — 6, when nj is 1-6, each carbon atom can be optionally and independently substituted with R3 and/or R4;
X and Y are the same or different and independently selected from H, halo, C1-C4 alkyl, such as CH3 and CF3, NO2, C(O)Ri, OR9, SR9, CN, and NR10Rn;
Re is selected from H, C]-C6 alkyl, C4 - C9 cycloalkyl, C4 — C9 heterocycloalkyl, cycloalkylalkyl (e.g., cyclopropylmethyl), aryl, heteroaryl, arylalkyl (e.g., benzyl, 2- phenylethenyl), heteroarylalkyl (e.g., pyridylmethyl), ORi 2, and NRi3Ri4;
R7 is selected from OR15, SRi5, S(O)Ri6, SO2Ri7, NR13Ri4, and NRi2SO2Re;
R8 is selected from H, ORi5, NRi3Ri4, Cj-C6 alkyl, C4 - C9 cycloalkyl, C4 - C9 heterocycloalkyl, aryl, heteroaryl, arylalkyl (e.g., benzyl), and heteroarylalkyl (e.g., pyridylmethyl);
R9 is selected from Ci - C4 alkyl, for example, CH3 and CF3, C(O)-alkyl, for example C(O)CH3, and C(O)CF3;
Rio and Rj \ are the same or different and independently selected from H, Cj-C4 alkyl, and -C(O)-alkyl;
Ri2 is selected from H, Ci-C6 alkyl, C4 - C9 cycloalkyl, C4 - C9 heterocycloalkyl, C4 — C9 heterocycloalkylalkyl, aryl, mixed aryl and non-aryl polycycle, heteroaryl, arylalkyl (e.g., benzyl), and heteroarylalkyl (e.g., pyridylmethyl);
R13 and Ri4 are the same or different and independently selected from H, Ci-C6 alkyl, C4 - C9 cycloalkyl, C4 - C9 heterocycloalkyl, aryl, heteroaryl. arylalkyl (e.g., benzyl), heteroarylalkyl (e.g., pyridylmethyl), amino acyl, or Rj3 and Rj4 together with the nitrogen to which they are bound are C4 - C9 heterocycloalkyl, heteroaryl, polyheteroaryl, non-aromatic polyheterocycle or mixed aryl and non-aryl polyheterocycle;
R15 is selected from H, Ci-C6 alkyl, C4 - C9 cycloalkyl, C4 - C9 heterocycloalkyl, aryl, heteroaryl, arylalkyl, heteroarylalkyl and (CH2)mZRi2;
R16 is selected from Ci-C6 alkyl, C4 - C9 cycloalkyl, C4 — C9 heterocycloalkyl, aryl, heteroaryl, polyheteroaryl, arylalkyl, heteroarylalkyl and (CH2)mZR|2;
Ri7 is selected from Cj-C6 alkyl, C4 — C9 cycloalkyl, C4 — C9 heterocycloalkyl, aryl, aromatic polycycles, heteroaryl, arylalkyl, heteroarylalkyl, polyheteroaryl and
NR13Ri4; m is an integer selected from 0 to 6; and Z is selected from O, NRi3, S and S(O).
33. The method of claim 32, wherein the HDAC6 modulating agent is a compound selected from the group consisting of formula II
34. The method of claim 27, wherein the modulating agent interacts directly with the mutant HDAC6 polypeptide.
35. The method of claim 27, wherein the modulating agent interacts indirectly with the mutant HDAC6 polypeptide by interacting with a step involving the HDAC6 mutant polypeptide.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US73645505P | 2005-11-14 | 2005-11-14 | |
US60/736,455 | 2005-11-14 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2007058992A2 true WO2007058992A2 (en) | 2007-05-24 |
WO2007058992A3 WO2007058992A3 (en) | 2007-07-12 |
Family
ID=37947953
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2006/043899 WO2007058992A2 (en) | 2005-11-14 | 2006-11-13 | Mutations and polymorphisms of hdac6 |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2007058992A2 (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2010045224A3 (en) * | 2008-10-14 | 2010-08-19 | Oregon Health & Science University | Methods for the selection of therapeutic treatments for cancer |
US20120190874A1 (en) * | 2009-07-16 | 2012-07-26 | Royal College Of Surgeons In Ireland | Metal complexes having dual histone deacetylase inhibitory and dna-binding activity |
WO2013041407A1 (en) * | 2011-09-19 | 2013-03-28 | Cellzome Ag | Hydroxamic acids as hdac6 inhibitors |
US20220168300A1 (en) * | 2019-02-12 | 2022-06-02 | Emory University | Methods of Managing Vascular Conditions and Diabetic Peripheral Neuropathies |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
PE20020354A1 (en) * | 2000-09-01 | 2002-06-12 | Novartis Ag | HYDROXAMATE COMPOUNDS AS HISTONE-DESACETILASE (HDA) INHIBITORS |
-
2006
- 2006-11-13 WO PCT/US2006/043899 patent/WO2007058992A2/en active Application Filing
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2010045224A3 (en) * | 2008-10-14 | 2010-08-19 | Oregon Health & Science University | Methods for the selection of therapeutic treatments for cancer |
US20120190874A1 (en) * | 2009-07-16 | 2012-07-26 | Royal College Of Surgeons In Ireland | Metal complexes having dual histone deacetylase inhibitory and dna-binding activity |
US8901337B2 (en) * | 2009-07-16 | 2014-12-02 | Royal College Of Surgeons In Ireland | Metal complexes having dual histone deacetylase inhibitory and DNA-binding activity |
WO2013041407A1 (en) * | 2011-09-19 | 2013-03-28 | Cellzome Ag | Hydroxamic acids as hdac6 inhibitors |
US20220168300A1 (en) * | 2019-02-12 | 2022-06-02 | Emory University | Methods of Managing Vascular Conditions and Diabetic Peripheral Neuropathies |
Also Published As
Publication number | Publication date |
---|---|
WO2007058992A3 (en) | 2007-07-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2007016532A2 (en) | Mutations and polymorphisms of hdac4 | |
Wasko et al. | Tumour-selective activity of RAS-GTP inhibition in pancreatic cancer | |
WO2006130527A2 (en) | Mutations and polymorphisms of fibroblast growth factor receptor 1 | |
US20100035251A1 (en) | BioMarkers for the Progression of Alzheimer's Disease | |
WO2007095038A2 (en) | Mutations and polymorphisms of erbb2 | |
WO2006104812A2 (en) | Biomarkers for pharmacogenetic diagnosis of type 2 diabetes | |
JP2014534178A (en) | Cancer treatment methods | |
JP2010529418A (en) | Method for detecting binding between MDM2 and proteasome | |
US20100249107A1 (en) | Biomarkers for Alzheimer's Disease Progression | |
WO2007022041A2 (en) | Mutations and polymorphisms of hdac3 | |
WO2006110478A2 (en) | Mutations and polymorphisms of epidermal growth factor receptor | |
WO2007058992A2 (en) | Mutations and polymorphisms of hdac6 | |
WO2007030455A2 (en) | Mutations and polymorphisms of hdac10 | |
WO2006060429A2 (en) | Identification of variants in histone deacetylase 1 (hdac1) to predict drug response | |
WO2007030454A2 (en) | Mutations and polymorphisms of hdac9 | |
WO2007038073A2 (en) | Mutations and polymorphisms of hdac11 | |
US20100240760A1 (en) | Biomarkers for Efficacy of Aliskiren as a Hypertensive Agent | |
EP1835909A2 (en) | Biomarkers for identifying efficacy of tegaserod in patients with chronic constipation | |
WO2007053502A2 (en) | Mutations and polymorphisms of hdac5 | |
WO2007047998A2 (en) | Mutations and polymorphisms of hdac2 | |
WO2007002217A2 (en) | Mutations and polymorphisms of bcl-2 | |
WO2007109183A2 (en) | Mutations and polymorphisms of fms-related tyrosine kinase 1 | |
WO2007095032A2 (en) | Mutations and polymorphisms of ptk2b | |
WO2007109515A2 (en) | Mutations and polymorphisms of knockdown resistance polypeptide | |
WO2007058991A2 (en) | Mutations and polymorphisms of c-abl |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 06837391 Country of ref document: EP Kind code of ref document: A2 |