Clozapine-induced agranulocytosis is associated with rare HLA-DQB1 and HLA-B alleles

Goldstein, Jacqueline I.; Fredrik Jarskog, L.; Hilliard, Chris; Alfirevic, Ana; Duncan, Laramie; Fourches, Denis; Huang, Hailiang; Lek, Monkol; Neale, Benjamin M.; Ripke, Stephan; Shianna, Kevin; Szatkiewicz, Jin P.; Tropsha, Alexander; van den Oord, Edwin JCG; Cascorbi, Ingolf; Dettling, Michael; Gazit, Ephraim; Goff, Donald C.; Holden, Arthur L.; Kelly, Deanna L.; Malhotra, Anil K.; Nielsen, Jimmi; Pirmohamed, Munir; Rujescu, Dan; Werge, Thomas; Levy, Deborah L.; Josiassen, Richard C.; Kennedy, James L.; Lieberman, Jeffrey A.; Daly, Mark J.; Sullivan, Patrick F.

doi:10.1038/ncomms5757

Article
Published: 04 September 2014

Clozapine-induced agranulocytosis is associated with rare HLA-DQB1 and HLA-B alleles

Nature Communications volume 5, Article number: 4757 (2014) Cite this article

7329 Accesses
17 Altmetric
Metrics details

Subjects

Abstract

Clozapine is a particularly effective antipsychotic medication but its use is curtailed by the risk of clozapine-induced agranulocytosis/granulocytopenia (CIAG), a severe adverse drug reaction occurring in up to 1% of treated individuals. Identifying genetic risk factors for CIAG could enable safer and more widespread use of clozapine. Here we perform the largest and most comprehensive genetic study of CIAG to date by interrogating 163 cases using genome-wide genotyping and whole-exome sequencing. We find that two loci in the major histocompatibility complex are independently associated with CIAG: a single amino acid in HLA-DQB1 (126Q) (P=4.7 × 10⁻¹⁴, odds ratio (OR)=0.19, 95% confidence interval (CI)=0.12–0.29) and an amino acid change in the extracellular binding pocket of HLA-B (158T) (P=6.4 × 10⁻¹⁰, OR=3.3, 95% CI=2.3–4.9). These associations dovetail with the roles of these genes in immunogenetic phenotypes and adverse drug responses for other medications, and provide insight into the pathophysiology of CIAG.

You have full access to this article via your institution.

Download PDF

Pharmacogenomics of Clozapine-induced agranulocytosis: a systematic review and meta-analysis

Article Open access 16 June 2022

HLA-DQB1 6672G>C (rs113332494) is associated with clozapine-induced neutropenia and agranulocytosis in individuals of European ancestry

Article Open access 12 April 2021

Polygenic overlap with granulocyte counts identifies novel loci for clozapine metabolism and clozapine-induced agranulocytosis

Article 18 January 2025

Introduction

Clozapine-induced agranulocytosis/granulocytopenia (CIAG) is a rare adverse drug event that occurs in 1% of treated individuals and can be fatal if not detected early¹. As a result, clozapine is underused despite its superior efficacy and particular utility in treatment-resistant schizophrenia². Our goal is to identify genetic risk factors for CIAG that could enable safer and increased use of clozapine.

Because of the specific nature of this adverse drug response and the role of the major histocompatibility complex (MHC) in adverse drug responses, candidate gene studies have focused on human leukocyte antigen (HLA) alleles in CIAG^3,4,5,6,7,8. A small exome sequencing study of CIAG was unrevealing⁹. While nominally significant HLA associations have been reported, no study to our knowledge has reported unequivocal statistical significance or the dissection of the complex linkage disequilibrium (LD) relationships in the MHC.

Using multiple ascertainment schemes, we assemble the largest CIAG cohort to date (163 cases, 54.0% never previously reported, Supplementary Table 1). We conduct a genome survey using whole-exome sequencing and genome-wide genotyping to test for association with CIAG (Fig. 1, Supplementary Table 2). We show that CIAG is associated with two independent amino acid changes in HLA-B (158T) and HLA-DQB1 (126Q). However, the odds ratios (ORs) for these risk factors do not suggest immediate clinical application in predictive testing.

Results

Genome-wide common variant association study

We performed a genome-wide association study of 8,809,853 genotyped and imputed common variants in 161 CIAG cases (two additional cases had exome sequencing or exome array data) and 1,196 controls of European ancestry. After controlling for ancestry (Supplementary Fig. 1), no loci exceeded genome-wide significance (P<5 × 10⁻⁸, Fig. 2). Our sample size was powered to detect common genetic variants with effect sizes of clinical utility (for example, power was >99% for an OR>4 and MAF >10%; Supplementary Fig. 2). The single-nucleotide polymorphism (SNP) with the best evidence for association was in an intron in HLA-B (rs41549217, P=2.07 × 10⁻⁷, OR=4.66, 95% confidence interval (CI)=2.6–8.3, Supplementary Data 1). We specifically examined the distribution of P values of SNPs previously shown to be associated with immune-mediated diseases, but did not detect any enrichment (Supplementary Fig. 3). Evaluation of copy number variation did not reveal any associations with CIAG. No protein encoded by a gene implicated by genome-wide association study (GWAS) is known to bind clozapine¹⁰.

Exome-wide rare variant association study

We tested protein-coding variants (MAF <5%) for association using whole-exome sequencing data from 67 CIAG cases and 376 untreated population controls. For variants that were also genotyped on the exome array, we merged the sequencing data with exome array data from 81 CIAG cases and 3,294 controls of European ancestry. We also used raw count data from ~4,300 samples of European ancestry from the NHLBI Exome Sequencing Project as an additional control reference to increase statistical power¹¹. The top finding is a low frequency missense variant in BTNL2 annotated as benign by PolyPhen-2 (rs28362679, P=4.14 × 10⁻⁷, OR=3.9, 95% CI=1.2–12.9, Supplementary Data 2).

Next, we used a burden test to evaluate whether cases were more likely to be carriers of a functional variant in a gene compared with controls in the exome-sequencing data. We divided rare variants into different functional categories and MAF ranges and found that BTNL2 (P=7.0 × 10⁻⁸) was the only gene to exceed a conservative significance threshold of P=2.5 × 10⁻⁶, corresponding to a Bonferroni correction of 20,000 genes tested¹² (Fig. 3). None of the protein products of these genes is known to bind clozapine¹⁰. The BTNL2 signal is driven by two independent variants: the top SNP noted above (rs28362679) and a conditionally independent missense variant annotated as benign by PolyPhen-2 (rs143211074; P=6.0 × 10⁻⁶, OR=4.4, 95% CI=1.10–17.8). These associations replicated using the exome array and NHLBI Exome Sequencing Project data (Supplementary Fig. 4). BTNL2 is in the MHC region and is in strong LD with HLA-DQB1 and HLA-DRB1, which have been associated with autoimmune diseases and adverse drug reactions including CIAG^7,13. BTNL2 codes for the butyrophilin-like protein 2, a member of the immunoglobulin gene superfamily with a role in regulating T-cell activation¹⁴. Given its location in a region of very high LD, the BTNL2 association could implicate it directly or reflect indirect associations with classical HLA alleles.

Classical HLA allele imputation

We imputed four-digit classical HLA alleles and amino acid sequences in 162 CIAG cases and 4,319 controls of European ancestry from a reference panel of 5,225 individuals using SNP2HLA¹⁵. The cases, mainly of northwestern European and Ashkenazi Jewish ancestry, were well-matched to controls (Supplementary Fig. 5). Two independent loci are associated with CIAG after multiple-testing correction (Fig. 4). The top signal is an amino acid change in HLA-DQB1 (126Q) that is in strong LD with HLA-DQB1*05:02 (P=4.7 × 10⁻¹⁴, OR=0.19, 95% CI=0.12–0.29). After conditioning on HLA-DQB1 126Q, a genome-wide significant signal in HLA-B remains (alanine to threonine at residue 158, P_cond=1.54 × 10⁻⁸, OR=3.11, 95% CI=2.1–4.6). No other statistically significant loci remain after conditioning on both HLA-DQB1 126Q and HLA-B 158T (Supplementary Data 3).

**Figure 4: Manhattan plot of the MHC region from HLA imputation.**

HLA-DQB1 and HLA-B explains the MHC signal better than BTNL2

To determine whether the BTNL2 signal is independent of the HLA-B 158T and HLA-DQB1 126Q signals, we analysed samples with imputed HLA alleles and genotyped BTNL2 variants (146 CIAG cases and 3,364 controls). In conditional analyses, we observed that the most-associated BTNL2 variant (rs28362679) was in high LD with HLA-DQB1 126Q (R²=0.618) and the other BTNL2 variant (rs143211074) had a low correlation with HLA-B 158T (R²=0.042) (Supplementary Fig. 6). To determine whether the association in the region was better explained by the two variants in BTNL2 or the two HLA alleles, we compared the likelihoods of models with HLA-DQB1 126Q and HLA-B 158T predicting CIAG compared with a model with the two BTNL2 variants. The combination of HLA-DQB1 126Q and HLA-B 158T is 27,000 times more likely to explain the MHC association with CIAG than the two BTNL2 variants. We also looked at whether a model including one or both BTNL2 variants in addition to HLA-DQB1 126Q and HLA-B 158T explained the association better than HLA-B and HLA-DQB1 alone. We found including rs143211074 in the model was 13 times more likely to explain the association than the two-component model of HLA-B and HLA-DQB1, but there was no improvement when we added rs28362679. These results indicate that rs143211074 could be independently associated with CIAG, but the more likely explanation is that rs143211074 is tagging an HLA haplotype that is not genome-wide significant on its own (given that rs143211074 and HLA-DRB1*04:02 are highly correlated with R²=0.77).

HLA sequence alignment

We used the IMGT/HLA database¹⁶ (http://www.ebi.ac.uk/ipd/imgt/hla) to determine which classical alleles correspond to HLA-DQB1 126Q and HLA-B 158T and to align these results with previous candidate gene studies of CIAG. Since having a glutamine at position 126 in HLA-DQB1 is protective for CIAG, the most common high-risk allele is HLA-DQB1*05:02 which has a histidine at position 126. This is consistent with prior reports of an association of CIAG with HLA-DQB1 6672G>C (a variant in strong LD with HLA-DQB1*05:02)^4,5. HLA-B*38, HLA-B*39 and HLA-B*67 are the most common alleles with a threonine at position 158 in the protein sequence (consistent with previous reports although none met contemporary significance levels)^3,4,8. Our results cannot differentiate whether the amino acids we identified are causal or whether the overall configuration of these genetically related isoforms confers risk for CIAG.

CIAG severity by HLA-B and HLA-DQB1 carrier status

Finally, we tested for differences in CIAG severity (as indexed by minimum absolute neutrophil count, ANC) according to risk allele status of HLA-DQB1 and HLA-B among 149 cases with these data. Cases who were carriers of HLA-B 158T demonstrated significantly lower ANC than noncarriers (P<0.01, Fig. 5). However, as nearly all cases with substantial Ashkenazi Jewish ancestry carried the HLA-B 158T allele (Supplementary Fig. 7), it is not possible to determine conclusively whether some of this signal could be driven by an unexplored feature of ascertainment or population difference. Hence we repeated the analysis after excluding cases with Ashkenazi Jewish ancestry (PC1>0.02) and subsequently correcting the phenotype for ancestry and sample cohort and observed a consistent effect but of reduced significance (P=0.049) owing to reduced power because of the number of 158T alleles excluded (Supplementary Fig. 8). Similar analyses of time to onset of CIAG were not significant (data not shown).

**Figure 5: Absolute neutrophil count stratified by HLA risk allele carrier status.**

Modelling of clozapine binding to HLA-B

The associations of HLA-DQB1 and HLA-B with CIAG are biologically plausible. HLA-DQB1 is part of the MHC Class II antigen-presentation pathway and has been associated with autoimmune diseases (for example, rheumatoid arthritis)¹⁷ and a range of immunogenetic phenomena (for example, allergic sensitization, human papillomavirus seropositivity and chronic hepatitis B infection)^18,19,20. HLA-B is an important component of the MHC Class I antigen-presentation pathway and a ‘hot spot’ for serious adverse drug reactions including flucloxacillin-induced liver injury²¹, carbamazepine-induced Stevens–Johnson syndrome²², and abacavir hypersensitivity syndrome²³. For several of these adverse drug reactions, the mechanism appears to be off-target binding of a drug molecule to the antigen-binding cleft of the HLA-B protein²³.

Similar to what has been done for abacavir²³, we used a molecular docking approach to determine that clozapine binds with relatively high affinity to the putative HLA-B*39 antigen-presenting peptide groove when there is a threonine at position 158 (Fig. 6). Docking simulations also suggested a potential for clozapine to bind in the pockets of HLA-B*57:01 and HLA-B*35:08 with predicted affinities much higher than that for multiple other HLA-B variants (Supplementary Fig. 9). As a negative control, docking studies showed that clozapine had low binding affinities for the antigen-presenting pockets of HLA-A proteins.

**Figure 6: Predicted binding pose of clozapine in the antigen-presenting pocket of *HLA-B**39.**

Discussion

In conclusion, we identified two independent loci in the MHC region that are associated with CIAG: HLA-DQB1 and HLA-B. These findings are consistent with previous results but the evidence is now much stronger in this comprehensive analysis. Moreover, we localized the signal to two amino acids (HLA-DQB1 126Q and HLA-B 158T), but cannot distinguish whether these amino acids are causal or if the overall configuration of the alleles confers risk for CIAG. Although these data greatly clarify the contributions of HLA variation to CIAG, the ORs do not immediately suggest clinical application in screening. If we consider a genetic test that was positive if an individual carried either risk factor, the sensitivity and specificity of such a test would be 0.36 and 0.89. Because more than half of the cases do not carry these two relatively infrequent risk factors, a predictive test based solely on HLA-DQB1 and HLA-B could not define a truly ‘safe’ group for clozapine use. We note that seven CIAG cases (4.3%) carried two risk alleles, suggesting that this combination had particularly high risk compared with the population expectation of 0.3–0.4%. These genetic insights could further help in understanding the biological processes underlying CIAG and, as other non-genetic and genetic risk factors for CIAG are identified, constitute an important component of tests to screen patients for the safer use of clozapine.

Methods

Ethical statement

The study was approved by Institutional Review Boards at the University of North Carolina (http://research.unc.edu/offices/human-research-ethics) and at Massachusetts General Hospital (http://healthcare.partners.org/phsirb). Research was carried out in accordance with the Declaration of Helsinki. Written informed consent was obtained from all the participants (or legal guardians, if applicable).

Sample description

We formed the Clozapine-Induced Agranulocytosis Consortium (CIAC) to further this effort by bringing together nearly all investigators in the area and to ascertain new cases. CIAG (clozapine-induced agranulocytosis/granulocytopenia) cases developed agranulocytosis (ANC<500 mm⁻³) or granulocytopenia (500≤ANC≤1,000 mm⁻³) while taking clozapine and without a known medical condition that increases risk for agranulocytosis (for example, haematological disorder or autoimmune disease). There were 98 cases with agranulocytosis (60.1%), 61 cases with granulocytopenia (37.4%) and four cases (2.5%) included on the basis of clinical judgment due to a precipitous drop in ANC to <1,500 mm⁻³ while being intensively monitored. We allowed any primary psychiatric diagnosis (although most subjects had schizophrenia or schizoaffective disorder) and any reported ancestry was allowed. Heterogeneity due to ANC threshold, diagnosis and ancestry was addressed analytically.

Subjects were identified in two ways. Opportunistic ascertainment was via a broad survey of colleagues to identify all worldwide samples in existing CIAG collections that allowed rapid and efficient identification of existing CIAG cases and controls exposed to clozapine without CIAG. This included cases from the International Severe Adverse Events Consortium (iSAEC, http://www.saeconsortium.org). Systematic assessment included identification of additional cases from the Novartis and Teva clozapine registries (https://www.clozarilcare.com/care, https://www.clozapineregistry.com), and the electronic pharmaceutical record from the Pennsylvania state hospital system. Briefly, we contacted the physician of record to confirm an episode of CIAG, the physician then obtained permission from the potential subject to release contact information, and we then contacted the subject to attempt recruitment into this study.

We attempted to obtain and review medical records for all subjects. Using a standardized review form, we abstracted demographic data, clinical diagnosis, and clozapine dosage, exposure duration and the lowest ANC. Two senior CIAC clinicians evaluated all available data to verify CIAG case status. Most DNA samples were extracted from peripheral venous blood; a subset of DNA samples were from lymphoblastoid cell lines or oral sources. Most samples were subjected to standard quality control procedures (DNA quantification, electrophoresis to assess degradation and genotyping a ‘fingerprint’ of 25 common SNPs using a Sequenom iPLEX panel). Genotype data confirmed that all subjects were independent and of European ancestry (see below).

A combination of clozapine-treated and untreated controls was used to increase statistical power. Clozapine-treated controls had received clozapine for over 1 year with no documented ANC<1,500 mm⁻³ and no medical condition with an increased risk of agranulocytosis. Ascertainment of clozapine-treated controls was performed in the same manner as for the cases. Untreated controls were broadly consented population controls ascertained in unrelated disease and cohort studies. Allele frequency data from the NHLBI Exome Sequencing Project (ESP) were also used.

Generation of genome-wide array data

A total of 385 samples from CIAC (135 cases and 249 treated controls) were genotyped on the Illumina OmniExpress array at Duke University. We supplemented the CIAC data with 839 untreated population controls from the United Kingdom Blood Service cohort who were genotyped on the Illumina OmniExpress platform at the Broad Institute. In addition, 26 case samples from the International Serious Adverse Events Consortium (iSAEC, http://www.saeconsortium.org) were genotyped at Duke University on the Illumina Human610-Quadv1_B platform. These data were merged with 108 untreated population controls from the NIMH repository genotyped on the Illumina 550K array at the University of Michigan. Genotype calls were generated using the GenomeStudio software from Illumina. All samples had a call rate >98% and an inbreeding coefficient between −0.2 and 0.2. SNPs were included if the call rate was >98%, were in Hardy–Weinberg equilibrium (P_control>1 × 10⁻⁶, P_case>1 × 10⁻¹⁰) and the call rate difference between cases and controls was <0.02. Samples were unrelated and of European ancestry via multidimensional scaling²⁴. We used SHAPEIT^25,26 and IMPUTE2 (ref. 27) to impute genotypes for all SNPs using 1000 Genomes Project data as a reference panel. The imputation reference set was 2,186 phased haplotypes from the 1000 Genomes Project (August 2012, 30,069,288 variants, release ‘v3.macGT1’, http://mathgen.stats.ox.ac.uk/impute/data_download_1000G_phase1_integrated.html). Imputation was done for each platform separately (OmniExpress, I550K/I610K) to avoid platform-based batch effects in imputation.

Generation of whole-exome sequencing data

We sequenced 41 CIAC cases and 26 iSAEC cases at Duke University. Exome sequence from CIAC cases was captured using Agilent SureSelect 50 MB kits (Santa Clara, CA) and sequenced using Illumina HiSeq 2000 (San Diego, CA) for paired-end 100 bp reads. The iSAEC samples were sequenced several years earlier using Agilent SureSelect kits (Santa Clara, CA) and Illumina GAII for paired-end reads (San Diego, CA). For controls, we used data from 376 phenotypically normal controls who had been exome sequenced at the Broad Institute²⁸. All raw FASTQ data were processed using the same Picard pipeline^29,30 and genotype calls were generated for all data jointly using the GATK Unified Genotyper tool³¹ to avoid batch effects.

Individual genotype calls were filtered by requiring read depth ≥10, genotype quality score ≥30 and meeting-specified allelic balance criteria (based on genotype: for 0/0, 0–0.3; for 0/1, 0.3–0.7; and for 1/1, 0.7–1.0). All variants were required to have a GATK PASS rating, to be biallelic, not to have a significant difference in call rate between cohorts (P>0.01, eliminates biases due to a variant being sequenced in one cohort but not in the other), and not to be an insertion/deletion variant. Samples were required to have a call rate >95%, <600 singleton calls and a Ti:Tv ratio between 3.1 and 3.4. The first 10 principal components (PCs) were computed using PLINK with 14,466 independent variants with an MAF >5% and not located in the MHC region.

Generation of exome array data

We genotyped deleterious exonic variation and a comprehensive set of MHC tag SNPs using the Illumina HumanExome array (v1.0, http://genome.sph.umich.edu/wiki/Exome_Chip_Design). Initial genotyping was conducted at Duke University and genotype calls were made using GenomeStudio (v2010.3, genotyping module v1.8.4) using the default Illumina cluster file. Samples were removed for missingness >1% (based on common SNPs with MAF >5%) or genome-wide heterozygosity outside the range of 0.1–1.2% (based on rarer SNPs with MAF <5%) using the original GenomeStudio calls. Cases that failed quality control (N=23) were re-genotyped at the Broad Institute on the Illumina HumanExome array (v1.1); genotype calling and quality control were the same as for the Duke data except that a custom cluster file and a missingness threshold >2% were used. We then used zCall to recover rare genotypes³². After removing poorly performing samples, individual SNPs from the Duke data were included if they had a call rate >97% from the original GenomeStudio calls, a HWE P-value >1 × 10⁻⁵ and a call rate >99.5% calculated from the zCalls. For the Broad data, more lenient call rate thresholds were used due to the small number of samples (GenomeStudio calls >75%, zCalls >95%).

The passing exome array data were merged with population controls of European ancestry broadly consented for biomedical studies. These controls included samples from the NIMH repository (n=1,037)³³, POPRES (n=703)²¹, the Prospective Registry in IBD Study (n=254)³⁴, the 1000 Genomes Project (n=449)³⁵ and controls ascertained for an age-related macular degeneration study (n=704)³⁶. Quality control for the shared controls was similar to the steps for the CIAC data, except that a call rate of 99% was used³⁶. The final data set included only SNPs passing quality control in all cohorts. We removed one member of putative relative pairs (>0.2) and used EIGENSOFT³⁷ to calculate the first 10 PCs based on 16,621 SNPs (MAF >5%, excluding the extended MHC region (chr6:25–35 mb), and following LD pruning). Two hundred and thirteen out of 3,366 controls were ascertained for not having CIAG.

HLA allele imputation

HLA classical alleles and amino acids were imputed using BEAGLE³⁸ for exome array data or IMPUTE2 (ref. 27) for genome array data with a reference set of 5,225 samples from the Type 1 Diabetes Genetics Consortium¹⁵. The imputation accuracy using this reference panel is extremely high. For example, the SNP2HLA authors found that the accuracy of HLA-B*38:01 was 99.8%, HLA-B*39:01 was 99.8% and HLA-B*39:06 was 99.9% using the same set of MHC tag SNPs present on the Illumina HumanExome array compared with direct genotyping of these variants¹⁵. We also confirmed the accuracy of imputation of HLA-DQB1*05:02 by SNP2HLA using existing data for 19 samples that had PCR-based sequencing of HLA-DQB1 6672G>C, which is in strong LD with HLA-DQB1*05:02 (ref. 7): Thirty-seven out of 38 imputed HLA-DQB1*05:02 calls were concordant with the PCR-based sequencing results.

Genome-wide common variant association study

PCs were calculated using EigenSoft with the merged OmniExpress and I550K/I610K imputed data sets. SNPs used to calculate the PCs were independent (LD pruned) and did not include the MHC (chr6:25–35 Mb). We tested the first 20 PCs for phenotype association (using logistic regression) and evaluated their impact on the genome-wide test statistics using λ, a measure of how much the observed distribution of P values deviates from the expected distribution of P values. Five PCs (1–4 and 12) were included in all association analyses. The first four PCs were selected because it is a standard practice to include the first four PCs in analyses to account for ancestry and, of PC5–PC20, PC12 was the only PC associated with case–control status (P<0.05). We performed a mega-analysis between the OmniExpress data and the I550K/I610K imputed dosage data using PLINK with PCs (described above) as covariates for logistic regression. SNPs with an INFO score ≥0.6 (metric of imputation quality for a SNP) and MAF ≥1% in controls were considered passing. We did not observe any batch effects due to multiple platforms or confounding due to population ancestry as indicated by a value of λ=1.047 and no false positive results (Supplementary Fig. 1).

Immune-mediated SNP Enrichment

To determine whether our GWAS results show any indication of signal in loci known to be associated with autoimmune diseases, we randomly drew P values from the GWAS results above to create an expected null distribution. To make this comparison more representative in terms of power per SNP, random draws were only taken from the subset of variants with MAF >0.05 and INFO >0.8 (6,752,294 SNPs). We drew 100 sets of 838 SNPs and compared the distribution of P values to those observed in a curated list of 838 associations to immune-mediated and autoimmune disease curated from the NHGRI GWAS catalogue. The comparison demonstrated that CIAG association to known immune-mediated SNPs was not distinguishable from random draws of SNPs.

Exome-wide rare variant association study

Because we had data from different platforms for rare exonic variants (exome sequencing, array and allele counts from the ESP¹¹, available at http://evs.gs.washington.edu/EVS), we used the following procedure to combine allele counts. If a variant was assayed on multiple platforms, we ensured that the MAFs between the controls did not differ (P>0.001). If the MAFs in controls were not compatible, we removed data in the following order: ESP, exome array and exome sequencing. For each variant, we added allele counts for each technology in both cases and controls and used Fisher’s exact test on the combined sample to test for significance.

Variants were functionally annotated using the Variant Effect Predictor tool from Ensembl³⁹. We then divided the annotations into three categories. The most restrictive grouping included all nonsense and splice-region variants. The intermediate grouping included all nonsense, splice-region and missense variants annotated as probably damaging by PolyPhen-2 (ref. 40). The least restrictive grouping included all nonsense, splice-region, missense and untranslated region variants. To test for an excess burden of rare and functional variants, we used Fisher’s exact test where the variable of interest is the number of carriers of a functional variant. We used the number of carriers of a functional variant instead of the total minor allele count as the testing variable to account for LD between variants.

MHC association analysis

We used imputed HLA classical alleles from all the samples that were genotyped on the exome array. For the remaining samples that were not genotyped on the exome array, we imputed HLA classical alleles from either OmniExpress or I500K. PCs were calculated from 6,000 independent markers that were present on all three platforms to obtain one consistent set of PCs for all samples using PLINK (Supplementary Fig. 5). We merged the imputed dosage data from all the three platforms to do a mega-analysis and used logistic regression to calculate association test statistics for each imputed allele using the first 10 PCs as covariates in PLINK to account for any confounding due to population structure. For conditional analyses, we used the dosage of the conditioning marker as an additional covariate in the analysis.

Model comparison

We used R to compute the log-likelihood of a logistic regression model consisting of either the top two BTNL2 missense variants or HLA-DQB1 126Q and HLA-B 158T dosages with the first 10 PCs as covariates in 144 cases and 3,355 controls. We also explored whether adding one or both BTNL2 variants improved the model consisting of HLA-DQB1 126Q and HLA-B 158T.

LD calculations

We used PLINK to calculate the R² between each pair of variants in both cases and controls who had BTNL2 and HLA imputation data (n=146 cases and 3,364 controls). Haploview⁴¹ was used to plot the LD between the top BTNL2 variants, HLA-B 158T, HLA-DQB1 126Q alleles and the top HLA-DRB1 alleles (*04:02 and *16:01).

Wilcoxon rank-sum tests

We used a Wilcoxon Rank-Sum test in R to determine whether the ANC distribution between carriers and noncarriers of HLA risk alleles was different in cases who had a lowest recorded ANC ≤1,000 (n=149). To correct for covariates, we used the residuals of the regression between ANC and the PCs and cohort indicator as the quantitative variable for the Wilcoxon rank-sum test (Supplementary Fig. 8). For samples with dosage, treatment duration and minimum ANC available (n=92), we calculated the correlation between these variables. ANC is moderately correlated with treatment duration (R²=0.165) and weakly correlated with clozapine dosage (R²=0.035). Treatment duration is moderately correlated with dosage (R²=0.229). Carrier status was determined using the best guess genotype for the HLA-B 158T and HLA-DQB1 126Q alleles.

Copy number variation analysis

To generate CNV genotypes, we applied PennCNV to the Illumina OmniExpress data. We removed any samples that had a log R ratio standard deviation >0.3 or that were above the 95th percentile for the number of CNV calls leaving 134 cases and 242 controls. CNVs included for analysis were required to have >10 SNP probes, >20 kb in length, <50% overlap with large genome gaps and with MAF <1% in controls. We used PLINK to test for CIAG association using the ‘--cnv-test-2sided’ command with significance determined using 100,000 permutations. The family-wise corrected P values were examined. No rare CNV was significantly associated with CIAG (data not shown).

Molecular docking

We used the Maestro 2012 software suite (Schrödinger, http://www.schrodinger.com) to build a homology model of HLA-B39 using the protein structures of HLA-B*08 (Protein Data Bank, PDB, http://www.wwpdb.org, code=3SPV, resolution 1.30Å) and HLA-B*41 (PDB code=3LN4, resolution 1.30 Å) as templates. We also used a multiple sequence alignment encompassing HLA-B*08, B41 and B38. Clozapine was docked into the antigen-presenting site of the predicted structure of HLA-B39 using the Glide program in extra-precision docking mode (Glide XP). Ionizable groups were first converted to their most probable charged forms at pH 7.4 using the LIGPREP software within Maestro. The binding region was defined by a 10 × 10 × 10 Å box centred on the B39 pocket. A scaling factor of 0.8 was applied to the van der Waals radii. Default settings were used for all the remaining parameters. The docking poses were energy-minimized with Macromodel in the OPLS2005 force field with flexible ligand and rigid receptor. The refined poses were re-ranked on the basis of the calculated interaction energy. The lowest-energy pose was selected and rescored in the active site using GlideScore XP. Clozapine was also docked in the antigen-presenting sites of HLA-B*57:01 (PDB code=3VRI), HLA-B*35:08 (PDB code=3BWA) and the binding pockets of other HLA-A and HLA-B proteins.

Additional information

How to cite this article: Goldstein, J. I. et al. Clozapine-induced agranulocytosis is associated with rare HLA-DQB1 and HLA-B alleles. Nat. Commun. 5:4757 doi: 10.1038/ncomms5757 (2014).

References

Alvir, J. M., Lieberman, J. A., Safferman, A. Z., Schwimmer, J. L. & Schaaf, J. A. Clozapine-induced agranulocytosis. Incidence and risk factors in the United States. N. Engl. J. Med. 329, 162–167 (1993).
Article CAS PubMed Google Scholar
Kane, J., Honigfeld, G., Singer, J. & Meltzer, H. Clozapine for the treatment-resistant schizophrenic. A double-blind comparison with chlorpromazine. Arch. Gen. Psychiatry 45, 789–796 (1988).
Article CAS PubMed Google Scholar
Lieberman, J. A. HLA-B38, DR4, DQw3 and clozapine-induced agranulocytosis in Jewish patients with schizophrenia. Arch. Gen. Psychiatry 47, 945 (1990).
Article CAS PubMed Google Scholar
Yunis, J. J. et al. HLA associations in clozapine-induced agranulocytosis. Blood 86, 1177–1183 (1995).
CAS PubMed Google Scholar
Dettling, M., Cascorbi, I., Roots, I. & Mueller-Oerlinghausen, B. Genetic determinants of clozapine-induced agranulocytosis: recent results of HLA subtyping in a non-jewish caucasian sample. Arch. Gen. Psychiatry 58, 93–94 (2001).
Article CAS PubMed Google Scholar
Dettling, M., Cascorbi, I., Opgen-Rhein, C. & Schaub, R. Clozapine-induced agranulocytosis in schizophrenic Caucasians: confirming clues for associations with human leukocyte class I and II antigens. Pharmacogenomics J. 7, 325–332 (2007).
Article CAS PubMed Google Scholar
Athanasiou, M. C. et al. Candidate gene analysis identifies a polymorphism in HLA-DQB1 associated with clozapine-induced agranulocytosis. J. Clin. Psychiatry 72, 458–463 (2011).
Article CAS PubMed Google Scholar
Valevski, A. et al. HLA-B38 and clozapine-induced agranulocytosis in Israeli Jewish schizophrenic patients. Eur. J. Immunogenet. 25, 11–13 (1998).
Article CAS PubMed Google Scholar
Tiwari, A. K. et al. Exome sequence analysis of Finnish patients with clozapine-induced agranulocytosis. Mol. Psychiatry 19, 403–405 (2013).
Article PubMed Google Scholar
Yadav, P. N. et al. The presynaptic component of the serotonergic system is required for clozapine’s efficacy. Neuropsychopharmacology 36, 638–651 (2011).
Article CAS PubMed Google Scholar
NHLBI Exome Sequencing Project. Exome Variant Server http://evs.gs.washington.edu/EVS/ (2011).
MacArthur, D. G. et al. Guidelines for investigating causality of sequence variants in human disease. Nature 508, 469–476 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Wei, C.-Y., Lee, M.-T. M. & Chen, Y.-T. Pharmacogenomics of adverse drug reactions: implementing personalized medicine. Hum. Mol. Genet. 21, R58–R65 (2012).
Article CAS PubMed Google Scholar
Mitsunaga, S. et al. Exome sequencing identifies novel rheumatoid arthritis-susceptible variants in the BTNL2. J. Hum. Genet. 58, 210–215 (2013).
Article CAS PubMed Google Scholar
Jia, X. et al. Imputing amino acid polymorphisms in human leukocyte antigens. PLoS ONE 8, e64683 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Robinson, J. et al. The IMGT/HLA database. Nucleic Acids Res. 41, D1222–D1227 (2013).
Article CAS PubMed Google Scholar
Negi, S. et al. A genome-wide association study reveals ARL15, a novel non-HLA susceptibility gene for rheumatoid arthritis in North Indians. Arthritis Rheum. 65, 3026–3035 (2013).
Article CAS PubMed Google Scholar
Bonnelykke, K. et al. Meta-analysis of genome-wide association studies identifies ten loci influencing allergic sensitization. Nat. Genet. 45, 902–906 (2013).
Article CAS PubMed PubMed Central Google Scholar
Rubicz, R. et al. A genome-wide integrative genomic study localizes genetic factors influencing antibodies against Epstein-Barr virus nuclear antigen 1 (EBNA-1). PLoS Genet. 9, e1003147 (2013).
Article CAS PubMed PubMed Central Google Scholar
Mbarek, H. et al. A genome-wide association study of chronic hepatitis B identified novel risk locus in a Japanese population. Hum. Mol. Genet. 20, 3884–3892 (2011).
Article CAS PubMed Google Scholar
Daly, A. K. et al. HLA-B*5701 genotype is a major determinant of drug-induced liver injury due to flucloxacillin. Nat. Genet. 41, 816–819 (2009).
Article CAS PubMed Google Scholar
Chung, W. H. et al. Medical genetics: a marker for Stevens-Johnson syndrome. Nature 428, 486 (2004).
Article ADS CAS PubMed Google Scholar
Illing, P. T. et al. Immune self-reactivity triggered by drug-modified HLA-peptide repertoire. Nature 486, 554–558 (2012).
Article ADS CAS PubMed Google Scholar
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
Article CAS PubMed PubMed Central Google Scholar
Delaneau, O., Marchini, J. & Zagury, J. F. A linear complexity phasing method for thousands of genomes. Nat. Methods 9, 179–181 (2012).
Article CAS Google Scholar
Delaneau, O., Zagury, J. F. & Marchini, J. Improved whole-chromosome phasing for disease and population genetic studies. Nat. Methods 10, 5–6 (2013).
Article CAS PubMed Google Scholar
Howie, B., Fuchsberger, C., Stephens, M., Marchini, J. & Abecasis, G. R. Fast and accurate genotype imputation in genome-wide association studies through pre-phasing. Nat. Genet. 44, 955–959 (2012).
Article CAS PubMed PubMed Central Google Scholar
Lim, E. T. et al. Rare complete knockouts in humans: population distribution and significant role in autism spectrum disorders. Neuron 77, 235–242 (2013).
Article CAS PubMed PubMed Central Google Scholar
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS PubMed PubMed Central Google Scholar
DePristo, M. A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. 43, 491–498 (2011).
Article CAS PubMed PubMed Central Google Scholar
Goldstein, J. I. et al. zCall: a rare variant caller for array-based genotyping: genetics and population analysis. Bioinformatics 28, 2543–2545 (2012).
Article CAS PubMed PubMed Central Google Scholar
Sklar, P. et al. Whole-genome association study of bipolar disorder. Mol. Psychiatry 13, 558–569 (2008).
Article CAS PubMed PubMed Central Google Scholar
Rivas, M. A. et al. Deep resequencing of GWAS loci identifies independent rare variants associated with inflammatory bowel disease. Nat. Genet. 43, 1066–1073 (2011).
Article CAS PubMed PubMed Central Google Scholar
Abecasis, G. R. et al. A map of human genome variation from population-scale sequencing. Nature 467, 1061–1073 (2010).
Article ADS PubMed Google Scholar
Seddon, J. M. et al. Rare variants in CFI, C3 and C9 are associated with high risk of advanced age-related macular degeneration. Nat. Genet. 45, 1366–1370 (2013).
Article CAS PubMed PubMed Central Google Scholar
Price, A. L. et al. Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 38, 904–909 (2006).
Article CAS PubMed Google Scholar
Browning, B. L. & Browning, S. R. A unified approach to genotype imputation and haplotype-phase inference for large data sets of trios and unrelated individuals. Am. J. Hum. Genet. 84, 210–223 (2009).
Article CAS PubMed PubMed Central Google Scholar
McLaren, W. et al. Deriving the consequences of genomic variants with the Ensembl API and SNP Effect Predictor. Bioinformatics 26, 2069–2070 (2010).
Article CAS PubMed PubMed Central Google Scholar
Adzhubei, I. A. et al. A method and server for predicting damaging missense mutations. Nat. Methods 7, 248–249 (2010).
Article CAS PubMed PubMed Central Google Scholar
Barrett, J. C., Fry, B., Maller, J. & Daly, M. J. Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics 21, 263–265 (2005).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We thank Drs Thomas Lehner (NIMH) and David Goldstein (Duke University) for their help on this project. We acknowledge the assistance of Dr Michael Karukin (Teva Pharmaceuticals) and Drs Rafael Muniz and Vinod Kumar (Novartis) for help in accessing US national clozapine registries. We thank Drs Stanton Gerson (Department of Hematology, Case Western Reserve), Armond Goldman (Department of Immunology, University of Texas Galveston), and Nancy Berliner (Department of Hematology, Harvard University) for providing input on mechanisms of agranulocytosis. M.P. is NIHR Senior Investigator. This project was funded by R01 MH080403 (P.F.S., E.J.C.G.V.D.O.) from the US National Institute of Mental Health via the American Recovery and Reinvestment Act of 2009. D.F. and A.T. acknowledge the support from National Science Foundation grant ABI 10-567.

Author information

Mark J. Daly and Patrick F. Sullivan: These authors contributed equally to the work

Authors and Affiliations

Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, 02114, Massachusetts, USA
Jacqueline I. Goldstein, Laramie Duncan, Hailiang Huang, Monkol Lek, Benjamin M. Neale, Stephan Ripke & Mark J. Daly
Medical and Population Genetics Program, Broad Institute of MIT and Harvard, Cambridge, 02142, Massachusetts, USA
Jacqueline I. Goldstein, Laramie Duncan, Benjamin M. Neale & Mark J. Daly
Department of Psychiatry, University of North Carolina, Chapel Hill, 27599, North Carolina, USA
L. Fredrik Jarskog, Chris Hilliard & Patrick F. Sullivan
Department of Molecular and Clinical Pharmacology, University of Liverpool, Liverpool, L69 3BX, UK
Ana Alfirevic & Munir Pirmohamed
Division of Chemical Biology and Medicinal Chemistry, Laboratory for Molecular Modeling, Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, 27599, North Carolina, USA
Denis Fourches & Alexander Tropsha
Psychiatric and Neurodevelopmental Genetics Unit, Massachusetts General Hospital, Boston, 02114, Massachusetts, USA
Benjamin M. Neale
Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, 02142, Massachusetts, USA
Benjamin M. Neale & Stephan Ripke
New York Genome Center, New York, 10013, New York, USA
Kevin Shianna
Department of Genetics, University of North Carolina, Chapel Hill, 27599, North Carolina, USA
Jin P. Szatkiewicz & Patrick F. Sullivan
Center for Biomarker Research and Personalized Medicine, Virginia Commonwealth University, Richmond, 23298, Virginia, USA
Edwin JCG van den Oord
Institute of Experimental and Clinical Pharmacology, University Hospital Schleswig-Holstein, Kiel, 24105, Germany
Ingolf Cascorbi
Department of Psychiatry and Psychotherapy, Charité-University Medicine, Berlin, 14050, Germany
Michael Dettling
Tel Aviv University, Ramat Aviv, Tissue Typing Lab, Sheba Medical Center, 6997801, Israel
Ephraim Gazit
Department of Psychiatry, New York University, New York, 10016, New York, USA
Donald C. Goff
International SAE Consortium, Ltd., Chicago, 60631, Illinois, USA
Arthur L. Holden
Maryland Psychiatric Research Center, University of Maryland, Baltimore, 21228, Maryland, USA
Deanna L. Kelly
The Feinstein Institute for Medical Research, Manhasset, 11030, New York, USA
Anil K. Malhotra
The Hofstra NS-LIJ School of Medicine, Hempstead, 11549, New York, USA
Anil K. Malhotra
The Zucker Hillside Hospital, Glen Oaks, 11004, New York, USA
Anil K. Malhotra
Aalborg University Hospital, Psychiatry, Aalborg, 9000, Denmark
Jimmi Nielsen
Department of Clinical Medicine, Aalborg University, Aalborg, 9000, Denmark
Jimmi Nielsen
Department of Psychiatry, University of Halle, Halle, 06112, Germany
Dan Rujescu
Department of Psychiatry, University of Munich, Munich, 80336, Germany
Dan Rujescu
Department of Clinical Medicine, University of Copenhagen, Copenhagen, 1165, Denmark
Thomas Werge
Institute of Biological Psychiatry, MHC Sct. Hans, Mental Health Services Copenhagen, Copenhagen, 4000, Denmark
Thomas Werge
The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Roskilde, 8000, Denmark
Thomas Werge
Department of Psychiatry, Harvard Medical School, Boston, 02115, Massachusetts, USA
Deborah L. Levy
Psychology Research Laboratory, McLean Hospital, Belmont, 02478, Massachusetts, USA
Deborah L. Levy
Department of Psychiatry, Drexel University, Philadelphia, 19104, Pennsylvania, USA
Richard C. Josiassen
Center for Addiction and Mental Health, Toronto, M5T 1R8, Canada
James L. Kennedy
Department of Psychiatry, Columbia University and the New York State Psychiatric Institute, New York, 10032, New York, USA
Jeffrey A. Lieberman
Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, SE-171 77, Sweden
Patrick F. Sullivan

Authors

Jacqueline I. Goldstein
View author publications
You can also search for this author inPubMed Google Scholar
L. Fredrik Jarskog
View author publications
You can also search for this author inPubMed Google Scholar
Chris Hilliard
View author publications
You can also search for this author inPubMed Google Scholar
Ana Alfirevic
View author publications
You can also search for this author inPubMed Google Scholar
Laramie Duncan
View author publications
You can also search for this author inPubMed Google Scholar
Denis Fourches
View author publications
You can also search for this author inPubMed Google Scholar
Hailiang Huang
View author publications
You can also search for this author inPubMed Google Scholar
Monkol Lek
View author publications
You can also search for this author inPubMed Google Scholar
Benjamin M. Neale
View author publications
You can also search for this author inPubMed Google Scholar
Stephan Ripke
View author publications
You can also search for this author inPubMed Google Scholar
Kevin Shianna
View author publications
You can also search for this author inPubMed Google Scholar
Jin P. Szatkiewicz
View author publications
You can also search for this author inPubMed Google Scholar
Alexander Tropsha
View author publications
You can also search for this author inPubMed Google Scholar
Edwin JCG van den Oord
View author publications
You can also search for this author inPubMed Google Scholar
Ingolf Cascorbi
View author publications
You can also search for this author inPubMed Google Scholar
Michael Dettling
View author publications
You can also search for this author inPubMed Google Scholar
Ephraim Gazit
View author publications
You can also search for this author inPubMed Google Scholar
Donald C. Goff
View author publications
You can also search for this author inPubMed Google Scholar
Arthur L. Holden
View author publications
You can also search for this author inPubMed Google Scholar
Deanna L. Kelly
View author publications
You can also search for this author inPubMed Google Scholar
Anil K. Malhotra
View author publications
You can also search for this author inPubMed Google Scholar
Jimmi Nielsen
View author publications
You can also search for this author inPubMed Google Scholar
Munir Pirmohamed
View author publications
You can also search for this author inPubMed Google Scholar
Dan Rujescu
View author publications
You can also search for this author inPubMed Google Scholar
Thomas Werge
View author publications
You can also search for this author inPubMed Google Scholar
Deborah L. Levy
View author publications
You can also search for this author inPubMed Google Scholar
Richard C. Josiassen
View author publications
You can also search for this author inPubMed Google Scholar
James L. Kennedy
View author publications
You can also search for this author inPubMed Google Scholar
Jeffrey A. Lieberman
View author publications
You can also search for this author inPubMed Google Scholar
Mark J. Daly
View author publications
You can also search for this author inPubMed Google Scholar
Patrick F. Sullivan
View author publications
You can also search for this author inPubMed Google Scholar

Contributions

The study was designed by L.F.J., J.L.K., J.A.L., M.J.D. and P.F.S. with additional input by E.J.C.G.V.D.O. Cases and controls were ascertained and phenotyped by L.F.J., I.C., M.D., E.G., D.C.G., A.L.H., D.L.K., A.K.M., J.N., M.P., D.R., T.W., D.L.L. and R.C.J. Most genomic assays were done under the supervision of K.S. with some done by C.H. Data analysis was conducted by J.I.G., L.D., H.H., M.L., B.M.N., S.R. and J.P.S. Molecular docking modeling was done by D.F. and A.T. All authors discussed the results and implications and commented on the manuscript at all stages.

Corresponding authors

Correspondence to Mark J. Daly or Patrick F. Sullivan.

Ethics declarations

Competing interests

L.F.J. has received grant support from Genentech, Amgen, and Sunovion in the past 3 years, and is a member of a Data Monitoring Committee for Janssen. A.H. is an employee of iSAEC, a 501(c)(3) non-profit research organization. J.A.L. serves on the Advisory Board of EnVivo and Intracellular Therapies and does not receive direct financial compensation or salary support for his participation. He receives grant support from Biomarin, EnVivo, Genentech, Novartis, Psychogenics and Sunovion; is a member of the Advisory Board of Pear Therapeutics and holds a financial interest; and holds a patent from Repligen. The remaining authors declare no competing financial interests.

Supplementary information

Supplementary Figures and Supplementary Tables

Supplementary Figures 1-9 and Supplementary Tables 1-2 (PDF 1010 kb)

Supplementary Data 1

Top LD-pruned SNPs from the GWAS scan sorted by p-value. Columns are: SNP=name of variant; CHR=chromosome; BP=base pair position (hg19); P=P-value; OR=odds ratio for allele 1; SE=standard error; A1A2=allele 1 and allele 2; FRQ_A=frequency of allele1 in affected cases; FRQ_U=frequency of allele1 in unaffected controls; INFO=imputation info score; ngt=number of studies in which this variant was genotyped (vs. imputed); "friends(.1).p0.001"=list of all variants with LD-r² > 0.1 to index SNP, in brackets LD-r² and distance in kb sorted by LD-r2; range.left=left margin of region (defined by LD friends); range.right=right margin of region (defined by LD friends); span(kb)=right margin - left margin (in kb); "friends(.6).p0.001", range.left.6, range.right.6, and span.6(kb)=as before but with LD-r2 of 0.6; gwas_catalog_span.6=list of entries in NHGRI GWAS catalogue among entries in column friends(.6); "genes.6.50kb(dist2index)"=list of genes within the region of friends.6 (±50 kb), in brackets distance to index SNP in kb. (XLSX 150 kb)

Supplementary Data 2

Best rare variant association study results sorted by p-value. Columns are: CHR=chromosome; BP=base pair position (hg19); SNP=name of variant; A1=minor allele; A2=major allele; P=combined P-value from all data; OR=odds ratio; DATA_USED=the source of data that went into the combined p-value; GENE=gene name; FUNCTION=function as annotated by Variant Effect Predictor; POLYPHEN2=PolyPhen-2 annotation for missense variants; SEQ_MAC_AFF=sequencing minor allele count in cases; SEQ_MAC_UNAFF=sequencing minor allele count in controls; SEQ_MAJ_AFF=sequencing major allele count in cases; SEQ_MAJ_UNAFF=sequencing major allele count in controls; SEQ_MAF_AFF=sequencing minor allele frequency in cases; SEQ_MAF_UNAFF=sequencing minor allele frequency in controls; SEQ_P=p-value from sequencing data only; SEQ_OR=odds ratio from sequencing data only; EC_SNP=exome chip SNP name; EC_MAC_AFF=exome chip minor allele count in cases; EC_MAC_UNAFF=exome chip minor allele count in controls; EC_MAJ_AFF=exome chip major allele count in cases; EC_MAJ_UNAFF=exome chip major allele count in controls; EC_MAF_AFF=exome chip minor allele frequency in cases; EC_MAF_UNAFF=exome chip minor allele frequency in controls; EC_P=p-value in exome chip data only; EC_OR=odds ratio in exome chip data only; ESP_MAC=minor allele count in ESP data; ESP_MAJ=major allele count in ESP data; ESP_MAF=ESP minor allele frequency. (XLSX 86 kb)

Supplementary Data 3

Data used to make Figure 4. The first tab is no conditioning SNPs, the second tab is conditioning on HLA-DQB1 126Q, and the third tab is conditioning on HLA-DQB1 126Q and HLA-B 158T. Column are: SNP=name from SNP2HLA, where amino acids are labeled as AA_GENE_AAPOS_hg18basepair_residue (see SNP2HLA documentation for more information); A1=allele odds ratio is calculated with respect to, P=present and A=Absent; A2=alternate allele; FRQA=A1 frequency in cases based on dosages; FRQU=A1 frequency in controls based on dosages; INFO=imputation info score; OR=odds ratio; SE=natural log of standard error of odds ratio; P=P-value. (XLSX 1736 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Goldstein, J., Fredrik Jarskog, L., Hilliard, C. et al. Clozapine-induced agranulocytosis is associated with rare HLA-DQB1 and HLA-B alleles. Nat Commun 5, 4757 (2014). https://doi.org/10.1038/ncomms5757

Download citation

Received: 02 April 2014
Accepted: 18 July 2014
Published: 04 September 2014
DOI: https://doi.org/10.1038/ncomms5757

This article is cited by

Pharmacogenomics: current status and future perspectives
- Munir Pirmohamed
Nature Reviews Genetics (2023)
Pharmacogenomics of Clozapine-induced agranulocytosis: a systematic review and meta-analysis
- Farhana Islam
- Daniel Hain
- Daniel J. Müller
The Pharmacogenomics Journal (2022)
Cost effectiveness of pharmacogenetic-guided clozapine administration based on risk of HLA variants in Japan and the UK
- Kohei Ninomiya
- Takeo Saito
- Nakao Iwata
Translational Psychiatry (2021)
HLA-DQB1 6672G>C (rs113332494) is associated with clozapine-induced neutropenia and agranulocytosis in individuals of European ancestry
- Bettina Konte
- James T. R. Walters
- Ina Giegling
Translational Psychiatry (2021)
Potential pharmacogenomic targets in bipolar disorder: considerations for current testing and the development of decision support tools to individualize treatment selection
- Alfredo B. Cuéllar-Barboza
- Susan L. McElroy
- Mark A. Frye
International Journal of Bipolar Disorders (2020)

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Genome-wide common variant association study

Exome-wide rare variant association study

Classical HLA allele imputation

HLA-DQB1 and HLA-B explains the MHC signal better than BTNL2

HLA sequence alignment

CIAG severity by HLA-B and HLA-DQB1 carrier status

Modelling of clozapine binding to HLA-B

Discussion

Methods

Ethical statement

Sample description

Generation of genome-wide array data

Generation of whole-exome sequencing data

Generation of exome array data

HLA allele imputation

Genome-wide common variant association study

Immune-mediated SNP Enrichment

Exome-wide rare variant association study

MHC association analysis

Model comparison

LD calculations

Wilcoxon rank-sum tests

Copy number variation analysis

Molecular docking

Additional information

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links