Abstract
Heart failure (HF) is a major contributor to global morbidity and mortality. While distinct clinical subtypes, defined by etiology and left ventricular ejection fraction, are well recognized, their genetic determinants remain inadequately understood. In this study, we report a genome-wide association study of HF and its subtypes in a sample of 1.9 million individuals. A total of 153,174 individuals had HF, of whom 44,012 had a nonischemic etiology (ni-HF). A subset of patients with ni-HF were stratified based on left ventricular systolic function, where data were available, identifying 5,406 individuals with reduced ejection fraction and 3,841 with preserved ejection fraction. We identify 66 genetic loci associated with HF and its subtypes, 37 of which have not previously been reported. Using functionally informed gene prioritization methods, we predict effector genes for each identified locus, and map these to etiologic disease clusters through phenome-wide association analysis, network analysis and colocalization. Through heritability enrichment analysis, we highlight the role of extracardiac tissues in disease etiology. We then examine the differential associations of upstream risk factors with HF subtypes using Mendelian randomization. These findings extend our understanding of the mechanisms underlying HF etiology and may inform future approaches to prevention and treatment.
Similar content being viewed by others
Main
Genome-wide association studies (GWAS) of HF have provided valuable translational insights, yet existing studies are often limited by phenotypic heterogeneity and a lack of stratification by etiology1,2,3. HF often develops following significant risk factor exposures, such as myocardial infarction and valvular heart disease4. We hypothesized that a study of HF occurring without major antecedent risk factors (specifically, myocardial infarction, coronary revascularization or marked structural or valvular heart diseases) could better capture genetic factors directly influencing HF susceptibility5 (Extended Data Fig. 1 and Supplementary Fig. 1). We refer to this form of HF as ni-HF. Where data permitted, ni-HF cases were further stratified by left ventricular systolic dysfunction, defining those with left ventricular ejection fraction (LVEF) above or below 50% as ni-HFpEF or ni-HFrEF, respectively. We performed GWAS meta-analyses to identify genomic loci, prioritize putative effector genes and characterize the key pathways, tissues and cell types involved in etiology. We further investigated etiologic clusters through a pleiotropy scan and used Mendelian randomization (MR) to assess the genetic evidence supporting risk factor associations. Our study advances the understanding of the genetic architecture of HF and its subtypes.
Results
GWAS meta-analysis
We conducted a GWAS meta-analysis of HF and its subtypes across 42 studies participating in the Heart Failure Molecular Epidemiology for Therapeutic Targets (HERMES) consortium. The study population comprised 1,946,349 individuals, including 153,174 diagnosed cases of HF (hereafter termed HFall), of which 44,012 had a nonischemic etiology (ni-HF). Among these, 5,406 were classified as ni-HFrEF (LVEF < 50%) and 3,841 were classified as ni-HFpEF (LVEF ≥ 50%; Supplementary Figs. 1 and 2). Antecedent myocardial infarction was reported for 29% of the HFall cases and <1% of the ni-HF phenotypes (Supplementary Table 1). The identified HF cases encompassed five major ancestry groups: 139,533 (91%) European; 9,413 (6.2%) East Asian; 3,292 (2.2%) African; 779 (0.5%) South Asian; and 157 (0.1%) admixed American (Supplementary Table 2).
We performed a cross-ancestry, fixed-effect inverse-variance-weighted (IVW) meta-analysis to investigate the associations of 10,199,961 genetic variants (minor allele frequency (MAF) >1%) with the risk of HF and its subtypes. We identified 59 conditionally independent (sentinel) genetic variants associated with HFall at P < 5 × 10−8, at 56 nonoverlapping genomic loci (distance > 500 kb; Fig. 1 and Supplementary Table 3). We did not observe heterogeneity of effects between ancestry groups, except for one sentinel variant at the LPA locus (Phet-ancestry < 0.05/59; Supplementary Fig. 3 and Supplementary Table 4). Ten additional variants were associated with ni-HF subtypes but not with HFall. In total, 66 independent genomic loci were identified across the HF phenotypes at P < 5 × 10−8, of which 46 (70%) loci passed a multiplicity-adjusted threshold of P < 5 × 10−8/4 phenotypes. Of the 66 identified loci, 46 (70%) were associated with at least one nonischemic subtype at adjusted P < 0.05/66 (hereafter termed ni-HF loci). The remaining 20 loci were associated with HFall but not with the nonischemic subtypes (hereafter termed other HF loci; Fig. 2c). We identified 37 variants not previously reported and successfully replicated 76 of 87 (87%) previously reported HF variants available in our study (P < 0.05/87; Supplementary Fig. 4 and Supplementary Table 5)1,2,3,6,7.
a, Phenotyping schema. b, Manhattan plots of four HF subtypes showing −log10(P) for genetic associations from GWAS meta-analysis (y axis) across genetic variants ordered by chromosome and base pair positions (x axis). c, Summary of conditionally independent lead variants effect across HF phenotypes. Lead variants are denoted using chromosome, base pair position according to the GRCh37 assembly, risk-increasing allele in HF phenotype with the lowest P value for association and the other allele. Loci are categorized by the strength of genetic associations across HF phenotypes, with labels on the right edge of the plot. The presented P values are derived from two-sided association tests as described in Methods. Chr1, chromosome 1.
a, Circular heatmap showing predictor score and Boolean classifier scores for 142 candidate genes across HF susceptibility loci. Blue tiles indicate a 'true' value and prioritized genes within each locus are highlighted in red. Coloc refers to colocalization evidence due to shared causal variants with gene expression level in the most relevant tissue; Mendel indicates overlap with Mendelian disease genes; ABC refers to the ABC measure for enhancer-gene activity of overlapping lead variants and their proxies or fine-mapped variants; overall refers to the weighted average of PoPS, TWAS statistics and V2G scaled predictor score (with 2:1:2 weight ratio). b, Schematic diagram of locus categorization by genetic associations across HF subtypes. c, Upset plot showing number of loci per phenotype category. d, Schematic diagram of effector gene prioritization.
To investigate sex as a potential modifier of variant effects, we conducted a meta-regression analysis of sex ratio (proportion of male sex) and additive effects of risk-increasing alleles for 66 identified HF sentinel variants across the participating studies. Although limited by statistical power, we observed nominally significant evidence of effect modification by sex at 3 of the 66 loci (P < 0.05; Supplementary Table 6).
Genetic architecture and heritability
The genetic architecture of HF was found to be highly polygenic, evidenced by an elevated genomic inflation factor (λGC = 1.22) in the absence of population stratification (linkage disequilibrium score (LDSC) regression intercept = 1.01; Supplementary Figs. 5 and 6 and Supplementary Table 7). We observed an exponential relationship between allele frequency and effect size for the associated variants (Extended Data Fig. 2). The estimated proportion of variance in disease liability explained by common genetic variants under an additive model, or SNP-based heritability (h2g), was 5.4 ± 0.2% for HFall, 6.1 ± 0.5% for ni-HF, 11.8 ± 2.6% for ni-HFrEF and 1.8 ± 1.3% for ni-HFpEF (Extended Data Fig. 3)8. Positive genetic correlations were observed across between all HF subtype pairs, with estimates ranging from 0.42 (s.e. = 0.18) between ni-HFrEF and ni-HFpEF to 0.93 (s.e. = 0.15) between ni-HF and ni-HFpEF (Supplementary Fig. 7). We derived a polygenic score (PGSHF) from the HFall GWAS, excluding UK Biobank (UKB) participants, and evaluated its association with HF in UKB. Among 347,235 UKB European participants (13,793 HFall cases), the PGSHF was associated with HFall (odds ratio (OR) per PGS s.d. = 1.37 (95% confidence interval (CI) = 1.35–1.39), P < 2 × 10−16), after adjusting for sex, age and first ten genetic principal components (PCs). At a 5% false positive rate, 19% of the cases were identified. Individuals in the top decile of PGSHF had 1.70-fold higher odds of developing heart failure compared to those in the fifth decile (OR = 1.70, 95% CI = 1.59–1.82, P = 1.7 × 10−142) and 2.89-fold compared to those in the bottom decile (OR = 2.89, 95% CI = 2.66–3.14, P = 2 × 10−51; Supplementary Table 8 and Extended Data Fig. 4).
Prioritization of effector variants, genes and pathways
We then examined the functional properties of variants and genes within each GWAS locus. Through functionally informed fine mapping, we identified 70 credible sets containing 547 putative causal variants at 47/66 HF loci (cumulative posterior inclusion probability (PIP) >0.95)9. Eleven fine-mapped variants, including exonic variants in established dilated cardiomyopathy (DCM) genes FLNC, BAG3 and HSPB7, had high predicted deleteriousness (Combined Annotation-Dependent Depletion (CADD) Phred score > 20)10,11,12,13 (Extended Data Fig. 5 and Supplementary Table 9). Next, we used a two-stage approach to identify potential effector genes by evaluating 758 protein-coding genes located within heart failure loci. In the first stage, we identified candidate genes ranked highest by one or more of the following gene prioritization methods: (1) variant-to-gene (V2G) scoring of fine-mapped variants14, (2) gene-level polygenic priority scores (PoPS)15 and (3) association of predicted gene expression with HFall in multitissue transcriptome-wide association study (TWAS)16. In the second stage, we nominated a single best-prioritized gene for each of the 66 loci based on the following: (1) predicted enhancer-gene activity using the activity-by-contact (ABC) model17,18, (2) colocalization of gene expression in a relevant tissue19 and (3) association with a phenotypically relevant Mendelian disorder20 (Fig. 2 and Supplementary Table 10). Results for regional genetic associations, gene prioritization scores, cross-trait association and study-level estimates are provided in Supplementary Data 1.
The prioritized genes in ni-HF loci included those implicated in cardiac development and cardiomyopathy (ACTN2, BAG3, CAMKD2 (ref. 21), CAND2 (ref. 22), FLNC, ACTN2 and HSPB7 (refs. 11,12,13,23), CAMKD2 (ref. 21), NKX2-5 and STRN24), cardiac hypertrophy (CAMKD2 (ref. 21) and CAND2 (ref. 22)) and cardiac arrhythmia (PITX2, KLF12 and ATP1B1 (refs. 25,26,27)). Genes with an established role in atherosclerosis (LDLR28, LPL29, ABCG5 (refs. 30,31) and LPA32) were notable among loci associated with HFall but not with ni-HF.
We also performed an exploratory enrichment analysis of the candidate gene sets to identify additional pathways and cellular components associated with HFall and ni-HF (insufficient genes were identified for ni-HFrEF and ni-HFrEF; Extended Data Fig. 6 and Supplementary Table 11). Cellular growth and senescence pathways were highly enriched for both HFall and ni-HF. Notably, a key senescence-associated secretory phenotype (SASP) gene, insulin-like growth factor binding protein 7 (IGFBP7), was prioritized at one of the ni-HFpEF loci. IGFBP7 encodes a circulating anti-angiogenic factor produced by the vasculature, which has recently been implicated in cardiomyocyte senescence and cardiac remodeling33. The glucocorticoid pathway was enriched across all HF phenotypes, and genes associated with the sarcomeric Z-disk were overrepresented in both HFall and ni-HF.
Identifying organs, tissues and cells involved in etiology
To identify the biological components contributing to HF etiology, we performed a heritability enrichment analysis across 206 tissues and cell types, representing 12 organs and systems (Fig. 3 and Supplementary Table 12). We observed enrichment of 46 unique tissues and cell types (one-sided P < 0.05), highlighting the diverse range of organ systems involved in HF etiology. Overall, cardiac tissues were the most frequently enriched (15 enrichments), followed by musculoskeletal/connective tissues (12 enrichments) and nervous system tissues (8 enrichments). Cardiac tissues showed the highest enrichment for HFall, ni-HF and ni-HFrEF, whereas for niHFpEF, the kidney and pancreas showed the highest enrichment. We then investigated the contribution of 14 cardiac cell populations using single-nucleus transcriptomics derived from 185,185 nuclei from 16 nonfailing human hearts. Heritability enrichment and gene-based association enrichment using MAGMA34 revealed cardiomyocytes as the major effector cell type for ni-HF. Additionally, vascular smooth muscle cells were also enriched for HFall but not for ni-HF (Supplementary Fig. 8 and Supplementary Table 13).
Estimates for heritability enrichment of 206 tissues and cell types in 12 system/organ categories across four HF phenotypes are presented. Top five tissues/cell types per phenotype are highlighted with labels. The presented −log10(P) is derived from a one-sided statistical test of heritability enrichment using LDSC-SEG as described in Methods.
To investigate whether genes prioritized at the identified HF loci were differentially regulated in failing hearts, we compared the cell-type transcriptomic profile of the nonfailing heart samples with those from 28 DCM and hypertrophic cardiomyopathy (HCM) patients (n nuclei = 344,797). We found that 30/95 (30%) of ni-HF genes were differentially regulated, predominantly within fibroblasts and cardiomyocytes (Extended Data Fig. 7 and Supplementary Table 14). Notably, nuclear receptor subfamily 3 group C member 1 (NR3C1), which encodes glucocorticoid receptors, was upregulated in 12 of 14 cardiac cell types from failing hearts. This upregulation aligns with the glucocorticoid pathway enrichment among the prioritized genes identified at HF GWAS loci. These findings suggest that glucocorticoid signaling may have a cardioprotective role, potentially through the inhibition of pathological mineralocorticoid receptor signaling35, which could help explain, in part, the beneficial effects of mineralocorticoid receptor antagonist drugs in HF36. Another pro-survival HF gene, KLF12, was upregulated in both cardiomyocytes and cardiac fibroblasts. KLF12 is a transcription factor that upregulates p53, which in turn leads to upregulation of p21 (CDKN1A)37. Activation of p53 has been implicated as an important mechanism in LMNA cardiomyopathy38. In contrast to prior reports, we did not observe differential expression of IGFBP7 in cardiomyocytes from failing hearts33.
Mapping genetic loci to etiologic clusters
To explore potential mechanisms by which HF genetic loci influence disease risk, we conducted a pleiotropy scan to examine associations between the identified loci and 294 clinical traits and diseases systematically ascertained from UKB39. We identified 207 pleiotropic associations at false discovery rate (FDR) < 1% (corresponding to P < 0.001) between 46/66 (70%) HF loci and 79 (27%) of the phenotypes tested (Extended Data Fig. 8, Supplementary Fig. 9 and Supplementary Table 15). To visualize these associations, we constructed a pleiotropy graph network by representing loci and diseases as nodes and locus–phenotype associations as edges (Extended Data Fig. 9a). Using a community detection algorithm, we identified 18 etiological clusters of locus–disease associations (Fig. 4, Extended Data Fig. 9b and Supplementary Table 16). The largest cluster, cluster 1, included traits and diseases related to atherosclerosis, with loci that were not associated with ni-HF subtypes. Cluster 2 encompassed cardiac arrhythmias and DCM and loci associated with ni-HF. Cluster 3 centered on the pleiotropic effects of a locus prioritizing ALDH2, which encodes an enzyme involved in removing toxic acetaldehydes, with deficiencies in ALDH2 being linked to increased risks of cancer and cardiovascular diseases40. The sentinel variant at the corresponding locus has been associated with reduced ALDH2 expression in blood41. Cluster 4 was centered around hypertension, a major risk factor for all HF subtypes. Finally, cluster 5 comprised ni-HF and ni-HFpEF loci associated with adiposity, diabetes and carpal tunnel syndrome, which are notable clinical risk factors for HFpEF.
The network is constructed from 207 genotype–phenotype associations across 79 unique diseases and 46 HF susceptibility loci identified from phenome-wide association (PheWAS) analysis in UKB at FDR <1%. Nodes represent genetic loci labeled by the prioritized gene (solid background with bold–italic label, colored by categorical association across HF phenotypes) and phenotypes (translucent background, colored by phenotype category), sized proportionally to centrality measure. Edges (arrows connecting locus nodes to phenotype nodes) represent the association, with thickness representing the strength of association measured by absolute z score. The full pleiotropy network and phenotype category color codes are presented in Extended Data Fig. 9. Phenotype abbreviations, phenotype categories and phenome-wide association results are presented in Supplementary Table 15 and Extended Data Fig. 8. Cluster membership and centrality measures of nodes are presented in Supplementary Table 26 and Supplementary Fig. 13. CHD, coronary heart disease; NOS, not-otherwise-specified (nonspecific cause), AF, atrial fibrillation; AV block, atrioventricular block; LRTI, lower respiratory tract infection; COPD, chronic obstructive pulmonary disease; LBBB, left bundle branch block; BPH, benign prostatic hyperplasia.
We then investigated the genetic associations of sentinel variants across 66 HF loci with 24 related diseases, lifestyle factors and quantitative traits related to cardiovascular physiology and cardiac structure and function. This analysis revealed 207 pleiotropic associations at FDR < 1% (Extended Data Fig. 10 and Supplementary Table 17). Notably, 24 of the 66 HF loci (36%) were not associated with any of the traits investigated (P ≥ 5 × 10−8). Cross-trait colocalization analysis revealed evidence for shared causal variants for 105 locus–phenotype pairs (posterior probability of shared causal variants > 0.8; Fig. 5 and Supplementary Table 18). For example, the BTB domain and CNC homolog 1 (BACH1) ni-HFrEF locus colocalized with both coronary artery disease (CAD) and DCM. BACH1, a transcription factor induced by mechanical stress in cardiac fibroblasts and endothelial cells, regulates Yes1-associated transcriptional regulator (YAP) expression by binding to its promoter42. The BACH1–YAP transcriptional network plays a crucial role in cardiac regeneration by modulating proliferative programs in cardiomyocytes43. Additionally, six HF loci colocalized with either estimated glomerular filtration rate (eGFR) or chronic kidney disease (CKD), highlighting the key role of the cardio–renal axis in HF. The prioritized gene at one of these loci, PFDN1, encodes prefoldin subunit 1, a molecular chaperone involved in protein folding. This gene has been associated with mortality and cardiovascular phenotypes in mouse knockout models44,45.
a, Chord diagram showing connections between 22 (of 24 tested) related phenotypes across 42 (of 66) HF susceptibility loci with posterior probability of shared causal variants (PPcoloc H4) > 0.8. Each band connects a locus to a phenotype, representing the sharing of causal genetic variants (colocalization) between the tested phenotype and HF at the locus. b, Total number of colocalized phenotypes across HF susceptibility loci. c, Total number of HF susceptibility loci with genetic colocalization across tested phenotypes. DBP, diastolic blood pressure; FVC, forced vital capacity; FEV1, forced expiratory volume in 1 s; LVEDV, left ventricular end diastolic volume; PDSR-r, peak diastolic strain rate radial; PDSR-l, peak diastolic strain rate longitudinal; RVEF, right ventricular ejection fraction; RVEDV, right ventricular end diastolic volume; LAVi, left atrial volume index.
Estimation of risk factor effects on heart failure
Previous genetic studies have identified certain exposures as potential causal factors for HF; however, these studies have been limited in the range of traits examined and characterization of their association with HF subtypes1,2,3. To address this gap, we investigated the potential role of 24 related phenotypes (used in the etiologic cluster mapping above) in the etiology of HF and its subtypes. First, we estimated the genetic correlation (rg) between the HF phenotypes and the exposures of interest using bivariate LDSC. We found evidence of shared additive genetic effects for 21/24 phenotypes, with rg values ranging from −0.25 to 0.67 (Fig. 6 and Supplementary Table 19). Next, we estimated the associations using MR with an IVW fixed effects model and performed sensitivity analyses using MR–Egger and the weighted median estimator (WME). Overall, 18 of the 24 of the exposures were associated with at least one HF subtype (FDR < 1% of IVW estimate, with directional concordance in sensitivity analyses; Fig. 6, Supplementary Figs. 10–12 and Supplementary Table 20).
Asterisks (*) indicate binary traits. MR effect estimates are reported as OR (ORMR) per doubling prevalence for binary traits or per s.d. increase for quantitative traits. Estimates that were robust to multiple-testing adjustment at FDR <1% and sensitivity analyses were indicated by light blue shade (for rg < 0 and ORMR < 1) or light red shade (for rg > 0 and ORMR > 1). The heatmaps represent two-sided P values for associations from different MR models, color-coded with the direction of MR estimates and strength of associations. Diamonds and whiskers on the forest plots represent point estimates and 95% CIs for rg (left) and ORMR (right). Missing diamonds/point whiskers with arrows represent point estimates/95% CIs outside the scale. HR, heart rate; RA-area, right atrial area.
The estimated effects of risk factors on HFall and ni-HF outcomes were similar, with the exception of CAD. Consistent with prior studies, liability to CAD was associated with HFall (MR OR per doubling prevalence with IVW estimator (ORMR–IVW) = 1.20, 95% CIs = 1.18–1.22), but not with ni-HF (ORMR–IVW = 1.02, 1.00–1.04)1. We found weak evidence supporting a causal effect of liability to type 2 diabetes (T2D) on HFall (ORMR−IVW = 1.04, 1.03–1.05 per doubling prevalence) as previously reported1,46; however, evidence of horizontal pleiotropy was observed (PMR–Egger intercept = 1 × 10−6), and the association was attenuated when estimated using pleiotropy-robust methods (ORMR–Egger = 0.98, 0.96–1.00, ORMR–WME = 1.01, 1.00–1.02 per doubling prevalence). We did not find evidence supporting a causal T2D effect on ni-HF risk (ORMR–IVW = 1.01, 0.99–1.02 per doubling prevalence).
Despite the kidney being prioritized in heritability enrichment analyses and colocalization of HF phenotypes with renal traits, neither lower eGFR or CKD were associated with ni-HFpEF in MR analysis (Figs. 3 and 5) and there was limited genetic correlation between CKD and HFall (rg = 0.17, P = 0.0002)47,48. In MR analysis, genetically predicted high systolic blood pressure (SBP) and higher body mass index (BMI) were associated with risk of all HF phenotypes, with the effect estimates differing between ni-HF subtypes stratified by LVEF; SBP had the largest effect on ni-HFrEF (ORMR–IVW = 1.92, 1.70–2.17 per s.d., Pinteraction versus ni-HFpEF estimate = 5.7 × 10−7), while BMI had the greatest magnitude of effect on ni-HFpEF (ORMR–IVW = 1.87, 1.70–2.06 per s.d., Pinteraction versus ni-HFrEF estimate = 0.002).
We identified potential causal effects of liability to chronic obstructive pulmonary disease (ORMR–IVW = 1.10, 95% CI = 1.06–1.14 per doubling prevalence) and lower forced expiratory volume on risk of HF (ORMR–IVW = 0.91, 95% CI = 0.88–0.95 per s.d.) and risk of ni-HF (ORMR–IVW = 0.91, 95% CI = 0.88–0.95 per s.d.). We also found evidence for effects of smoking behavior on HF (ORMR–IVW = 1.26, 95% CI = 1.19–1.33 per s.d. pack-years of smoking). Given the prioritization of ALDH2 at an identified GWAS locus for HF and ni-HF, we estimated the effects of higher alcohol consumption on HF risk. We found evidence of a positive association between higher alcohol consumption and risk of both HF (ORMR–IVW = 1.17, 95% CI = 1.06–1.29 per s.d. drinks per week) and ni-HFrEF (ORMR–IVW = 1.87, 95% CI = 1.37–2.55 per s.d. drinks per week), consistent with clinical studies that identify alcohol as a risk factor for dilated cariomyopathy49.
In MR analyses, lower LVEF, a trait linked to impaired myocardial contractility, was associated with a higher risk of all HF phenotypes except for ni-HFpEF (Fig. 6 and Supplementary Fig. 12).The strongest association was observed with ni-HFrEF (ORMR–IVW = 0.18, 0.12–0.27 per s.d. LVEF; ORMR–IVW = 1.83, 1.64–2.03 per doubling prevalence of DCM). Conversely, a higher liability to hypertrophic cardiomyopathy, a condition linked to increased myocardial contractility, was protective against ni-HFrEF (ORMR–IVW = 0.88, 0.85–0.91 per doubling prevalence of HCM).These findings are consistent with the reported opposing genetic associations between dilated and HCM and with observational studies reporting divergent association between lower and higher LVEF and the risk of HFrEF and HFpEF, respectively50,51. Despite the widely held hypothesis that diastolic dysfunction is the key driver for HFpEF, we did not find evidence supporting a causal association (ORMR–IVW for ni-HFpEF = 1.16, 0.91–1.48 per s.d. peak diastolic strain rate radial), although the limited set of diastolic parameters investigated are not established as clinical indices of diastolic dysfunction52.
Discussion
We report a GWAS of heart failure in 1.9 million individuals, including over 150,000 cases, with a focus on HF subtypes defined by etiology and LVEF. Our analysis identifies 66 distinct genetic loci associated with HF, including 37 new loci, with specific associations identified for ni-HFpEF. Through a comprehensive framework integrating functional genomic data at the variant, locus and genome-wide levels, we uncover new genes and pathways underlying HF and its subtypes. We leverage genetic association data to map the organs, tissues and cells contributing to HF pathogenesis and estimate subtype-specific effects of key risk factors.
Our study focusses on HF occuring in the absence of major secondary causes, specifically, previous myocardial infarction, revascularization, congenital heart disease and significant valvular heart disease. By focusing on cases without these common upstream drivers, our findings highlight mechanisms intrinsic to HF, including cardio–renal mechanisms and primary cardiomyopathies53. The ni-HFrEF phenotype captures both HF with mildly reduced (LVEF = 41–49%) and reduced (LVEF < 40%)54 ejection fraction and is enriched for DCM. Among the subtypes analyzed, this phenotype showed the highest heritability. In contrast, ni-HFpEF had low heritability but significant enrichment for renal tissues, reflecting a more heterogeneous phenotype consistent with a multisystem disorder.
Our comprehensive appraisal of the genetic and phenotypic etiology of HF and its subtypes offers key insights with tangible translational implications. First, by integrating functional genomic data to identify likely causal variants, genes and pathways, we identify new loci and genes, as well as prioritizing alternative candidate genes at previously reported HF loci. Molecular mechanisms regulating growth emerged as potentially important, including modulation of BACH1–YAP42 and KLF12–CDKN1A37 activity. Notably, we prioritized ALDH2 at a previously known HF locus, a gene that has been associated with increased cardiovascular risk via its effect on alcohol metabolism. We also found enrichment of the SASP pathway and identified IGFBP7 as a potential HFpEF gene. IGFBP7 is a well-established clinical biomarker of acute renal failure and a driver of cell cycle arrest, fibrosis and vascular rarefaction, presenting a potentially promising target for new therapeutics33,55,56. Second, our findings underscore the significance of noncardiac tissues in HF etiology. In particular, the kidney, vascular system and metabolic tissues were identified as important for HFpEF, although we did not find evidence of associations between CKD and eGFR in our MR analysis57. Third, our appraisal of cardiovascular risk factor effects across HF subtypes yielded several findings with potential implications for preventive strategies. For example, the weak evidence of the causal effect of T2D on HF suggests that observational associations between these conditions might partially stem from shared upstream mechanisms. Notably, certain diabetes treatments, such as SGLT2 inhibitors, have demonstrated substantial benefits in patients with HF without diabetes. This observation may reflect the drugs’ effects on common upstream pathways influencing both HF and diabetes58. We also found no evidence supporting the causal effects of CAD on liability to HF in the absence of previous myocardial infarction or revascularization (ni-HF).Finally, we provide new evidence implicating modifiable lifestyle factors, including smoking and alcohol consumption, in HF risk, highlighting potentially actionable targets for prevention59,60.
Our study has important limitations that suggest directions for future research. Despite efforts to include data from diverse populations, participants of non-European ancestry were underrepresented in our analysis. Additionally, only a minority of HF cases could be stratified by LVEF due to limited data availability, and key measures of ventricular size and diastolic function were largely unavailable. Large prospective studies with detailed phenotypic measurements are particularly needed for HFpEF to enable a more precise characterization of the genetic basis underlying HF subtypes. The variants and genes identified in the discovery analysis require functional validation, and studies with larger sample sizes are needed to improve the precision of effect size estimates. Our meta-regression analysis of sex ratio likely underestimates genetic effect modification by sex due to the limited statistical power and potential confounding by sex-related factors. To address this, sex-stratified genetic association studies are needed to uncover sex-specific effects. In our cross-trait genetic analyses, we note a minor sample overlap between samples in the current GWAS and those used in GWAS of CAD and atrial fibrillation; however, this overlap is unlikely to have significantly biased the MR estimates due to the limited scope of the overlap and the strength of the exposure instruments. Nevertheless, further studies are needed to model relationships between individual risk factors more effectively, incorporating multivariable and mediation analysis.
In summary, we used human genetic analysis to map the etiologies underlying HF and its subtypes, classified by etiology and LVEF. Our findings provide new evidence supporting the concept of HF as a multisystem clinical syndrome and provide insights into the underlying biological mechanisms. These insights may guide the development of more effective strategies for prevention and treatment.
Methods
Ethics statement
This study complies with the ethical regulations provided by the University College London Research Ethics Committee. All participating studies were ethically approved by local committees, and all study participants provided written informed consent (Supplementary Table 21).
Phenotype definition
The present meta-analysis included 1,946,349 individuals from 42 studies (Supplementary Table 21). Each study defined up to four HF phenotypes, which are as follows: (1) overall HF (HFall), (2) ni-HF, (3) ni-HFrEF and (4) ni-HFpEF. The HFall phenotype includes any diagnosis of HF based on a physician’s adjudication, hospital record review or diagnosis codes. ni-HF was defined by excluding antecedent ischemic, valvular and congenital heart diseases. ni-HFrEF was defined as ni-HF with LVEF < 50% based on cardiac imaging or diagnosis of LVSD at any point. ni-HFpEF was defined as ni-HF with LVEF ≥ 50% based on cardiac imaging without a record of LVSD at any point. Phenotyping was performed separately in each participating study (Supplementary Table 22), using a harmonized multimodal phenotyping framework (Supplementary Methods).
Genome-wide association analysis
Study-level GWAS was performed using logistic regression (for prevalent cases) or Cox proportional hazard (for incident cases) assuming additive genetic effect with adjustment for sex, age at DNA draw, genetic PCs and study-specific covariates (Supplementary Table 23). Study-level GWAS results were meta-analyzed using a fixed-effect IVW model implemented in METAL61. As study-level estimates include both results from logistic regression and Cox proportional hazard regression, we report the meta-analysis effect estimates as relative risk per additional risk allele. Sample genotyping and quality control of study-level and meta-analysis GWAS summary statistics are detailed in Supplementary Methods and Supplementary Data 2.
Identification of genetic susceptibility loci
To identify genetic susceptibility loci for HF, we performed a chromosome-wide stepwise conditional-joint analysis in each heart failure phenotype using Genome-wide Complex Trait Analysis (GCTA) software62. Conditionally independent variants across HF phenotype with joint P < 5 × 10−8 that are physically located within 500 kb of each other were aggregated into one locus set. A genomic locus was then defined as the genomic region within 500 kb upstream and downstream of the farthest variants in each aggregated set.
The identified genomic loci were labeled with an incremental one-based integer sequence based on phenotype order (HFall, ni-HF, ni-HFrEF and ni-HFpEF), chromosome and base pair positions. A genomic locus was declared as new if all conditionally independent variants within the locus and any of the sentinel variants reported at P < 5 × 10−8 in previous GWAS of HF1,2,3,6,7 were physically located more than 250 kb away and not in linkage disequilibrium (LD; r2 < 0.2). Analysis was performed using reference genotype from randomly sampled 10,000 UKB participants with proportionally matched ancestry composition (admixed American ancestry was not included due to unavailability in UKB and the small proportion of this ancestry in the overall sample).
Furthermore, genetic associations were categorized based on P values at cutoff values of P < 5 × 10−8 (genome-wide significant), P < 0.05/number of identified loci (replicated at a Bonferroni-adjusted threshold), P < 0.05 (nominally significant) and P ≥ 0.05 (no evidence of association). Loci associated with any ni-HF subtype at Bonferroni-adjusted or genome-wide significance thresholds were labeled as ni-HF loci, and the remaining were labeled as other HF loci. ni-HF loci that are associated with ni-HFrEF at Bonferroni-adjusted or genome-wide significance thresholds were labeled as ni-HFrEF loci, and the remaining were labeled as ni-HFpEF loci. Sentinel variants for HF were defined as conditionally independent variants with the lowest P value for association across HF phenotypes within a locus. For loci associated with both HFall and ni-HF phenotypes, sentinel variants identified in the HFall analysis were prioritized.
Cross-ancestry allelic effect heterogeneity
To assess the heterogeneity of allelic effects across ancestries, we performed a meta-analysis accounting for ancestry-specific allelic effects with a meta-regression framework implemented in MR-MEGA63. Using this technique, we estimated the P value for heterogeneity correlated with ancestry, the P value for residual heterogeneity and the concordance of genetic association P values from the fixed-effect meta-analysis across sentinel variants for the HFall phenotype (Supplementary Fig. 3).
Genetic effect modification by sex
We estimated potential genetic effect modification by sex across sentinel genetic variants for HF through a meta-regression analysis using the R meta v7.0.0 package64,65. For each sentinel variant, we extracted study-specific effect estimates across studies with available sex ratio information (Supplementary Table 1) and aligned the effect alleles to risk-increasing alleles in the GWAS meta-analysis. We then performed a meta-regression of study-level effect estimates on the proportion of males, weighted by the inverse variance of the effect size. We report the regression coefficient and the corresponding P value, whereby a positive value represents a positive correlation between the estimated additive genetic effect of risk-increasing alleles and a larger proportion of males.
Genetic architecture assessment
To assess genetic architecture and polygenicity across HF phenotypes, we compared quantiles of the expected and observed genome-wide genetic association P values using quantile–quantile plot and calculated the genomic inflation coefficient (λGC). To distinguish between polygenicity and confounding by population stratification, we calculated λGC assuming 1,000 participants (λGC-1,000), and estimated the LD score regression slope using LDSC66. To minimize bias, LDSC regression was performed using the European ancestry meta-analysis subset with reference genotypes from 10,000 random UKB European participants. We investigated the relationship between allelic effect on HF and allele frequency by plotting the relative risk per additional minor allele as a function of MAF and fitted two separate local polynomial regressions with locally estimated scatter-plot smoothing for groups of conditionally independent variants associated with increased and decreased risk of HF. To increase precision, we fitted the regression using conditionally independent variants associated with HFall at an FDR < 1%, estimated using the q value package in R67.
SNP-based heritability estimation
The proportion of variance in HF risk explained by common SNPs, that is, SNP-based heritability (h2SNP), was estimated from GWAS meta-analysis summary statistics using Linkage-Disequilibrium-Adjusted Kinships (LDAK) SumHer software68 with LDAK-Thin and BLD-LDAK heritability models69 on a liability scale. To minimize bias, we used the European ancestry meta-analysis subset with precomputed tagging files derived from 2,000 UKB European participants68. The conversion to liability scale was calculated using population prevalence derived from the meta-analysis sample and sample prevalence derived using correction for effective sample size assuming an equal number of cases and controls70.
Polygenic risk score analysis
We explored the association between PGS of HFall (PGSHF) and HFall risk in 347,235 UKB European participants, including 13,793 HFall cases. The PGSHF was constructed as a weighted sum of the allelic count of 1,012,059 genetic variants selected using the LDpred2-auto model71, with weights derived from the present GWAS meta-analysis of HF excluding UKB and reference genotype from 362,320 UKB European participants provided by LDpred2 authors. OR of HFall per s.d. of PGSHF was estimated using logistic regression with binary HFall status as response variable, standardized PGSHF as predictor and sex, age and first ten genetic PCs as covariates. To estimate the risks of HF in individuals with high PGSHF, we grouped participants into deciles of PGSHF and calculated ORs of the top decile as compared against the fifth and the first (bottom) decile using logistic regression.
Fine mapping and functional consequences of causal variants
We performed functionally informed fine mapping using PolyFun9 and sum of single effects (SuSiE)72 to identify causal variants within HF genetic susceptibility loci. We used precomputed, functionally informed prior causal probabilities of 19 million genetic variants based on a meta-analysis of 15 UKB traits9 and genome-wide association estimates from the current analysis to calculate prior causal probability proportional to per-SNP heritability of phenotype under analysis. The resulting estimates were used to calculate per-SNP PIP and to construct 95% credible sets of likely causal variants using the sum-of-single-effects fine-mapping model implemented in SuSiE, assuming at most five causal variants per locus. To minimize bias, we used effect estimates from the European ancestry meta-analysis subset and reference genotypes from 10,000 random UKB European participants. To assess the functional consequences of fine-mapped variants, we extracted variant-level information on the nearest gene(s), genic functions and CADD10 Phred score from ANNOVAR73 and OpenTargets Genetics14.
Prioritization of effector genes
To identify effector genes for HF, we implemented a two-step prioritization approach. In step 1, we identified candidate genes using a combination of the following three predictors: (1) PoPS15, (2) V2G—OpenTargets V2G score14,74 and (3) TWAS—multitissue TWAS using S-MulTiXcan16. Genes with the highest PoPS, highest V2G score or lowest TWAS P value within a locus were considered as candidate effector genes for HF.
In step 2, we further prioritized these genes using the following three Boolean (true/false) classifiers: (1) ABC—overlap with enhancer as predicted by ABC17,18 score > 0.02, (2) Mendel—association with enriched Mendelian disease term estimated using MendelVar20 and (3) Coloc—sharing of causal variants (colocalization) between gene expression in relevant tissue with at least one HF phenotype under study at posterior probability >0.8 estimated using R coloc package19 using gene expression data from Genotype-Tissue Expression project (GTEx, v.8)75.
In addition, we derived an overall predictor score based on weighted average of PoPS, highest V2G and −log10(P) MulTiXcan values with 2:2:1 weight ratio, scaled to 0–100 value using a quantile transformation with uniform output distribution as implemented in the Python scikit-learn76 library. Finally, for each locus, we ranked genes based on total classifier score (sum of true values) and the overall predictor score. Genes that are top-ranked or have a total classifier score ≥2 were prioritized as effector genes for HF. More details are provided in Supplementary Methods.
Tissue- and cell-type enrichment
To identify tissues and cell types involved in HF pathology, we estimated heritability enrichment of specifically expressed gene sets across 206 tissues and cell types77,78 within 12 organ/system categories (Supplementary Table 24). For each set, we performed a one-sided enrichment test for per-SNP heritability attributed to the given set, conditional on the set that includes all genes and a baseline model consisting of 52 genomic annotations using LDSC applied to specifically expressed genes (LDSC-SEG)79. For this analysis, we used the European ancestry meta-analysis subset with precomputed LD score weights derived from the 1000 Genomes European reference panel80.
Based on the observation that heart tissues are the major contributors to SNP heritability across HF phenotypes, we extended this analysis to 15 cell types derived from single-nucleus RNA-sequencing of 185,185 nuclei from 16 nonfailing human heart donors43 (Supplementary Table 25). Identification of specifically expressed genes across cardiac cell types is described in Supplementary Methods.
As sensitivity analysis, we further performed a gene-property enrichment analysis using MAGMA81,82. We first assigned SNPs to 20,260 coding genes from Ensembl v.92 with a 1-kb window and computed gene-level association statistics using the SNP-wise mean model. We then computed average gene expression per cell type and provided this as gene covariates to MAGMA to calculate a one-sided P value for enrichment of a given cell type conditioned on the average expression of other cell types.
Cellular expression of heart failure effector genes
To assess changes in the expression of HF effector genes in failing heart, we performed single-nucleus differential gene expression analysis of candidate effector genes within heart failure loci by comparing transcription levels in cell nuclei from nonfailing heart described above with cell nuclei from 28 failing heart donors diagnosed with end-stage DCM (ndonors = 12 and nnuclei = 142,490) or HCM (ndonors = 16 and nnuclei = 202,307) from Myocardial Applied Genetic Network34,83. Samples are processed using CellBender84 and Cell Ranger85. Differential gene expression was calculated using the limma–voom86,87 model adjusting for age and sex (see more details in Supplementary Methods).
Differential expression was defined using the following criteria: two-sided adjusted P < 0.01 (following Benjamini–Hochberg adjustment for multiple testing), sign concordance between estimates derived from CellBender and Cell Ranger quantifications and no background contamination in CellBender. To test whether differentially expressed genes in each cell type are overrepresented among prioritized HF effector genes, we performed a one-sided Fisher’s exact test implemented in R, using all genes tested in the given cell type as the background set.
Pathway enrichment
To identify biological pathways relevant to HF, we tested for overrepresentation of biological terms in HF effector gene set using g:Profiler88. To account for etiological differences and uncertainty in gene prioritization, we tested six gene sets constructed from combinations of three phenotypic classifications of loci (HF, ni-HF and other HF) and two gene prioritization categories (candidate and prioritized effector genes). For each gene set, we performed an unordered enrichment analysis with a one-sided Fisher’s test for biological terms in the Kyoto Encyclopedia of Genes and Genomes89, Reactome90, Wiki Pathways91 and Gene Ontology (GO)92,93, excluding terms with more than 2,000 genes. Multiple-testing adjustment was performed by multiplying the enrichment P values by the ratio of the approximate threshold given the number of genes in the set and the initial experiment-wide type-I error rate of 0.05, using the g:SCS algorithm in g:Profiler. Driver GO terms were identified using the g:Profiler clustering algorithm to represent groups of more specific terms.
Phenome-wide association of heart failure sentinel variants
To characterize pleiotropic effects of HF genetic loci across human diseases, we performed a phenome-wide association study (PheWAS) in 408,480 UKB European participants. We tested the associations between sentinel variants at 66 HF loci with 294 disease phenotypes derived using a curated phenotype definition and category39. For each phenotype, participants were classified as cases if at least one event was recorded in linked hospital admission, death certificate, primary care visit, self-reported cancer diagnosis, noncancer diagnosis or procedure history. For each phenotype–sentinel variant pair with at least 100 cases and 100 noncases (controls), we ran a case–control additive genetic association test in PLINK2 with a logistic regression model adjusted for sex, genotyping array and first ten genetic PCs. For presentation, we report effects per additional HF risk-increasing allele.
Pleiotropy network analysis
Using the PheWAS results, we performed a network analysis to evaluate the connections between HF genetic loci based on their pleiotropic effects across human diseases. First, we constructed a network dataset with nodes representing loci or phenotypes and edges representing unidirectional associations of the locus sentinel variant with a phenotype. Eigenvector centrality measure was calculated to estimate node influence within the network (Supplementary Fig. 13 and Supplementary Table 26). Furthermore, we performed a community detection analysis using a walktrap algorithm94 weighted by the absolute z score of the edges to identify distinct etiologic clusters underlying HF. The network analysis and visualization were performed using igraph, tidygraph and ggraph package in R.
Locus-specific pleiotropy assessment
We investigated the local pleiotropic effects of sentinel variants in 66 HF genetic loci on 24 related GWAS traits (Supplementary Table 27). Estimates for sentinel variants that were not reported in a target GWAS were substituted with those of a proxy variant with the highest LD r2 (requiring at least LD r2 > 0.8), identified using PLINK95 with LD reference panel derived from 10,000 random samples of UKB European participants. For presentation (Extended Data Fig. 10), effect estimates were converted to z scores derived from the regression coefficients (β) divided by their s.e., with direction of effect aligned to reflect the additive effect of HF risk-increasing allele. Loci and GWAS traits were ordered using hierarchical agglomerative clustering (implemented with the hclust function in R 4.2.0.) with average linkage method based on the Euclidean distance measure derived from absolute z scores.
Cross-trait colocalization
To test whether HFe and related traits share causal genetic variants, we performed pairwise cross-trait colocalization analysis using the R coloc package19. Analyses were performed across pairwise combinations of credible sets corresponding to the tested traits identified using SuSiE regression72. We used the default marginal prior probabilities for association with trait 1 (p1) and trait 2 (p2) of 10−4, as well as a prior probability of joint causal association (p12) of 10−5. A posterior probability of common causal variants (PPcoloc H4) > 0.8 in at least one pair of credible sets was considered as evidence of colocalization. The colocalization analysis was performed for each identified HF locus using an overlapping set of variants available in both GWAS of heart failure and trait under analysis situated within the locus. Genetic association estimates for HF were extracted from GWAS of the HFall phenotype for loci with PHF < 5 × 10−8 (locus 1–56) or otherwise from GWAS of ni-HF phenotypes for subtype-specific loci—ni-HF for locus 57–61, ni-HFrEF for locus 62–64 and ni-HFpEF for locus 65–66. Genetic association estimates for tested traits are extracted from the source GWAS (Supplementary Table 27).
Genetic correlation and MR
We tested genetic correlation across pairs of four HF phenotypes under analysis and 24 related GWAS traits using bivariate LDSC regression96. We further estimated the causal effects of each tested trait on four HF phenotypes under analysis using two-sample MR as implemented in the MendelianRandomization package97. MR instruments for the exposure traits were selected from overlapping genetic variants available in both GWAS of exposure and outcome traits using an LD-based clumping algorithm with a P value threshold of 5 × 10−8 and LD r2 threshold of 0.05 implemented in PLINK95. For each exposure trait, we estimated the causal association with each HF phenotype using the IVW MR estimator and performed sensitivity analyses with MR–Egger and WME98,99. An adjusted P < 0.01 following the Benjamini–Hochberg correction in IVW analysis with the consistent direction of effect in sensitivity analyses was considered supportive evidence of causality. Reference genotype from 10,000 random UKB European participants was used to perform these analyses, given the majority of European ancestry in the tested GWAS traits. Details of the tested traits, source GWAS and model parameters are provided in Supplementary Table 27.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
GWAS summary statistics are available to download from the Cardiovascular Disease Knowledge Portal for the multi-ancestry meta-analysis (https://api.kpndataregistry.org/api/d/6Ls5Wu) and for the European ancestry subset meta-analysis (https://api.kpndataregistry.org/api/d/6eJqWn). Study-level summary statistics can be requested by contacting individual studies, as described in Supplementary Table 21. Single-cell data for cell-type enrichment and differential gene expression analyses are available at https://singlecell.broadinstitute.org/single_cell/study/SCP1303/. Data from UKB can be requested from the UKB Access Management System. GTEx data can be accessed at https://gtexportal.org. A summary of regional genetic associations, gene prioritization scores, cross-trait association and study-level estimates across identified GWAS loci is provided in Supplementary Data 1 and online at https://hermes2-supp-note.netlify.app/locus_desc.html. A summary of study-level GWAS summary statistics quality control is provided in Supplementary Data 2 and online at https://hermes2-supp-note.netlify.app/hf_subtypes_qc.html.
Code availability
GWAS meta-analysis was performed centrally using METAL (v2020-05-05). SNP-based heritability was calculated using LDAK SumHer software (v5.2). Genetic correlation between traits was performed using LDSC (v1.0.1). Conditionally independent variants were identified using GCTA software (v1.92.4). Causal variant fine mapping was performed using PolyFun (v2020-11-14) and SuSiE (v0.11.92). Polygenic prediction score (PoPS, v0.1), OpenTargets Variant2Gene (v1.1), S-MultiXcan (v0.7.3), MendelVar (v2023-12-05), coloc R package (v5.2.3), ABC-Max (v2021-04-08) and Python scikit-learn (v1.5.1) were used for effector gene prioritization. Pathway enrichment of prioritized genes was performed using g:Profiler (v0.2.3) package in R. Tissue-based and cell-type enrichment was performed using S-LDSC (v1.0.1) and MAGMA (v1.10). PGSs were derived using the LDpred2-auto model implemented in bigsnpr R package (v1.12.16), with individual PGS scores generated using PLINK (v1.9). PheWAS of sentinel variants was performed using PLINK (v2.0). Network analysis was performed using tidygraph (v1.2.3) and visualized using ggraph (v2.0.0) in R. Custom code to perform the main analyses is available at Zenodo100 (https://doi.org/10.5281/zenodo.11204854).
References
Shah, S. et al. Genome-wide association and Mendelian randomisation analysis provide insights into the pathogenesis of heart failure. Nat. Commun. 11, 163 (2020).
Levin, M. G. et al. Genome-wide association and multi-trait analyses characterize the common genetic architecture of heart failure. Nat. Commun. 13, 6914 (2022).
Joseph, J. et al. Genetic architecture of heart failure with preserved versus reduced ejection fraction. Nat. Commun. 13, 7753 (2022).
Roger, V. L. Epidemiology of heart failure. Circ. Res. 113, 646–659 (2013).
Seferović, P. M. et al. Heart failure in cardiomyopathies: a position paper from the Heart Failure Association of the European Society of Cardiology. Eur. J. Heart Fail. 21, 553–576 (2019).
Arvanitis, M. et al. Genome-wide association and multi-omic analyses reveal ACTN2 as a gene linked to heart failure. Nat. Commun. 11, 1122 (2020).
Zhou, W. et al. Global biobank meta-analysis initiative: powering genetic discovery across human disease. Cell Genom. 2, 100192 (2022).
Garnier, S. et al. Genome-wide association analysis in dilated cardiomyopathy reveals two new players in systolic heart failure on chromosomes 3p25.1 and 22q11.23. Eur. Heart J. 42, 2000–2011 (2021).
Weissbrod, O. et al. Functionally informed fine-mapping and polygenic localization of complex trait heritability. Nat. Genet. 52, 1355–1363 (2020).
Rentzsch, P., Witten, D., Cooper, G. M., Shendure, J. & Kircher, M. CADD: predicting the deleteriousness of variants throughout the human genome. Nucleic Acids Res. 47, D886–D894 (2019).
Ortiz-Genga, M. F. et al. Truncating FLNC mutations are associated with high-risk dilated and arrhythmogenic cardiomyopathies. J. Am. Coll. Cardiol. 68, 2440–2451 (2016).
Domínguez, F. et al. Dilated cardiomyopathy due to BLC2-associated athanogene 3 (BAG3) mutations. J. Am. Coll. Cardiol. 72, 2471–2481 (2018).
Wu, T. et al. HSPB7 is indispensable for heart development by modulating actin filament assembly. Proc. Natl Acad. Sci. USA 114, 11956–11961 (2017).
Ghoussaini, M. et al. Open Targets Genetics: systematic identification of trait-associated genes using large-scale genetics and functional genomics. Nucleic Acids Res. 49, D1311–D1320 (2021).
Weeks, E. M. et al. Leveraging polygenic enrichments of gene features to predict genes underlying complex traits and diseases. Nat. Genet. 55, 1267–1276 (2023).
Barbeira, A. N. et al. Integrating predicted transcriptome from multiple tissues improves association detection. PLoS Genet. 15, e1007889 (2019).
Fulco, C. P. et al. Activity-by-contact model of enhancer–promoter regulation from thousands of CRISPR perturbations. Nat. Genet. 51, 1664–1669 (2019).
Nasser, J. et al. Genome-wide enhancer maps link risk variants to disease genes. Nature 593, 238–243 (2021).
Wallace, C. A more accurate method for colocalisation analysis allowing for multiple causal variants. PLoS Genet. 17, e1009440 (2021).
Sobczyk, M. K., Gaunt, T. R. & Paternoster, L. MendelVar: gene prioritization at GWAS loci using phenotypic enrichment of Mendelian disease genes. Bioinformatics 37, 1–8 (2021).
Ling, H. et al. Requirement for Ca2+/calmodulin-dependent kinase II in the transition from pressure overload-induced cardiac hypertrophy to heart failure in mice. J. Clin. Invest. 119, 1230–1240 (2009).
Górska, A. A. et al. Muscle-specific Cand2 is translationally upregulated by mTORC1 and promotes adverse cardiac remodeling. EMBO Rep. 22, e52170 (2021).
Chiu, C. et al. Mutations in α-actinin-2 cause hypertrophic cardiomyopathy: a genome-wide analysis. J. Am. Coll. Cardiol. 55, 1127–1135 (2010).
Meurs, K. M. et al. Association of dilated cardiomyopathy with the striatin mutation genotype in boxer dogs. J. Vet. Intern. Med. 27, 1437–1440 (2013).
Wang, J. et al. Pitx2 prevents susceptibility to atrial arrhythmias by inhibiting left-sided pacemaker specification. Proc. Natl Acad. Sci. USA 107, 9753–9758 (2010).
Sotoodehnia, N. et al. Common variants in 22 loci are associated with QRS duration and cardiac ventricular conduction. Nat. Genet. 42, 1068–1076 (2010).
Pfeufer, A. et al. Common variants at ten loci modulate the QT interval duration in the QTSCD study. Nat. Genet. 41, 407–414 (2009).
Sanan, D. A. et al. Low density lipoprotein receptor-negative mice expressing human apolipoprotein B-100 develop complex atherosclerotic lesions on a chow diet: no accentuation by apolipoprotein(a). Proc. Natl Acad. Sci. USA 95, 4544–4549 (1998).
Steinberg, D., Parthasarathy, S., Carew, T. E., Khoo, J. C. & Witztum, J. L. Beyond cholesterol. Modifications of low-density lipoprotein that increase its atherogenicity. N. Engl. J. Med. 320, 915–924 (1989).
Berge, K. E. et al. Accumulation of dietary cholesterol in sitosterolemia caused by mutations in adjacent ABC transporters. Science 290, 1771–1775 (2000).
Lee, M. H. et al. Identification of a gene, ABCG5, important in the regulation of dietary cholesterol absorption. Nat. Genet. 27, 79–83 (2001).
Kronenberg, F. et al. Lipoprotein(a) in atherosclerotic cardiovascular disease and aortic stenosis: a European Atherosclerosis Society consensus statement. Eur. Heart J. 43, 3925–3946 (2022).
Zhang, L. et al. Insulin-like growth factor-binding protein-7 (IGFBP7) links senescence to heart failure. Nat. Cardiovasc. Res. 1, 1195–1214 (2022).
Chaffin, M. et al. Single-nucleus profiling of human dilated and hypertrophic cardiomyopathy. Nature 608, 174–180 (2022).
Oakley, R. H. et al. Cardiomyocyte glucocorticoid and mineralocorticoid receptors directly and antagonistically regulate heart disease in mice. Sci. Signal. 12, eaau9685 (2019).
Rossignol, P. et al. Eplerenone survival benefits in heart failure patients post-myocardial infarction are independent from its diuretic and potassium-sparing effects. Insights from an EPHESUS (Eplerenone Post-Acute Myocardial Infarction Heart Failure Efficacy and Survival Study) substudy. J. Am. Coll. Cardiol. 58, 1958–1966 (2011).
Li, Y. et al. KLF12 promotes the proliferation of breast cancer cells by reducing the transcription of p21 in a p53-dependent and p53-independent manner. Cell Death Dis. 14, 313 (2023).
Cheedipudi, S. M. et al. Genomic reorganization of Lamin-associated domains in cardiac myocytes is associated with differential gene expression and DNA methylation in human dilated cardiomyopathy. Circ. Res. 124, 1198–1213 (2019).
Kuan, V. et al. A chronological map of 308 physical and mental health conditions from 4 million individuals in the English National Health Service. Lancet Digit. Health 1, e63–e77 (2019).
Zhang, J. et al. The role of aldehyde dehydrogenase 2 in cardiovascular disease. Nat. Rev. Cardiol. 20, 495–509 (2023).
Võsa, U. et al. Large-scale cis- and trans-eQTL analyses identify thousands of genetic loci and polygenic scores that regulate blood gene expression. Nat. Genet. 53, 1300–1310 (2021).
Jia, M. et al. Deletion of BACH1 attenuates atherosclerosis by reducing endothelial inflammation. Circ. Res. 130, 1038–1055 (2022).
Xin, M. et al. Hippo pathway effector Yap promotes cardiac regeneration. Proc. Natl Acad. Sci. USA 110, 13839–13844 (2013).
Blake, J. A. et al. Mouse Genome Database (MGD): knowledgebase for mouse-human comparative biology. Nucleic Acids Res. 49, D981–D987 (2021).
Ochoa, D. et al. The next-generation Open Targets Platform: reimagined, redesigned, rebuilt. Nucleic Acids Res. 51, D1353–D1359 (2023).
Mordi, I. R. et al. Type 2 diabetes, metabolic traits, and risk of heart failure: a Mendelian randomization study. Diabetes Care 44, 1699–1705 (2021).
Charoen, P. et al. Mendelian randomisation study of the influence of eGFR on coronary heart disease. Sci. Rep. 6, 28514 (2016).
Morris, A. P. et al. Trans-ethnic kidney function association study reveals putative causal genes and effects on kidney-specific disease aetiologies. Nat. Commun. 10, 29 (2019).
Ware, J. S. et al. Genetic etiology for alcohol-induced cardiac toxicity. J. Am. Coll. Cardiol. 71, 2293–2302 (2018).
Wehner, G. J. et al. Routinely reported ejection fraction and mortality in clinical practice: where does the nadir of risk lie? Eur. Heart J. 41, 1249–1257 (2020).
Tadros, R. et al. Shared genetic pathways contribute to risk of hypertrophic and dilated cardiomyopathies with opposite directions of effect. Nat. Genet. 53, 128–134 (2021).
Pan, J., Ng, S. M., Neubauer, S. & Rider, O. J. Phenotyping heart failure by cardiac magnetic resonance imaging of cardiac macro- and microscopic structure: state of the art review. Eur. Heart J. Cardiovasc. Imaging 24, 1302–1317 (2023).
Arbelo, E. et al. 2023 ESC guidelines for the management of cardiomyopathies. Eur. Heart J. 44, 3503–3626 (2023).
McDonagh, T. A. et al. 2021 ESC guidelines for the diagnosis and treatment of acute and chronic heart failure. Eur. Heart J. 42, 3599–3726 (2021).
Bracun, V. et al. Insulin-like growth factor binding protein 7 (IGFBP7), a link between heart failure and senescence. ESC Heart Fail. 9, 4167–4176 (2022).
Di Micco, R., Krizhanovsky, V., Baker, D. & d’Adda di Fagagna, F. Cellular senescence in ageing: from mechanisms to therapeutic opportunities. Nat. Rev. Mol. Cell Biol. 22, 75–95 (2021).
Shah, S. J. et al. Research priorities for heart failure with preserved ejection fraction: National Heart, Lung, and Blood Institute working group summary. Circulation 141, 1001–1026 (2020).
Vaduganathan, M. et al. SGLT2 inhibitors in patients with heart failure: a comprehensive meta-analysis of five randomised controlled trials. Lancet 400, 757–767 (2022).
Maron, D. J. et al. Initial invasive or conservative strategy for stable coronary disease. N. Engl. J. Med. 382, 1395–1407 (2020).
Carson, P. et al. The STICH trial (Surgical Treatment for Ischemic Heart Failure): mode-of-death results. JACC Heart Fail. 1, 400–408 (2013).
Willer, C. J., Li, Y. & Abecasis, G. R. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190–2191 (2010).
Yang, J. et al. Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat. Genet. 44, 369–375 (2012).
Mägi, R. et al. Trans-ethnic meta-regression of genome-wide association studies accounting for ancestry increases power for discovery and improves fine-mapping resolution. Hum. Mol. Genet. 26, 3639–3650 (2017).
Schwarzer, G., Carpenter, J. R. & Rücker, G. Meta-Analysis With R (Springer International Publishing, 2015).
Harrer, M., Cuijpers, P., Furukawa, T. A. & Ebert, D. D. Doing Meta-Analysis With R: A Hands-On Guide (Chapman & Hall/CRC Press, 2021).
Bulik-Sullivan, B. K. et al. LD score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291–295 (2015).
Storey, J. D., Taylor, J. E. & Siegmund, D. Strong control, conservative point estimation and simultaneous conservative consistency of false discovery rates: a unified approach. J. R. Stat. Soc. Series B Stat. Methodol. 66, 187–205 (2004).
Speed, D. & Balding, D. J. SumHer better estimates the SNP heritability of complex traits from summary statistics. Nat. Genet. 51, 277–284 (2019).
Speed, D., Holmes, J. & Balding, D. J. Evaluating and improving heritability models using summary statistics. Nat. Genet. 52, 458–462 (2020).
Grotzinger, A. D., Fuente, Jdela, Privé, F., Nivard, M. G. & Tucker-Drob, E. M. Pervasive downward bias in estimates of liability-scale heritability in genome-wide association study meta-analysis: a simple solution. Biol. Psychiatry 93, 29–36 (2023).
Privé, F., Arbel, J. & Vilhjálmsson, B. J. LDpred2: better, faster, stronger. Bioinformatics 36, 5424–5431 (2020).
Wang, G., Sarkar, A., Carbonetto, P. & Stephens, M. A simple new approach to variable selection in regression, with application to genetic fine mapping. J. R. Stat. Soc. Series B Stat. Methodol. 82, 1273–1300 (2020).
Wang, K., Li, M. & Hakonarson, H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 38, e164 (2010).
Mountjoy, E. et al. An open approach to systematically prioritize causal variants and genes at all published human GWAS trait-associated loci. Nat. Genet. 53, 1527–1533 (2021).
The GTEx Consortium. The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science 369, 1318–1330 (2020).
Pedregosa, F. et al. Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
Pers, T. H. et al. Biological interpretation of genome-wide association studies using predicted gene functions. Nat. Commun. 6, 5890 (2015).
Fehrmann, R. S. N. et al. Gene expression analysis identifies global gene dosage sensitivity in cancer. Nat. Genet. 47, 115–125 (2015).
Finucane, H. K. et al. Heritability enrichment of specifically expressed genes identifies disease-relevant tissues and cell types. Nat. Genet. 50, 621–629 (2018).
Finucane, H. K. et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nat. Genet. 47, 1228–1235 (2015).
De Leeuw, C. A., Mooij, J. M., Heskes, T. & Posthuma, D. MAGMA: generalized gene-set analysis of GWAS data. PLoS Comput. Biol. 11, e1004219 (2015).
Watanabe, K., Umićević Mirkov, M., de Leeuw, C. A., van den Heuvel, M. P. & Posthuma, D. Genetic mapping of cell type specificity for complex traits. Nat. Commun. 10, 3222 (2019).
Tucker, N. R. et al. Transcriptional and cellular diversity of the human heart. Circulation 142, 466–482 (2020).
Fleming, S. J. et al. Unsupervised removal of systematic background noise from droplet-based single-cell experiments using CellBender. Nat. Methods 20, 1323–1335 (2023).
Zheng, G. X. Y. et al. Massively parallel digital transcriptional profiling of single cells. Nat. Commun. 8, 14049 (2017).
Ritchie, M. E. et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43, e47 (2015).
Law, C. W., Chen, Y., Shi, W. & Smyth, G. K. voom: precision weights unlock linear model analysis tools for RNA-seq read counts. Genome Biol. 15, R29 (2014).
Raudvere, U. et al. g:Profiler: a web server for functional enrichment analysis and conversions of gene lists (2019 update). Nucleic Acids Res. 47, W191–W198 (2019).
Kanehisa, M., Sato, Y., Kawashima, M., Furumichi, M. & Tanabe, M. KEGG as a reference resource for gene and protein annotation. Nucleic Acids Res. 44, D457–D462 (2016).
Gillespie, M. et al. The reactome pathway knowledgebase 2022. Nucleic Acids Res. 50, D687–D692 (2022).
Slenter, D. N. et al. WikiPathways: a multifaceted pathway database bridging metabolomics to other omics research. Nucleic Acids Res. 46, D661–D667 (2018).
Ashburner, M. et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. 25, 25–29 (2000).
Gene Ontology Consortium. The Gene Ontology resource: enriching a GOld mine. Nucleic Acids Res. 49, D325–D334 (2021).
Yolum, P., Güngör, T., Gürgen, F. & Özturan, C. (eds). Computing communities in large networks using random walks. Proc. Computer and Information Sciences—ISCIS 2005 pp. 284–293 (Springer, 2005).
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
Bulik-Sullivan, B. et al. An atlas of genetic correlations across human diseases and traits. Nat. Genet. 47, 1236–1241 (2015).
Yavorska, O. O. & Burgess, S. MendelianRandomization: an R package for performing Mendelian randomization analyses using summarized data. Int. J. Epidemiol. 46, 1734–1739 (2017).
Bowden, J., Davey Smith, G. & Burgess, S. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. Int. J. Epidemiol. 44, 512–525 (2015).
Burgess, S., Dudbridge, F. & Thompson, S. G. Combining information on multiple instrumental variables in Mendelian randomization: comparison of allele score and summarized data methods. Stat. Med. 35, 1880–1906 (2016).
Henry, A. Ihi-Comp-Med/Hermes2-Gwas: manuscript release. Zenodo https://doi.org/10.5281/ZENODO.11204854 (2024).
Acknowledgements
A. Henry was supported by the British Heart Foundation Cardiovascular Biomedicine PhD studentship (FS/18/65/34186). R.T.L. and A. Henry were partly supported by a Pfizer Innovative Targets Exploration Network Grant. The project was additionally supported by the BigData@Heart Consortium, funded by the Innovative Medicines Initiative-2 Joint Undertaking (grant agreement 116074), the UCL British Heart Foundation Accelerator (AA/18/6/34223), the National Institute for Health Research University College London Hospitals Biomedical Research Centre (NIHR203328) and Health Data Research UK (MR/S003754/1). Analyses using the UKB resource presented in this work were conducted under applications 9922, 15422, 12113, 12505 and 47602. Additional study-level acknowledgements are provided in Supplementary Table 21. We thank all research participants included in this study. The views expressed in this work are those of the authors and not necessarily those of the funders.
Author information
Authors and Affiliations
Consortia
Contributions
R.S.V., J.G.S., H. Holm, Sonia Shah, P.T.E., A.D.H., Q.W. and R.T.L. conceptualized the study. A. Henry, M.D.C., D.S. and Sonia Shah developed the methodology. A. Henry, H.I., D.M. and R.T.L. managed project administration. A. Henry and X.M. carried out formal analysis. A. Henry, C.F., D.S., S.C. and A.F.S. conducted software analysis. A. Henry and H.I. curated the data. A. Henry and R.T.L. wrote the original draft of the manuscript. A. Henry handled visualization. C.F., M.D.C., S.D., J. Gratton, S.C., J.G.S., H. Holm, P.T.E. and A.D.H. arranged the resources. M.D.C., I.B., C.R. and D.M. conducted the investigation. F.W.A., T.P.C., M.-P.D., M.E.D., C.C.L., N.J.S., R.S.V., J.G.S., H. Holm, Sonia Shah, P.T.E., A.D.H., Q.W. and R.T.L. provided supervision. R.T.L. secured funding. F.W.A., T.P.C., M.-P.D., M.E.D., C.C.L., N.J.S., Svati Shah, R.S.V., J.G.S., H. Holm, P.T.E., A.D.H., Q.W. and R.T.L. are members of the HERMES Executive Committee who provided additional supervision of the work. J.S.W., S.L.Z., A.M., E.A., L.A., C.A., K.G.A., J.Ä., G.A., A.A.R., J.D.B., T.M.B., K.J.B., M.L.B., H.L.B., E.B., J.B., M.R.B., S.B., M.T.B., L.B., H.B., D.J.C., D.I.C., X.C., J.P.C., T.C., S.d.D., A.D., G.E.D., A.S.D., M.D., J.D., S.C.D., G.E., C.E., T.E., E.H.F.-E., S.B.F, S.F., I.F., M.G., S. Ghasemi, J. Ghouse, V.G., F.G., J.S.G., S. Gross, D.F.G., H.G., R.G., S.H., C.M.H., Ä.K.H., A. Helgadottir, H. Hemingway, H. Hillege, C.L.H., B.A.J., J.W.J., I.K., R.K., M.K., J.R.K., M.E.K., L.K., A.K., K.K., Y.-P.L., D.L., C.L., H.L., L.L., C.M.L., P.P.L., B.L., B.D.L., J.L., S.A.L., P.M., K.B.M., N.A.M., H.M., W.M., O.M., I.R.M., M.P.M., A.P.M., A.C.M., L.M., M.W.N., C.P.N., A.N., T.N., R.N., C.N., M.L.O., S.R.O., A.T.O, C.N.A.P., G.P., O.B.P., M. Perola, M. Pigeyre, B.M.P., K.M.R., P.M.R., S.P.R.R., J.I.R., C.T.R., M.S.S., N. Sallah, V.S., N. Sattar, A.A.S., A.S., D.T.S., N.L.S., E.S., S. Srinivasan, K.S., G.S., P.S., M.-L.T., J.-C.T., M.T.-L., A.T., G.T., U.T., C.T.-P., V.T., S.T., A.G.U., H.U., P.v.d.H., D.v.H., J.v.S., M.v.V., A.V., M.V., N.V., C.R.V., U.V., A.A.V., L.W., Y.W., P.E.W., K.L.W., L.K.W., Y.Y., B.Y., F.Z. and C.Z. contributed to data generation, funding acquisition, formal analysis and supervision at individual study level. All authors reviewed and approved the final version of the manuscript.
Corresponding author
Ethics declarations
Competing interests
R.T.L. received institutional research support from Pfizer. R.T.L. has served as a paid consultant for Health Lumen and FITFILE. J.S.W. has acted as a consultant for MyoKardia, Pfizer, Foresite Labs and Health Lumen and received institutional support from Bristol Myers Squibb and Pfizer. S.d.D. was supported through grants from AstraZeneca and Roche Molecular Science/DalCor. J.R.K. declares stock ownership in AbbVie, Abbott, Bristol Myers Squibb, Johnson & Johnson, Medtronic, Merck and Pfizer. N.A.M. received speaking honoraria from Amgen and is involved in clinical trials with Ionis, Amgen, Pfizer and Novartis. B.M.P. serves on the Steering Committee of the Yale Open Data Access Project funded by Johnson & Johnson. C.T.R. received honoraria for scientific advisory boards and consulting from Anthos, Bayer, Bristol Myers Squibb, Daiichi Sankyo, Janssen and Pfizer and received institutional research grants from Anthos, AstraZeneca, Daiichi Sankyo, Janssen and Novartis. M.S.S. received substantial research grant support from Abbott Laboratories, Amgen, AstraZeneca, Bayer, Critical Diagnostics, Daiichi Sankyo, Eisai, Genzyme, Gilead, GlaxoSmithKline, Intarcia, Janssen Research and Development, The Medicines Company, MedImmune, Merck, Novartis, Poxel, Pfizer, Quark Pharmaceuticals, Roche Diagnostics and Takeda and has received consulting fees from Alnylam, AstraZeneca, Bristol Myers Squibb, CVS and Amgen. A.A.V. received consultancy fees and/or research support from AnaCardia, AstraZeneca, Bayer, BMS, Boehringer Ingelheim, Corteria, Cytokinetics, Eli Lilly, Moderna, Novartis, Novo Nordisk and Roche Diagnostics. M.-P.D. declares holding equity in Dalcor Pharmaceuticals, unrelated to this work. Members of the TIMI Study Group (ENGAGE, FOURIER, PEGASUS, SAVOR and SOLID) have received institutional research grant support through Brigham and Women’s Hospital from Abbott, Amgen, Anthos Therapeutics, ARCA Biopharma, AstraZeneca, Bayer HealthCare Pharmaceuticals, Daiichi Sankyo, Eisai, Intarcia, Ionis Pharmaceuticals, Janssen Research and Development, MedImmune, Merck, Novartis, Pfizer, Quark Pharmaceuticals, Regeneron Pharmaceuticals, Roche, Siemens Healthcare Diagnostics, Softcell Medical Limited, The Medicines Company, Zora Biosciences, Caremark, Dyrnamix, Esperon, IFM Pharmaceuticals and MyoKardia. The authors who are affiliated with deCODE genetics/Amgen and the authors affiliated with Pfizer declare competing financial interests as employees. The remaining authors declare no competing interests.
Peer review
Peer review information
Nature Genetics thanks Shoa Clarke and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data
Extended Data Fig. 1 Study summary. Each panel describes key analyses that are undertaken in the study. The discovery of genomic loci panel describe four categories of genomic loci identified in this study based on patterns of associations across the four phenotypes, where nonwhite colour in the heat map represents significant genetic association at a multiple-testing-corrected threshold of two-sided P < 0.05 / 66.
GWAS, genome-wide association study; HF, heart failure; HFall, overall heart failure; ni-HF, non-ischemic heart failure; ni-HFrEF, non-ischemic heart failure with reduced ejection fraction; ni-HFpEF, non-ischemic heart failure with preserved ejection fraction.
Extended Data Fig. 2 Allelic architecture of genetic variants associated with the overall heart failure phenotype.
The presented data points are conditionally independent variants that passed two-sided P < 5 × 10−8 (red colored points) and two-sided P less than equivalent type-I error rate (α) at a false discovery rate <1% as estimated using q value package in R (gray colored points). Green lines represent local polynomial regression with locally estimated scatterplot smoothing (LOESS), fitted separately for variants with risk ratio >1 and risk ratio <1 per additional minor allele. Green error bands represent the corresponding 95% confidence intervals of the regression lines.
Extended Data Fig. 3 SNP heritability across heart failure phenotypes.
The presented estimates represent SNP heritability on a liability scale (h2SNP-liability) estimated using 66-parameter BLD-LDAK and LDAK-thin heritability models using GWAS meta-analysis of European ancestry cohorts. The BLD-LDAK heritability estimate for non-ischemic HFpEF was unavailable due to limited sample size. Point estimates and confidence intervals for h2SNP-liability are represented by the bar height and error bars, with numerical estimates in percentage displayed on the top of the bars.
Extended Data Fig. 4 Association between polygenic score and risk of heart failure in UK Biobank.
a, Distribution of polygenic score for the overall heart failure phenotype (PGSHF) among 13,824 individuals with at least one documented event in linked electronic health record (case) and 332,843 individuals without any documented event (control) in UK Biobank. b, Odds ratios (OR) and 95% confidence intervals (represented as bullet points and error bars) for overall heart failure across deciles of PGSHF in UK Biobank with the fifth decile group as reference. Sample size and the numerical estimate for each decile are presented in Supplementary Table 8.
Extended Data Fig. 5 Functional consequences of fine-mapped genetic variants across genetic susceptibility loci for heart failure.
a, Predicted deleteriousness of 547 fine-mapped variants within 95% credible sets across 66 independent genomic loci for heart failure as measured by Combined Annotation-Dependent Depletion (CADD) Phred score. Nearest genes of fine-mapped variants with CADD Phred score >20 and fine-mapped exonic variants with CADD Phred score >10 are labeled. b, Histogram of CADD Phred score distribution among fine-mapped variants stratified by function. c,d, Locus-wide (c) and genome-wide (d) counts of fine-mapped variants stratified by variant function.
Extended Data Fig. 6 Enriched pathways within the effector heart failure gene set.
The presented terms are derived from Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), Human Phenotype Ontology (HP), Reactome (REAC) and Wiki Pathways (WP) at a Penrichment < 0.05. Up to top 5 enriched terms per source in each heart failure phenotype gene set are labeled. Penrichment is calculated from enrichment analysis of candidate and prioritized gene sets with one-sided Fisher’s test, adjusted for multiple testing by multiplying the original P values to the ratio of the approximate threshold given the number of genes in the set and the initial experiment-wide type-I error rate of 0.05 as described in Methods. For display, P values are presented in −log10 scale, and terms that are enriched in both candidate and prioritized gene sets are collapsed by presenting the median values.
Extended Data Fig. 7 Differential gene expression of candidate heart failure effector genes across cardiac cell types from failing and non-failing hearts.
a, Differential gene expression profile for 51 candidate heart failure GWAS genes that are differentially expressed (DE) in at least one cardiac cell type, defined as survived multiple-testing correction at adjusted P value < 0.01 and showed a concordant differential expression sign in both CellBender and Cell Ranger quantifications and had no background contamination as estimated in CellBender (Methods). Red/blue colors represent higher/lower expression in cells from failing heart (compared to non-failing heart); gray color represents undetectable expression; yellow dots represent differential expression as defined above. b, Volcano plots showing differential gene expression between cells (fibroblasts and cardiomyocytes) from failing and non-failing heart samples. Candidate and prioritized heart failure (HF) genes (the latter is prefixed with *) that are differentially expressed (DE) are highlighted. The presented P values are derived from two-sided statistical tests and adjusted for multiple testing using the Benjamini–Hochberg procedure.
Extended Data Fig. 8 Phenome-wide association of heart failure sentinel genetic variants.
Association estimates are presented in absolute z score (vertical axis) for 294 disease phenotypes and 66 sentinel genetic variants across heart failure (HF) susceptibility loci (horizontal axis). Estimates which survived multiple-testing correction at false discovery rate (FDR) < 1% are highlighted, with top associated phenotype (largest absolute z score) per locus labeled. Loci are labeled by the prioritized gene and grayed out if no FDR-passing association is identified. Phenotype abbreviations and the numerical estimates are presented in Supplementary Table 15.
Extended Data Fig. 9 Pleiotropy network and etiologic clusters of heart failure.
The network is constructed from 207 genotype–phenotype associations across 79 unique diseases and 46 heart failure susceptibility loci identified from phenome-wide association (PheWAS) analysis at false discovery rate <1%. Nodes represent genetic loci labeled by the prioritized gene (solid background with bold–italic label, colored by categorical association across heart failure phenotypes) and phenotypes (translucent background, colored by phenotype category), sized proportionally to centrality measure. Edges (arrows connecting locus nodes to phenotype nodes) represent association, with thickness representing strength of association measured by absolute z score. a, Full network constructed using Davidson–Harel layout with edge bundling. b, Annotated network showing 18 etiological clusters identified using walktrap community detection algorithm. Phenotype abbreviations, phenotype categories and phenome-wide association results are presented in Supplementary Table 15 and Extended Data Fig. 8. Individual etiological clusters are presented in Fig. 4. Cluster membership and centrality measures of nodes are presented in Supplementary Table 26 and Supplementary Fig. 13.
Extended Data Fig. 10 Associations between heart failure sentinel genetic variants and related traits.
Circle data points represent association estimates in absolute Z score between 66 sentinel genetic variants within heart failure susceptibility loci (rows) and 24 related traits (columns). Non-transparent circles represent associations passing false discovery rate <1%. Solid circles with full color represent associations at P < 5 × 10−8, tested using two-sided statistical tests for association in the corresponding GWAS. Loci and traits were arranged based on hierarchical agglomerative clustering results, represented by dendrograms on the edges of the plot. Trait description and source for GWAS summary statistics are provided in Supplementary Table 27.
Supplementary information
Supplementary Information
Supplementary Figs. 1–13, Supplementary Methods, Supplementary Note and Appendices 1–6.
Supplementary Data 1
Regional genetic associations, gene prioritization scores, cross-trait association and study-level estimates across identified GWAS loci.
Supplementary Data 2
Study-level GWAS summary statistics quality control procedure and results.
Supplementary Tables
Supplementary Tables 1–27.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Henry, A., Mo, X., Finan, C. et al. Genome-wide association study meta-analysis provides insights into the etiology of heart failure and its subtypes. Nat Genet 57, 815–828 (2025). https://doi.org/10.1038/s41588-024-02064-3
Received:
Accepted:
Published:
Version of record:
Issue date:
DOI: https://doi.org/10.1038/s41588-024-02064-3
This article is cited by
-
A genetic map of human metabolism across the allele frequency spectrum
Nature Genetics (2025)