Genome-wide association study reveals mechanisms underlying dilated cardiomyopathy and myocardial resilience

Jurgens, Sean J.; Rämö, Joel T.; Kramarenko, Daria R.; Wijdeveld, Leonoor F. J. M.; Haas, Jan; Chaffin, Mark D.; Garnier, Sophie; Gaziano, Liam; Weng, Lu-Chen; Lipov, Alex; Zheng, Sean L.; Henry, Albert; Huffman, Jennifer E.; Challa, Saketh; Rühle, Frank; Verdugo, Carmen Diaz; Krijger Juárez, Christian; Kany, Shinwan; van Orsouw, Constance A.; Biddinger, Kiran; Poel, Edwin; Elliott, Amanda L.; Wang, Xin; Francis, Catherine; Ruan, Richard; Koyama, Satoshi; Beekman, Leander; Zimmerman, Dominic S.; Deleuze, Jean-François; Villard, Eric; Trégouët, David-Alexandre; Isnard, Richard; Boomsma, Dorret I.; de Geus, Eco J. C.; Tadros, Rafik; Pinto, Yigal M.; Wilde, Arthur A. M.; Hottenga, Jouke-Jan; Sinisalo, Juha; Niiranen, Teemu; Walsh, Roddy; Schmidt, Amand F.; Choi, Seung Hoan; Chang, Kyong-Mi; Tsao, Philip S.; Matthews, Paul M.; Ware, James S.; Lumbers, R. Thomas; van der Crabben, Saskia; Laukkanen, Jari; Palotie, Aarno; Amin, Ahmad S.; Charron, Philippe; Meder, Benjamin; Ellinor, Patrick T.; Daly, Mark; Aragam, Krishna G.; Bezzina, Connie R.

doi:10.1038/s41588-024-01975-5

Download PDF

Letter
Open access
Published: 21 November 2024

Genome-wide association study reveals mechanisms underlying dilated cardiomyopathy and myocardial resilience

Nature Genetics volume 56, pages 2636–2645 (2024)Cite this article

39k Accesses
29 Citations
63 Altmetric
Metrics details

Subjects

A Publisher Correction to this article was published on 04 December 2024

This article has been updated

Abstract

Dilated cardiomyopathy (DCM) is a heart muscle disease that represents an important cause of morbidity and mortality, yet causal mechanisms remain largely elusive. Here, we perform a large-scale genome-wide association study and multitrait analysis for DCM using 9,365 cases and 946,368 controls. We identify 70 genome-wide significant loci, which show broad replication in independent samples and map to 63 prioritized genes. Tissue, cell type and pathway enrichment analyses highlight the central role of the cardiomyocyte and contractile apparatus in DCM pathogenesis. Polygenic risk scores constructed from our genome-wide association study predict DCM across different ancestry groups, show differing contributions to DCM depending on rare pathogenic variant status and associate with systolic heart failure across various clinical settings. Mendelian randomization analyses reveal actionable potential causes of DCM, including higher bodyweight and higher systolic blood pressure. Our findings provide insights into the genetic architecture and mechanisms underlying DCM and myocardial function more broadly.

Genome-wide association analysis provides insights into the molecular etiology of dilated cardiomyopathy

Article Open access 21 November 2024

Shared genetic pathways contribute to risk of hypertrophic and dilated cardiomyopathies with opposite directions of effect

Article 25 January 2021

Genome-wide association and multi-trait analyses characterize the common genetic architecture of heart failure

Article Open access 14 November 2022

Main

DCM is a disease of the cardiac muscle characterized by increased left ventricular (LV) dimensions and decreased contractile function, which is not explained by abnormal loading conditions or coronary artery disease (CAD)^1,2,3,4,5. DCM represents a main cause of morbidity and mortality, as it predisposes to heart failure (HF) and lethal arrhythmias^3,4. While causal rare genetic variants are found in up to 25% of probands, most cases do not harbor a known monogenic cause of disease^6,7. Furthermore, actionable disease mechanisms remain elusive, with few preventative therapeutics⁴. Genome-wide association studies (GWAS) have recently demonstrated a polygenic contribution to DCM^8,9,10,11, opening an avenue for new mechanistic discovery, although these smaller studies were limited in power and identified only a handful of significant loci.

Here, we set out to assemble a large-scale GWAS meta-analysis using six datasets, comprising clinical DCM case–control and biobank sets. We included a total of 4,343 clinically ascertained DCM cases from three datasets (Fig. 1 and Supplementary Tables 1 and 2), including two published DCM datasets^8,10 (one reanalyzed; Supplementary Note) and a new clinical dataset from Amsterdam UMC (with one significant locus at BAG3; Supplementary Note, Supplementary Table 3 and Supplementary Figs. 1 and 2). We also performed harmonized GWAS of a strict, billing-code based phenotype of nonischemic DCM (NI-DCM) in three biobank datasets. Substantial yield was afforded by the FinnGen study¹² (n = 3,350 cases; 14 loci; most significantly at BAG3 and HSPB7), with additional contributions from the United Kingdom (UK) Biobank (UKB; one locus at BAG3)¹³ and Mass General Brigham Biobank (MGB)^13,14 (Supplementary Tables 1 and 2 and Supplementary Figs. 1 and 2). We found strong genetic support for the strict biobank-based DCM construct (Supplementary Tables 4 and 5 and Supplementary Note). In comparison, we explored a broader definition of nonischemic cardiomyopathy (NICM)^15,16, which yielded diminished discovery yield despite substantially larger case numbers (Extended Data Fig. 1, Supplementary Note and Supplementary Figs. 2 and 3). Therefore, we proceeded with the strict NI-DCM phenotype and performed a GWAS meta-analysis across all biobank and clinical DCM datasets, hereafter ‘GWAS-DCM.’

GWAS-DCM included 9,365 cases and 946,368 controls and included 12,600,235 common variants (minor allele frequency (MAF) > 0.5%) after quality control (Fig. 1). The meta-analysis showed some genomic inflation (λ_GC,LDSC = 1.19; λ_GC, genomic inflation factor; LDSC, linkage disequilibrium score regression), which could largely be resolved as polygenic signal (LDSC intercept = 1.06; Supplementary Table 4 and Extended Data Fig. 2). At conventional genome-wide significance (P < 5 × 10⁻⁸) we uncovered 38 distinct loci, 27 of which had not been previously described for DCM (Fig. 2a, Supplementary Tables 6–8 and Supplementary Note).

**Fig. 2: Locus and gene discovery for DCM.**

Most of the previously published DCM loci were recapitulated in GWAS-DCM^8,11 (Supplementary Table 6 and Extended Data Fig. 3). Furthermore, most loci overlapped with DCM loci from a recent preprint by Zheng et al.¹⁷ (Supplementary Note). GWAS-DCM signals showed strong pleiotropic effects on relevant cardiovascular traits, including cardiac magnetic resonance imaging (MRI) traits, electrocardiographic traits, blood pressure, HF and arrhythmia (Supplementary Note).

Previously published GWAS for DCM used multitrait analyses GWAS (MTAG)¹⁸ to boost discovery power for new loci¹¹. We similarly aimed to maximize discovery using an MTAG approach, using GWAS of eight LV traits from 36,083 UKB participants¹⁹. We identified two clusters of genetically correlated traits that included endophenotypes with strong genetic correlation to GWAS-DCM (Supplementary Fig. 4 and Supplementary Table 9). Using the most strongly correlated trait from each cluster—global circumferential strain (Ecc; r_g = 0.75 with DCM) and LV end systolic volume (LVESVi; r_g = 0.7 with DCM)—we performed an MTAG for DCM (‘MTAG-DCM’). MTAG-DCM identified 65 significant loci, 50 of which had not been published previously for DCM (Supplementary Tables 10–12; Extended Data Fig. 3 and Supplementary Note).

We then performed a replication analysis using independent samples from HERMES (Heart Failure Molecular Epidemiology for Therapeutic Targets), MVP (Million Veteran’s Program) and the ‘All of Us’²⁰ datasets, totaling up to 13,258 cases of NICM/DCM and 1,435,287 controls (Extended Data Fig. 4 and Supplementary Tables 13 and 14). Of 36 testable GWAS-DCM loci, all were concordant in effect direction and 92% replicated at P < 0.05. Of 64 testable MTAG-DCM loci, 88% replicated at P < 0.05 (81% for ‘MTAG-only’ loci; Supplementary Note). No loci showed meaningful heterogeneity in discovery (Supplementary Tables 15 and 16). These results confirm the robustness of our GWAS and MTAG approaches.

To identify cell types of relevance to DCM biology, we performed enrichment analyses using two published LV single nucleus RNA sequencing (snRNA-seq) datasets^21,22. Only cardiomyocyte-specific genes were significantly and robustly enriched for DCM heritability across datasets (P < 3 × 10⁻⁷ for enrichment coefficient; Supplementary Table 17, Extended Data Fig. 5 and Supplementary Fig. 5). Of note, Zheng et al. described enrichments for DCM heritability in other cardiac cell types¹⁷; this discrepancy is most probably due to technical differences, including use of a different enrichment statistic²³ (Supplementary Note). Taken together, our results highlight the central role of cardiomyocyte dysfunction in DCM pathogenesis.

We applied various approaches for variant-to-gene mapping^24,25,26 (Methods). In ten GWAS-DCM loci, a lead variant was linked to a protein-altering coding variant affecting a single gene (for example, BAG3, TTN, FHOD3, ADAMTS7, CAND2; Supplementary Tables 6 and 10). Among these, BAG3, TTN and FHOD3 represent known Mendelian cardiomyopathy genes^7,27,28. A well-imputed (INFO = 0.997) TUBA8 missense variant (22:18609493:G:A) was a lead variant in GWAS-DCM (Supplementary Fig. 6). TUBA8 is an α-tubulin predicted to be a component of myocyte cytoskeletons²⁹. The variant was testable only in FinnGen, reflecting an 18-fold enrichment in Finnish over non-Finnish Europeans³⁰.

Colocalization analyses with molecular traits—using expression quantitative trait loci (eQTLs) for LV from the genotype-tissue expression project (GTEx)³¹, eQTLs for blood from eQTLGen³² and protein quantitative trait loci (pQTLs) in blood from the UKB Pharma Proteomics Project (PPP)³³—helped prioritize genes and informed direction of effect in certain loci (Supplementary Table 18). We found 24 distinct transcripts/proteins associated with DCM at high posterior probability (PP4 > 70%). For instance, genetically predicted lower LV expression of TMEM182 (encoding a regulator of myoblast differentiation³⁴) and lower genetically predicted blood expression of FBXO32 (a recessive DCM gene^35,36) were associated with increased DCM risk. Higher predicted expressions of several genes, including MLF1, MMP1 and MAPT, were associated with increased DCM risk.

We found that the polygenic priority score method (PoPS) was a powerful tool to identify cardiomyopathy genes, as the top 100 genes from GWAS-DCM were enriched 119-fold (95% confidence interval (CI) (47–285), two-sided P < 2.6 × 10⁻¹⁶; Fisher exact test) for known Mendelian DCM and hypertrophic cardiomypathy (HCM) genes (ClinGen genes at ≥moderate evidence; Supplementary Table 19). Therefore, PoPS was assigned high weight in our final prioritization score.

We synthesized the various prioritization approaches into one score to identify a list of prioritized genes (Fig. 2b and Supplementary Tables 20 and 21; Methods). Across prioritized GWAS-DCM genes (n = 35 genes with ≥2.5 points) and MTAG-DCM genes (n = 60 genes), we narrowed down to 63 unique prioritized genes (defined as ≥2.5 points and highest score within a locus in either GWAS-DCM or MTAG-DCM; Fig. 2b and Extended Data Fig. 6). Among these prioritized genes were—as expected—several Mendelian cardiomyopathy genes, but also several genes with unknown or lesser-known roles in the heart (for example, CRIM1, MLF1, HSPA4, ERBB4, MITF, MLIP, MAP3K7, NEDD4L, DNAJC18 and HSPB8). HSPB8, HSPA4 and DNAJC18 encode proteins from the heatshock family, along with HSPB7, a gene functionally validated in DCM biology after being identified initially through GWAS³⁷.

Accordingly, gene set enrichment analyses, using the 63 prioritized genes, identified several significant gene sets including ‘Cellular response to heat stress’ (Supplementary Tables 22 and 23 and Supplementary Fig. 7). Most remaining gene sets were related to (cardiac) muscle development and function. Other distinct pathways emerged including ERBB signaling²² and cytoskeletal organization^38,39, as well as ‘Apoptosis by doxorubicin’ and ‘Aberrant mitosis by docetaxel.’ Doxorubicin and docetaxel are chemotherapeutics that may induce DCM-like phenotypes⁴⁰.

To scrutinize the prioritized genes further, we queried published single-cell data of the human LV from three datasets^21,22,41—including data from 61 nonfailing donors and 81 DCM patients. We found that many of the prioritized genes showed high and/or preferential expression in cardiomyocytes (Fig. 3 and Supplementary Table 24). These genes underscore the role of the contractile apparatus in DCM pathogenesis⁴², through known cardiac sarcomeric genes (for example, TTN, OBSCN and ACTN2), but also lesser-described structural genes including SVIL (encoding an actin-binding protein recently implicated in HCM¹⁹) and PDLIM5 (encoding a cytoskeletal linker⁴³). Other genes with cardiomyocyte-specific expression included MITF (encoding a transcription factor implicated in cardiac hypertrophy in vitro⁴⁴) and MLIP (encoding a lamin-interacting protein associated with myocardial adaptation in mice⁴⁵). Several genes showed significant differential expression (DE) between DCM and nonfailing hearts (Fig. 3 and Supplementary Table 25). Notably, within cardiomyocytes, such genes included MAP3K7 (encoding a mitogen-activated protein implicated in cardiospondylofacial syndrome⁴⁶), ADAMTS7 (encoding a thrombospondin-regulating metalloprotease⁴⁷) and both PRKCA and CAMK2D (involved in calcium handling^48,49). Of note, several genes highlighted from both GWAS and single-cell data are being investigated as targets for other conditions (Supplementary Table 26). These results show how integration of GWAS and single-cell data—paired with appropriate cell type priors—may identify plausible gene candidates for cardiomyopathy and LV function.

**Fig. 3: Cell-type-specific expression and DE of the top prioritized genes for DCM from three single-cell LV datasets.**

We next used genetic data to identify potential causes and consequences of DCM through Mendelian randomization (MR)⁵⁰. We performed a bidirectional MR screen using the weighted median (WM) method, based on genetic instruments constructed from GWAS for 73 common diseases and quantitative traits (Methods). At Bonferroni significance, we identified five potential causal risk factors for DCM (weight, body mass index (BMI), atrial fibrillation (AF), systolic blood pressure (SBP) and height), and two potential consequences of DCM liability (HF and mean platelet thrombocyte volume; Fig. 4a, Supplementary Table 27 and Supplementary Note). Weight, systolic blood pressure and AF remained as independent risk factors for DCM in multivariable MR (Supplementary Table 28). While these results partially recapitulate previous descriptions of causal factors for general HF⁵¹, we did not observe evidence for a causal role of coronary disease (g = −0.09, P = 0.13) or diabetes (g = −0.05, P = 0.18) on DCM.

**Fig. 4: Bidirectional MR screen for DCM and 73 common diseases and quantitative traits.**

To scrutinize the potential causal associations further, we employed two additional methods. First, we used MR-Egger regression⁵⁰ and found that most of the signals survived filtering using this method (P_slope < 0.05 and P_intercept > 0.1; Fig. 4 and Supplementary Table 27). Second, we used CAUSE—an approach that models a pleiotropic pathway and tests whether a causal model is a better fit for the data than a sharing model⁵² (Supplementary Table 29 and Supplementary Figs. 8–10). CAUSE estimated that BMI, weight and SBP all conferred increased risk of DCM (Fig. 4b). The causal role of blood pressure is consistent with the main pharmacotherapeutic approach to DCM, which consists partly of blood-pressure-lowering medications⁴. Similarly, there is a growing body of observational evidence linking obesity to risk of HF, DCM and other cardiomyopathies^53,54,55. In summary, our data support that SBP and weight are reasonable parameters for action in (premorbid) DCM management.

**Fig. 5: DCM genetic liability as a predictor of systolic HF across a range of settings in All of Us.**

We then constructed polygenic risk scores (PRS) from our GWAS-DCM and MTAG-DCM summary statistics⁵⁶, and tested these in three datasets. PRS constructed from GWAS-DCM and MTAG-DCM were associated significantly and strongly with DCM (Fig. 5 and Supplementary Tables 30–32), with MTAG-DCM scores yielding the best predictive performance across all tested strata (Extended Data Figs. 7 and 8, Supplementary Fig. 12 and Supplementary Note). For instance, in the All of Us dataset, PRS was associated strongly with DCM among European (OR per s.d. = 1.73; P = 9.0 × 10⁻³⁷) and African ancestries (OR per s.d. = 1.61; P = 2.5 × 10⁻¹⁰), with a weaker but significant signal among Admixed-American ancestry (OR per s.d. = 1.34; P = 2.4 × 10⁻³).

In the Amsterdam UMC dataset, clinical DCM cases carrying rare disease-causing variants (‘genotype-positive’) had significantly lower PRS than genotype-negative DCM cases (P = 0.0015), and genotype-negative cases were enriched more strongly for higher PRS (Fig. 6). Nevertheless, DCM PRS was enriched significantly in both groups compared with controls. These results highlight that polygenic burden contributes to disease risk in carriers and in noncarriers of rare pathogenic alleles, although carriers might need less polygenic burden to reach disease state^57,58.

**Fig. 6: Distribution and association of PRS among DCM patients by rare variant genotype status in the Amsterdam cohort.**

Finally, we assessed whether DCM PRS may have value for prediction of systolic HF—a condition associated with substantial morbidity and healthcare costs^59,60. In All of Us, we found significant associations for DCM PRS with systolic HF (OR per s.d. = 1.30; P = 7.8 × 10⁻⁷³), which persisted after removal of NI-DCM and NICM cases (OR per s.d. = 1.24; P = 8.4 × 10⁻⁴³; Supplementary Table 33). Furthermore, the PRS was a predictor of systolic HF across a range of settings, including after AF diagnosis (P = 1.4 × 10⁻¹³), after hypertension diagnosis (P = 2.4 × 10⁻³⁹), after myocardial infarction (P = 4.4 × 10⁻⁴) and among carriers of pathogenic rare variants for DCM (P = 5.2 × 10⁻⁷; Fig. 5 and Extended Data Fig. 9). These findings support the notion that the DCM PRS captures liability to intrinsic myocardial dysfunction or structural weakness, which may determine the resilience of the LV upon experiencing adverse events or prolonged stress.

In summary, we performed a large-scale GWAS and MTAG for DCM—including 9,365 strict DCM cases—and identified 70 loci at genome-wide significance. Several main conclusions arise from our work. First, on a cell-type level, we found that the heritability of DCM is enriched predominantly for cardiomyocyte expression, highlighting the central role of cardiomyocyte dysfunction in DCM pathogenesis. Second, mapping of loci to genes using various methods identified 63 potential effector genes, which may inform on-target and off-target effects in therapeutics development. Third, MR analyses support a causal role of bodyweight and SBP in DCM risk, indicating that early blood pressure regulation and weight reduction may be considerations in DCM patients or at-risk people. Fourth, a PRS derived from our GWAS predicts DCM, with impactful—albeit potentially differing—contributions in carriers and noncarriers of rare pathogenic variants. Fifth, the genetic liability to DCM underlies systolic HF, and may modulate risk of systolic failure across a range of settings. Our results have implications for our understanding of the mechanisms underlying DCM and myocardial resilience.

Methods

GWAS for dilated cardiomyopathy

We collected data from three case–control datasets that ascertained clinical DCM patients, and data from three large biobank studies. The clinical DCM datasets included (1) a published GWAS by Garnier et al. that enrolled 2,651 DCM cases from France, Germany, Italy, the UK and the United States⁸; (2) a reanalyzed dataset of 909 DCM cases from Heidelberg, Germany¹⁰ (Supplementary Note) and (3) a new dataset of Dutch DCM cases from Amsterdam UMC. The Amsterdam cohort comprised DCM patients referred for genetic testing at Amsterdam UMC, who underwent chart review for DCM diagnosis and had evidence of hypocontractility on imaging; 978 DCM cases passed all genotype quality-control criteria, of which 783 homogeneous cases of patients of European ancestry were included in GWAS (Supplementary Note and Supplementary Table 3). Further details for the various cohorts are described in the Supplementary Note and are summarized in Supplementary Tables 1 and 2. All clinical DCM cohorts applied imaging criteria as part of case definition.

We further performed GWAS in three biobanks, namely FinnGen (freeze 11)¹², UKB¹³ and MGB¹⁴ (Supplementary Tables 1 and 2 and Supplementary Note). In these datasets, we defined two phenotypes using International Classification of Disease (ICD) coding. First, we defined an NICM phenotype as described previously¹⁵, using ICD10 code I42.0 ‘dilated cardiomyopathy’ and ICD codes for ‘left heart failure,’ with exclusion of—at minimum—antecedent acute coronary syndromes and revascularization procedures (Supplementary Table 1 and Supplementary Note). Across biobanks, the NICM definition totaled 13,478 cases. We also defined a strict NI-DCM phenotype using only I42.0 (again with minimum exclusion of antecedent acute coronary syndromes and/or revascularization procedures), totaling 5,022 cases (Supplementary Tables 1 and 2). In all biobank datasets, individuals with other HF codes—but not fulfilling the case criteria—were removed from the controls. In all biobanks, REGENIE⁶¹ was used for GWAS. Further details are presented in Supplementary Table 2 and Supplementary Note.

All study cohorts either collected informed consent from research participants, or received appropriate approval from ethical/review committees to waive the requirement of informed consent. All study protocols were approved by appropriate ethical/review committees; approval was granted as described in the original publications for published cohorts^{8,10,11,12,13,14}; the Amsterdam UMC study protocol—focused on GWAS for heritable cardiovascular diseases—was approved by the Amsterdam UMC Medical Ethical Review Committee.

GWAS meta-analyses

Stringent variant quality control was applied in each dataset. Variants were filtered to high imputation quality (INFO) ≥ 0.5 or R² ≥ 0.5; MAF ≥ 0.5%; INFO ≥ 0.8 if MAF < 1%; INFO × MAF × Ncases × 2 ≥ 5; and variants with nonambiguous alleles (Supplementary Table 2). Before meta-analysis, variants were aligned to genome build GRCh38, using the liftOver command line tool if not already on the correct genome build⁶². GWAS meta-analyses were then performed using an inverse-variance weighted fixed-effects approach implemented in METAL⁶³ (March 25, 2011 release). GWAS meta-analyses were performed combining the three clinical DCM datasets, combining the three NICM GWAS from the biobank datasets and combining the NI-DCM-GWAS from the biobanks. After meta-analyses, results were filtered to common variants (MAF > 0.5%). Variants were considered significant if reaching the conventional genome-wide significance level (P < 5 × 10⁻⁸). In all GWAS, hypothesis tests were two-sided.

Heritability and genetic correlations

We used LDSC⁶⁴ (v.1.0.1) to estimate the heritability attributable to common single nucleotide polymorphism (SNP) variants (h²_SNP) for different meta-analyses. The European subset of the 1000Genomes⁶⁵ (v.3.5) dataset was used as a linkage disequilibrium (LD) reference panel, and analyses were subsetted to nonambiguous HapMap3 variants. Heritability values were transformed to the liability scale, assuming a population prevalence of 0.4% for DCM⁴ and 1.2% for NICM (based on UKB prevalence). We further used bivariate LDSC to estimate the genetic correlations (r_g) between the various meta-analyses^4,66. Hypothesis tests were performed using a null hypothesis of 0, using two-sided tests.

The biobank NI-DCM meta-analysis showed a comparable h²_SNP and high r_g with the clinical DCM meta-analysis (see above), and therefore we proceeded with an overall meta-analysis combining the clinical DCM-GWAS with the biobank NI-DCM-GWAS, from here referred to as GWAS-DCM.

Multitrait analyses

MTAG leverages the genetic correlation between a target GWAS (for example, for DCM) and GWAS for related traits (for example, LV parameters) to increase the discovery power, while accounting for potential sample overlap. We used MTAG (v.1.0.8) to first estimate a genetic correlation matrix between GWAS-DCM, NICM, HCM¹⁹, and eight LV MRI traits from a previous GWAS (n = 36,083 participants from UKB)¹⁹. Per SNP effective sample sizes (n_snp-eff) were computed from the s.e., using the formula

$${n}_{\rm{snp-eff}}=1/(2\times \rm{MAF}\times (1-MAF\,)\times (s.{e.}^{2}))$$

MTAG developers recommend utilizing GWAS of traits that are strongly genetically correlated with the target GWAS (r_g > 0.7). We additionally aimed to reduce the number of included traits to limit potential false-positive findings. After computing an initial genetic correlation matrix (Supplementary Table 9), we identified two large clusters of MRI traits correlated strongly with GWAS-DCM. From the clusters of genetically correlated traits (a ‘contractility’ cluster and a ‘volumetric’ cluster; Supplementary Fig. 4), we identified two index traits with r_g > 0.7 (Ecc and LVESVi). We then ran MTAG—including GWAS-DCM, Ecc GWAS and LVESVi GWAS—using default parameters. MTAG estimated that the boosted summary statistics for DCM equated to an increase in effective sample size of approximately 73% (ref. ¹⁸). The maximum false-discovery rate computed by MTAG was 0.03, meaning that, under the most unfavorable distribution of trait-specific effect sizes, 3% of signals may represent false positives¹⁸. Imaging-based contractility and LV dimensions represent direct (diagnostic) endophenotypes of DCM^3,4,5,67. Therefore, the true false-discovery rate is probably even lower. The results from this analysis are referred to as ‘MTAG-DCM.’ Significance was determined at the conventional genome-wide level (P < 5 × 10⁻⁸). In all MTAG, hypothesis tests were two-sided.

Locus definitions, variant annotation and gene prioritization

Functional mapping and annotation processing and annotation

GWAS-DCM and MTAG-DCM were processed in Functional Mapping and Annotation (FUMA)⁶⁸ v.1.6.1. Lead variants were defined as variants at genome-wide significance (P < 5 × 10⁻⁸) and r² < 0.05 (using ‘1KG/Phase3 EUR’ as LD reference). Genomic loci were subsequently defined by merging over 1 Mb distances. FUMA utilizes Multi-marker Analysis of GenoMic Annotation (MAGMA) v.1.08 to perform gene-based testing⁶⁹; FUMA then uses the MAGMA genes for tissue enrichment analysis based on GTEx v.8 expression (GTEx/v8/gtex_v8_ts_general_avg_log2TPM)³¹. Variants, and their LD partners, were further annotated using ANNOVAR⁷⁰ (v.2017-07-17). Loci were considered new if none of the lead variants overlapped (at 1 Mb windows) known lead variants from previous DCM-GWAS and DCM MTAG^8,11, or were found associated with DCM according to GWAS Catalog⁷¹ or OpenTargets^24,72 (queried in October 2023).

Protein-altering variation and closest protein-coding gene

For gene prioritization, we first assessed whether lead variants were in LD (r² > 0.65) with protein-coding protein-altering variants based on ANNOVAR annotations in FUMA. Second, we identified the closest protein-coding gene for lead variants, based on OpenTargets (22.10 update).

OpenTargets Variant2Function

Third, we used Variant2Function (V2F) from the OpenTargets platform²⁴ (22.10 update) to map variants to genes. V2F is a phenotype-agnostic machine-learning algorithm that identifies potential genes affected by genomic variants; we extracted the top three genes identified by V2F as being potentially affected by lead variants from GWAS-DCM and MTAG-DCM.

Polygenic priority score

Fourth, we used the PoPS method²⁵. PoPS uses gene-level associations—computed from GWAS summary statistics—to learn gene features associated with the trait in a joint model by polygenic enrichment; features consist of cell-type-specific gene expression, biological pathways and protein–protein interactions (PPIs). We first performed gene region based analysis with MAGMA⁶⁹ v.1.10 using the European subset of the 1000Genomes Phase 3 as a reference dataset. Based on gene-level results from MAGMA, we computed polygenic priority scores for 18,383 genes using the full set of features provided with PoPS v.0.2.

MR and colocalization for eQTLs and pQTLs

Fifth, we used MR of quantitative trait loci for expression (eQTLs) and protein abundance (pQTLs), followed by colocalization²⁶. As instruments for expression in the heart, we used cis-eQTLs for LV from GTEx³⁰ v.8 (n = 386 left ventricular samples). As instruments for expression in whole blood, we used cis-eQTLs from the eQTLGen consortium³² (n = 31,684 samples; we used the 2019 dataset, downloaded from https://www.eqtlgen.org/cis-eqtls.html). As instruments for protein abundance, we used pQTLs derived from the UKB PPP, which used the Olink platform for proteomic profiling³³; we downloaded summary statistics for the ‘combined’ set (from https://www.synapse.org/#!Synapse:syn51364943/files/; n = ~34,000 samples) and defined cis-pQTLs as variants present within 1 Mb of the associated protein. All three datasets were subsequently processed the same way and harmonized with GWAS-DCM or MTAG-DCM summary statistics (Supplementary Note). We defined our instruments by clumping the cis-eQTL/cis-pQTL variants, using two-sided P < 5 × 10⁻⁸, r² < 0.0005 and window size of 10 Mb in PLINK2 (refs. ^32,73). The R-package TwoSampleMR (v.0.5.6) was used to perform two-sample MR, using Wald ratio tests for single-instruments exposures and using the inverse-variance weighted approach for exposures with multiple instruments⁷⁴. P values from MR were all two-sided. Analyses were performed for both GWAS-DCM and MTAG-DCM; separate Bonferroni corrections were applied to both, and separate corrections were applied for eQTL and pQTL datasets. Significant hits were subsequently subjected to colocalization⁷⁵ using the R-package coloc (v.4.0.4) using strict priors (p1 = 1 × 10⁻⁴, p2 = 1 × 10⁻⁴, p12 = 1 × 10⁻⁶). A posterior probability for a shared causal variant (PP4) of >0.5 was considered some evidence of colocalization, while PP4 > 0.7 was considered strong colocalization.

Omnibus gene prioritization score

We then assembled the information from the five prioritization methods into one score. Given that PoPS showed a marked enrichment of known Mendelian DCM and HCM genes genome-wide, this method was strongly weighted in the score. In summary:

We assigned 1 point to a gene if it was the top gene prioritized by PoPS within a locus (defined as within ±500 kb from the lead variant, or ±1 Mb if fewer than two genes within 500 kb) or 0.5 point if within the top three genes.
We assigned an additional point to genes if they were also among the top 100 PoPS genes genome-wide, or 0.5 points if within place 101–200 genome-wide.
We assigned 1 point to a gene if it was the nearest protein-coding gene to the lead variant.
We assigned 1 point to a gene if it was affected by protein-altering variation (in LD with) a lead variant, or 0.5 points if several genes in the locus were implicated by protein-altering variation.
We assigned 1 point to the highest OpenTargets V2F gene for a lead variant, or 0.5 points for second and third genes.
We assigned 1 point to a gene within a locus if there was strong evidence from eQTL/pQTL colocalization (PP4 > 0.7), or 0.5 points if there was moderate evidence (PP4 > 0.5) and/or several genes were implicated in the locus by this approach.

In total, therefore, any given gene could attain between 0 and 6 points. For downstream analyses, we assigned the gene with the highest score across lead variants in the locus as the most highly prioritized gene for that locus. In case of ties, we first assessed whether the gene was convincingly prioritized in the locus based on the other discovery approach (that is, GWAS or MTAG); if not, then one was picked at random. From these genes, we further defined a final list of prioritized targets, using a prioritization score cutoff of ≥2.5 points.

Gene set enrichment analyses

We used two platforms for gene set enrichment analyses. First, we used the FUMA Gene2Func function⁶⁸ (v.1.6.1), to perform enrichment analyses restricting to FUMA-curated pathways. As input we used the curated set of prioritized genes across GWAS-DCM and MTAG-DCM (n = 63 genes), and used all Ensembl (v.102) protein-coding genes as background. We required at least two overlapping genes to identify a potential gene set, and we determined significance using a false-discovery-rate adjusted one-sided P < 0.05 (by two-step Benjamini–Krieger–Yekutieli method).

We additionally used the g:Profiler platform⁷⁶ (v. September 2023) to test for enrichment of gene sets from several predefined sources. The g:Profiler algorithm uses one-sided Fisher’s exact tests to test for enrichment of a prespecified list of genes across many gene sets, and subsequently adjusts one-sided P values for multiple testing while taking into account the correlation between gene sets (g:SCS method⁷⁶). Again the 63 prioritized genes were put forward for enrichment testing; g:Profiler used Ensembl v.110 as the background of protein-coding genes.

Since our prioritized genes may have been preselected towards genes with high cardiac expression (that is, through gene features learnt by PoPS), we performed a sensitivity analysis using genes nominated by MAGMA⁶⁹—a method based only on association signals near gene regions.

Cardiac-cell-type enrichment

To identify causal cell types for GWAS-DCM and MTAG-DCM, we used stratified LDSC, as described in Finucane et al.²³. To this end, we utilized two published single-nucleus RNA sequencing (snRNA-seq) datasets, one from Chaffin et al.²¹ and another from Reichart et al.²². The Chaffin et al. dataset included LV expression data on 11 DCM hearts, 16 nonfailing hearts and 15 HCM hearts. The cardiomyopathy samples came from explanted hearts with end-stage disease. Chaffin et al. identified 17 main cell types, which were used to define cell-type-specific gene programs for enrichment testing (see Supplementary Note for detailed methods). The Reichart et al. dataset included data on 61 end-stage cardiomyopathy hearts (52 with DCM) and 18 nonfailing controls. Reichart et al. identified nine main cell types in the LV, which were used to define cell-type-specific gene programs for enrichment testing (see Supplementary Note for detailed methods). Finally, in addition to the ‘cell-type-specific’ expression annotations described above, we also explored ‘disease-dependent’ cell-type annotations. Disease-dependent programs were based on genes with significant DE between DCM samples and nonfailing samples, irrespective of their cell-type-specificity. The detailed methods for this analysis are described in the Supplementary Note. Of note, cell-type-enrichment analyses were not informed in any way by our GWAS/MTAG gene prioritization scheme.

Single-cell expression and DE

We then aimed to identify cell-type-expression patterns and cellular functions for the prioritized genes from our GWAS and MTAG. To this end, we used available snRNA-seq or scRNA-seq data from three published datasets, including Chaffin et al.²¹, Reichart et al.²² and Koenig et al.⁴¹. Koenig et al. performed snRNA-seq/scRNA-seq on 18 LVs from DCM patients and 27 LVs from control donors.

Using the processed AnnData/Seurat objects from each study, we first restricted to control/nonfailing samples from the LV, and then log-normalized the expression data with scale 10,000 (if not already normalized). To harmonize cell-type data across datasets, we then used the available cell-type and/or cell-state annotations to collapse or split cell types into ‘harmonized’ cell types (Supplementary Note). For genes with at least 0.5 points from our prioritization scheme in GWAS-DCM or MTAG-DCM, we then exported several expression measures from each dataset. These included (1) the mean normalized expression within harmonized cell types and pseudobulk data and (2) the percentage of nuclei/cells with nonzero expression for each harmonized cell type and in pseudobulk. We then combined data by taking the weighted average of expression values (weighted by the number of nuclei per cells contributing in each dataset). For plotting purposes, we then focused on the list of 63 prioritized genes and computed the scaled relative normalized expression of a given gene in a given cell type, as compared with all other cell types.

We further aimed to identify genes differentially expressed between DCM and nonfailing hearts. To this end, we utilized results from cell-type-specific DE analysis for DCM versus nonfailing hearts, as described in Chaffin et al.²¹ and Koenig et al.⁴¹ For the published Chaffin et al. DE analysis, we consider results suggestive if reaching transcriptome-wide multiple-testing-adjusted two-sided P < 0.05 using CellBender-adjusted counts, without failing the ‘background contamination’ flag. For the published Koenig et al. DE analysis, we considered results suggestive if reaching transcriptome-wide multiple-testing-adjusted two-sided P < 0.05. Finally, we used the Reichart et al. dataset²², to perform a new DE analysis, comparing the 52 DCM LVs with 18 control LVs, using the same cell types that could be included for DE testing in their original publication (Supplementary Note). Again, a transcriptome-wide multiple-testing-adjusted two-sided P < 0.05 was considered suggestive. While we acknowledge that the cell types included in DE testing were not perfectly aligned across datasets, we approximately matched cell types to identify signals that were consistent across datasets (Supplementary Table 25). Finally, we declared significance for a gene, if at least two of three datasets showed a suggestive result with concordant direction of effect within comparable cell types.

MR for DCM

We used two-sample MR to identify potential causes and consequences of DCM using genetic data⁵⁰. To this end, we utilized the GWAS-DCM summary statistics and additionally collected published GWAS summary statistics for various common diseases and potential risk factors, including AF⁷⁷, CAD⁷⁸, type 2 diabetes⁷⁹, chronic kidney disease⁸⁰, HF⁵⁰, thyroid disease⁸¹, BMI⁸², alcohol use (drinks per day)⁸³, smoking (cigarettes per day)⁸³ and an additional 65 commonly measured quantitative traits (including blood pressure, anthropometry and laboratory values)⁸⁴. The GWAS summary statistics were chosen such that they were largely of European ancestry (and if European-only summary statistics were available, those were used; this was chosen to make the LD structure most comparable with the DCM-GWAS) and such that FinnGen was not included in the GWAS (to keep sample overlap to a reasonable minimum for two-sample MR).

We performed a bidirectional MR screen, where the above-mentioned traits were modeled as exposure and DCM modeled as outcome, and vice versa (DCM modeled as exposure). Harmonization of summary statistics is described in the Supplementary Note. For our discovery analysis, we used the WM method implemented in the R-package TwoSampleMR (v.0.5.6); the WM method may give more robust results than the inverse-variance-weighted approach in case of outliers⁵⁰. Results at a Bonferroni correction (two-sided P < 0.05; 146 comparisons) were considered significant. As a secondary filter for significant results, we then used the MR-Egger method. MR-Egger has lower power but may better account for directional pleiotropy, and further provides an estimate of the regression intercept (which may flag implausible relationships between outcome and exposure effects due to correlated directional pleiotropy)⁵⁰. We required that signals persisted with Egger-slope two-sided P < 0.05 without a substantial Egger-intercept (two-sided P > 0.1).

For any ‘exposure to DCM’ or ‘DCM to outcome’ pairs that remained after discovery and MR-Egger filtering, we then assessed the potential causal effect using CAUSE⁵² (v.1.2.0)—a recently proposed mixture approach that accounts for correlated and uncorrelated pleiotropy. In short, CAUSE assesses whether GWAS data for two traits are consistent with a causal effect, by fitting and comparing two nested models. These include a ‘sharing’ model that allows only a pleiotropic pathway, and a ‘causal’ model that additionally estimates a causal pathway. These models are compared using the expected log pointwise posterior density, and a one-sided P value is computed from a Z-test comparing the ‘causal’ model with the ‘sharing’ model⁵². For step 1 of CAUSE (estimating nuisance parameters), we used default parameters that include using 1 M random genome-wide markers for parameter estimation. For step 2 of CAUSE (estimating causal effects) we used filtered and pruned variants (two-sided P < 0.001 and r² < 0.0005 over 10 Mb windows) and otherwise default parameters.

PRS analyses

We then aimed to assess the performance of DCM PRS for prediction of DCM and systolic HF across ancestries and different clinical settings. To this end, we used the Amsterdam DCM cohort and the All of Us Research Program, as described below. In addition, we assessed the predictive capacity of the PRS in a third dataset, the UKB, as described in detail in the Supplementary Note.

Association with DCM and systolic HF in All of Us

All of Us is a cohort study enrolling participants from across the United States, with an emphasis on participants classically underrepresented in genetics research^20,85. Whole genome sequencing data were available for over 245,000 participants, of which 84% had complete electronic health record linkage. After quality control (Supplementary Note), we were left with 195,533 unrelated samples, of which 102,886 (52.6%) were of genetically defined European ancestry, and of which 928 had NI-DCM. Characteristics can be found in Supplementary Table 30.

From the GWAS-DCM and MTAG-DCM summary statistics, we created various PRS. Since MGB and All of Us have some overlapping samples⁸⁶, we reran our GWAS meta-analyses and MTAG omitting MGB for all PRS analyses described in All of Us. Using these updated summary statistics (DCM-GWAS (excluding MGB) and MTAG-DCM (excluding MGB)) we created genome-wide PRS using PRScs (v.2022-11 (ref. ⁵⁶)). We used the ‘auto’ function that learns the optimal shrinkage parameter directly from the GWAS summary statistics. Considering our discovery GWAS was of largely European ancestry, we used the ldblk_ukbb_eur files as LD reference. Participants in the All of Us dataset were subsequently scored using the ‘--score’ function in PLINK2 (ref. ⁷³). To account for ancestral differences in PRS distribution in this multi-ancestry dataset, we first regressed the first ten ancestral principal components (PCs) of ancestry out of the PRS values, and then standardized them to mean 0 and unit variance.

We first tested the association of both PRS with NI-DCM, using logistic regression models adjusting for age, age², sex and PCs 1–10. We assessed the association of PRS in the entire multi-ancestry cohort, as well as within the three largest ancestral subgroups, namely European (n = 102,886), African (n = 40,496), and Admixed-American (n = 30,358) ancestry. Correcting for the number of tests, we considered results with P < 0.05 ((2 × 4)) = 0.00625 significant. In all PRS analyses, hypothesis tests were two-sided.

Using the best performing PRS for DCM prediction (MTAG-DCM (excluding MGB)), we then assessed whether PRS could predict systolic HF. We used logistic regression models to predict systolic HF—defined using ICD10-CM code I50.2 (and subcodes; Supplementary Note)—using PRS, adjusting for age, age², sex and PCs 1–10. Additionally, we assessed whether the PRS could predict these outcomes across a range of clinical settings as a ‘second hit,’ namely after AF diagnosis, after hypertension diagnosis and after myocardial infarction. In these analyses, individuals with systolic HF coded before or concurrently with the initial event (for example, AF, hypertension, myocardial infarction) were removed from the respective analyses. Furthermore, we also assessed whether the PRS could predict systolic HF in carriers of likely pathogenic or pathogenic variants in high-confidence DCM genes (ClinGen strong/definitive; Supplementary Note). The significance cutoff was set to two-sided P < 0.05 (6) = 0.0083. In all these models, we performed sensitivity analyses removing participants with NI-DCM and NICM to assess whether potential signals were driven by these hard phenotypes; we also performed analyses restricting to European ancestry participants to assess whether results were driven solely by continental ancestry.

Cumulative contribution of rare and common variation to DCM in the Amsterdam cohort

We next assessed the distribution and discriminatory capacity of DCM PRS within the Amsterdam DCM cohort. The same general methodological framework from the All of Us cohort was applied to construct PRScs scores⁵⁶ in the Amsterdam (AUMC) dataset. Notably, however, we included MGB and omitted the Amsterdam cohort from GWAS-DCM and MTAG-DCM to prevent overfitting. As such, PRScs scores were created for GWAS-DCM (excluding AUMC) and MTAG-DCM (excluding AUMC). After scoring all individuals, the first ten PCs of ancestry were regressed out of the PRS values, and were scaled to mean 0 and variance 1 within the dataset.

We then tested whether the PRS based on GWAS-DCM (excluding AUMC) and MTAG-DCM (excluding AUMC) could discriminate between cases and controls, using logistic regression models adjusting for the first ten PCs of ancestry and sex. To assess performance in various subgroups, we assessed (1) all individuals, (2) individuals of European ancestry, (3) individuals of non-European ancestry, (4) male participants only and (5) female participants only. To determine significance, we used Bonferroni correction at two-sided P < 0.05 (2 scores × 5) = 0.005. We focused further analyses on the MTAG-DCM (excluding AUMC) PRS, which performed the best across groups (see above).

We then aimed to assess the cumulative contribution of common and rare genetic variation to clinical DCM, as described previously for rare arrhythmia syndromes^57,58. We grouped DCM cases into ‘rare genotype-positive,’ ‘rare genotype-negative’ and ‘uncertain rare genotype,’ based on clinical genetic testing findings (Supplementary Note). We performed logistic regression analyses restricting to either ‘genotype-positive’ cases or ‘genotype-negative’ cases, comparing either with the general control group. We also assessed distributions of PRS using density plots across (1) controls, (2) all cases (n = 978), (3) genotype-positive cases (n = 193) and (4) genotype-negative cases (n = 294). To identify statistical difference between PRS distribution among genotype-positive and genotype-negative cases, we used linear regression analyses with PRS as outcome and rare variant status as predictor (adjusting for sex and PC 1–10; Supplementary Note). In sensitivity analyses, all above approaches were repeated, restricting to individuals of genetically determined European ancestry, to assess whether results were driven by continental ancestry.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Summary statistics for our GWAS meta-analyses have been made available for download through the Cardiovascular Disease Knowledge Portal (https://cvd.hugeamp.org/downloads.html); summary statistics for various meta-analyses, including clinical dataset-only and biobank dataset-only, are available (https://api.kpndataregistry.org/api/d/CQyqth). Our PRS scoring weights—for both GWAS and MTAG scores—have been deposited into the PGS Catalog (publication ID: PGP000672; score IDs: PGS004946–PGS004951) and into the Cardiovascular Disease Knowledge Portal (https://api.kpndataregistry.org/api/d/9jevLe). Access to individual-level data for the Meder et al. cohort, the Garnier et al. cohort, the Amsterdam UMC cohort and MGB will not be made publicly available at this time, due to the restrictive/sensitive nature of the genomic and/or phenotypic data in question. Access to individual-level UK Biobank data, both phenotypic and genetic, is available to bona fide researchers through application on the UK Biobank website (https://www.ukbiobank.ac.uk). Access to individual-level phenotypic and genetic data from All of Us Research Program is currently available to bona fide researchers within the United States through the All of Us Researcher Workbench—a cloud-based computing platform (https://www.researchallofus.org/register/). The Finnish biobank data can be accessed through the Fingenious services (https://site.fingenious.fi/en/) managed by FINBB. Finnish Health register data can be applied for from Findata (https://findata.fi/en/data/). All processed snRNA-seq/scRNA-seq datasets used in the present study are publicly available: the Chaffin et al. dataset is available for download from the Broad Single Cell Portal (https://singlecell.broadinstitute.org/single_cell/study/SCP1303/single-nuclei-profiling-of-human-dilated-and-hypertrophic-cardiomyopathy); the Reichart et al. dataset was downloaded from GEO (https://www.ncbi.nlm.nih.gov/geo/download/?acc=GSE183852&format=file&file=GSE183852%5FDCM%5FIntegrated%2ERobj%2Egz); the Koenig et al. dataset was downloaded from CellxGene (https://datasets.cellxgene.cziscience.com/3716fb19-cedd-4fe5-abc4-5dbeb007fb65.rds). Other datasets include cis-eQTLs from the eQTLGen consortium (https://www.eqtlgen.org/cis-eqtls.html); cis-eQTLs from GTEx v.8 (https://www.gtexportal.org/home/downloads/adult-gtex#qtl) and tissue expression levels from GTEx v.8 (https://www.gtexportal.org/home/downloads/adult-gtex#bulk_tissue_expression); pQTLs derived from the UK Biobank PPP (summary statistics for the ‘combined’ set from https://www.synapse.org/#!Synapse:syn51364943/files/); the 22.10 update of the OpenTargets platform (https://genetics.opentargets.org/); GWAS Catalog queried in October 2023 (https://www.ebi.ac.uk/gwas/); ANNOVAR v.2017-07-17 (https://annovar.openbioinformatics.org/en/latest/); 1000Genomes project Phase 3 (https://www.internationalgenome.org/data/); gnomAD exomes v.2.1 (https://gnomad.broadinstitute.org/downloads); the ClinVar database (https://www.ncbi.nlm.nih.gov/clinvar/) was accessed in April 2023.

Code availability

Processing of genotype data, quality control, imputation and genome-wide association analyses were performed with various software tools as described in Supplementary Table 2. Notably, in most of the datasets, various versions of PLINK were used for quality control (https://www.cog-genomics.org/plink/ and https://www.cog-genomics.org/plink/2.0/) and various versions of REGENIE were used for GWAS (https://github.com/rgcgithub/regenie). Meta-analysis of GWAS was performed using the 2011-03-25 release of METAL (https://github.com/statgen/METAL). Heritability and genetic correlation parameters were computed using LDSC v.1.0.1 (https://github.com/bulik/ldsc). Multitrait analysis of GWAS was performed using MTAG v.1.0.8 (https://github.com/JonJala/mtag). For Mendelian randomization analyses, we used R-packages TwoSampleMR v.0.5.6 (https://mrcieu.github.io/TwoSampleMR/), coloc v.4.0.4 (https://github.com/chr1swallace/coloc) and CAUSE v.1.2.0 (https://github.com/jean997/cause/tree/master), implemented in custom MR pipelines (https://github.com/seanjosephjurgens/MR_pipeline_sjj). Annotation of GWAS was performed using FUMA v.1.6.1 (https://fuma.ctglab.nl/) as well as MAGMA v.1.10 (https://ctg.cncr.nl/software/MAGMA/prog/magma_v1.10.zip) and PoPS v.0.2 (https://github.com/FinucaneLab/pops). Gene set enrichment analyses were performed using FUMA v.1.6.1 (https://fuma.ctglab.nl/) and g:Profiler v.September 20 2023 (https://biit.cs.ut.ee/gprofiler/). For cell-type-specific heritability analyses, we used R-packages edgeR v.3.22.3 (https://github.com/OliverVoogd/edgeR), DESeq2 v.1.20.0 (https://github.com/thelovelab/DESeq2) and limma v.3.36.2 (https://bioconductor.org/packages/release/bioc/html/limma.html) as well as stratified LDSC v.1.0.1 (https://github.com/bulik/ldsc). For wrangling of single-cell/nucleus data, we used R-package Seurat v.5.0 (https://github.com/satijalab/seurat). For polygenic scoring analyses, we used PRScs v.2022-11 (https://github.com/getian107/PRScs) and PLINK2 (https://www.cog-genomics.org/plink/2.0/; various versions from May 2020 release onwards). All analyses that were run in R, were run in R v.4.0.0.

Change history

04 December 2024
A Correction to this paper has been published: https://doi.org/10.1038/s41588-024-02047-4

References

Schultheiss, H.-P. et al. Dilated cardiomyopathy. Nat. Rev. Dis. Prim. 5, 32 (2019).
Article PubMed Google Scholar
McNally, E. M. & Mestroni, L. Dilated cardiomyopathy: genetic determinants and mechanisms. Circ. Res. 121, 731–748 (2017).
Article PubMed PubMed Central CAS Google Scholar
McDonagh, T. A. et al. 2021 ESC Guidelines for the diagnosis and treatment of acute and chronic heart failure. Eur. Heart J. 42, 3599–3726 (2021).
Article PubMed CAS Google Scholar
Arbelo, E. et al. 2023 ESC Guidelines for the management of cardiomyopathies. Eur. Heart J. 44, 3503–3626 (2023).
Article PubMed CAS Google Scholar
Seferović, P. M. et al. Heart failure in cardiomyopathies: a position paper from the Heart Failure Association of the European Society of Cardiology. Eur. J. Heart Fail. 21, 553–576 (2019).
Article PubMed Google Scholar
Dellefave-Castillo, L. M. et al. Assessment of the diagnostic yield of combined cardiomyopathy and arrhythmia genetic testing. JAMA Cardiol. 7, 966–974 (2022).
Article PubMed PubMed Central Google Scholar
Mazzarotto, F. et al. Reevaluating the genetic contribution of monogenic dilated cardiomyopathy. Circulation 141, 387–398 (2020).
Article PubMed PubMed Central CAS Google Scholar
Garnier, S. et al. Genome-wide association analysis in dilated cardiomyopathy reveals two new players in systolic heart failure on chromosomes 3p25.1 and 22q11.23. Eur. Heart J. 42, 2000–2011 (2021).
Article PubMed PubMed Central CAS Google Scholar
Pirruccello, J. P. et al. Analysis of cardiac magnetic resonance imaging in 36,000 individuals yields genetic insights into dilated cardiomyopathy. Nat. Commun. 11, 2254 (2020).
Article PubMed PubMed Central CAS Google Scholar
Meder, B. et al. A genome-wide association study identifies 6p21 as novel risk locus for dilated cardiomyopathy. Eur. Heart J. 35, 1069–1077 (2014).
Article PubMed CAS Google Scholar
Tadros, R. et al. Shared genetic pathways contribute to risk of hypertrophic and dilated cardiomyopathies with opposite directions of effect. Nat. Genet. 53, 128–134 (2021).
Article PubMed PubMed Central CAS Google Scholar
Kurki, M. I. et al. FinnGen provides genetic insights from a well-phenotyped isolated population. Nature 613, 508–518 (2023).
Article PubMed PubMed Central CAS Google Scholar
Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209 (2018).
Article PubMed PubMed Central CAS Google Scholar
Karlson, E. W., Boutin, N. T., Hoffnagle, A. G. & Allen, N. L. Building the partners healthcare biobank at partners personalized medicine: informed consent, return of research results, recruitment lessons and operational considerations. J. Pers. Med. 6, 2 (2016).
Article PubMed PubMed Central Google Scholar
Aragam, K. G. et al. Phenotypic refinement of heart failure in a national biobank facilitates genetic discovery. Circulation 139, 489–501 (2019).
Article PubMed Google Scholar
Levin, M. G. et al. Genome-wide association and multi-trait analyses characterize the common genetic architecture of heart failure. Nat. Commun. 13, 6914 (2022).
Article PubMed PubMed Central CAS Google Scholar
Zheng, S. L. et al. Genome-wide association analysis provides insights into the molecular etiology of dilated cardiomyopathy. Nat. Genet. https://doi.org/10.1038/s41588-024-01952-y (2024).
Turley, P. et al. Multi-trait analysis of genome-wide association summary statistics using MTAG. Nat. Genet. 50, 229–237 (2018).
Article PubMed PubMed Central CAS Google Scholar
Tadros, R. et al. Large scale genome-wide association analyses identify novel genetic loci and mechanisms in hypertrophic cardiomyopathy. Preprint at medRxiv https://doi.org/10.1101/2023.01.28.23285147 (2023).
Ramirez, A. H. et al. The All of Us research program: data quality, utility, and diversity. Patterns (N.Y.) 3, 100570 (2022).
Chaffin, M. et al. Single-nucleus profiling of human dilated and hypertrophic cardiomyopathy. Nature 608, 174–180 (2022).
Article PubMed CAS Google Scholar
Reichart, D. et al. Pathogenic variants damage cell composition and single cell transcription in cardiomyopathies. Science 377, eabo1984 (2022).
Article PubMed PubMed Central CAS Google Scholar
Finucane, H. K. et al. Heritability enrichment of specifically expressed genes identifies disease-relevant tissues and cell types. Nat. Genet. 50, 621–629 (2018).
Article PubMed PubMed Central CAS Google Scholar
Mountjoy, E. et al. An open approach to systematically prioritize causal variants and genes at all published human GWAS trait-associated loci. Nat. Genet. 53, 1527–1533 (2021).
Article PubMed PubMed Central CAS Google Scholar
Weeks, E. M. et al. Leveraging polygenic enrichments of gene features to predict genes underlying complex traits and diseases. Nat. Genet. 55, 1267–1276 (2023).
Article PubMed PubMed Central CAS Google Scholar
Gaziano, L. et al. Actionable druggable genome-wide Mendelian randomization identifies repurposing opportunities for COVID-19. Nat. Med. 27, 668–676 (2021).
Article PubMed PubMed Central CAS Google Scholar
Ochoa, J. P. et al. Formin homology 2 domain containing 3 (FHOD3) is a genetic basis for hypertrophic cardiomyopathy. J. Am. Coll. Cardiol. 72, 2457–2467 (2018).
Article PubMed CAS Google Scholar
Górska, A. A. et al. Muscle-specific Cand2 is translationally upregulated by mTORC1 and promotes adverse cardiac remodeling. EMBO Rep. 22, e52170 (2021).
Article PubMed PubMed Central Google Scholar
Stanchi, F. et al. TUBA8: a new tissue-specific isoform of alpha-tubulin that is highly conserved in human and mouse. Biochem. Biophys. Res. Commun. 270, 1111–1118 (2000).
Article PubMed CAS Google Scholar
Karczewski, K. J. et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature 581, 434–443 (2020).
Article PubMed PubMed Central CAS Google Scholar
GTEx Consortium. The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science 369, 1318–1330 (2020).
Article Google Scholar
Võsa, U. et al. Large-scale cis- and trans-eQTL analyses identify thousands of genetic loci and polygenic scores that regulate blood gene expression. Nat. Genet. 53, 1300–1310 (2021).
Article PubMed PubMed Central Google Scholar
Sun, B. B. et al. Plasma proteomic associations with genetics and health in the UK Biobank. Nature 622, 329–338 (2023).
Article PubMed PubMed Central CAS Google Scholar
Luo, W. et al. TMEM182 interacts with integrin beta 1 and regulates myoblast differentiation and muscle regeneration. J. Cachexia Sarcopenia Muscle 12, 1704–1723 (2021).
Article PubMed PubMed Central Google Scholar
Al-Yacoub, N. et al. Mutation in FBXO32 causes dilated cardiomyopathy through up-regulation of ER-stress mediated apoptosis. Commun. Biol. 4, 884 (2021).
Article PubMed PubMed Central CAS Google Scholar
Lipov, A. et al. Exploring the complex spectrum of dominance and recessiveness in genetic cardiomyopathies. Nat. Cardiovasc. Res. 2, 1078–1094 (2023).
Article PubMed PubMed Central Google Scholar
Wu, T. et al. HSPB7 is indispensable for heart development by modulating actin filament assembly. Proc. Natl Acad. Sci. USA 114, 11956–11961 (2017).
Article PubMed PubMed Central CAS Google Scholar
Pecorari, I., Mestroni, L. & Sbaizero, O. Current understanding of the role of cytoskeletal cross-linkers in the onset and development of cardiomyopathies. Int. J. Mol. Sci. 21, 5865 (2020).
Article PubMed PubMed Central CAS Google Scholar
Sequeira, V., Nijenkamp, L. L. A. M., Regan, J. A. & van der Velden, J. The physiological role of cardiac cytoskeleton and its alterations in heart failure. Biochim. Biophys. Acta 1838, 700–722 (2014).
Article PubMed CAS Google Scholar
Rawat, P. S., Jaiswal, A., Khurana, A., Bhatti, J. S. & Navik, U. Doxorubicin-induced cardiotoxicity: an update on the molecular mechanism and novel therapeutic strategies for effective management. Biomed. Pharmacother. 139, 111708 (2021).
Article PubMed CAS Google Scholar
Koenig, A. L. et al. Single-cell transcriptomics reveals cell-type-specific diversification in human heart failure. Nat. Cardiovasc Res 1, 263–280 (2022).
Article PubMed PubMed Central Google Scholar
Vikhorev, P. G. & Vikhoreva, N. N. Cardiomyopathies and related changes in contractility of human heart muscle. Int. J. Mol. Sci. 19, 2234 (2018).
Article PubMed PubMed Central Google Scholar
Huang, X., Qu, R., Ouyang, J., Zhong, S. & Dai, J. An overview of the cytoskeleton-associated role of PDLIM5. Front. Physiol. 11, 975 (2020).
Article PubMed PubMed Central Google Scholar
Tshori, S. et al. Transcription factor MITF regulates cardiac growth and hypertrophy. J. Clin. Invest. 116, 2673–2681 (2006).
Article PubMed PubMed Central CAS Google Scholar
Cattin, M.-E. et al. Deletion of MLIP (muscle-enriched A-type lamin-interacting protein) leads to cardiac hyperactivation of Akt/mammalian target of rapamycin (mTOR) and impaired cardiac adaptation. J. Biol. Chem. 290, 26699–26714 (2015).
Article PubMed PubMed Central CAS Google Scholar
Le Goff, C. et al. Heterozygous mutations in MAP3K7, encoding TGF-β-activated kinase 1, cause cardiospondylocarpofacial syndrome. Am. J. Hum. Genet. 99, 407–413 (2016).
Article PubMed PubMed Central Google Scholar
Kessler, T. et al. ADAMTS-7 inhibits re-endothelialization of injured arteries and promotes vascular remodeling through cleavage of thrombospondin-1. Circulation 131, 1191–1201 (2015).
Article PubMed CAS Google Scholar
Sutanto, H. et al. Cardiomyocyte calcium handling in health and disease: insights from in vitro and in silico studies. Prog. Biophys. Mol. Biol. 157, 54–75 (2020).
Article PubMed CAS Google Scholar
Braz, J. C. et al. PKC-alpha regulates cardiac contractility and propensity toward heart failure. Nat. Med. 10, 248–254 (2004).
Article PubMed CAS Google Scholar
Burgess, S. et al. Guidelines for performing Mendelian randomization investigations: update for summer 2023. Wellcome Open Res 4, 186 (2019).
Article PubMed Google Scholar
Shah, S. et al. Genome-wide association and Mendelian randomisation analysis provide insights into the pathogenesis of heart failure. Nat. Commun. 11, 163 (2020).
Article PubMed PubMed Central CAS Google Scholar
Morrison, J., Knoblauch, N., Marcus, J. H., Stephens, M. & He, X. Mendelian randomization accounting for correlated and uncorrelated pleiotropic effects using genome-wide summary statistics. Nat. Genet. 52, 740–747 (2020).
Article PubMed PubMed Central CAS Google Scholar
Biddinger, K. J. et al. Rare and common genetic variation underlying the risk of hypertrophic cardiomyopathy in a national biobank. JAMA Cardiol. 7, 715–722 (2022).
Article PubMed PubMed Central Google Scholar
Robertson, J. et al. Body mass index in young women and risk of cardiomyopathy: a long-term follow-up study in Sweden. Circulation 141, 520–529 (2020).
Article PubMed PubMed Central Google Scholar
Robertson, J. et al. Higher body mass index in adolescence predicts cardiomyopathy risk in midlife. Circulation 140, 117–125 (2019).
Article PubMed PubMed Central CAS Google Scholar
Ge, T., Chen, C.-Y., Ni, Y., Feng, Y.-C. A. & Smoller, J. W. Polygenic prediction via Bayesian regression and continuous shrinkage priors. Nat. Commun. 10, 1776 (2019).
Article PubMed PubMed Central Google Scholar
Barc, J. et al. Genome-wide association analyses identify new Brugada syndrome risk loci and highlight a new mechanism of sodium channel regulation in disease susceptibility. Nat. Genet. 54, 232–239 (2022).
Article PubMed PubMed Central CAS Google Scholar
Lahrouchi, N. et al. Transethnic genome-wide association study provides insights in the genetic architecture and heritability of long QT syndrome. Circulation 142, 324–338 (2020).
Article PubMed PubMed Central CAS Google Scholar
Urbich, M. et al. A systematic review of medical costs associated with heart failure in the USA (2014–2020). Pharmacoeconomics 38, 1219–1236 (2020).
Article PubMed PubMed Central Google Scholar
Heidenreich, P. A. et al. 2022 AHA/ACC/HFSA guideline for the management of heart failure: a report of the American College of Cardiology/American Heart Association Joint Committee on Clinical Practice Guidelines. Circulation 145, e895–e1032 (2022).
PubMed Google Scholar
Mbatchou, J. et al. Computationally efficient whole-genome regression for quantitative and binary traits. Nat. Genet. 53, 1097–1103 (2021).
Article PubMed CAS Google Scholar
Luu, P.-L., Ong, P.-T., Dinh, T.-P. & Clark, S. J. Benchmark study comparing liftover tools for genome conversion of epigenome sequencing data. NAR Genom. Bioinform. 2, lqaa054 (2020).
Article PubMed PubMed Central Google Scholar
Willer, C. J., Li, Y. & Abecasis, G. R. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190–2191 (2010).
Article PubMed PubMed Central CAS Google Scholar
Bulik-Sullivan, B. K. et al. LD score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291–295 (2015).
Article PubMed PubMed Central CAS Google Scholar
1000 Genomes Project Consortium. et al. A global reference for human genetic variation. Nature 526, 68–74 (2015).
Article Google Scholar
Bulik-Sullivan, B. et al. An atlas of genetic correlations across human diseases and traits. Nat. Genet. 47, 1236–1241 (2015).
Article PubMed PubMed Central CAS Google Scholar
Pinto, Y. M. et al. Proposal for a revised definition of dilated cardiomyopathy, hypokinetic non-dilated cardiomyopathy, and its implications for clinical practice: a position statement of the ESC working group on myocardial and pericardial diseases. Eur. Heart J. 37, 1850–1858 (2016).
Article PubMed Google Scholar
Watanabe, K., Taskesen, E., van Bochoven, A. & Posthuma, D. Functional mapping and annotation of genetic associations with FUMA. Nat. Commun. 8, 1826 (2017).
Article PubMed PubMed Central Google Scholar
de Leeuw, C. A., Mooij, J. M., Heskes, T. & Posthuma, D. MAGMA: generalized gene-set analysis of GWAS data. PLoS Comput. Biol. 11, e1004219 (2015).
Article PubMed PubMed Central Google Scholar
Wang, K., Li, M. & Hakonarson, H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 38, e164 (2010).
Article PubMed PubMed Central Google Scholar
Sollis, E. et al. The NHGRI-EBI GWAS Catalog: knowledgebase and deposition resource. Nucleic Acids Res. 51, D977–D985 (2023).
Article PubMed CAS Google Scholar
Carvalho-Silva, D. et al. Open targets platform: new developments and updates two years on. Nucleic Acids Res. 47, D1056–D1065 (2019).
Article PubMed CAS Google Scholar
Chang, C. C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7 (2015).
Article PubMed PubMed Central Google Scholar
Hemani, G. et al. The MR-base platform supports systematic causal inference across the human phenome. eLife 7, e34408 (2018).
Article PubMed PubMed Central Google Scholar
Giambartolomei, C. et al. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet. 10, e1004383 (2014).
Article PubMed PubMed Central Google Scholar
Raudvere, U. et al. g:Profiler: a web server for functional enrichment analysis and conversions of gene lists (2019 update). Nucleic Acids Res. 47, W191–W198 (2019).
Article PubMed PubMed Central CAS Google Scholar
Roselli, C. et al. Multi-ethnic genome-wide association study for atrial fibrillation. Nat. Genet. 50, 1225–1233 (2018).
Article PubMed PubMed Central CAS Google Scholar
Nikpay, M. et al. A comprehensive 1,000 genomes-based genome-wide association meta-analysis of coronary artery disease. Nat. Genet. 47, 1121–1130 (2015).
Article PubMed PubMed Central CAS Google Scholar
Mahajan, A. et al. Fine-mapping type 2 diabetes loci to single-variant resolution using high-density imputation and islet-specific epigenome maps. Nat. Genet. 50, 1505–1513 (2018).
Article PubMed PubMed Central CAS Google Scholar
Wuttke, M. et al. A catalog of genetic loci associated with kidney function from analyses of a million individuals. Nat. Genet. 51, 957–972 (2019).
Article PubMed PubMed Central CAS Google Scholar
Saevarsdottir, S. et al. FLT3 stop mutation increases FLT3 ligand level and risk of autoimmune thyroid disease. Nature 584, 619–623 (2020).
Article PubMed CAS Google Scholar
Pulit, S. L. et al. Meta-analysis of genome-wide association studies for body fat distribution in 694 649 individuals of European ancestry. Hum. Mol. Genet. 28, 166–174 (2019).
Article PubMed CAS Google Scholar
Liu, M. et al. Association studies of up to 1.2 million individuals yield new insights into the genetic etiology of tobacco and alcohol use. Nat. Genet. 51, 237–244 (2019).
Article PubMed PubMed Central CAS Google Scholar
Jurgens, S. J. et al. Adjusting for common variant polygenic scores improves yield in rare variant association analyses. Nat. Genet. 55, 544–548 (2023).
Article PubMed PubMed Central CAS Google Scholar
Buniello, A. et al. The NHGRI-EBI GWAS catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 47, D1005–D1012 (2019).
Article PubMed CAS Google Scholar
Jurgens, S. J., Wang, X. & Choi, S. H. et al. Rare coding variant analysis for human diseases across biobanks and ancestries. Nat. Genet. 56, 1811–1820 (2024).
Article PubMed CAS Google Scholar

Download references

Acknowledgements

We gratefully thank all research participants, as this work would not have been possible without their contributions. Relevant funding sources for the FinnGen, UKB and All of Us datasets are presented in the Supplementary Note. S.J.J. received research support through the Junior Clinical Scientist Fellowship from the Dutch Heart Foundation (03-007-2022-0035), as well as a doctoral fellowship from the Amsterdam UMC. J.T.R. was supported by a research grant from the Aarne Koskelo Foundation. S.K. was supported by the Walter Benjamin Fellowship from the Deutsche Forschungsgemeinschaft (521832260). L.F.J.M.W. was supported by the Amsterdam UMC YTF, Dutch Heart Foundation Student Grant and the AFIP foundation. P.T.E. was supported by funding from the National Institutes of Health (1RO1HL092577, 1R01HL157635, 5R01HL139731), by a grant from the American Heart Association (18SFRN34110082) and from the European Union (MAESTRIA 965286). This work was further supported by a grant from the National Institutes of Health (1K08HL153937) and a grant from the American Heart Association (862032) to K.G.A. C.R.B. was supported by funding from the Dutch Heart Foundation (CVON 2018-30 PREDICT2) and the Pathfinder Cardiogenomics programme of the European Innovation Council of the European Union (DCM-NEXT). A.A.M., D.R.K. and Y.M.P. were supported by funding from the PSIDER programme of the Netherlands Organisation for Health Research and Development (ZonMW; project 40-46800-98-018). T.N. was supported by grants from the Sigrid Jusélius Foundation, the Finnish Foundation for Cardiovascular Research and the Finnish Research Council (grants 321351 and 354447). This work was supported by a grant from the GENMED Laboratory of Excellence on Medical Genomics (ANR-10-LABX-0013)—a research program managed by the National Research Agency (ANR) as part of the French Investment for the Future; Aviesan-ITMO Genetique-Genomique-Bioinformatique (ResDiCard: Resolving diagnostic deadlock in cardiomyopathies) and the Société Française de Cardiologie/Fédération Française de Cardiologie; the SFB-TR19 registry was supported by the Deutsche Forschungsgemeinschaft (DFG). The Study of Health in Pomerania (SHIP) is part of the Community Medicine Research net of the University of Greifswald, Germany, funded by the Federal Ministry of Education and Research (grants 01ZZ9603, 01ZZ0103, and 01ZZ0403); the Ministry of Cultural Affairs and the Social Ministry of the Federal State of Mecklenburg-West Pomerania; and grants from the German Center for Cardiovascular Research (DZHK). The KORA study was initiated and financed by the Helmholtz Zentrum München—German Research Center for Environmental Health, funded by the German Federal Ministry of Education and Research (BMBF) and by the State of Bavaria. KORA research was supported at the Munich Center of Health Sciences (MC-Health), Ludwig-Maximilians-Universität, as part of LMUinnovativ. D.A.T. is supported by the ‘EPIDEMIOM-VTE’ Senior Chair from the Initiative of Excellence of the University of Bordeaux. This research is based on data from the Million Veteran Program, Office of Research and Development, Veterans Health Administration, and was supported by award I01-BX003362 to P.T. and K.-M.C. This publication does not represent the views of the Department of Veteran Affairs or the United States Government. This work was also supported by the Sir Jules Thorn Charitable Trust (21JTA), Medical Research Council (UK), British Heart Foundation (RE/18/4/34215, SP/17/11/32885), the NIHR Imperial College Biomedical Research Centre, Pathfinder Cardiogenomics programme of the European Innovation Council of the European Union (DCM-NEXT) (101115416) to J.S.W.

Author information

These authors contributed equally: Sean J. Jurgens, Joel T. Rämö, Daria R. Kramarenko, Leonoor F. J. M. Wijdeveld, Jan Haas, Mark D. Chaffin, Sophie Garnier, Liam Gaziano.
These authors jointly supervised this work: Jari Laukkanen, Aarno Palotie, Ahmad S. Amin, Philippe Charron, Benjamin Meder, Patrick T. Ellinor, Mark Daly, Krishna G. Aragam, Connie R. Bezzina.
Full lists of members and their affiliations appear in the Supplementary Information.

Authors and Affiliations

Department of Experimental Cardiology, Amsterdam Cardiovascular Sciences, Heart Failure & Arrhythmias, Amsterdam UMC location, University of Amsterdam, Amsterdam, the Netherlands
Sean J. Jurgens, Daria R. Kramarenko, Alex Lipov, Christian Krijger Juárez, Edwin Poel, Leander Beekman, Dominic S. Zimmerman, Rafik Tadros, Yigal M. Pinto, Arthur A. M. Wilde, Roddy Walsh, Ahmad S. Amin & Connie R. Bezzina
Cardiovascular Disease Initiative, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Sean J. Jurgens, Joel T. Rämö, Leonoor F. J. M. Wijdeveld, Mark D. Chaffin, Liam Gaziano, Lu-Chen Weng, Saketh Challa, Carmen Diaz Verdugo, Shinwan Kany, Kiran Biddinger, Xin Wang, Richard Ruan, Satoshi Koyama, Seung Hoan Choi, Patrick T. Ellinor & Krishna G. Aragam
Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA
Sean J. Jurgens, Joel T. Rämö, Liam Gaziano, Lu-Chen Weng, Shinwan Kany, Satoshi Koyama, Patrick T. Ellinor & Krishna G. Aragam
Institute for Molecular Medicine Finland (FIMM), Helsinki Institute of Life Science (HiLIFE), University of Helsinki, Helsinki, Finland
Joel T. Rämö, Amanda L. Elliott, Aarno Palotie & Mark Daly
European Reference Network for rare low prevalence and complex diseases of the heart: ERN GUARD-Heart, Amsterdam, the Netherlands
Daria R. Kramarenko, Yigal M. Pinto, Arthur A. M. Wilde, Ahmad S. Amin & Connie R. Bezzina
Department of Physiology, Amsterdam UMC location, Vrije Universiteit, Amsterdam, the Netherlands
Leonoor F. J. M. Wijdeveld
Department of Medicine III, Institute for Cardiomyopathies Heidelberg (ICH), University Hospital Heidelberg, Heidelberg, Germany
Jan Haas & Benjamin Meder
Site Heidelberg/Mannheim, DZHK, Heidelberg, Germany
Jan Haas & Benjamin Meder
Research Unit on Cardiovascular Disorders, Metabolism and Nutrition, Team Genomics and Pathophysiology of Cardiovascular Disease, Sorbone Université, INSERM, Paris, France
Sophie Garnier, Eric Villard, Richard Isnard & Philippe Charron
ICAN Institute for Cardiometabolism and Nutrition, Paris, France
Sophie Garnier, Eric Villard, Richard Isnard & Philippe Charron
National Heart and Lung Institute, Imperial College London, London, UK
Sean L. Zheng, Catherine Francis, Paul M. Matthews & James S. Ware
MRC Laboratory of Medical Sciences, Imperial College London, London, UK
Sean L. Zheng, Paul M. Matthews & James S. Ware
Royal Brompton and Harefield Hospitals, Guy’s and St. Thomas’ NHS Foundation Trust, London, UK
Sean L. Zheng, Catherine Francis & James S. Ware
Institute of Cardiovascular Science, University College London, London, UK
Albert Henry
Institute of Health Informatics, University College London, London, UK
Albert Henry & R. Thomas Lumbers
Massachusetts Veterans Epidemiology Research and Information Center (MAVERIC), VA Boston Healthcare System, Boston, MA, USA
Jennifer E. Huffman
Palo Alto Veterans Institute for Research (PAVIR), Palo Alto Health Care System, Palo Alto, CA, USA
Jennifer E. Huffman
Harvard Medical School, Boston, MA, USA
Jennifer E. Huffman & Amanda L. Elliott
Bioinformatics Core Facility, Institute of Molecular Biology gGmbH (IMB), Mainz, Germany
Frank Rühle
Department of Genetic Epidemiology, Institute of Human Genetics, University of Münster, Münster, Germany
Frank Rühle
Department of Cardiology, University Heart and Vascular Center, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
Shinwan Kany
Department of Clinical Genetics, Amsterdam UMC location, University of Amsterdam, Amsterdam, the Netherlands
Constance A. van Orsouw & Saskia van der Crabben
Department of Psychiatry and Center for Genomic Medicine, Psychiatric and Neurodevelopmental Genetics Unit, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
Amanda L. Elliott
Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, USA
Amanda L. Elliott & James S. Ware
Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA
Amanda L. Elliott
CEA, Centre National de Recherche en Génomique Humaine, Université Paris-Saclay, Evry, France
Jean-François Deleuze
Laboratory of Excellence in Medical Genomics, GENMED, Evry, France
Jean-François Deleuze & David-Alexandre Trégouët
Fondation Jean Dausset, Centre d’Etude du Polymorphisme Humain, Paris, France
Jean-François Deleuze
Bordeaux Population Health Research Center, UMR 1219, University of Bordeaux, INSERM, Bordeaux, France
David-Alexandre Trégouët
APHP, Cardiology and Genetics Departments, Pitié-Salpêtrière Hospital, Paris, France
Richard Isnard & Philippe Charron
Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
Dorret I. Boomsma, Eco J. C. de Geus & Jouke-Jan Hottenga
Amsterdam Public Health Research Institute, Amsterdam UMC location, Vrije Universiteit, Amsterdam, the Netherlands
Eco J. C. de Geus
Cardiovascular Genetics Centre, Montreal Heart Institute, Montreal, QC, Canada
Rafik Tadros
Faculty of Medicine, Université de Montréal, Montreal, QC, Canada
Rafik Tadros
Department of Clinical Cardiology, Amsterdam Cardiovascular Sciences, Heart Failure and Arrhythmias, Amsterdam UMC location, University of Amsterdam, Amsterdam, the Netherlands
Yigal M. Pinto, Arthur A. M. Wilde, Amand F. Schmidt & Ahmad S. Amin
The Netherlands Twin Register, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
Jouke-Jan Hottenga
Department of Cardiology, Helsinki University Hospital, Helsinki, Finland
Juha Sinisalo
Heart and Lung Center, Helsinki University Hospital and Helsinki University, Helsinki, Finland
Juha Sinisalo
Department of Internal Medicine, University of Turku, Helsinki, Finland
Teemu Niiranen
Division of Medicine, Turku University Hospital, Helsinki, Finland
Teemu Niiranen
Finnish Institute for Health and Welfare (THL), Helsinki, Finland
Teemu Niiranen
Institute of Cardiovascular Science, Faculty of Population Health, University College London, London, UK
Amand F. Schmidt
University College London British Heart Foundation Research Accelerator, London, UK
Amand F. Schmidt
Department of Cardiology, Division Heart and Lungs, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
Amand F. Schmidt
Department of Biostatistics, Boston University, Boston, MA, USA
Seung Hoan Choi
Corporal Michael J. Crescenz VA Medical Center, Philadelphia, PA, USA
Kyong-Mi Chang
Department of Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
Kyong-Mi Chang
Palo Alto Health Care System, Palo Alto, CA, USA
Philip S. Tsao
Department of Medicine and Cardiovascular Institute, Stanford University School of Medicine, Stanford, CA, USA
Philip S. Tsao
The National Institute for Health Research, University College London Hospitals Biomedical Research Centre, University College London, London, UK
R. Thomas Lumbers
Department of Medicine, Institute of Clinical Medicine, University of Eastern Finland, Kuopio, Finland
Jari Laukkanen
Central Finland Biobank, Central Finland Health Care District, Jyväskylä, Finland
Jari Laukkanen
Program in Medical and Population Genetics and Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA
Aarno Palotie & Mark Daly
Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
Aarno Palotie & Mark Daly

Authors

Sean J. Jurgens
View author publications
Search author on:PubMed Google Scholar
Joel T. Rämö
View author publications
Search author on:PubMed Google Scholar
Daria R. Kramarenko
View author publications
Search author on:PubMed Google Scholar
Leonoor F. J. M. Wijdeveld
View author publications
Search author on:PubMed Google Scholar
Jan Haas
View author publications
Search author on:PubMed Google Scholar
Mark D. Chaffin
View author publications
Search author on:PubMed Google Scholar
Sophie Garnier
View author publications
Search author on:PubMed Google Scholar
Liam Gaziano
View author publications
Search author on:PubMed Google Scholar
Lu-Chen Weng
View author publications
Search author on:PubMed Google Scholar
Alex Lipov
View author publications
Search author on:PubMed Google Scholar
Sean L. Zheng
View author publications
Search author on:PubMed Google Scholar
Albert Henry
View author publications
Search author on:PubMed Google Scholar
Jennifer E. Huffman
View author publications
Search author on:PubMed Google Scholar
Saketh Challa
View author publications
Search author on:PubMed Google Scholar
Frank Rühle
View author publications
Search author on:PubMed Google Scholar
Carmen Diaz Verdugo
View author publications
Search author on:PubMed Google Scholar
Christian Krijger Juárez
View author publications
Search author on:PubMed Google Scholar
Shinwan Kany
View author publications
Search author on:PubMed Google Scholar
Constance A. van Orsouw
View author publications
Search author on:PubMed Google Scholar
Kiran Biddinger
View author publications
Search author on:PubMed Google Scholar
Edwin Poel
View author publications
Search author on:PubMed Google Scholar
Amanda L. Elliott
View author publications
Search author on:PubMed Google Scholar
Xin Wang
View author publications
Search author on:PubMed Google Scholar
Catherine Francis
View author publications
Search author on:PubMed Google Scholar
Richard Ruan
View author publications
Search author on:PubMed Google Scholar
Satoshi Koyama
View author publications
Search author on:PubMed Google Scholar
Leander Beekman
View author publications
Search author on:PubMed Google Scholar
Dominic S. Zimmerman
View author publications
Search author on:PubMed Google Scholar
Jean-François Deleuze
View author publications
Search author on:PubMed Google Scholar
Eric Villard
View author publications
Search author on:PubMed Google Scholar
David-Alexandre Trégouët
View author publications
Search author on:PubMed Google Scholar
Richard Isnard
View author publications
Search author on:PubMed Google Scholar
Dorret I. Boomsma
View author publications
Search author on:PubMed Google Scholar
Eco J. C. de Geus
View author publications
Search author on:PubMed Google Scholar
Rafik Tadros
View author publications
Search author on:PubMed Google Scholar
Yigal M. Pinto
View author publications
Search author on:PubMed Google Scholar
Arthur A. M. Wilde
View author publications
Search author on:PubMed Google Scholar
Jouke-Jan Hottenga
View author publications
Search author on:PubMed Google Scholar
Juha Sinisalo
View author publications
Search author on:PubMed Google Scholar
Teemu Niiranen
View author publications
Search author on:PubMed Google Scholar
Roddy Walsh
View author publications
Search author on:PubMed Google Scholar
Amand F. Schmidt
View author publications
Search author on:PubMed Google Scholar
Seung Hoan Choi
View author publications
Search author on:PubMed Google Scholar
Kyong-Mi Chang
View author publications
Search author on:PubMed Google Scholar
Philip S. Tsao
View author publications
Search author on:PubMed Google Scholar
Paul M. Matthews
View author publications
Search author on:PubMed Google Scholar
James S. Ware
View author publications
Search author on:PubMed Google Scholar
R. Thomas Lumbers
View author publications
Search author on:PubMed Google Scholar
Saskia van der Crabben
View author publications
Search author on:PubMed Google Scholar
Jari Laukkanen
View author publications
Search author on:PubMed Google Scholar
Aarno Palotie
View author publications
Search author on:PubMed Google Scholar
Ahmad S. Amin
View author publications
Search author on:PubMed Google Scholar
Philippe Charron
View author publications
Search author on:PubMed Google Scholar
Benjamin Meder
View author publications
Search author on:PubMed Google Scholar
Patrick T. Ellinor
View author publications
Search author on:PubMed Google Scholar
Mark Daly
View author publications
Search author on:PubMed Google Scholar
Krishna G. Aragam
View author publications
Search author on:PubMed Google Scholar
Connie R. Bezzina
View author publications
Search author on:PubMed Google Scholar

Consortia

FinnGen

Joel T. Rämö
, Amanda L. Elliott
, Juha Sinisalo
, Teemu Niiranen
, Jari Laukkanen
, Aarno Palotie
& Mark Daly

VA Million Veteran Program

Jennifer E. Huffman
, Krishna G. Aragam
, Kyong-Mi Chang
& Philip S. Tsao

HERMES Consortium

Sean L. Zheng
, Albert Henry
, Kiran Biddinger
, Patrick T. Ellinor
, Krishna G. Aragam
, James S. Ware
& R. Thomas Lumbers

Contributions

S.J.J., K.G.A. and C.R.B. conceived the study. S.J.J., J.T.R., P.T.E., M.D., K.G.A. and C.R.B were responsible for the overall study design. S.J.J., J.T.R., D.R.K., J.H. and S.G. contributed to the main discovery GWAS analyses. S.J.J., J.T.R. and L.G. performed the main gene prioritization analyses, while J.T.R., D.R.K. and L.F.J.M. performed the main polygenic risk score analyses. M.D.C. performed the main single-cell analyses pertaining to cell-type enrichment and differential expression, while S.J.J. harmonized single-cell data for gene-level expression patterns. S.J.J. and L.F.J.M. performed data visualization, with support from D.R.K., M.D.C. and A.L. Statistical and analytical support was provided by L.-C.W., A.L., F.R., S.K., K.B., A.L.E., X.W., S.K. and D.S.Z. GWAS data for cardiac MRI phenotypes were contributed by C.F. and P.M.M. At the Amsterdam UMC site, patient inclusions and database management saw contributions from D.R.K., C.K.J., C.A.v.O., E.P., Y.M.P., S.v.d.C., A.S.A. and C.R.B., while L.B. handled and processed patient samples. D.I.B., E.J.C.d.G. and J.-J.H. contributed control sample data for the Amsterdam dataset. J.S., T.N., J.L., A.P. and M.D. were responsible for phenotyping, analysis supervision and oversight within FinnGen. J.-F.D., E.V., D.-A.T., R.I. and P.C. were responsible for oversight, inclusions and analysis of French DCM patient samples, while B.M. was responsible for oversight of DCM-GWAS data from Heidelberg. The biological relevance of prioritized genes was assessed by C.D.V. and R.R. using in-house wet-laboratory data. S.L.Z., A.H., J.S.W. and T.L. were responsible for replication data from the HERMES dataset, while J.E.H., S.C., K.-I.C. and P.S.T. were responsible for replication data from the Million Veteran Program. S.J.J. performed replication analyses using the All of Us data, and performed the meta-analysis of the various replication datasets. Important intellectual contributions were provided by R.T., Y.M.P., A.A.M.W., J.S., T.N., R.W., A.F.S., S.H.C., J.S.W., T.L. and S.v.d.C., while main supervision of the study was provided by J.L., A.P., A.S.A., P.C., B.M., P.T.E., M.D., K.G.A. and C.R.B. S.J.J., J.T.R., D.R.K., P.T.E., M.D., K.G.A. and C.R.B. wrote the manuscript. All authors reviewed and critically revised the manuscript.

Corresponding authors

Correspondence to Patrick T. Ellinor, Mark Daly, Krishna G. Aragam or Connie R. Bezzina.

Ethics declarations

Competing interests

P.T.E. has received sponsored research support from Bayer AG, IBM Health, Bristol Myers Squibb and Pfizer; he has consulted for Bayer AG, Novartis and MyoKardia. K.G.A. has received sponsored research support from Sarepta Therapeutics and Bayer AG, and reports a research collaboration with Novartis. Y.M.P. is involved in the development of therapies for DCM as an advisor to Forbion and Medical Director at ARMGO pharma and CMO at Phlox Therapeutics. P.C. reports personal fees for consultancies, outside the present work, for Amicus, OWKIN, Pfizer and SANOFI. J.S.W. has received research support from Bristol Myers Squibb, and has acted as a consultant for MyoKardia, Pfizer, Foresite Labs, Health Lumen and Tenaya Therapeutics. The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Genetics thanks Matthias Heinig, Guillaume Paré and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Manhattan plots for biobank meta-analysis for NI-DCM and NICM.

Each panel shows a Manhattan for a GWAS meta-analysis of a phenotype across biobank datasets (FinnGen+UKB+MGB), where the top plot shows the results for the strict NI-DCM phenotype (N = 5,022 cases; N = 932,941 controls), and the bottom plot shows results for the broader NICM phenotype (N = 13,478 cases; N = 932,873 controls). In both figures, each dot represents a single tested variant, the x-axis shows genomic coordinates for those variants (chromosome, and position on chromosome), while the y-axis shows the -log10 of the P-value from GWAS. P-values are derived from inverse-variance-weighted meta-analysis of logistic regression models; reported P-values are two-sided and unadjusted for multiple testing. The red line indicates the conventional genome-wide significance level (alpha = 5 × 10⁻⁸). Loci reaching above the significance line are annotated with a gene name, where the annotated gene is harmonized with the locus name from our main GWAS (ie, highest prioritized gene in locus from GWAS-DCM/MTAG-DCM) for easy comparisons; sometimes an additional gene is highlighted to serve easier comparison to previously-published GWAS; if locus was not identified in GWAS-DCM/MTAG-DCM, the closest protein-coding gene is used. Note: GWAS, genome-wide association study; NICM, nonischemic cardiomyopathy; NI-DCM, nonischemic dilated cardiomyopathy.

Extended Data Fig. 2 Quantile-quantile plots for the final meta-analysis (GWAS-DCM) and the final MTAG analysis (MTAG-DCM).

The quantile-quantile plots show results for the GWAS meta-analysis of DCM (left) and for the MTAG analysis of DCM with cardiac MRI traits (right). In each quantile-quantile plot, the x-axis represents the expected -log10 of the P-value of variants under the null hypothesis, while the y-axis represents the observed -log10 of the P-value in the analysis. The top corner shows calibration statistics, namely i) the inflation factor lambda, computed as the observed X^2 statistic at the median over the expected under the null, from all plotted variants, ii) the inflation factor computed by LDSC, which filters to high-confidence common genetic variants found in their internal reference, iii) the LDSC intercept which quantifies the residual inflation (computed as the intercept in a regression of X^2 statistics over linkage-disequilibrium scores⁶³), due to biases. P-values are derived from inverse-variance-weighted meta-analysis of logistic regression models (GWAS-DCM) or from MTAG analysis of such results (MTAG-DCM); reported P-values are two-sided and unadjusted for multiple testing. Note: GWAS, genome-wide association study; MTAG, multi-trait analysis GWAS; DCM, dilated cardiomyopathy; LDSC, linkage-disequilibrium score regression.

Extended Data Fig. 3 Venn Diagram highlighting the loci identified in GWAS-DCM and MTAG-DCM.

This Venn diagram shows loci that were significantly associated in GWAS-DCM, MTAG-DCM, or both. The right ellipse shows results from GWAS-DCM, the left ellipse shows results from MTAG-DCM, and the overlapping area shows loci found in both. A genomic locus was defined based on distance, taking the top index variant in a region, and merging with other potential index variants if within a 1 Mb window up or downstream (and merging MTAG-DCM and GWAS-DCM loci based on distance as well). Loci are annotated with the most highly-prioritized gene using our methodology (Methods); in case of different genes prioritized by MTAG-DCM or GWAS-DCM (for overlapping loci), one was chosen at random for annotation. Loci are also annotated with the genomic coordinates (chromosome:position in megabases) for GRCh37. Loci annotated in red were ‘novel’, which was defined as: Not within 1 Mb distance with a previously described locus from a peer-reviewed published genome-wide association study for DCM, or MTAG for DCM, by querying GWAScatalog⁸⁵, OpenTargets²³ and two previous larger studies^8,11. Note: GWAS, genome-wide association study; MTAG, multi-trait analysis GWAS; DCM, dilated cardiomyopathy.

Extended Data Fig. 4 Independent replication of GWAS-DCM and MTAG-DCM loci.

The figure shows the summary of the replication effort performed within HERMES, the Million Veteran’s Program (MVP) and All of Us (AoU) datasets. Part a shows the replication effort for GWAS-DCM loci, while part b shows results for replication of the MTAG-DCM loci. In both parts, data are restricted to loci passing quality-control for replication, and are restricted to a single lead variant per locus (the lead variant with strongest significance in discovery). The left panels show dot plots, with on the x-axis the effect sizes from discovery (ie, GWAS-DCM or MTAG-DCM) and on the y-axis the estimated effect size from the replication set (a meta-analysis of independent cohorts/samples from HERMES, MVP, and AoU), totalling up to 13,258 DCM/NICM cases and 1,435,287 controls (see Supplementary Note and Supplementary Tables 13 and 14). Data represent estimated beta coefficients ± standard errors. A trend line from linear regression is added to the plot, with the estimated beta coefficient and standard error from this regression added to the top left of the panels. Genes showing substantial deviation from the line are annotated with their gene names. The right panels represent bar charts that show the replication rate (ie, the percentage of replicating loci) using different definitions for replication; the green bars (left) represent directional concordance, the light blue bars (middle) represent replication at nominal unadjusted one-sided P < 0.05, while the dark blue (right) bars represent replication at Bonferroni-adjusted significance (one-sided P < 0.05/# loci) which leaves cutoffs of P < 0.0014 and P < 0.002 in part a and cutoffs of P < 0.00078 and P < 0.0015 in part b. Given the estimated attenuation of effect sizes for previously-established DCM loci, we computed ‘expected’ replication rates under the assumption that all loci are true and share the same degree of attenuation (Supplementary Note); the expected replication rates are added as light gray bars behind the colored bars. Note: OR, odds ratio.

Extended Data Fig. 5 Cardiac cell type enrichment of DCM heritability from two snRNA sequencing datasets.

Bar charts represent the -log10 of the P-value from the analysis testing for enrichment of cell type-specific gene programs in our GWAS/MTAG results. The x-axis shows different cell types from the respective snRNAseq datasets. Part a shows results from enrichment analysis using the Chaffin et al.²⁰ snRNAseq dataset, while part b shows results for the Reichart et al.²¹ snRNAseq dataset. The dotted lines represent the significance cutoffs within the panel, using a Bonferroni correction for the number of included cell types. The left panels show the results from testing for enrichment of GWAS-DCM heritability, while the right panels show results for testing for enrichment of MTAG-DCM heritability. P-values are derived from the Tau statistic from stratified LD score regression, and represent one-sided P-values that are unadjusted for multiple testing. Note: GWAS, genome-wide association study; MTAG, multi-trait analysis GWAS; DCM, dilated cardiomyopathy.

Extended Data Fig. 6 Gene prioritization scores for top prioritized genes from GWAS-DCM.

The bottom side of the figure shows a heatmap with different gene prioritization methods on the y-axis and highly-prioritized genes on the x-axis. The top side of the figure shows the corresponding gene prioritization scores, represented in bar charts, that show the sum of the individual components from the heatmap. Genes are ordered from left to right based on their priority score (high to low). In the heatmap, a very light blue panel indicates no points, a middle-blue panel indicates 0.5 points, while a dark blue panel indicates 1 point assigned to the given gene based on the given prioritization method. Highly-prioritized genes were defined as genes with 2.5 or higher points, which were also the most highly-prioritized genes in their respective loci. For the similar plot for MTAG-DCM, see Fig. 2b. Note: GWAS, genome-wide association study; DCM, dilated cardiomyopathy; MTAG, multi-trait analysis GWAS; PoPs, polygenic priority score method; eQTL, expression quantitative trait locus; pQTL, protein quantitative trait locus.

Extended Data Fig. 7 Associations between polygenic risk score and DCM across three European ancestry datasets.

This forest plot shows association results for the PRS constructed from GWAS-DCM and MTAG-DCM with DCM status across three different datasets. In all cases, association data are shown in a European ancestry ‘testing set’ (dataset in which PRS is tested) that is made as independent as possible from the ‘training data’ (ie, the base GWAS and MTAG data used to construct PRS). In the Amsterdam UMC (AUMC) dataset, AUMC samples were omitted from the PRS training data, and PRS was used to discriminate clinical DCM cases (N = 783) from referents (N = 6,978). In the All of Us (AoU) dataset, samples from Massachusetts General Hospital (MGB) were omitted from the PRS training data, and PRS was used to discriminate NI-DCM cases (N = 506) from controls (N = 95,510). In the UK Biobank (UKB) dataset, samples from UKB were omitted from the base GWAS, and participants were excluded from the testing set if they contributed to the MRI sub-study of UKB (first 45k); PRS was used to discriminate NI-DCM cases (N = 793) from controls (N = 325,313). All PRS were constructed using the PRScs algorithm (Methods). In the plot, the x-axis shows odds ratios per standard deviation of the PRS distribution, estimated from logistic regression (adjusted at least for ancestral principal components in all cases). Data are presented as estimated odds ratios with 95% confidence intervals.The first three rows with dark green color show results for PRS constructed from GWAS-DCM, while the bottom three rows in light green color show results for PRS constructed from MTAG-DCM. On the right of the plot we show the R^2 for each PRS in the respective dataset, where R^2 represents the residual variance explained by the PRS (computed as the improvement of model R^2 inclusive of PRS as compared to the model without PRS, divided by the proportion of residual variance); all R^2 values were computed on the liability-scale to allow better comparisons across datasets. Note: Other performance metrics are presented in Supplementary Table 41. GWAS, genome-wide association study; DCM, dilated cardiomyopathy; NI-DCM, nonischemic dilated cardiomyopathy; MTAG, multi-trait analysis of GWAS; OR, odds ratio; 95%CI, 95% confidence interval; SD, standard error; R^2, variance explained.

Extended Data Fig. 8 Associations between DCM polygenic risk score and NI-DCM across different ancestries in the All of Us dataset.

This forest plot shows association results for the PRS constructed from GWAS-DCM and MTAG-DCM with NI-DCM in the All of Us dataset. PRS were constructed using the PRScs algorithm (Methods), with x-axis showing odds ratios per standard deviation of the PRS distribution, estimated from logistic regression, adjusting for age, age^2, sex and ancestral principal components. Data are presented as estimated odds ratios with 95% confidence intervals. The figure shows results for all samples (N = 928 cases and 181,773 controls), European ancestry only (N = 506 cases and 95,510 controls), African ancestry only (N = 246 cases and 36,864 controls), and Admixed-American ancestry only (N = 107 cases and 28,784 controls). The top of the figure shows results for the PRS constructed from GWAS-DCM, while the bottom shows results for PRS constructed from MTAG-DCM. Reported P-values are two-sided and unadjusted for mutliple testing. Note: Other performance metrics are presented in Supplementary Table 32. GWAS, genome-wide association study; NI-DCM, nonischemic dilated cardiomyopathy; MTAG, multi-trait analysis of GWAS; OR, odds ratio; 95%CI, 95% confidence interval; SD, standard error.

Extended Data Fig. 9 The additive contribution of PRS and rare pathogenic variants to NI-DCM risk in the All of Us dataset.

The figures show bar charts, where the x-axis shows different strata based on genetics, including three tertiles of PRS (tertile one [T1] in very-light blue, tertile two [T2] in light blue, and tertile three [T3] in dark blue) and two strata based on rare variant carrier status, that is non-carriers and carriers of rare pathogenic or likely pathogenic variants for DCM. The y-axis shows the estimated odds ratio for the given group as compared to a reference group; odds ratios were estimated using logistic regression analyses. Data are presented as estimated odds ratios with 95% confidence intervals. Part a shows results inclusive of all individuals passing our quality-control in All of Us (N = 928 cases and 181,773 controls), while part b is additionally restricted to samples with genetically-determined European ancestry (N = 506 cases and 95,510 controls). In both parts, the left panel shows results where the reference group is represented by individuals without rare variants in the second tertile of PRS; the right panel shows results where the reference group is represented by individuals without rare variants who are in the first tertile of PRS. Note: NI-DCM, nonischemic dilated cardiomyopathy; P/LP, likely pathogenic or pathogenic rare variants; CI, confidence interval; ALL, all individuals irrespective of ancestry; EUR, individuals of genetically-determined European ancestry.

Supplementary information

Supplementary Information

Supplementary Notes and Figs. 1–13.

Reporting Summary

Peer Review File

Supplementary Tables

Supplementary Tables 1–41.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Jurgens, S.J., Rämö, J.T., Kramarenko, D.R. et al. Genome-wide association study reveals mechanisms underlying dilated cardiomyopathy and myocardial resilience. Nat Genet 56, 2636–2645 (2024). https://doi.org/10.1038/s41588-024-01975-5

Download citation

Received: 08 December 2023
Accepted: 08 October 2024
Published: 21 November 2024
Version of record: 21 November 2024
Issue date: December 2024
DOI: https://doi.org/10.1038/s41588-024-01975-5

This article is cited by

Integrating single-cell RNA sequencing and Mendelian randomization analysis to identify potential drug targets for dilated cardiomyopathy
- Ruikang Liu
- Yiying Liu
- Chiyun Sun
Hereditas (2025)
The MYH7 c.2770G > A (p.Glu924Lys) mutation exhibits phenotypic heterogeneity in hypertrophic cardiomyopathy (HCM) and restrictive cardiomyopathy (RCM): a case report
- Yuanyuan Han
- Haiyang Wang
- Fanhua Meng
BMC Cardiovascular Disorders (2025)
CD36 loss-of-function variant underlies dilated cardiomyopathy risk in African ancestry

Nature Genetics (2025)
LncRNA HSCHARME is altered in human cardiomyopathies and promotes stem cell-derived cardiomyogenesis via splicing regulation
- Giulia Buonaiuto
- Fabio Desideri
- Monica Ballarino
Nature Communications (2025)
Tailored therapeutics for cardiomyopathies
- Athanasios Bakalakos
- Emanuele Monda
- Perry Mark Elliott
Nature Reviews Cardiology (2025)