A statistical framework for multi-trait rare variant analysis in large-scale whole-genome sequencing studies

Li, Xihao; Chen, Han; Selvaraj, Margaret Sunitha; Van Buren, Eric; Zhou, Hufeng; Wang, Yuxuan; Sun, Ryan; McCaw, Zachary R.; Yu, Zhi; Jiang, Min-Zhi; DiCorpo, Daniel; Gaynor, Sheila M.; Dey, Rounak; Arnett, Donna K.; Benjamin, Emelia J.; Bis, Joshua C.; Blangero, John; Boerwinkle, Eric; Bowden, Donald W.; Brody, Jennifer A.; Cade, Brian E.; Carson, April P.; Carlson, Jenna C.; Chami, Nathalie; Chen, Yii-Der Ida; Curran, Joanne E.; de Vries, Paul S.; Fornage, Myriam; Franceschini, Nora; Freedman, Barry I.; Gu, Charles; Heard-Costa, Nancy L.; He, Jiang; Hou, Lifang; Hung, Yi-Jen; Irvin, Marguerite R.; Kaplan, Robert C.; Kardia, Sharon L. R.; Kelly, Tanika N.; Konigsberg, Iain; Kooperberg, Charles; Kral, Brian G.; Li, Changwei; Li, Yun; Lin, Honghuang; Liu, Ching-Ti; Loos, Ruth J. F.; Mahaney, Michael C.; Martin, Lisa W.; Mathias, Rasika A.; Mitchell, Braxton D.; Montasser, May E.; Morrison, Alanna C.; Naseri, Take; North, Kari E.; Palmer, Nicholette D.; Peyser, Patricia A.; Psaty, Bruce M.; Redline, Susan; Reiner, Alexander P.; Rich, Stephen S.; Sitlani, Colleen M.; Smith, Jennifer A.; Taylor, Kent D.; Tiwari, Hemant K.; Vasan, Ramachandran S.; Viali, Satupa’itea; Wang, Zhe; Wessel, Jennifer; Yanek, Lisa R.; Yu, Bing; Dupuis, Josée; Meigs, James B.; Auer, Paul L.; Raffield, Laura M.; Manning, Alisa K.; Rice, Kenneth M.; Rotter, Jerome I.; Peloso, Gina M.; Natarajan, Pradeep; Li, Zilin; Liu, Zhonghua; Lin, Xihong

doi:10.1038/s43588-024-00764-8

Article
Published: 07 February 2025

A statistical framework for multi-trait rare variant analysis in large-scale whole-genome sequencing studies

Xihao Li ORCID: orcid.org/0000-0001-8151-0106^1,2,
Han Chen ORCID: orcid.org/0000-0002-9510-4923³,
Margaret Sunitha Selvaraj^4,5,6,
Eric Van Buren⁷,
Hufeng Zhou⁷,
Yuxuan Wang ORCID: orcid.org/0000-0001-9117-0619⁸,
Ryan Sun⁹,
Zachary R. McCaw¹,
Zhi Yu ORCID: orcid.org/0000-0003-4810-3474^4,5,10,
Min-Zhi Jiang ORCID: orcid.org/0000-0001-5502-063X^2,11,
Daniel DiCorpo⁸,
Sheila M. Gaynor⁷,
Rounak Dey⁷,
Donna K. Arnett¹²,
Emelia J. Benjamin ORCID: orcid.org/0000-0003-4076-2336^13,14,15,
Joshua C. Bis ORCID: orcid.org/0000-0002-3409-1110¹⁶,
John Blangero ORCID: orcid.org/0000-0001-6250-5723¹⁷,
Eric Boerwinkle^3,18,
Donald W. Bowden¹⁹,
Jennifer A. Brody¹⁶,
Brian E. Cade^5,20,21,
April P. Carson ORCID: orcid.org/0000-0002-7970-6756²²,
Jenna C. Carlson²³,
Nathalie Chami ORCID: orcid.org/0000-0002-8547-6424^24,25,
Yii-Der Ida Chen²⁶,
Joanne E. Curran¹⁷,
Paul S. de Vries³,
Myriam Fornage^3,27,
Nora Franceschini²⁸,
Barry I. Freedman ORCID: orcid.org/0000-0003-0275-5530²⁹,
Charles Gu ORCID: orcid.org/0000-0002-8527-8145³⁰,
Nancy L. Heard-Costa ORCID: orcid.org/0000-0001-9730-0306^15,31,
Jiang He^32,33,
Lifang Hou³⁴,
Yi-Jen Hung³⁵,
Marguerite R. Irvin³⁶,
Robert C. Kaplan^37,38,
Sharon L. R. Kardia³⁹,
Tanika N. Kelly⁴⁰,
Iain Konigsberg⁴¹,
Charles Kooperberg⁴⁰,
Brian G. Kral⁴²,
Changwei Li^32,33,
Yun Li^1,2,
Honghuang Lin⁴³,
Ching-Ti Liu ORCID: orcid.org/0000-0002-0703-0742⁸,
Ruth J. F. Loos^24,44,
Michael C. Mahaney¹⁷,
Lisa W. Martin ORCID: orcid.org/0000-0003-4352-0914⁴⁵,
Rasika A. Mathias⁴²,
Braxton D. Mitchell ORCID: orcid.org/0000-0003-4920-4744⁴⁶,
May E. Montasser⁴⁶,
Alanna C. Morrison³,
Take Naseri^47,48,
Kari E. North ORCID: orcid.org/0000-0002-8903-0366²⁸,
Nicholette D. Palmer¹⁹,
Patricia A. Peyser³⁹,
Bruce M. Psaty^16,49,50,
Susan Redline^20,21,
Alexander P. Reiner^38,49,
Stephen S. Rich ORCID: orcid.org/0000-0003-3872-7793⁵¹,
Colleen M. Sitlani¹⁶,
Jennifer A. Smith³⁹,
Kent D. Taylor²⁶,
Hemant K. Tiwari⁵²,
Ramachandran S. Vasan^15,53,
Satupa’itea Viali ORCID: orcid.org/0000-0002-4829-8403^54,55,56,
Zhe Wang ORCID: orcid.org/0000-0002-8046-4969²⁴,
Jennifer Wessel^57,58,
Lisa R. Yanek⁴²,
Bing Yu³,
NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium,
Josée Dupuis ORCID: orcid.org/0000-0003-2871-3603^8,59,
James B. Meigs^5,6,60,
Paul L. Auer⁶¹,
Laura M. Raffield²,
Alisa K. Manning^6,62,63,
Kenneth M. Rice⁶⁴,
Jerome I. Rotter²⁶,
Gina M. Peloso⁸,
Pradeep Natarajan^4,5,6,
Zilin Li ORCID: orcid.org/0000-0003-1521-8945⁷,
Zhonghua Liu ORCID: orcid.org/0000-0003-3048-9823⁶⁵ &
…
Xihong Lin ORCID: orcid.org/0000-0001-7067-7752^5,7,66

Nature Computational Science volume 5, pages 125–143 (2025)Cite this article

4198 Accesses
3 Citations
16 Altmetric
Metrics details

Subjects

This article has been updated

A preprint version of the article is available at bioRxiv.

Abstract

Large-scale whole-genome sequencing (WGS) studies have improved our understanding of the contributions of coding and noncoding rare variants to complex human traits. Leveraging association effect sizes across multiple traits in WGS rare variant association analysis can improve statistical power over single-trait analysis, and also detect pleiotropic genes and regions. Existing multi-trait methods have limited ability to perform rare variant analysis of large-scale WGS data. We propose MultiSTAAR, a statistical framework and computationally scalable analytical pipeline for functionally informed multi-trait rare variant analysis in large-scale WGS studies. MultiSTAAR accounts for relatedness, population structure and correlation among phenotypes by jointly analyzing multiple traits, and further empowers rare variant association analysis by incorporating multiple functional annotations. We applied MultiSTAAR to jointly analyze three lipid traits in 61,838 multi-ethnic samples from the Trans-Omics for Precision Medicine (TOPMed) Program. We discovered and replicated new associations with lipid traits missed by single-trait analysis.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on SpringerLink
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Fig. 1: MultiSTAAR framework and pipeline.**

**Fig. 2: Manhattan plots and Q–Q plots for unconditional gene-centric coding, noncoding and ncRNA multi-trait analysis of LDL-C, HDL-C and TG using TOPMed data (n = 61,838).**

**Fig. 3: TOPMed genetic-region (2-kb sliding window) unconditional multi-trait analysis results for LDL-C, HDL-C and TG using TOPMed data (n = 61,838).**

Powerful, scalable and resource-efficient meta-analysis of rare variant associations in large whole genome sequencing studies

Article 23 December 2022

A framework for detecting noncoding rare-variant associations of large-scale whole-genome sequencing studies

Article 27 October 2022

Pleiotropic and sex-specific genetic mechanisms of circulating metabolic markers

Article Open access 28 May 2025

Data availability

This Article used TOPMed Freeze 8 WGS data and lipids phenotype data. Genotype and phenotype data are both available in the database of Genotypes and Phenotypes. The TOPMed WGS data were from the following 20 study cohorts (accession numbers provided in parentheses): Old Order Amish (phs000956.v1.p1), Atherosclerosis Risk in Communities Study (phs001211), Mt Sinai BioMe Biobank (phs001644), Coronary Artery Risk Development in Young Adults (phs001612), Cleveland Family Study (phs000954), Cardiovascular Health Study (phs001368), Diabetes Heart Study (phs001412), Framingham Heart Study (phs000974), Genetic Study of Atherosclerosis Risk (phs001218), Genetic Epidemiology Network of Arteriopathy (phs001345), Genetic Epidemiology Network of Salt Sensitivity (phs001217), Genetics of Lipid Lowering Drugs and Diet Network (phs001359), Hispanic Community Health Study—Study of Latinos (phs001395), Hypertension Genetic Epidemiology Network and Genetic Epidemiology Network of Arteriopathy (phs001293), Jackson Heart Study (phs000964), Multi-Ethnic Study of Atherosclerosis (phs001416), San Antonio Family Heart Study (phs001215), Genome-Wide Association Study of Adiposity in Samoans (phs000972), Taiwan Study of Hypertension using Rare Variants (phs001387) and Women’s Health Initiative (phs001237). The sample sizes, ancestry and phenotype summary statistics of these cohorts are provided in Supplementary Table 2. Source data for Figs. 2 and 3 and Extended Data Figs. 1 and 2 are available via Zenodo (https://doi.org/10.5281/zenodo.14213842)⁵⁸. The UK Biobank analyses were conducted using the UK Biobank resource under application 52008. The functional annotation data are publicly available and can be downloaded from the following links: GRCh38 CADD v1.4 (https://cadd.gs.washington.edu/download); ANNOVAR dbNSFP v3.3a (https://annovar.openbioinformatics.org/en/latest/user-guide/download); LINSIGHT (https://github.com/CshlSiepelLab/LINSIGHT); FATHMM-XF (http://fathmm.biocompute.org.uk/fathmm-xf); FANTOM5 CAGE (https://fantom.gsc.riken.jp/5/data); GeneCards (https://www.genecards.org; v4.7 for hg38); and Umap/Bismap (https://bismap.hoffmanlab.org; ‘before March 2020’ version). In addition, recombination rate and nucleotide diversity were obtained from ref. ⁵⁹. The whole-genome individual functional annotation data were assembled from a variety of sources, and the computed annotation PCs are available at the Functional Annotation of Variant-Online Resource (FAVOR) site (https://favor.genohub.org)⁶⁰ and the FAVOR database (https://doi.org/10.7910/DVN/1VGTJI)⁶¹.

Code availability

MultiSTAAR is implemented as an open-source R package available at https://github.com/xihaoli/MultiSTAAR and https://hsph.harvard.edu/research/lin-lab/software. Data analysis was performed in R (4.1.0). STAAR v0.9.7 and MultiSTAAR v0.9.7 were used in simulation and real data analysis and implemented as open-source R packages available at https://github.com/xihaoli/STAAR (ref. ⁵⁶) and https://github.com/xihaoli/MultiSTAAR (ref. ⁵⁴). The assembled functional annotation data were downloaded from FAVOR using Wget (https://www.gnu.org/software/wget/wget.html).

Change history

05 March 2025
In the version of the article initially published, DOIs were missing from refs. 54–58 and have now been added to the HTML and PDF versions of the article.

References

Taliun, D. et al. Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program. Nature 590, 290–299 (2021).
Google Scholar
The All of Us Research Program Investigators. The ‘All of Us’ research program. New Engl. J. Med. 381, 668–676 (2019).
Google Scholar
Halldorsson, B. V. et al. The sequences of 150,119 genomes in the UK Biobank. Nature 607, 732–740 (2022).
MATH Google Scholar
Lee, S., Abecasis, G. R., Boehnke, M. & Lin, X. Rare-variant association analysis: study designs and statistical tests. Am. J. Human Genet. 95, 5–23 (2014).
Google Scholar
Li, B. & Leal, S. M. Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data. Am. J. Human Genet. 83, 311–321 (2008).
MATH Google Scholar
Madsen, B. E. & Browning, S. R. A groupwise association test for rare mutations using a weighted sum statistic. PLoS Genet. 5, e1000384 (2009).
Google Scholar
Morris, A. P. & Zeggini, E. An evaluation of statistical approaches to rare variant analysis in genetic association studies. Genet. Epidemiol. 34, 188–193 (2010).
MATH Google Scholar
Wu, M. C. et al. Rare-variant association testing for sequencing data with the sequence kernel association test. Am. J. Human Genet. 89, 82–93 (2011).
MATH Google Scholar
Liu, Y. et al. ACAT: a fast and powerful P value combination method for rare-variant analysis in sequencing studies. Am. J. Human Genet. 104, 410–421 (2019).
MATH Google Scholar
Solovieff, N., Cotsapas, C., Lee, P. H., Purcell, S. M. & Smoller, J. W. Pleiotropy in complex traits: challenges and strategies. Nat. Rev. Genet. 14, 483–495 (2013).
Google Scholar
Sivakumaran, S. et al. Abundant pleiotropy in human complex diseases and traits. Am. J. Human Genet. 89, 607–618 (2011).
MATH Google Scholar
Abdellaoui, A., Yengo, L., Verweij, K. J. H. & Visscher, P. M. 15 years of GWAS discovery: realizing the promise. Am. J. Human Genet 110, 179–194 (2023).
MATH Google Scholar
Watanabe, K. et al. A global overview of pleiotropy and genetic architecture in complex traits. Nat. Genet. 51, 1339–1348 (2019).
MATH Google Scholar
Wu, B. & Pankow, J. S. Sequence kernel association test of multiple continuous phenotypes. Genet. Epidemiol. 40, 91–100 (2016).
MATH Google Scholar
Dutta, D., Scott, L., Boehnke, M. & Lee, S. Multi-SKAT: general framework to test for rare-variant association with multiple phenotypes. Genet. Epidemiol. 43, 4–23 (2019).
Google Scholar
Luo, L. et al. Multi-trait analysis of rare-variant association summary statistics using MTAR. Nat. Commun. 11, 2850 (2020).
MATH Google Scholar
Broadaway, K. A. et al. A statistical approach for testing cross-phenotype effects of rare variants. Am. J. Human Genet. 98, 525–540 (2016).
MATH Google Scholar
Li, X. et al. Dynamic incorporation of multiple in silico functional annotations empowers rare variant association analysis of large whole-genome sequencing studies at scale. Nat. Genet. 52, 969–983 (2020).
MATH Google Scholar
Sammel, M., Lin, X. & Ryan, L. Multivariate linear mixed models for multiple outcomes. Stat. Med. 18, 2479–2492 (1999).
MATH Google Scholar
Conomos, M. P., Miller, M. B. & Thornton, T. A. Robust inference of population structure for ancestry prediction and correction of stratification in the presence of relatedness. Genet. Epidemiol. 39, 276–293 (2015).
MATH Google Scholar
Conomos, M. P., Reiner, A. P., Weir, B. S. & Thornton, T. A. Model-free estimation of recent genetic relatedness. Am. J. Human Genet. 98, 127–148 (2016).
MATH Google Scholar
Gogarten, S. M. et al. Genetic association testing using the GENESIS R/Bioconductor package. Bioinformatics 35, 5346–5348 (2019).
MATH Google Scholar
Lee, P. H. et al. Principles and methods of in silico prioritization of noncoding regulatory variants. Human Genet. 137, 15–30 (2018).
Li, Z. et al. A framework for detecting noncoding rare-variant associations of large-scale whole-genome sequencing studies. Nat. Methods 19, 1599–1611 (2022).
Morrison, A. C. et al. Practical approaches for whole-genome sequence analysis of heart-and blood-related traits. Am. J. Human Genet. 100, 205–215 (2017).
MATH Google Scholar
Selvaraj, M. S. et al. Whole genome sequence analysis of blood lipid levels in >66,000 individuals. Nat. Commun. 13, 5995 (2022).
MATH Google Scholar
Liu, Z. & Lin, X. Multiple phenotype association tests using summary statistics in genome-wide association studies. Biometrics 74, 165–175 (2018).
MathSciNet MATH Google Scholar
Teslovich, T. M. et al. Biological, clinical and population relevance of 95 loci for blood lipids. Nature 466, 707–713 (2010).
MATH Google Scholar
Schaffner, S. F. et al. Calibrating a coalescent simulation of human genome sequence variation. Genome Res. 15, 1576–1583 (2005).
MATH Google Scholar
Natarajan, P. et al. Deep-coverage whole genome sequences and blood lipids among 16,324 individuals. Nat. Commun. 9, 3391 (2018).
MATH Google Scholar
Stilp, A. M. et al. A system for phenotype harmonization in the National Heart, Lung and Blood Institute Trans-Omics for Precision Medicine (TOPMed) program. Am. J. Epidemiol. 190, 1977–1992 (2021).
MATH Google Scholar
Frankish, A. et al. GENCODE reference annotation for the human and mouse genomes. Nucleic Acids Res. 47, D766–D773 (2019).
MATH Google Scholar
Dong, C. et al. Comparison and integration of deleteriousness prediction methods for non-synonymous SNVs in whole exome sequencing studies. Human Mol. Genet. 24, 2125–2137 (2014).
Kircher, M. et al. A general framework for estimating the relative pathogenicity of human genetic variants. Nat. Genet. 46, 310–315 (2014).
MATH Google Scholar
Huang, Y.-F., Gulko, B. & Siepel, A. Fast, scalable prediction of deleterious noncoding variants from functional and population genomic data. Nat. Genet. 49, 618–624 (2017).
Rogers, M. F. et al. FATHMM-XF: accurate prediction of pathogenic point mutations via extended features. Bioinformatics 34, 511–513 (2017).
MATH Google Scholar
Buniello, A. et al. The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 47, D1005–D1012 (2019).
Google Scholar
Klarin, D. et al. Genetics of blood lipids among ~300,000 multi-ethnic participants of the Million Veteran Program. Nat. Genet. 50, 1514–1523 (2018).
MATH Google Scholar
Forrest, A. R. et al. A promoter-level mammalian expression atlas. Nature 507, 462–470 (2014).
MATH Google Scholar
Abascal, F. et al. Expanded encyclopaedias of DNA elements in the human and mouse genomes. Nature 583, 699–710 (2020).
MATH Google Scholar
Andersson, R. et al. An atlas of active enhancers across human cell types and tissues. Nature 507, 455–461 (2014).
MATH Google Scholar
Fishilevich, S. et al. GeneHancer: genome-wide integration of enhancers and target genes in GeneCards. Database 2017, bax028 (2017).
Google Scholar
DiCorpo, D. et al. Whole genome sequence association analysis of fasting glucose and fasting insulin levels in diverse cohorts from the NHLBI TOPMed program. Commun. Biol. 5, 756 (2022).
Google Scholar
Jiang, M.-Z. et al. Whole genome sequencing based analysis of inflammation biomarkers in the Trans-Omics for Precision Medicine (TOPMed) consortium. Human Mol. Genet 33, 1429–1441 (2024).
MATH Google Scholar
Dijk, W. et al. Identification of a gain-of-function LIPC variant as a novel cause of familial combined hypocholesterolemia. Circulation 146, 724–739 (2022).
MATH Google Scholar
Ottensmann, L. et al. Genome-wide association analysis of plasma lipidome identifies 495 genetic associations. Nat. Commun. 14, 6934 (2023).
MATH Google Scholar
Guo, T. et al. Association between the DOCK7, PCSK9 and GALNT2 gene polymorphisms and serum lipid levels. Sci. Rep. 6, 19079 (2016).
Google Scholar
Li, Z. et al. Dynamic scan procedure for detecting rare-variant association regions in whole-genome sequencing studies. Am. J. Human Genet. 104, 802–814 (2019).
MATH Google Scholar
McCaw, Z. R., Gao, J., Lin, X. & Gronsbell, J. Synthetic surrogates improve power for genome-wide association studies of partially missing phenotypes in population biobanks. Nat. Genet. 56, 1527–1536 (2024).
Google Scholar
Li, X. et al. Powerful, scalable and resource-efficient meta-analysis of rare variant associations in large whole genome sequencing studies. Nat. Genet. 55, 154–164 (2023).
MATH Google Scholar
Chen, H. et al. Control for population structure and relatedness for binary traits in genetic association studies via logistic mixed models. Am. J. Human Genet. 98, 653–666 (2016).
MATH Google Scholar
Chen, H. et al. Efficient variant set mixed model association tests for continuous and binary traits in large-scale whole-genome sequencing studies. Am. J. Human Genet. 104, 260–274 (2019).
MATH Google Scholar
Liu, Y. & Xie, J. Cauchy combination test: a powerful test with analytic P-value calculation under arbitrary dependency structures. J. Am. Stat. Assoc. 115, 393–402 (2020).
MathSciNet MATH Google Scholar
Li, X. xihaoli/MultiSTAAR: MultiSTAAR_v0.9.7 (v0.9.7). Zenodo. https://doi.org/10.5281/zenodo.13955413 (2024).
Li, X. & Li, Z. xihaoli/STAARpipeline: STAARpipeline_v0.9.7 (v0.9.7). Zenodo. https://doi.org/10.5281/zenodo.10098313 (2023).
Li, X. xihaoli/STAAR: STAAR_v0.9.7 (v0.9.7). Zenodo. https://doi.org/10.5281/zenodo.10060210 (2023).
Li, X. & Li, Z. xihaoli/STAARpipelineSummary: STAARpipelineSummary_v0.9.7 (v0.9.7). Zenodo. https://doi.org/10.5281/zenodo.10113310 (2023).
Li, X. et al. Source data of the MultiSTAAR manuscript “A statistical framework for multi-trait rare variant analysis in large-scale whole-genome sequencing studies”. [Data set]. Zenodo. https://doi.org/10.5281/zenodo.14213842 (2024).
Gazal, S. et al. Linkage disequilibrium–dependent architecture of human complex traits shows action of negative selection. Nat. Genet. 49, 1421–1427 (2017).
MATH Google Scholar
Zhou, H. et al. FAVOR: functional annotation of variants online resource and annotator for variation across the human genome. Nucleic Acids Res. 51, D1300–D1311 (2023).
Google Scholar
Zhou, H., Arapoglou, T., Li, X., Li, Z. & Lin, X. FAVOR Essential Database. V1 edn (Harvard Dataverse, 2022).
Moors, J. et al. A Polynesian-specific missense CETP variant alters the lipid profile. Human Genet. Genomics Adv. 4, 100204 (2023).
MATH Google Scholar

Download references

Acknowledgements

This work was supported by grants R35-CA197449, U19-CA203654, U01-HG012064 and U01-HG009088 (X. Lin), NHLBI TOPMed Fellowship 75N92021F00229 (X. Li and M.S.S.), 1R01AG086379-01 (Z. Liu), R01-HL142711 and R01-HL127564 (P.N. and G.M.P.), R00HG012956-02 (Z.Y.), 75N92020D00001, HHSN268201500003I, N01-HC-95159, 75N92020D00005, N01-HC-95160, 75N92020D00002, N01-HC-95161, 75N92020D00003, N01-HC-95162, 75N92020D00006, N01-HC-95163, 75N92020D00004, N01-HC-95164, 75N92020D00007, N01-HC-95165, N01-HC-95166, N01-HC-95167, N01-HC-95168, N01-HC-95169, UL1-TR-000040, UL1-TR-001079, UL1-TR-001420, UL1-TR001881, DK063491, R01-HL071051, R01-HL071205, R01-HL071250, R01-HL071251, R01-HL071258, R01-HL071259 and UL1-RR033176 (J.I.R.), HHSN268201800001I and U01-HL137162 (K.M.R.), DK078616 and HL151855 (J.B.M.), 1R35-HL135818, R01-HL113338 and HL046389 (S.R.), HL105756 (B.M.P.), HHSN268201600018C, HHSN268201600001C, HHSN268201600002C, HHSN268201600003C and HHSN268201600004C (C.K.), R01-MD012765 and R01-DK117445 (N.F.), R01-HL153805 and R03-HL154284 (B.E.C.), HHSN268201700001I, HHSN268201700002I, HHSN268201700003I, HHSN268201700005I and HHSN268201700004I (E.B.), U01-HL072524, R01-HL104135-04S1, U01-HL054472, U01-HL054473, U01-HL054495, U01-HL054509 and R01-HL055673-18S1 (D.K.A.) and U01-HL72518, HL087698, HL49762, HL59684, HL58625, HL071025, HL112064, NR0224103 and M01-RR000052 (to the Johns Hopkins General Clinical Research Center). The Diabetes Heart Study (DHS) was supported by R01 HL92301, R01 HL67348, R01 NS058700, R01 AR48797, R01 DK071891 and R01 AG058921, the General Clinical Research Center of the Wake Forest University School of Medicine (M01 RR07122, F32 HL085989), the American Diabetes Association and a pilot grant from the Claude Pepper Older Americans Independence Center of Wake Forest University Health Sciences (P60 AG10484). The Framingham Heart Study (FHS) acknowledges the support of contracts NO1-HC-25195, HHSN268201500001I, 75N92019D00031, 1R01HL064753, R01HL076784 and 1R01AG028321 from the National Heart, Lung and Blood Institute and grant supplement R01 HL092577-06S1 for this research. We also acknowledge the dedication of the FHS study participants, without whom this research would not be possible. R.S.V. is supported in part by the Evans Medical Foundation and the Jay and Louis Coffman Endowment from the Department of Medicine, Boston University Chobanian & Avedisian School of Medicine. The Jackson Heart Study (JHS) is supported and conducted in collaboration with Jackson State University (HHSN268201800013I), Tougaloo College (HHSN268201800014I), the Mississippi State Department of Health (HHSN268201800015I) and the University of Mississippi Medical Center (HHSN268201800010I, HHSN268201800011I and HHSN268201800012I) contracts from the National Heart, Lung and Blood Institute (NHLBI) and the National Institute on Minority Health and Health Disparities (NIMHD). We also thank the staff and participants of the JHS. Support for GENOA was provided by the NHLBI (U01HL054457, U01HL054464, U01HL054481, R01HL119443 and R01HL087660) of the National Institutes of Health (NIH). Collection of the San Antonio Family Study data was supported in part by NIH grants P01 HL045522, MH078143, MH078111 and MH083824, and whole-genome sequencing of SAFS subjects was supported by U01 DK085524 and R01 HL113323. Molecular data for the Trans-Omics in Precision Medicine (TOPMed) program was supported by the NHLBI. Core support, including centralized genomic read mapping and genotype calling, along with variant quality metrics and filtering were provided by the TOPMed Informatics Research Center (3R01HL-117626-02S1; contract no. HHSN268201800002I). Core support, including phenotype harmonization, data management, sample-identity quality control and general program coordination were provided by the TOPMed Data Coordinating Center (R01HL-120393; U01HL-120393; contract no. HHSN268201800001I). We gratefully acknowledge the studies and participants who provided biological samples and data for TOPMed. The full study-specific acknowledgements are detailed in the Supplementary Information.

Author information

Authors and Affiliations

Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Xihao Li, Zachary R. McCaw & Yun Li
Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Xihao Li, Min-Zhi Jiang, Yun Li & Laura M. Raffield
Human Genetics Center, Department of Epidemiology, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
Han Chen, Eric Boerwinkle, Paul S. de Vries, Myriam Fornage, Alanna C. Morrison & Bing Yu
Center for Genomic Medicine and Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA
Margaret Sunitha Selvaraj, Zhi Yu & Pradeep Natarajan
Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, USA
Margaret Sunitha Selvaraj, Zhi Yu, Brian E. Cade, James B. Meigs, Pradeep Natarajan & Xihong Lin
Department of Medicine, Harvard Medical School, Boston, MA, USA
Margaret Sunitha Selvaraj, James B. Meigs, Alisa K. Manning & Pradeep Natarajan
Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
Eric Van Buren, Hufeng Zhou, Sheila M. Gaynor, Rounak Dey, Zilin Li & Xihong Lin
Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
Yuxuan Wang, Daniel DiCorpo, Ching-Ti Liu, Josée Dupuis & Gina M. Peloso
Department of Biostatistics, University of Texas MD Anderson Cancer Center, Houston, TX, USA
Ryan Sun
Clinical and Translational Epidemiology Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
Zhi Yu
Department of Biostatistics, The Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
Min-Zhi Jiang
Provost Office, University of South Carolina, Columbia, SC, USA
Donna K. Arnett
Section of Cardiovascular Medicine, Boston Medical Center, Boston University Chobanian & Avedisian School of Medicine, Boston, MA, USA
Emelia J. Benjamin
Department of Epidemiology, Boston University School of Public Health, Boston, MA, USA
Emelia J. Benjamin
Framingham Heart Study, Framingham, MA, USA
Emelia J. Benjamin, Nancy L. Heard-Costa & Ramachandran S. Vasan
Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
Joshua C. Bis, Jennifer A. Brody, Bruce M. Psaty & Colleen M. Sitlani
Department of Human Genetics and South Texas Diabetes and Obesity Institute, School of Medicine, The University of Texas Rio Grande Valley, Brownsville, TX, USA
John Blangero, Joanne E. Curran & Michael C. Mahaney
Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
Eric Boerwinkle, Adithya Balasubramanian, Huyen Dinh, Harsha Doddapaneni, Shannon Dugan-Perez, Jesse Farek, Richard Gibbs, Yi Han, Jianhong Hu, Ziad Khan, Sandra Lee, Vipin Menon, Ginger Metcalf, Zeineen Momin, Donna Muzny, Caitlin Nessner, Osuji Nkechinyere, Geoffrey Okwuonu, Mahitha Rajendran, Sejal Salvi, Jireh Santibanez & Jennifer Watt
Department of Biochemistry, Wake Forest University School of Medicine, Winston-Salem, NC, USA
Donald W. Bowden & Nicholette D. Palmer
Division of Sleep and Circadian Disorders, Brigham and Women’s Hospital, Boston, MA, USA
Brian E. Cade & Susan Redline
Division of Sleep Medicine, Harvard Medical School, Boston, MA, USA
Brian E. Cade & Susan Redline
Department of Medicine, University of Mississippi Medical Center, Jackson, MS, USA
April P. Carson
Department of Human Genetics and Department of Biostatistics and Health Data Science, University of Pittsburgh, Pittsburgh, PA, USA
Jenna C. Carlson
The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Nathalie Chami, Ruth J. F. Loos & Zhe Wang
The Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Nathalie Chami
The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
Yii-Der Ida Chen, Kent D. Taylor & Jerome I. Rotter
Brown Foundation Institute of Molecular Medicine, McGovern Medical School, The University of Texas Health Science Center at Houston, Houston, TX, USA
Myriam Fornage
Department of Epidemiology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Nora Franceschini & Kari E. North
Department of Internal Medicine, Nephrology, Wake Forest University School of Medicine, Winston-Salem, NC, USA
Barry I. Freedman
Division of Biology & Biomedical Sciences, Washington University School of Medicine, St. Louis, MO, USA
Charles Gu
Department of Neurology, Boston University Chobanian & Avedisian School of Medicine, Boston, MA, USA
Nancy L. Heard-Costa
Department of Epidemiology, Tulane University School of Public Health and Tropical Medicine, New Orleans, LA, USA
Jiang He & Changwei Li
Translational Science Institute, Tulane University, New Orleans, LA, USA
Jiang He & Changwei Li
Department of Preventive Medicine, Northwestern University, Chicago, IL, USA
Lifang Hou
Department of Internal Medicine, Tri-Service General Hospital, National Defense Medical Center, Taipei, Taiwan
Yi-Jen Hung
Department of Epidemiology, School of Public Health, University of Alabama at Birmingham, Birmingham, AL, USA
Marguerite R. Irvin
Department of Epidemiology and Population Health, Albert Einstein College of Medicine, Bronx, NY, USA
Robert C. Kaplan
Division of Public Health Sciences, Fred Hutchinson Cancer Center, Seattle, WA, USA
Robert C. Kaplan & Alexander P. Reiner
Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
Sharon L. R. Kardia, Patricia A. Peyser & Jennifer A. Smith
Department of Medicine, Division of Nephrology, University of Illinois Chicago, Chicago, IL, USA
Tanika N. Kelly & Charles Kooperberg
Department of Biomedical Informatics, University of Colorado, Aurora, CO, USA
Iain Konigsberg
Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Brian G. Kral, Rasika A. Mathias & Lisa R. Yanek
Department of Medicine, University of Massachusetts Chan Medical School, Worcester, MA, USA
Honghuang Lin
Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
Ruth J. F. Loos
School of Medicine and Health Sciences, George Washington University, Washington, DC, USA
Lisa W. Martin
Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
Braxton D. Mitchell & May E. Montasser
Naseri & Associates Public Health Consultancy Firm and Family Health Clinic, Apia, Samoa
Take Naseri
Department of Epidemiology, Brown University, Providence, RI, USA
Take Naseri & Stephen McGarvey
Departments of Epidemiology, University of Washington, Seattle, WA, USA
Bruce M. Psaty & Alexander P. Reiner
Department of Health Systems and Population Health, University of Washington, Seattle, WA, USA
Bruce M. Psaty
Department of Genome Sciences, University of Virginia, Charlottesville, VA, USA
Stephen S. Rich
Department of Biostatistics, School of Public Health, University of Alabama at Birmingham, Birmingham, AL, USA
Hemant K. Tiwari
Department of Quantitative and Qualitative Health Sciences, UT Health San Antonio School of Public Health, San Antonia, TX, USA
Ramachandran S. Vasan
School of Medicine, National University of Samoa, Apia, Samoa
Satupa’itea Viali
Department of Chronic Disease Epidemiology, Yale University School of Public Health, New Haven, CT, USA
Satupa’itea Viali
Oceania University of Medicine, Apia, Samoa
Satupa’itea Viali
Department of Epidemiology, Fairbanks School of Public Health, Indiana University, Indianapolis, IN, USA
Jennifer Wessel
Diabetes Translational Research Center, Indiana University, Indianapolis, IN, USA
Jennifer Wessel
Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montreal, QC, Canada
Josée Dupuis
Division of General Internal Medicine, Massachusetts General Hospital, Boston, MA, USA
James B. Meigs
Division of Biostatistics, Data Science Institute, and Cancer Center, Medical College of Wisconsin, Milwaukee, WI, USA
Paul L. Auer
Metabolism Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
Alisa K. Manning
Clinical and Translational Epidemiology Unit, Mongan Institute, Massachusetts General Hospital, Boston, MA, USA
Alisa K. Manning
Department of Biostatistics, University of Washington, Seattle, WA, USA
Erin Buth, Matthew Conomos, Ben Heavner, Susanne May, Caitlin McHugh, Sarah C. Nelson, Catherine Tong, Kayleen Williams & Kenneth M. Rice
Department of Biostatistics, Mailman School of Public Health, Columbia University, New York, NY, USA
Zhonghua Liu
Department of Statistics, Harvard University, Cambridge, MA, USA
Xihong Lin
New York Genome Center, New York, NY, USA
Namiko Abe, Karen Bunting, Bo-Juen Chen, Soren Germer, Tanja Smith & Michael Zody
University of Michigan, Ann Arbor, MI, USA
Gonçalo Abecasis, Larry Bielak, Thomas Blackwell, Matthew Flickinger, Colin Gross, Jonathon LeFaive, Jacob Pleiness, Albert Vernon Smith, Daniel Taliun, Peter VandeHaar, Jiongming Wang, Ketian Yu & Sebastian Zoellner
Broad Institute, Cambridge, MA, USA
Francois Aguet, Kristin Ardlie, Mark Chaffin, Seung Hoan Choi, Stacey Gabriel, Namrata Gupta, Carolina Roselli & Seyedeh Maryam Zekavat
Cedars-Sinai Medical Center, Boston, MA, USA
Christine Albert
Children’s Hospital of Philadelphia, University of Pennsylvania, Philadelphia, PA, USA
Laura Almasy
Emory University, Atlanta, GA, USA
Alvaro Alonso, Rich Johnston, Lawrence S. Phillips & Zhaohui Qin
University of Maryland, Baltimore, MD, USA
Seth Ament, Amber Beitelshees, Christy Chang, Coleen Damcott, Scott Devine, Mao Fu, Da-Wei Gong, Yue Guan, Elliott Hong, Michael Kessler, Joshua Lewis, Patrick McArdle, Tim O’Connor, James Perry, Toni Pollin, Robert Reed, Kathleen Ryan, Amol Shetty, Elizabeth Streeten, Simeon Taylor & Huichun Xu
University of Washington, Seattle, WA, USA
Peter Anderson, Jai Broome, Colleen Davis, Leslie Emery, Chris Frazar, Stephanie M. Fullerton, Stephanie Gogarten, Deepti Jain, Craig Johnson, Alyna Khan, Cathy Laurie, Cecelia Laurie, David Levine, Sarah Ruuska, Josh Smith, Nona Sotoodehnia, Adrienne M. Stilp, Adam Szpiro, Timothy A. Thornton, David Tirschwell, Fei Fei Wang, Bruce Weir & Quenna Wong
University of Mississippi, Jackson, MS, USA
Pramod Anugu, Lynette Ekunwe, Yan Gao, Hao Mei & Nancy Min
National Institutes of Health, Bethesda, MD, USA
Deborah Applebaum-Bowden
Johns Hopkins University, Baltimore, MD, USA
Dan Arking, Dimitrios Avramopoulos, Emily Barron-Casella, Terri Beaty, Lewis Becker, James Casella, Kimberly Jones, Barry Make, Rakhi Naik, Ingo Ruczinski, Steven Salzberg, Margaret Taub & Dhananjay Vaidya
Duke University, Durham, NC, USA
Allison Ashley-Koch & Marilyn Telen
University of Alabama, Birmingham, AL, USA
Stella Aslibekyan & Bertha Hidalgo
Stanford University, Stanford, CA, USA
Tim Assimes, Chris Gignoux & Marco Perez
Department of Medicine, Providence Health Care, Vancouver, British Columbia, Canada
Najib Ayas
Cleveland Clinic, Cleveland, OH, USA
John Barnard & Mina Chung
Tempus, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
Kathleen Barnes
Columbia University, New York, NY, USA
R. Graham Barr & Danish Saleheen
The Emmes Corporation, LTRC, Rockville, MD, USA
Lucas Barwick
Cleveland Clinic, Quantitative Health Sciences, Cleveland, OH, USA
Gerald Beck
Department of Medicine, Johns Hopkins University, Baltimore, MD, USA
Diane Becker
National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA
Rebecca Beer, Weiniu Gan, Cashell Jaquish, Andrew Johnson, Daniel Levy, James Luo, Julie Mikulla, George Papanicolaou & Pankaj Qasba
Department of Epidemiology, University of Florida, Gainesville, FL, USA
Takis Benos
Fundação de Hematologia e Hemoterapia de PernambucoHemope, Recife, Brazil
Marcos Bezerra
Department of Obstetrics and Gynecology, University of Utah, Salt Lake City, UT, USA
Nathan Blue
National Jewish Health, National Jewish Health, Denver, CO, USA
Russell Bowler
Department of Pediatrics, Medical College of Wisconsin, Milwaukee, WI, USA
Ulrich Broeckel
Department of Pediatrics, University of Texas Health at Houston, Houston, TX, USA
Deborah Brown
University of California at San Francisco, San Francisco, CA, USA
Esteban Burchard, Ryan Hernandez & Shannon Kelly
Department of Biomedical Data Science, Stanford University, Stanford, CA, USA
Carlos Bustamante
University of Colorado at Denver, Denver, CO, USA
Jonathan Cardwell, Sameer Chavan, Michelle Daya, Shanshan Gao, Daniel Grine, John Hokanson, Ethan Lange, Susan Mathai, Bonnie Neltner, Meher Preethi Boorgula, Pamela Russell, David Schwartz, Aniket Shetty, Tarik Walker, Avram Walts & Ivana Yang
Brigham & Women’s Hospital, Boston, MA, USA
Vincent Carey, Juan P. Casas Romero, Michael Cho, Dawn DeMeo, Auyon Ghosh, Meryl LeBoff, Jiwon Lee, JoAnn Manson, Dandi Qiao, Edwin Silverman, Jody Sylvia & Carla Wilson
University of Montreal, Montreal, Canada
Julie Carrier
Washington State University at Pullman, Pullman, WA, USA
Cara Carty
University of California at Los Angeles, Los Angeles, CA, USA
Richard Casaburi, Carolyn Crandall & Karol Watson
Department of Medicine, Brigham & Women’s Hospital, Boston, MA, USA
Peter Castaldi & Matt Moll
National Taiwan University, Taipei, Taiwan
Yi-Cheng Chang
Division of Preventive Medicine, Brigham & Women’s Hospital, Boston, MA, USA
Daniel Chasman
University of Virginia, Charlottesville, VA, USA
Wei-Min Chen, Charles Farber, Ani Manichaikul, Josyf C. Mychaleckyj & Aakrosh Ratan
National Taiwan University Hospital, National Taiwan University, Taipei, Taiwan
Lee-Ming Chuang
National Health Research Institute Taiwan, Miaoli County, Taiwan
Ren-Hua Chung
Metabolomics Platform, Broad Institute, Cambridge, MA, USA
Clary Clish
Department of Immunity and Immunology, Cleveland Clinic, Cleveland, OH, USA
Suzy Comhair
University of Vermont, Burlington, VT, USA
Elaine Cornell
Department of Population Health Science, University of Mississippi, Jackson, MS, USA
Adolfo Correa
National Jewish Health, Denver, CO, USA
James Crapo & Snow Xueyan Zhao
Department of Biostatistics, Boston University, Boston, MA, USA
L. Adrienne Cupples
Department of Internal Medicine, University of Michigan, Ann Arbor, MI, USA
Jeffrey Curtis & Cristen Willer
Vitalant Research Institute, San Francisco, CA, USA
Brian Custer
University of Illinois at Chicago, Chicago, IL, USA
Dawood Darbar
University of Chicago, Chicago, IL, USA
Sean David
Vanderbilt University, Nashville, TN, USA
Michael DeBaun
University of Cincinnati, Cincinnati, OH, USA
Ranjan Deka
University of North Carolina, Chapel Hill, NC, USA
Qing Duan
University of Texas Rio Grande Valley School of Medicine, Edinburg, TX, USA
Ravi Duggirala & Juan Manuel Peralta
Department of Pathology and Laboratory Medicine, University of Vermont, Burlington, VT, USA
Jon Peter Durda & Russell Tracy
Department of Genetics, Washington University in St Louis, St Louis, MO, USA
Susan K. Dutcher
Brown University, Providence, RI, USA
Charles Eaton
Channing Division of Network Medicine, Harvard University, Cambridge, MA, USA
Adel El Boueiz
Massachusetts General Hospital, Boston, MA, USA
Patrick T. Ellinor, Steven A. Lubitz & Lu-Chen Weng
Lerner Research Institute, Cleveland Clinic, Cleveland, OH, USA
Serpil Erzurum
Center for Genes, Environment and Health, National Jewish Health, Denver, CO, USA
Tasha Fingerlin
Washington University in St Louis, St Louis, MO, USA
Lucinda Fulton, D. C. Rao, Karen Schwander & Yun Ju Sung
Fred Hutchinson Cancer Research Center, Seattle, WA, USA
Margery Gass & Jeff Haessler
New York Genome Center, New York City, NY, USA
Heather Geiger, Melissa Marton, Catherine Reeves, Nicolas Robine, Alexi Runnels & Lara Winterkorn
Icahn School of Medicine at Mount Sinai, New York, NY, USA
Bruce Gelb, Eimear Kenny, Girish Nadkarni & Michael Preuss
University of Pittsburgh, Pittsburgh, PA, USA
Mark Geraci, Mark Gladwin, Ryan L. Minster & Frank Sciurba
Beth Israel Deaconess Medical Center, Boston, MA, USA
Robert Gerszten & Tamar Sofer
Department of Psychiatry, Boston Children’s Hospital, Harvard Medical School, Boston, MA, USA
David Glahn
University of Texas Rio Grande Valley School of Medicine, San Antonio, TX, USA
Harald Goring
University of Colorado Anschutz Medical Campus, Aurora, CO, USA
Sharon Graw, Luisa Mestroni & Matthew Taylor
Department of Obstetrics and Gynecology, Mass General Brigham, Boston, MA, USA
Kathryn J. Gray
Lundquist Institute, Torrance, CA, USA
Xiuqing Guo, Xiaohui Li & Henry Lin
Department of Cardiology, University of Mississippi, Jackson, MS, USA
Michael Hall
Department of Medicine, University of Calgary, Calgary, Canada
Patrick Hanly
Department of Genetics, University of Maryland, Philadelphia, PA, USA
Daniel Harris
Department of Chronic Disease Epidemiology, Yale University, New Haven, CT, USA
Nicola L. Hawley
Department of Epidemiology, University of Washington, Seattle, WA, USA
Susan Heckbert & Nicholas Smith
Wake Forest Baptist Health, Winston-Salem, NC, USA
David Herrington
Channing Division of Network Medicine, Brigham & Women’s Hospital, Boston, MA, USA
Craig Hersh
University of Texas Health at Houston, Houston, TX, USA
James Hixson
Regeneron Genetics Center, Boston, MA, USA
Brian Hobbs
University of Iowa, Iowa City, IO, USA
Karin Hoth & Robert Wallace
Institute of Population Health Sciences, National Health Research Institute Taiwan, Miaoli County, Taiwan
Chao Agnes Hsiung
Blood Works Northwest, Seattle, WA, USA
Haley Huston
Taichung Veterans General Hospital Taiwan, Taichung City, Taiwan
Chii Min Hwu, Wen-Jane Lee & Wayne Hui-Heng Sheu
Divisions of Endocrinology, Diabetes and Metabolism and Internal Medicine, Oklahoma State University Medical Center, Columbus, OH, USA
Rebecca Jackson
Department of Medicine, University of Washington, Seattle, WA, USA
Jill Johnsen
Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA
Hyun Min Kang & Joshua Weinstock
Harvard University, Cambridge, MA, USA
Wonji Kim & Sean McFarland
McGill University, Montréal, Quebec, Canada
John Kimoff
Department of Epidemiology, University of Colorado at Denver, Aurora, CO, USA
Greg Kinney
Department of Medicine, Blood Works Northwest, Seattle, WA, USA
Barbara Konkle
Department of Public Health Sciences, Loyola University, Maywood, IL, USA
Holly Kramer
Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA
Christoph Lange
Department of Medicine, University of Colorado at Denver, Aurora, CO, USA
Leslie Lange
Department of Epidemiology and Medicine, Brown University, Providence, RI, USA
Simin Liu
Department of Cardiology, Duke University, Durham, NC, USA
Yongmei Liu
Cardiovascular Institute, Stanford University, Stanford, CA, USA
Yu Liu
Boston University, Boston, MA, USA
Kathryn L. Lunetta
Department of Critical Care and Sleep Medicine, Division of Pulmonary, The Ohio State University, Columbus, OH, USA
Ulysses Magalang
University of Alabama at Birmingham, Birmingham, AL, USA
Merry-Lynn McDonald
Department of Genome Sciences, University of Washington, Seattle, WA, USA
Daniel McGoldrick, Deborah Nickerson & Machiko Threlkeld
RTI International, Research Triangle Park, NC, USA
Becky McNeil
University of Arizona, Tucson, AZ, USA
Deborah A. Meyers
Center For Sleep Sciences and Medicine, Stanford University, Palo Alto, CA, USA
Emmanuel Mignot
National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, MD, USA
Mollie Minear
Department of Genes and Human Disease, Oklahoma Medical Research Foundation, Oklahoma City, OK, USA
Courtney Montgomery
Howard University, Washington, DC, USA
Sergei Nekhai
University of Maryland, Balitmore, MD, USA
Jeff O’Connell
University at Buffalo, Buffalo, NY, USA
Heather Ochs-Balcom
Division of Sleep Medicine/Department of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Allan Pack
Stanford Cardiovascular Institute, Stanford University, Stanford, CA, USA
David T. Paik & Joseph Wu
University of Minnesota, Minneapolis, MN, USA
James Pankow, Michael Tsai & Scott Vrieze
Biostatistics and Epidemiology Division, RTI International, Research Triangle Park, NC, USA
Cora Parker
Fred Hutchinson Cancer Research Center and UW Medicine, Seattle, WA, USA
Ulrike Peters
Departments of Cardiology and Medicine, Johns Hopkins University, Baltimore, MD, USA
Wendy Post
Department of Medicine, University of Colorado at Denver, Denver, CO, USA
Julia Powers Becker
Colorado Center for Personalized Medicine, University of Colorado at Denver, Denver, CO, USA
Nicholas Rafaels
Northwestern University, Chicago, IL, USA
Laura Rasmussen-Torvik
Department of Medicine, National Jewish Health, Denver, CO, USA
Elizabeth Regan
Lutia I Puava Ae Mapu I Fagalele, Apia, Samoa
Muagututi’a Sefuiva Reupena
Sleep Research Unit and Institute for Mental Health Research, University of Ottawa, Ottawa, Ontario, Canada
Rebecca Robillard
Departments of Medicine, Pharmacology and Biomedical Informatics, Vanderbilt University, Nashville, TN, USA
Dan Roden
Faculdade de Medicina, Universidade de Sao Paulo, Sao Paulo, Brazil
Ester Cerdeira Sabino
Department of Pathology, University of Maryland, Seattle, WA, USA
Shabnam Salimi
Lundquist Institute, TGPS, Torrance, CA, USA
Kevin Sandow
Division of Hematology and Oncology, Harvard University, Boston, MA, USA
Vijay G. Sankaran
Department of Genetics, Harvard Medical School, Boston, MA, USA
Christine Seidman
Harvard Medical School, Boston, MA, USA
Jonathan Seidman
Department of Pediatrics, Emory University, Atlanta, GA, USA
Vivien Sheehan
Department of Human Genetics, Emory University, Atlanta, GA, USA
Stephanie L. Sherman
Departments of Medicine and Cardiology, Vanderbilt University, Nashville, TN, USA
M. Benjamin Shoemaker
UMass Memorial Medical Center, Worcester, MA, USA
Brian Silver
University of Saskatchewan, Saskatoon, Saskatchewan, Canada
Robert Skomro
Albert Einstein College of Medicine, New York, NY, USA
Sylvia Smoller
Department of Biostatistical Sciences, Wake Forest Baptist Health, Winston-Salem, NC, USA
Beverly Snively
Department of Genetics, Stanford University, Stanford, CA, USA
Michael Snyder & Hua Tang
Department of Genomic Cardiology, University of Colorado at Denver, Aurora, CO, USA
Garrett Storm
Channing Department of Medicine, Brigham & Women’s Hospital, Boston, MA, USA
Jessica Lasky Su
Université Laval, Quebec City, Quebec, Canada
Frédéric Sériès
Cancer Prevention Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
Lesley Tinker
Department of Genetics, University of Pennsylvania, Philadelphia, PA, USA
Sarah Tishkoff
USC Methylation Characterization Center, University of Southern California, Los Angeles, CA, USA
David Van Den Berg
Brigham & Women’s Hospital, Mass General Brigham, Boston, MA, USA
Heming Wang
Department of Human Genetics, University of Pittsburgh, Pittsburgh, PA, USA
Daniel E. Weeks
Department of Medicine and Channing Division of Network Medicine, Brigham & Women’s Hospital, Boston, MA, USA
Scott T. Weiss
Henry Ford Health System, Detroit, MI, USA
L. Keoki Williams
Case Western Reserve University, Cleveland, OH, USA
Scott Williams
Department of Cardiology, Beth Israel Deaconess Medical Center, Cambridge, MA, USA
James Wilson
Department of Medicine, Henry Ford Health System, Detroit, MA, USA
Baojun Wu
Department of Medicine, University of Pittsburgh, Pittsburgh, PA, USA
Yingze Zhang
Department of Epidemiology, University of Michigan, Ann Arbor, MI, USA
Wei Zhao
Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, OH, USA
Xiaofeng Zhu
Department of Medicine, University of California at San Francisco, San Francisco, CA, USA
Elad Ziv
Health Quantitative Sciences Research, Mayo Clinic, Rochester, MI, USA
Mariza de Andrade
Cardiovascular Division, Department of Medicine, Washington University in St Louis, St Louis, MO, USA
Lisa de las Fuentes

Authors

Xihao Li
View author publications
Search author on:PubMed Google Scholar
Han Chen
View author publications
Search author on:PubMed Google Scholar
Margaret Sunitha Selvaraj
View author publications
Search author on:PubMed Google Scholar
Eric Van Buren
View author publications
Search author on:PubMed Google Scholar
Hufeng Zhou
View author publications
Search author on:PubMed Google Scholar
Yuxuan Wang
View author publications
Search author on:PubMed Google Scholar
Ryan Sun
View author publications
Search author on:PubMed Google Scholar
Zachary R. McCaw
View author publications
Search author on:PubMed Google Scholar
Zhi Yu
View author publications
Search author on:PubMed Google Scholar
Min-Zhi Jiang
View author publications
Search author on:PubMed Google Scholar
Daniel DiCorpo
View author publications
Search author on:PubMed Google Scholar
Sheila M. Gaynor
View author publications
Search author on:PubMed Google Scholar
Rounak Dey
View author publications
Search author on:PubMed Google Scholar
Donna K. Arnett
View author publications
Search author on:PubMed Google Scholar
Emelia J. Benjamin
View author publications
Search author on:PubMed Google Scholar
Joshua C. Bis
View author publications
Search author on:PubMed Google Scholar
John Blangero
View author publications
Search author on:PubMed Google Scholar
Eric Boerwinkle
View author publications
Search author on:PubMed Google Scholar
Donald W. Bowden
View author publications
Search author on:PubMed Google Scholar
Jennifer A. Brody
View author publications
Search author on:PubMed Google Scholar
Brian E. Cade
View author publications
Search author on:PubMed Google Scholar
April P. Carson
View author publications
Search author on:PubMed Google Scholar
Jenna C. Carlson
View author publications
Search author on:PubMed Google Scholar
Nathalie Chami
View author publications
Search author on:PubMed Google Scholar
Yii-Der Ida Chen
View author publications
Search author on:PubMed Google Scholar
Joanne E. Curran
View author publications
Search author on:PubMed Google Scholar
Paul S. de Vries
View author publications
Search author on:PubMed Google Scholar
Myriam Fornage
View author publications
Search author on:PubMed Google Scholar
Nora Franceschini
View author publications
Search author on:PubMed Google Scholar
Barry I. Freedman
View author publications
Search author on:PubMed Google Scholar
Charles Gu
View author publications
Search author on:PubMed Google Scholar
Nancy L. Heard-Costa
View author publications
Search author on:PubMed Google Scholar
Jiang He
View author publications
Search author on:PubMed Google Scholar
Lifang Hou
View author publications
Search author on:PubMed Google Scholar
Yi-Jen Hung
View author publications
Search author on:PubMed Google Scholar
Marguerite R. Irvin
View author publications
Search author on:PubMed Google Scholar
Robert C. Kaplan
View author publications
Search author on:PubMed Google Scholar
Sharon L. R. Kardia
View author publications
Search author on:PubMed Google Scholar
Tanika N. Kelly
View author publications
Search author on:PubMed Google Scholar
Iain Konigsberg
View author publications
Search author on:PubMed Google Scholar
Charles Kooperberg
View author publications
Search author on:PubMed Google Scholar
Brian G. Kral
View author publications
Search author on:PubMed Google Scholar
Changwei Li
View author publications
Search author on:PubMed Google Scholar
Yun Li
View author publications
Search author on:PubMed Google Scholar
Honghuang Lin
View author publications
Search author on:PubMed Google Scholar
Ching-Ti Liu
View author publications
Search author on:PubMed Google Scholar
Ruth J. F. Loos
View author publications
Search author on:PubMed Google Scholar
Michael C. Mahaney
View author publications
Search author on:PubMed Google Scholar
Lisa W. Martin
View author publications
Search author on:PubMed Google Scholar
Rasika A. Mathias
View author publications
Search author on:PubMed Google Scholar
Braxton D. Mitchell
View author publications
Search author on:PubMed Google Scholar
May E. Montasser
View author publications
Search author on:PubMed Google Scholar
Alanna C. Morrison
View author publications
Search author on:PubMed Google Scholar
Take Naseri
View author publications
Search author on:PubMed Google Scholar
Kari E. North
View author publications
Search author on:PubMed Google Scholar
Nicholette D. Palmer
View author publications
Search author on:PubMed Google Scholar
Patricia A. Peyser
View author publications
Search author on:PubMed Google Scholar
Bruce M. Psaty
View author publications
Search author on:PubMed Google Scholar
Susan Redline
View author publications
Search author on:PubMed Google Scholar
Alexander P. Reiner
View author publications
Search author on:PubMed Google Scholar
Stephen S. Rich
View author publications
Search author on:PubMed Google Scholar
Colleen M. Sitlani
View author publications
Search author on:PubMed Google Scholar
Jennifer A. Smith
View author publications
Search author on:PubMed Google Scholar
Kent D. Taylor
View author publications
Search author on:PubMed Google Scholar
Hemant K. Tiwari
View author publications
Search author on:PubMed Google Scholar
Ramachandran S. Vasan
View author publications
Search author on:PubMed Google Scholar
Satupa’itea Viali
View author publications
Search author on:PubMed Google Scholar
Zhe Wang
View author publications
Search author on:PubMed Google Scholar
Jennifer Wessel
View author publications
Search author on:PubMed Google Scholar
Lisa R. Yanek
View author publications
Search author on:PubMed Google Scholar
Bing Yu
View author publications
Search author on:PubMed Google Scholar
Josée Dupuis
View author publications
Search author on:PubMed Google Scholar
James B. Meigs
View author publications
Search author on:PubMed Google Scholar
Paul L. Auer
View author publications
Search author on:PubMed Google Scholar
Laura M. Raffield
View author publications
Search author on:PubMed Google Scholar
Alisa K. Manning
View author publications
Search author on:PubMed Google Scholar
Kenneth M. Rice
View author publications
Search author on:PubMed Google Scholar
Jerome I. Rotter
View author publications
Search author on:PubMed Google Scholar
Gina M. Peloso
View author publications
Search author on:PubMed Google Scholar
Pradeep Natarajan
View author publications
Search author on:PubMed Google Scholar
Zilin Li
View author publications
Search author on:PubMed Google Scholar
Zhonghua Liu
View author publications
Search author on:PubMed Google Scholar
Xihong Lin
View author publications
Search author on:PubMed Google Scholar

Consortia

NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium

Namiko Abe
, Gonçalo Abecasis
, Francois Aguet
, Christine Albert
, Laura Almasy
, Alvaro Alonso
, Seth Ament
, Peter Anderson
, Pramod Anugu
, Deborah Applebaum-Bowden
, Kristin Ardlie
, Dan Arking
, Allison Ashley-Koch
, Stella Aslibekyan
, Tim Assimes
, Dimitrios Avramopoulos
, Najib Ayas
, Adithya Balasubramanian
, John Barnard
, Kathleen Barnes
, R. Graham Barr
, Emily Barron-Casella
, Lucas Barwick
, Terri Beaty
, Gerald Beck
, Diane Becker
, Lewis Becker
, Rebecca Beer
, Amber Beitelshees
, Takis Benos
, Marcos Bezerra
, Larry Bielak
, Thomas Blackwell
, Nathan Blue
, Russell Bowler
, Ulrich Broeckel
, Jai Broome
, Deborah Brown
, Karen Bunting
, Esteban Burchard
, Carlos Bustamante
, Erin Buth
, Jonathan Cardwell
, Vincent Carey
, Julie Carrier
, Cara Carty
, Richard Casaburi
, Juan P. Casas Romero
, James Casella
, Peter Castaldi
, Mark Chaffin
, Christy Chang
, Yi-Cheng Chang
, Daniel Chasman
, Sameer Chavan
, Bo-Juen Chen
, Wei-Min Chen
, Michael Cho
, Seung Hoan Choi
, Lee-Ming Chuang
, Mina Chung
, Ren-Hua Chung
, Clary Clish
, Suzy Comhair
, Matthew Conomos
, Elaine Cornell
, Adolfo Correa
, Carolyn Crandall
, James Crapo
, L. Adrienne Cupples
, Jeffrey Curtis
, Brian Custer
, Coleen Damcott
, Dawood Darbar
, Sean David
, Colleen Davis
, Michelle Daya
, Michael DeBaun
, Dawn DeMeo
, Ranjan Deka
, Scott Devine
, Huyen Dinh
, Harsha Doddapaneni
, Qing Duan
, Shannon Dugan-Perez
, Ravi Duggirala
, Jon Peter Durda
, Susan K. Dutcher
, Charles Eaton
, Lynette Ekunwe
, Adel El Boueiz
, Patrick T. Ellinor
, Leslie Emery
, Serpil Erzurum
, Charles Farber
, Jesse Farek
, Tasha Fingerlin
, Matthew Flickinger
, Chris Frazar
, Mao Fu
, Stephanie M. Fullerton
, Lucinda Fulton
, Stacey Gabriel
, Weiniu Gan
, Shanshan Gao
, Yan Gao
, Margery Gass
, Heather Geiger
, Bruce Gelb
, Mark Geraci
, Soren Germer
, Robert Gerszten
, Auyon Ghosh
, Richard Gibbs
, Chris Gignoux
, Mark Gladwin
, David Glahn
, Stephanie Gogarten
, Da-Wei Gong
, Harald Goring
, Sharon Graw
, Kathryn J. Gray
, Daniel Grine
, Colin Gross
, Yue Guan
, Xiuqing Guo
, Namrata Gupta
, Jeff Haessler
, Michael Hall
, Yi Han
, Patrick Hanly
, Daniel Harris
, Nicola L. Hawley
, Ben Heavner
, Susan Heckbert
, Ryan Hernandez
, David Herrington
, Craig Hersh
, Bertha Hidalgo
, James Hixson
, Brian Hobbs
, John Hokanson
, Elliott Hong
, Karin Hoth
, Chao Agnes Hsiung
, Jianhong Hu
, Haley Huston
, Chii Min Hwu
, Rebecca Jackson
, Deepti Jain
, Cashell Jaquish
, Jill Johnsen
, Andrew Johnson
, Craig Johnson
, Rich Johnston
, Kimberly Jones
, Hyun Min Kang
, Shannon Kelly
, Eimear Kenny
, Michael Kessler
, Alyna Khan
, Ziad Khan
, Wonji Kim
, John Kimoff
, Greg Kinney
, Barbara Konkle
, Holly Kramer
, Christoph Lange
, Ethan Lange
, Leslie Lange
, Cathy Laurie
, Cecelia Laurie
, Meryl LeBoff
, Jonathon LeFaive
, Jiwon Lee
, Sandra Lee
, Wen-Jane Lee
, David Levine
, Daniel Levy
, Joshua Lewis
, Xiaohui Li
, Henry Lin
, Simin Liu
, Yongmei Liu
, Yu Liu
, Steven A. Lubitz
, Kathryn L. Lunetta
, James Luo
, Ulysses Magalang
, Barry Make
, Ani Manichaikul
, JoAnn Manson
, Melissa Marton
, Susan Mathai
, Susanne May
, Patrick McArdle
, Merry-Lynn McDonald
, Sean McFarland
, Stephen McGarvey
, Daniel McGoldrick
, Caitlin McHugh
, Becky McNeil
, Hao Mei
, Vipin Menon
, Luisa Mestroni
, Ginger Metcalf
, Deborah A. Meyers
, Emmanuel Mignot
, Julie Mikulla
, Nancy Min
, Mollie Minear
, Ryan L. Minster
, Matt Moll
, Zeineen Momin
, Courtney Montgomery
, Donna Muzny
, Josyf C. Mychaleckyj
, Girish Nadkarni
, Rakhi Naik
, Sergei Nekhai
, Sarah C. Nelson
, Bonnie Neltner
, Caitlin Nessner
, Deborah Nickerson
, Osuji Nkechinyere
, Jeff O’Connell
, Tim O’Connor
, Heather Ochs-Balcom
, Geoffrey Okwuonu
, Allan Pack
, David T. Paik
, James Pankow
, George Papanicolaou
, Cora Parker
, Juan Manuel Peralta
, Marco Perez
, James Perry
, Ulrike Peters
, Lawrence S. Phillips
, Jacob Pleiness
, Toni Pollin
, Wendy Post
, Julia Powers Becker
, Meher Preethi Boorgula
, Michael Preuss
, Pankaj Qasba
, Dandi Qiao
, Zhaohui Qin
, Nicholas Rafaels
, Mahitha Rajendran
, D. C. Rao
, Laura Rasmussen-Torvik
, Aakrosh Ratan
, Robert Reed
, Catherine Reeves
, Elizabeth Regan
, Muagututi’a Sefuiva Reupena
, Rebecca Robillard
, Nicolas Robine
, Dan Roden
, Carolina Roselli
, Ingo Ruczinski
, Alexi Runnels
, Pamela Russell
, Sarah Ruuska
, Kathleen Ryan
, Ester Cerdeira Sabino
, Danish Saleheen
, Shabnam Salimi
, Sejal Salvi
, Steven Salzberg
, Kevin Sandow
, Vijay G. Sankaran
, Jireh Santibanez
, Karen Schwander
, David Schwartz
, Frank Sciurba
, Christine Seidman
, Jonathan Seidman
, Vivien Sheehan
, Stephanie L. Sherman
, Amol Shetty
, Aniket Shetty
, Wayne Hui-Heng Sheu
, M. Benjamin Shoemaker
, Brian Silver
, Edwin Silverman
, Robert Skomro
, Albert Vernon Smith
, Josh Smith
, Nicholas Smith
, Tanja Smith
, Sylvia Smoller
, Beverly Snively
, Michael Snyder
, Tamar Sofer
, Nona Sotoodehnia
, Adrienne M. Stilp
, Garrett Storm
, Elizabeth Streeten
, Jessica Lasky Su
, Yun Ju Sung
, Jody Sylvia
, Adam Szpiro
, Frédéric Sériès
, Daniel Taliun
, Hua Tang
, Margaret Taub
, Matthew Taylor
, Simeon Taylor
, Marilyn Telen
, Timothy A. Thornton
, Machiko Threlkeld
, Lesley Tinker
, David Tirschwell
, Sarah Tishkoff
, Catherine Tong
, Russell Tracy
, Michael Tsai
, Dhananjay Vaidya
, David Van Den Berg
, Peter VandeHaar
, Scott Vrieze
, Tarik Walker
, Robert Wallace
, Avram Walts
, Fei Fei Wang
, Heming Wang
, Jiongming Wang
, Karol Watson
, Jennifer Watt
, Daniel E. Weeks
, Joshua Weinstock
, Bruce Weir
, Scott T. Weiss
, Lu-Chen Weng
, Cristen Willer
, Kayleen Williams
, L. Keoki Williams
, Scott Williams
, Carla Wilson
, James Wilson
, Lara Winterkorn
, Quenna Wong
, Baojun Wu
, Joseph Wu
, Huichun Xu
, Ivana Yang
, Ketian Yu
, Seyedeh Maryam Zekavat
, Yingze Zhang
, Snow Xueyan Zhao
, Wei Zhao
, Xiaofeng Zhu
, Elad Ziv
, Michael Zody
, Sebastian Zoellner
, Mariza de Andrade
& Lisa de las Fuentes

Contributions

X. Li, H.C., Z. Li, Z. Liu and X. Lin designed the experiments. X. Li, H.C., Z. Li and X. Lin performed the experiments. X. Li, H.C., M.S.S., E.V.B., Y.W., R.S., Z.R.M., Z.Y., M.-Z.J., D.D., S.M.G., R.D., D.K.A., E.J.B., J.C.B., J.B., E.B., D.W.B., J.A.B., B.E.C., A.P.C., J.C.C., N.C., Y.-D.I.C., J.E.C., P.S.d.V., M.F., N.F., B.I.F., C.G., N.L.H.C., J.H., L.H., Y.-J.H., M.R.I., R.C.K., S.L.R.K., T.N.K., I.K., C.K., B.G.K., C.L., Y.L., H.L., C.-T.L. R.J.F.L., M.C.M., L.W.M., R.A.M., B.D.M., M.E.M., A.C.M., T.N., K.E.N., N.D.P., P.A.P., B.M.P., S.R., A.P.R., S.S.R., C.M.S., J.A.S., K.D.T., H.K.T., R.S.V., S.V., Z.W., J.W., L.R.Y., B.Y., J.D., J.B.M., P.L.A., L.M.R., A.K.M., K.M.R., J.I.R., G.M.P., P.N., Z. Li, H.Z., Z. Liu and X. Lin acquired, analyzed or interpreted data. G.M.P., P.N. and the NHLBI TOPMed Lipids Working Group provided administrative, technical or material support. X. Li, Z. Li, Z. Liu and X. Lin drafted the paper and revised it according to suggestions by the coauthors. All authors critically reviewed the paper, suggested revisions as needed, and approved the final version.

Corresponding authors

Correspondence to Zilin Li, Zhonghua Liu or Xihong Lin.

Ethics declarations

Competing interests

Z.R.M. and R.D. are employees of Insitro. S.M.G. is an employee of Regeneron Genetics Center. M.E.M. receives research funding from Regeneron Pharmaceutical Inc., unrelated to this project. B.M.P. serves on the Steering Committee of the Yale Open Data Access Project funded by Johnson & Johnson. L.M.R. and S.S.R. are consultants for the TOPMed Administrative Coordinating Center (via Westat). P.N. reports research grants from Allelica, Amgen, Apple, Boston Scientific, Genentech/Roche and Novartis, personal fees from Allelica, Apple, AstraZeneca, Blackstone Life Sciences, Creative Education Concepts, CRISPR Therapeutics, Eli Lilly & Co, Esperion Therapeutics, Foresite Capital, Foresite Labs, Genentech/Roche, GV, HeartFlow, Magnet Biomedicine, Merck, Novartis, TenSixteen Bio and Tourmaline Bio, equity in Bolt, Candela, Mercury, MyOme, Parameter Health, Preciseli and TenSixteen Bio, and spousal employment at Vertex Pharmaceuticals, all unrelated to the present work. X. Lin is a consultant of AbbVie Pharmaceuticals and Verily Life Sciences. The other authors declare no competing interests.

Peer review

Peer review information

Nature Computational Science thanks Yuehua Cui, Yukinori Okada and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: Ananya Rastogi, in collaboration with the Nature Computational Science team. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Manhattan plots and Q-Q plots for unconditional gene-centric coding, noncoding and genetic region (2-kb sliding window) multi-trait analysis of fasting glucose (FG) and fasting insulin (FI) using TOPMed data (n = 21,731).

a, Manhattan plots for unconditional gene-centric coding analysis of protein-coding genes. The horizontal line indicates a genome-wide MultiSTAAR-O P value threshold of \(5.00\times {10}^{-7}\). The significant threshold is defined by multiple comparisons using the Bonferroni correction (\(0.05/\left(\mathrm{20,000}\times 5\right)=5.00\times {10}^{-7}\)). Different symbols represent the MultiSTAAR-O P value of the protein-coding gene using different functional categories (putative loss-of-function, putative loss-of-function and disruptive missense, missense, disruptive missense, synonymous). b, Quantile-quantile plots for unconditional gene-centric coding analysis of protein-coding genes. Different symbols represent the MultiSTAAR-O P-value of the gene using different functional categories. c, Manhattan plots for unconditional gene-centric noncoding analysis of protein-coding genes. The horizontal line indicates a genome-wide MultiSTAAR-O P value threshold of \(3.57\times {10}^{-7}\). The significant threshold is defined by multiple comparisons using the Bonferroni correction (\(0.05/\left(\mathrm{20,000}\times 7\right)=3.57\times {10}^{-7}\)). Different symbols represent the MultiSTAAR-O P value of the protein-coding gene using different functional categories (upstream, downstream, UTR, promoter_CAGE, promoter_DHS, enhancer_CAGE, enhancer_DHS). Promoter_CAGE and promoter_DHS are the promoters with overlap of Cap Analysis of Gene Expression (CAGE) sites and DNase hypersensitivity (DHS) sites for a given gene, respectively. Enhancer_CAGE and enhancer_DHS are the enhancers in GeneHancer-predicted regions with the overlap of CAGE sites and DHS sites for a given gene, respectively. d, Quantile-quantile plots for unconditional gene-centric noncoding analysis of protein-coding genes. Different symbols represent the MultiSTAAR-O P-value of the gene using different functional categories. e, Manhattan plot showing the associations of 2.68 million 2-kb sliding windows versus \(-{\log }_{10}(P)\) of MultiSTAAR-O. The horizontal line indicates a genome-wide P value threshold of \(1.86\times {10}^{-8}\). f, Quantile-quantile plot of 2-kb sliding window MultiSTAAR-O P values. In panels, a, c and e, the chromosome number are indicated by the colors of dots. In all panels, MultiSTAAR-O is a two-sided test.

Extended Data Fig. 2 Manhattan plots and Q-Q plots for unconditional gene-centric coding, noncoding and genetic region (2-kb sliding window) multi-trait analysis of C-reactive protein (CRP), interleukin-6 (IL-6), lipoprotein-associated phospholipase A2 (Lp-PLA2) activity, and lipoprotein-associated phospholipase A2 (Lp-PLA2) mass using TOPMed data (n = 9,380).

a, Manhattan plots for unconditional gene-centric coding analysis of protein-coding genes. The horizontal line indicates a genome-wide MultiSTAAR-O P value threshold of \(5.00\times {10}^{-7}\). The significant threshold is defined by multiple comparisons using the Bonferroni correction (\(0.05/\left(\mathrm{20,000}\times 5\right)=5.00\times {10}^{-7}\)). Different symbols represent the MultiSTAAR-O P value of the protein-coding gene using different functional categories (putative loss-of-function, putative loss-of-function and disruptive missense, missense, disruptive missense, synonymous). b, Quantile-quantile plots for unconditional gene-centric coding analysis of protein-coding genes. Different symbols represent the MultiSTAAR-O P-value of the gene using different functional categories. c, Manhattan plots for unconditional gene-centric noncoding analysis of protein-coding genes. The horizontal line indicates a genome-wide MultiSTAAR-O P value threshold of \(3.57\times {10}^{-7}\). The significant threshold is defined by multiple comparisons using the Bonferroni correction (\(0.05/\left(\mathrm{20,000}\times 7\right)=3.57\times {10}^{-7}\)). Different symbols represent the MultiSTAAR-O P value of the protein-coding gene using different functional categories (upstream, downstream, UTR, promoter_CAGE, promoter_DHS, enhancer_CAGE, enhancer_DHS). Promoter_CAGE and promoter_DHS are the promoters with overlap of Cap Analysis of Gene Expression (CAGE) sites and DNase hypersensitivity (DHS) sites for a given gene, respectively. Enhancer_CAGE and enhancer_DHS are the enhancers in GeneHancer predicted regions with the overlap of CAGE sites and DHS sites for a given gene, respectively. d, Quantile-quantile plots for unconditional gene-centric noncoding analysis of protein-coding genes. Different symbols represent the MultiSTAAR-O P-value of the gene using different functional categories. e, Manhattan plot showing the associations of 2.67 million 2-kb sliding windows versus \(-{\log }_{10}(P)\) of MultiSTAAR-O. The horizontal line indicates a genome-wide P value threshold of \(1.87\times {10}^{-8}\). f, Quantile-quantile plot of 2-kb sliding window MultiSTAAR-O P values. In panels, a, c and e, the chromosome number are indicated by the colors of dots. In all panels, MultiSTAAR-O is a two-sided test.

Supplementary information

Supplementary Information Table of Contents, Supplementary Figs. 1–38, Tables 1–12 and Note.

Reporting Summary

Peer Review File

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Li, X., Chen, H., Selvaraj, M.S. et al. A statistical framework for multi-trait rare variant analysis in large-scale whole-genome sequencing studies. Nat Comput Sci 5, 125–143 (2025). https://doi.org/10.1038/s43588-024-00764-8

Download citation

Received: 12 November 2023
Accepted: 20 December 2024
Published: 07 February 2025
Version of record: 07 February 2025
Issue date: February 2025
DOI: https://doi.org/10.1038/s43588-024-00764-8

This article is cited by

Genomics of drug target prioritization for complex diseases
- Robert Chen
- Áine Duffy
- Ron Do
Nature Reviews Genetics (2025)
Network construction using sparse Gaussian graphical model based on GWAS summary statistics
- Megh Subedi
- Xuewei Cao
- Qiuying Sha
Scientific Reports (2025)