Main

Physical activity (PA) has pervasive health benefits, including beneficial influences on chronic conditions like cancer, hypertension and type 2 diabetes1. The majority of these associations are supported by cross-sectional studies2,3,4,5, which have great utility but cannot establish causality or determine the direction of causal relationships.

An individual’s level of PA is influenced by genetic factors, with estimated heritability (h2) between 48% and 71%6. To identify genetic variants relevant to PA, genome-wide association studies (GWAS) have been conducted, initially focusing on PA during leisure time7,8,9,10. Identifying genetic associations with PA affords us a unique opportunity; because we are born with genetic variations before exposures to psychological, social and environmental factors that may affect activity levels, we can begin to decompose some of the complex interactions that underlie and potentially confound the study of this important health trait. There is also a wide range of interindividual differences in requirements for PA in daily life. Some jobs require much more physical exertion than others; some individuals must exert more effort in their households than others. These kinds of PA tend to differ phenomenologically from exercise taken at leisure, occurring more in concentrated bursts in the first two cases, and with the potential to be more sustained in the latter case; yet the level of activity in different contexts is often treated as if equivalent. There are, however, prior reports that there may not be comparable health benefits for activity at leisure versus activity in other contexts11,12, and genetic analyses may help shed light on such differences.

Two previous studies of PA used UK Biobank (UKB) data13,14. The first study (n = 91,105) identified three variants associated with PA, based on wrist-worn accelerometer data14. The second study (n ≤ 377,234) identified ten significant loci across five phenotypes, three based on self-report data (moderate-to-vigorous PA (MVPA), vigorous PA (VPA) and strenuous sport or other exercise (SSOE) during leisure time), and two on wrist-worn accelerometer data13. A multiancestry meta-analysis combining data from 703,901 individuals from 51 studies identified 11 associated loci for MVPA15. Combining this phenotype with two traits phenotypically contrary to PA (leisure screen time and sedentary behavior at work), 99 associated loci were found, summing results from PA (11 loci), leisure screen time (88 loci) and sedentary behavior at work (4 loci); that is, most loci were mapped for phenotypes correlated to a lack of PA15.

We sought to maximize power to investigate PA genetics using a quantitative trait definition, combining different intensities of PA by incorporating their frequencies, using data from the Million Veteran Program (MVP). We focused on PA per se, as opposed to sedentary behavior, with our primary quantitative trait, All-PA–leisure, focused exclusively on PA during leisure time (Methods) with data from European (EUR; n = 189,812), African (AFR; n = 27,044) and Latin American (AMR; n = 10,263) ancestries. We studied this trait in two meta-analyses (All-PA–leisure + SSOE), one of EUR ancestry (n = 511,310) and one cross-ancestry (EUR, AFR and AMR ancestries; n = 548,617), with SSOE data from UKB (neff = 4/(1/ncases + 1/ncontrols) = 321,498), also based on information regarding PA during leisure time (Supplementary Fig. 1 and Supplementary Table 1)13. In doing so, we substantially increased the number of discovered loci while also enhancing the diversity of the sample beyond EUR. This enabled us to investigate, from a genomic perspective, the relationship of PA to a range of health traits, and especially protective effects of PA against health outcomes including type 2 diabetes, gastroesophageal reflux disease, abdominal aortic aneurysm, heart failure, hospitalization caused by COVID-19, osteoarthritis, parental survival and an aging phenotype. We also investigated PA traits related to different activity contexts (home, work, and leisure), for which we found substantial genetic differences.

Results

GWAS of All-PA–leisure and All-PA–leisure + SSOE meta-analysis

All-PA–leisure and SSOE phenotypes

GWAS of All-PA–leisure in MVP EUR data identified 14 lead SNPs at 13 loci with a genome-wide significant (GWS) threshold P value of 5 × 10−8 (or 6 lead SNPs in 6 loci if considering a threshold P = 5 × 10−9; Supplementary Fig. 2 and Supplementary Table 2), whereas there were no significant GWAS findings for MVP AFR and AMR data. GWAS of SSOE in UKB identified 15 lead SNPs in 14 loci with a threshold P value of 5 × 10−8 (8 lead SNPs in 7 loci if considering a threshold P= 5 × 10−9; Supplementary Fig. 3 and Supplementary Table 3).

The two most significant lead SNPs in the GWAS for All-PA–leisure (EUR, rs761898, P = 1.63 × 10−11; rs7613360, P = 3.15 × 10−11) map to intergenic regions on chromosomes 14 and 3, respectively (Supplementary Table 3). Additional GWS loci included SLC39A8*rs13107325 (P = 1.57 × 10−10; solute carrier family 39 member 8). As previously described13, the most significant variant for GWAS of SSOE in UKB is CADM2*rs62253088 (P = 1.0 × 10−19 in this study; Cell Adhesion Molecule 2).

A sex-stratified analysis of the X chromosome in MVP (All-PA–leisure) showed no GWS variants (Supplementary Fig. 4).

All-PA–leisure + SSOE meta-analyses

We calculated SNP-based heritability (SNP-h2) in MVP data for All-PA–leisure (Table 1), and the genetic correlation between All-PA–leisure and traits from the previous UKB report13 (Supplementary Notes).

Table 1 Estimated SNP-heritability (h2) of the cohorts used in the study

The EUR All-PA–leisure + SSOE meta-analysis, including the UKB phenotype SSOE and the MVP phenotype All-PA–leisure in a sample-size-based meta-analysis, greatly increased discovery, yielding 67 lead SNPs in 55 loci (Supplementary Fig. 5 and Supplementary Table 4), with intercept of 1.038 ± 0.011 and an attenuation ratio of the linkage disequilibrium score regression (LDSC) of 0.057 ± 0.016 (ref. 16). The cross-ancestry meta-analysis provided 70 lead SNPs in 56 loci (Fig. 1 and Supplementary Table 5). Differences in the number of lead SNPs and associated loci considering a more stringent threshold P ≤ 5 × 109 are reported in Supplementary Table 6. In both the meta-analyses, the strongest association was with CADM2*rs62253088 (EUR meta-analysis P = 2.0 × 10−21; cross-ancestry meta-analysis P = 6.1 × 10−20). The second strongest associated lead SNP was also on chromosome 3: MST1R*rs3733134 (EUR meta-analysis P = 1.4 × 10−17; cross-ancestry meta-analysis P = 6.9 × 10−18).

Fig. 1: Cross-ancestry All-PA–leisure + SSOE meta-analysis.
figure 1

Annotated genes are the closest to each significant lead SNP. Different genes compared to EUR All-PA–leisure meta-analysis (Supplementary Fig. 6) are highlighted in green. The red horizontal line indicates GWS (P = 5 × 10−8) level, where P values were calculated using sample-size-based approach, two-sided tests.

Gene-based and gene-set analyses of All-PA–leisure + SSOE

Focusing on the EUR meta-analysis, the gene-based test using MAGMA identified 119 significant genes from 19,030 protein-coding genes (GWS defined at P = 2.6 × 10−6; Supplementary Table 7). CADM2 was the most significant gene (P = 3.7 × 10−25) followed by NPC1 (P = 4.8 × 10−16). The fastENLOC17,18 analysis identified 41 genes with gene-level colocalization probability (GLCP) ≥ 0.9 (Supplementary Table 8).

For the EUR All-PA–leisure + SSOE meta-analysis, we also conducted transcriptome-wide association study (TWAS) and fine-mapping (Supplementary Results; Supplementary Tables 911). These analyses, together with the previously mentioned gene-based approaches, highlighted two genes (CADM2 and AKAP10) shared by four approaches, and eight genes (RGS12, NCP1, GATAD2A, DLAT, CYP46A1, ANK3, AFF3 and ACTN1) shared by three approaches (discussion in ‘Supplementary Results—Genes detected by gene-based approaches’). We also conducted MTAG analysis with leisure screen time and liking of PA (‘Supplementary Results—MTAG analyses of EUR All-PA–leisure + SSOE’; Supplementary Tables 12 and 13).

The following four significant gene ontology (GO) terms resulted from the gene-set analysis: structural constituent of the presynapse, presynaptic active zone organization, maintenance of presynaptic active zone structure and structural constituent of the synapse (Supplementary Table 14).

Tissue and cell-type specificity of All-PA–leisure + SSOE

Tissue Expression Analysis showed 11 significant enrichments in brain tissues, with the two most significant represented in the cerebellar hemisphere and in the cerebellum (Supplementary Figs. 6 and 7). For gene-based association analyses and gene-set analyses, similar results were obtained for the cross-ancestry meta-analysis (Supplementary Notes; Supplementary Tables 15 and 16 and Supplementary Figs. 8 and 9).

Cell-type-enrichment analysis was performed with a collection of human cell types from brain, blood and pancreas (all available tissues). In the EUR meta-analysis of All-PA–leisure + SSOE, the only significant cell type was GABA+ neurons from the GSE76381 Linnarsson Human Midbrain dataset (adjusted P value per dataset after multiple testing correction = 2.4 × 105). In the cross-ancestry meta-analysis, no cell types were significantly enriched. The results likely reflect the limited number of kinds of tissue included in this analysis.

Functional enrichment analysis

Functional enrichment analysis with g:Profiler using significant genes from the gene-based test as input, identified three significant GO terms shared by the EUR meta-analysis and the cross-ancestry meta-analysis (Supplementary Tables 17 and 18)—presynaptic active zone cytoplasmic component, cell cortex region and synapse. A human phenotype ontology term, polyclonal elevation of IgM, was also shared by the EUR meta-analysis and cross-ancestry meta-analysis.

Genetic correlations

We estimated the genetic correlation of the All-PA–leisure + SSOE phenotype (EUR meta-analysis) with traits of interest covering a broad range of diseases and physiological traits, using the Benjamini–Hochberg false discovery rate correction (Methods). Results showed significant negative genetic correlations between All-PA–leisure + SSOE with all disease traits studied, with the strongest values for stroke (rg = −0.62 ± 0.11; P = 3.54 × 10−9) and gastroesophageal reflux disease (rg = −0.482 ± 0023; P = 9.9 × 10−97; Fig. 2a, Supplementary Table 19 and Supplementary Notes). Conditional analysis adjusting for income as a measure of socioeconomic status (Supplementary Fig. 10), highlighted a few changes compared to previous genetic correlations (Supplementary Table 20). Genetic correlations of All-PA–leisure + SSOE with height and executive functioning became nonsignificant, whereas multiple sclerosis became significant (rg = −0.184 ± 0.086; P = 0.032). Consistent results were obtained from a phenome-wide association analysis (PheWAS; Fig. 2b; Supplementary Notes; Supplementary Table 21).

Fig. 2: Traits genetically correlated with All-PA–leisure + SSOE.
figure 2

a, Genetic correlations of EUR meta-analysis for All-PA–leisure + SSOE phenotype with traits of interest. We display here the traits of interest that were significant after the Benjamini–Hochberg false discovery procedure. Tests were two-sided. Black error bars represent the standard error. b, PheWAS of PA from the polygenic score derived from the EUR All-PA–leisure + SSOE meta-analysis. The red line represents the Bonferroni threshold P = 3.99 × 10−5. Up-pointing and down-pointing triangles represent positive or negative associations, respectively, which were calculated using a logistic regression model.

Local genetic correlations

We next considered local correlations (LAVA19) as global genetic correlations may mask important regional differences. There were 91 significant local bivariate genetic correlations at 61 loci between the EUR All-PA–leisure + SSOE meta-analysis and 18 traits in the genetic correlation analyses (Supplementary Table 21). Of these, oral cancer was not flagged by the global genetic correlation analysis. The trait with the highest number of significant local genetic correlations with All-PA–leisure + SSOE was educational attainment (31 loci), followed by body mass index (BMI; 15 loci) and income (9 loci). For two traits, height and well being, we observed local genetic correlations with EUR All-PA–leisure+SSOE in opposite directions (Supplementary Table 22).

Mendelian randomization (MR) analyses of All-PA–leisure

We used MR analyses to investigate the causal relationship between All-PA–leisure and traits with which it had a significant genetic correlation. Since several statistics for target traits included UKB data, we used the GWAS of All-PA–leisure in EUR MVP (and not the meta-analysis) to avoid sample overlap. The following four methods were tested: MR-Egger, weighted median, inverse-variance weighted (IVW) and simple mode. Using a threshold P < 1 × 10−5 for instrumental variables, we observed significant bidirectional negative (protective) effects of All-PA–leisure with respect to BMI, type 2 diabetes, leisure screen time, cigarettes per day and gastroesophageal reflux disease (Tables 2 and 3). Unlike bidirectional effects, which show effects both caused by All-PA–leisure and on All-PA–leisure (that is, PA increases or decreases risk for the comparator trait, and that trait also increases or decreases PA), unidirectional effects show only one direction of causality. There were unidirectional negative (protective) effects of All-PA–leisure on abdominal aortic aneurysm, heart failure, COVID-19 hospitalization, triglycerides, phenotypic age acceleration (an age-adjusted biological age measure; Supplementary Methods) and osteoarthritis (Tables 2 and 3). We found significant bidirectional positive effects between All-PA–leisure and HDL-cholesterol (HDL-C), parental survival, liking of PA, household income, well being and educational attainment. MR analyses with a stronger threshold P = 5 × 10−8 for instrumental variables showed similar causal relationships, and also a possible protective effect of All-PA–leisure on asthma (Supplementary Notes; Supplementary Tables 2325).

Table 2 MR analysis of causal effects of PA during leisure time (All-PA–leisure) on traits of interest
Table 3 MR analysis of causal effects of traits of interest on PA during leisure time (All-PA–leisure)

MVMR analyses testing BMI as confounder

To understand the apparent protective effect of All-PA–leisure with respect to COVID-19 better, we considered a possible role of BMI as confounder, as high BMI leads to worse COVID-19 outcomes20. We performed a multivariable MR (MVMR) analysis, with the All-PA–leisure phenotype and BMI as exposures, and COVID-19 hospitalization as outcome. In this analysis, we still observed a significant protective effect of All-PA–leisure on the COVID-19 trait which considers hospitalized cases (effect = −0.067 ± 0.016, P = 2.8 × 10−5; Table 4). Considering the role that BMI has in many other health outcomes, we also conducted MVMR, including BMI for all other traits that had significant MR results. Despite the reduced power of the analysis due to a decrease in the number of instrumental variables suitable for all three traits, we found significant protective effects of All-PA–leisure on HDL-C, triglycerides, osteoarthritis and gastroesophageal reflux disease (Table 4 and Supplementary Table 26).

Table 4 MVMR analysis of All-PA–leisure setting BMI as exposures

All-PA–leisure and home and All-PA–leisure, home and work phenotypes

To evaluate the potential gain in information from combining data from PA during leisure time with PA performed at home and work, we conducted two additional GWAS in the EUR population. First, we summed All-PA–leisure with All-PA from PA at home (All-PA–leisure and home; Methods; Supplementary Fig. 11). We next added information from All-PA at work (All-PA–leisure, home and work; Methods; Supplementary Fig. 12). In both these GWASs, we observed a substantial reduction of the number of association peaks compared to All-PA–leisure.

GWAS of vigorous PA phenotypes (leisure, at home and at work)

We ran analyses on different contexts of vigorous PA to understand genetic differences and benefits related to the context in which PA was performed.

Kinds of vigorous PA and their genetic interrelationships

We ran independent MVP GWAS of all vigorous activity types depending on the context (Vig-PA–leisure, Vig-PA–home and Vig-PA–work) in EUR and investigated correlations between each set of two to study their genetic differences. For Vig-PA–leisure and Vig-PA–home, rg = 0.64 ± 0.04 (P = 5.0 × 10−59); Vig-PA–leisure and Vig-PA–work, rg = 0.46 ± 0.07 (P = 9.8 × 10−11). Vig-PA–home and Vig-PA–work were more strongly correlated with each other (rg = 0.97 ± 0.05, P = 1.2 × 10−73). SNP-h2 of vigorous PA phenotypes are reported in Supplementary Table 27.

For each vigorous PA trait, we ran GWAS, and for Vig-PA–leisure we also conducted X chromosome-wide association study (XWAS), gene-based testing, TWAS, fine-mapping, functional enrichment, mitochondria–SNV association and gene-based association analysis on the mitochondrial genome (Supplementary Results; Supplementary Tables 2835 and Supplementary Figs. 1317).

Genetic correlations of vigorous PA traits

To understand the differences between the vigorous activity traits better, we calculated the genetic correlations between Vig-PA–leisure, Vig-PA–work and Vig-PA–home individually with the previously analyzed traits of interest (Supplementary Tables 3638). We observed more statistically significant differences between Vig-PA–leisure and Vig-PA–work (for 15 traits), than when considering Vig-PA–leisure and Vig-PA–home (for 12 traits) or Vig-PA–home and Vig-PA–work (for 3 traits; Fig. 3a). Among these three pairs of vigorous activity traits, there were numerous significant differences regarding genetic correlations with income, leisure screen time and educational attainment, reporting better health benefits from Vig-PA–leisure compared to the other two vigorous traits. Analyses performed adjusting the vigorous PA traits with income using mtCOJO are described in Supplementary Results, Supplementary Table 39 and Supplementary Fig. 18.

Fig. 3: Genetic differences among vigorous PA types.
figure 3

a, Genetic correlation between each pair of vigorous PA contexts and other traits of interest. Significantly different correlations were defined as having P < 4.63 × 10−4 after multiple testing correction. An asterisk represents a significant difference between Vig-PA–leisure and Vig-PA–work, wedge represents a significant difference between Vig-PA–leisure and Vig-PA–home, and plus represents a significant difference between Vig-PA–home and Vig-PA–work. Tests were two-sided. Black error bars represent the s.e. The genetic correlation values of the vigorous PA with the other traits of interest are provided in Supplementary Tables 3638. b, Genomic-SEM of the two-factor model, showing the loadings for the inferred traits—their standardized estimates with the s.e. in parentheses. The CFA uses a chi-square test.

Genomic structural equation modeling (genomic-SEM) analysis

Considering the differences identified among the three vigorous PA traits, which had shown different benefits according to the context, we used genomic-SEM analysis21 to understand their relationship better, inferring the overall genetic architecture among them and two other PA traits in EUR, All-PA–leisure + SSOE and liking of PA. The exploratory factor analysis (EFA) indicated that a two-factor model fit the data best (Fig. 3b), accounting for 91% of the cumulative variance explained (factor 1, 50%; factor 2, 41%). Both factors had strong strength of sum of squares (SS) loadings (factor 1 SS = 2.50; factor 2 SS = 2.04). Confirmatory factor analysis (CFA) showed that two correlated factors model fit the data well, with comparative fit index = 0.97, chi-square = 109.03, Akaike information criterion = 129.03 and standardized root mean square residual = 0.069. Vig-PA–leisure (loading = 1.00 ± 0.03), All-PA–leisure + SSOE (loading = 0.89 ± 0.03) and liking of PA (loading = 0.79 ± 0.04) loaded most strongly on factor 1 (leisure/liking PA), while Vig-PA–home (loading = 1.02 ± 0.07) and Vig-PA–work (loading = 0.82 ± 0.08) loaded most strongly on factor 2 (occupational/domestic PA; Fig. 3b). The factor leisure/liking PA and factor occupational/domestic PA were correlated at 0.50 (0.04).

Survival analyses

Survival analyses showed a significant reduction of risk of all-cause death for Vig-PA–leisure (β = −0.1173 ± 0.0037; P ≤ 2 × 10−16) and for Vig-PA–work (β = −0.0061 ± 0.0029; P = 0.036). There was no statistically significant reduction of risk of death for Vig-PA–home, though the direction of effect and effect size were similar (β = −0.0052 ± 0.0030; P = 0.082).

Discussion

PA has long been appreciated to ameliorate the risk of illness. Increasing PA on a population level could greatly reduce morbidity and mortality—this is a major public health undertaking and very difficult to accomplish. Understanding genetic factors that influence the level of activity for individuals could provide pathways for interventions to support efforts to increase PA. We present here comprehensive information on the genetics of PA and investigate the relationships between PA and health. This work provides tools to dissect causal relationships between a range of other traits and PA, and to uncover previously unknown biology responsible in part for interindividual differences in PA. Those relationships were remarkably different depending on the context of the PA—in particular, whether the activity was associated with PA during leisure time (protective) rather than at work or around the house (often not protective). Our main focus was accordingly on leisure-PA. All-PA–leisure considered different levels of PA intensity and frequency. We completed a GWAS of All-PA–leisure for EUR, AFR and AMR ancestry in MVP. We combined these data with the UKB SSOE trait in two meta-analyses, one with EUR individuals and the other cross-ancestry.

The polygenic architecture of leisure-time PA showed that genetic liability to PA was protective with respect to numerous diseases. It was previously reported that PA has negative genetic correlation with type 2 diabetes13,15,22, triglyceride level13, rheumatoid arthritis15, coronary artery disease15 and lung cancer15. There is prior genetic evidence of the role of PA in preventing colorectal cancer using MR methods23, but we did not find a significant genetic correlation with colorectal cancer. Our analyses confirmed the negative genetic correlations with type 2 diabetes, coronary artery disease and lung cancer. Moreover, we found significant negative genetic correlations between PA and other medically relevant traits—thoracic and abdominal aortic aneurysm, heart failure, stroke, asthma, type 1 diabetes, Alzheimer’s disease, inflammatory bowel disease, osteoarthritis and gastroesophageal reflux disease. PA was also protective with respect to COVID-19 and COVID-19-associated hospitalization, and positively correlated with parental survival.

Whereas genetic correlations are an average of the shared association across the genome, local genetic correlations can detect regions of shared genetic associations, including finding signals in opposing directions that a genome-wide analysis would not capture. Local genetic correlation, like global correlation, supported a protective role of PA for multiple health traits, showing negative local genetic correlations for diabetes types 1 and 2, gastroesophageal reflux disease and osteoarthritis. We also found negative local genetic correlations between All-PA–leisure + SSOE and BMI, triglycerides, phenotypic age acceleration, leisure screen time, neuroticism and cigarettes per day. Positive local genetic correlations were found between All-PA–leisure + SSOE and bone mineral density, income, liking of PA, educational attainment, and, unexpectedly, oral cancer—there was a single locus showing positive local genetic correlation between PA and oral cancer; however, genome-wide genetic correlation between these traits calculated with LDSC was not significant (P = 0.418). We also found local genetic correlations in opposite directions—at different genomic locations—with All-PA–leisure + SSOE for height and well being.

Our MR results confirmed a protective effect of All-PA–leisure genetic liability against type 2 diabetes, abdominal aortic aneurysm, heart failure, COVID-19-associated hospitalization, triglyceride level, osteoarthritis and gastroesophageal reflux disease. Most of these causal effects were observed even when we tested for BMI as a confounder in the MR (MVMR; except for heart failure, which could not be tested due to sample overlap between the BMI and heart failure datasets). However, the low number of instrumental variables in our MVMR analyses could result in false negatives. We tested BMI as confounder since it is genetically correlated to numerous diseases and a previous study showed that it is likely a mediator of leisure time PA, as its causal effect on leisure time PA disappears when taking education into account24. These findings nevertheless confirm the utility of All-PA–leisure to decrease risk for numerous diseases25,26, independent of the role of BMI as either a mediator or a confounder. MR results also suggested a protective effect of All-PA–leisure on the biological age measure phenotypic age acceleration, and a bidirectional effect between All-PA–leisure and parental survival, a protective effect of genetic liability for this PA phenotype versus aging and mortality.

PheWAS analysis in BioVU27, a broad exploration of disease traits in an independent sample, showed a negative correlation of PA with several previously mentioned health traits, including diabetes, cardiovascular disease (for example, coronary artery disease and heart failure), gastroesophageal reflux disease, lung cancer and asthma. The strongest association was related to diabetes (P = 5.89 × 1036).

The EUR meta-analysis of All-PA–leisure + SSOE identified 67 lead SNPs, 59 of which were novel compared to the lead SNPs of the two GWASs, All-PA–leisure and SSOE, in the meta-analysis. Previous GWAS of self-report and device-measured PA identified 68 lead variants in total13,14,15. We replicated associations for 17 previously identified lead variants. Two (CADM2*rs62253088 and LOC642366*rs4865656) were still lead variants in both our meta-analyses of All-PA–leisure + SSOE (EUR ancestry and cross-ancestry); the other 15 were significantly associated (Supplementary Table 40). CADM2*rs62253088 (encoding Cell Adhesion Molecule 2), on chromosome 3, had the strongest association in both EUR and cross-ancestry All-PA–leisure + SSOE meta-analyses (EUR meta-analysis P = 2.0 × 1021; cross-ancestry (EUR, AFR and AMR) meta-analysis P = 6.1 × 1020), but was not significant in the GWAS of Vig-PA–leisure. However, considering Vig-PA–leisure, another variant mapping to CADM2 was a significant lead SNP in the GWAS (rs2326266; P = 4.1 × 108) and CADM2 also had the most significant P value in the gene-based test (2.5 × 108). CADM2 variants, including rs62253088, have previously been associated with PA, and also with decreased neuroticism and decreased self-reported nervous and anxious feelings13. CADM2 was also previously found to be associated with obesity and metabolic traits28, cannabis use29, smoking and alcohol related traits30, risk-taking behavior31, educational attainment32 and autism spectrum disorder33. CADM2 encodes a member of the synaptic CADM (SynCAM) family, which belongs to the immunoglobulin superfamily34. Through a previous summary-based MR35 analysis of eQTL signals, CADM2 was suggested to influence PA through effects on skeletal muscle15.

Significant variants also included MST1R*rs3733134 (EUR meta-analysis P = 1.4 × 1017; cross-ancestry meta-analysis P = 6.9 × 1018), EXD2*rs4899292 (EUR meta-analysis P = 1.7 × 10−16; cross-ancestry meta-analysis P = 3.8 × 1016), LONRF2 and SLC39A8. SLC39A8 was previously associated with several psychiatric disorders, including problematic alcohol use36, schizophrenia37 and opioid use disorder38 (Supplementary Discussion).

Additional analyses demonstrated that PA is related to brain structure and cellular architecture. From MAGMA tissue expression analyses, the cerebellum and cerebellar hemisphere showed significant enrichments. Cerebellar function is critical for skills relevant to PA, such as coordination and movement. These results were similar to the results of the previous GWAS of SSOE13. Cell-type-specificity analyses showed a significant association of PA with GABA+ neurons39. EUR All-PA–leisure + SSOE and cross-ancestry meta-analyses shared the following three significant gene sets in their MAGMA gene-set analysis results: structural constituent of presynapse, presynaptic active zone organization, and maintenance of presynaptic active zone structure. This finding highlights the role of presynaptic cells in PA and the importance of the cerebellum and GABAergic function in the biology of PA levels.

Vig-PA–leisure was, as noted, genetically different from Vig-PA–home and Vig-PA–work. We observed moderate genetic correlation between Vig-PA–leisure and Vig-PA–home (rg = 0.64 ± 0.04, P = 5.0 × 1059), but lower genetic correlation between Vig-PA–leisure and Vig-PA–work (rg = 0.46 ± 0.07, P = 9.8 × 1011). While there was high genetic correlation between Vig-PA–home and Vig-PA–work (rg = 0.97 ± 0.05, P = 1.2 × 1073), there were differences between these two traits in genetic correlation analyses with other traits of interest, in particular for income, educational attainment and leisure screen time. Previous work on dementia also showed genetic differences between PA during leisure versus work time40.

The genomic-SEM results also supported that context-defined PA traits are genetically heterogeneous. Traits more similar to voluntary PA and healthy lifestyle habits loaded on the leisure/liking PA Factor, while the occupational/domestic PA Factor captured traits more consistent with strenuous work and labor performed as part of work duties and responsibilities at home.

While the leisure/liking PA and occupational/domestic PA factors were moderately correlated (0.50), the nature of the respective contexts of PA identified key differences. Perhaps the most notable distinction is that strenuous occupational and domestic PA is often not health-enhancing, and can be detrimental to health via injury and long-term stress on bodily systems41,42. Individuals whose vocations are labor-intensive are also less likely to engage in health-promoting PA in leisure43, further compounding their risk for adverse health outcomes long-term.

When we adjusted for vigorous PA traits and income using mtCOJO, we still observed differences in genetic correlations with health traits, suggesting that these differences do not merely reflect differences in socioeconomic status. These findings together suggest that not all contexts of PA are equally beneficial, and some forms of vigorous activity may be more strongly linked than others to improved mental and physical health.

A further confirmation of differing benefits by PA context was provided by the survival analyses, which show a strong significant reduction of risk of death for Vig-PA–leisure (β = −0.1173 ± 0.0037, P ≤ 2 × 1016) compared to a weak effect for Vig-PA–work (β = −0.0061 ± 0.0029, P = 0.036), and none for for Vig-PA–home (β = −0.0052 ± 0.0030, P = 0.082). The small protective effect for Vig-PA–work contrasts with the opposing directions for risk for several other important health traits. The survival analyses may be confounded by health status (healthier individuals are able to exercise more vigorously) and competing risks (death). A previous study on different kinds of PA during leisure time—for example, weightlifting and aerobic exercise—showed, consistent with the present report, that PA in the leisure context can lead to a longer and healthier life44. We conclude that PA at leisure is the most protective with respect to deleterious medical outcomes.

This study has limitations, including a relative lack of females in the MVP dataset (approximately 9%, which nevertheless equates to >17,000 females for Vig-PA–leisure); this makes the interpretation of sex-specific associations infeasible. UKB, though, has excellent female representation. MVP is also exclusively based on military veterans, not fully representative of the general population. However, the high genetic correlation (rg = 0.76 ± 0.03, P = 4.7 × 10118) between MVP data (All-PA–leisure) and SSOE in UKB, a more representative cohort, may also be considered in this context. Moreover, while there are many non-EUR participants included, their numbers are still insufficient, so the cross-ancestry analysis mostly reflects findings in EUR. It would also be important to verify the tissue expression analyses, which highlight only significant associations with the brain, with more tissues and larger samples. Our results may reflect a lack of power for other tissues or our sample size—of the 53 tissue types contained in Genotype-Tissue Expression version 8 (GTEx v8), 13 are brain-specific.

In conclusion, we have made progress towards revealing the genetic and biological nature of PA, which can differ greatly depending on the context. In effect, the PA that one chooses to engage in in a leisure context has many positive effects with respect to health traits, while activity that one must do, in a work or household context, tends not to have these beneficial effects. This difference persisted even after we adjusted for income. Moreover, PA in the leisure context shows a much stronger decrease in the risk of death compared to the other contexts. This may reflect the differences in intensity and duration of leisure activities compared to work and home activities. We detected a protective role of PA genetic liability on several diseases, using global and local genetic correlation, PheWAS analyses and causal inference using MR. MR analyses showed significant protective effects of PA on several traits, including type 2 diabetes, abdominal aortic aneurysm, heart failure, triglyceride level, osteoarthritis, gastroesophageal reflux disease and COVID-19-associated hospitalization. These results provide genetic insights into the biological bases of PA and add to strong experimental, epidemiological and observational literature showing the value to health of PA.

Methods

MVP datasets

The U.S. Department of Veterans Affairs (VA) MVP is one of the largest and most diverse biobanks in the world, with genetic and electronic health record (EHR) data available45,46. Ethical approval for the MVP study was obtained from the Central VA Institutional Review Board and the site-specific institutional review boards. All relevant ethical regulations for work with human participants were followed in the conduct of the study, and informed consent was obtained from all participants.

Participants provided a blood sample for genomic analyses, granted access to medical records, and a subset agreed to complete two questionnaires, the MVP Baseline and Lifestyle Surveys. PA information used in this study was collected from the MVP Lifestyle Survey, and data were subdivided according to context (leisure, work and home), intensity (vigorous, moderate and light) and frequency (Supplementary Fig. 1). The three levels of activity were defined as follows:

  1. 1.

    Vigorous—activities that cause your heart to beat rapidly and you work up a good sweat, and are breathing heavily; performed at least 10 min at a time.

  2. 2.

    Moderate—activities that cause your heart rate to increase slightly and you typically work up a sweat, but are not physically exhausting; performed for at least 10 min at a time.

  3. 3.

    Light—activities that require little physical effort.

The levels of activity were considered together with frequency information, which could be ‘daily’, ‘several times per week’, ‘once per week’, ‘several times per month’, ‘once per month or less’ or ‘never’, when participants answered three questions regarding the place where PA was effected (Supplementary Methods). The three different levels of PA were summed considering All-PA phenotype frequency and weighting intensity to create a total score, reported in the Supplementary Methods.

From previous MVP analyses, after MVP genotyping performed using a customized Affymetrix Axiom Biobank Array and quality control47, EAGLE2 (ref. 48) was used for phasing chromosomes and Minimac3 was used for imputation49, using the 1000 Genomes Project reference panel, Phase 3, version 5 (ref. 50). Populations were defined using principal component analysis45.

For All-PA during leisure time (All-PA–leisure), we ran independent GWAS for each of the three ancestries (EUR, AFR and AMR; Supplementary Fig. 1). For EUR, we also ran GWAS for vigorous PA during leisure time (Vig-PA–leisure), work time (Vig-PA–work) and home time (Vig-PA–home). We did not have enough power to run GWAS for vigorous PA for AFR and AMR ancestries.

In the quality control procedure for GWAS conducted using PLINK 2.0 (ref. 51), we removed variants with imputation quality scores < 0.6, Hardy–Weinberg equilibrium P < 5 × 10−5, minor allele frequency <0.01, missing call rates for variants >0.1 and missing call rates for samples >0.1. Data were aligned to the GRCh37 reference genome. Considering EUR ancestry, after filtering, we had 6,660,206 variants for All-PA–leisure phenotype, and 6,548,052, 6,536,056, 6,537,158 variants for Vig-PA–leisure, Vig-PA–home and Vig-PA–work, respectively. For AFR and AMR ancestries, we kept 11,984,943 and 7,969,366 variants, respectively (All-PA–leisure phenotype).

To remove related individuals, we used a threshold of 0.0884 for the kinship coefficients calculated by KING52, resulting in the removal of individuals with a minimum of a second-degree relationship. Then, we implemented an algorithm to optimize retaining the more informative individuals, keeping the maximum number of individuals with the highest score (for All-PA–leisure phenotype) or with the highest frequency (for the vigorous activity phenotypes). If two individuals had the same score (or frequency), we removed the one with the highest number of relationships. For EUR, the final sample sizes were as follows: 189,812 individuals for the All-PA–leisure phenotype, 201,050 for Vig-PA–leisure, 203,430 for Vig-PA–home and 171,278 for Vig-PA–work. For AFR and AMR ancestries, we kept 27,044 and 10,263 individuals, respectively (All-PA–leisure phenotype). We ran GWAS analysis using a linear regression model implemented in PLINK 2.0, using sex, age and the first ten principal components as covariates.

For All-PA–leisure&home phenotype and for All-PA–leisure&home&work, we ran the same QC as described in the Methods, using EUR ancestry. After filtering, we had 6,678,873 variants for All-PA–leisure&home, and 6,761,070 variants for All-PA–leisure&home&work. For the final sample size, where we excluded individuals without full information on their PA, we retained 181,317 individuals for All-PA–leisure&home, and 146,929 for All-PA–leisure&home&work.

UKB summary statistics

UKB is a cohort study of ~500,000 adults aged 40–69 years in the United Kingdom (UK) and was recruited from 22 centers across the country53. All UKB participants provided written informed consent53. Ethical approval of the UKB study was given by the North West Multicentre Research Ethics Committee, the National Information Governance Board for Health and Social Care and the Community Health Index Advisory Group53.

From the UKB public databases, we downloaded GWAS summary statistics for the SSOE phenotype with the accession code GCST006100, for MVPA with the accession code GCST006097 and for VPA with the accession code GCST006098 (ref. 13). Self-reported information was reported via a touchscreen questionnaire. For SSOE, participants needed to answer the following question: ‘in the last 4 weeks, did you spend any time doing the following?’. Then, there were follow-up questions assessing the frequency and typical duration of ‘strenuous sports’ and of ‘other exercises’. Cases were defined as individuals spending 2–3 days per week or more doing strenuous sports or other exercises for a duration of 15–30 min or greater, and controls as those individuals who did not indicate spending any time in the last 4 weeks doing either SSOE, obtaining 124,842 cases (ncase) and 225,650 controls (ncontrol)13. We defined the population effective size as neff = 4/((1/ncase) + (1/ncontrol)).

The downloaded summary statistics had SNPs filtered according to—Hardy–Weinberg equilibrium P < 10−6, high missingness >1.5%, low minor allele frequency <0.1%13. We also filtered low imputation quality <0.6, yielding approximately 11.7 million available SNPs.

Meta-analyses

METAL54 was used for EUR ancestry meta-analysis, which included EUR MVP data for the All-PA–leisure phenotype and UKB summary statistics for the SSOE phenotype13, in an All-PA–leisure + SSOE meta-analysis. We also ran a cross-ancestry All-PA–leisure + SSOE meta-analysis, adding AFR and AMR summary statistics from MVP data to the EUR All-PA–leisure + SSOE meta-analysis (Supplementary Fig. 1). We conducted the meta-analyses using the sample-size method, which considers P value and direction of effect, weighted according to sample size. We used this method to combine summary statistics from a logistic regression model (UKB summary statistics for SSOE phenotype) and a linear regression model (MVP summary statistics for All-PA–leisure phenotype).

XWAS

Sex-stratified analysis of the X chromosome was conducted in the EUR MVP sample using XWAS 3.0 on hard-call genotypes55. We included 174,375 males and 15,130 females for the All-PA–leisure phenotype and 229,331 males and 17,072 females for Vig-PA–leisure. Following sex-stratified analysis, test statistics from males and females were combined using Stouffer’s method56. Variants with Hardy–Weinberg equilibrium P < 5 × 106 were filtered; 191,114 variants were included for the All-PA–leisure phenotype and 191,148 for Vig-PA–leisure. We used age and the first ten principal components as covariates.

Gene-based and gene-set analyses, functional enrichment

Using the tool MAGMA57, implemented in the FUMA web-based platform58, we annotated variants and ran gene-based tests, gene-set analyses and functional enrichment for each of the two meta-analyses (EUR ancestry and cross-ancestry) for the All-PA–leisure phenotype and for the GWAS of Vig-PA–leisure. For annotation, we used 1000 Genome Phase 3 EUR for EUR ancestry analyses and 1000 Genome Phase 3 ALL for the cross-ancestry All-PA–leisure+SSOE meta-analysis. The other parameters for the identification of lead SNPs were a maximum cutoff P < 0.05, r2 threshold to define independent significant SNPs >0.6, a second r2 threshold to define lead SNPs >0.1 and a maximum distance between linkage disequilibrium (LD) blocks to merge into a locus <250 kb.

The MAGMA gene-based test is based on a multiple regression approach to detect gene effects, considering SNP P values and linkage disequilibrium. We also ran a gene-based approach on our EUR All-PA–leisure + SSOE meta-analysis using fastENLOC17,18 (fast enrichment estimation aided colocalization analysis). This method was used to prioritize likely causal gene–trait associations with precomputed eQTL annotations from the 49 tissues in the GTEx v8 dataset. We kept as ‘noteworthy genes’ those with GLCP ≥ 0.9; if a gene was colocalizing among two or more tissues, we retained the tissue with the highest GLCP value.

Gene-set analyses are performed for curated gene sets and GO terms obtained from MsigDB, with terms having a Bonferroni-corrected P < 0.05 considered significant57. Gene property analyses were performed for tissue-specific gene expression using GTEx v8 dataset59.

Single-cell expression

FUMA also includes a tool to test cell-type-specificity analysis60, which was run using the following datasets of human samples: Allen Brain Atlas Cell Type61, DroNc62, GSE104276 (ref. 63), GSE67835 without fetal samples64, GSE81547 (ref. 65), GSE84133 (ref. 66), GSE89232 (ref. 67), GSE101601 (Linnarsson’s lab)68, GSE76381 (Linnarsson’s lab)69 and PsychENCODE70.

SNP-h 2 and genetic correlation

LDSC16 was used to calculate SNP-h2 for EUR ancestry and to calculate genetic correlation71 between the EUR meta-analysis of the All-PA–leisure phenotype and other traits, to quantify genetic similarity (Supplementary Methods ‘Traits studied for genetic correlation with PA’). To determine which traits were significant, we used the Benjamini–Hochberg procedure (Supplementary Table 15; ‘Benjamini–Hochberg false discovery procedure’). To estimate SNP-h2 for AFR and AMR cohorts, we calculated LD scores with cov-LDSC72 from 10,000 random independent individuals from MVP filtering the SNPs to keep only those that were previously identified in the HapMap Project73. We conducted multitrait conditional and joint analysis to evaluate the potential confounding of socioeconomic status with mtCOJO (Supplementary Notes).

Benjamini–Hochberg false discovery procedure

To determine which traits were significantly genetically correlated with PA, we used the Benjamini–Hochberg procedure (Supplementary Table 19)74. We ranked the traits according to their increasing P value, then we calculated the P value’s Benjamini–Hochberg critical value by multiplying each rank by 0.05 and dividing it by the total number of analyzed traits. Finally, the trait P values were compared to the critical values, considering a trait significant until the largest P value was smaller than its corresponding critical value.

Local genetic correlation

We ran LAVA19 to calculate local genetic correlations between the EUR All-PA–leisure + SSOE meta-analysis and the 41 traits (Supplementary Table 41) also studied for genome-wide (global) genetic correlation, to identify shared genetic bases in specific genomic regions. LAVA allows estimation of local genetic heritability and local genetic correlations. We considered 2,495 genomic loci, defined by partitioning the genome into blocks of around 1 Mb and minimizing the LD between the blocks19. Based on the number of genomic loci, we defined significant local genetic heritability as having a Bonferroni-corrected P < 0.05/2,495. We evaluated local genetic correlation at loci with pairs of traits that have already been shown to have significant local genetic heritability for both phenotypes. Thus, we conducted 3,088 local genetic correlation tests, defining significance as those with a Bonferroni-corrected P < 0.05/3,088.

MR

MR analyses enable the inference of causality between traits with genetic similarity. Two-sample MR was performed using the following four methods: MR-Egger, weighted median, IVW and simple mode75. For the PA trait, we used EUR ancestry MVP data of the All-PA–leisure phenotype to account for problems of population stratification and of overlapping samples. We filtered SNPs with P > 10−5 and clumping data using the default window of 10,000 kb and the r2 cutoff of 0.001. To designate significant results, we applied multiple testing correction for 36 outcomes used both as outcomes and as exposures (P = 6.9 × 10−4). We also used TwoSampleMR to run MVMR analyses of the All-PA–leisure phenotype as exposure, to understand the influence that BMI could have as a confounder, setting it as exposure (Supplementary Notes).

TWAS

TWAS was used to identify genes associated to traits. We ran TWAS using FUSION76 with the 1000 Genomes LD reference data (https://data.broadinstitute.org/alkesgroup/FUSION/LDREF.tar.bz2) and the GTEx v8 multitissue expression weights from 49 tissues (https://www.mancusolab.com). These tissues included adipose, adrenal gland, artery, brain, breast, skin, blood, colon, esophagus, heart, kidney, liver, lung, minor salivary gland, muscle, nerve, ovary, pancreas, pituitary, prostate, small intestine, spleen, stomach, testis, thyroid, uterus and vagina. We performed expression imputation for each autosome using these tissue weights; thus, we identified genes that were conditionally independent. Based on these 49 tissue weights and a total of 27,977 genes, we used a multiple testing correction of 3.65 × 108. If a gene was significant and expressed in at least two tissues, we chose the gene expression with the lowest P value.

Fine-mapping

Fine-mapping of causal gene sets was used to fine-map TWAS at genomic risk regions, identifying likely causal genes77. We used the All-PA–leisure + SSOE EUR meta-analysis and the Vig-PA–leisure as input GWAS, and the GTEx v7 weights from PrediXcan78 combined with Metabolic Syndrome in Men Study79, Netherlands Twins Registry80, Young Finns Study81 and CommonMind Consortium82 weights. LD scores were obtained from the 1000 Genomes Phase 3.

We also conducted fine-mapping analyses at the SNP level using Polygenic functionally informed fine-mapping83, using precomputed prior causal probabilities based on a meta-analysis of 15 UKB traits, allowing for the extraction of per-SNP heritabilities (SNPVAR). These were necessary together with the summary statistics and EUR genotypes from 1000 Genomes Project Phase 3 to perform functionally informed fine-mapping with the sum of single effects84,85.

Multitrait analysis of GWAS (MTAG)

This method was employed to facilitate joint analysis of summary statistics from GWAS of different but related traits, such as the MTAG86, to increase power in GWAS analyses. Here we ran two MTAG analyses, both with the EUR All-PA–leisure + SSOE meta-analysis as a single trait. First, we joined this trait with leisure screen time15. Because of their negative genetic correlation, we flipped the allele in the LSC statistics. Second, we joined the EUR All-PA–leisure + SSOE meta-analysis with a measure of PA-liking87.

Enrichment analysis

g:Profiler is a toolset that also includes finding biological categories enriched in a gene list88. We performed it on three gene lists resulting from the FUMA gene-based test—EUR All-PA–leisure+SSOE meta-analysis, cross-ancestry All-PA–leisure+SSOE meta-analysis and Vig-PA–leisure.

Mitochondrial genome association analyses

We used mitochondria genomes of EUR ancestry individuals from MVP to run mitochondria–SNV association analyses and gene-based association analyses with All-PA–leisure and Vig-PA–leisure phenotypes. For the mitochondria–SNV association analyses, we filtered missing call rates for variants >0.1, missing call rates for samples >0.1, and we excluded monomorphic variants. We thus analyzed 141 SNVs from 189,782 individuals for All-PA–leisure and 142 SNVs from 201,018 individuals for Vig-PA–leisure. For the gene-based association analyses, we initially assigned the SNVs to mitochondrial genes using the reference sequence NC_012920.1 (ref. 89); each SNV mapped to a single gene, and then we used the R software SKAT v2.2.4 to perform associations with All-PA–leisure and Vig-PA–leisure phenotypes, using age, sex and the first ten principal components as covariates90. We considered the nominally significant P value (≤0.05).

PheWAS

We used Vanderbilt University Medical Center’s (VUMC) EHR database to conduct a PheWAS as an empirical exploration of PA and its related phenotypes, investigating a broader range of traits compared to those used in the genetic correlation analyses. The VUMC EHR database, also known as the Synthetic Derivative, houses de-identified, longitudinal medical histories of over 3.1 million patients, including their demographic information (age, sex and ethnicity), and clinical information like medications, lab values, procedural and surgical codes and diagnostic codes27. Codes from both the International Classification of Diseases, ninth and tenth editions (ICD-9 and ICD-10), are used by healthcare professionals to code symptoms, procedures and diagnoses in the synthetic derivative. A subset of the patients from the Synthetic Derivative also has their biological data associated with their medical records. This subset is referred to as BioVU and was used for genomic and phenomic analyses91,92,93.

For our analysis, we performed a PheWAS across BioVU records using a polygenic score for PA derived from the EUR All-PA–leisure + SSOE meta-analysis. First, we used PRS-CS94 to calculate a polygenic score for PA for 66,917 BioVU records of EUR participants with genotyped data. Next, using the PA polygenic score as the continuous predictor, we ran a PheWAS95 to explore the phenomic landscape of PA. The first step in PheWAS involves mapping related ICD codes to their phecode using the R PheWAS package96. The complete list of phecodes, along with the corresponding ICD-9 and ICD-10 codes mapped to them, can be found on the PheWAS catalog (https://phewascatalog.org/). Finally, using the EHR-driven case-control status for each phecode, we estimated the association between the PA polygenic score and the phecodes. We restricted our analysis to 1,254 phecodes that meet the following three criteria: (1) phecodes need over 100 cases or control, (2) phecodes should not be specific to a single sex and (3) phecodes should not be included in the broader mental health phenotypes. We also controlled for the first ten ancestry principal components, median age of the record and reported sex.

Genomic structural equation modeling

Genomic structural equation modeling (genomic-SEM) was used to evaluate the overall genetic architecture of the PA traits included in the EUR GWAS analyses21. Summary statistics for the All-PA–leisure + SSOE meta-analysis, and for GWAS of Vig-PA–leisure, Vig-PA–home, Vig-PA–work and liking of PA were used to perform EFA and CFA. EFA model fit was evaluated by the amount of cumulative variance explained, the strength of SS loadings (≥1), and balance in the proportion of variance explained by each of the individual factors. Traits with factor loadings ≥0.20 in the EFA were allowed to load on the respective factors and evaluated for CFA model fit as determined by conventional fit indices21.

Survival analyses

We ran the Cox proportional-hazards model using the package survival 3.3-1 in R, to establish the impact of vigorous PA for the three contexts (leisure, home and work) on the risk of dying of any cause. Information was collected in the MVP dataset for age, censoring status, and vigorous PA. Age was calculated for censored individuals from the current year minus the year of birth; for dead participants, the year of death was provided. The vigorous PA phenotypes present in MVP were previously described in the Methods ‘MVP datasets’ as follows: we considered Vig-PA–leisure, Vig-PA–home and Vig-PA–work classification. We had complete information for 252,718 individuals (of whom 55,849 were deceased) for Vig-PA–leisure, 253,096 individuals (of whom 55,203 were deceased) for Vig-PA–home and 207,862 individuals (of whom 43,294 were deceased) for Vig-PA–work.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.