Use of Mendelian Randomization to Examine Causal Inference in Osteoporosis
- 1MRC Integrative Epidemiology Unit (IEU), Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, United Kingdom
- 2Musculoskeletal Research Unit, Translational Health Sciences, Bristol Medical School, University of Bristol, Bristol, United Kingdom
- 3The University of Queensland Diamantina Institute, The University of Queensland, Woolloongabba, QLD, Australia
Epidemiological studies have identified many risk factors for osteoporosis, however it is unclear whether these observational associations reflect true causal effects, or the effects of latent confounding or reverse causality. Mendelian randomization (MR) enables causal relationships to be evaluated, by examining the relationship between genetic susceptibility to the risk factor in question, and the disease outcome of interest. This has been facilitated by the development of two-sample MR analysis, where the exposure and outcome are measured in different studies, and by exploiting summary result statistics from large well-powered genome-wide association studies that are available for thousands of traits. Though MR has several inherent limitations, the field is rapidly evolving and at least 14 methodological extensions have been developed to overcome these. The present paper aims to discuss some of the limitations in the MR analytical framework, and how this method has been applied to the osteoporosis field, helping to reinforce conclusions about causality, and discovering potential new regulatory pathways, exemplified by our recent MR study of sclerostin.
Osteoporosis is a common disorder leading to skeletal fragility and increased fracture risk. This condition is strongly influenced by age and sex, as well as genetic factors. Establishing which risk factors play a causal role in osteoporosis is helpful in unraveling pathogenic mechanisms, and in identifying potential new preventative and treatment strategies. Epidemiology studies in the osteoporosis field have examined relationships between putative risk factors and fracture risk, the main clinical consequence of osteoporosis. Investigations have also studied risk factors for bone mineral density (BMD) as measured by DXA, which is a strong predictor of fracture risk (1). Traditional observational studies have reported that a range of potentially modifiable risk factors, including sex-steroid deficiency, low body mass index (BMI), physical inactivity, smoking, heavy alcohol consumption, and low calcium and vitamin D, are related to BMD and fractures. However, studies of this type suffer from confounding and reverse causality (2, 3). Randomized controlled trials (RCTs) are the gold standard for inferring causality, because they are unaffected by these issues if performed correctly. However, RCTs are expensive, resource-intensive, time consuming, and may have important ethical limitations.
MR is a statistical method for inferring causality which is analogous to an RCT, except that genotypes are used to randomize participants into different levels of the exposure/treatment. MR can be implemented as a form of instrumental variables analysis, where genetic variants, normally single nucleotide polymorphisms (SNPs), are used as proxies (“instruments”) for the exposure of interest (see Figure 1) (4, 5). According to Mendel's Laws of Inheritance, alleles segregate randomly when passed from parents to offspring. According to his (second) Law of Independent Assortment, which forms the foundation of MR, the inheritance of one pair of factors (genes) is independent of the inheritance of the other pair. Thus, offspring genotypes are unlikely to be associated with confounders in the population. In addition, since germ-line genotypes are determined at conception, they precede outcomes being investigated, and so observed associations cannot be explained by reverse causation. However, unlike RCTs which generally involve relatively short term interventions, genetic influences exert their effects from conception onwards, and so causal effects estimated from MR represent life-long exposures.
Figure 1. One-sample and two-sample Mendelian randomization study designs. (A) One-sample Mendelian randomization is based on a population where both exposure and outcome have been measured. (B) In two-sample Mendelian randomization, exposures and outcomes are measured in non-overlapping populations. SNP-exposure is derived in Sample 1, and SNP-outcome in Sample 2.
MR was initially developed in the form of one-sample MR, which relies on access to individual level data (Figure 1A). One limitation of this method is that most individual cohorts do not have many traits measured simultaneously. Two sample MR was subsequently developed to overcome this issue by using summary level data derived from independent cohorts that collectively have many exposures and outcomes measured (Figure 1B). GWAS data is now available on thousands of plausible osteoporosis risk factors which can be leveraged by two sample MR. The extensive opportunities to explore causal influences on bone phenotypes using the MR approach is summarized in two recent reviews (2, 5).
Though previous MR studies have contributed to our understanding of causal factors involved in the etiology of osteoporosis, as discussed below, MR has a number of inherent assumptions and limitations, for which a range of sensitivity analyses have been developed (6–8). MR analyses may also be subject to several sources of bias (9). For example, if individuals with a certain disease outcome are drawn from a population with distinct ancestry to disease-free controls, this may lead to differences in frequency of genetic variants between those with and without disease, and hence spurious associations with genotypes related to putative risk factors. Furthermore, dynastic effects need to be considered for many traits, including BMD, whereby effects of genetic variation in the offspring are partly mediated by shared parental genetic influences acting via early life environment (10). The present paper aims to discuss how MR has been applied to the osteoporosis field, including the approaches taken to address limitations in the MR analytical framework.
Consistent with epidemiology studies of osteoporosis in general, the majority of MR studies in osteoporosis have utilized DXA-measured BMD as the outcome, which is widely used clinically as the gold standard for diagnosing osteoporosis. As well as being predictive of the clinical consequence of osteoporosis, namely fractures (1), BMD has a major heritable component, making it a highly suitable outcome for MR analyses (11, 12). Initial studies utilized summary statistics from GWASs, such as those based on the GEnetic Factors for OSteoporosis Consortium (GEFOS, http://www.gefos.org/), which made GWAS findings for DXA-measured BMD publicly available for a range of sites including the lumbar spine (LS) and femoral neck (FN) (13). Whereas, GEFOS has the advantage of providing BMD GWAS data for multiple skeletal sites, a potential disadvantage is that the latter GWAS adjusted for weight. The justification for this is that areal BMD measured by DXA is influenced by body size, which is partly accounted for by adjusting for weight. However, this can have unintended consequences, such as the introduction of spurious genetic associations as a consequence of collider bias (14). In addition, by using BMD summary statistics corrected for weight, subsequent MR analysis may be biased as effects on BMD mediated by weight may not be accurately estimated. Moreover, as described below, use of GWAS outputs that were adjusted for weight and/or height may complicate interpretation when applying these data in MR studies examining relationships between BMI and BMD. The latter relationship is thought to be relatively complex, involving both a mass effect leading to greater loading acting via the mechanosensor, and shared endocrine pathways.
Recently published GWASs for estimated BMD (eBMD) derived from heel ultrasound in UK Biobank (11, 12) have the advantage that both unadjusted and adjusted summary statistics are available on request, enabling sensitivity analyses to be performed. Moreover, the large sample size provides a significant advance in terms of power, which is one of the major limitations of MR studies. Ultrasound derived BMD does not involve radiation, is quick and cheap, and is therefore well-suited to population studies involving hundreds of thousands of people. The limitation is that estimated BMD is not well-understood and we are not entirely sure how well it proxies BMD. That said, eBMD and DXA-BMD measures are reasonably highly correlated genetically (r = ~0.5), as are eBMD and fractures (r = ~0.5) (11), and ultrasound-derived measures have previously been reported to predict subsequent fractures with similar accuracy to DXA BMD (15–18).
BMD is an intermediary phenotype; low BMD is only of pathological significance as a result of its causal relationship with fracture. However, since many risk factors for osteoporosis act via BMD, their relationship with BMD is somewhat stronger than that with fracture, with the result that MR studies using fracture as the outcome tend to be underpowered. That said, in a large MR study of fractures based on discovery set of 37,857 fracture cases and 227,116 controls, with replication in 147,200 fracture cases and 150,085 controls, Trajanoska et al. found that higher BMD had an expected causal effect in reducing fractures (19). Moreover, Morris et al. identified 13 bone fracture loci in approximately 1.2 million individuals, all of which were associated with eBMD (12). It may also be possible to extend MR studies in osteoporosis to examine causal effects on other phenotypes relevant to osteoporosis. BMD is not the sole causal determinant of fracture, and GWAS signals have recently been identified for several geometric parameters derived from hip DXA, which are also thought to be related to fracture risk (20). GWAS efforts are also underway for osteocalcin and CTX, offering opportunities for MR studies to examine causal pathways for other outcomes contributing to fracture risk, such as bone turnover.
Osteoporosis Risk Factors
A range of risk factors for osteoporosis identified in epidemiological studies have been examined in MR studies intended to explore their causal effects, using BMD as the outcome, the majority of which have yielded no or weak evidence of causality (see Table 1). For example, studies using BMD as an outcome did not find support for a causal effect of vitamin D (21–23) or genetically determined calcium intake as reflected by lactase persistence genotype (24). Guo et al. found no evidence to suggest a causal effect of alcohol consumption on BMD whereas smoking status was found to be causally related to lower BMD (25); however, it should be noted that smoking (exposure) and eBMD (outcome) instruments were derived from the same sample population which could result in biased estimates (8). In a subsequent study, genetic predisposition to smoking initiation was associated with fracture risk, but not eBMD; genetic liability to alcohol dependence was also associated with fracture and lower eBMD, whereas no association was seen for genetically predicted alcohol intake (26). However, this study also included UK Biobank participants in exposure and outcome instruments which could lead to bias. Other studies exploring the effect of serum urate (27, 28), inflammatory markers (29) and thyroid stimulating hormone (30) found no evidence for association with FN or LS BMD. Rather than a risk factor, MR analysis suggests that lowering LDL-C levels and statin therapy improve BMD (31).
In terms of constitutive factors, a causal association was observed between later age at menarche and reduced FN and LS BMD in adults (32), and reduced LS BMD in adolescents (33). A study in children found a causal association between BMI/adiposity and BMD (34). A previous study in adults using summary data from GWASs in Europeans found no evidence of a causal effect of BMI (based on 77 SNPs) on FN or LS BMD, however since FN and LS BMD were corrected for weight prior to the GWAS, the variation in BMD attributable to BMI may not be adequately captured by MR analysis (35). In contrast, a one-sample MR in Koreans using 13 BMI-associated SNPs identified in a GWAS of east Asians was suggestive of a causal effect of BMI on BMD on weight bearing sites in men and pre-menopausal women (36). Observation studies have implicated several diseases in the development of osteoporosis, however MR has subsequently found no causal effect of type 2 diabetes (T2D) and coronary heart disease (CHD) on eBMD (37). Another study reported a weak association between increased T2D risk and increased FN BMD, whereas no association was seen with LS BMD (35). In terms of other constitutional factors, genetic predisposition to increased calcium levels was recently found to be unrelated either to eBMD or fracture risk in UK Biobank (39).
Several MR studies have examined osteoporosis risk factors with fracture as the outcome (Table 2). In the study based on UK Biobank from Trajanoska et al. (19), while confirming the expected protective effect of higher BMD on fractures, there was little evidence to suggest a causal effect of dietary factors (vitamin D levels and calcium intake), early menopause, late puberty and range of diseases (including type 1 and 2 diabetes, CHD and inflammatory bowel disease) on risk of fracture. These findings are consistent with results from the above MR studies based on BMD. However, the study did provide some evidence for causal effect of decreased grip strength on fracture risk. A study in 97,811 Danish individuals failed to provide evidence for a relationship between calcium intake and hip fracture (40). Interestingly, an MR study investigating the causal effect of height with 50 diseases reported that one SD increase in genetically determined height was associated with increased risk of hip fracture (41). In terms of the effect of serum hormones, a previous MR study in men reported lower levels of estradiol to be causally related to increased risk of fracture (including all self-reported fractures, major non-vertebral osteoporotic fractures and wrist fractures), whilst there was no evidence for causal association between serum testosterone and fracture risk (42). Furthermore, using a genetic risk score for CRP levels in a Rotterdam study, there was no evidence to support a causal effect of CRP on fracture risk (43).
Some MR studies have set out to test hypothesized causal effects of BMD on other outcomes. For example, in a study which used summary statistics from the first release of the UK Biobank data (N = 11,650), the authors reported some evidence for a causal effect of eBMD on T2D, CHD, HDL-c, and HOMA-IR, testing reciprocal associations for two traits (T2D and CHD), for which there was no evidence of a causal effect on BMD (37). The most recent study in 426,824 UK Biobank participants identified 518 loci associated with eBMD, explaining 20% of its variance (12), meaning that many powerful and robust instruments for MR analyses examining causal effects of eBMD will be available.
1. Vertical pleiotropy, when the genetic variant has an effect on two or more traits that both influence the outcome via the same biological pathway, is usually not problematic for MR analyses
2. In contrast, horizontal pleiotropy, when a genetic variant is associated with two traits which influence the outcome via independent biological pathways, violates one of the key MR assumptions
3. Several methods have been developed that relax the strict requirement that genetic instruments exhibit no horizontal pleiotropy yet still produce consistent causal effect estimates
4. Where genetic instruments are known to be pleiotropically associated with multiple correlated phenotypes, it may be possible to examine independent effects through exclusion of certain SNPs, or use of multivariable MR.
One of the main assumptions of MR is that genetic instruments are only associated with the interest via the exposure being tested. This is known as the “no pleiotropy” assumption or the “exclusion restriction criterion.” When performing an MR study, it is usually unclear whether such an assumption holds. Therefore, various sensitivity analyses are applied to detect the existence of pleiotropy, and to estimate the un-biased causal effect of the exposure on the outcome. Vertical pleiotropy (i.e., a genetic variant has an effect on two or more traits that both influence the outcome via the same biological pathway) is not generally an issue for MR analysis (Figure 2A) (8). However, this can be this can be problematic in situations where the exposure variable is mis-specified i.e., the genetic instrument is biologically related to an intermediate or outcome, but has been identified as being related to the exposure by virtue of the latter's correlation with the biologically related trait (4), termed correlated pleiotropy (44). For example, although a locus in FTO was initially identified in relation to type II diabetes, this was subsequently found to primarily influence BMI with secondary effects on type II diabetes (45), leading to difficulties in interpreting MR studies where FTO variation is used as instrumental variable for type 2 diabetes.
Figure 2. Vertical and horizontal pleiotropy. (A) Vertical pleiotropy, which does not violate the MR assumption; (B) Horizontal pleiotropy, which violates the MR assumption.
In contrast, horizontal pleiotropy (i.e., a genetic variant is associated with two traits which influence the outcome via independent biological pathways) violates the exclusion restriction criterion (Figure 2B). GWAS identify genetic instruments purely on statistical grounds. Even if instrumental variables used in MR studies intersect genes with plausible pathways to the exposure it's not possible to be sure whether they mediate the causal effect being evaluated. Therefore, potential horizontal pleiotropy as a result of unknown pathways needs to be excluded if MR studies are to reach robust conclusions about causality. One simple method to limit the impact of horizontal pleiotropy is leave-one-out as a sensitivity analysis to ensure that the causal effect is not mediated by an outlier effect of one specific locus (46).
Over the last few years, several methods have been developed that relax the strict requirement that genetic instruments exhibit no horizontal pleiotropy yet still produce consistent causal effect estimates (7). One such approach is MR-Egger regression (47), where given a set of genetic variants that proxy an exposure variable of interest, a regression is performed between estimates of the SNP-outcome association and SNP-exposure association (this can be performed in both one and two-sample MR analyses). Unfortunately, Egger regression is limited by very poor power. Weighted median and weighted mode approaches have since been developed to derive causal estimates based on the relationship between the strength of the association between the SNP and the outcome, and the strength of the association of the SNP with the exposure, which are more robust to violation of horizontal pleiotropy by a substantial proportion of instruments (48, 49). Several additional methods now exist which assume that only a certain proportion of the genetic instruments have a horizontal pleiotropic effect. These methods aim to reduce heterogeneity by removing SNPs that contribute to heterogeneity disproportionately, based on the standard errors of the Wald ratios. Such outlier removal strategies are applied in the MR-PRESSO (50), and generalized summary MR (GSMR) approaches (51). One of the issues in applying MR methods that are robust to pleiotropy is that in order to detect causal effects these require large sample sizes (49). Another issue concerns the number of SNPs used as IVs; significant numbers of SNPs are required to provide sufficient data points for meaningful analysis. However, an advantage is that these approaches often rely on different sets of assumptions, and if consistent, conclusions can be drawn regarding causality with reasonable confidence.
The recent review by Lawlor et al. provides a useful summary of the various methods and extensions of Mendelian Randomization (52). Some of these more advanced MR analysis methods have been applied in relatively recent studies examining causal inference in osteoporosis. For example, in their study examining causal relationships between blood lipids and eBMD, Cherny et al. found broadly similar inverse associations between LDL-C and eBMD as assessed by inverse-variance weighted (IVW), MR-Egger, weighted median and weighted mode estimates (53). Similarly, in our recent study, MR-Egger, IVW and weighted median estimates showed similar causal effects of BMD on sclerostin (54). That said, the statistical evidence against the null for the MR-Egger estimates was somewhat lower, in both these papers, reflecting the lower statistical power of this test (48, 49). However, these types of sensitivity analyses have been lacking in many of the MR analyses in osteoporosis, including those analyzing a range of metabolites for which genetic influences could well-exert pleiotropic effects via unknown pathways (38).
It may be hard to exclude pleiotropy where the genetic instruments comprise just one or two SNPs, however there are exceptions to this. For example, we recently performed an MR study to examine the causal relationship between sclerostin levels and eBMD, based on results of a sclerostin GWAS where we identified just two loci. However, we were able to establish a causal relationship between sclerostin and eBMD using co-localization analysis, which interrogates LD structure at a single locus, in this case the gene encoding B4GALNT3 (54).
Several analysis methods have been developed to explore causal pathways in those situations where genetic instruments are pleiotropically associated with multiple correlated phenotypes. For example, in studying the causal relationship between genetically determined BMD (reflected by eBMD) and OA, Funck-Brentano et al. observed a strong causal effect of BMI on knee and hip OA, suggesting that if any eBMD SNPs are shared with BMI, this may influence OA via pathways other than BMD, which will violate the 3rd MR assumption (horizontal pleiotropy). The authors established that genetically determined BMD also has a causal effect on OA after excluding pleiotropic pathways involving BMI, by removing SNPs from their eBMD polygenic risk score that were related to BMI (55). Similarly, Cousminer et al. excluded SNPs for height and BMI in their MR analysis of the causal role of pubertal age on BMD (33).
An alternative method of accounting for pleiotropy where genetic variants are pleiotropically associated with multiple correlated phenotypes, is to perform multivariable MR. The latter aims to address this limitation by using instruments associated with multiple exposures to jointly estimate the separate causal effect of individual risk factors on the outcome (34, 56–58). For example, Kemp et al. used a one-sample multivariable method to show that BMI SNPs acted via both lean and fat mass to increase BMD (34). In an MR analysis of relationships between plasma lipids and BMD, we observed a strong inverse association between LDL cholesterol and forearm BMD in multivariate MR analyses adjusting for HDL cholesterol and triglycerides, which was not evident in univariate analyses involving only LDL cholesterol (59). This indicates that complex relationships may exist between the causal effects of different lipids and BMD, which MR analyses need to account for, and may help to explain the conflicting results from different MR analyses examining relationships between lipid levels and eBMD in the UK Biobank (53, 59, 60).
Distinguishing Genetic Correlation From Causality
1. Traits which are correlated as a result of shared underlying biology are likely to have shared genetic influences, leading to a positive signal in MR studies
2. MR signals arising from genetic correlation between two traits are expected to be bidirectional; true causal effects generally produce a positive MR signal in one direction only (i.e., exposure to outcome as opposed to outcome to exposure)
3. In bidirectional MR, it may be helpful to use methods such as Steiger filtering to restrict SNPs to those which have strongest effects on the outcome as opposed to the exposure being tested
4. Though rarer, bidirectional causal effects may exist, exemplified by a positive causal effect of BMD on sclerostin levels, and a negative causal effect of sclerostin on BMD.
It's common for two related traits to share a proportion of their heritability, as quantified by genetic correlation, implying some form of shared underlying biology. Bidirectional MR can help distinguish causality from correlation by first testing the associations in one direction (i.e., “exposure” to “outcome”), and then performing these in the opposite direction (i.e., “outcome” to “exposure”), using SNPs found to be associated with each trait in different GWASs. In those instances where certain SNPs are common to GWASs for both the exposure and outcome, methods such as Steiger filtering are recommended to remove these SNPs to ensure they are used correctly as instruments for analyses in one direction only (61). Bidirectional MR assumes that the underlying causal association works in a single direction. Where there is evidence for “bidirectional causality,” this may simply reflect genetic correlation arising from a common genetic pathway affecting both the exposure and outcome. That said, bidirectional Mendelian randomization can identify causal effects that do work in both directions; for example, smoking reduces BMI and higher BMI increases smoking (62). In the case of bidirectional causality where evidence is stronger in one direction, although the main causal pathway may be in this direction, findings may reflect misspecification of the exposure variable as described above. Alternative strategies to MR, such as latent causal variable analysis, have been developed to distinguish correlation from causality (63).
Certain biomarkers and risk factors for osteoporosis may be unlikely to show strong genetic correlations with BMD, and to be influenced by common biological pathways, nevertheless it's still good practice to perform bidirectional MR. For example, in the case of factors such as smoking, which was found to be genetically related to lower heel BMD (25), in the absence of bidirectional MR, it's not possible to exclude reverse causality, which is not inconceivable given the casual effect of BMI (which is known to influence BMD) on smoking (62). In recent studies examining relationships between panels of blood metabolites and BMD, where the direction of the causal effect is unclear, whereas one study reported findings from bidirectional analysis (64), a further one did not (38). In addition, genetic correlation as a consequence of shared biological pathways could conceivably explain relationships between BMD and other disease phenotypes such as osteoarthritis (OA). For example, a previous study revealed significant genetic correlation between LS BMD and hip and/or knee OA, suggesting common genetic influences, exemplified by the SMAD3 locus found to affect both OA risk and BMD (65). Shared genetic influences on BMD and OA could also explain recent findings that genetic instruments for eBMD are associated OA (55); whereas the authors interpreted this as indicative of a causal effect of eBMD on OA, bidirectional MR is required to prove such a causal pathway exists, as opposed to common biological mechanisms contributing equally to both traits.
One of the challenges in performing bidirectional MR between two variables which are highly correlated genetically is that the two traits are likely to share one or more SNPs in common. This is particularly an issue when using results derived from large GWAS studies that generate many signals. For example, in our recent study of relationships between eBMD and lipids, a bidirectional effect for eBMD on LDL-C was investigated using 404 eBMD associated SNPs as genetic instruments (12). Steiger filtering was used to identify SNPs that had stronger effects on the outcome (LDL-C) compared to the exposure (BMD). This analysis suggested that 394 of 404 SNPs exerted their primary effect on BMD as opposed to LDL-C levels. IVW MR, weighted median MR and MR-Egger regression results showed some evidence that eBMD might influence LDL-C, and the association remained unchanged after Steiger filtering to remove those SNPs that primary affected LDL-C levels (59).
As well as selectively removing SNPs to assist interpretation of bidirectional analyses, this approach may also be helpful in examining the role of specific biological pathways involved in mediating causal effects. For example, in the above MR analysis of the effects of plasma lipids on eBMD, we were able to confirm that the inhibitory effect of LDL cholesterol on eBMD which we observed was not solely mediated by SNPs intersecting the HMGR locus which is the target for statin therapy, since similar results were obtained when SNPs at this locus were removed from the polygenic risk score. Similarly, SNPs can be stratified into relevant/specific biological pathways and their association with outcomes of interest tested. For example, although not a formal MR analysis, Warrington et al. used genetic risk scores constructed from SNPs belonging to specific biological pathways, and showed that genetic risk scores comprising variants that belonged to the RANK-RANKL-OPG pathway, the mesenchymal stem cell differentiation functional pathway and the WNT signaling function pathway were associated with bone measures at age 13, but only mesenchymal stem cell differentiation and the WNT pathway SNPs showed associations with rate of change in BMD between 9 and 17 years (66).
It's a reasonable assumption that correlated variables as a result of shared biology show equivalent “causal” effects on bidirectional MR, whereas for a true causal relationship, an effect is just observed in one direction. However, we recently observed a further pattern in our study exploring the relationship between circulating sclerostin levels on eBMD, namely bidirectional causal pathways in opposite directions (54). We found that higher levels of serum sclerostin were causally related to lower FN BMD, lower eBMD and higher fracture risk. In contrast, greater BMD was causally related to higher sclerostin levels, using BMD SNPs identified in the GEFOS BMD GWAS (54). This finding aligned with the observational relationship between BMD and sclerostin we reported in the same paper and may be a reflection of a previously unsuspected negative control feedback mechanism for BMD (see Figure 3). However, the exact mechanisms involved remain unclear and functional validation of such a pathway is still needed.
Figure 3. Proposed feedback pathway between BMD and sclerostin. Greater BMD is proposed to increase circulating levels of sclerostin, which then feeds back to inhibit bone formation, and hence limit further gains in BMD.
In contrast to conventional epidemiological studies where the exposure variable comprises the population variance of the trait of interest, in MR, genetic instruments only capture a small proportion of trait variance (not infrequently <1%). As a consequence, the strength of the relationship between an instrumental variable used for MR, and the outcome of interest, will only be of a small fraction of that seen for the measured exposure variable. Therefore, limited power is a common problem for MR analyses, and a frequent explanation of null findings, and needs to be an important consideration particularly when the findings fail to support other well-established lines of evidence. Limited power is even more problematic for some of the more recent extensions to MR, such as multivariable MR.
In any given MR study, the major factors governing the power are the sample size, the strength of the genetic instruments available, the strength of the underlying causal relationship being evaluated, and the type I error rate. Recent availability of very large datasets, such as the UK Biobank, have facilitated well powered MR studies, as have the increasing number of GWAS signals available for any given trait. Nonetheless, even where a large sample is available, MR may be uninformative where available instruments are lacking, or if there is a weak underlying relationship. These considerations are particularly important when null associations are obtained, where it is helpful to report power calculations to illustrate the strength of any underlying relationship which would have been detectable, and how this compares with that seen in observational studies (67).
A related issue is weak instrument bias. In one-sample MR, using weak instruments may bias the causal association toward the observational association between the exposure and outcome, whereas in two-sample MR, weak instruments may bias MR estimates toward the null (68). Therefore, it is important to avoid such bias by evaluating instrument strength at the outset of the study. For most human phenotypes, common genetic variants only explain a limited proportion of the variance of the phenotypes; combining small effects across these common variants into a score (known as the polygenic risk score) may increase instrument strength (69). Another relevant concept is the “No Measurement Error” (NOME) assumption of MR, which assumes the association between a given genetic instrument and the exposure is estimated without measurement error (70). This is particularly important when weak instruments are estimated from GWAS with small sample sizes. In an IVW setting, the mean F-statistic can be used to assess whether instruments violate the NOME assumption, a value below 10 implying a high likelihood of weak instrument bias (71). In an MR-Egger setting, the I2 statistic (between 0 and 1) can be used to quantify violation of the NOME assumption, a lower value indicating greater likelihood that this assumption has been violated (48).
In the majority of instances of null findings reported in recent MR studies of osteoporosis risk factors, genetic instruments have been identified based on genome-wide significant associations from large scale GWASs, and although instrument strength is not universally reported, weak instrument bias is less likely to be an issue under these circumstances (70). However, weak instrument bias may be an issue in those instances where instruments have been identified from relatively small GWAS studies. For example, in a two-sample MR study examining causal relationships between inflammation and BMD reporting null findings, three of the genetic instruments for IL-6 were derived from a population of 1,664 individuals, and had F-statistics ranging from 3 to 8, indicating high likelihood of weak instrument bias (29).
It may be possible to extend MR to identify novel risk factors for osteoporosis using a hypothesis-free approach. For example, centralized databases such as MR-Base (46) and UK Biobank (http://www.nealelab.is/uk-biobank) have harmonized GWAS summary results for more than 20,000 complex human traits. Such resources make it feasible to conduct a phenome-wide MR for osteoporosis, aimed at identifying novel causal effects on BMD from screening a comprehensive range of complex traits. In many cases, mega biobanks such as UK Biobank, provide the richest source of GWAS-linked exposure or outcome data. Consequently, the issue of overlapping samples for generating genetic instruments and providing outcome data in a two-sample MR framework, potentially providing biased estimates (8), is becoming increasingly problematic. With the burgeoning opportunities for performing MR analyses, there also comes the need to ensure these are performed and reported comprehensively, with thorough exploration of issues such as pleiotropy, reverse causality and power, to ensure appropriate conclusions are drawn. STROBE-MR guidelines, intended to improve the quality of reporting of MR studies, have recently been produced (72).
MR was initially developed to examine the causal role of environmental exposures on the outcome of interest. This method has since been applied to a wide range of research areas, including drug target validation and prioritization, and the interpretation of multi-dimensional omics data. Large-scale GWASs of omics data, such as metabolites, DNA methylation, gene expression and protein expression provide a timely opportunity to identify the causal relationship of thousands of molecular phenotypes with osteoporosis in a MR framework. Automated tools such as summary-data-based MR (SMR), Generalized Summary-data-based MR (GSMR) and the two-sample MR R package make it possible to conduct such large-scale analyses effectively (46, 51, 73).
For omics studies of osteoporosis, one of the issues that needs further consideration is tissue specificity. Most molecular phenotypes to date have been measured in whole blood, for which the sample size of expression QTLs and methylation QTLs studies exceeds 30,000 (74) (http://www.godmc.org.uk/) and protein QTLs studies exceed 6,000 (75, 76). In contrast, the QTLs measured in bone tissues are limited to several hundreds of individuals. Whether molecular phenotypes measured in blood can be used as a proxy for those measured in bone tissues remains unclear, particularly methylation which shows a high degree of tissue specificity, in line with emerging trends in tissue specific MR (77), implying an urgent need for osteoblasts, osteoclasts and osteocytes and other skeletal cell types to be sufficiently well- represented in omics resources.
MR is being increasingly applied to examine causal inference in osteoporosis, reflecting the increasing availability of large datasets such as the UK Biobank, and multiple GWASs for potential risk factors. To date, the most important findings have been around the lack of causal role of traditional risk factors such as vitamin D in determining variation within the normal range of BMD/fracture risk. High-dimensional omics studies, based on GWASs of metabolites, gene expression and DNA methylation, offer exciting opportunities for future discovery, with the emergence of the first MR studies of metabolites in osteoporosis. However, an important caveat is that MR studies can be complicated by a number of issues including horizontal pleiotropy, reverse causality, and lack of power. Several extended MR methods have been developed to explore these aspects, and while not always applied consistently, r STROBE-MR guidelines have recently been produced, intended to support the quality with which MR studies are reported.
JZ and MF helped to write the manuscript. JK, DE, and GS reviewed the manuscript and provided critical comments. JT conceived and helped to write the manuscript.
JZ receives salary and start-up funding from the University of Bristol (Vice-Chancellor's fellowship), and MF from the Wellcome Trust (ref 209233/Z).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
1. Cummings SR, Black DM, Nevitt MC, Browner W, Cauley J, Ensrud K, et al. Bone density at various sites for prediction of hip fractures. The Study of Osteoporotic Fractures Research Group. Lancet. (1993) 341:72–5. doi: 10.1016/0140-6736(93)92555-8
6. Evans DM, Davey Smith G. Mendelian randomization: new applications in the coming age of hypothesis-free causality. Annu Rev Genomics Hum Genet. (2015) 16:327–50. doi: 10.1146/annurev-genom-090314-050016
10. Brumpton B, Sanderson E, Hartwig FP, Harrison S, Vie GÅ, Cho Y, et al. Within-family studies for Mendelian randomization: avoiding dynastic, assortative mating, and population stratification biases. bioRxiv [Preprint]. (2019). doi: 10.1101/602516
11. Kemp JP, Morris JA, Medina-Gomez C, Forgetta V, Warrington NM, Youlten SE, et al. Identification of 153 new loci associated with heel bone mineral density and functional involvement of GPC6 in osteoporosis. Nat Genet. (2017) 49:1468–75. doi: 10.1038/ng.3949
13. Estrada K, Styrkarsdottir U, Evangelou E, Hsu Y-H, Duncan EL, Ntzani EE, et al. Genome-wide meta-analysis identifies 56 bone mineral density loci and reveals 14 loci associated with risk of fracture. Nat Genet. (2012) 44:491–501. doi: 10.1038/ng.2249
14. Munafò MR, Tilling K, Taylor AE, Evans DM, Davey Smith G. Collider scope: when selection bias can substantially influence observed associations. Int J Epidemiol. (2018) 47:226–35. doi: 10.1101/079707
15. Arden NK, Baker J, Hogg C, Baan K, Spector TD. The heritability of bone mineral density, ultrasound of the calcaneus and hip axis length: a study of postmenopausal twins. J Bone Miner Res. (1996) 11:530–4. doi: 10.1002/jbmr.5650110414
16. Howard GM, Nguyen TV, Harris M, Kelly PJ, Eisman JA. Genetic and environmental contributions to the association between quantitative ultrasound and bone mineral density measurements: a twin study. J Bone Miner Res. (1998) 13:1318–27. doi: 10.1359/jbmr.19184.108.40.2068
17. Hunter DJ, de Lange M, Andrew T, Snieder H, MacGregor AJ, Spector TD. Genetic variation in bone mineral density and calcaneal ultrasound: a study of the influence of menopause using female twins. Osteoporos Int. (2001) 12:406–11. doi: 10.1007/s001980170110
18. Lee M, Czerwinski SA, Choh AC, Demerath EW, Sun SS, Chumlea WC, et al. Unique and common genetic effects between bone mineral density and calcaneal quantitative ultrasound measures: the Fels Longitudinal Study. Osteoporos Int. (2006) 17:865–71. doi: 10.1007/s00198-006-0075-4
19. Trajanoska K, Morris JA, Oei L, Zheng H-F, Evans DM, Kiel DP, et al. Assessment of the genetic and clinical determinants of fracture risk: genome wide association and mendelian randomisation study. BMJ. (2018) 362:k3225. doi: 10.1136/bmj.k3225
20. Hsu Y-H, Estrada K, Evangelou E, Ackert-Bicknell C, Akesson K, Beck T, et al. Meta-analysis of genomewide association studies reveals genetic variants for hip bone geometry. J Bone Miner Res. (2019) 34:1284–96. doi: 10.1002/jbmr.3698
21. Li S-S, Gao L-H, Zhang X-Y, He J-W, Fu W-Z, Liu Y-J, et al. Genetically low vitamin d levels, bone mineral density, and bone metabolism markers: a Mendelian Randomisation Study. Sci Rep. (2016) 6:33202. doi: 10.1038/srep33202
22. Sun J-Y, Zhao M, Hou Y, Zhang C, Oh J, Sun Z, et al. Circulating serum vitamin D levels and total body bone mineral density: a Mendelian randomization study. J Cell Mol Med. (2019) 23:2268–71. doi: 10.1111/jcmm.14153
23. Larsson SC, Melhus H, Michaëlsson K. Circulating Serum 25-hydroxyvitamin D levels and bone mineral density: mendelian randomization study. J Bone Miner Res. (2018) 33:840–4. doi: 10.1002/jbmr.3389
24. Yang Q, Lin SL, Au Yeung SL, Kwok MK, Xu L, Leung GM, et al. Genetically predicted milk consumption and bone health, ischemic heart disease and type 2 diabetes: a Mendelian randomization study. Eur J Clin Nutr. (2017) 71:1008–12. doi: 10.1038/ejcn.2017.8
25. Guo R, Wu L, Fu Q. Is There Causal Relationship of Smoking and Alcohol Consumption with Bone Mineral Density? A Mendelian Randomization Study. Calcif Tissue Int. (2018) 103:546–53. doi: 10.1007/s00223-018-0452-y
26. Yuan S, Michaëlsson K, Wan Z, Larsson SC. Associations of smoking and alcohol and coffee intake with fracture and bone mineral density: a Mendelian Randomization Study. Calcif Tissue Int. (2019) 105:582–8. doi: 10.1007/s00223-019-00606-0
27. Xiong A, Yao Q, He J, Fu W, Yu J, Zhang Z. No causal effect of serum urate on bone-related outcomes among a population of postmenopausal women and elderly men of Chinese Han ethnicity–a Mendelian randomization study. Osteoporos Int. (2016) 27:1031–9. doi: 10.1007/s00198-015-3341-5
28. Dalbeth N, Topless R, Flynn T, Cadzow M, Bolland MJ, Merriman TR. Mendelian randomization analysis to examine for a causal effect of urate on bone mineral density. J Bone Miner Res. (2015) 30:985–91. doi: 10.1002/jbmr.2434
30. van Vliet NA, Noordam R, van Klinken JB, Westendorp RG, Bassett JD, Williams GR, et al. Thyroid stimulating hormone and bone mineral density: evidence from a two-sample Mendelian Randomization Study and a Candidate Gene Association Study. J Bone Miner Res. (2018) 33:1318–25. doi: 10.1002/jbmr.3426
32. Li GH-Y, Cheung C-L, Au PC-M, Tan KC-B, Wong IC-K, Sham P-C. Positive effects of low LDL-C and statins on bone mineral density: an integrated epidemiological observation analysis and Mendelian Randomization study. Int J Epidemiol. (2019). dyz145. doi: 10.1093/ije/dyz145
33. Cousminer DL, Mitchell JA, Chesi A, Roy SM, Kalkwarf HJ, Lappe JM, et al. Genetically determined later puberty impacts lowered bone mineral density in childhood and adulthood. J Bone Miner Res. (2018) 33:430–6. doi: 10.1002/jbmr.3320
34. Kemp JP, Sayers A, Smith GD, Tobias JH, Evans DM. Using Mendelian randomization to investigate a possible causal relationship between adiposity and increased bone mineral density at different skeletal sites in children. Int J Epidemiol. (2016) 45:1560–72. doi: 10.1093/ije/dyw079
35. Ahmad OS, Leong A, Miller JA, Morris JA, Forgetta V, Mujammami M, et al. A Mendelian Randomization Study of the Effect of Type-2 diabetes and glycemic traits on bone mineral density. J Bone Miner Res. (2017) 32:1072–81. doi: 10.1002/jbmr.3063
37. Gan W, Clarke RJ, Mahajan A, Kulohoma B, Kitajima H, Robertson NR, et al. Bone mineral density and risk of type 2 diabetes and coronary heart disease: a Mendelian randomization study. Wellcome Open Res. (2017) 2:68. doi: 10.12688/wellcomeopenres.12288.1
38. Liu L, Wen Y, Zhang L, Xu P, Liang X, Du Y, et al. Assessing the associations of blood metabolites with osteoporosis: a mendelian randomization Study. J Clin Endocrinol Metab. (2018) 103:1850–5. doi: 10.1210/jc.2017-01719
39. Cerani A, Zhou S, Forgetta V, Morris JA, Trajanoska K, Rivadeneira F, et al. Genetic predisposition to increased serum calcium, bone mineral density, and fracture risk in individuals with normal calcium levels: mendelian randomisation study. BMJ. (2019) 366:l4410. doi: 10.1136/bmj.l4410
40. Bergholdt HKM, Larsen MK, Varbo A, Nordestgaard BG, Ellervik C. Lactase persistence, milk intake, hip fracture and bone mineral density: a study of 97 811 Danish individuals and a meta-analysis. J Intern Med. (2018) 284:254–69. doi: 10.1111/joim.12753
41. Lai FY, Nath M, Hamby SE, Thompson JR, Nelson CP, Samani NJ. Adult height and risk of 50 diseases: a combined epidemiological and genetic analysis. BMC Med. (2018) 16:187. doi: 10.1186/s12916-018-1175-7
42. Nethander M, Vandenput L, Eriksson AL, Windahl S, Funck-Brentano T, Ohlsson C. Evidence of a Causal Effect of Estradiol on Fracture Risk in Men. J Clin Endocrinol Metab. (2019) 104:433–42. doi: 10.1210/jc.2018-00934
43. Oei L, Campos-Obando N, Dehghan A, Oei EHG, Stolk L, van Meurs JBJ, et al. Dissecting the relationship between high-sensitivity serum C-reactive protein and increased fracture risk: the Rotterdam Study. Osteoporos Int. (2014) 25:1247–54. doi: 10.1007/s00198-013-2578-0
44. Morrison J, Knoblauch N, Marcus J, Stephens M, He X. Mendelian randomization accounting for correlated and uncorrelated pleiotropic effects using genome-wide summary statistics. bioRxiv [Preprint]. (2019) doi: 10.1101/682237
45. Frayling TM, Timpson NJ, Weedon MN, Zeggini E, Freathy RM, Lindgren CM, et al. A common variant in the FTO gene is associated with body mass index and predisposes to childhood and adult obesity. Science. (2007) 316:889–94. doi: 10.1126/science.1141634
46. Hemani G, Zheng J, Elsworth B, Wade KH, Haberland V, Baird D, et al. The MR-Base platform supports systematic causal inference across the human phenome. Elife. (2018) 7: e34408. doi: 10.7554/eLife.34408
47. Bowden J, Davey Smith G, Burgess S. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. Int J Epidemiol. (2015) 44:512–25. doi: 10.1093/ije/dyv080
48. Bowden J, Del Greco MF, Minelli C, Davey Smith G, Sheehan NA, Thompson JR. Assessing the suitability of summary data for two-sample Mendelian randomization analyses using MR-Egger regression: the role of the I 2 statistic. Int J Epidemiol. (2016) 45:1961–74. doi: 10.1093/ije/dyw220
50. Verbanck M, Chen C-Y, Neale B, Do R. Widespread pleiotropy confounds causal relationships between complex traits and diseases inferred from Mendelian randomization. Nat Genet. (2017) 50:693–8. doi: 10.1038/s41588-018-0099-7
51. Wu Y, Zeng J, Zhang F, Zhu Z, Qi T, Zheng Z, et al. Integrative analysis of omics summary data reveals putative mechanisms underlying complex traits. Nat Commun. (2018) 9:918. doi: 10.1038/s41467-018-03371-0
52. Lawlor DA, Wade K, Borges MC, Palmer T, Hartwig FP, Hemani G, et al. A Mendelian Randomization dictionary: useful definitions and descriptions for undertaking, understanding and interpreting Mendelian Randomization studies. Centre for open science. (2019) doi: 10.31219/osf.io/6yzs7
54. Zheng J, Maerz W, Gergei I, Kleber M, Drechsler C, Wanner C, et al. Mendelian Randomization analysis reveals a causal influence of circulating sclerostin levels on bone mineral density and fractures. J Bone Miner Res. (2019) 34:1824–36. doi: 10.1002/jbmr.3803
55. Funck-Brentano T, Nethander M, Movérare-Skrtic S, Richette P, Ohlsson C. Causal factors for knee, hip and hand osteoarthritis: a Mendelian randomization study in the UK Biobank. Arthritis Rheumatol. (2019) doi: 10.1002/art.40928
57. Burgess S, Dudbridge F, Thompson SG. Re: Multivariable Mendelian randomization: the use of pleiotropic genetic variants to estimate causal effects. Am J Epidemiol. (2015) 181:290–1. doi: 10.1093/aje/kwv017
58. Sanderson E, Davey Smith G, Windmeijer F, Bowden J. An examination of multivariable Mendelian randomization in the single-sample and two-sample summary data settings. Int J Epidemiol. (2019) 48:713–72. doi: 10.1093/ije/dyy262
59. Zheng J, Brion M-J, Kemp JP, Warrington NM, Borges M-C, Hemani G, et al. The effect of plasma lipids and lipid lowering interventions on bone mineral density: a Mendelian randomization study. bioRxiv [Preprint] (2018). doi: 10.1101/480426
60. Yang X-L, Cui Z-Z, Zhang H, Wei X-T, Feng G-J, Liu L, et al. Causal link between lipid profile and bone mineral density: a Mendelian randomization study. Bone. (2019) 127:37–43. doi: 10.1016/j.bone.2019.05.037
61. Hemani G, Tilling K, Davey Smith G. Orienting the causal relationship between imprecisely measured traits using GWAS summary data. PLoS Genet. (2017) 13:e1007081. doi: 10.1371/journal.pgen.1007081
62. Carreras-Torres R, Johansson M, Haycock PC, Relton CL, Davey Smith G, Brennan P, et al. Role of obesity in smoking behaviour: Mendelian randomisation study in UK Biobank. BMJ. (2018) 361:k1767. doi: 10.1136/bmj.k1767
64. Moayyeri A, Adams JE, Adler RA, Krieg M-A, Hans D, Compston J, et al. Quantitative ultrasound of the heel and fracture risk assessment: an updated meta-analysis. Osteoporos Int. (2012) 23:143–53. doi: 10.1007/s00198-011-1817-5
65. Hackinger S, Trajanoska K, Styrkarsdottir U, Zengini E, Steinberg J, Ritchie GRS, et al. Evaluation of shared genetic aetiology between osteoarthritis and bone mineral density identifies SMAD3 as a novel osteoarthritis risk locus. Hum Mol Genet. (2017) 26:3850–8. doi: 10.1093/hmg/ddx285
66. Warrington NM, Kemp JP, Tilling K, Tobias JH, Evans DM. Genetic variants in adult bone mineral density and fracture risk genes are associated with the rate of bone mineral density acquisition in adolescence. Hum Mol Genet. (2015) 24:4158–66. doi: 10.1093/hmg/ddv143
69. Evans DM, Brion MJA, Paternoster L, Kemp JP, McMahon G, Munafò M, et al. Mining the human phenome using allelic scores that index biological intermediates. PLoS Genet. (2013) 9:e1003919. doi: 10.1371/journal.pgen.1003919
70. Bowden J, Del Greco MF, Minelli C, Lawlor D, Sheehan N, Thompson J, et al. Improving the accuracy of two-sample summary data Mendelian randomization: moving beyond the NOME assumption. Int J Epidemiol. (2018) 48:728–42. doi: 10.1093/ije/dyy258
72. Smith GD, Davies NM, Dimou N, Egger M, Gallo V, Golub R, et al. STROBE-MR: Guidelines for strengthening the reporting of Mendelian randomization studies. Peer J Preprints (2019) 7:e27857v1. doi: 10.7287/peerj.preprints.27857
73. Zhu Z, Zhang F, Hu H, Bakshi A, Robinson MR, Powell JE, et al. Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets. Nat Genet. (2016) 48:481–7. doi: 10.1038/ng.3538
74. Võsa U, Claringbould A, Westra H-J, Bonder MJ, Deelen P, Zeng B, et al. Unraveling the polygenic architecture of complex traits using blood eQTL metaanalysis. bioRxiv [Preprint]. (2018) doi: 10.1101/447367
75. Zheng J, Haberland V, Baird D, Walker V, Haycock P, Gutteridge A, et al. Phenome-wide Mendelian randomization mapping the influence of the plasma proteome on complex diseases. bioRxiv [Preprint]. (2019). doi: 10.1101/627398
Keywords: bone mineral density (BMD), fractures - bone, pleiotropy, sclerostin, GWAS - genome-wide association study
Citation: Zheng J, Frysz M, Kemp JP, Evans DM, Davey Smith G and Tobias JH (2019) Use of Mendelian Randomization to Examine Causal Inference in Osteoporosis. Front. Endocrinol. 10:807. doi: 10.3389/fendo.2019.00807
Received: 26 July 2019; Accepted: 04 November 2019;
Published: 21 November 2019.
Edited by:Wim Van Hul, University of Antwerp, Belgium
Reviewed by:Melissa Orlandin Premaor, Federal University of Minas Gerais, Brazil
Michaël R. Laurent, University Hospitals Leuven, Belgium
Katerina Trajanoska, Erasmus Medical Center, Netherlands
Copyright © 2019 Zheng, Frysz, Kemp, Evans, Davey Smith and Tobias. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Jonathan H. Tobias, firstname.lastname@example.org
†These authors have contributed equally to this work