Detecting Weak Signals by Combining Small P-Values in Genetic Association Studies

Vsevolozhskaya, Olga A.; Hu, Fengjiao; Zaykin, Dmitri V.

doi:10.3389/fgene.2019.01051

ORIGINAL RESEARCH article

Front. Genet., 20 November 2019

Sec. Statistical Genetics and Methodology

Volume 10 - 2019 | https://doi.org/10.3389/fgene.2019.01051

Detecting Weak Signals by Combining Small P-Values in Genetic Association Studies

Olga A. Vsevolozhskaya¹

Fengjiao Hu²

Dmitri V. Zaykin^1*

¹Department of Biostatistics, College of Public Health, University of Kentucky, Lexington, KY, United States
²Biostatistics and Computational Biology, National Institute of Environmental Health Sciences, National Institutes of Health, Research Triangle Park, NC, United States

We approach the problem of combining top-ranking association statistics or P-values from a new perspective which leads to a remarkably simple and powerful method. Statistical methods, such as the rank truncated product (RTP), have been developed for combining top-ranking associations, and this general strategy proved to be useful in applications for detecting combined effects of multiple disease components. To increase power, these methods aggregate signals across top ranking single nucleotide polymorphisms (SNPs), while adjusting for their total number assessed in a study. Analytic expressions for combined top statistics or P-values tend to be unwieldy, which complicates interpretation and practical implementation and hinders further developments. Here, we propose the augmented rank truncation (ART) method that retains main characteristics of the RTP but is substantially simpler to implement. ART leads to an efficient form of the adaptive algorithm, an approach where the number of top ranking SNPs is varied to optimize power. We illustrate our methods by strengthening previously reported associations of μ-opioid receptor variants with sensitivity to pain.

Introduction

Complex diseases are influenced by multiple environmental and genetic risk factors. A specific factor, such as a single mutation, may convey a high risk, but population frequencies of high risk factors are usually low, and substantial contribution to disease incidence can be attributable to accumulation of multiple but weak determinants within individuals. Genetic determinants of complex diseases that had been identified by genetic association studies tend to carry modest effects; yet, power to detect such variants, as well as accuracy of identifying individuals at risk, can be improved by combining multiple weak predictors. The main challenge in detecting specific variants is low statistical power, but the overall accumulated effect of many individually weak signals can be much stronger. It is convenient to combine statistical summaries of associations—for example, P-values, and this approach can be nearly as efficient as analysis of raw data (Lin and Zeng, 2010). In observational research, methods for combining P-values are commonly associated with meta-analyses that pool results of multiple experiments studying the same hypothesis. The combined P-value then aggregates signals across all L studies, potentially providing a higher level of assurance that the studied risk factor is associated with disease. Furthermore, if samples in those studies are taken from populations that are similar with respect to the effect size magnitude, the combined meta-analytic P-value will well approximate the one that would have been obtained by pooling together all raw data and performing a single test (Zaykin, 2011).

P-values can also be combined when the L hypotheses are distinct, and when the interest is in detecting the overall signal. Such applications are common and include gene set and pathway analyses. Specifically, a typical strategy in computation of gene- and pathway-scores includes (1) mapping individual SNPs to genes, followed by combining their association P-values into gene-scores, and (2) grouping genes into pathways and combining gene-scores into pathway-scores.

Existing tools for combining P-values (P_i,i = 1, …, L) are often based on the sum of P_i‘s transformed by some function H. For example, Fisher (1932) test is based on the log-transformed P-values, H(P_i)=–21n(P_i), which are then added up to form a test statistics $T = \sum_{i = 1}^{L} H (P_{i}) ~ χ_{(2 L)}^{2}$ , where $χ_{(2 L)}^{2}$ has the chi-square distribution with 2 L degrees of freedom. When a portion of L distinct associations is expected to be spurious, it is advantageous to combine only some of the predictors using a truncated variation of combined P-value methods or emphasize the smallest P-values. For instance, Zaykin et al. (2002) proposed the truncated product method (TPM) as a variation of the Fisher test, which was trimmed by the indicator function, I(P_i≤ α), that is equal to zero if P_i> α, and one if P_i≤ α; 0 < α ≤ 1 is a truncation threshold. The combined P-value, P_TPM, is then given by the cumulative distribution function (CDF) of $W = \sum_{i = 1}^{L} \ln (P_{i}) I (P_{i} \leq α)$ . With the TPM approach, the threshold α is fixed while the number of P-values that form the sum W is random. Similarly inspired methods include the “tail strength measure” Taylor and Tibshirani (2005) and a method by Jiang et al. (2011).

A related popular method for combining top-ranking P-values is the rank truncated product (RTP) (Zaykin, 2000; Dudbridge and Koeleman, 2003; Zaykin et al., 2007a). In RTP, the number of P-values to be combined, k, is fixed, rather than the P-value threshold, as in TPM. The resulting combined P-value can be found from the cumulative distribution of the product:

P_{RPT} = Pr {\prod_{I n = 1}^{k} P_{(i)} \leq w} = 1 - Pr {\sum_{i = 1}^{k} \ln [P_{(i)}] > \ln [w]},

where P_(i) is the ith smallest P-value, i = 1,…,k. RTP leads to an appealing extension, where k can be chosen adaptively, to maximize statistical power (Hoh et al., 2001; Yu et al., 2009; Zhang et al., 2013). Adaptive rank truncated product (aRTP) variations optimize selection of the truncation point k among all (or a subset) of possible values 1 ≤ k ≤ L. Adaptive extensions for TPM are not as straightforward because the threshold α is a continuous variable, but one can resort to evaluating the distribution over a set of grid points (Sheng and Yang, 2013). In adaptive extensions of TPM and RTP, the final test statistic is the minimum P-value observed at various candidate truncation points.

The RTP null distribution is considerably more complicated than that of TPM. Complexity of the RTP distribution is due to dependency between ordered P-values. When k = L, this dependency is inconsequential because a statistic is formed as a sum of L terms, and its value does not change if the terms are re-ordered. In fact, when k = L, the RTP P-value is the same as the Fisher combined P-value, derived via a CDF of a sum of independent chi-square variables. However, if 1 < k < L, the k smallest P-values remain correlated and dependent even if these k values are randomly shuffled. The dependency is induced through P_(k+1) being a random variable: when P₍_k+1₎ happens to be relatively small, the k P-values have to squeeze into a relatively small interval from zero to that value. This induces positive dependency between random sets of k smallest P-values, similar to the clustering effect in random effects models. Although the linear correlation can be eliminated by scaling the largest P-value, P₍_k₎, the k values remain dependent, as illustrated in Figure 1 (see Supplementary Material S-1 for discussion).

FIGURE 1

Figure 1 Illustration for decorrelated yet dependent P-values; k = 2, L = 4. A plot of simulated and decorrelated values, U₍₁₎vs.U₍₂₎, reveals a hole in the middle, instead of the complete Malevich black square, indicating dependency.

Applications of combining independent P-values remain important in statistical research, and there is clear preference among practitioners for methods that are based on simple and transparent approaches, such as the Fisher or the inverse normal (Stouffer’s) tests (Fisher, 1932; Stouffer et al., 1949; Loughin, 2004; Whitlock, 2005; Zaykin, 2011; Won et al., 2009). Here, we derive a simple, easily implemented theoretical form of the RTP distribution for independent P-values which further leads to derivation of a new statistic. The new statistic, which we call the augmented RTP, or ART, is based on the product of the first smallest P-values, just like the RTP, but, unlike the RTP, the distribution of the new statistic is given by standard functions, and its computation avoids explicit integration. Despite simplicity, ART is at least as powerful as RTP, according to our simulation studies. Moreover, the ART leads to an adaptive statistic, where the number of the smallest P-values to combine can be determined analytically to maximize power. Next, we extend our methods by allowing dependence in the observed P-values. In genetic association studies, P-values are often correlated due to linkage disequilibrium (LD). The LD correlation is typically accounted for through permutational or other resampling approaches, where P-values are simulated under the null hypothesis while preserving LD between genetic variants. While such approaches are practical and easy to implement, it is also possible to de-correlate P-values before combining them and then use any of the approaches developed under the independence assumption. Surprisingly, we find that the decorrelation step often improves power. In particular, we find that, when association with disease is markedly different among variants within a gene, statistical power of standard methods (without the decorrelation step) approaches a plateau as a function of LD and does not improve as the number of SNPs increases. In contrast, power of our proposed decorrelation method increases steadily with the number of SNPs. Our analytical results as well as simulation experiments demonstrate this property for both ART (where k is chosen beforehand and fixed) and for the adaptive variations of RTP and ART (aRTP and ART-A). Finally, we illustrate usefulness of the proposed methods by strengthening an overall, gene-based association via combining previously reported P-values between pain sensitivity and individual SNPs within the μ-opioid receptor.

Material and Methods

Theoretical RTP Distribution and Augmented RTP, the ART

Even when P-values are independent, previously proposed theoretical forms of the RTP distribution are cumbersome and result in expressions that involve repeated integration (Zaykin, 2000; Dudbridge and Koeleman, 2003; Nagaraja, 2006; Zaykin et al., 2007b). For example, Nagaraja (2006) gives the cumulative distribution for the statistic $W_{k} = - \sum_{i = 1}^{k} \ln P_{(i)}$ and k< L, as:

\begin{array}{l} \begin{array}{l} Pr (W_{k} > w) = \sum_{j = 1}^{k} w_{j} exp {- \frac{c_{j} w}{c_{k + 1}}} \frac{1}{(L - K - 1)!} \int_{0}^{w} \exp {y d_{j}} y^{L - k - 1} d y \\ + \sum_{s = o}^{L - k - 1} \exp {- w} \frac{w^{s}}{s!}, where \\ c_{j} = L - j + 1, \\ d_{j} = \frac{k + 1 - j}{L - k}, \\ w_{j} = \frac{1}{L - j + 1} \frac{L!}{(L - k)!} \frac{{(- 1)}^{k} - j}{(j - 1)! (k - j)!} \end{array} & (1) \end{array}

Theoretical forms of the RTP distribution (e.g., Eq. 1) may retain order-specific terms. Here, we proceed to a simpler representation by noting that every random realization of k smallest P-values can be shuffled. This step does not change the value of the product, W_k (or its logarithm), which is our statistic of interest, but implies that we can treat the joint k-variate distribution as governed by the same pair-wise dependence for every pair of variables. Moreover, variables of that shuffled distribution are identical marginally. The dependency is induced completely through randomness of P₍_k+1₎, and conditionally on its value, the {W_k|p₍_k+1₎} distribution is given by standard independence results. Then, P_RTP is given by the marginal CDF of W_k. Based on this conceptual model, we derived the following representation of RTP where a single integral is evaluated in a bounded interval (0,1), which allows one to evaluate the RTP distribution as a simple average of standard functions. Specifically, we derive a simple expression for the RTP distribution as the expectation of a function of a uniform (0 to 1) random variable:

\begin{array}{l} \begin{array}{l} P_{RTP} (k) = \\ \Pr (W_{k} \leq w) = 1 - \int_{0}^{1} G_{k} {\ln ({\frac{[B_{k + 1}^{−1} (u)]}{w}}^{k})} d u_{} \\ = E {H (U | k, w)}, \end{array} & (2) \end{array}

where $B_{k + 1}^{−1} (\cdot)$ is inverse CDF of Beta(k+1,L−k) distribution, G_k(·) is CDF of Gamma(k, 1), and $H (u | k, w) = G_{k} (\ln (\frac{{[B_{k + 1}^{−1} (u)]}^{k}}{w})) .$ P_RTP(k) is the combined RTP P-value. Notably, given the value of the product of k P-values, W = w, we can simultaneously evaluate P_RTP(k+1):

\begin{array}{l} \begin{array}{l} P_{RTP} (k + 1) = \\ Pr (W_{k + 1} \leq w) = 1 - \int_{0}^{1} G_{k} {\ln (\frac{{[B_{k + 1}^{−1} (u)]}^{k + 1}}{w})} d u . \end{array} & (3) \end{array}

Details and the derivation are given in Supplementary Material S-2.

The conditional independence of k−1 smallest P-values, given a value of the beta-distributed k-th smallest P-value, leads to a simple statistic which (just as RTP) is a function of the product of the k-th smallest P-values. This statistic and its distribution are not an approximation to W_k and the RTP distribution. However, similarly to RTP, the new statistic is designed to capture information contained in the first k smallest P-values. Motivation for our augmented RTP method (ART) follows from the intermediate results of the derivation of the RTP distribution. Note that the distribution of W_k conditional on (k + 1) involves a product of k uniforms and a beta random variable. Moreover, the product and the beta variable are independent. On the one hand, we could have proceeded by dividing every P₍_1:k₎ value by the observed p₍_k+1₎ and employed the gamma CDF to $\sum_{i = 1}^{k} - \ln (P_{(i)} / p_{(k + 1)})$ to obtain the combined P-value. However, this approach ignores information contained in P₍_k+1₎ magnitude. On the other hand, we can exploit the independence of the gamma-distributed sum and P₍_k+1₎ by transforming P₍_k+1₎ to a gamma random variable and adding the result to the sum. Specifically, to construct the new statistic, we propose the following transformation that involves the product W_k−1 and the variable P_(k). These transformations yield three independent variables, which are next added together and give a gamma-distributed random variable,

\begin{array}{l} \begin{array}{l} A_{k} = \\ - \ln {W_{k - 1}} + (k - 1) \ln {P_{(k)}} + G_{λ}^{−1} {1 - B_{k} (P_{(k)})}, \end{array} & (4) \end{array}

where $G_{k}^{−1} (\cdot)$ is inverse CDF of Gamma (k,1),

λ = (k – 1) × E {– ln (P_(k))} = (k – 1) (Γ′ (L + 1)/Γ (L + 1) – Γ′ (k)/Γ (k)),

Γ′ is the first derivative of a gamma function; and B_k(x) is the CDF of Beta(k, L−k+1) distribution evaluated at x. The shape parameter λ is chosen so that the two last terms in Eq. 4 (that are both transformations of P_(k)) have the same expectation. The shape parameter here plays a role of a weight given to the last term. This is similar to the Lancaster method for combining P-values, where the gamma shape parameter (equivalently, the degrees of freedom of a chi-square distribution) has been used to give differential weights to P-values (Lancaster, 1965).

Given the observed value A_k=a_k, the combined P-value is

\begin{array}{l} P_{ART} = \Pr (A_{k} \leq a_{k}) = 1 - G_{k + λ - 1} (a_{k}) . & (5) \end{array}

Under the null hypothesis, as illustrated by Figure 2, combined P-values based on the proposed method (ART) are very similar to P_RTP, and approach P_RTP as k increases. However, under the alternative, we find (as described in “Results” section) that ART has either the same or higher power than RTP. Furthermore, the combined P-value, ART, can be easily computed in R using its standard functions. A short example and an R code are given in Supplementary Material S-3.

FIGURE 2

Figure 2 Combined P-values based on A_k versus RTP statistic. Multiple combined P-values were computed using the 2 proposed statistics based on either top 10 or top 15 P-values out of L = 20 tests.

Adaptive ART Method, ART-A

As we discussed earlier in Introduction, the number of k P-values to be combined by the RTP method (or ART) is fixed and needs to be pre-specified. The choice of k is somewhat arbitrary; so, a researcher may wish to evaluate ART at several values of k, consequently choosing k that corresponds to the smallest combined P-value. However, this additional step creates another layer of multiple comparisons, which needs to be accounted for. Yu et al. (2009) proposed an empirical procedure to evaluate adaptive RTP (aRTP) method based on the minimum P-value computed over various candidate truncation points. To avoid a cumbersome two-level permutation procedure, they built on the method suggested by Ge et al. (2003) to reduce computational time. While computationally efficient, the method requires to store a large B×L matrix, with every row containing L P-values generated under the null distribution over B simulated experiments. Zhang et al. (2013) derived analytic but mathematically complex aRTP distribution, which needs to be evaluated using high-dimensional integration. Here, we propose a new and easily implemented version of the theoretical distribution for ART, ART-A. The method exploits the fact that ordered P-values can be represented as functions of the same number of independent uniform random variables (Supplementary Material S-4). The two main ideas behind ART-A are first to approximate the Gamma distribution with a large shape parameter by the normal distribution, and second to use the fact that the joint distribution of the partial normal sums follows a multivariate normal distribution.

Correlated P-Values

We further extend the proposed methods to combine correlated P-values via decorrelation by orthogonal transformation approach, DOT. Let L correlated P-values, (p₁, p₂, …, p_L) originate from statistics that jointly follow a standard multivariate normal distribution, y ∼ MVN (μ = 0, Σ), under H₀. For two-sided P-values, the elements of y are squared. Elements of the vector of squared variables, $y_{j}^{2}$ , follow the one degree of freedom chi-square distribution with $C o r (y_{i}^{2}, y_{j}^{2}) = Σ_{i j}^{2}$ . Dependent variables can be transformed into independent variables by using eigendecomposition of Σ, such that Σ = QΛQ⁻¹, where Q is a square matrix, with ith column containing eigenvector q_i of Σ, and Λ is the diagonal matrix of eigenvalues λ₁, λ₂, …, λ_L. Next, define an orthogonal matrix H = QΛ^−1/2Q^T and y_e = H^T y. P-values are decorrelated as 1-Φ⁻¹(y_e). Then, the first k smallest decorrelated P-values can be used to calculate various combined statistics. The choice of this particular transformation is motivated by its “invariance to order” property. Briefly, in the equicorrelation case, including the special case of ρ = 0, i.e., independence, a permutation of y should yield the same (possibly permuted) values in the decorrelated vector, y_e. Extensive evaluation of the decorrelation approach is presented by us elsewhere (Vsevolozhskaya et al., 2019).

Results

Simulation Study Results

We used simulation experiments to evaluate the type I error rate and power of the proposed methods relative to the previously studied RTP (defined for a fixed k) and to the adaptive RTP [where k is varied and the distribution is evaluated by single-layer simulations as in Yu et al. (2009)]. Briefly, a single-layer simulation is a numerical optimization of the naive simulation setup, which is quite slow because each round of data generation has to be followed by a second set of simulations to determine the optimal truncation point. Yu et al. proposed a clever shortcut based on a rectangular matrix, in which each i-th row is a set of L P-values that corresponds to the i-th simulation. The shortcut avoids nested and slow simulations but has a downside of keeping quite large matrix in computer memory.

Performance of various methods was evaluated using k first-ordered P-values, with k = {10,100}, L = {100,200,500}, and B = 100,000 simulations for evaluation of both, type-I error and power. Details of the simulation design are given in Supplementary Material S-5.

When various combined P-value methods are being compared, it is meaningful to gauge their performance against methods designed for multiple testing adjustments. This is especially relevant with methods that employ truncation due to their emphasis on small P-values. Therefore, we included the Simes (1986) method in our power comparisons because it can be viewed as a combined P-value method. The Simes method tests the overall H₀ without a reference to individual P-values: the H₀ is rejected at α level if P₍_i₎ ≤ iα/L for at least one i. Equivalently, the overall (or the “combined”) Simes P-value can be obtained as min {kp₍_i₎/i}. The Simes test is a useful benchmark, because it is related to the combined P-value methods with truncation, as well as to multiple testing adjustment procedures. At the extreme, the RTP with k = 1 becomes equivalent to Šidák (1967) correction. Šidák correction is approximately the Bonferroni (1935) correction for small P-values and large L. The Simes P-value is at least as small as Bonferroni-corrected P-value. In addition, there is a connection of the Simes test to the Benjamini and Hochberg (1995) false discovery rate (FDR), i.e., the Simes test is algebraically the same procedure as the Benjamini and Hochberg FDR, although the interpretation is different: FDR method determines the largest i, such that P₍_i₎ ≤ iα/L, and rejects H₀ for all P₍_j₎, j ≤ i, to control the expectation of FDR.

Tables 1–3 present type I error rates for combinations of independent and decorrelated P-values, respectively. In the tables, rows labeled “ART-A” refer to our newly proposed adaptive ART method, while “aRTP” rows label the results of the conventional approach (Yu et al., 2009). For the adaptive methods, the sequence of truncation points varied from 1 to k or from 1 to L, if k = L. Both tables confirm that all methods maintain the correct Type I error rate at various α-levels.

TABLE 1

Table 1 Type I error at α = 0.05, 0.01, and 0.005, assuming that P-values are independent.

TABLE 2

Table 2 Type I error at α = 0.05, 0.01, and 0.005 for randomly correlated P-values.

TABLE 3

Table 3 Additional type I error results at α = 0.05, 0.01, and 0.005 for correlated P-values and different k and L combinations.

Tables 4–6 summarize a set of power simulations for independent P-values. Results presented in Table 4 were obtained under the assumption that all L statistics had the same underlying effect size (μ = 0.5). From this table, it is evident that our ART has the highest power, closely followed by RTP. In general, the ART P-values tend to be similar to the P-values obtained by the RTP, and we show their similarity graphically in Figure 2. The Simes method has the lowest power, which is expected due to homogeneity in effect sizes across L tests and absence of true nulls. For the results in Table 5, the effect size was allowed to randomly vary throughout the range from 0.05 to 0.45. In both of these tables, the ART method has the highest power, while the Simes method has the lowest power. The power of both adaptive methods is very similar to one another but lower than that of methods based on a fixed k (RTP and ART). Nonetheless, in practice, a good choice for k may not be immediately clear, so a small sacrifice in power may be preferable to an arbitrary and possibly poor choice of k. However, when L is large, it can be impractical or unreasonable to vary candidate truncation points all the way up to L. Therefore, the value of k in these tables is the upper bound for sets of candidate truncation points, and that explains the difference in powers. For example, for the combination k = 10, L = 200, the power of ART-A in Table 5 is 0.26, while for k = 100, L = 200, the power is 0.37. Adaptive methods are considerably slower than methods where the value of k is fixed. A promising solution, which is currently under development, is removal of candidate truncation points for which the corresponding partial products are highly correlated.

TABLE 4

Table 4 Power under the alternative hypothesis, assuming independence and the same effect size μ = 0.5 for all L tests.

TABLE 5

Table 5 Power under the alternative hypothesis, assuming independence and random effect size (μ between 0.05 and 0.45).

TABLE 6

Table 6 Power under independence, assuming constant effect size (μ = 1.4) for a fraction of L = 1,000 hypotheses and μ = 0 for the rest of the tests. Proportion of true effects is the proportion of SNPs with μ≠0. In other words, it is the proportion of alternative hypotheses among all hypotheses.

We note that power values in Tables 4–5 include k = L, but for illustrative purposes, we also give power values for some choices where k < L. We emphasize that, in our tables, these chosen maximum k values do not have any specific meaning. In general, the maximum value of k for the adaptive methods can be varied up to L, unless L is either very large, or if there is a priori assumption that the real signals are rare. As discussed in Zaykin et al. (2007a), the optimal value of k is usually lower than the actual number of real signals; however, the upper bound for k can be set to reflect a priori knowledge about potential maximum number of real signals.

Finally, Table 6 summarizes results for simulations when some of the L hypotheses were true nulls (μ = 0), while the remaining hypotheses were true signals (μ = 0.5). The results follow the same pattern as in the previous tables, with ART having the highest power.

Table 7 summarizes a set of power simulations for correlated P-values. The effect sizes were randomly varied between −0.45 and 1.3 in each simulation. The correlation matrices were generated as described in Supplementary Material S-5. This set of simulations assumes that the P-values were obtained from the same data set as the sample estimate of the correlation matrix. Under heterogeneous effect sizes (Table 7), the empirical versions of the tests (“RTP,” “ART-A”) show nearly identical (and low) power for various combinations of k and L values. However, decorrelation-based methods become quite powerful, and their power is increasing with k and L. The steady power increase is due to the decorrelation effect on the combined noncentrality that involves the sum $\sum_{i \neq j}^{L} {(μ_{i} - μ_{j})}^{2}$ , which increases with the increased heterogeneity of μ. More details on the performance of the decorrelation approach are given by us elsewhere (Vsevolozhskaya et al., 2019), but here we briefly note that this finding is practically relevant because substantial heterogeneity of associations is expected among genetic variants, leading to a substantial power boost, as we next illustrate via re-analysis of published associations of genetic variants within the μ-opioid gene with pain sensitivity.

TABLE 7

Table 7 Power for correlated P-values when the effect size is randomly varied between −0.45 and 1.3.

Real Data Analysis

In several popular variations of the gene-based approach (Neale and Sham, 2004), future association statistics or P-values are combined across variants within a gene (Yu et al., 2009; Liu et al., 2010; Peng et al., 2010; Li et al., 2011). Gene-based approaches have some advantages over methods based on individual SNPs or haplotypes. In particular, gene-based P-values may facilitate subsequent meta-analysis of separate studies and can be less susceptible to erroneous findings (Neale and Sham, 2004). To obtain a gene-based P-value, one would need to account for LD among variants. The matrix of LD correlation coefficients can be obtained conveniently without access to individual genotypes if frequencies of haplotypes for SNPs within the genetic region of interest are available. The LD for alleles i and j is defined by the difference between the di-locus haplotype frequency, P_ij, and the product of the frequencies of two alleles, D_ij ₌ P_ij- p_ip_j. The LD correlation for SNPs i and j is $r_{i j} = \frac{D_{i j}}{p_{i} (1 - p_{i}) p_{j} (1 - p_{j})}$ . Shabalina et al. (2008) and Kuo et al. (2014) reported SNP-based P-values (Table 8), as well as results of several haplotype-based tests for genetic association of variants within the μ-opioid receptor (MOR) with pain sensitivity. Kuo et al. (2014) also reported estimated frequencies for 11-SNP haplotypes within the MOR gene, from which the 11×11 LD correlation matrix can be computed. The P_ij frequencies are given by the sum of frequencies of those 11-SNP haplotypes that contain both of the minor alleles for SNPs i and j. Similarly, p_i allele frequency is the sum of haplotype frequencies that carry the minor allele of the SNP i. The LD correlations within the MOR region spanned by the 11 SNPs ranged from −0.82 to 0.99, with the average absolute value ≈0.55 and the median absolute value ≈0.66. Half of pairwise LD correlations were smaller than −0.23 or larger than 0.82. Our analysis (Table 9) showcases utility of the proposed methods. The columns show combined P-values, for k varying from 2 up to all 11 SNPs (k = 1 is equivalent to the Bonferroni correction, i.e., 0.007×11). Similar to what we found via simulation experiments, where correlation is controlled by reshuffling the phenotype values while keeping the original LD structure intact, RTP and aRTP (without the decorrelation step) do not benefit from inclusion of additional SNPs. P-values in the ART column are very similar to those in the RTP column, which reassures our theoretical expectations. In contrast to previously proposed methods that control correlation by resampling (i.e., RTP, aRTP, and ART), the results in columns marked by “decorr” are substantially lower. In these columns, we used the proposed transformation to independence, which gives much stronger combined P-values. In all “decorr” columns, k = 7 results in the smallest combined P-value, implying that the number of real effects (including proxy associations) is at least seven.

TABLE 8

Table 8 Individual SNP P-values as originally reported in Shabalina et al. (2008).

TABLE 9

Table 9 Combined P-values by different methods for μ-opioid data.

Discussion

Complex diseases are influenced by collective effects of environmental exposures and genetic determinants. There can be numerous weak but biologically meaningful risk factors. The challenge is to distinguish between real and spurious statistical signals in the presence of multiple comparisons and low detection power. When the number of potential real associations is expected to be small, compared to the total number of variants evaluated within a study, it is advantageous to focus on the top-ranking associations. The rank truncated product method (RTP) has been designed with this objective in mind. The RTP and related approaches had been shown to be valuable tools in analysis of genetic associations with disease. In this article, we derive a mathematically simple form of the RTP distribution that leads a to new method, ART and its adaptive version, ART-A, which searches through a number of candidate values of truncation points and finds an optimal number in terms of combined P-value. Two important questions are the meaning of “k” and its optimal value. Unfortunately, k cannot be interpreted as an estimate of the number of real signals, for the reason that the k value that yields the minimum P-value depends on both the numbers of real signals, as well as on their strength of association, both of which are unknown. This issue has been investigated in detail in Zaykin et al. (2007a). The ART is designed with the same objectives in mind as RTP and TPM: to facilitate detection of possibly weak signals among top-ranking predictors that could have been missed, unless combined into a single score. The ART is trivial to implement in terms of standard functions, provided by packages such as R, and its power characteristics are close to RTP or higher in all studied settings. Analytical forms of ART and ART-A are derived under independence. To accommodate LD, we propose a decorrelation step, by transformation of P-values to independence. Our decorrelation by orthogonal transformation approach (DOT) is analogous to the Mahalanobis transformation (Härdle and Simar, 2007). We found DOT to be surprisingly powerful in many settings, compared to the usual method of evaluating the distribution of product of correlated P-values under the null hypothesis. Theoretical properties and extensive numerical evaluation of DOT will be published elsewhere and currently these findings are available as a preprint (Vsevolozhskaya et al., 2019). Further, we illustrate an application of our methods with analyses of variants within the μ-opioid gene that have been shown to affect sensitivity to pain. We find strengthened evidence of overall association within the 11-SNP block. In this application, the LD correlation matrix was reconstructed from the haplotype frequencies, which might be slightly different from the correlation of (0,1,2) values between pairs of SNPs (Zaykin, 2004). Further studies are needed to investigate whether approaches such as this, or utilization of reference panel (external) data as a source of LD information, may lead to substantial bias.

Web Resources

The URL for software referenced in this article is available at: https://github.com/dmitri-zaykin/Total_Decor.

Data Availability Statement

The datasets generated for this study are available on request to the corresponding author.

Author Contributions

DZ and OV conceived the study, interpreted the results, and wrote the manuscript. FH performed extensive simulations. All authors read and approved the final manuscript.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgement

This research was supported in part by the Intramural Research Program of the NIH, National Institute of Environmental Health Sciences.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2019.01051/full#supplementary-material

References

Benjamini, Y., Hochberg, Y. (1995). Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. Royal Stat. Soc. B. 57, 289–300. doi: 10.1111/j.2517-6161.1995.tb02031.x

CrossRef Full Text | Google Scholar

Bonferroni, C. E. (1935). Il calcolo delle assicurazioni su gruppi di teste. Studi in Onore del Professore Salvatore Ortu Carboni. 13–60.

Google Scholar

Dudbridge, F., Koeleman, B. P. (2003). Rank truncated product of p-values, with application to genomewide association scans. Genet. Epidemiol. 25, 360–366. doi: 10.1002/gepi.10264

PubMed Abstract | CrossRef Full Text | Google Scholar

Fisher, S. (1932). Statistical methods for research workers. Oliver & Boyd (Edinburgh): Genesis Publishing Pvt Ltd.

Google Scholar

Ge, Y., Dudoit, S., Speed, T. P. (2003). Resampling-based multiple testing for microarray data analysis. Test 12 (1), 1–77. doi: 10.1007/BF02595811

CrossRef Full Text | Google Scholar

Härdle, W., Simar, L. (2007). Applied multivariate statistical analysis Vol. 22007. Berlin: Springer.

Google Scholar

Hoh, J., Wille, A., Ott, J. (2001). Trimming, weighting, and grouping snps in human case-control association studies. Genome Res. 11, 2115–2119. doi: 10.1101/gr.204001

PubMed Abstract | CrossRef Full Text | Google Scholar

Jiang, B., Zhang, X., Zuo, Y., Kang, G. (2011). A powerful truncated tail strength method for testing multiple null hypotheses in one dataset. J. Theor. Biol. 277, 67–73. doi: 10.1016/j.jtbi.2011.01.029

PubMed Abstract | CrossRef Full Text | Google Scholar

Kuo, C.-L., Diatchenko, L., Zaykin, D. (2014). Discovering multilocus associations with complex pain phenotypes. Pain Genet.: Basic Transl. Sci., 99–114. doi: 10.1002/9781118398890.ch7

CrossRef Full Text | Google Scholar

Lancaster, H. (1965). The Helmert matrices. Am. Math. Mon. 72, 4–12. doi: 10.1080/00029890.1965.11970483

CrossRef Full Text | Google Scholar

Li, M.-X., Gui, H.-S., Kwan, J. S., Sham, P. C. (2011). GATES: A rapid and powerful gene-based association test using extended Simes procedure. Am. J. Hum. Genet. 88, 283–293. doi: 10.1016/j.ajhg.2011.01.019

PubMed Abstract | CrossRef Full Text | Google Scholar

Lin, D., Zeng, D. (2010). Meta-analysis of genome-wide association studies: no efficiency gain in using individual participant data. Genet. Epidemiol. 34, 60–66. doi: 10.1002/gepi.20435

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, J. Z., Mcrae, A. F., Nyholt, D. R., Medland, S. E., Wray, N. R., Brown, K. M., et al. (2010). A versatile gene-based test for genome-wide association studies. Am. J. Hum. Genet. 87, 139–145. doi: 10.1016/j.ajhg.2010.06.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Loughin, T. M. (2004). A systematic comparison of methods for combining p-values from independent tests. Comput. Stat. Data Anal. 47, 467–485. doi: 10.1016/j.csda.2003.11.020

CrossRef Full Text | Google Scholar

Nagaraja, H. N. (2006). Order statistics from independent exponential random variables and the sum of the top order statistics. Advances in Distribution Theory, Order Statistics, and Inference. Birkhausre Boston: Springer, 173–185. doi: 10.1007/0-8176-4487-3_11

CrossRef Full Text | Google Scholar

Neale, B. M., Sham, P. C. (2004). The future of association studies: gene-based analysis and replication. Am. J. Hum. Genet. 75, 353–362. doi: 10.1086/423901

PubMed Abstract | CrossRef Full Text | Google Scholar

Peng, G., Luo, L., Siu, H., Zhu, Y., Hu, P., Hong, S., et al. (2010). Gene and pathway-based second-wave analysis of genome-wide association studies. Eur. J. Human Genet. 18, 111. doi: 10.1038/ejhg.2009.115

CrossRef Full Text | Google Scholar

Shabalina, S. A., Zaykin, D. V., Gris, P., Ogurtsov, A. Y., Gauthier, J., Shibata, K., et al. (2008). Expansion of the human μ-opioid receptor gene architecture: novel functional variants. Human Mol. Genet. 18, 1037–1051. doi: 10.1093/hmg/ddn439

CrossRef Full Text | Google Scholar

Sheng, X., Yang, J. (2013). An adaptive truncated product method for combining dependent p-values. Econ. Lett. 119, 180–182. doi: 10.1016/j.econlet.2013.02.013

PubMed Abstract | CrossRef Full Text | Google Scholar

Šidák, Z. (1967). Rectangular confidence regions for the means of multivariate normal distributions. J. Am. Statist. Assoc. 78, 626–633. doi: 10.1080/01621459.1967.10482935

CrossRef Full Text | Google Scholar

Simes, R. (1986). An improved Bonferroni procedure for multiple tests of significance. Biometrika 73, 751–754. doi: 10.1093/biomet/73.3.751

CrossRef Full Text | Google Scholar

Stouffer, S., DeVinney, L., Suchmen, E. (1949). “The American soldier,” in Adjustment during army life, vol. 1. (Princeton, US: Princeton University Press).

Google Scholar

Taylor, J., Tibshirani, R. (2005). A tail strength measure for assessing the overall univariate significance in a dataset. Biostatistics 7, 167–181. doi: 10.1093/biostatistics/kxj009

PubMed Abstract | CrossRef Full Text | Google Scholar

Vsevolozhskaya, O. A., Shi, M., Hu, F., Zaykin, D. V. (2019). DOT: gene-set analysis by combining decorrelated association statistics. doi: 10.1101/665133

CrossRef Full Text | Google Scholar

Whitlock, M. C. (2005). Combining probability from independent tests: the weighted Z-method is superior to Fisher's approach. J. Evol. Biol. 18, 1368–1373. doi: 10.1111/j.1420-9101.2005.00917.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Won, S., Morris, N., Lu, Q., Elston, R. (2009). Choosing an optimal method to combine P-values. Stat. Med. 28, 1537–1553. doi: 10.1002/sim.3569

PubMed Abstract | CrossRef Full Text | Google Scholar

Yu, K., Li, Q. Z., Bergen, A. W., Pfeiffer, R. M., Rosenberg, P. S., Caporaso, N., et al. (2009). Pathway analysis by adaptive combination of p-values. Genet. Epidemiol. 33, 700–709. doi: 10.1002/gepi.20422

PubMed Abstract | CrossRef Full Text | Google Scholar

Zaykin, D. V. (2000). Statistical analysis of genetic associations. North Carolina State University, Ph.D. thesis.

Google Scholar

Zaykin, D. V. (2004). Bounds and normalization of the composite linkage disequilibrium coefficient. Genet. Epidemiol. 27, 252–257. doi: 10.1002/gepi.20015

PubMed Abstract | CrossRef Full Text | Google Scholar

Zaykin, D. V. (2011). Optimally weighted Z-test is a powerful method for combining probabilities in meta-analysis. J. Evol. Biol. 24, 1836–1841. doi: 10.1111/j.1420-9101.2011.02297.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Zaykin, D. V., Zhivotovsky, L. A., Czika, W., Shao, S., Wolfinger, R. D. (2007a). Combining p-values in large-scale genomics experiments. Pharm. Stat. 6, 217–226. doi: 10.1002/pst.304

PubMed Abstract | CrossRef Full Text | Google Scholar

Zaykin, D. V., Zhivotovsky, L. A., Czika, W., Shao, S., Wolfinger, R. D. (2007b). Combining P-values in large-scale genomics experiments. Pharm. Stat. 6, 217–226. doi: 10.1002/pst.304

PubMed Abstract | CrossRef Full Text | Google Scholar

Zaykin, D. V., Zhivotovsky, L. A., Westfall, P. H., Weir, B. S. (2002). Truncated product method for combining p-values. Genet. Epidemiol. 22, 170–185. doi: 10.1002/gepi.0042

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, S., Chen, H.-S., Pfeiffer, R. M. (2013). A combined p-value test for multiple hypothesis testing. J. Stat. Plann. Inference 143, 764–770. doi: 10.1016/j.jspi.2012.10.004

CrossRef Full Text | Google Scholar

Keywords: combining evidence, rank truncated product, a rank truncated product RTP, augmented rank truncation, adaptive augmented rank truncation

Citation: Vsevolozhskaya OA, Hu F and Zaykin DV (2019) Detecting Weak Signals by Combining Small P-Values in Genetic Association Studies. Front. Genet. 10:1051. doi: 10.3389/fgene.2019.01051

Received: 01 July 2019; Accepted: 30 September 2019;
Published: 20 November 2019.

Edited by:

Guolian Kang, St. Jude Children's Research Hospital, United States

Reviewed by:

Frank Dudbridge, University of Leicester, United Kingdom
Qizhai Li, Academy of Mathematics and Systems Science (CAS), China

Copyright © 2019 Vsevolozhskaya, Hu and Zaykin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Dmitri V. Zaykin, ZG1pdHJpLnpheWtpbkBuaWguZ292

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.