ACDC: a general approach for detecting phenotype or exposure associated co-expression

Queen, Katelyn; Nguyen, My-Nhi; Gilliland, Frank D.; Chun, Sung; Raby, Benjamin A.; Millstein, Joshua

doi:10.3389/fmed.2023.1118824

ORIGINAL RESEARCH article

Front. Med., 19 May 2023

Sec. Pulmonary Medicine

Volume 10 - 2023 | https://doi.org/10.3389/fmed.2023.1118824

This article is part of the Research TopicCurrent Omics-Based Approaches as Tools for Improving the Understanding, Diagnosis and Management of Inflammatory Lung DiseaseView all 6 articles

ACDC: a general approach for detecting phenotype or exposure associated co-expression

Katelyn Queen¹

My-Nhi Nguyen¹

Frank D. Gilliland¹

Sung Chun²

Benjamin A. Raby^2,3,4

Joshua Millstein¹^*

¹Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, United States
²Division of Pulmonary Medicine, Boston Children's Hospital and Harvard Medical School, Boston, MA, United States
³Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, United States
⁴Division of Pulmonary and Critical Care Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, United States

Background: Existing module-based differential co-expression methods identify differences in gene-gene relationships across phenotype or exposure structures by testing for consistent changes in transcription abundance. Current methods only allow for assessment of co-expression variation across a singular, binary or categorical exposure or phenotype, limiting the information that can be obtained from these analyses.

Methods: Here, we propose a novel approach for detection of differential co-expression that simultaneously accommodates multiple phenotypes or exposures with binary, ordinal, or continuous data types.

Results: We report an application to two cohorts of asthmatic patients with varying levels of asthma control to identify associations between gene co-expression and asthma control test scores. Results suggest that both expression levels and covariances of ADORA3, ALOX15, and IDO1 are associated with asthma control.

Conclusion: ACDC is a flexible extension to existing methodology that can detect differential co-expression across varying external variables.

1. Introduction

Differential expression analysis has long been used to test for differences in transcriptional dependencies across conditions, and may explain phenotypic variation in a population. However, differential expression methods study each gene independent of any other and therefore may not capture transcriptional differences due to changes in gene-gene relationships. Differential co-expression methods test for differences in gene covariances, and thus, such approaches may illuminate regulatory mechanisms not identified by differential expression analysis alone (1).

Module-based differential co-expression methods incorporate information about gene connectivity, and assume that the genes within a module are correlated in the general population. These approaches can have good statistical power due to a reduction in “noise” (2), or unrelated variation of individual genes by collapsing related genes into a single feature. Generally, these module-based methods can be distinguished from one another by, (i) whether modules are defined by the user or the method, (ii) if differential co-expression is detected within or between modules, and (iii) how many conditions are assessed. Methods may also detect differential co-expression for gene pairs across the phenotype of interest and then apply post-hoc clustering methods to identify co-expressed modules. One highly-cited method, CoXpress, determines differentially co-expressed modules given microarray data (3). By cutting the trees determined by average-linkage hierarchical clustering at a user-defined threshold, genes are split into modules. Then, pairwise correlation coefficients are used to created a distribution of co-expression for each module under two conditions. If these distributions are statistically significantly different from random in one condition and not the other, the module is considered differentially co-expressed.

While many methods exists for binary conditions and a few for greater than two, we are unaware of any module-based differential co-expression approaches designed to detect differences across continuous conditions or multiple types of conditions simultaneously. Here we describe a novel method, association of covariance for detecting differential co-expression (ACDC), to detect differential co-expression across multiple binary, ordinal, or continuous phenotypes or exposures. We report an application to gene expression measured in two independent cohorts of asthmatics to determine whether genes in inflammatory pathways are co-expressed across levels of asthma control.

2. Materials and methods

2.1. ACDC description

ACDC is designed to detect dependencies between gene-gene co-expression (or connectivity) and a set of external features that can be either exposures or responses. That is, ACDC is applied to test for evidence of association between measures of co-expression and measures of external features. Notably, the external features are not constrained to be categorical, the typical requirement (2), but could be continuous or ordinal.

The concept of covariance can be used to quantify the dependence between two random variables and thus to quantify gene-gene co-expression. It is possible for the covariance of a pair of genes to depend on external features. For example, suppose in a biological pathway, two genes tend to be co-regulated and thus co-expressed, resulting in positive covariance. A perturbation to the pathway could alter that relationship, resulting in a change in co-expression and thus a change in covariance. If candidate perturbagens and the expression of genes in the pathway are measured, ACDC may be applied to detect these types of effects simultaneously for the multiple genes and perturbagens. Using a similar rationale, ACDC could be applied to detect downstream results of pathway perturbations if the affected phenotypes are measured.

Suppose all individuals have measurements for all M gene expression features in the set, referred to here as a “module”, and all P external features. Assume the vector of P external features are distributed as multivariate normal,

\begin{array}{l} x = {(x_{1}, x_{2}, . . ., x_{P})}^{T} ~ N (μ_{x}, Σ_{x}) & (1) \end{array}

with x_p representing each external feature. Though we describe x as multivariate normal here, we can relax this assumption in practice and allow other distributions and variable types, as in a design matrix.

Suppose the M gene expression features are also distributed as multivariate normal with the covariance matrix depending on x,

\begin{array}{l} g = {(g_{1}, g_{2}, . . ., g_{M})}^{T} ~ N (μ_{g}, Σ_{g} | x) & (2) \end{array}

where g_j denotes the expression of gene j. The covariance matrix can be represented by,

The off-diagonal elements of Σ_g can be considered measures of co-expression, and (for given values of x) estimated in the conventional way,

\begin{array}{l} \begin{matrix} {\hat{σ}}_{j, k} = \frac{1}{N - 1} Σ_{s} (g_{s, j} - \bar{g_{j}}) (g_{s, k} - \bar{g_{k}}) . \end{matrix} & (4) \end{array}

Note that this is essentially an average over individuals. Letting s denote an individual, each contribution is,

\begin{array}{l} {\hat{σ}}_{j, k} = (g_{s, j} - \bar{g_{j}}) (g_{s, k} - \bar{g_{k}}) & (5) \end{array}

These individual components have approximately the same expectation as the scaled sum, therefore they can also be described as estimators for σ_{j, k}. We leverage this property to test for dependencies between the covariances and the external features.

We can denote the co-expression profile for a given module as,

\begin{array}{l} C = (σ_{1, 2}, ..., σ_{j, k}, ... σ_{(M - 1), M}); | C | = (\begin{matrix} M \\ 2 \end{matrix}) = G . & (6) \end{array}

We are interested in dependencies that may exist between the external features, x, and the gene-pair covariances, the off diagonals of Σ_g. If we have a single external feature or a single pair of genes, conventional general linear modeling (GLM) approaches could be used to relate x to C. For multiple gene pairs and external features, CCA can be applied, or sparse CCA for high dimensional settings. CCA finds min [G, P] linear combinations, a ∈ ℝ^P, b ∈ ℝ^G, of C and x, respectively, that maximize the correlation,

\begin{array}{l} (a_{1}^{'}, b_{1}^{'}) = argmax corr (a_{1}^{T} x, b_{1}^{T} C); ρ_{1} = corr (a_{1}^{T} x, b_{1}^{T} C), & (7) \end{array}

for example, for the first pair of canonical variables. Note that CCA can be applied even if G and/or P is equal to one (4). Wilks–Lambda can be used to conduct a joint hypothesis test of whether the correlation coefficients found by CCA are significantly different from zero,

\begin{array}{l} H_{0} : ρ_{i} = 0, for all 1 \leq i \leq min [G, P] \\ H_{A} : ρ_{i} \neq 0, for some 1 \leq i \leq min [G, P] . & (8) \end{array}

A rejected test implies dependent co-expression, i.e., that there are linear combinations of gene-gene covariances associated with linear combinations of external features.

False discovery rates (FDR) can be computed using the Benjamini–Hochberg (BH) (5) method when multiple modules are tested and parametric assumptions apply. If severe departures from the assumed distributions may be present, permutation-based approaches such as the Millstein and Volfson (MV) FDR (6) method can be used.

and is also available from the CRAN repository, https://cran.r-project.org/web/packages/modACDC/index.html

2.2. Datasets

2.2.1. Asthma BRIDGE

The Asthma Biorepository for Integrative Genomic Exploration (ABRIDGE) aimed to bring together data from over 2,700 participants in ongoing (at the time) asthma studies (7). Patients were recruited from six cohorts of the EVE Consortium, a group of 11 academic sites who did genome-wide association studies of asthma (8), and extensive phenotype and genomics data are publicly available.

The discovery dataset includes gene expression in whole blood from 245 patients with doctor-diagnosed asthma from ABRIDGE (Table 1), profiled using the Illumina HumanHT-12 v4 Expression array. Six-month asthma control test (ACT) scores were calculated from questionnaire responses about wheezing with and without exercise, patient waking due to wheezing, and the need for Albuterol in the last 6 months (range: [4,20]), where higher scores indicate suboptimal control (Figure 1A).

TABLE 1

Table 1. Patient demographics for ABRIDGE and CAMP cohorts.

FIGURE 1

Figure 1. (A) The distribution of 6-month ACT scores in ABRIDGE Whole Blood gene expression, with scores being calculated with information about wheezing with and without exercise, patient waking due to wheezing, and the need for rescue medications in the last 6 months. (B) The distribution of 7-day ACT scores in CAMP Whole Blood gene expression, with scores being calculated with information about the need for rescue and preventative medications, activity limits, and patient waking due to wheezing in the past 7 days.

The gene expression profile data were normalized via a log2-transformation and quantile-normalization. Duplicate probes were condensed using the largest median absolute deviation, leaving only probes with unique targets. The analysis includes 623 probes with targets annotated for inflammatory response in Gene Ontology.

2.2.2. CAMP

The Childhood Asthma Management Program (CAMP) was a randomized, placebo-controlled clinical trial started in the early 1990s for children with mild to moderate asthma. One thousand and forty-one children were enrolled between 1993 and 1995 at eight clinical centers, and extensive baseline data was collected and is publicly available (GEO accession number GSE22324) (9).

Results from the initial analysis were followed up in an independent dataset that included whole blood gene expression from 604 asthmatics, primarily young adults who were enrolled in CAMP as children (Table 1), profiled using the HumanRef8 v2 BeadChip array. Seven-day ACT scores were calculated using baseline questionnaire responses about rescue and preventative bronchodilator use, activity limits, and frequency of waking due to wheezing in the past 7 days (range: [0,28]), where higher scores indicate suboptimal control (Figure 1B). The same data processing normalization steps were taken as in Asthma BRIDGE.

2.3. ABRIDGE and CAMP data analysis

To identify modules of correlated genes, we applied the Partition data reduction method (10, 11), an agglomerative approach that requires the user to specify an acceptable proportion of information loss when collapsing all features to a single measure such as the mean. Selection of the information loss threshold was guided by the aim to maximize information explained in the ACT score while minimizing noise. Further explanation is provided in the Supplementary material (12). We used an information loss constraint of 0.35 which corresponds to a minimum of 65% information from the non-reduced data captured by each new feature, as assessed by the intraclass correlation coefficient (ICC). This reduction threshold resulted in roughly 50% reduction in features when compared to the full dataset (Supplementary Figure 1).

In analyses of individual genes within modules of interest, gene-ACT score relationships were modeled using ordinal logistic regression, adjusting for patient age, race, sex, data collection site, and the first three principal components (PCs), which may capture global dependencies due to cell-type composition and technical artifacts, of each gene expression data set. Associations were identified at the 0.05 FDR level.

To clarify the novel attributes of ACDC, a comparative analysis was conducted in the ABRIDGE cohort using CoXpress. The ACT score was dichotomized at the median value to indicate better vs. worse asthma control to conform to the coXpress requirement of a binary phenotype. The Pearson correlation coefficient was used as the similarity measure, and for module identification the dendrogram was cut at a height of 0.35 for consistency with the Partition approach.

3. Results

3.1. ACDC in ABRIDGE

ACDC was performed on 65 modules identified by Partition in the ABRIDGE dataset. The results for the top five modules based on BH FDR can be found in Table 2. Evidence suggestive of differential co-expression as determined by CCA Wilks–Lambda p ≤ 0.05 was found for two modules including genes NOD-like Receptor Family Pyrin Domain Containing 12 (NLRP12), Meteorin Like, Glial Cell Differentiation Regulator (METRNL), and Ghrelin And Obestatin Prepropeptide (GHRL) in module A (BH FDR = 0.0737), and Adenosine A3 Receptor (ADORA3), Arachidonate 15-Lipoxygenase (ALOX15), and Indoleamine 2,3-Dioxygenase 1 (IDO1) in module B (BH FDR = 0.1569). We also computed the non-parametric, permutation based FDR estimate Millstein–Volfson (MV) to account for departures from the normality assumption by the ACT variable, which is ordinal. However, the results of the MV FDR test are in approximate agreement with the BH FDR results, yielding two modules with evidence of differential co-expression [(Supplementary Figure 2), FDR = 0.0554, 95% CI: (0.0054, 0.5742)].

TABLE 2

Table 2. Results of CCA analysis between gene-gene covariances and ACT score components for ABRIDGE and CAMP cohorts.

To further explore the relationship between co-expression of genes in modules A and B and asthma control, Kruskal–Wallis tests were performed to determine whether covariance measures for all possible pairs of these genes differ across levels of the ACT score components. Eight of the total 24 tests resulted in p-values less than 0.05, with the top six coming from module B. The most significant test involved the co-expression of IDO1 and ADORA3 and the frequency of waking from wheezing in the past 6 months (p = 0.0021; Figure 2).

FIGURE 2

Figure 2. Violin plots for the most statistically significant gene-gene covariance measures (Equation 5) and 6-month ACT score components relationships for the ABRIDGE cohort, where each dot represents values for one patient. Kruskal–Wallis was used to test for global differences, and Wilcoxon signed-rank was used to test for pairwise differences. (A) IDO1 and ADORA3 covariance in 6-month frequency of waking from wheezing; (B) ALOX15 and ADORA3 covariance in 6-month Albuterol use; (C) ALOX15 and ADORA3 covariance in 6-month frequency of wheezing with exercising; (D) ALOX15 and ADORA3 covariance in 6-month frequency of waking from wheezing.

3.2. ACDC in CAMP

We performed ACDC using data from CAMP in an attempt to replicate results observed for the top five modules identified in ABRIDGE. We found evidence of differential co-expression for module B (p = 0.0315) but not module A (p = 0.6823; Table 2). Also, evidence of differential co-expression was observed for gene pairs in modules C and D, which were not significant in ABRIDGE, Interleukin 5 Receptor Subunit Alpha (IL5RA) and Peripheral Myelin Protein 22 (PMP22) in module C, and Interleukin 17 Receptor B (IL17RB) and Interleukin 6 (IL6) in module D. Note that these results have not been adjusted for multiple testing.

Kruskal–Wallis tests were also performed for the same gene-pair covariances tested in ABRIDGE. Of the 24 tests performed, there were three with p-values less than 0.05, all from module B. The most significant test compared the co-expression of IDO1 and ADORA3 across levels of rescue bronchodilator use in the past 7 days (p = 0.02) (Figure 3). Additionally, we performed Kruskal–Wallis tests for all gene-pair covariances and 7-day ACT components for the three modules with CCA Wilks–Lambda p-values below 0.05. Of the 20 tests performed, the same three pairs from module B showed evidence of differential co-expression, but no others had p-values less than 0.05.

FIGURE 3

Figure 3. Violin plots for the most statistically significant gene-gene covariance measures (Equation 5) and 7-day ACT score components relationships for the CAMP cohort, where each dot represents values for one patient. Kruskal–Wallis was used to test for global differences, and Wilcoxon signed-rank was used to test for pairwise differences. (A) IDO1 and ADORA3 covariance in 7-day frequency of rescue bronchodilator use; (B) ALOX15 and ADORA3 covariance in 7-day frequency of rescue bronchodilator use; (C) ALOX15 and IDO1 covariance in 7-day activity limit.

3.3. Differential expression in ABRIDGE

Following the differential co-expression analysis, we performed ordinal logistic regression for each of the 13 genes in the top five modules and found increased risk of suboptimal acute asthma control for all genes in modules B and C, after adjusting for covariates (Table 3). Higher expression of ADORA3, ALOX15, and IDO1 was associated with suboptimal 6-month ACT scores (Figure 4).

TABLE 3

Table 3. Results of ordinal logistic regression models of genes in top five modules from CCA on ACT scores for ABRIDGE and CAMP cohorts.

FIGURE 4

Figure 4. Violin plots for comparing unadjusted (A) ADORA3, (B) ALOX15, and (C) IDO1 expression across 6-month ACT score levels in the ABRIDGE cohort.

3.4. Differential expression in CAMP

Adjusted ordinal logistic regressions were performed for the same 13 genes as the ABRIDGE cohort (Section 3.3). In the CAMP cohort, the regressions also showed highly statistically significant associations for all genes in modules B and C, and non-significant associations for modules A and D (Table 3). Unlike the results from ABRIDGE, a significant protective effect was seen for NOD-like Receptor Family CARD Domain Containing 3 (NLRC3) [OR: 0.3926, 95% CI: (0.1864, 0.8268)]. Associations between these genes and 7-day ACT scores (Figure 5) also imply that increasing gene expression is associated with suboptimal acute asthma control.

FIGURE 5

Figure 5. Violin plots for comparing unadjusted (A) ADORA3, (B) ALOX15, and (C) IDO1 expression across 7-day ACT score levels in the CAMP cohort.

3.5. Methods comparison

The five most differentially co-expressed modules identified by the CoXpress analysis can be seen in Table 4. As a rule of thumb for identifying differentially co-expressed modules, the coXpress authors suggest pr_g₁ ≤ 0.05 and pr_g₂ ≥ 0.05, which implies correlations different than zero in one of the classes but not the other. None of the ABRIDGE modules met this threshold, and values of pr_g₁, pr_g₂ ≤ 0.05 for all of the five top modules indicate that the intra-module correlations are non-zero for patients with both better and worse asthma control. We note that genes ADORA3 and ALOX15 appear in module 1, the most differentially co-expressed module.

TABLE 4

Table 4. Results of coXpress analysis in ABRIDGE whole blood gene expression dataset.

4. Discussion

Here, we have described a novel approach to differential co-expression analysis that accommodates categorical, ordinal, or continuous exposures or outcomes. We suggest that co-expression features can be included in a linear modeling framework either as predictors or outcomes. To handle multivariate external features, we introduce ACDC, for either exploratory analyses or formal hypothesis testing. This strategy contrasts to most existing methods that test for differences in co-expression across a small number of classes. Another key difference is that identified modules can be small or large, which is not possible in many other methods. For example, DICER only accepts modules with at least fifteen genes (13). Although Partition was applied here to identify modules of correlated genes, other methods could be used, such as weighted gene co-expression network analysis (WGCNA) (14). Additionally, this framework can be applied to other types of molecular data, such as proteomics or metabolomics.

Application of the ACDC differential co-expression approach and ordinal logistic regression analyses identified three genes, ADORA3, ALOX15, and IDO1 whose covariances and expression levels were associated with 6-month and 7-day ACT scores in the ABRIDGE and CAMP cohorts, respectively.

Adenosine is a nucleoside which exhibits increased production during periods of lung inflammation. Mediation is controlled through adenosine receptors like ADORA3. Previously, studies have shown that while single nucleotide polymorphisms (SNPs) of ADORA3 loci are not associated with asthma (15, 16), ADORA3 expression is associated with immunoglobulin E levels in whole blood samples of asthmatic patients (17) and is differentially expressed when comparing patients with severe asthma to controls (18).

ALOX15 has both anti-inflammatory and inflammatory effects depending on its regulation and has been previously implicated in the development of inflammatory diseases, including asthma. A few studies have shown that ALOX15 can be found in airway mucosa of asthmatic patients (19, 20), and another study found evidence of differential expression of ALOX15 between controls and asthmatics (21). Additionally, one study found that haplotypic genetic variation at the locus for ALOX15 is associated with asthma (22).

The best understood function of IDO1 is it's role as an immunoregulator in cancer, inhibiting the body's ability to fight diseased cells, but its role in autoimmune responses is less clear. A mouse study showed that the entire indoleamine family promotes allergic airway inflammation (23), and a human study found evidence of differential expression of IDO1 between patients with severe eosinophilic asthma, a more severe subtype of asthma typically found in adults and categorized by high peripheral blood concentration of eosinophils, and healthy controls (24).

Though all three genes have been previously identified as differentially expressed in asthma, there are varying degrees of understanding as to the biological roles that they play. To our knowledge, there are no studies that identify any of these genes as differentially co-expressed in asthma. This additional information could help to fill knowledge gaps about how the genes regulate or co-regulate asthma control.

A limitation of this analysis is the difficulty differentiating cause and effect between gene expression, acute asthma control, and medication use. Does gene expression affect response to asthma exacerbations or is it determined primarily by asthma control medications? The directionality of the relationship is particularly muddled by the inclusion of medication use in the calculation of ACT scores, which is standard practice (25).

The number of covariance features grows much more quickly than the number of genes in a module (or other gene set). Thus, for large modules it may be useful to reduce the dimensionality of the co-expression features or apply a feature selection mechanism in a preliminary step. We are working to implement two dimension reduction approaches: first, sparse CCA using elastic net penalized regression and second, applying Partition to the co-expression matrix. Additionally, the ability to adjust for covariates in the CCA step would add to the utility of the approach.

In the comparison analysis using coXpress, an existing and highly-cited module-based differential co-expression method, genes ADORA3 and ALOX15 were identified among the most important, but no modules reached statistical significance. To achieve statistical significance, coXpress requires that correlations be undetectable in one condition and detectable in the other. Kruskal–Wallis tests of the co-expression matrices showed differences in co-expression across levels of ACT, indicating that while co-expression is present at all levels of ACT, it is nevertheless different across levels. This type of relationship cannot be captured by coXpress. Also, to use coXpress, the ACT score must be dichotomized, which results in information loss.

Further study is needed to understand the larger network that includes ADORA3, ALOX15, and IDO1. All three are part of the Nakajima Eosinophil pathway, a group of the top 30 eosinophil-specific genes (26). This pathway is not well-studied and while much has been published about the role of eosinophils in asthma, few studies have looked at the role this pathway plays in asthma exacerbations or symptomology. More study is needed to determine what drives the associations with covariances observed here. They could be related to differences in the expression of eosinophil genes between eosinophilic and non-eosinophilic asthmatics. Alternatively, within eosinophilic asthmatics, within non-eosinophilic asthmatics or for all subtypes, covariances may be associated with symptom control. That is, differences in expression of eosinophil genes within some of these groups may be associated with symptom control or the associations may be driven by differences between groups.

In summary, we propose a novel strategy for differential co-expression analysis that is a flexible extension to prior methodology. In applications to ABRIDGE and CAMP cohorts, we find evidence of both differential co-expression and differential expression across ACT scores for ADORA3, ALOX15, and IDO1, all genes which have been previously implicated in asthma. These genes may be involved in the underlying regulatory mechanisms behind acute asthma control, however, further study is needed.

Data availability statement

The CAMP dataset can be found in the Gene Expression Omnibus (GEO; https://www.ncbi.nlm.nih.gov/geo/) repository under accession number GSE22324. The ABRIDGE data is being submitted to GEO and will be available as soon as that process is complete.

Ethics statement

The studies involving human participants were reviewed and approved by Partners Human Research Committee. The patients/participants provided their written informed consent to participate in this study.

Author contributions

The differential co-expression approach was conceived by KQ and JM. The analysis was conducted by KQ. All authors contributed to interpretation of the results and writing of the manuscript. All authors contributed to the article and approved the submitted version.

Funding

This research was supported by the National Institute of Environmental Health Sciences (T32ES013678 to KQ); the National Heart Lung Blood Institute (R01HL118455 to M-NN, FG, SC, BR, and JM); and the National Cancer Institute (P01CA196569 to JM).

Acknowledgments

The authors would like to thank the participants of the ABRIDGE and CAMP studies.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2023.1118824/full#supplementary-material

References

1. Chowdhury HA, Bhattacharyya DK, Kalita JK. (Differential) co-expression analysis of gene expression: a survey of best practices. IEEE/ACM Trans Comput Biol Bioinform. (2020) 17:1154–73. doi: 10.1109/tcbb.2019.2893170

PubMed Abstract | CrossRef Full Text

2. Bhuva DD, Cursons J, Smyth GK, Davis MJ. Differential co-expression-based detection of conditional relationships in transcriptional data: comparative analysis and application to breast cancer. Genome Biol. (2019) 20:236. doi: 10.1186/s13059-019-1851-8

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Watson M. CoXpress: differential co-expression in gene expression data. BMC Bioinformatics. (2006) 7:509. doi: 10.1186/1471-2105-7-509

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Widmann M. One-dimensional CCA and SVD, and their relationship to regression maps. J Clim. (2005) 18:2785–92. doi: 10.1175/JCLI3424.1

CrossRef Full Text | Google Scholar

5. Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B Methodol. (1995) 57:289–300. doi: 10.1111/j.2517-6161.1995.tb02031.x

CrossRef Full Text | Google Scholar

6. Millstein J, Volfson D. Computationally efficient permutation-based confidence interval estimation for tail-area FDR. Front Genet. (2013) 4:179. doi: 10.3389/fgene.2013.00179

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Croteau-Chonka DC, Qiu W, Martinez FD, Strunk RC, Lemanske RF, Liu AH, et al. Gene expression profiling in blood provides reproducible molecular insights into asthma control. Am J Respirat Crit Care Med. (2016) 195:179–88. doi: 10.1164/rccm.201601-0107OC

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Torgerson DG, Ampleford EJ, Chiu GY, Gauderman WJ, Gignoux CR, Graves PE, et al. Meta-analysis of genome-wide association studies of asthma in ethnically diverse North American populations. Nat Genet. (2011) 43:887–92. doi: 10.1038/ng.888

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Covar RA, Fuhlbrigge AL, Williams P, Kelly HW, Group tCAMPR. The Childhood Asthma Management Program (CAMP): contributions to the understanding of therapy and the natural history of childhood asthma. Curr Respirat Care Rep. (2012) 1:243–50. doi: 10.1007/s13665-012-0026-9

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Millstein J, Battaglin F, Barrett M, Cao S, Zhang W, Stintzing S, et al. Partition: a surjective mapping approach for dimensionality reduction. Bioinformatics. (2019) 36:676–81. doi: 10.1093/bioinformatics/btz661

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Barrett M, Millstein J. Partition: a fast and flexible framework for data reduction in R. J Open Source Softw. (2020) 5:1991. doi: 10.21105/joss.01991

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Zhang F, Chen W, Zhu Z, Zhang Q, Nabais MF, Qi T, et al. OSCA: a tool for omic-data-based complex trait analysis. Genome Biol. (2019) 20:107. doi: 10.1186/s13059-019-1718-z

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Amar D, Safer H, Shamir R. Dissection of regulatory networks that are altered in disease via differential co-expression. PLoS Comput Biol. (2013) 9:e1002955. doi: 10.1371/journal.pcbi.1002955

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. (2008) 9:559. doi: 10.1186/1471-2105-9-559

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Blumenthal MN, Miller MB, Reilly C, Oetting WS, Brott M, Willaert R, et al. Analysis of ADORA3 as a possible candidate gene for asthma. J Allergy Clin Immunol. (2005) 115:S219. doi: 10.1016/j.jaci.2004.12.883

CrossRef Full Text | Google Scholar

16. Kim SH, Kim YK, Park HW, Kim SH, Kim SH, Ye YM, et al. Adenosine deaminase and adenosine receptor polymorphisms in aspirin-intolerant asthma. Respirat Med. (2009) 103:356–63. doi: 10.1016/j.rmed.2008.10.008

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Saferali A, Yun JH, Lee S, Chase RP, Bowler RP, Castaldi PJ, et al. Transcriptomic signature of asthma-chronic obstructive pulmonary disease overlap in whole blood. Am J Respirat Cell Mol Biol. (2021) 64:268–71. doi: 10.1165/rcmb.2020-0382LE

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Sanchez-Ovando S, Simpson JL, Barker D, Baines KJ, Wark PAB. Transcriptomics of biopsies identifies novel genes and pathways linked to neutrophilic inflammation in severe asthma. Clin Exp Allergy. (2021) 51:1279–94. doi: 10.1111/cea.13986

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Bradding P, Redington AE, Djukanovic R, Conrad DJ, Holgate ST. 15-lipoxygenase immunoreactivity in normal and in asthmatic airways. Am J Respirat Crit Care Med. (1995) 151:1201–4. doi: 10.1164/ajrccm/151.4.1201

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Profita M, Sala A, Riccobono L, Paterno A, Mirabella A, Bonanno A, et al. 15-Lipoxygenase expression and 15(S)-hydroxyeicoisatetraenoic acid release and reincorporation in induced sputum of asthmatic subjects. J Allergy Clin Immunol. (2000) 105:711–6. doi: 10.1067/mai.2000.105122

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Laprise C, Sladek R, Ponton A, Bernier MC, Hudson TJ, Laviolette M. Functional classes of bronchial mucosa genes that are differentially expressed in asthma. BMC Genomics. (2004) 5:21. doi: 10.1186/1471-2164-5-21

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Permaul P, Raby B, Levy B, Israel E. “ALOX-15 haplotypes and association with asthma,” in American Thoracic Society 2009 International Conference (San Diego, CA).

Google Scholar

23. Xu H, Oriss TB, Fei M, Henry AC, Melgert BN, Chen L, et al. Indoleamine 2,3-dioxygenase in lung dendritic cells promotes Th2 responses and allergic inflammation. Proc Natl Acad Sci USA. (2008) 105:6690–5. doi: 10.1073/pnas.0708809105

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Sánchez-Ovando S, Baines K, Barker D, Wark P, Simpson JL. Endobronchial biopsy gene expression between different severe asthma inflammatory phenotypes. Eur Respirat J. (2019) 54(Suppl 63):PA5207. doi: 10.1183/13993003.congress-2019.PA5207

CrossRef Full Text | Google Scholar

25. Nathan RA, Sorkness CA, Kosinski M, Schatz M, Li JT, Marcus P, et al. Development of the asthma control test A survey for assessing asthma control. J Allergy Clin Immunol. (2004) 113:59–65. doi: 10.1016/j.jaci.2003.09.008

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Nakajima T, Matsumoto K, Suto H, Tanaka K, Ebisawa M, Tomita H, et al. Gene expression screening of human mast cells and eosinophils using high-density oligonucleotide probe arrays: abundant expression of major basic protein in mast cells. Blood. (2001) 98:1127–34. doi: 10.1182/blood.V98.4.1127

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: gene expression, differential co-expression, asthma, asthma control, inflammation

Citation: Queen K, Nguyen M-N, Gilliland FD, Chun S, Raby BA and Millstein J (2023) ACDC: a general approach for detecting phenotype or exposure associated co-expression. Front. Med. 10:1118824. doi: 10.3389/fmed.2023.1118824

Received: 08 December 2022; Accepted: 02 May 2023;
Published: 19 May 2023.

Edited by:

Paula Tejera, Harvard University, United States

Reviewed by:

Shu-Yi Liao, National Jewish Health, United States
Xianlong Wang, Fujian Medical University, China

Copyright © 2023 Queen, Nguyen, Gilliland, Chun, Raby and Millstein. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Joshua Millstein, am9zaHVhLm1pbGxzdGVpbkB1c2MuZWR1

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.