Original Research ARTICLE
Investigation of the Genome-Wide Genetic and Epigenetic Networks for Drug Discovery Based on Systems Biology Approaches in Colorectal Cancer
- 1Laboratory of Automatic Control, Signaling Processing and Systems Biology, Department of Electrical Engineering, National Tsing Hua University, Hsinchu, Taiwan
- 2Department of Pediatrics and Human Development, College of Human Medicine, Michigan State University, Grand Rapids, MI, United States
Colorectal cancer (CRC) is the third most commonly diagnosed type of cancer worldwide. The mechanisms leading to the progression of CRC are involved in both genetic and epigenetic regulations. In this study, we applied systems biology methods to identify potential biomarkers and conduct drug discovery in a computational approach. Using big database mining, we constructed a candidate protein-protein interaction network and a candidate gene regulatory network, combining them into a genome-wide genetic and epigenetic network (GWGEN). With the assistance of system identification and model selection approaches, we obtain real GWGENs for early-stage, mid-stage, and late-stage CRC. Subsequently, we extracted core GWGENs for each stage of CRC from their real GWGENs through a principal network projection method, and projected them to the Kyoto Encyclopedia of Genes and Genomes pathways for further analysis. Finally, we compared these core pathways resulting in different molecular mechanisms in each stage of CRC and identified carcinogenic biomarkers for the design of multiple-molecule drugs to prevent the progression of CRC. Based on the identified gene expression signatures, we suggested potential compounds combined with known CRC drugs to prevent the progression of CRC with querying Connectivity Map (CMap).
Colorectal cancer (CRC) is the third most commonly diagnosed type of cancer worldwide. Its incidence is increasing in young adults, especially in developing countries. As the fourth main cause of cancer-related deaths globally, CRC is a serious threat to human health (Deng et al., 2017). Recently, one study has mentioned that CRC was caused by epigenetic, genetic, and microenvironment factors (Khare and Verma, 2012). However, the molecular mechanisms of CRC are very complicated and remain unclear. Therefore, it is important to investigate the relationship between epigenetic and molecular mechanisms.
The progression mechanism and treatment methods for each stage of CRC has been discussed.
In the early stage of disease, CRC invades through the bowel wall, without involving the lymph nodes. The current treatment for early-stage CRC is surgical resection. Conducting surgical resection in this stage shows no evidence of subsequent influence on the lymph nodes of adjacent organs or distant sites (Freeman, 2013). In the mid stage, CRC involves the lymph nodes. At this stage, the main treatment method is surgical resection, followed by administration of chemotherapy. Using this strategy, a previous study reported that the overall survival of patients was improved and the CRC recurrence rate was decreased (Bos et al., 2015). In the late stage, CRC is widespread through metastases. At this stage, the main treatment methods focus on palliative chemotherapy and radiation therapy. Moreover, the survival rate in patients with late-stage CRC is usually minimal. It is necessary to regularly track the status of patients for the prevention of deterioration (Bouvier et al., 2015). Effective biomarkers, which are correlated with multiple drugs for patients in each stage of CRC, provide an alternative approach to treatment.
It is established that microRNAs (miRNAs), influenced by aberrant epigenetic regulation, also mediate the regulation of gene expression. miRNAs are a broad class of noncoding RNAs (length: ~21 nucleotides), which play crucial roles in posttranscriptional gene regulation (Lee and Ambros, 2001). Accumulating evidence indicates that dysregulations of miRNAs are crucial factors in human development and involved in human diseases, including CRC (Stefani and Slack, 2008). Several studies have investigated (Stefani and Slack, 2008; Chen et al., 2011) the effects of miRNAs on CRC. Nevertheless, further investigations regarding the molecular mechanisms involved in the development of CRC using genome-wide genetic and epigenetic networks (GWGENs), in which miRNAs participate, are warranted. In addition, long noncoding RNA (lncRNA) is a newly identified noncoding RNA molecules that are considered to be important regulators of tumor initiation and development (Xu et al., 2014). Although lncRNA has no biological functions on transcription, accumulating studies have uncovered the emerging role of lncRNA in multiple cellular processes, such as cell differentiation, proliferation, migration, invasion, apoptosis, and so on (Sun et al., 2015; Forrest et al., 2018; Xu et al., 2018; Di et al., 2019). Moreover, epigenetic changes modify the activation of certain genes, but not the genetic code sequence of DNA. Along with the accumulation of epigenetic changes in colon epithelial cells, these cells transform into adenocarcinomas (Lao and Grady, 2011). The most important epigenetic change is aberrant DNA methylation. All CRC cells have aberrantly methylated genes. Aberrant DNA methylation may interact with the change in the tumor microenvironment (TME). TME is the cellular environment in which the tumor exists, including the surrounding blood vessels, immune cells, fibroblasts, bone marrow–derived inflammatory cells, lymphocytes, signaling molecules, and the extracellular matrix (Berraondo et al., 2012). Among them, immune cells are the vital cells which can defeat nascent tumors in the TME. Although tumor cells cannot be completely eradicated by the immune cells, the latter may control tumor growth (Wang et al., 2017). Hence, proper activation of the immune response is beneficial to patients in the early stage of disease.
In this study, we applied a big database mining method to construct candidate GWGENs. The real GWGENs, including a protein-protein interaction network (PPIN) (Hsu et al., 2008; Lin et al., 2010), gene regulatory network (GRN), transcriptional regulations by transcription factors (TFs), lncRNAs (Chen et al., 2011), miRNA inhibitions, and DNA methylations were constructed through system modeling and identification of mRNAs and miRNAs expression using microarray data for each stage of CRC. Analysis of the real GWGENs is complicated; thus, we propose a principal network projection (PNP) method for the selection of the corresponding core GWGENs. Through denotation using the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways, the core GWGENs at the three stages of CRC cells could be projected into core pathways of the corresponding CRC cells. By comparing the core pathways of neighboring stages, we identified essential biomarkers involved in the progression mechanisms of CRC cells. These biomarkers could be considered as drug targets for different stages of CRC. By querying Connectivity Map (CMap), multiple-molecule drugs were designed for the therapeutic treatment of CRC progression.
Materials and Methods
Overview of Constructing Candidate GWGEN and Real GWGENs for Identifying Essential Biomarkers in the Progression of CRC
To identify essential biomarkers and find potential multiple-molecule drug to prevent the progression of CRC, the flowchart is given in Figure 1. At first, we constructed the candidate GWGEN by mining big databases, including Database of Interacting Proteins (DIP), Biomolecular Interaction Network Database (BIND), Biological General Repository for Interaction Datasets (BioGRID), IntAct, MINT, Integrated Transcription Factor Platform (ITFP), and Circuits DB2. The genome-wide microarray raw data were downloaded from the Gene Expression Omnibus database (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi), including 290 primary colorectal tumor samples (GSE14333). For the convenience of analysis, we merged Dukes’ stages A and B into the early stage. Dukes’ stages C and D are considered as the mid stage and late stage, respectively (Jorissen et al., 2009). We separated the data in three stages: early stage (137 samples), mid stage (92 samples), and late stage (61 samples). Subsequently, we used the gene symbols of human gene information data from the National Center for Biotechnology Information (NCBI) website as standard human gene names. Secondly, with the help of above mentioned three stages of microarray data, we used system identification and model selection approaches to eliminate the false positives in the candidate GWGEN and obtain real GWGENs. Afterwards, since it was still too complicated to analyze the real GWGENs, the PNP method was applied to get core GWGENs shown as Figures 2–4. Moreover, based on the projection values, the core pathways shown as Figures 5 and 6 in respect of KEGG pathways could be extracted out from the core GWGENs. Eventually, we suggested two potential multiple-molecule drugs to reverse the identified abnormal signature to avoid the progression of CRC by querying CMap.
Figure 1 Flowchart of identifying essential biomarkers and finding potential multiple-molecule drug for three stages of colorectal cancer cells. Microarray data of early-stage to late-stage samples, miRNA data, miRNA database, TF-gene database, and BioGRID database were searched to construct candidate genome-wide genetic and epigenetic networks (GWGENs) consisting of a candidate gene regulatory network (GRN), candidate protein-protein interaction network (PPIN), and candidate miRNA regulation network. Subsequently, false positives of the candidate GWGENs were pruned to construct real GWGENs of early-stage, mid-stage, and late-stage colon cancer through system identification and system order detection methods. The core GWGENs of early-stage, mid-stage, and late-stage could be extracted from the corresponding real GWGENs using the principal network projection (PNP) method. We investigated different molecular mechanisms from early-stage to late-stage colon cancer according to their core pathways in the annotation of Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways.
Figure 2 Core genome-wide genetic and epigenetic network (GWGEN) of early-stage colorectal cancer (CRC) cells. The core GWGEN of early-stage colon cancer was extracted from the corresponding real GWGEN by applying the principal network projection (PNP) method. Among the core GWGEN of early-stage CRC, there are 340 receptors, 2,455 proteins, 101 lncRNAs, 17 miRNAs, and 87 transcription factors (TFs). The gray lines represent edges in protein-protein interaction network (PPIN); the purple lines represent edges in gene regulatory network (GRN); the red lines represent miRNA regulations.
Figure 3 Core genome-wide genetic and epigenetic network (GWGEN) of mid-stage colorectal cancer (CRC) cells. The core GWGEN of mid-stage colon cancer was extracted from the corresponding real GWGEN by applying the principal network projection (PNP) method. Among the core GWGEN of mid-stage CRC, there are 362 receptors, 2,408 proteins, 129 lncRNAs, 15 miRNAs, and 86 transcription factors (TFs). The gray lines represent edges in protein-protein interaction network (PPIN); the purple lines represent edges in gene regulatory network (GRN); the red lines represent miRNA regulations.
Figure 4 Core genome-wide genetic and epigenetic network (GWGEN) of late-stage colorectal cancer (CRC) cells. The core GWGEN of late-stage colon cancer was extracted from the corresponding real GWGEN by applying the principal network projection (PNP) method. Among the core GWGEN of late-stage CRC, there are 370 receptors, 2,464 proteins, 74 lncRNAs, 13 miRNAs, and 79 transcription factors (TFs). The gray lines represent edges in PPI; the purple lines represent edges in gene regulatory network (GRN); the red lines represent miRNA regulations.
Figure 5 Differential core pathways for investigating the progression mechanism from early-stage colon cancer cells to mid-stage colon cancer cells. This figure shows the core pathways in the annotation of Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways extracted from core genome-wide genetic and epigenetic networks (GWGENs) of early-stage colon cancer cells and mid-stage colon cancer cells. The blue background covers the core pathways in early-stage colon cancer cells; the orange background covers the core pathways in mid-stage colon cancer cells; the green background covers the regulation of MIR133B which is modified by DNA methylation to inhibit cell migration and proliferation in mid-stage colon cancer cells. The green rectangular represents downregulation. The yellow rectangular represents upregulation. The blue lines represent the pathways in early-stage colon cancer cells; the orange lines represent the pathways in mid-stage colon cancer cells; the gray dash lines represent translocation; the green lines with an arrow head stand for activation of function; the green lines with a bar head stand for repressing function; the blue and orange lines with an arrowhead and circle head denote activation and repression of function, respectively; the red lines with a circle mean repression of function; and the yellow stars refer to gene involvement in mutation.
Figure 6 Differential core pathways for investigating the progression mechanism between mid-stage colon cancer cells and late-stage colon cancer cells. This figure shows the core pathways in the annotation of Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways extracted from the core genome-wide genetic and epigenetic networks (GWGENs) of mid-stage colon cancer cells and late-stage colon cancer cells. The orange background covers the molecular mechanism in mid-stage colon cancer cells; the purple background covers the core pathways in late-stage colon cancer cells. The green rectangular represents downregulation. The yellow rectangular represents upregulation. The orange lines represent the pathways in mid-stage colon cancer cells; the purple lines represent the pathways in late-stage colon cancer cells; the grey dash lines represent translocation; the green lines with an arrow head stand for activation of function; the green lines with a bar head stand for repression of function; the orange and purple lines with an arrowhead and circle head denote activation and repression of function, respectively; the red lines with a circle mean repression of function; and the yellow stars refer to gene involvement in mutation.
Construction of the Systematic Models for the GWGENs
For the PPIN, the expression levels of the ith protein and its hth interaction protein in the nth sample, denoted by qi [n] and ph[n], respectively, could be
where aih is the interaction ability between the i-th protein and its hth interactive protein; Hi indicates the total number of proteins interacting with the ith protein; βi, PPIN denotes the basal level of the ith protein expression; εi,PPIN [n] represents the stochastic noise owing to the modeling residue and measurement noise in the ith protein; I is the number of proteins; and N is the number of samples.
For the GRN, the expression level of a gene is regulated by its regulatory TFs/proteins, lncRNAs and wth miRNAs could be described by the following gene regulatory equation:
for j=1, …, J, n=1, …, N.
where zj [n] is the expression level of the jth gene; pu [n], xv [n] and rw [n] denote the expressions of the uth TF/protein, the vth lncRNA and the wth miRNA, respectively; dth≥ 0 represents the posttranscriptional regulatory ability of the wth miRNA to inhibit the jth gene; bju denotes the transcription regulatory ability from the uth TF to the jth gene; Uj indicates the total number of TFs binding to the jth gene; cjv is the transcription regulatory ability from the vth lncRNA to the jth gene; Vj denotes the total number of lncRNAs binding to the jth gene; Wj is the total number of miRNAs inhibiting the jth gene; βj,GRN indicates the basal level of the jth gene expression; εj,GRN[n] represents the stochastic noise owing to the modeling residue and measurement noise in the jth gene; J is the total number of genes; and N denotes the number of samples (i.e., patients).
For the lncRNA regulatory network (LRN), the expression level of an lncRNA is regulated by its regulatory TFs/proteins, lncRNAs, and miRNAs is described by the following regulatory equation:
for k=1, …, K, n=1, …, N.
where fk[n] is the expression level of the kth lncRNA; pu[n], xv[n], and rw[n] are the expression levels of regulatory TFs/proteins, lncRNAs and miRNAs, respectively; rkw≥ 0 denotes the posttranscriptional regulatory ability of the wth miRNA to inhibit the kth lncRNA; eku denotes the transcription regulatory ability from the uth TF to the kth lncRNA; Uk indicates the total number of TFs binding to the kth lncRNA; tkv is the transcription regulatory ability from the vth lncRNA to the kth lncRNA; Vk denotes the total number of lncRNAs binding to the kth lncRNA; Wk is the total number of miRNAs inhibiting the kth lncRNA; βk,LRN indicates the basal level of the kth lncRNA expression; εk,LRN[n] represents the stochastic noise owing to the modeling residue and measurement noise in the kth lncRNA; and K is the total number of lncRNAs.
For the miRNA regulatory network (MRN), the expression level of a miRNA is regulated by its regulatory TFs/proteins, lncRNAs, and miRNAs could be described by the following regulatory equation:
for t = 1,…, T, n = 1,…, N,
where bt[n] is the expression level of the tth miRNA; pu[n], xv[n], and rw[n] are the expression levels of regulatory TFs/proteins, lncRNAs, and miRNAs, respectively; ntw≥ 0 denotes the posttranscriptional regulatory ability of the wth miRNA to inhibit the tth miRNA; ytu denotes the transcription regulatory ability from the uth TF to the tth miRNA; Ut indicates the total number of TFs binding to the tth miRNA; htv is the transcription regulatory ability from the vth lncRNA to the tth miRNA; Vt denotes the total number of lncRNAs binding to the tth miRNA; Wt is the total number of miRNAs inhibiting the tth miRNA; βt,MRN indicates the basal level of the tth miRNA expression; εt,MRN[n] represents the stochastic noise owing to the modeling residue and measurement noise in the tth miRNA; and T is the total number of miRNAs. Accordingly, we considered epigenetic modifications, such as acetylation, methylation, ubiquitination, and phosphorylation. These epigenetic modifications can contribute to change in the basal level indicated in equations (1–4) when compared in different stages.
System Identification of Candidate GWGEN via Genome-Wide Microarray Data
To identify the precise parameters of the PPIN in equation (1) through the system identification method, equation (1) can be represented by the linear regression form as shown below:
where indicates the regression vector obtained from the corresponding microarray expression data, and θi,PPIN represents the unknown parameter vector to be estimated for the ith protein in PPIN. The equation (5) of the ith protein can be augmented for N samples as the following form:
which could be simply described as:
Therefore, by applying the following least-squares estimation problem, we could estimate the interactive parameters in the vector :
By solving the above optimization problem (8), we can obtain the interactive parameters in the PPIN interactive equation (1).
The linear regression form of the gene regulatory equations in (2) of GEN could be described as shown below:
where indicates the regression vector obtained from the corresponding microarray expression data, and θj,GRN represents the unknown parameter vector to be estimated for the jth gene in GRN. The equation (9) of the jth gene could be augmented for N samples as the following form:
for j=1, …, J.
which could be simply described as:
Therefore, by solving the following constrained least-squares estimation problem, we could estimate the regulatory parameters in the vector :
Simultaneously, the miRNA repression parameter -djw is guaranteed to be nonpositive (i.e. −djw ≤ 0, for w=1, 2, …, Wj.).
Similarly, the regulatory equations of LRN in (3) could be described below:
for k=1, …, K,
where indicates the regression vector which can be obtained from corresponding microarray expression data, and θk,LRN represents the unknown parameter vector to be estimated for the kth lncRNA in LRN. The equation (13) of the kth lncRNA could be augmented for N samples of corresponding microarray expression data as the following form:
for k=1, …, K,
which could be simply described as:
Therefore, by applying the following constrained least-squares estimation problem, we could estimate the regulatory parameters in the vector :
By solving the above constrained optimization problem (16), we can obtain the parameters in the LRN regulatory equation (3). Simultaneously, the miRNA repression −qkw is guaranteed to be nonpositive, i.e., −qkw ≤ 0, for w=1, 2, …, Wk.
Finally, the linear regression form of the MRN regulatory equation (4) could be described as shown below:
where indicates the regression vector which can be obtained for N samples of corresponding microarray expression data, and θt,MRN represents the unknown parameter vector to be estimated for the tth miRNA in MRN.
The regulatory equation (17) of the tth miRNA could be augmented for N samples of corresponding microarray expression data as the following form:
which could be simply described as:
Therefore, by applying the following constrained least-squares estimation problem, we could estimate the regulatory parameters in the vector :
By solving the above constrained optimization problem (20), we can obtain the parameters in the MRN regulatory equation (4). Simultaneously, the miRNA repression −ztw is guaranteed to be nonpositive, i.e., −ztw ≤ 0, for w=1, 2, …, Wt.
System Order Detection Scheme for Pruning the False Positives of the Candidate Network to Obtain the Real GWGENs of Early-Stage to Late-Stage CRC
Big database mining is associated with many false positives in the constructed candidate GWGEN. Therefore, we pruned the false positives in the candidate GWGEN to obtain the real GWGENs for the three stages of CRC using a system order detection scheme.
For the PPIN model in (7), the Akaike information criterion (AIC) for detecting the number of interactions of the ith protein could be defined using the following equation:
where indicates the estimated interaction parameter vector of the ith protein by solving the equation, is the estimated residual error. A decrease in the system interaction number (order) Hi will result in an increase in the corresponding estimated residual error. In contrast, attempts to minimize the estimated residual error, will result in an increase in the corresponding system order. Hence, we need to trade-off the system order and estimated residual error to determine the minimum value of the AICi for the real system interaction order Hi, i.e., the real system order of protein i could minimize AICi(Hi) in (21). Therefore, the insignificant protein interactions out of system order should be pruned from the interactions of the candidate PPIN. Gradually (one protein each time), we could prune the false positives of all proteins in the candidate PPIN to obtain the real PPIN using the AIC system order detection method. Similarly, for pruning false positives of the candidate GRN subnetwork in (11), the candidate LRN subnetwork in (15), and the candidate MRN subnetwork in (19), AICs for system order detection of the jth gene, the kth lncRNA, and the tth miRNA could be defined using the following equations, respectively:
where ,, indicate the estimated parameter vector of the jth gene, the kth lncRNA, and the tth miRNA by solving the equations (11), (15), and (19), respectively; , , and are the estimated residual error, respectively. In (22), (23), and (24), we could select ,,,,,, and ,,, to minimize AICj(Uj,Vj,Wj),AICk(Uk,Vk,Wk), and AICt(Ut,Vt,Wt), respectively. Then, ,, and would be the real numbers of the regulatory TFs, lncRNAs, and miRNAs of the jth gene, respectively; ,, and would be the real numbers of the regulatory TFs, lncRNAs, and miRNAs of the kth lncRNA, respectively; , and would be the real numbers of the regulatory TFs, lncRNAs, and miRNAs of the tth miRNA, respectively. The insignificant regulatory TFs lncRNAs, and miRNAs out of ,, and in the ith gene, ,, and in the jth lncRNA, ,, and in the tth miRNA are considered false positives in the candidate GWGENs. By gradually pruning the false positives (i.e., one gene, one lncRNA, and one miRNA each time), we could obtain the real GWGENs at every stage of CRC. The GWGENs of early-stage, mid-stage, and late-stage CRC, constructed using the proposed systems biology method, are illustrated in Figures S1, S2, and S3, respectively.
Extract of the Core Network From Real GWGENs by Applying the PNP Method
It is difficult to directly investigate the genetic and epigenetic mechanisms of colon carcinogenesis using the real GWGENs owing to their complexity. Prior to using the PNP method for the extraction of the core GWGENs from the real GWGENs, we initially established a network matrix P, including all the estimated parameters of edges in the real GWGENs. The network matrix P of real GWGEN is shown below:
where could be obtained in by solving the equation (8) and pruning false positives through the AIC method in (21); ,, could be obtained in by solving the equation (12) and pruning false positives using the AIC method in (22); ,, and could be obtained in by solving the equation (16) and pruning false positives via the AIC method in (23); ,, and could be obtained in by solving the equation (20) and pruning false positives by the AIC method in (24); U, V, and W are the total number of TFs, lncRNAs, and miRNAs, respectively. The estimated interactive and regulative abilities of edges in real GWGEN are included in the network matrix P. If an edge does not exist in the GWGEN or has been pruned out through AIC, the corresponding parameter in network matrix P is padded with zero.
Subsequently, we extracted the core components of the GEN using the PNP method. The proposed PNP method is a principal network structure projection method based on principal singular value decomposition of the system matrix P in (25) as follows:
where unitary matrix ∈ ℝ(I+J+K+T)×(U+V+W); R∈ ℝ(U+V+W)×(U+V+W), and D=diag(d1,...,d(U+V+W)) is a diagonal matrix, which contains U+V+W singular values of P in a descending order, i.e., d1≥...≥ dU+V+W.
The eigen expression fraction Em is defined as the following energy normalization:
By selecting the minimum A, the A top singular vectors of matrix P contained 85% of the core network structure of the real GWGEN from the energy point of view. Next, the projection of P to the top A singular vectors of N is defined as follows:
for ω = 1,…,I + J +K + T and ℓ = 1,…,A,
where aω,: represents the ωth row vector of P; and r:ℓ represents the column vectors of N.
Next, we defined the 2-norm projection value of each node, including proteins, genes, lncRNAs, and miRNAs in GWGENs to the top A right-singular vectors
for ω = 1,…, (I + J + K + T), and ℓ = 1,…,A,
where DR(ω) is the 2-norm projection value of the ωth node of GWGEN on the 85% core network architecture. We selected important proteins from receptors to TFs and their associated miRNAs, lncRNAs, and genes to construct core GWGENs for the investigation of significant genetic and epigenetic mechanisms in early-stage, mid-stage, and late-stage CRC cells. Finally, the core GWGENs of early-stage to late-stage CRC are shown in Figures 5 and 6.
Overview of Constructing Genome-Wide Genetic and Epigenetic Network for Early-Stage, Mid-Stage and Late-Stage CRC
In this study, to identify core GWGENs for three stages of CRC, we did big database mining to construct candidate PPI and candidate GRN. The candidate GWGEN consists of candidate PPI and candidate GRN. We applied reversed engineering and model selection approaches with corresponding early-stage, mid-stage, and late-stage of CRC microarray data to obtain real GWGENs shown in Figures S1, S2, and S3, respectively. The total number of nodes including receptors, proteins, lncRNAs, TFs, and miRNAs and edges of their interactions in candidate GWGEN and real GWGENs for three stages of CRC are shown in Table 1. It is noted that the number of nodes and edges decline a lot compared to those in real GWGENs. This phenomenon showed that the false positives were removed by system identification and model selection approaches. Afterwards, we utilized the PNP method, which could help to extract core GWGENs based on significant projection value of the node. The higher the projection value is, the more contribution made by the node in the core GWGEN. For the early-stage core GWGEN of CRC cells as shown in Figure 2, we identified 340 receptors, 2,455 proteins, 101 lncRNAs, 17 miRNAs, and 87 TFs. For the middle-stage core GWGEN of CRC cells as shown in Figure 3, we identified 362 receptors, 2,408 proteins, 129 lncRNAs, 15 miRNAs, and 86 TFs. For the late-stage core GWGEN of CRC cells as shown in Figure 4, 370 receptors, 2,464 proteins, 74 lncRNAs, 13 miRNAs, and 79 TFs are identified. The identified nodes in three stages of CRC could be found in Supplementary Material. Moreover, in order to be convenient for investigating the genetic and epigenetic molecular mechanisms of CRC, we denoted core signaling pathways in respect of KEGG pathways. The differential core signaling pathways were identified and carcinogenic mechanisms were found for early stage to mid stage and mid stage to late stage of CRC, shown in Figures 5 and 6, respectively. Consequently, according to our analytic results, we identified two groups of essential biomarkers reflecting abnormal gene expression signatures for CRC progression. By querying CMap, we suggested two multiple-molecule drugs for early stage to mid stage and mid stage to late stage of CRC, respectively.
Table 1 Number of edges and nodes in the candidate genome-wide genetic and epigenetic network (GWGEN) and identified real GWGENs in each stage of colorectal cancer.
Core Pathways in Early-Stage to Mid-Stage CRC
The core differential pathways between early-stage and mid-stage CRC are shown in Figure 5. Our results demonstrated that the receptor signal transducing adaptor family member 2 (STAP2), which binds the ligand protein tyrosine kinase 6 (PTK6) (hypoxic microenvironment), interacts with tubulin beta 1 class VI (TUBB1) and transmits the signal through protein mitogen-activated protein kinase kinase kinase kinase 1 (MAP4K1) to TF tumor protein p53 (TP53). TP53 is modified by deubiquitination and downregulates the expression of target gene MUC2, which is also modified by DNA methylation. This epigenetic modification can be detected by the basal level. The ligand growth factor receptor bound protein 2 (GRB2) binds to the receptor SHC3 modified by phosphorylation and transmits to proteins kelch like family member 15 (KLHL15) and SIM1. Following the modification of KLHL15 by ubiquitination, the signal will be transmitted to TF tripartite motif containing 65 (TRIM65). In the meantime, TF TRIM65 upregulates CUGBP Elav-like family member 2 (CELF2) to promote tumor apoptosis and inhibit cell migration and proliferation (Ramalingam et al., 2012).
The receptor protein kinase D1 (PRKD1), which is modified by phosphorylation and mutation, binds Mg2+ and transmits the signal to SIM1, MAP4K1, and zinc finger protein 260 (ZNF260). ZNF260 transmits the signal to KRAS which is modified by mutation, and then transmits it to Ras homolog family member F (RHOF) modified by acetylation and methylation. Finally, the signal reaches the TF TCF3, which is modified by mutation and methylation. This results in the upregulation of the target gene NFKB inhibitor like 1 (NFKBIL1), promoting immune response and inflammation (Atzei et al., 2010). The ligand WNT inhibitory factor 1 (WIF1) (signal transduction) binds to the receptor WNT4 and transmits the signal to TF zinc finger E-box binding homeobox 1 (ZEB1). ZEB1 appears in early-stage and mid-stage CRC cells through tubulin alpha 1b (TUBA1B), which is modified by acetylation and methylation, as well as transmembrane protein 39A (TMEM39A). ZEB1 modified by mutation downregulates HECTD3, which is also modified by DNA methylation. The aforementioned pathway controls the cell cycle and tumor apoptosis in early-stage CRC cells. Moreover, these epigenetic modifications can be detected by the basal level. The receptor RAR-related orphan receptor B (RORB), which is duplicated in early-stage and mid-stage CRC cells, binds ligand all-trans retinoic acid (ATRA) (apoptosis signal). Subsequently, it transmits the signal to TF TCF3 through cancer susceptibility 2 (CASC2), histone H4 transcription factor (HINFP) (modified by ubiquitination), and SMG8 (modified by phosphorylation) in early-stage CRC cells. TF TCF3 activates target gene NFKBIL1 to promote inflammation and immune response.
In the mid-stage CRC cells, there are three core pathways transmitting the signals after the binding of three receptors with three ligands and involving several transduction proteins to reach three TFs. Notably, receptor RORB and TF ZEB1 also present in early-stage CRC cells. Firstly, the receptor RORB binds ligand ATRA (apoptosis signal) and then transmits the signal to EGF-containing fibulin extracellular matrix protein 1 (EFEMP1), (modified by phosphorylation and mutation) and potassium two pore domain channel subfamily K member 2 (KCNK2) to interact with TF MAF bZIP transcription factor G (MAFG) (modified by phosphorylation and mutation). Eventually, the target gene MINK1 is upregulated, potentially leading to the migration of cancer cells to mid-stage CRC (Hu et al., 2004). Secondly, ligand G protein subunit alpha L (GNAL) (chemical stimulation) binds to the receptor olfactory receptor family 2 subfamily H member 1 (OR2H1) and transmits the signal through STK17B modified by phosphorylation, GDNF modified by mutation, and coactivator associated arginine methyltransferase 1 (CARM1) modified by acetylation and methylation to TF ZEB1. TF ZEB1 modified by mutation upregulates target gene LRRN1, which promotes proliferation in mid-stage CRC cells (Hossain et al., 2008). Thirdly, the receptor glutamate ionotropic receptor delta type subunit 2 (GRID2) binds ligand cerebellin 1 precursor (CBLN1) (protein secretion) to transmit the signal to TF POU class 2 homeobox 1 (POU2F1) through SMC2. This upregulates the target gene hematopoietic prostaglandin D synthase (HPGDS) and inhibits the migration and proliferation of CRC cells (Tippin et al., 2012).
In Figure 5, lncRNA LOC400043 interacted with MIR133B downregulating its expression. The target genes SMC2, POU2F1, and GDNF are downregulated by MIR133B modified by DNA methylation. Owing to the epigenetic effect, the target gene SMC2 promotes tumor apoptosis and inhibits the cell cycle; the target gene POU2F1 inhibits cell proliferation; and the target gene GDNF inhibits cell migration and proliferation (Segil et al., 1991; Choudhary et al., 2009; Davalos et al., 2012; Evangelisti et al., 2012; Huang et al., 2014; Guo et al., 2015; Wang et al., 2016b).
Core Pathways in Mid-Stage to Late-Stage CRC Cells
The core differential pathways between mid-stage and late-stage CRC are shown in Figure 6. Firstly, in mid-stage CRC cells, the ligand ATRA (apoptosis signal) binds to the receptor RORB and triggers the signaling to EFEMP1, which is modified by phosphorylation and mutation. Subsequently, the signal is transmitted to the protein KCNK2 and finally reaches the TF MAFG, which is modified by mutation and acetylation. Subsequently, the target gene MINK1 is upregulated, potentially leading to the migration of cancer cells to mid-stage CRC (Hu et al., 2004). Secondly, the receptor OR2H1 receives the GNAL (Chemical stimulation) signal and transmits it through sequential proteins STK17B, GDNF, and CARM1 to TF ZEB1. In this pathway, STK17B is translocated by phosphorylation and downregulated by MIR302B; GDNF is modified by mutation; CARM1 is modified by phosphorylation and methylation; and TF ZEB1 is modified by mutation. Finally, the upregulated LRRN1 leads to cell proliferation (Hossain et al., 2008).
All pathways are initiated from the common receptor GRID2 between mid-stage and late-stage CRC cells. In the mid-stage CRC cells, the ligand CBLN1 (protein secretion) binds to receptor GRID2 to transmit the signal to TF POU2F1 through SMC2, which is modified by mutation. In turn, TF POU2F1 upregulates target gene HPGDS to inhibit cell migration and proliferation (Tippin et al., 2012). After regulated by lncRNA LOC400043, MIR133B modified SMC2 resulting in the inhibition of cell proliferation and migration (Choudhary et al., 2009; Davalos et al., 2012; Guo et al., 2015). In the late-stage CRC cells, receptor GRID2 binds with the ligand CBLN1 (protein secretion) and sequentially transmits the signal to TF WD repeat domain 4 (WDR4) through proteins cell cycle associated protein 1 (CAPRIN1), secreted protein acidic and cysteine rich (SPARC), protein phosphatase 2 regulatory subunit B’’beta (PPP2R3B), ectonucleotide pyrophosphatase/phosphodiesterase 4 (ENPP4), and single-pass membrane protein with aspartate rich tail 1 (SMDT1). Eventually, TF WDR4 upregulates target gene LIG3 for DNA repair (Chen et al., 1995).
Notably, the late-stage CRC cells involve two additional pathways. In one pathway, receptor glycoprotein V platelet (GP5) binds ligand von Willebrand factor (VWF) (extracellular matrix organization) to activate TF aryl hydrocarbon receptor (AHR) through a cascade of proteins stomatin (STOM), SLC16A, SH3, and cysteine rich domain (STAC), major facilitator superfamily domain containing 2B (MFSD2B), GDNF, arylsulfatase F (ARSF), and ANGEL2. The protein GDNF, which also appears in mid-stage CRC cells pathway is modified by mutation. Finally, TF AHR upregulates target gene acidic nuclear phosphoprotein 32 family member E (ANP32E) modified by DNA methylation to promote cell proliferation, migration and DNA repair (Obri et al., 2014). In the other pathway, ion K+ binds to receptor potassium voltage-gated channel subfamily Q member 2 (KCNQ2) and transmits the signaling to TF LIG1 through a cascade of proteins, namely peptidylprolyl isomerase F (PPIF), KCNK2, DNA polymerase theta (POLQ), zinc finger protein 37B (ZNF37BP), SIM1, and HUS1. The second pathway present in both mid-stage and late-stage CRC cells; POLQ is modified by mutation (Lin et al., 2000; Nilsson et al., 2001); target gene p53-induced death domain protein 1 (PIDD1) is downregulated by TF LIG1 which promotes tumor apoptosis.
Genetic and Epigenetic Progression Mechanisms of Early-Stage to Mid-Stage CRC via Cell Apoptosis and Migration
By applying the systems biology approach and PNP method, we constructed the core pathways to investigate the genetic and epigenetic carcinogenic mechanisms of CRC, as shown in Figure 5. In the left-pathway in the early-stage CRC cells, the ligand PTK6 binds to the receptor STAP2 phosphorylated in the hypoxic microenvironment (Semenza, 2016). While STAP2 is modified by phosphorylation, the signal is transmitted to cascade proteins TUBB1, armadillo repeat containing 8 (ARMC8), and MAP4K1 (Fujita et al., 2014; Cho et al., 2017). At this point, with signal coupling from SIM1, the pathway activates the MAPK signaling pathway which is related to the control of immune response. MAP4K1 interacts with TF TP53 modified by deubiquitination. Following the modification of TP53 by deubiquitination, it may upregulate MUC2, leading to the progression of CRC. The ubiquitinated TF TP53 inhibits apoptosis and downregulates target gene MUC2 (Malkin et al., 1990; Ghosh et al., 2004). Furthermore, the main cellular functions of MUC2 are to protect the colon from disease, including the activation of inflammation and immune response and the inhibition of migration due to DNA methylation. Unfortunately, TF TP53 is eventually modified by mutation and deubiquitination that will affect the expression level of MUC2 to cause tumorigenesis. However, MUC2 with epigenetic DNA methylation exhibits lower expression to prevent CRC cells from progressing to the mid stage (Moehle et al., 2006; Cobo et al., 2015).
The next pathway begins with the ligand GRB2 in human B lymphocytes, which phosphorylates the receptor SHC3 after binding. This process activates two pathways: (1) through SIM1 to crosstalk with the MAPK pathway via protein MAP4K1 (Magrassi et al., 2005; Tashiro et al., 2009; Azzalin et al., 2014), and (2) transmission of the signal to KLHL15, which is modified by ubiquitination and activation of TF TRIM65. This upregulates target gene CELF2, which activates apoptosis, to inhibit cell proliferation and migration. KLHL15 effected by epigenetic changes stabilizes the expression level to avoid progression to mid-stage CRC (Ferretti et al., 2016; Wang et al., 2016a). Therefore, we concluded that the second pathway could inhibit cell migration, proliferation, cell apoptosis to prevent the progression of early-stage CRC to mid-stage CRC.
The third pathway initiates with binding of Mg2+ to the receptor PRKD1, which is modified by phosphorylation and mutation. Subsequently, the signal is transmitted through proteins SIM1 and ZNF260 to KRAS. KRAS mediates ZNF260 to stabilize PRKD1 in this pathway. It has been reported that this interaction occurs through epigenetic silencing due to mutation (Serra et al., 2014). After transmission of the signal to TF TCF3 with DNA migration through RHOF, which is modified by methylation and acetylation (Gouw et al., 2005), it upregulates target gene NFKBIL1 to activate inflammation and immune response and inhibit the development of CRC (Atzei et al., 2010; Mcallister et al., 2014; Taniue et al., 2016).
The next pathway is initiated with receptor WNT4. Several studies have shown that this pathway could inhibit the proliferation and migration of tumors (Seth and Ruiz I Altaba, 2016; Tang et al., 2018). In the WNT-signaling pathway, ligand WIF1 (signal transduction) binds with the receptor WNT4 to transmit the signal to TF ZEB1 through cascade proteins TUBA1B and TMEM39A. When highly expressed in tumor cells, TUBA1B improves their proliferation. To solve this problem, the acetylation of TUBA1B (also affected by epigenetic methylation) may play an important role in balancing the expression and signal transfer to TF ZEB1 through TMEM39A (Lu et al., 2013). TF ZEB1 is duplicated in both early-stage and mid-stage CRC cells. Its major function is to regulate the apoptosis, migration, and proliferation of cancer cells, as well as the downregulation of target gene HECTD3. As HECTD3 exhibits low expression levels, it may lead to apoptosis of cancer cells and promotion of cell cycle (Shu et al., 2017). Hence, the marked reduction in the expression of HECTD3, caused by DNA methylation, may be beneficial to patients. Unfortunately, TF ZEB1 may undergo mutation during the carcinogenic process. Mutated ZEB1 may result in epithelial–mesenchymal transition (Loboda et al., 2011), progressing CRC to the mid-stage. The final pathway in early-stage CRC cells is initiated from receptor RORB. Some studies have shown that RORB acts as a tumor suppressor. A high expression level of RORB may exert an effect on TF TCF3 to activate a downstream pathway in early-stage CRC (Mühlbauer et al., 2013; Wen et al., 2017). In contrast, a low expression level of RORB may activate another downstream pathway in the mid-stage CRC.
After binding of ligand ATRA to receptor RORB (apoptosis signal), two pathways with epigenetic modifications are activated. In the first pathway, in early-stage CRC, the upregulated CASC2 transmits the signal to HINFP, which is modified by ubiquitination to inhibit tumor growth (Baldinu et al., 2007), and transmits the signal to TF TCF3 through SMG8. Subsequently, SMG8 is modified by phosphorylation to maintain its function in inhibiting cell apoptosis. Finally, the signal is transmitted to TF TCF3 to regulate target gene NFKBIL1, directing inflammation and immune response. We concluded that the cellular dysfunctions in early-stage CRC should include immune response, inflammation, and cell apoptosis. In the second pathway, in mid-stage CRC, receptor RORB transmits the signal to TF MAFG through EFEMP1 and KCNK2; epigenetic modifications were also found in this pathway. EFEMP1 is modified by phosphorylation, which activates the mitogen-activated protein kinase/extracellular signal-regulated kinase (MAPK/ERK) signaling pathway (Dou et al., 2016) to control the disordered cell proliferation in mid-stage CRC (Setia et al., 2014). Following activation of the MAPK/ERK pathway, the signal is transmitted through KCNK2 to TF MAFG, which plays a crucial role in this pathway by the modification of acetylation to promote cell migration (Fang et al., 2014; Vera et al., 2017). Subsequently, MINK1 is upregulated by MAFG. Moreover, previous studies (Hu et al., 2004; Nicke et al., 2005) have shown that MINK1 is an essential target gene in the MAPK/ERK pathway. Specifically, sustained high expression of MINK1 is associated with the occurrence of cell migration. Furthermore, there are some proteins and one gene affected by mutation in this pathway. For example, EFEMP1 is modified by mutation; this may inactivate the MAPK/ERK pathway to induce cell proliferation. The next protein is MAFG; following the mutation of TF MAFG, acetylation does not downregulate its expression level. The aforementioned process may inhibit cell migration. In general, MINK1, which is regulated by the TF MAFG, exhibits high expression in CRC. However, while MINK1 is mutated, the expression may be downregulated to inhibit cell migration.
The second pathway in the mid-stage CRC cells starts with receptor OR2H1. When OR2H1 binds ligand GNAL, it may stimulate the G-protein signal pathway to promote cell proliferation. The signal is transmitted through STK17B, which is modified by phosphorylation to control the cell cycle and apoptosis, to GDNF. GDNF is an essential protein related to cell migration in the progression of CRC. Numerous studies (Oppenheim et al., 1995; Evangelisti et al., 2012; Zhang et al., 2017; Fielder et al., 2018) have shown that GDNF would control cell migration. However, while GDNF is mutated in the signaling cascade, it may cause the pathway to be inactive. CARM1 (modified by methylation and phosphorylation) transmits the signal to TF ZEB1, which may be upregulated to cause cell proliferation (Guo et al., 2017; Zhang et al., 2018). TF ZEB1 plays an important role in this pathway to connect the early-stage and mid-stage of CRC. In the mid-stage, ZEB1 may upregulate LRRN1 to inhibit cell proliferation (Hossain et al., 2008).
The final pathway in the mid-stage CRC cells is activated by receptor GRID2 after binding the ligand CBLN1 (protein secretion). The signal is transmitted to TF POU2F1 through SMC2 to upregulate target gene HPGDS, inhibiting cell proliferation and migration. There are two studies (Tippin et al., 2012; Tippin et al., 2014) indicating that high expression of HPGDS might inhibit tumor proliferation in normal human cells. However, in CRC, HPGDS exhibits a five-fold lower expression (Tippin et al., 2012) to promote tumor migration. The aforementioned processes do not involve mutations. Mutations in this pathway (e.g., receptor GRID2) would lead to its inactivation. Moreover, mutation of the protein SMC2, may impair the natural expression and promote tumor growth.
We also investigated the role of miRNA regulation in the molecular carcinogenic mechanism of mid-stage CRC. As shown in Figure 5, some proteins (GDNF, SMC2, and POU2F1) may be translocated through miRNA regulation. One study has indicated that the overexpression of MIR133B in CRC cells induces apoptosis and cell cycle arrest at G1 phase (Lv et al., 2015). However, they have not mentioned that the molecular mechanism involved in the regulation of MIR133B. In this study, we hypothesized that the overexpression of MIR133B would lead to loss of its inhibitory function on other genes. However, accompanying by the regulation of lncRNA LOC400043 and DNA methylation, MIR133B might reverse the overexpression to its natural expression, leading to downregulation of GDNF, SMC2, and POU2F1 in this pathway. Furthermore, the high expression of GDNF may affect tumor migration. As GDNF is downregulated by MIR133B, it inhibits tumor migration and proliferation (Evangelisti et al., 2012; Huang et al., 2014). The expression of SMC2 modified by MIR133B could downregulate the control of tumor proliferation, migration, and apoptosis (Choudhary et al., 2009; Davalos et al., 2012; Je et al., 2014). POU2F1 may be overexpressed in CRC cells to cause proliferation. Affected by MIR133B, the expression of POU2F1 would be downregulated to inhibit cell proliferation. Finally, we found that the cellular dysfunctions in mid-stage CRC include migration and proliferation. The key point of progression from early-stage to mid-stage cancer cells is the mutation of TF ZEB1, which cross-talks in two pathways.
Genetic and Epigenetic Progression Mechanisms From Mid-Stage to Late-Stage CRC via Cell Migration and Proliferation
Late-stage CRC involves widespread metastasis (Jorissen et al., 2009). Only a few studies have investigated late-stage CRC. Figure 6 shows three core pathways in the late-stage CRC cells. The first pathway starts with receptor GRID2, which also appears in mid-stage CRC cells, and transmits the signal to TF WDR4 to upregulate target gene LIG3. LIG3 has been investigated in studies concerning DNA repair (Murray et al., 2011; Hu et al., 2018). DNA damage and repair are double-edged swords in cancer. DNA damage, coupled with error-prone repair, could drive cancer progression by promoting genomic or genetic instability (Doksani and De Lange, 2016; Hu et al., 2018). Based on our data analysis result, we infer that the DNA repair in response to DNA damage provides a possibility to prevent CRC cells from progressing. However, failures in DNA repair would lead to the deterioration of patient’s condition.
The core pathway contains protein GDNF, which is the critical factor in the progression of CRC. This pathway starts with receptor GP5, which binds ligand VWF (extracellular matrix organization), transmitting the signal to TF AHR through GDNF. AHR regulates cell proliferation (Xie and Raufman, 2015) by upregulating target gene ANP32E. The upregulated ANP32E may cause tumor migration and proliferation. Our results show that ANP32E is modified by DNA methylation to downregulate ANP32E, which leads to the promotion of tumor migration and proliferation, as well as DNA repair (Obri et al., 2014).
The last pathway in the late stage starts with receptor KCNQ2 binding with the K+ transmits signal to TF LIG1 through cascade protein KCNK2, which also appears in the mid-stage CRC cells. POLQ, which is prevalent in CRC, is regarded as a tumor promoter because its mutation could cause overexpression to promote cell migration (Higgins et al., 2010). When the signal is transmitted to TF LIG1, it may downregulate target gene PIDD1 to promote cell apoptosis (Lin et al., 2000; Nilsson et al., 2001). In other words, TF LIG1 is a tumor suppressor which regulates PIDD1 to promote cell apoptosis. Regulated by lncRNA LOC400043, MIR133B is able to downregulate SMC2 resulting in the inhibition of cell migration. Of note, mutation of the receptor GRID2 denying the binding of ligand CBLN1 is the key for the progression from mid-stage to late-stage CRC.
Genetic and Epigenetic Carcinogenic Mechanisms in Early-Stage to Late-Stage CRC Cells
After analyzing the pathway of each stage of CRC, we recognized some essential pathways and graphically illustrated them in Figure 7. Based on the investigation of the core genetic and epigenetic network, we identified some pathogenic biomarkers for the design of multiple drugs against CRC. In early-stage disease, PTK6 (hypoxic environment) binds with STAP2 to activate essential protein MAP4K1 and consequently the MAPK pathway (Fujita et al., 2014; Cho et al., 2017). WIF1 (Signal transduction) binds with WNT4 to activate the WNT signaling pathway (Seth and Ruiz I Altaba, 2016; Tang et al., 2018). MUC2 and HECTD3, modified by DNA methylation, could promote immune response, inflammation, cell cycle, and apoptosis. At this stage, TF TP53 and ZEB1 are two potential factors of tumor progression, which are modified by mutation to cause cell migration (Malkin et al., 1990; Ghosh et al., 2004; Loboda et al., 2011). The mutated TF TP53 and ZEB1 promote the progression of CRC toward the mid stage.
Figure 7 Overview of genetic and epigenetic progression mechanisms from early-stage colon cancer cells to late-stage colorectal cancer cells.The upper part shows the genetic and epigenetic mechanisms of early-stage colon cancer cells; the middle part shows the genetic and epigenetic mechanisms of mid-stage colon cancer cells; the lower part shows the genetic and epigenetic mechanisms of late-stage colon cancer cells. The black lines represent protein-protein interaction; the red lines represent upregulation or downregulation; the black dash lines represent upregulation or downregulation; black dot line represents crosstalk; the red dash lines represent genetic, epigenetic, and mutation effects; red rectangular represents cellular functions; black dash rectangles include pathway description; green dash rectangles represent microenvironment; the red cross indicates mutation which may inactivate the pathway; and the blue rectangles show progression mechanisms.
Mid-stage CRC involves three pathways: (1) ligand GNAL (chemical stimulation) binds with receptor OR2H1 to activate the G-protein signaling pathway; (2) ligand ATRA (apoptosis signal) binds with receptor RORB to activate the MAPK/ERK pathway (Setia et al., 2014); and (3) ligand CBLN1 (protein secretion) binds with receptor GRID2 to activate the CREB pathway (Hui et al., 2014). The most important carcinogenic mechanism and cellular dysfunctions at this stage of CRC are as follows: MIR133B is modified by DNA methylation to downregulate GDNF and SMC2, inhibiting tumor proliferation and migration. Target gene MINK1 and its TF MAFG are two tumor promoters (Hu et al., 2004; Nicke et al., 2005; Fang et al., 2014; Vera et al., 2017). Affected by mutations, their expression may be reduced to prevent tumor migration. In other words, they would abolish their tumor promoting activity. GDNF, EFEMP1, and GRID2 play important roles in carcinogenesis at this stage. Following the occurrence of mutations within the cascade signaling pathway, these pathways may be inactivated and accelerate tumor migration.
The late-stage CRC cells involve two core pathways: (1) ligand CBLN1 (protein secretion) binds with receptor GRID2; and (2) ligand VWF (extracellular matrix organization) binds with receptor GP5. In the first pathway, the target gene LIG3 controls DNA repair. In the second pathway, the target gene ANP32E is modified by DNA methylation to downregulate its expression and retard tumor migration (Li et al., 2017; Xiong et al., 2018).
Design of Multiple-Molecule Drugs for Preventing the Progression From Early-Stage to Mid-Stage and Mid-Stage to Late-Stage CRC by Querying Connectivity Map
According to the analyzed results of core signaling pathways for three stages of CRC, genetic, and epigenetic biomarkers are identified as drug targets for designing multiple-molecule drug to prevent progression from early-stage to mid-stage and mid-stage to late-stage CRC. Connectivity Map (CMap) build 02, a project developed by Broad Institute, contains 6100 instances with 1,309 drugs and 156 concentrations on five cell lines (Musa et al., 2017). By querying CMap build 02, we suggested three potential compounds with high negative connectivity scores and combined them with the known drugs of CRC for preventing progression from early stage to mid stage of CRC and mid stage to late stage of CRC, respectively. The correlation coefficients between the concentrations of drugs and the gene expression levels indicate the relationship between drugs and genes. If the correlation coefficient is >0, the gene is upregulated by treatment with the drug; if the correlation coefficient is <0, the gene is downregulated by treatment with the drug.
We selected one genetic biomarker and four epigenetic biomarkers in Table 2: the genetic biomarker is MINK1 and the epigenetic biomarkers are MUC2, HECTD3, GDNF, and SMC2. The epigenetic biomarkers were selected owing to their high expression level, as a consequence of alterations in epigenetic regulation resulting in cell migration and proliferation. Moreover, MINK1 exhibits a high expression level, resulting in cell migration in CRC. We selected two known drugs used in the treatment of CRC, (i.e., 5-fluorouracil and oxaliplatin), and three potential compounds (i.e., mesalazine, dexverapamil, and sulindac) by querying CMap to restore the normal expression of five target genes (Table 2). The results showed that the expression of five target genes were decreased through the treatment with the proposed potential compounds. Among them, it is noted that there are several evidences demonstrating the ability of mesalazine and its derivative to interfere with intracellular signals involved in CRC cell growth (Bus et al., 1999; Lyakhovich and Gasche, 2010; Stolfi et al., 2012). Meanwhile, ClinicalTroals.gov identifier NCT02077777, has shown mesalazine completed phase II clinical trial based on definitions developed by the U.S. Food and Drug Administration (FDA) for chemopreventive action of mesalazine on CRC. The dexverapamil has been regarded as chemosensitizer, which is a small molecule making tumor cells be sensitive to the chemotherapeutic agents (Weinlander et al., 1997; Hu et al., 2016). Moreover, there is one study showing sulindac could inhibit CRC cell growth and downregulate specificity protein transcription factors (Li et al., 2015). Based on these results, we suggest that combining three potential drugs with two known CRC drugs may alleviate the rate of deterioration from early-stage to mid-stage CRC.
Table 2 Drug targets and the corresponding multiple-molecule drug for the therapeutic treatment from early-stage to mid-stage colorectal cancer.
As shown in Table 3, we identified four genetic biomarkers and one epigenetic biomarker. The genetic biomarkers are LIG3, STK17B, MINK1, and LRRN1, and the epigenetic biomarker is SMC2. According to the above analysis, MINK1, STK17B, LRRN1, and SMC2 exhibit high expression level to promote cell migration in CRC; LIG3 has a high expression level to cause failure in DNA repair. We selected two known drugs of CRC (i.e., 5-fluorouracil and oxaliplatin), and proposed three potential compounds (i.e. valproic acid, estradiol, and gefitinib) by querying CMap to restore the normal expression of five targets. Notably, the valproic acid not only inhibits CRC cells growth through cell cycle modification but also has the ability to reverse aberrant DNA methylation partially (Strey et al., 2011; Brodie and Brandes, 2014; Bressy et al., 2017). Moreover, estradiol has been found that it reduced proliferation and apoptosis in CRC (Sasso et al., 2019). For gefitinib, there are a number of phase I and II studies investigating the effects caused by the combination with standard 5-fluorouracil (5-FU)-based regimes with response rates ranging from 25 to 59%, although these trials did not include chemotherapy-resistant individuals (Kuo et al., 2005; Cho et al., 2006; Hofheinz et al., 2006; Wolpin et al., 2006; Stebbing et al., 2008). Moreover, ClnicalTrials.gov identifier NCT00025350, has completed phase II trial for using gefitinib in patients with recurrent metastatic CRC. According to these results, we suggest that the combination of three potential compounds with two known drugs used against CRC as a multiple-molecule drug may retard the rate of deterioration from mid-stage to late-stage CRC.
Table 3 Drug targets and the corresponding multiple-molecule drug for the therapeutic treatment of mid-stage and late-stage colorectal cancer.
The Model Evaluations and Limitations of Systems Biology Approaches to Infer the Core Signaling Pathways of CRC
In drug discovery, biomarker identification is an important problem. Ligand binds to receptor, which trigger downstream signaling cascade, and results in the progression of tumor cells. In this study, in order to investigate the core signaling pathways of CRC for identifying essential biomarkers, we leveraged microarray CRC dataset to construct real GWGENs by the reversed engineering method. Afterwards, we applied Akaike’s information criterion (AIC), which could help us to prune the false positives of regulations and interactions in the GWGENs and conquer the overfitting and under fitting problems. To evaluate our proposed models, we found another independent dataset, Colorectal Adenocarcinoma (TCGA, PanCancer Atlas) (Hoadley et al., 2018), and calculated the AIC values for the common symbols. Subsequently, we executed random permutation for 1,000 times on our original dataset (GSE14333). Here, we would like to know what percentage of common symbols in the independent dataset could have significant p-value. In other words, taking one common symbol for example, if the AIC value of common symbol could be lower than all of the AIC values after random permutation in the original dataset, we would say that the common symbol has significant p-value (p-value < 0.001). According to our model evaluation results for each stage of CRC in the independent dataset, there are 0.5847, 0.5170, and 0.5867 percentage of common symbols with significant p-value, respectively. The corresponding model evaluation analysis code could be found in the Github link (https://github.com/lab619nthu/Validation.git). Moreover, we also found that the symbol with numerous edges was prone to have the higher p-value than the original dataset. This phenomenon implies that the experimental condition change would have severe effects on symbol owning lots of edges.
It is noted that not all of the pathway analysis of proteins (Tables 4–6) have been proved to be associated with CRC (for example, the HCM pathway shown in Supplementary Table 2). We conclude multiple reasons resulting in such finding. First, it is known that microarray dataset is very noisy. Considering the fact that most of the models are linear in our pipeline, for some context, the true signal may be buried in the accumulated noise due to the high dimensionality of the dataset. In future, we would like to enhance our pipeline and try to minimize such effect as much as possible. Secondly, cancer cells could utilize cellular programs which are different from normal cells to survive in the stressful microenvironment. For example, reactivation of cancer-testis antigen BAP31 has been identified to promote proliferation and invasion in cervical cancer (Dang et al., 2018). Therefore, we could not totally exclude the possibility that the HCM pathway promotes the progress of disease as cancer cells could utilize unusual cellular programs to survive. The ectopic expressions may have something to do with the rewiring of cancer cellular program. Currently, we only put focus on gene expression. In future, we would like to integrate multiple types of molecular data into our pipeline and have more exploration on this.
Table 4 Pathway analysis of proteins in the real genome-wide genetic and epigenetic network (GWGEN) of early-stage colon cancer.
Table 5 Pathway analysis of proteins in the real genome-wide genetic and epigenetic network (GWGEN) of mid-stage colon cancer.
Table 6 Pathway analysis of proteins in the real genome-wide genetic and epigenetic network (GWGEN) of late-stage colon cancer.
Data Availability Statement
The genome-wide microarray raw data were downloaded from the Gene Expression Omnibus database (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi), including 290 primary colorectal tumor samples (GSE14333).
Conceptualization: S-JY and S-WC. Methodology: S-JY, S-WC, and B-SC. Software: S-JY and S-WC. Validation: S-JY. Formal Analysis: S-JY and S-WC. Investigation: S-WC. Data Curation: S-WC. Writing—Original Draft Preparation: S-JY. Writing-Review and Editing: S-JY, S-WC, and B-SC. Visualization: S-JY and S-WC. Supervision: B-SC. Funding Acquisition: B-SC.
This research was funded by the Ministry of Science and Technology (grant number: MOST 107-2221-E-007-112-MY3).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2020.00117/full#supplementary-material
Atzei, P., Gargan, S., Curran, N., Moynagh, P. N. (2010). Cactin targets the MHC class III protein IkappaB-like (IkappaBL) and inhibits NF-kappaB and interferon-regulatory factor signaling pathways. J. Biol. Chem. 285, 36804–36817. doi: 10.1074/jbc.M110.139113
Azzalin, A., Moretti, E., Arbustini, E., Magrassi, L. (2014). Cell density modulates SHC3 expression and survival of human glioblastoma cells through Fak activation. J. Neurooncol. 120, 245–256. doi: 10.1007/s11060-014-1551-x
Bos, A. C., Van Erning, F. N., Van Gestel, Y. R., Creemers, G. J., Punt, C. J., Van Oijen, M. G., et al. (2015). Timing of adjuvant chemotherapy and its relation to survival among patients with stage III colon cancer. Eur. J. Cancer 51, 2553–2561. doi: 10.1016/j.ejca.2015.08.016
Bouvier, A. M., Launoy, G., Bouvier, V., Rollot, F., Manfredi, S., Faivre, J., et al. (2015). Incidence and patterns of late recurrences in colon cancer patients. Int. J. Cancer 137, 2133–2138. doi: 10.1002/ijc.29578
Bressy, C., Majhen, D., Raddi, N., Jdey, W., Cornilleau, G., Zig, L., et al. (2017). Combined therapy of colon carcinomas with an oncolytic adenovirus and valproic acid. Oncotarget 8, 97344–97360. doi: 10.18632/oncotarget.22107
Bus, P. J., Nagtegaal, I. D., Verspaget, H. W., Lamers, C. B., Geldof, H., Van Krieken, J. H., et al. (1999). Mesalazine-induced apoptosis of colorectal cancer: on the verge of a new chemopreventive era? Aliment. Pharmacol. Ther. 13, 1397–1402. doi: 10.1046/j.1365-2036.1999.00652.x
Chen, J., Tomkinson, A. E., Ramos, W., Mackey, Z. B., Danehower, S., Walter, C. A., et al. (1995). Mammalian DNA ligase III: molecular cloning, chromosomal localization, and expression in spermatocytes undergoing meiotic recombination. Mol. Cell Biol. 15, 5412–5422. doi: 10.1128/MCB.15.10.5412
Chen, C.-Y., Chen, S.-T., Fuh, C.-S., Juan, H.-F., Huang, H.-C. (2011). Coregulation of transcription factors and microRNAs in human transcriptional regulatory network. BMC Bioinf. 12, S41. doi: 10.1186/1471-2105-12-S1-S41
Cho, C. D., Fisher, G. A., Halsey, J., Sikic, B. I. (2006). Phase I study of gefitinib, oxaliplatin, 5-fluorouracil, and leucovorin (IFOX) in patients with advanced solid malignancies. Invest. New Drugs 24, 117–123. doi: 10.1007/s10637-006-2032-7
Cho, S. S. L., Han, J., James, S. J., Png, C. W., Weerasooriya, M., Alonso, S., et al. (2017). Dual-Specificity Phosphatase 12 Targets p38 MAP Kinase to Regulate Macrophage Response to Intracellular Bacterial Infection. Front. Immunol. 8, 1259. doi: 10.3389/fimmu.2017.01259
Choudhary, C., Kumar, C., Gnad, F., Nielsen, M. L., Rehman, M., Walther, T. C., et al. (2009). Lysine acetylation targets protein complexes and co-regulates major cellular functions. Science 325, 834–840. doi: 10.1126/science.1175371
Cobo, E. R., Kissoon-Singh, V., Moreau, F., Chadee, K. (2015). Colonic MUC2 mucin regulates the expression and antimicrobial activity of beta-defensin 2. Mucosal. Immunol. 8, 1360–1372. doi: 10.1038/mi.2015.27
Dang, E., Yang, S., Song, C., Jiang, D., Li, Z., Fan, W., et al. (2018). BAP31, a newly defined cancer/testis antigen, regulates proliferation, migration, and invasion to promote cervical cancer progression. Cell Death Dis. 9, 791. doi: 10.1038/s41419-018-0824-2
Davalos, V., Suarez-Lopez, L., Castano, J., Messent, A., Abasolo, I., Fernandez, Y., et al. (2012). Human SMC2 protein, a core subunit of human condensin complex, is a novel transcriptional target of the WNT signaling pathway and a new therapeutic target. J. Biol. Chem. 287, 43472–43481. doi: 10.1074/jbc.M112.428466
Deng, H., Wang, J. M., Li, M., Tang, R., Tang, K., Su, Y., et al. (2017). Long non-coding RNAs: New biomarkers for prognosis and diagnosis of colon cancer. Tumour Biol. 39, 1010428317706332. doi: 10.1177/1010428317706332
Di, W., Weinan, X., Xin, L., Zhiwei, Y., Xinyue, G., Jinxue, T., et al. (2019). Long noncoding RNA SNHG14 facilitates colorectal cancer metastasis through targeting EZH2-regulated EPHA7. Cell Death Dis. 10, 514. doi: 10.1038/s41419-019-1707-x
Doksani, Y., De Lange, T. (2016). Telomere-Internal Double-Strand Breaks Are Repaired by Homologous Recombination and PARP1/Lig3-Dependent End-Joining. Cell Rep. 17, 1646–1656. doi: 10.1016/j.celrep.2016.10.008
Dou, C. Y., Cao, C. J., Wang, Z., Zhang, R. H., Huang, L. L., Lian, J. Y., et al. (2016). EFEMP1 inhibits migration of hepatocellular carcinoma by regulating MMP2 and MMP9 via ERK1/2 activity. Oncol. Rep. 35, 3489–3495. doi: 10.3892/or.2016.4733
Evangelisti, C., Bianco, F., Pradella, L. M., Puliti, A., Goldoni, A., Sbrana, I., et al. (2012). Apolipoprotein B is a new target of the GDNF/RET and ET-3/EDNRB signalling pathways. Neurogastroenterol. Motil. 24, e497–e508. doi: 10.1111/j.1365-2982.2012.01998.x
Fang, M., Ou, J., Hutchinson, L., Green, M. R. (2014). The BRAF oncoprotein functions through the transcriptional repressor MAFG to mediate the CpG Island Methylator phenotype. Mol. Cell 55, 904–915. doi: 10.1016/j.molcel.2014.08.010
Ferretti, L. P., Himmels, S. F., Trenner, A., Walker, C., Von Aesch, C., Eggenschwiler, A., et al. (2016). Cullin3-KLHL15 ubiquitin ligase mediates CtIP protein turnover to fine-tune DNA-end resection. Nat. Commun. 7, 12628. doi: 10.1038/ncomms12628
Forrest, M. E., Saiakhova, A., Beard, L., Buchner, D. A., Scacheri, P. C., Laframboise, T., et al. (2018). Colon Cancer-Upregulated Long Non-Coding RNA lincDUSP Regulates Cell Cycle Genes and Potentiates Resistance to Apoptosis. Sci. Rep. 8, 7324–.
Fujita, N., Oritani, K., Ichii, M., Yokota, T., Saitoh, N., Okuzaki, D., et al. (2014). Signal-transducing adaptor protein-2 regulates macrophage migration into inflammatory sites during dextran sodium sulfate induced colitis. Eur. J. Immunol. 44, 1791–1801. doi: 10.1002/eji.201344239
Gouw, L. G., Reading, N. S., Jenson, S. D., Lim, M. S., Elenitoba-Johnson, K. S. (2005). Expression of the Rho-family GTPase gene RHOF in lymphocyte subsets and malignant lymphomas. Br. J. Haematol. 129, 531–533. doi: 10.1111/j.1365-2141.2005.05481.x
Guo, Y., Li, X., Lin, C., Zhang, Y., Hu, G., Zhou, J., et al. (2015). MicroRNA133b inhibits connective tissue growth factor in colorectal cancer and correlates with the clinical stage of the disease. Mol. Med. Rep. 11, 2805–2812. doi: 10.3892/mmr.2014.3075
Guo, C., Ma, J., Deng, G., Qu, Y., Yin, L., Li, Y., et al. (2017). ZEB1 Promotes oxaliplatin resistance through the induction of epithelial - mesenchymal transition in colon cancer cells. J. Cancer 8, 3555–3566. doi: 10.7150/jca.20952
Higgins, G. S., Harris, A. L., Prevo, R., Helleday, T., Mckenna, W. G., Buffa, F. M. (2010). Overexpression of POLQ confers a poor prognosis in early breast cancer patients. Oncotarget 1, 175–184. doi: 10.18632/oncotarget.124
Hoadley, K. A., Yau, C., Hinoue, T., Wolf, D. M., Lazar, A. J., Drill, E., et al. (2018). Cell-of-Origin Patterns Dominate the Molecular Classification of 10,000 Tumors from 33 Types of Cancer. Cell 173, 291–304.e296. doi: 10.1016/j.cell.2018.03.022
Hofheinz, R. D., Kubicka, S., Wollert, J., Arnold, D., Hochhaus, A. (2006). Gefitinib in combination with 5-fluorouracil (5-FU)/folinic acid and irinotecan in patients with 5-FU/oxaliplatin- refractory colorectal cancer: a phase I/II study of the Arbeitsgemeinschaft fur Internistische Onkologie (AIO). Onkologie 29, 563–567.
Hossain, M. S., Ozaki, T., Wang, H., Nakagawa, A., Takenobu, H., Ohira, M., et al. (2008). N-MYC promotes cell proliferation through a direct transactivation of neuronal leucine-rich repeat protein-1 (NLRR1) gene in neuroblastoma. Oncogene 27, 6075–6082. doi: 10.1038/onc.2008.200
Hu, Y., Leo, C., Yu, S., Huang, B. C., Wang, H., Shen, M., et al. (2004). Identification and functional characterization of a novel human misshapen/Nck interacting kinase-related kinase, hMINK beta. J. Biol. Chem. 279, 54387–54397. doi: 10.1074/jbc.M404497200
Hu, Y., Lin, J., Fang, H., Fang, J., Li, C., Chen, W., et al. (2018). Targeting the MALAT1/PARP1/LIG3 complex induces DNA damage and apoptosis in multiple myeloma. Leukemia. 32, 2250–2262. doi: 10.1038/s41375-018-0104-2
Huang, S. M., Chen, T. S., Chiu, C. M., Chang, L. K., Liao, K. F., Tan, H. M., et al. (2014). GDNF increases cell motility in human colon cancer through VEGF-VEGFR1 interaction. Endocr. Relat. Cancer 21, 73–84. doi: 10.1530/ERC-13-0351
Hui, K., Yang, Y., Shi, K., Luo, H., Duan, J., An, J., et al. (2014). The p38 MAPK-regulated PKD1/CREB/Bcl-2 pathway contributes to selenite-induced colorectal cancer cell apoptosis in vitro and in vivo. Cancer Lett. 354, 189–199. doi: 10.1016/j.canlet.2014.08.009
Je, E. M., Yoo, N. J., Lee, S. H. (2014). Mutational and expressional analysis of SMC2 gene in gastric and colorectal cancers with microsatellite instability. Apmis 122, 499–504. doi: 10.1111/apm.12193
Jorissen, R. N., Gibbs, P., Christie, M., Prakash, S., Lipton, L., Desai, J., et al. (2009). Metastasis-Associated Gene Expression Changes Predict Poor Outcomes in Patients with Dukes Stage B and C Colorectal Cancer. Clin. Cancer Res. 15, 7642–7651. doi: 10.1158/1078-0432.CCR-09-1431
Kuo, T., Cho, C. D., Halsey, J., Wakelee, H. A., Advani, R. H., Ford, J. M., et al. (2005). Phase II study of gefitinib, fluorouracil, leucovorin, and oxaliplatin therapy in previously treated patients with metastatic colorectal cancer. J. Clin. Oncol. 23, 5613–5619. doi: 10.1200/JCO.2005.08.359
Li, X., Pathi, S. S., Safe, S. (2015). Sulindac sulfide inhibits colon cancer cell growth and downregulates specificity protein transcription factors. BMC Cancer 15, 974. doi: 10.1186/s12885-015-1956-8
Li, P., Xu, T., Zhou, X., Liao, L., Pang, G., Luo, W., et al. (2017). Downregulation of miRNA-141 in breast cancer cells is associated with cell migration and invasion: involvement of ANP32E targeting. Cancer Medicine. 6, 662–672. doi: 10.1002/cam4.1024
Lin, C.-C., Hsiang, J.-T., Wu, C.-Y., Oyang, Y.-J., Juan, H.-F., Huang, H.-C. (2010). Dynamic functional modules in co-expressed protein interaction networks of dilated cardiomyopathy. BMC Syst. Biol. 4, 138. doi: 10.1186/1752-0509-4-138
Loboda, A., Nebozhyn, M. V., Watters, J. W., Buser, C. A., Shaw, P. M., Huang, P. S., et al. (2011). EMT is the dominant program in human colon cancer. BMC Med. Genomics 4, 9. doi: 10.1186/1755-8794-4-9
Lu, C., Zhang, J., He, S., Wan, C., Shan, A., Wang, Y., et al. (2013). Increased alpha-tubulin1b expression indicates poor prognosis and resistance to chemotherapy in hepatocellular carcinoma. Dig. Dis. Sci. 58, 2713–2720. doi: 10.1007/s10620-013-2692-z
Lv, L. V., Zhou, J., Lin, C., Hu, G., Yi, L. U., Du, J., et al. (2015). DNA methylation is involved in the aberrant expression of miR-133b in colorectal cancer cells. Oncol. Lett. 10, 907–912. doi: 10.3892/ol.2015.3336
Mühlbauer, E., Bazwinsky-Wutschke, I., Wolgast, S., Labucay, K., Peschke, E. (2013). Differential and day-time dependent expression of nuclear receptors RORα, RORβ, RORγ and RXRα in the rodent pancreas and islet. Mol. Cell. Endocrinol. 365, 129–138. doi: 10.1016/j.mce.2012.10.001
Magrassi, L., Conti, L., Lanterna, A., Zuccato, C., Marchionni, M., Cassini, P., et al. (2005). Shc3 affects human high-grade astrocytomas survival. Oncogene 24, 5198–5206. doi: 10.1038/sj.onc.1208708
Malkin, D., Li, F. P., Strong, L. C., Fraumeni, J. F., Jr., Nelson, C. E., Kim, D. H., et al. (1990). Germ line p53 mutations in a familial syndrome of breast cancer, sarcomas, and other neoplasms. Science 250, 1233–1238. doi: 10.1126/science.1978757
Moehle, C., Ackermann, N., Langmann, T., Aslanidis, C., Kel, A., Kel-Margoulis, O., et al. (2006). Aberrant intestinal expression and allelic variants of mucin genes associated with inflammatory bowel disease. J. Mol. Med. (Berl.) 84, 1055–1066. doi: 10.1007/s00109-006-0100-2
Murray, R. J., Tanteles, G. A., Mills, J., Perry, A., Peat, I., Osman, A., et al. (2011). Association between single nucleotide polymorphisms in the DNA repair gene LIG3 and acute adverse skin reactions following radiotherapy. Radiother. Oncol. 99, 231–234. doi: 10.1016/j.radonc.2011.05.007
Musa, A., Ghoraie, L. S., Zhang, S. D., Glazko, G., Yli-Harja, O., Dehmer, M., et al. (2017). A review of connectivity map and computational approaches in pharmacogenomics. Brief Bioinform. 18, 903. doi: 10.1093/bib/bbx023
Nicke, B., Bastien, J., Khanna, S. J., Warne, P. H., Cowling, V., Cook, S. J., et al. (2005). Involvement of MINK, a Ste20 family kinase, in Ras oncogene-induced growth arrest in human ovarian surface epithelial cells. Mol. Cell 20, 673–685. doi: 10.1016/j.molcel.2005.10.038
Nilsson, J., Vallbo, C., Guo, D., Golovleva, I., Hallberg, B., Henriksson, R., et al. (2001). Cloning, characterization, and expression of human LIG1. Biochem. Biophys. Res. Commun. 284, 1155–1161. doi: 10.1006/bbrc.2001.5092
Obri, A., Ouararhni, K., Papin, C., Diebold, M. L., Padmanabhan, K., Marek, M., et al. (2014). ANP32E is a histone chaperone that removes H2A.Z from chromatin. Nature 505, 648–653. doi: 10.1038/nature12922
Oppenheim, R. W., Houenou, L. J., Johnson, J. E., Lin, L. F., Li, L., Lo, A. C., et al. (1995). Developing motor neurons rescued from programmed and axotomy-induced cell death by GDNF. Nature 373, 344–346. doi: 10.1038/373344a0
Ramalingam, S., Ramamoorthy, P., Subramaniam, D., Anant, S. (2012). Reduced expression of rna binding protein celf2, a putative tumor suppressor gene in colon cancer. Immunogastroenterology 1, 27–33. doi: 10.7178/ig.1.1.7
Sasso, C. V., Santiano, F. E., Campo Verde Arbocco, F., Zyla, L. E., Semino, S. N., Guerrero-Gimenez, M. E., et al. (2019). Estradiol and progesterone regulate proliferation and apoptosis in colon cancer. Endocr. Connect 8, 217–229. doi: 10.1530/EC-18-0374
Serra, R. W., Fang, M., Park, S. M., Hutchinson, L., Green, M. R. (2014). A KRAS-directed transcriptional silencing pathway that mediates the CpG island methylator phenotype. Elife 3, e02313. doi: 10.7554/eLife.02313
Setia, S., Nehru, B., Sanyal, S. N. (2014). Upregulation of MAPK/Erk and PI3K/Akt pathways in ulcerative colitis-associated colon cancer. BioMed. Pharmacother. 68, 1023–1029. doi: 10.1016/j.biopha.2014.09.006
Shu, T., Li, Y., Wu, X., Li, B., Liu, Z. (2017). Down-regulation of HECTD3 by HER2 inhibition makes serous ovarian cancer cells sensitive to platinum treatment. Cancer Lett. 411, 65–73. doi: 10.1016/j.canlet.2017.09.048
Stebbing, J., Harrison, M., Glynne-Jones, R., Bridgewater, J., Propper, D. (2008). A phase II study to determine the ability of gefitinib to reverse fluoropyrimidine resistance in metastatic colorectal cancer (the INFORM study). Br. J. Cancer 98, 716–719. doi: 10.1038/sj.bjc.6604232
Strey, C. W., Schamell, L., Oppermann, E., Haferkamp, A., Bechstein, W. O., Blaheta, R. A. (2011). Valproate inhibits colon cancer growth through cell cycle modification in vivo and in vitro. Exp. Ther. Med. 2, 301–307. doi: 10.3892/etm.2011.202
Sun, X., Du, P., Yuan, W., Du, Z., Yu, M., Yu, X., et al. (2015). Long non-coding RNA HOTAIR regulates cyclin J via inhibition of microRNA-205 expression in bladder cancer. Cell Death Dis. 6, e1907. doi: 10.1038/cddis.2015.269
Tang, X. B., Li, H., Zhang, J., Wang, W. L., Yuan, Z. W., Bai, Y. Z. (2018). Expression pattern of Wif1 and beta-catenin during development of anorectum in fetal rats with anorectal malformations. PeerJ 6, e4445. doi: 10.7717/peerj.4445
Taniue, K., Kurimoto, A., Takeda, Y., Nagashima, T., Okada-Hatakeyama, M., Katou, Y., et al. (2016). ASBEL-TCF3 complex is required for the tumorigenicity of colorectal cancer cells. Proc. Natl. Acad. Sci. U. S. A. doi: 10.1073/pnas.1605938113
Tashiro, K., Tsunematsu, T., Okubo, H., Ohta, T., Sano, E., Yamauchi, E., et al. (2009). GAREM, a novel adaptor protein for growth factor receptor-bound protein 2, contributes to cellular transformation through the activation of extracellular signal-regulated kinase signaling. J. Biol. Chem. 284, 20206–20214. doi: 10.1074/jbc.M109.021139
Tippin, B. L., Levine, A. J., Materi, A. M., Song, W.-L., Keku, T. O., Goodman, J. E., et al. (2012). Hematopoietic prostaglandin D synthase (HPGDS): A high stability, Val187Ile isoenzyme common among African Americans and its relationship to risk for colorectal cancer. Prostaglandins Other Lipid Mediators 97, 22–28. doi: 10.1016/j.prostaglandins.2011.07.006
Tippin, B. L., Kwong, A. M., Inadomi, M. J., Lee, O. J., Park, J. M., Materi, A. M., et al. (2014). Intestinal tumor suppression in ApcMin/+ mice by prostaglandin D2 receptor PTGDR. Cancer Med. 3, 1041–1051. doi: 10.1002/cam4.251
Vera, O., Jimenez, J., Pernia, O., Rodriguez-Antolin, C., Rodriguez, C., Sanchez Cabo, F., et al. (2017). DNA methylation of miR-7 is a mechanism involved in platinum response through mafg overexpression in cancer cells. Theranostics 7, 4118–4134. doi: 10.7150/thno.20112
Wang, X. L., Shi, W. P., Shi, H. C., Lu, S. C., Wang, K., Sun, C., et al. (2016a). Knockdown of TRIM65 inhibits lung cancer cell proliferation, migration and invasion: A therapeutic target in human lung cancer. Oncotarget 7, 81527–81540. doi: 10.18632/oncotarget.13131
Wang, Y. P., Song, G. H., Chen, J., Xiao, C., Li, C., Zhong, L., et al. (2016b). Elevated OCT1 participates in colon tumorigenesis and independently predicts poor prognoses of colorectal cancer patients. Tumour Biol. 37, 3247–3255. doi: 10.1007/s13277-015-4080-0
Weinlander, G., Kornek, G., Raderer, M., Hejna, M., Tetzner, C., Scheithauer, W. (1997). Treatment of advanced colorectal cancer with doxorubicin combined with two potential multidrug-resistance-reversing agents: high-dose oral tamoxifen and dexverapamil. J. Cancer Res. Clin. Oncol. 123, 452–455. doi: 10.1007/BF01372550
Wen, Z., Pan, T., Yang, S., Liu, J., Tao, H., Zhao, Y., et al. (2017). Up-regulated NRIP2 in colorectal cancer initiating cells modulates the Wnt pathway by targeting RORbeta. Mol. Cancer 16, 20. doi: 10.1186/s12943-017-0590-2
Wolpin, B. M., Clark, J. W., Meyerhardt, J. A., Earle, C. C., Ryan, D. P., Enzinger, P. C., et al. (2006). Phase I study of gefitinib plus FOLFIRI in previously untreated patients with metastatic colorectal cancer. Clin. Colorectal Cancer 6, 208–213. doi: 10.3816/CCC.2006.n.037
Xu, M., Chen, X., Lin, K., Zeng, K., Liu, X., Pan, B., et al. (2018). The long noncoding RNA SNHG1 regulates colorectal cancer cell growth through interactions with EZH2 and miR-154-5p. Mol. Cancer 17, 141. doi: 10.1186/s12943-018-0894-x
Zhang, B. L., Dong, F. L., Guo, T. W., Gu, X. H., Huang, L. Y., Gao, D. S. (2017). MiRNAs mediate gdnf-induced proliferation and migration of glioma cells. Cell Physiol. Biochem. 44, 1923–1938. doi: 10.1159/000485883
Zhang, M., Miao, F., Huang, R., Liu, W., Zhao, Y., Jiao, T., et al. (2018). RHBDD1 promotes colorectal cancer metastasis through the Wnt signaling pathway and its downstream target ZEB1. J. Exp. Clin. Cancer Res. 37, 22. doi: 10.1186/s13046-018-0687-5
Keywords: colorectal cancer, systems biology, system identification, system modeling, system model selection, genome-wide genetic and epigenetic network, drug discovery
Citation: Yeh S-J, Chen S-W and Chen B-S (2020) Investigation of the Genome-Wide Genetic and Epigenetic Networks for Drug Discovery Based on Systems Biology Approaches in Colorectal Cancer. Front. Genet. 11:117. doi: 10.3389/fgene.2020.00117
Received: 25 November 2019; Accepted: 31 January 2020;
Published: 06 March 2020.
Edited by:Christiane Pienna Soares, Sao Paulo State University, Brazil
Reviewed by:Cibele Cardoso, University of São Paulo, Brazil
Claudia Cava, National Research Council, Italy
Copyright © 2020 Yeh, Chen and Chen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Bor-Sen Chen, email@example.com