- 1Department of Medical Informatics and Computer, Zhengzhou Health College, Zhengzhou, Henan, China
- 2School of Basic Medical Sciences, Henan University, Kaifeng, Henan, China
Introduction: Colorectal cancer (CRC) remains a leading cause of cancer-related mortality worldwide. Although diagnostic and therapeutic strategies have advanced, the molecular mechanisms driving CRC pathogenesis are not fully understood, highlighting the need for novel biomarkers and therapeutic agents.
Methods: Integrated bioinformatics analyses of transcriptomic datasets from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) were performed to identify survival-associated differentially expressed genes (DEGs) in CRC (|log2FC| > 1.5, p < 0.05). These key DEGs were then used to query the L1000FWD, DGIdb, and CMap platforms to predict candidate small-molecule drugs. The top candidate was evaluated by molecular docking, and its anti-tumor effects were validated by MTT cell viability assays in CRC cell lines.
Results: We identified 15 survival-associated DEGs—MELK, NFE2L3, MCM2, MAD2L1, AUNIP, CXCL3, GLDN, GREM2, ALDH1A1, CILP, FABP4, AOC3, CNN1, ANGPTL1, and DES—as potential early diagnostic biomarkers for CRC. Drug-repositioning analyses convergently highlighted SB-225002 as a promising therapeutic agent. Molecular docking showed high binding affinity of SB-225002 to multiple key targets. MTT assays demonstrated that SB-225002 exerted dose-dependent inhibitory effects on the proliferation of CRC cell lines (SW-480, DLD-1, and MC38), with IC50 values of 2.307 μM, 0.9456 μM, and 3.449 μM, respectively.
Discussion: This study systematically delineates a novel panel of early-detection biomarkers for CRC and identifies SB-225002 as a repurposed candidate therapeutic agent. The integrative strategy combining multi-cohort transcriptomic analysis, drug-repositioning platforms, molecular docking, and experimental validation offers a feasible framework for discovering clinically actionable biomarkers and small-molecule therapies for CRC.
1 Introduction
According to the 2022 Global Cancer Statistic report, CRC is the second leading cause of cancer-related deaths, accounting for 9.6% of newly diagnosed cancer cases and 9.3% of cancer-related deaths worldwide each year (Bray et al., 2024). Epidemiological research has shown that the incidence and mortality of CRC are closely associated with a spectrum of risk factors such as genetic variations, ethnicity, family history, age, high-fat intake, red meat consumption, obesity, alcohol consumption, smoking, and lack of physical activity (Shams-White et al., 2019). Currently, surgery and chemotherapy are the primary clinical treatment strategies for CRC (Salibasic et al., 2019). While Surgical treatment is mainly applicable to patients diagnosed at an early stage, approximately 60%–70% of symptomatic CRC cases are diagnosed at a later stage (Shinagawa et al., 2018). Despite aggressive clinical treatment regimens, the prognosis of patients with advanced CRC remains poor due to metastasis (Kvietkauskas et al., 2021). Therefore, there is an urgent need to identify effective pharmacological agents and develop novel therapeutic strategies to improve CRC prognosis.
With the rapid advancement of genomics, new opportunities have emerged for CRC treatment. Genomic technologies enable an in-depth understanding of the molecular pathogenesis of CRC and the identification of biomarkers and therapeutic targets associated with CRC prognosis (Van Schaeybroeck et al., 2011; Jiang et al., 2022). Drug repurposing has emerged as a highly promising strategy in oncological research, offering significant advantages over traditional drug development, including shorter development cycles, reduced costs, and a more favorable safety profile (Xia et al., 2024). This approach is particularly relevant for CRC, where recent biomarker studies have identified key therapeutic targets. For instance, Squalene Epoxidase (SQLE) has been found to promote CRC progression by inhibiting cancer cell apoptosis and disrupting the intestinal flora balance. Consequently, the conventional antifungal SQLE inhibitor, terbinafine, is now considered a potential therapeutic agent for CRC (Li et al., 2022). Similarly, the overexpression of Mitogen-Activated Protein Kinase Kinase 3 (MKK3) is known to drive CRC development. AT9283, a drug previously investigated for hematological malignancies, has been identified as a potential MKK3 inhibitor for CRC treatment (Piastra et al., 2024). Furthermore, recent studies have indicated that the traditional antipsychotic drug, aripiprazole, also exhibits anti-tumor efficacy against CRC (Hu et al., 2024). However, although numerous studies have identified DEGs in CRC, a significant challenge remains in systematically integrating these gene signatures with multi-omics drug databases to efficiently discover drug repurposing candidates for CRC therapy. Our study aims to bridge this gap. To this end, we utilized an integrated bioinformatics approach combining several powerful databases. Complementary to each other, these platforms offer distinct yet synergistic functionalities: the L1000 Fireworks Display (L1000FWD) platform visualizes drug-induced gene expression signatures, facilitating drug mechanism elucidation and repurposing (Wang et al., 2018). The Drug-Gene Interaction Database (DGIdb) provides comprehensive information on druggable genes and their interactions (Griffith et al., 2013; Freshour et al., 2021). The Connectivity Map (CMap) compares disease-associated gene expression profiles with those induced by small molecules, generating connectivity scores to identify potential therapeutics (Zhao et al., 2023). The synergy of these tools offers a novel methodology for linking CRC-associated gene signatures to potential drugs.
This study aims to identify potential candidate drugs for CRC treatment through a comprehensive analysis of gene expression data. We obtained CRC gene datasets from the GEO and TCGA databases and identified DEGs significantly associated with patient survival. These prognostic DEGs were subsequently used as inputs for simultaneous screening in the L1000FWD, DGIdb, and CMap databases to pinpoint small-molecule drugs with inverse correlation patterns. From the resulting candidates, SB-225002 was selected for further investigation. Molecular docking studies confirmed its strong binding affinity to multiple target proteins, and in vitro experiments demonstrated its potent anti-proliferative effects on CRC cell lines.
2 Materials and methods
2.1 Data acquisition and processing
Gene expression FPKM (Fragments Per Kilobase of transcript per Million mapped reads) values and clinical characteristic data of CRC samples were obtained from The Cancer Genome Atlas (TCGA) database. TCGA database offers the advantages of a large sample size, complete clinical annotations, and standardized quality control. Additionally, three independent CRC gene expression datasets (GSE18105 (Matsuyama et al., 2010), GSE20842 (Gaedcke et al., 2010), and GSE87211 (Hu et al., 2018)) containing tumor and normal tissue samples were obtained from the Gene Expression Omnibus (GEO) database (Table 1). Microarray data were processed using the limma package for background correction and quantile normalization. Batch effects were then corrected using the ComBat algorithm from the sva package. Finally, genes with low expression across all samples were filtered out, and samples from the TCGA cohort with missing survival time or event status were excluded.
2.2 Clinical sample collection
Between December 2023 and April 2024, four fresh tumor tissue samples (along with paired normal tissue samples) were collected from CRC patients who underwent surgery at Huaihe Hospital of Henan University, China. All samples were stored at −80 °C for preservation. Prior to surgery, all patients were fully informed about the study and signed an informed consent form. This study was reviewed and approved by the Clinical Research Ethics Committee of Huaihe Hospital of Henan University. Ethical Approval Code: IEC-C-010-A07-V2.0.
2.3 Total RNA extraction
Approximately 0.5 cm × 0.5 cm of tissue was collected and homogenized in 200 μL of Trizol lysis buffer using a tissue homogenizer. An additional 800 μL of Trizol was added, and the mixture was incubated at room temperature for 5 min. Subsequently, 200 μL of chloroform was added, followed by thorough mixing by inversion for 3 min. The samples were then centrifuged at 12,000 × g for 15 min at 4 °C, and the aqueous phase was transferred to a new RNase-free EP tube. An equal volume of isopropanol was added, mixed, and incubated at room temperature for 10 min. The mixture was centrifuged again at 12,000 × g for 10 min at 4 °C, and the supernatant was discarded. The RNA pellet was washed with 1 mL of 75% ethanol, followed by centrifugation at 7,500 × g for 5 min at 4 °C. After discarding the supernatant, the RNA pellet was air-dried for 5–10 min, then resuspended in RNase-free water (Chomczynski and Sacchi, 2006).
2.4 RNA sequencing
Extracted total RNA was sent to a sequencing company for transcriptome sequencing (RNA-seq). RNA sequencing libraries were constructed, and sequencing was performed using the Illumina NovaSeq 6000 platform (PE150 mode) with a sequencing depth of 10 Gb reads per sample.
2.5 Differential gene expression analysis of CRC
The limma package in R was used to analyze DEGs between CRC tissues and normal tissues from public datasets (TCGA-CRC, GSE18105, GSE20842, and GSE87211) and our collected clinical samples. Genes with p < 0.05 and |log2 FC| > 1.5 were considered significantly differentially expressed.
2.6 Differential gene expression and functional enrichment analysis
Diffrentially Expressed Genes (DEGs)were identified using the “limma” R package, with a threshold of adjusted p < 0.05 and |log2 FC| > 1.5. The overlapping significantly up- and downregulated differentially expressed genes (DEGs) common to both the TCGA and GEO datasets were identified. To elucidate their biological functions, Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses were subsequently performed using the “clusterProfiler” R package. Visualization of the enrichment results was conducted using the “ggplot2” R package.
2.7 Screening of prognosis-associated genes
To investigate the impact of DEGs on CRC prognosis, univariate Cox regression analysis was performed using “survival” R packages. Genes with a significant association with CRC prognosis were identified using univariable Cox proportional hazards regression (p < 0.05), and Kaplan–Meier survival curves along with log-rank tests were used to compare survival differences between high- and low-expression groups.
2.8 Validation of gene expression
To enhance the reliability of the identified DEGs, validation was performed using our own sequencing data. Genes that |log2 FC| > 1.5 and p < 0.05 in differential expression analysis, as well as p < 0.05 in t-tests, were included in the subsequent drug prediction analysis.
The Human Protein Atlas (HPA) online database (www.proteinatlas.org) provides information on the tissue and cellular distribution of 26,000 human proteins, primarily using specific antibodies to analyze protein expression in cell lines, normal tissues, and tumor tissues (Gu et al., 2020). We utilized this database to verify the expression patterns of 15 key DEGs in normal colon tissue and CRC tissue.
2.9 Screening of small-molecule drugs
L1000FWD (https://maayanlab.cloud/l1000fwd/), DGIdb (https://www.dgidb.org/), and CMap (https://clue.io/) are essential tools for drug discovery research (Wen and Ablimit, 2024; Cannon et al., 2024; Gao et al., 2019). We analyzed CRC-associated DEGs that were significantly related to patient survival using these three databases. Small-molecule drugs that exhibited an inverse correlation with these DEGs were identified, leading to the determination of potential candidate drugs for CRC treatment.
2.10 Molecular docking
PubChem is the world’s largest public database for chemical information (Kim et al., 2025), while Universal Protein (UniProt) is one of the most comprehensive and widely used protein databases, containing over 220 million protein entries (Wang et al., 2021). The chemical data and structures of the selected drugs were retrieved from PubChem (https://pubchem.ncbi.nlm.nih.gov/), and the X-ray crystal structures of the target proteins were obtained from UniProt (https://www.uniprot.org/). Only human protein structures were used.
To evaluate the screened small-molecule drugs, molecular docking was performed to examine their binding reliability to target proteins based on binding energy calculations. Molecular docking studies were conducted using SYBYL-X software (Palafox et al., 2023). The Surflex-Dock module in SYBYL-X 2.1.1 was used for protein preprocessing, including removal of crystallographic water and ligands, addition of hydrogen atoms and charges, and force field optimization. The “automatic” mode was used to generate active binding sites on the target protein, followed by docking with small-molecule drug Mol2 files. The docking results were visualized using PyMOL 2.6, a widely used software for the visualization and analysis of biomolecules (Rosignoli and Paiardini, 2022).
2.11 MTT assay
The MTT assay is widely used to assess the cytotoxic effects of drugs on cells by evaluating cell viability (Ghasemi et al., 2023). CRC cells in the logarithmic growth phase were digested, centrifuged, and counted, then seeded in 96-well plates. For drug treatment, the selected drug (SB-225002) was prepared into a series of gradient concentrations (0.0075, 0.195, 0.39, 1.56, 3.125, 6.25, 12.5, 25 μM) using complete medium; 100 μL of the drug solution, with three replicate wells per concentration. After that, cells were incubated in a 37 °C, 5% CO2 cell culture incubator (Thermo Scientific, United States) for 48 h. Subsequently, 15 μL of MTT solution (5 mg/mL) was added to each well, followed by incubation at 37 °C for 4 h under light-protected conditions. The medium and MTT were discarded, and 100 μL of dimethyl sulfoxide (DMSO) was added. Optical density (OD) was measured at 490 nm using a microplate reader. The IC50 value was calculated using GraphPad software.
2.12 Statistical analysis
Statistical analyses were conducted using GraphPad Prism 8.0 and R software (version 4.2.2). All experimental data are presented as mean ± standard deviation (SD). Continuous variables with a normal distribution were analyzed using t-tests. For data not following a normal distribution, log2 transformation was applied prior to t-test analysis. Kaplan–Meier survival analysis was performed using the log-rank test. p < 0.05 was considered statistically significant.
3 Results
3.1 Identification of differentially expressed genes
In the TCGA-CRC dataset, a total of 1,233 DEGs were identified, including 389 upregulated genes and 844 downregulated genes. After normalization, differential expression analysis was performed on the CRC microarray datasets GSE18105, GSE20842, and GSE87211. Specifically, the GSE18105 dataset contained 944 DEGs, with 515 upregulated and 429 downregulated genes. The GSE20842 dataset had 637 DEGs, including 298 upregulated and 339 downregulated genes. The GSE87211 dataset identified 1,349 DEGs, with 591 upregulated and 758 downregulated genes. The DEGs from these four datasets are presented in Figure 1. A Venn diagram analysis of upregulated and downregulated genes across the datasets identified 180 common upregulated genes (Figure 2A) and 308 common downregulated genes (Figure 2B).
Figure 1. Differential Expression Analysis Based on GEO and TCGA Databases. (A) TCGA-CRC, (B) GSE18105, (C) GSE20842, (D) GSE87211. The red dots represent the upregulated genes based on an adjusted P < 0.05 and |log fold change| > 1.5; the blue dots represent the downregulated genes based on an adjusted P < 0.05 and |log fold change| > 1.5; the black spots represent genes with no significant difference in expression.
Figure 2. Intersection Analysis of Differentially Expressed Genes. (A) Venn Diagram of Upregulated Genes: Intersection of upregulated genes from the TCGA-CRC, GSE18105, GSE20842, and GSE87211 datasets. (B) Venn Diagram of Downregulated Genes: Intersection of downregulated genes from the same datasets.
3.2 KEGG and GO enrichment analysis of DEGs
GO and KEGG enrichment analyses were conducted for the common DEGs. The biological process (BP) enrichment analysis of upregulated genes revealed that these genes were significantly involved in organelle fusion, chromosome segregation, and nuclear division (Figure 3A). Molecular function (MF) enrichment analysis indicated strong associations with G-protein coupled receptor binding, cytokine activity, and chemokine receptor binding (Figure 3B). Cellular component (CC) enrichment analysis showed significant enrichment in chromosome regions, spindle apparatus, and centromeric regions (Figure 3C). KEGG pathway enrichment analysis suggested that these genes were likely involved in viral protein interactions with cytokines, the cell cycle, and the chemokine signaling pathway (Figure 3D). For the downregulated genes, BP enrichment analysis indicated associations with alcohol metabolism, stress response to metal ions, and copper detoxification (Figure 3E). MF enrichment analysis demonstrated significant relationships with phospholipase activity, lipase activity, and carboxylesterase hydrolase activity (Figure 3F). CC enrichment analysis showed enrichment in apical parts of the plasma membrane, brush border, and microvilli (Figure 3G). KEGG pathway analysis suggested that these genes were involved in mineral absorption, arachidonic acid metabolism, and nitrogen metabolism (Figure 3H). These results provide crucial insights into the biological roles of the identified DEGs.
Figure 3. Enrichment analysis of differentially expressed genes. (A–D) enrichment of upregulated genes: (A) Biological process; (B) molecular function; (C) cellular component; (D) KEGG pathway; (E–H) enrichment of downregulated genes; (E) biological process; (F) molecular function; (G) cellular component; (H) KEGG pathway.
3.3 Screening of therapeutic targets
To identify key genes with the most significant impact on CRC prognosis, a univariate Cox regression model was applied to the 180 commonly upregulated genes, revealing 13 genes (ACSL6, AUNIP, CXCL3, CXCL8, CXCL11, EPHX4, GPR143, LRRC8, MAD2L1, MCM2, MELK, MMP3, and NFE2L3) associated with overall survival (OS), all of which acted as protective factors (Figure 4A). Among the 308 commonly downregulated genes, 16 genes were significantly correlated with OS (Figure 4B), with 9 genes (ALDH1A1, ANGPTL1, AOC3, CLP, CNN1, DES, FABP4, GLDN, and GREM2) acting as risk factors, while seven genes (B3GNT6, CLCA1, DENND2A, F2RL1, FDCSP, ITLN1, and MS4A1) functioned as protective factors. Kaplan-Meier survival curves were generated for the 13 upregulated genes (Figure 5) and the 16 downregulated genes (Figure 6), demonstrating statistically significant differences (log-rank p < 0.05).
Figure 4. Univariate COX Regression Analysis of Gene Prognostic Relevance. (A) Prognostic significance of 13 upregulated genes; (B) Prognostic significance of 16 downregulated genes.
Figure 5. Kaplan-Meier survival curves for genes with upregulated expression. (A) ACSL6; (B) AUNIP; (C) CXCL3; (D) CXCL8; (E) CXCL11; (F) EPHX4; (G) GPR143; (H) LRRC8; (I) MAD2L1; (J) MCM2; (K) MELK; (L) MMP3; (M) NFE2L3.
Figure 6. Kaplan-Meier survival curves for genes with downregulated expression. (A) ALDH1A1; (B) ANGPTL1; (C) AOC3; (D) B3GNT6; (E) CLP; (F) CLCA1; (G) CNN1; (H) DENND2A; (I) DES; (J) F2RL1; (K) FABP4; (L) FDCSP; (M) GLDN; (N) GREM2; (O) ITLN1; (P) MS4A1.
Principal Component Analysis (PCA) was performed on clinical tumor samples to assess group differences and biological reproducibility within groups. As shown in (Figure 7A), samples within each group were highly similar, whereas CRC tissues and normal tissues exhibited significant inter-group differences, supporting further transcriptional profiling analyses. Differential expression analysis between CRC and normal tissues identified 2170 DEGs, with 1437 upregulated genes and 733 downregulated genes (Figure 7B). Intersecting these with 29 prognostic DEGs from four public datasets yielded 22 common genes (Figure 7C). To ensure robustness, we further validated these 22 candidates using a t-test on our clinical sequencing data (log2(FPKM+1)), which confirmed 15 significant DEGs (p < 0.05) for subsequent drug screening (Figure 7D).
Figure 7. Transcriptome Analysis and Differential Gene Expression Verification in CRC Tissues and Normal Tissues. (A) Two-dimensional Principal Component Analysis (PCA) of transcriptome sequencing samples; (B) Differentially Expressed Genes (DEGs) between CRC tissues and normal tissues in clinical tumor samples; (C,D) Parametric tests for the expression levels of common differential genes identified in both public datasets and this study’s samples, *P < 0.05, **P < 0.01, ***P < 0.001.
3.4 Validation of DEGs at the protein level
Protein expression levels of four key genes were analyzed using immunohistochemistry data from the Human Protein Atlas (HPA) database. Among them, CXCL3 expression was not detected. As shown in (Figure 8), compared with normal colonic tissues, MMP3 and NFE2L3 were upregulated in CRC tissues, while CLCA1 was downregulated, aligning with our previous predictions.
Figure 8. The protein expression levels of 15 key genes in normal intestinal tissue and CRC tissue were verified through the HPA online website. (A) MCM2; (B) GLDN; (C) GREM2; (D) AOC3. (MELK, NFE2L3, MAD2L1, AUNIP, CXCL3, ALDH1A1, CILP, FABP4, CNN1, ANGPTL1, DES were not found).
3.5 Online drug screening for candidate small molecules
Using the L1000FWD, DGIdb, and CMap databases, six upregulated genes and nine downregulated gene were used for a drug screening analysis, identifying six small-molecule drug candidates: MG-132, H-89, BMS-387032, JNJ-7706621, NVP-TAE684, and SB-225002 (Figure 9). MG-132, a proteasome inhibitor, showed anticancer effects but had severe side effects due to widespread inhibition of protein degradation (Fatima et al., 2020). H-89, a PKA inhibitor, was effective only in cancers driven by the PKA signaling pathway, limiting its broader application (Xu et al., 2015). BMS-387032 and JNJ-7706621, CDK kinase inhibitors, and NVP-TAE684, an ALK inhibitor, were too target-specific, restricting their clinical applicability (Ki et al., 2022). SB-225002, a chemokine receptor antagonist, modulates inflammatory cell migration, potentially inhibiting tumor-associated inflammation and disrupting pro-tumor inflammatory pathways, making it a promising candidate for cancer treatment (Sun and Yuan, 2022). Based on the screening results from the three platforms, and after a comprehensive evaluation of their mechanisms of action and potential side effects, SB-225002 was selected as the most promising candidate drug.
Figure 9. Intersection of potential therapeutic targets identified from the L1000FWD, DGIdb, and CMap databases.
3.6 Molecular docking validation
Detailed information about SB-225002 was obtained from PubChem (https://pubchem.ncbi.nlm.nih.gov/). SB-225002 (PubChem CID: 3854666) has a molecular weight of 352.14 g/mol and a molecular formula of C13H10BrN3O4. The 2D structure of the compound is shown in (Figure 10). The SDF file of SB-225002 was downloaded from PubChem and converted into a Mol2 file using Open Babel GUI for use as a ligand in molecular docking studies.
The 3D structures of the target proteins were retrieved and prepared for docking, including MELK (PDB: 4IXP), NFE2L3 (AlphaFoldDB: AF-Q9Y4A8-F1-v4), MCM2 (AlphaFoldDB: AF-P49736-F1-v4), MAD2L1 (PDB: 3GMH), AUNIP (AlphaFoldDB: AF-Q9H7T9-F1-v4), CXCL3 (AlphaFoldDB: AF-P19876-F1-v4), GLDN (AlphaFoldDB: AF-Q6ZMI3-F1-v4), GREM2 (AlphaFoldDB: AF-Q9H772-F1-v4), ALDH1A1 (AlphaFoldDB: AF-P00352-F1-v4), CILP (AlphaFoldDB: AF-O75339-F1-v4), FABP4 (PDB: 3Q6L), AOC3 (PDB: 1PU4), CNN1 (AlphaFoldDB: AF-P51911-F1-v4), ANGPTL1 (AlphaFoldDB: AF-O95841-F1-v4), and DES (AlphaFoldDB: AF-P17661-F1-v4). These proteins were used as receptors for docking analysis.
High-precision molecular docking was performed using SYBYL-X 2.1.1. The docking scores included Total-score, CSCORE, Crash, and D-score. In this study, Total-score and CSCORE were used as the primary indicators of binding affinity. In general, a Total-score close to 4.0 indicates a reasonable binding activity, with higher scores suggesting stronger binding affinity (Feng et al., 2022). Similarly, a CSCORE approaching 4.0 signifies better spatial complementarity and energy compatibility between the ligand and receptor (Xue et al., 2023). The detailed docking results are presented in Table 2.
SB-225002 formed five hydrogen bonds with MELK, specifically at ARG-65 (1.9 Å, 2.3 Å), ALA-322 (2.0 Å), GLU-142 (2.2 Å), and ARG-326 (2.7 Å) (Figure 11A). It also formed five hydrogen bonds with NFE2L3 at SER-396 (2.2 Å, 2.1 Å), GLN-624 (2.2 Å), THR-399 (2.4 Å), and GLU-400 (1.9 Å) (Figure 11B). For MCM2, SB-225002 established four hydrogen bonds at GLU-455 (1.9 Å, 2.3 Å, 2.4 Å) and HIS-287 (2.2 Å) (Figure 11C). With MAD2L1, it formed three hydrogen bonds at CYS-106 (1.9 Å, 2.1 Å) and LYS-108 (2.7 Å) (Figure 11D). For AUNIP, two hydrogen bonds were observed at PHE-272 (2.0 Å, 2.1 Å) (Figure 11E).
Figure 11. Molecular Docking verification of SB-225002 with the target protein. (A) MELK; (B) NFE2L3; (C) MCM2; (D) MAD2L1; (E) AUNIP; (F) CXCL3; (G) GLDN; (H) GREM2; (I) ALDH1A1; (J) CILP; (K) FABP4; (L) AOC3; (M) CNN1; (N) ANGPTL1; (O) DES. In this figure, green representatives the target protein; Brown indicates SB-225002; The protein residues that interact with SB-225002 are shown in blue. Yellow indicates hydrogen bonds.
SB-225002 also interacted with CXCL3, forming three hydrogen bonds at SER-36 (2.0 Å) and PRO-67 (2.0 Å, 2.0 Å) (Figure 11F). With GLDN, it established four hydrogen bonds at ASP-110 (2.0 Å, 2.0 Å, 2.7 Å, 1.9 Å) (Figure 11G). For GREM2, four hydrogen bonds were observed at ASP-71 (2.1 Å, 2.2 Å), CYS-157 (2.0 Å), and ARG-156 (2.1 Å) (Figure 11H). The docking of SB-225002 with ALDH1A1 resulted in three hydrogen bonds at ILE-304 (2.0 Å), CYS-303 (2.0 Å), and ASN-170 (2.5 Å) (Figure 11I).
Further analysis revealed six hydrogen bonds between SB-225002 and CILP at MET-311 (1.9 Å, 2.0 Å, 2.2 Å, 2.3 Å), ARG-536 (2.0 Å), and GLU-535 (2.2 Å) (Figure 11J). Additionally, one hydrogen bond was formed with CILP at ALA-76 (1.9 Å) (Figure 11K). The docking with AOC3 resulted in four hydrogen bonds at GLU-486 (2.1 Å, 2.5 Å), ARG-463 (1.9 Å), and LEU-438 (2.0 Å) (Figure 11L). CNN1 formed one hydrogen bond with SB-225002 at LYS-143 (2.0 Å) (Figure 11M). ANGPTL1 established three hydrogen bonds at GLU-197 (2.0 Å, 2.1 Å) and SER-190 (2.1 Å) (Figure 11N). Finally, DES formed two hydrogen bonds at LYS-339 (2.8 Å) and ASN-342 (1.9 Å) (Figure 11O).
3.7 In vitro experimental validation
The MTT assay was used to assess the effects of SB-225002 on CRC cell viability (SW-480 cells, DLD-1 cells and MC 38 cells). Results showed a dose-dependent inhibition of cell viability. The IC50 value of SB-225002 toward SW-480 cells was 2.307 μM. The IC50 value of SB-225002 toward DLD-1 cells was 0.9456 μM. The IC50 value of SB-225002 toward MC 38 cells was 3.449 μM, indicating that SB-225002 significantly inhibited the proliferation of CRC cells at lower concentrations (Figure 12).
4 Discussion
CRC is currently the third most common cancer worldwide, accounting for approximately 10% of all cancer cases. It is also the second leading cause of cancer-related mortality globally. The primary treatment strategy for CRC remains surgical resection, supplemented by chemotherapy and radiotherapy. However, the efficacy of chemotherapy has reached a plateau, and molecular targeted therapy has become the main approach to improving the prognosis of patients with advanced CRC. The molecular mechanisms underlying CRC initiation and progression remain to be fully elucidated, making it essential to identify novel biomarkers with clinical, pathological, and prognostic significance to guide CRC treatment (Lişcu et al., 2024). Targeted therapy, as an emerging approach, enhances the precision of chemotherapy by selectively targeting cancer cells while preserving normal tissues (Svec et al., 2024). Given the current limitations in predictive biomarkers for CRC, the early prediction and targeted treatment of CRC, as well as the identification of effective and specific biomarkers, remain critical tasks.
In this study, A total of 15 DEGs with p < 0.05, including MELK, NFE2L3, MCM2, MAD2L1, AUNIP, CXCL3, GLDN, GREM2, ALDH1A1, CILP, FABP4, AOC3, CNN1, ANGPTL1, and DES, were identified for subsequent drug screening research. Some of these genes (such as CXCL3 (Liao et al., 2025), MCM2 (Tian et al., 2024), NFE2L3 (Saliba et al., 2022), MAD2L1 (Gharebaghi et al., 2024), MELK (Tang et al., 2022), ANGPTL1 (Chang et al., 2022), GREM2 (Zhang and Zhu, 2022), GLDN (Su et al., 2022), VAP-1 (Li et al., 2014), FABP-4 (Zhou et al., 2014), ALDH1A1 (Wang X. et al., 2025), CNN1 (Wang L. et al., 2025), DES (Ma et al., 2009)) have previously been confirmed to be associated with CRC, which further validates our research results. However, the roles of AUNIP and CILP in this malignant tumor have not been fully studied. Detailed information can be found in Tables 3 and 4. In the future, our research will focus on these differentially expressed genes to elucidate their specific contributions to the tumorigenesis of CRC.
L1000FWD, DGIdb and CMap platforms were used to screen drugs, and molecular docking analysis was performed. Although some genes identified here (e.g., CXCL3, MCM2, MELK) are known in CRC (Liao et al., 2025; Tian et al., 2024; Tang et al., 2022), our study distinguishes itself through a rigorous integrative bioinformatics workflow focused on druggable target discovery. We prioritized not only differential expression but also prognostic significance and validation in a clinical cohort, yielding a small but robust set of high-confidence genes. Drug prediction analysis applied to this refined gene set has uncovered potential therapeutic agents, such as SB-225002, highlighting the pipeline’s utility for translational research.
Further in vitro experiments verified SB-225002s efficacy and safety, establishing it as an ideal therapeutic agent for CRC, but the direct evidence regarding the anti-inflammatory and anti-metastatic effects of SB-225002 in CRC remains limited. However, as a selective CXCR2 antagonist (White et al., 1998), SB-225002 has demonstrated promising therapeutic activity across multiple other cancer types through diverse mechanisms. For example, in breast cancer, SB-225002 was shown to suppress bone metastasis by inhibiting the CXCL5–CXCR2 signaling axis (Romero-Moreno et al., 2019). In nasopharyngeal carcinoma, it exerted anti-tumor effects by reducing phosphorylated AKT (p-AKT) levels (Lo et al., 2013). Similarly, in prostate cancer, SB-225002 inhibited AKT phosphorylation, thereby blocking the PI3K signaling pathway, while also suppressing the secretion of bone sialoprotein and osteopontin, ultimately attenuating metastatic progression (Xu et al., 2018). In lung cancer, it reduced neutrophil infiltration in the tumor microenvironment and enhanced CD8+ T lymphocyte activation, contributing to anti-tumor immunity (Cheng et al., 2021). Furthermore, in mantle cell lymphoma, SB-225002 overcame drug resistance by inhibiting the activation of Akt, STAT3, and p38 signaling pathways and reprogramming lymphoma-associated macrophages (Sun et al., 2025). Given these broad mechanisms observed in other malignancies, our future research will focus on elucidating the anti-inflammatory and anti-metastatic potential of SB-225002 specifically in the context of CRC.
5 Conclusion
Through the analysis of multiple TCGA datasets, ACSL6, AUNIP, CXCL3, CXCL8, CXCL11, EPHX4, GPR143, LRRC8, MAD2L1, MCM2, MELK, MMP3, and NFE2L3 were identified as potential therapeutic biomarkers for CRC. Internal and external database validation further confirmed the predictive significance of 15 key genes. Among them, SB-225002 emerged as a promising candidate for CRC treatment. Subsequent in vitro experiments validated the efficacy and safety of SB-225002, making it a potential therapeutic agent for CRC. However, further research is required to evaluate the clinical potential of SB-225002 as a targeted therapy for CRC.
Data availability statement
The datasets generated in this study have been deposited in the National Population Health Data Center Data Warehouse (PHDA, 2024) under the DOI https://doi.org/10.12213/11.Z0I2Q.202411.267.V1.0, and all other datasets used in the analyses are available from the corresponding public repositories as described in the manuscript.
Ethics statement
The studies involving humans were approved by Clinical Research Ethics Committee of Huaihe Hospital, Henan University. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study. Ethical approval was not required for the studies on animals in accordance with the local legislation and institutional requirements because only commercially available established cell lines were used.
Author contributions
QL: Writing – review and editing, Writing – original draft, Formal Analysis. YX: Writing – original draft, Validation, Methodology. YY: Formal Analysis, Methodology, Writing – original draft. ML: Writing – original draft. ZY: Writing – original draft, Methodology, Visualization. XC: Writing – review and editing. LZ: Supervision, Writing – review and editing, Project administration, Conceptualization. XJ: Conceptualization, Writing – review and editing.
Funding
The author(s) declared that financial support was received for this work and/or its publication. This work has been financed by the Key R&D and Promotion Projects in Henan Province (Nos. 242102310407, 242102310096 and 242102310448), Key Scientific Research Projects of Colleges and Universities in Henan Province (No. 23A310011), Medical Education Research Project of Henan Provincial Health Commission (No. Wjlx2022208), Kaifeng science and technology development plan project (No. 2303010).
Conflict of interest
The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declared that generative AI was not used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Boord, J. B., Fazio, S., and Linton, M. F. (2002). Cytoplasmic fatty acid-binding proteins: emerging roles in metabolism and atherosclerosis. Curr. Opin. Lipidol. 13 (2), 141–147. doi:10.1097/00041433-200204000-00005
Bray, F., Laversanne, M., Sung, H., Ferlay, J., Siegel, R. L., Soerjomataram, I., et al. (2024). Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 74 (3), 229–263. doi:10.3322/caac.21834
Cannon, M., Stevenson, J., Stahl, K., Basu, R., Coffman, A., Kiwala, S., et al. (2024). DGIdb 5.0: rebuilding the drug-gene interaction database for precision medicine and drug discovery platforms. Nucleic Acids Res. 52 (D1), D1227–D1235. doi:10.1093/nar/gkad1040
Chang, T. Y., Lan, K. C., Chiu, C. Y., Sheu, M. L., and Liu, S. H. (2022). ANGPTL1 attenuates cancer migration, invasion, and stemness through regulating FOXO3a-mediated SOX2 expression in colorectal cancer. Clin. Sci. (Lond) 136 (9), 657–673. doi:10.1042/CS20220043
Cheng, Y., Mo, F., Li, Q., Han, X., Shi, H., Chen, S., et al. (2021). Targeting CXCR2 inhibits the progression of lung cancer and promotes therapeutic effect of cisplatin. Mol. Cancer 20 (1), 62. doi:10.1186/s12943-021-01355-1
Chomczynski, P., and Sacchi, N. (2006). The single-step method of RNA isolation by acid guanidinium thiocyanate-phenol-chloroform extraction: twenty-something years on. Nat. Protoc. 1 (2), 581–585. doi:10.1038/nprot.2006.83
Fatima, A., Irmak, D., Noormohammadi, A., Rinschen, M. M., Das, A., Leidecker, O., et al. (2020). The ubiquitin-conjugating enzyme UBE2K determines neurogenic potential through histone H3 in human embryonic stem cells. Commun. Biol. 3 (1), 262. doi:10.1038/s42003-020-0984-3
Feng, S., Wang, T., Fan, L., An, X., Ding, X., Wang, M., et al. (2022). Exploring the potential therapeutic effect of eucommia ulmoides-Dipsaci radix herbal pair on osteoporosis based on network pharmacology and molecular docking technology. RSC Adv. 12 (4), 2181–2195. doi:10.1039/d1ra05799e
Freshour, S. L., Kiwala, S., Cotto, K. C., Coffman, A. C., McMichael, J. F., Song, J. J., et al. (2021). Integration of the drug-gene interaction database (DGIdb 4.0) with open crowdsource efforts. Nucleic Acids Res. 49 (D1), D1144–D1151. doi:10.1093/nar/gkaa1084
Gaedcke, J., Grade, M., Jung, K., Camps, J., Jo, P., Emons, G., et al. (2010). Mutated KRAS results in overexpression of DUSP4, a MAP-kinase phosphatase, and SMYD3, a histone methyltransferase, in rectal carcinomas. Genes. Chromosom. Cancer 49 (11), 1024–1034. doi:10.1002/gcc.20811
Gao, Y., Kim, S., Lee, Y. I., and Lee, J. (2019). Cellular stress-modulating drugs can potentially be identified by in silico screening with connectivity map (CMap). Int. J. Mol. Sci. 20 (22), 5601. doi:10.3390/ijms20225601
Gharebaghi, A., Afshar, S., Tapak, L., Ranjbar, H., Saidijam, M., and Dinu, I. (2024). Utilizing an in-silico approach to pinpoint potential biomarkers for enhanced early detection of colorectal cancer. Cancer Inf. 23, 1611033573. doi:10.1177/11769351241307163
Ghasemi, M., Liang, S., Luu, Q. M., and Kempson, I. (2023). The MTT assay: a method for error minimization and interpretation in measuring cytotoxicity and estimating cell viability. Methods Mol. Biol. 2644, 15–33. doi:10.1007/978-1-0716-3052-5_2
Griffith, M., Griffith, O. L., Coffman, A. C., Weible, J. V., McMichael, J. F., Spies, N. C., et al. (2013). DGIdb: mining the druggable genome. Nat. Methods 10 (12), 1209–1210. doi:10.1038/nmeth.2689
Gu, L., Liu, Y., Jiang, C., Sun, L., and Zhou, H. (2020). Identification and clinical validation of metastasis-associated biomarkers based on large-scale samples in colon-adenocarcinoma. Pharmacol. Res. 160, 105087. doi:10.1016/j.phrs.2020.105087
Ha, Y. J., Park, S. H., Tak, K. H., Lee, J. L., Kim, C. W., Kim, J. H., et al. (2024). CILP2 is a potential biomarker for the prediction and therapeutic target of peritoneal metastases in colorectal cancer. Sci. Rep. 14 (1), 12487. doi:10.1038/s41598-024-63366-4
Hu, Y., Gaedcke, J., Emons, G., Beissbarth, T., Grade, M., Jo, P., et al. (2018). Colorectal cancer susceptibility loci as predictive markers of rectal cancer prognosis after surgery. Genes. Chromosom. Cancer 57 (3), 140–149. doi:10.1002/gcc.22512
Hu, H. F., Fu, J. Y., Han, L., Gao, G. B., Zhang, W. X., Yu, S. M., et al. (2024). The antipsychotic drug aripiprazole suppresses colorectal cancer by targeting LAMP2a to induce RNH1/miR-99a/mTOR-Mediated autophagy and apoptosis. Adv. Sci. (Weinh) 11 (48), e2409498. doi:10.1002/advs.202409498
Jenab, M., Riboli, E., Cleveland, R. J., Norat, T., Rinaldi, S., Nieters, A., et al. (2007). Serum C-peptide, IGFBP-1 and IGFBP-2 and risk of colon and rectal cancers in the European prospective investigation into cancer and nutrition. Int. J. Cancer 121 (2), 368–376. doi:10.1002/ijc.22697
Jiang, H., Zhang, X., Wu, Y., Zhang, B., Wei, J., Li, J., et al. (2022). Bioinformatics identification and validation of biomarkers and infiltrating immune cells in endometriosis. Front. Immunol. 13, 944683. doi:10.3389/fimmu.2022.944683
Kim, M. J., Pandit, S., and Jee, J. G. (2022). Discovery of kinase and carbonic anhydrase dual inhibitors by machine learning classification and experiments. Pharm. (Basel) 15 (2), 236. doi:10.3390/ph15020236
Kim, S., Chen, J., Cheng, T., Gindulyte, A., He, J., He, S., et al. (2025). PubChem 2025 update. Nucleic Acids Res. 53 (D1), D1516–D1525. doi:10.1093/nar/gkae1059
Kvietkauskas, M., Zitkute, V., Leber, B., Strupas, K., Stiegler, P., and Schemmer, P. (2021). Dietary melatonin and glycine decrease tumor growth through antiangiogenic activity in experimental colorectal liver metastasis. Nutrients 13 (6), 2035. doi:10.3390/nu13062035
Li, Y. I., Hung, J. S., Yu, T. Y., Liou, J. M., Wei, J. N., Kao, H. L., et al. (2014). Serum vascular adhesion protein-1 predicts all-cause mortality and cancer-related mortality in subjects with colorectal cancer. Clin. Chim. Acta 428, 51–56. doi:10.1016/j.cca.2013.10.024
Li, C., Wang, Y., Liu, D., Wong, C. C., Coker, O. O., Zhang, X., et al. (2022). Squalene epoxidase drives cancer cell proliferation and promotes gut dysbiosis to accelerate colorectal carcinogenesis. Gut 71 (11), 2253–2265. doi:10.1136/gutjnl-2021-325851
Liao, Q., Zhou, X., Wu, L., Yang, Y., Zhu, X., Liao, H., et al. (2025). Gut microbial metabolite 4-hydroxybenzeneacetic acid drives colorectal cancer progression via accumulation of immunosuppressive PMN-MDSCs. J. Clin. Invest. 135 (11), e181243. doi:10.1172/JCI181243
Lişcu, H. D., Verga, N., Atasiei, D. I., Badiu, D. C., Dumitru, A. V., Ultimescu, F., et al. (2024). Biomarkers in colorectal cancer: actual and future perspectives. Int. J. Mol. Sci. 25 (21), 11535. doi:10.3390/ijms252111535
Liu, S., Wen, H., Li, F., Xue, X., Sun, X., et al. (2024). Revealing the pathogenesis of gastric intestinal metaplasia based on the mucosoid air-liquid interface. J. Transl. Med. 22 (1), 468. doi:10.1186/s12967-024-05276-7
Lo, M. C., Yip, T. C., Ngan, K. C., Cheng, W. W., Law, C. K., Chan, P. S., et al. (2013). Role of MIF/CXCL8/CXCR2 signaling in the growth of nasopharyngeal carcinoma tumor spheres. Cancer Lett. 335 (1), 81–92. doi:10.1016/j.canlet.2013.01.052
Ma, Y., Peng, J., Liu, W., Zhang, P., Huang, L., Gao, B., et al. (2009). Proteomics identification of desmin as a potential oncofetal diagnostic and prognostic biomarker in colorectal cancer. Mol. Cell. Proteomics 8 (8), 1878–1890. doi:10.1074/mcp.M800541-MCP200
Matsuyama, T., Ishikawa, T., Mogushi, K., Yoshida, T., Iida, S., Uetake, H., et al. (2010). MUC12 mRNA expression is an independent marker of prognosis in stage II and stage III colorectal cancer. Int. J. Cancer 127 (10), 2292–2299. doi:10.1002/ijc.25256
Mori, M., Nakajima, M., Mikami, Y., Seki, S., Takigawa, M., Kubo, T., et al. (2006). Transcriptional regulation of the cartilage intermediate layer protein (CILP) gene. Biochem. Biophys. Res. Commun. 341 (1), 121–127. doi:10.1016/j.bbrc.2005.12.159
Murphy, N., Song, M., Papadimitriou, N., Carreras-Torres, R., Langenberg, C., Martin, R. M., et al. (2022). Associations between glycemic traits and colorectal cancer: a mendelian randomization analysis. J. Natl. Cancer Inst. 114 (5), 740–752. doi:10.1093/jnci/djac011
Palafox, M. A., Belskaya, N. P., and Kostova, I. P. (2023). Study of the molecular architectures of 2-(4-Chlorophenyl)-5-(pyrrolidin-1-yl)-2H-1,2,3-triazole-4-carboxylic acid using their vibrational spectra, quantum chemical calculations and molecular docking with MMP-2 receptor. Pharmaceutics 15 (12), 2686. doi:10.3390/pharmaceutics15122686
Piastra, V., Ganci, F., Sacconi, A., Pranteda, A., Allegretti, M., Bernardini, R., et al. (2024). Repurposed AT9283 triggers anti-tumoral effects by targeting MKK3 oncogenic functions in colorectal cancer. J. Exp. Clin. Cancer Res. 43 (1), 234. doi:10.1186/s13046-024-03150-4
Pisani, P. (2008). Hyper-insulinaemia and cancer, meta-analyses of epidemiological studies. Arch. Physiol. Biochem. 114 (1), 63–70. doi:10.1080/13813450801954451
Rinaldi, S., Rohrmann, S., Jenab, M., Biessy, C., Sieri, S., Palli, D., et al. (2008). Glycosylated hemoglobin and risk of colorectal cancer in men and women, the European prospective investigation into cancer and nutrition. Cancer Epidemiol. Biomarkers Prev. 17 (11), 3108–3115. doi:10.1158/1055-9965.EPI-08-0495
Romero-Moreno, R., Curtis, K. J., Coughlin, T. R., Miranda-Vergara, M. C., Dutta, S., Natarajan, A., et al. (2019). The CXCL5/CXCR2 axis is sufficient to promote breast cancer colonization during bone metastasis. Nat. Commun. 10 (1), 4404. doi:10.1038/s41467-019-12108-6
Rosignoli, S., and Paiardini, A. (2022). Boosting the full potential of PyMOL with structural biology plugins. Biomolecules 12 (12), 1764. doi:10.3390/biom12121764
Saliba, J., Coutaud, B., Makhani, K., Epstein Roth, N., Jackson, J., Park, J. Y., et al. (2022). Loss of NFE2L3 protects against inflammation-induced colorectal cancer through modulation of the tumor microenvironment. Oncogene 41 (11), 1563–1575. doi:10.1038/s41388-022-02192-2
Salibasic, M., Pusina, S., Bicakcic, E., Pasic, A., Gavric, I., Kulovic, E., et al. (2019). Colorectal cancer surgical treatment, our experience. Med. Arch. 73 (6), 412–414. doi:10.5455/medarh.2019.73.412-414
Shams-White, M. M., Brockton, N. T., Mitrou, P., Romaguera, D., Brown, S., Bender, A., et al. (2019). Operationalizing the 2018 world cancer research fund/American institute for cancer research (WCRF/AICR) cancer prevention recommendations: a standardized scoring system. Nutrients 11 (7), 1572. doi:10.3390/nu11071572
Shinagawa, T., Tanaka, T., Nozawa, H., Emoto, S., Murono, K., Kaneko, M., et al. (2018). Comparison of the guidelines for colorectal cancer in Japan, the USA and Europe. Ann. Gastroenterol. Surg. 2 (1), 6–12. doi:10.1002/ags3.12047
Su, Y., Tian, X., Gao, R., Guo, W., Chen, C., Chen, C., et al. (2022). Colon cancer diagnosis and staging classification based on machine learning and bioinformatics analysis. Comput. Biol. Med. 145, 105409. doi:10.1016/j.compbiomed.2022.105409
Sun, J., and Yuan, J. (2022). Chemokine (C-X-C motif) ligand 1/chemokine (C-X-C motif) receptor 2 autocrine loop contributes to cellular proliferation, migration and apoptosis in cervical cancer. Bioengineered 13 (3), 7579–7591. doi:10.1080/21655979.2022.2036896
Sun, X., Wang, C., Cao, J., Li, J., Ma, G., Wu, X., et al. (2025). Dialog between mantle cell lymphoma cells and lymphoma-associated macrophages underlies ibrutinib resistance. J. Adv. Res. 73, 631–644. doi:10.1016/j.jare.2024.08.023
Svec, J., Onhajzer, J., and Korinek, V. (2024). Origin, development and therapy of colorectal cancer from the perspective of a biologist and an oncologist. Crit. Rev. Oncol. Hematol. 204, 104544. doi:10.1016/j.critrevonc.2024.104544
Tang, B., Zhu, J., Liu, F., Ding, J., Wang, Y., Fang, S., et al. (2022). xCT contributes to colorectal cancer tumorigenesis through upregulation of the MELK oncogene and activation of the AKT/mTOR cascade. Cell. Death Dis. 13 (4), 373. doi:10.1038/s41419-022-04827-4
Tian, W., Zhao, J., Zhang, X., Li, P., Li, X., Hong, Y., et al. (2024). RUNX1 regulates MCM2/CDC20 to promote COAD progression modified by deubiquitination of USP31. Sci. Rep. 14 (1), 13906. doi:10.1038/s41598-024-64726-w
Van Schaeybroeck, S., Allen, W. L., Turkington, R. C., and Johnston, P. G. (2011). Implementing prognostic and predictive biomarkers in CRC clinical trials. Nat. Rev. Clin. Oncol. 8 (4), 222–232. doi:10.1038/nrclinonc.2011.15
Wang, Z., Lachmann, A., Keenan, A. B., and Ma'ayan, A. (2018). L1000FWD: fireworks visualization of drug-induced transcriptomic signatures. Bioinformatics 34 (12), 2150–2152. doi:10.1093/bioinformatics/bty060
Wang, Y., Wang, Q., Huang, H., Huang, W., Chen, Y., McGarvey, P. B., et al. (2021). A crowdsourcing open platform for literature curation in UniProt. PLoS Biol. 19 (12), e3001464. doi:10.1371/journal.pbio.3001464
Wang, X., Wen, X., Hu, X., Niu, W., Guo, G., Yang, F., et al. (2025a). ALDH1A1 promotes colorectal cancer metastasis through activating the notch signaling pathway. Med. Oncol. 42 (9), 403. doi:10.1007/s12032-025-02958-0
Wang, L., Wang, W., Wu, S., and Shi, Y. (2025b). Ultrasound-targeted microbubble destruction mediated upregulation of CNN1 induces ferroptosis in colorectal cancer cells by regulating p53-related SLC7A11 expression. Biochem. Biophys. Res. Commun. 777, 152207. doi:10.1016/j.bbrc.2025.152207
Wen, Z., and Ablimit, A. (2024). Comprehensive analysis of scRNA-Seq and bulk RNA-Seq reveals ubiquitin promotes pulmonary fibrosis in chronic pulmonary diseases. Sci. Rep. 14 (1), 21195. doi:10.1038/s41598-024-70659-1
White, J. R., Lee, J. M., Young, P. R., Hertzberg, R. P., Jurewicz, A. J., Chaikin, M. A., et al. (1998). Identification of a potent, selective non-peptide CXCR2 antagonist that inhibits interleukin-8-induced neutrophil migration. J. Biol. Chem. 273 (17), 10095–10098. doi:10.1074/jbc.273.17.10095
Xia, Y., Sun, M., Huang, H., and Jin, W. L. (2024). Drug repurposing for cancer therapy. Signal Transduct. Target Ther. 9 (1), 92. doi:10.1038/s41392-024-01808-1
Xu, J., Kurup, P., Foscue, E., and Lombroso, P. J. (2015). Striatal-enriched protein tyrosine phosphatase regulates the PTPα/Fyn signaling pathway. J. Neurochem. 134 (4), 629–641. doi:10.1111/jnc.13160
Xu, M., Jiang, H., Wang, H., Liu, J., Liu, B., and Guo, Z. (2018). SB225002 inhibits prostate cancer invasion and attenuates the expression of BSP, OPN and MMP-2. Oncol. Rep. 40 (2), 726–736. doi:10.3892/or.2018.6504
Xue, H., Xing, H. J., Wang, B., Fu, C., Zhang, Y. S., Qiao, X., et al. (2023). Cinchonine, a potential oral small-molecule glucagon-like Peptide-1 receptor agonist, lowers blood glucose and ameliorates non-alcoholic steatohepatitis. Drug Des. Devel. Ther. 17, 1417–1432. doi:10.2147/DDDT.S404055
Yang, Z., Liang, X., Fu, Y., Liu, Y., Zheng, L., Liu, F., et al. (2019). Identification of AUNIP as a candidate diagnostic and prognostic biomarker for oral squamous cell carcinoma. EBioMedicine 47, 44–57. doi:10.1016/j.ebiom.2019.08.013
You, X., Wang, M., Wang, X., Cheng, Y., Zhang, C., et al. (2024). Gliomedin drives gastric cancer cell proliferation and migration, correlating with a poor prognosis. Heliyon 10 (18), e38153. doi:10.1016/j.heliyon.2024.e38153
Zhang, Z., and Zhu, X. (2022). MiR-103a-3p contributes to the progression of colorectal cancer by regulating GREM2 expression. Yonsei Med. J. 63 (6), 520–529. doi:10.3349/ymj.2022.63.6.520
Zhang, Q., Wang, J., Liu, M., Zhu, Q., Li, Q., Xie, C., et al. (2020). Weighted correlation gene network analysis reveals a new stemness index-related survival model for prognostic prediction in hepatocellular carcinoma. Aging (Albany NY) 12 (13), 13502–13517. doi:10.18632/aging.103454
Zhao, Y., Chen, X., Chen, J., and Qi, X. (2023). Decoding connectivity Map-based drug repurposing for oncotherapy. Brief. Bioinform 24 (3), bbad142. doi:10.1093/bib/bbad142
Zhou, B., Shu, B., Yang, J., Liu, J., Xi, T., and Xing, Y. (2014). C-reactive protein, interleukin-6 and the risk of colorectal cancer: a meta-analysis. Cancer Causes Control 25 (10), 1397–1405. doi:10.1007/s10552-014-0445-8
Glossary
ALDH1A1 Aldehyde Dehydrogenase 1 Family Member A1
ANGPTL1 Angiopoietin-like protein 1
ANOVA analysis of variance
AOC3 amine oxidase copper containing 3
AUNIP Aurora kinase A and ninein-interacting protein
BMPs bone morphogenetic proteins
BP biological process
CC cellular component
CILP Cartilage Intermediate Layer Protein
CMap Connectivity Map
CNN1 Calponin 1
CRC colorectal cancer
CXCL3 C-X-C Motif Chemokine 3
DEGs differentially expressed genes
DES Desmin
DGIdb Drug-Gene Interaction Database
DMSO dimethyl sulfoxide
FABP-4 Fatty Acid-Binding Protein 4
FC fold change
FPKM Fragments Per Kilobase of transcript per Million mapped reads
GEO Gene Expression Omnibus
GLDN Gliomedin
GO Gene Ontology
GREM2 Gremlin-2
HPA Human Protein Atlas
KEGG Kyoto Encyclopedia of Genes and Genomes
L1000FWD L1000 Fireworks Display
MAD2L1 Mitotic arrest deficient-2 like-1
MCM2 Minichromosome maintenance complex component 2
MELK Maternal embryonic leucine zipper kinase
MF molecular function
MOA mechanisms of action
NFE2L3 Nuclear factor erythroid 2-related factor 3
OD optical density
OS overall survival
PCA Principal Component Analysis
PRDC Protein Related to DAN and Cerberus
SD standard deviation
TCGA The Cancer Genome Atlas
TGF-β transforming growth factor-beta
UniProt Universal Protein
VAP-1 Vascular Adhesion Protein-1
Keywords: colorectal cancer, differentially expressed genes, drug screening, molecular docking, biomarkers
Citation: Liao Q, Xue Y, Yu Y, Liu M, Yu Z, Cheng X, Zhang L and Ji X (2026) Exploratory research on therapeutic agents combined with early diagnostic biomarkers for colorectal cancer. Front. Pharmacol. 16:1723072. doi: 10.3389/fphar.2025.1723072
Received: 11 October 2025; Accepted: 02 December 2025;
Published: 08 January 2026.
Edited by:
Nazeer Hasan, Jamia Hamdard University, IndiaReviewed by:
Mohd Wasim, Jamia Hamdard University, IndiaKartik Chandra Guchhait, Debra Thana Sahid Kshudiram Smriti Mahavidyalaya, India
Arif Nadaf, Dr. Reddy’s Laboratories, India
Copyright © 2026 Liao, Xue, Yu, Liu, Yu, Cheng, Zhang and Ji. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Lei Zhang, emhsZWlAaGVudS5lZHUuY24=; Xinying Ji, MTAxOTAwOTZAdmlwLmhlbnUuZWR1LmNu
†These authors have contributed equally to this work and share first authorship
Qiming Liao1†