<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="research-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Bioinform.</journal-id>
<journal-title>Frontiers in Bioinformatics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Bioinform.</abbrev-journal-title>
<issn pub-type="epub">2673-7647</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">1131021</article-id>
<article-id pub-id-type="doi">10.3389/fbinf.2023.1131021</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Bioinformatics</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Multimodal AI for prediction of distant metastasis in carcinoma patients</article-title>
<alt-title alt-title-type="left-running-head">Olatunji and Cui</alt-title>
<alt-title alt-title-type="right-running-head">
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fbinf.2023.1131021">10.3389/fbinf.2023.1131021</ext-link>
</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Olatunji</surname>
<given-names>Isaac</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/2151405/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Cui</surname>
<given-names>Feng</given-names>
</name>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1013516/overview"/>
</contrib>
</contrib-group>
<aff>
<institution>Thomas H. Gosnell School of Life Science</institution>, <institution>Rochester Institute of Technology</institution>, <addr-line>Rochester</addr-line>, <addr-line>NY</addr-line>, <country>United States</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/465970/overview">Li Xiao</ext-link>, University of Science and Technology of China, China</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/2161206/overview">Yi Han</ext-link>, University of Texas Southwestern Medical Center, United States</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1291605/overview">Bing Song</ext-link>, University of Texas Southwestern Medical Center, United States</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Feng Cui, <email>fxcsbi@rit.edu</email>
</corresp>
</author-notes>
<pub-date pub-type="epub">
<day>09</day>
<month>05</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="collection">
<year>2023</year>
</pub-date>
<volume>3</volume>
<elocation-id>1131021</elocation-id>
<history>
<date date-type="received">
<day>24</day>
<month>12</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>24</day>
<month>04</month>
<year>2023</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2023 Olatunji and Cui.</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Olatunji and Cui</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>Metastasis of cancer is directly related to death in almost all cases, however a lot is yet to be understood about this process. Despite advancements in the available radiological investigation techniques, not all cases of Distant Metastasis (DM) are diagnosed at initial clinical presentation. Also, there are currently no standard biomarkers of metastasis. Early, accurate diagnosis of DM is however crucial for clinical decision making, and planning of appropriate management strategies. Previous works have achieved little success in attempts to predict DM from either clinical, genomic, radiology, or histopathology data. In this work we attempt a multimodal approach to predict the presence of DM in cancer patients by combining gene expression data, clinical data and histopathology images. We tested a novel combination of Random Forest (RF) algorithm with an optimization technique for gene selection, and investigated if gene expression pattern in the primary tissues of three cancer types (Bladder Carcinoma, Pancreatic Adenocarcinoma, and Head and Neck Squamous Carcinoma) with DM are similar or different. Gene expression biomarkers of DM identified by our proposed method outperformed Differentially Expressed Genes (DEGs) identified by the DESeq2 software package in the task of predicting presence or absence of DM. Genes involved in DM tend to be more cancer type specific rather than general across all cancers. Our results also indicate that multimodal data is more predictive of metastasis than either of the three unimodal data tested, and genomic data provides the highest contribution by a wide margin. The results re-emphasize the importance for availability of sufficient image data when a weakly supervised training technique is used. Code is made available at: <ext-link ext-link-type="uri" xlink:href="https://github.com/rit-cui-lab/Multimodal-AI-for-Prediction-of-Distant-Metastasis-in-Carcinoma-Patients">https://github.com/rit-cui-lab/Multimodal-AI-for-Prediction-of-Distant-Metastasis-in-Carcinoma-Patients</ext-link>.</p>
</abstract>
<kwd-group>
<kwd>metastasis</kwd>
<kwd>cancer</kwd>
<kwd>multimodal</kwd>
<kwd>gene expression</kwd>
<kwd>histopathology</kwd>
<kwd>deep learning</kwd>
<kwd>machine learning</kwd>
</kwd-group>
<custom-meta-wrap>
<custom-meta>
<meta-name>section-at-acceptance</meta-name>
<meta-value>Computational BioImaging</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
<body>
<sec id="s1">
<title>Introduction</title>
<p>Proper management planning for carcinoma patients requires accurate diagnosis, and prognosis prediction. A major prognostic factor, metastasis, however is not always diagnosed at initial patient presentation. Metastasis refers to the dissemination of cancer cells away from the initial site of origin to form colonies at distant organs. This single hallmark of malignancies is responsible for the highest proportion (approximately 90%) of cancer related mortalities (<xref ref-type="bibr" rid="B19">Fares et al., 2020</xref>). In HNSCC (Head and Neck Squamous Cell Carcinoma), about 10% of patients present with Distant Metastasis (DM) at diagnosis, while 20%&#x2013;25% are detected during the disease course (<xref ref-type="bibr" rid="B74">Pisani et al., 2020</xref>). Pancreatic cancer is usually diagnosed at a late stage, and DM is quite common at presentation, however, multidetector computed tomography (MDCT) which is currently the optimal preoperative investigation has poor sensitivity to liver and peritoneal metastasis, the most common metastatic sites (<xref ref-type="bibr" rid="B52">Liu et al., 2018</xref>). Cases of metastatic non-muscle-invasive bladder cancer have also been reported (<xref ref-type="bibr" rid="B101">Xu et al., 2022</xref>). Early diagnosis of metastatic cancer is critical if patients will benefit from systemic therapies, alongside other appropriate management plans. The mechanisms behind metastasis, however, are very complex, and a lot is still not understood. Currently, there are no standard biomarkers of metastasis (<xref ref-type="bibr" rid="B93">Wang et al., 2018</xref>), and identification of biomarkers that are associated with metastasis will be useful in guiding clinical decisions, and as basis for development of new therapies.</p>
<p>Past studies have investigated the presence or absence of generalizable links between genes involved in the metastasis of different cancer types, either on the basis of mutation, or expression level. However, metastasis has proven to be a complex molecular and biochemical process. While similar mutation rate, and expression pattern have been reported in selected genes across groups of metastatic cancer, there are multiple claims that there are in fact no specific cancer metastatic genes. In a study carried out by <xref ref-type="bibr" rid="B50">Liu et al. (2017)</xref>, mutation rates of <italic>TP53</italic> was significantly different between primary and metastatic samples in seven cancer types, while <italic>PTEN</italic> mutation level was different in five cancer types. Copy number variations also differ significantly in all 15 cancer types examined. <xref ref-type="bibr" rid="B68">Nguyen et al. (2022)</xref> also implicated <italic>TP53, PTEN, CDKN2A, and MYC</italic> as significantly mutated genes in the metastasis of various subsets of cancer types.</p>
<p>In specific cancer types, designated differentially expressed signature genes are the basis of some of the past attempts to stratify cancer patients based on the risk of DM (<xref ref-type="bibr" rid="B90">van de Vijver et al., 2002</xref>). A recent study (<xref ref-type="bibr" rid="B43">Kaur et al., 2022</xref>) carried out in triple negative breast cancer samples using DESeq2 software identified a total of 1738 differentially expressed genes between metastatic and non-metastatic primary samples, 3 of which are part of the 70 prognostic signature genes in (<xref ref-type="bibr" rid="B90">van de Vijver et al., 2002</xref>). An analogous study in renal cell carcinoma noted some of the identified differentially expressed genes as predictive of metastasis-free survival, and overall survival (<xref ref-type="bibr" rid="B29">Ho et al., 2017</xref>). In melanoma, measurement of Breslow&#x2019;s thickness of the primary tumor was correlated with the level of expression of specific genes, and the transition to metastatic tumor (<xref ref-type="bibr" rid="B77">Riker et al., 2008</xref>). While all of these reports suggest that genomic data may actually be predictive of metastasis, the inconsistent patterns of gene mutation and expression seen in different cancer samples makes it a challenge to precisely identify their importance in specific cases.</p>
<p>The wealth of morphological information contained in the tumor microenvironment is routinely exploited by pathologists for making definitive diagnosis of cancer and predicting patient prognosis. However, aside from being time consuming, this tedious process has also been associated with inter- and intra-observer variability that sometimes lead to unresolved diagnosis or worse, errors in diagnosis. With the use of Convolutional Neural Network (CNN), histopathology images have proven to be good predictors of malignancy status, important molecular biomarkers of various clinical and research relevance, as well as other cellular and extracellular processes (<xref ref-type="bibr" rid="B67">Mungenast et al., 2021</xref>). While many deep learning models have achieved relatively high metrics in detecting tumors within histopathology images of metastatic lymph node samples (<xref ref-type="bibr" rid="B14">Chuang et al., 2021</xref>; <xref ref-type="bibr" rid="B95">Wen et al., 2021</xref>; <xref ref-type="bibr" rid="B32">Huang et al., 2022</xref>), the difficult problem of predicting DM from primary samples remains a challenge, and most of the past attempts on this and similar tasks have struggled with relatively average model performance (<xref ref-type="bibr" rid="B112">Zhao, 2020</xref>; <xref ref-type="bibr" rid="B5">Brinker et al., 2021</xref>; <xref ref-type="bibr" rid="B46">Kiehl et al., 2021</xref>; <xref ref-type="bibr" rid="B79">Schiele et al., 2021</xref>).</p>
<p>Recent trends of use of multimodal data for prediction of diagnosis and outcomes in cancer patients have reported mostly improved metrics compared to use of singular modes of data (<xref ref-type="bibr" rid="B64">Mobadersany et al., 2018</xref>; <xref ref-type="bibr" rid="B12">Chen et al., 2022</xref>). In the case of metastasis prediction, past works have used singular or multimodal data combining clinical (<xref ref-type="bibr" rid="B1">Ali, 2020</xref>), genomic (<xref ref-type="bibr" rid="B107">Yuan et al., 2019</xref>), radiological (<xref ref-type="bibr" rid="B54">Liu et al., 2020</xref>), and histopathology (<xref ref-type="bibr" rid="B112">Zhao, 2020</xref>) data, however, to the best of our knowledge, at the time of this writing, this is one of the first works that combines transcriptomic data, clinical data, and histopathology images from primary tumor samples to predict DM.</p>
<p>In this work, we attempt to predict DM using gene expression data, clinical data, and histopathology images from primary tumors of three carcinoma types - Pancreatic Adenocarcinoma (PAAD), Bladder Carcinoma (BLCA), and Head and Neck Squamous cell Carcinoma (HNSC). The contributions of this research include:<list list-type="simple">
<list-item>
<p>- We identify genomic markers of DM in three different carcinoma types using a novel combination of the random forest algorithm with an optimized feature selection approach described in (<xref ref-type="bibr" rid="B65">Mori et al., 2021</xref>). Genes selected via this method performed better in prediction of DM than differentially expressed genes (DEGs) derived from DESeq2 analysis in our study, as well as in comparison to methods from other similar studies. These biomarkers could be further investigated for development of new diagnostics and therapies against DM.</p>
</list-item>
<list-item>
<p>- Using various machine learning techniques, we investigate and substantiate claims that genes involved in DM tend to be more cancer type specific rather than general across all cancer types, and that there are no specific cancer metastasis genes.</p>
</list-item>
<list-item>
<p>- We built separate models to predict DM from gene expression data, clinical data, or histopathology images, as well as multimodal combinations of the three data types. Models metrics show that multimodal data provides an edge for prediction of DM over genomic, clinical or histopathology data. However, the genomic data has the highest contribution by a wide margin.</p>
</list-item>
<list-item>
<p>- Unlike with genomic data in which features tend to be more cancer type specific, models built from histopathology image dataset of all three cancer types in total had better metrics than those built from a single cancer type image dataset, emphasizing the importance of sufficient data to build a robust model when a weakly supervised technique is used.</p>
</list-item>
</list>
</p>
</sec>
<sec sec-type="methods" id="s2">
<title>Methods</title>
<sec id="s2-1">
<title>Dataset</title>
<p>Barcode of patients with DM in the TCGA-HNSC, TCGA-BLCA, and TCGA-PAAD projects were retrieved from Genomic Data Commons (GDC) Data Portal, and gene expression data, clinical data, and histopathology images of these patients were downloaded. The few number of available cases of DM is a common challenge in most studies on metastasis. BLCA, and PAAD were selected based on the number of cases of DM in these cancer types that are available on The Cancer Genome Atlas (TCGA), and the number of cases with available corresponding genomic, image, and clinical data. A few cancer types were dropped during this selection process due to the absence of some variables in their records. For example, Skin Cell Carcinoma (SKCM) was eliminated due to missing &#x201c;number_of_lymph_nodes_positive_by_HE&#x201d; variable which is present in BLCA, PAAD, and HNSC datasets. HNSC was added based on the interest of one of the authors. See <xref ref-type="sec" rid="s9">Supplementary Material S1</xref> for file ID and barcode of samples. BLCA (N &#x3d; 80) records had 59 male, and 21 female patients. Average age in PAAD (N &#x3d; 58; Male &#x3d; 32; Female &#x3d; 26), and HNSC(N &#x3d; 24; Male &#x3d; 21; Female &#x3d; 3) records is 63, and 61 respectively. The race of patients in the BLCA cohort are (White &#x3d; 66; Black or African American &#x3d; 6; Asian &#x3d; 5), and (White &#x3d; 47; Black or African American &#x3d; 3; Asian &#x3d; 5) for PAAD, while HNSC had (White &#x3d; 19; Black or African American &#x3d; 4; Asian &#x3d; 1). The T, and N staging, and Number of lymph nodes positive by hematoxylin and eosin (HE) staining in each cancer type is shown in <xref ref-type="table" rid="T1">Table 1</xref>.</p>
<table-wrap id="T1" position="float">
<label>TABLE 1</label>
<caption>
<p>Information about metastatic datasets.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="center">Information</th>
<th align="center">Bladder Carcinoma</th>
<th align="center">Pancreatic adenocarcinoma</th>
<th align="center">Head and Neck Cancer</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">Number of samples</td>
<td align="center">80</td>
<td align="center">58</td>
<td align="center">24</td>
</tr>
<tr>
<td rowspan="2" align="left">Patient age range</td>
<td align="center">Min: 47</td>
<td align="center">Min: 41</td>
<td align="center">Min: 49</td>
</tr>
<tr>
<td align="center">Max: 90</td>
<td align="center">Max: 81</td>
<td align="center">Max: 79</td>
</tr>
<tr>
<td rowspan="2" align="left">Gender</td>
<td align="center">Male: 59</td>
<td align="center">Male: 32</td>
<td align="center">Male: 21</td>
</tr>
<tr>
<td align="center">Female: 21</td>
<td align="center">Female: 26</td>
<td align="center">Female: 3</td>
</tr>
<tr>
<td rowspan="3" align="left">Race</td>
<td align="center">White: 66</td>
<td align="center">White: 47</td>
<td align="center">White: 19</td>
</tr>
<tr>
<td align="center">Black or African American: 6</td>
<td align="center">Black or African American: 3</td>
<td align="center">Black or African American: 4</td>
</tr>
<tr>
<td align="center">Asian: 5</td>
<td align="center">Asian: 5</td>
<td align="center">Asian: 1</td>
</tr>
<tr>
<td rowspan="4" align="left">T staging</td>
<td align="center">T1: 0</td>
<td align="center">T1: 2</td>
<td align="center">T1: 2</td>
</tr>
<tr>
<td align="center">T2: 26</td>
<td align="center">T2: 7</td>
<td align="center">T2: 4</td>
</tr>
<tr>
<td align="center">T3: 41</td>
<td align="center">T3: 48</td>
<td align="center">T3: 6</td>
</tr>
<tr>
<td align="center">T4: 13</td>
<td align="center">T4: 1</td>
<td align="center">T4: 12</td>
</tr>
<tr>
<td rowspan="4" align="left">N staging</td>
<td align="center">0: 42</td>
<td align="center">0: 17</td>
<td align="center">0: 4</td>
</tr>
<tr>
<td align="center">1: 16</td>
<td align="center">1: 41</td>
<td align="center">1: 2</td>
</tr>
<tr>
<td align="center">2: 19</td>
<td align="center">2: 0</td>
<td align="center">2: 18</td>
</tr>
<tr>
<td align="center">3: 3</td>
<td align="center">3: 0</td>
<td align="center">3: 0</td>
</tr>
<tr>
<td rowspan="5" align="left">Number of lymph nodes positive by HE</td>
<td align="center">0: 21</td>
<td align="center">0: 16</td>
<td align="center">0: 3</td>
</tr>
<tr>
<td align="center">1: 12</td>
<td align="center">1: 5</td>
<td align="center">1: 2</td>
</tr>
<tr>
<td align="center">2&#x2013;6: 13</td>
<td align="center">2&#x2013;6: 28</td>
<td align="center">2&#x2013;6: 11</td>
</tr>
<tr>
<td align="center">&#x3e;6: 8</td>
<td align="center">&#x3e;6: 8</td>
<td align="center">&#x3e;6: 3</td>
</tr>
<tr>
<td align="center">NA/Invalid: 26</td>
<td align="center">NA/Invalid: 1</td>
<td align="center">NA/Invalid: 5</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s2-2">
<title>Machine learning for identification of transcriptomic biomarkers and prediction of distant metastasis</title>
<p>We designed a study that utilized machine learning techniques to simultaneously identify biomarkers of DM in each of BLCA, PAAD, and HNSC, and investigate if genes involved in DM are similar across the three cancer types. Each cancer type dataset was pre-processed separately. Equal numbers of complementary gene expression data of samples without DM (BLCA &#x3d; 80; PAAD &#x3d; 58; HNSC &#x3d; 24) were downloaded for each of the cancer types, and we ensured that there were no overlapping samples between the groups with DM, and those without DM. After extracting Transcript Per Million (TPM)) normalized values of protein-coding genes from the individual records, all samples in a group were merged to create a table, and NaN values were replaced with zero. Other initial preprocessing steps include log to base ten transformation of data to adjust for the wide and non-linear values, removal of genes with a value of zero in greater than 80% of the samples, and selection of only top 10000 genes with highest variance across all samples.</p>
<p>To identify important genes that are involved in DM, we utilized the Random Forest (RF) algorithm with an optimization technique described in (<xref ref-type="bibr" rid="B65">Mori Y. et al., 2021</xref>). RF is a supervised ML algorithm that creates multiple decision trees from bootstrapped samples data and randomly selected features to output a result, that is, based on a voting system. It works well with high dimensional data, and is a commonly utilized technique in gene expression data analysis. The optimization technique we employed consists of running the RF algorithm over (N &#x3d; 1000) iterations to classify samples as DM positive or DM negative, and selecting the top (K &#x3d; 100) most important genes at each of the N instances. This is followed by ranking the important genes overall based on how many times each gene appears (i.e., frequency) in K over the N iterations. The algorithm was used to classify samples in each of the groups based on the presence or absence of DM (<xref ref-type="fig" rid="F1">Figure 1</xref>). We selected the top 50 overall highest ranked genes for BLCA, PAAD, and HNSC. Also, the three cancer types datasets were combined to derive the &#x201c;ALL3&#x201d; group, and the above steps were repeated to select the top 50 highest ranked genes in this group as well (<xref ref-type="fig" rid="F1">Figure 1</xref>).</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>Workflow for analysis of genomic data. On the right, gene expression datasets of HNSC, PAAD, and BLCA are separately passed through the (RF &#x2b; Optimization) gene selection algorithm, and a parallel DGE Analysis. Separate ML models are trained on highest ranked selected genes, and DEGs to predict DM samples. On the top left, the same processes carried out on ALL3 (combined dataset of HNSC, PAAD, and BLCA) dataset. On the lower left, genes selected from ALL3 dataset are used as features to train new models on each of the cancer types (i.e., other groups- HNSC, PAAD, BLCA) dataset. Cancer images source: National Cancer Institute.</p>
</caption>
<graphic xlink:href="fbinf-03-1131021-g001.tif"/>
</fig>
<p>To assess the strength of the selected biomarkers for prediction of DM, and to investigate if genes involved in DM are similar across the three cancer types, first, we used only the selected genes in each of the groups as features to train multiple ML models for the task of DM prediction. Support Vector Machine (SVM), K-Nearest Neighbor (KNN) and RF models were trained on datasets from each of the four (BLCA, PAAD, HNSC ALL3) groups (<xref ref-type="fig" rid="F1">Figure 1</xref>), and the metrics were evaluated. Next, we looked for overlaps between different combinations of the selected genes in the four groups. Furthermore, we trained new ML (SVM, RF, and KNN) models on BLCA, PAAD, and HNSC datasets, but with genes selected from the combined dataset of the three cancer types (i.e., ALL3 group) (<xref ref-type="fig" rid="F1">Figure 1</xref>). Lastly, we created a list of union of selected genes from each of the three cancer types (i.e., BLCA &#x2b; PAAD &#x2b; HNSC), and these were used as variables to predict presence or absence of DM in each of the three cancer dataset.</p>
</sec>
<sec id="s2-3">
<title>Differential Gene Expression with DESeq2</title>
<p>To further assess the strength of genes selected by the proposed method for predicting DM, under the same study design, we carried out an analysis to identify Differentially Expressed Genes (DEGs) between samples with DM and those without DM in each of the four groups. Instead of the TPM normalized values, raw counts of unstranded RNASeq were extracted from the TCGA RNASeq records. A sample information table was also generated. Differential Gene Expression (DGE) analysis was performed in R using the DESeq2 Bioconductor package. DESeq2 detects DEGs by normalizing raw count values of genes in the experimental groups, fitting negative binomial generalized linear models for each gene, and detecting significance by Wald test (<xref ref-type="bibr" rid="B55">Love et al., 2014</xref>). Threshold was set at p-adjusted value of 0.05, after initial set threshold of log fold-change of 1 and p-adjusted value of 0.05 yielded only five, one and seven DEGs in BLCA, PAAD and HNSC respectively. Thereafter, top DEGs were used as features to classify DM samples in each of the study groups (<xref ref-type="fig" rid="F1">Figure 1</xref>). DEGs from this analysis and metrics of models trained on them are compared to those of genes selected via the (RF &#x2b; Optimization) method.</p>
</sec>
<sec id="s2-4">
<title>Convolutional neural network for histopathology images analysis and prediction of distant metastasis</title>
<p>We downloaded diagnostic pathology Whole Slide Histopathology Images (WSIs) of patients with DM in the TCGA projects of BLCA, HNSC, and PAAD from GDC Data Portal. The number of samples in each group reduced slightly after collating a list of only patients with available histopathology images, and RNASeq data, and clinical data. Number of samples with DM in BLCA, PAAD, and HNSC groups are 50, 44, and 18 respectively. Again, we downloaded random complementary WSIs (BLCA &#x3d; 55; PAAD &#x3d; 51; HNSA &#x3d; 22) of patients without DM for each of the cancer types, and ensured no overlap between samples (<xref ref-type="sec" rid="s9">Supplementary Material S1</xref>). Total number of samples in each group was split in an 80:20 ratio for models training and testing.</p>
</sec>
<sec id="s2-5">
<title>WSIs preprocessing</title>
<p>One WSI was preprocessed per patient. Due to the typical large size of WSIs which hovers around 100000 pixels in both horizontal and vertical axis, we isolated representative regions within each WSI for analysis. First, a maximum of 500 random non-overlapping patches of size 512 &#x2a; 512 pixels were extracted from each image at 20 &#xd7; magnification. These were reduced to a maximum of 200 patches after checking for a tissue area of at least 80% in each patch. To ensure selection of tumor representative regions we imitated the concept of high cellularity described in (<xref ref-type="bibr" rid="B76">Riasatian, 2021</xref>) which assumes that high grade tumors contain more cellular areas than normal tissues. Hence, we used a pretrained U-Net nuclei segmentation model which was trained on breast cancer histopathology images to rank the patches based on cellular content. The top 60 highest ranked patches were selected for each patient. Further random manual inspection led to the exclusion of a few more patches before Macenko normalization. At the end of initial preprocessing steps, BLCA, PAAD, HNSC, and ALL3 groups had a total of 4936, 4348, 1856, and 11154 training patches respectively, of which 25% were for validation.</p>
</sec>
<sec id="s2-6">
<title>CNN training and multimodal fusion of genomic and imaging data</title>
<p>A DenseNet121 model, pretrained on Imagenet data, and KimiaNet (<xref ref-type="bibr" rid="B76">Riasatian, 2021</xref>) were chosen for this study. This allows us to evaluate the effect of keeping weights of lower layers of a model fine-tuned on domain histopathology images (i.e., KimiaNet) on performance. DenseNet is a CNN architecture that attempts to solve the problem of varnishing gradient associated with deep neural networks by cumulatively concatenating features output of a layer within the architecture to input of subsequent upper layers, (<xref ref-type="bibr" rid="B33">Huang et al., 2016</xref>; <xref ref-type="bibr" rid="B31">2017</xref>). In total, DenseNet121 contains 1 7 &#xd7; 7 convolution, 58 3 &#xd7; 3 convolution, 61 1 &#xd7; 1 convolution, 4 average pooling, and 1 fully connected layers. KimiaNet is a DenseNet121 architecture Imagenet-pretrained CNN model fine-tuned on about 250,000 histopathology images. We replaced the classification layers in the models with 1 GlobalAveragePooling2D, 3 Dropout (0.1, 0.2, and 0.05), and 3 Dense (512, 32, and 1) layers, to train a new classifier head on top of the base convolutional layers, using a weakly supervised technique. The last hidden layer of the new classifier heads has 32 nodes which is subsequently used as features extractor. The modified DenseNet121 was separately trained on data from each of the four study groups (HNSC, BLCA, PAAD, ALL3) for a binary classification task of DM prediction <bold>(</bold>
<xref ref-type="fig" rid="F2">Figure 2</xref>
<bold>)</bold>. Loss function was binary cross-entropy, and optimizer, Adam with a learning rate of 0.0001. Training epoch was set at 40.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>Binary classifiers trained on the same patients&#x2019; records but with different data types for comparisons between unimodal, and multimodal data models. <bold>(A)</bold> Additional Dense layers trained on DenseNet121, and KimiaNet CNN for features extraction from histopathology images. Classifiers are trained on image features to predict presence or absence of DM. <bold>(B)</bold> Classifiers are trained on only clinical variables to predict presence or absence of DM. <bold>(C)</bold> Classifiers are trained on genes selected by the (RF &#x2b; Optimization) algorithm to predict presence or absence of DM <bold>(D)</bold> Image features are fused with selected genes, and clinical features to train multimodal models for prediction of DM.</p>
</caption>
<graphic xlink:href="fbinf-03-1131021-g002.tif"/>
</fig>
<p>For prediction on the test sets, patient level features were derived by passing all patches for each patient through the trained feature extractor models, and averaging all patch features from a slide. We trained traditional machine learning algorithm- SVM and a single layer MLP on the patient level features to classify samples as DM positive or DM negative (<xref ref-type="fig" rid="F2">Figure 2</xref>). Considering this approach we have taken, HNSC group was dropped here based on small sample size.</p>
<p>Multimodal concatenation of patient level features derived from the CNN models to log transformed values of TPM normalized RNASeq data of selected DM associated genes from above was carried out. For BLCA, and PAAD groups, the 50 genes selected using the (RF &#x2b; Optimization) technique were combined with 32 features derived from histopathology images of each patient to train classifiers, and predict presence or absence of DM (<xref ref-type="fig" rid="F2">Figure 2</xref>). 150 genes, consisting of 50 each selected from BLCA, PAAD, and HNSC were combined with 32 image features to predict DM in the ALL3 group (<xref ref-type="fig" rid="F2">Figure 2</xref>).</p>
<p>Likely due to the effect of a larger training dataset given the weakly supervised training technique that was employed, it was observed from early results that image only models trained on data from the ALL3 group generally performed better in their predictions on the test sets than other groups. Similarly, multimodal models of genomic and histopathology image were generally better than unimodal model of either data type in the ALL3 dataset, while metrics of multimodal models are generally lower than either genomic or image only models in other individual cancer dataset (i.e., BLCA, or PAAD). Therefore further experiments were carried out on the ALL3 dataset. We trained fully connected layers on KimiaNet, a DenseNet121 architecture model finetuned on about 250,000 histopathology images to extract features from the images in the ALL3 dataset. Metrics of multimodal models built from these features, and genomic data were compared to those from DenseNet121.</p>
<p>Four variables&#x2014;Age, T stage, N stage, and Number of lymph nodes positive by HE&#x2014;were selected from the clinical data, and concatenated with genomic, and histopathology features to predict presence or absence of DM in the ALL3 dataset (<xref ref-type="fig" rid="F2">Figure 2</xref>). As part of the clinical data preprocessing, invalid/NaN values in the age variable were replaced by the rounded mean age of the cancer type while invalid/NaN T staging, N staging and number of positive lymph node values were replaced with 0. Min-max normalization was applied to age, and number of positive lymph node variables.</p>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<sec id="s3-1">
<title>ML identified gene expression biomarkers of distant metastasis</title>
<p>In each cancer type, samples with DM were combined with the same size of samples without DM. Also, datasets from the three cancer types were combined to derive the &#x201c;ALL3&#x201d; group. Following pre-processing, the optimized RF algorithm, as described in the methods section was used for binary classification of the samples (i.e., DM vs. non-DM), and simultaneous gene selection (top 50) in each of the groups (<xref ref-type="sec" rid="s9">Supplementary Material S2</xref>). While RF is a popular ML based gene selection technique (<xref ref-type="bibr" rid="B17">D&#xed;az-Uriarte and Alvarez de Andr&#xe9;s, 2006</xref>; <xref ref-type="bibr" rid="B96">Wenric and Ruhollah, 2018</xref>; <xref ref-type="bibr" rid="B69">Nivedhitha et al., 2020</xref>), to the best of our knowledge, at the time of this writing, this is the first time RF is combined with this optimization method for the task of gene selection. The top 5 genes associated with metastasis in BLCA are <italic>FKBP6, ASIC5, MAPK8IP1, F11R</italic>, and <italic>PABPC5</italic>, while <italic>CTSV, BIRC5, SERPINA7, CST2, KLHL3</italic> are the highest ranked genes in PAAD. In HNSC, <italic>TM4SF1, C19orf18, EXPH5, FKBP2, PTPRZ1</italic> are the highest ranked genes, and ALL3 group had <italic>CHRNG, CPT1B, CGREF1, GPR31, SPTBN5</italic>. Past publications have confirmed the activities of some of these genes in the metastasis of various cancers, either as oncogene or as tumor suppressors (<xref ref-type="table" rid="T2">Table 2</xref>). There are others whose roles in cancer metastasis are yet to be explored. A functional annotation search for the selected 50 genes on <ext-link ext-link-type="uri" xlink:href="http://pantherdb.org">pantherdb.org</ext-link> revealed very similar classification patterns in all of the three cancer types. Approximately 50%&#x2013;60% of the genes had no known category, and the most common functional classification of those with known categories in all the cancers are protein binding, and catalytic activity (<xref ref-type="fig" rid="F3">Figure 3</xref>).</p>
<table-wrap id="T2" position="float">
<label>TABLE 2</label>
<caption>
<p>Top 10 genes selected by the ML (RF &#x2b; Optimization) technique in BLCA, PAAD, HNSC, and various types of malignancies in which their involvement in metastasis have been reported in past literatures.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Gene Ranking &#x23;</th>
<th align="left">Highest Ranked DM Genes in BLCA</th>
<th align="left">Cancer types with Publications of Gene metastatic activities</th>
<th align="left">Highest Ranked DM Genes in PAAD</th>
<th align="left">Cancer types with Publications of Gene metastatic activities</th>
<th align="left">Highest Ranked DM Genes in HNSC</th>
<th align="left">Cancer types with Publications of Gene metastatic activities</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td rowspan="5" align="left">1</td>
<td rowspan="5" align="left">
<bold>FKBP6</bold>
</td>
<td rowspan="5" align="left"/>
<td rowspan="5" align="left">
<bold>CTSV</bold>
</td>
<td align="left">Breast Cancer <xref ref-type="bibr" rid="B81">Sereesongsaeng et al. (2020</xref>)</td>
<td rowspan="5" align="left">
<bold>TM4SF1</bold>
</td>
<td align="left">Ovarian Cancer <xref ref-type="bibr" rid="B22">Gaoet al. (2019</xref>)</td>
</tr>
<tr>
<td align="left">Colorectal Cancer <xref ref-type="bibr" rid="B92">Wang et al. (2020</xref>)</td>
<td align="left">Esophageal Cancer <xref ref-type="bibr" rid="B103">Xue et al. (2017</xref>)</td>
</tr>
<tr>
<td align="left">Lung Cancer <xref ref-type="bibr" rid="B94">Wang et al. (2021</xref>); <xref ref-type="bibr" rid="B105">Yang et al. (2022</xref>)</td>
<td align="left">Pancreatic Cancer <xref ref-type="bibr" rid="B7">Cao et al. (2016</xref>)</td>
</tr>
<tr>
<td align="left"/>
<td align="left">Liver Cancer <xref ref-type="bibr" rid="B33">Huang et al. (2016</xref>)</td>
</tr>
<tr>
<td align="left"/>
<td align="left">Colorectal Cancer <xref ref-type="bibr" rid="B72">Park et al. (2016</xref>); <xref ref-type="bibr" rid="B87">Tang et al. (2020</xref>)</td>
</tr>
<tr>
<td rowspan="3" align="left">2</td>
<td rowspan="3" align="left">
<bold>ASIC5</bold>
</td>
<td rowspan="3" align="left"/>
<td rowspan="3" align="left">
<bold>BIRC5</bold>
</td>
<td align="left">Colorectal Cancer <xref ref-type="bibr" rid="B45">Kreig et al. (2013</xref>)</td>
<td rowspan="3" align="left">
<bold>C19orf18</bold>
</td>
<td rowspan="3" align="left"/>
</tr>
<tr>
<td align="left">Prostate Cancer <xref ref-type="bibr" rid="B28">Hennigs et al. (2020</xref>)</td>
</tr>
<tr>
<td align="left">Breast Cancer <xref ref-type="bibr" rid="B16">Dai JB et al. (2020</xref>); <xref ref-type="bibr" rid="B70">Oparina et al. (2021</xref>)</td>
</tr>
<tr>
<td align="left">3</td>
<td align="left">
<bold>MAPK8IP1</bold>
</td>
<td align="left">Gastric Cancer <xref ref-type="bibr" rid="B56">Lu et al. (2017</xref>)</td>
<td align="left">
<bold>SERPINA7</bold>
</td>
<td align="left"/>
<td align="left">
<bold>EXPH5</bold>
</td>
<td align="left"/>
</tr>
<tr>
<td rowspan="4" align="left">4</td>
<td rowspan="4" align="left">
<bold>F11R</bold>
</td>
<td align="left">Breast Cancer <xref ref-type="bibr" rid="B3">Bednarek et al. (2020</xref>)</td>
<td rowspan="4" align="left">
<bold>CST2</bold>
</td>
<td align="left">Prostate Cancer <xref ref-type="bibr" rid="B82">Song et al. (2021</xref>)</td>
<td rowspan="4" align="left">
<bold>FKBP2</bold>
</td>
<td rowspan="4" align="left"/>
</tr>
<tr>
<td align="left">Prostate Cancer <xref ref-type="bibr" rid="B26">Guo et al. (2023</xref>)</td>
<td align="left">Triple-Negative Breast Cancer <xref ref-type="bibr" rid="B41">Johnstone et al. (2018</xref>)</td>
</tr>
<tr>
<td align="left">Pancreatic Cancer <xref ref-type="bibr" rid="B108">Zhang et al. (2022</xref>)</td>
<td align="left">Gastric Cancer <xref ref-type="bibr" rid="B111">Zhang et al. (2020</xref>)</td>
</tr>
<tr>
<td align="left">Multiple Cancers <xref ref-type="bibr" rid="B15">Czubak-Prowizor et al. (2022</xref>)</td>
<td align="left"/>
</tr>
<tr>
<td rowspan="2" align="left">5</td>
<td rowspan="2" align="left">
<bold>PABPC5</bold>
</td>
<td align="left">Non-Small Cell Lung Cancer <xref ref-type="bibr" rid="B98">Wu et al. (2021</xref>)</td>
<td rowspan="2" align="left">
<bold>KLHL3</bold>
</td>
<td rowspan="2" align="left">Breast Cancer <xref ref-type="bibr" rid="B62">Mamoor. (2021</xref>)</td>
<td rowspan="2" align="left">
<bold>PTPRZ1</bold>
</td>
<td align="left">Triple Negative Breast Cancer <xref ref-type="bibr" rid="B21">Fu et al. (2016</xref>)</td>
</tr>
<tr>
<td align="left">Glioma <xref ref-type="bibr" rid="B40">Jing et al. (2020</xref>)</td>
<td align="left">Lung Cancer <xref ref-type="bibr" rid="B11">Chai et al. (2022</xref>)</td>
</tr>
<tr>
<td rowspan="3" align="left">6</td>
<td rowspan="3" align="left">
<bold>SLC5A1</bold>
</td>
<td rowspan="3" align="left">Glioblastoma <xref ref-type="bibr" rid="B6">Brosch et al. (2022</xref>)</td>
<td rowspan="3" align="left">
<bold>TK1</bold>
</td>
<td align="left">Lung Cancer <xref ref-type="bibr" rid="B61">Malvin et al. (2019</xref>)</td>
<td rowspan="3" align="left">
<bold>MUC12</bold>
</td>
<td align="left">Colorectal Cancer <xref ref-type="bibr" rid="B63">Maksuyama et al. (2010</xref>)</td>
</tr>
<tr>
<td align="left">Breast Cancer (<xref ref-type="bibr" rid="B18">Fanelli et al. (2021</xref>); <xref ref-type="bibr" rid="B27">He et al. (2006</xref>), <xref ref-type="bibr" rid="B4">Bitter et al. (2022)</xref>
</td>
<td align="left">Renal Cell Carcinoma <xref ref-type="bibr" rid="B23">Gao et al. (2020</xref>)</td>
</tr>
<tr>
<td align="left">Multiple Cancers <xref ref-type="bibr" rid="B50">Liu et al. (2017</xref>)</td>
<td align="left"/>
</tr>
<tr>
<td rowspan="4" align="left">7</td>
<td rowspan="4" align="left">
<bold>CCDC33</bold>
</td>
<td rowspan="4" align="left"/>
<td rowspan="4" align="left">
<bold>E2F1</bold>
</td>
<td align="left">Prostate Cancer <xref ref-type="bibr" rid="B49">Liang et al. (2016</xref>)</td>
<td rowspan="4" align="left">
<bold>RPS10</bold>
</td>
<td rowspan="4" align="left"/>
</tr>
<tr>
<td align="left">Melanoma <xref ref-type="bibr" rid="B2">Alla et al. (2010</xref>)</td>
</tr>
<tr>
<td align="left">Breast Cancer <xref ref-type="bibr" rid="B30">Hollern DP et al. (2019</xref>)</td>
</tr>
<tr>
<td align="left">Multiple Cancers <xref ref-type="bibr" rid="B24">Goody et al. (2019</xref>)</td>
</tr>
<tr>
<td rowspan="7" align="left">8</td>
<td rowspan="7" align="left">
<bold>ONECUT1</bold>
</td>
<td rowspan="7" align="left">Hepatocellular Carcinoma <xref ref-type="bibr" rid="B53">Liu et al. (2022</xref>)</td>
<td rowspan="7" align="left">
<bold>GGH</bold>
</td>
<td rowspan="7" align="left">Gastric Cancer <xref ref-type="bibr" rid="B113">Terashima et al. (2017)</xref>, <xref ref-type="bibr" rid="B60">Maezawa et al. (2020</xref>)</td>
<td rowspan="7" align="left">
<bold>RAB3D</bold>
</td>
<td align="left">Osteosarcoma (<xref ref-type="bibr" rid="B37">Jiashi et al. (2018</xref>); <xref ref-type="bibr" rid="B8">Cao et al. (2019</xref>)</td>
</tr>
<tr>
<td align="left">Colorectal Cancer <xref ref-type="bibr" rid="B57">Luo et al. (2016</xref>)</td>
</tr>
<tr>
<td align="left">Melanoma <xref ref-type="bibr" rid="B104">Yang et al. (2015</xref>)</td>
</tr>
<tr>
<td align="left">Breast Cancer <xref ref-type="bibr" rid="B104">Yang et al. (2015</xref>)</td>
</tr>
<tr>
<td align="left">Glioma <xref ref-type="bibr" rid="B38">Jin et al. (2021</xref>), <xref ref-type="bibr" rid="B88">Tao et al. (2020)</xref>
</td>
</tr>
<tr>
<td align="left">Non-small Cell Lung Cancer (<xref ref-type="bibr" rid="B58">Ma et al. (2022</xref>)</td>
</tr>
<tr>
<td align="left">Hepatocellular Cancer <xref ref-type="bibr" rid="B47">Li et al. (2020</xref>)</td>
</tr>
<tr>
<td rowspan="3" align="left">9</td>
<td rowspan="3" align="left">
<bold>CLC</bold>
</td>
<td align="left">Gastric Cancer <xref ref-type="bibr" rid="B25">Gu et al. (2018</xref>); <xref ref-type="bibr" rid="B73">Peng a et al. (2018</xref>)</td>
<td rowspan="3" align="left">
<bold>MAGED4</bold>
</td>
<td align="left">Hepatocellular Carcinoma <xref ref-type="bibr" rid="B42">Kanda et al. (2017</xref>)</td>
<td rowspan="3" align="left">
<bold>SH2D4A</bold>
</td>
<td rowspan="3" align="left"/>
</tr>
<tr>
<td align="left">Colorectal Cancer <xref ref-type="bibr" rid="B66">Mu et al. (2020</xref>)</td>
<td align="left">Non Small Cell Lung Cancer <xref ref-type="bibr" rid="B59">Ma et al. (2012</xref>)</td>
</tr>
<tr>
<td align="left">Multiple cancers <xref ref-type="bibr" rid="B99">Xu et al. (2014</xref>)</td>
<td align="left"/>
</tr>
<tr>
<td rowspan="3" align="left">10</td>
<td rowspan="3" align="left">
<bold>ZNF467</bold>
</td>
<td rowspan="3" align="left">Prostate Cancer <xref ref-type="bibr" rid="B108">Zhang et al. (2022</xref>)</td>
<td rowspan="3" align="left">
<bold>CST6</bold>
</td>
<td align="left">Breast Cancer <xref ref-type="bibr" rid="B48">Li et al. (2021</xref>); <xref ref-type="bibr" rid="B39">Jin L et al. (2012</xref>), <xref ref-type="bibr" rid="B78">Rivenbark et al. (2007)</xref>
</td>
<td rowspan="3" align="left">
<bold>RGS16</bold>
</td>
<td align="left">Chondrosarcoma <xref ref-type="bibr" rid="B85">Sun et al. (2015</xref>)</td>
</tr>
<tr>
<td align="left">Melanoma <xref ref-type="bibr" rid="B77">Riker AI et al. (2008</xref>)</td>
<td align="left">Glioma <xref ref-type="bibr" rid="B91">Wang et al. (2022</xref>)</td>
</tr>
<tr>
<td align="left">Multiple Cancers <xref ref-type="bibr" rid="B100">Xu et al. (2021</xref>)</td>
<td align="left">Pancreatic Cancer <xref ref-type="bibr" rid="B44">Kim et al. (2010</xref>); <xref ref-type="bibr" rid="B10">Carper MB et al. (2014</xref>)</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>Functional classification of selected genes in <bold>(A)</bold> BLCA, <bold>(B)</bold> PAAD, and <bold>(C)</bold> HNSC.</p>
</caption>
<graphic xlink:href="fbinf-03-1131021-g003.tif"/>
</fig>
<p>Dataset in each group was randomly split into train and test set in a 75:25 ratio, and SVM with linear kernel, KNN, and RF models were trained on the highest ranked 30, 25, 20, 15, 10, 5, 3, 2, and 1 gene from list of selected top 50 genes. Outcomes of single instance predictions, and five-fold cross-validation on the test sets were recorded (<xref ref-type="sec" rid="s9">Supplementary Material S3</xref>).</p>
<p>To test the strength of genes selected based on the (RF &#x2b; Optimization) method as predictors of DM, a DGE analysis was carried out with the DESeq2 software package to identify DEGs between the DM and non-DM samples. With a threshold adjusted <italic>p</italic>-value of 0.05, the number of DEGs in BLCA, PAAD, HNSC, and ALL3 are 229, 2142, 1100, and 658 respectively. Of these 5, 39, 19, and 13 overlap with the 50 genes selected using the ML method in each of the respective groups. Presence or absence of DM was predicted in the four groups using DEGs with the lowest 30, 25, 20, 15, 10, 5, 3, 2, and 1 adjusted <italic>p</italic>-value as variables. In almost all cases, genes selected using the ML (RF &#x2b; optimization) techniques had higher evaluation metrics (Accuracy, F1-score, AUROC) than those selected via DESeq2 DGE analysis in the task of DM prediction <bold>(</bold>
<xref ref-type="fig" rid="F4">Figure 4</xref>
<bold>)</bold>. Highest mean AUROC score achieved over a five-fold cross validation was 0.87, 0.92, 0.97, 0.79 in BLCA, PAAD, HNSC, and ALL3 groups respectively, compared to 0.65, 0.80, 0.92, 0.62 with DESeq2 DGE analysis, when models were trained on top 15 selected genes (<xref ref-type="fig" rid="F4">Figure 4</xref>; <xref ref-type="sec" rid="s9">Supplementary Material S3</xref>).</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>(<bold>A, E, I, M</bold>) Five-fold cross validation ROC curves, and mean ROC curve when 15 highest ranked genes selected by (RF &#x2b; Optimization) method from each of the groups are used to predict presence or absence of DM. These are higher than other predictions within the same study group. (<bold>B, F, J, N</bold>) Five-fold cross validation ROC curves, and mean ROC curve when 15 highest ranked DEGs (p adjusted value &#x3d; 0.05) are used to predict presence or absence of DM. <bold>(C, G, K)</bold> Five-fold cross validation ROC curves, and mean ROC curve when 15 highest ranked genes selected by (RF &#x2b; Optimization) method from ALL3 group are used to predict presence or absence of DM in other (BLCA, PAAD, HNSC) study groups. (<bold>D, H, L</bold>) Five-fold cross validation ROC curves, and mean ROC curve when a union of the 15 highest ranked genes selected by (RF &#x2b; Optimization) method from each of BLCA, PAAD, and HNSC groups (total &#x3d; 45) are used to predict presence or absence of DM in each cancer type.</p>
</caption>
<graphic xlink:href="fbinf-03-1131021-g004.tif"/>
</fig>
</sec>
<sec id="s3-2">
<title>Expression profile of genes associated with DM in primary tumors differ across cancer types</title>
<p>To investigate if primary tumors of different cancer types share similar gene expression profiles in metastasis, first, we looked for overlap between various combinations of the selected 50 genes in each group. There was no overlap between the different combinations of BLCA, PAAD or HNC. However, in the ALL3 group, 7 genes (<italic>CGREF1, SPTBN5, TAS1R3, FAM241B, EL5, CSPG5</italic>, and <italic>MAPK8IP1</italic>), and 6 genes (<italic>CFAP45, CST2, BIRC5, MAGED4, CST6</italic>, and <italic>FAIM2</italic>) overlap with those selected in BLCA, and PAAD respectively (<xref ref-type="fig" rid="F5">Figure 5</xref>; <xref ref-type="table" rid="T3">Table 3</xref>). Also, it was observed that there was at least one member of the <bold>
<italic>Z</italic>
</bold>
<italic>NF</italic> and <italic>FKBP</italic> gene family present within the list of selected genes in BLCA, and HNSC. <xref ref-type="bibr" rid="B35">Jen and Wang (2016)</xref> extensively reviewed multiple studies on the role of ZNF (Zinc Finger) gene family proteins in cancer progression, and metastasis, acting both as oncogenes, and tumor suppressors. Various studies (<xref ref-type="bibr" rid="B20">Fong et al., 2003</xref>; <xref ref-type="bibr" rid="B86">Sun et al., 2021</xref>) have also implicated members of the FKBP family in cancer metastasis.</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption>
<p>There was no overlap between genes selected using the (RF &#x2b; Optimization) method in either of the three cancer types- BLCA, PAAD, and HNSC. Six genes selected in PAAD, and seven genes in BLCA were also present in the list of genes selected from the ALL3 group.</p>
</caption>
<graphic xlink:href="fbinf-03-1131021-g005.tif"/>
</fig>
<table-wrap id="T3" position="float">
<label>TABLE 3</label>
<caption>
<p>Number of overlapping genes between combinations of the different groups of genes selected by (RF &#x2b; Optimization) method, and DGE analysis by DESeq2.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="center">
<break/>Compared Groups of Selected Genes</th>
<th align="center">(RF &#x2b; Optimization) method<break/>No of Overlaps Between Groups</th>
<th align="center">DGE analysis (DESeq2)<break/>No of Overlaps Between Groups</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="center">BLCA &#x7c; PAAD</td>
<td align="center">0</td>
<td align="center">32</td>
</tr>
<tr>
<td align="center">BLCA &#x7c; HNSC</td>
<td align="center">0</td>
<td align="center">16</td>
</tr>
<tr>
<td align="center">PAAD &#x7c; HNSC</td>
<td align="center">0</td>
<td align="center">160</td>
</tr>
<tr>
<td align="center">BLCA &#x7c; ALL3</td>
<td align="center">7</td>
<td align="center">66</td>
</tr>
<tr>
<td align="center">PAAD &#x7c; ALL3</td>
<td align="center">6</td>
<td align="center">305</td>
</tr>
<tr>
<td align="center">HNSC &#x7c; ALL3</td>
<td align="center">0</td>
<td align="center">105</td>
</tr>
<tr>
<td align="center">BLCA &#x7c; PAAD &#x7c; HNSC</td>
<td align="center">0</td>
<td align="center">4 (<bold>PHF21B, CALY, FGF19, and KRT81</bold>)</td>
</tr>
<tr>
<td align="center">BLCA &#x7c; PAAD &#x7c; HNSC &#x7c; ALL3</td>
<td align="center">0</td>
<td align="center">0</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Furthermore, we trained new ML (SVM, RF, and KNN) models on BLCA, PAAD, and HNSC datasets, but with genes selected from the combined dataset of the three cancer types (<italic>i.e.</italic>, ALL3 group) (<xref ref-type="fig" rid="F1">Figure 1</xref>). These models performed lesser than in cases of predictions of DM using genes selected in individual cancer types. Highest mean AUROC score of 0.76, 0.82 and 0.64 was achieved in BLCA, PAAD, and HNSC datasets respectively against 0.87, 0.92, and 0.97 of models trained on selected genes derived from individual cancer types (<xref ref-type="fig" rid="F4">Figure 4</xref>; <xref ref-type="sec" rid="s9">Supplementary Material S3</xref>). To further confirm this specificity of genes in each cancer type, a union list of selected top 15 genes in the three cancer types - BLCA, PAAD, and HNSC was created. Results show a drop in mean AUROC scores from 0.87, 0.92, and 0.97 in BLCA, PAAD, and HNSC datasets respectively when 15 genes generated from each cancer type were used as variables to 0.83, 0.88, and 0.77 when the total of 45 genes in the list of unified genes were used as variables to predict DM (<xref ref-type="fig" rid="F4">Figure 4</xref>; <xref ref-type="sec" rid="s9">Supplementary Material S3</xref>).</p>
</sec>
<sec id="s3-3">
<title>Histopathology images and clinical and genomic data for multimodal prediction of DM</title>
<p>DenseNet121 was trained on histopathology images from BLCA, PAAD, and ALL3 groups, after the HNSC group was dropped due to small sample size. One WSI was preprocessed per patient as described in the methods section. For each group, classification models&#x2014;SVM, and MLP were trained on patient level features extracted by the trained CNN models, and prediction was carried out on the test set features (<xref ref-type="fig" rid="F2">Figure 2</xref>). The image features were also combined with genomic features from the same patient to build multimodal (Image &#x2b; Genomic) models.</p>
<p>Early results showed that metrics of multimodal (Image &#x2b; Genomic) models built from the ALL3 dataset generally improved on image, or genomic unimodal models by 1&#x2013;3 percent margin, with genomic data contributing more to the multimodal metrics by a large margin. Following a five-fold Monte Carlo cross-validation, a multimodal (Image &#x2b; Genomic) SVM classifier produced a mean AUROC score of 0.77, while the corresponding image and genomic unimodal models produced a score of 0.58, and 0.74 respectively. The mean AUROC scores from MLP classifiers are (Image &#x2b; Genomic &#x3d; 0.81; Image &#x3d; 0.59; Genomic &#x3d; 0.80). In either BLCA or PAAD dataset, results of multimodal (Image &#x2b; Genomic) models mostly improved on those of the image only models, however they were generally below that of genomic only models. The highest mean AUROC derived from a BLCA dataset multimodal (Image &#x2b; Genomic) model is 0.62, and the corresponding values from image only, and genomic only model are 0.57, and 0.80 respectively. Similar pattern was seen in the PAAD dataset [Image &#x2b; Genomic &#x3d; 0.85; Image &#x3d; 0.51; Genomic &#x3d; 0.90) (<xref ref-type="fig" rid="F6">Figure 6</xref>; <xref ref-type="sec" rid="s9">Supplementary Material S4</xref>)]. These outcomes emphasize the importance of a large dataset for building a more robust image model when a weakly supervised training technique is employed.</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption>
<p>
<bold>(A)</bold> SVM classifier clearly shows superiority of the multimodal (Clinical &#x2b; Genomic &#x2b; Image) model in the ALL3 dataset. <bold>(B)</bold> The performance of the (Clinical &#x2b; Genomic &#x2b; Image) model appears to be on par with the bimodal models when a MLP classifier is used. Comparing prediction metrics of the best performing bimodal (Image &#x2b; Genomic) model in each group to those of unimodal models. Results from the bimodal model from the <bold>(C)</bold> ALL3 dataset are clearly better than those from unimodal <bold>(D)</bold> BLCA dataset, and <bold>(E)</bold> PAAD dataset.</p>
</caption>
<graphic xlink:href="fbinf-03-1131021-g006.tif"/>
</fig>
<p>Based on these early results, we carried out further experiments with the ALL3 dataset. To evaluate the effect of using a CNN model with lower layers pre-finetuned on histopathology domain images as features extractor, fully connected layers were trained on KimiaNet with the ALL3 dataset histopathology images, and features derived from the model were combined with genomic data to predict DM in the test set. There was generally no additional advantage observed in the results of KimiaNet over those from DenseNet121. Highest mean AUROC from the MLP multimodal model (KimiaNet Image &#x2b; genomic data) is 0.77. Unimodal image and genomic models from the same dataset produced AUROC scores of 0.55, and 0.80 respectively. The same pattern was seen with the SVM classifier (Image &#x2b; Genomic &#x3d; 0.73; Image &#x3d; 0.56; Genomic &#x3d; 0.74) (<xref ref-type="sec" rid="s9">Supplementary Material S4</xref>).</p>
<p>Lastly, four clinical variables&#x2014;Age, T stage, N stage, and Number of lymph nodes positive by HE were concatenated with genomic, and histopathology features to train SVM, and MLP classifiers. This led to an improvement of the mean AUROC score of the Image &#x2b; Genomic SVM model from 0.77 to a score of 0.79. Mean accuracy, and F1 scores also improved from 0.67 and 0.67 to 0.71 and 0.71 respectively. MLP classifier however produced a mean AUROC score of 0.80 against the score of 0.81 achieved with Genomic &#x2b; Histopathology data (<xref ref-type="fig" rid="F6">Figure 6</xref>). Combining clinical with genomic data improved the metrics of the clinical only models, however there was no marked improvement from metrics of the genomic only models (<xref ref-type="fig" rid="F6">Figure 6</xref>; <xref ref-type="table" rid="T4">Tables 4</xref>, <xref ref-type="table" rid="T5">5</xref>).</p>
<table-wrap id="T4" position="float">
<label>TABLE 4</label>
<caption>
<p>Accuracy, F1, and AUROC scores derived from combinations of clinical, genomic and image data in the ALL3 dataset with a SVM classifier.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Data Type(s)</th>
<th align="left">Mean accuracy</th>
<th align="left">Accuracy Standard Deviation</th>
<th align="left">Mean F1 Score</th>
<th align="left">F1 Score Standard Deviation</th>
<th align="left">Mean AUROC</th>
<th align="left">AUROC Standard Deviation</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">Clinical</td>
<td align="left">0.59</td>
<td align="left">0.061</td>
<td align="left">0.52</td>
<td align="left">0.070</td>
<td align="left">0.64</td>
<td align="left">0.085</td>
</tr>
<tr>
<td align="left">Image</td>
<td align="left">0.56</td>
<td align="left">0.073</td>
<td align="left">0.55</td>
<td align="left">0.106</td>
<td align="left">0.58</td>
<td align="left">0.075</td>
</tr>
<tr>
<td align="left">Genomic</td>
<td align="left">0.68</td>
<td align="left">0.071</td>
<td align="left">0.69</td>
<td align="left">0.080</td>
<td align="left">0.74</td>
<td align="left">0.080</td>
</tr>
<tr>
<td align="left">Genomic &#x2b; Image</td>
<td align="left">0.67</td>
<td align="left">0.078</td>
<td align="left">0.67</td>
<td align="left">0.079</td>
<td align="left">0.77</td>
<td align="left">0.092</td>
</tr>
<tr>
<td align="left">Clinical &#x2b; Genomic</td>
<td align="left">0.68</td>
<td align="left">0.096</td>
<td align="left">0.69</td>
<td align="left">0.121</td>
<td align="left">0.75</td>
<td align="left">0.085</td>
</tr>
<tr>
<td align="left">Genomic &#x2b; Image &#x2b; Clinical</td>
<td align="left">
<bold>0.71</bold>
</td>
<td align="left">0.063</td>
<td align="left">
<bold>0.71</bold>
</td>
<td align="left">0.055</td>
<td align="left">
<bold>0.79</bold>
</td>
<td align="left">0.087</td>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap id="T5" position="float">
<label>TABLE 5</label>
<caption>
<p>Accuracy, F1, and AUROC scores derived from combinations of clinical, genomic and image data in the ALL3 dataset with a MLP classifier.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Data Type(s)</th>
<th align="left">Mean accuracy</th>
<th align="left">Accuracy Standard Deviation</th>
<th align="left">Mean F1 Score</th>
<th align="left">F1 Score Standard Deviation</th>
<th align="left">Mean AUROC</th>
<th align="left">AUROC Standard Deviation</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">Clinical</td>
<td align="left">0.51</td>
<td align="left">0.039</td>
<td align="left">0.26</td>
<td align="left">0.077</td>
<td align="left">0.59</td>
<td align="left">0.044</td>
</tr>
<tr>
<td align="left">Image</td>
<td align="left">0.55</td>
<td align="left">0.084</td>
<td align="left">0.51</td>
<td align="left">0.111</td>
<td align="left">0.59</td>
<td align="left">0.092</td>
</tr>
<tr>
<td align="left">Genomic</td>
<td align="left">0.72</td>
<td align="left">0.058</td>
<td align="left">0.72</td>
<td align="left">0.060</td>
<td align="left">0.80</td>
<td align="left">0.079</td>
</tr>
<tr>
<td align="left">Genomic &#x2b; Image</td>
<td align="left">
<bold>0.72</bold>
</td>
<td align="left">0.070</td>
<td align="left">0.73</td>
<td align="left">0.078</td>
<td align="left">
<bold>0.81</bold>
</td>
<td align="left">0.074</td>
</tr>
<tr>
<td align="left">Clinical &#x2b; Genomic</td>
<td align="left">0.72</td>
<td align="left">0.065</td>
<td align="left">0.72</td>
<td align="left">0.080</td>
<td align="left">0.79</td>
<td align="left">0.065</td>
</tr>
<tr>
<td align="left">Genomic &#x2b; Image &#x2b; Clinical</td>
<td align="left">0.72</td>
<td align="left">0.082</td>
<td align="left">
<bold>0.73</bold>
</td>
<td align="left">0.090</td>
<td align="left">0.80</td>
<td align="left">0.090</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
</sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>Given that there was barely any overlap between genes selected from the different cancer types considered and that prediction of metastasis in each of the cancer types with a union of genes selected from the three cancer types or the ALL3 group produced inferior results, we have been able to substantiate the claim that metastatic genes are more cancer type specific, rather than general, across carcinomas. As only three carcinoma types are considered in this study, studies including a larger number of carcinoma types will be needed to further solidify our findings. As shown on (<xref ref-type="table" rid="T3">Table 3</xref>), some of the selected genes have however been identified to be involved in the metastasis of multiple cancers. These further confirm the theory that even though metastatic genes tend to be specific to each carcinoma type, this specificity is based more on group of genes rather than individual genes, and that there is no single metastatic gene as have been reported in (<xref ref-type="bibr" rid="B68">Nguyen et al., 2022</xref>). The AUROC scores, and other metrics reported from our study, and other similar studies call for future works and development of diagnostics and therapeutics against DM to perhaps be more focused on the carcinoma type, rather than a general one size fits all approach. The <italic>ZNF</italic> and <italic>FBKP</italic> family of genes which is present in the list of selected genes in the BLCA, and HNSC datasets have been severally associated with cancer progression, and metastasis in multiple cancers (<xref ref-type="bibr" rid="B20">Fong et al., 2003</xref>; <xref ref-type="bibr" rid="B35">Jen and Wang, 2016</xref>; <xref ref-type="bibr" rid="B86">Sun et al., 2021</xref>), and should be investigated more for their roles in cancer metastasis.</p>
<p>Furthermore, with our novel combination of the RF algorithm and the described optimization technique, we have identified separate gene expression biomarkers of DM in the individual cancer types. Metrics of models built with these genes (AUC: BLCA &#x3d; 0.87; PAAD &#x3d; 0.92; HNSC &#x3d; 0.97) outperform those built from genes selected by DESeq2 DGE analysis (AUC: BLCA &#x3d; 0.65; PAAD &#x3d; 0.82; HNSC &#x3d; 0.92). While it is important to note that the datasets in our study are focused specifically on DM, similar studies on related tasks have been reported. Using just RF algorithm for gene selection in a breast cancer study (<xref ref-type="bibr" rid="B106">Yao et al., 2022</xref>), achieved an AUC of 0.52 in a metastasis and recurrence prediction task. (<xref ref-type="bibr" rid="B97">Wu et al., 2017</xref>), and (<xref ref-type="bibr" rid="B75">Qiao et al., 2020</xref>) achieved AUC scores of 0.71 and 0.84 respectively when genes derived from DESeq2 DGE analysis, and Boruta algorithm in the latter were combined with clinical data for prediction of lymph node metastasis in HNSC. In predicting metastasis status in pancreatic cancer samples (<xref ref-type="bibr" rid="B102">Xue et al., 2021</xref>), achieved a highest AUC score of 0.72 using DEG identified by the edgeR package. A 51 gene signature produced an AUC score of 0.82 in prediction of lymph node metastasis in bladder cancer as reported by (<xref ref-type="bibr" rid="B80">Seiler et al., 2016</xref>).</p>
<p>The reported AUC scores from our study were derived from the highest ranked 15 genes out of the initially selected 50. Perhaps a protein-protein interaction analysis including the initial 50 genes could lead to selection of fewer genes with higher relevance than the ranking we have employed here, which may in turn lead to improved prediction metrics. The biomarkers discovered in each cancer type using the strict ML approach we have employed should be further investigated for their potential as diagnostic indicators, and as basis for development of new therapies against cancer metastasis.</p>
<p>Our results indicate that multimodal data (Genomic &#x2b; Clinical &#x2b; Histopathology) provides a predictive advantage over either of the unimodal data types in the task of DM detection, however the contribution of only gene expression data is highest by a wide margin. Similar studies have acknowledged the superiority of multimodal data for prediction of medical diagnosis and various outcomes in cancer patients (<xref ref-type="bibr" rid="B64">Mobadersany et al., 2018</xref>; <xref ref-type="bibr" rid="B46">Keihl et al., 2021</xref>; <xref ref-type="bibr" rid="B12">Chen et al., 2022</xref>), however some studies have shown that this is not always the case (<xref ref-type="bibr" rid="B5">Brinker et al., 2021</xref>; <xref ref-type="bibr" rid="B89">Vale-Silva and Rohr, 2021</xref>). Past multimodal studies on prediction of metastasis from histopathology images have mostly combined clinical data with histopathology images. This combination used in prediction of nodal metastasis led to an improved AUC score of 74% in a study by (<xref ref-type="bibr" rid="B46">Keihl et al., 2021</xref>). In contrast, there was a 0.2% decrease from the only image AUC score of 61.8% reported by (<xref ref-type="bibr" rid="B5">Brinker et al., 2021</xref>) when clinical, and cellular data were included. Results from our study also suggest the importance of a larger dataset of images when a weakly supervised technique is employed as noted with the superior metrics of the multimodal models including histopathology images in the ALL3 dataset compared to BLCA, and PAAD dataset. A recent study by (<xref ref-type="bibr" rid="B106">Yao et al., 2022</xref>) reported a higher accuracy in model built from only genomic data compared to that built on combined genomic, and histopathology data for prediction of metastasis and recurrence in breast cancer. However, an AUC score of 0.75 was achieved when gene expression, histopathology images and clinical data were combined.</p>
<p>The highest mean AUC score of 0.79 derived from the SVM classifier in our study was from multimodal combination of gene expression, clinical, histopathology data, while the score of 0.81 derived from the MLP classifier was obtained from a genomic &#x2b; histopathology image model. Overall, our results indicate that combining clinical, genomic, and histopathology image data increases the prediction metrics for DM, however genomic data alone is a strong contributor.</p>
</sec>
</body>
<back>
<sec sec-type="data-availability" id="s5">
<title>Data availability statement</title>
<p>The original contributions presented in the study are included in the article/<xref ref-type="sec" rid="s9">Supplementary Material</xref>, further inquiries can be directed to the corresponding author.</p>
</sec>
<sec id="s6">
<title>Author contributions</title>
<p>IO: Conceptualization, Methodology, Software, Data Analysis, Writing; FC: Conceptualization, Data Analysis, Writing. All authors listed have made a substantial, direct, and intellectual contribution to the work and approved it for publication. All authors contributed to the article and approved the submitted version.</p>
</sec>
<ack>
<p>We acknowledge the resources and assistance provided by RIT (Rochester Institute of Technology) Research Computing which has made this work possible.</p>
</ack>
<sec sec-type="COI-statement" id="s7">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s8">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec id="s9">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fbinf.2023.1131021/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fbinf.2023.1131021/full&#x23;supplementary-material</ext-link>
</p>
<supplementary-material xlink:href="Table2.XLSX" id="SM1" mimetype="application/XLSX" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table3.XLSX" id="SM2" mimetype="application/XLSX" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table5.DOCX" id="SM3" mimetype="application/DOCX" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table4.XLSX" id="SM4" mimetype="application/XLSX" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table1.XLSX" id="SM5" mimetype="application/XLSX" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ali</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Mubarik</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Zahid</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Sattar</surname>
<given-names>A. K.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Clinicopathologic features predictive of distant metastasis in patients diagnosed with invasive breast cancer</article-title>. <source>JCO Glob. Oncol.</source> <volume>6</volume>, <fpage>1346</fpage>&#x2013;<lpage>1351</lpage>. <pub-id pub-id-type="doi">10.1200/GO.20.00257</pub-id>
</citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Alla</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Engelmann</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>An</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Pahnke</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Schmidt</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Kunz</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2010</year>). <article-title>E2F1 in melanoma progression and metastasis</article-title>. <source>JNCI J. Natl. Cancer Inst.</source> <volume>102</volume> (<issue>2</issue>), <fpage>127</fpage>&#x2013;<lpage>133</lpage>. <pub-id pub-id-type="doi">10.1093/jnci/djp458</pub-id>
</citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bednarek</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Selmi</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Wojkowska</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Karolczak</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Popielarski</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Stasiak</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>Functional inhibition of F11 receptor (F11R/junctional adhesion molecule-A/JAM-A) activity by a F11R-derived peptide in breast cancer and its microenvironment</article-title>. <source>Breast Cancer Res. Treat.</source> <volume>179</volume> (<issue>2</issue>), <fpage>325</fpage>&#x2013;<lpage>335</lpage>. <pub-id pub-id-type="doi">10.1007/s10549-019-05471-x</pub-id>
</citation>
</ref>
<ref id="B4">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Bitter</surname>
<given-names>E. E.</given-names>
</name>
<name>
<surname>Morris</surname>
<given-names>R. M.</given-names>
</name>
<name>
<surname>Mortimer</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Barlow</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Schekall</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Townsend</surname>
<given-names>M. H.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). &#x201c;<article-title>The potential effects of thymidine kinase 1 on breast cancer invasion</article-title>,&#x201d; in <source>Proceedings of the American association for cancer research annual meeting 2022</source> (<publisher-loc>Philadelphia (PA)</publisher-loc>: <publisher-name>AACR; Cancer Res</publisher-name>).</citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Brinker</surname>
<given-names>T. J.</given-names>
</name>
<name>
<surname>Kiehl</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Schmitt</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Jutzi</surname>
<given-names>T. B.</given-names>
</name>
<name>
<surname>Krieghoff-Henning</surname>
<given-names>E. I.</given-names>
</name>
<name>
<surname>Krahl</surname>
<given-names>D.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Deep learning approach to predict sentinel lymph node status directly from routine histology of primary melanoma tumours</article-title>. <source>Eur. J. Cancer</source> <volume>154</volume>, <fpage>227</fpage>&#x2013;<lpage>234</lpage>. <pub-id pub-id-type="doi">10.1016/j.ejca.2021.05.026</pub-id>
</citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Brosch</surname>
<given-names>P. K.</given-names>
</name>
<name>
<surname>Korsa</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Taban</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Glucose and inositol transporters, SLC5A1 and SLC5A3, in glioblastoma cell migration</article-title>. <source>Cancers</source> <volume>14</volume> (<issue>23</issue>), <fpage>5794</fpage>. <pub-id pub-id-type="doi">10.3390/cancers14235794</pub-id>
</citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cao</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Ramachandran</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Arumugam</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Deng</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Z.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <article-title>TM4SF1 regulates pancreatic cancer migration and invasion <italic>in vitro</italic> and <italic>in vivo</italic>
</article-title>. <source>Cell. Physiol. Biochem.</source> <volume>39</volume>, <fpage>740</fpage>&#x2013;<lpage>750</lpage>. <pub-id pub-id-type="doi">10.1159/000445664</pub-id>
</citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cao</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Fang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Hu</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>&#x3c;p&#x26;gt;The lncRNA HOXA11-AS regulates Rab3D expression by sponging miR-125a-5p promoting metastasis of osteosarcoma&#x26;lt;/p&#x26;gt;</article-title>. <source>Cancer Manag. Res.</source> <volume>11</volume>, <fpage>4505</fpage>&#x2013;<lpage>4518</lpage>. <pub-id pub-id-type="doi">10.2147/CMAR.S196025</pub-id>
</citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Carper</surname>
<given-names>M. .B.</given-names>
</name>
<name>
<surname>Denvir</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Boskovic</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Primerano</surname>
<given-names>D. A.</given-names>
</name>
<name>
<surname>Claudio</surname>
<given-names>P. P.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>RGS16, a novel p53 and pRb cross-talk candidate inhibits migration and invasion of pancreatic cancer cells</article-title>. <source>Genes. Cancer</source> <volume>5</volume> (<issue>11-12</issue>), <fpage>420</fpage>&#x2013;<lpage>435</lpage>. <pub-id pub-id-type="doi">10.18632/genesandcancer.43</pub-id>
</citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chai</surname>
<given-names>R. C.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Pang</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Y.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). <article-title>Recurrent PTPRZ1-MET fusion and a high occurrence rate of MET exon 14 skipping in brain metastases</article-title>. <source>Cancer Sci.</source> <volume>113</volume>, <fpage>796</fpage>&#x2013;<lpage>801</lpage>. <pub-id pub-id-type="doi">10.1111/cas.15211</pub-id>
</citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname>
<given-names>R. J.</given-names>
</name>
<name>
<surname>Lu</surname>
<given-names>M. Y.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Williamson</surname>
<given-names>D. F. K.</given-names>
</name>
<name>
<surname>Rodig</surname>
<given-names>S. J.</given-names>
</name>
<name>
<surname>Lindeman</surname>
<given-names>N. I.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). <article-title>Pathomic fusion: An integrated framework for fusing histopathology and genomic features for cancer diagnosis and prognosis</article-title>. <source>IEEE Trans. Med. Imaging</source> <volume>41</volume> (<issue>4</issue>), <fpage>757</fpage>&#x2013;<lpage>770</lpage>. <pub-id pub-id-type="doi">10.1109/tmi.2020.3021387</pub-id>
</citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chuang</surname>
<given-names>W. Y.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>C. C.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>W. H.</given-names>
</name>
<name>
<surname>Yeh</surname>
<given-names>C. J.</given-names>
</name>
<name>
<surname>Chang</surname>
<given-names>S. H.</given-names>
</name>
<name>
<surname>Ueng</surname>
<given-names>S. H.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Identification of nodal micrometastasis in colorectal cancer using deep learning on annotation-free whole-slide images</article-title>. <source>Mod. Pathol.</source> <volume>34</volume> (<issue>10</issue>), <fpage>1901</fpage>&#x2013;<lpage>1911</lpage>. <pub-id pub-id-type="doi">10.1038/s41379-021-00838-2</pub-id>
</citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Czubak-Prowizor</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Babinska</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Swiatkowska</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>The F11 receptor (F11r)/junctional adhesion molecule-A (JAM-A) (F11R/JAM-A) in cancer progression</article-title>. <source>Mol. Cell. Biochem.</source> <volume>477</volume>, <fpage>79</fpage>&#x2013;<lpage>98</lpage>. <pub-id pub-id-type="doi">10.1007/s11010-021-04259-2</pub-id>
</citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dai</surname>
<given-names>J. B.</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>W. J.</given-names>
</name>
<name>
<surname>Gao</surname>
<given-names>H. Y.</given-names>
</name>
<name>
<surname>Dai</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Zheng</surname>
<given-names>L.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>Identification of prognostic significance of BIRC5 in breast cancer using integrative bioinformatics analysis</article-title>. <source>Biosci. Rep.</source> <volume>40</volume> (<issue>2</issue>), <fpage>BSR20193678</fpage>. <pub-id pub-id-type="doi">10.1042/BSR20193678</pub-id>
</citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>D&#xed;az-Uriarte</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Alvarez de Andr&#xe9;s</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>Gene selection and classification of microarray data using random forest</article-title>. <source>BMC Bioinforma.</source> <volume>7</volume>, <fpage>3</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2105-7-3</pub-id>
</citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fanelli</surname>
<given-names>G. N.</given-names>
</name>
<name>
<surname>Scarpitta</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Cinacchi</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Fuochi</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Szumera-Cie&#x107;kiewicz</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>De Ieso</surname>
<given-names>K.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Immunohistochemistry for thymidine kinase-1 (TK1): A potential tool for the prognostic stratification of breast cancer patients</article-title>. <source>J. Clin. Med.</source> <volume>10</volume>, <fpage>5416</fpage>. <pub-id pub-id-type="doi">10.3390/jcm10225416</pub-id>
</citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fares</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Fares</surname>
<given-names>M. Y.</given-names>
</name>
<name>
<surname>Khachfe</surname>
<given-names>H. H.</given-names>
</name>
<name>
<surname>Salhab</surname>
<given-names>H. A.</given-names>
</name>
<name>
<surname>Fares</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Molecular principles of metastasis: A hallmark of cancer revisited</article-title>. <source>Sig Transduct. Target Ther.</source> <volume>5</volume>, <fpage>28</fpage>. <pub-id pub-id-type="doi">10.1038/s41392-020-0134-x</pub-id>
</citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fong</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Mounkes</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Maibaum</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Alonzo</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Desprez</surname>
<given-names>P. Y.</given-names>
</name>
<etal/>
</person-group> (<year>2003</year>). <article-title>Functional identification of distinct sets of antitumor activities mediated by the FKBP gene family</article-title>. <source>Proc. Natl. Acad. Sci. U. S. A.</source> <volume>100</volume> (<issue>24</issue>), <fpage>14253</fpage>&#x2013;<lpage>14258</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.2332307100</pub-id>
</citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fu</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Xiao</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Zou</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Pei</surname>
<given-names>L.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <article-title>Expression of receptor protein tyrosine phosphatase &#x3b6; is a risk factor for triple negative breast cancer relapse</article-title>. <source>Biomed. Rep.</source> <volume>4</volume>, <fpage>167</fpage>&#x2013;<lpage>172</lpage>. <pub-id pub-id-type="doi">10.3892/br.2016.570</pub-id>
</citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gao</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Yao</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Feng</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>Z.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>TM4SF1 is a potential target for anti-invasion and metastasis in ovarian cancer</article-title>. <source>BMC Cancer</source> <volume>19</volume>, <fpage>237</fpage>. <pub-id pub-id-type="doi">10.1186/s12885-019-5417-7</pub-id>
</citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gao</surname>
<given-names>S. L.</given-names>
</name>
<name>
<surname>Yin</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>L. F.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>S. M.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>J. S.</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>X. Y.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>The oncogenic role of MUC12 in RCC progression depends on c-Jun/TGF-&#x3b2; signalling</article-title>. <source>J. Cell. Mol. Med.</source> <volume>24</volume> (<issue>15</issue>), <fpage>8789</fpage>&#x2013;<lpage>8802</lpage>. <pub-id pub-id-type="doi">10.1111/jcmm.15515</pub-id>
</citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Goody</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Gupta</surname>
<given-names>S. K.</given-names>
</name>
<name>
<surname>Engelmann</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Spitschak</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Marquardt</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Mikkat</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Drug repositioning inferred from E2F1-coregulator interactions studies for the prevention and treatment of metastatic cancers</article-title>. <source>Theranostics</source> <volume>9</volume> (<issue>5</issue>), <fpage>1490</fpage>&#x2013;<lpage>1509</lpage>. <pub-id pub-id-type="doi">10.7150/thno.29546</pub-id>
</citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gu</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>C.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Overexpression of CLC-3 is regulated by XRCC5 and is a poor prognostic biomarker for gastric cancer</article-title>. <source>J. Hematol. Oncol.</source> <volume>11</volume>, <fpage>115</fpage>. <pub-id pub-id-type="doi">10.1186/s13045-018-0660-y</pub-id>
</citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Guo</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Gu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Pei</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Hao</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>2023</year>). <article-title>LINC01146/F11R facilitates growth and metastasis of prostate cancer under the regulation of TGF-&#x3b2;</article-title>. <source>J. Steroid Biochem. Mol. Biol.</source> <volume>225</volume>, <fpage>106193</fpage>. <pub-id pub-id-type="doi">10.1016/j.jsbmb.2022.106193</pub-id>
</citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>He</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Fornander</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Johansson</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Johansson</surname>
<given-names>U.</given-names>
</name>
<name>
<surname>Hu</surname>
<given-names>G. Z.</given-names>
</name>
<name>
<surname>Rutqvist</surname>
<given-names>L. E.</given-names>
</name>
<etal/>
</person-group> (<year>2006</year>). <article-title>Thymidine kinase 1 in serum predicts increased risk of distant or loco-regional recurrence following surgery in patients with early breast cancer</article-title>. <source>Anticancer Res.</source> <volume>26</volume> (<issue>6C</issue>), <fpage>4753</fpage>&#x2013;<lpage>4759</lpage>.</citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hennigs</surname>
<given-names>J. K.</given-names>
</name>
<name>
<surname>Minner</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Tennstedt</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>L&#xf6;ser</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Huland</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Klose</surname>
<given-names>H.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>Subcellular compartmentalization of survivin is associated with biological aggressiveness and prognosis in prostate cancer</article-title>. <source>Sci. Rep.</source> <volume>10</volume>, <fpage>3250</fpage>. <pub-id pub-id-type="doi">10.1038/s41598-020-60064-9</pub-id>
</citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ho</surname>
<given-names>T. H.</given-names>
</name>
<name>
<surname>Serie</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Parasramka</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Cheville</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Bot</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Tan</surname>
<given-names>W.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>Differential gene expression profiling of matched primary renal cell carcinoma and metastases reveals upregulation of extracellular matrix genes</article-title>. <source>Ann. Oncol.</source> <volume>28</volume>, <fpage>604</fpage>&#x2013;<lpage>610</lpage>. <pub-id pub-id-type="doi">10.1093/annonc/mdw652</pub-id>
</citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hollern</surname>
<given-names>D. P.</given-names>
</name>
<name>
<surname>Swiatnicki</surname>
<given-names>M. R.</given-names>
</name>
<name>
<surname>Rennhack</surname>
<given-names>J. P.</given-names>
</name>
<name>
<surname>Misek</surname>
<given-names>S. A.</given-names>
</name>
<name>
<surname>Matson</surname>
<given-names>B. C.</given-names>
</name>
<name>
<surname>McAuliff</surname>
<given-names>A.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>E2F1 drives breast cancer metastasis by regulating the target gene FGF13 and altering cell migration</article-title>. <source>Sci. Rep.</source> <volume>9</volume> (<issue>1</issue>), <fpage>10718</fpage>. <pub-id pub-id-type="doi">10.1038/s41598-019-47218-0</pub-id>
</citation>
</ref>
<ref id="B31">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Huang</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Weinberger</surname>
<given-names>K. Q.</given-names>
</name>
</person-group> (<year>2017</year>). <source>IEEE conference on computer vision and pattern recognition</source>. <publisher-loc>USA</publisher-loc>: <publisher-name>CVPR</publisher-name>. <pub-id pub-id-type="doi">10.48550/arXiv.1608.06993</pub-id>
</citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Huang</surname>
<given-names>S. C.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>C. C.</given-names>
</name>
<name>
<surname>Lan</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Hsieh</surname>
<given-names>T. Y.</given-names>
</name>
<name>
<surname>Chuang</surname>
<given-names>H. C.</given-names>
</name>
<name>
<surname>Chien</surname>
<given-names>M. Y.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). <article-title>Deep neural network trained on gigapixel images improves lymph node metastasis detection in clinical settings</article-title>. <source>Nat. Commun.</source> <volume>13</volume>, <fpage>3347</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-022-30746-1</pub-id>
</citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Huang</surname>
<given-names>Y.-K.</given-names>
</name>
<name>
<surname>Fan</surname>
<given-names>X.-G.</given-names>
</name>
<name>
<surname>Qiu</surname>
<given-names>F.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>TM4SF1 promotes proliferation, invasion, and metastasis in human liver cancer cells</article-title>. <source>Int. J. Mol. Sci.</source> <volume>17</volume>, <fpage>661</fpage>. <pub-id pub-id-type="doi">10.3390/ijms17050661</pub-id>
</citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jen</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Y. C.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Zinc finger proteins in cancer progression</article-title>. <source>J. Biomed. Sci.</source> <volume>23</volume> (<issue>1</issue>), <fpage>53</fpage>. <pub-id pub-id-type="doi">10.1186/s12929-016-0269-9</pub-id>
</citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jiashi</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Chuang</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Zhenjun</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Guangbin</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Bin</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Ming</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>MicroRNA-506-3p inhibits osteosarcoma cell proliferation and metastasis by suppressing RAB3D expression</article-title>. <source>Aging (Albany NY)</source> <volume>10</volume>, <fpage>1294</fpage>&#x2013;<lpage>1305</lpage>. <pub-id pub-id-type="doi">10.18632/aging.101468</pub-id>
</citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jin</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Miao</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Nie</surname>
<given-names>Y. W.</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>Y. Y.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Investigating resistin like beta (RETNLB) as a tumor promoter for oral squamous cell carcinoma</article-title>. <source>Head. Face Med.</source> <volume>17</volume> (<issue>1</issue>), <fpage>20</fpage>. <pub-id pub-id-type="doi">10.1186/s13005-021-00272-4</pub-id>
</citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jin</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Yao</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Fu</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Yao</surname>
<given-names>X.</given-names>
</name>
<etal/>
</person-group> (<year>2012</year>). <article-title>Differential secretome analysis reveals CST6 as a suppressor of breast cancer bone metastasis</article-title>. <source>Cell. Res.</source> <volume>22</volume>, <fpage>1356</fpage>&#x2013;<lpage>1373</lpage>. <pub-id pub-id-type="doi">10.1038/cr.2012.90</pub-id>
</citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jing</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Ruan</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Zheng</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>The PABPC5/HCG15/znf331 feedback loop regulates vasculogenic mimicry of glioma via STAU1-mediated mRNA decay</article-title>. <source>Mol. Ther. Oncolytics</source> <volume>17</volume>, <fpage>216</fpage>&#x2013;<lpage>231</lpage>. <pub-id pub-id-type="doi">10.1016/j.omto.2020.03.017</pub-id>
</citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Johnstone</surname>
<given-names>C. N.</given-names>
</name>
<name>
<surname>Pattison</surname>
<given-names>A. D.</given-names>
</name>
<name>
<surname>Gorringe</surname>
<given-names>K. L.</given-names>
</name>
<name>
<surname>Harrison</surname>
<given-names>P. F.</given-names>
</name>
<name>
<surname>Powell</surname>
<given-names>D. R.</given-names>
</name>
<name>
<surname>Lock</surname>
<given-names>P.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Functional and genomic characterisation of a xenograft model system for the study of metastasis in triple-negative breast cancer</article-title>. <source>Dis. Model. Mech.</source> <volume>11</volume> (<issue>5</issue>), <fpage>dmm032250</fpage>. <pub-id pub-id-type="doi">10.1242/dmm.032250</pub-id>
</citation>
</ref>
<ref id="B42">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kanda</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Murotani</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Sugimoto</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Miwa</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Umeda</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Suenaga</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>An integrated multigene expression panel to predict long-term survival after curative hepatectomy in patients with hepatocellular carcinoma</article-title>. <source>Oncotarget</source> <volume>8</volume>, <fpage>71070</fpage>&#x2013;<lpage>71079</lpage>. <pub-id pub-id-type="doi">10.18632/oncotarget.20369</pub-id>
</citation>
</ref>
<ref id="B43">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kaur</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Chandrashekar</surname>
<given-names>D. S.</given-names>
</name>
<name>
<surname>Varga</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Sobottka</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Janssen</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Kowalski</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). <article-title>Distinct gene expression profiles of matched primary and metastatic triple-negative breast cancers</article-title>. <source>Cancers (Basel)</source> <volume>14</volume> (<issue>10</issue>), <fpage>2447</fpage>. <pub-id pub-id-type="doi">10.3390/cancers14102447</pub-id>
</citation>
</ref>
<ref id="B44">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kim</surname>
<given-names>J. H.</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>J. Y.</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>K. T.</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>K. H.</given-names>
</name>
<name>
<surname>Jang</surname>
<given-names>K. T.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>RGS16 and FosB underexpressed in pancreatic cancer with lymph node metastasis promote tumor progression</article-title>. <source>Tumour Biol. J. Int. Soc. Oncodevelopmental Biol. Med.</source> <volume>31</volume> (<issue>5</issue>), <fpage>541</fpage>&#x2013;<lpage>548</lpage>. <pub-id pub-id-type="doi">10.1007/s13277-010-0067-z</pub-id>
</citation>
</ref>
<ref id="B45">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Krieg</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Werner</surname>
<given-names>T. A.</given-names>
</name>
<name>
<surname>Verde</surname>
<given-names>P. E.</given-names>
</name>
<name>
<surname>Stoecklein</surname>
<given-names>N. H.</given-names>
</name>
<name>
<surname>Knoefel</surname>
<given-names>W. T.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Prognostic and clinicopathological significance of survivin in colorectal cancer: A meta-analysis</article-title>. <source>PLOS ONE</source> <volume>8</volume> (<issue>6</issue>), <fpage>e65338</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0065338</pub-id>
</citation>
</ref>
<ref id="B46">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kiehl</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Kuntz</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>H&#xf6;hn</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Jutzi</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Krieghoff-Henning</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Kather</surname>
<given-names>J. N.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Deep learning can predict lymph node status directly from histology in colorectal cancer</article-title>. <source>Eur. J. Cancer</source> <volume>157</volume>, <fpage>464</fpage>&#x2013;<lpage>473</lpage>. <pub-id pub-id-type="doi">10.1016/j.ejca.2021.08.039</pub-id>
</citation>
</ref>
<ref id="B47">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Bai</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Filopodia associated promotes hepatocellular carcinoma metastasis by altering the metabolic status of cancer cells through RAB3D</article-title>. <source>Hepatology</source> <volume>73</volume>, <fpage>2361</fpage>&#x2013;<lpage>2379</lpage>. <pub-id pub-id-type="doi">10.1002/hep.31641</pub-id>
</citation>
</ref>
<ref id="B48">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Liang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Lian</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Peng</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Xiao</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>He</surname>
<given-names>Y.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>CST6 protein and peptides inhibit breast cancer bone metastasis by suppressing CTSB activity and osteoclastogenesis</article-title>. <source>Theranostics</source> <volume>11</volume> (<issue>20</issue>), <fpage>9821</fpage>&#x2013;<lpage>9832</lpage>. <pub-id pub-id-type="doi">10.7150/thno.62187</pub-id>
</citation>
</ref>
<ref id="B49">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Lu</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Mo</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>He</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Xie</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>JiangZhong</surname>
<given-names>F. W.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <article-title>E2F1 promotes tumor cell invasion and migration through regulating CD147 in prostate cancer</article-title>. <source>Int. J. Oncol.</source> <volume>48</volume>, <fpage>1650</fpage>&#x2013;<lpage>1658</lpage>. <pub-id pub-id-type="doi">10.3892/ijo.2016.3364</pub-id>
</citation>
</ref>
<ref id="B50">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Zhan</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Dong</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Genomics alterations of metastatic and primary tissues across 15 cancer types</article-title>. <source>Sci. Rep.</source> <volume>7</volume> (<issue>1</issue>), <fpage>13262</fpage>. <pub-id pub-id-type="doi">10.1038/s41598-017-13650-3</pub-id>
</citation>
</ref>
<ref id="B52">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Fu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Gao</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>K.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Predictors of distant metastasis on exploration in patients with potentially resectable pancreatic cancer</article-title>. <source>BMC Gastroenterol.</source> <volume>18</volume>, <fpage>168</fpage>. <pub-id pub-id-type="doi">10.1186/s12876-018-0891-y</pub-id>
</citation>
</ref>
<ref id="B53">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Higashitsuji</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Itoh</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Yamaguchi</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Umemura</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Itoh</surname>
<given-names>Y.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). <article-title>Onecut1 partially contributes to the liver progenitor cell transition and acquisition of metastatic potential in hepatocellular carcinoma</article-title>. <comment>bioRxiv</comment>. <pub-id pub-id-type="doi">10.1101/2022.09.20.508738</pub-id>
</citation>
</ref>
<ref id="B54">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Meng</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Sun</surname>
<given-names>K.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>Predicting distant metastasis and chemotherapy benefit in locally advanced rectal cancer</article-title>. <source>Nat. Commun.</source> <volume>11</volume>, <fpage>4308</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-020-18162-9</pub-id>
</citation>
</ref>
<ref id="B55">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Love</surname>
<given-names>M. I.</given-names>
</name>
<name>
<surname>Huber</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Anders</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2</article-title>. <source>Genome Biol.</source> <volume>15</volume>, <fpage>550</fpage>. <pub-id pub-id-type="doi">10.1186/s13059-014-0550-8</pub-id>
</citation>
</ref>
<ref id="B56">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Wei</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Mo</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>L.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>Direct targeting of MAPK8IP1 by miR-10a-5p is a major mechanism for gastric cancer metastasis</article-title>. <source>Oncol. Lett.</source> <volume>13</volume> (<issue>3</issue>), <fpage>1131</fpage>&#x2013;<lpage>1136</lpage>. <pub-id pub-id-type="doi">10.3892/ol.2016.5544</pub-id>
</citation>
</ref>
<ref id="B57">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Luo</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Ye</surname>
<given-names>G. Y.</given-names>
</name>
<name>
<surname>Qin</surname>
<given-names>S. L.</given-names>
</name>
<name>
<surname>Mu</surname>
<given-names>Y. F.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Qi</surname>
<given-names>Y.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <article-title>High expression of Rab3D predicts poor prognosis and associates with tumor progression in colorectal cancer</article-title>. <source>Int. J. Biochem. Cell. Biol.</source> <volume>75</volume>, <fpage>53</fpage>&#x2013;<lpage>62</lpage>. <pub-id pub-id-type="doi">10.1016/j.biocel.2016.03.017</pub-id>
</citation>
</ref>
<ref id="B58">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ma</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>CircRNA PRH1-PRR4 stimulates RAB3D to regulate the malignant progression of NSCLC by sponging miR-877-5p</article-title>. <source>Thorac. Cancer</source> <volume>13</volume>, <fpage>690</fpage>&#x2013;<lpage>701</lpage>. <pub-id pub-id-type="doi">10.1111/1759-7714.14264</pub-id>
</citation>
</ref>
<ref id="B59">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ma</surname>
<given-names>Q. Y.</given-names>
</name>
<name>
<surname>Pang</surname>
<given-names>L. W.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>Z. M.</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>Y. J.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>The significance of MAGED4 expression in non-small cell lung cancer as analyzed by real-time fluorescence quantitative PCR</article-title>. <source>Oncol. Lett.</source> <volume>4</volume> (<issue>4</issue>), <fpage>733</fpage>&#x2013;<lpage>738</lpage>. <pub-id pub-id-type="doi">10.3892/ol.2012.786</pub-id>
</citation>
</ref>
<ref id="B60">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Maezawa</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Sakamaki</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Oue</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Kimura</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Hashimoto</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Hara</surname>
<given-names>K.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>High gamma-glutamyl hydrolase and low folylpolyglutamate synthetase expression as prognostic biomarkers in patients with locally advanced gastric cancer who were administrated postoperative adjuvant chemotherapy with S-1</article-title>. <source>J. Cancer Res. Clin. Oncol.</source> <volume>146</volume>, <fpage>75</fpage>&#x2013;<lpage>86</lpage>. <pub-id pub-id-type="doi">10.1007/s00432-019-03087-8</pub-id>
</citation>
</ref>
<ref id="B61">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Malvi</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Janostiak</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Nagarajan</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Cai</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Wajapeyee</surname>
<given-names>N.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Loss of thymidine kinase 1 inhibits lung cancer growth and metastatic attributes by reducing GDF15 expression</article-title>. <source>PLOS Genet.</source> <volume>15</volume> (<issue>10</issue>), <fpage>e1008439</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1008439</pub-id>
</citation>
</ref>
<ref id="B62">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mamoor</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>KLHL3 is a differentially expressed gene in human metastatic breast cancer, in the brain and in the lymph nodes</article-title>. <comment>Preprint</comment>. <pub-id pub-id-type="doi">10.31219/osf.io/nghx4</pub-id>
</citation>
</ref>
<ref id="B63">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Matsuyama</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Ishikawa</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Mogushi</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Yoshida</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Iida</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Uetake</surname>
<given-names>H.</given-names>
</name>
<etal/>
</person-group> (<year>2010</year>). <article-title>
<italic>MUC12</italic> mRNA expression is an independent marker of prognosis in stage II and stage III colorectal cancer</article-title>. <source>Int. J. Cancer</source> <volume>127</volume>, <fpage>2292</fpage>&#x2013;<lpage>2299</lpage>. <pub-id pub-id-type="doi">10.1002/ijc.25256</pub-id>
</citation>
</ref>
<ref id="B64">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mobadersany</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Yousefi</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Amgad</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Gutman</surname>
<given-names>D. A.</given-names>
</name>
<name>
<surname>Barnholtz-Sloan</surname>
<given-names>J. S.</given-names>
</name>
<name>
<surname>Vel&#xe1;zquez Vega</surname>
<given-names>J. .E.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Predicting cancer outcomes from histology and genomics using convolutional networks</article-title>. <source>Proc. Natl. Acad. Sci. U. S. A.</source> <volume>115</volume> (<issue>13</issue>), <fpage>E2970</fpage>&#x2013;<lpage>E2979</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1717139115</pub-id>
</citation>
</ref>
<ref id="B65">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mori</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Yokota</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Hoshino</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Iwatate</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Wakamatsu</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Uno</surname>
<given-names>T.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Deep learning-based gene selection in comprehensive gene analysis in pancreatic cancer</article-title>. <source>Sci. Rep.</source> <volume>11</volume>, <fpage>16521</fpage>. <pub-id pub-id-type="doi">10.1038/s41598-021-95969-6</pub-id>
</citation>
</ref>
<ref id="B66">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Mu</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Gao</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Suppression of CLC-3 reduces the proliferation, invasion and migration of colorectal cancer through Wnt/&#x3b2;-catenin signaling pathway</article-title>. <source>Biochem. Biophysical Res. Commun.</source> <volume>533</volume> (<issue>4</issue>), <fpage>1240</fpage>&#x2013;<lpage>1246</lpage>. <pub-id pub-id-type="doi">10.1016/j.bbrc.2020.09.125</pub-id>
</citation>
</ref>
<ref id="B67">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mungenast</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Fernando</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Nica</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Boghiu</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Lungu</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Batra</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Next-generation digital histopathology of the tumor microenvironment</article-title>. <source>Genes. (Basel)</source> <volume>12</volume> (<issue>4</issue>), <fpage>538</fpage>. <pub-id pub-id-type="doi">10.3390/genes12040538</pub-id>
</citation>
</ref>
<ref id="B68">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nguyen</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Fong</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Luthra</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Smith</surname>
<given-names>S. A.</given-names>
</name>
<name>
<surname>DiNatale</surname>
<given-names>R. G.</given-names>
</name>
<name>
<surname>Nandakumar</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). <article-title>Genomic characterization of metastatic patterns from prospective clinical sequencing of 25,000 patients</article-title>. <source>Cell.</source> <volume>185</volume> (<issue>3</issue>), <fpage>563</fpage>&#x2013;<lpage>575</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2022.01.003</pub-id>
</citation>
</ref>
<ref id="B69">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nivedhitha</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Vincent</surname>
<given-names>D. P. M.</given-names>
</name>
<name>
<surname>S Kathiravan</surname>
<given-names>C. C. Y.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Machine learning based computational gene selection models: A survey, performance evaluation, open issues, and future research directions</article-title>. <source>Front. Genet.</source> <volume>11</volume>, <fpage>603808</fpage>. <pub-id pub-id-type="doi">10.3389/fgene.2020.603808</pub-id>
</citation>
</ref>
<ref id="B70">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Oparina</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Erlandsson</surname>
<given-names>M. C.</given-names>
</name>
<name>
<surname>F&#xe4;ldt Beding</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Parris</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Helou</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Karlsson</surname>
<given-names>P.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Prognostic significance of BIRC5/survivin in breast cancer: Results from three independent cohorts</article-title>. <source>Cancers</source> <volume>13</volume>, <fpage>2209</fpage>. <pub-id pub-id-type="doi">10.3390/cancers13092209</pub-id>
</citation>
</ref>
<ref id="B72">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Park</surname>
<given-names>Y. R.</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>S. T.</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>S. L.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Y. C.</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>M. R.</given-names>
</name>
<name>
<surname>Shin</surname>
<given-names>J. H.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <article-title>MicroRNA-9 suppresses cell migration and invasion through downregulation of TM4SF1 in colorectal cancer</article-title>. <source>Int. J. Oncol.</source> <volume>48</volume>, <fpage>2135</fpage>&#x2013;<lpage>2413</lpage>. <pub-id pub-id-type="doi">10.3892/ijo.2016.3430</pub-id>
</citation>
</ref>
<ref id="B73">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Peng</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Yuan</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>He</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Overexpression of chloride channel-3 predicts unfavorable prognosis and promotes cellular invasion in gastric cancer</article-title>. <source>Cancer Manag. Res.</source> <volume>10</volume>, <fpage>1163</fpage>&#x2013;<lpage>1175</lpage>. <pub-id pub-id-type="doi">10.2147/CMAR.S159790</pub-id>
</citation>
</ref>
<ref id="B74">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pisani</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Airoldi</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Allais</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Aluffi Valletti</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Battista</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Benazzo</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>Metastatic disease in head &#x26; neck oncology</article-title>. <source>Acta otorhinolaryngol. Ital. organo uff. della Soc. ital. otorinolaringol. chir. cerv. facc.</source> <volume>40</volume> (<issue>1</issue>), <fpage>S1</fpage>&#x2013;<lpage>S86</lpage>. <pub-id pub-id-type="doi">10.14639/0392-100X-suppl.1-40-2020</pub-id>
</citation>
</ref>
<ref id="B75">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Qiao</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Meng</surname>
<given-names>F.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>A novel RNA-seq-based model for preoperative prediction of lymph node metastasis in oral squamous cell carcinoma</article-title>. <source>BioMed Res. Int.</source> <volume>2020</volume>, <fpage>1</fpage>&#x2013;<lpage>13</lpage>. <pub-id pub-id-type="doi">10.1155/2020/4252580</pub-id>
</citation>
</ref>
<ref id="B76">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Riasatian</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Babaie</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Maleki</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Kalra</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Fine-Tuning and training of densenet for histopathology image representation using TCGA diagnostic slides</article-title>. <source>Med. Image Anal.</source> <volume>70</volume>, <fpage>102032</fpage>. <pub-id pub-id-type="doi">10.1016/j.media.2021.102032</pub-id>
</citation>
</ref>
<ref id="B77">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Riker</surname>
<given-names>A. I.</given-names>
</name>
<name>
<surname>Enkemann</surname>
<given-names>S. A.</given-names>
</name>
<name>
<surname>Fodstad</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Ren</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Morris</surname>
<given-names>C.</given-names>
</name>
<etal/>
</person-group> (<year>2008</year>). <article-title>The gene expression profiles of primary and metastatic melanoma yields a transition point of tumor progression and metastasis</article-title>. <source>BMC Med. Genomics</source> <volume>1</volume>, <fpage>13</fpage>. <pub-id pub-id-type="doi">10.1186/1755-8794-1-13</pub-id>
</citation>
</ref>
<ref id="B78">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rivenbark</surname>
<given-names>A. G.</given-names>
</name>
<name>
<surname>Livasy</surname>
<given-names>C. A.</given-names>
</name>
<name>
<surname>Boyd</surname>
<given-names>C. E.</given-names>
</name>
<name>
<surname>Keppler</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Coleman</surname>
<given-names>W. B.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>Methylation-dependent silencing of CST6 in primary human breast tumors and metastatic lesions</article-title>. <source>Exp. Mol. Pathology</source> <volume>83</volume> (<issue>2</issue>), <fpage>188</fpage>&#x2013;<lpage>197</lpage>. <pub-id pub-id-type="doi">10.1016/j.yexmp.2007.03.008</pub-id>
</citation>
</ref>
<ref id="B79">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Schiele</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Arndt</surname>
<given-names>T. T.</given-names>
</name>
<name>
<surname>Martin</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Miller</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Bauer</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Banner</surname>
<given-names>B. M.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Deep learning prediction of metastasis in locally advanced colon cancer using binary histologic tumor images</article-title>. <source>Cancers (Basel)</source> <volume>13</volume> (<issue>9</issue>), <fpage>2074</fpage>. <pub-id pub-id-type="doi">10.3390/cancers13092074</pub-id>
</citation>
</ref>
<ref id="B80">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Seiler</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Lam</surname>
<given-names>L. L.</given-names>
</name>
<name>
<surname>Erho</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Takhar</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Mitra</surname>
<given-names>A. P.</given-names>
</name>
<name>
<surname>Buerki</surname>
<given-names>C.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <article-title>Prediction of lymph node metastasis in patients with bladder cancer using whole transcriptome gene expression signatures</article-title>. <source>J. Urol.</source> <volume>196</volume> (<issue>4</issue>), <fpage>1036</fpage>&#x2013;<lpage>1041</lpage>. <pub-id pub-id-type="doi">10.1016/j.juro.2016.04.061</pub-id>
</citation>
</ref>
<ref id="B81">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sereesongsaeng</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>McDowell</surname>
<given-names>S. H.</given-names>
</name>
<name>
<surname>Burrows</surname>
<given-names>J. F.</given-names>
</name>
<name>
<surname>Scott</surname>
<given-names>C. J.</given-names>
</name>
<name>
<surname>Burden</surname>
<given-names>R. E.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Cathepsin V suppresses GATA3 protein expression in luminal A breast cancer</article-title>. <source>Breast Cancer Res.</source> <volume>22</volume> (<issue>1</issue>), <fpage>139</fpage>. <pub-id pub-id-type="doi">10.1186/s13058-020-01376-6</pub-id>
</citation>
</ref>
<ref id="B82">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Song</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Pan</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Yi</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Zheng</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Wei</surname>
<given-names>H.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Identification of novel key genes associated with the metastasis of prostate cancer based on bioinformatics prediction and validation</article-title>. <source>Cancer Cell. Int.</source> <volume>21</volume>, <fpage>559</fpage>. <pub-id pub-id-type="doi">10.1186/s12935-021-02258-3</pub-id>
</citation>
</ref>
<ref id="B85">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sun</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Charbonneau</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Wei</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Terek</surname>
<given-names>R. M.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>miR-181a targets RGS16 to promote chondrosarcoma growth, angiogenesis, and metastasis</article-title>. <source>Mol. Cancer Res.</source> <volume>13</volume> (<issue>9</issue>), <fpage>1347</fpage>&#x2013;<lpage>1357</lpage>. <pub-id pub-id-type="doi">10.1158/1541-7786.MCR-14-0697</pub-id>
</citation>
</ref>
<ref id="B86">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sun</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Qin</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Fang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Tang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Fan</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Multi-omics analysis of the expression and prognosis for FKBP gene family in renal cancer</article-title>. <source>Front. Oncol.</source> <volume>11</volume>, <fpage>697534</fpage>. <pub-id pub-id-type="doi">10.3389/fonc.2021.697534</pub-id>
</citation>
</ref>
<ref id="B87">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tang</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Di</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Yuan</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Z.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>TM4SF1 promotes EMT and cancer stemness via the Wnt/&#x3b2;-catenin/SOX2 pathway in colorectal cancer</article-title>. <source>J. Exp. Clin. Cancer Res.</source> <volume>39</volume>, <fpage>232</fpage>. <pub-id pub-id-type="doi">10.1186/s13046-020-01690-z</pub-id>
</citation>
</ref>
<ref id="B88">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tao</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Mingfa</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Yan</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Yuanzhi</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Zhennan</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Haoqi</surname>
<given-names>H.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>Lcn2-derived circular RNA (hsa_circ_0088732) inhibits cell apoptosis and promotes EMT in glioma via the miR-661/rab3d Axis</article-title>. <source>Front. Oncol.</source> <volume>10</volume>, <fpage>170</fpage>. <pub-id pub-id-type="doi">10.3389/fonc.2020.00170</pub-id>
</citation>
</ref>
<ref id="B113">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Terashima</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Ichikawa</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Ochiai</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Kitada</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Kurahashi</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Sakuramoto</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>TOP2A, GGH, and PECAM1 are associated with hematogenous, lymph node, and peritoneal recurrence in stage II/III gastric cancer patients enrolled in the ACTS-GC study</article-title>. <source>Oncotarget.</source> <volume>8</volume> (<issue>34</issue>), <fpage>57574</fpage>&#x2013;<lpage>57582</lpage>. <pub-id pub-id-type="doi">10.18632/oncotarget.15895</pub-id>
</citation>
</ref>
<ref id="B89">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vale-Silva</surname>
<given-names>L. A.</given-names>
</name>
<name>
<surname>Rohr</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Long-term cancer survival prediction using multimodal deep learning</article-title>. <source>Sci. Rep.</source> <volume>11</volume>, <fpage>13505</fpage>. <pub-id pub-id-type="doi">10.1038/s41598-021-92799-4</pub-id>
</citation>
</ref>
<ref id="B90">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>van de Vijver</surname>
<given-names>M. J.</given-names>
</name>
<name>
<surname>He</surname>
<given-names>Y. D.</given-names>
</name>
<name>
<surname>van&#x2019;t Veer</surname>
<given-names>L. J.</given-names>
</name>
<name>
<surname>Dai</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Hart</surname>
<given-names>A. A.</given-names>
</name>
<name>
<surname>Voskuil</surname>
<given-names>D. W.</given-names>
</name>
<etal/>
</person-group> (<year>2002</year>). <article-title>A gene-expression signature as a predictor of survival in breast cancer</article-title>. <source>N. Engl. J. Med.</source> <volume>347</volume>, <fpage>1999</fpage>&#x2013;<lpage>2009</lpage>. <pub-id pub-id-type="doi">10.1056/NEJMoa021967</pub-id>
</citation>
</ref>
<ref id="B91">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Xue</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Sun</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Gao</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Qi</surname>
<given-names>Y.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). <article-title>RGS16 regulated by let-7c-5p promotes glioma progression by activating PI3K-AKT pathway</article-title>. <source>Front. Med.</source> <volume>17</volume>, <fpage>143</fpage>&#x2013;<lpage>155</lpage>. <pub-id pub-id-type="doi">10.1007/s11684-022-0929-y</pub-id>
</citation>
</ref>
<ref id="B92">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>C. H.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>L. K.</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>C. C.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Shyu</surname>
<given-names>R. Y.</given-names>
</name>
<name>
<surname>Tsai</surname>
<given-names>F. M.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Cathepsin V mediates the tazarotene-induced gene 1-induced reduction in invasion in colorectal cancer cells</article-title>. <source>Cell. Biochem. biophysics</source> <volume>78</volume>, <fpage>483</fpage>&#x2013;<lpage>494</lpage>. <pub-id pub-id-type="doi">10.1007/s12013-020-00940-3</pub-id>
</citation>
</ref>
<ref id="B93">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Peng</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Qin</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Xue</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Circulating microRNAs as potential cancer biomarkers: The advantage and disadvantage</article-title>. <source>Clin. Epigenet</source> <volume>10</volume>, <fpage>59</fpage>. <pub-id pub-id-type="doi">10.1186/s13148-018-0492-1</pub-id>
</citation>
</ref>
<ref id="B94">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Gao</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Guan</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Dong</surname>
<given-names>Z.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Predicting gastric cancer outcome from resected lymph node histopathology images using deep learning</article-title>. <source>Nat. Commun.</source> <volume>12</volume>, <fpage>1637</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-021-21674-7</pub-id>
</citation>
</ref>
<ref id="B95">
<citation citation-type="web">
<person-group person-group-type="author">
<name>
<surname>Wen</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Gao</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Yifeng</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Xiaodan</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Lifei</surname>
<given-names>Z.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Secreted cathepsin V promoted metastasis of lung cancer by modulating adhesion molecules</article-title>. <comment>Available at: <ext-link ext-link-type="uri" xlink:href="https://ssrn.com/abstract=3974544">https://ssrn.com/abstract&#x3d;3974544</ext-link>
</comment>.</citation>
</ref>
<ref id="B96">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wenric</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Ruhollah</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Using supervised learning methods for gene selection in RNA-seq case-control studies</article-title>. <source>Front. Genet.</source> <volume>9</volume>, <fpage>297</fpage>. <pub-id pub-id-type="doi">10.3389/fgene.2018.00297</pub-id>
</citation>
</ref>
<ref id="B97">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wu</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Law</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Whipple</surname>
<given-names>M. E.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>A bayesian network model of head and neck squamous cell carcinoma incorporating gene expression profiles</article-title>. <source>Stud. Health Technol. Inf.</source> <volume>245</volume>, <fpage>634</fpage>&#x2013;<lpage>638</lpage>.</citation>
</ref>
<ref id="B98">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>H Ni</surname>
<given-names>D. Y.</given-names>
</name>
<name>
<surname>Niu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>F.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Driver and novel genes correlated with metastasis of non-small cell lung cancer: A comprehensive analysis</article-title>. <source>Pathology - Res. Pract.</source> <volume>224</volume>, <fpage>153551</fpage>. <pub-id pub-id-type="doi">10.1016/j.prp.2021.153551</pub-id>
</citation>
</ref>
<ref id="B99">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xu</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Jin</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Min</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Lulu</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Hui</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Guixian</surname>
<given-names>L.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). <article-title>Chloride channel-3 promotes tumor metastasis by regulating membrane ruffling and is associated with poor survival</article-title>. <source>Oncotarget</source> <volume>6</volume>, <fpage>2434</fpage>&#x2013;<lpage>2450</lpage>. <pub-id pub-id-type="doi">10.18632/oncotarget.2966</pub-id>
</citation>
</ref>
<ref id="B100">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xu</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Ding</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Cao</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Qiu</surname>
<given-names>D.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>A pan-cancer analysis of cystatin E/M reveals its dual functional effects and positive regulation of epithelial cell in human tumors</article-title>. <source>Front. Genet.</source> <volume>12</volume>, <fpage>733211</fpage>. <pub-id pub-id-type="doi">10.3389/fgene.2021.733211</pub-id>
</citation>
</ref>
<ref id="B101">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xu</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Gu</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Xia</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>He</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Dong</surname>
<given-names>F.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). <article-title>Distant metastasis without regional progression in non-muscle invasive bladder cancer: Case report and pooled analysis of literature</article-title>. <source>World J. Surg. Oncol.</source> <volume>20</volume> (<issue>1</issue>), <fpage>226</fpage>. <pub-id pub-id-type="doi">10.1186/s12957-022-02664-5</pub-id>
</citation>
</ref>
<ref id="B102">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xue</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Zheng</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Qian</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Gu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Hu</surname>
<given-names>Z.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Identification of key mRNAs as prediction models for early metastasis of pancreatic cancer based on LASSO</article-title>. <source>Front. Bioeng. Biotechnol.</source> <volume>9</volume>, <fpage>701039</fpage>. <pub-id pub-id-type="doi">10.3389/fbioe.2021.701039</pub-id>
</citation>
</ref>
<ref id="B103">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xue</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Deng</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Mao</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>L.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>TM4SF1 promotes the self-renewal of esophageal cancer stem-like cells and is regulated by miR-141</article-title>. <source>Oncotarget</source> <volume>8</volume>, <fpage>19274</fpage>&#x2013;<lpage>19284</lpage>. <pub-id pub-id-type="doi">10.18632/oncotarget.13866</pub-id>
</citation>
</ref>
<ref id="B104">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wei</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Xin&#x27;an</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Yan</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Yongzhang</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>High expression of small GTPase Rab3D promotes cancer progression and metastasis</article-title>. <source>Oncotarget</source> <volume>6</volume>, <fpage>11125</fpage>&#x2013;<lpage>11138</lpage>. <pub-id pub-id-type="doi">10.18632/oncotarget.3575</pub-id>
</citation>
</ref>
<ref id="B105">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Zeng</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Deng</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Qiu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Yao</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Liao</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Glycosylated cathepsin V serves as a prognostic marker in lung cancer</article-title>. <source>Front. Oncol.</source> <volume>12</volume>, <fpage>876245</fpage>. <pub-id pub-id-type="doi">10.3389/fonc.2022.876245</pub-id>
</citation>
</ref>
<ref id="B106">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yao</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Lv</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Tong</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Liang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Xi</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Ji</surname>
<given-names>B.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). <article-title>Icsda: A multi-modal deep learning model to predict breast cancer recurrence and metastasis risk by integrating pathological, clinical and gene expression data</article-title>. <source>Brief. Bioinform</source> <volume>23</volume> (<issue>6</issue>), <fpage>bbac448</fpage>. <pub-id pub-id-type="doi">10.1093/bib/bbac448</pub-id>
</citation>
</ref>
<ref id="B107">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yuan</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Zou</surname>
<given-names>Q.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Prediction of tumor metastasis from sequencing data in the era of genome sequencing</article-title>. <source>Briefings Funct. Genomics</source> <volume>18</volume> (<issue>6</issue>), <fpage>412</fpage>&#x2013;<lpage>418</lpage>. <pub-id pub-id-type="doi">10.1093/bfgp/elz010</pub-id>
</citation>
</ref>
<ref id="B108">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Yao</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Hu</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Pu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>He</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). <article-title>Effect of F11R gene knockdown on malignant biological behaviors of pancreatic cancer cells</article-title>. <source>J. Oncol.</source> <volume>2022</volume>, <fpage>1</fpage>&#x2013;<lpage>8</lpage>. <pub-id pub-id-type="doi">10.1155/2022/3379027</pub-id>
</citation>
</ref>
<ref id="B111">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>W. P.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Tan</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Xing</surname>
<given-names>C. G.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Cystatin 2 leads to a worse prognosis in patients with gastric cancer</article-title>. <source>J. Biol. Regul. Homeost. Agents</source> <volume>34</volume> (<issue>6</issue>), <fpage>2059</fpage>&#x2013;<lpage>2067</lpage>. <pub-id pub-id-type="doi">10.23812/20-293-A</pub-id>
</citation>
</ref>
<ref id="B112">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Zhao</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2020</year>). &#x201c;<article-title>Predicting lymph node metastasis using histopathological images based on multiple instance learning with deep graph convolution</article-title>,&#x201d; in <conf-name>2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)</conf-name>, <conf-loc>Seattle, WA, USA</conf-loc>, <conf-date>13-19 June 2020</conf-date> (<publisher-name>IEEE</publisher-name>), <fpage>4836</fpage>&#x2013;<lpage>4845</lpage>. <pub-id pub-id-type="doi">10.1109/CVPR42600.2020.00489</pub-id>
</citation>
</ref>
</ref-list>
</back>
</article>