<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" dtd-version="2.3" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Genet.</journal-id>
<journal-title>Frontiers in Genetics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Genet.</abbrev-journal-title>
<issn pub-type="epub">1664-8021</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fgene.2021.702259</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Genetics</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Cascade Deep Forest With Heterogeneous Similarity Measures for Drug&#x2013;Target Interaction Prediction</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Zheng</surname> <given-names>Ying</given-names></name>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Wu</surname> <given-names>Zheng</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/1321646/overview"/>
</contrib>
</contrib-group>
<aff><institution>School of Computer &#x0026; Communication Engineering, Changsha University of Science &#x0026; Technology</institution>, <addr-line>Changsha</addr-line>, <country>China</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Wei Lan, Guangxi University, China</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Yijie Ding, Suzhou University of Science and Technology, China; Shichao Liu, Huazhong Agricultural University, China</p></fn>
<corresp id="c001">&#x002A;Correspondence: Ying Zheng, <email>zhengying@csust.edu.cn</email></corresp>
<fn fn-type="other" id="fn004"><p>This article was submitted to Computational Genomics, a section of the journal Frontiers in Genetics</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>24</day>
<month>08</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>12</volume>
<elocation-id>702259</elocation-id>
<history>
<date date-type="received">
<day>29</day>
<month>04</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>24</day>
<month>05</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2021 Zheng and Wu.</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>Zheng and Wu</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>Drug repositioning is a method of systematically identifying potential molecular targets that known drugs may act on. Compared with traditional methods, drug repositioning has been extensively studied due to the development of multi-omics technology and system biology methods. Because of its biological network properties, it is possible to apply machine learning related algorithms for prediction. Based on various heterogeneous network model, this paper proposes a method named THNCDF for predicting drug&#x2013;target interactions. Various heterogeneous networks are integrated to build a tripartite network, and similarity calculation methods are used to obtain similarity matrix. Then, the cascade deep forest method is used to make prediction. Results indicate that THNCDF outperforms the previously reported methods based on the 10-fold cross-validation on the benchmark data sets proposed by Y. Yamanishi. The area under Precision Recall curve (AUPR) value on the Enzyme, GPCR, Ion Channel, and Nuclear Receptor data sets is 0.988, 0.980, 0.938, and 0.906 separately. The experimental results well illustrate the feasibility of this method.</p>
</abstract>
<kwd-group>
<kwd>drug repositioning</kwd>
<kwd>drug discovery</kwd>
<kwd>drug&#x2013;target interaction</kwd>
<kwd>heterogeneous similarity measures</kwd>
<kwd>cascade deep forest</kwd>
</kwd-group>
<counts>
<fig-count count="3"/>
<table-count count="3"/>
<equation-count count="5"/>
<ref-count count="37"/>
<page-count count="9"/>
<word-count count="0"/>
</counts>
</article-meta>
</front>
<body>
<sec id="S1">
<title>Introduction</title>
<p>In the past few decades, investment in drug research and development has grown rapidly, but most drugs have failed in the first phase of clinical trials. Moreover, it normally costs billions of dollars and consumes 10 years for any drug to be put on the market completely (<xref ref-type="bibr" rid="B23">Roessler et al., 2021</xref>). At present, drug repositioning has a wide prospect and provides evidence for further drug discovery, whose purpose is to determine potential therapeutic targets for existing drugs, thereby saving time and minimizing risks of conventional drug development (<xref ref-type="bibr" rid="B24">Stein et al., 2021</xref>).</p>
<p>The key of drug repositioning hinges on identifying drug&#x2013;target interaction (DTI), which exerts a vital role in drug research and development (<xref ref-type="bibr" rid="B2">Badkas et al., 2020</xref>). Currently, traditional experimental approaches are either time consuming or high costly. Despite that potential drug indications can be directly detected by target or cell screening of thousands of drugs in synthetic databases, there are still hurdles to massively relocate drugs owing to the needs of collecting existing drugs, specialized equipment, and screening tests (<xref ref-type="bibr" rid="B26">Turanli et al., 2018</xref>).</p>
<p>In general, the traditional methods for calculating drug target interactions mainly consist of ligand method and structure method (<xref ref-type="bibr" rid="B11">Huang et al., 2020</xref>; <xref ref-type="bibr" rid="B29">Yang et al., 2020</xref>; <xref ref-type="bibr" rid="B32">Zhang et al., 2020</xref>). The ligand-based methods predict potential DTI <italic>via</italic> contrasting candidate ligands with known ligands capable of binding to them, but it does not perform well in the absence of ligand information for potential targets (<xref ref-type="bibr" rid="B13">Ju&#x00E1;rez-Saldivar et al., 2020</xref>). The structure-based method mainly uses the docking simulation technology to predict the potential DTI on the basis of known three-dimensional structure. In the same way, this method that relies on simulated docking&#x2019;s reliability often consumes a plenty of time and requires all drugs and targets to provide accurate and reliable three-dimensional structure (<xref ref-type="bibr" rid="B27">Vivarelli et al., 2020</xref>).</p>
<p>Along with sustainable innovative developments of biological data, and high-speed improvements of machine learning technology in recent years, a variety of methods for computational drug repositioning have been put forward correspondingly and achieved some achievements in practical applications (<xref ref-type="bibr" rid="B15">Lan et al., 2016</xref>, <xref ref-type="bibr" rid="B14">2020</xref>; <xref ref-type="bibr" rid="B5">Chen et al., 2021</xref>; <xref ref-type="bibr" rid="B16">Li et al., 2019</xref>; <xref ref-type="bibr" rid="B17">Liu et al., 2019</xref>; <xref ref-type="bibr" rid="B31">Zeng et al., 2019</xref>; <xref ref-type="bibr" rid="B9">Fahimian et al., 2020</xref>; <xref ref-type="bibr" rid="B21">Rauschenbach et al., 2020</xref>; <xref ref-type="bibr" rid="B34">Zhou et al., 2020</xref>; <xref ref-type="bibr" rid="B12">Jarada et al., 2021</xref>; <xref ref-type="bibr" rid="B20">Meng et al., 2021</xref>). Machine learning is a beneficial complement to ligand-based and structure-based methods. It has been widely developed and applied as an effective method for pinpointing drug&#x2013;targets as well as predicting drug-diseases. Machine learning is able to systematically integrate biological databases, with the purpose of predicting potential DTI and drug&#x2013;disease interactions.</p>
<p>The method of similarity constrained probabilistic matrix factorization (SCPMF) is used for drug repositioning through recognizing novel drug&#x2013;virus coactions (<xref ref-type="bibr" rid="B20">Meng et al., 2021</xref>). Moreover, SCPMF innovatively reconstructs the drug&#x2013;virus interaction matrix, by dexterously projecting the drug&#x2013;virus interaction matrix into two potential feature matrices for viruses and drugs. A new framework named Similarity Network Fusion and Neural Networks (SNF-NN) on the basis of deep learning was proposed and elaborated, which predicts new drug&#x2013;disease interactions though using similarity selection relevant to drugs and diseases, similarity network fusion, and a novel neural network model with superior tuning (<xref ref-type="bibr" rid="B12">Jarada et al., 2021</xref>). By comparison of the performance of SNF-NN with that of nine benchmark machine learning methods, the robustness of SNF-NN is calculated. The values of AUC and AUPR are 0.867 and 0.876, respectively. Besides, a previous study has shown that a method based on network called RepCOOL is utilized for drug repositioning (<xref ref-type="bibr" rid="B9">Fahimian et al., 2020</xref>). The eventual model of drug repositioning is constructed on account of a random forest classifier. RepCOOL recommends four novel drugs for the treatment of breast cancer at stage II, namely, paclitaxel, doxorubicin, tamoxifen, and trastuzumab. In addition, a network embedding based method for predicting drug&#x2013;disease interactions (NEDD) is raised (<xref ref-type="bibr" rid="B34">Zhou et al., 2020</xref>). Initially, through constructing a heterogeneous network and utilizing meta-paths of various lengths, NEDD accurately obtains the indirect associations between drugs and diseases or their strong proximity, thereby acquiring representation vectors of drugs and diseases with low dimensions. NEDD estimates novel relationships between diseases and drugs by utilizing a random forest classifier. A recent study has reported that a network-based method about deep learning for drug repositioning (deepDR) recognizes advanced characteristics of drugs from heterogeneous networks through a multi-mode autoencoder. Then, through a variational autoencoder, the obtained low-dimensional representation of the drug as well as clinically reported drug&#x2013;disease pairs are uniformly encoded and decoded to infer candidates for approved drugs that were actually without initial approval (<xref ref-type="bibr" rid="B31">Zeng et al., 2019</xref>).</p>
<p>The main contributions of this paper are summarized as follows:</p>
<p>We study various calculation methods based on the tripartite heterogeneous network, and finally adopt the Gaussian kernel between each layer, and the Tanimoto&#x2019;s coefficient is used in the drug layer to calculate the chemical structure similarity matrix. Besides, the similarity matrix is fitted by all matrices;</p>
<p>We improve and adjust the parameters according to the gcForest (<xref ref-type="bibr" rid="B35">Zhou and Feng, 2019</xref>) method. We use 10-fold cross-validation to check the final prediction (termed THNCDF, Tripartite Heterogeneous Network Cascade Deep Forest).</p>
<p>We compare the results of THNCDF with four types of methods (<xref ref-type="bibr" rid="B4">Cao et al., 2014</xref>; <xref ref-type="bibr" rid="B10">Hao et al., 2016</xref>; <xref ref-type="bibr" rid="B22">Rayhan et al., 2017</xref>; <xref ref-type="bibr" rid="B25">Thafar et al., 2020</xref>). The experimental results show that the THNCDF method has good performance, and the area under Precision Recall curve (AUPR) values on the four benchmark data sets reach 0.988, 0.980, 0.938, and 0.906.</p>
<p>The rest of this paper is organized as follows. In Section 2, we introduce the data sets used for similarity measurement, and then we present the general framework and cascade deep forest methods with details in Section 3. In Section 4, the performance of our proposed THNCDF method is evaluated through extensive experiments. At the end, some discussions are provided in Section 5.</p>
</sec>
<sec id="S2">
<title>Related Work</title>
<sec id="S2.SS1">
<title>Data Sets</title>
<p>In our experiments, we use the data sets listed in <xref ref-type="table" rid="T1">Table 1</xref> to build a tripartite heterogeneous network model. <xref ref-type="table" rid="T1">Table 1</xref> shows the exactly biologic data sets we used during the experiments (<xref ref-type="bibr" rid="B28">Yamanishi et al., 2008</xref>; <xref ref-type="bibr" rid="B33">Zheng and Wu, 2021</xref>). Especially, the main resource of the data set for the disease layer is from DisGeNET. This paper also uses a data set called DisGeNET approved, which contains FDA-approved drugs and their corresponding protein targets in the DisGeNET.</p>
<table-wrap position="float" id="T1">
<label>TABLE 1</label>
<caption><p>Sources and verification of databases.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Resource</td>
<td valign="top" align="left">Description</td>
<td valign="top" align="left">Url</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">DrugBank</td>
<td valign="top" align="left">Free accessible drug database</td>
<td valign="top" align="left"><ext-link ext-link-type="uri" xlink:href="http://www.drugbank.ca/">www.drugbank.ca/</ext-link></td>
</tr>
<tr>
<td valign="top" align="left">DisGeNET</td>
<td valign="top" align="left">Free accessible human disease database</td>
<td valign="top" align="left"><ext-link ext-link-type="uri" xlink:href="http://www.disgenet.org/">www.disgenet.org/</ext-link></td>
</tr>
<tr>
<td valign="top" align="left">ChEMBL</td>
<td valign="top" align="left">Free accessible drug and target database</td>
<td valign="top" align="left"><ext-link ext-link-type="uri" xlink:href="http://www.ebi.ac.uk/chembl/">www.ebi.ac.uk/chembl/</ext-link></td>
</tr>
<tr>
<td valign="top" align="left">Kegg</td>
<td valign="top" align="left">Free accessible database for molecular-level information</td>
<td valign="top" align="left"><ext-link ext-link-type="uri" xlink:href="http://www.kegg.jp/">www.kegg.jp/</ext-link></td>
</tr>
<tr>
<td valign="top" align="left">Uniprot</td>
<td valign="top" align="left">Free accessible protein sequence and annotation database</td>
<td valign="top" align="left"><ext-link ext-link-type="uri" xlink:href="http://www.uniprot.org">www.uniprot.org</ext-link></td>
</tr>
<tr>
<td valign="top" align="left">OMIM</td>
<td valign="top" align="left">Free accessible compendium for Mendelian disorder</td>
<td valign="top" align="left"><ext-link ext-link-type="uri" xlink:href="http://www.omim.org/">www.omim.org/</ext-link></td>
</tr>
</tbody>
</table></table-wrap>
<p>We will evaluate the performance of THNCDF on benchmark data sets. The benchmark data sets used in many DTI predictions were originally proposed by Y. Yamanishi, which have been considered as the golden data sets for comparing various DTI prediction methods. The benchmark data sets are listed in <xref ref-type="table" rid="T2">Table 2</xref>, which are downloaded from <ext-link ext-link-type="uri" xlink:href="http://web.kuicr.kyotou.ac.jp/supp/yoshi/drugtarget/">http://web.kuicr.kyotou.ac.jp/supp/yoshi/drugtarget/</ext-link>. The data sets include four subsets grouped by target classification: Enzyme, ion channel, GPCR (G protein-coupled receptor), and nuclear receptor. The largest subset, Enzyme, includes 445 drugs and 664 targets with 2,926 known DTI between them. Another NR, the smallest subset includes only 54 drugs and 26 targets with 90 known interactions. The other two subsets, IC and GPCR, consist of 210 and 223 drugs, 204 and 95 targets, and 1,476 and 635 known interactions, respectively.</p>
<table-wrap position="float" id="T2">
<label>TABLE 2</label>
<caption><p>Benchmark data sets.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Data sets</td>
<td valign="top" align="center">Drugs</td>
<td valign="top" align="center">Targets</td>
<td valign="top" align="center">n<sub><italic>d</italic></sub>/n<sub><italic>t</italic></sub></td>
<td valign="top" align="center">Interactions</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Enzyme</td>
<td valign="top" align="center">445</td>
<td valign="top" align="center">664</td>
<td valign="top" align="center">0.667</td>
<td valign="top" align="center">2,926</td>
</tr>
<tr>
<td valign="top" align="left">Ion channel</td>
<td valign="top" align="center">210</td>
<td valign="top" align="center">204</td>
<td valign="top" align="center">1.03</td>
<td valign="top" align="center">1,476</td>
</tr>
<tr>
<td valign="top" align="left">GPCR</td>
<td valign="top" align="center">223</td>
<td valign="top" align="center">95</td>
<td valign="top" align="center">2.35</td>
<td valign="top" align="center">635</td>
</tr>
<tr>
<td valign="top" align="left">Nuclear receptor</td>
<td valign="top" align="center">54</td>
<td valign="top" align="center">26</td>
<td valign="top" align="center">2.08</td>
<td valign="top" align="center">90</td>
</tr>
</tbody>
</table></table-wrap>
</sec>
<sec id="S2.SS2">
<title>Tripartite Heterogeneous Network</title>
<p>Based on the related ideas of pharmacology, the therapeutic effect of a single drug is relatively limited for diseases that are complex multiple pathological (<xref ref-type="bibr" rid="B30">Zamami et al., 2017</xref>; <xref ref-type="bibr" rid="B36">Zhu et al., 2020</xref>). Recently, the development of high-throughput biotechnology has produced a large amount of data. However, one of the main difficulties is how to collect and analyze the required biomedical data because they are heterogeneous and the data generated from different experiments include different types of information, such as nucleotide sequences and protein&#x2013;protein interactions (<xref ref-type="bibr" rid="B19">Luo et al., 2020</xref>).</p>
<p>In this paper, we integrate the composition of many different heterogeneous networks and construct our novel tripartite heterogeneous network model according to different types of data. <xref ref-type="fig" rid="F1">Figure 1A</xref> is the part of visualization of the Enzyme in benchmark data sets, in which the red nodes are drugs and the green nodes are targets. <xref ref-type="fig" rid="F1">Figure 1B</xref> is the bipartite graph model of a part of <xref ref-type="fig" rid="F1">Figure 1A</xref>; the red nodes are drugs, and the green nodes are targets in the same.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p>An example of bipartite graph for drug&#x2013;target interactions. <bold>(A)</bold> Is the part of visualization of the Enzyme in benchmark data sets, in which the red nodes are drugs and the green nodes are targets. <bold>(B)</bold> Is the bipartite graph model of a part of <xref ref-type="fig" rid="F1">Figure 1A</xref>.</p></caption>
<graphic xlink:href="fgene-12-702259-g001.tif"/>
</fig>
<p>We construct a tripartite network that includes three layers: drugs, targets, and diseases. Correspondingly, two types of interactions, drug&#x2013;target interactions and target&#x2013;disease interactions, are interpreted as edges to connect nodes in these layers. We mainly focus on constructing the similarity matrix and feature information of the tripartite heterogeneous network.</p>
</sec>
</sec>
<sec id="S3">
<title>Materials and Methods</title>
<p>In this study, we propose THNCDF, a new computational approach for molecular target identification from known drug&#x2013;target centered DTI prediction. It utilizes low-dimensional but informative matrix representations of features for both drugs and targets through a cascade deep forest classifier in prediction of DTI (<xref ref-type="bibr" rid="B33">Zheng and Wu, 2021</xref>).</p>
<p>As shown in <xref ref-type="fig" rid="F2">Figure 2</xref>, THNCDF mainly includes three steps: (1) Data integration and complete heterogeneous network is obtained, which contains diverse cheminformatics and bioinformatics profiles; (2) Similarity matrix calculation and parameter setting; (3) Application of cascade deep forest classifier and verification of the results.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p>The overview of the proposed work THNCDF.</p></caption>
<graphic xlink:href="fgene-12-702259-g002.tif"/>
</fig>
<sec id="S3.SS1">
<title>Similarity of Medicinal Chemical Structures</title>
<p>To ensure that the features in the network model are distinguishable, the similarity of the medicinal chemical structure is a relatively objective feature (<xref ref-type="bibr" rid="B33">Zheng and Wu, 2021</xref>). In particular, the chemical structure of various drugs under the same standard can be obtained through the simplified molecular-input line-entry system (SIMILES), and then converted into 166-bits string of a certain length fingerprint. Thus, each fingerprint represents a unique drug. Through the calculation of Tanimoto&#x2019;s coefficient, the similarity matrix of medicinal chemical structure among all drugs is obtained. The formula for calculating Tanimoto&#x2019;s coefficient is shown in Equation (1).</p>
<disp-formula id="S3.E1">
<label>(1)</label>
<mml:math id="M1">
<mml:mrow>
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>I</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mpadded width="+3.3pt">
<mml:msub>
<mml:mi>M</mml:mi>
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>h</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>e</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>m</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mpadded>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mpadded width="+5pt">
<mml:mfrac>
<mml:mrow>
<mml:mo stretchy="false">|</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>&#x00D7;</mml:mo>
<mml:mi>f</mml:mi>
</mml:mrow>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>y</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo stretchy="false">|</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">|</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>y</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mrow>
<mml:mo stretchy="false">|</mml:mo>
</mml:mrow>
<mml:mo>-</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">|</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>&#x00D7;</mml:mo>
<mml:mi>f</mml:mi>
</mml:mrow>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>y</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo stretchy="false">|</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mfrac>
</mml:mpadded>
</mml:mrow>
</mml:math>
</disp-formula>
<p>where <italic>f</italic><sub>(dx)</sub> is the binary chemical fingerprint of drug <italic>x</italic>. According to Equation (1), a matrix of chemical structure similarity is constructed.</p>
</sec>
<sec id="S3.SS2">
<title>Gaussian Kernel Similarity</title>
<p>The Gaussian kernel is defined as the unimodal of the Euclidean distance between any two points in the network (<xref ref-type="bibr" rid="B33">Zheng and Wu, 2021</xref>). In THNCDF method, the Gaussian kernel is mainly used to calculate the feature of the connection between two layers, like the edge between the drug layer and the target layer or between the target layer and the disease layer. Also, for drug&#x2013;drug interactions and target&#x2013;target interactions, the Gaussian Kernel can calculate the edges in the same layer. Therefore, the calculation formula is commonly used to construct various types of matrices, such as the drug&#x2013;drug interactions similarity matrix, target&#x2013;target interactions similarity matrix, and target&#x2013;disease interactions similarity matrix. The calculation formula is as follows:</p>
<disp-formula id="S3.E2">
<label>(2)</label>
<mml:math id="M2">
<mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>K</mml:mi>
<mml:mrow>
<mml:mrow>
<mml:mi>G</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>I</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>P</mml:mi>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:mi>d</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
<mml:mo rspace="5.8pt" stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo rspace="5.8pt">=</mml:mo>
<mml:mrow>
<mml:mi>e</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>p</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>&#x03B3;</mml:mi>
<mml:mi>d</mml:mi>
</mml:msub>
<mml:mo>&#x2062;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo fence="true">||</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:msub>
<mml:mi>d</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>-</mml:mo>
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:msub>
<mml:mi>d</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mrow>
<mml:mo fence="true">||</mml:mo>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<disp-formula id="S3.E3">
<label>(3)</label>
<mml:math id="M3">
<mml:mrow>
<mml:mpadded width="+3.3pt">
<mml:msub>
<mml:mi>&#x03B3;</mml:mi>
<mml:mi>d</mml:mi>
</mml:msub>
</mml:mpadded>
<mml:mo rspace="5.8pt">=</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x03B3;</mml:mi>
<mml:mi>d</mml:mi>
<mml:mo>&#x2032;</mml:mo>
</mml:msubsup>
<mml:mo>/</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mi>m</mml:mi>
</mml:mfrac>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:munderover>
<mml:mo largeop="true" movablelimits="false" symmetric="true">&#x2211;</mml:mo>
<mml:mrow>
<mml:mpadded width="+3.3pt">
<mml:mi>i</mml:mi>
</mml:mpadded>
<mml:mo rspace="5.8pt">=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>m</mml:mi>
</mml:munderover>
<mml:msup>
<mml:mrow>
<mml:mo fence="true">||</mml:mo>
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:msub>
<mml:mi>d</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo fence="true">||</mml:mo>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<p>where <italic>D<sub>i</sub></italic> is defined as the <italic>i-th</italic> drug in the drug set, <italic>T<sub>i</sub></italic> represents the <italic>i-th</italic> target in the target set, while <italic>ts</italic><sub><italic>i</italic></sub> represents the <italic>i-th</italic> target in the target&#x2013;disease interactions set. <italic>m</italic> is the size of drug set, while <italic>n</italic> and <italic>k</italic> represent the size of target set and the size of target&#x2013;disease interactions set, respectively. The adjacency matrix <italic>Y</italic> &#x2208; <italic>mn</italic> represents the known drug&#x2013;target interactions. If the drug and the target have an existing interaction, the value is 1; otherwise, the value is 0. <italic>yd</italic><sub><italic>i</italic></sub>{<italic>y</italic><sub><italic>i</italic>1</sub>,<italic>y</italic><sub><italic>i</italic>2</sub>,&#x2026;,<italic>y</italic><sub><italic>in</italic></sub>} is defined as the correlation vector between the drug <italic>d<sub>i</sub></italic> and all targets; meanwhile, <italic>yts</italic><sub><italic>i</italic></sub>{<italic>y</italic><sub><italic>i</italic>1</sub>,<italic>y</italic><sub><italic>i</italic>2</sub>,&#x2026;,<italic>y</italic><sub><italic>in</italic></sub>} is defined as the correlation vector between the target <italic>ts</italic><sub><italic>i</italic></sub> and all diseases. &#x03B3;<sub><italic>d</italic></sub>, &#x03B3;<sub><italic>t</italic></sub>, and &#x03B3;<sub><italic>ts</italic></sub> are adjustment parameters that control the width of the kernel, where<inline-formula><mml:math id="INEQ9"><mml:msubsup><mml:mi mathvariant="normal">&#x03B3;</mml:mi><mml:mi>d</mml:mi><mml:mo>&#x2032;</mml:mo></mml:msubsup></mml:math></inline-formula>, <inline-formula><mml:math id="INEQ10"><mml:msubsup><mml:mi mathvariant="normal">&#x03B3;</mml:mi><mml:mi>t</mml:mi><mml:mo>&#x2032;</mml:mo></mml:msubsup></mml:math></inline-formula>, and <inline-formula><mml:math id="INEQ11"><mml:msubsup><mml:mi mathvariant="normal">&#x03B3;</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>&#x2062;</mml:mo><mml:mi>s</mml:mi></mml:mrow><mml:mo>&#x2032;</mml:mo></mml:msubsup></mml:math></inline-formula> are set to 1 by using Gaussian kernels.</p>
</sec>
<sec id="S3.SS3">
<title>Similarity Matrix Fusion</title>
<p>According to the above multiple similarity matrices, we construct a kernel containing the spatial information of drugs and targets (<xref ref-type="bibr" rid="B6">Ding et al., 2018</xref>, <xref ref-type="bibr" rid="B8">2020a</xref>,<xref ref-type="bibr" rid="B7">b</xref>; <xref ref-type="bibr" rid="B33">Zheng and Wu, 2021</xref>). Since the similarity matrix is not a positive definite matrix, predictions are ultimately required. We linearly fit the similarity matrix of drug chemical structure, the drug Gaussian kernel, the target Gaussian kernel, and the disease Gaussian kernel. We also set the weighted factors in the following equations empirically.</p>
<disp-formula id="S3.E4">
<label>(4)</label>
<mml:math id="M4">
<mml:mrow>
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>I</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:msub>
<mml:mi>M</mml:mi>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>u</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>g</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>d</mml:mi>
<mml:mi>x</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>d</mml:mi>
<mml:mi>y</mml:mi>
</mml:msub>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>-</mml:mo>
<mml:mi>&#x03B1;</mml:mi>
</mml:mrow>
<mml:mo rspace="0.8pt" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo rspace="0.8pt">&#x00D7;</mml:mo>
<mml:msub>
<mml:mi>K</mml:mi>
<mml:mrow>
<mml:mrow>
<mml:mi>G</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>I</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>P</mml:mi>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:mi>d</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>d</mml:mi>
<mml:mi>x</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>d</mml:mi>
<mml:mi>y</mml:mi>
</mml:msub>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mi>&#x03B1;</mml:mi>
<mml:mo>&#x00D7;</mml:mo>
<mml:mi>S</mml:mi>
</mml:mrow>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>I</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:msub>
<mml:mi>M</mml:mi>
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>h</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>e</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>m</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>d</mml:mi>
<mml:mi>x</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>d</mml:mi>
<mml:mi>y</mml:mi>
</mml:msub>
<mml:mo rspace="7.5pt" stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<disp-formula id="S3.E5">
<label>(5)</label>
<mml:math id="M5">
<mml:mrow>
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>I</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:msub>
<mml:mi>M</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>a</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>t</mml:mi>
<mml:mi>x</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>t</mml:mi>
<mml:mi>y</mml:mi>
</mml:msub>
<mml:mo rspace="5.8pt" stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo rspace="5.8pt">=</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>-</mml:mo>
<mml:mi>&#x03B1;</mml:mi>
</mml:mrow>
<mml:mo rspace="5.8pt" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo rspace="5.8pt">&#x00D7;</mml:mo>
<mml:msub>
<mml:mi>K</mml:mi>
<mml:mrow>
<mml:mrow>
<mml:mi>G</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>I</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>P</mml:mi>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>t</mml:mi>
<mml:mi>x</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>t</mml:mi>
<mml:mi>y</mml:mi>
</mml:msub>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mpadded width="+3.3pt">
<mml:mi>&#x03B1;</mml:mi>
</mml:mpadded>
<mml:mo rspace="5.8pt">&#x00D7;</mml:mo>
<mml:msub>
<mml:mi>K</mml:mi>
<mml:mrow>
<mml:mrow>
<mml:mi>G</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>I</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>P</mml:mi>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:mi>S</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:msub>
<mml:mi>s</mml:mi>
<mml:mi>x</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:msub>
<mml:mi>s</mml:mi>
<mml:mi>y</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo rspace="7.5pt" stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<p>The result of similarity matrices is used as the original input of the next step. In the latter experiments, in order to balance the constructed similarity matrix, the ratio of 0.5:0.5 is used with parameter setting.</p>
</sec>
<sec id="S3.SS4">
<title>Cascade Deep Forest</title>
<p>Random forest, developed by Bermain and Culter (<xref ref-type="bibr" rid="B3">Breiman, 2001</xref>), is widely used due to its excellent stability and resistance to overfitting. Nowadays, random forest has been successfully applied to the analysis of multiple biological and pharmacological contexts, such as Diabetic Retinopathy screening procedure (<xref ref-type="bibr" rid="B1">Alabdulwahhab et al., 2021</xref>) and detection of copy number variations for uncovering genetic factors (<xref ref-type="bibr" rid="B37">Zhuang et al., 2020</xref>). But in novel review by Zhou et al., deep learning based on non-differentiable modules exhibits the possibility of constructing deep models without using backpropagation. They have proposed the gcForest approach, which has generated three characteristics: layer-by-layer processing, in-model feature transformation, and sufficient model complexity. It provides an alternative methods to deep neural networks (DNNs) to learn hyper-level representations at a low computational cost. gcForest is a novel decision tree ensemble, with a cascade structure. It has much fewer hyper-parameters than DNNs, which the training process does not rely on backpropagation. In fact, the most important value of gcForest approach is it may open a door for non-NN style deep learning, or deep models based on non-differentiable modules. An extended depiction and the study of the theory on random forest or gcForest can be referred to the Web site of Bremain or the paper of Zhou et al.</p>
<p>Based on the advantage of random forest and characteristics of gcForest, we construct the THNCDF method, which includes the similarity matrices described above and utilizes improved gcForest approach for prediction. First, the fusion similarity matrix is the origin input for cascade structure of deep forest. Each level of cascade receives the feature information processed by its previous level and outputs its processing result to the next level. All level is an ensemble of decision tree forest. For example, each forest will count the percentages of different classes of training examples at the leaf node, and then average all trees in the same forest to obtain an estimate of the class distribution.</p>
<p>Secondly, we use three random forests: (a) two completely random tree forests, (b) two gradient boosting tree forests, and (c) two extra randomized tree forests. Each forest contains 1,000 trees, and there are 6,000 trees in total. Each node selects a feature randomly as the judgment condition and generates leaf nodes according to the condition. Stop until each leaf node contains only instances of the same class.</p>
<p>To compare with other results, we use 10-fold cross validation (<xref ref-type="bibr" rid="B18">Liu et al., 2016</xref>). It means that class vectors produced by each forest are generated by 10-fold cross validation to reduce the risk of overfitting. Finally, if there is no significant performance gain, the training process will terminate. The number of cascade levels is automatically determined.</p>
</sec>
</sec>
<sec id="S4">
<title>Experimental Results and Analysis</title>
<sec id="S4.SS1">
<title>Baseline Methods</title>
<p>In order to evaluate the performance of our method, we mainly introduce DTI prediction results compared with baseline methods on the benchmark data sets that are proposed by Y. Yamanishi. The following are the state-of-the-art methods made in comparison with the same standard criteria:</p>
<p>RLS-KF (<xref ref-type="bibr" rid="B10">Hao et al., 2016</xref>): A regularized least squares combining with nonlinear kernel fusion method is developed.</p>
<p>RF (<xref ref-type="bibr" rid="B4">Cao et al., 2014</xref>): A computational method integrated the information from network, chemical, and biological properties. This method is developed based on the random forest combining with integrated features.</p>
<p>DTiGEMS (<xref ref-type="bibr" rid="B25">Thafar et al., 2020</xref>): A computational method using graph embedding, graph mining, and similarity properties techniques. DTiGEMS firstly applies a similarity selection procedure and a similarity fusion algorithm. Then, it integrates multiple drug&#x2013;drug similarities and target&#x2013;target similarities into the final heterogeneous graph structure after.</p>
<p>iDTI-ESBoost (<xref ref-type="bibr" rid="B22">Rayhan et al., 2017</xref>): A prediction model uses evolutionary and structural features. The method uses a new data balancing and boosting technique to make prediction.</p>
</sec>
<sec id="S4.SS2">
<title>Evaluation Criteria</title>
<p>Two quality measures are commonly used to evaluate the performance of these methods: AUC and AUPR. Specifically, we calculate the receiver operating characteristic curve (ROC) of true positive as a function of false positive, and use the area under the ROC curve (AUC) value as a quality measure. In addition, we also calculate the precision&#x2013;recall curve (P&#x2013;R), which is the chart of true positive rate between all positive predictions of each given recall rate. The area under the P&#x2013;R curve (AUPR) provides a quantitative assessment. These two kinds of quality measures have become the standard criteria for evaluating methods.</p>
</sec>
<sec id="S4.SS3">
<title>Prediction Ability</title>
<p>To provide a fair comparison of DTI prediction performances, we apply these methods on the same benchmark data sets. We also use 10-fold cross-validation random setting, the same evaluation criteria, and optimal parameters of each method.</p>
<p>From the results reported in <xref ref-type="table" rid="T3">Table 3</xref> and <xref ref-type="fig" rid="F3">Figure 3</xref>, THNCDF algorithm still maintains a high performance, especially for the AUPR values. For example, in the enzyme data set (<xref ref-type="fig" rid="F3">Figure 3A</xref>), the ion channel data set (<xref ref-type="fig" rid="F3">Figure 3B</xref>), and the GPCR data set (<xref ref-type="fig" rid="F3">Figure 3C</xref>), THNCDF outperforms all other methods by achieving the best performance for AUPR values. On the other hand, for the AUC values, THNCDF still maintains the high performance. It is well known that the training of DNN usually requires a large amount of training data; hence, its implementation on tasks with small-scale data is not suitable. This is the inherently unavoidable characteristic of the method we use. Thus, it is reflected in the correlation between the size of the benchmark data sets and the AUC values obtained.</p>
<table-wrap position="float" id="T3">
<label>TABLE 3</label>
<caption><p>The results of the baseline methods and the THNCDF method.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Data sets</td>
<td valign="top" align="center">Methods</td>
<td valign="top" align="center">RLS-KF</td>
<td valign="top" align="center">RF</td>
<td valign="top" align="center">DTiGEMS</td>
<td valign="top" align="center">iDTI-ESBoost</td>
<td valign="top" align="center">THNCDF</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Enzyme</td>
<td valign="top" align="center">AUC</td>
<td valign="top" align="center">0.990&#x002A;</td>
<td valign="top" align="center">0.978</td>
<td valign="top" align="center">0.990</td>
<td valign="top" align="center">0.960</td>
<td valign="top" align="center">0.987</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">AUPR</td>
<td valign="top" align="center">0.915</td>
<td valign="top" align="center">0.935</td>
<td valign="top" align="center">0.970</td>
<td valign="top" align="center">0.680</td>
<td valign="top" align="center">0.988&#x002A;</td>
</tr>
<tr>
<td valign="top" align="left">Ion channel</td>
<td valign="top" align="center">AUC</td>
<td valign="top" align="center">0.987</td>
<td valign="top" align="center">0.924</td>
<td valign="top" align="center">0.990&#x002A;</td>
<td valign="top" align="center">0.905</td>
<td valign="top" align="center">0.982</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">AUPR</td>
<td valign="top" align="center">0.901</td>
<td valign="top" align="center">0.948</td>
<td valign="top" align="center">0.960</td>
<td valign="top" align="center">0.480</td>
<td valign="top" align="center">0.980&#x002A;</td>
</tr>
<tr>
<td valign="top" align="left">GPCR</td>
<td valign="top" align="center">AUC</td>
<td valign="top" align="center">0.981</td>
<td valign="top" align="center">0.951</td>
<td valign="top" align="center">0.990&#x002A;</td>
<td valign="top" align="center">0.932</td>
<td valign="top" align="center">0.937</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">AUPR</td>
<td valign="top" align="center">0.806</td>
<td valign="top" align="center">0.896</td>
<td valign="top" align="center">0.860</td>
<td valign="top" align="center">0.480</td>
<td valign="top" align="center">0.938&#x002A;</td>
</tr>
<tr>
<td valign="top" align="left">Nuclear receptor</td>
<td valign="top" align="center">AUC</td>
<td valign="top" align="center">0.987</td>
<td valign="top" align="center">0.987</td>
<td valign="top" align="center">0.990&#x002A;</td>
<td valign="top" align="center">0.928</td>
<td valign="top" align="center">0.963</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">AUPR</td>
<td valign="top" align="center">0.911&#x002A;</td>
<td valign="top" align="center">0.847</td>
<td valign="top" align="center">0.880</td>
<td valign="top" align="center">0.790</td>
<td valign="top" align="center">0.906</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<attrib><italic>For each methods, <sup>&#x2217;</sup> indicates the highest value.</italic></attrib>
</table-wrap-foot>
</table-wrap>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption><p>Comparison results for THNCDF and other methods in terms of AUC and AUPR values on the benchmark data sets. The best AUC values are indicated in red, and the best AUPR values are in purple.</p></caption>
<graphic xlink:href="fgene-12-702259-g003.tif"/>
</fig>
<p>In addition, the number of positive samples and negative samples in each data set is highly imbalanced. The fact that few positive samples make THNCDF cannot exert its advantages, which is based on a large amount of training data. For benchmark data set, the feature dimension used in this method is low, and cascade deep forest has great advantages in the representation learning of ultra-high-dimensional data.</p>
<p>As shown in <xref ref-type="fig" rid="F3">Figure 3</xref>, it is found that the prediction accuracy is approximately equal to each other. It also shows that the THNCDF preserves the best performance on all data sets so that it can be migrated to other predictions. The experiment procedure shows that THNCDF is not very sensitive to parameter settings. Therefore, it does not need large-scale parameter adjustment, especially the selection of the optimal combination of base classifiers. Comparing with DNN, THNCDF is more stable and easier.</p>
<p>It is worth mentioning that for the two commonly used evaluation metrics, more and more authors think that AUPR provides more informative assessment than AUC for highly imbalanced data sets. They argue in favor of AUPR values as a key standard of evaluating the performance for skewed data sets, especially the data sets with more negative samples than positive samples. In fact, all of the four subsets in the benchmark data sets possess the imbalanced characteristic, which means that the number of known drug&#x2013;target interactions is far less than the number of pairs with no interaction evidence. So a more sensitive AUPR metric is generally preferred for assessing the prediction results for those imbalanced datasets. From this perspective, the result clearly shows that THNCDF outperforms the prediction in terms of AUPR as well.</p>
</sec>
</sec>
<sec id="S5">
<title>Discussion</title>
<p>In this paper, we present a new multi-kernel computational approach combined with an improved cascade deep forest, which leads to good predictive performance on the task of predicting DTI. The values of AUPR on four benchmark data sets are improved to 0.988, 0.980, 0.938, and 0.906, respectively. Theoretically, THNCDF can process various high dimensional features by utilizing heterogeneous networks. However, we still have some problems to be solved in the future. First, even though studies have discussed multiple similarity calculation methods, they have not escaped the research scope on the network interactions. We are more looking forward to the introduction of new biochemical similarity calculation methods or data sets. Secondly, we suggest applying different embedding techniques, integrating more similarity measures from more sources, and generating more graph-based features. It can also be found that various data sets, such as chemical structure, side effect, therapeutic effect, gene expression, drug binding site, and semantic data, have been utilized in former studies. However, the disadvantages of these biomedical data sets are also obvious, which include high data noise, incompleteness, and inaccuracy. Thirdly, some potential extensions of our work include applying THNCDF to different networks formulated as an interaction prediction problem. Popular examples of interaction prediction in the bioinformatics field include but are not limited to drug&#x2013;drug interactions prediction, drug&#x2013;disease interactions prediction, and gene&#x2013;disease association prediction.</p>
</sec>
<sec id="S6">
<title>Data Availability Statement</title>
<p>The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author/s.</p>
</sec>
<sec id="S7">
<title>Author Contributions</title>
<p>YZ and ZW: conceptualization and validation. YZ: methodology, writing&#x2014;review and editing, and supervision. ZW: software and writing&#x2014;original draft preparation. Both authors have read and agreed to the published version of the manuscript.</p>
</sec>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="S10">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<fn-group>
<fn fn-type="financial-disclosure">
<p><bold>Funding.</bold> This research was funded by the National Natural Science Foundation of China, grant number 61402054.</p>
</fn>
</fn-group>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Alabdulwahhab</surname> <given-names>K.</given-names></name> <name><surname>Sami</surname> <given-names>W.</given-names></name> <name><surname>Mehmood</surname> <given-names>T.</given-names></name> <name><surname>Meo</surname> <given-names>S.</given-names></name> <name><surname>Alasbali</surname> <given-names>T.</given-names></name> <name><surname>Alwadani</surname> <given-names>F.</given-names></name></person-group> (<year>2021</year>). <article-title>Automated detection of diabetic retinopathy using machine learning classifiers.</article-title> <source><italic>Riv. Eur. Sci. Med. Farmacol</italic></source> <volume>25</volume> <fpage>583</fpage>&#x2013;<lpage>590</lpage>. <pub-id pub-id-type="doi">10.26355/eurrev_202101_24615</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Badkas</surname> <given-names>A.</given-names></name> <name><surname>De Landtsheer</surname> <given-names>S.</given-names></name> <name><surname>Sauter</surname> <given-names>T.</given-names></name></person-group> (<year>2020</year>). <article-title>Topological network measures for drug repositioning.</article-title> <source><italic>Brief. Bioinform.</italic></source> <fpage>1</fpage>&#x2013;<lpage>13</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbaa357</pub-id> <pub-id pub-id-type="pmid">33348366</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Breiman</surname> <given-names>L.</given-names></name></person-group> (<year>2001</year>). <article-title>Random forests.</article-title> <source><italic>Mach. Learn.</italic></source> <volume>45</volume> <fpage>5</fpage>&#x2013;<lpage>32</lpage>. <pub-id pub-id-type="doi">10.1023/A:1010933404324</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cao</surname> <given-names>D. S.</given-names></name> <name><surname>Zhang</surname> <given-names>L. X.</given-names></name> <name><surname>Tan</surname> <given-names>G. S.</given-names></name> <name><surname>Xiang</surname> <given-names>Z.</given-names></name> <name><surname>Zeng</surname> <given-names>W. B.</given-names></name> <name><surname>Xu</surname> <given-names>Q. S.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>Computational prediction of drugtarget interactions using chemical, biological, and network features.</article-title> <source><italic>Mol. Inform.</italic></source> <volume>33</volume> <fpage>669</fpage>&#x2013;<lpage>681</lpage>. <pub-id pub-id-type="doi">10.1002/minf.201400009</pub-id> <pub-id pub-id-type="pmid">27485302</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>Q.</given-names></name> <name><surname>Lai</surname> <given-names>D.</given-names></name> <name><surname>Lan</surname> <given-names>W.</given-names></name> <name><surname>Wu</surname> <given-names>X.</given-names></name> <name><surname>Chen</surname> <given-names>B.</given-names></name> <name><surname>Chen</surname> <given-names>Y. P.</given-names></name><etal/></person-group> (<year>2021</year>). <article-title>ILDMSF: inferring associations between long non-coding RNA and disease based on multi-similarity fusion.</article-title> <source><italic>IEEE/ACM Trans. Comput. Biol. Bioinform.</italic></source> <volume>18</volume> <fpage>1106</fpage>&#x2013;<lpage>1112</lpage>. <pub-id pub-id-type="doi">10.1109/TCBB.2019.2936476</pub-id> <pub-id pub-id-type="pmid">31443046</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ding</surname> <given-names>Y.</given-names></name> <name><surname>Tang</surname> <given-names>J.</given-names></name> <name><surname>Guo</surname> <given-names>F.</given-names></name></person-group> (<year>2018</year>). <article-title>Identification of drug-side effect association via multiple information integration with centered kernel alignment.</article-title> <source><italic>Neurocomputing</italic></source> <volume>325</volume> <fpage>211</fpage>&#x2013;<lpage>224</lpage>. <pub-id pub-id-type="doi">10.1016/j.neucom.2018.10.028</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ding</surname> <given-names>Y.</given-names></name> <name><surname>Tang</surname> <given-names>J.</given-names></name> <name><surname>Guo</surname> <given-names>F.</given-names></name></person-group> (<year>2020b</year>). <article-title>Identification of drug&#x2013;target interactions via fuzzy bipartite local model.</article-title> <source><italic>Neural Comput. Applic.</italic></source> <volume>32</volume> <fpage>10303</fpage>&#x2013;<lpage>10319</lpage>. <pub-id pub-id-type="doi">10.1007/s00521-019-04569-z</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ding</surname> <given-names>Y.</given-names></name> <name><surname>Tang</surname> <given-names>J.</given-names></name> <name><surname>Guo</surname> <given-names>F.</given-names></name></person-group> (<year>2020a</year>). <article-title>Identification of drug&#x2013;target interactions via dual laplacian regularized least squares with multiple kernel fusion.</article-title> <source><italic>Knowl. Based Syst.</italic></source> <volume>204</volume>:<issue>106254</issue>. <pub-id pub-id-type="doi">10.1016/j.knosys.2020.106254</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fahimian</surname> <given-names>G.</given-names></name> <name><surname>Zahiri</surname> <given-names>J.</given-names></name> <name><surname>Arab</surname> <given-names>S. S.</given-names></name> <name><surname>Sajedi</surname> <given-names>R.</given-names></name></person-group> (<year>2020</year>). <article-title>RepCOOL: computational drug repositioning via integrating heterogeneous biological networks.</article-title> <source><italic>J. Transl. Med.</italic></source> <volume>18</volume>:<issue>375</issue>. <pub-id pub-id-type="doi">10.1186/s12967-020-02541-3</pub-id> <pub-id pub-id-type="pmid">33008415</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hao</surname> <given-names>M.</given-names></name> <name><surname>Wang</surname> <given-names>Y. L.</given-names></name> <name><surname>Bryant</surname> <given-names>S.</given-names></name></person-group> (<year>2016</year>). <article-title>Improved prediction of drug-target interactions using regularized least squares integrating with kernel fusion technique.</article-title> <source><italic>Anal. Chim. Acta</italic></source> <volume>909</volume> <fpage>41</fpage>&#x2013;<lpage>50</lpage>. <pub-id pub-id-type="doi">10.1016/j.aca.2016.01.014</pub-id> <pub-id pub-id-type="pmid">26851083</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>L.</given-names></name> <name><surname>Luo</surname> <given-names>H. M.</given-names></name> <name><surname>Li</surname> <given-names>S. N.</given-names></name> <name><surname>Wu</surname> <given-names>F. X.</given-names></name> <name><surname>Wang</surname> <given-names>J. X.</given-names></name></person-group> (<year>2020</year>). <article-title>Drug&#x2013;drug similarity measure and its applications.</article-title> <source><italic>Brief. Bioinform.</italic></source> <fpage>1</fpage>&#x2013;<lpage>20</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbaa265</pub-id> <pub-id pub-id-type="pmid">33152756</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jarada</surname> <given-names>T.</given-names></name> <name><surname>Rokne</surname> <given-names>J.</given-names></name> <name><surname>Alhajj</surname> <given-names>R.</given-names></name></person-group> (<year>2021</year>). <article-title>SNF-NN: computational method to predict drug-disease interactions using similarity network fusion and neural networks.</article-title> <source><italic>BMC Bioinformatics</italic></source> <volume>22</volume>:<issue>28</issue>. <pub-id pub-id-type="doi">10.1186/s12859-020-03950-3</pub-id> <pub-id pub-id-type="pmid">33482713</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ju&#x00E1;rez-Saldivar</surname> <given-names>A.</given-names></name> <name><surname>Schroeder</surname> <given-names>M.</given-names></name> <name><surname>Salentin</surname> <given-names>S.</given-names></name> <name><surname>Haupt</surname> <given-names>V.</given-names></name> <name><surname>Saavedra</surname> <given-names>E.</given-names></name> <name><surname>V&#x00E1;zquez</surname> <given-names>C.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>Computational drug repositioning for chagas disease using protein-ligand interaction profiling.</article-title> <source><italic>Int. J. Mol. Sci.</italic></source> <volume>21</volume>:<issue>4270</issue>. <pub-id pub-id-type="doi">10.3390/ijms21124270</pub-id> <pub-id pub-id-type="pmid">32560043</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lan</surname> <given-names>W.</given-names></name> <name><surname>Lai</surname> <given-names>D.</given-names></name> <name><surname>Chen</surname> <given-names>Q.</given-names></name> <name><surname>Wu</surname> <given-names>X.</given-names></name> <name><surname>Chen</surname> <given-names>B.</given-names></name> <name><surname>Liu</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>LDICDL: LncRNA-disease association identification based on Collaborative Deep Learning.</article-title> <source><italic>IEEE/ACM Trans. Comput. Biol. Bioinform.</italic></source> <fpage>1</fpage>&#x2013;<lpage>1</lpage>. <pub-id pub-id-type="doi">10.1109/TCBB.2020.3034910</pub-id> <pub-id pub-id-type="pmid">33125333</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lan</surname> <given-names>W.</given-names></name> <name><surname>Wang</surname> <given-names>J.</given-names></name> <name><surname>Li</surname> <given-names>M.</given-names></name> <name><surname>Liu</surname> <given-names>J.</given-names></name> <name><surname>Li</surname> <given-names>Y.</given-names></name> <name><surname>Wu</surname> <given-names>F.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>Predicting drug-target interaction using positive-unlabeled learning.</article-title> <source><italic>Neurocomputing</italic></source> <volume>206</volume> <fpage>50</fpage>&#x2013;<lpage>57</lpage>. <pub-id pub-id-type="doi">10.1016/j.neucom.2016.03.080</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>W. J.</given-names></name> <name><surname>Xu</surname> <given-names>H. Y.</given-names></name> <name><surname>Li</surname> <given-names>H. X.</given-names></name> <name><surname>Yang</surname> <given-names>Y. J.</given-names></name> <name><surname>Sharma</surname> <given-names>P.</given-names></name> <name><surname>Wang</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2019</year>). <article-title>Complexity and algorithms for superposed data uploading problem in networks with smart devices.</article-title> <source><italic>IEEE Intern.Things J.</italic></source> <volume>7</volume> <fpage>5882</fpage>&#x2013;<lpage>5891</lpage>. <pub-id pub-id-type="doi">10.1109/JIOT.2019.2949352</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>J.</given-names></name> <name><surname>Wang</surname> <given-names>W. T.</given-names></name> <name><surname>Chen</surname> <given-names>J.</given-names></name> <name><surname>Sun</surname> <given-names>G. Z.</given-names></name> <name><surname>Yang</surname> <given-names>A.</given-names></name></person-group> (<year>2019</year>). <article-title>Classification and research of skin lesions based on machine learning.</article-title> <source><italic>Comput. Mater. Cont.</italic></source> <volume>61</volume> <fpage>1187</fpage>&#x2013;<lpage>1200</lpage>. <pub-id pub-id-type="doi">10.32604/cmc.2020.05883</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>Y.</given-names></name> <name><surname>Wu</surname> <given-names>M.</given-names></name> <name><surname>Miao</surname> <given-names>C.</given-names></name> <name><surname>Zhao</surname> <given-names>P.</given-names></name> <name><surname>Li</surname> <given-names>X.</given-names></name></person-group> (<year>2016</year>). <article-title>Neighborhood regularized logistic matrix factorization for drug-target interaction prediction.</article-title> <source><italic>PLoS Comput. Biol.</italic></source> <volume>12</volume>:<issue>e1004760</issue>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1004760</pub-id> <pub-id pub-id-type="pmid">26872142</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Luo</surname> <given-names>H. M.</given-names></name> <name><surname>Li</surname> <given-names>M.</given-names></name> <name><surname>Yang</surname> <given-names>M. Y.</given-names></name> <name><surname>Wu</surname> <given-names>F. X.</given-names></name> <name><surname>Li</surname> <given-names>Y. H.</given-names></name> <name><surname>Wang</surname> <given-names>J. X.</given-names></name></person-group> (<year>2020</year>). <article-title>Biomedical data and computational models for drug repositioning: a comprehensive review.</article-title> <source><italic>Brief. Bioinform.</italic></source> <volume>22</volume> <fpage>1604</fpage>&#x2013;<lpage>1619</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbz176</pub-id> <pub-id pub-id-type="pmid">32043521</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Meng</surname> <given-names>Y. J.</given-names></name> <name><surname>Jin</surname> <given-names>M.</given-names></name> <name><surname>Tang</surname> <given-names>X. F.</given-names></name> <name><surname>Xu</surname> <given-names>J. L.</given-names></name></person-group> (<year>2021</year>). <article-title>Drug repositioning based on similarity constrained probabilistic matrix factorization: COVID-19 as a case study.</article-title> <source><italic>Appl. Soft Comput.</italic></source> <volume>103</volume>:<issue>107135</issue>. <pub-id pub-id-type="doi">10.1016/j.asoc.2021.107135</pub-id> <pub-id pub-id-type="pmid">33519322</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rauschenbach</surname> <given-names>L.</given-names></name> <name><surname>Wieland</surname> <given-names>A.</given-names></name> <name><surname>Reinartz</surname> <given-names>R.</given-names></name> <name><surname>Kebir</surname> <given-names>S.</given-names></name> <name><surname>Till</surname> <given-names>A.</given-names></name> <name><surname>Darkwah Oppong</surname> <given-names>M.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>Drug repositioning of antiretroviral ritonavir for combinatorial therapy in glioblastoma.</article-title> <source><italic>Eur. J. Cancer</italic></source> <volume>140</volume> <fpage>130</fpage>&#x2013;<lpage>139</lpage>. <pub-id pub-id-type="doi">10.1016/j.ejca.2020.09.017</pub-id> <pub-id pub-id-type="pmid">33091717</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rayhan</surname> <given-names>F.</given-names></name> <name><surname>Ahmed</surname> <given-names>S.</given-names></name> <name><surname>Shatabda</surname> <given-names>S.</given-names></name> <name><surname>Farid</surname> <given-names>D.</given-names></name> <name><surname>Mousavian</surname> <given-names>Z.</given-names></name> <name><surname>Dehzangi</surname> <given-names>I.</given-names></name><etal/></person-group> (<year>2017</year>). <article-title>IDTI-ESBoost: identification of drug target interaction using evolutionary and structural features with boosting.</article-title> <source><italic>Sci. Rep.</italic></source> <volume>7</volume>:<issue>17731</issue>. <pub-id pub-id-type="doi">10.1038/s41598-017-18025-2</pub-id> <pub-id pub-id-type="pmid">29255285</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Roessler</surname> <given-names>H.</given-names></name> <name><surname>Knoers</surname> <given-names>N.</given-names></name> <name><surname>van haelst</surname> <given-names>M.</given-names></name> <name><surname>Haaften</surname> <given-names>G.</given-names></name></person-group> (<year>2021</year>). <article-title>Drug repurposing for rare diseases.</article-title> <source><italic>Trends Pharmacol. Sci.</italic></source> <volume>75</volume> <fpage>157</fpage>&#x2013;<lpage>160</lpage>. <pub-id pub-id-type="doi">10.1016/j.tips.2021.01.003</pub-id> <pub-id pub-id-type="pmid">33563480</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stein</surname> <given-names>M.</given-names></name> <name><surname>Levey</surname> <given-names>D.</given-names></name> <name><surname>Cheng</surname> <given-names>Z. S.</given-names></name> <name><surname>Wendt</surname> <given-names>F.</given-names></name> <name><surname>Harrington</surname> <given-names>K.</given-names></name> <name><surname>Pathak</surname> <given-names>G.</given-names></name><etal/></person-group> (<year>2021</year>). <article-title>Genome-wide association analyses of post-traumatic stress disorder and its symptom subdomains in the Million Veteran Program.</article-title> <source><italic>Nat. Genet.</italic></source> <volume>53</volume> <fpage>174</fpage>&#x2013;<lpage>184</lpage>. <pub-id pub-id-type="doi">10.1038/s41588-020-00767-x</pub-id> <pub-id pub-id-type="pmid">33510476</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thafar</surname> <given-names>M.</given-names></name> <name><surname>Olayan</surname> <given-names>R.</given-names></name> <name><surname>Ashoor</surname> <given-names>H.</given-names></name> <name><surname>Albaradei</surname> <given-names>S.</given-names></name> <name><surname>Bajic</surname> <given-names>V.</given-names></name> <name><surname>Gao</surname> <given-names>X.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>DTiGEMS+: drug-target interaction prediction using graph embedding, graph mining, and similarity-based techniques.</article-title> <source><italic>J. Cheminform.</italic></source> <volume>12</volume>:<issue>44</issue>. <pub-id pub-id-type="doi">10.1186/s13321-020-00447-2</pub-id> <pub-id pub-id-type="pmid">33431036</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Turanli</surname> <given-names>B.</given-names></name> <name><surname>Grotli</surname> <given-names>M.</given-names></name> <name><surname>Boren</surname> <given-names>J.</given-names></name> <name><surname>Nielsen</surname> <given-names>J.</given-names></name> <name><surname>Uhlen</surname> <given-names>M.</given-names></name> <name><surname>Arga</surname> <given-names>K.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>Drug repositioning for effective prostate cancer treatment.</article-title> <source><italic>Front. Physiol.</italic></source> <volume>9</volume>:<issue>500</issue>. <pub-id pub-id-type="doi">10.3389/fphys.2018.00500</pub-id> <pub-id pub-id-type="pmid">29867548</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vivarelli</surname> <given-names>S.</given-names></name> <name><surname>Candido</surname> <given-names>S.</given-names></name> <name><surname>Caruso</surname> <given-names>G.</given-names></name> <name><surname>Falzone</surname> <given-names>L.</given-names></name> <name><surname>Libra</surname> <given-names>M.</given-names></name></person-group> (<year>2020</year>). <article-title>Patient-derived tumor organoids for drug repositioning in cancer care: a promising approach in the era of tailored treatment.</article-title> <source><italic>Cancers.</italic></source> <volume>12</volume>:<issue>3636</issue>. <pub-id pub-id-type="doi">10.3390/cancers12123636</pub-id> <pub-id pub-id-type="pmid">33291603</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yamanishi</surname> <given-names>Y.</given-names></name> <name><surname>Araki</surname> <given-names>M.</given-names></name> <name><surname>Gutteridge</surname> <given-names>A.</given-names></name> <name><surname>Honda</surname> <given-names>W.</given-names></name> <name><surname>Kanehisa</surname> <given-names>M.</given-names></name></person-group> (<year>2008</year>). <article-title>Prediction of drug&#x2013;target interaction networks from the integration of chemical and genomic spaces.</article-title> <source><italic>Bioinformatics (Oxf. Engl.)</italic></source> <volume>24</volume> <fpage>i232</fpage>&#x2013;<lpage>i240</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btn162</pub-id> <pub-id pub-id-type="pmid">18586719</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname> <given-names>J. B.</given-names></name> <name><surname>Zhang</surname> <given-names>D. N.</given-names></name> <name><surname>Liu</surname> <given-names>L.</given-names></name> <name><surname>Li</surname> <given-names>G. Q.</given-names></name> <name><surname>Cai</surname> <given-names>Y. Y.</given-names></name> <name><surname>Zhang</surname> <given-names>Y.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>Computational drug repositioning based on the relationships between substructure&#x2013;indication.</article-title> <source><italic>Brief. Bioinform.</italic></source> <fpage>1</fpage>&#x2013;<lpage>11</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbaa348</pub-id> <pub-id pub-id-type="pmid">33313675</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zamami</surname> <given-names>Y.</given-names></name> <name><surname>Imanishi</surname> <given-names>M.</given-names></name> <name><surname>Takechi</surname> <given-names>K.</given-names></name> <name><surname>Ishizawa</surname> <given-names>K.</given-names></name></person-group> (<year>2017</year>). <article-title>Pharmacological approach for drug repositioning against cardiorenal diseases.</article-title> <source><italic>J. Med. Invest.</italic></source> <volume>64</volume> <fpage>197</fpage>&#x2013;<lpage>201</lpage>. <pub-id pub-id-type="doi">10.2152/jmi.64.197</pub-id> <pub-id pub-id-type="pmid">28954981</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zeng</surname> <given-names>X. X.</given-names></name> <name><surname>Zhu</surname> <given-names>S. Y.</given-names></name> <name><surname>Liu</surname> <given-names>X. R.</given-names></name> <name><surname>Zhou</surname> <given-names>Y. D.</given-names></name> <name><surname>Nussinov</surname> <given-names>R.</given-names></name> <name><surname>Cheng</surname> <given-names>F. X.</given-names></name></person-group> (<year>2019</year>). <article-title>DeepDR: A network-based deep learning approach to in silico drug repositioning.</article-title> <source><italic>Bioinformatics (Oxf. Engl.)</italic></source> <volume>35</volume> <fpage>5191</fpage>&#x2013;<lpage>5198</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btz418</pub-id> <pub-id pub-id-type="pmid">31116390</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>J. Y.</given-names></name> <name><surname>Zhong</surname> <given-names>S. Q.</given-names></name> <name><surname>Wang</surname> <given-names>J.</given-names></name> <name><surname>Wang</surname> <given-names>L.</given-names></name> <name><surname>Yang</surname> <given-names>Y. Q.</given-names></name> <name><surname>Wei</surname> <given-names>B. Y.</given-names></name><etal/></person-group> (<year>2020</year>). &#x201C;<article-title>A review on blockchain-based systems and applications</article-title>,&#x201D; in <source><italic>Internet of Vehicles. Technologies and Services Toward Smart Cities. IOV 2019. Lecture Notes in Computer Science</italic></source>, <volume>Vol. 11894</volume> <role>eds</role> <person-group person-group-type="editor"><name><surname>Hsu</surname> <given-names>C. H.</given-names></name> <name><surname>Kallel</surname> <given-names>S.</given-names></name> <name><surname>Lan</surname> <given-names>K. C.</given-names></name> <name><surname>Zheng</surname> <given-names>Z.</given-names></name></person-group> (<publisher-loc>Cham</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>237</fpage>&#x2013;<lpage>249</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-030-38651-1_20</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zheng</surname> <given-names>Y.</given-names></name> <name><surname>Wu</surname> <given-names>Z.</given-names></name></person-group> (<year>2021</year>). <article-title>A machine learning-based biological drug-target interaction prediction method for a tripartite heterogeneous network.</article-title> <source><italic>ACS Omega</italic></source> <volume>6</volume> <fpage>3037</fpage>&#x2013;<lpage>3045</lpage>. <pub-id pub-id-type="doi">10.1021/acsomega.0c05377</pub-id> <pub-id pub-id-type="pmid">33553921</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname> <given-names>R. Y.</given-names></name> <name><surname>Lu</surname> <given-names>Z. L.</given-names></name> <name><surname>Luo</surname> <given-names>H. M.</given-names></name> <name><surname>Xiang</surname> <given-names>J.</given-names></name> <name><surname>Zeng</surname> <given-names>M.</given-names></name> <name><surname>Li</surname> <given-names>M.</given-names></name></person-group> (<year>2020</year>). <article-title>NEDD: a network embedding based method for predicting drug-disease associations.</article-title> <source><italic>BMC Bioinformatics</italic></source> <volume>21</volume>:<issue>387</issue>. <pub-id pub-id-type="doi">10.1186/s12859-020-03682-4</pub-id> <pub-id pub-id-type="pmid">32938396</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname> <given-names>Z. H.</given-names></name> <name><surname>Feng</surname> <given-names>J.</given-names></name></person-group> (<year>2019</year>). <article-title>Deep forest.</article-title> <source><italic>Natl. Sci. Rev.</italic></source> <volume>6</volume> <fpage>74</fpage>&#x2013;<lpage>86</lpage>. <pub-id pub-id-type="doi">10.1093/nsr/nwy108</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhu</surname> <given-names>D. J.</given-names></name> <name><surname>Sun</surname> <given-names>Y. D.</given-names></name> <name><surname>Li</surname> <given-names>X. F.</given-names></name> <name><surname>Du</surname> <given-names>H. W.</given-names></name> <name><surname>Qu</surname> <given-names>R. N.</given-names></name> <name><surname>Yu</surname> <given-names>P. P.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>MINE: a method of multi-interaction heterogeneous information network embedding.</article-title> <source><italic>Comput. Mater. Cont.</italic></source> <volume>63</volume> <fpage>1343</fpage>&#x2013;<lpage>1356</lpage>. <pub-id pub-id-type="doi">10.32604/cmc.2020.010008</pub-id></citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhuang</surname> <given-names>X. H.</given-names></name> <name><surname>Ye</surname> <given-names>R.</given-names></name> <name><surname>So</surname> <given-names>M. T.</given-names></name> <name><surname>Lam</surname> <given-names>W. Y.</given-names></name> <name><surname>Karim</surname> <given-names>A.</given-names></name> <name><surname>Yu</surname> <given-names>M.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>A random forest-based framework for genotyping and accuracy assessment of copy number variations.</article-title> <source><italic>NAR Genomics Bioinform.</italic></source> <volume>2</volume>:<issue>lqaa071</issue>. <pub-id pub-id-type="doi">10.1093/nargab/lqaa071</pub-id> <pub-id pub-id-type="pmid">33575619</pub-id></citation></ref>
</ref-list>
</back>
</article>