<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Genet.</journal-id>
<journal-title>Frontiers in Genetics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Genet.</abbrev-journal-title>
<issn pub-type="epub">1664-8021</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fgene.2021.727744</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Genetics</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Predicting miRNA-Disease Associations Based on Heterogeneous Graph Attention Networks</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Ji</surname> <given-names>Cunmei</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1378209/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Wang</surname> <given-names>Yutian</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Ni</surname> <given-names>Jiancheng</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Zheng</surname> <given-names>Chunhou</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/838718/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Su</surname> <given-names>Yansen</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="corresp" rid="c002"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1125397/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>School of Cyber Science and Engineering, Qufu Normal University</institution>, <addr-line>Qufu</addr-line>, <country>China</country></aff>
<aff id="aff2"><sup>2</sup><institution>School of Artificial Intelligence, Anhui University</institution>, <addr-line>Hefei</addr-line>, <country>China</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Hongmin Cai, South China University of Technology, China</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Xing Chen, China University of Mining and Technology, China; Qi Zhao, University of Science and Technology Liaoning, China; Wei Lan, Guangxi University, China</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Cunmei Ji <email>cunmeiji&#x00040;126.com</email></corresp>
<corresp id="c002">Yansen Su <email>suyansen&#x00040;ahu.edu.cn</email></corresp>
<fn fn-type="other" id="fn001"><p>This article was submitted to Computational Genomics, a section of the journal Frontiers in Genetics</p></fn></author-notes>
<pub-date pub-type="epub">
<day>25</day>
<month>08</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>12</volume>
<elocation-id>727744</elocation-id>
<history>
<date date-type="received">
<day>19</day>
<month>06</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>02</day>
<month>08</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2021 Ji, Wang, Ni, Zheng and Su.</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>Ji, Wang, Ni, Zheng and Su</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license></permissions>
<abstract><p>In recent years, more and more evidence has shown that microRNAs (miRNAs) play an important role in the regulation of post-transcriptional gene expression, and are closely related to human diseases. Many studies have also revealed that miRNAs can be served as promising biomarkers for the potential diagnosis and treatment of human diseases. The interactions between miRNA and human disease have rarely been demonstrated, and the underlying mechanism of miRNA is not clear. Therefore, computational approaches has attracted the attention of researchers, which can not only save time and money, but also improve the efficiency and accuracy of biological experiments. In this work, we proposed a Heterogeneous Graph Attention Networks (GAT) based method for miRNA-disease associations prediction, named HGATMDA. We constructed a heterogeneous graph for miRNAs and diseases, introduced weighted DeepWalk and GAT methods to extract features of miRNAs and diseases from the graph. Moreover, a fully-connected neural networks is used to predict correlation scores between miRNA-disease pairs. Experimental results under five-fold cross validation (five-fold CV) showed that HGATMDA achieved better prediction performance than other state-of-the-art methods. In addition, we performed three case studies on breast neoplasms, lung neoplasms and kidney neoplasms. The results showed that for the three diseases mentioned above, 50 out of top 50 candidates were confirmed by the validation datasets. Therefore, HGATMDA is suitable as an effective tool to identity potential diseases-related miRNAs.</p></abstract>
<kwd-group>
<kwd>disease</kwd>
<kwd>miRNA</kwd>
<kwd>graph attention networks</kwd>
<kwd>miRNA-disease association</kwd>
<kwd>DeepWalk</kwd>
</kwd-group>
<counts>
<fig-count count="4"/>
<table-count count="6"/>
<equation-count count="16"/>
<ref-count count="60"/>
<page-count count="12"/>
<word-count count="8463"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>1. Introduction</title>
<p>MicoRNAs (miRNAs) are a class of endogenous non-coding RNAs with a length of about 21&#x02013;25 nucleotides, which play an important role in the regulation of post-transcriptional gene expression in organisms (Ambros, <xref ref-type="bibr" rid="B1">2001</xref>, <xref ref-type="bibr" rid="B3">2004</xref>; Bartel, <xref ref-type="bibr" rid="B4">2004</xref>, <xref ref-type="bibr" rid="B6">2018</xref>). Over the past decades, researchers have identified hundreds of miRNAs in humans and shown that many of them interact with most human mRNAs (Friedman et al., <xref ref-type="bibr" rid="B20">2009</xref>). Recent studies have discovered that miRNAs down-regulate gene expression by degrading or silencing targeting mRNAs, thereby affecting many cellular processes, such as growth, development, differentiation, and death (Ambros, <xref ref-type="bibr" rid="B2">2003</xref>; He and Hannon, <xref ref-type="bibr" rid="B23">2004</xref>; Bartel, <xref ref-type="bibr" rid="B5">2009</xref>). Furthermore, many researches have found that human miRNAs are involved in many human diseases (Croce, <xref ref-type="bibr" rid="B18">2008</xref>; Li et al., <xref ref-type="bibr" rid="B34">2014</xref>; Chou et al., <xref ref-type="bibr" rid="B17">2016</xref>; Huang et al., <xref ref-type="bibr" rid="B25">2019b</xref>). Therefore, miRNAs are promising biomarkers for diagnosis and treatment of human diseases.</p>
<p>Based on previous biological experiments, verifying the association between miRNA and disease is time-consuming and expensive. Computational methods can efficiently select the promising disease-associated miRNAs for further experimental verification. Many computational methods have been proposed for predicting the miRNA-disease associations, which can be roughly divided into two categories: one is based on similarity networks, the other is based on machine learning. Jiang et al. (<xref ref-type="bibr" rid="B27">2010</xref>) have firstly introduced similarity network to compute the score between disease and miRNA. They constructed functional miRNA network by computing the overlap of the target genes, and applied the hypergeometric distribution to calculate the association score between disease and miRNA. Since then, various network based approaches have been proposed (Chen et al., <xref ref-type="bibr" rid="B14">2016</xref>, <xref ref-type="bibr" rid="B12">2018c</xref>; Pan et al., <xref ref-type="bibr" rid="B41">2019</xref>; Yu et al., <xref ref-type="bibr" rid="B57">2019</xref>). You et al. (<xref ref-type="bibr" rid="B56">2017</xref>) have constructed a heterogeneous network, then proposed a novel path based method named PBMDA for inferring the disease-related miRNAs. However, this method only uses sub-graph information for prediction, which can be improved by considering the global information in the heterogeneous graph. Chen et al. (<xref ref-type="bibr" rid="B9">2012</xref>) have firstly developed random walk with restart for predicting miRNA-disease associations (RWRMDA). However, the requirement for at least one known related miRNA in inference may limit the application of RWRMDA. Subsequently, many methods based on random walk were proposed to improve the prediction performance.</p>
<p>Machine learning methods have also been introduced in this field. Matrix completion has been widely used in recommendation systems (Koren et al., <xref ref-type="bibr" rid="B31">2009</xref>). Inspired by these, Li et al. (<xref ref-type="bibr" rid="B33">2017</xref>) developed a matrix completion based model to predict the disease-related miRNAs (MCMDA). However, it will be failed to predict new disease (or miRNA) that has no connections with the known miRNAs (or diseases). Chen et al. (<xref ref-type="bibr" rid="B11">2018b</xref>) proposed a effective method of inductive matrix completion for predicting, which can be applied for new diseases (or miRNAs), named IMCMDA, it achieved better prediction performance than MCMDA. Almost at the same time, Xiao et al. (<xref ref-type="bibr" rid="B51">2018</xref>) proposed a novel method called GRNMF, which combined graph Laplacian regularization with non-negative matrix factorization for miRNA-disease associations prediction in the integrated heterogeneous networks. Chen et al. (<xref ref-type="bibr" rid="B15">2018e</xref>) introduced low rank matrix decomposition to reduce noises in the datasets, then inferred the associations between miRNAs and diseases in the integrated heterogeneous graph, including disease semantic network, miRNA functional network, the relative GIP kernel networks, and miRNA-disease associations. Chen et al. (<xref ref-type="bibr" rid="B13">2018d</xref>) used bipartite recommendation algorithm to generate the association score between disease and miRNA. In addition, there are many other studies based on matrix completion and matrix factorization to predict potential connections between miRNAs and diseases (Shen et al., <xref ref-type="bibr" rid="B47">2017</xref>; Zhong et al., <xref ref-type="bibr" rid="B59">2018</xref>; Yan et al., <xref ref-type="bibr" rid="B54">2019</xref>; Yu et al., <xref ref-type="bibr" rid="B57">2019</xref>). Furthermore, lots of supervised learning algorithms have been introduced in this field. Chen et al. (<xref ref-type="bibr" rid="B16">2019</xref>) used decision tree to infer the miRNA-disease associations. SVM-based methods have been used for predicting the potential relations between miRNAs and diseases (Xu et al., <xref ref-type="bibr" rid="B52">2011</xref>; Chen et al., <xref ref-type="bibr" rid="B8">2018a</xref>). Chen et al. (<xref ref-type="bibr" rid="B7">2015</xref>) introduced restricted Boltzmann machine for predicting associations and different types between miRNAs and diseases. Chen et al. (<xref ref-type="bibr" rid="B10">2021</xref>) integrated matrix completions with neighborhood constraint for prediction.</p>
<p>Recent decades, deep learning based methods have been gradually used in this field. Fu and Peng (<xref ref-type="bibr" rid="B21">2017</xref>) proposed a deep ensemble method and adopted stacked autoencoder for obtaining high-level features from integrated similarities, then a three layers neural networks (NN) was used for prediction. Peng et al. (<xref ref-type="bibr" rid="B43">2019</xref>) developed a CNN based model named MDA-CNN for prediction. They constructed disease-gene-miRNA networks, and obtained features of diseases and miRNAs by measuring Pearson correlation coefficient with genes. Then auto-encoder was used for feature selection, followed by a convolutional neural networks (CNN) model for further feature extraction between miRNA-disease pairs. Finally, two fully-connected layers was introduced for classification. Xuan et al. (<xref ref-type="bibr" rid="B53">2018</xref>) integrated two CNN models to predict correlation score of the miRNA-disease pair. In order to reduce the impact of negative sample missing on the prediction performance, Zhang et al. (<xref ref-type="bibr" rid="B58">2019</xref>) constructed two spliced matrices with the known miRNA-disease associations, disease similarity and miRNA similarity, then used variational autoencoder to calculate the unknown values of miRNA-disease pairs. Ji et al. (<xref ref-type="bibr" rid="B26">2021</xref>) used two regression models to learn dense vectors from integrated disease and miRNA similarities, applied the reconstruction probability of an autoencoder model for inferring.</p>
<p>Graph neural networks (GNNs) are a class of models that can effectively extract information from its neighborhood in the graph, it has achieved great success in social networks, knowledge graph, biology and so on (Zhou et al., <xref ref-type="bibr" rid="B60">2020</xref>). DimiG considers the tissue-specific expressions as the features of miRNAs and protein coding genes, and then applies graph convolutional network (GCN) in the protein-coding genes and miRNAs networks (Pan and Shen, <xref ref-type="bibr" rid="B40">2019</xref>). Li et al. (<xref ref-type="bibr" rid="B32">2020a</xref>) combined GCN and IMC algorithm to identify disease-related miRNAs. However, the input features of diseases and miRNAs were initialized randomly, which reduced the ability of GCN. Long et al. (<xref ref-type="bibr" rid="B36">2021</xref>) developed a novel computational model to predict microbe-disease association (GATMDA). It firstly constructed the input features by integrating similarities of diseases and microbes, and a bipartite network of known microbe-disease associations. Then, it used graph attention networks (GAT) for further learning representations of nodes from the graph. Finally, a decoder with inductive matrix completion (IMC) was selected for prediction. Long et al. (<xref ref-type="bibr" rid="B37">2020</xref>) also proposed GNN based model to predict human microbe-drug connections. Unlike GATMDA, they adopted RWR method for initial features learning, and then used GNN with random field. Additionally, negative samples were needed for regression training. GAEMDA integrated similarities of diseases and miRNAs as features of nodes, applied a GCN model for further feature extraction, and then used a bilinear decoder for identification (Li et al., <xref ref-type="bibr" rid="B35">2020b</xref>).</p>
<p>In this paper, we proposed a novel computational model (HGATMDA) that combines graph embedding and graph attention networks to infer disease-related miRNAs. Specifically, we first constructed a miRNA-disease heterogeneous graph. Then, we adopted weighted DeepWalk to learn dense representations of miRNAs and diseases from miRNA-miRNA sub-graph and disease-disease sub-graph, respectively. In addition, we utilized graph attention networks (GAT) to further learn the graph structure information from the miRNA-disease sub-graph. Furthermore, we presented a fully-connected neural networks for inferring the potential associations between miRNAs and diseases. Finally, we evaluated the performance of HGATMDA under five-fold cross validation (CV). The results showed that HGATMDA achieved the best area under ROC curve (AUC) of 94.54 &#x000B1; 0.34%, the area under P-R curve (AUPR) of 94.05 &#x000B1; 0.18%, Accuracy of 87.02%, Precision of 94.07%, Recall of 90.04%, F1-score of 87.39%. We conducted case studies on three common human diseases such as breast neoplasms, lung neoplasms, and kidney neoplsms, to further evaluate the performance of HGATMDA. For the three diseases, 50 out of top 50 candidates were confirmed by the validation datasets. Experimental results showed that HGATMDA performed better performance than other state-of-the-art methods, and can be used as an efficient and accurate tool to identity the underlying associations between miRNAs and diseases.</p></sec>
<sec sec-type="materials and methods" id="s2">
<title>2. Materials and Methods</title>
<sec>
<title>2.1. Overview</title>
<p>In this study, we proposed a new computational method, HGATMDA, which combines graph embedding and graph attention network (GAT) methods to identify miRNA-disease associations. As shows in <xref ref-type="fig" rid="F1">Figure 1</xref>, HGATMDA consists of four parts. Firstly, HGATMDA builds a heterogeneous graph including disease-disease sub-graph, miRNA-miRNA sub-graph and miRNA-disease sub-graph. Secondly, HGATMDA adopts a novel weighted DeepWalk to obtain dense representations of diseases and miRNAs from the disease-disease and miRNA-miRNA sub-graphs. Thirdly, it applies GAT to learn node features from the miRNA-disease sub-graph. Lastly, HGATMDA introduces a fully-connected (FC) neural network as an effective classifier for inferring the potential disease-related miRNAs.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>The workflow of HGATMDA for miRNA-disease association prediction.</p></caption>
<graphic xlink:href="fgene-12-727744-g0001.tif"/>
</fig></sec>
<sec>
<title>2.2. Human miRNA Disease Database</title>
<p>HMDD v2.0 (Wang et al., <xref ref-type="bibr" rid="B50">2010</xref>) and HMDD v3.2 (Huang et al., <xref ref-type="bibr" rid="B24">2019a</xref>) databases, the experimental verified human miRNA-disease associations, are used in this paper. We directly downloaded data from <ext-link ext-link-type="uri" xlink:href="http://www.cuilab.cn/hmdd">http://www.cuilab.cn/hmdd</ext-link>, and constructed a miRNA-disease graph with known associations between miRNAs and diseases. We denote the association matrix as <inline-formula><mml:math id="M1"><mml:msup><mml:mrow><mml:mi>A</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo>&#x000D7;</mml:mo><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msup></mml:math></inline-formula>, and <italic>N</italic><sub><italic>d</italic></sub> is the number of diseases, <italic>N</italic><sub><italic>m</italic></sub> is the number of miRNAs. The value of <italic>A</italic><sub><italic>ij</italic></sub> &#x02208; {0, 1} indicates whether there is a known connections existed between miRNA <italic>m</italic><sub><italic>i</italic></sub> and disease <italic>d</italic><sub><italic>j</italic></sub>. Here, <italic>N</italic><sub><italic>d</italic></sub> &#x0003D; 383 and <italic>N</italic><sub><italic>m</italic></sub> &#x0003D; 495, and 5,430 known interactions between miRNAs and diseases in the HMDD v2.0. We found that the names in HMDD v3.2, including diseases and miRNAs, did not exactly match those in HMDD V2.0. For consistency, we mapped the items in HMDD v3.2 to HMDD v2.0 as the previous work (Wang et al., <xref ref-type="bibr" rid="B50">2010</xref>).</p></sec>
<sec>
<title>2.3. Disease Semantic Similarity</title>
<p>The Medical Subject Headings (MeSH) dataset contains hierarchical relationships between diseases. Tree numbers in the Mesh headings denotes parent-child associations between nodes in the networks. By using the disease name as nodes and tree numbers as edges, we can build a directed acyclic graph (DAG) with the headings for each disease (<ext-link ext-link-type="uri" xlink:href="https://www.nlm.nih.gov/mesh/meshhome.html">https://www.nlm.nih.gov/mesh/meshhome.html</ext-link>). We define <italic>DAG</italic><sub><italic>d</italic></sub> &#x0003D; (<italic>T</italic><sub><italic>d</italic></sub>, <italic>E</italic><sub><italic>d</italic></sub>) as disease <italic>d</italic>, and <italic>T</italic><sub><italic>d</italic></sub> and <italic>E</italic><sub><italic>d</italic></sub> are the nodes and edges in the DAG. Then, we can compute the contribution of disease <italic>d</italic><sub><italic>i</italic></sub> &#x02208; <italic>T</italic><sub><italic>d</italic></sub> to disease <italic>d</italic> as follows:</p>
<disp-formula id="E1"><label>(1)</label><mml:math id="M2"><mml:mrow><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:mtable columnalign='left'><mml:mtr columnalign='left'><mml:mtd columnalign='left'><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mi>d</mml:mi></mml:msub><mml:mo stretchy='false'>(</mml:mo><mml:msub><mml:mi>d</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo stretchy='false'>)</mml:mo><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mtd><mml:mtd columnalign='left'><mml:mrow><mml:mtext>if&#x000A0;</mml:mtext><mml:mi>d</mml:mi><mml:mo>=</mml:mo><mml:msub><mml:mi>d</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mtd></mml:mtr><mml:mtr columnalign='left'><mml:mtd columnalign='left'><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mi>d</mml:mi></mml:msub><mml:mo stretchy='false'>(</mml:mo><mml:msub><mml:mi>d</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo stretchy='false'>)</mml:mo><mml:mo>=</mml:mo><mml:mi>m</mml:mi><mml:mi>a</mml:mi><mml:mi>x</mml:mi><mml:mo>&#x0007B;</mml:mo><mml:mo>&#x00394;</mml:mo><mml:msub><mml:mi>D</mml:mi><mml:mi>d</mml:mi></mml:msub><mml:mo stretchy='false'>(</mml:mo><mml:msubsup><mml:mi>d</mml:mi><mml:mi>i</mml:mi><mml:mo>&#x02032;</mml:mo></mml:msubsup><mml:mo stretchy='false'>)</mml:mo><mml:mo>&#x0007C;</mml:mo><mml:msubsup><mml:mi>d</mml:mi><mml:mi>i</mml:mi><mml:mo>&#x02032;</mml:mo></mml:msubsup><mml:mo>&#x02208;</mml:mo><mml:mtext>children&#x000A0;of&#x000A0;</mml:mtext><mml:msub><mml:mi>d</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>&#x0007D;</mml:mo></mml:mrow></mml:mtd><mml:mtd columnalign='left'><mml:mrow><mml:mtext>if&#x000A0;</mml:mtext><mml:mi>d</mml:mi><mml:mo>&#x02260;</mml:mo><mml:msub><mml:mi>d</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mrow></mml:mrow></mml:math></disp-formula>
<p>where &#x00394; is a weight decay parameter and is set as 0.5 in this paper. We define the semantic value of disease <italic>d</italic> by the following formula:</p>
<disp-formula id="E2"><label>(2)</label><mml:math id="M3"><mml:mtable class="eqnarray" columnalign="left left left"><mml:mtr><mml:mtd><mml:mi>D</mml:mi><mml:mi>V</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:munder class="msub"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msup><mml:mo>&#x02208;</mml:mo><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:munder></mml:mstyle><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msup><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>Based on the assumption, the more overlap between two DAGs, the more similar the two diseases are. The similarity between disease <italic>d</italic><sub><italic>i</italic></sub> and <italic>d</italic><sub><italic>j</italic></sub> can be calculated as follows:</p>
<disp-formula id="E3"><label>(3)</label><mml:math id="M4"><mml:mtable class="eqnarray" columnalign="left left left"><mml:mtr><mml:mtd><mml:mi>S</mml:mi><mml:mi>S</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mstyle displaystyle="true"><mml:msub><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msup><mml:mo>&#x02208;</mml:mo><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msub><mml:mo>&#x02229;</mml:mo><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:msub></mml:mstyle><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msup><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msup><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>D</mml:mi><mml:mi>V</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>&#x0002B;</mml:mo><mml:mi>D</mml:mi><mml:mi>V</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:mfrac><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
</sec>
<sec>
<title>2.4. miRNA Functional Similarity</title>
<p>The basic assumption is that miRNAs with similar functions tend to be connected with similar diseases, and vice versa (Lu et al., <xref ref-type="bibr" rid="B38">2008</xref>; Wang et al., <xref ref-type="bibr" rid="B50">2010</xref>). Wang et al. have proposed miRNA MISIM functional similarity, that is, to calculate the similarity score by the related disease DAG between two miRNAs. Thanks to this excellent work, we can directly download the data from website (<ext-link ext-link-type="uri" xlink:href="http://www.cuilab.cn/files/images/cuilab/misim.zip">http://www.cuilab.cn/files/images/cuilab/misim.zip</ext-link>). We then constructed the miRNA functional network, which is denoted as <italic>FS</italic> with the shape of <italic>N</italic><sub><italic>m</italic></sub> &#x000D7; <italic>N</italic><sub><italic>m</italic></sub>, where <italic>N</italic><sub><italic>m</italic></sub> is the number of miRNAs. The element <italic>FS</italic>(<italic>m</italic><sub><italic>i</italic></sub>, <italic>m</italic><sub><italic>j</italic></sub>) represents the similarity score between miRNA <italic>m</italic><sub><italic>i</italic></sub> and miRNA <italic>m</italic><sub><italic>j</italic></sub>.</p></sec>
<sec>
<title>2.5. Gaussian Interaction Profile Kernel Similarity for Disease and miRNA</title>
<p>In order to overcome the sparsity in disease semantic similarity <italic>SS</italic> and miRNA functional similarity <italic>FS</italic>, we introduced the Gaussian interaction profile (GIP) kernel similarity. Firstly, we used column <italic>A</italic><sub>&#x000B7;<italic>i</italic></sub> to represent a disease <italic>d</italic><sub><italic>i</italic></sub> and row <italic>A</italic><sub><italic>j</italic>&#x000B7;</sub> to represent miRNA <italic>m</italic><sub><italic>j</italic></sub>. Then, GIP similarities between two diseases or two miRNAs are defined as follows:</p>
<disp-formula id="E4"><label>(4)</label><mml:math id="M5"><mml:mtable class="eqnarray" columnalign="left left left"><mml:mtr><mml:mtd><mml:mi>K</mml:mi><mml:mi>D</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mo class="qopname">exp</mml:mo><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003B3;</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">|</mml:mo><mml:mo stretchy="false">|</mml:mo><mml:msub><mml:mrow><mml:mi>A</mml:mi></mml:mrow><mml:mrow><mml:mo>&#x000B7;</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>A</mml:mi></mml:mrow><mml:mrow><mml:mo>&#x000B7;</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">|</mml:mo><mml:msup><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E5"><label>(5)</label><mml:math id="M6"><mml:mtable class="eqnarray" columnalign="left left left"><mml:mtr><mml:mtd><mml:mi>K</mml:mi><mml:mi>M</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mo class="qopname">exp</mml:mo><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003B3;</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">|</mml:mo><mml:mo stretchy="false">|</mml:mo><mml:msub><mml:mrow><mml:mi>A</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>&#x000B7;</mml:mo></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>A</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mo>&#x000B7;</mml:mo></mml:mrow></mml:msub><mml:mo stretchy="false">|</mml:mo><mml:msup><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>here, &#x003B3;<sub><italic>d</italic></sub> and &#x003B3;<sub><italic>d</italic></sub> are parameters and can be calculated by following forums:</p>
<disp-formula id="E6"><label>(6)</label><mml:math id="M7"><mml:mtable class="eqnarray" columnalign="left left left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>&#x003B3;</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:msubsup><mml:mrow><mml:mi>&#x003B3;</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msubsup></mml:mrow><mml:mrow><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac><mml:mstyle displaystyle="true"><mml:msubsup><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msubsup></mml:mstyle><mml:mo stretchy="false">|</mml:mo><mml:mo stretchy="false">|</mml:mo><mml:msub><mml:mrow><mml:mi>A</mml:mi></mml:mrow><mml:mrow><mml:mo>&#x000B7;</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">|</mml:mo><mml:msup><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:mfrac></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E7"><label>(7)</label><mml:math id="M8"><mml:mtable class="eqnarray" columnalign="left left left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>&#x003B3;</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:msubsup><mml:mrow><mml:mi>&#x003B3;</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msubsup></mml:mrow><mml:mrow><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac><mml:mstyle displaystyle="true"><mml:msubsup><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msubsup></mml:mstyle><mml:mo stretchy="false">|</mml:mo><mml:mo stretchy="false">|</mml:mo><mml:msub><mml:mrow><mml:mi>A</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>&#x000B7;</mml:mo></mml:mrow></mml:msub><mml:mo stretchy="false">|</mml:mo><mml:msup><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:mfrac></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>Based on the previous work (van Laarhoven et al., <xref ref-type="bibr" rid="B49">2011</xref>), we set the bandwidth parameters <inline-formula><mml:math id="M9"><mml:msubsup><mml:mrow><mml:mi>&#x003B3;</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msubsup></mml:math></inline-formula> and <inline-formula><mml:math id="M10"><mml:msubsup><mml:mrow><mml:mi>&#x003B3;</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msubsup></mml:math></inline-formula> to 1 in this paper.</p></sec>
<sec>
<title>2.6. Integrated Similarities for Disease and miRNA</title>
<p>We observed that there was no overlap between DAGs for many diseases, resulting in zero for many elements in disease semantic similarity <italic>SS</italic> and miRNA functional similarity <italic>FS</italic>. While all entries in the GIP similarities <italic>KD</italic> and <italic>KM</italic> are non-zero. Therefore, we integrated GIP similarities with <italic>SS</italic> and <italic>FS</italic> as follows:</p>
<disp-formula id="E8"><label>(8)</label><mml:math id="M11"><mml:mtable class="eqnarray" columnalign="left left left"><mml:mtr><mml:mtd><mml:mi>S</mml:mi><mml:mi>D</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mi>S</mml:mi><mml:mi>S</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>&#x0002B;</mml:mo><mml:mi>K</mml:mi><mml:mi>D</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:mfrac></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E9"><label>(9)</label><mml:math id="M12"><mml:mtable class="eqnarray" columnalign="left left left"><mml:mtr><mml:mtd><mml:mi>S</mml:mi><mml:mi>M</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mi>F</mml:mi><mml:mi>S</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>&#x0002B;</mml:mo><mml:mi>K</mml:mi><mml:mi>M</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:mfrac><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
</sec>
<sec>
<title>2.7. Weighted DeepWalk for Node Representation</title>
<p>DeepWalk (Perozzi et al., <xref ref-type="bibr" rid="B44">2014</xref>) is an algorithm that can learn the representations of vertices in graphs, inspired by the well-known unsupervised feature learning framework word2vec (Mikolov et al., <xref ref-type="bibr" rid="B39">2013</xref>). Given a graph <italic>G</italic> &#x0003D; (<italic>V, E</italic>), <italic>V</italic> denotes all vertices in the graph, and <italic>E</italic> represents the edge or transaction matrix in the graph. In order to generate the corpus in graphs, a vertex <italic>v</italic><sub><italic>i</italic></sub> &#x02208; <italic>V</italic> is uniformly selected as the root, and then a random sampling is used from the neighbors as the next hop. After generating samples, DeepWalk applies SkipGram model to learn the representation of each vertex <italic>v</italic><sub><italic>i</italic></sub>, denotes as <inline-formula><mml:math id="M13"><mml:mi>&#x003A6;</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>v</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>&#x02208;</mml:mo><mml:msup><mml:mrow><mml:mi>&#x0211D;</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula>. Usually, the dimension size <italic>d</italic> is used as a hyper-parameter.</p>
<p>We applied a variant DeepWalk, called weighted DeepWalk, to learn disease and miRNA representations. The element in <italic>SD</italic> and <italic>SM</italic> can be used as the weight of an edge between two vertices of the disease-disease sub-graph or the miRNA-miRNA sub-graph. Here we used the element in <italic>SD</italic> and <italic>SM</italic> to denote the probability of the current vertex walking to other vertices in the sub-graph. Given a vertex <italic>v</italic><sub><italic>i</italic></sub>, we define a walk sequence <italic>W</italic><sub><italic>v</italic><sub><italic>i</italic></sub></sub> &#x0003D; {<italic>v</italic><sub><italic>i</italic></sub>, &#x022EF;&#x000A0;} to represent the vertices passed by a weighted random walk starting at the vertex <italic>v</italic><sub><italic>i</italic></sub>. We denote the transition possibility of next hop as follows:</p>
<disp-formula id="E10"><label>(10)</label><mml:math id="M14"><mml:mtable class="eqnarray" columnalign="left left left"><mml:mtr><mml:mtd><mml:mi>p</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>v</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">|</mml:mo><mml:msub><mml:mrow><mml:mi>v</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <italic>v</italic><sub><italic>i</italic></sub>, <italic>v</italic><sub><italic>j</italic></sub> &#x02208; <italic>V</italic> are the vertices in the sub-graph, and <italic>v</italic><sub><italic>j</italic></sub> &#x02208; <italic>Ner</italic><sub><italic>v</italic><sub><italic>i</italic></sub></sub> is in the neighbors of vertex <italic>v</italic><sub><italic>i</italic></sub>. <italic>p</italic><sub><italic>ij</italic></sub> &#x02208; <italic>SD</italic> or <italic>p</italic><sub><italic>ij</italic></sub> &#x02208; <italic>SM</italic> is the weight of an edge from <italic>v</italic><sub><italic>i</italic></sub> to <italic>v</italic><sub><italic>j</italic></sub>.</p>
<p>After this processing, we obtained two samples {<italic>W</italic><sub><italic>d</italic><sub>1</sub></sub>, <italic>W</italic><sub><italic>d</italic><sub>2</sub></sub>, &#x022EF;&#x000A0;} and {<italic>W</italic><sub><italic>m</italic><sub>1</sub></sub>, <italic>W</italic><sub><italic>m</italic><sub>2</sub></sub>, &#x022EF;&#x000A0;} as the corpus, Each sample is a sentence, and each vertex is a word. For learning the representations of diseases and miRNAs, we used DeepWalk to construct two SkipGram models for learning node representations. There are hundreds of diseases and miRNAs in the respective corpus. Therefore, hierarchical softmax wass used for accelerating the training process. We set the dimension size as <italic>F</italic> in both disease and miRNA representations, and the learned representations of diseases and miRNAs can be denoted as <inline-formula><mml:math id="M15"><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>d</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>d</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>d</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msub></mml:mrow><mml:mo>}</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>d</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02208;</mml:mo><mml:msup><mml:mrow><mml:mi>&#x0211D;</mml:mi></mml:mrow><mml:mrow><mml:mi>F</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula>, <inline-formula><mml:math id="M16"><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>m</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>m</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>m</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msub></mml:mrow><mml:mo>}</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:mstyle mathvariant="bold-italic"><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mstyle><mml:mo>&#x02208;</mml:mo><mml:msup><mml:mrow><mml:mi>&#x0211D;</mml:mi></mml:mrow><mml:mrow><mml:mi>F</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula>. <italic>F</italic> is the dimension size.</p></sec>
<sec>
<title>2.8. Graph Attention Networks for Node Feature Aggregation</title>
<p>In this section, we further constructed a GAT model in the miRNA-disease sub-graph with node features learned from the previous section, to refine dense vectors of miRNAs and diseases. In consequence, these node features contain global heterogeneous graph structure information. We denote the miRNA-disease sub-graph as <italic>G</italic><sub><italic>m</italic>&#x02212;<italic>d</italic></sub> &#x0003D; (<italic>V, E</italic>), and the number of nodes is <italic>N</italic><sub><italic>d</italic></sub> &#x0002B; <italic>N</italic><sub><italic>m</italic></sub>, node <italic>v</italic><sub><italic>i</italic></sub> &#x02208; <italic>V</italic> &#x0003D; {<italic>v</italic><sub>1</sub>, &#x02026;, <italic>v</italic><sub><italic>N</italic><sub><italic>d</italic></sub></sub>, <italic>v</italic><sub><italic>N</italic><sub><italic>d</italic></sub>&#x0002B;1</sub>, &#x02026;, <italic>v</italic><sub><italic>N</italic><sub><italic>d</italic></sub> &#x0002B; <italic>N</italic><sub><italic>m</italic></sub></sub>}, edges (<italic>v</italic><sub><italic>i</italic></sub>, <italic>v</italic><sub><italic>j</italic></sub>) &#x02208; <italic>E</italic> &#x0003D; {<italic>A</italic><sub><italic>ij</italic></sub> &#x0003D; 1 &#x02208; <italic>A</italic>}. The initial node features in the miRNA-disease sub-graph are defined as <italic>X</italic> &#x0003D; [<italic>X</italic><sub><italic>d</italic></sub>, <italic>X</italic><sub><italic>m</italic></sub>], and <inline-formula><mml:math id="M17"><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02208;</mml:mo><mml:msup><mml:mrow><mml:mi>&#x0211D;</mml:mi></mml:mrow><mml:mrow><mml:mi>F</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> is the feature of the <italic>i</italic>-th node in the graph. GAT uses multi-head attention mechanism to compute the contributions of neighbors of vertex <italic>v</italic><sub><italic>i</italic></sub> to itself. We denote the input of <italic>l</italic>-layer of our GAT model as <inline-formula><mml:math id="M18"><mml:msup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>h</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>h</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>h</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup></mml:mrow><mml:mo>}</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>h</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:mo>&#x02208;</mml:mo><mml:msup><mml:mrow><mml:mi>&#x0211D;</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>F</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msup></mml:mrow></mml:msup></mml:math></inline-formula>, where <italic>N</italic> &#x0003D; <italic>N</italic><sub><italic>d</italic></sub>&#x0002B;<italic>N</italic><sub><italic>m</italic></sub> is the number of nodes, and <italic>F</italic><sup>(<italic>l</italic>)</sup> denotes the input dimension size of each node. Here, <italic>H</italic><sup>(0)</sup> &#x0003D; <italic>X</italic> is the input features of the GAT model. We define the output of <italic>l</italic>-layer as <inline-formula><mml:math id="M19"><mml:msup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>h</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>h</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>h</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup></mml:mrow><mml:mo>}</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>h</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:mo>&#x02208;</mml:mo><mml:msup><mml:mrow><mml:mi>&#x0211D;</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>F</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msup></mml:mrow></mml:msup></mml:math></inline-formula>. For each node, we first compute the important score from the neighbor node <italic>j</italic> to node <italic>i</italic></p>
<disp-formula id="E11"><label>(11)</label><mml:math id="M20"><mml:mtable class="eqnarray" columnalign="left left left"><mml:mtr><mml:mtd><mml:msubsup><mml:mrow><mml:mi>e</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:mi>a</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>W</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>h</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>W</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>h</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <bold>W</bold><sup>(<italic>l</italic>)</sup> is a parameter matrix <italic>W</italic><sup>(<italic>l</italic>)</sup> &#x02208; &#x0211D;<sup><italic>F</italic>(<italic>l</italic>&#x0002B;1)</sup> &#x000D7; <italic>F</italic><sup>(<italic>l</italic>)</sup>, and <italic>a</italic> is a one layer feed-forward neural network. Then, we normalize the neighborhood important scores of node <italic>i</italic> by the softmax function as follows:</p>
<disp-formula id="E12"><label>(12)</label><mml:math id="M21"><mml:mtable class="eqnarray" columnalign="left left left"><mml:mtr><mml:mtd><mml:msubsup><mml:mrow><mml:mi>&#x003B1;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>softmax</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mi>e</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mo class="qopname">exp</mml:mo><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mi>e</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mstyle displaystyle="true"><mml:msub><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>k</mml:mi><mml:mo>&#x02208;</mml:mo><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="-tex-caligraphic">N</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo class="qopname">exp</mml:mo><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mi>e</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msub></mml:mstyle></mml:mrow></mml:mfrac></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <inline-formula><mml:math id="M22"><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="-tex-caligraphic">N</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is the neighborhood node set of node <italic>i</italic>, including node <italic>i</italic> itself. Finally, we can use these scores to calculate the new features of node <italic>i</italic> by aggregating information from its neighbors:</p>
<disp-formula id="E13"><label>(13)</label><mml:math id="M23"><mml:mtable class="eqnarray" columnalign="left left left"><mml:mtr><mml:mtd><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>h</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:mi>&#x003C3;</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mstyle displaystyle="true"><mml:munder class="msub"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mo>&#x02208;</mml:mo><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="-tex-caligraphic">N</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:munder></mml:mstyle><mml:msubsup><mml:mrow><mml:mi>&#x003B1;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>W</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>h</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where &#x003C3; denotes a non-linear activation function, such as LeakyReLU. The power of GAT is benefit from the multi-head attention mechanism, we apply <italic>K</italic> independent attention of node <italic>i</italic> on its neighborhood, and the output of node features is as follows:</p>
<disp-formula id="E14"><label>(14)</label><mml:math id="M24"><mml:mtable class="eqnarray" columnalign="left left left"><mml:mtr><mml:mtd><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>h</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:mi>&#x003C3;</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:munderover><mml:mrow><mml:mo>&#x0039B;</mml:mo></mml:mrow><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>K</mml:mi></mml:mrow></mml:munderover><mml:mstyle displaystyle="true"><mml:munder class="msub"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mo>&#x02208;</mml:mo><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="-tex-caligraphic">N</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:munder></mml:mstyle><mml:msubsup><mml:mrow><mml:mi>&#x003B1;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi><mml:mo>,</mml:mo><mml:mi>k</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>W</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi><mml:mo>,</mml:mo><mml:mi>k</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>h</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>l</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where &#x0039B; denotes concatenation or averaging. In our paper, all GAT layers are used concatenation, except averaging for last layer. A GAT carries out the first-order neighborhood information aggregation, and the graph convolution on multi-layers realizes multi-order neighborhood aggregation. As the number of training iterations increases, the node representations can obtain the structure information of the miRNA-disease sub-graph. Together with the node features obtained from miRNA-miRNA and disease-disease sub-graphs, the final node features of <inline-formula><mml:math id="M25"><mml:msub><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>m</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02208;</mml:mo><mml:msup><mml:mrow><mml:mi>&#x0211D;</mml:mi></mml:mrow><mml:mrow><mml:mi>F</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> and <inline-formula><mml:math id="M26"><mml:msub><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>d</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02208;</mml:mo><mml:msup><mml:mrow><mml:mi>&#x0211D;</mml:mi></mml:mrow><mml:mrow><mml:mi>F</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> contain rich structure information of the global heterogeneous graph.</p></sec>
<sec>
<title>2.9. Potential miRNA-Disease Associations Prediction</title>
<p>At last, we designed a scoring function that can calculate the correlation score between a pair of miRNA and disease. We first integrated node features with raw features in <italic>SD</italic> and <italic>SM</italic>. In order to transform the raw features in <italic>SD</italic> and <italic>SM</italic> to the same dimensions as the node features, we introduced projection parameters <inline-formula><mml:math id="M27"><mml:msub><mml:mrow><mml:mi>W</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02208;</mml:mo><mml:msup><mml:mrow><mml:mi>&#x0211D;</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub><mml:mo>&#x000D7;</mml:mo><mml:mi>F</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> and <inline-formula><mml:math id="M28"><mml:msub><mml:mrow><mml:mi>W</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02208;</mml:mo><mml:msup><mml:mrow><mml:mi>&#x0211D;</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo>&#x000D7;</mml:mo><mml:mi>F</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula>. Then, we can define the correlation score as follows:</p>
<disp-formula id="E15"><label>(15)</label><mml:math id="M29"><mml:mtable class="eqnarray" columnalign="left left left"><mml:mtr><mml:mtd><mml:mi>f</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mi>F</mml:mi><mml:mi>C</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mrow><mml:mo stretchy="false">[</mml:mo><mml:mrow><mml:mi>g</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>d</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>S</mml:mi><mml:mi>D</mml:mi><mml:mo>&#x000D7;</mml:mo><mml:msub><mml:mrow><mml:mi>W</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>&#x000B7;</mml:mo></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:mi>g</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>m</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>S</mml:mi><mml:mi>M</mml:mi><mml:mo>&#x000D7;</mml:mo><mml:msub><mml:mrow><mml:mi>W</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mo>&#x000B7;</mml:mo></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mo stretchy="false">]</mml:mo></mml:mrow></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <italic>FC</italic> is a fully-connected neural networks, and the details will be discussed in the section 3.3. <italic>g</italic>(&#x000B7;) represents an accumulation function, such as <italic>concat</italic>(&#x000B7;), i.e., a concatenation of node features and raw features, or <italic>sum</italic>(&#x000B7;), i.e., summation of node features and raw features. Our model is trained by minimizing the cross-entropy loss and L2 regularization.</p>
<disp-formula id="E16"><label>(16)</label><mml:math id="M30"><mml:mtable class="eqnarray" columnalign="left left left"><mml:mtr><mml:mtd><mml:mtable style="text-align:axis;" equalrows="false" columnlines="none none" equalcolumns="false" class="array"><mml:mtr><mml:mtd columnalign="left"><mml:mrow><mml:mi mathvariant="-tex-caligraphic">L</mml:mi></mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>X</mml:mi><mml:mo>,</mml:mo><mml:mi>Y</mml:mi><mml:mo>,</mml:mo><mml:mo>&#x00398;</mml:mo></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mstyle displaystyle="true"><mml:munder class="msub"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>&#x02208;</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:msup><mml:mrow><mml:mrow><mml:mi mathvariant="-tex-caligraphic">G</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mo>&#x0002B;</mml:mo></mml:mrow></mml:msup><mml:mo>&#x0222A;</mml:mo><mml:msup><mml:mrow><mml:mrow><mml:mi mathvariant="-tex-caligraphic">G</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mo>-</mml:mo></mml:mrow></mml:msup></mml:mrow><mml:mo>}</mml:mo></mml:mrow></mml:mrow></mml:munder></mml:mstyle><mml:mi>y</mml:mi><mml:mo class="qopname">log</mml:mo><mml:mi>f</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd columnalign="left"><mml:mtext>&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;</mml:mtext><mml:mo>&#x0002B;</mml:mo><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mn>1</mml:mn><mml:mo>-</mml:mo><mml:mi>y</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo class="qopname">log</mml:mo><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mn>1</mml:mn><mml:mo>-</mml:mo><mml:mi>f</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>&#x0002B;</mml:mo><mml:mo>&#x003BB;</mml:mo><mml:mo stretchy="false">|</mml:mo><mml:mo stretchy="false">|</mml:mo><mml:mo>&#x00398;</mml:mo><mml:mo stretchy="false">|</mml:mo><mml:msup><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mtd></mml:mtr></mml:mtable></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where &#x00398; denotes parameters of our GAT model, <inline-formula><mml:math id="M31"><mml:msup><mml:mrow><mml:mrow><mml:mi mathvariant="-tex-caligraphic">G</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mo>&#x0002B;</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula> is the set of known associations between miRNA and disease, and <inline-formula><mml:math id="M32"><mml:msup><mml:mrow><mml:mrow><mml:mi mathvariant="-tex-caligraphic">G</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mo>-</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula> is the same number set of unknown associations between miRNA and disease from negative sampling, which we will discuss the impact in the later section. <italic>X</italic> denotes the vector form of the miRNA-disease pairs in <inline-formula><mml:math id="M33"><mml:msup><mml:mrow><mml:mrow><mml:mi mathvariant="-tex-caligraphic">G</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mo>&#x0002B;</mml:mo></mml:mrow></mml:msup><mml:mo>&#x0222A;</mml:mo><mml:msup><mml:mrow><mml:mrow><mml:mi mathvariant="-tex-caligraphic">G</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mo>-</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula> and <italic>y</italic> &#x02208; <italic>Y</italic> represents the corresponding label.</p></sec></sec>
<sec sec-type="results" id="s3">
<title>3. Results</title>
<sec>
<title>3.1. Datasets and Experimental Details</title>
<p>We used benchmark datasets of known miRNA-disease associations, including HMDD, dbDEMC v2.0 (Yang et al., <xref ref-type="bibr" rid="B55">2017</xref>), and miR2Disease (Jiang et al., <xref ref-type="bibr" rid="B28">2008</xref>). Mesh dataset is also used for computing disease semantic similarity. To evaluate the prediction performance of our method, we compared the experimental results with other stat-of-the-art methods. Moreover, we also performed case studies on three common human diseases such as breast neoplasms, lung neoplasms and kidney neoplasms for further evaluation. HMDD v2.0 is used for training, and dbDEMC v2.0, HMDD v3.2, and miR2Disease are used for validation. We carried out experimental analysis using five-fold cross validation. Specially, all known associations in HMDD v2.0 are taken as positive samples. We first selected equal number of negative samples from a uniform distribution of unknown associations between miRNAs and diseases in HMDD v2.0 dataset. Then, we randomly shuffled all the samples and divided into five equal parts, four of which are in turn used for training and the rest for validation. In addition, we applied several metrics for evaluation, such as Accuracy, Precision, Recall, F1-score. Moreover, we plotted the receiver operating characteristic curve (ROC) and precision-recall curve (PR). Furthermore, the area under the ROC curve (AUC), and the area under the P-R curve (AUPR) are used for analysis the prediction performance quantitatively.</p>
<p>The code of our proposed method is implemented based on the machine learning library PyTorch v1.6.0 (Paszke et al., <xref ref-type="bibr" rid="B42">2019</xref>). We trained the weighted DeepWalk models using GenSim library (Rehurek and Sojka, <xref ref-type="bibr" rid="B45">2010</xref>). For GAT, we used PyTorch Geometric deep learning library (Fey and Lenssen, <xref ref-type="bibr" rid="B19">2019</xref>). Our experiments are run on the Ubuntu 16.04 operating system, with two Intel Xeon CPUs (2.30 GHz, 16 cores), and two Tesla V100 GPUs. The Adam optimizer is used for training (Kingma and Ba, <xref ref-type="bibr" rid="B29">2017</xref>), with a learning rate of 0.001 and the weight decay is 0.00005.</p></sec>
<sec>
<title>3.2. Predictive Performance Analysis</title>
<p>In our experiments, we performed ablation study to analyze the effect of architectures and hyper-parameters, then selected the appropriate parameters. Details of the choices will be discussed in the section 3.3. We chose the model with window size of 5, walk length of 20 in weighted Deepwalk, 2 layers in GAT, and 3-layer FC as predictor. The results of our best model under five-fold CV based on HMDD v2.0 dataset are shown in <xref ref-type="table" rid="T1">Table 1</xref> and <xref ref-type="fig" rid="F2">Figure 2</xref>. HGATMDA achieves the average prediction Accuracy of 87.02%, Precision of 94.07%, Recall of 90.04%, and F1-score of 87.39%. The threshold used in theses metrics is 0.5. The mean values of AUC and AUPR are 94.54 &#x000B1; 0.34 and 94.05 &#x000B1; 0.18%, respectively. In summary, the results showed that HGATMDA can significantly promote the performance of predicting miRNA-disease associations.</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Results of five-fold CV based on HMDD v2.0.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"><bold>Test fold</bold></th>
<th valign="top" align="center" colspan="6" style="border-bottom: thin solid #000000;"><bold>Metrics</bold></th>
</tr>
<tr>
<th/>
<th valign="top" align="center"><bold>Accuracy (%)</bold></th>
<th valign="top" align="center"><bold>Precision (%)</bold></th>
<th valign="top" align="center"><bold>Recall (%)</bold></th>
<th valign="top" align="center"><bold>F1-score (%)</bold></th>
<th valign="top" align="center"><bold>AUC (%)</bold></th>
<th valign="top" align="center"><bold>AUPR (%)</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">1</td>
<td valign="top" align="center">87.15</td>
<td valign="top" align="center">94.08</td>
<td valign="top" align="center">89.87</td>
<td valign="top" align="center">87.49</td>
<td valign="top" align="center">94.57</td>
<td valign="top" align="center">94.07</td>
</tr>
<tr>
<td valign="top" align="left">2</td>
<td valign="top" align="center">87.57</td>
<td valign="top" align="center">94.08</td>
<td valign="top" align="center">89.41</td>
<td valign="top" align="center">87.79</td>
<td valign="top" align="center">94.50</td>
<td valign="top" align="center">94.07</td>
</tr>
<tr>
<td valign="top" align="left">3</td>
<td valign="top" align="center">86.69</td>
<td valign="top" align="center">93.75</td>
<td valign="top" align="center">86.65</td>
<td valign="top" align="center">86.69</td>
<td valign="top" align="center">94.09</td>
<td valign="top" align="center">93.74</td>
</tr>
<tr>
<td valign="top" align="left">4</td>
<td valign="top" align="center">87.29</td>
<td valign="top" align="center">94.21</td>
<td valign="top" align="center">95.30</td>
<td valign="top" align="center">88.24</td>
<td valign="top" align="center">95.15</td>
<td valign="top" align="center">94.20</td>
</tr>
<tr>
<td valign="top" align="left">5</td>
<td valign="top" align="center">86.37</td>
<td valign="top" align="center">94.25</td>
<td valign="top" align="center">88.95</td>
<td valign="top" align="center">86.71</td>
<td valign="top" align="center">94.45</td>
<td valign="top" align="center">94.24</td>
</tr> <tr style="border-top: thin solid #000000;">
<td valign="top" align="left">Average</td>
<td valign="top" align="center">87.02</td>
<td valign="top" align="center">94.07</td>
<td valign="top" align="center">90.04</td>
<td valign="top" align="center">87.39</td>
<td valign="top" align="center">94.55</td>
<td valign="top" align="center">94.07</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p><bold>(A)</bold> ROC curves and <bold>(B)</bold> P-R curves of HGATMDA under five-fold CV based on HMDD v2.0.</p></caption>
<graphic xlink:href="fgene-12-727744-g0002.tif"/>
</fig></sec>
<sec>
<title>3.3. Ablation Experiments</title>
<p>In order to further demonstrate the prediction performance of our model, we carried out extensive experiments under different architectures and hyper-parameters, analyzed how the design of sub-model and the choice of hyper-parameters have different effect on the final performance of the proposed model. Five-fold CV on HMDD v2.0 dataset was used to evaluate the model sensitivity of architectures and hyper-parameters in the experiments. The following discussion followed the structure in <xref ref-type="table" rid="T2">Table 2</xref> and <xref ref-type="fig" rid="F3">Figures 3</xref>, <xref ref-type="fig" rid="F4">4</xref>. Dot product Predictor denotes the vector dot product of miRNA <italic>m</italic><sub><italic>i</italic></sub> and disease <italic>d</italic><sub><italic>j</italic></sub> as the predictor. FC Predictor represents a fully-connected neural networks, as shown in the Equation (15). GAT (untrained) uses randomly weights of neighbors without training, and Raw features denotes integrated similarities of miRNA and disease.</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p>Ablation experiments of different architectures.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"><bold>Test model</bold></th>
<th valign="top" align="center"><bold>AUC (%)</bold></th>
<th valign="top" align="center"><bold>AUPR (%)</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Weighted DeepWalk &#x0002B; Dot product Predictor</td>
<td valign="top" align="center">73.48</td>
<td valign="top" align="center">77.97</td>
</tr>
<tr>
<td valign="top" align="left">Weighted DeepWalk &#x0002B; GAT (untrained) &#x0002B; Dot product Predictor</td>
<td valign="top" align="center">89.78</td>
<td valign="top" align="center">88.59</td>
</tr>
<tr>
<td valign="top" align="left">Weighted DeepWalk &#x0002B; GAT &#x0002B; Dot product Predictor</td>
<td valign="top" align="center">92.17</td>
<td valign="top" align="center">91.52</td>
</tr> <tr style="border-top: thin solid #000000;">
<td valign="top" align="left">Raw features &#x0002B; GAT &#x0002B; Dot product Predictor</td>
<td valign="top" align="center">72.68</td>
<td valign="top" align="center">69.27</td>
</tr>
<tr>
<td valign="top" align="left">Weight DeepWalk &#x0002B; Raw features &#x0002B; GAT &#x0002B; Dot product Predictor</td>
<td valign="top" align="center">93.52</td>
<td valign="top" align="center">93.09</td>
</tr>
<tr>
<td valign="top" align="left">Weight DeepWalk &#x0002B; Raw features &#x0002B; GAT &#x0002B; FC Predictor</td>
<td valign="top" align="center"><bold>94.03</bold></td>
<td valign="top" align="center"><bold>93.51</bold></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>The variants of the model are listed in the first column. We test each model variant for 10 times under five-fold CV on HMDD v2.0 dataset and report averages. Bold values indicates highlight our results</italic>.</p>
</table-wrap-foot>
</table-wrap>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p>Parameters analysis of DeepWalk and GAT on HMDD v2.0 dataset. <bold>(A)</bold> Window size Weight DeepWalk. <bold>(B)</bold> Feature size Weight DeepWalk. <bold>(C)</bold> &#x00023; heads of GAT. <bold>(D)</bold> &#x00023; layers of GAT.</p></caption>
<graphic xlink:href="fgene-12-727744-g0003.tif"/>
</fig>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p>Parameters analysis of FC layers, negative samples and regularization on HMDD v2.0 dataset. <bold>(A)</bold> &#x00023; FC layers. <bold>(B)</bold> Negative sampling ratio. <bold>(C)</bold> Penalty factor &#x003BB;.</p></caption>
<graphic xlink:href="fgene-12-727744-g0004.tif"/>
</fig>
<sec>
<title>3.3.1. Feature Embedding</title>
<p>Recall that our model adopted weighted DeepWalk to learn representations of miRNAs and diseases from miRNA-miRNA sub-graph and disease-disease sub-graph. We experimented with different feature size of {32,64,128,256} and window size {3,5,7,9} for comparison. Large training windows tend to learn more information from more nodes in the walk, while small training windows do the opposite. HGATMDA obtains the best performance at window size of 5, and the performance slightly decreases with the increase or decrease of the window size (<xref ref-type="fig" rid="F3">Figure 3A</xref>).</p>
<p>Feature size not only affects the representations of miRNAs and diseases in the SkipGram model, but also affects the final prediction performance together with other parameters. We evaluated the effect of feature size using five-fold CV, and mainly focused on values of AUC and AUPR. As shown in <xref ref-type="fig" rid="F3">Figure 3B</xref>, HGATMDA performs best performance when feature size is set to 64 dimensions.</p></sec>
<sec>
<title>3.3.2. Graph Attention Networks</title>
<p>Representations of miRNAs and diseases are further extract by GAT from miRNA-disease heterogeneous sub-graph. The power of GAT is a multi-head mechanism, we conducted experiments with different number of attention heads. We chose a 2-layer of GAT model, and each layer used same number of heads for comparison. The results are shown in <xref ref-type="fig" rid="F3">Figure 3C</xref>. We found that more heads slightly increased the AUC and AUPR values.</p>
<p>The number of layer is another factor that affects the performance of HGATMDA. We investigated the depth of GAT model used in HGATMDA. Similar to the phenomenon observed in the classification experiments (Kipf and Welling, <xref ref-type="bibr" rid="B30">2017</xref>), we found that the performance of HGATMDA decreased when the number of layer was set to greater than 2. Therefore, we chose 2-layer of GAT with 16 heads as our default.</p>
<p>In the <xref ref-type="fig" rid="F3">Figure 3D</xref>, we can see that using only an untrained GAT architecture greatly improves the prediction performance, achieves 89.78% of AUC and 88.59% of AUPR, more than 10% higher than without GAT.</p></sec>
<sec>
<title>3.3.3. FC Layers</title>
<p>The predictor used in HGATMDA is a fully-connected neural networks. Given a pair miRNA and disease, the concatenation of representations of the miRNA and disease is fed into the FC layers, to calculate the correlation score between them. We investigated the influence of number of layers in the predictor. The results are shown in <xref ref-type="fig" rid="F4">Figure 4A</xref>, we can see that HGATMDA achieves the better performance when the number of layers is greater than two. Too deep neural network can not improve the predictive performance of HGATMDA. Considering the computation cost, we chose a three-layer neural networks as the predictor in HGATMDA.</p></sec>
<sec>
<title>3.3.4. Negative Samples</title>
<p>There is lack of prior experimental evidence of non-associations between miRNAs and diseases in the HMDD dataset, we implemented a sampling strategy to randomly generate negative samples from unknown associations in the dataset of HMDD v2.0. As sampling analysis used in the previous study (Long et al., <xref ref-type="bibr" rid="B37">2020</xref>), we experimentally used different sampling ratios to evaluate the effect. Recall that the number of known associations is 5,430, then the number of negative samples is {1.0,2.0,3.0,4.0,5.0,6.0} times the number of positive samples. As shown in <xref ref-type="fig" rid="F4">Figure 4B</xref>, AUC value could not be improved by sampling more negative samples, and the AUPR value is decreased rapidly. More negative samples may result in an imbalance between positive and negative samples, and P-R curve is more sensitive to the difference of category distribution. Therefore, we set the sampling ratio to 1.0 in HGATMDA.</p></sec>
<sec>
<title>3.3.5. Regularization</title>
<p>Recall that in Equation (16), we introduced the L2 regularization with a penalty factor &#x003BB; to improve the generalization ability of the model. We conducted experiments with different values of &#x003BB; to test the effect on prediction performance of the model under five-fold CV. The area under ROC curve and the area under P-R curve are selected as metrics for evaluation. The values we used in this experiment are {0.5, 0.05, 0.005, 0.0005, 0.00005, 0.000005, 0.0000005}. As shown in <xref ref-type="fig" rid="F4">Figure 4C</xref>, We can see that &#x003BB; is set greater then 0.005, the AUC value declines significantly. Especially, AUC value and AUPR value are both drop below 90% if &#x003BB; is set as 0.5. When &#x003BB; is set to 0.0005 or 0.00005, our model achieves better performance. Our model obtains best AUPR at 0.0005, so we finally choose it as our default value.</p></sec></sec>
<sec>
<title>3.4. Comparison With Other Methods</title>
<p>In this section, we compared the prediction performance of our model with other state-of-the-art methods, including PBMDA (You et al., <xref ref-type="bibr" rid="B56">2017</xref>), GRNMF (Xiao et al., <xref ref-type="bibr" rid="B51">2018</xref>), MDHGI (Chen et al., <xref ref-type="bibr" rid="B15">2018e</xref>), BNPMDA (Chen et al., <xref ref-type="bibr" rid="B13">2018d</xref>), MCLPMDA (Yu et al., <xref ref-type="bibr" rid="B57">2019</xref>), NIMCGCN (Li et al., <xref ref-type="bibr" rid="B32">2020a</xref>), and GAEMDA (Li et al., <xref ref-type="bibr" rid="B35">2020b</xref>). We noted that different evaluation metrics and datasets are used in these methods. For fair comparison, AUC values under five-fold CV are selected based on HMDD v2.0 in all these studies for comparison. It is worth noticing that the AUCs reported in these papers are the best values. Therefore, we ran the five-fold CV for 100 times and picked up the best and average AUCs for comparison. The results are shown in <xref ref-type="table" rid="T3">Table 3</xref>. We can see that our model achieves the best AUC performance among these methods, with the best AUC and average AUC of 94.52 and 93.88%, respectively. In particular, NIMCGCN and GAEMDA are GCN-based methods, and AUCs of our best model are 0.96 and 1.61% higher than these two methods, respectively. This further shows that HGATMDA obtains better performance than other methods.</p>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p>The AUCs comparison with other state-of-the-art methods under five-fold CV based on HMDD v2.0.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"><bold>Models</bold></th>
<th valign="top" align="center"><bold>AUC (%)</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">PBMDA (You et al., <xref ref-type="bibr" rid="B56">2017</xref>)</td>
<td valign="top" align="center">91.72</td>
</tr>
<tr>
<td valign="top" align="left">GRNMF (Xiao et al., <xref ref-type="bibr" rid="B51">2018</xref>)</td>
<td valign="top" align="center">86.90</td>
</tr>
<tr>
<td valign="top" align="left">MDHGI (Chen et al., <xref ref-type="bibr" rid="B15">2018e</xref>)</td>
<td valign="top" align="center">87.94</td>
</tr>
<tr>
<td valign="top" align="left">BNPMDA (Chen et al., <xref ref-type="bibr" rid="B13">2018d</xref>)</td>
<td valign="top" align="center">89.80</td>
</tr>
<tr>
<td valign="top" align="left">MCLPMDA (Yu et al., <xref ref-type="bibr" rid="B57">2019</xref>)</td>
<td valign="top" align="center">93.25</td>
</tr>
<tr>
<td valign="top" align="left">NIMCGCN (Li et al., <xref ref-type="bibr" rid="B32">2020a</xref>)</td>
<td valign="top" align="center">92.91</td>
</tr>
<tr>
<td valign="top" align="left">GAEMDA (Li et al., <xref ref-type="bibr" rid="B35">2020b</xref>)</td>
<td valign="top" align="center">93.56</td>
</tr> <tr style="border-top: thin solid #000000;">
<td valign="top" align="left">Our-best</td>
<td valign="top" align="center"><bold>94.52</bold></td>
</tr>
<tr>
<td valign="top" align="left">Our-average</td>
<td valign="top" align="center"><bold>93.88</bold></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>Bold values indicates highlight our results</italic>.</p>
</table-wrap-foot>
</table-wrap></sec>
<sec>
<title>3.5. Case Study</title>
<p>We performed case studies on three common human neoplasms, including <italic>Breast Neoplasms, Lung Neoplasms, Kidney Neoplasms</italic>, to further evaluate the prediction performance. In these experiments, HMDD v2.0 dataset is used to train our model, and the validation datasets are HMDD v3.2 (Huang et al., <xref ref-type="bibr" rid="B24">2019a</xref>), dbDEMC v2.0 (Yang et al., <xref ref-type="bibr" rid="B55">2017</xref>), and miR2Disease (Jiang et al., <xref ref-type="bibr" rid="B28">2008</xref>), which are used to verify the candidates. For each specific disease, all known associations in HMDD v2.0 are taken as positive samples, while negative samples are chosen from all unknown miRNA-disease associations except the particular disease related. In the prediction phase, all candidate miRNAs are finally ranked by their correlation scores, which are the last layer of our model with sigmoid activation function, to present how much a miRNA associated with the specific disease. In addition, Top 50 candidates are listed and checked whether they are verified in the validation datasets.</p>
<p>We conducted the first case study for <italic>Breast Neoplasms</italic>. It is reported that Breast cancer is the leading cancer among women worldwide, and can arise for a wide number of reasons. It usually happens when cells in breast tissue grow and divide out of control. As shown in <xref ref-type="table" rid="T4">Table 4</xref>, 50 out of the top 50 candidates are confirmed in HMDD v3.2, dbDEMC v2.0, or miR2Disease. The second case study was implemented for <italic>Lung Neoplasms</italic>. This is a leading cause of cancer deaths among men and women both in United States and the world (Siegel et al., <xref ref-type="bibr" rid="B48">2020</xref>). We applied our model to predict the most relevant miRNAs with <italic>Lung Neoplasms</italic>. The results are shown in the <xref ref-type="table" rid="T5">Table 5</xref>, 50 of top 50 selected miRNAs have been verified in the validation datasets. The last case study we selected was <italic>Kidney Neoplasms</italic>. It is another common disease, and the incidence still continues to increase in the United States (Siegel et al., <xref ref-type="bibr" rid="B48">2020</xref>). We note that there are only 7 known associations related with <italic>Kidney Neoplasms</italic> in HMDD v2.0 dataset. We listed top 50 related miRNAs in <xref ref-type="table" rid="T6">Table 6</xref>, and we can see that 50 out of top 50 candidates are confirmed either in HMDD v3.2, dbDEMC v2.0 or miR2Disease. In particular, 6 of top 10 related miRNAs are verified in at least two datasets. Therefore, our model can serve as a powerful and effective tool to infer the potential related miRNAs for specific diseases.</p>
<table-wrap position="float" id="T4">
<label>Table 4</label>
<caption><p>Top 50 predicted miRNAs related to Breast Neoplasms based on HMDD v2.0.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"><bold>Rank</bold></th>
<th valign="top" align="left"><bold>miRNA</bold></th>
<th valign="top" align="left"><bold>Evidence</bold></th>
<th valign="top" align="center"><bold>Rank</bold></th>
<th valign="top" align="left"><bold>miRNA</bold></th>
<th valign="top" align="left"><bold>Evidence</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">1</td>
<td valign="top" align="left">hsa-mir-17</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">26</td>
<td valign="top" align="left">hsa-let-7c</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">2</td>
<td valign="top" align="left">hsa-mir-92a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
<td valign="top" align="center">27</td>
<td valign="top" align="left">hsa-mir-125b</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">3</td>
<td valign="top" align="left">hsa-mir-1</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
<td valign="top" align="center">28</td>
<td valign="top" align="left">hsa-mir-30a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">4</td>
<td valign="top" align="left">hsa-mir-34a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
<td valign="top" align="center">29</td>
<td valign="top" align="left">hsa-mir-143</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">5</td>
<td valign="top" align="left">hsa-mir-29a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
<td valign="top" align="center">30</td>
<td valign="top" align="left">hsa-mir-122</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">6</td>
<td valign="top" align="left">hsa-mir-29c</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">31</td>
<td valign="top" align="left">hsa-mir-181a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">7</td>
<td valign="top" align="left">hsa-mir-15a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
<td valign="top" align="center">32</td>
<td valign="top" align="left">hsa-mir-196a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">8</td>
<td valign="top" align="left">hsa-mir-18a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">33</td>
<td valign="top" align="left">hsa-mir-214</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">9</td>
<td valign="top" align="left">hsa-mir-19b</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
<td valign="top" align="center">34</td>
<td valign="top" align="left">hsa-mir-7</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">10</td>
<td valign="top" align="left">hsa-mir-155</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">35</td>
<td valign="top" align="left">hsa-mir-29b</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">11</td>
<td valign="top" align="left">hsa-mir-145</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">36</td>
<td valign="top" align="left">hsa-mir-222</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">12</td>
<td valign="top" align="left">hsa-mir-16</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
<td valign="top" align="center">37</td>
<td valign="top" align="left">hsa-mir-200c</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">13</td>
<td valign="top" align="left">hsa-let-7a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">38</td>
<td valign="top" align="left">hsa-mir-9</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">14</td>
<td valign="top" align="left">hsa-mir-133b</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
<td valign="top" align="center">39</td>
<td valign="top" align="left">hsa-mir-100</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">15</td>
<td valign="top" align="left">hsa-mir-20a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">40</td>
<td valign="top" align="left">hsa-mir-210</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">16</td>
<td valign="top" align="left">hsa-mir-133a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
<td valign="top" align="center">41</td>
<td valign="top" align="left">hsa-mir-200b</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">17</td>
<td valign="top" align="left">hsa-mir-221</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">42</td>
<td valign="top" align="left">hsa-mir-223</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">18</td>
<td valign="top" align="left">hsa-mir-146a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">43</td>
<td valign="top" align="left">hsa-mir-195</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">19</td>
<td valign="top" align="left">hsa-mir-126</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">44</td>
<td valign="top" align="left">hsa-let-7b</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">20</td>
<td valign="top" align="left">hsa-mir-142</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
<td valign="top" align="center">45</td>
<td valign="top" align="left">hsa-mir-34c</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">21</td>
<td valign="top" align="left">hsa-mir-21</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">46</td>
<td valign="top" align="left">hsa-let-7f</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">22</td>
<td valign="top" align="left">hsa-mir-19a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
<td valign="top" align="center">47</td>
<td valign="top" align="left">hsa-mir-200a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">23</td>
<td valign="top" align="left">hsa-mir-146b</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">48</td>
<td valign="top" align="left">hsa-let-7e</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">24</td>
<td valign="top" align="left">hsa-mir-199a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
<td valign="top" align="center">49</td>
<td valign="top" align="left">hsa-let-7d</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">25</td>
<td valign="top" align="left">hsa-mir-150</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
<td valign="top" align="center">50</td>
<td valign="top" align="left">hsa-mir-34b</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>I, II, II denote HMDD v3.2, dbDEMC v2.0, and miR2Disease</italic>.</p>
</table-wrap-foot>
</table-wrap>
<table-wrap position="float" id="T5">
<label>Table 5</label>
<caption><p>Top 50 predicted miRNAs related to Lung Neoplasms based on HMDD v2.0.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"><bold>Rank</bold></th>
<th valign="top" align="left"><bold>miRNA</bold></th>
<th valign="top" align="left"><bold>Evidence</bold></th>
<th valign="top" align="center"><bold>Rank</bold></th>
<th valign="top" align="left"><bold>miRNA</bold></th>
<th valign="top" align="left"><bold>Evidence</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">1</td>
<td valign="top" align="left">hsa-mir-20a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">26</td>
<td valign="top" align="left">hsa-mir-9</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">2</td>
<td valign="top" align="left">hsa-mir-125b</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">27</td>
<td valign="top" align="left">hsa-mir-203</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">3</td>
<td valign="top" align="left">hsa-mir-146b</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">28</td>
<td valign="top" align="left">hsa-mir-29c</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">4</td>
<td valign="top" align="left">hsa-mir-17</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">29</td>
<td valign="top" align="left">hsa-mir-155</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">5</td>
<td valign="top" align="left">hsa-mir-181a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">30</td>
<td valign="top" align="left">hsa-mir-34c</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">6</td>
<td valign="top" align="left">hsa-mir-145</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
<td valign="top" align="center">31</td>
<td valign="top" align="left">hsa-mir-142</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">7</td>
<td valign="top" align="left">hsa-mir-34a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">32</td>
<td valign="top" align="left">hsa-mir-143</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">8</td>
<td valign="top" align="left">hsa-mir-29b</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">33</td>
<td valign="top" align="left">hsa-mir-30a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">9</td>
<td valign="top" align="left">hsa-mir-146a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">34</td>
<td valign="top" align="left">hsa-mir-221</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">10</td>
<td valign="top" align="left">hsa-mir-126</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">35</td>
<td valign="top" align="left">hsa-mir-133b</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">11</td>
<td valign="top" align="left">hsa-let-7a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">36</td>
<td valign="top" align="left">hsa-mir-31</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">12</td>
<td valign="top" align="left">hsa-let-7d</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">37</td>
<td valign="top" align="left">hsa-mir-182</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">13</td>
<td valign="top" align="left">hsa-mir-92a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">38</td>
<td valign="top" align="left">hsa-mir-34b</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">14</td>
<td valign="top" align="left">hsa-let-7i</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">39</td>
<td valign="top" align="left">hsa-let-7c</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">15</td>
<td valign="top" align="left">hsa-mir-19a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">40</td>
<td valign="top" align="left">hsa-mir-122</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">16</td>
<td valign="top" align="left">hsa-mir-19b</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
<td valign="top" align="center">41</td>
<td valign="top" align="left">hsa-mir-222</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">17</td>
<td valign="top" align="left">hsa-let-7b</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
<td valign="top" align="center">42</td>
<td valign="top" align="left">hsa-mir-200b</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">18</td>
<td valign="top" align="left">hsa-let-7g</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
<td valign="top" align="center">43</td>
<td valign="top" align="left">hsa-mir-18a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">19</td>
<td valign="top" align="left">hsa-mir-200c</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">44</td>
<td valign="top" align="left">hsa-mir-199a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">20</td>
<td valign="top" align="left">hsa-mir-21</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
<td valign="top" align="center">45</td>
<td valign="top" align="left">hsa-mir-214</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">21</td>
<td valign="top" align="left">hsa-mir-210</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
<td valign="top" align="center">46</td>
<td valign="top" align="left">hsa-mir-106b</td>
<td valign="top" align="left"><italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">22</td>
<td valign="top" align="left">hsa-let-7f</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">47</td>
<td valign="top" align="left">hsa-mir-100</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">23</td>
<td valign="top" align="left">hsa-mir-200a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
<td valign="top" align="center">48</td>
<td valign="top" align="left">hsa-mir-148a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">24</td>
<td valign="top" align="left">hsa-let-7e</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">49</td>
<td valign="top" align="left">hsa-mir-195</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">25</td>
<td valign="top" align="left">hsa-mir-16</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
<td valign="top" align="center">50</td>
<td valign="top" align="left">hsa-mir-101</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>I, II, II denote HMDD v3.2, dbDEMC v2.0, and miR2Disease</italic>.</p>
</table-wrap-foot>
</table-wrap>
<table-wrap position="float" id="T6">
<label>Table 6</label>
<caption><p>Top 50 predicted miRNAs related to Kidney Neoplasms based on HMDD v2.0.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"><bold>Rank</bold></th>
<th valign="top" align="left"><bold>miRNA</bold></th>
<th valign="top" align="left"><bold>Evidence</bold></th>
<th valign="top" align="center"><bold>Rank</bold></th>
<th valign="top" align="left"><bold>miRNA</bold></th>
<th valign="top" align="left"><bold>Evidence</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">1</td>
<td valign="top" align="left">hsa-mir-21</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">26</td>
<td valign="top" align="left">hsa-mir-31</td>
<td valign="top" align="left"><italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">2</td>
<td valign="top" align="left">hsa-mir-146a</td>
<td valign="top" align="left"><italic>II</italic></td>
<td valign="top" align="center">27</td>
<td valign="top" align="left">hsa-mir-150</td>
<td valign="top" align="left"><italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">3</td>
<td valign="top" align="left">hsa-mir-155</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
<td valign="top" align="center">28</td>
<td valign="top" align="left">hsa-let-7a</td>
<td valign="top" align="left"><italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">4</td>
<td valign="top" align="left">hsa-mir-34a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
<td valign="top" align="center">29</td>
<td valign="top" align="left">hsa-mir-143</td>
<td valign="top" align="left"><italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">5</td>
<td valign="top" align="left">hsa-mir-125b</td>
<td valign="top" align="left"><italic>II</italic></td>
<td valign="top" align="center">30</td>
<td valign="top" align="left">hsa-mir-142</td>
<td valign="top" align="left"><italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">6</td>
<td valign="top" align="left">hsa-mir-221</td>
<td valign="top" align="left"><italic>II</italic></td>
<td valign="top" align="center">31</td>
<td valign="top" align="left">hsa-mir-200b</td>
<td valign="top" align="left"><italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">7</td>
<td valign="top" align="left">hsa-mir-29a</td>
<td valign="top" align="left"><italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">32</td>
<td valign="top" align="left">hsa-mir-181a</td>
<td valign="top" align="left"><italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">8</td>
<td valign="top" align="left">hsa-mir-20a</td>
<td valign="top" align="left"><italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">33</td>
<td valign="top" align="left">hsa-mir-23a</td>
<td valign="top" align="left"><italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">9</td>
<td valign="top" align="left">hsa-mir-15a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">34</td>
<td valign="top" align="left">hsa-mir-15b</td>
<td valign="top" align="left"><italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">10</td>
<td valign="top" align="left">hsa-mir-16</td>
<td valign="top" align="left"><italic>II</italic></td>
<td valign="top" align="center">35</td>
<td valign="top" align="left">hsa-mir-106b</td>
<td valign="top" align="left"><italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">11</td>
<td valign="top" align="left">hsa-mir-17</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">36</td>
<td valign="top" align="left">hsa-mir-196a</td>
<td valign="top" align="left"><italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">12</td>
<td valign="top" align="left">hsa-mir-29b</td>
<td valign="top" align="left"><italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">37</td>
<td valign="top" align="left">hsa-mir-133b</td>
<td valign="top" align="left"><italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">13</td>
<td valign="top" align="left">hsa-mir-92a</td>
<td valign="top" align="left"><italic>II</italic></td>
<td valign="top" align="center">38</td>
<td valign="top" align="left">hsa-mir-210</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">14</td>
<td valign="top" align="left">hsa-mir-29c</td>
<td valign="top" align="left"><italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">39</td>
<td valign="top" align="left">hsa-mir-181b</td>
<td valign="top" align="left"><italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">15</td>
<td valign="top" align="left">hsa-mir-145</td>
<td valign="top" align="left"><italic>II</italic></td>
<td valign="top" align="center">40</td>
<td valign="top" align="left">hsa-mir-146b</td>
<td valign="top" align="left"><italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">16</td>
<td valign="top" align="left">hsa-mir-18a</td>
<td valign="top" align="left"><italic>II</italic></td>
<td valign="top" align="center">41</td>
<td valign="top" align="left">hsa-mir-30a</td>
<td valign="top" align="left"><italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">17</td>
<td valign="top" align="left">hsa-mir-223</td>
<td valign="top" align="left"><italic>II</italic></td>
<td valign="top" align="center">42</td>
<td valign="top" align="left">hsa-mir-24</td>
<td valign="top" align="left"><italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">18</td>
<td valign="top" align="left">hsa-mir-122</td>
<td valign="top" align="left"><italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">43</td>
<td valign="top" align="left">hsa-mir-200c</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">19</td>
<td valign="top" align="left">hsa-mir-19b</td>
<td valign="top" align="left"><italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">44</td>
<td valign="top" align="left">hsa-mir-9</td>
<td valign="top" align="left"><italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">20</td>
<td valign="top" align="left">hsa-mir-126</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">45</td>
<td valign="top" align="left">hsa-mir-200a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">21</td>
<td valign="top" align="left">hsa-mir-199a</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
<td valign="top" align="center">46</td>
<td valign="top" align="left">hsa-mir-182</td>
<td valign="top" align="left"><italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">22</td>
<td valign="top" align="left">hsa-mir-1</td>
<td valign="top" align="left"><italic>II</italic></td>
<td valign="top" align="center">47</td>
<td valign="top" align="left">hsa-mir-214</td>
<td valign="top" align="left"><italic>I</italic>; <italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">23</td>
<td valign="top" align="left">hsa-mir-133a</td>
<td valign="top" align="left"><italic>II</italic></td>
<td valign="top" align="center">48</td>
<td valign="top" align="left">hsa-mir-148a</td>
<td valign="top" align="left"><italic>II</italic>; <italic>III</italic></td>
</tr>
<tr>
<td valign="top" align="left">24</td>
<td valign="top" align="left">hsa-mir-19a</td>
<td valign="top" align="left"><italic>II</italic></td>
<td valign="top" align="center">49</td>
<td valign="top" align="left">hsa-mir-195</td>
<td valign="top" align="left"><italic>II</italic></td>
</tr>
<tr>
<td valign="top" align="left">25</td>
<td valign="top" align="left">hsa-mir-222</td>
<td valign="top" align="left"><italic>II</italic></td>
<td valign="top" align="center">50</td>
<td valign="top" align="left">hsa-mir-7</td>
<td valign="top" align="left"><italic>II</italic></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>I, II, II denote HMDD v3.2, dbDEMC v2.0, and miR2Disease</italic>.</p>
</table-wrap-foot>
</table-wrap></sec></sec>
<sec sec-type="discussion" id="s4">
<title>4. Discussion</title>
<p>Studies have shown that the occurrence and development of many human diseases are related to the abnormal expression of miRNAs. Traditional biological verification of the interactions between miRNAs and diseases are time consuming and expensive. Therefore, computational methods of predicting the disease-related miRNAs can accelerate the identification process, and help us understand the potential mechanism of the interactions between miRNAs and diseases.</p>
<p>In this paper, we presented a computational method for miRNA-disease association prediction based on Heterogeneous Graph Attention Networks, which is superior to other state-of-the-art methods. Specially, we first constructed a heterogeneous graph containing miRNAs and diseases using disease semantic similarity, miRNA functional similarity, the GIP kernel similarities, and known associations between miRNAs and diseases. Then, we proposed a novel method based on weighted DeepWalk that can learn dense feature representation of miRNAs and diseases from the miRNA-miRNA and disease-disease sub-graphs. Furthermore, a GAT based model was implemented for further feature exaction from the miRNA-disease heterogeneous sub-graph, followed by a 3-layer fully-connected neural networks as the predictor. We conducted experiments on five-fold CV, case studies and ablation study. The results demonstrated that our proposed method HGATMDA can serve as an efficient and reliable tool for predicting potential relations between miRNAs and diseases, as well as therapeutics and clinical research.</p>
<p>Our model has achieved high predictive performance mainly for the following reasons. First, GAT was applied to extract node features in the heterogeneous miRNA-disease graph, which is very effective and expressive by leveraging multi-head attention mechanism. Second, initial node features extraction were obtained by weighted DeepWalk, combined with similarity integration, to further improve miRNA and disease characterization. Third, FC-based predictor is suitable and reliable for inferring the potential disease-related miRNAs.</p>
<p>However, there is still room for further improvement. Our constructed heterogeneous graph maybe inaccurate due to the computing equations used in disease and miRNA similarities. In the future, we can directly use the Mesh headings and relationships (<ext-link ext-link-type="uri" xlink:href="https://www.nlm.nih.gov/mesh/meshhome.html">https://www.nlm.nih.gov/mesh/meshhome.html</ext-link>) to build disease-disease sub-graph, and extract node features such as Guo et al. (<xref ref-type="bibr" rid="B22">2021</xref>). Furthermore, as our future work, we can introduce more biological information, such as miRNA sequence, miRNA-target, protein-protein, and protein-target interactions, to enrich nodes representations, and construct complex heterogeneous graph (Schlichtkrull et al., <xref ref-type="bibr" rid="B46">2018</xref>) with multiple node and edge types, to enhance the prediction performance of our model.</p></sec>
<sec sec-type="data-availability-statement" id="s5">
<title>Data Availability Statement</title>
<p>The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author/s.</p></sec>
<sec id="s6">
<title>Author Contributions</title>
<p>CJ and YS conceived the entire project and wrote the paper. CJ and CZ developed the prediction method. CJ and JN collected data and designed the experiments. CJ and YW analyzed the results. All authors read and approved the final manuscript.</p></sec>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p></sec>
<sec sec-type="disclaimer" id="s7">
<title>Publisher&#x00027;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p></sec>
</body>
<back>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ambros</surname> <given-names>V.</given-names></name></person-group> (<year>2001</year>). <article-title>microRNAs: tiny regulators with great potential</article-title>. <source>Cell</source> <volume>107</volume>, <fpage>823</fpage>&#x02013;<lpage>826</lpage>. <pub-id pub-id-type="doi">10.1016/S0092-8674(01)00616-X</pub-id><pub-id pub-id-type="pmid">11779458</pub-id></citation></ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ambros</surname> <given-names>V.</given-names></name></person-group> (<year>2003</year>). <article-title>MicroRNA pathways in flies and worms: growth, death, fat, stress, and timing</article-title>. <source>Cell</source> <volume>113</volume>, <fpage>673</fpage>&#x02013;<lpage>676</lpage>. <pub-id pub-id-type="doi">10.1016/S0092-8674(03)00428-8</pub-id><pub-id pub-id-type="pmid">12809598</pub-id></citation></ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ambros</surname> <given-names>V.</given-names></name></person-group> (<year>2004</year>). <article-title>The functions of animal microRNAs</article-title>. <source>Nature</source> <volume>431</volume>, <fpage>350</fpage>&#x02013;<lpage>355</lpage>. <pub-id pub-id-type="doi">10.1038/nature02871</pub-id><pub-id pub-id-type="pmid">15372042</pub-id></citation></ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bartel</surname> <given-names>D. P.</given-names></name></person-group> (<year>2004</year>). <article-title>MicroRNAs: genomics, biogenesis, mechanism, and function</article-title>. <source>Cell</source> <volume>116</volume>, <fpage>281</fpage>&#x02013;<lpage>297</lpage>. <pub-id pub-id-type="doi">10.1016/S0092-8674(04)00045-5</pub-id><pub-id pub-id-type="pmid">14744438</pub-id></citation></ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bartel</surname> <given-names>D. P.</given-names></name></person-group> (<year>2009</year>). <article-title>MicroRNAs: target recognition and regulatory functions</article-title>. <source>Cell</source> <volume>136</volume>, <fpage>215</fpage>&#x02013;<lpage>233</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2009.01.002</pub-id><pub-id pub-id-type="pmid">22468167</pub-id></citation></ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bartel</surname> <given-names>D. P.</given-names></name></person-group> (<year>2018</year>). <article-title>Metazoan microRNAs</article-title>. <source>Cell</source> <volume>173</volume>, <fpage>20</fpage>&#x02013;<lpage>51</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2018.03.006</pub-id><pub-id pub-id-type="pmid">29570994</pub-id></citation></ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Clarence Yan</surname> <given-names>C.</given-names></name> <name><surname>Zhang</surname> <given-names>X.</given-names></name> <name><surname>Li</surname> <given-names>Z.</given-names></name> <name><surname>Deng</surname> <given-names>L.</given-names></name> <name><surname>Zhang</surname> <given-names>Y.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>RBMMMDA: predicting multiple types of disease-microRNA associations</article-title>. <source>Sci. Rep</source>. <volume>5</volume>:<fpage>13877</fpage>. <pub-id pub-id-type="doi">10.1038/srep13877</pub-id><pub-id pub-id-type="pmid">26347258</pub-id></citation></ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Gong</surname> <given-names>Y.</given-names></name> <name><surname>Zhang</surname> <given-names>D. H.</given-names></name> <name><surname>You</surname> <given-names>Z. H.</given-names></name> <name><surname>Li</surname> <given-names>Z. W.</given-names></name></person-group> (<year>2018a</year>). <article-title>DRMDA: deep representations-based miRNA-disease association prediction</article-title>. <source>J. Cell. Mol. Med</source>. <volume>22</volume>, <fpage>472</fpage>&#x02013;<lpage>485</lpage>. <pub-id pub-id-type="doi">10.1111/jcmm.13336</pub-id><pub-id pub-id-type="pmid">28857494</pub-id></citation></ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Liu</surname> <given-names>M. X.</given-names></name> <name><surname>Yan</surname> <given-names>G. Y.</given-names></name></person-group> (<year>2012</year>). <article-title>RWRMDA: Predicting novel human microRNA-disease associations</article-title>. <source>Mol. Biosyst</source>. <volume>8</volume>, <fpage>2792</fpage>&#x02013;<lpage>2798</lpage>. <pub-id pub-id-type="doi">10.1039/c2mb25180a</pub-id><pub-id pub-id-type="pmid">22875290</pub-id></citation></ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Sun</surname> <given-names>L.-G.</given-names></name> <name><surname>Zhao</surname> <given-names>Y.</given-names></name></person-group> (<year>2021</year>). <article-title>NCMCMDA: miRNA-disease association prediction through neighborhood constraint matrix completion</article-title>. <source>Brief. Bioinform</source>. <volume>22</volume>, <fpage>485</fpage>&#x02013;<lpage>496</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbz159</pub-id><pub-id pub-id-type="pmid">31927572</pub-id></citation></ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Wang</surname> <given-names>L.</given-names></name> <name><surname>Qu</surname> <given-names>J.</given-names></name> <name><surname>Guan</surname> <given-names>N. N.</given-names></name> <name><surname>Li</surname> <given-names>J. Q.</given-names></name></person-group> (<year>2018b</year>). <article-title>Predicting miRNA-disease association based on inductive matrix completion</article-title>. <source>Bioinformatics</source> <volume>34</volume>, <fpage>4256</fpage>&#x02013;<lpage>4265</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bty503</pub-id><pub-id pub-id-type="pmid">29939227</pub-id></citation></ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Wang</surname> <given-names>L. Y.</given-names></name> <name><surname>Huang</surname> <given-names>L.</given-names></name></person-group> (<year>2018c</year>). <article-title>NDAMDA: Network distance analysis for miRNA-disease association prediction</article-title>. <source>J. Cell. Mol. Med</source>. <volume>22</volume>, <fpage>2884</fpage>&#x02013;<lpage>2895</lpage>. <pub-id pub-id-type="doi">10.1111/jcmm.13583</pub-id><pub-id pub-id-type="pmid">29532987</pub-id></citation></ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Xie</surname> <given-names>D.</given-names></name> <name><surname>Wang</surname> <given-names>L.</given-names></name> <name><surname>Zhao</surname> <given-names>Q.</given-names></name> <name><surname>You</surname> <given-names>Z. H.</given-names></name> <name><surname>Liu</surname> <given-names>H.</given-names></name></person-group> (<year>2018d</year>). <article-title>BNPMDA: bipartite network projection for miRNA-disease association prediction</article-title>. <source>Bioinformatics</source> <volume>34</volume>, <fpage>3178</fpage>&#x02013;<lpage>3186</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bty333</pub-id><pub-id pub-id-type="pmid">29701758</pub-id></citation></ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Yan</surname> <given-names>C. C.</given-names></name> <name><surname>Zhang</surname> <given-names>X.</given-names></name> <name><surname>You</surname> <given-names>Z. H.</given-names></name> <name><surname>Huang</surname> <given-names>Y. A.</given-names></name> <name><surname>Yan</surname> <given-names>G. Y.</given-names></name></person-group> (<year>2016</year>). <article-title>HGIMDA: Heterogeneous graph inference for miRNA-disease association prediction</article-title>. <source>Oncotarget</source> <volume>7</volume>, <fpage>65257</fpage>&#x02013;<lpage>65269</lpage>. <pub-id pub-id-type="doi">10.18632/oncotarget.11251</pub-id><pub-id pub-id-type="pmid">27533456</pub-id></citation></ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Yin</surname> <given-names>J.</given-names></name> <name><surname>Qu</surname> <given-names>J.</given-names></name> <name><surname>Huang</surname> <given-names>L.</given-names></name></person-group> (<year>2018e</year>). <article-title>MDHGI: matrix decomposition and heterogeneous graph inference for miRNA-disease association prediction</article-title>. <source>PLoS Comput. Biol</source>. <volume>14</volume>:<fpage>e1006418</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1006418</pub-id><pub-id pub-id-type="pmid">30142158</pub-id></citation></ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Zhu</surname> <given-names>C. C.</given-names></name> <name><surname>Yin</surname> <given-names>J.</given-names></name></person-group> (<year>2019</year>). <article-title>Ensemble of decision tree reveals potential miRNA-disease associations</article-title>. <source>PLoS Comput. Biol</source>. <volume>15</volume>:<fpage>e1007209</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1007209</pub-id><pub-id pub-id-type="pmid">31329575</pub-id></citation></ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chou</surname> <given-names>C. H.</given-names></name> <name><surname>Chang</surname> <given-names>N. W.</given-names></name> <name><surname>Shrestha</surname> <given-names>S.</given-names></name> <name><surname>Hsu</surname> <given-names>S. D.</given-names></name> <name><surname>Lin</surname> <given-names>Y. L.</given-names></name> <name><surname>Lee</surname> <given-names>W. H.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>miRTarBase 2016: updates to the experimentally validated miRNA-target interactions database</article-title>. <source>Nucleic Acids Res</source>. <volume>44</volume>, <fpage>D239</fpage>&#x02013;<lpage>D247</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkv1258</pub-id><pub-id pub-id-type="pmid">26590260</pub-id></citation></ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Croce</surname> <given-names>C. M.</given-names></name></person-group> (<year>2008</year>). <article-title>Oncogenes and cancer</article-title>. <source>N. Engl. J. Med</source>. <volume>358</volume>, <fpage>502</fpage>&#x02013;<lpage>511</lpage>. <pub-id pub-id-type="doi">10.1056/NEJMra072367</pub-id><pub-id pub-id-type="pmid">18234754</pub-id></citation></ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fey</surname> <given-names>M.</given-names></name> <name><surname>Lenssen</surname> <given-names>J. E.</given-names></name></person-group> (<year>2019</year>). <article-title>Fast graph representation learning with pytorch geometric</article-title>. <source>arXiv [Preprint]. arXiv:1903.02428</source>.</citation></ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Friedman</surname> <given-names>R. C.</given-names></name> <name><surname>Farh</surname> <given-names>K.-H.</given-names></name> <name><surname>Burge</surname> <given-names>C. B.</given-names></name> <name><surname>Bartel</surname> <given-names>D. P.</given-names></name></person-group> (<year>2009</year>). <article-title>Most mammalian mRNAs are conserved targets of microRNAs</article-title>. <source>Genome Research</source> <volume>19</volume>, <fpage>92</fpage>&#x02013;<lpage>105</lpage>. <pub-id pub-id-type="doi">10.1101/gr.082701.108</pub-id><pub-id pub-id-type="pmid">18955434</pub-id></citation></ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fu</surname> <given-names>L.</given-names></name> <name><surname>Peng</surname> <given-names>Q.</given-names></name></person-group> (<year>2017</year>). <article-title>A deep ensemble model to predict miRNA-disease association</article-title>. <source>Sci. Rep</source>. <volume>7</volume>, <fpage>1</fpage>&#x02013;<lpage>13</lpage>. <pub-id pub-id-type="doi">10.1038/s41598-017-15235-6</pub-id><pub-id pub-id-type="pmid">29101378</pub-id></citation></ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Guo</surname> <given-names>Z. H.</given-names></name> <name><surname>You</surname> <given-names>Z. H.</given-names></name> <name><surname>Huang</surname> <given-names>D. S.</given-names></name> <name><surname>Yi</surname> <given-names>H. C.</given-names></name> <name><surname>Zheng</surname> <given-names>K.</given-names></name> <name><surname>Chen</surname> <given-names>Z. H.</given-names></name> <etal/></person-group>. (<year>2021</year>). <article-title>MeSHHeading2vec: a new method for representing MeSH headings as vectors based on graph embedding algorithm</article-title>. <source>Brief. Bioinform</source>. <volume>22</volume>, <fpage>2085</fpage>&#x02013;<lpage>2095</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbaa037</pub-id><pub-id pub-id-type="pmid">32232320</pub-id></citation></ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>He</surname> <given-names>L.</given-names></name> <name><surname>Hannon</surname> <given-names>G. J.</given-names></name></person-group> (<year>2004</year>). <article-title>MicroRNAs: small RNAs with a big role in gene regulation</article-title>. <source>Nat. Rev. Genet</source>. <volume>5</volume>, <fpage>522</fpage>&#x02013;<lpage>531</lpage>. <pub-id pub-id-type="doi">10.1038/nrg1379</pub-id><pub-id pub-id-type="pmid">15211354</pub-id></citation></ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>Z.</given-names></name> <name><surname>Liu</surname> <given-names>L.</given-names></name> <name><surname>Gao</surname> <given-names>Y.</given-names></name> <name><surname>Shi</surname> <given-names>J.</given-names></name> <name><surname>Cui</surname> <given-names>Q.</given-names></name> <name><surname>Li</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2019a</year>). <article-title>Benchmark of computational methods for predicting microRNA-disease associations</article-title>. <source>Genome Biol</source>. <volume>20</volume>, <fpage>1</fpage>&#x02013;<lpage>13</lpage>. <pub-id pub-id-type="doi">10.1186/s13059-019-1811-3</pub-id><pub-id pub-id-type="pmid">31594544</pub-id></citation></ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>Z.</given-names></name> <name><surname>Shi</surname> <given-names>J.</given-names></name> <name><surname>Gao</surname> <given-names>Y.</given-names></name> <name><surname>Cui</surname> <given-names>C.</given-names></name> <name><surname>Zhang</surname> <given-names>S.</given-names></name> <name><surname>Li</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2019b</year>). <article-title>HMDD v3.0: a database for experimentally supported human microRNA-disease associations</article-title>. <source>Nucleic Acids Res</source>. <volume>47</volume>, <fpage>D1013</fpage>&#x02013;<lpage>D1017</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gky1010</pub-id><pub-id pub-id-type="pmid">30364956</pub-id></citation></ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ji</surname> <given-names>C.</given-names></name> <name><surname>Gao</surname> <given-names>Z.</given-names></name> <name><surname>Ma</surname> <given-names>X.</given-names></name> <name><surname>Wu</surname> <given-names>Q.</given-names></name> <name><surname>Ni</surname> <given-names>J.</given-names></name> <name><surname>Zheng</surname> <given-names>C.</given-names></name></person-group> (<year>2021</year>). <article-title>AEMDA: inferring miRNA-disease associations based on deep autoencoder</article-title>. <source>Bioinformatics</source> <volume>37</volume>, <fpage>66</fpage>&#x02013;<lpage>72</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btaa670</pub-id><pub-id pub-id-type="pmid">32726399</pub-id></citation></ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jiang</surname> <given-names>Q.</given-names></name> <name><surname>Hao</surname> <given-names>Y.</given-names></name> <name><surname>Wang</surname> <given-names>G.</given-names></name> <name><surname>Juan</surname> <given-names>L.</given-names></name> <name><surname>Zhang</surname> <given-names>T.</given-names></name> <name><surname>Teng</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>Prioritization of disease microRNAs through a human phenome-microRNAome network</article-title>. <source>BMC Syst. Biol</source>. <volume>4</volume>(<supplement>Suppl. 1</supplement>):<fpage>S2</fpage>. <pub-id pub-id-type="doi">10.1186/1752-0509-4-S1-S2</pub-id><pub-id pub-id-type="pmid">20522252</pub-id></citation></ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jiang</surname> <given-names>Q.</given-names></name> <name><surname>Wang</surname> <given-names>Y.</given-names></name> <name><surname>Hao</surname> <given-names>Y.</given-names></name> <name><surname>Juan</surname> <given-names>L.</given-names></name> <name><surname>Teng</surname> <given-names>M.</given-names></name> <name><surname>Zhang</surname> <given-names>X.</given-names></name> <etal/></person-group>. (<year>2008</year>). <article-title>miR2Disease: a manually curated database for microRNA deregulation in human disease</article-title>. <source>Nucleic Acids Res</source>. <volume>37</volume>(<supplement>Suppl_1</supplement>), <fpage>D98</fpage>&#x02013;<lpage>D104</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkn714</pub-id><pub-id pub-id-type="pmid">18927107</pub-id></citation></ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kingma</surname> <given-names>D. P.</given-names></name> <name><surname>Ba</surname> <given-names>J.</given-names></name></person-group> (<year>2017</year>). <article-title>Adam: a method for stochastic optimization</article-title>. <source>arXiv [Preprint]. arXiv:1412.6980</source>.</citation></ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kipf</surname> <given-names>T. N.</given-names></name> <name><surname>Welling</surname> <given-names>M.</given-names></name></person-group> (<year>2017</year>). <article-title>Semi-supervised classification with graph convolutional networks</article-title>. <source>arXiv [Preprint]. arXiv:1609.02907</source>.<pub-id pub-id-type="pmid">29890408</pub-id></citation></ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Koren</surname> <given-names>Y.</given-names></name> <name><surname>Bell</surname> <given-names>R.</given-names></name> <name><surname>Volinsky</surname> <given-names>C.</given-names></name></person-group> (<year>2009</year>). <article-title>Matrix factorization techniques for recommender systems</article-title>. <source>Computer</source> <volume>42</volume>, <fpage>30</fpage>&#x02013;<lpage>37</lpage>. <pub-id pub-id-type="doi">10.1109/MC.2009.263</pub-id></citation></ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>J.</given-names></name> <name><surname>Zhang</surname> <given-names>S.</given-names></name> <name><surname>Liu</surname> <given-names>T.</given-names></name> <name><surname>Ning</surname> <given-names>C.</given-names></name> <name><surname>Zhang</surname> <given-names>Z.</given-names></name> <name><surname>Zhou</surname> <given-names>W.</given-names></name></person-group> (<year>2020a</year>). <article-title>Neural inductive matrix completion with graph convolutional networks for miRNA-disease association prediction</article-title>. <source>Bioinformatics</source> <volume>36</volume>, <fpage>2538</fpage>&#x02013;<lpage>2546</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btz965</pub-id><pub-id pub-id-type="pmid">31904845</pub-id></citation></ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>J. Q.</given-names></name> <name><surname>Rong</surname> <given-names>Z. H.</given-names></name> <name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Yan</surname> <given-names>G. Y.</given-names></name> <name><surname>You</surname> <given-names>Z. H.</given-names></name></person-group> (<year>2017</year>). <article-title>MCMDA: Matrix completion for MiRNA-disease association prediction</article-title>. <source>Oncotarget</source> <volume>8</volume>, <fpage>21187</fpage>&#x02013;<lpage>21199</lpage>. <pub-id pub-id-type="doi">10.18632/oncotarget.15061</pub-id><pub-id pub-id-type="pmid">28177900</pub-id></citation></ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>Y.</given-names></name> <name><surname>Qiu</surname> <given-names>C.</given-names></name> <name><surname>Tu</surname> <given-names>J.</given-names></name> <name><surname>Geng</surname> <given-names>B.</given-names></name> <name><surname>Yang</surname> <given-names>J.</given-names></name> <name><surname>Jiang</surname> <given-names>T.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>HMDD v2.0: a database for experimentally supported human microRNA and disease associations</article-title>. <source>Nucleic Acids Res</source>. <volume>42</volume>, <fpage>D1070</fpage>&#x02013;<lpage>D1074</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkt1023</pub-id><pub-id pub-id-type="pmid">24194601</pub-id></citation></ref>
<ref id="B35">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>Z.</given-names></name> <name><surname>Li</surname> <given-names>J.</given-names></name> <name><surname>Nie</surname> <given-names>R.</given-names></name> <name><surname>You</surname> <given-names>Z.-H.</given-names></name> <name><surname>Bao</surname> <given-names>W.</given-names></name></person-group> (<year>2020b</year>). <article-title>A graph auto-encoder model for miRNA-disease associations prediction</article-title>. <source>Brief. Bioinform</source>. <volume>22</volume>, <fpage>1</fpage>&#x02013;<lpage>13</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbaa240</pub-id><pub-id pub-id-type="pmid">34293850</pub-id></citation></ref>
<ref id="B36">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Long</surname> <given-names>Y.</given-names></name> <name><surname>Luo</surname> <given-names>J.</given-names></name> <name><surname>Zhang</surname> <given-names>Y.</given-names></name> <name><surname>Xia</surname> <given-names>Y.</given-names></name></person-group> (<year>2021</year>). <article-title>Predicting human microbe-disease associations via graph attention networks with inductive matrix completion</article-title>. <source>Brief. Bioinform</source>. <volume>22</volume>, <fpage>1</fpage>&#x02013;<lpage>13</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbaa146</pub-id><pub-id pub-id-type="pmid">32725163</pub-id></citation></ref>
<ref id="B37">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Long</surname> <given-names>Y.</given-names></name> <name><surname>Wu</surname> <given-names>M.</given-names></name> <name><surname>Kwoh</surname> <given-names>C. K.</given-names></name> <name><surname>Luo</surname> <given-names>J.</given-names></name> <name><surname>Li</surname> <given-names>X.</given-names></name></person-group> (<year>2020</year>). <article-title>Predicting human microbe-drug associations via graph convolutional network with conditional random field</article-title>. <source>Bioinformatics</source> <volume>36</volume>, <fpage>4918</fpage>&#x02013;<lpage>4927</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btaa598</pub-id><pub-id pub-id-type="pmid">32597948</pub-id></citation></ref>
<ref id="B38">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lu</surname> <given-names>M.</given-names></name> <name><surname>Zhang</surname> <given-names>Q.</given-names></name> <name><surname>Deng</surname> <given-names>M.</given-names></name> <name><surname>Miao</surname> <given-names>J.</given-names></name> <name><surname>Guo</surname> <given-names>Y.</given-names></name> <name><surname>Gao</surname> <given-names>W.</given-names></name> <etal/></person-group>. (<year>2008</year>). <article-title>An analysis of human microRNA and disease associations</article-title>. <source>PLoS ONE</source> <volume>3</volume>:<fpage>e3420</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0003420</pub-id><pub-id pub-id-type="pmid">18923704</pub-id></citation></ref>
<ref id="B39">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Mikolov</surname> <given-names>T.</given-names></name> <name><surname>Sutskever</surname> <given-names>I.</given-names></name> <name><surname>Chen</surname> <given-names>K.</given-names></name> <name><surname>Corrado</surname> <given-names>G.</given-names></name> <name><surname>Dean</surname> <given-names>J.</given-names></name></person-group> (<year>2013</year>). <article-title>Distributed representations ofwords and phrases and their compositionality,</article-title> in <source>Advances in Neural Information Processing Systems, NIPS&#x00027;13</source> (<publisher-loc>Red Hook, NY</publisher-loc>: <publisher-name>Curran Associates Inc.</publisher-name>), <fpage>1</fpage>&#x02013;<lpage>9</lpage>.</citation></ref>
<ref id="B40">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pan</surname> <given-names>X.</given-names></name> <name><surname>Shen</surname> <given-names>H. B.</given-names></name></person-group> (<year>2019</year>). <article-title>Inferring disease-associated microRNAs using semi-supervised multi-label graph convolutional networks</article-title>. <source>iScience</source> <volume>20</volume>, <fpage>265</fpage>&#x02013;<lpage>277</lpage>. <pub-id pub-id-type="doi">10.1016/j.isci.2019.09.013</pub-id><pub-id pub-id-type="pmid">31605942</pub-id></citation></ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pan</surname> <given-names>Z.</given-names></name> <name><surname>Zhang</surname> <given-names>H.</given-names></name> <name><surname>Liang</surname> <given-names>C.</given-names></name> <name><surname>Li</surname> <given-names>G.</given-names></name> <name><surname>Xiao</surname> <given-names>Q.</given-names></name> <name><surname>Ding</surname> <given-names>P.</given-names></name> <etal/></person-group>. (<year>2019</year>). <article-title>Self-weighted multi-kernel multi-label learning for potential miRNA-disease association prediction</article-title>. <source>Mol. Therapy Nucleic Acids</source> <volume>17</volume>, <fpage>414</fpage>&#x02013;<lpage>423</lpage>. <pub-id pub-id-type="doi">10.1016/j.omtn.2019.06.014</pub-id><pub-id pub-id-type="pmid">31319245</pub-id></citation></ref>
<ref id="B42">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Paszke</surname> <given-names>A.</given-names></name> <name><surname>Gross</surname> <given-names>S.</given-names></name> <name><surname>Massa</surname> <given-names>F.</given-names></name> <name><surname>Lerer</surname> <given-names>A.</given-names></name> <name><surname>Bradbury</surname> <given-names>J.</given-names></name> <name><surname>Chanan</surname> <given-names>G.</given-names></name> <etal/></person-group>. (<year>2019</year>). <article-title>Pytorch: an imperative style, high-performance deep learning library</article-title>. <source>arXiv [Preprint]. arXiv:1912.01703</source>.</citation></ref>
<ref id="B43">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Peng</surname> <given-names>J.</given-names></name> <name><surname>Hui</surname> <given-names>W.</given-names></name> <name><surname>Li</surname> <given-names>Q.</given-names></name> <name><surname>Chen</surname> <given-names>B.</given-names></name> <name><surname>Hao</surname> <given-names>J.</given-names></name> <name><surname>Jiang</surname> <given-names>Q.</given-names></name> <etal/></person-group>. (<year>2019</year>). <article-title>A learning-based framework for miRNA-disease association identification using neural networks</article-title>. <source>Bioinformatics</source> <volume>35</volume>, <fpage>4364</fpage>&#x02013;<lpage>4371</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btz254</pub-id><pub-id pub-id-type="pmid">30977780</pub-id></citation></ref>
<ref id="B44">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Perozzi</surname> <given-names>B.</given-names></name> <name><surname>Al-Rfou</surname> <given-names>R.</given-names></name> <name><surname>Skiena</surname> <given-names>S.</given-names></name></person-group> (<year>2014</year>). <article-title>Deepwalk: online learning of social representations,</article-title> in <source>Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining</source> (<publisher-loc>New York, NY</publisher-loc>). <pub-id pub-id-type="doi">10.1145/2623330.2623732</pub-id></citation></ref>
<ref id="B45">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Rehurek</surname> <given-names>R.</given-names></name> <name><surname>Sojka</surname> <given-names>P.</given-names></name></person-group> (<year>2010</year>). <article-title>Software framework for topic modelling with large corpora,</article-title> in <source>Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks</source> (<publisher-loc>Valletta</publisher-loc>), <fpage>45</fpage>&#x02013;<lpage>50</lpage>.</citation></ref>
<ref id="B46">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Schlichtkrull</surname> <given-names>M.</given-names></name> <name><surname>Kipf</surname> <given-names>T. N.</given-names></name> <name><surname>Bloem</surname> <given-names>P.</given-names></name> <name><surname>van den Berg</surname> <given-names>R.</given-names></name> <name><surname>Titov</surname> <given-names>I.</given-names></name> <name><surname>Welling</surname> <given-names>M.</given-names></name></person-group> (<year>2018</year>). <article-title>Modeling relational data with graph convolutional networks,</article-title> in <source>Lecture Notes in Computer Science</source>, eds <person-group person-group-type="editor"><name><surname>Gangemi</surname> <given-names>A.</given-names></name> <name><surname>Navigli</surname> <given-names>R.</given-names></name> <name><surname>Vidal</surname> <given-names>M.-E.</given-names></name> <name><surname>Hitzler</surname> <given-names>P.</given-names></name> <name><surname>Troncy</surname> <given-names>R.</given-names></name> <name><surname>Hollink</surname> <given-names>L.</given-names></name> <name><surname>Tordai</surname> <given-names>A.</given-names></name> <name><surname>Alam</surname> <given-names>M.</given-names></name></person-group> (<publisher-loc>Heraklion</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>593</fpage>&#x02013;<lpage>607</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-319-93417-4_38</pub-id></citation></ref>
<ref id="B47">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shen</surname> <given-names>Z.</given-names></name> <name><surname>Zhang</surname> <given-names>Y. H.</given-names></name> <name><surname>Han</surname> <given-names>K.</given-names></name> <name><surname>Nandi</surname> <given-names>A. K.</given-names></name> <name><surname>Honig</surname> <given-names>B.</given-names></name> <name><surname>Huang</surname> <given-names>D. S.</given-names></name></person-group> (<year>2017</year>). <article-title>MiRNA-disease association prediction with collaborative matrix factorization</article-title>. <source>Complexity</source> <volume>2017</volume>, <fpage>1</fpage>&#x02013;<lpage>9</lpage>. <pub-id pub-id-type="doi">10.1155/2017/2498957</pub-id></citation></ref>
<ref id="B48">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Siegel</surname> <given-names>R. L.</given-names></name> <name><surname>Miller</surname> <given-names>K. D.</given-names></name> <name><surname>Jemal</surname> <given-names>A.</given-names></name></person-group> (<year>2020</year>). <article-title>Cancer statistics, 2020</article-title>. <source>CA Cancer J. Clin</source>. <volume>70</volume>, <fpage>7</fpage>&#x02013;<lpage>30</lpage>. <pub-id pub-id-type="doi">10.3322/caac.21590</pub-id><pub-id pub-id-type="pmid">31912902</pub-id></citation></ref>
<ref id="B49">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>van Laarhoven</surname> <given-names>T.</given-names></name> <name><surname>Nabuurs</surname> <given-names>S. B.</given-names></name> <name><surname>Marchiori</surname> <given-names>E.</given-names></name></person-group> (<year>2011</year>). <article-title>Gaussian interaction profile kernels for predicting drug-target interaction</article-title>. <source>Bioinformatics</source> <volume>27</volume>, <fpage>3036</fpage>&#x02013;<lpage>3043</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btr500</pub-id><pub-id pub-id-type="pmid">21893517</pub-id></citation></ref>
<ref id="B50">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>D.</given-names></name> <name><surname>Wang</surname> <given-names>J.</given-names></name> <name><surname>Lu</surname> <given-names>M.</given-names></name> <name><surname>Song</surname> <given-names>F.</given-names></name> <name><surname>Cui</surname> <given-names>Q.</given-names></name></person-group> (<year>2010</year>). <article-title>Inferring the human microRNA functional similarity and functional network based on microRNA-associated diseases</article-title>. <source>Bioinformatics</source> <volume>26</volume>, <fpage>1644</fpage>&#x02013;<lpage>1650</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btq241</pub-id><pub-id pub-id-type="pmid">20439255</pub-id></citation></ref>
<ref id="B51">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xiao</surname> <given-names>Q.</given-names></name> <name><surname>Luo</surname> <given-names>J.</given-names></name> <name><surname>Liang</surname> <given-names>C.</given-names></name> <name><surname>Cai</surname> <given-names>J.</given-names></name> <name><surname>Ding</surname> <given-names>P.</given-names></name></person-group> (<year>2018</year>). <article-title>A graph regularized non-negative matrix factorization method for identifying microRNA-disease associations</article-title>. <source>Bioinformatics</source> <volume>34</volume>, <fpage>239</fpage>&#x02013;<lpage>248</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btx545</pub-id><pub-id pub-id-type="pmid">28968779</pub-id></citation></ref>
<ref id="B52">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xu</surname> <given-names>J.</given-names></name> <name><surname>Li</surname> <given-names>C. X.</given-names></name> <name><surname>Lv</surname> <given-names>J. Y.</given-names></name> <name><surname>Li</surname> <given-names>Y. S.</given-names></name> <name><surname>Xiao</surname> <given-names>Y.</given-names></name> <name><surname>Shao</surname> <given-names>T. T.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Prioritizing candidate disease miRNAs by topological features in the miRNA target-dysregulated network: case study of prostate cancer</article-title>. <source>Mol. Cancer Therapeut</source>. <volume>10</volume>, <fpage>1857</fpage>&#x02013;<lpage>1866</lpage>. <pub-id pub-id-type="doi">10.1158/1535-7163.MCT-11-0055</pub-id><pub-id pub-id-type="pmid">21768329</pub-id></citation></ref>
<ref id="B53">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xuan</surname> <given-names>P.</given-names></name> <name><surname>Dong</surname> <given-names>Y.</given-names></name> <name><surname>Guo</surname> <given-names>Y.</given-names></name> <name><surname>Zhang</surname> <given-names>T.</given-names></name> <name><surname>Liu</surname> <given-names>Y.</given-names></name></person-group> (<year>2018</year>). <article-title>Dual convolutional neural network based method for predicting disease-related miRNAs</article-title>. <source>Int. J. Mol. Sci</source>. <volume>19</volume>:<fpage>3732</fpage>. <pub-id pub-id-type="doi">10.3390/ijms19123732</pub-id><pub-id pub-id-type="pmid">30477152</pub-id></citation></ref>
<ref id="B54">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yan</surname> <given-names>C.</given-names></name> <name><surname>Wang</surname> <given-names>J.</given-names></name> <name><surname>Ni</surname> <given-names>P.</given-names></name> <name><surname>Lan</surname> <given-names>W.</given-names></name> <name><surname>Wu</surname> <given-names>F. X.</given-names></name> <name><surname>Pan</surname> <given-names>Y.</given-names></name></person-group> (<year>2019</year>). <article-title>DNRLMF-MDA: predicting microRNA-disease associations based on similarities of microRNAs and diseases</article-title>. <source>IEEE/ACM Trans. Comput. Biol. Bioinform</source>. <volume>16</volume>, <fpage>233</fpage>&#x02013;<lpage>243</lpage>. <pub-id pub-id-type="doi">10.1109/TCBB.2017.2776101</pub-id><pub-id pub-id-type="pmid">29990253</pub-id></citation></ref>
<ref id="B55">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname> <given-names>Z.</given-names></name> <name><surname>Wu</surname> <given-names>L.</given-names></name> <name><surname>Wang</surname> <given-names>A.</given-names></name> <name><surname>Tang</surname> <given-names>W.</given-names></name> <name><surname>Zhao</surname> <given-names>Y.</given-names></name> <name><surname>Zhao</surname> <given-names>H.</given-names></name> <etal/></person-group>. (<year>2017</year>). <article-title>DbDEMC 2.0: updated database of differentially expressed miRNAs in human cancers</article-title>. <source>Nucleic Acids Res</source>. <volume>45</volume>, <fpage>D812</fpage>&#x02013;<lpage>D818</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkw1079</pub-id><pub-id pub-id-type="pmid">27899556</pub-id></citation></ref>
<ref id="B56">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>You</surname> <given-names>Z. H.</given-names></name> <name><surname>Huang</surname> <given-names>Z. A.</given-names></name> <name><surname>Zhu</surname> <given-names>Z.</given-names></name> <name><surname>Yan</surname> <given-names>G. Y.</given-names></name> <name><surname>Li</surname> <given-names>Z. W.</given-names></name> <name><surname>Wen</surname> <given-names>Z.</given-names></name> <etal/></person-group>. (<year>2017</year>). <article-title>PBMDA: a novel and effective path-based computational model for miRNA-disease association prediction</article-title>. <source>PLoS Comput. Biol</source>. <volume>13</volume>:<fpage>e1005455</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1005455</pub-id><pub-id pub-id-type="pmid">28339468</pub-id></citation></ref>
<ref id="B57">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yu</surname> <given-names>S. P.</given-names></name> <name><surname>Liang</surname> <given-names>C.</given-names></name> <name><surname>Xiao</surname> <given-names>Q.</given-names></name> <name><surname>Li</surname> <given-names>G. H.</given-names></name> <name><surname>Ding</surname> <given-names>P. J.</given-names></name> <name><surname>Luo</surname> <given-names>J. W.</given-names></name></person-group> (<year>2019</year>). <article-title>MCLPMDA: A novel method for miRNA-disease association prediction based on matrix completion and label propagation</article-title>. <source>J. Cell. Mol. Med</source>. <volume>23</volume>, <fpage>1427</fpage>&#x02013;<lpage>1438</lpage>. <pub-id pub-id-type="doi">10.1111/jcmm.14048</pub-id><pub-id pub-id-type="pmid">30499204</pub-id></citation></ref>
<ref id="B58">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>L.</given-names></name> <name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Yin</surname> <given-names>J.</given-names></name></person-group> (<year>2019</year>). <article-title>Prediction of potential mirna-disease associations through a novel unsupervised deep learning framework with variational autoencoder</article-title>. <source>Cells</source> <volume>8</volume>:<fpage>1040</fpage>. <pub-id pub-id-type="doi">10.3390/cells8091040</pub-id><pub-id pub-id-type="pmid">31489920</pub-id></citation></ref>
<ref id="B59">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhong</surname> <given-names>Y.</given-names></name> <name><surname>Xuan</surname> <given-names>P.</given-names></name> <name><surname>Wang</surname> <given-names>X.</given-names></name> <name><surname>Zhang</surname> <given-names>T.</given-names></name> <name><surname>Li</surname> <given-names>J.</given-names></name> <name><surname>Liu</surname> <given-names>Y.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>A non-negative matrix factorization based method for predicting disease-associated miRNAs in miRNA-disease bilayer network</article-title>. <source>Bioinformatics</source> <volume>34</volume>, <fpage>267</fpage>&#x02013;<lpage>277</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btx546</pub-id><pub-id pub-id-type="pmid">28968753</pub-id></citation></ref>
<ref id="B60">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname> <given-names>J.</given-names></name> <name><surname>Cui</surname> <given-names>G.</given-names></name> <name><surname>Hu</surname> <given-names>S.</given-names></name> <name><surname>Zhang</surname> <given-names>Z.</given-names></name> <name><surname>Yang</surname> <given-names>C.</given-names></name> <name><surname>Liu</surname> <given-names>Z.</given-names></name> <etal/></person-group>. (<year>2020</year>). <article-title>Graph neural networks: a review of methods and applications</article-title>. <source>AI Open</source> <volume>1</volume>, <fpage>57</fpage>&#x02013;<lpage>81</lpage>. <pub-id pub-id-type="doi">10.1016/j.aiopen.2021.01.001</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn fn-type="financial-disclosure"><p><bold>Funding.</bold> This work was supported by the National Natural Science Foundation of China (grant numbers U19A2064, 61873001, and 61872220), and the Natural Science Foundation of Shandong Province (grant number ZR2020KC022).</p>
</fn>
</fn-group>
</back>
</article>