<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Microbiol.</journal-id>
<journal-title>Frontiers in Microbiology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Microbiol.</abbrev-journal-title>
<issn pub-type="epub">1664-302X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fmicb.2018.02560</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Microbiology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Human Microbe-Disease Association Prediction With Graph Regularized Non-Negative Matrix Factorization</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>He</surname> <given-names>Bin-Sheng</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/598832/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Peng</surname> <given-names>Li-Hong</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="corresp" rid="c002"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/601035/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Li</surname> <given-names>Zejun</given-names></name>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<xref ref-type="aff" rid="aff4"><sup>4</sup></xref>
<xref ref-type="corresp" rid="c003"><sup>&#x0002A;</sup></xref>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>The First Affiliated Hospital, Changsha Medical University</institution>, <addr-line>Changsha</addr-line>, <country>China</country></aff>
<aff id="aff2"><sup>2</sup><institution>School of Information Engineering, Changsha Medical University</institution>, <addr-line>Changsha</addr-line>, <country>China</country></aff>
<aff id="aff3"><sup>3</sup><institution>College of Information Science and Engineering, Hunan University</institution>, <addr-line>Changsha</addr-line>, <country>China</country></aff>
<aff id="aff4"><sup>4</sup><institution>School of Computer and Information Science, Hunan Institute of Technology</institution>, <addr-line>Hengyang</addr-line>, <country>China</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Hongsheng Liu, Liaoning University, China</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Edwin Wang, University of Calgary, Canada; Jiawei Luo, Hunan University, China</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Bin-Sheng He <email>hbscsmu&#x00040;163.com</email></corresp>
<corresp id="c002">Li-Hong Peng <email>plhhnu&#x00040;163.com</email></corresp>
<corresp id="c003">Zejun Li <email>lzjfox&#x00040;163.com</email></corresp>
<fn fn-type="other" id="fn003"><p>This article was submitted to Systems Microbiology, a section of the journal Frontiers in Microbiology</p></fn></author-notes>
<pub-date pub-type="epub">
<day>01</day>
<month>11</month>
<year>2018</year>
</pub-date>
<pub-date pub-type="collection">
<year>2018</year>
</pub-date>
<volume>9</volume>
<elocation-id>2560</elocation-id>
<history>
<date date-type="received">
<day>16</day>
<month>08</month>
<year>2018</year>
</date>
<date date-type="accepted">
<day>08</day>
<month>10</month>
<year>2018</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2018 He, Peng and Li.</copyright-statement>
<copyright-year>2018</copyright-year>
<copyright-holder>He, Peng and Li</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>A microbe is a microscopic organism which may exists in its single-celled form or in a colony of cells. In recent years, accumulating researchers have been engaged in the field of uncovering microbe-disease associations since microbes are found to be closely related to the prevention, diagnosis, and treatment of many complex human diseases. As an effective supplement to the traditional experiment, more and more computational models based on various algorithms have been proposed for microbe-disease association prediction to improve efficiency and cost savings. In this work, we developed a novel predictive model of Graph Regularized Non-negative Matrix Factorization for Human Microbe-Disease Association prediction (GRNMFHMDA). Initially, microbe similarity and disease similarity were constructed on the basis of the symptom-based disease similarity and Gaussian interaction profile kernel similarity for microbes and diseases. Subsequently, it is worth noting that we utilized a preprocessing step in which unknown microbe-disease pairs were assigned associated likelihood scores to avoid the possible negative impact on the prediction performance. Finally, we implemented a graph regularized non-negative matrix factorization framework to identify potential associations for all diseases simultaneously. To assess the performance of our model, cross validations including global leave-one-out cross validation (LOOCV) and local LOOCV were implemented. The AUCs of 0.8715 (global LOOCV) and 0.7898 (local LOOCV) proved the reliable performance of our computational model. In addition, we carried out two types of case studies on three different human diseases to further analyze the prediction performance of GRNMFHMDA, in which most of the top 10 predicted disease-related microbes were verified by database HMDAD or experimental literatures.</p>
</abstract>
<kwd-group>
<kwd>microbe</kwd>
<kwd>disease</kwd>
<kwd>association prediction</kwd>
<kwd>graph regularization</kwd>
<kwd>matrix factorization</kwd>
</kwd-group>
<counts>
<fig-count count="3"/>
<table-count count="3"/>
<equation-count count="27"/>
<ref-count count="52"/>
<page-count count="11"/>
<word-count count="7324"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>Introduction</title>
<p>Antonie Van Leeuwenhoek, the father of microbiology, was the first to discover, observe, describe, study, and conduct scientific experiments with microbes, using simple single-lensed microscopes of his own design in 1673 (Leeuwenhoek, <xref ref-type="bibr" rid="B28">1683-1775</xref>). From then on, with the development of biological theory and technology, a great mass of microbes has been discovered. It has been suggested that the amount of organisms living below the Earth&#x00027;s surface is comparable with the amount of life on or above the surface (Gold, <xref ref-type="bibr" rid="B21">1992</xref>). As we know, microbes are very closely related to humans in many fields, such as food production (Smid and Lacroix, <xref ref-type="bibr" rid="B38">2013</xref>), water treatment (Tabatabaei et al., <xref ref-type="bibr" rid="B40">2010</xref>), energy (Tanaka, <xref ref-type="bibr" rid="B41">1999</xref>), and human health (Thiele et al., <xref ref-type="bibr" rid="B42">2013</xref>). Especially, many studies have demonstrated that one of the most important effects of microbes on humans is the associations between microbes and complex human diseases. For example, Boleij et al. (<xref ref-type="bibr" rid="B4">2015</xref>) proved that the <italic>Bacteroides fragilis</italic> toxin gene is associated with colorectal neoplasia, especially in late-stage colorectal cancer (CRC). Moreover, Galiana et al. (<xref ref-type="bibr" rid="B18">2014</xref>) found that <italic>Actinomyces</italic> can be as an indicator in the evolution of chronic obstructive pulmonary disease (COPD) patients because their study confirmed a strong association between the presence or absence of <italic>Actinomyces</italic> and the severity of the clinical condition. Another example is that periodontal pathogens <italic>Porphyromonas gingivalis</italic> and <italic>Fusobacterium nucleatum</italic> stimulate tumorigenesis of oral squamous cell carcinoma (OSCC) via direct interaction with oral epithelial cells through Toll-like receptors which is beneficial to the development of corresponding prevention and treatment schemes (Binder Gallimidi et al., <xref ref-type="bibr" rid="B3">2015</xref>). Thus, due to the fact that detecting potential microbiological markers could help to provide a better understanding of the pathogenesis of diseases and the role played by the microbiota in its severity, it is of great significance to explore the potential associations between microbes and diseases. However, since traditional experimental methods always suffer from the time constraints and capital limitations, proposing novel computational models is able to be an effective complement for uncovering potential microbe-disease associations. Recently, many feasible and effective prediction models have been developed by researchers.</p>
<p>In the last few years, some prediction models were proposed based on network analysis. Ma et al. (<xref ref-type="bibr" rid="B30">2017</xref>) developed an analysis method based on the microbe-based human disease network (Human Microbe Disease Network, HMDN) to infer the associations between microbes and disease genes, symptoms, chemical fragments, and drugs. In the method, they first utilized a large-scale text mining-based method to build the microbe-disease association network, on which the cosine similarity was calculated for each disease pair to construct the HMDN. Taking microbe-disease gene association prediction as an example, the potential related disease genes of a microbe in the HMDN can be finally obtained by finding the highly overlapped genes among the microbe-related diseases in the gene-based human disease network (Human Gene Disease Network, HGDN). Besides, in a similar way, this analysis method can also be used between HMDN and symptom-based human disease network (Human Symptoms Disease Network, HSDN), chemical fragment-based human disease network (Human Chemical Fragments Disease Network, HCDN), and drug-based human disease network (Human Drug Disease Network, HDDN) to infer the associations between microbes and disease symptoms, chemical fragments, and drugs, respectively. However, the prediction performance of this analysis method is limited by the small microbe-based disease network. Thereafter, Chen et al. (<xref ref-type="bibr" rid="B8">2017a</xref>) was the first to propose a computational model of KATZ measure for Human Microbe-Disease Association prediction (KATZHMDA) on a large scale. Firstly, they integrated the known microbe-disease associations network and Gaussian interaction profile kernel similarity networks of microbes and diseases into a heterogeneous graph. Through summarizing all walks with different weighted lengths (i.e., the walk with shorter length was assigned larger coefficient) for each microbe-disease pair, they finally calculated the association probability between each microbe and disease. Moreover, KATZHMDA is applicable for new diseases/microbes without known associations if there are additional available similarity information between the new disease/microbe and other diseases/microbes in the known microbe-disease association network. One limitation of KATZHMDA is that the optimal value of the number of walks is still hard to select. Later, Huang Z. A. et al. (<xref ref-type="bibr" rid="B25">2017</xref>) proposed a model of Path-Based Human Microbe-Disease Association Prediction (PBHMDA) by integrating known microbe-disease association network and Gaussian interaction profile kernel similarity network for microbes and diseases into a heterogeneous interlinked network in which a threshold was set to remove the edges that represent weak correlations. In the heterogeneous interlinked network, the weights of all paths between a microbe-disease pair were finally aggregated to represent the association probability between the microbe and the disease, while the weight of each path was calculated by multiplying the weights of all edges in the path without overlap and then penalizing the path with a decay coefficient. The limitation existing in PBHMDA is that it will cause bias to microbes or diseases with more known associations. Moreover, PBHMDA cannot work well for new microbes and new diseases.</p>
<p>In addition, some proposed models were not based on network analysis. Since the negative microbe-disease samples (i.e., microbe-disease pairs that are confirmed to have no associations) are unavailable, Wang et al. (<xref ref-type="bibr" rid="B45">2017</xref>) presented a semi-supervised learning-based computational model of Laplacian Regularized Least Squares for Human Microbe-Disease Association prediction (LRLSHMDA) by optimizing the Laplacian regularized least squares classifiers in microbe space and disease space. Finally, they used a simple weighted average operation on the above two optimal classifiers to obtain the final probability matrix that indicates the potential association probabilities between microbes and diseases. However, LRLSHMDA is still faced with the problem of being unable to be implemented to new diseases without known associated microbes. Similarly, with no need for negative samples, Huang Y. A. et al. (<xref ref-type="bibr" rid="B24">2017</xref>) developed the method of a Neighbor- and Graph-based combined Recommendation model for Human Microbe-Disease Association prediction (NGRHMDA) by combining two recommendation models that are neighbor-based collaborative filtering model and topological information-based model. In the neighbor-based collaborative filtering model, considering that different microbe-disease pairs may share the same microbes or diseases, they computed two association possibility matrices respectively from the microbe perspective and disease perspective and then averaged them to obtain a prediction matrix. While in the topological information-based model, they introduced a two-step diffusion approach on the microbe-disease bipartite graph to obtain another prediction matrix. Ultimately, the above two prediction matrices were simply averaged to get the final association possibilities for all microbe-disease pairs. What is worth noting is that NGRHMDA shares the same aforementioned disadvantage with LRLSHMDA.</p>
<p>In summary, all of the above models have their own limitations in predicting microbe-disease associations. Due to the lack of measurements for microbe/disease similarity, some models are only based on the Gaussian interaction profile kernel similarity of microbes and diseases that leads to unavoidable bias to those well-investigated diseases and microbes. Besides, some models cannot predict for new microbes/diseases and optimal parameters in some models are not easy to select. In this work, considering some of the above limitations, we developed a novel computational model of Graph Regularized Non-negative Matrix Factorization for Human Microbe-Disease Association prediction (GRNMFHMDA). First of all, the information of Gaussian interaction profile kernel similarity of microbes and diseases, symptom-based disease similarity and known microbe-disease associations in HMDAD (Ma et al., <xref ref-type="bibr" rid="B30">2017</xref>) were combined as the input to start the whole prediction process. Here, after data preparation, the prediction process consists of two main steps, the preprocessing step and the step of GRNMF. In the preprocessing step, the weighted <italic>K</italic> nearest neighbor profiles for microbes and diseases were calculated to reconstruct the original adjacency matrix obtained based on the known microbe-disease associations so that we could avoid the possible negative impact on the final prediction performance from unknown microbe-disease pairs. While in the step of GRNMF, Tikhonov (<italic>L</italic><sub>2</sub>) and graph Laplacian regularization were introduced into the standard NMF framework to obtain a smoother solution from matrix factorization and take full advantage of the geometric structure of our data, respectively. In addition, global leave-one-out cross validation (LOOCV), local LOOCV and two types of case studies were carried out to evaluate the prediction performance of our model. As a result, GRNMFHMDA obtained AUCs of 0.8715 (global LOOCV) and 0.7898 (local LOOCV). More than that, 9 (Asthma), 9 (Obesity), and 8 (Type 1 diabetes) out of the top 10 predicted disease-related microbes were confirmed by HMDAD or experimental literatures. Thus, it is obvious that our model would perform well in microbe-disease association prediction according to the aforementioned results.</p>
</sec>
<sec sec-type="materials and methods" id="s2">
<title>Materials and methods</title>
<sec>
<title>Method overview</title>
<p>Here, to predict potential associations between microbes and diseases, the model of GRNMFHMDA (See Figure <xref ref-type="fig" rid="F1">1</xref>) can be decomposed into three steps: (1) data preparation, in which adjacency matrix, microbe similarity, and disease similarity were established; (2) the preprocessing step, in which unknown microbe-disease pairs were assigned with associated likelihood scores based on the calculation of weighted <italic>K</italic> nearest neighbor profiles for microbes and diseases; (3) GRNMF, in which Tikhonov (<italic>L</italic><sub>2</sub>) and Graph Laplacian regularization were introduced into the standard NMF framework to obtain the final score matrix.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>Flowchart of GRNMFHMDA model to predict the potential microbe-disease associations.</p></caption>
<graphic xlink:href="fmicb-09-02560-g0001.tif"/>
</fig>
</sec>
<sec>
<title>Human microbe-disease associations</title>
<p>From the Human Microbe-Disease Association Database (HMDAD, <ext-link ext-link-type="uri" xlink:href="http://www.cuilab.cn/hmdad">http://www.cuilab.cn/hmdad</ext-link>) (Ma et al., <xref ref-type="bibr" rid="B30">2017</xref>), we can download 483 known microbe-disease associations between 292 microbes and 39 human diseases. However, since some microbe-disease associations we downloaded are the same, there were only 450 known associations after removing the duplicate parts according to different evidences. In order to represent the associations information in a more convenient and efficient way, we defined an adjacency matrix <italic>Y</italic> &#x02208; <italic>R</italic><sup><italic>m</italic>&#x0002A;<italic>n</italic></sup>, where <italic>m</italic> and <italic>n</italic> denoted the number of microbes and diseases, respectively. Moreover, the element <italic>Y</italic>(<italic>m</italic><sub><italic>i</italic></sub>, <italic>d</italic><sub><italic>j</italic></sub>) was set to 1 if microbe <italic>m</italic><sub><italic>i</italic></sub> and disease <italic>d</italic><sub><italic>j</italic></sub> had known association, otherwise 0.</p>
</sec>
<sec>
<title>Gaussian interaction profile kernel similarity for microbes</title>
<p>There is a hypothesis that similar microbes (i.e., microbes exhibiting a similar pattern of interaction and non-interaction with the diseases of a microbe-disease association network) are inclined to be associated with the same disease, on which many previous studies had relied to construct the Gaussian interaction profile kernel similarity for microbes (Chen et al., <xref ref-type="bibr" rid="B8">2017a</xref>; Huang Z. A. et al., <xref ref-type="bibr" rid="B25">2017</xref>). In this article, based on the same assumption, we first represent the interaction profile for each microbe with a binary vector involving the association information between the microbe and each disease in the known microbe-disease association network. On the basis of the definition of adjacency matrix <italic>Y</italic>, the <italic>i</italic>th row vector (<italic>Y</italic>(<italic>m</italic><sub><italic>i</italic></sub>) &#x0003D; (<italic>Y</italic><sub><italic>i</italic>1</sub>, <italic>Y</italic><sub><italic>i</italic>2</sub>, &#x02026;, <italic>Y</italic><sub><italic>in</italic></sub>)) can be used to denote the interaction profile of microbe <italic>m</italic><sub><italic>i</italic></sub>. Thus, according to the method of van Laarhoven et al. (<xref ref-type="bibr" rid="B43">2011</xref>), the Gaussian interaction profile kernel similarity between microbe <italic>m</italic><sub><italic>i</italic></sub> and <italic>m</italic><sub><italic>j</italic></sub> can be defined as follows:</p>
<disp-formula id="E1"><label>(1)</label><mml:math id="M1"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msup><mml:mrow><mml:mi>S</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msup><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mo class="qopname">exp</mml:mo><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003B3;</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:msup><mml:mrow><mml:mo>|</mml:mo><mml:mo>|</mml:mo><mml:mi>Y</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>-</mml:mo><mml:mi>Y</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>|</mml:mo><mml:mo>|</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where</p>
<disp-formula id="E2"><label>(2)</label><mml:math id="M2"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>&#x003B3;</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>&#x003B3;</mml:mi><mml:mi>&#x02032;</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo>/</mml:mo><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:mfrac><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:munderover></mml:mstyle><mml:msup><mml:mrow><mml:mo>|</mml:mo><mml:mo>|</mml:mo><mml:mi>Y</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>|</mml:mo><mml:mo>|</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>Here, &#x003B3;<sub><italic>m</italic></sub> is the adjustment coefficient that can be obtained by normalizing another bandwidth parameter &#x003B3;&#x02032;<sub><italic>m</italic></sub>.</p>
</sec>
<sec>
<title>Gaussian interaction profile kernel similarity for diseases</title>
<p>The construction of the Gaussian interaction profile kernel similarity for diseases is based on the assumption that similar diseases (i.e., diseases exhibiting a similar pattern of interaction and non-interaction with the microbes of a microbe-disease association network) are more likely to be associated with similar microbes. Here, the interaction profile for each disease is also represented by a binary vector containing the association information between the disease and each microbe in the known microbe-disease association network. Based on the same method of van Laarhoven et al. (<xref ref-type="bibr" rid="B43">2011</xref>), the <italic>j</italic>th column vector (<italic>Y</italic>(<italic>d</italic><sub><italic>j</italic></sub>) &#x0003D; (<italic>Y</italic><sub>1<italic>j</italic></sub>, <italic>Y</italic><sub>2<italic>j</italic></sub>, &#x02026;, <italic>Y</italic><sub><italic>mj</italic></sub>)) denotes the interaction profile of disease <italic>d</italic><sub><italic>j</italic></sub> and the Gaussian interaction profile kernel similarity between disease <italic>d</italic><sub><italic>i</italic></sub> and <italic>d</italic><sub><italic>j</italic></sub> can be defined as follows:</p>
<disp-formula id="E3"><label>(3)</label><mml:math id="M4"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msup><mml:mrow><mml:mi>S</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:msup><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mo class="qopname">exp</mml:mo><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003B3;</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub><mml:msup><mml:mrow><mml:mo>|</mml:mo><mml:mo>|</mml:mo><mml:mi>Y</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>-</mml:mo><mml:mi>Y</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>|</mml:mo><mml:mo>|</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where</p>
<disp-formula id="E4"><label>(4)</label><mml:math id="M5"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>&#x003B3;</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>&#x003B3;</mml:mi><mml:mi>&#x02032;</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo>/</mml:mo><mml:mrow><mml:mo stretchy="true">(</mml:mo><mml:mrow><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:mfrac><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:munderover></mml:mstyle><mml:msup><mml:mrow><mml:mo>|</mml:mo><mml:mo>|</mml:mo><mml:mi>Y</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>|</mml:mo><mml:mo>|</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mrow><mml:mo stretchy="true">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>Similarly, &#x003B3;<sub><italic>d</italic></sub> is the adjustment coefficient that can be calculated by normalizing another bandwidth parameter &#x003B3;&#x02032;<sub><italic>d</italic></sub>.</p>
</sec>
<sec>
<title>Integrated symptom-based disease similarity</title>
<p>As we have mentioned above, Gaussian interaction profile kernel similarity is used in our model to measure the similarity of microbes and diseases. However, since the Gaussian interaction profile kernel similarity is an association information-based measurement, it is essential to combine more types of microbe or disease similarities based on other available biological information. Indeed, according to different biological data, many researchers have developed their own method to measure the similarity of microbes or diseases. For instance, Zhou et al. (<xref ref-type="bibr" rid="B52">2014</xref>) proposed a model of symptom-based human disease network (HSDN) to measure the disease similarity based on co-occurrence of disease/symptom terms recorded in different literatures. In this work, we implemented HSDN to calculate symptom-based disease similarity (<italic>SDM</italic>) and then constructed a new disease similarity matrix (<italic>S</italic><sup><italic>d</italic></sup>) by integrating <italic>SDM</italic> with S<sup><italic>d</italic>&#x02032;</sup> in an average way according to the study of Chen et al. (<xref ref-type="bibr" rid="B8">2017a</xref>):</p>
<disp-formula id="E5"><label>(5)</label><mml:math id="M7"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msup><mml:mrow><mml:mi>S</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:msup><mml:mrow><mml:mi>S</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:msup><mml:mo>&#x0002B;</mml:mo><mml:mi>S</mml:mi><mml:mi>D</mml:mi><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:mfrac></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
</sec>
<sec>
<title>Weighted <italic>K</italic> nearest neighbor profiles for microbes and diseases</title>
<p>Due to the fact that values in interaction profiles of microbes or diseases without known associations are all zeros, the prediction performance may be affected to some extent. Considering that, to deal with the above mentioned problem, we came up with a preprocessing step to establish new interaction profiles both for microbes and diseases. For each microbe <italic>m</italic><sub><italic>q</italic></sub>, we first find out its <italic>K</italic> nearest known microbes, each of which must has at least one known association. Next, the similarity information between <italic>m</italic><sub><italic>q</italic></sub> and its <italic>K</italic> nearest known microbes together with the information of their corresponding <italic>K</italic> interaction profiles are combined to calculate the new interaction profile as follows:</p>
<disp-formula id="E6"><label>(6)</label><mml:math id="M8"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>q</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>Q</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac><mml:msubsup><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>K</mml:mi></mml:mrow></mml:msubsup><mml:msub><mml:mrow><mml:mi>w</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mi>Y</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where</p>
<disp-formula id="E7"><label>(7)</label><mml:math id="M9"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>w</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:msup><mml:mrow><mml:mi>&#x003B1;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup></mml:mrow><mml:mrow><mml:mo>&#x0002A;</mml:mo></mml:mrow></mml:msup><mml:msup><mml:mrow><mml:mi>S</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msup><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>q</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E8"><label>(8)</label><mml:math id="M10"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>Q</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mn>1</mml:mn><mml:mo>&#x02264;</mml:mo><mml:mi>i</mml:mi><mml:mo>&#x02264;</mml:mo><mml:mi>K</mml:mi></mml:mrow></mml:msub><mml:msup><mml:mrow><mml:mi>S</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msup><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>q</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>Here, <italic>m</italic><sub>1</sub> to <italic>m</italic><sub><italic>K</italic></sub> denote the <italic>K</italic> nearest known microbes of <italic>m</italic><sub><italic>q</italic></sub> which were sorted in descending order based on the similarity values between them. The function of the weight coefficient <italic>w</italic><sub><italic>i</italic></sub> is that the corresponding similarity value is assigned higher weight if <italic>m</italic><sub><italic>i</italic></sub> is more similar to <italic>m</italic><sub><italic>q</italic></sub>. Besides, &#x003B1; is a decay term whose value is in the range of [0,1] and <italic>Q</italic><sub><italic>m</italic></sub> is the normalization term.</p>
<p>In a similar way, the new interaction profile for each disease <italic>d</italic><sub><italic>p</italic></sub> can be defined as follows:</p>
<disp-formula id="E9"><label>(9)</label><mml:math id="M11"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>p</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>Q</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac><mml:msubsup><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>K</mml:mi></mml:mrow></mml:msubsup><mml:msub><mml:mrow><mml:mi>w</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mi>Y</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E10"><label>(10)</label><mml:math id="M12"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>w</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:msup><mml:mrow><mml:mi>&#x003B1;</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup></mml:mrow><mml:mrow><mml:mo>&#x0002A;</mml:mo></mml:mrow></mml:msup><mml:msup><mml:mrow><mml:mi>S</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msup><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>p</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E11"><label>(11)</label><mml:math id="M13"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>Q</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mn>1</mml:mn><mml:mo>&#x02264;</mml:mo><mml:mi>j</mml:mi><mml:mo>&#x02264;</mml:mo><mml:mi>K</mml:mi></mml:mrow></mml:msub><mml:msup><mml:mrow><mml:mi>S</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msup><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>p</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>After calculating the new interaction profiles from microbe perspective and disease perspective, we combine <italic>Y</italic><sub><italic>m</italic></sub> and <italic>Y</italic><sub><italic>d</italic></sub> as follows:</p>
<disp-formula id="E12"><label>(12)</label><mml:math id="M14"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mi>d</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>/</mml:mo><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <italic>a</italic><sub>1</sub> and <italic>a</italic><sub>2</sub> are two weight coefficient and both of them are set to 1 for simplicity.</p>
<p>Finally, to replace the element <italic>Y</italic>(<italic>m</italic><sub><italic>i</italic></sub>, <italic>d</italic><sub><italic>j</italic></sub>) &#x0003D; 0 with an associated likelihood score, we use the following equation to update the original adjacency matrix <italic>Y</italic>.</p>
<disp-formula id="E13"><label>(13)</label><mml:math id="M15"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:mi>Y</mml:mi><mml:mo>=</mml:mo><mml:mo class="qopname">max</mml:mo><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>Y</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mi>d</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
</sec>
<sec>
<title>GRNMF</title>
<p>As a common method, the purpose of the standard NMF is to find two non-negative matrices whose product is an optimal approximation to the original matrix (Sotiras et al., <xref ref-type="bibr" rid="B39">2015</xref>; Xu et al., <xref ref-type="bibr" rid="B48">2015</xref>). Therefore, the adjacency matrix <italic>Y</italic> &#x02208; <italic>R</italic><sup><italic>m</italic>&#x0002A;<italic>n</italic></sup> can be decomposed into two parts after implementing NMF, namely, <italic>W</italic> &#x02208; <italic>R</italic><sup><italic>m</italic>&#x0002A;<italic>k</italic></sup> and <italic>H</italic> &#x02208; <italic>R</italic><sup><italic>n</italic>&#x0002A;<italic>k</italic></sup> (<italic>Y</italic> &#x02248; <italic>WH</italic><sup>T</sup>). Accordingly, we can further get the following standard optimization problem:</p>
<disp-formula id="E14"><label>(14)</label><mml:math id="M16"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:mstyle displaystyle="true"><mml:munder><mml:mrow><mml:mo class="qopname">min</mml:mo></mml:mrow><mml:mrow><mml:mi>W</mml:mi><mml:mo>,</mml:mo><mml:mi>H</mml:mi></mml:mrow></mml:munder></mml:mstyle><mml:mo>|</mml:mo><mml:mo>|</mml:mo><mml:mi>Y</mml:mi><mml:mo>-</mml:mo><mml:mi>W</mml:mi><mml:msup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mo class="qopname">T</mml:mo></mml:mrow></mml:msup><mml:mo>|</mml:mo><mml:msubsup><mml:mrow><mml:mo>|</mml:mo></mml:mrow><mml:mrow><mml:mo class="qopname">F</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>&#x0002B;</mml:mo><mml:mi>L</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>W</mml:mi><mml:mo>,</mml:mo><mml:mi>H</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <italic>L</italic>(<italic>W, H</italic>) is a regularization term to prevent overfitting.</p>
<p>Here, motivated by the study of Xiao et al. (<xref ref-type="bibr" rid="B47">2017</xref>) and the standard NMF framework, we introduced other two terms, the Tikhonov (<italic>L</italic><sub>2</sub>) (Guan et al., <xref ref-type="bibr" rid="B22">2011</xref>) and graph Laplacian regularization (Cai et al., <xref ref-type="bibr" rid="B6">2011</xref>), to predict microbe-disease associations. The utilizing of Tikhonov regularization aims to obtain a smooth solution (<italic>W</italic> and <italic>H</italic>), while the purpose of introducing graph regularization is to ensure a part-based representation through taking full advantage of the data geometric structure. Thus, we can construct the optimization problem of GRNMF as follows:</p>
<disp-formula id="E15"><label>(15)</label><mml:math id="M17"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:mstyle displaystyle="true"><mml:munder><mml:mrow><mml:mo class="qopname">min</mml:mo></mml:mrow><mml:mrow><mml:mi>W</mml:mi><mml:mo>,</mml:mo><mml:mi>H</mml:mi></mml:mrow></mml:munder></mml:mstyle><mml:mo>|</mml:mo><mml:mo>|</mml:mo><mml:mi>Y</mml:mi><mml:mo>-</mml:mo><mml:mi>W</mml:mi><mml:msup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mo class="qopname">T</mml:mo></mml:mrow></mml:msup><mml:mo>|</mml:mo><mml:msubsup><mml:mrow><mml:mo>|</mml:mo></mml:mrow><mml:mrow><mml:mo class="qopname">F</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mtext class="textrm" mathvariant="normal">&#x0002B;</mml:mtext><mml:msub><mml:mrow><mml:mi>&#x003BB;</mml:mi></mml:mrow><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mtext class="textrm" mathvariant="normal">(</mml:mtext><mml:mo>|</mml:mo><mml:mo>|</mml:mo><mml:mi>W</mml:mi><mml:mo>|</mml:mo><mml:msubsup><mml:mrow><mml:mo>|</mml:mo></mml:mrow><mml:mrow><mml:mo class="qopname">F</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mtext class="textrm" mathvariant="normal">&#x0002B;</mml:mtext><mml:mo>|</mml:mo><mml:mo>|</mml:mo><mml:mi>H</mml:mi><mml:mo>|</mml:mo><mml:msubsup><mml:mrow><mml:mo>|</mml:mo></mml:mrow><mml:mrow><mml:mo class="qopname">F</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mtext class="textrm" mathvariant="normal">)</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mtext class="textrm" mathvariant="normal">&#x0002B;</mml:mtext><mml:msub><mml:mrow><mml:mi>&#x003BB;</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>p</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:munderover></mml:mstyle><mml:msup><mml:mrow><mml:mo>|</mml:mo><mml:mo>|</mml:mo><mml:msub><mml:mrow><mml:mi>w</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>w</mml:mi></mml:mrow><mml:mrow><mml:mi>p</mml:mi></mml:mrow></mml:msub><mml:mo>|</mml:mo><mml:mo>|</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:msub><mml:mrow><mml:msup><mml:mrow><mml:mi>S</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mo>&#x0002A;</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msup></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>p</mml:mi></mml:mrow></mml:msub><mml:mtext class="textrm" mathvariant="normal">&#x0002B;</mml:mtext><mml:msub><mml:mrow><mml:mi>&#x003BB;</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>q</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:munderover></mml:mstyle><mml:msup><mml:mrow><mml:mo>|</mml:mo><mml:mo>|</mml:mo><mml:msub><mml:mrow><mml:mi>h</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>h</mml:mi></mml:mrow><mml:mrow><mml:mi>q</mml:mi></mml:mrow></mml:msub><mml:mo>|</mml:mo><mml:mo>|</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:msub><mml:mrow><mml:msup><mml:mrow><mml:mi>S</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mo>&#x0002A;</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msup></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mi>q</mml:mi></mml:mrow></mml:msub><mml:mtext>&#x000A0;</mml:mtext><mml:mi>s</mml:mi><mml:mo>.</mml:mo><mml:mi>t</mml:mi><mml:mo>.</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mi>W</mml:mi><mml:mo>&#x02265;</mml:mo><mml:mtext class="textrm" mathvariant="normal">0,</mml:mtext><mml:mi>H</mml:mi><mml:mo>&#x02265;</mml:mo><mml:mtext class="textrm" mathvariant="normal">0</mml:mtext></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>Here, &#x003BB;<sub><italic>l</italic></sub>, &#x003BB;<sub><italic>m</italic></sub> and &#x003BB;<sub><italic>d</italic></sub> are the corresponding regularization coefficients. Besides,<italic>w</italic><sub><italic>i</italic></sub> and <italic>h</italic><sub><italic>j</italic></sub> are defined as <italic>i</italic>th rows of <italic>W</italic> and <italic>j</italic>th rows of <italic>H</italic>, respectively. In order to avoid negative affects to the prediction performance of our model, we introduced sparse weight matrices of <italic>S</italic><sup><italic>d</italic></sup><sup>&#x0002A;</sup> and <italic>S</italic><sup><italic>m</italic></sup><sup>&#x0002A;</sup>that are constructed on the basis of the geometrical information of disease and microbe data spaces (<italic>S</italic><sup><italic>d</italic></sup> and <italic>S</italic><sup><italic>m</italic></sup>), respectively. Then, Equation (14) can be transformed into:</p>
<disp-formula id="E17"><label>(16)</label><mml:math id="M19"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:mstyle displaystyle="true"><mml:munder><mml:mrow><mml:mo class="qopname">min</mml:mo></mml:mrow><mml:mrow><mml:mi>W</mml:mi><mml:mo>,</mml:mo><mml:mi>H</mml:mi></mml:mrow></mml:munder></mml:mstyle><mml:mo>|</mml:mo><mml:mo>|</mml:mo><mml:mi>Y</mml:mi><mml:mo>-</mml:mo><mml:mi>W</mml:mi><mml:msup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mo class="qopname">T</mml:mo></mml:mrow></mml:msup><mml:mo>|</mml:mo><mml:msubsup><mml:mrow><mml:mo>|</mml:mo></mml:mrow><mml:mrow><mml:mo class="qopname">F</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mtext class="textrm" mathvariant="normal">&#x0002B;</mml:mtext><mml:msub><mml:mrow><mml:mi>&#x003BB;</mml:mi></mml:mrow><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mtext class="textrm" mathvariant="normal">(</mml:mtext><mml:mo>|</mml:mo><mml:mo>|</mml:mo><mml:mi>W</mml:mi><mml:mo>|</mml:mo><mml:msubsup><mml:mrow><mml:mo>|</mml:mo></mml:mrow><mml:mrow><mml:mo class="qopname">F</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mtext class="textrm" mathvariant="normal">&#x0002B;</mml:mtext><mml:mo>|</mml:mo><mml:mo>|</mml:mo><mml:mi>H</mml:mi><mml:mo>|</mml:mo><mml:msubsup><mml:mrow><mml:mo>|</mml:mo></mml:mrow><mml:mrow><mml:mo class="qopname">F</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mtext class="textrm" mathvariant="normal">)</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mtext class="textrm" mathvariant="normal">&#x0002B;</mml:mtext><mml:msub><mml:mrow><mml:mi>&#x003BB;</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mi>T</mml:mi><mml:mi>r</mml:mi><mml:mtext class="textrm" mathvariant="normal">(</mml:mtext><mml:msup><mml:mrow><mml:mi>W</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:msub><mml:mrow><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mi>W</mml:mi><mml:mtext class="textrm" mathvariant="normal">)&#x0002B;</mml:mtext><mml:msub><mml:mrow><mml:mi>&#x003BB;</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub><mml:mi>T</mml:mi><mml:mi>r</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mtext>T</mml:mtext></mml:mrow></mml:msup><mml:msub><mml:mrow><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub><mml:mi>H</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mtext>&#x000A0;</mml:mtext><mml:mi>s</mml:mi><mml:mo>.</mml:mo><mml:mi>t</mml:mi><mml:mo>.</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mi>W</mml:mi><mml:mo>&#x02265;</mml:mo><mml:mtext class="textrm" mathvariant="normal">0,</mml:mtext><mml:mi>H</mml:mi><mml:mo>&#x02265;</mml:mo><mml:mtext class="textrm" mathvariant="normal">0</mml:mtext></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <italic>Tr</italic>(&#x02022;) represents the trace of a matrix. Here, <italic>L</italic><sub><italic>m</italic></sub> and <italic>L</italic><sub><italic>d</italic></sub> are the corresponding graph Laplacian matrices for <italic>S</italic><sup><italic>m</italic></sup><sup>&#x0002A;</sup> and <italic>S</italic><sup><italic>d</italic></sup><sup>&#x0002A;</sup> that can be calculated as follows:</p>
<disp-formula id="E19"><label>(17)</label><mml:math id="M21"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mi>L</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>D</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mo>&#x02212;</mml:mo><mml:msup><mml:mi>S</mml:mi><mml:mrow><mml:mi>m</mml:mi><mml:mo>&#x0002A;</mml:mo></mml:mrow></mml:msup></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E190"><label>(18)</label><mml:math id="M210"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mi>L</mml:mi><mml:mi>d</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>D</mml:mi><mml:mi>d</mml:mi></mml:msub><mml:mo>&#x02212;</mml:mo><mml:msup><mml:mi>S</mml:mi><mml:mrow><mml:mi>d</mml:mi><mml:mo>&#x0002A;</mml:mo></mml:mrow></mml:msup></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <italic>D</italic><sub><italic>m</italic></sub> and <italic>D</italic><sub><italic>d</italic></sub> are the diagonal matrices whose entries are row (or column) sums of <italic>S</italic><sup><italic>m</italic></sup><sup>&#x0002A;</sup> and <italic>S</italic><sup><italic>d</italic></sup><sup>&#x0002A;</sup>, respectively.</p>
<p>Based on the information of the nearest neighbor graph on a scatter of data points, researchers came up with a conclusion that local geometric structure is able to be effectively modeled (Cai et al., <xref ref-type="bibr" rid="B6">2011</xref>; Li et al., <xref ref-type="bibr" rid="B29">2017</xref>). Since microbes or diseases appearing in the same cluster are more likely to behave similarly, according to the above conclusion, we construct the graph matrices <italic>S</italic><sup><italic>m</italic></sup><sup>&#x0002A;</sup> and <italic>S</italic><sup><italic>d</italic></sup><sup>&#x0002A;</sup> in terms of microbe space and disease space respectively on the basis of the <italic>p</italic> nearest neighbors and corresponding clustering information. Here, we use the ClusterONE method (Nepusz et al., <xref ref-type="bibr" rid="B32">2012</xref>) to construct the graph <italic>S</italic><sup><italic>m</italic></sup><sup>&#x0002A;</sup>from microbe space, in which the weight matrix <italic>X</italic><sup><italic>m</italic></sup> is generated based on the microbe similarity matrix <italic>S</italic><sup><italic>m</italic></sup> as follows:</p>
<disp-formula id="E20"><label>(19)</label><mml:math id="M22"><mml:mtable class="eqnarray" columnalign="left"><mml:mrow><mml:msubsup><mml:mi>X</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow><mml:mi>m</mml:mi></mml:msubsup><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:mtable columnalign='left'><mml:mtr columnalign='left'><mml:mtd columnalign='left'><mml:mrow><mml:mn>1</mml:mn><mml:mtext>&#x000A0;&#x000A0;&#x000A0;</mml:mtext><mml:mi>i</mml:mi><mml:mo>&#x02208;</mml:mo><mml:mi>N</mml:mi><mml:mtext>(</mml:mtext><mml:msub><mml:mi>m</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mtext>)&#x000A0;&#x0200B;&#x0200B;</mml:mtext><mml:mo>&#x00026;</mml:mo><mml:mtext>&#x0200B;&#x0200B;&#x000A0;</mml:mtext><mml:mi>j</mml:mi><mml:mo>&#x02208;</mml:mo><mml:mi>N</mml:mi><mml:mtext>(</mml:mtext><mml:msub><mml:mi>m</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mtext>),</mml:mtext><mml:msub><mml:mi>m</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>&#x02208;</mml:mo><mml:mi>C</mml:mi></mml:mrow></mml:mtd></mml:mtr><mml:mtr columnalign='left'><mml:mtd columnalign='left'><mml:mrow><mml:mn>0</mml:mn><mml:mtext>&#x000A0;&#x000A0;</mml:mtext><mml:mi>i</mml:mi><mml:mo>&#x02209;</mml:mo><mml:mi>N</mml:mi><mml:mtext>(</mml:mtext><mml:msub><mml:mi>m</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mtext>)&#x000A0;&#x0200B;&#x0200B;</mml:mtext><mml:mo>&#x00026;</mml:mo><mml:mtext>&#x0200B;&#x0200B;&#x000A0;</mml:mtext><mml:mi>j</mml:mi><mml:mo>&#x02209;</mml:mo><mml:mi>N</mml:mi><mml:mtext>(</mml:mtext><mml:msub><mml:mi>m</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mtext>),</mml:mtext><mml:msub><mml:mi>m</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>&#x02209;</mml:mo><mml:mi>C</mml:mi></mml:mrow></mml:mtd></mml:mtr><mml:mtr columnalign='left'><mml:mtd columnalign='left'><mml:mrow><mml:mn>0.5</mml:mn><mml:mtext>&#x000A0;</mml:mtext><mml:mi>o</mml:mi><mml:mi>t</mml:mi><mml:mi>h</mml:mi><mml:mi>e</mml:mi><mml:mi>r</mml:mi><mml:mi>w</mml:mi><mml:mi>i</mml:mi><mml:mi>s</mml:mi><mml:mi>e</mml:mi></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mrow></mml:mrow></mml:mtable></mml:math></disp-formula>
<p>where <italic>N</italic>(<italic>m</italic><sub><italic>i</italic></sub>) and <italic>N</italic>(<italic>m</italic><sub><italic>j</italic></sub>) are the sets of <italic>p</italic> nearest neighbors of <italic>m</italic><sub><italic>i</italic></sub> and <italic>m</italic><sub><italic>j</italic></sub>, respectively. <italic>C</italic> denotes to any one of the clusters obtained by ClusterONE method and we define the graph matrix <italic>S</italic><sup><italic>m</italic></sup><sup>&#x0002A;</sup> for microbes as follows:</p>
<disp-formula id="E21"><label>(20)</label><mml:math id="M23"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:mo>&#x02200;</mml:mo><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:msubsup><mml:mrow><mml:mi>S</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mo>&#x0002A;</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:msubsup><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msubsup><mml:msubsup><mml:mrow><mml:mi>S</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msubsup></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>In a similar way as the computation of <italic>S</italic><sup><italic>m</italic>&#x0002A;</sup>, we calculate the graph matrix <italic>S</italic><sup><italic>d</italic></sup>&#x0002A; according to the disease similarity matrix <italic>S</italic><sup><italic>d</italic></sup>.</p>
<p>Here, we defined &#x003A6; &#x0003D; [ &#x003C6;<sub><italic>ik</italic></sub> ] and &#x003A8; &#x0003D; [ &#x003C8;<sub><italic>jk</italic></sub> ] as the Lagrange multipliers for the constrains <italic>w</italic><sub><italic>ik</italic></sub> &#x02265; 0 and <italic>h</italic><sub><italic>jk</italic></sub> &#x02265; 0, respectively. In this work, we first convert the optimization problem in Equation (15) to an unconstraint problem, then minimize this problem by utilizing the corresponding Lagrange function <italic>L</italic><sub><italic>f</italic></sub> as follows:</p>
<disp-formula id="E22"><label>(21)</label><mml:math id="M24"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mi>f</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>T</mml:mi><mml:mi>r</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>Y</mml:mi><mml:msup><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mo class="qopname">T</mml:mo></mml:mrow></mml:msup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>-</mml:mo><mml:mn>2</mml:mn><mml:mi>T</mml:mi><mml:mi>r</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>Y</mml:mi><mml:mi>H</mml:mi><mml:msup><mml:mrow><mml:mi>W</mml:mi></mml:mrow><mml:mrow><mml:mo class="qopname">T</mml:mo></mml:mrow></mml:msup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>&#x0002B;</mml:mo><mml:mi>T</mml:mi><mml:mi>r</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>W</mml:mi><mml:msup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mo class="qopname">T</mml:mo></mml:mrow></mml:msup><mml:mi>H</mml:mi><mml:msup><mml:mrow><mml:mi>W</mml:mi></mml:mrow><mml:mrow><mml:mo class="qopname">T</mml:mo></mml:mrow></mml:msup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003BB;</mml:mi></mml:mrow><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mi>T</mml:mi><mml:mi>r</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>W</mml:mi><mml:msup><mml:mrow><mml:mi>W</mml:mi></mml:mrow><mml:mrow><mml:mo class="qopname">T</mml:mo></mml:mrow></mml:msup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mo>&#x0002B;</mml:mo></mml:mtd><mml:mtd><mml:msub><mml:mrow><mml:mi>&#x003BB;</mml:mi></mml:mrow><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mi>T</mml:mi><mml:mi>r</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>H</mml:mi><mml:msup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mo class="qopname">T</mml:mo></mml:mrow></mml:msup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003BB;</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mi>T</mml:mi><mml:mi>r</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msup><mml:mrow><mml:mi>W</mml:mi></mml:mrow><mml:mrow><mml:mo class="qopname">T</mml:mo></mml:mrow></mml:msup><mml:msub><mml:mrow><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mi>W</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003BB;</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub><mml:mi>T</mml:mi><mml:mi>r</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mo class="qopname">T</mml:mo></mml:mrow></mml:msup><mml:msub><mml:mrow><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub><mml:mi>H</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mo>&#x0002B;</mml:mo></mml:mtd><mml:mtd><mml:mi>T</mml:mi><mml:mi>r</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>&#x003A6;</mml:mi><mml:msup><mml:mrow><mml:mi>W</mml:mi></mml:mrow><mml:mrow><mml:mo class="qopname">T</mml:mo></mml:mrow></mml:msup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>&#x0002B;</mml:mo><mml:mi>T</mml:mi><mml:mi>r</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mtext>&#x003A8;</mml:mtext><mml:msup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mo class="qopname">T</mml:mo></mml:mrow></mml:msup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>To solve the above problem, we first calculate the partial derivatives with respect to <italic>W</italic> and <italic>H</italic> as follows:</p>
<disp-formula id="E24"><label>(22)</label><mml:math id="M26"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:mfrac><mml:mrow><mml:mi>&#x02202;</mml:mi><mml:msub><mml:mrow><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mi>f</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mi>&#x02202;</mml:mi><mml:mi>W</mml:mi></mml:mrow></mml:mfrac><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mn>2</mml:mn><mml:mi>Y</mml:mi><mml:mi>H</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mn>2</mml:mn><mml:mi>W</mml:mi><mml:msup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mo class="qopname">T</mml:mo></mml:mrow></mml:msup><mml:mi>H</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mn>2</mml:mn><mml:msub><mml:mrow><mml:mi>&#x003BB;</mml:mi></mml:mrow><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mi>W</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mn>2</mml:mn><mml:msub><mml:mrow><mml:mi>&#x003BB;</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mi>W</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mi>&#x003A6;</mml:mi></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E25"><label>(23)</label><mml:math id="M27"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:mfrac><mml:mrow><mml:mi>&#x02202;</mml:mi><mml:msub><mml:mrow><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mi>f</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mi>&#x02202;</mml:mi><mml:mi>H</mml:mi></mml:mrow></mml:mfrac><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mn>2</mml:mn><mml:msup><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mo class="qopname">T</mml:mo></mml:mrow></mml:msup><mml:mi>W</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mn>2</mml:mn><mml:mi>H</mml:mi><mml:msup><mml:mrow><mml:mi>W</mml:mi></mml:mrow><mml:mrow><mml:mo class="qopname">T</mml:mo></mml:mrow></mml:msup><mml:mi>W</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mn>2</mml:mn><mml:msub><mml:mrow><mml:mi>&#x003BB;</mml:mi></mml:mrow><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mi>H</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mn>2</mml:mn><mml:msub><mml:mrow><mml:mi>&#x003BB;</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub><mml:mi>H</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mtext>&#x003A8;</mml:mtext></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>After using the Karush-Kuhn-Tucker (KKT) conditions of &#x003C6;<sub><italic>ik</italic></sub><italic>w</italic><sub><italic>ik</italic></sub> &#x0003D; 0 and &#x003C8;<sub><italic>jk</italic></sub><italic>h</italic><sub><italic>jk</italic></sub> &#x0003D; 0 (Facchinei et al., <xref ref-type="bibr" rid="B17">2014</xref>), we can obtain the equations for <italic>w</italic><sub><italic>ik</italic></sub> and <italic>h</italic><sub><italic>jk</italic></sub> as follows:</p>
<disp-formula id="E26"><label>(24)</label><mml:math id="M28"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:mo>-</mml:mo></mml:mtd><mml:mtd><mml:msub><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>Y</mml:mi><mml:mi>H</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>w</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>W</mml:mi><mml:msup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mo class="qopname">T</mml:mo></mml:mrow></mml:msup><mml:mi>H</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>w</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>&#x003BB;</mml:mi></mml:mrow><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mi>W</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>w</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mo>&#x0002B;</mml:mo></mml:mtd><mml:mtd><mml:msub><mml:mrow><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>&#x003BB;</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msup><mml:mrow><mml:mi>S</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mo>&#x0002A;</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mi>W</mml:mi></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>w</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E28"><label>(25)</label><mml:math id="M30"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:mo>-</mml:mo></mml:mtd><mml:mtd><mml:msub><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msup><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mo class="qopname">T</mml:mo></mml:mrow></mml:msup><mml:mi>W</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>h</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>H</mml:mi><mml:msup><mml:mrow><mml:mi>W</mml:mi></mml:mrow><mml:mrow><mml:mo class="qopname">T</mml:mo></mml:mrow></mml:msup><mml:mi>W</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>h</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>&#x003BB;</mml:mi></mml:mrow><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mi>H</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>h</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mo>&#x0002B;</mml:mo></mml:mtd><mml:mtd><mml:msub><mml:mrow><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>&#x003BB;</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msup><mml:mrow><mml:mi>S</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mo>&#x0002A;</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>h</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>Finally, on the basis of the above two equations, we can get the updating rules for <italic>w</italic><sub><italic>ik</italic></sub> and <italic>h</italic><sub><italic>jk</italic></sub> as follows:</p>
<disp-formula id="E30"><label>(26)</label><mml:math id="M32"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>w</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02190;</mml:mo><mml:msub><mml:mrow><mml:mi>w</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mfrac><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>Y</mml:mi><mml:mi>H</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003BB;</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:msup><mml:mrow><mml:mi>S</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mo>&#x0002A;</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msup><mml:mi>W</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>W</mml:mi><mml:msup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mo class="qopname">T</mml:mo></mml:mrow></mml:msup><mml:mi>H</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003BB;</mml:mi></mml:mrow><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mi>W</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003BB;</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mi>W</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E31"><label>(27)</label><mml:math id="M33"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>h</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02190;</mml:mo><mml:msub><mml:mrow><mml:mi>h</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mfrac><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msup><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mo class="qopname">T</mml:mo></mml:mrow></mml:msup><mml:mi>W</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003BB;</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub><mml:msup><mml:mrow><mml:mi>S</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mo>&#x0002A;</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msup><mml:mi>H</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>H</mml:mi><mml:msup><mml:mrow><mml:mi>W</mml:mi></mml:mrow><mml:mrow><mml:mo class="qopname">T</mml:mo></mml:mrow></mml:msup><mml:mi>W</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003BB;</mml:mi></mml:mrow><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mi>H</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003BB;</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub><mml:mi>H</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>Based on the above two updating formulas, we can obtain the final two non-negative matrices <italic>W</italic> and <italic>H</italic> until convergence. Subsequently, we calculate the score matrix <italic>Y</italic><sup>&#x0002A;</sup> for microbe-disease pairs by utilizing <italic>Y</italic><sup>&#x0002A;</sup> &#x0003D; <italic>WH</italic><sup>T</sup>, in which the higher score of a microbe-disease pair indicates that the microbe is more likely to be associated with the corresponding disease. In addition, for better understanding, we provided the pseudocode of the whole GRNMF algorithm (See Figure <xref ref-type="fig" rid="F2">2</xref>).</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p>The pseudocode of the whole GRNMF algorithm.</p></caption>
<graphic xlink:href="fmicb-09-02560-g0002.tif"/>
</fig>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<sec>
<title>Performance evaluation</title>
<p>Cross validation, a widely used assessment method, was introduced to evaluate the prediction performance of GRNMFHMDA. In this study, we utilized two types of cross validations, namely, global LOOCV and local LOOCV. For the global LOOCV, each of the known microbe-disease associations was in turn considered to be the test sample while the remaining known associations were treated as the training samples. Besides, all of the unknown microbe-disease pairs were regarded as the candidate samples which would be used in the ranking process. After implementing GRNMFHMDA, we ranked each test sample with all candidate samples according to their predicted scores. As for local LOOCV, the difference is that the test sample was only ranked with the candidate samples involving the investigated disease.</p>
<p>In each cross validation process, we would consider that the test sample was successfully predicted if the ranking of the test sample was higher than the given threshold. Further, based on the ranks of all test samples, we drew a receiver operating characteristic (ROC) curve through calculating the ratio between true positive rate (TPR, sensitivity) and false positive rate (FPR, 1-specificity) under different thresholds both for global LOOCV and local LOOCV. Sensitivity meant the ratio between the number of test samples ranking higher than the given threshold and the number of positive samples (known microbe-disease associations), while 1-specificity denoted the percentage of the number of negative microbe-disease pairs whose ranks were lower than the given threshold. Moreover, area under the ROC curve (AUC) was calculated to make quantitative evaluation for our model&#x00027;s prediction performance. The model would be considered to be able to perfectly predict all associations if the value of AUC equaled to 1, while the model was only supposed to be able to make random prediction if the value of AUC equaled to 0.5. As a result, GRNMFHMDA obtained AUCs of 0.8715 and 0.7898 in global LOOCV and local LOOCV, respectively. Furthermore, the prediction performance of our model outperformed the KATZHMDA both in global LOOCV (0.8644) and local LOOCV (0.6998), which proved the superior accuracy and reliability of our model in predicting microbe-disease associations (See Figure <xref ref-type="fig" rid="F3">3</xref>).</p>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p>The comparison of prediction performance between GRNMFHMDA and the classical model of KATZHMDA both in global LOOCV and local LOOCV. As a result, GRNMFHMDA achieved AUCs of 0.8715 and 0.7898 in the global and local LOOCV, which exceed the first computational model of KATZHMDA in the field of microbe-disease association prediction.</p></caption>
<graphic xlink:href="fmicb-09-02560-g0003.tif"/>
</fig>
</sec>
<sec>
<title>Case study</title>
<p>Here, we put forward two types of case studies on three different common human diseases with the purpose of further assessing the prediction performance of GRNMFHMDA. On the basis of the known microbe-disease associations in HMDAD, we implemented GRNMFHMDA to predict disease-related microbes and then validated the top 10 predicted microbes by HMDAD or recent literatures.</p>
<p>Asthma, a common long-term inflammatory disease of the airways of the lungs, often starts during childhood and its average number of deaths and death rates (per 100,000 people) respectively reached to 38 and 0.1 in 2016 in the World Health Organization (WHO) European region among 10&#x02013;14 years old children (Kyu et al., <xref ref-type="bibr" rid="B27">2018</xref>). Here, under the GRNMFHMDA framework, asthma was treated as an investigated disease to explore its potential associated microbes. As a result, 9 out of the top 10 microbes in the prediction list were confirmed to be associated with asthma by experimental literatures (See Table <xref ref-type="table" rid="T1">1</xref>). For example, <italic>Lactobacillus casei rhamnosus</italic> Lcr35, a species of <italic>Lactobacillus</italic> (1st in the prediction list), was found to be able to attenuate airway inflammation and hyperreactivity in a mouse model of asthma through oral treatment before sensitization (Yu et al., <xref ref-type="bibr" rid="B50">2010</xref>). Besides, Ding et al. (<xref ref-type="bibr" rid="B16">2018</xref>) discovered that exosomes derived by <italic>Pseudomonas</italic> (2nd in the prediction list) <italic>aeruginosa</italic> could induce protection against allergic sensitization in asthma mice. Another example is that there is a distinct alteration of the sputum microbiota with a greater prominence of <italic>Firmicutes</italic> (4th in the prediction list) in severe asthma (Zhang et al., <xref ref-type="bibr" rid="B51">2016</xref>).</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Prediction list of the top 10 potential asthma-related microbes based on the known associations in HMDAD database and the corresponding validation evidences (experimental literatures in PubMed) for these associations.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Rank</bold></th>
<th valign="top" align="left"><bold>Microbe</bold></th>
<th valign="top" align="left"><bold>Evidence</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">1</td>
<td valign="top" align="left">Lactobacillus</td>
<td valign="top" align="left">PMID: 20592920</td>
</tr>
<tr>
<td valign="top" align="left">2</td>
<td valign="top" align="left">Pseudomonas</td>
<td valign="top" align="left">PMID: 29795208</td>
</tr>
<tr>
<td valign="top" align="left">3</td>
<td valign="top" align="left">Burkholderia</td>
<td valign="top" align="left">unconfirmed</td>
</tr>
<tr>
<td valign="top" align="left">4</td>
<td valign="top" align="left">Firmicutes</td>
<td valign="top" align="left">PMID: 27078029</td>
</tr>
<tr>
<td valign="top" align="left">5</td>
<td valign="top" align="left">Actinobacteria</td>
<td valign="top" align="left">PMID: 23265859</td>
</tr>
<tr>
<td valign="top" align="left">6</td>
<td valign="top" align="left">Clostridium coccoides</td>
<td valign="top" align="left">PMID: 21477358</td>
</tr>
<tr>
<td valign="top" align="left">7</td>
<td valign="top" align="left">Streptococcus</td>
<td valign="top" align="left">PMID: 17928596</td>
</tr>
<tr>
<td valign="top" align="left">8</td>
<td valign="top" align="left">Clostridia</td>
<td valign="top" align="left">PMID: 22047069</td>
</tr>
<tr>
<td valign="top" align="left">9</td>
<td valign="top" align="left">Lachnospiraceae</td>
<td valign="top" align="left">PMID: 26512904</td>
</tr>
<tr>
<td valign="top" align="left">10</td>
<td valign="top" align="left">Fusobacterium</td>
<td valign="top" align="left">PMID: 24024497</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Obesity, a medical condition in which accumulated excess body fat reaches a certain level that may have a negative effect on health, is a leading preventable cause of death worldwide (Reinier and Chugh, <xref ref-type="bibr" rid="B35">2015</xref>). In recent years, plenty of studies have shown certain associations between obesity and microbes that helps a lot to the prevention and treatment of obesity. For instance, many researchers have demonstrated that <italic>methanogens</italic> play a specific role in weight gain and the development of obesity in human subjects (Armougom et al., <xref ref-type="bibr" rid="B1">2009</xref>; Krajmalnik-Brown et al., <xref ref-type="bibr" rid="B26">2012</xref>). Not only that, many studies have now been conducted into the potential of probiotics to ameliorate obesity and diabetes (Delzenne et al., <xref ref-type="bibr" rid="B15">2011</xref>; Peterson et al., <xref ref-type="bibr" rid="B34">2015</xref>). Therefore, taking obesity as another investigated disease in the first type of case study, we found that 9 out of the top 10 predicted obesity-related microbes were confirmed by experimental literatures (See Table <xref ref-type="table" rid="T2">2</xref>). For the phylum <italic>Proteobacteria</italic> (1st in the prediction list) which belongs to gram-negative bacteria, the existing study already discovered that it was abundant in the obese group compared with lean group (Park et al., <xref ref-type="bibr" rid="B33">2015</xref>). Besides, as a species of <italic>Clostridia</italic> (2nd in the prediction list), the presence of <italic>Clostridium ramosum</italic> in simplified human intestinal (SIHUMI) enhanced diet-induced obesity according to the experiment data of Woting et al. (<xref ref-type="bibr" rid="B46">2014</xref>). Moreover, <italic>Bacillus</italic>, a genus of <italic>Clostridia</italic> (3rd in the prediction list), was found to have outgrown dramatically in the obesity group by Gao et al. (<xref ref-type="bibr" rid="B19">2018</xref>).</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p>Prediction list of the top 10 potential obesity-related microbes based on the known associations in HMDAD database and the corresponding validation evidences (experimental literatures in PubMed) for these associations.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Rank</bold></th>
<th valign="top" align="left"><bold>Microbe</bold></th>
<th valign="top" align="left"><bold>Evidence</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">1</td>
<td valign="top" align="left">Proteobacteria</td>
<td valign="top" align="left">PMID: 25407880</td>
</tr>
<tr>
<td valign="top" align="left">2</td>
<td valign="top" align="left">Clostridia</td>
<td valign="top" align="left">PMID: 25271283</td>
</tr>
<tr>
<td valign="top" align="left">3</td>
<td valign="top" align="left">Bacilli</td>
<td valign="top" align="left">PMID: 29280312</td>
</tr>
<tr>
<td valign="top" align="left">4</td>
<td valign="top" align="left">Faecalibacterium prausnitzii</td>
<td valign="top" align="left">PMID: 19849869</td>
</tr>
<tr>
<td valign="top" align="left">5</td>
<td valign="top" align="left">Clostridium</td>
<td valign="top" align="left">PMID: 23645850</td>
</tr>
<tr>
<td valign="top" align="left">6</td>
<td valign="top" align="left">Betaproteobacteria</td>
<td valign="top" align="left">PMID: 29312822</td>
</tr>
<tr>
<td valign="top" align="left">7</td>
<td valign="top" align="left">Clostridium coccoides</td>
<td valign="top" align="left">PMID: 23147032</td>
</tr>
<tr>
<td valign="top" align="left">8</td>
<td valign="top" align="left">Lactobacillus</td>
<td valign="top" align="left">PMID: 23056479</td>
</tr>
<tr>
<td valign="top" align="left">9</td>
<td valign="top" align="left">Fusobacterium nucleatum</td>
<td valign="top" align="left">unconfirmed</td>
</tr>
<tr>
<td valign="top" align="left">10</td>
<td valign="top" align="left">Prevotella</td>
<td valign="top" align="left">PMID: 21695273</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>More than that, in order to facilitate future researchers to study the disease-related microbes that they are interested in, based on the known associations in HMDAD, we provided the whole prediction list including all pairs between 292 microbes and 39 diseases as well as their predicted association scores (See Supplementary Table <xref ref-type="supplementary-material" rid="SM1">1</xref>).</p>
<p>In addition, to prove the predictive applicability of our model on new diseases without known associated microbes, we carried out another case study on a disease via removing all its known associations in HMDAD. In this way, the prediction process of seeking the investigated disease-related microbes can only depend on the information of other known microbe-disease associations (training samples) and the relevant similarity measures. What needs to be emphasized is that only candidate samples (all microbe-disease pairs including the investigated disease) were ranked and then verified in HMDAD. Hence, there was no overlap between training samples and prediction list. In other words, the verification of predicted associations was independent of HMDAD. Type 1 diabetes, a form of diabetes mellitus, is believed to involve a combination of genetic and environmental factors such as dietary agents (Serena et al., <xref ref-type="bibr" rid="B37">2015</xref>), viral infections (Rewers and Ludvigsson, <xref ref-type="bibr" rid="B36">2016</xref>) and gut microbiota (Bibb&#x000F2; et al., <xref ref-type="bibr" rid="B2">2017</xref>). Especially in gut microbiota, the previous study confirmed that the genus <italic>Bacteroides</italic> is the largest representative of type 1 diabetes-associated dysbiosis that can be modulated by diet (Mejj&#x000ED;a-Le&#x000F3;n and Barca, <xref ref-type="bibr" rid="B31">2015</xref>). Thus, considering the significance of studying type 1 diabetes-related microbes, we took type 1 diabetes as the investigated disease to predict its potential associated microbes under the framework of the second type of case study. After implementing GRNMFHMDA, we obtained the ranks of type 1 diabetes&#x00027; candidate microbes in terms of their association scores (See Table <xref ref-type="table" rid="T3">3</xref>). As a result, 8 out of the top 10 predictions were confirmed by HMDAD or recent literatures. For example, Giongo et al. (<xref ref-type="bibr" rid="B20">2011</xref>) demonstrated that the <italic>Clostridia</italic> (1st in the prediction list) sequences increased in control samples (samples of general population) as the abundance of <italic>Clostridia</italic> decreased overtime in the case samples (samples of patients with type 1 diabetes). Moreover, at the phylum level and at <italic>p</italic>-values &#x0003C;0.001, <italic>Proteobacteria</italic> (2nd in the prediction list) was found to be higher in case samples than that in control samples (Brown et al., <xref ref-type="bibr" rid="B5">2011</xref>). Another example is that <italic>Lactobacillus strains</italic> (a species of <italic>Lactobacillus</italic> ranking 4th in the prediction list) was found to be able to induce specific changes in the immune system of non-obese diabetic (NOD) mice that can increase or decrease diabetes (Brown et al., <xref ref-type="bibr" rid="B5">2011</xref>).</p>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p>Prediction list of the top 10 potential type 1 diabetes-related microbes via removing all the known type 1 diabetes-microbe associations in HMDAD database.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Rank</bold></th>
<th valign="top" align="left"><bold>Microbe</bold></th>
<th valign="top" align="left"><bold>Evidence</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">1</td>
<td valign="top" align="left">Clostridia</td>
<td valign="top" align="left">confirmed by HMDAD</td>
</tr>
<tr>
<td valign="top" align="left">2</td>
<td valign="top" align="left">Proteobacteria</td>
<td valign="top" align="left">confirmed by HMDAD</td>
</tr>
<tr>
<td valign="top" align="left">3</td>
<td valign="top" align="left">Clostridium coccoides</td>
<td valign="top" align="left">unconfirmed</td>
</tr>
<tr>
<td valign="top" align="left">4</td>
<td valign="top" align="left">Lactobacillus</td>
<td valign="top" align="left">confirmed by HMDAD</td>
</tr>
<tr>
<td valign="top" align="left">5</td>
<td valign="top" align="left">Bacteroidetes</td>
<td valign="top" align="left">confirmed by HMDAD</td>
</tr>
<tr>
<td valign="top" align="left">6</td>
<td valign="top" align="left">Firmicutes</td>
<td valign="top" align="left">confirmed by HMDAD</td>
</tr>
<tr>
<td valign="top" align="left">7</td>
<td valign="top" align="left">Faecalibacterium prausnitzii</td>
<td valign="top" align="left">PMID: 23934614</td>
</tr>
<tr>
<td valign="top" align="left">8</td>
<td valign="top" align="left">Clostridium</td>
<td valign="top" align="left">PMID: 23433344</td>
</tr>
<tr>
<td valign="top" align="left">9</td>
<td valign="top" align="left">Betaproteobacteria</td>
<td valign="top" align="left">unconfirmed</td>
</tr>
<tr>
<td valign="top" align="left">10</td>
<td valign="top" align="left">Bacilli</td>
<td valign="top" align="left">PMID: 24930037</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>The validation evidences denote to whether the predicted associations were confirmed by the HMDAD database or experimental literatures in PubMed</italic>.</p>
</table-wrap-foot>
</table-wrap>
<p>According to the results presented, GRNMFHMDA consistently achieved an excellent predictive performance in the two types of case studies. With the continuous experimental research on microbe-disease associations, we expect that more and more microbes in the prediction lists generated by our model would be verified in the future.</p>
</sec>
</sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>In this article, we proposed a novel prediction model of GRNMFHMDA based on the known microbe-disease associations in HMDAD, Gaussian interaction profile kernel similarity of microbes and diseases and symptom-based disease similarity. To eliminate the possible problem caused by unknown microbe-disease pairs that may affect our final prediction performance, we first implemented a preprocessing step to establish new interaction profiles both for microbes and diseases. Then, after introducing Tikhonov (<italic>L</italic><sub>2</sub>) and graph Laplacian regularization under the standard NMF framework, we finally obtained reliable and satisfactory prediction performance both in LOOCV and case studies. Therefore, we can conclude that our prediction model is able to play critical role in revealing the associations between microbes and diseases, thus improving the prevention, diagnosis and treatment of many complex human diseases in the future.</p>
<p>Here, the reason why GRNMFHMDA performed well in microbe-disease association prediction lies in the following facts. Firstly, in the study of Wang et al. (<xref ref-type="bibr" rid="B44">2015</xref>), to model cancer hallmark traits and networks, nodes and links in the network were weighted, and certain scoring functions were developed to represent gene regulatory logics/strengths on networks. Inspired by that, based on the data extracted from the acknowledged databases, we implemented proper and effective measurements to quantify microbe-disease association network, microbe similarity network and disease similarity network, which guaranteed the reliable prediction performance of our model. Secondly, before implementing GRNMF, we constructed new interaction profiles both for microbes and diseases to further assign those unknown microbe-disease pairs with associated likelihood score, which also improved our model&#x00027;s performance in some degree. Thirdly, different from the standard NMF, we introduced Tikhonov (<italic>L</italic><sub>2</sub>) and graph Laplacian regularization that ensured the final two non-negative matrices smoothness and guaranteed a part-based representation via fully exploiting the data geometric structure, respectively.</p>
<p>Nevertheless, here are also some limitations restricting the accuracy of our model that need to be overcome in future studies. Initially, the types of similarities for microbes and diseases are not enough yet and we believe that our model would be significantly improved with more biological data and similarity measurements being taken into consideration. Successful advance in association prediction research in various fields of computational biology would also accelerate the development of effective models for microbe-disease association prediction (Chen and Yan, <xref ref-type="bibr" rid="B13">2013</xref>; Chen et al., <xref ref-type="bibr" rid="B9">2016</xref>, <xref ref-type="bibr" rid="B12">2017b</xref>, <xref ref-type="bibr" rid="B10">2018a</xref>,<xref ref-type="bibr" rid="B11">b</xref>,<xref ref-type="bibr" rid="B14">c</xref>; Chen and Huang, <xref ref-type="bibr" rid="B7">2017</xref>; You et al., <xref ref-type="bibr" rid="B49">2017</xref>). Secondly, as shown in the research of Hao et al. (<xref ref-type="bibr" rid="B23">2018</xref>), three representative genome-scale cellular networks, genome-scale metabolic network (GMN), transcriptional regulatory network (TRN), and signal transduction network (STN), were found to be able to become a necessary tool in the systematic analysis of microbes through network integration. Therefore, whether there are similar molecular networks between two microbes is well worth studying in constructing our prediction model. Thirdly, the selection of the optimal parameters is still worth studying. Finally, GRNMFHMDA would inevitably cause bias to diseases that have more known associated microbes and vice versa. Hence, we would come up with optimization strategies to deal with those limitations in our next work.</p>
</sec>
<sec id="s5">
<title>Author contributions</title>
<p>B-SH conceived the project, developed the prediction method, analyzed the result, and revised the paper. L-HP designed the experiments, implemented the experiments, analyzed the result, and wrote the paper. ZL analyzed the result and revised the paper. All authors read and approved the final manuscript.</p>
<sec>
<title>Conflict of interest statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</sec>
</body>
<back>
<sec sec-type="supplementary-material" id="s6">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fmicb.2018.02560/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fmicb.2018.02560/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Table_1.xlsx" id="SM1" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Armougom</surname> <given-names>F.</given-names></name> <name><surname>Henry</surname> <given-names>M.</given-names></name> <name><surname>Vialettes</surname> <given-names>B.</given-names></name> <name><surname>Raccah</surname> <given-names>D.</given-names></name> <name><surname>Raoult</surname> <given-names>D.</given-names></name></person-group> (<year>2009</year>). <article-title>Monitoring bacterial community of human gut microbiota reveals an increase in Lactobacillus in obese patients and Methanogens in anorexic patients</article-title>. <source>PLoS ONE</source> <volume>4</volume>:<fpage>e7125</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0007125</pub-id><pub-id pub-id-type="pmid">19774074</pub-id></citation></ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bibb&#x000F2;</surname> <given-names>S.</given-names></name> <name><surname>Dore</surname> <given-names>M. P.</given-names></name> <name><surname>Pes</surname> <given-names>G. M.</given-names></name> <name><surname>Delitala</surname> <given-names>G.</given-names></name> <name><surname>Delitala</surname> <given-names>A. P.</given-names></name></person-group> (<year>2017</year>). <article-title>Is there a role for gut microbiota in type 1 diabetes pathogenesis?</article-title> <source>Ann. Med.</source> <volume>49</volume>, <fpage>11</fpage>&#x02013;<lpage>22</lpage>. <pub-id pub-id-type="doi">10.1080/07853890.2016.1222449</pub-id><pub-id pub-id-type="pmid">27499366</pub-id></citation></ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Binder Gallimidi</surname> <given-names>A.</given-names></name> <name><surname>Fischman</surname> <given-names>S.</given-names></name> <name><surname>Revach</surname> <given-names>B.</given-names></name> <name><surname>Bulvik</surname> <given-names>R.</given-names></name> <name><surname>Maliutina</surname> <given-names>A.</given-names></name> <name><surname>Rubinstein</surname> <given-names>A. M.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>Periodontal pathogens <italic>Porphyromonas gingivalis</italic> and <italic>Fusobacterium nucleatum</italic> promote tumor progression in an oral-specific chemical carcinogenesis model</article-title>. <source>Oncotarget</source> <volume>6</volume>, <fpage>22613</fpage>&#x02013;<lpage>22623</lpage>. <pub-id pub-id-type="doi">10.18632/oncotarget.4209</pub-id><pub-id pub-id-type="pmid">26158901</pub-id></citation></ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Boleij</surname> <given-names>A.</given-names></name> <name><surname>Hechenbleikner</surname> <given-names>E. M.</given-names></name> <name><surname>Goodwin</surname> <given-names>A. C.</given-names></name> <name><surname>Badani</surname> <given-names>R.</given-names></name> <name><surname>Stein</surname> <given-names>E. M.</given-names></name> <name><surname>Lazarev</surname> <given-names>M. G.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>The <italic>Bacteroides fragilis</italic> toxin gene is prevalent in the colon mucosa of colorectal cancer patients</article-title>. <source>Clin. Infect. Dis.</source> <volume>60</volume>, <fpage>208</fpage>&#x02013;<lpage>215</lpage>. <pub-id pub-id-type="doi">10.1093/cid/ciu787</pub-id><pub-id pub-id-type="pmid">25305284</pub-id></citation></ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brown</surname> <given-names>C. T.</given-names></name> <name><surname>Davis-Richardson</surname> <given-names>A. G.</given-names></name> <name><surname>Giongo</surname> <given-names>A.</given-names></name> <name><surname>Gano</surname> <given-names>K. A.</given-names></name> <name><surname>Crabb</surname> <given-names>D. B.</given-names></name> <name><surname>Mukherjee</surname> <given-names>N.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Gut microbiome metagenomics analysis suggests a functional model for the development of autoimmunity for type 1 diabetes</article-title>. <source>PLoS ONE</source> <volume>6</volume>:<fpage>e25792</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0025792</pub-id><pub-id pub-id-type="pmid">22043294</pub-id></citation></ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cai</surname> <given-names>D.</given-names></name> <name><surname>He</surname> <given-names>X.</given-names></name> <name><surname>Han</surname> <given-names>J.</given-names></name> <name><surname>Huang</surname> <given-names>T. S.</given-names></name></person-group> (<year>2011</year>). <article-title>graph regularized nonnegative matrix factorization for data representation</article-title>. <source>IEEE Trans. Pattern Anal. Mach. Intell.</source> <volume>33</volume>, <fpage>1548</fpage>&#x02013;<lpage>1560</lpage>. <pub-id pub-id-type="doi">10.1109/TPAMI.2010.231</pub-id><pub-id pub-id-type="pmid">21173440</pub-id></citation></ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Huang</surname> <given-names>L.</given-names></name></person-group> (<year>2017</year>). <article-title>LRSSLMDA: Laplacian Regularized Sparse Subspace Learning for MiRNA-Disease Association prediction</article-title>. <source>PLoS Comput. Biol.</source> <volume>13</volume>:<fpage>e1005912</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1005912</pub-id><pub-id pub-id-type="pmid">29253885</pub-id></citation></ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Huang</surname> <given-names>Y. A.</given-names></name> <name><surname>You</surname> <given-names>Z. H.</given-names></name> <name><surname>Yan</surname> <given-names>G. Y.</given-names></name> <name><surname>Wang</surname> <given-names>X. S.</given-names></name></person-group> (<year>2017a</year>). <article-title>A novel approach based on KATZ measure to predict associations of human microbiota with non-infectious diseases</article-title>. <source>Bioinformatics</source> <volume>33</volume>, <fpage>733</fpage>&#x02013;<lpage>739</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btw715</pub-id><pub-id pub-id-type="pmid">28025197</pub-id></citation></ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Ren</surname> <given-names>B.</given-names></name> <name><surname>Chen</surname> <given-names>M.</given-names></name> <name><surname>Wang</surname> <given-names>Q.</given-names></name> <name><surname>Zhang</surname> <given-names>L.</given-names></name> <name><surname>Yan</surname> <given-names>G.</given-names></name></person-group> (<year>2016</year>). <article-title>NLLSS: Predicting Synergistic Drug Combinations Based on Semi-supervised Learning</article-title>. <source>PLoS Comput. Biol.</source> <volume>12</volume>:<fpage>e1004975</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1004975</pub-id><pub-id pub-id-type="pmid">27415801</pub-id></citation></ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Wang</surname> <given-names>L.</given-names></name> <name><surname>Qu</surname> <given-names>J.</given-names></name> <name><surname>Guan</surname> <given-names>N. N.</given-names></name> <name><surname>Li</surname> <given-names>J. Q.</given-names></name></person-group> (<year>2018a</year>). <article-title>Predicting miRNA-disease association based on inductive matrix completion</article-title>. <source>Bioinformatics</source> <pub-id pub-id-type="doi">10.1093/bioinformatics/bty503</pub-id><pub-id pub-id-type="pmid">29939227</pub-id>. [Epub ahead of print].</citation></ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Xie</surname> <given-names>D.</given-names></name> <name><surname>Wang</surname> <given-names>L.</given-names></name> <name><surname>Zhao</surname> <given-names>Q.</given-names></name> <name><surname>You</surname> <given-names>Z. H.</given-names></name> <name><surname>Liu</surname> <given-names>H.</given-names></name></person-group> (<year>2018b</year>). <article-title>BNPMDA: Bipartite Network Projection for MiRNA-Disease Association prediction</article-title>. <source>Bioinformatics</source> <volume>34</volume>, <fpage>3178</fpage>&#x02013;<lpage>3186</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bty333</pub-id><pub-id pub-id-type="pmid">29701758</pub-id></citation></ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Xie</surname> <given-names>D.</given-names></name> <name><surname>Zhao</surname> <given-names>Q.</given-names></name> <name><surname>You</surname> <given-names>Z. H.</given-names></name></person-group> (<year>2017b</year>). <article-title>MicroRNAs and complex diseases: from experimental results to computational models</article-title>. <source>Brief. Bioinformatics</source>. <pub-id pub-id-type="doi">10.1093/bib/bbx130</pub-id><pub-id pub-id-type="pmid">29045685</pub-id>. [Epub ahead of print].</citation></ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Yan</surname> <given-names>G. Y.</given-names></name></person-group> (<year>2013</year>). <article-title>Novel human lncRNA-disease association inference based on lncRNA expression profiles</article-title>. <source>Bioinformatics</source> <volume>29</volume>, <fpage>2617</fpage>&#x02013;<lpage>2624</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btt426</pub-id><pub-id pub-id-type="pmid">24002109</pub-id></citation></ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Yin</surname> <given-names>J.</given-names></name> <name><surname>Qu</surname> <given-names>J.</given-names></name> <name><surname>Huang</surname> <given-names>L.</given-names></name></person-group> (<year>2018c</year>). <article-title>MDHGI: Matrix Decomposition and Heterogeneous Graph Inference for miRNA-disease association prediction</article-title>. <source>PLoS Comput. Biol.</source> <volume>14</volume>:<fpage>e1006418</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1006418</pub-id><pub-id pub-id-type="pmid">30142158</pub-id></citation></ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Delzenne</surname> <given-names>N. M.</given-names></name> <name><surname>Neyrinck</surname> <given-names>A. M.</given-names></name> <name><surname>B&#x000E4;ckhed</surname> <given-names>F.</given-names></name> <name><surname>Cani</surname> <given-names>P. D.</given-names></name></person-group> (<year>2011</year>). <article-title>Targeting gut microbiota in obesity: effects of prebiotics and probiotics</article-title>. <source>Nat. Rev. Endocrinol.</source> <volume>7</volume>, <fpage>639</fpage>&#x02013;<lpage>646</lpage>. <pub-id pub-id-type="doi">10.1038/nrendo.2011.126</pub-id><pub-id pub-id-type="pmid">21826100</pub-id></citation></ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ding</surname> <given-names>F. X.</given-names></name> <name><surname>Liu</surname> <given-names>B.</given-names></name> <name><surname>Zou</surname> <given-names>W. J.</given-names></name> <name><surname>Li</surname> <given-names>Q. B.</given-names></name> <name><surname>Tian</surname> <given-names>D. Y.</given-names></name> <name><surname>Fu</surname> <given-names>Z.</given-names></name></person-group> (<year>2018</year>). <article-title><italic>Pseudomonas aeruginosa</italic>-derived exosomes ameliorates allergic reactions via inducing the Treg response in asthma</article-title>. <source>Pediatr. Res.</source> <volume>84</volume>, <fpage>125</fpage>&#x02013;<lpage>133</lpage>. <pub-id pub-id-type="doi">10.1038/s41390-018-0020-1</pub-id><pub-id pub-id-type="pmid">29795208</pub-id></citation></ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Facchinei</surname> <given-names>F.</given-names></name> <name><surname>Kanzow</surname> <given-names>C.</given-names></name> <name><surname>Sagratella</surname> <given-names>S.</given-names></name></person-group> (<year>2014</year>). <article-title>Solving quasi-variational inequalities via their KKT conditions</article-title>. <source>Math. Program.</source> <volume>144</volume>, <fpage>369</fpage>&#x02013;<lpage>412</lpage>. <pub-id pub-id-type="doi">10.1007/s10107-013-0637-0</pub-id></citation></ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Galiana</surname> <given-names>A.</given-names></name> <name><surname>Aguirre</surname> <given-names>E.</given-names></name> <name><surname>Rodriguez</surname> <given-names>J. C.</given-names></name> <name><surname>Mira</surname> <given-names>A.</given-names></name> <name><surname>Sa&#x000F1;tibanez</surname> <given-names>M.</given-names></name> <name><surname>Candela</surname> <given-names>I.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>Sputum microbiota in moderate versus severe patients with COPD</article-title>. <source>Eur. Respir. J.</source> <volume>43</volume>, <fpage>1787</fpage>&#x02013;<lpage>1790</lpage>. <pub-id pub-id-type="doi">10.1183/09031936.00191513</pub-id><pub-id pub-id-type="pmid">24311775</pub-id></citation></ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gao</surname> <given-names>R.</given-names></name> <name><surname>Zhu</surname> <given-names>C.</given-names></name> <name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Yin</surname> <given-names>M.</given-names></name> <name><surname>Pan</surname> <given-names>C.</given-names></name> <name><surname>Huang</surname> <given-names>L.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>Dysbiosis signatures of gut microbiota along the sequence from healthy, young patients to those with overweight and obesity</article-title>. <source>Obesity</source> <volume>26</volume>, <fpage>351</fpage>&#x02013;<lpage>361</lpage>. <pub-id pub-id-type="doi">10.1002/oby.22088</pub-id><pub-id pub-id-type="pmid">29280312</pub-id></citation></ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Giongo</surname> <given-names>A.</given-names></name> <name><surname>Gano</surname> <given-names>K. A.</given-names></name> <name><surname>Crabb</surname> <given-names>D. B.</given-names></name> <name><surname>Mukherjee</surname> <given-names>N.</given-names></name> <name><surname>Novelo</surname> <given-names>L. L.</given-names></name> <name><surname>Casella</surname> <given-names>G.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Toward defining the autoimmune microbiome for type 1 diabetes</article-title>. <source>ISME J.</source> <volume>5</volume>, <fpage>82</fpage>&#x02013;<lpage>91</lpage>. <pub-id pub-id-type="doi">10.1038/ismej.2010.92</pub-id><pub-id pub-id-type="pmid">20613793</pub-id></citation></ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gold</surname> <given-names>T.</given-names></name></person-group> (<year>1992</year>). <article-title>The deep, hot biosphere</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>89</volume>, <fpage>6045</fpage>&#x02013;<lpage>6049</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.89.13.6045</pub-id><pub-id pub-id-type="pmid">1631089</pub-id></citation></ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Guan</surname> <given-names>N.</given-names></name> <name><surname>Tao</surname> <given-names>D.</given-names></name> <name><surname>Luo</surname> <given-names>Z.</given-names></name> <name><surname>Yuan</surname> <given-names>B.</given-names></name></person-group> (<year>2011</year>). <article-title>Manifold regularized discriminative nonnegative matrix factorization with fast gradient descent</article-title>. <source>IEEE Trans. Image Process.</source> <volume>20</volume>, <fpage>2030</fpage>&#x02013;<lpage>2048</lpage>. <pub-id pub-id-type="doi">10.1109/TIP.2011.2105496</pub-id><pub-id pub-id-type="pmid">21233051</pub-id></citation></ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hao</surname> <given-names>T.</given-names></name> <name><surname>Wu</surname> <given-names>D.</given-names></name> <name><surname>Zhao</surname> <given-names>L.</given-names></name> <name><surname>Wang</surname> <given-names>Q.</given-names></name> <name><surname>Wang</surname> <given-names>E.</given-names></name> <name><surname>Sun</surname> <given-names>J.</given-names></name></person-group> (<year>2018</year>). <article-title>The genome-scale integrated networks in microorganisms</article-title>. <source>Front. Microbiol.</source> <volume>9</volume>:<fpage>296</fpage>. <pub-id pub-id-type="doi">10.3389/fmicb.2018.00296</pub-id><pub-id pub-id-type="pmid">29527198</pub-id></citation></ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>Y. A.</given-names></name> <name><surname>You</surname> <given-names>Z. H.</given-names></name> <name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Huang</surname> <given-names>Z. A.</given-names></name> <name><surname>Zhang</surname> <given-names>S.</given-names></name> <name><surname>Yan</surname> <given-names>G. Y.</given-names></name></person-group> (<year>2017</year>). <article-title>Prediction of microbe-disease association from the integration of neighbor and graph with collaborative recommendation model</article-title>. <source>J. Transl. Med.</source> <volume>15</volume>, <fpage>209</fpage>. <pub-id pub-id-type="doi">10.1186/s12967-017-1304-7</pub-id><pub-id pub-id-type="pmid">29037244</pub-id></citation></ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>Z. A.</given-names></name> <name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Zhu</surname> <given-names>Z.</given-names></name> <name><surname>Liu</surname> <given-names>H.</given-names></name> <name><surname>Yan</surname> <given-names>G. Y.</given-names></name> <name><surname>You</surname> <given-names>Z. H.</given-names></name> <etal/></person-group>. (<year>2017</year>). <article-title>PBHMDA: Path-Based Human Microbe-Disease Association Prediction</article-title>. <source>Front. Microbiol.</source> <volume>8</volume>:<fpage>233</fpage>. <pub-id pub-id-type="doi">10.3389/fmicb.2017.00233</pub-id><pub-id pub-id-type="pmid">28275370</pub-id></citation></ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Krajmalnik-Brown</surname> <given-names>R.</given-names></name> <name><surname>Ilhan</surname> <given-names>Z. E.</given-names></name> <name><surname>Kang</surname> <given-names>D. W.</given-names></name> <name><surname>Dibaise</surname> <given-names>J. K.</given-names></name></person-group> (<year>2012</year>). <article-title>Effects of gut microbes on nutrient absorption and energy regulation</article-title>. <source>Nutr. Clin. Pract.</source> <volume>27</volume>, <fpage>201</fpage>&#x02013;<lpage>214</lpage>. <pub-id pub-id-type="doi">10.1177/0884533611436116</pub-id><pub-id pub-id-type="pmid">22367888</pub-id></citation></ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kyu</surname> <given-names>H. H.</given-names></name> <name><surname>Stein</surname> <given-names>C. E.</given-names></name> <name><surname>Boschi Pinto</surname> <given-names>C.</given-names></name> <name><surname>Rakovac</surname> <given-names>I.</given-names></name> <name><surname>Weber</surname> <given-names>M. W.</given-names></name> <name><surname>Dannemann Purnat</surname> <given-names>T.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>Causes of death among children aged 5-14 years in the WHO European Region: a systematic analysis for the Global Burden of Disease Study 2016</article-title>. <source>Lancet Child Adolesc. Health</source> <volume>2</volume>, <fpage>321</fpage>&#x02013;<lpage>337</lpage>. <pub-id pub-id-type="doi">10.1016/S2352-4642(18)30095-6</pub-id><pub-id pub-id-type="pmid">29732397</pub-id></citation></ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Leeuwenhoek</surname> <given-names>A. V.</given-names></name></person-group> (<year>1683-1775</year>). <article-title>Part of a Letter from Mr Antony van Leeuwenhoek, F. R. S. concerning Green Weeds Growing in Water, and Some Animalcula Found about Them</article-title>. <source>Philos. Trans.</source> <volume>23</volume>, <fpage>1304</fpage>&#x02013;<lpage>1311</lpage>.</citation></ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>X.</given-names></name> <name><surname>Cui</surname> <given-names>G.</given-names></name> <name><surname>Dong</surname> <given-names>Y.</given-names></name></person-group> (<year>2017</year>). <article-title>Graph regularized non-negative low-rank matrix factorization for image clustering</article-title>. <source>IEEE Trans. Cybern.</source> <volume>47</volume>, <fpage>3840</fpage>&#x02013;<lpage>3853</lpage>. <pub-id pub-id-type="doi">10.1109/TCYB.2016.2585355</pub-id><pub-id pub-id-type="pmid">27448379</pub-id></citation></ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ma</surname> <given-names>W.</given-names></name> <name><surname>Zhang</surname> <given-names>L.</given-names></name> <name><surname>Zeng</surname> <given-names>P.</given-names></name> <name><surname>Huang</surname> <given-names>C.</given-names></name> <name><surname>Li</surname> <given-names>J.</given-names></name> <name><surname>Geng</surname> <given-names>B.</given-names></name> <etal/></person-group>. (<year>2017</year>). <article-title>An analysis of human microbe-disease associations</article-title>. <source>Brief. Bioinformatics</source> <volume>18</volume>, <fpage>85</fpage>&#x02013;<lpage>97</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbw005</pub-id><pub-id pub-id-type="pmid">26883326</pub-id></citation></ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mejj&#x000ED;a-Le&#x000F3;n</surname> <given-names>M. E.</given-names></name> <name><surname>Barca</surname> <given-names>A. M.</given-names></name></person-group> (<year>2015</year>). <article-title>Diet, microbiota and immune system in type 1 diabetes development and evolution</article-title>. <source>Nutrients</source> <volume>7</volume>, <fpage>9171</fpage>&#x02013;<lpage>9184</lpage>. <pub-id pub-id-type="doi">10.3390/nu7115461</pub-id></citation></ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nepusz</surname> <given-names>T.</given-names></name> <name><surname>Yu</surname> <given-names>H.</given-names></name> <name><surname>Paccanaro</surname> <given-names>A.</given-names></name></person-group> (<year>2012</year>). <article-title>Detecting overlapping protein complexes in protein-protein interaction networks</article-title>. <source>Nat. Methods</source> <volume>9</volume>, <fpage>471</fpage>&#x02013;<lpage>472</lpage>. <pub-id pub-id-type="doi">10.1038/nmeth.1938</pub-id><pub-id pub-id-type="pmid">22426491</pub-id></citation></ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Park</surname> <given-names>H. J.</given-names></name> <name><surname>Lee</surname> <given-names>S. E.</given-names></name> <name><surname>Kim</surname> <given-names>H. B.</given-names></name> <name><surname>Isaacson</surname> <given-names>R. E.</given-names></name> <name><surname>Seo</surname> <given-names>K. W.</given-names></name> <name><surname>Song</surname> <given-names>K. H.</given-names></name></person-group> (<year>2015</year>). <article-title>Association of obesity with serum leptin, adiponectin, and serotonin and gut microflora in beagle dogs</article-title>. <source>J. Vet. Intern. Med.</source> <volume>29</volume>, <fpage>43</fpage>&#x02013;<lpage>50</lpage>. <pub-id pub-id-type="doi">10.1111/jvim.12455</pub-id><pub-id pub-id-type="pmid">25407880</pub-id></citation></ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Peterson</surname> <given-names>C. T.</given-names></name> <name><surname>Sharma</surname> <given-names>V.</given-names></name> <name><surname>Elm&#x000E9;n</surname> <given-names>L.</given-names></name> <name><surname>Peterson</surname> <given-names>S. N.</given-names></name></person-group> (<year>2015</year>). <article-title>Immune homeostasis, dysbiosis and therapeutic modulation of the gut microbiota</article-title>. <source>Clin. Exp. Immunol.</source> <volume>179</volume>, <fpage>363</fpage>&#x02013;<lpage>377</lpage>. <pub-id pub-id-type="doi">10.1111/cei.12474</pub-id><pub-id pub-id-type="pmid">25345825</pub-id></citation></ref>
<ref id="B35">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Reinier</surname> <given-names>K.</given-names></name> <name><surname>Chugh</surname> <given-names>S. S.</given-names></name></person-group> (<year>2015</year>). <article-title>Obesity and sudden death: visceral response?</article-title> <source>Heart</source> <volume>101</volume>, <fpage>165</fpage>&#x02013;<lpage>166</lpage>. <pub-id pub-id-type="doi">10.1136/heartjnl-2014-306921</pub-id><pub-id pub-id-type="pmid">25502517</pub-id></citation></ref>
<ref id="B36">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rewers</surname> <given-names>M.</given-names></name> <name><surname>Ludvigsson</surname> <given-names>J.</given-names></name></person-group> (<year>2016</year>). <article-title>Environmental risk factors for type 1 diabetes</article-title>. <source>Lancet</source> <volume>387</volume>, <fpage>2340</fpage>&#x02013;<lpage>2348</lpage>. <pub-id pub-id-type="doi">10.1016/S0140-6736(16)30507-4</pub-id><pub-id pub-id-type="pmid">27302273</pub-id></citation></ref>
<ref id="B37">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Serena</surname> <given-names>G.</given-names></name> <name><surname>Camhi</surname> <given-names>S.</given-names></name> <name><surname>Sturgeon</surname> <given-names>C.</given-names></name> <name><surname>Yan</surname> <given-names>S.</given-names></name> <name><surname>Fasano</surname> <given-names>A.</given-names></name></person-group> (<year>2015</year>). <article-title>The role of gluten in celiac disease and type 1 diabetes</article-title>. <source>Nutrients</source> <volume>7</volume>, <fpage>7143</fpage>&#x02013;<lpage>7162</lpage>. <pub-id pub-id-type="doi">10.3390/nu7095329</pub-id><pub-id pub-id-type="pmid">26343710</pub-id></citation></ref>
<ref id="B38">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Smid</surname> <given-names>E. J.</given-names></name> <name><surname>Lacroix</surname> <given-names>C.</given-names></name></person-group> (<year>2013</year>). <article-title>Microbe-microbe interactions in mixed culture food fermentations</article-title>. <source>Curr. Opin. Biotechnol.</source> <volume>24</volume>, <fpage>148</fpage>&#x02013;<lpage>154</lpage>. <pub-id pub-id-type="doi">10.1016/j.copbio.2012.11.007</pub-id><pub-id pub-id-type="pmid">23228389</pub-id></citation></ref>
<ref id="B39">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sotiras</surname> <given-names>A.</given-names></name> <name><surname>Resnick</surname> <given-names>S. M.</given-names></name> <name><surname>Davatzikos</surname> <given-names>C.</given-names></name></person-group> (<year>2015</year>). <article-title>Finding imaging patterns of structural covariance via Non-Negative Matrix Factorization</article-title>. <source>Neuroimage</source> <volume>108</volume>, <fpage>1</fpage>&#x02013;<lpage>16</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuroimage.2014.11.045</pub-id><pub-id pub-id-type="pmid">25497684</pub-id></citation></ref>
<ref id="B40">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tabatabaei</surname> <given-names>M.</given-names></name> <name><surname>Rahim</surname> <given-names>R. A.</given-names></name> <name><surname>Abdullah</surname> <given-names>N.</given-names></name> <name><surname>Wright</surname> <given-names>A. D. G.</given-names></name> <name><surname>Shirai</surname> <given-names>Y.</given-names></name> <name><surname>Sakai</surname> <given-names>K.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>Importance of the methanogenic archaea populations in anaerobic wastewater treatments</article-title>. <source>Process Biochem.</source> <volume>45</volume>, <fpage>1214</fpage>&#x02013;<lpage>1225</lpage>. <pub-id pub-id-type="doi">10.1016/j.procbio.2010.05.017</pub-id></citation></ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tanaka</surname> <given-names>K.</given-names></name></person-group> (<year>1999</year>). <article-title>From the fryer to the fuel tank: the complete guide to using vegetable oil as an alternative fuel</article-title>. <source>Green Teacher</source> <volume>15</volume>, <fpage>46</fpage>.</citation></ref>
<ref id="B42">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thiele</surname> <given-names>I.</given-names></name> <name><surname>Heinken</surname> <given-names>A.</given-names></name> <name><surname>Fleming</surname> <given-names>R. M.</given-names></name></person-group> (<year>2013</year>). <article-title>A systems biology approach to studying the role of microbes in human health</article-title>. <source>Curr. Opin. Biotechnol.</source> <volume>24</volume>, <fpage>4</fpage>&#x02013;<lpage>12</lpage>. <pub-id pub-id-type="doi">10.1016/j.copbio.2012.10.001</pub-id><pub-id pub-id-type="pmid">23102866</pub-id></citation></ref>
<ref id="B43">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>van Laarhoven</surname> <given-names>T.</given-names></name> <name><surname>Nabuurs</surname> <given-names>S. B.</given-names></name> <name><surname>Marchiori</surname> <given-names>E.</given-names></name></person-group> (<year>2011</year>). <article-title>Gaussian interaction profile kernels for predicting drug-target interaction</article-title>. <source>Bioinformatics</source> <volume>27</volume>, <fpage>3036</fpage>&#x02013;<lpage>3043</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btr500</pub-id><pub-id pub-id-type="pmid">21893517</pub-id></citation></ref>
<ref id="B44">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>E.</given-names></name> <name><surname>Zaman</surname> <given-names>N.</given-names></name> <name><surname>Mcgee</surname> <given-names>S.</given-names></name> <name><surname>Milanese</surname> <given-names>J. S.</given-names></name> <name><surname>Masoudi-Nejad</surname> <given-names>A.</given-names></name> <name><surname>O&#x00027;Connor-Mccourt</surname> <given-names>M.</given-names></name></person-group> (<year>2015</year>). <article-title>Predictive genomics: a cancer hallmark network framework for predicting tumor clinical phenotypes using genome sequencing data</article-title>. <source>Semin. Cancer Biol.</source> <volume>30</volume>, <fpage>4</fpage>&#x02013;<lpage>12</lpage>. <pub-id pub-id-type="doi">10.1016/j.semcancer.2014.04.002</pub-id><pub-id pub-id-type="pmid">24747696</pub-id></citation></ref>
<ref id="B45">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>F.</given-names></name> <name><surname>Huang</surname> <given-names>Z. A.</given-names></name> <name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Zhu</surname> <given-names>Z.</given-names></name> <name><surname>Wen</surname> <given-names>Z.</given-names></name> <name><surname>Zhao</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2017</year>). <article-title>LRLSHMDA: Laplacian regularized least squares for human microbe-disease association prediction</article-title>. <source>Sci. Rep.</source> <volume>7</volume>:<fpage>7601</fpage>. <pub-id pub-id-type="doi">10.1038/s41598-017-08127-2</pub-id><pub-id pub-id-type="pmid">28790448</pub-id></citation></ref>
<ref id="B46">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Woting</surname> <given-names>A.</given-names></name> <name><surname>Pfeiffer</surname> <given-names>N.</given-names></name> <name><surname>Loh</surname> <given-names>G.</given-names></name> <name><surname>Klaus</surname> <given-names>S.</given-names></name> <name><surname>Blaut</surname> <given-names>M.</given-names></name></person-group> (<year>2014</year>). <article-title>Clostridium ramosum promotes high-fat diet-induced obesity in gnotobiotic mouse models</article-title>. <source>MBio</source> <volume>5</volume>, <fpage>e01530</fpage>&#x02013;<lpage>e01514</lpage>. <pub-id pub-id-type="doi">10.1128/mBio.01530-14</pub-id><pub-id pub-id-type="pmid">25271283</pub-id></citation></ref>
<ref id="B47">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xiao</surname> <given-names>Q.</given-names></name> <name><surname>Luo</surname> <given-names>J.</given-names></name> <name><surname>Liang</surname> <given-names>C.</given-names></name> <name><surname>Cai</surname> <given-names>J.</given-names></name> <name><surname>Ding</surname> <given-names>P.</given-names></name></person-group> (<year>2017</year>). <article-title>A graph regularized non-negative matrix factorization method for identifying microRNA-disease associations</article-title>. <source>Bioinformatics</source> <volume>34</volume>, <fpage>239</fpage>&#x02013;<lpage>248</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btx545</pub-id></citation></ref>
<ref id="B48">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xu</surname> <given-names>J.</given-names></name> <name><surname>Xiang</surname> <given-names>L.</given-names></name> <name><surname>Wang</surname> <given-names>G.</given-names></name> <name><surname>Ganesan</surname> <given-names>S.</given-names></name> <name><surname>Feldman</surname> <given-names>M.</given-names></name> <name><surname>Shih</surname> <given-names>N. N.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>Sparse Non-negative Matrix Factorization (SNMF) based color unmixing for breast histopathological image analysis</article-title>. <source>Comput. Med. Imaging Graph</source> <volume>46</volume>(<issue>Pt 1</issue>), <fpage>20</fpage>&#x02013;<lpage>29</lpage>. <pub-id pub-id-type="doi">10.1016/j.compmedimag.2015.04.002</pub-id><pub-id pub-id-type="pmid">25958195</pub-id></citation></ref>
<ref id="B49">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>You</surname> <given-names>Z. H.</given-names></name> <name><surname>Huang</surname> <given-names>Z. A.</given-names></name> <name><surname>Zhu</surname> <given-names>Z.</given-names></name> <name><surname>Yan</surname> <given-names>G. Y.</given-names></name> <name><surname>Li</surname> <given-names>Z. W.</given-names></name> <name><surname>Wen</surname> <given-names>Z.</given-names></name> <etal/></person-group>. (<year>2017</year>). <article-title>PBMDA: a novel and effective path-based computational model for miRNA-disease association prediction</article-title>. <source>PLoS Comput. Biol.</source> <volume>13</volume>:<fpage>e1005455</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1005455</pub-id><pub-id pub-id-type="pmid">28339468</pub-id></citation></ref>
<ref id="B50">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yu</surname> <given-names>J.</given-names></name> <name><surname>Jang</surname> <given-names>S. O.</given-names></name> <name><surname>Kim</surname> <given-names>B. J.</given-names></name> <name><surname>Song</surname> <given-names>Y. H.</given-names></name> <name><surname>Kwon</surname> <given-names>J. W.</given-names></name> <name><surname>Kang</surname> <given-names>M. J.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>The effects of <italic>Lactobacillus rhamnosus</italic> on the prevention of asthma in a murine model</article-title>. <source>Allergy Asthma Immunol. Res.</source> <volume>2</volume>, <fpage>199</fpage>&#x02013;<lpage>205</lpage>. <pub-id pub-id-type="doi">10.4168/aair.2010.2.3.199</pub-id><pub-id pub-id-type="pmid">20592920</pub-id></citation></ref>
<ref id="B51">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>Q.</given-names></name> <name><surname>Cox</surname> <given-names>M.</given-names></name> <name><surname>Liang</surname> <given-names>Z.</given-names></name> <name><surname>Brinkmann</surname> <given-names>F.</given-names></name> <name><surname>Cardenas</surname> <given-names>P. A.</given-names></name> <name><surname>Duff</surname> <given-names>R.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>Airway microbiota in severe asthma and relationship to asthma severity and phenotypes</article-title>. <source>PLoS ONE</source> <volume>11</volume>:<fpage>e0152724</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0152724</pub-id><pub-id pub-id-type="pmid">27078029</pub-id></citation></ref>
<ref id="B52">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname> <given-names>X.</given-names></name> <name><surname>Menche</surname> <given-names>J.</given-names></name> <name><surname>Barab&#x000E1;si</surname> <given-names>A. L.</given-names></name> <name><surname>Sharma</surname> <given-names>A.</given-names></name></person-group> (<year>2014</year>). <article-title>Human symptoms-disease network</article-title>. <source>Nat. Commun.</source> <volume>5</volume>, <fpage>4212</fpage>. <pub-id pub-id-type="doi">10.1038/ncomms5212</pub-id><pub-id pub-id-type="pmid">24967666</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn fn-type="financial-disclosure"><p><bold>Funding.</bold> B-SH was supported by Key Program of Hunan Provincial Education Department (Grant No. 15A026), General Program of Hunan Provincial Philosophy and Social Science Planning Fund office (Grant No. 15YBA035). L-HP was supported by Natural Science Foundation of Hunan province under Grant No. 2018JJ3570. ZL was supported by National Nature Science Foundation of China (Grant No. 61672223).</p>
</fn>
</fn-group>
</back>
</article>