<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Microbiol.</journal-id>
<journal-title>Frontiers in Microbiology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Microbiol.</abbrev-journal-title>
<issn pub-type="epub">1664-302X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fmicb.2018.02440</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Microbiology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Human Microbe-Disease Association Prediction Based on Adaptive Boosting</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Peng</surname> <given-names>Li-Hong</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="author-notes" rid="fn002"><sup>&#x2020;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/601482/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Yin</surname> <given-names>Jun</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="author-notes" rid="fn002"><sup>&#x2020;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/578422/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Zhou</surname> <given-names>Liqian</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Liu</surname> <given-names>Ming-Xi</given-names></name>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Zhao</surname> <given-names>Yan</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/592576/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>School of Computer Science, Hunan University of Technology</institution>, <addr-line>Zhuzhou</addr-line>, <country>China</country></aff>
<aff id="aff2"><sup>2</sup><institution>School of Information and Control Engineering, China University of Mining and Technology</institution>, <addr-line>Xuzhou</addr-line>, <country>China</country></aff>
<aff id="aff3"><sup>3</sup><institution>Institutes of Science and Development, Chinese Academy of Sciences</institution>, <addr-line>Beijing</addr-line>, <country>China</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Hongsheng Liu, Liaoning University, China</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Yi Xiong, Shanghai Jiao Tong University, China; Qinghua Cui, Peking University, China</p></fn>
<corresp id="c001">&#x002A;Correspondence: Ming-Xi Liu liumingxi@casipm.ac.cn Yan Zhao, <email>ts17060090a3@cumt.edu.cn</email></corresp>
<fn fn-type="other" id="fn002"><p><sup>&#x2020;</sup>Joint first authors</p></fn>
<fn fn-type="other" id="fn003"><p>This article was submitted to Systems Microbiology, a section of the journal Frontiers in Microbiology</p></fn></author-notes>
<pub-date pub-type="epub">
<day>09</day>
<month>10</month>
<year>2018</year>
</pub-date>
<pub-date pub-type="collection">
<year>2018</year>
</pub-date>
<volume>9</volume>
<elocation-id>2440</elocation-id>
<history>
<date date-type="received">
<day>19</day>
<month>08</month>
<year>2018</year>
</date>
<date date-type="accepted">
<day>24</day>
<month>09</month>
<year>2018</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2018 Peng, Yin, Zhou, Liu and Zhao.</copyright-statement>
<copyright-year>2018</copyright-year>
<copyright-holder>Peng, Yin, Zhou, Liu and Zhao</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>There are countless microbes in the human body, and they play various roles in the physiological process. There is growing evidence that microbes are closely associated with human diseases. Researching disease-related microbes helps us understand the mechanisms of diseases and provides new strategies for diseases diagnosis and treatment. Many computational models have been proposed to predict disease-related microbes, in this paper, we developed a model of Adaptive Boosting for Human Microbe-Disease Association prediction (ABHMDA) to reveal the associations between diseases and microbes by calculating the relation probability of disease-microbe pair using a strong classifier. Our model could be applied to new diseases without any known related microbes. In order to assess the prediction power of the model, global and local leave-one-out cross validation (LOOCV) were implemented. As shown in the results, the global and local LOOCV values reached 0.8869 and 0.7910, respectively. What&#x2019;s more, 10, 10, and 8 out of the top 10 microbes predicted to be most likely to be associated with Asthma, Colorectal carcinoma and Type 1 diabetes were all verified by relevant literatures or database HMDAD, respectively. The above results verify the superior predictive performance of ABHMDA.</p>
</abstract>
<kwd-group>
<kwd>microbe</kwd>
<kwd>disease</kwd>
<kwd>association prediction</kwd>
<kwd>adaptive boosting</kwd>
<kwd>decision tree</kwd>
</kwd-group>
<counts>
<fig-count count="2"/>
<table-count count="3"/>
<equation-count count="0"/>
<ref-count count="51"/>
<page-count count="9"/>
<word-count count="0"/>
</counts>
</article-meta>
</front>
<body>
<sec><title>Introduction</title>
<p>Microbes are ubiquitous in our lives. After deeper research, microbes could be simply divided into the following types: bacteria, fungi, viruses, archaea, protozoa, and so on (<xref ref-type="bibr" rid="B41">Sommer and Backhed, 2013</xref>). As we all know, there are a number of microbes living in the human tissues, such as gut (<xref ref-type="bibr" rid="B22">Grenham et al., 2011</xref>), skin (<xref ref-type="bibr" rid="B20">Fredricks, 2001</xref>) and lung (<xref ref-type="bibr" rid="B17">Cole, 1989</xref>). Cells are the basic unit of our body&#x2019;s structure and function, and our body contains more than 40 trillion cells, but studies have shown that the number of microorganisms in humans is 10% more than the number of cells, which shows that the microbial community is relatively large in the human body (<xref ref-type="bibr" rid="B39">Sender et al., 2016</xref>). There are studies showing that microorganisms are involved in many biological processes in the human body, such as metabolic function, immune function, and so on (<xref ref-type="bibr" rid="B21">Gill et al., 2006</xref>). For example, in the intestinal tract of the adult, most of the intestinal microbes living in the gastrointestinal tract are able to not only synthesize necessary amino acids and vitamins, but also are conducive to the digestion and absorption of indigestible food (<xref ref-type="bibr" rid="B26">Huang Z.A. et al., 2017</xref>). So it is not surprising that there are links between microbes and human diseases (<xref ref-type="bibr" rid="B18">Consortium, 2012</xref>). Some researchers had found a close relationship between human type 2 diabetes and changes in the composition of the intestinal microbiota (<xref ref-type="bibr" rid="B30">Larsen et al., 2010</xref>). Gut microbes could induce colorectal cancer by generating butyrate that promoted the hyperproliferation of MSH2(-/-) colon epithelial cells (<xref ref-type="bibr" rid="B4">Belcheva et al., 2014</xref>). There was also evidence that toxins produced by microbes such as Streptococcus and Staphylococcus aureus had been shown to be a new class of allergens that could induce or even aggravate inflammatory skin diseases (<xref ref-type="bibr" rid="B40">Skov and Baadsgaard, 2000</xref>). Therefore, revealing disease-related microbes not only helps to further understand the pathogenesis of the disease but also provides new strategies for the diagnosis and treatment of the disease. Although some proven disease-microbe associations have been documented in the database HMDAD (<xref ref-type="bibr" rid="B33">Ma et al., 2017</xref>)<sup><xref ref-type="fn" rid="fn01">1</xref></sup>, such as Allergic asthma-Helicobacter pylori, Allergic sensitization-Clostridium difficile, and Asthma-Bacteroidetes, these are far from enough. Unfortunately, using biological experiments to reveal the relationship between disease and microbes is cumbersome and costly. Therefore, it is imperative to predict the potential disease-related microbes by constructing computational models.</p>
<p>According to the assumption that functionally similar microbes tend to be associated with similar diseases, by integrating two separate recommendation algorithms based on neighbor information and network topology, respectively, <xref ref-type="bibr" rid="B25">Huang Y.A. et al. (2017)</xref> developed a neighbor and graph based combined recommendation model for human microbe-disease association prediction (NGRHMDA) to predict potential disease-related microbes. As a combination of two independent recommendation models, the prediction accuracy of NGRHMDA was significantly improved compared to a single recommendation model. Unlike previous methods, NGRHMDA was an unsupervised learning method that did not require negative samples. Of course, there were some restrictions on NGRHMDA. Firstly, NGRHMDA could not be applied to predict microbes associated with new diseases without any known related microbes. Secondly, the optimal values of some parameters in the model were still not solved. <xref ref-type="bibr" rid="B26">Huang Z.A. et al. (2017)</xref> proposed a method of Path-Based Human Microbe-Disease Association prediction (PBHMDA) by integrating confirmed disease-microbe relations and the Gaussian interaction profile kernel similarity for diseases and microbes into a heterogeneous network. This model traversed all possible pathways between microbes and diseases through a novel depth-first search algorithm to predict the most likely disease-associated microbes. Both global and local leave-one-out cross validation (LOOCV) AUC values of PBHMDA were greater than 0.9, which showed that the prediction accuracy of PBHMDA was quite impressive. Regrettably, this model still had some shortcomings. Firstly, both the disease&#x2013;disease similarities and microbe&#x2013;microbe similarities were obtained from the Gaussian kernel for interaction profiles of microbes and diseases that were calculated based on the known disease-microbe associations, which might be biased for diseases with more known related microbes. Secondly, PBHMDA was also not suitable for new diseases. What&#x2019;s more, based on the known human microbe-disease association network obtained from the HMDAD database, <xref ref-type="bibr" rid="B46">Wang et al. (2017)</xref> proposed a novel computational model of Laplacian Regularized Least Squares for Human Microbe-Disease Association (LRLSHMDA) to reveal potential disease-related microbes (<xref ref-type="bibr" rid="B46">Wang et al., 2017</xref>). LRLSHMDA applied a semi-supervised learning framework due to the lack of pairs of disease-microbes that had proven to be unrelated. In this model, the microbe similarity network and the disease similarity network were constructed based on the Gaussian interaction profile kernel similarity calculated by known microbe-disease association, and then by constructing and optimizing the cost functions in microbe space and disease space to integrated the optimal classifier functions to calculate the relation probabilities of microbe-disease pairs. Although the reliable prediction performance of LRLSHMDA had been verified, the model still had some shortcomings that needed further improvement. Firstly, the number of proven-microbe associations was too small, and sparse known association network might affect the prediction performance of the model. Secondly, LRLSHMDA could not be suitable for new microbes without any known related diseases.</p>
<p>In addition, <xref ref-type="bibr" rid="B33">Ma et al. (2017)</xref> built a microbe-disease association network based on published literature, and constructed a disease&#x2013;disease network (Human Microbe Disease Network, HMDN) based on disease-associated microbes where the weight of the link between diseases was the similarity of microbes associated with the corresponding disease, and then by integrating data of disease genes, symptoms, chemical fragments, and drugs to investigate the overlaps between microbes and genes. <xref ref-type="bibr" rid="B8">Chen et al. (2017a)</xref> built a microbe-human disease association network and proposed a novel computational model of KATZ measure for Human Microbe-Disease Association prediction (KATZHMDA) based on this hypothesis that functionally similar microbes tend to have similar interactions and non-interactive patterns with non-infectious diseases and vice versa. By merging known disease-microbe association networks, disease similarity networks and microbe similarity networks into a heterogeneous network, KATZHMDA integrated walks with different lengths in the network to calculate the relation probability between microbe and disease. As a global computation method, KATZHMDA was capable of simultaneously revealing microbes associated with all diseases in a large-scale network. However, KATZHMDA still had many problems need to be solved in the future. For example, the problem of the optimal value of the parameter k had not been solved yet, and the prediction accuracy of KATZHMDA needed to be improved.</p>
<p>The above methods had various shortcomings. For instance, some models were not suitable for new diseases, and the optimal values of the parameters in some models were not well solved. For the sake of revealing the association between microbe-diseases better, in this paper, we proposed a model of Adaptive Boosting for Human Microbe-Disease Association prediction (ABHMDA) to uncover the associations between diseases and microbes by calculating the relation probability of disease-microbe pair using a strong classifier. Compared with the above methods, our model had the advantage of predicting microbes associated with new diseases. Since the number of negative samples was much larger than that of positive samples, we introduced k-means clusters to sample negative samples to balance the samples for training. What&#x2019;s more, the strong classifier was composed of multiple weak classifiers according to the corresponding weights, and the higher the prediction accuracy of weak classifier, the greater the weight of it. We applied global and local LOOCV to evaluate the prediction performance of ABHMDA. As the results shown, the global and local LOOCV values reached 0.8869 and 0.7910, respectively, which indicated that the model&#x2019;s prediction power was reliable. Besides, we used ABHMDA to conduct case studies on three diseases. 10, 10, and 8 out of the top 10 microbes predicted to be most likely to be associated with Asthma, Colorectal carcinoma and Type 1 diabetes were all verified by relevant literatures or database HMDAD, respectively.</p>
</sec>
<sec id="s1" sec-type="materials|methods">
<title>Materials and Methods</title>
<sec><title>Human Microbe-Disease Associations</title>
<p>We could obtain 450 known associations between 292 microbes and 38 diseases from Human Microbe-Disease Association Database (HMDAD) (<xref ref-type="bibr" rid="B33">Ma et al., 2017</xref>). For the reason that there were several grades of microbe classification, and when using 16s RNA sequences to study microbes, only the information in the level of genus would be acquired, we revealed the microbes which were likely to be related with human diseases in genus level. Besides, we defined the adjacency matrix A, if there was known association between disease <italic>d</italic>(<italic>i</italic>) and microbes <italic>m</italic>(<italic>j</italic>), the value of the element <italic>A</italic>(<italic>d</italic>(<italic>i</italic>), <italic>m</italic>(<italic>j</italic>)) matrix A was 1. We applied the variable nd, nm to denote the number of diseases and microbes studied, respectively.</p>
</sec>
<sec><title>Gaussian Interaction Profile Kernel Similarity</title>
<p>Inspired by this article (<xref ref-type="bibr" rid="B29">Laarhoven et al., 2011</xref>), Considering the assumption that if two similar diseases were associated with two microbes, respectively, the two microbes were likely to be similar, and there were similar interaction and non-interaction pattern between diseases and microbes, Gaussian interaction profile kernel similarity for disease KD was constructed to indicated the similarities between diseases based on the known associations of disease-microbe pairs. Firstly, binary vector <italic>IP</italic>(<italic>d</italic>(<italic>i</italic>)) was defined to represented the interaction profiles of diseases <italic>d</italic>(<italic>i</italic>) by observing whether there was a known association between disease <italic>d</italic>(<italic>i</italic>) and each microbe (i.e., the ith row of the adjacency matrix A). Then, the Gaussian interaction profile kernel similarity between disease <italic>d</italic>(<italic>i</italic>) and <italic>d</italic>(<italic>i</italic>) could be calculated as follow:</p>
<disp-formula id="M1"><label>(1)</label><mml:math id="E1"><mml:mrow><mml:mi>K</mml:mi><mml:mi>D</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>d</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>i</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mtext>,&#x00A0;</mml:mtext><mml:mi>d</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>j</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo stretchy='false'>)</mml:mo><mml:mo>=</mml:mo><mml:mi>exp</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mi>&#x03B3;</mml:mi><mml:mtext>d</mml:mtext></mml:msub><mml:mo>&#x007C;</mml:mo><mml:mo>&#x007C;</mml:mo><mml:mi>I</mml:mi><mml:mi>P</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>d</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>i</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo stretchy='false'>)</mml:mo><mml:mo>&#x2212;</mml:mo><mml:mi>I</mml:mi><mml:mi>P</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>d</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>i</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo stretchy='false'>)</mml:mo><mml:mo>&#x007C;</mml:mo><mml:msup><mml:mo>&#x007C;</mml:mo><mml:mn>2</mml:mn></mml:msup><mml:mo stretchy='false'>)</mml:mo></mml:mrow></mml:math></disp-formula>
<p>Here, parameter &#x03B3;<sub>d</sub> was introduced to regulated the kernel bandwidth and got by normalizing another parameter &#x03B3;&#x2032;<sub>d</sub> by the average number of related microbes of all the diseases. &#x03B3;<sub>d</sub> was calculated as follow:</p>
<disp-formula id="M2"><label>(2)</label><mml:math id="E2"><mml:mrow><mml:msub><mml:mi>&#x03B3;</mml:mi><mml:mtext>d</mml:mtext></mml:msub><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:msubsup><mml:mi>&#x03B3;</mml:mi><mml:mtext>d</mml:mtext><mml:mo>&#x2032;</mml:mo></mml:msubsup></mml:mrow><mml:mrow><mml:mfrac><mml:mrow><mml:msub><mml:mo>&#x2211;</mml:mo><mml:mn>1</mml:mn></mml:msub><mml:mi>n</mml:mi><mml:mi>d</mml:mi><mml:mo>&#x007C;</mml:mo><mml:mo>&#x007C;</mml:mo><mml:mi>I</mml:mi><mml:mi>P</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>d</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>i</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo stretchy='false'>)</mml:mo><mml:mo>&#x007C;</mml:mo><mml:msup><mml:mo>&#x007C;</mml:mo><mml:mn>2</mml:mn></mml:msup></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mi>d</mml:mi></mml:mrow></mml:mfrac></mml:mrow></mml:mfrac></mml:mrow></mml:math></disp-formula>
<p>where the value of &#x03B3;&#x2032;<sub>d</sub> was 1.</p>
<p>The definition of Gaussian interaction profile kernel similarity for microbe KM was similar to KD</p>
</sec>
<sec><title>Integrating Symptom-Based Disease Similarity</title>
<p>From the above we could see that Gaussian interaction profile kernel similarity was only based on the adjacency matrix A. If we wanted to effectively and scientifically predict potential disease-associated microbes, it was necessary to introduce other datasets in combination with the Gaussian interaction profile kernel similarity. Based on the disease and corresponding symptom recorded in PubMed bibliography. <xref ref-type="bibr" rid="B51">Zhou et al. (2014)</xref> calculated similarity between diseases and constructed the symptom-based human disease network (HSDN). Here, we integrated the Gaussian interaction profile kernel similarity for disease KD and the symptom-based disease similarity SDM to obtained the Integrating symptom-based disease similarity SD, and the calculation of SD was defined as follow:</p>
<disp-formula id="M3"><label>(3)</label><mml:math id="E3"><mml:mrow><mml:mi>S</mml:mi><mml:mi>D</mml:mi><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mi>K</mml:mi><mml:mi>D</mml:mi><mml:mo>+</mml:mo><mml:mi>S</mml:mi><mml:mi>D</mml:mi><mml:mi>M</mml:mi></mml:mrow><mml:mn>2</mml:mn></mml:mfrac></mml:mrow></mml:math></disp-formula>
</sec>
<sec><title>ABHMDA</title>
<p>Motivated by this paper (<xref ref-type="bibr" rid="B37">Rayhan et al., 2017</xref>), we constructed a novel calculation model of ABHMDA to predict disease-related microbes and the flow chart of the algorithm was shown in <bold>Figure <xref ref-type="fig" rid="F1">1</xref></bold>. The core idea of ABHMDA was to train different classifiers (weak classifiers) for the same training samples, and then grouped these weak classifiers with different ratios to form a stronger classifier to score and sort samples. Here, we chose the decision tree as our weak classifier. The specific steps were mainly divided into three steps: integrating the data, training the model, and scoring the samples. In the first step, we integrated the Gaussian interaction profile kernel similarity for microbe KM and the Integrating symptom-based disease similarity SD. In the second step, we firstly referred to the sample with confirmed association as a positive sample, otherwise it was an unknown sample. On account of the unknown sample accounting for about 97% in all the samples, that was to say, there were far more unknown samples than positive ones, and it was unreasonable to directly train such unbalanced datasets. Here, we introduced a novel method to balance the datasets. In this method, we applied the k-mean clustering to divide the unknown sample into k parts, and then randomly extract some samples from each part as negative samples, while positive samples kept unchanged. There were researchers studying the effect to random extraction when k took different values, and the results shown that the optimal value of parameter k was 23. In order to make the dataset used for training more balanced, the number of the unknown samples randomly selected ought to be approximately equal to the positive sample. In the end, the negative and positive samples together formed the training samples. Each training sample was weighted with an initial weight of <inline-formula><mml:math id="E9"><mml:mrow><mml:mfrac><mml:mn>1</mml:mn><mml:mi>n</mml:mi></mml:mfrac></mml:mrow></mml:math></inline-formula>, where <italic>n</italic> was the total number of training samples. The main purpose of the training process was to calculate the proportion of each weak classifier in the final strong classifier and update the weight of each training sample according to whether it was classified correctly by the last classifier and the overall classification accuracy of the last classifier. After updating, the new training sample set with modified weight values was sent to the next weak classifier for training. Here, we built lists D<sub>I</sub>,<italic>h</italic>(<italic>i</italic>) and <italic>Y</italic> , all of which had n elements. The value of each element in D<sub>i</sub> was the weight of the corresponding sample when the ith weak classifier trained the sample. The value of i was 0, 1, 2,,,,29. In other words, D<sub>0</sub> was a list with all elements being <inline-formula><mml:math id="E10"><mml:mrow><mml:mfrac><mml:mn>1</mml:mn><mml:mi>n</mml:mi></mml:mfrac></mml:mrow></mml:math></inline-formula>. The value of the element in label lists <italic>h</italic>(<italic>i</italic>) and Y was only 0 or 1, and the difference between them was that the value of <italic>h</italic>(<italic>i</italic>)<sub>j</sub> depended on the prediction of the ith weak classifier, while the value of <italic>Y</italic><sub>j</sub> depended on whether the corresponding sample was a positive sample, if the corresponding sample was a positive sample, the value of <italic>Y</italic><sub>j</sub> was equal to 1, otherwise 0. The error function &#x2208;<sub>i</sub> was calculated as follow:</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p>The flowchart of ABHMDA includes three steps: preparing the data; training the model; and scoring and ranking the disease-microbe pairs.</p></caption>
<graphic xlink:href="fmicb-09-02440-g001.tif"/>
</fig>
<disp-formula id="M4"><label>(4)</label><mml:math id="E4"><mml:mrow><mml:msub><mml:mo>&#x2208;</mml:mo><mml:mtext>i</mml:mtext></mml:msub><mml:mstyle displaystyle='true'><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mtext>j</mml:mtext><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mtext>n</mml:mtext></mml:munderover><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mtext>i</mml:mtext></mml:msub><mml:msub><mml:mn>1</mml:mn><mml:mrow><mml:mi>h</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mtext>i</mml:mtext><mml:mo stretchy='false'>)</mml:mo><mml:mi>j</mml:mi><mml:mo>&#x2260;</mml:mo><mml:msub><mml:mi>Y</mml:mi><mml:mtext>j</mml:mtext></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:mstyle></mml:mrow></mml:math></disp-formula>
<p>It could be seen from the formula that the error function &#x2208;<sub>i</sub> was equal to the sum of the weights of the samples, whose label predicted by the weak classifier <italic>h</italic>(<italic>i</italic>)<sub>j</sub> was different from the known label <italic>Y</italic><sub>j</sub>. That was to say &#x2208;<sub>i</sub> was equal to the sum of the weights of all the samples that were predicted wrong. Then the proportion of the ith weak classifier in the strong classifier could be defined as follow:</p>
<disp-formula id="M5"><label>(5)</label><mml:math id="E5"><mml:mrow><mml:msub><mml:mi>&#x03B1;</mml:mi><mml:mtext>i</mml:mtext></mml:msub><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mi>log</mml:mi><mml:mfrac><mml:mrow><mml:mn>1</mml:mn><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mo>&#x2208;</mml:mo><mml:mtext>i</mml:mtext></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mo>&#x2208;</mml:mo><mml:mtext>i</mml:mtext></mml:msub></mml:mrow></mml:mfrac></mml:mrow><mml:mn>2</mml:mn></mml:mfrac></mml:mrow></mml:math></disp-formula>
<p>It could be seen from equation (5) that the smaller the error function was, the larger the proportion of the weak classifier in the strong classifier would be. And the variate Z<sub>i</sub> could be calculated as follow:</p>
<disp-formula id="M6"><label>(6)</label><mml:math id="E6"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn>2</mml:mn><mml:msup><mml:mrow><mml:mo stretchy='false'>[</mml:mo><mml:msub><mml:mo>&#x2208;</mml:mo><mml:mtext>i</mml:mtext></mml:msub><mml:mo stretchy='false'>(</mml:mo><mml:mn>1</mml:mn><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mo>&#x2208;</mml:mo><mml:mtext>i</mml:mtext></mml:msub><mml:mo stretchy='false'>)</mml:mo><mml:mo stretchy='false'>]</mml:mo></mml:mrow><mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:math></disp-formula>
<p>The weight of the sample could be updated according to the following formula:</p>
<disp-formula id="M7"><label>(7)</label><mml:math id="E7"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mrow><mml:mtext>i+1</mml:mtext></mml:mrow></mml:msub><mml:mo stretchy='false'>(</mml:mo><mml:mi>j</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo>=</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mtext>i</mml:mtext></mml:msub></mml:mrow></mml:mfrac><mml:msub><mml:mi>D</mml:mi><mml:mtext>i</mml:mtext></mml:msub><mml:mtext>&#x00A0;(</mml:mtext><mml:mi>j</mml:mi><mml:mtext>)</mml:mtext><mml:msup><mml:mi>e</mml:mi><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mi>&#x03B1;</mml:mi><mml:mtext>i</mml:mtext></mml:msub><mml:msub><mml:mi>Y</mml:mi><mml:mtext>j</mml:mtext></mml:msub><mml:mi>h</mml:mi><mml:msub><mml:mrow><mml:mo stretchy='false'>(</mml:mo><mml:mtext>i)</mml:mtext></mml:mrow><mml:mtext>j</mml:mtext></mml:msub></mml:mrow></mml:msup></mml:mrow></mml:math></disp-formula>
<p>Here <italic>j</italic> = 0, 1, 2...<italic>n</italic> - 1. After the weights of samples being updated, the samples with the new weights were sent to the next weak classifier to start the next training until all the weak classifiers completed the training (Theoretically, the more weak classifiers, the higher the prediction accuracy of strong classifier. But when the weak classifier reached a certain number, the prediction accuracy tended to be stable. And then as the number of weak classifiers increased, accuracy was not significantly improved, but the prediction process took longer. We compared the prediction results with 20, 30, and 40 weak classifiers, the accuracy of using 30 and 40 weak classifiers was basically the same, which was better than 20 weak classifiers. However, the prediction time of 40 weak classifiers was longer than using 30 classifiers. Comprehensive consideration of prediction time and accuracy, here, we chose to use 30 weak classifiers to form the final strong classifier.), then the training process was end. The next step was to score the sample, and the score of the jth sample was defined as follows:</p>
<disp-formula id="M8"><label>(8)</label><mml:math id="E8"><mml:mrow><mml:mi>s</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>j</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo>=</mml:mo><mml:mstyle displaystyle='true'><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mtext>i=0</mml:mtext></mml:mrow><mml:mrow><mml:mn>29</mml:mn></mml:mrow></mml:munderover><mml:mrow><mml:msub><mml:mi>&#x03B1;</mml:mi><mml:mtext>i</mml:mtext></mml:msub><mml:mi>H</mml:mi><mml:msub><mml:mrow><mml:mo stretchy='false'>(</mml:mo><mml:mi>i</mml:mi><mml:mo stretchy='false'>)</mml:mo></mml:mrow><mml:mtext>j</mml:mtext></mml:msub></mml:mrow></mml:mstyle></mml:mrow></mml:math></disp-formula>
<p>Here, <italic>HH</italic>(<italic>i</italic>)<sub>j</sub> was the score scored by the ith weak classifier for the jth sample. That was to say, the score of the sample was equal to the sum of the product of the sample&#x2019;s goal scored by weak classifier and the corresponding weight (The corresponding data and code had been submitted to the website<sup><xref ref-type="fn" rid="fn02">2</xref></sup>).</p>
</sec>
</sec>
<sec><title>Results</title>
<sec><title>Performance Evaluation</title>
<p>In order to verify the prediction performance of ABHMDA, we implemented global and local LOOCV for our model based on the database HMDAD (<xref ref-type="bibr" rid="B33">Ma et al., 2017</xref>) which recorded 450 known associations between 39 diseases and 292 miRNAs. Specifically, each of the 450 samples (positive samples) with known association was left out in turn as a test sample while the remaining 449 were used for model training, while all of the samples without known associations were considered as candidate samples (unknown samples). In global LOOCV, we sorted the test sample with all candidate samples based on the score marked by calculation model, while the test sample was ranked with the candidate samples that contained the same disease as the test sample in local LOOCV. We evaluated the prediction performance of models based on the AUC value of the LOOCV. To be specific, only the test sample ranked above a certain threshold, could it be considered as a correct prediction, and then we set the true positive rate (TPR, sensitivity) as the horizontal axis and the false positive rate (FPR, 1-specificity) as the vertical axis. Therefore, we could plot the Receiver operating characteristics (ROC) curve, which was composed of points corresponding to different thresholds, then we could obtain the Area under the ROC curve (AUC). A model with an AUC value equal to 0.5 was equivalent to a random prediction. When the AUC took the maximum value of 1, the model had excellent prediction performance. In other words, when the value of AUC was greater than 0.5 and less than 1, the larger the value was, the better the prediction performance of the model would be.</p>
<p>As shown in <bold>Figure <xref ref-type="fig" rid="F2">2</xref></bold>, the global LOOCV value of ABHMDA was 0.8869, which was significantly larger than that of KATZHMDA (0.8644) and LRLSHMDA (0.8843). What was more, the local LOOCV value of our model reached 0.7910, which was also obviously better than KATZHMDA (0.6998) and LRLSHMDA (0.7508). These results confirmed the superior prediction performance of ABHMDA</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p>Comparison of prediction performance of ABHMDA with two other computational models (KATZHMDA, LRLSHMDA) in terms of ROC curves and AUCs values based on global and local LOOCV. As shown in the results, the global and local LOOCV values of ABHMDA were 0.8869 and 0.7910, respectively, which were significantly larger than that of KATZHMDA (0.8644, 0.6998) and LRLSHMDA (0.8843, 0.7508).</p></caption>
<graphic xlink:href="fmicb-09-02440-g002.tif"/>
</fig>
</sec>
<sec><title>Case Study</title>
<p>In order to further assess the prediction ability of ABHMDA, we implemented two case studies on some important diseases of human. In the first kind, there were 10938 unknown samples about 39 diseases and 292 miRNAs in HMDAD. We sorted and ranked all unknown samples corresponding to the same disease and verified whether the association between the top 10 microbes and the disease studied was verified by the relevant literature. In the second kind, we converted all 1 in the adjacency matrix A to 0 and sorted all the samples (positive and unknown samples) corresponding to the same disease and then verified the association between disease and the 10 microbes most likely associated with it predicted by the model in the database HMDAD. In other words, the purpose of the second case study was to verify our model&#x2019;s power to predict microbes associated with new diseases without any known related microbes. Here, we implemented the first case study on asthma, Colorectal carcinoma, and the second case on Type 1 diabetes.</p>
<p>As an inflammatory disease on the airway, it was very difficult to completely cure asthma under current medical conditions (<xref ref-type="bibr" rid="B36">Preston et al., 2007</xref>). According to statistics, there were about 300 million asthma patients worldwide, and in recent years its morbidity and mortality had also increased rapidly, especially in developing countries (<xref ref-type="bibr" rid="B38">Sagar et al., 2014</xref>). Therefore, a deeper study of asthma was imperative, and studies had shown that there was a close relationship between the microbes in the respiratory tract and the development and progression of asthma (<xref ref-type="bibr" rid="B34">Marri et al., 2013</xref>). For example, studies had shown that Firmicutes was reduced in asthmatic patients compared with normal humans (<xref ref-type="bibr" rid="B47">Wu et al., 2018</xref>). In contrast, Proteobacteria accounted for a larger proportion of microorganisms in asthma patients than normal people (<xref ref-type="bibr" rid="B34">Marri et al., 2013</xref>). What&#x2019;s more, there was evidence that when the hypopharyngeal area of Neonates was infected with Streptococcus pneumoniae, the risk of developing asthma was increased compared to uninfected (<xref ref-type="bibr" rid="B5">Bisgaard et al., 2007</xref>). We implemented the first case study of asthma and the 10 microbes predicted to be most relevant to asthma were all verified by literatures. For instance, the experimental results showed that the abundance of Lachnospiraceae (First in the prediction list) in asthma patients was 1.9 times that of normal people (<xref ref-type="bibr" rid="B28">Jung et al., 2016</xref>). The researchers found that the relative abundance of Veillonella (Second in prediction list) in infants at risk of asthma was significantly lower than in normal people, and inoculation of sterile mice with Veillonella could improve its airway inflammation, which provided new ideas for the treatment of asthma (<xref ref-type="bibr" rid="B2">Arrieta et al., 2015</xref>). Moreover, there was evidence that if there was Clostridium coccoides (Third in prediction list) in a 3 week old baby&#x2019;s stool, he was at risk of developing asthma, so Clostridium coccoides may become an early diagnostic target for asthma (<xref ref-type="bibr" rid="B45">Vael et al., 2011</xref>; See <bold>Table <xref ref-type="table" rid="T1">1</xref></bold>).</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>The 10 microbes predicted to be most likely to be associated with the Asthma.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left">microbe</th>
<th valign="top" align="left">Evidece</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Lachnospiraceae</td>
<td valign="top" align="left">PMID: 27433177</td>
</tr>
<tr>
<td valign="top" align="left">Veillonella</td>
<td valign="top" align="left">PMID: 26424567</td></tr>
<tr>
<td valign="top" align="left">Clostridium coccoides</td>
<td valign="top" align="left">PMID: 21477358</td>
</tr>
<tr>
<td valign="top" align="left">Firmicutes</td>
<td valign="top" align="left">PMID: 23265859</td></tr>
<tr>
<td valign="top" align="left">Streptococcus</td>
<td valign="top" align="left">PMID: 17928596</td>
</tr>
<tr>
<td valign="top" align="left">Actinobacteria</td>
<td valign="top" align="left">PMID: 23265859</td></tr>
<tr>
<td valign="top" align="left">Lactobacillus</td>
<td valign="top" align="left">PMID: 20592920</td>
</tr>
<tr>
<td valign="top" align="left">Bacteroides uniformis</td>
<td valign="top" align="left">PMID: 27433177</td></tr>
<tr>
<td valign="top" align="left">Enterococcus</td>
<td valign="top" align="left">PMID: 22641478</td>
</tr>
<tr>
<td valign="top" align="left">Escherichia coli</td>
<td valign="top" align="left">PMID: 26277095</td></tr>
</tbody></table>
<table-wrap-foot>
<attrib><italic>The first column records the top 10 microbes most likely to be related Asthma, and the second column records the databases and experimental literatures in PubMed, which verify the associations between the corresponding microbe and Asthma.</italic></attrib>
</table-wrap-foot>
</table-wrap>
<p>To facilitate further research and validation, we provided a ranking of the relevant probabilities for all pairs of disease-microbe pairs without confirmed association (See <bold>Supplementary Table <xref ref-type="supplementary-material" rid="SM1">S1</xref></bold>).</p>
<p>Colorectal carcinoma (CRC) was a common gastrointestinal malignant tumor in China (<xref ref-type="bibr" rid="B48">Xue et al., 2014</xref>). As one of the top cancers with the highest morbidity and mortality worldwide, it was estimated that there were approximately one million new cases of CRC and 500000 deaths per year (<xref ref-type="bibr" rid="B43">Sun et al., 2013</xref>). What was more serious was that its incidence would continue to increase in the next few decades, and the survival rate in 5 years was less than 60% (<xref ref-type="bibr" rid="B44">Sun et al., 2011</xref>). Therefore, it was necessary to study the pathogenesis of CRC to explored new treatment methods, and studies had shown that microbes played an important role in the development and progression of cancer that were closely related to inflammation like CRC (<xref ref-type="bibr" rid="B32">Liang et al., 2014</xref>). For example, there were studies showing that the number of Lactobacillus hamster increased significantly during the formation of CRC (<xref ref-type="bibr" rid="B32">Liang et al., 2014</xref>). The researchers compared CRC cases with the normal control group and found that the relative abundance of phylum Bacteroidetes in the case group reached 16.2%, which was much higher than 9.9% of the normal group (<xref ref-type="bibr" rid="B1">Ahn et al., 2013</xref>). We applied ABHMDA to implement the first case study on CRC, and the 10 predicted microorganisms most likely to be associated with CRC were all verified by related literature in PubMed. There was evidence that the relative abundance of Veillonella (First in the prediction list) in CRC cancer tissues was 2.87% and only 0.68% in the intestinal lumen (<xref ref-type="bibr" rid="B6">Chen et al., 2012</xref>). Pyogenic liver abscess was identified as an early manifestation of adult CRC, and an 11-year follow-up study showed that pyogenic liver abscess patients with Klebsiella (Second in the prediction list) pneumoniae had a higher probability of having CRC than those without (<xref ref-type="bibr" rid="B24">Huang et al., 2012</xref>). What was more, there were studies showing that Enterobacteriaceae (Third in the prediction list) was very rich in CRC patients (<xref ref-type="bibr" rid="B3">Arthur et al., 2014</xref>). From the above results, it could be seen that the predicted performance of ABHMDA was very reliable (See <bold>Table <xref ref-type="table" rid="T2">2</xref></bold>).</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p>The 10 microbes predicted to be most likely to be associated with the Colorectal carcinoma.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left">microbe</th>
<th valign="top" align="left">Evidece</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Veillonella</td>
<td valign="top" align="left">PMID: 22761885</td>
</tr>
<tr>
<td valign="top" align="left">Klebsiella</td>
<td valign="top" align="left">PMID: 22776247</td></tr>
<tr>
<td valign="top" align="left">Enterobacteriaceae</td>
<td valign="top" align="left">PMID: 25182170</td>
</tr>
<tr>
<td valign="top" align="left">Proteobacteria</td>
<td valign="top" align="left">PMID: 24603888</td></tr>
<tr>
<td valign="top" align="left">Lachnospiraceae</td>
<td valign="top" align="left">PMID: 21850056</td>
</tr>
<tr>
<td valign="top" align="left">Clostridium coccoides</td>
<td valign="top" align="left">PMID: 19807912</td></tr>
<tr>
<td valign="top" align="left">Streptococcus</td>
<td valign="top" align="left">PMID: 21247505</td>
</tr>
<tr>
<td valign="top" align="left">Actinobacteria</td>
<td valign="top" align="left">PMID: 24316595</td></tr>
<tr>
<td valign="top" align="left">Lactobacillus</td>
<td valign="top" align="left">PMID: 15828052</td>
</tr>
<tr>
<td valign="top" align="left">Bacteroides uniformis</td>
<td valign="top" align="left">PMID: 24828543</td></tr>
</tbody></table>
<table-wrap-foot>
<attrib><italic>The first column records the top 10 microbes most likely to be related Colorectal carcinoma, and the second column records the databases and experimental literatures in PubMed, which verify the associations between the corresponding microbe and Colorectal carcinoma.</italic></attrib>
</table-wrap-foot>
</table-wrap>
<p>Type 1 diabetes was an autoimmune disease which resulted from the immune-mediated destruction of insulin-producing pancreatic &#x03B2; cells (<xref ref-type="bibr" rid="B31">Li et al., 2014</xref>). The incidence of Type 1 diabetes was increasing globally, but the proportion of patients suffering from genetic factors was decreasing, which suggested that the virus, nutrition, and overweight were very likely to have become the main cause of Type 1 diabetes (<xref ref-type="bibr" rid="B27">Islam et al., 2014</xref>). Studies had shown that the abnormality in the gut microbiota was closely related to the development of Type 1 diabetes (<xref ref-type="bibr" rid="B19">De Goffau et al., 2014</xref>). The number of Firmicutes and Actinomycetes were significantly reduced in children with Type 1 diabetes compared with normal people (<xref ref-type="bibr" rid="B35">Murri et al., 2013</xref>). We conducted the second case study on Type 1 diabetes to test the prediction power of ABHMDA to predict the potential microbe-related of new diseases, and the results showed that 7 of the top 10 potential disease-related microbes predicted were validated by the database HMDAD. The associations between Type 1 diabetes and microbe Veillonella (First in the prediction list) with Bacteroidaceae (Second in the prediction list) were confirmed by HANDAD. Some researchers had found that patients with Type 1 diabetes had increased colonization of Enterobacteriaceae (Third in the prediction list) in addition to Escherichia coli compared with normal people (<xref ref-type="bibr" rid="B42">Soyucen et al., 2014</xref>). The above results indicated that ABHMDA&#x2019;s ability to predict microbes associated with new diseases was also reliable (See <bold>Table <xref ref-type="table" rid="T3">3</xref></bold>).</p>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p>The 10 microbes predicted to be most likely to be associated with the Type 1 diabetes.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left">microbe</th>
<th valign="top" align="left">Evidece</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Veillonella</td>
<td valign="top" align="left">confirmed</td>
</tr>
<tr>
<td valign="top" align="left">Bacteroidaceae</td>
<td valign="top" align="left">confirmed</td></tr>
<tr>
<td valign="top" align="left">Enterobacteriaceae</td>
<td valign="top" align="left">PMID: 24475780</td>
</tr>
<tr>
<td valign="top" align="left">Coxiellaceae</td>
<td valign="top" align="left">unconfirmed</td></tr>
<tr>
<td valign="top" align="left">Prevotella</td>
<td valign="top" align="left">confirmed</td>
</tr>
<tr>
<td valign="top" align="left">Bacteroidetes</td>
<td valign="top" align="left">confirmed</td></tr>
<tr>
<td valign="top" align="left">Prevotella copri</td>
<td valign="top" align="left">unconfirmed</td>
</tr>
<tr>
<td valign="top" align="left">Lachnospiraceae</td>
<td valign="top" align="left">confirmed</td>
</tr>
<tr>
<td valign="top" align="left">Lactobacillus</td>
<td valign="top" align="left">confirmed</td></tr>
<tr>
<td valign="top" align="left">Clostridia</td>
<td valign="top" align="left">confirmed</td></tr>
</tbody></table>
<table-wrap-foot>
<attrib><italic>The first column records the top 10 microbes most likely to be related Type 1 diabetes, and the second column records the databases and experimental literatures in PubMed, which verify the associations between the corresponding microbe and Type 1 diabetes.</italic></attrib>
</table-wrap-foot>
</table-wrap>
</sec>
</sec>
<sec><title>Discussion</title>
<p>As a kind of tiny creature that are invisible to the human eyes, the microbes are small in size and simple in structure, but they are closely related to human beings. There are thousands of microbes in the human body. They build complex functional institutions and play an extremely important role in many biological processes, although they can benefit people, they can also bring a lot of trouble to human beings, such as diseases. More and more research shows that many human diseases are closely related to microorganisms, especially gastrointestinal diseases. Revealing the relation between disease and microbes contributes to further understand the pathogenesis of the disease and the development of new drugs (<xref ref-type="bibr" rid="B12">Chen et al., 2016b</xref>, <xref ref-type="bibr" rid="B8">2017a</xref>). However, due to limited technology, the cost of using experimental methods to reveal disease-related microbes is greater. Therefore, it is imperative to construct model for the prediction of potentially relevant microbes. In this paper, we proposed a novel model ABHMDA to reveal the association between disease and microbes. The global and local LOOCV value of ABHMDA was 0.8869 and 0.7910, respectively, which was significantly larger than that of KATZHMDA (0.8644, 0.6998) and LRLSHMDA (0.8843, 0.7508). This result confirmed the strong prediction power of ABHMDA.</p>
<p>Several factors that led to ABHMDA prediction performance were summarized as follows. Firstly, the datasets used by our model were relatively reliable. Secondly, we extracted the potential similarities for diseases and microbes through Gaussian interaction profile kernel similarity. Thirdly, we combined multiple weak classifiers into one strong classifier according to different weights to score the samples. The high-precision weak classifiers accounted for a high proportion and vice versa, which conduced to improve the accuracy of the strong classifier. Of course, ABHMDA also had some defects that needed to be resolved in future work. Firstly, although the prediction performance of ABHMDA had improved compared to previous methods, prediction capabilities were expected to improve further if more reliable similarities were considered. Many groups have developed several effective computational models for the association prediction (<xref ref-type="bibr" rid="B16">Chen and Yan, 2013</xref>; <xref ref-type="bibr" rid="B9">Chen et al., 2016a</xref>; <xref ref-type="bibr" rid="B7">Chen and Huang, 2017</xref>; <xref ref-type="bibr" rid="B49">You et al., 2017</xref>; <xref ref-type="bibr" rid="B13">Chen et al., 2018a</xref>,<xref ref-type="bibr" rid="B14">b</xref>,<xref ref-type="bibr" rid="B15">c</xref>). We would introduce these reliable techniques to this new research area. Secondly, ABHMDA might cause bias to microbes with more associated diseases. Finally, the model did not consider the microbe&#x2013;microbe similarity based on sequence similarity, which was also where we needed to improve in our future work (<xref ref-type="bibr" rid="B10">Chen et al., 2017b</xref>,<xref ref-type="bibr" rid="B11">c</xref>; <xref ref-type="bibr" rid="B23">Hu et al., 2018</xref>; <xref ref-type="bibr" rid="B50">Zhao et al., 2018</xref>).</p>
</sec>
<sec><title>Author Contributions</title>
<p>L-HP and JY implemented the experiments, analyzed the result, and wrote the paper. LZ and M-XL analyzed the result and revised the paper. YZ conceived the project, developed the prediction method, designed the experiments, analyzed the result, and revised the paper. All authors read and approved the final manuscript.</p>
</sec>
<sec><title>Conflict of Interest Statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<fn-group>
<fn fn-type="financial-disclosure">
<p><bold>Funding.</bold> JY and YZ was supported by National Natural Science Foundation of China under grant no. 61772531. L-HP was supported by 61803151 and Natural Science Foundation of Hunan province under grant no. 2018JJ3570. M-XL was supported by the China Postdoctoral Science Foundation.</p>
</fn>
</fn-group>
<sec sec-type="supplementary material">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fmicb.2018.02440/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fmicb.2018.02440/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Table_1.DOCX" id="SM1" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_2.XLSX" id="SM2" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ahn</surname> <given-names>J.</given-names></name> <name><surname>Sinha</surname> <given-names>R.</given-names></name> <name><surname>Pei</surname> <given-names>Z.</given-names></name> <name><surname>Dominianni</surname> <given-names>C.</given-names></name> <name><surname>Wu</surname> <given-names>J.</given-names></name> <name><surname>Shi</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>Human gut microbiome and risk for colorectal cancer.</article-title> <source><italic>J. Natl. Cancer Inst.</italic></source> <volume>105</volume> <fpage>1907</fpage>&#x2013;<lpage>1911</lpage>. <pub-id pub-id-type="doi">10.1093/jnci/djt300</pub-id> <pub-id pub-id-type="pmid">24316595</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Arrieta</surname> <given-names>M. C.</given-names></name> <name><surname>Stiemsma</surname> <given-names>L. T.</given-names></name> <name><surname>Dimitriu</surname> <given-names>P. A.</given-names></name> <name><surname>Thorson</surname> <given-names>L.</given-names></name> <name><surname>Russell</surname> <given-names>S.</given-names></name> <name><surname>Yurist-Doutsch</surname> <given-names>S.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>Early infancy microbial and metabolic alterations affect risk of childhood asthma.</article-title> <source><italic>Sci. Transl. Med.</italic></source> <volume>7</volume>:<issue>307ra152</issue>. <pub-id pub-id-type="doi">10.1126/scitranslmed.aab2271</pub-id> <pub-id pub-id-type="pmid">26424567</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Arthur</surname> <given-names>J. C.</given-names></name> <name><surname>Gharaibeh</surname> <given-names>R. Z.</given-names></name> <name><surname>Muhlbauer</surname> <given-names>M.</given-names></name> <name><surname>Perez-Chanona</surname> <given-names>E.</given-names></name> <name><surname>Uronis</surname> <given-names>J. M.</given-names></name> <name><surname>Mccafferty</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>Microbial genomic analysis reveals the essential role of inflammation in bacteria-induced colorectal cancer.</article-title> <source><italic>Nat. Commun.</italic></source> <volume>5</volume>:<issue>4724</issue>. <pub-id pub-id-type="doi">10.1038/ncomms5724</pub-id> <pub-id pub-id-type="pmid">25182170</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Belcheva</surname> <given-names>A.</given-names></name> <name><surname>Irrazabal</surname> <given-names>T.</given-names></name> <name><surname>Robertson</surname> <given-names>S. J.</given-names></name> <name><surname>Streutker</surname> <given-names>C.</given-names></name> <name><surname>Maughan</surname> <given-names>H.</given-names></name> <name><surname>Rubino</surname> <given-names>S.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>Gut microbial metabolism drives transformation of MSH2-deficient colon epithelial cells.</article-title> <source><italic>Cell</italic></source> <volume>158</volume> <fpage>288</fpage>&#x2013;<lpage>299</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2014.04.051</pub-id> <pub-id pub-id-type="pmid">25036629</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bisgaard</surname> <given-names>H.</given-names></name> <name><surname>Hermansen</surname> <given-names>M. N.</given-names></name> <name><surname>Buchvald</surname> <given-names>F.</given-names></name> <name><surname>Loland</surname> <given-names>L.</given-names></name> <name><surname>Halkjaer</surname> <given-names>L. B.</given-names></name> <name><surname>Bonnelykke</surname> <given-names>K.</given-names></name><etal/></person-group> (<year>2007</year>). <article-title>Childhood asthma after bacterial colonization of the airway in neonates.</article-title> <source><italic>N. Engl. J. Med.</italic></source> <volume>357</volume> <fpage>1487</fpage>&#x2013;<lpage>1495</lpage>. <pub-id pub-id-type="doi">10.1056/NEJMoa052632</pub-id> <pub-id pub-id-type="pmid">17928596</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>W.</given-names></name> <name><surname>Liu</surname> <given-names>F.</given-names></name> <name><surname>Ling</surname> <given-names>Z.</given-names></name> <name><surname>Tong</surname> <given-names>X.</given-names></name> <name><surname>Xiang</surname> <given-names>C.</given-names></name></person-group> (<year>2012</year>). <article-title>Human intestinal lumen and mucosa-associated microbiota in patients with colorectal cancer.</article-title> <source><italic>PLoS One</italic></source> <volume>7</volume>:<issue>e39743</issue>. <pub-id pub-id-type="doi">10.1371/journal.pone.0039743</pub-id> <pub-id pub-id-type="pmid">22761885</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Huang</surname> <given-names>L.</given-names></name></person-group> (<year>2017</year>). <article-title>LRSSLMDA: laplacian regularized sparse subspace learning for MiRNA-disease association prediction.</article-title> <source><italic>PLoS Comput. Biol.</italic></source> <volume>13</volume>:<issue>e1005912</issue>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1005912</pub-id> <pub-id pub-id-type="pmid">29253885</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Huang</surname> <given-names>Y. A.</given-names></name> <name><surname>You</surname> <given-names>Z. H.</given-names></name> <name><surname>Yan</surname> <given-names>G. Y.</given-names></name> <name><surname>Wang</surname> <given-names>X. S.</given-names></name></person-group> (<year>2017a</year>). <article-title>A novel approach based on KATZ measure to predict associations of human microbiota with non-infectious diseases.</article-title> <source><italic>Bioinformatics</italic></source> <volume>33</volume> <fpage>733</fpage>&#x2013;<lpage>739</lpage>.</citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Ren</surname> <given-names>B.</given-names></name> <name><surname>Chen</surname> <given-names>M.</given-names></name> <name><surname>Wang</surname> <given-names>Q.</given-names></name> <name><surname>Zhang</surname> <given-names>L.</given-names></name> <name><surname>Yan</surname> <given-names>G.</given-names></name></person-group> (<year>2016a</year>). <article-title>NLLSS: predicting synergistic drug combinations based on semi-supervised learning.</article-title> <source><italic>PLoS Comput. Biol.</italic></source> <volume>12</volume>:<issue>e1004975</issue>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1004975</pub-id> <pub-id pub-id-type="pmid">27415801</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Xie</surname> <given-names>D.</given-names></name> <name><surname>Zhao</surname> <given-names>Q.</given-names></name> <name><surname>You</surname> <given-names>Z. H.</given-names></name></person-group> (<year>2017b</year>). <article-title>MicroRNAs and complex diseases: from experimental results to computational models.</article-title> <source><italic>Brief. Bioinform.</italic></source> <pub-id pub-id-type="doi">10.1093/bib/bbx130</pub-id> [Epub ahead of print]. <pub-id pub-id-type="pmid">29045685</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Yan</surname> <given-names>C. C.</given-names></name> <name><surname>Zhang</surname> <given-names>X.</given-names></name> <name><surname>You</surname> <given-names>Z. H.</given-names></name></person-group> (<year>2017c</year>). <article-title>Long non-coding RNAs and complex diseases: from experimental results to computational models.</article-title> <source><italic>Brief. Bioinform.</italic></source> <volume>18</volume> <fpage>558</fpage>&#x2013;<lpage>576</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbw060</pub-id> <pub-id pub-id-type="pmid">27345524</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Yan</surname> <given-names>C. C.</given-names></name> <name><surname>Zhang</surname> <given-names>X.</given-names></name> <name><surname>Zhang</surname> <given-names>X.</given-names></name> <name><surname>Dai</surname> <given-names>F.</given-names></name> <name><surname>Yin</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2016b</year>). <article-title>Drug-target interaction prediction: databases, web servers and computational models.</article-title> <source><italic>Brief. Bioinform.</italic></source> <volume>17</volume> <fpage>696</fpage>&#x2013;<lpage>712</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbv066</pub-id> <pub-id pub-id-type="pmid">26283676</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Wang</surname> <given-names>L.</given-names></name> <name><surname>Qu</surname> <given-names>J.</given-names></name> <name><surname>Guan</surname> <given-names>N. N.</given-names></name> <name><surname>Li</surname> <given-names>J. Q.</given-names></name></person-group> (<year>2018a</year>). <article-title>Predicting miRNA-disease association based on inductive matrix completion.</article-title> <source><italic>Bioinformatics</italic></source> <pub-id pub-id-type="doi">10.1093/bioinformatics/bty503</pub-id> [Epub ahead of print]. <pub-id pub-id-type="pmid">29939227</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Xie</surname> <given-names>D.</given-names></name> <name><surname>Wang</surname> <given-names>L.</given-names></name> <name><surname>Zhao</surname> <given-names>Q.</given-names></name> <name><surname>You</surname> <given-names>Z. H.</given-names></name> <name><surname>Liu</surname> <given-names>H.</given-names></name></person-group> (<year>2018b</year>). <article-title>BNPMDA: bipartite network projection for mirna-disease association prediction.</article-title> <source><italic>Bioinformatics</italic></source> <volume>34</volume> <fpage>3178</fpage>&#x2013;<lpage>3186</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bty333</pub-id> <pub-id pub-id-type="pmid">29701758</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Yin</surname> <given-names>J.</given-names></name> <name><surname>Qu</surname> <given-names>J.</given-names></name> <name><surname>Huang</surname> <given-names>L.</given-names></name></person-group> (<year>2018c</year>). <article-title>MDHGI: matrix decomposition and heterogeneous graph inference for miRNA-disease association prediction.</article-title> <source><italic>PLoS Comput. Biol.</italic></source> <volume>14</volume>:<issue>e1006418</issue>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1006418</pub-id> <pub-id pub-id-type="pmid">30142158</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Yan</surname> <given-names>G. Y.</given-names></name></person-group> (<year>2013</year>). <article-title>Novel human lncRNA-disease association inference based on lncRNA expression profiles.</article-title> <source><italic>Bioinformatics</italic></source> <volume>29</volume> <fpage>2617</fpage>&#x2013;<lpage>2624</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btt426</pub-id> <pub-id pub-id-type="pmid">24002109</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cole</surname> <given-names>P.</given-names></name></person-group> (<year>1989</year>). <article-title>Host-microbe relationships in chronic respiratory infection.</article-title> <source><italic>Respiration</italic></source> <volume>55</volume><issue>(Suppl. 1)</issue>, <fpage>5</fpage>&#x2013;<lpage>8</lpage>. <pub-id pub-id-type="doi">10.1159/000195745</pub-id> <pub-id pub-id-type="pmid">2682870</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Consortium</surname> <given-names>H. M. P.</given-names></name></person-group> (<year>2012</year>). <article-title>A framework for human microbiome research.</article-title> <source><italic>Nature</italic></source> <volume>486</volume> <fpage>215</fpage>&#x2013;<lpage>221</lpage>. <pub-id pub-id-type="doi">10.1038/nature11209</pub-id> <pub-id pub-id-type="pmid">22699610</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>De Goffau</surname> <given-names>M. C.</given-names></name> <name><surname>Fuentes</surname> <given-names>S.</given-names></name> <name><surname>Van Den Bogert</surname> <given-names>B.</given-names></name> <name><surname>Honkanen</surname> <given-names>H.</given-names></name> <name><surname>De Vos</surname> <given-names>W. M.</given-names></name> <name><surname>Welling</surname> <given-names>G. W.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>Aberrant gut microbiota composition at the onset of type 1 diabetes in young children.</article-title> <source><italic>Diabetologia</italic></source> <volume>57</volume> <fpage>1569</fpage>&#x2013;<lpage>1577</lpage>. <pub-id pub-id-type="doi">10.1007/s00125-014-3274-0</pub-id> <pub-id pub-id-type="pmid">24930037</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fredricks</surname> <given-names>D. N.</given-names></name></person-group> (<year>2001</year>). <article-title>Microbial ecology of human skin in health and disease.</article-title> <source><italic>J. Investig. Dermatol. Symp. Proc.</italic></source> <volume>6</volume> <fpage>167</fpage>&#x2013;<lpage>169</lpage>. <pub-id pub-id-type="doi">10.1046/j.0022-202x.2001.00039.x</pub-id> <pub-id pub-id-type="pmid">11924822</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gill</surname> <given-names>S. R.</given-names></name> <name><surname>Pop</surname> <given-names>M.</given-names></name> <name><surname>Deboy</surname> <given-names>R. T.</given-names></name> <name><surname>Eckburg</surname> <given-names>P. B.</given-names></name> <name><surname>Turnbaugh</surname> <given-names>P. J.</given-names></name> <name><surname>Samuel</surname> <given-names>B. S.</given-names></name><etal/></person-group> (<year>2006</year>). <article-title>Metagenomic analysis of the human distal gut microbiome.</article-title> <source><italic>Science</italic></source> <volume>312</volume> <fpage>1355</fpage>&#x2013;<lpage>1359</lpage>. <pub-id pub-id-type="doi">10.1126/science.1124234</pub-id> <pub-id pub-id-type="pmid">16741115</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Grenham</surname> <given-names>S.</given-names></name> <name><surname>Clarke</surname> <given-names>G.</given-names></name> <name><surname>Cryan</surname> <given-names>J. F.</given-names></name> <name><surname>Dinan</surname> <given-names>T. G.</given-names></name></person-group> (<year>2011</year>). <article-title>Brain-gut-microbe communication in health and disease.</article-title> <source><italic>Front. Physiol.</italic></source> <volume>2</volume>:<issue>94</issue>. <pub-id pub-id-type="doi">10.3389/fphys.2011.00094</pub-id> <pub-id pub-id-type="pmid">22162969</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hu</surname> <given-names>H.</given-names></name> <name><surname>Zhang</surname> <given-names>L.</given-names></name> <name><surname>Ai</surname> <given-names>H.</given-names></name> <name><surname>Zhang</surname> <given-names>H.</given-names></name> <name><surname>Fan</surname> <given-names>Y.</given-names></name> <name><surname>Zhao</surname> <given-names>Q.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>HLPI-ensemble: prediction of human lncRNA-protein interactions based on ensemble strategy.</article-title> <source><italic>RNA Biol.</italic></source> <volume>15</volume> <fpage>797</fpage>&#x2013;<lpage>806</lpage>. <pub-id pub-id-type="doi">10.1080/15476286.2018.1457935</pub-id> <pub-id pub-id-type="pmid">29583068</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>W. K.</given-names></name> <name><surname>Chang</surname> <given-names>J. W.</given-names></name> <name><surname>See</surname> <given-names>L. C.</given-names></name> <name><surname>Tu</surname> <given-names>H. T.</given-names></name> <name><surname>Chen</surname> <given-names>J. S.</given-names></name> <name><surname>Liaw</surname> <given-names>C. C.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Higher rate of colorectal cancer among patients with pyogenic liver abscess with <italic>Klebsiella pneumoniae</italic> than those without: an 11-year follow-up study.</article-title> <source><italic>Colorectal Dis.</italic></source> <volume>14</volume> <fpage>e794</fpage>&#x2013;<lpage>e801</lpage>. <pub-id pub-id-type="doi">10.1111/j.1463-1318.2012.03174.x</pub-id> <pub-id pub-id-type="pmid">22776247</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>Y. A.</given-names></name> <name><surname>You</surname> <given-names>Z. H.</given-names></name> <name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Huang</surname> <given-names>Z. A.</given-names></name> <name><surname>Zhang</surname> <given-names>S.</given-names></name> <name><surname>Yan</surname> <given-names>G. Y.</given-names></name></person-group> (<year>2017</year>). <article-title>Prediction of microbe-disease association from the integration of neighbor and graph with collaborative recommendation model.</article-title> <source><italic>J. Transl. Med.</italic></source> <volume>15</volume>:<issue>209</issue>. <pub-id pub-id-type="doi">10.1186/s12967-017-1304-7</pub-id> <pub-id pub-id-type="pmid">29037244</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>Z. A.</given-names></name> <name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Zhu</surname> <given-names>Z.</given-names></name> <name><surname>Liu</surname> <given-names>H.</given-names></name> <name><surname>Yan</surname> <given-names>G. Y.</given-names></name> <name><surname>You</surname> <given-names>Z. H.</given-names></name><etal/></person-group> (<year>2017</year>). <article-title>PBHMDA: path-based human microbe-disease association prediction.</article-title> <source><italic>Front. Microbiol.</italic></source> <volume>8</volume>:<issue>233</issue>. <pub-id pub-id-type="doi">10.3389/fmicb.2017.00233</pub-id> <pub-id pub-id-type="pmid">28275370</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Islam</surname> <given-names>S. T.</given-names></name> <name><surname>Srinivasan</surname> <given-names>S.</given-names></name> <name><surname>Craig</surname> <given-names>M. E.</given-names></name></person-group> (<year>2014</year>). <article-title>Environmental determinants of type 1 diabetes: a role for overweight and insulin resistance.</article-title> <source><italic>J. Paediatr. Child Health</italic></source> <volume>50</volume> <fpage>874</fpage>&#x2013;<lpage>879</lpage>. <pub-id pub-id-type="doi">10.1111/jpc.12616</pub-id> <pub-id pub-id-type="pmid">24893825</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jung</surname> <given-names>J. W.</given-names></name> <name><surname>Choi</surname> <given-names>J. C.</given-names></name> <name><surname>Shin</surname> <given-names>J. W.</given-names></name> <name><surname>Kim</surname> <given-names>J. Y.</given-names></name> <name><surname>Park</surname> <given-names>I. W.</given-names></name> <name><surname>Choi</surname> <given-names>B. W.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>Lung microbiome analysis in steroid-nasmall yi, ukrainianve asthma patients by using whole sputum.</article-title> <source><italic>Tuberc. Respir. Dis.</italic></source> <volume>79</volume> <fpage>165</fpage>&#x2013;<lpage>178</lpage>. <pub-id pub-id-type="doi">10.4046/trd.2016.79.3.165</pub-id> <pub-id pub-id-type="pmid">27433177</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Laarhoven</surname> <given-names>T. V.</given-names></name> <name><surname>Nabuurs</surname> <given-names>S. B.</given-names></name> <name><surname>Marchiori</surname> <given-names>E.</given-names></name></person-group> (<year>2011</year>). <article-title>Gaussian interaction profile kernels for predicting drug&#x2013;target interaction.</article-title> <source><italic>Bioinformatics</italic></source> <volume>27</volume><fpage>3036</fpage>&#x2013;<lpage>3043</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btr500</pub-id> <pub-id pub-id-type="pmid">21893517</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Larsen</surname> <given-names>N.</given-names></name> <name><surname>Vogensen</surname> <given-names>F. K.</given-names></name> <name><surname>Van Den Berg</surname> <given-names>F. W.</given-names></name> <name><surname>Nielsen</surname> <given-names>D. S.</given-names></name> <name><surname>Andreasen</surname> <given-names>A. S.</given-names></name> <name><surname>Pedersen</surname> <given-names>B. K.</given-names></name><etal/></person-group> (<year>2010</year>). <article-title>Gut microbiota in human adults with type 2 diabetes differs from non-diabetic adults.</article-title> <source><italic>PLoS One</italic></source> <volume>5</volume>:<issue>e9085</issue>. <pub-id pub-id-type="doi">10.1371/journal.pone.0009085</pub-id> <pub-id pub-id-type="pmid">20140211</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>M.</given-names></name> <name><surname>Song</surname> <given-names>L. J.</given-names></name> <name><surname>Qin</surname> <given-names>X. Y.</given-names></name></person-group> (<year>2014</year>). <article-title>Advances in the cellular immunological pathogenesis of type 1 diabetes.</article-title> <source><italic>J. Cell Mol. Med.</italic></source> <volume>18</volume> <fpage>749</fpage>&#x2013;<lpage>758</lpage>. <pub-id pub-id-type="doi">10.1111/jcmm.12270</pub-id> <pub-id pub-id-type="pmid">24629100</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liang</surname> <given-names>X.</given-names></name> <name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Tian</surname> <given-names>G.</given-names></name> <name><surname>Li</surname> <given-names>S.</given-names></name></person-group> (<year>2014</year>). <article-title>Dynamic microbe and molecule networks in a mouse model of colitis-associated colorectal cancer.</article-title> <source><italic>Sci. Rep.</italic></source> <volume>4</volume>:<issue>4985</issue>. <pub-id pub-id-type="doi">10.1038/srep04985</pub-id> <pub-id pub-id-type="pmid">24828543</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ma</surname> <given-names>W.</given-names></name> <name><surname>Zhang</surname> <given-names>L.</given-names></name> <name><surname>Zeng</surname> <given-names>P.</given-names></name> <name><surname>Huang</surname> <given-names>C.</given-names></name> <name><surname>Li</surname> <given-names>J.</given-names></name> <name><surname>Geng</surname> <given-names>B.</given-names></name><etal/></person-group> (<year>2017</year>). <article-title>An analysis of human microbe-disease associations.</article-title> <source><italic>Brief. Bioinform.</italic></source> <volume>18</volume> <fpage>85</fpage>&#x2013;<lpage>97</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbw005</pub-id> <pub-id pub-id-type="pmid">26883326</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Marri</surname> <given-names>P. R.</given-names></name> <name><surname>Stern</surname> <given-names>D. A.</given-names></name> <name><surname>Wright</surname> <given-names>A. L.</given-names></name> <name><surname>Billheimer</surname> <given-names>D.</given-names></name> <name><surname>Martinez</surname> <given-names>F. D.</given-names></name></person-group> (<year>2013</year>). <article-title>Asthma-associated differences in microbial composition of induced sputum.</article-title> <source><italic>J. Allergy Clin. Immunol.</italic></source> <volume>131</volume> <issue>346</issue>.<fpage>e3</fpage>&#x2013;<lpage>352</lpage>.e3. <pub-id pub-id-type="doi">10.1016/j.jaci.2012.11.013</pub-id> <pub-id pub-id-type="pmid">23265859</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Murri</surname> <given-names>M.</given-names></name> <name><surname>Leiva</surname> <given-names>I.</given-names></name> <name><surname>Gomez-Zumaquero</surname> <given-names>J. M.</given-names></name> <name><surname>Tinahones</surname> <given-names>F. J.</given-names></name> <name><surname>Cardona</surname> <given-names>F.</given-names></name> <name><surname>Soriguer</surname> <given-names>F.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>Gut microbiota in children with type 1 diabetes differs from that in healthy children: a case-control study.</article-title> <source><italic>BMC Med.</italic></source> <volume>11</volume>:<issue>46</issue>. <pub-id pub-id-type="doi">10.1186/1741-7015-11-46</pub-id> <pub-id pub-id-type="pmid">23433344</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Preston</surname> <given-names>J. A.</given-names></name> <name><surname>Essilfie</surname> <given-names>A. T.</given-names></name> <name><surname>Horvat</surname> <given-names>J. C.</given-names></name> <name><surname>Wade</surname> <given-names>M. A.</given-names></name> <name><surname>Beagley</surname> <given-names>K. W.</given-names></name> <name><surname>Gibson</surname> <given-names>P. G.</given-names></name><etal/></person-group> (<year>2007</year>). <article-title>Inhibition of allergic airways disease by immunomodulatory therapy with whole killed Streptococcus pneumoniae.</article-title> <source><italic>Vaccine</italic></source> <volume>25</volume> <fpage>8154</fpage>&#x2013;<lpage>8162</lpage>. <pub-id pub-id-type="doi">10.1016/j.vaccine.2007.09.034</pub-id> <pub-id pub-id-type="pmid">17950502</pub-id></citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rayhan</surname> <given-names>F.</given-names></name> <name><surname>Ahmed</surname> <given-names>S.</given-names></name> <name><surname>Shatabda</surname> <given-names>S.</given-names></name> <name><surname>Farid</surname> <given-names>D. M.</given-names></name> <name><surname>Mousavian</surname> <given-names>Z.</given-names></name> <name><surname>Dehzangi</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2017</year>). <article-title>iDTI-ESBoost: identification of drug target interaction using evolutionary and structural features with boosting.</article-title> <source><italic>Sci. Rep.</italic></source> <volume>7</volume>:<issue>17731</issue>. <pub-id pub-id-type="doi">10.1038/s41598-017-18025-2</pub-id> <pub-id pub-id-type="pmid">29255285</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sagar</surname> <given-names>S.</given-names></name> <name><surname>Morgan</surname> <given-names>M. E.</given-names></name> <name><surname>Chen</surname> <given-names>S.</given-names></name> <name><surname>Vos</surname> <given-names>A. P.</given-names></name> <name><surname>Garssen</surname> <given-names>J.</given-names></name> <name><surname>Van Bergenhenegouwen</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title><italic>Bifidobacterium breve</italic> and <italic>Lactobacillus rhamnosus</italic> treatment is as effective as budesonide at reducing inflammation in a murine model for chronic asthma.</article-title> <source><italic>Respir. Res.</italic></source> <volume>15</volume>:<issue>46</issue>. <pub-id pub-id-type="doi">10.1186/1465-9921-15-46</pub-id> <pub-id pub-id-type="pmid">24735374</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sender</surname> <given-names>R.</given-names></name> <name><surname>Fuchs</surname> <given-names>S.</given-names></name> <name><surname>Milo</surname> <given-names>R.</given-names></name></person-group> (<year>2016</year>). <article-title>Revised estimates for the number of human and bacteria cells in the body.</article-title> <source><italic>PLoS Biol.</italic></source> <volume>14</volume>:<issue>e1002533</issue>. <pub-id pub-id-type="doi">10.1371/journal.pbio.1002533</pub-id> <pub-id pub-id-type="pmid">27541692</pub-id></citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Skov</surname> <given-names>L.</given-names></name> <name><surname>Baadsgaard</surname> <given-names>O.</given-names></name></person-group> (<year>2000</year>). <article-title>Bacterial superantigens and inflammatory skin diseases.</article-title> <source><italic>Clin. Exp. Dermatol.</italic></source> <volume>25</volume> <fpage>57</fpage>&#x2013;<lpage>61</lpage>. <pub-id pub-id-type="doi">10.1046/j.1365-2230.2000.00575.x</pub-id></citation></ref>
<ref id="B41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sommer</surname> <given-names>F.</given-names></name> <name><surname>Backhed</surname> <given-names>F.</given-names></name></person-group> (<year>2013</year>). <article-title>The gut microbiota&#x2013;masters of host development and physiology.</article-title> <source><italic>Nat. Rev. Microbiol.</italic></source> <volume>11</volume> <fpage>227</fpage>&#x2013;<lpage>238</lpage>. <pub-id pub-id-type="doi">10.1038/nrmicro2974</pub-id> <pub-id pub-id-type="pmid">23435359</pub-id></citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Soyucen</surname> <given-names>E.</given-names></name> <name><surname>Gulcan</surname> <given-names>A.</given-names></name> <name><surname>Aktuglu-Zeybek</surname> <given-names>A. C.</given-names></name> <name><surname>Onal</surname> <given-names>H.</given-names></name> <name><surname>Kiykim</surname> <given-names>E.</given-names></name> <name><surname>Aydin</surname> <given-names>A.</given-names></name></person-group> (<year>2014</year>). <article-title>Differences in the gut microbiota of healthy children and those with type 1 diabetes.</article-title> <source><italic>Pediatr. Int.</italic></source> <volume>56</volume> <fpage>336</fpage>&#x2013;<lpage>343</lpage>. <pub-id pub-id-type="doi">10.1111/ped.12243</pub-id> <pub-id pub-id-type="pmid">24475780</pub-id></citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sun</surname> <given-names>K.</given-names></name> <name><surname>Deng</surname> <given-names>H. J.</given-names></name> <name><surname>Lei</surname> <given-names>S. T.</given-names></name> <name><surname>Dong</surname> <given-names>J. Q.</given-names></name> <name><surname>Li</surname> <given-names>G. X.</given-names></name></person-group> (<year>2013</year>). <article-title>miRNA-338-3p suppresses cell growth of human colorectal carcinoma by targeting smoothened.</article-title> <source><italic>World J. Gastroenterol.</italic></source> <volume>19</volume> <fpage>2197</fpage>&#x2013;<lpage>2207</lpage>. <pub-id pub-id-type="doi">10.3748/wjg.v19.i14.2197</pub-id> <pub-id pub-id-type="pmid">23599646</pub-id></citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sun</surname> <given-names>K.</given-names></name> <name><surname>Wang</surname> <given-names>W.</given-names></name> <name><surname>Zeng</surname> <given-names>J. J.</given-names></name> <name><surname>Wu</surname> <given-names>C. T.</given-names></name> <name><surname>Lei</surname> <given-names>S. T.</given-names></name> <name><surname>Li</surname> <given-names>G. X.</given-names></name></person-group> (<year>2011</year>). <article-title>MicroRNA-221 inhibits CDKN1C/p57 expression in human colorectal carcinoma.</article-title> <source><italic>Acta Pharmacol. Sin.</italic></source> <volume>32</volume> <fpage>375</fpage>&#x2013;<lpage>384</lpage>. <pub-id pub-id-type="doi">10.1038/aps.2010.206</pub-id> <pub-id pub-id-type="pmid">21278784</pub-id></citation></ref>
<ref id="B45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vael</surname> <given-names>C.</given-names></name> <name><surname>Vanheirstraeten</surname> <given-names>L.</given-names></name> <name><surname>Desager</surname> <given-names>K. N.</given-names></name> <name><surname>Goossens</surname> <given-names>H.</given-names></name></person-group> (<year>2011</year>). <article-title>Denaturing gradient gel electrophoresis of neonatal intestinal microbiota in relation to the development of asthma.</article-title> <source><italic>BMC Microbiol.</italic></source> <volume>11</volume>:<issue>68</issue>. <pub-id pub-id-type="doi">10.1186/1471-2180-11-68</pub-id> <pub-id pub-id-type="pmid">21477358</pub-id></citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>F.</given-names></name> <name><surname>Huang</surname> <given-names>Z. A.</given-names></name> <name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Zhu</surname> <given-names>Z.</given-names></name> <name><surname>Wen</surname> <given-names>Z.</given-names></name> <name><surname>Zhao</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2017</year>). <article-title>LRLSHMDA: laplacian regularized least squares for human microbe-disease association prediction.</article-title> <source><italic>Sci. Rep.</italic></source> <volume>7</volume>:<issue>7601</issue>. <pub-id pub-id-type="doi">10.1038/s41598-017-08127-2</pub-id> <pub-id pub-id-type="pmid">28790448</pub-id></citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wu</surname> <given-names>C.</given-names></name> <name><surname>Gao</surname> <given-names>R.</given-names></name> <name><surname>Zhang</surname> <given-names>D.</given-names></name> <name><surname>Han</surname> <given-names>S.</given-names></name> <name><surname>Zhang</surname> <given-names>Y.</given-names></name></person-group> (<year>2018</year>). <article-title>PRWHMDA: human microbe-disease association prediction by random walk on the heterogeneous network with PSO.</article-title> <source><italic>Int. J. Biol. Sci.</italic></source> <volume>14</volume> <fpage>849</fpage>&#x2013;<lpage>857</lpage>. <pub-id pub-id-type="doi">10.7150/ijbs.24539</pub-id> <pub-id pub-id-type="pmid">29989079</pub-id></citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xue</surname> <given-names>Q.</given-names></name> <name><surname>Sun</surname> <given-names>K.</given-names></name> <name><surname>Deng</surname> <given-names>H. J.</given-names></name> <name><surname>Lei</surname> <given-names>S. T.</given-names></name> <name><surname>Dong</surname> <given-names>J. Q.</given-names></name> <name><surname>Li</surname> <given-names>G. X.</given-names></name></person-group> (<year>2014</year>). <article-title>MicroRNA-338-3p inhibits colorectal carcinoma cell invasion and migration by targeting smoothened.</article-title> <source><italic>Jpn. J. Clin. Oncol.</italic></source> <volume>44</volume> <fpage>13</fpage>&#x2013;<lpage>21</lpage>. <pub-id pub-id-type="doi">10.1093/jjco/hyt181</pub-id> <pub-id pub-id-type="pmid">24277750</pub-id></citation></ref>
<ref id="B49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>You</surname> <given-names>Z. H.</given-names></name> <name><surname>Huang</surname> <given-names>Z. A.</given-names></name> <name><surname>Zhu</surname> <given-names>Z.</given-names></name> <name><surname>Yan</surname> <given-names>G. Y.</given-names></name> <name><surname>Li</surname> <given-names>Z. W.</given-names></name> <name><surname>Wen</surname> <given-names>Z.</given-names></name><etal/></person-group> (<year>2017</year>). <article-title>PBMDA: a novel and effective path-based computational model for miRNA-disease association prediction.</article-title> <source><italic>PLoS Comput. Biol.</italic></source> <volume>13</volume>:<issue>e1005455</issue>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1005455</pub-id> <pub-id pub-id-type="pmid">28339468</pub-id></citation></ref>
<ref id="B50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhao</surname> <given-names>Q.</given-names></name> <name><surname>Zhang</surname> <given-names>Y.</given-names></name> <name><surname>Hu</surname> <given-names>H.</given-names></name> <name><surname>Ren</surname> <given-names>G.</given-names></name> <name><surname>Zhang</surname> <given-names>W.</given-names></name> <name><surname>Liu</surname> <given-names>H.</given-names></name></person-group> (<year>2018</year>). <article-title>IRWNRLPI: integrating random walk and neighborhood regularized logistic matrix factorization for lncRNA-protein interaction prediction.</article-title> <source><italic>Front. Genet.</italic></source> <volume>9</volume>:<issue>239</issue>. <pub-id pub-id-type="doi">10.3389/fgene.2018.00239</pub-id> <pub-id pub-id-type="pmid">30023002</pub-id></citation></ref>
<ref id="B51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname> <given-names>X. Z.</given-names></name> <name><surname>Menche</surname> <given-names>J.</given-names></name> <name><surname>Barab&#x00E1;si</surname> <given-names>A.</given-names></name> <name><surname>Sharma</surname> <given-names>A.</given-names></name></person-group> (<year>2014</year>). <article-title>Human symptoms&#x2013;disease network.</article-title> <source><italic>Nat. Commun.</italic></source> <volume>5</volume>:<issue>4212</issue>. <pub-id pub-id-type="doi">10.1038/ncomms5212</pub-id> <pub-id pub-id-type="pmid">24967666</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn id="fn01"><label>1</label><p><ext-link ext-link-type="uri" xlink:href="http://www.cuilab.cn/hmdad">http://www.cuilab.cn/hmdad</ext-link></p></fn>
<fn id="fn02"><label>2</label><p><ext-link ext-link-type="uri" xlink:href="https://github.com/githubcode007/ABHMDA">https://github.com/githubcode007/ABHMDA</ext-link></p></fn>
</fn-group>
</back>
</article>