<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="review-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Pharmacol.</journal-id>
<journal-title>Frontiers in Pharmacology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Pharmacol.</abbrev-journal-title>
<issn pub-type="epub">1663-9812</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fphar.2018.00074</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Pharmacology</subject>
<subj-group>
<subject>Mini Review</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Transfer and Multi-task Learning in QSAR Modeling: Advances and Challenges</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Sim&#x00F5;es</surname> <given-names>Rodolfo S.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Maltarollo</surname> <given-names>Vinicius G.</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Oliveira</surname> <given-names>Patricia R.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Honorio</surname> <given-names>Kathia M.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<xref ref-type="author-notes" rid="fn001"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/415877/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>School of Arts, Sciences and Humanities, University of S&#x00E3;o Paulo</institution>, <addr-line>S&#x00E3;o Paulo</addr-line>, <country>Brazil</country></aff>
<aff id="aff2"><sup>2</sup><institution>Department of Pharmaceutical Products, Faculty of Pharmacy, Federal University of Minas Gerais</institution>, <addr-line>Belo Horizonte</addr-line>, <country>Brazil</country></aff>
<aff id="aff3"><sup>3</sup><institution>Center for Natural and Human Sciences, Federal University of ABC</institution>, <addr-line>Santo Andr&#x00E9;</addr-line>, <country>Brazil</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: <italic>Leonardo G. Ferreira, University of S&#x00E3;o Paulo, Brazil</italic></p></fn>
<fn fn-type="edited-by"><p>Reviewed by: <italic>Simone Brogi, University of Siena, Italy; Andrew James Greenshaw, University of Alberta, Canada</italic></p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x002A;Correspondence: <italic>Kathia M. Honorio, <email>kmhonorio@usp.br</email></italic></p></fn>
<fn fn-type="other" id="fn002"><p>This article was submitted to Experimental Pharmacology and Drug Discovery, a section of the journal Frontiers in Pharmacology</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>06</day>
<month>02</month>
<year>2018</year>
</pub-date>
<pub-date pub-type="collection">
<year>2018</year>
</pub-date>
<volume>9</volume>
<elocation-id>74</elocation-id>
<history>
<date date-type="received">
<day>22</day>
<month>09</month>
<year>2017</year>
</date>
<date date-type="accepted">
<day>22</day>
<month>01</month>
<year>2018</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2018 Sim&#x00F5;es, Maltarollo, Oliveira and Honorio.</copyright-statement>
<copyright-year>2018</copyright-year>
<copyright-holder>Sim&#x00F5;es, Maltarollo, Oliveira and Honorio</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>Medicinal chemistry projects involve some steps aiming to develop a new drug, such as the analysis of biological targets related to a given disease, the discovery and the development of drug candidates for these targets, performing parallel biological tests to validate the drug effectiveness and side effects. Approaches as quantitative study of activity-structure relationships (QSAR) involve the construction of predictive models that relate a set of descriptors of a chemical compound series and its biological activities with respect to one or more targets in the human body. Datasets used to perform QSAR analyses are generally characterized by a small number of samples and this makes them more complex to build accurate predictive models. In this context, transfer and multi-task learning techniques are very suitable since they take information from other QSAR models to the same biological target, reducing efforts and costs for generating new chemical compounds. Therefore, this review will present the main features of transfer and multi-task learning studies, as well as some applications and its potentiality in drug design projects.</p>
</abstract>
<kwd-group>
<kwd>drug design</kwd>
<kwd>medicinal chemistry</kwd>
<kwd>QSAR</kwd>
<kwd>machine learning</kwd>
<kwd>transfer learning</kwd>
<kwd>multi-task learning</kwd>
</kwd-group>
<contract-sponsor id="cn001">Funda&#x00E7;&#x00E3;o de Amparo &#x00E0; Pesquisa do Estado de S&#x00E3;o Paulo<named-content content-type="fundref-id">10.13039/501100001807</named-content></contract-sponsor>
<contract-sponsor id="cn002">Conselho Nacional de Desenvolvimento Cient&#x00ED;fico e Tecnol&#x00F3;gico<named-content content-type="fundref-id">10.13039/501100003593</named-content></contract-sponsor>
<contract-sponsor id="cn003">Coordena&#x00E7;&#x00E3;o de Aperfei&#x00E7;oamento de Pessoal de N&#x00ED;vel Superior<named-content content-type="fundref-id">10.13039/501100002322</named-content></contract-sponsor>
<contract-sponsor id="cn004">International Business Machines Corporation<named-content content-type="fundref-id">10.13039/100004316</named-content></contract-sponsor>
<counts>
<fig-count count="3"/>
<table-count count="1"/>
<equation-count count="0"/>
<ref-count count="67"/>
<page-count count="7"/>
<word-count count="0"/>
</counts>
</article-meta>
</front>
<body>
<sec><title>Introduction</title>
<p>The drug design process, since the discovery/identification of bioactive compounds until the approval of its clinical use by a regulatory agency, is very complex and demands time and financial support (<xref ref-type="bibr" rid="B57">Tufts Center for the Study of Drug Development [CSDD], 2014</xref>). There are several well-known bottlenecks in this process, such as finding out a suitable and validated molecular target, designing and/or discovering of a lead compound, pharmacokinetic and toxicity optimization, besides commercial reasons, efficacy and clinical safety (<xref ref-type="bibr" rid="B28">Khanna, 2012</xref>; <xref ref-type="bibr" rid="B37">Medina-Franco et al., 2013</xref>). In this scenario, the use of computational techniques in drug discovery is rapidly increasing.</p>
<p>Computer-aided drug design (CADD) techniques are broadly employed in order to reduce costs and time involved in drug design. Among the important CADD techniques, molecular docking, similarity search and QSAR studies could be highlighted. Molecular docking and virtual screening are considered structure-based drug design (SBDD) strategies since it requires 3D structure of a molecular target and consists of predicting a binding mode of molecules and its binding energy (<xref ref-type="bibr" rid="B60">Walters et al., 1998</xref>; <xref ref-type="bibr" rid="B53">Shoichet, 2004</xref>; <xref ref-type="bibr" rid="B3">Andricopulo et al., 2008</xref>). As the docking simulations consider both structures (ligands and targets), its calculations are more computationally expensive. Considering these aspects, similarity searches and pharmacophore modeling are alternatives to faster calculations (<xref ref-type="bibr" rid="B7">Brogi et al., 2009</xref>; <xref ref-type="bibr" rid="B56">Tresadern et al., 2009</xref>) and are defined as ligand-based drug design (LBDD) strategies since they do not require the biological target structure (<xref ref-type="bibr" rid="B58">Turki et al., 2017</xref>). <bold>Figure <xref ref-type="fig" rid="F1">1</xref></bold> illustrates the main steps in a drug design process, including the use of computational tools.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p>Main steps involved in drug design, highlighting the use of computational approaches.</p></caption>
<graphic xlink:href="fphar-09-00074-g001.tif"/>
</fig>
<p>Another LBDD strategy is known as quantitative structure-activity relationships (QSAR) and it has been widely employed in drug design, mainly aiming to predict the biological activity of a compound set against a specific target to optimize the binding affinity (<xref ref-type="bibr" rid="B19">Du et al., 2008</xref>; <xref ref-type="bibr" rid="B25">Gertrudes et al., 2012</xref>). QSAR models provide accurate predictions of measured endpoints instead of an independent ranking of biological activity. These quantitative approaches have also been used in other tasks, such as optimization of pharmacokinetics and toxicity profile (<xref ref-type="bibr" rid="B34">Maltarollo et al., 2015</xref>; <xref ref-type="bibr" rid="B20">Egeghy et al., 2016</xref>; <xref ref-type="bibr" rid="B11">Chemi et al., 2017</xref>) and virtual screening (<xref ref-type="bibr" rid="B8">Brogi et al., 2013</xref>; <xref ref-type="bibr" rid="B38">Melo-Filho et al., 2016</xref>; <xref ref-type="bibr" rid="B41">Neves et al., 2016</xref>; <xref ref-type="bibr" rid="B63">Zaccagnini et al., 2017</xref>).</p>
<p>Several important QSAR studies can be found in literature, which include the description of successful computational methods and algorithms (<xref ref-type="bibr" rid="B54">Sliwoski et al., 2014</xref>; <xref ref-type="bibr" rid="B46">Raies and Bajic, 2016</xref>), validation techniques (<xref ref-type="bibr" rid="B27">Gramatica and Sangion, 2016</xref>), applications (<xref ref-type="bibr" rid="B13">Cherkasov et al., 2014</xref>; <xref ref-type="bibr" rid="B21">Fang and Xiao, 2016</xref>) as well-challenges and how those have been addressed (<xref ref-type="bibr" rid="B14">Cronin and Schultz, 2003</xref>; <xref ref-type="bibr" rid="B4">Arthur, 2008</xref>; <xref ref-type="bibr" rid="B18">Dearden et al., 2009</xref>; <xref ref-type="bibr" rid="B52">Scior et al., 2009</xref>; <xref ref-type="bibr" rid="B61">Wang et al., 2015</xref>; <xref ref-type="bibr" rid="B45">Ponzoni et al., 2017</xref>).</p>
<p>In many recent studies, machine learning (ML) methods have been largely applied to QSAR analyses. This growth has been mainly motivated by the increasing availability of data in public repositories, the use of numerous and diverse chemical descriptors and the proposal of accurate predictive algorithms, such as support vector machines (SVMs) and artificial neural networks (<xref ref-type="bibr" rid="B25">Gertrudes et al., 2012</xref>; <xref ref-type="bibr" rid="B35">Maltarollo et al., 2013</xref>; <xref ref-type="bibr" rid="B40">Mitchell, 2014</xref>; <xref ref-type="bibr" rid="B31">Lima et al., 2016</xref>). A common application of ML techniques in CADD refers to forecast new compound class labels (e.g., &#x201C;active&#x201D; <italic>versus</italic> &#x201C;inactive&#x201D;) using models previously derived from available training sets (<xref ref-type="bibr" rid="B29">Lavecchia, 2015</xref>). In such specific situation, ML techniques are said to perform a classification learning task. In addition, other sort of learning tasks can also be considered in CADD, such as clustering and ranking (<xref ref-type="bibr" rid="B1">Agarwal et al., 2010</xref>).</p>
<p>Despite of the widespread use of ML methods in QSAR modeling, the success of such approaches critically depends on the availability of a great amount of data, which remains challenging in drug discovery. This problem is strongly related to issues involving the quality of public data sources, including imprecise representation of chemical structures and inaccurate activity information (<xref ref-type="bibr" rid="B67">Zhao et al., 2017</xref>). Furthermore, the nature of different experimental protocols can usually lead to data belonging to different probability distributions, which makes the use of traditional ML techniques impracticable.</p>
<p>The data sets available in public repositories are usually obtained from single structure-activity relationship (SAR) campaigns. This explains the several particular and linear sets of compounds that are commonly used to generate only specialized QSAR models. In most of cases, biological activities of two datasets are measured under different experimental conditions, making the link among chemical spaces difficult to be analyzed (<xref ref-type="bibr" rid="B48">Richter and Ecker, 2015</xref>). Furthermore, a large chemical space has activity cliffs naturally: regions in a structure/activity surface where there is a discontinuous SAR (<xref ref-type="bibr" rid="B15">Cruz-Monteagudo et al., 2014</xref>).</p>
<p>In 2014, a review on QSAR (<xref ref-type="bibr" rid="B13">Cherkasov et al., 2014</xref>) stated that the transferability of QSAR models is one of the challenges in QSAR modeling, since the traditional approaches have been typically designed for each target property individually. Aiming to take advantage of diverse but related available experimental data, transfer and multi-task learning techniques have been recently developed. The novelty behind these approaches is related to their ability to exploit knowledge from other related tasks to improve the learning performance, especially when a small data set is available for training.</p>
</sec>
<sec><title>Transfer and Multi-Task Learning</title>
<p>For QSAR purposes, the data space under analysis is characterized by biological and chemical properties. In such scenario, changes in the distribution of data force the model to be rebuilt, implying to collect new training data. However, in many real-world applications, it is expensive or impossible to recollect data required to reconstruct these models. In such situations, transfer learning (or knowledge transfer) among related domains would be desirable (<xref ref-type="bibr" rid="B42">Pan and Yang, 2010</xref>).</p>
<p>Transfer learning can be defined as the ability of a system to recognize and apply the knowledge learned in previous (source) tasks for the solution of new (target) problems. The development of such approach was motivated by the fact that one can apply the knowledge acquired previously to solve new problems more quickly and with better solutions. The goal here relies on extracting the knowledge obtained by a model from one or more source tasks and to apply it to a target task. However, one of the premises for using transfer learning technique is that the source and the destination domains must be related. In this sense, <xref ref-type="bibr" rid="B55">Tan et al. (2015)</xref> suggest that such relationship can be expressed by instances (<xref ref-type="bibr" rid="B5">Bickel et al., 2009</xref>) or characteristics (<xref ref-type="bibr" rid="B51">Satpal and Sarawagi, 2007</xref>). If no direct relationship is found, the forced transfer will not work, resulting in no improvement or even degenerating the performance in the target domain (<xref ref-type="bibr" rid="B22">Fitzgerald and Thomaz, 2015</xref>). Multi-task learning is closely related to knowledge transfer, but they have also a clear distinction. In multi-task approaches, a number of tasks are learned simultaneously, without involving designated source and target tasks. <bold>Figure <xref ref-type="fig" rid="F2">2</xref></bold> illustrates the overall schemes for transfer and multi-task learning.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p>General framework used to plan a study using <bold>(A)</bold> transfer learning techniques and <bold>(B)</bold> multi-task learning.</p></caption>
<graphic xlink:href="fphar-09-00074-g002.tif"/>
</fig>
<p>The methods used for transfer learning can be summarized into four categories, depending on which aspect of knowledge will be transferred, i.e., &#x201C;what to transfer&#x201D; (<xref ref-type="bibr" rid="B42">Pan and Yang, 2010</xref>). The first category refers to instance-based transfer learning, which assumes that some data from the source set can be selected for training in the target set by re-weighting. Importance sampling and instance reweighting are the two most commonly techniques used (<xref ref-type="bibr" rid="B16">Dai et al., 2007</xref>). The second category refers to transfer learning methods by feature representation, which focuses on encoding the structural information carried by molecules into a numerical representation that can be effectively exploited by learning processes in other related problems. In this case, the intuitive idea consists in learning a suitable representation of characteristics for use in the target set, i.e., the transfer learning is coded in the representation of the new characteristics (<xref ref-type="bibr" rid="B47">Raina et al., 2007</xref>). The third category refers to the transfer learning techniques by parameters (<xref ref-type="bibr" rid="B30">Lawrence and Platt, 2004</xref>), in which it is assumed that the source and the target tasks share some parameters or prior distributions of the hyper-parameters of the respective QSAR models. In this case, knowledge can be transferred between the tasks by discovering these shared parameters or priors. The last category consists of methods that deal with the problem of relational knowledge transfer, which refers to transfer learning in related domains (<xref ref-type="bibr" rid="B39">Mihalkova et al., 2007</xref>). In this condition, the knowledge can be transferred by mapping the data from the source set to the destination one. The statistical methods of relational learning are the most applied in this case (<xref ref-type="bibr" rid="B39">Mihalkova et al., 2007</xref>; <xref ref-type="bibr" rid="B17">Davis and Domingos, 2009</xref>). A scheme illustrating how the transfer learning approaches can be applied to obtain predictive models is presented in <bold>Figure <xref ref-type="fig" rid="F3">3</xref></bold>.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption><p>Schemes used for applying transfer learning approaches.</p></caption>
<graphic xlink:href="fphar-09-00074-g003.tif"/>
</fig>
<p>To apply transfer learning techniques, it is assumed that two sets of related data are available and the knowledge will be transferred from the dataset with the largest volume to the set with the least amount of available data. However, this assumption in the chemical datasets is not always sufficient, requiring the opinion of an expert to define the source datasets. To overcome this limitation, <xref ref-type="bibr" rid="B26">Girschick et al. (2012)</xref> proposed an approach to select a source dataset in a repository containing target-related sets by following a data-driven methodology. The main idea behind such proposal is based on calculating a measure for the activity overlap between the target set and each related set available in PubChem database. As result, a ranking of all related sets according to their similarity to the target set is obtained. In order to find the similarity values, Tanimoto coefficient is calculated using the categorization of the chemical compounds (active/inactive) in each dataset. Therefore, the objective is to select the set that has the distribution of instances (compounds) closest to the distribution (number of instances categorized as active and inactive) of the target set.</p>
<p>One can find out many situations where transfer learning adds benefits, for example, molecules could be classified as active or inactive according to a biological data for a defined endpoint (e.g., IC<sub>50</sub> values). For this classification task, it is initially necessary to collect several experimentally tested samples and, next, to train a classifier for the collected data with their respective labels. Since the probability distribution of the comments on other endpoints can be very different, a new classifier has to be trained to each dataset in order to maintain a satisfactory performance. To reduce this effort, it would be desirable to use the knowledge from a classification model that is already trained on some related endpoints to improve the classification performance of other tasks with small samples or datasets (<xref ref-type="bibr" rid="B58">Turki et al., 2017</xref>). <bold>Table <xref ref-type="table" rid="T1">1</xref></bold> illustrates examples of transfer learning in drug design.</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Examples of potential applications of transfer learning methods in drug design.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left">Transfer learning approach</th>
<th valign="top" align="left">Concept</th>
<th valign="top" align="left">Application in drug design</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Instance-based</td>
<td valign="top" align="left">Uses the same ML technique for modeling but apply some changes to the parameters of the target model.</td>
<td valign="top" align="left">Source and target datasets have the same endpoint (e.g., same molecular target) but the training data can be colleted at different experimental conditions.</td>
</tr>
<tr>
<td valign="top" align="left">Feature representation</td>
<td valign="top" align="left">Based on some mathematical transformations of data.</td>
<td valign="top" align="left">Source and target datasets have different but related endpoints, e.g., same classes of molecular target (kinases, nuclear receptors, proteases, etc.).</td>
</tr>
<tr>
<td valign="top" align="left">Parameters</td>
<td valign="top" align="left">It is assumed that both datasets share some properties.</td>
<td valign="top" align="left">Source and target datasets have the same or related endpoints.</td>
</tr>
<tr>
<td valign="top" align="left">Relational knowledge transfer</td>
<td valign="top" align="left">Based on technique for mapping the data in the target domain.</td>
<td valign="top" align="left">The endpoints of the source and target datasets are different but the domains (the independent variable in QSAR models) are related; e.g., cellular permeability and log P.</td>
</tr>
<tr>
<td valign="top" align="left"></td></tr>
</tbody>
</table>
</table-wrap>
<p>In general, transfer learning approaches have shown to be promising for combining the knowledge previously obtained in related tasks into a single predictive model, whether for classification, regression, or grouping (<xref ref-type="bibr" rid="B42">Pan and Yang, 2010</xref>). In particular, researches in medicinal chemistry with focus in drug discovery have been benefited with the use of transfer learning, as can be seen in previous studies (<xref ref-type="bibr" rid="B26">Girschick et al., 2012</xref>; <xref ref-type="bibr" rid="B49">Rosenbaum et al., 2013</xref>; <xref ref-type="bibr" rid="B50">Saha et al., 2016</xref>). Next, applications of transfer and multi-task learning in medicinal chemistry studies will be presented.</p>
</sec>
<sec><title>Some Applications of Transfer and Multi-Task Learning</title>
<p>Many machine learning methods are based on the assumption that similar drugs may share the same side effects, but measuring the similarity of these drugs is still a challenge. However, the use of data from various sources (similar drugs) provides important information for the analysis of side effects and should be integrated for obtaining a highly accurate prediction. <xref ref-type="bibr" rid="B66">Zhang et al. (2016)</xref> discussed the problem of predicting side effects caused by drugs through linear neighbor approaches and the integration of data from various sources. The authors argued that auxiliary data can bring additional and diverse information (such as drug substructures, drug targets, drug transporters, drug enzymes, drug pathways) that should be integrated to the side-effect prediction, aiming at improving its performance. Analyses on multi-label classification showed that the proposed transfer learning approaches achieved better performance than state-of-art-methods (<xref ref-type="bibr" rid="B43">Pauwels et al., 2011</xref>; <xref ref-type="bibr" rid="B32">Liu et al., 2012</xref>; <xref ref-type="bibr" rid="B12">Cheng et al., 2013</xref>) applied to benchmark datasets.</p>
<p>The task of relating chemical structure to biological activity in QSAR studies is usually based on the notion of chemical similarity to predict the molecular behavior of close compounds. So, techniques that provide similarity measures among chemical compounds are increasingly important (<xref ref-type="bibr" rid="B23">Floris et al., 2014</xref>). Lately, relevant solutions have been proposed, which comprise distance learning (<xref ref-type="bibr" rid="B6">Biehl et al., 2014</xref>) and inductive transfer (<xref ref-type="bibr" rid="B24">Garcke and Vanck, 2014</xref>) methods. Distance learning aims at learning an appropriate distance measure to reflect the underlying relationship between instances in the training set, while inductive transfer refers to the process of transferring knowledge learned from one task into another related task. <xref ref-type="bibr" rid="B26">Girschick et al. (2012)</xref> presented an adapted transfer approach, which combines distance learning and inductive transfer by learning the distances on a related task and then transferring them to the target learning task. Additionally, the authors developed a method for selecting a related task that can be used as source task for transfer learning. This technique consists in applying an activity overlap similarity measure to two datasets to find out a suitable source task. This approach was evaluated on five distinct datasets found in PubChem BioAssay (<xref ref-type="bibr" rid="B62">Wang et al., 2009</xref>) repository. The results showed that both proposals worked well for large and small amounts of training data.</p>
<p>The multi-task learning approach (<xref ref-type="bibr" rid="B10">Caruana, 1998</xref>) is considered to be closely related to transfer learning, since it attempts to learn multiple tasks simultaneously even when they are different. <xref ref-type="bibr" rid="B49">Rosenbaum et al. (2013)</xref> introduced two multi-task methods and evaluated the performance of such approaches by inferring multi-target QSAR models on a subset of human kinome. The authors assumed that the taxonomical relationship of the kinase targets should correspond to the relatedness of the QSAR problems on these targets. The multi-task techniques were compared to SVMs models independently trained for each target and an SVM model that assumed all targets to be identical. The results demonstrated that the multi-target learning can over perform baseline (pure SVM) methods if knowledge can be transferred from a target with a lot of data to a similar target with little domain knowledge.</p>
<p><xref ref-type="bibr" rid="B59">Varnek et al. (2009)</xref> applied different inductive transfer and multi-task learning approaches to model tissue-air partition coefficients. The authors found that these techniques improved the prediction accuracy of the obtained models when compared to single task learning. Finally, this study indicated that inductive transfer learning is very suitable when single modeling is unable to generate reliable QSAR models using diverse data sets and with small amount of samples.</p>
<p><xref ref-type="bibr" rid="B9">Brown et al. (2014)</xref> presented some challenges involved with chemogenomic data, since high-throughput assays give us a large number of information from multi-ligand and multi-target data (<xref ref-type="bibr" rid="B44">Pereira and Williams, 2007</xref>). So, the authors assert that computational techniques, in particular inductive transfer and explicit learning, can help to construct more robust models when compared to target-specific (classical) QSAR ones.</p>
<p>The study of <xref ref-type="bibr" rid="B64">Zhang et al. (2013)</xref> discussed the use of single- and multi-task learning to construct QSAR models for predicting the binding affinity of a compound database by estrogen receptors (ERs), which are involved with endocrine disruption by chemicals and the construction of predictive models can contribute to design safer substances. The authors concluded that multi-task learning provided better results for a small dataset (ER&#x03B2; ligands) than single learning, indicating that this approach can be considered as a good tool to understand the action mechanism of endocrine disruption and to predict the ER activity of unknown compounds as endocrine disrupting chemicals.</p>
<p>Another interesting application of multi-task techniques was performed by <xref ref-type="bibr" rid="B33">Liu et al. (2011)</xref>, which used multi-task learning to construct multi-target QSAR models employing three human immunodeficiency virus (HIV) inhibitor datasets together with other six subsets containing two hepatitis C virus (HCV) inhibitors. The main conclusions of this study included the fact that the integration of all databases (HIV and HCV) improved the rate of the discovery of lead HIV-HCV inhibitors, helping the design of new co-inhibitors for these important infections. Other achievement is related to the successful use (considering efficiency in convergence speed and learning accuracy) of a multi-task learning technique to construct multi-target QSAR models.</p>
</sec>
<sec><title>Discussion</title>
<p>The main issue of transfer and multi-task learning approaches is to employ the knowledge generated (e.g., features, subset of variables, weights of equations) from available ML models and other datasets in the construction of models for related endpoints. In this sense, it is possible to use different datasets with the same biological activities but measured at different experimental conditions. Other important consequence of applying transfer and multi-task learning is the decrease on computational costs related to the faster convergence obtained by using the knowledge derived from a model previously built from a related endpoint.</p>
<p>From a literature review taking into to account the transfer learning applications on medicinal chemistry, one can note that there is still a great potentiality to be explored in this sense. Other emerging approaches as deep learning methods (<xref ref-type="bibr" rid="B65">Zhang et al., 2017</xref>), which basically use complex neural networks architectures, also have promising applications in the era of big data.</p>
<p>Among the main challenges on applying transfer and multi-task learning methods is that they require an artificial intelligence expert to code them since there are no chemical and/or pharmaceutical packages with a graphical user interface. Depend on the source data and on the learning method, transfer and multi-task learning could be also considered as &#x201C;black-boxes,&#x201D; making the interpretability of QSAR models difficult. And, finally, the transfer of knowledge could be inappropriately employed if the assumption of &#x201C;equivalent&#x201D; endpoints is not valid.</p>
</sec>
<sec><title>Conclusion</title>
<p>Nowadays one can observe increasing number of applications of transfer and multi-task learning in medicinal studies. There are also current challenges in the QSAR field that comprise the integration of different datasets (even from different experiments) aiming the same or similar endpoints (<xref ref-type="bibr" rid="B36">Maltarollo et al., 2017</xref>) and the development of universal QSAR models using very large datasets (<xref ref-type="bibr" rid="B2">Alves et al., 2017</xref>). Therefore, good examples of dataset that could be benefited from transfer and multi-task learning are: (i) compounds with same endpoint measured under different experimental conditions; (ii) antimicrobial activities against genetically similar microorganisms; (iii) compounds with the same mechanism of action in homologous targets and high degree of similarity in the binding pocket; (iv) non-specific endpoints as toxicity against a cell line or permeability rates determined by different models. In this complex scenario, transfer and multi-task learning techniques can be considered powerful tools for drug design.</p>
</sec>
<sec><title>Author Contributions</title>
<p>RS, KH, VM, and PO designed this article. All authors wrote and revised the manuscript. Also, all authors read and approved the final version of the manuscript.</p>
</sec>
<sec><title>Conflict of Interest Statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. The handling Editor declared a shared affiliation, though no other collaboration, with the authors.</p>
</sec>
</body>
<back>
<ack>
<p>The authors would like to thank FAPESP, CNPq, CAPES, and IBM for funding.</p>
</ack>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Agarwal</surname> <given-names>S.</given-names></name> <name><surname>Dugar</surname> <given-names>D.</given-names></name> <name><surname>Sengupta</surname> <given-names>S.</given-names></name></person-group> (<year>2010</year>). <article-title>Ranking chemical structures for drug discovery: a new machine learning approach.</article-title> <source><italic>J. Chem. Inf. Model.</italic></source> <volume>50</volume> <fpage>716</fpage>&#x2013;<lpage>731</lpage>. <pub-id pub-id-type="doi">10.1021/ci9003865</pub-id> <pub-id pub-id-type="pmid">20387860</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Alves</surname> <given-names>V. M.</given-names></name> <name><surname>Muratov</surname> <given-names>E. N.</given-names></name> <name><surname>Zakharov</surname> <given-names>A.</given-names></name> <name><surname>Muratov</surname> <given-names>N. N.</given-names></name> <name><surname>Andrade</surname> <given-names>C. H.</given-names></name> <name><surname>Tropsha</surname> <given-names>A.</given-names></name></person-group> (<year>2017</year>). <article-title>Chemical toxicity prediction for major classes of industrial chemicals: Is it possible to develop universal models covering cosmetics, drugs, and pesticides?</article-title> <source><italic>Food Chem. Toxicol.</italic></source> <pub-id pub-id-type="doi">10.1016/j.fct.2017.04.008</pub-id> <comment>[Epub ahead of print]</comment>. <pub-id pub-id-type="pmid">28412406</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Andricopulo</surname> <given-names>A. D.</given-names></name> <name><surname>Guido</surname> <given-names>R. V. C.</given-names></name> <name><surname>Oliva</surname> <given-names>G.</given-names></name></person-group> (<year>2008</year>). <article-title>Virtual screening and its integration with modern drug design technologies.</article-title> <source><italic>Curr. Med. Chem.</italic></source> <volume>15</volume> <fpage>37</fpage>&#x2013;<lpage>46</lpage>. <pub-id pub-id-type="doi">10.2174/092986708783330683</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Arthur</surname> <given-names>M. D.</given-names></name></person-group> (<year>2008</year>). <article-title>QSAR: dead or alive?</article-title> <source><italic>J. Comput. Aided Mol. Des.</italic></source> <volume>22</volume> <fpage>81</fpage>&#x2013;<lpage>89</lpage>. <pub-id pub-id-type="doi">10.1007/s10822-007-9162-7</pub-id> <pub-id pub-id-type="pmid">18189157</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bickel</surname> <given-names>S.</given-names></name> <name><surname>Sawade</surname> <given-names>C.</given-names></name> <name><surname>Scheffer</surname> <given-names>T.</given-names></name></person-group> (<year>2009</year>). <article-title>Transfer learning by distribution matching for targeted advertising.</article-title> <source><italic>Adv. Neural Inf. Proces. Syst.</italic></source> <volume>21</volume> <fpage>145</fpage>&#x2013;<lpage>152</lpage>.</citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Biehl</surname> <given-names>M.</given-names></name> <name><surname>Hammer</surname> <given-names>B.</given-names></name> <name><surname>Villmann</surname> <given-names>T.</given-names></name></person-group> (<year>2014</year>). <article-title>&#x201C;Distance measures for prototype based classification,&#x201D; in</article-title> <source><italic>Brain-Inspired Computing. BrainComp 2013. Lecture Notes in Computer Science</italic></source> <role>eds</role> <person-group person-group-type="editor"><name><surname>Grandinetti</surname> <given-names>L.</given-names></name> <name><surname>Lippert</surname> <given-names>T.</given-names></name> <name><surname>Petkov</surname> <given-names>N.</given-names></name></person-group> (<publisher-loc>Cham</publisher-loc>: <publisher-name>Springer</publisher-name>) <pub-id pub-id-type="doi">10.1007/978-3-319-12084-3_9</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brogi</surname> <given-names>S.</given-names></name> <name><surname>Kladi</surname> <given-names>M.</given-names></name> <name><surname>Vagias</surname> <given-names>C.</given-names></name> <name><surname>Papazafiri</surname> <given-names>P.</given-names></name> <name><surname>Roussis</surname> <given-names>V.</given-names></name> <name><surname>Tafi</surname> <given-names>A.</given-names></name></person-group> (<year>2009</year>). <article-title>Pharmacophore modeling for qualitative prediction of antiestrogenic activity.</article-title> <source><italic>J. Chem. Inf. Model.</italic></source> <volume>49</volume> <fpage>2489</fpage>&#x2013;<lpage>2497</lpage>. <pub-id pub-id-type="doi">10.1021/ci900254b</pub-id> <pub-id pub-id-type="pmid">19877675</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brogi</surname> <given-names>S.</given-names></name> <name><surname>Papazafiri</surname> <given-names>P.</given-names></name> <name><surname>Roussis</surname> <given-names>V.</given-names></name> <name><surname>Tafi</surname> <given-names>A.</given-names></name></person-group> (<year>2013</year>). <article-title>3D-QSAR using pharmacophore-based alignment and virtual screening for discovery of novel MCF-7 cell line inhibitors.</article-title> <source><italic>Eur. J. Med. Chem.</italic></source> <volume>67</volume> <fpage>344</fpage>&#x2013;<lpage>351</lpage>. <pub-id pub-id-type="doi">10.1016/j.ejmech.2013.06.048</pub-id> <pub-id pub-id-type="pmid">23880359</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brown</surname> <given-names>J. B.</given-names></name> <name><surname>Okuno</surname> <given-names>Y.</given-names></name> <name><surname>Marcou</surname> <given-names>G.</given-names></name> <name><surname>Varnek</surname> <given-names>A.</given-names></name> <name><surname>Horvath</surname> <given-names>D.</given-names></name></person-group> (<year>2014</year>). <article-title>Computational chemogenomics: Is it more than inductive transfer?</article-title> <source><italic>J. Comput. Aided Mol. Des.</italic></source> <volume>28</volume> <fpage>597</fpage>&#x2013;<lpage>618</lpage>. <pub-id pub-id-type="doi">10.1007/s10822-014-9743-1</pub-id> <pub-id pub-id-type="pmid">24771144</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Caruana</surname> <given-names>R.</given-names></name></person-group> (<year>1998</year>). <article-title>&#x201C;Multitask learning,&#x201D; in</article-title> <source><italic>Learning to Learn</italic></source> <role>eds</role> <person-group person-group-type="editor"><name><surname>Thrun</surname> <given-names>S.</given-names></name> <name><surname>Pratt</surname> <given-names>L.</given-names></name></person-group> (<publisher-loc>Boston, MA</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>95</fpage>&#x2013;<lpage>133</lpage>. <pub-id pub-id-type="doi">10.1007/978-1-4615-5529-2_5</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chemi</surname> <given-names>G.</given-names></name> <name><surname>Gemma</surname> <given-names>S.</given-names></name> <name><surname>Campiani</surname> <given-names>G.</given-names></name> <name><surname>Brogi</surname> <given-names>S.</given-names></name> <name><surname>Butini</surname> <given-names>S.</given-names></name> <name><surname>Brindisi</surname> <given-names>M.</given-names></name></person-group> (<year>2017</year>). <article-title>Computational tool for fast <italic>in silico</italic> evaluation of <italic>hERG</italic> K<sup>+</sup> channel affinity.</article-title> <source><italic>Front. Chem.</italic></source> <volume>5</volume>:<issue>7</issue>. <pub-id pub-id-type="doi">10.3389/fchem.2017.00007</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cheng</surname> <given-names>F.</given-names></name> <name><surname>Li</surname> <given-names>W.</given-names></name> <name><surname>Wang</surname> <given-names>X.</given-names></name> <name><surname>Zhou</surname> <given-names>Y.</given-names></name> <name><surname>Wu</surname> <given-names>Z.</given-names></name> <name><surname>Shen</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>Adverse drug events: database construction and in silico prediction.</article-title> <source><italic>J. Chem. Inf. Model.</italic></source> <volume>53</volume> <fpage>744</fpage>&#x2013;<lpage>752</lpage>. <pub-id pub-id-type="doi">10.1021/ci4000079</pub-id> <pub-id pub-id-type="pmid">23521697</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cherkasov</surname> <given-names>A.</given-names></name> <name><surname>Muratov</surname> <given-names>E. N.</given-names></name> <name><surname>Fourches</surname> <given-names>D.</given-names></name> <name><surname>Varnek</surname> <given-names>A.</given-names></name> <name><surname>Baskin</surname> <given-names>I. I.</given-names></name> <name><surname>Cronin</surname> <given-names>M.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>QSAR modeling: where have you been? Where are you going to?</article-title> <source><italic>J. Med. Chem.</italic></source> <volume>57</volume> <fpage>4977</fpage>&#x2013;<lpage>5010</lpage>. <pub-id pub-id-type="doi">10.1021/jm4004285</pub-id> <pub-id pub-id-type="pmid">24351051</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cronin</surname> <given-names>M. T. D.</given-names></name> <name><surname>Schultz</surname> <given-names>T. W.</given-names></name></person-group> (<year>2003</year>). <article-title>Pitfalls in QSAR.</article-title> <source><italic>J. Mol. Struct. Theochem</italic></source> <volume>622</volume> <fpage>39</fpage>&#x2013;<lpage>51</lpage>. <pub-id pub-id-type="doi">10.1016/S0166-1280(02)00616-4</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cruz-Monteagudo</surname> <given-names>M.</given-names></name> <name><surname>Medina-Franco</surname> <given-names>J. L.</given-names></name> <name><surname>Perez-Castillo</surname> <given-names>Y.</given-names></name> <name><surname>Nicolotti</surname> <given-names>O.</given-names></name> <name><surname>Cordeiro</surname> <given-names>M. N. D.</given-names></name> <name><surname>Borges</surname> <given-names>F.</given-names></name></person-group> (<year>2014</year>). <article-title>Activity cliffs in drug discovery: Dr Jekyll or Mr Hyde?</article-title> <source><italic>Drug Discov. Today</italic></source> <volume>19</volume> <fpage>1069</fpage>&#x2013;<lpage>1080</lpage>. <pub-id pub-id-type="doi">10.1016/j.drudis.2014.02.003</pub-id> <pub-id pub-id-type="pmid">24560935</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dai</surname> <given-names>W.</given-names></name> <name><surname>Yang</surname> <given-names>Q.</given-names></name> <name><surname>Yu</surname> <given-names>Y.</given-names></name></person-group> (<year>2007</year>). <article-title>&#x201C;Boosting for transfer learning,&#x201D; in</article-title> <source><italic>Proceedings of the 24th international conference on Machine learning</italic></source> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>ACM</publisher-name>) <fpage>193</fpage>&#x2013;<lpage>200</lpage>. <pub-id pub-id-type="doi">10.1145/1273496.1273521</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Davis</surname> <given-names>J.</given-names></name> <name><surname>Domingos</surname> <given-names>P.</given-names></name></person-group> (<year>2009</year>). &#x201C;<article-title>Deep transfer via second-order markov logic,&#x201D; in</article-title> <source><italic>Proceedings of the 26th Annual International Conference on Machine Learning</italic></source> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>ACM</publisher-name>) <fpage>217</fpage>&#x2013;<lpage>224</lpage>. <pub-id pub-id-type="doi">10.1145/1553374.1553402</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dearden</surname> <given-names>J. C.</given-names></name> <name><surname>Cronin</surname> <given-names>M. T. D.</given-names></name> <name><surname>Kaiser</surname> <given-names>K. L. E.</given-names></name></person-group> (<year>2009</year>). <article-title>How not to develop a quantitative structure-activity or structure-property relationship (QSAR/QSPR).</article-title> <source><italic>SAR QSAR Environ. Res.</italic></source> <volume>20</volume> <fpage>241</fpage>&#x2013;<lpage>266</lpage>. <pub-id pub-id-type="doi">10.1080/10629360902949567</pub-id> <pub-id pub-id-type="pmid">19544191</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Du</surname> <given-names>Q.-S.</given-names></name> <name><surname>Huang</surname> <given-names>R.-B.</given-names></name> <name><surname>Chou</surname> <given-names>K.-C.</given-names></name></person-group> (<year>2008</year>). <article-title>Recent advances in QSAR and their applications in predicting the activities of chemical molecules, peptides and proteins for drug design.</article-title> <source><italic>Curr. Protein Pept. Sci.</italic></source> <volume>9</volume> <fpage>248</fpage>&#x2013;<lpage>259</lpage>. <pub-id pub-id-type="doi">10.2174/138920308784534005</pub-id> <pub-id pub-id-type="pmid">18537680</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Egeghy</surname> <given-names>P. P.</given-names></name> <name><surname>Sheldon</surname> <given-names>L. S.</given-names></name> <name><surname>Isaacs</surname> <given-names>K. K.</given-names></name> <name><surname>&#x00D6;zkaynak</surname> <given-names>H.</given-names></name> <name><surname>Goldsmith</surname> <given-names>M. R.</given-names></name> <name><surname>Wambaugh</surname> <given-names>J. F.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>Computational exposure science: an emerging discipline to support 21st-century risk assessment.</article-title> <source><italic>Environ. Health Perspect.</italic></source> <volume>124</volume> <fpage>697</fpage>&#x2013;<lpage>702</lpage>. <pub-id pub-id-type="doi">10.1289/ehp.1509748</pub-id> <pub-id pub-id-type="pmid">26545029</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fang</surname> <given-names>C.</given-names></name> <name><surname>Xiao</surname> <given-names>Z.</given-names></name></person-group> (<year>2016</year>). <article-title>Receptor-based 3D-QSAR in drug design: methods and applications in kinase studies.</article-title> <source><italic>Curr. Top. Med. Chem.</italic></source> <volume>16</volume> <fpage>1463</fpage>&#x2013;<lpage>1477</lpage>. <pub-id pub-id-type="doi">10.2174/1568026615666150915120943</pub-id> <pub-id pub-id-type="pmid">26369822</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fitzgerald</surname> <given-names>T.</given-names></name> <name><surname>Thomaz</surname> <given-names>A. L.</given-names></name></person-group> (<year>2015</year>). <article-title>&#x201C;Skill demonstration transfer for learning from demonstration,&#x201D; in</article-title> <source><italic>Proceedings of the Tenth Annual ACM/IEEE International Conference on Human-Robot Interaction Extended Abstracts</italic></source> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>ACM</publisher-name>) <fpage>187</fpage>&#x2013;<lpage>188</lpage>. <pub-id pub-id-type="doi">10.1145/2701973.2702728</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Floris</surname> <given-names>M.</given-names></name> <name><surname>Manganaro</surname> <given-names>A.</given-names></name> <name><surname>Nicolotti</surname> <given-names>O.</given-names></name> <name><surname>Medda</surname> <given-names>R.</given-names></name> <name><surname>Mangiatordi</surname> <given-names>G. F.</given-names></name> <name><surname>Benfenati</surname> <given-names>E.</given-names></name></person-group> (<year>2014</year>). <article-title>A generalizable definition of chemical similarity for read-across.</article-title> <source><italic>J. Cheminform.</italic></source> <volume>6</volume>:<issue>39</issue>. <pub-id pub-id-type="doi">10.1186/s13321-014-0039-1</pub-id> <pub-id pub-id-type="pmid">25383097</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Garcke</surname> <given-names>J.</given-names></name> <name><surname>Vanck</surname> <given-names>T.</given-names></name></person-group> (<year>2014</year>). <article-title>Importance weighted inductive transfer learning for regression.</article-title> <source><italic>Lect. Notes Comput. Sci.</italic></source> <volume>8274</volume> <fpage>466</fpage>&#x2013;<lpage>481</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-662-44848-9_30</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gertrudes</surname> <given-names>J. C.</given-names></name> <name><surname>Maltarollo</surname> <given-names>V. G.</given-names></name> <name><surname>Silva</surname> <given-names>R. A.</given-names></name> <name><surname>Oliveira</surname> <given-names>P. R.</given-names></name> <name><surname>Honorio</surname> <given-names>K. M.</given-names></name> <name><surname>Da Silva</surname> <given-names>A. B. F.</given-names></name></person-group> (<year>2012</year>). <article-title>Machine learning techniques and drug design.</article-title> <source><italic>Curr. Med. Chem.</italic></source> <volume>19</volume> <fpage>4289</fpage>&#x2013;<lpage>4297</lpage>. <pub-id pub-id-type="doi">10.2174/092986712802884259</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Girschick</surname> <given-names>T.</given-names></name> <name><surname>Ruckert</surname> <given-names>U.</given-names></name> <name><surname>Kramer</surname> <given-names>S.</given-names></name></person-group> (<year>2012</year>). <article-title>Adapted transfer of distance measures for quantitative structure-activity relationships and data-driven selection of source datasets.</article-title> <source><italic>Comput. J.</italic></source> <volume>56</volume> <fpage>274</fpage>&#x2013;<lpage>288</lpage>. <pub-id pub-id-type="doi">10.1093/comjnl/bxs092</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gramatica</surname> <given-names>P.</given-names></name> <name><surname>Sangion</surname> <given-names>A.</given-names></name></person-group> (<year>2016</year>). <article-title>A historical excursus on the statistical validation parameters for QSAR models: a clarification concerning metrics and terminology.</article-title> <source><italic>J. Chem. Inf. Model.</italic></source> <volume>56</volume> <fpage>1127</fpage>&#x2013;<lpage>1131</lpage>. <pub-id pub-id-type="doi">10.1021/acs.jcim.6b00088</pub-id> <pub-id pub-id-type="pmid">27218604</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Khanna</surname> <given-names>I.</given-names></name></person-group> (<year>2012</year>). <article-title>Drug discovery in pharmaceutical industry: productivity challenges and trends.</article-title> <source><italic>Drug Discov. Today</italic></source> <volume>17</volume> <fpage>1088</fpage>&#x2013;<lpage>1102</lpage>. <pub-id pub-id-type="doi">10.1016/j.drudis.2012.05.007</pub-id> <pub-id pub-id-type="pmid">22627006</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lavecchia</surname> <given-names>A.</given-names></name></person-group> (<year>2015</year>). <article-title>Machine-learning approaches in drug discovery: methods and applications.</article-title> <source><italic>Drug Discov. Today</italic></source> <volume>20</volume> <fpage>318</fpage>&#x2013;<lpage>331</lpage>. <pub-id pub-id-type="doi">10.1016/j.drudis.2014.10.012</pub-id> <pub-id pub-id-type="pmid">25448759</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lawrence</surname> <given-names>N. D.</given-names></name> <name><surname>Platt</surname> <given-names>J. C.</given-names></name></person-group> (<year>2004</year>). <article-title>&#x201C;Learning to learn with the informative vector machine,&#x201D; in</article-title> <source><italic>Proceedings of the Twenty-rst International Conference on Machine Learning</italic></source> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>ACM</publisher-name>) <volume>65.</volume> <pub-id pub-id-type="doi">10.1145/1015330.1015382</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lima</surname> <given-names>A. N.</given-names></name> <name><surname>Philot</surname> <given-names>E. A.</given-names></name> <name><surname>Trossini</surname> <given-names>G. H. G.</given-names></name> <name><surname>Scott</surname> <given-names>L. P. B.</given-names></name> <name><surname>Maltarollo</surname> <given-names>V. G.</given-names></name> <name><surname>Honorio</surname> <given-names>K. M.</given-names></name></person-group> (<year>2016</year>). <article-title>Use of machine learning approaches for novel drug discovery.</article-title> <source><italic>Exp. Opin. Drug Discov.</italic></source> <volume>11</volume> <fpage>225</fpage>&#x2013;<lpage>239</lpage>. <pub-id pub-id-type="doi">10.1517/17460441.2016.1146250</pub-id> <pub-id pub-id-type="pmid">26814169</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>M.</given-names></name> <name><surname>Wu</surname> <given-names>Y.</given-names></name> <name><surname>Chen</surname> <given-names>Y.</given-names></name> <name><surname>Sun</surname> <given-names>J.</given-names></name> <name><surname>Zhao</surname> <given-names>Z.</given-names></name> <name><surname>Chen</surname> <given-names>X. W.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Large-scale prediction of adverse drug reactions using chemical, biological, and phenotypic properties of drugs.</article-title> <source><italic>J. Am. Med. Inform. Assoc.</italic></source> <volume>19</volume> <fpage>e28</fpage>&#x2013;<lpage>e35</lpage>. <pub-id pub-id-type="doi">10.1136/amiajnl-2011-000699</pub-id> <pub-id pub-id-type="pmid">22718037</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>Q.</given-names></name> <name><surname>Zhou</surname> <given-names>H.</given-names></name> <name><surname>Liu</surname> <given-names>L.</given-names></name> <name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Zhu</surname> <given-names>R.</given-names></name> <name><surname>Cao</surname> <given-names>Z.</given-names></name></person-group> (<year>2011</year>). <article-title>Multi-target QSAR modelling in the analysis and design of HIV-HCV co-inhibitors: an in-silico study.</article-title> <source><italic>BMC Bioinformatics</italic></source> <volume>12</volume>:<issue>294</issue>. <pub-id pub-id-type="doi">10.1186/1471-2105-12-294</pub-id> <pub-id pub-id-type="pmid">21774796</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Maltarollo</surname> <given-names>V. G.</given-names></name> <name><surname>Gertrudes</surname> <given-names>J. C.</given-names></name> <name><surname>Oliveira</surname> <given-names>P. R.</given-names></name> <name><surname>Honorio</surname> <given-names>K. M.</given-names></name></person-group> (<year>2015</year>). <article-title>Applying machine learning techniques for ADME-Tox prediction: a review.</article-title> <source><italic>Expert Opin. Drug Metab. Toxicol.</italic></source> <volume>11</volume> <fpage>259</fpage>&#x2013;<lpage>271</lpage>. <pub-id pub-id-type="doi">10.1517/17425255.2015.980814</pub-id> <pub-id pub-id-type="pmid">25440524</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Maltarollo</surname> <given-names>V. G.</given-names></name> <name><surname>Honorio</surname> <given-names>K. M.</given-names></name> <name><surname>da Silva</surname> <given-names>A. B. F.</given-names></name></person-group> (<year>2013</year>). &#x201C;<article-title>Applications of artificial neural networks in chemical problems,&#x201D; in</article-title> <source><italic>Artificial Neural Networks-Architectures and Applications</italic></source> <role>ed.</role> <person-group person-group-type="editor"><name><surname>Suzuki</surname> <given-names>K.</given-names></name></person-group> (<publisher-loc>Rijeka</publisher-loc>: <publisher-name>InTech</publisher-name>) <fpage>203</fpage>&#x2013;<lpage>223</lpage>. <pub-id pub-id-type="doi">10.5772/51275</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Maltarollo</surname> <given-names>V. G.</given-names></name> <name><surname>Kronenberger</surname> <given-names>T.</given-names></name> <name><surname>Wrenger</surname> <given-names>C.</given-names></name> <name><surname>Honorio</surname> <given-names>K. M.</given-names></name></person-group> (<year>2017</year>). <article-title>Current trends in quantitative structure&#x2013;activity relationship validation and applications on drug discovery.</article-title> <source><italic>Future Sci. OA</italic></source> <volume>3</volume> <fpage>FSO214</fpage>&#x2013;<lpage>FSO216</lpage>. <pub-id pub-id-type="doi">10.4155/fsoa-2017-0052</pub-id></citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Medina-Franco</surname> <given-names>J. L.</given-names></name> <name><surname>Giulianotti</surname> <given-names>M. A.</given-names></name> <name><surname>Welmaker</surname> <given-names>G. S.</given-names></name> <name><surname>Houghten</surname> <given-names>R. A.</given-names></name></person-group> (<year>2013</year>). <article-title>Shifting from the single to the multitarget paradigm in drug discovery.</article-title> <source><italic>Drug Discov. Today</italic></source> <volume>18</volume> <fpage>495</fpage>&#x2013;<lpage>501</lpage>. <pub-id pub-id-type="doi">10.1016/j.drudis.2013.01.008</pub-id> <pub-id pub-id-type="pmid">23340113</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Melo-Filho</surname> <given-names>C. C.</given-names></name> <name><surname>Dantas</surname> <given-names>R. F.</given-names></name> <name><surname>Braga</surname> <given-names>R. C.</given-names></name> <name><surname>Neves</surname> <given-names>B. J.</given-names></name> <name><surname>Senger</surname> <given-names>M. R.</given-names></name> <name><surname>Valente</surname> <given-names>W. C.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>QSAR-driven discovery of novel chemical scaffolds active against <italic>Schistosoma mansoni</italic>.</article-title> <source><italic>J. Chem. Inf. Model.</italic></source> <volume>56</volume> <fpage>1357</fpage>&#x2013;<lpage>1372</lpage>. <pub-id pub-id-type="doi">10.1021/acs.jcim.6b00055</pub-id> <pub-id pub-id-type="pmid">27253773</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mihalkova</surname> <given-names>L.</given-names></name> <name><surname>Huynh</surname> <given-names>T.</given-names></name> <name><surname>Mooney</surname> <given-names>R. J.</given-names></name></person-group> (<year>2007</year>). &#x201C;<article-title>Mapping and revising markov logic networks for transfer learning,&#x201D; in <italic>Proceedings of the.</italic> 22nd Association for the advancement of Artificial Intelligence (AAAI) Conference. Artificial Intelligence</article-title> (<publisher-loc>Vancouver, BC</publisher-loc>: <publisher-name>AAAI</publisher-name>) <fpage>608</fpage>&#x2013;<lpage>614</lpage>.</citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mitchell</surname> <given-names>J. B. O.</given-names></name></person-group> (<year>2014</year>). <article-title>Machine learning methods in chemoinformatics.</article-title> <source><italic>WIREs Comput. Mol. Sci.</italic></source> <volume>4</volume> <fpage>468</fpage>&#x2013;<lpage>481</lpage>. <pub-id pub-id-type="doi">10.1002/wcms.1183</pub-id> <pub-id pub-id-type="pmid">25285160</pub-id></citation></ref>
<ref id="B41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Neves</surname> <given-names>B. J.</given-names></name> <name><surname>Dantas</surname> <given-names>R. F.</given-names></name> <name><surname>Senger</surname> <given-names>M. R.</given-names></name> <name><surname>Melo-Filho</surname> <given-names>C. C.</given-names></name> <name><surname>Valente</surname> <given-names>W. C.</given-names></name> <name><surname>De Almeida</surname> <given-names>A. C.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>Discovery of new anti-schistosomal hits by integration of QSAR-Based virtual screening and high content screening.</article-title> <source><italic>J. Med. Chem.</italic></source> <volume>59</volume> <fpage>7075</fpage>&#x2013;<lpage>7088</lpage>. <pub-id pub-id-type="doi">10.1021/acs.jmedchem.5b02038</pub-id> <pub-id pub-id-type="pmid">27396732</pub-id></citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pan</surname> <given-names>S. J.</given-names></name> <name><surname>Yang</surname> <given-names>Q.</given-names></name></person-group> (<year>2010</year>). <article-title>A survey on transfer learning.</article-title> <source><italic>IEEE Trans. Knowl. Data Eng.</italic></source> <volume>22</volume> <fpage>1345</fpage>&#x2013;<lpage>1359</lpage>. <pub-id pub-id-type="doi">10.1109/TKDE.2009.191</pub-id></citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pauwels</surname> <given-names>E.</given-names></name> <name><surname>Stoven</surname> <given-names>V.</given-names></name> <name><surname>Yamanishi</surname> <given-names>Y.</given-names></name></person-group> (<year>2011</year>). <article-title>Predicting drug side-effect profiles: a chemical fragment-based approach.</article-title> <source><italic>BMC Bioinformatics</italic></source> <volume>12</volume>:<issue>169</issue>. <pub-id pub-id-type="doi">10.1186/1471-2105-12-169</pub-id> <pub-id pub-id-type="pmid">21586169</pub-id></citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pereira</surname> <given-names>D. A.</given-names></name> <name><surname>Williams</surname> <given-names>J. A.</given-names></name></person-group> (<year>2007</year>). <article-title>Origin and evolution of high throughput screening.</article-title> <source><italic>Br. J. Pharmacol.</italic></source> <volume>152</volume> <fpage>53</fpage>&#x2013;<lpage>61</lpage>. <pub-id pub-id-type="doi">10.1038/sj.bjp.0707373</pub-id> <pub-id pub-id-type="pmid">17603542</pub-id></citation></ref>
<ref id="B45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ponzoni</surname> <given-names>I.</given-names></name> <name><surname>Sebasti&#x00E1;n-P&#x00E9;rez</surname> <given-names>V.</given-names></name> <name><surname>Requena-Triguero</surname> <given-names>C.</given-names></name> <name><surname>Roca</surname> <given-names>C.</given-names></name> <name><surname>Mart&#x00ED;nez</surname> <given-names>M. J.</given-names></name> <name><surname>Cravero</surname> <given-names>F.</given-names></name><etal/></person-group> (<year>2017</year>). <article-title>Hybridizing feature selection and feature learning approaches in QSAR modeling for drug discovery.</article-title> <source><italic>Sci. Rep.</italic></source> <volume>7</volume>:<issue>2403</issue>. <pub-id pub-id-type="doi">10.1038/s41598-017-02114-3</pub-id> <pub-id pub-id-type="pmid">28546583</pub-id></citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Raies</surname> <given-names>A. B.</given-names></name> <name><surname>Bajic</surname> <given-names>V. B.</given-names></name></person-group> (<year>2016</year>). <article-title><italic>In silico</italic> toxicology: computational methods for the prediction of chemical toxicity.</article-title> <source><italic>Wiley Interdiscip. Rev. Comput. Mol. Sci.</italic></source> <volume>6</volume> <fpage>147</fpage>&#x2013;<lpage>172</lpage>. <pub-id pub-id-type="doi">10.1002/wcms.1240</pub-id> <pub-id pub-id-type="pmid">27066112</pub-id></citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Raina</surname> <given-names>R.</given-names></name> <name><surname>Battle</surname> <given-names>A.</given-names></name> <name><surname>Lee</surname> <given-names>H.</given-names></name> <name><surname>Packer</surname> <given-names>B.</given-names></name> <name><surname>Ng</surname> <given-names>A. Y.</given-names></name></person-group> (<year>2007</year>). <article-title>&#x201C;Self-taught learning: transfer learning from unlabeled data,&#x201D; in</article-title> <source><italic>Proceedings of the 24th International Conference on Machine learning, ICML 2007</italic></source> (<publisher-loc>Corvalis, OR</publisher-loc>: <publisher-name>ACM</publisher-name>) <fpage>759</fpage>&#x2013;<lpage>766</lpage>. <pub-id pub-id-type="doi">10.1145/1273496.1273592</pub-id></citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Richter</surname> <given-names>L.</given-names></name> <name><surname>Ecker</surname> <given-names>G. F.</given-names></name></person-group> (<year>2015</year>). <article-title>Medicinal chemistry in the era of big data.</article-title> <source><italic>Drug Discov. Today</italic></source> <volume>14</volume> <fpage>37</fpage>&#x2013;<lpage>41</lpage>. <pub-id pub-id-type="doi">10.1016/j.ddtec.2015.06.001</pub-id> <pub-id pub-id-type="pmid">26194586</pub-id></citation></ref>
<ref id="B49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rosenbaum</surname> <given-names>L.</given-names></name> <name><surname>D&#x00F6;rr</surname> <given-names>A.</given-names></name> <name><surname>Bauer</surname> <given-names>M. R.</given-names></name> <name><surname>Boeckler</surname> <given-names>F. M.</given-names></name> <name><surname>Zell</surname> <given-names>A.</given-names></name></person-group> (<year>2013</year>). <article-title>Inferring multi-target qsar models with taxonomy-based multi-task learning.</article-title> <source><italic>J. Cheminform.</italic></source> <volume>5</volume>:<issue>33</issue>. <pub-id pub-id-type="doi">10.1186/1758-2946-5-33</pub-id> <pub-id pub-id-type="pmid">23842210</pub-id></citation></ref>
<ref id="B50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Saha</surname> <given-names>B.</given-names></name> <name><surname>Gupta</surname> <given-names>S.</given-names></name> <name><surname>Phung</surname> <given-names>D.</given-names></name> <name><surname>Venkatesh</surname> <given-names>S.</given-names></name></person-group> (<year>2016</year>). &#x201C;<article-title>Transfer learning for rare cancer problems via discriminative sparse gaussian graphical model</article-title>,&#x201D; in <source><italic>Proceedings of the 23rd International Conference on Pattern Recognition (ICPR)</italic></source> (<publisher-loc>Cancun</publisher-loc>: <publisher-name>IEEE - Institute of Electrical and Electronics Engineers, Inc</publisher-name>) <fpage>537</fpage>&#x2013;<lpage>542</lpage>. <pub-id pub-id-type="doi">10.1109/ICPR.2016.7899689</pub-id></citation></ref>
<ref id="B51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Satpal</surname> <given-names>S.</given-names></name> <name><surname>Sarawagi</surname> <given-names>S.</given-names></name></person-group> (<year>2007</year>). <article-title>&#x201C;Domain adaptation of conditional probability models via feature subsetting,&#x201D; in</article-title> <source><italic>Knowledge Discovery in Databases: PKDD</italic></source> <volume>2007</volume> <role>eds</role> <person-group person-group-type="editor"><name><surname>Kok</surname> <given-names>J. N.</given-names></name> <name><surname>Koronacki</surname> <given-names>J.</given-names></name> <name><surname>Lopez de Mantaras</surname> <given-names>R.</given-names></name> <name><surname>Matwin</surname> <given-names>S.</given-names></name> <name><surname>Mladenic</surname> <given-names>D.</given-names></name> <name><surname>Skowron</surname> <given-names>A.</given-names></name></person-group> (<publisher-loc>Berlin</publisher-loc>: <publisher-name>Springer</publisher-name>) <fpage>224</fpage>&#x2013;<lpage>235</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-540-74976-9_23</pub-id></citation></ref>
<ref id="B52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Scior</surname> <given-names>T.</given-names></name> <name><surname>Medina-Franco</surname> <given-names>J. L.</given-names></name> <name><surname>Do</surname> <given-names>Q. T.</given-names></name> <name><surname>Mart&#x00ED;nez-Mayorga</surname> <given-names>K.</given-names></name> <name><surname>Yunes Rojas</surname> <given-names>J. A.</given-names></name> <name><surname>Bernard</surname> <given-names>P.</given-names></name></person-group> (<year>2009</year>). <article-title>How to recognize and workaround pitfalls in QSAR studies: a critical review.</article-title> <source><italic>Curr. Med. Chem.</italic></source> <volume>16</volume> <fpage>4297</fpage>&#x2013;<lpage>4313</lpage>. <pub-id pub-id-type="doi">10.2174/092986709789578213</pub-id> <pub-id pub-id-type="pmid">19754417</pub-id></citation></ref>
<ref id="B53"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shoichet</surname> <given-names>B. K.</given-names></name></person-group> (<year>2004</year>). <article-title>Virtual screening of chemical libraries.</article-title> <source><italic>Nature</italic></source> <volume>432</volume> <fpage>862</fpage>&#x2013;<lpage>865</lpage>. <pub-id pub-id-type="doi">10.1038/nature03197</pub-id> <pub-id pub-id-type="pmid">15602552</pub-id></citation></ref>
<ref id="B54"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sliwoski</surname> <given-names>G.</given-names></name> <name><surname>Kothiwale</surname> <given-names>S. K.</given-names></name> <name><surname>Meiler</surname> <given-names>J.</given-names></name> <name><surname>Lowe</surname> <given-names>E. W.</given-names></name> <name><surname>Barker</surname> <given-names>E. L.</given-names></name></person-group> (<year>2014</year>). <article-title>Computational methods in drug discovery.</article-title> <source><italic>Pharmacol. Rev.</italic></source> <volume>66</volume> <fpage>334</fpage>&#x2013;<lpage>395</lpage>. <pub-id pub-id-type="doi">10.1124/pr.112.007336</pub-id> <pub-id pub-id-type="pmid">24381236</pub-id></citation></ref>
<ref id="B55"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tan</surname> <given-names>B.</given-names></name> <name><surname>Song</surname> <given-names>Y.</given-names></name> <name><surname>Zhong</surname> <given-names>E.</given-names></name> <name><surname>Yang</surname> <given-names>Q.</given-names></name></person-group> (<year>2015</year>). <article-title>&#x201C;Transitive transfer learning,&#x201D; in</article-title> <source><italic>Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining</italic></source> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>ACM</publisher-name>) <fpage>1155</fpage>&#x2013;<lpage>1164</lpage>. <pub-id pub-id-type="doi">10.1145/2783258.2783295</pub-id></citation></ref>
<ref id="B56"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tresadern</surname> <given-names>G.</given-names></name> <name><surname>Bemporad</surname> <given-names>D.</given-names></name> <name><surname>Howe</surname> <given-names>T.</given-names></name></person-group> (<year>2009</year>). <article-title>A comparison of ligand based virtual screening methods and application to corticotropin releasing factor 1 receptor.</article-title> <source><italic>J. Mol. Graph. Model.</italic></source> <volume>27</volume> <fpage>860</fpage>&#x2013;<lpage>870</lpage>. <pub-id pub-id-type="doi">10.1016/j.jmgm.2009.01.003</pub-id> <pub-id pub-id-type="pmid">19230731</pub-id></citation></ref>
<ref id="B57"><citation citation-type="journal"><collab>Tufts Center for the Study of Drug Development [CSDD]</collab> (<year>2014</year>). <source><italic>Cost to Develop and Win Marketing Approval for a New Drug Is $2.6 Billion</italic></source>. Available at: <ext-link ext-link-type="uri" xlink:href="http://csdd.tufts.edu/news/complete_story/pr_tufts_csdd_2014_cost_study">http://csdd.tufts.edu/news/complete_story/pr_tufts_csdd_2014_cost_study</ext-link></citation></ref>
<ref id="B58"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Turki</surname> <given-names>T.</given-names></name> <name><surname>Wei</surname> <given-names>Z.</given-names></name> <name><surname>Wang</surname> <given-names>J. T.</given-names></name></person-group> (<year>2017</year>). <article-title>Transfer learning approaches to improve drug sensitivity prediction in multiple Myeloma patients.</article-title> <source><italic>IEEE Access</italic></source> <volume>5</volume> <fpage>7381</fpage>&#x2013;<lpage>7393</lpage>. <pub-id pub-id-type="doi">10.1109/ACCESS.2017.2696523</pub-id></citation></ref>
<ref id="B59"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Varnek</surname> <given-names>A.</given-names></name> <name><surname>Gaudin</surname> <given-names>C.</given-names></name> <name><surname>Marcou</surname> <given-names>G.</given-names></name> <name><surname>Baskin</surname> <given-names>I.</given-names></name> <name><surname>Pandey</surname> <given-names>A. K.</given-names></name> <name><surname>Tetko</surname> <given-names>I. V.</given-names></name></person-group> (<year>2009</year>). <article-title>Inductive transfer of knowledge: application of multi-task learning and feature net approaches to model tissue-air partition coefficients.</article-title> <source><italic>J. Chem. Inf. Model.</italic></source> <volume>49</volume> <fpage>133</fpage>&#x2013;<lpage>144</lpage>. <pub-id pub-id-type="doi">10.1021/ci8002914</pub-id> <pub-id pub-id-type="pmid">19125628</pub-id></citation></ref>
<ref id="B60"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Walters</surname> <given-names>W. P.</given-names></name> <name><surname>Stahl</surname> <given-names>M. T.</given-names></name> <name><surname>Murcko</surname> <given-names>M. A.</given-names></name></person-group> (<year>1998</year>). <article-title>Virtual screening&#x2014;an overview.</article-title> <source><italic>Drug Discov. Today</italic></source> <volume>3</volume> <fpage>160</fpage>&#x2013;<lpage>178</lpage>. <pub-id pub-id-type="doi">10.1016/S1359-6446(97)01163-X</pub-id></citation></ref>
<ref id="B61"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>T.</given-names></name> <name><surname>Wu</surname> <given-names>M. B.</given-names></name> <name><surname>Lin</surname> <given-names>J. P.</given-names></name> <name><surname>Yang</surname> <given-names>L. R.</given-names></name></person-group> (<year>2015</year>). <article-title>Quantitative structure-activity relationship: promising advances in drug discovery platforms.</article-title> <source><italic>Expert Opin. Drug Discov.</italic></source> <volume>11</volume> <fpage>1</fpage>&#x2013;<lpage>18</lpage>. <pub-id pub-id-type="doi">10.1517/17460441.2015.1083006</pub-id> <pub-id pub-id-type="pmid">26358617</pub-id></citation></ref>
<ref id="B62"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>Y.</given-names></name> <name><surname>Bolton</surname> <given-names>E.</given-names></name> <name><surname>Dracheva</surname> <given-names>S.</given-names></name> <name><surname>Karapetyan</surname> <given-names>K.</given-names></name> <name><surname>Shoemaker</surname> <given-names>B. A.</given-names></name> <name><surname>Suzek</surname> <given-names>T. O.</given-names></name><etal/></person-group> (<year>2009</year>). <article-title>An overview of the PubChem BioAssay resource.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>38</volume> <fpage>D255</fpage>&#x2013;<lpage>D266</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkp965</pub-id> <pub-id pub-id-type="pmid">19933261</pub-id></citation></ref>
<ref id="B63"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zaccagnini</surname> <given-names>L.</given-names></name> <name><surname>Brogi</surname> <given-names>S.</given-names></name> <name><surname>Brindisi</surname> <given-names>M.</given-names></name> <name><surname>Gemma</surname> <given-names>S.</given-names></name> <name><surname>Chemi</surname> <given-names>G.</given-names></name> <name><surname>Legname</surname> <given-names>G.</given-names></name><etal/></person-group> (<year>2017</year>). <article-title>Identification of novel fluorescent probes preventing PrP Sc replication in prion diseases.</article-title> <source><italic>Eur. J. Med. Chem.</italic></source> <volume>127</volume> <fpage>859</fpage>&#x2013;<lpage>873</lpage>. <pub-id pub-id-type="doi">10.1016/j.ejmech.2016.10.064</pub-id> <pub-id pub-id-type="pmid">27842893</pub-id></citation></ref>
<ref id="B64"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>L.</given-names></name> <name><surname>Sedykh</surname> <given-names>A.</given-names></name> <name><surname>Tripathi</surname> <given-names>A.</given-names></name> <name><surname>Zhu</surname> <given-names>H.</given-names></name> <name><surname>Afantitis</surname> <given-names>A.</given-names></name> <name><surname>Mouchlis</surname> <given-names>V. D.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>Identification of putative estrogen receptor-mediated endocrine disrupting chemicals using QSAR- and structure-based virtual screening approaches.</article-title> <source><italic>Toxicol. Appl. Pharmacol.</italic></source> <volume>272</volume> <fpage>67</fpage>&#x2013;<lpage>76</lpage>. <pub-id pub-id-type="doi">10.1016/j.taap.2013.04.032</pub-id> <pub-id pub-id-type="pmid">23707773</pub-id></citation></ref>
<ref id="B65"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>L.</given-names></name> <name><surname>Tan</surname> <given-names>J.</given-names></name> <name><surname>Han</surname> <given-names>D.</given-names></name> <name><surname>Zhu</surname> <given-names>H.</given-names></name></person-group> (<year>2017</year>). <article-title>From machine learning to deep learning: progress in machine intelligence for rational drug discovery.</article-title> <source><italic>Drug Discov. Today</italic></source> <volume>22</volume> <fpage>1680</fpage>&#x2013;<lpage>1685</lpage>. <pub-id pub-id-type="doi">10.1016/j.drudis.2017.08.010</pub-id> <pub-id pub-id-type="pmid">28881183</pub-id></citation></ref>
<ref id="B66"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>W.</given-names></name> <name><surname>Chen</surname> <given-names>Y.</given-names></name> <name><surname>Tu</surname> <given-names>S.</given-names></name> <name><surname>Liu</surname> <given-names>F.</given-names></name> <name><surname>Qu</surname> <given-names>Q.</given-names></name></person-group> (<year>2016</year>). &#x201C;<article-title>Drug side effect prediction through linear neighborhoods and multiple data source integration,&#x201D; in</article-title> <source><italic>Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine</italic></source> (<publisher-loc>Shenzhen</publisher-loc>: <publisher-name>IEEE</publisher-name>) <fpage>427</fpage>&#x2013;<lpage>434</lpage>. <pub-id pub-id-type="doi">10.1109/BIBM.2016.7822555</pub-id></citation></ref>
<ref id="B67"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhao</surname> <given-names>L.</given-names></name> <name><surname>Wang</surname> <given-names>W.</given-names></name> <name><surname>Sedykh</surname> <given-names>A.</given-names></name> <name><surname>Zhu</surname> <given-names>H.</given-names></name></person-group> (<year>2017</year>). <article-title>Experimental errors in QSAR modeling sets: what we can do and what we cannot do.</article-title> <source><italic>ACS Omega</italic></source> <volume>2</volume> <fpage>2805</fpage>&#x2013;<lpage>2812</lpage>. <pub-id pub-id-type="doi">10.1021/acsomega.7b00274</pub-id> <pub-id pub-id-type="pmid">28691113</pub-id></citation></ref>
</ref-list>
</back>
</article>