<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Genet.</journal-id>
<journal-title>Frontiers in Genetics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Genet.</abbrev-journal-title>
<issn pub-type="epub">1664-8021</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fgene.2020.01016</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Genetics</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Pathway-Based Drug Response Prediction Using Similarity Identification in Gene Expression</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Madani Tonekaboni</surname> <given-names>Seyed Ali</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/807445/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Beri</surname> <given-names>Gangesh</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/971304/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Haibe-Kains</surname> <given-names>Benjamin</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<xref ref-type="aff" rid="aff4"><sup>4</sup></xref>
<xref ref-type="aff" rid="aff5"><sup>5</sup></xref>
<xref ref-type="corresp" rid="c002"><sup>&#x002A;</sup></xref>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Princess Margaret Cancer Centre</institution>, <addr-line>Toronto, ON</addr-line>, <country>Canada</country></aff>
<aff id="aff2"><sup>2</sup><institution>Department of Medical Biophysics, University of Toronto</institution>, <addr-line>Toronto, ON</addr-line>, <country>Canada</country></aff>
<aff id="aff3"><sup>3</sup><institution>Department of Computer Science, University of Toronto</institution>, <addr-line>Toronto, ON</addr-line>, <country>Canada</country></aff>
<aff id="aff4"><sup>4</sup><institution>Ontario Institute of Cancer Research</institution>, <addr-line>Toronto, ON</addr-line>, <country>Canada</country></aff>
<aff id="aff5"><sup>5</sup><institution>Vector Institute</institution>, <addr-line>Toronto, ON</addr-line>, <country>Canada</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Harinder Singh, J. Craig Venter Institute (La Jolla), United States</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Yang Dai, University of Illinois at Chicago, United States; Sherry Bhalla, Mount Sinai Hospital, United States</p></fn>
<corresp id="c001">&#x002A;Correspondence: Seyed Ali Madani Tonekaboni, <email>ali.madanitonekaboni@mail.utoronto.ca</email></corresp>
<corresp id="c002">Benjamin Haibe-Kains, <email>Benjamin.Haibe-Kains@uhnresearch.ca</email></corresp>
<fn fn-type="other" id="fn004"><p>This article was submitted to Computational Genomics, a section of the journal Frontiers in Genetics</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>09</day>
<month>09</month>
<year>2020</year>
</pub-date>
<pub-date pub-type="collection">
<year>2020</year>
</pub-date>
<volume>11</volume>
<elocation-id>1016</elocation-id>
<history>
<date date-type="received">
<day>07</day>
<month>10</month>
<year>2019</year>
</date>
<date date-type="accepted">
<day>10</day>
<month>08</month>
<year>2020</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2020 Madani Tonekaboni, Beri and Haibe-Kains.</copyright-statement>
<copyright-year>2020</copyright-year>
<copyright-holder>Madani Tonekaboni, Beri and Haibe-Kains</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>Lapatinib and trastuzumab (Herceptin) are targeted therapies designed for patients with HER2+ breast tumors. Although these therapies improved survival rates of patients with this tumor type, not all the patients harboring HER2 amplification respond to these drugs. The NeoALTTO clinical trial was designed to test whether a higher response rate can be achieved by combining lapatinib and trastuzumab. Although the combination therapy showed almost double the response rate compared to the monotherapies, 40% of the patients did not respond to the treatment. In this study, we sought to identify biomarkers of HER2+ breast cancer patients&#x2019; response to drugs relying on gene expression profiles of tumors. We show that univariate gene expression-based biomarkers are significant but weak predictors of drug response. We further show that pathway activities, estimated from gene expression patterns quantified using the recent transcriptional similarity coefficient (TSC) between the tumor samples, yield high predictive value for therapy response (concordance index &#x003E;0.8, <italic>p</italic> &#x003C; 0.05). Moreover, machine learning models, built using multiple algorithms including logistic regression, naive Bayes, random forest, k-nearest neighbor, and support vector machine, for predicting drug response in the NeoALTTO clinical trial, resulted in lower performance compared to our pathway-based approach. Our results indicate that transcriptional similarity of biological pathways can be used to predict lapatinib and trastuzumab response in HER2+ breast cancer.</p>
</abstract>
<kwd-group>
<kwd>breast cancer</kwd>
<kwd>human epidermal growth factor receptor 2</kwd>
<kwd>lapatinib</kwd>
<kwd>trastuzumab</kwd>
<kwd>transcriptional similarity coefficient</kwd>
<kwd>estrogen receptor</kwd>
</kwd-group>
<counts>
<fig-count count="4"/>
<table-count count="0"/>
<equation-count count="2"/>
<ref-count count="30"/>
<page-count count="9"/>
<word-count count="0"/>
</counts>
</article-meta>
</front>
<body>
<sec id="S1">
<title>Introduction</title>
<p>Unsupervised clustering of breast tumor samples based on high-throughput expression profiles enabled the identification of HER2+ breast cancer subtype (<xref ref-type="bibr" rid="B5">Carey et al., 2006</xref>; <xref ref-type="bibr" rid="B27">Wirapati et al., 2008</xref>; <xref ref-type="bibr" rid="B19">Onitilo et al., 2009</xref>). Breast cancer survival differed by subtype (<italic>p</italic> &#x003C; 0.001), with shortest survival among HER2+ and basal-like subtypes. To treat HER2+ tumors, trastuzumab and lapatinib have been designed to target the EGFR/ERBB2 pathway and yielded a 30% rate of clinical response in the NeoALTTO clinical trial for patients with HER2+ breast tumors (<xref ref-type="bibr" rid="B1">Baselga et al., 2012</xref>). This led to their adoption as standard-of-care therapies for HER2+ breast cancer patients (<xref ref-type="bibr" rid="B1">Baselga et al., 2012</xref>). To increase the response rate of HER2+ tumors, the biomedical research community sought to design combination therapies. Concurrent treatment with these drugs with a response rate of almost 60% proved a higher efficacy of combination therapy with respect to lapatinib and trastuzumab as monotherapies (<xref ref-type="bibr" rid="B1">Baselga et al., 2012</xref>). However, there is a need to identify the non-responders to further improve the rate of treatment response in HER2+ breast cancer patients.</p>
<p>Stratification of responders and non-responders to lapatinib, trastuzumab, and their combination therapies was conducted using either gene-based or pathway (or gene set)-based approaches. In gene-based approaches, mutation, amplification, or expression of individual genes with known association with HER2+ breast tumor biology was investigated as potential biomarkers of response to these therapies (<xref ref-type="bibr" rid="B9">Gomez et al., 2007</xref>; <xref ref-type="bibr" rid="B2">Bianchini et al., 2011</xref>; <xref ref-type="bibr" rid="B6">Dave et al., 2011</xref>; <xref ref-type="bibr" rid="B13">Loibl et al., 2014</xref>; <xref ref-type="bibr" rid="B20">Schneeweiss et al., 2014</xref>; <xref ref-type="bibr" rid="B24">Vici et al., 2014</xref>; <xref ref-type="bibr" rid="B15">Menyh&#x00E1;rt et al., 2015</xref>). In spite of successful studies as part of gene-based analysis, they could not capture the whole picture of resistance mechanisms to targeted therapies in HER2+ breast cancer patients due to the complexity of HER2+ tumor biology (<xref ref-type="bibr" rid="B17">Nahta, 2012</xref>; <xref ref-type="bibr" rid="B7">de Melo Gagliato et al., 2016</xref>; <xref ref-type="bibr" rid="B30">Zhang et al., 2017</xref>). Hence, pathway (or gene set) approaches were conducted trying to identify the association of multiple genes or pathways to therapy response in HER2+ breast cancer patients.</p>
<p>The association of biological pathways to treatment response in HER2+ breast cancer patients has been conducted based on pathway enrichment analysis using individual genes identified either (1) based on higher activity in responders (or non-responders) versus non-responders (or responders) (<xref ref-type="bibr" rid="B29">Wu et al., 2012</xref>; <xref ref-type="bibr" rid="B3">Boulbes et al., 2015</xref>; <xref ref-type="bibr" rid="B18">Nam et al., 2015</xref>) or (2) as individual or multigene biomarkers of response (<xref ref-type="bibr" rid="B12">Harris et al., 2007</xref>; <xref ref-type="bibr" rid="B26">Willis et al., 2018</xref>). Here we present an alternative approach where pathways are used directly as features for predicting treatment response in HER2+ breast cancer patients.</p>
<p>In this study, we used our recent Similarity Identification in Gene Expression (SIGN) approach (<xref ref-type="bibr" rid="B14">Madani Tonekaboni et al., 2019</xref>) as a classifier relying on expression patterns in biological pathways for the classification of patient tumor samples to predict the response of cancer patients in each arm of the NeoALTTO clinical trial. We showed that the transcriptional similarity coefficient (TSC), identified comparing each patient tumor sample to responders versus non-responders, can be used to identify new pathway-based biomarkers of drug response in HER2+ breast cancer patients.</p>
</sec>
<sec id="S2" sec-type="materials|methods">
<title>Materials and Methods</title>
<p>The overall design of our study is illustrated in <xref ref-type="fig" rid="F1">Figure 1</xref>. In brief, we used the similarity of patterns of gene expression in biological pathways from patients responding to lapatinib, trastuzumab, and their combination to predict therapy response using our SIGN methodology (<xref ref-type="bibr" rid="B14">Madani Tonekaboni et al., 2019</xref>). In this framework, a leave-one-out cross-validation was used to assess the performance of each biomarker in predicting the response of cancer patients in each treatment category (<xref ref-type="fig" rid="F1">Figure 1</xref>). The data and detailed methods are described below.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p>Design of the study regarding identification of responders to lapatinib, trastuzumab, and their combination therapy in the NeoALTTO clinical trial. BP, MF, and CC stand for the Gene Ontology (GO) terms for biological processes, molecular functions, and cellular components, respectively.</p></caption>
<graphic xlink:href="fgene-11-01016-g001.tif"/>
</fig>
<sec id="S2.SS1">
<title>Gene Expression Profiles of Tumor Samples</title>
<p>RNA-seq raw data of tumor samples in the NeoALTTO clinical trial were quantified with Kallisto (<xref ref-type="bibr" rid="B4">Bray et al., 2016</xref>) in Toil pipeline (<xref ref-type="bibr" rid="B25">Vivian et al., 2017</xref>) using the GENCODE version 23 (ALL version) transcriptome annotation. Transcript level abundances are summarized to gene level using the same approach as described in <xref ref-type="bibr" rid="B23">Soneson et al. (2015)</xref>.</p>
</sec>
<sec id="S2.SS2">
<title>Clinical Definition of Responders Versus Non-responders</title>
<p>Responders and non-responders in the NeoALTTO clinical trial were determined using the rate of pathological complete response (pCR) (<xref ref-type="bibr" rid="B1">Baselga et al., 2012</xref>). Any patient without a recorded pCR was regarded as a non-responder. A pathological complete response is defined as no invasive cancer in the breast or only non-invasive <italic>in situ</italic> cancer in the breast specimen. Surgical breast and axillary node resection specimens were evaluated for pathologic tumor response according to the National Surgical Adjuvant Breast and Bowel Project (NSABP) guidelines<sup><xref ref-type="fn" rid="footnote1">1</xref></sup>.</p>
</sec>
<sec id="S2.SS3">
<title>Unsupervised Clustering</title>
<p>Similarities of samples within each arm of the NeoALTTO clinical trial were identified using Spearman&#x2019;s rank-order correlation (<xref ref-type="bibr" rid="B28">Wissler, 1905</xref>). The hierarchical clustering was then implemented on the similarity matrix between the samples using Euclidean distance and Ward&#x2019;s minimum variance method (<xref ref-type="bibr" rid="B16">Murtagh and Legendre, 2014</xref>).</p>
</sec>
<sec id="S2.SS4">
<title>Univariate Biomarker Discovery Using Genes</title>
<p>Concordance indices between the expression of each gene and the binarized vector of drug response were calculated as the prediction performance of each gene as a univariate biomarker. The significance of each identified C-index was calculated using a permutation test. The observations were randomly permuted, and the C-index between the expression of each gene and the observed classes for the tumor samples was calculated. Then the fraction of times in which the C-index of the gene expression with real observed classes was lower than the C-indices identified with permuted observed classes was considered as the significance (or FDR) of the C-index identified for that gene.</p>
</sec>
<sec id="S2.SS5">
<title>Concordance Index</title>
<p>We used the concordance index (C-index) to quantify the predictive value of our drug response predictors. The C-index estimates the probability that, for a pair of randomly chosen comparable samples, the sample with the higher predicted value will experience an event before the other sample or belongs to a higher binary class (<xref ref-type="bibr" rid="B11">Harrell et al., 1982</xref>). We used the implementation of the concordance index available in the survcomp R package (version 1.34.0) (<xref ref-type="bibr" rid="B21">Schr&#x00F6;der et al., 2011</xref>).</p>
</sec>
<sec id="S2.SS6">
<title>Transcriptional Similarity Coefficient</title>
<p>The transcriptional similarity coefficient (TSC) between each sample and the responders and non-responders were identified using the TSC function in the SIGN R package (version 0.1.0) (<xref ref-type="bibr" rid="B14">Madani Tonekaboni et al., 2019</xref>). Let <italic>P</italic> be the matrix of expression of genes within a pathway for a set of biological samples where rows are genes and columns are samples. Then the TSC is defined as follows:</p>
<disp-formula id="S2.Ex1">
<mml:math id="M1">
<mml:mrow>
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>S</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>C</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>&#x2062;</mml:mo>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mpadded width="+1.7pt">
<mml:mn>2</mml:mn>
</mml:mpadded>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mo largeop="true" symmetric="true">&#x2211;</mml:mo>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mn>10</mml:mn>
</mml:msub>
<mml:mo>&#x00D7;</mml:mo>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mn>20</mml:mn>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:msqrt>
<mml:mrow>
<mml:msqrt>
<mml:mrow>
<mml:msub>
<mml:mo largeop="true" symmetric="true">&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msubsup>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mn>10</mml:mn>
</mml:msub>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>j</mml:mi>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:msqrt>
<mml:mo>&#x2062;</mml:mo>
<mml:msqrt>
<mml:mrow>
<mml:msub>
<mml:mo largeop="true" symmetric="true">&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msubsup>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mn>20</mml:mn>
</mml:msub>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>j</mml:mi>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:msqrt>
</mml:mrow>
</mml:msqrt>
</mml:mfrac>
</mml:mrow>
</mml:math>
</disp-formula>
<p>where <italic>P</italic><sub>1</sub> and <italic>P</italic><sub>2</sub> represent the matrix of gene expressions of a given pathway in two sets of samples (populations 1 and 2), <italic>i</italic> is the row index (i.e., gene index) within each matrix, <italic>j</italic> is the column index (i.e., sample index) within each matrix, and <italic>P</italic><sub>m</sub><sub>0</sub> (either <italic>P</italic><sub>10</sub> or <italic>P</italic><sub>20</sub>)</p>
<disp-formula id="S2.Ex2">
<mml:math id="M2">
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>m</mml:mi>
</mml:msub>
<mml:mo>&#x00D7;</mml:mo>
<mml:msubsup>
<mml:mi>P</mml:mi>
<mml:mi>m</mml:mi>
<mml:mo>&#x2032;</mml:mo>
</mml:msubsup>
</mml:mrow>
<mml:mo>-</mml:mo>
<mml:mrow>
<mml:mi>D</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>a</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>g</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>o</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>n</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>a</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>l</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>m</mml:mi>
</mml:msub>
<mml:mo>&#x00D7;</mml:mo>
<mml:msubsup>
<mml:mi>P</mml:mi>
<mml:mi>m</mml:mi>
<mml:mo>&#x2032;</mml:mo>
</mml:msubsup>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<p>where <italic>m</italic> is either 1 for population 1 or 2 for population 2. Deducting the diagonal elements in the above equation was initially proposed to the bioinformatics community for analyzing genomics data (<xref ref-type="bibr" rid="B22">Smilde et al., 2009</xref>). This term will make sure that the identified similarities do not depend on the number of samples compared between the datasets.</p>
<p>The TSC captures the similarity of the pathway expression pattern between two samples and/or sample sets that is in the range [&#x2212;1,1].</p>
</sec>
<sec id="S2.SS7">
<title>Identifying Responders Using TSC</title>
<p>The TSC for each pathway was identified between one sample and the remaining samples, divided into two groups of responders and non-responders (<xref ref-type="bibr" rid="B1">Baselga et al., 2012</xref>). GO terms in level C5 with 10 to 30 genes are used in this study to identify the similarity between samples based on their gene expression pattern (<xref ref-type="bibr" rid="B14">Madani Tonekaboni et al., 2019</xref>). We limited the number of genes in GO terms to exclude large GO terms (at the top of the GO term hierarchy) that are parents of the GO terms in our study (at the bottom of the GO hierarchy). If the TSC for similarity for the responders was higher than that for the non-responders, the given sample was considered as a responder and <italic>vice versa</italic>. This process was repeated for every given sample in each arm of the trial. The method&#x2019;s performance for predicting the response of cancer patients was assessed using the concordance index.</p>
</sec>
<sec id="S2.SS8">
<title>Cross-Validation in Predictive Models</title>
<p>Each model was validated using leave-one-out cross-validation. In this setting, a target sample was put aside, and the rest of the samples were used for the prediction of drug response in the target sample. The TSC of each pathway between the set-aside sample and the randomly selected five samples from responders and non-responders were calculated. Then the median of the TSCs of all the pathways was calculated to assess if the sample has a higher similarity to responders or non-responders. This process was repeated 100 times for each sample, and majority votes of the 100 times were considered as the predicted class of the sample to be responder or non-responder.</p>
</sec>
</sec>
<sec id="S3">
<title>Results</title>
<p>We leveraged the gene expression and clinical information of HER2+ breast cancer patients in the NeoALTTO clinical trial to identify biomarkers of drug response. The NeoALTTO clinical trial was a phase three randomized clinical trial designed to assess the efficacy of anti-HER2 monoclonal antibody trastuzumab, the tyrosine kinase inhibitor lapatinib, and their combination therapies on HER2-overexpressing breast cancer patients. The response (pCR) rate was significantly higher in the group given lapatinib and trastuzumab (51.3%) than in the group given trastuzumab alone (29.5%; <italic>p</italic> &#x003C; 0.05). However, no significant difference in pCR between the lapatinib (24.7%) and the trastuzumab (<italic>p</italic> = 0.34) groups was observed.</p>
<p>We identified correlations of tumor samples based on their gene expression profiles in three arms of the clinical trial separated based on the treatment type including trastuzumab alone, lapatinib alone, and their combination therapies (<xref ref-type="fig" rid="F1">Figure 1</xref>). The unsupervised clustering of samples could not stratify the patients based on their responses, relying on the rate of pathological complete response (<xref ref-type="fig" rid="F2">Figure 2A</xref>).</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p>Identifying responders to lapatinib, trastuzumab, and their combination therapy in the NeoALTTO clinical trial using genes as univariate predictors of response. <bold>(A)</bold> Clustering of samples based on their similarity, defined as the Spearman correlation between gene expression profiles of the sample. <bold>(B)</bold> Top 10 genes as univariate biomarkers of drug response in ER+ and ER&#x2013; cohorts within each arm of the NeoALTTO clinical trial.</p></caption>
<graphic xlink:href="fgene-11-01016-g002.tif"/>
</fig>
<p>We further computed the C-index of genes as univariate biomarkers of drug response in each arm of the NeoALTTO trial. Relying on the common knowledge on ER being one of the main drivers in breast cancer development and progression (<xref ref-type="bibr" rid="B8">Fuqua, 1997</xref>), we stratified our analyses based on the ER status. Top predictors of response yield a C-index of 0.68 (<xref ref-type="fig" rid="F2">Figure 2B</xref>), while the C-index of ERBB2 as a univariate biomarker of response in all the arms does not exceed 0.59 (the full list can be found in the <xref ref-type="supplementary-material" rid="DS1">Supplementary Material</xref>). The low performance of univariate modeling could be due to high correlation of patient tumor samples, as more than 90% of the tumor sample pairs had Pearson correlation of more than 0.9 using their gene expression profiles.</p>
<p>We recently showed the high performance of a new method called SIGN in predicting the survival rate of breast cancer patients under different therapeutic regimens (<xref ref-type="bibr" rid="B14">Madani Tonekaboni et al., 2019</xref>). We sought to use SIGN to predict the drug response of patients in each arm of the NeoALTTO trial. We used C-indices of the pathways, identified between the TSC of the pathways and the drug response in each arm of the trial, to cluster the arms. Trastuzumab alone and the combination therapy arms were clustered more closely compared to the lapatinib alone arm using the C-indices of the pathways, although the difference is not significant (<italic>p</italic> &#x003E; 0.05) (<xref ref-type="fig" rid="F3">Figure 3A</xref>). Moreover, the pathway biomarkers of ER&#x2212; and ER+ patient tumors showed low commonality revealing differences in the mechanism of response caused by the ER status of the patient tumors (absolute Spearman correlation &#x003C;0.08) (<xref ref-type="fig" rid="F3">Figure 3B</xref>). Top pathway biomarkers for patients with the same treatment regimen and ER status had C-indices of more than 0.8 except for ER&#x2212; patients under trastuzumab alone therapy (<xref ref-type="fig" rid="F3">Figure 3C</xref>).</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption><p>Identifying responders to lapatinib, trastuzumab, and their combination therapy in the NeoALTTO clinical trial using the transcriptional similarity coefficient (TSC) of pathways. <bold>(A)</bold> Concordance indices of delta TSCs of GO terms, comparing each sample with responders and non-responders, in predicting the response of patients to lapatinib, trastuzumab, and their combination. <bold>(B)</bold> Clustering of groups of patients based on Concordance indices of delta TSC of GO terms <bold>(A)</bold>. <bold>(C)</bold> Top pathways as predictors of lapatinib, trastuzumab, and their combination in ER+ and ER&#x2013; tumor samples in the NeoALTTO clinical trial.</p></caption>
<graphic xlink:href="fgene-11-01016-g003.tif"/>
</fig>
<p>Although the biological function of the identified pathways as biomarkers of drug response requires experimental validation, we found some evidence on their biological relevance. For example, among top identified biomarkers of drug response, there are REGULATION OF INTERFERON GAMMA BIOSYNTHETIC PROCESS and NEGATIVE T-CELL SELECTION for ER&#x2212; and ER+ cancer patients under lapatinib treatment, respectively. These are in agreement with previous literature on the importance of immune signaling in lapatinib response in cancer patients (<xref ref-type="bibr" rid="B10">Griguolo et al., 2019</xref>).</p>
<sec id="S3.SS1">
<title>Comparison With Other Machine Learning Models</title>
<p>We compared the top seven biomarkers identified in each arm of the NeoALTTO clinical trial for patients with ER&#x2212; or ER+ status with the performance of 35 machine learning models (<xref ref-type="fig" rid="F4">Figure 4</xref>). These models were built using five different machine learning algorithms, including logistic regression, k-nearest-neighbor (k-NN), naive Bayes, random forest, and support vector machine (SVM), and seven different feature selection approaches (<xref ref-type="fig" rid="F4">Figure 4</xref>). SIGN-based biomarkers outperformed all 35 models in all treatment categories. We used the same leave-one-out cross-validation strategy as used for SIGN to compare the performance of these models.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption><p>Comparison of performance of top seven biomarkers of drug response using Similarity Identification in Gene Expression (SIGN) and 35 machine learning models built combining five machine learning methods and seven different feature selection approaches.</p></caption>
<graphic xlink:href="fgene-11-01016-g004.tif"/>
</fig>
</sec>
</sec>
<sec id="S4">
<title>Discussion</title>
<p>We propose SIGN as a new approach to identify biomarkers of drug response in other subtypes of breast cancer or other tumor types. We showed the utility of SIGN in predicting the response of HER2+ breast cancer patients to lapatinib, trastuzumab, and their combination therapies using transcription patterns within biological pathways. Our results further emphasize the information gained upon using genes within biological pathways instead of individual markers of drug response. Furthermore, it suggests transcriptional similarity coefficient (TSC) as a new measure of similarity between tumor samples to be used in predicting their response to drug response. SIGN-based biomarkers outperformed 35 different machine learning models in predicting drug response in each treatment category. Moreover, the SIGN approach provides us with highly interpretable pathway-based biomarkers of drug response. Although SIGN showed promising performance for predicting response to lapatinib, trastuzumab, and their combination in HER2+ breast cancer patients, this approach needs further validation to ensure its generalizability in new clinical datasets. Upon having access to further clinical data of HER2+ patients in each one of these treatment categories, our findings in this study can be further assessed and validated.</p>
</sec>
<sec id="S5">
<title>Data Availability Statement</title>
<p>The datasets generated for this study can be found in the <ext-link ext-link-type="uri" xlink:href="https://clinicaltrials.gov">ClinicalTrials.gov</ext-link> Identifier: NCT00553358.</p>
</sec>
<sec id="S6">
<title>Author Contributions</title>
<p>SM led the project and performed the computational analysis of the work under supervision of BH-K. GB and SM collected and curated the data. All authors contributed to the article and approved the submitted version.</p>
</sec>
<sec id="conf1">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<fn-group>
<fn fn-type="financial-disclosure">
<p><bold>Funding.</bold> This study was conducted with the support of the Cancer Research Society and the Ontario Institute for Cancer Research through funding provided by the Government of Ontario. SM was supported by Connaught the International Scholarships for Doctoral Students, Genome Canada, and the Ontario Research Funds. BH-K was supported by the Gattuso&#x2013;Slaight Personalized Cancer Medicine Fund at Princess Margaret Cancer Centre, the Natural Sciences and Engineering Research Council, and the Canadian Institutes of Health Research.</p>
</fn>
</fn-group>
<ack>
<p>The authors thank the organizers of the NeoALTTO clinical trial for sharing their valuable data with the scientific community.</p>
</ack>
<sec id="S9" sec-type="supplementary material"><title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fgene.2020.01016/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fgene.2020.01016/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Data_Sheet_1.ZIP" id="DS1" mimetype="application/zip" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Data_Sheet_2.ZIP" id="DS2" mimetype="application/zip" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baselga</surname> <given-names>J.</given-names></name> <name><surname>Bradbury</surname> <given-names>I.</given-names></name> <name><surname>Eidtmann</surname> <given-names>H.</given-names></name> <name><surname>Di Cosimo</surname> <given-names>S.</given-names></name> <name><surname>de Azambuja</surname> <given-names>E.</given-names></name> <name><surname>Aura</surname> <given-names>C.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Lapatinib with trastuzumab for HER2-positive early breast cancer (NeoALTTO): a randomised, open-label, multicentre, phase 3 trial.</article-title> <source><italic>Lancet</italic></source> <volume>379</volume> <fpage>633</fpage>&#x2013;<lpage>640</lpage>. <pub-id pub-id-type="doi">10.1016/s0140-6736(11)61847-3</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bianchini</surname> <given-names>G.</given-names></name> <name><surname>Prat</surname> <given-names>A.</given-names></name> <name><surname>Pickl</surname> <given-names>M.</given-names></name> <name><surname>Belousov</surname> <given-names>A.</given-names></name> <name><surname>Koehler</surname> <given-names>A.</given-names></name> <name><surname>Semiglazov</surname> <given-names>V.</given-names></name><etal/></person-group> (<year>2011</year>). <article-title>Response to neoadjuvant trastuzumab and chemotherapy in ER and ER- HER2-positive breast cancers: gene expression analysis.</article-title> <source><italic>J. Clin. Oncol.</italic></source> <volume>29</volume> <fpage>529</fpage>&#x2013;<lpage>529</lpage>. <pub-id pub-id-type="doi">10.1200/jco.2011.29.15_suppl.529</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Boulbes</surname> <given-names>D. R.</given-names></name> <name><surname>Chauhan</surname> <given-names>G. B.</given-names></name> <name><surname>Jin</surname> <given-names>Q.</given-names></name> <name><surname>Bartholomeusz</surname> <given-names>C.</given-names></name> <name><surname>Esteva</surname> <given-names>F. J.</given-names></name></person-group> (<year>2015</year>). <article-title>CD44 expression contributes to trastuzumab resistance in HER2-positive breast cancer cells.</article-title> <source><italic>Breast Cancer Res. Treat.</italic></source> <volume>151</volume> <fpage>501</fpage>&#x2013;<lpage>513</lpage>. <pub-id pub-id-type="doi">10.1007/s10549-015-3414-3</pub-id> <pub-id pub-id-type="pmid">25971596</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bray</surname> <given-names>N. L.</given-names></name> <name><surname>Pimentel</surname> <given-names>H.</given-names></name> <name><surname>Melsted</surname> <given-names>P.</given-names></name> <name><surname>Pachter</surname> <given-names>L.</given-names></name></person-group> (<year>2016</year>). <article-title>Erratum: Near-optimal probabilistic RNA-seq quantification.</article-title> <source><italic>Nat. Biotechnol.</italic></source> <volume>34</volume>:<issue>888</issue>. <pub-id pub-id-type="doi">10.1038/nbt0816-888d</pub-id> <pub-id pub-id-type="pmid">27504780</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Carey</surname> <given-names>L. A.</given-names></name> <name><surname>Perou</surname> <given-names>C. M.</given-names></name> <name><surname>Livasy</surname> <given-names>C. A.</given-names></name> <name><surname>Dressler</surname> <given-names>L. G.</given-names></name> <name><surname>Cowan</surname> <given-names>D.</given-names></name> <name><surname>Conway</surname> <given-names>K.</given-names></name><etal/></person-group> (<year>2006</year>). <article-title>Race, breast cancer subtypes, and survival in the carolina breast cancer study.</article-title> <source><italic>JAMA</italic></source> <volume>295</volume> <fpage>2492</fpage>&#x2013;<lpage>2502</lpage>.</citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dave</surname> <given-names>B.</given-names></name> <name><surname>Migliaccio</surname> <given-names>I.</given-names></name> <name><surname>Carolina Gutierrez</surname> <given-names>M.</given-names></name> <name><surname>Wu</surname> <given-names>M.-F.</given-names></name> <name><surname>Chamness</surname> <given-names>G. C.</given-names></name> <name><surname>Wong</surname> <given-names>H.</given-names></name><etal/></person-group> (<year>2011</year>). <article-title>Loss of phosphatase and tensin homolog or phosphoinositol-3 kinase activation and response to trastuzumab or lapatinib in human epidermal growth factor receptor 2-overexpressing locally advanced breast cancers.</article-title> <source><italic>J. Clin. Oncol.</italic></source> <volume>29</volume> <fpage>166</fpage>&#x2013;<lpage>173</lpage>. <pub-id pub-id-type="doi">10.1200/jco.2009.27.7814</pub-id> <pub-id pub-id-type="pmid">21135276</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>de Melo Gagliato</surname> <given-names>D.</given-names></name> <name><surname>Jardim</surname> <given-names>D. L. F.</given-names></name> <name><surname>Marchesi</surname> <given-names>M. S. P.</given-names></name> <name><surname>Hortobagyi</surname> <given-names>G. N.</given-names></name></person-group> (<year>2016</year>). <article-title>Mechanisms of resistance and sensitivity to anti-HER2 therapies in HER2+ breast cancer.</article-title> <source><italic>Oncotarget</italic></source> <volume>7</volume> <fpage>64431</fpage>&#x2013;<lpage>64446</lpage>. <pub-id pub-id-type="doi">10.18632/oncotarget.7043</pub-id> <pub-id pub-id-type="pmid">26824988</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fuqua</surname> <given-names>S. A. W.</given-names></name></person-group> (<year>1997</year>). <article-title>The role of the estrogen receptor in breast cancer development and progression.</article-title> <source><italic>Steroids</italic></source> <volume>11.62</volume>:<issue>723</issue>.</citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gomez</surname> <given-names>H. L.</given-names></name> <name><surname>Chavez</surname> <given-names>M. A.</given-names></name> <name><surname>Doval</surname> <given-names>D. C.</given-names></name> <name><surname>Franco</surname> <given-names>S.</given-names></name> <name><surname>Arbushites</surname> <given-names>M.</given-names></name> <name><surname>Berger</surname> <given-names>M. S.</given-names></name><etal/></person-group> (<year>2007</year>). <article-title>Investigation of tumor biomarkers as response predictors in a monotherapy study with lapatinib (L) as a first line treatment in ErbB2 amplified women with breast cancer.</article-title> <source><italic>J. Clin. Oncol.</italic></source> <volume>25</volume> <fpage>10562</fpage>&#x2013;<lpage>10562</lpage>. <pub-id pub-id-type="doi">10.1200/jco.2007.25.18_suppl.10562</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Griguolo</surname> <given-names>G.</given-names></name> <name><surname>Pascual</surname> <given-names>T.</given-names></name> <name><surname>Dieci</surname> <given-names>M. V.</given-names></name> <name><surname>Guarneri</surname> <given-names>V.</given-names></name> <name><surname>Prat</surname> <given-names>A.</given-names></name></person-group> (<year>2019</year>). <article-title>Interaction of host immunity with HER2-targeted treatment and tumor heterogeneity in HER2-positive breast cancer.</article-title> <source><italic>J. Immunother. Cancer</italic></source> <volume>7</volume>:<issue>90</issue>.</citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Harrell</surname> <given-names>F. E.</given-names> <suffix>Jr.</suffix></name> <name><surname>Califf</surname> <given-names>R. M.</given-names></name> <name><surname>Pryor</surname> <given-names>D. B.</given-names></name> <name><surname>Lee</surname> <given-names>K. L.</given-names></name> <name><surname>Rosati</surname> <given-names>R. A.</given-names></name></person-group> (<year>1982</year>). <article-title>Evaluating the yield of medical tests.</article-title> <source><italic>JAMA</italic></source> <volume>247</volume> <fpage>2543</fpage>&#x2013;<lpage>2546</lpage>. <pub-id pub-id-type="doi">10.1001/jama.247.18.2543</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Harris</surname> <given-names>L. N.</given-names></name> <name><surname>You</surname> <given-names>F.</given-names></name> <name><surname>Schnitt</surname> <given-names>S. J.</given-names></name> <name><surname>Witkiewicz</surname> <given-names>A.</given-names></name> <name><surname>Lu</surname> <given-names>X.</given-names></name> <name><surname>Sgroi</surname> <given-names>D.</given-names></name><etal/></person-group> (<year>2007</year>). <article-title>Predictors of resistance to preoperative trastuzumab and vinorelbine for HER2-positive early breast cancer.</article-title> <source><italic>Clin. Cancer Res.</italic></source> <volume>13</volume> <fpage>1198</fpage>&#x2013;<lpage>1207</lpage>. <pub-id pub-id-type="doi">10.1158/1078-0432.ccr-06-1304</pub-id> <pub-id pub-id-type="pmid">17317830</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Loibl</surname> <given-names>S.</given-names></name> <name><surname>von Minckwitz</surname> <given-names>G.</given-names></name> <name><surname>Schneeweiss</surname> <given-names>A.</given-names></name> <name><surname>Paepke</surname> <given-names>S.</given-names></name> <name><surname>Lehmann</surname> <given-names>A.</given-names></name> <name><surname>Rezai</surname> <given-names>M.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>PIK3CA mutations are associated with lower rates of pathologic complete response to anti-human epidermal growth factor receptor 2 (HER2) therapy in primary HER2-overexpressing breast cancer.</article-title> <source><italic>J. Clin. Oncol.</italic></source> <volume>32</volume> <fpage>3212</fpage>&#x2013;<lpage>3220</lpage>. <pub-id pub-id-type="doi">10.1200/jco.2014.55.7876</pub-id> <pub-id pub-id-type="pmid">25199759</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Madani Tonekaboni</surname> <given-names>S. A.</given-names></name> <name><surname>Kumar Manem</surname> <given-names>V. S.</given-names></name> <name><surname>El-Hachem</surname> <given-names>N.</given-names></name> <name><surname>Haibe-Kains</surname> <given-names>B.</given-names></name></person-group> (<year>2019</year>). <article-title>SIGN: similarity identification in gene expression.</article-title> <source><italic>Bioinformatics</italic></source> <volume>35</volume> <fpage>4830</fpage>&#x2013;<lpage>4833</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btz485</pub-id> <pub-id pub-id-type="pmid">31198954</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Menyh&#x00E1;rt</surname> <given-names>O.</given-names></name> <name><surname>Santarpia</surname> <given-names>L.</given-names></name> <name><surname>Gy&#x00F5;rffy</surname> <given-names>B.</given-names></name></person-group> (<year>2015</year>). <article-title>A comprehensive outline of trastuzumab resistance biomarkers in HER2 overexpressing breast cancer.</article-title> <source><italic>Curr. Cancer Drug Targets</italic></source> <volume>15</volume> <fpage>665</fpage>&#x2013;<lpage>683</lpage>. <pub-id pub-id-type="doi">10.2174/156800961508151001101742</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Murtagh</surname> <given-names>F.</given-names></name> <name><surname>Legendre</surname> <given-names>P.</given-names></name></person-group> (<year>2014</year>). <article-title>Ward&#x2019;s hierarchical agglomerative clustering method: which algorithms implement ward&#x2019;s criterion?</article-title> <source><italic>J. Classif.</italic></source> <volume>31</volume> <fpage>274</fpage>&#x2013;<lpage>295</lpage>. <pub-id pub-id-type="doi">10.1007/s00357-014-9161-z</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nahta</surname> <given-names>R.</given-names></name></person-group> (<year>2012</year>). <article-title>molecular mechanisms of trastuzumab-based treatment in HER2-overexpressing breast cancer.</article-title> <source><italic>ISRN Oncol.</italic></source> <volume>2012</volume>:<issue>428062</issue>.</citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nam</surname> <given-names>S.</given-names></name> <name><surname>Chang</surname> <given-names>H. R.</given-names></name> <name><surname>Jung</surname> <given-names>H. R.</given-names></name> <name><surname>Gim</surname> <given-names>Y.</given-names></name> <name><surname>Kim</surname> <given-names>N. Y.</given-names></name> <name><surname>Grailhe</surname> <given-names>R.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>A pathway-based approach for identifying biomarkers of tumor progression to trastuzumab-resistant breast cancer.</article-title> <source><italic>Cancer Lett.</italic></source> <volume>356</volume> <fpage>880</fpage>&#x2013;<lpage>890</lpage>. <pub-id pub-id-type="doi">10.1016/j.canlet.2014.10.038</pub-id> <pub-id pub-id-type="pmid">25449779</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Onitilo</surname> <given-names>A. A.</given-names></name> <name><surname>Engel</surname> <given-names>J. M.</given-names></name> <name><surname>Greenlee</surname> <given-names>R. T.</given-names></name> <name><surname>Mukesh</surname> <given-names>B. N.</given-names></name></person-group> (<year>2009</year>). <article-title>Breast cancer subtypes based on ER/PR and Her2 expression: comparison of clinicopathologic features and survival.</article-title> <source><italic>Clin. Med. Res.</italic></source> <volume>7</volume> <fpage>4</fpage>&#x2013;<lpage>13</lpage>. <pub-id pub-id-type="doi">10.3121/cmr.2008.825</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schneeweiss</surname> <given-names>A.</given-names></name> <name><surname>Chia</surname> <given-names>S.</given-names></name> <name><surname>Hegg</surname> <given-names>R.</given-names></name> <name><surname>Tausch</surname> <given-names>C.</given-names></name> <name><surname>Deb</surname> <given-names>R.</given-names></name> <name><surname>Ratnayake</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>Evaluating the predictive value of biomarkers for efficacy outcomes in response to pertuzumab- and trastuzumab-based therapy: an exploratory analysis of the TRYPHAENA study.</article-title> <source><italic>Breast Cancer Res.</italic></source> <volume>16</volume> <issue>bcr3690</issue>. <pub-id pub-id-type="doi">10.1186/bcr3690</pub-id> <pub-id pub-id-type="pmid">25005255</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schr&#x00F6;der</surname> <given-names>M. S.</given-names></name> <name><surname>Culhane</surname> <given-names>A. C.</given-names></name> <name><surname>Quackenbush</surname> <given-names>J.</given-names></name> <name><surname>Haibe-Kains</surname> <given-names>B.</given-names></name></person-group> (<year>2011</year>). <article-title>survcomp: an R/Bioconductor package for performance assessment and comparison of survival models.</article-title> <source><italic>Bioinformatics</italic></source> <volume>27</volume> <fpage>3206</fpage>&#x2013;<lpage>3208</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btr511</pub-id> <pub-id pub-id-type="pmid">21903630</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Smilde</surname> <given-names>A. K.</given-names></name> <name><surname>Kiers</surname> <given-names>H. A. L.</given-names></name> <name><surname>Bijlsma</surname> <given-names>S.</given-names></name> <name><surname>Rubingh</surname> <given-names>C. M.</given-names></name> <name><surname>van Erk</surname> <given-names>M. J.</given-names></name></person-group> (<year>2009</year>). <article-title>Matrix correlations for high-dimensional data: the modified RV-coefficient.</article-title> <source><italic>Bioinformatics</italic></source> <volume>25</volume> <fpage>401</fpage>&#x2013;<lpage>405</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btn634</pub-id> <pub-id pub-id-type="pmid">19073588</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Soneson</surname> <given-names>C.</given-names></name> <name><surname>Love</surname> <given-names>M. I.</given-names></name> <name><surname>Robinson</surname> <given-names>M. D.</given-names></name></person-group> (<year>2015</year>). <article-title>Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences.</article-title> <source><italic>F1000Research</italic></source> <volume>4</volume>:<issue>1521</issue>. <pub-id pub-id-type="doi">10.12688/f1000research.7563.1</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vici</surname> <given-names>P.</given-names></name> <name><surname>Mottolese</surname> <given-names>M.</given-names></name> <name><surname>Pizzuti</surname> <given-names>L.</given-names></name> <name><surname>Barba</surname> <given-names>M.</given-names></name> <name><surname>Sperati</surname> <given-names>F.</given-names></name> <name><surname>Terrenato</surname> <given-names>I.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>The Hippo transducer TAZ as a biomarker of pathological complete response in HER2-positive breast cancer patients treated with trastuzumab-based neoadjuvant therapy.</article-title> <source><italic>Oncotarget</italic></source> <volume>5</volume> <fpage>9619</fpage>&#x2013;<lpage>9625</lpage>. <pub-id pub-id-type="doi">10.18632/oncotarget.2449</pub-id> <pub-id pub-id-type="pmid">25294813</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vivian</surname> <given-names>J.</given-names></name> <name><surname>Rao</surname> <given-names>A. A.</given-names></name> <name><surname>Nothaft</surname> <given-names>F. A.</given-names></name> <name><surname>Ketchum</surname> <given-names>C.</given-names></name> <name><surname>Armstrong</surname> <given-names>J.</given-names></name> <name><surname>Novak</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2017</year>). <article-title>Toil enables reproducible, open source, big biomedical data analyses.</article-title> <source><italic>Nat. Biotechnol.</italic></source> <volume>35</volume> <fpage>314</fpage>&#x2013;<lpage>316</lpage>. <pub-id pub-id-type="doi">10.1038/nbt.3772</pub-id> <pub-id pub-id-type="pmid">28398314</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Willis</surname> <given-names>S.</given-names></name> <name><surname>Polydoropoulou</surname> <given-names>V.</given-names></name> <name><surname>Sun</surname> <given-names>Y.</given-names></name> <name><surname>Young</surname> <given-names>B.</given-names></name> <name><surname>Tsourti</surname> <given-names>Z.</given-names></name> <name><surname>Karlis</surname> <given-names>D.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>Exploratory analysis of single-gene predictive biomarkers in HERA DASL cohort reveals that C8A mRNA expression is prognostic of outcome and predictive of benefit of trastuzumab.</article-title> <source><italic>JCO Prec. Oncol.</italic></source> <volume>2</volume> <fpage>1</fpage>&#x2013;<lpage>12</lpage>. <pub-id pub-id-type="doi">10.1200/po.18.00016</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wirapati</surname> <given-names>P.</given-names></name> <name><surname>Sotiriou</surname> <given-names>C.</given-names></name> <name><surname>Kunkel</surname> <given-names>S.</given-names></name> <name><surname>Farmer</surname> <given-names>P.</given-names></name> <name><surname>Pradervand</surname> <given-names>S.</given-names></name> <name><surname>Haibe-Kains</surname> <given-names>B.</given-names></name><etal/></person-group> (<year>2008</year>). <article-title>Meta-analysis of gene expression profiles in breast cancer: toward a unified understanding of breast cancer subtyping and prognosis signatures.</article-title> <source><italic>Breast Cancer Res.</italic></source> <volume>10</volume>:<issue>124</issue>. <pub-id pub-id-type="doi">10.1186/bcr2124</pub-id> <pub-id pub-id-type="pmid">18662380</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wissler</surname> <given-names>C.</given-names></name></person-group> (<year>1905</year>). <article-title>The spearman correlation formula.</article-title> <source><italic>Science</italic></source> <volume>22</volume> <fpage>309</fpage>&#x2013;<lpage>311</lpage>. <pub-id pub-id-type="doi">10.1126/science.22.558.309</pub-id> <pub-id pub-id-type="pmid">17836577</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wu</surname> <given-names>Y.</given-names></name> <name><surname>Ginther</surname> <given-names>C.</given-names></name> <name><surname>Kim</surname> <given-names>J.</given-names></name> <name><surname>Mosher</surname> <given-names>N.</given-names></name> <name><surname>Chung</surname> <given-names>S.</given-names></name> <name><surname>Slamon</surname> <given-names>D.</given-names></name></person-group> (<year>2012</year>). <article-title>Expression of Wnt3 activates Wnt/b-catenin pathway and promotes EMT-like phenotype in trastuzumab-resistant HER2-overexpressing breast cancer cells.</article-title> <source><italic>Mol. Cancer Res.</italic></source> <volume>10</volume> <fpage>1597</fpage>&#x2013;<lpage>1606</lpage>. <pub-id pub-id-type="doi">10.1158/1541-7786.mcr-12-0155-t</pub-id> <pub-id pub-id-type="pmid">23071104</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>L.</given-names></name> <name><surname>Huang</surname> <given-names>Y.</given-names></name> <name><surname>Zhuo</surname> <given-names>W.</given-names></name> <name><surname>Zhu</surname> <given-names>Y.</given-names></name> <name><surname>Zhu</surname> <given-names>B.</given-names></name> <name><surname>Chen</surname> <given-names>Z.</given-names></name></person-group> (<year>2017</year>). <article-title>Identification and characterization of biomarkers and their functions for Lapatinib-resistant breast cancer.</article-title> <source><italic>Med. Oncol.</italic></source> <volume>34</volume>:<issue>89</issue>.</citation></ref>
</ref-list>
<fn-group>
<fn id="footnote1">
<label>1</label>
<p><ext-link ext-link-type="uri" xlink:href="https://clinicaltrials.gov/ct2/show/results/NCT00553358">https://clinicaltrials.gov/ct2/show/results/NCT00553358</ext-link></p></fn>
</fn-group>
</back>
</article>
