<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="review-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Artif. Intell.</journal-id>
<journal-title>Frontiers in Artificial Intelligence</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Artif. Intell.</abbrev-journal-title>
<issn pub-type="epub">2624-8212</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/frai.2021.659037</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Artificial Intelligence</subject>
<subj-group>
<subject>Review</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Applications of Topological Data Analysis in Oncology</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Bukkuri</surname> <given-names>Anuraag</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1059851/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Andor</surname> <given-names>Noemi</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Darcy</surname> <given-names>Isabel K.</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1245672/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Department of Integrated Mathematical Oncology, Moffitt Cancer Center</institution>, <addr-line>Tampa, FL</addr-line>, <country>United States</country></aff>
<aff id="aff2"><sup>2</sup><institution>Department of Mathematics, University of Iowa</institution>, <addr-line>Iowa City, IA</addr-line>, <country>United States</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Umberto Lupo, &#x000C9;cole Polytechnique F&#x000E9;d&#x000E9;rale de Lausanne, Switzerland</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Bala Krishnamoorthy, Washington State University Vancouver, United States; Pablo G. Camara, University of Pennsylvania, United States</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Anuraag Bukkuri  <email>anuraag.bukkuri&#x00040;moffitt.org</email></corresp>
<fn fn-type="other" id="fn001"><p>This article was submitted to Machine Learning and Artificial Intelligence, a section of the journal Frontiers in Artificial Intelligence</p></fn></author-notes>
<pub-date pub-type="epub">
<day>13</day>
<month>04</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>4</volume>
<elocation-id>659037</elocation-id>
<history>
<date date-type="received">
<day>26</day>
<month>01</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>16</day>
<month>03</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2021 Bukkuri, Andor and Darcy.</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>Bukkuri, Andor and Darcy</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license> </permissions>
<abstract><p>The emergence of the information age in the last few decades brought with it an explosion of biomedical data. But with great power comes great responsibility: there is now a pressing need for new data analysis algorithms to be developed to make sense of the data and transform this information into knowledge which can be directly translated into the clinic. Topological data analysis (TDA) provides a promising path forward: using tools from the mathematical field of algebraic topology, TDA provides a framework to extract insights into the often high-dimensional, incomplete, and noisy nature of biomedical data. Nowhere is this more evident than in the field of oncology, where patient-specific data is routinely presented to clinicians in a variety of forms, from imaging to single cell genomic sequencing. In this review, we focus on applications involving persistent homology, one of the main tools of TDA. We describe some recent successes of TDA in oncology, specifically in predicting treatment responses and prognosis, tumor segmentation and computer-aided diagnosis, disease classification, and cellular architecture determination. We also provide suggestions on avenues for future research including utilizing TDA to analyze cancer time-series data such as gene expression changes during pathogenesis, investigation of the relation between angiogenic vessel structure and treatment efficacy from imaging data, and experimental confirmation that geometric and topological connectivity implies functional connectivity in the context of cancer.</p></abstract>
<kwd-group>
<kwd>topological data analysis</kwd>
<kwd>persistent homology</kwd>
<kwd>oncology</kwd>
<kwd>single cell analysis</kwd>
<kwd>imaging</kwd>
<kwd>clonal evolution</kwd>
<kwd>tumor heterogeneity</kwd>
</kwd-group>
<counts>
<fig-count count="5"/>
<table-count count="0"/>
<equation-count count="0"/>
<ref-count count="138"/>
<page-count count="14"/>
<word-count count="12866"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>1. Introduction</title>
<p>With the advent of next-generation high-throughput sequencing (Roychowdhury et al., <xref ref-type="bibr" rid="B109">2011</xref>; Reuter et al., <xref ref-type="bibr" rid="B108">2015</xref>), improved medical imaging (Wang, <xref ref-type="bibr" rid="B128">2016</xref>; Tahmassebi et al., <xref ref-type="bibr" rid="B119">2018</xref>; Aiello et al., <xref ref-type="bibr" rid="B4">2019</xref>), and an increased focus on personalized medicine (Dilsizian and Siegel, <xref ref-type="bibr" rid="B40">2014</xref>; Gu and Taylor, <xref ref-type="bibr" rid="B60">2014</xref>; Alyass et al., <xref ref-type="bibr" rid="B7">2015</xref>; Suwinski et al., <xref ref-type="bibr" rid="B118">2019</xref>), more data is being collected than ever before. Efficient data analysis techniques are critically needed to convert this data into meaningful, clinically translatable information. Topological data analysis (TDA) focuses on the shape of data, identifying both local and global structures at multiple scales. Consider a trivial example: suppose data points lie on a circle. The data points could represent customers&#x00027; preferences or patient gene expression. In this case if a product or drug were targeted to the average person, the target would be the center of the circle and would thus miss the data set entirely. While this is a simple made-up example, it illustrates the importance of understanding the shape of data. TDA can be applied to high-dimensional and noisy data. While the output of TDA can be affected by incomplete data, it is still effective at distinguishing between data sets that have different shapes.</p>
<p>TDA has been successfully applied in a variety of medical contexts including to discover phenotype-biomarker associations in traumatic brain injury (Nielson et al., <xref ref-type="bibr" rid="B95">2017</xref>), identify diagnostic factors for pulmonary embolism (Rucco et al., <xref ref-type="bibr" rid="B110">2015</xref>), discriminate between healthy patients and those with diabetic retinopathy from retinal imaging (Garside et al., <xref ref-type="bibr" rid="B52">2019</xref>), map human recombination at fine scales (Camara et al., <xref ref-type="bibr" rid="B24">2016</xref>), identify novel pathological phenotypes of asthma (Siddiqui et al., <xref ref-type="bibr" rid="B112">2018</xref>), and characterize the structure of chromatin conformation inside the nucleus (Emmett et al., <xref ref-type="bibr" rid="B45">2016</xref>). In this review, we shall focus our attention on some recent applications of persistent homology, a main tool of TDA, to oncology. We specifically discuss treatment responses, clinical outcomes, disease classification, biomarker identification, and cellular architecture in cancer. We will also provide insights into possible future fruitful avenues of research, including analysis of time-series data to help with disease classification and identification of selection events, investigation of the relation between angiogenic vessel structure and treatment efficacy from imaging data, and experimental confirmation that geometric and topological connectivity implies functional connectivity in the context of cancer. Though we focus on persistent homology here, it is worth noting that there have been many notable successes of the application of other TDA methods, such as the <italic>Mapper</italic> algorithm (Singh et al., <xref ref-type="bibr" rid="B113">2007</xref>). For example, <italic>Mapper</italic> was recently used to extract information from high-throughput microarray data and define a new subtype of breast cancer, c-MYB&#x0002B;, characterized by high c-MYB expression and low levels of innate inflammatory genes, with corresponding patients exhibiting 100% survival and no metastasis (Nicolau et al., <xref ref-type="bibr" rid="B94">2007</xref>). In another study, <italic>Mapper</italic> was used to discover 38 new cancer-associated genes across tumor types, some of which were then confirmed to play a key role in tumorigenesis in mouse models (Rabad&#x000E1;n et al., <xref ref-type="bibr" rid="B104">2020</xref>). Before delving into the applications of persistent homology in cancer, we introduce some of the key mathematical underpinnings needed to understand these results.</p>
</sec>
<sec id="s2">
<title>2. What Is Persistent Homology?</title>
<p>The mathematical definition of homology/homologous is very precise and often differs from the English common usage. Homology uses algebra to detect topological shapes. Topology is sometimes called rubber sheet geometry as two objects are topologically equivalent to each other if one can be deformed into the other without tearing or puncturing the objects. For example, the spherical and cubical surfaces are topologically equivalent per <xref ref-type="fig" rid="F1">Figure 1A</xref>. The sphere is topologically different from the 3-dimensional ball that the sphere bounds. Homology detects this difference by noting that the 2-dimensional spherical surface bounds a void while the 3-dimensional ball is solid and thus does not bound any voids.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p><bold>(A)</bold> The solid ball and solid cube are topologically equivalent and thus have the same homology. Their surface boundaries also have the same homology since these surfaces are topologically equivalent. The solid ball has one connected component and thus &#x003B2;<sub>0</sub> &#x0003D; 1. The solid ball does not contain any voids, and thus &#x003B2;<sub><italic>i</italic></sub> &#x0003D; 0 for all <italic>i</italic> &#x0003E; 0. The sphere, which is the boundary of the ball, has &#x003B2;<sub>0</sub> &#x0003D; 1 since it is connected, and &#x003B2;<sub>2</sub> &#x0003D; 1 since the 2-dimensional sphere bounds a void, while &#x003B2;<sub><italic>i</italic></sub> &#x0003D; 0 for all other <italic>i</italic> since there are no lower or higher dimensional voids. For an <italic>n</italic> &#x0002B; 1-dimensional ball (for example, all points of distance less than or equal to 1 from the origin in <italic>R</italic><sup><italic>n</italic>&#x0002B;1</sup>), &#x003B2;<sub><italic>i</italic></sub> &#x0003D; 0 for all <italic>i</italic> &#x0003E; 0 since it does not contain any voids. The <italic>n</italic>-dimensional sphere, which is the boundary of the <italic>n</italic> &#x0002B; 1-dimensional ball, has &#x003B2;<sub><italic>i</italic></sub> &#x0003D; 1 for <italic>i</italic> &#x0003D; 0, <italic>n</italic> and &#x003B2;<sub><italic>i</italic></sub> &#x0003D; 0 for all other <italic>i</italic>. Since the <italic>n</italic>-dimensional sphere contains a void, it is the <italic>n</italic>-dimensional object that generates &#x003B2;<sub><italic>n</italic></sub>. <bold>(B)</bold> A surface with boundary that is topologically equivalent to an annulus. The annulus is a 2-dimensional surface that has the same homology as a 1-dimensional circle. Since this object has one connected component, &#x003B2;<sub>0</sub> &#x0003D; 1. We can use addition to represent a cycle. The cycle <italic>e</italic><sub>5</sub> &#x0002B; <italic>e</italic><sub>9</sub> &#x0002B; <italic>e</italic><sub>12</sub> &#x0002B; <italic>e</italic><sub>8</sub> &#x0003D; <italic>e</italic><sub>5</sub> &#x0002B; <italic>e</italic><sub>8</sub> &#x0002B; <italic>e</italic><sub>9</sub> &#x0002B; <italic>e</italic><sub>12</sub> is homologous to 0 since it bounds a surface (the light green waning crescent moon). Since all 1-dimensional cycles are either homologous to 0 or to (a multiple of) the rectangle cycle <italic>e</italic><sub>1</sub> &#x0002B; <italic>e</italic><sub>2</sub> &#x0002B; <italic>e</italic><sub>3</sub> &#x0002B; <italic>e</italic><sub>4</sub>, &#x003B2;<sub>1</sub> &#x0003D; 1. Since this object lives in the 2-dimensional plane, &#x003B2;<sub><italic>i</italic></sub> &#x0003D; 0 for all <italic>i</italic> &#x0003E; 1. <bold>(C)</bold> The solid torus has &#x003B2;<sub><italic>i</italic></sub> &#x0003D; 1 for <italic>i</italic> &#x0003D; 0, 1 and &#x003B2;<sub><italic>i</italic></sub> &#x0003D; 0 for all other <italic>i</italic> while its boundary, the torus, has &#x003B2;<sub><italic>i</italic></sub> &#x0003D; 1 for <italic>i</italic> &#x0003D; 0, 2, &#x003B2;<sub>1</sub> &#x0003D; 2, and &#x003B2;<sub><italic>i</italic></sub> &#x0003D; 0 for all other <italic>i</italic>. The thick blue cycle is a 1-dimensional homology generator for both the solid torus and its boundary. The thiner black cycle is a homologous to 0 in the solid torus as it bounds a meridinal disk, while this black circle is a homology generator in the torus which is not homologous to the blue circle. The torus surface generates the 2-dimensional homology.</p></caption>
<graphic xlink:href="frai-04-659037-g0001.tif"/>
</fig>
<p>To describe homology, we will first focus on two quantities: &#x003B2;<sub>0</sub> = the number of connected components and &#x003B2;<sub>1</sub> = the number of 1-dimensional holes (a circle that has not been filled in). One does not need to understand the algebra of homology in order to understand the basics of persistent homology, thus we will only briefly introduce some concepts for the interested reader. Two points are homologous if they are in the same connected component. Thus, &#x003B2;<sub>0</sub> = 1 if the object is connected. To describe &#x003B2;<sub>1</sub>, we will focus on <xref ref-type="fig" rid="F1">Figure 1B</xref>. We can use addition to represent topological objects. For example, the rectangle in <xref ref-type="fig" rid="F1">Figure 1</xref> is represented by the sum of edges: <italic>e</italic><sub>1</sub> &#x0002B; <italic>e</italic><sub>2</sub> &#x0002B; <italic>e</italic><sub>3</sub> &#x0002B; <italic>e</italic><sub>4</sub>. Two 1-dimensional cycles are homologous to each other if they form the boundary of a surface. Thus, the rectangle is homologous to the cycle <italic>e</italic><sub>5</sub> &#x0002B; <italic>e</italic><sub>8</sub> &#x0002B; <italic>e</italic><sub>10</sub> &#x0002B; <italic>e</italic><sub>11</sub> since these two cycles bound the green surface. The cycles <italic>e</italic><sub>5</sub> &#x0002B; <italic>e</italic><sub>6</sub> &#x0002B; <italic>e</italic><sub>7</sub> &#x0002B; <italic>e</italic><sub>8</sub> and <italic>e</italic><sub>9</sub> &#x0002B; <italic>e</italic><sub>10</sub> &#x0002B; <italic>e</italic><sub>11</sub> &#x0002B; <italic>e</italic><sub>12</sub> are also homologous since they bound the light green surface consisting of two crescent moons. In fact all these cycles are homologous to the rectangle <italic>e</italic><sub>1</sub> &#x0002B; <italic>e</italic><sub>2</sub> &#x0002B; <italic>e</italic><sub>3</sub> &#x0002B; <italic>e</italic><sub>4</sub>. One can see that this object contains many cycles, many of which are homologous to the rectangle (or a multiple of the rectangle, for example, <inline-formula><mml:math id="M1"><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>5</mml:mn></mml:mrow><mml:mrow><mml:mn>12</mml:mn></mml:mrow></mml:munderover></mml:mstyle><mml:msub><mml:mrow><mml:mi>e</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is homologous to <inline-formula><mml:math id="M2"><mml:mn>2</mml:mn><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:munderover></mml:mstyle><mml:msub><mml:mrow><mml:mi>e</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>). A 1-dimensional cycle is homologous to 0 if it bounds a surface. Thus the cycles <italic>e</italic><sub>5</sub> &#x0002B; <italic>e</italic><sub>9</sub> &#x0002B; <italic>e</italic><sub>12</sub> &#x0002B; <italic>e</italic><sub>8</sub> and <italic>e</italic><sub>6</sub> &#x0002B; <italic>e</italic><sub>7</sub> &#x0002B; <italic>e</italic><sub>11</sub> &#x0002B; <italic>e</italic><sub>10</sub> are both homologous to 0 since they each form the boundary of a surface (the two crescent moons, waning or waxing, respectively). Since each of the cycles in this figure are homologous to 0 or to a multiple of the rectangle, its homology is generated by a single cycle (for example, the rectangle) and thus &#x003B2;<sub>1</sub> &#x0003D; 1.</p>
<p>The intuitive definition of homology is that &#x003B2;<sub><italic>n</italic></sub> equals the number of <italic>n</italic>&#x02212;dimensional holes<xref ref-type="fn" rid="fn0001"><sup>1</sup></xref>. Per the <xref ref-type="fig" rid="F1">Figure 1</xref> caption, homology can be used to distinguish the following objects from each other: solid ball, sphere, higher dimensional balls and spheres, solid torus, and torus. Homology cannot distinguish all objects that are topologically different. For example,the 1-dimensional circle, the 2-dimensional surface in <xref ref-type="fig" rid="F1">Figure 1B</xref>, and the 3-dimensional solid torus (<xref ref-type="fig" rid="F1">Figure 1C</xref>) all have the same homology. For more on the mathematical definition of homology (please see Munkres, <xref ref-type="bibr" rid="B91">1984</xref>; Hatcher, <xref ref-type="bibr" rid="B64">2002</xref>; Ghrist, <xref ref-type="bibr" rid="B54">2014</xref>).</p>
<p>We will illustrate with an elementary example how persistent homology can detect shape at multiple scales by noting the birth and death of topological features. Our dataset will consist of 5 points from a circle as shown in <xref ref-type="fig" rid="F2">Figure 2</xref>. To detect the circle, we need to connect these points in some manner. For example, we could connect all points whose distance is less than some fixed &#x003F5;. If one can visualize the data set, then the choice of &#x003F5; may be clear. But more often, there is no obvious choice, so instead we analyze the data at multiple scales using persistent homology. The first box in <xref ref-type="fig" rid="F2">Figure 2</xref> shows the five data points. At this stage, we have five components, one for each data point (&#x003B2;<sub>0</sub> &#x0003D; 5). These components are represented by the five red lines in the top part of this figure. These five red lines along with the blue segment is called the barcode for the data set. The barcode keeps track of the number of components (red bars) and number of 1-dimensional holes (blue bar) as the threshold for connecting data points increases. We can visualize the increasing threshold (or proximity parameter) by growing balls around each data point and connecting pairs of points as soon as their respective balls intersect. Thus, in the second box, an edge joins the two closest points, reducing the number of connected components by one. Thus, one bar ends (dies), and only 4 bars (&#x003B2;<sub>0</sub> &#x0003D; 4) continue past this threshold. Observe that every time an edge joins two components, a bar dies (and &#x003B2;<sub>0</sub> reduces by one). In the timepoint just before 1.5 (box labeled 5), two edges are added. One connects two components, but the third forms a triangle with two previously created edges. These three edges surround a small hole, but we fill in this hole (shaded in pink) as we only want to detect large holes. We are forming a Rips complex where whenever a triangle is formed, it is immediately filled in and thus triangles do not contribute to &#x003B2;<sub>1</sub>. In the timepoint after 1.5 (box labeled 6), a cycle containing four edges is formed. This is indicated in the barcode by the start (birth) of the blue bar. As more edges are added, eventually this region is divided into two triangles and the blue bar dies at timepoint close to 2 (corresponding to box labeled 7). Note we have one infinitely long bar (top red bar with arrow) since after time 1.5 we have one connected component.</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p>A barcode captures topological features in a dataset at multiple scales. The topology of a dataset at a fixed scale is determined by joining pairs of data points with an edge if the distances between the pair of points is less than the fixed scale. If three edges form a triangle, then the triangle is filled in. This process is shown in the seven boxes as the scale for joining vertices increases from box 1 to box 7. The corresponding barcode is shown at the top of the figure. The persistence of a feature over multiple scales determines the length of the bar corresponding to that feature. The number of components (&#x003B2;<sub>0</sub>) that exist at a particular scale is represented by the number of red bars that exists at the corresponding Rips diameter. The creation of the 1-dimensional cycle in box 6 is represented by the birth of the blue bar. The blue bar dies when this cycle is filled in with triangles (box 7). This figure was created by modifying the output of the R package TDAstat (Wadhwa et al., <xref ref-type="bibr" rid="B127">2018</xref>) and latex code written by Catalina Betancourt.</p></caption>
<graphic xlink:href="frai-04-659037-g0002.tif"/>
</fig>
<p>To summarize, this example of a TDA pipeline consists of taking a dataset, creating a sequence of Rips complexes, and outputting a barcode (Edelsbrunner et al., <xref ref-type="bibr" rid="B44">2002</xref>; Carlsson et al., <xref ref-type="bibr" rid="B26">2005</xref>; Zomorodian and Carlsson, <xref ref-type="bibr" rid="B138">2005</xref>). A Rips complex is a generalization of a graph. While in our example we only looked at adding edges and triangles, we can also add higher dimensional simplices. A <italic>n</italic>-simplex in a Rips complex is a collection of <italic>n</italic> &#x0002B; 1 points where each pair of points is connected by an edge. Thus an edge is a 1-simplex, a triangle is a 2-simplex, and a tetrahedron is a 3-simplex. In our circle example, when all pairs of the 5 points are connected by edges, we add a 4-simplex even though the data set lives in 2-dimensions. The existence of an <italic>n</italic>-simplex means that (all pairs of) <italic>n</italic> &#x0002B; 1 points are close together according to a given threshold. The Rips complex is also called a clique complex, the latter term coming from graph theory where a clique is a graph where every pair of vertices is connected. Thus, our simplices correspond to clique subgraphs. Other names for Rips complex include Vietoris-Rips complex and flag complex.</p>
<p>There are other ways to form a simplicial complex from data. For the Rips complex, an <italic>n</italic>-simplex is formed at threshold <italic>r</italic> when all pairs of <italic>n</italic> &#x0002B; 1 points are of distance less than <italic>r</italic> (so that each pair of points is connected by an edge). This is equivalent to requiring every pair of balls of radius <italic>r</italic> centered around the <italic>n</italic> &#x0002B; 1 points to intersect. If we require the intersection of all these balls to be nonempty in order to form an <italic>n</italic>-simplex, we instead form the &#x0010C;ech complex. Thus, to form a 2-simplex (triangle), the Rips complex only requires non-empty pairwise intersection of three balls while the &#x0010C;ech complex requires the intersection of all three balls to be nonempty. Thus, the &#x0010C;ech complex is similar to the Rips complex, but an <italic>n</italic>-simplex is formed at a slightly larger threshold in the &#x0010C;ech complex. Under certain conditions, the &#x0010C;ech complex is guaranteed to have the same homology as the union of all balls of radius <italic>r</italic> centered around data points (Hatcher, <xref ref-type="bibr" rid="B64">2002</xref>). But the Rips complex has much smaller computer memory requirements as only the edges need to be stored to determine the Rips complex, and thus the Rips complex is normally used when calculating persistent homology. A very different TDA technique called Mapper uses a completely different method to create a simplicial complex from data (Singh et al., <xref ref-type="bibr" rid="B113">2007</xref>). For Mapper, each vertex represents a cluster of data points. If <italic>n</italic> &#x0002B; 1 of these clusters have a common intersection, then an <italic>n</italic>-simplex is formed. Mapper can be used to reduce the size of a data set and to visualize it.</p>
<p>The example in <xref ref-type="fig" rid="F2">Figure 2</xref> focused on &#x003B2;<sub>0</sub> and &#x003B2;<sub>1</sub>. For data that lives in a higher dimensional space, we can similarly calculate &#x003B2;<sub><italic>n</italic></sub> = the number of n-dimensional holes. For example, &#x003B2;<sub>2</sub> &#x0003D; 1 for both the sphere and torus as these are 2-dimensional surfaces that bound voids in space. For more details regarding persistent homology and barcodes (please see Ghrist, <xref ref-type="bibr" rid="B53">2008</xref>; Carlsson, <xref ref-type="bibr" rid="B25">2009</xref>; Edelsbrunner and Harer, <xref ref-type="bibr" rid="B43">2010</xref>; Otter et al., <xref ref-type="bibr" rid="B98">2017</xref>).</p>
<p>In order to use persistent homology in machine learning, we need a distance between barcodes. We first convert barcodes to persistence diagrams as described in the next section and use these diagrams to define a distance between barcodes. In this section, we show how persistent homology is stable with respect to noise: small perturbations in the data have only a small effect on the barcode (Cohen-Steiner et al., <xref ref-type="bibr" rid="B35">2007</xref>). In section 2.2, we discuss the advantages/disadvantages of persistent homology with regard to how it handles noise, incomplete data, and computational complexity. In section 2.3, we discuss one method (persistent images) of converting a persistence diagram into a vector that can be used in machine learning. We also give references to many other methods for using persistent homology in machine learning.</p>
<p>While we have discussed the basic method for converting Euclidean data into barcodes, there are a number of other methods for obtaining barcodes from data. All one needs is a method to determine when to add an edge between pairs of data points. Thus, the data do not need to live in Euclidean space. We also assumed that small holes correspond to noise, but there are applications where the point of using persistent homology is to detect small holes (Bendich et al., <xref ref-type="bibr" rid="B15">2016</xref>). We also had only one infinite bar corresponding to the one connected component we obtained when all our data points were connected by edges. If one is working with Euclidean data, eventually all holes will be filed in and thus eventually a Rips complex with only one component and no holes will be formed. But in other applications, holes may persist forever, resulting in infinite bars. One can also obtain additional information by looking at the group structure of the filtered homology groups, and prove stability properties using interleaving distance (Bauer and Lesnick, <xref ref-type="bibr" rid="B14">2014</xref>; Bubenik and Scott, <xref ref-type="bibr" rid="B22">2014</xref>; Oudot, <xref ref-type="bibr" rid="B99">2015</xref>; Chazal et al., <xref ref-type="bibr" rid="B29">2016</xref>).</p>
<sec>
<title>2.1. Persistence Diagrams and Stability</title>
<p>While barcodes are useful for visualizing changes in homology, barcodes are generally converted into persistence diagrams for statistical and machine learning analysis (Edelsbrunner et al., <xref ref-type="bibr" rid="B44">2002</xref>; Mileyko et al., <xref ref-type="bibr" rid="B88">2011</xref>). The start of a bar represents the birth of a cycle while the end represents its death. The plot of the points (birth time, death time) in 2-dimensional space is called the persistent diagram (PD). The persistent diagram corresponding to the barcode in <xref ref-type="fig" rid="F2">Figure 2</xref> is shown in <xref ref-type="fig" rid="F3">Figure 3</xref>. A persistence diagram also includes the diagonal as shown in this figure as the diagonal is used when computing distances between PDs. A PD can be a multiset if multiple bars have the same birth time <italic>b</italic> and death time <italic>d</italic>, so that the point (<italic>b, d</italic>) occurs multiple times in the PD.</p>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p>A barcode can be converted into a persistent diagram. Each bar with finite length in a barcode is represented by a point in the persistent diagram. If a bar is born at time <italic>b</italic> and dies at time <italic>d</italic>, then the bar is represented by the point (<italic>b, d</italic>). In <xref ref-type="fig" rid="F2">Figure 2</xref>, there are four finite red bars plus one infinite red bar. These bars are all born at time 0. In the persistent diagram, the four finite red bars are represented by the four red points all of which have <italic>b</italic> &#x0003D; 0. The one blue bar in <xref ref-type="fig" rid="F2">Figure 2</xref> is represented by the blue triangle in this persistent diagram. This figure was created using the R package TDAstat (Wadhwa et al., <xref ref-type="bibr" rid="B127">2018</xref>).</p></caption>
<graphic xlink:href="frai-04-659037-g0003.tif"/>
</fig>
<p>The formula for the bottleneck distance for a fixed &#x003B2;<sub><italic>i</italic></sub> between two persistence diagrams, <italic>P</italic><sub>1</sub> and <italic>P</italic><sub>2</sub>, is <inline-formula><mml:math id="M8"><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>B</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>P</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>P</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>:</mml:mo><mml:mo>=</mml:mo><mml:munder class="msub"><mml:mrow><mml:mo class="qopname">inf</mml:mo></mml:mrow><mml:mrow><mml:mi>&#x003B3;</mml:mi><mml:mo>:</mml:mo><mml:msub><mml:mrow><mml:mi>P</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>&#x02192;</mml:mo><mml:msub><mml:mrow><mml:mi>P</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:munder><mml:mtext>&#x000A0;</mml:mtext><mml:munder class="msub"><mml:mrow><mml:mo class="qopname">sup</mml:mo></mml:mrow><mml:mrow><mml:mi>x</mml:mi><mml:mo>&#x02208;</mml:mo><mml:msub><mml:mrow><mml:mi>P</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:munder><mml:mo>&#x02225;</mml:mo><mml:mi>x</mml:mi><mml:mo>-</mml:mo><mml:mi>&#x003B3;</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:msub><mml:mrow><mml:mo>&#x02225;</mml:mo></mml:mrow><mml:mrow><mml:mi>&#x0221E;</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>. To compute this distance we first create a matching &#x003B3; between these diagrams for the fixed &#x003B2;<sub><italic>i</italic></sub> as shown in <xref ref-type="fig" rid="F4">Figure 4</xref>. In this figure the blue triangles represent features with the fixed &#x003B2;<sub><italic>i</italic></sub> from one data set while the purple stars represent features from a different data set for the same &#x003B2;<sub><italic>i</italic></sub>. A matching &#x003B3;:<italic>P</italic><sub>1</sub> &#x02192; <italic>P</italic><sub>2</sub> is a bijective function from <italic>P</italic><sub>1</sub> to <italic>P</italic><sub>2</sub> where both persistence diagrams include the diagonal. Features that are close to the diagonal get matched to the diagonal unless they are closer to another feature that does not have a better matching than to the diagonal. If <italic>x</italic> &#x0003D; (<italic>b, d</italic>) &#x02208; <italic>P</italic><sub>1</sub> is matched to the point (&#x003B2;, &#x003B4;), then the distance between these features is &#x02016;<italic>x</italic> &#x02212; &#x003B3;(<italic>x</italic>)&#x02016;<sub>&#x0221E;</sub> &#x0003D; max(|<italic>b</italic> &#x02212; &#x003B2;|, |<italic>d</italic> &#x02212; &#x003B4;|). To find the distance for a particular matching &#x003B3;, we calculate <inline-formula><mml:math id="M9"><mml:msub><mml:mrow><mml:mtext>sup</mml:mtext></mml:mrow><mml:mrow><mml:mi>x</mml:mi><mml:mo>&#x02208;</mml:mo><mml:msub><mml:mrow><mml:mi>P</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:msub><mml:mo>&#x02225;</mml:mo><mml:mi>x</mml:mi><mml:mo>-</mml:mo><mml:mi>&#x003B3;</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:msub><mml:mrow><mml:mo>&#x02225;</mml:mo></mml:mrow><mml:mrow><mml:mi>&#x0221E;</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo></mml:math></inline-formula> the largest distance between a point <italic>x</italic> in <italic>P</italic><sub>1</sub> and its match &#x003B3;(<italic>x</italic>) in <italic>P</italic><sub>2</sub>. The bottleneck distance is obtained by taking the infimum of this distance over all possible matchings. In <xref ref-type="fig" rid="F4">Figure 4</xref>, red dotted lines indicate best matches between features from <italic>P</italic><sub>1</sub> and <italic>P</italic><sub>2</sub>.</p>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p>Two persistence diagram, <italic>P</italic><sub>1</sub> and <italic>P</italic><sub>2</sub>, are shown for a single dimension (for example, &#x003B2;<sub>1</sub>). The blue triangles correspond to <italic>P</italic><sub>1</sub> while purple stars are used for <italic>P</italic><sub>2</sub>. Both persistence diagrams include the diagonal. A matching between <italic>P</italic><sub>1</sub> and <italic>P</italic><sub>2</sub> is shown where the red dotted lines indicate features that have been matched where some of the features are matched to the diagonal. The length of the thicker dark red dotted lines indicate the distance between matched features. The distance between a feature and the diagonal is the persistence of the feature, <italic>d</italic> &#x02212; <italic>b</italic>, where <italic>b</italic> = birth time and <italic>d</italic> = death time of that feature. If feature (<italic>b, d</italic>) is matched with feature (&#x003B2;, &#x003B4;), then the distance between these features is max(|<italic>b</italic> &#x02212; &#x003B2;|, |<italic>d</italic> &#x02212; &#x003B4;|). Since the best matching is shown, <italic>d</italic><sub><italic>B</italic></sub>(<italic>P</italic><sub>1</sub>, <italic>P</italic><sub>2</sub>) equals the length of the longest of the thick dark red dotted lines. Any other matching would have a matched pair of features with larger distance.</p></caption>
<graphic xlink:href="frai-04-659037-g0004.tif"/>
</fig>
<p>If <italic>P</italic><sub>1</sub> is the PD for the data set <italic>X</italic> and <italic>P</italic><sub>2</sub> is the PD for the data set <italic>Y</italic>, the stability theorem states that <italic>d</italic><sub><italic>B</italic></sub>(<italic>P</italic><sub>1</sub>, <italic>P</italic><sub>2</sub>) &#x02264; <italic>d</italic><sub><italic>H</italic></sub>(<italic>X, Y</italic>) &#x0003D; inf{&#x003B5; &#x02265; 0; <italic>X</italic> &#x02286; <italic>Y</italic><sub>&#x003B5;</sub> and <italic>Y</italic> &#x02286; <italic>X</italic><sub>&#x003B5;</sub>} where <inline-formula><mml:math id="M12"><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mi>&#x003B5;</mml:mi></mml:msub><mml:mo>:</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mo>=</mml:mo><mml:mstyle displaystyle='true'><mml:munder><mml:mo>&#x0222A;</mml:mo><mml:mrow><mml:mi>x</mml:mi><mml:mo>&#x02208;</mml:mo><mml:mi>X</mml:mi></mml:mrow></mml:munder><mml:mrow><mml:mo>&#x0007B;</mml:mo><mml:mi>z</mml:mi><mml:mo>&#x02208;</mml:mo><mml:mi>M</mml:mi><mml:mo>;</mml:mo><mml:mi>d</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>z</mml:mi><mml:mo>,</mml:mo><mml:mi>x</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo>&#x02264;</mml:mo><mml:mi>&#x003B5;</mml:mi><mml:mo>&#x0007D;</mml:mo></mml:mrow></mml:mstyle></mml:mrow></mml:math></inline-formula> (Cohen-Steiner et al., <xref ref-type="bibr" rid="B35">2007</xref>). In other words, if each data point is perturbed by at most a distance &#x003F5;, then the persistence of a feature will change by at most 2&#x003F5; since the birth and death times can change by at most &#x003F5;. Features with persistence &#x0003C;2&#x003F5; may disappear, while new features with persistence less than 2&#x003F5; may be created.</p>
</sec>
<sec>
<title>2.2. Benefits and Limitations of Persistent Homology</title>
<p>That persistent homology is stable with respect to noise is, of course, a major advantage. But any method that uses Euclidean distance is affected by the curse of dimensionality due to the effect of noise on distance. For example, suppose a data point should be at the origin, but due to noise, each coordinate is perturbed by 0.01 units, then the point which should be at the origin is now <inline-formula><mml:math id="M10"><mml:msubsup><mml:mrow><mml:mo>&#x003A3;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msubsup><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mn>0</mml:mn><mml:mo>.</mml:mo><mml:mn>0</mml:mn><mml:msup><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></inline-formula> units away from the origin if the data lives in &#x0211D;<sup><italic>n</italic></sup>. Thus, for example if <italic>n</italic> &#x0003D; 10, 000, then the data point is perturbed by a distance <inline-formula><mml:math id="M11"><mml:msubsup><mml:mrow><mml:mo>&#x003A3;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>10</mml:mn><mml:mo>,</mml:mo><mml:mn>000</mml:mn></mml:mrow></mml:msubsup><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mn>0</mml:mn><mml:mo>.</mml:mo><mml:mn>0</mml:mn><mml:msup><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo></mml:math></inline-formula> 1. While the change in persistent homology is bounded by the distance between the original data set and the perturbed data set, the latter can be quite large, depending on the amount of noise and the dimension of the dataset. Thus, performing PCA or t-SNE or other dimension reduction technique first may lead to stronger results.</p>
<p>In order to recover the shape of an object, one must have sufficient coverage. Some holes detected by persistent homology may be due to incomplete data. If these are small, then they only result in short bars which may be considered noise. But in high dimensional spaces, one has many degrees of freedom, so even recovering the shape of simple objects in high dimensions can be impossible as obtaining a sufficient number of data points may not be feasible. However, differences between data sets may still be detected even if coverage is lacking. For example, one may have insufficient coverage to recover the topology of a torus if one uniformly under-samples data points from a torus. However, the resulting barcode will likely be very different than the barcode obtained from uniformly under-sampling points from a sphere. Also, coverage can be less of an issue if you have some information regarding the shape of the data such as periodicity (for example, Dequeant et al., <xref ref-type="bibr" rid="B38">2008</xref>). Thus, in practice, topological data analysis has proven to be quite robust. For more on complexity and topological inference (see Weinberger, <xref ref-type="bibr" rid="B129">2014</xref>).</p>
<p>Due to computational complexity, most analysis using TDA restricts to the use of &#x003B2;<sub><italic>i</italic></sub> for <italic>i</italic> &#x02264; 4. Often only &#x003B2;<sub>0</sub> and &#x003B2;<sub>1</sub> are used, but faster algorithms such as Ripser (Bauer, <xref ref-type="bibr" rid="B13">2019</xref>) are becoming available. To calculate persistent homology of a point cloud, one first needs to create simplicial complexes. The number of simplices grows rapidly with the number of data points as well as the homology dimension (not the dimension of the data set, but the dimension of the holes one wishes to detect&#x02014;in order to calculate &#x003B2;<sub><italic>i</italic></sub>, one needs <italic>i</italic>-dimensional and <italic>i</italic> &#x0002B; 1 dimensional simplices). The TDA pipeline also requires the computation of distances between data points. The dimension in which the data lives can affect this step, but after distances are calculated, it is the shape of the data that can have the largest effect, sometimes even larger than the number of data points as there are several algorithms that can greatly simplify the simplicial complex (Zomorodian, <xref ref-type="bibr" rid="B137">2010</xref>; Mischaikow and Nanda, <xref ref-type="bibr" rid="B89">2013</xref>; Wilkerson et al., <xref ref-type="bibr" rid="B130">2014</xref>; Boissonnat and Pritam, <xref ref-type="bibr" rid="B18">2020</xref>). The effectiveness of these simplification algorithms depends on both the topology and geometry of the data set. For example, suppose one takes <italic>n</italic> data points equally spaced on a straight line. The topology of the line is the same as the topology of a point. Thus, to calculate the homology of the line, one can remove all simplices except for a single vertex. For more on computational complexity of persistent homology (see Otter et al., <xref ref-type="bibr" rid="B98">2017</xref>).</p>
<p>If all the data points enter at time 0, the &#x003B2;<sub>0</sub> bars all start at time 0. Thus the barcode for &#x003B2;<sub>0</sub> can be created from a single linkage hierarchical clustering dendrogram as the merge heights of the dendrogram become the lengths of the &#x003B2;<sub>0</sub> bars. Hence the &#x003B2;<sub>0</sub> barcode contains less information than a single linkage hierarchical clustering dendrogram. However, there are applications where the data points enter at different times such as time series data. Thus, the &#x003B2;<sub>0</sub> barcode can be applied to a wider variety of applications than standard clustering techniques. Clustering also cannot capture holes and voids; the higher dimensional barcodes capture structure that other methods such as clustering miss.</p>
</sec>
<sec>
<title>2.3. Persistent Homology and Machine Learning</title>
<p>The barcode can be used as a topological signature to identify structure in data. While homology is built to detect topology and not geometry, persistent homology can be implemented in a variety of ways to distinguish geometrical shapes (e.g., Turner et al., <xref ref-type="bibr" rid="B124">2014</xref>; Li et al., <xref ref-type="bibr" rid="B81">2018</xref>; Bubenik et al., <xref ref-type="bibr" rid="B21">2020</xref>). Machine learning can be applied to a collection of persistent diagrams to distinguish between data sets with different structures. Many machine learning algorithms take a vector as input. There are many ways to create a vector from persistent homology. A pipeline to create a vector using persistence images (Adams et al., <xref ref-type="bibr" rid="B3">2017</xref>) is illustrated in <xref ref-type="fig" rid="F5">Figure 5</xref>. A persistent diagram is first rotated by 45&#x000B0; so that the diagonal becomes the horizontal axis (2nd panel of <xref ref-type="fig" rid="F5">Figure 5</xref>). Thus the horizontal axis represents the birth time, while the vertical axis represents persistence = death - birth. A heat map is then created using a Gaussian distribution (or other weight function) about each point (3rd panel). The height of the Gaussian distribution is indicated with color in the heat map and is dependent on the persistence of the feature. Points closest to the diagonal are considered to be the result of noise and are thus given no intensity. Hence the bottom of the heat map will always have the color corresponding to zero intensity, in this case blue. In other words, points close to the diagonal have no effect on the heat map. Observe that the point furthest from the diagonal in the first panel corresponds to a feature with the largest persistence per second panel. Thus, in the heat map in the 3rd panel, the color at this point is given the highest intensity (yellow). As shown in the fourth panel, the heat map is discretized by partitioning the heat map into <italic>n</italic> &#x000D7; <italic>n</italic> squares where the color of each square corresponds to the average value of the corresponding square in the heat map. In the discretized heat map (4th panel), the yellowish region from the 3rd panel corresponding to the most persistent feature is partitioned between two squares with the yellow square in the top row of this heat map containing a larger portion than the pinkish square next to it in the same row. In the final panel, an <italic>n</italic><sup>2</sup>-dimensional vector is created by concatenating the rows of the discretized heat map.</p>
<fig id="F5" position="float">
<label>Figure 5</label>
<caption><p>Pipeline for vectorizing a persistent diagram using persistent images. This figure is a modification of <xref ref-type="fig" rid="F1">Figure 1</xref> from Adams et al. (<xref ref-type="bibr" rid="B3">2017</xref>) which is licensed under <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">CC BY 4.0</ext-link>.</p></caption>
<graphic xlink:href="frai-04-659037-g0005.tif"/>
</fig>
<p>Other methods for using persistent homology in machine learning include persistent landscapes (Bubenik, <xref ref-type="bibr" rid="B19">2015</xref>, <xref ref-type="bibr" rid="B20">2020</xref>), persistent curves (Chung and Lawson, <xref ref-type="bibr" rid="B33">2019</xref>), and kernel functions (for example, Reininghaus et al., <xref ref-type="bibr" rid="B107">2015</xref>; Kusano et al., <xref ref-type="bibr" rid="B77">2016</xref>; Carri&#x000E8;re et al., <xref ref-type="bibr" rid="B27">2017</xref>; Chazal et al., <xref ref-type="bibr" rid="B30">2017</xref>).</p>
</sec>
</sec>
<sec id="s3">
<title>3. Treatment Responses and Prognosis</title>
<p>What impedes the success of cancer therapies is often the coexistence of therapy resistant cells along with therapy sensitive tumor cell populations. When administered separately, all currently adopted therapeutic strategies&#x02014;ranging from cytotoxic chemotherapies to molecular targeted therapies&#x02014;impose a dramatic, yet homogeneous selective pressure on an often heterogeneous group of tumor cells. Despite varying resistance mechanisms contingent upon therapy-type and tumor composition, every therapeutic intervention inevitably selects for resistant cells, which expand and become the dominant cell type of recurrent tumors, that cease to respond to therapy (Maley and Reid, <xref ref-type="bibr" rid="B85">2005</xref>; Aparicio and Caldas, <xref ref-type="bibr" rid="B10">2013</xref>; Bukkuri, <xref ref-type="bibr" rid="B23">2020</xref>). The increased resolution on the clonal architecture of intermixed tumor cell populations that has just now become available calls for prognostic and therapeutic benefits. High intra-tumor diversity in pre-malignant lesions has been shown to predict progression to malignant growths and poor outcome (Maley et al., <xref ref-type="bibr" rid="B84">2006</xref>; Laurie et al., <xref ref-type="bibr" rid="B78">2012</xref>). The therapeutic significance of intratumoral heterogeneity (ITH) is exemplified in a recent study that measured genetic and transcriptional diversity of breast cancer tumors before and after therapy based on four genetic markers and two transcriptional markers. The study provided proof-of-principle that therapy-induced phenotypic changes can be predicted based on the characterization of coexisting tumor subpopulations (Almendro et al., <xref ref-type="bibr" rid="B6">2014</xref>). Another recent study used RNA interference to model heterogeneous tumors and tested the efficacy of predicted drug combinations in eliminating coexisting tumor subpopulations (Zhao et al., <xref ref-type="bibr" rid="B135">2014</xref>). Their findings suggest that the most effective drug combination for a given tumor cannot be achieved by targeting the predominant subpopulation alone, but requires detailed characterization of the genetic makeup of branched subpopulations and their contribution to the tumor bulk.</p>
<p>Techniques from computational homology have been used to develop a new algorithm to characterize comparative genomic hybridization (CGH) profiles and identify the frequency of cancer recurrence in early stage breast cancer patients through identification of recurrent copy number aberrations (CNAs) in cancer (DeWoskin et al., <xref ref-type="bibr" rid="B39">2010</xref>), which serve as markers of genomic instability and thus cancer prognosis (Hanahan and Weinberg, <xref ref-type="bibr" rid="B63">2000</xref>; Han et al., <xref ref-type="bibr" rid="B62">2006</xref>). Specifically, the method uses a sliding window algorithm to associate a set of point clouds to each array CGH. Different window sizes allow one to analyze the data at various scales by considering different dimensional point clouds. Then, persistent homology is applied to these point clouds for classification. It was found, in accordance with prior results (Climent et al., <xref ref-type="bibr" rid="B34">2007</xref>), that the Betti numbers of the zero dimensional homology groups (&#x003B2;<sub>0</sub>) can distinguish between recurrent and non-recurrent groups in patients who did not receive anthracycline-based chemotherapy after surgery but not in patients who were treated with anthracycline. Note that, in this approach, no segmentation of the data was required.</p>
<p>In another study, a novel statistic called the smooth Euler characteristic transform (SECT), which allows shape information to be integrated into traditional statistical models, was developed and applied to predict disease free survival in glioblastoma multiforme (GBM) based on tumor shape from post-contrast T1 axial magnetic resonance imaging (MRI) (Crawford et al., <xref ref-type="bibr" rid="B36">2020</xref>). SECT is a variation of the persistent homology transform (PHT) introduced in Turner et al. (<xref ref-type="bibr" rid="B124">2014</xref>) that was created to overcome the difficulties in integration with traditional statistical models. Specifically, the output of SECT is a collection of smooth vectors, while the output of PHT is a collection of persistence diagrams (Edelsbrunner et al., <xref ref-type="bibr" rid="B44">2002</xref>), thus having a complicated representation and geometry which does not lend itself easily into integration with statistical models. In the GBM application, the statistical model used was a Bayesian linear mixed model (BLMM) (Ishwaran and Rao, <xref ref-type="bibr" rid="B71">2005</xref>; Guan and Stephens, <xref ref-type="bibr" rid="B61">2011</xref>; Zhou et al., <xref ref-type="bibr" rid="B136">2013</xref>). When this topological approach was applied to the GBM MRI data, it was found to outperform gene expression, volumetric, and morphological summaries in predicting disease free survival.</p>
<p>Clinically, there is a great importance in the identification of biomarkers which can serve as predictors for metastasis and patient prognosis in cancer. To this end, researchers have recently used persistent homology techniques, in an exploratory data analysis fashion, to identify biologically meaningful geometric properties of single cell data (Lockwood and Krishnamoorthy, <xref ref-type="bibr" rid="B82">2015</xref>). In this method, data was first transposed and analyzed in its dual space with each gene represented in a much lower dimensional sample space, thus circumventing the problem of high dimensionality that is typical of single cell data. A small set of genes (120&#x02013;200) were then selected as landmarks (De Silva and Carlsson, <xref ref-type="bibr" rid="B37">2004</xref>) and a family of nested simplicial complexes was constructed, indexed by a proximity parameter. Unlike many other methods which focus on the analysis of zero dimensional homology groups (DeWoskin et al., <xref ref-type="bibr" rid="B39">2010</xref>; Nicolau et al., <xref ref-type="bibr" rid="B93">2011</xref>), thus performing analyses which are topologically equivalent to clustering, this study focused their efforts on identifying loops of one dimensional homology groups which persist over a large range of values of the proximity parameter, hypothesizing that connections around holes imply nontrivial interactions among genes and biological functions which could have implications for tumorigenesis. Repeating this process for various landmarks, features which remain stable over large ranges of both the proximity parameter and number of landmarks could be detected. Applying these techniques to five different cancer data sets from brain, breast, ovarian, and acute myeloid leukemia cancers, many members of the significant loops in the one dimensional homology groups that were found have been previously shown to be accurate biomarkers for cancer biogenesis, while others serve as potential new markers which have yet to be experimentally validated.</p>
</sec>
<sec id="s4">
<title>4. Tumor Segmentation and Computer-Aided Diagnosis</title>
<p>Computerized methods can efficiently and effectively identify quantitative image features that are otherwise difficult to spot by manual inspection (Yu et al., <xref ref-type="bibr" rid="B132">2016</xref>). Quantitative morphological features extracted from H&#x00026;E stained slides, such as Zernike shape features, have been shown to predict survival in lung adeno- and squamous cell carcinoma (Yu et al., <xref ref-type="bibr" rid="B132">2016</xref>). Recent advances in next-generation sequencing technologies gave rise to a plethora of approaches that quantify and characterize the genotypic diversity within a given tumor. Evidence supporting a quantitative relation between genotypic and morphological ITH followed. A quantitative image analysis approach that complements genomic profiling with geographical information was developed (Yuan et al., <xref ref-type="bibr" rid="B133">2012</xref>; Andor et al., <xref ref-type="bibr" rid="B8">2016</xref>). Furthermore, the authors characterized cellular heterogeneity by distinguishing between well-defined cell-populations (stromal cells, lymphocytes, cancer cells). However, so far qualitative details of how this diversity in morphology is structured (i.e., how many subpopulations are present and what their geographical boundaries are on the H&#x00026;E slide) are unknown.</p>
<p>As a step toward a computer-aided cancer diagnosis system, persistent homology has been used to develop an automated tumor segmentation approach for Hematoxylin &#x00026; Eosin (H&#x00026;E) stained colorectal cancer histology whole slide images (WSI) (Qaiser et al., <xref ref-type="bibr" rid="B103">2016</xref>). The authors exploit the fact that nuclei in tumor regions have atypical characteristics such as non-uniform chromatin texture, irregularity in shape and size, and clustering of nuclei, and use persistent homology profiles to characterize the degree of connectivity among nuclei and to classify cancerous regions based on this information. Specifically, once a WSI has been obtained, it is first divided into patches, each of which has a persistent homology profile. Given two patches, the symmetrized Kullback-Leibler divergence (KLD) can be computed between the respective persistent homology profiles, which serves as a metric for interpatch distance. Then an input patch is classified as cancerous or non-cancerous by a kNN classifier, based on KLD distances between its persistent homology patch and those of each representative patches. These exemplar patches are chosen by training a CNN and selecting patches whose activation during training is large (separately for cancerous and non-cancerous classes). The benefit of this approach over previous approaches is that only the subset of highly activated patches from the convolutional layers are used as exemplars rather than the set of all patches in the training data. This method was compared against standard CNN and HyMaP (Khan et al., <xref ref-type="bibr" rid="B73">2013</xref>) approaches on 74 H&#x00026;E stained WSIs of colorectal cancers; in addition to being computationally less expensive than the other two methods, it was also shown to have better precision and segmentation accuracy.</p>
<p>Another example of tumor segmentation and algorithmic diagnosis is a recent study which aimed to segment a diseased area of skin and classify the type of skin lesion into one of seven classes in a given dermatoscopic image (Tschandl et al., <xref ref-type="bibr" rid="B123">2018</xref>) using persistent homology (Chung et al., <xref ref-type="bibr" rid="B32">2018</xref>). Like the colorectal image segmentation study (Qaiser et al., <xref ref-type="bibr" rid="B103">2016</xref>), the segmentation algorithm used is a concept similar to persistent homology (Edelsbrunner et al., <xref ref-type="bibr" rid="B44">2002</xref>). Linear support vector machines (SVMs) were used for classification on the persistence statistics (Chung et al., <xref ref-type="bibr" rid="B32">2018</xref>) and persistence curves (Chung and Lawson, <xref ref-type="bibr" rid="B33">2019</xref>) were derived from persistence diagrams. Specifically, given an image, a segmentation algorithm was first implemented to obtain an image mask: a binary image in which each pixel is colored either white (if it part of the healthy skin) or black (if it is part of a lesion). Once the mask was applied to the original image, the RGB color space is transformed into an RGB, HSV, or XYZ color space and each channel was extracted. Persistent homology software was then used to compute persistence diagrams for each channel; from each diagram, persistence statistics and curves were computed as features. Finally, a multi-class SVM was used to classify the input into one of the seven types of skin lesions. When this approach was applied to a validation set of 5,000 images, the highest resulting accuracy scores were 65.6, 66, and 67.2%.</p>
<p>Similar persistent homology techniques were used to classify H&#x00026;E stained stage T3 and stage T4 colorectal adenocarcinomas images as benign or malignant (Chittajallu et al., <xref ref-type="bibr" rid="B31">2018</xref>). To do this, given an image, it was first color normalized (Reinhard et al., <xref ref-type="bibr" rid="B106">2001</xref>) and the nuclear stain and minimum cross entropy thresholding (Li and Tam, <xref ref-type="bibr" rid="B80">1998</xref>) for nuclear foreground segmentation were extracted using an unsupervised color deconvolution method (Macenko et al., <xref ref-type="bibr" rid="B83">2009</xref>). Then, a fast difference-of-Gaussian implementation of the scale-adaptive Laplacian-of-Gaussian filter of Al-Kofahi et al. (<xref ref-type="bibr" rid="B5">2010</xref>) was performed to detect nuclei centroids. Then, by considering the set of nuclei centroids as a point cloud, the persistence diagram of its Vietoris-Rips filtration for the one dimensional homology groups (loops) was computed using a fast multiscale approach (Doyle et al., <xref ref-type="bibr" rid="B42">2008</xref>). Then, persistence landscape (Bubenik, <xref ref-type="bibr" rid="B19">2015</xref>) and image (Adams et al., <xref ref-type="bibr" rid="B3">2017</xref>) representations were computed and used as features to characterize loops formed by glandular epithelial cell nuclei. Then given training images with benign/malignant labels, a random forest classifier was trained using these topological features. PCA was used to reduce the dimensionality of each feature group so as to preserve 99% of the variance. Hyperparameter optimization was also performed via cross-validation using a tree-structured parzen estimator (Bergstra et al., <xref ref-type="bibr" rid="B16">2011</xref>). When this method was applied to testing data consisting of 80 images, an accuracy of 85%, AUC of 0.85, precision of 78%, and recall of 95% was obtained, an improvement over the traditional cell graph property approach in all areas (Doyle et al., <xref ref-type="bibr" rid="B42">2008</xref>).</p>
</sec>
<sec id="s5">
<title>5. Disease Classification</title>
<p>Cancers of unknown primary represent 3&#x02013;5% of all cancer cases, whereby physicians find one or multiple metastases but fail to locate the primary tumor. Pathologic evaluation of a metastatic biopsy often does not provide a definitive answer. Molecular data ranging from gene expression to somatic mutations have been shown to significantly aid classification of metastatic biopsies to their corresponding primary tumor site (Ferracin et al., <xref ref-type="bibr" rid="B50">2011</xref>; Marquard et al., <xref ref-type="bibr" rid="B86">2015</xref>; Vikes&#x000E5; et al., <xref ref-type="bibr" rid="B126">2015</xref>; Moran et al., <xref ref-type="bibr" rid="B90">2016</xref>; S&#x000F8;ndergaard et al., <xref ref-type="bibr" rid="B115">2017</xref>).</p>
<p>One study used persistent homology on 150 non-contrast-enhanced fat-suppressed 3D T1-weighted magnetic resonance (MR) images to classify hepatic tumors into three classes: hepatocellular carcinomas (HCC), metastatic tumors (MT), and hepatic hemangiomas (HH) (Oyama et al., <xref ref-type="bibr" rid="B100">2019</xref>). To do this, for each image, a 3D region of interest (ROI) in the shape of a rectangular solid enclosing the entire lesion was created by an experienced radiologist. Then, gray-scale values of the voxels in each ROI were normalized and persistence diagrams were created for dimensions 0, 1, and 2 using HomCloud (Kimura et al., <xref ref-type="bibr" rid="B75">2018</xref>; Obayashi and Hiraoka, <xref ref-type="bibr" rid="B97">2018</xref>). These diagrams were vectorized into persistence images (Adams et al., <xref ref-type="bibr" rid="B2">2015</xref>). Feature vectors were then obtained from these images and inputted into logistic regression with an elastic net penalty and extreme gradient boosting machine learning models for classification. The results from classification showed that dimension 1 persistence images had the highest accuracy rates: 85% for classifying HCC and MT, 84% for HCC and HH, and 74% for HH and MT.</p>
<p>An alternative method to accurately classify tumor subtypes is through the use of high throughput genomics (Nutt et al., <xref ref-type="bibr" rid="B96">2003</xref>; Freije et al., <xref ref-type="bibr" rid="B51">2004</xref>). Aiming to produce more robust algorithms than traditional classification methods, given gene expression profile data, researchers used statistical invariants and persistent homology to identify core patient groups associated with the classical, mesenchymal, and proneural subtypes of GBM and a compact set of genes most useful for this partitioning (Seemann et al., <xref ref-type="bibr" rid="B111">2012</xref>). To do this, a sufficient, but compact, panel of genes to be used for clustering was predetermined using non-dimensionalized standard deviation (to ensure bimodality of gene expression distribution across patient samples; Phillips et al., <xref ref-type="bibr" rid="B102">2006</xref>; Verhaak et al., <xref ref-type="bibr" rid="B125">2010</xref>) and persistent homology (to find groups of genes whose expression levels change coherently among patient samples; Carlsson, <xref ref-type="bibr" rid="B25">2009</xref>; Horak et al., <xref ref-type="bibr" rid="B68">2009</xref>). Then, a hierarchical partitioning of patient samples based on gene expression levels is performed using persistent homology; specifically, samples are repeatedly bisected until further partitioning is not possible, thus obtaining the number of clusters that exists and some notion of genetic proximity of the clusters. Each bisection was implemented using 30 genes. A predictive model was then implemented to assign cancer subtypes to each cluster. Applying this approach to the 20 GBM test samples, fifteen predictions were in accordance with results from standard clustering calculations (Verhaak et al., <xref ref-type="bibr" rid="B125">2010</xref>), five of which were unassigned by both algorithms. Of the remaining five samples, four were classified as &#x0201C;neural&#x0201D; by the clustering algorithm, but were unassigned by this approach since the neural group was not found in a single cluster.</p>
<p>Another example of the use of persistent and computational homology on gene expression data is in Arsuaga et al. (<xref ref-type="bibr" rid="B11">2012</xref>), whereby, upon application to a breast cancer gene expression dataset, the algorithm was able to distinguish among most breast cancer subtypes. This paper extended the work of DeWoskin et al. (<xref ref-type="bibr" rid="B39">2010</xref>) to gene expression data, under the assumption that gene expression is a measure of the underlying copy number changes (Neve et al., <xref ref-type="bibr" rid="B92">2006</xref>; Horlings et al., <xref ref-type="bibr" rid="B69">2010</xref>). Before applying the sliding window algorithm developed in DeWoskin et al. (<xref ref-type="bibr" rid="B39">2010</xref>) to gene expression data, theoretical work was done to show that under idealized conditions, the point cloud defined by the algorithm is a good representation of the original data. Hence, analysis of the point cloud is applicable to the original data set. This was done using Taken&#x00027;s embedding theorem, an extension of Whitney&#x00027;s embedding theorem to dynamical systems theory, and a circularization technique. To apply the sliding window algorithm to gene expression data, instead of pre-selecting differentially expressed genes like traditional clustering algorithms, all genes were ordered by their location in the genomes. Then, the sliding window algorithm was applied to generate point clouds, upon which topological and statistical analysis was performed. It was shown that when only &#x003B2;<sub>0</sub> was used, the algorithm could distinguish between less aggressive subtypes, like normal and luminal-A, and more aggressive ones, such as luminal B, basal-like, and Her2. It was also noted that the algorithm could not distinguish luminal B from Her2 and basal-like, implying the close similarities among these subtypes. Thus, it was noted that breast cancer subtypes can not only be classified by specific sets of genes, but also by certain global relationships among all genes.</p>
</sec>
<sec id="s6">
<title>6. Cellular Architecture</title>
<p>Imaging is an essential part of cancer clinical protocols, providing physicians with morphological, structural, and metabolic information about patient tumors, thereby assisting in clinical decision making and treatment planning (Fass, <xref ref-type="bibr" rid="B49">2008</xref>). The development of new image segmentation tools (Zhang et al., <xref ref-type="bibr" rid="B134">2001</xref>; Hong and Brady, <xref ref-type="bibr" rid="B67">2003</xref>; Xiaohua et al., <xref ref-type="bibr" rid="B131">2004</xref>) and quantitative multiplex immunofluorescence (Stack et al., <xref ref-type="bibr" rid="B116">2014</xref>; Dimitriou et al., <xref ref-type="bibr" rid="B41">2019</xref>; Abousamra et al., <xref ref-type="bibr" rid="B1">2020</xref>) have set the stage for topological data analysis and persistent homology techniques to be harnessed for interpretation of high-dimensional information in histopathological imaging data.</p>
<p>One example of this is using persistent homology techniques to investigate architectural characteristics of cellular organization and nuclear arrangements from microarray tissue samples to distinguish among genetically derived breast cancer subtypes (Basal, Luminal A, Luminal B, and HER2; Singh et al., <xref ref-type="bibr" rid="B114">2014</xref>). This was done through distinct topological characterizations such as nuclear connectivity (generators of zero dimensional homology groups) and loops (generators of one dimensional homology groups) based on Vietoris-Rips filtration of nuclei centers (Mischaikow and Nanda, <xref ref-type="bibr" rid="B89">2013</xref>). When its performance was compared to a standard distance weighted discrimination classifier (Marron and Todd, <xref ref-type="bibr" rid="B87">2007</xref>), nearly a four times improvement in classification accuracy was noted. Furthermore, for certain combinations of feature weightings, it was shown that topological features provide complementary information to patch based image appearance features. By using such topological features, they solve/address two main challenges in obtaining accurate cellular architectural characterization: the heterogeneity of spatial arrangements, both among patients and within single tumor samples, and differences in stain intensity which require manually determined phenotypic thresholds (Engers, <xref ref-type="bibr" rid="B46">2007</xref>; Truesdale et al., <xref ref-type="bibr" rid="B120">2011</xref>; Goodman et al., <xref ref-type="bibr" rid="B58">2012</xref>; Helpap et al., <xref ref-type="bibr" rid="B65">2012</xref>; Truong et al., <xref ref-type="bibr" rid="B121">2013</xref>; Epstein et al., <xref ref-type="bibr" rid="B47">2016</xref>; Evans et al., <xref ref-type="bibr" rid="B48">2016</xref>). This improves performance over existing standard classifiers, which are more sensitive to noise, cannot model stain concentration variations, and have issues with larger cell arrangements (Aukerman et al., <xref ref-type="bibr" rid="B12">2020</xref>).</p>
<p>In another paper, researchers used TDA to cluster prostate cancer histology into architectural groups consistent with the continuum of Gleason patterns, the most widely accepted system for evaluating prostate cancer architecture (Humphrey, <xref ref-type="bibr" rid="B70">2004</xref>; Lawson et al., <xref ref-type="bibr" rid="B79">2019</xref>). Persistent homology was used to compute persistence intensity diagrams (of zero and one dimensional components) of purely graded prostate cancer histopathology images of Gleason patterns 3&#x02013;5. This revealed key insights into characteristics such as nuclei density, glandular shape, and inter-glandular arrangement. Furthermore, persistent homology was able to cluster these images into architectural groups through a rank descending persistence vector&#x02013;the six resulting clusters provided a stable architectural continuum from well differentiated to poorly differentiated adenocarcinoma at an even finer level than the standard Gleason scale.</p>
<p>Persistent homology has also been used to characterize the spatial arrangement of immune and epithelial (tumor) cells within the breast cancer immune microenvironment from quantitative multiplex immunofluorescence (qmIF) imaging (Aukerman et al., <xref ref-type="bibr" rid="B12">2020</xref>). Stain intensities and spatial coordinates of individual cells were collected from qmIF through nuclear segmentation, cytoplasmic definition, and stain quantification. In order to incorporate these stain intensities, instead of directly using a Rips or Cech filtration on the point cloud data (Chazal et al., <xref ref-type="bibr" rid="B28">2009</xref>), a discretization process was first implemented to convert the point cloud data with stain intensity values into an image. Then, persistence diagrams were created from these images by using the opposite of the pixel stain intensity as the filter function. These diagrams were assessed as potential biomarkers of cancer subtype and prognostic biomarkers of overall survival using kernel mean embeddings (Gretton et al., <xref ref-type="bibr" rid="B59">2012</xref>) with the sliced Wasserstein kernel (Carri&#x000E8;re et al., <xref ref-type="bibr" rid="B27">2017</xref>) and were shown to outperform the standard nearest neighbor analysis with a standard Gaussian kernel. Furthermore, a correlation analysis using constrained covariance (Herbrich et al., <xref ref-type="bibr" rid="B66">2005</xref>) showed that the correlation between nearest neighbor and persistence diagrams were always &#x0003C;0.1, implying the features are nearly statistically independent and thus complementary.</p>
</sec>
<sec sec-type="discussion" id="s7">
<title>7. Discussion</title>
<p>As we have seen in this paper, TDA has proven to be a powerful tool, yielding critical insights in the treatment prognosis, tumor segmentation and diagnosis, disease classification, and cellular architecture of cancer. But despite the many recent successes of TDA in the field of oncology, it is still a nascent field with much fruitful work yet to be done. Experimentally, to biologically validate the TDA methodology and results, it would be worth performing thorough studies to assess whether geometric and topological connectivity implies functional connectivity. Computationally, one area which deserves further exploration is the use of TDA to analyze time-series data (Ravishanker and Chen, <xref ref-type="bibr" rid="B105">2019</xref>) in cancer. This has been done extensively in several other fields including climate analysis (Berwald et al., <xref ref-type="bibr" rid="B17">2014</xref>), tracking stability of dynamical systems (Khasawneh and Munch, <xref ref-type="bibr" rid="B74">2016</xref>), clustering populations of Tribolium flour beetles (Pereira and de Mello, <xref ref-type="bibr" rid="B101">2015</xref>), analyzing motion sensor data during sports activities (Stolz et al., <xref ref-type="bibr" rid="B117">2017</xref>), and financial time series data (Gidea, <xref ref-type="bibr" rid="B55">2017</xref>; Truong, <xref ref-type="bibr" rid="B122">2017</xref>; Gidea and Katz, <xref ref-type="bibr" rid="B57">2018</xref>; Gidea et al., <xref ref-type="bibr" rid="B56">2020</xref>). Though time series oncological data have been analyzed with varying degrees of success (Aoto et al., <xref ref-type="bibr" rid="B9">2018</xref>; Kourou et al., <xref ref-type="bibr" rid="B76">2020</xref>), TDA techniques of any sort have yet to be applied. Applying persistent homology techniques to time series microarray, cell anatomy imaging, or gene/pathway expression data, for example, may further help in disease classification, identifying intra-tumoral selection events, and contribute to a greater understanding of tumorigenesis. Another possible avenue of research is to investigate the process of angiogenesis, an inherently geometric and spatially dependent process, using persistent homology techniques. Specifically, we anticipate that TDA will help us understand the changes that occur in tumor vasculature morphology during cancer progression and under treatments. More importantly, we hope that connections between cancer vessel network and treatment prognosis can be found, such as by testing vessel normalization theory (Jain, <xref ref-type="bibr" rid="B72">2005</xref>). In addition to the ideas presented above, it is worth noting that research into the use of TDA in oncology is sparse and, as such, there is much important and clinically relevant work to be done in simply applying well-understood persistent homology algorithms to broader classes of cancer data sets (note that most TDA analyses have been concentrated in just melanoma, brain, breast, and colorectal cancers) and in performing longitudinal studies across several cancer types.</p>
</sec>
<sec id="s8">
<title>Author Contributions</title>
<p>AB conceptualized the project and wrote the sections 3&#x02013;7. AB and ID wrote the section 1. ID wrote the section 2. NA wrote the sections 1, 3, 4, and 5. All authors contributed to the article and approved the submitted version.</p>
</sec>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<ack><p>The authors would like to thank Ethan Rooke and Hind Benmerabet for their insightful comments on a draft of this manuscript.</p>
</ack>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Abousamra</surname> <given-names>S.</given-names></name> <name><surname>Fassler</surname> <given-names>D.</given-names></name> <name><surname>Hou</surname> <given-names>L.</given-names></name> <name><surname>Zhang</surname> <given-names>Y.</given-names></name> <name><surname>Gupta</surname> <given-names>R.</given-names></name> <name><surname>Kurc</surname> <given-names>T.</given-names></name> <etal/></person-group>. (<year>2020</year>). <article-title>&#x0201C;Weakly-supervised deep stain decomposition for multiplex IHC images,&#x0201D;</article-title> in <source>2020 IEEE 17th International Symposium on Biomedical Imaging (ISBI)</source>, <fpage>481</fpage>&#x02013;<lpage>485</lpage>. <pub-id pub-id-type="doi">10.1109/ISBI45749.2020.9098652</pub-id></citation></ref>
<ref id="B2">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Adams</surname> <given-names>H.</given-names></name> <name><surname>Chepushtanova</surname> <given-names>S.</given-names></name> <name><surname>Emerson</surname> <given-names>T.</given-names></name> <name><surname>Hanson</surname> <given-names>E.</given-names></name> <name><surname>Kirby</surname> <given-names>M.</given-names></name> <name><surname>Motta</surname> <given-names>F.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>Persistence images: a stable vector representation of persistent homology</article-title>. <source>J. Mach. Learn. Res</source>. <volume>18</volume>, <fpage>1</fpage>&#x02013;<lpage>35</lpage>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://jmlr.org/papers/v18/16-337.html">http://jmlr.org/papers/v18/16-337.html</ext-link></citation></ref>
<ref id="B3">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Adams</surname> <given-names>H.</given-names></name> <name><surname>Emerson</surname> <given-names>T.</given-names></name> <name><surname>Kirby</surname> <given-names>M.</given-names></name> <name><surname>Neville</surname> <given-names>R.</given-names></name> <name><surname>Peterson</surname> <given-names>C.</given-names></name> <name><surname>Shipman</surname> <given-names>P.</given-names></name> <etal/></person-group>. (<year>2017</year>). <article-title>Persistence images: a stable vector representation of persistent homology</article-title>. <source>J. Mach. Learn. Res</source>. <volume>18</volume>, <fpage>1</fpage>&#x02013;<lpage>35</lpage>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://jmlr.org/papers/v18/16-337.html">http://jmlr.org/papers/v18/16-337.html</ext-link></citation></ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Aiello</surname> <given-names>M.</given-names></name> <name><surname>Cavaliere</surname> <given-names>C.</given-names></name> <name><surname>D&#x00027;Albore</surname> <given-names>A.</given-names></name> <name><surname>Salvatore</surname> <given-names>M.</given-names></name></person-group> (<year>2019</year>). <article-title>The challenges of diagnostic imaging in the era of big data</article-title>. <source>J. Clin. Med</source>. <volume>8</volume>:<fpage>316</fpage>. <pub-id pub-id-type="doi">10.3390/jcm8030316</pub-id><pub-id pub-id-type="pmid">30845692</pub-id></citation></ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Al-Kofahi</surname> <given-names>Y.</given-names></name> <name><surname>Lassoued</surname> <given-names>W.</given-names></name> <name><surname>Lee</surname> <given-names>W.</given-names></name> <name><surname>Roysam</surname> <given-names>B.</given-names></name></person-group> (<year>2010</year>). <article-title>Improved automatic detection and segmentation of cell nuclei in histopathology images</article-title>. <source>IEEE Trans. Bio-Med. Eng</source>. <volume>57</volume>, <fpage>841</fpage>&#x02013;<lpage>852</lpage>. <pub-id pub-id-type="doi">10.1109/TBME.2009.2035102</pub-id><pub-id pub-id-type="pmid">19884070</pub-id></citation></ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Almendro</surname> <given-names>V.</given-names></name> <name><surname>Cheng</surname> <given-names>Y. K.</given-names></name> <name><surname>Randles</surname> <given-names>A.</given-names></name> <name><surname>Itzkovitz</surname> <given-names>S.</given-names></name> <name><surname>Marusyk</surname> <given-names>A.</given-names></name> <name><surname>Ametller</surname> <given-names>E.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>Inference of tumor evolution during chemotherapy by computational modeling and in situ analysis of genetic and phenotypic cellular diversity</article-title>. <source>Cell Rep</source>. <volume>6</volume>, <fpage>514</fpage>&#x02013;<lpage>527</lpage>. <pub-id pub-id-type="doi">10.1016/j.celrep.2013.12.041</pub-id><pub-id pub-id-type="pmid">24462293</pub-id></citation></ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Alyass</surname> <given-names>A.</given-names></name> <name><surname>Turcotte</surname> <given-names>M.</given-names></name> <name><surname>Meyre</surname> <given-names>D.</given-names></name></person-group> (<year>2015</year>). <article-title>From big data analysis to personalized medicine for all: challenges and opportunities</article-title>. <source>BMC Med. Genomics</source> <volume>8</volume>:<fpage>33</fpage>. <pub-id pub-id-type="doi">10.1186/s12920-015-0108-y</pub-id><pub-id pub-id-type="pmid">26112054</pub-id></citation></ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Andor</surname> <given-names>N.</given-names></name> <name><surname>Graham</surname> <given-names>T. A.</given-names></name> <name><surname>Jansen</surname> <given-names>M.</given-names></name> <name><surname>Xia</surname> <given-names>L. C.</given-names></name> <name><surname>Aktipis</surname> <given-names>C. A.</given-names></name> <name><surname>Petritsch</surname> <given-names>C.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>Pan-cancer analysis of the extent and consequences of intratumor heterogeneity</article-title>. <source>Nat. Med</source>. <volume>22</volume>, <fpage>105</fpage>&#x02013;<lpage>113</lpage>. <pub-id pub-id-type="doi">10.1038/nm.3984</pub-id><pub-id pub-id-type="pmid">26618723</pub-id></citation></ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Aoto</surname> <given-names>Y.</given-names></name> <name><surname>Okumura</surname> <given-names>K.</given-names></name> <name><surname>Hachiya</surname> <given-names>T.</given-names></name> <name><surname>Hase</surname> <given-names>S.</given-names></name> <name><surname>Wakabayashi</surname> <given-names>Y.</given-names></name> <name><surname>Ishikawa</surname> <given-names>F.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>Time-series analysis of tumorigenesis in a murine skin carcinogenesis model</article-title>. <source>Sci. Rep</source>. <volume>8</volume>:<fpage>12994</fpage>. <pub-id pub-id-type="doi">10.1038/s41598-018-31349-x</pub-id><pub-id pub-id-type="pmid">30158594</pub-id></citation></ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Aparicio</surname> <given-names>S.</given-names></name> <name><surname>Caldas</surname> <given-names>C.</given-names></name></person-group> (<year>2013</year>). <article-title>The implications of clonal genome evolution for cancer medicine</article-title>. <source>N. Engl. J. Med</source>. <volume>368</volume>, <fpage>842</fpage>&#x02013;<lpage>851</lpage>. <pub-id pub-id-type="doi">10.1056/NEJMra1204892</pub-id><pub-id pub-id-type="pmid">23445095</pub-id></citation></ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Arsuaga</surname> <given-names>J.</given-names></name> <name><surname>Baas</surname> <given-names>N. A.</given-names></name> <name><surname>Daniel DeWoskin Mizuno</surname> <given-names>H.</given-names></name> <name><surname>Pankov</surname> <given-names>A.</given-names></name> <name><surname>Park</surname> <given-names>C.</given-names></name></person-group> (<year>2012</year>). <article-title>Topological analysis of gene expression arrays identifies high risk molecular subtypes in breast cancer</article-title>. <source>Applicable Algebra in Engineering, Communications and Comput</source>. <volume>23</volume>, <fpage>3</fpage>&#x02013;<lpage>15</lpage>. <pub-id pub-id-type="doi">10.1007/s00200-012-0166-8</pub-id></citation></ref>
<ref id="B12">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Aukerman</surname> <given-names>A.</given-names></name> <name><surname>Carri&#x000E9;re</surname> <given-names>M.</given-names></name> <name><surname>Chen</surname> <given-names>C.</given-names></name> <name><surname>Gardner</surname> <given-names>K.</given-names></name> <name><surname>Rabad&#x000E1;n</surname> <given-names>R.</given-names></name> <name><surname>Vanguri</surname> <given-names>R.</given-names></name></person-group> (<year>2020</year>). <article-title>&#x0201C;Persistent homology based characterization of the breast cancer immune microenvironment: a feasibility study,&#x0201D;</article-title> in <source>36th International Symposium on Computational Geometry, Vol. 11</source> (<publisher-loc>Dagstuhl</publisher-loc>), <fpage>1</fpage>&#x02013;<lpage>11</lpage>.</citation></ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bauer</surname> <given-names>U.</given-names></name></person-group> (<year>2019</year>). <article-title>Ripser: efficient computation of Vietoris-Rips persistence barcodes</article-title>. <source>arXiv: 1908.02518v1</source>.</citation></ref>
<ref id="B14">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Bauer</surname> <given-names>U.</given-names></name> <name><surname>Lesnick</surname> <given-names>M.</given-names></name></person-group> (<year>2014</year>). <article-title>&#x0201C;Induced matchings of barcodes and the algebraic stability of persistence,&#x0201D;</article-title> in <source>Computational Geometry (SoCG&#x00027;14)</source> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>ACM</publisher-name>), <fpage>355</fpage>&#x02013;<lpage>364</lpage>. <pub-id pub-id-type="doi">10.1145/2582112.2582168</pub-id></citation></ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bendich</surname> <given-names>P.</given-names></name> <name><surname>Marron</surname> <given-names>J. S.</given-names></name> <name><surname>Miller</surname> <given-names>E.</given-names></name> <name><surname>Pieloch</surname> <given-names>A.</given-names></name> <name><surname>Skwerer</surname> <given-names>S.</given-names></name></person-group> (<year>2016</year>). <article-title>Persistent homology analysis of brain artery trees</article-title>. <source>Ann. Appl. Stat</source>. <volume>10</volume>, <fpage>198</fpage>&#x02013;<lpage>218</lpage>. <pub-id pub-id-type="doi">10.1214/15-AOAS886</pub-id><pub-id pub-id-type="pmid">27642379</pub-id></citation></ref>
<ref id="B16">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Bergstra</surname> <given-names>J.</given-names></name> <name><surname>Bardenet</surname> <given-names>R.</given-names></name> <name><surname>Bengio</surname> <given-names>Y.</given-names></name> <name><surname>K&#x000E9;gl</surname> <given-names>B.</given-names></name></person-group> (<year>2011</year>). <article-title>Algorithms for hyper-parameter optimization</article-title>. <source>Adv. Neural Inform. Process. Syst</source>. <volume>24</volume>, <fpage>1</fpage>&#x02013;<lpage>9</lpage>. Available online at: <ext-link ext-link-type="uri" xlink:href="https://proceedings.neurips.cc/paper/2011/file/86e8f7ab32cfd12577bc2619bc635690-Paper.pdf">https://proceedings.neurips.cc/paper/2011/file/86e8f7ab32cfd12577bc2619bc635690-Paper.pdf</ext-link></citation></ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Berwald</surname> <given-names>J. J.</given-names></name> <name><surname>Gidea</surname> <given-names>M.</given-names></name> <name><surname>Vejdemo-Johansson</surname> <given-names>M.</given-names></name></person-group> (<year>2014</year>). <article-title>Automatic recognition and tagging of topologically different regimes in dynamical systems</article-title>. <source>Discont. Nonlin. Complex</source>. <volume>3</volume>, <fpage>413</fpage>&#x02013;<lpage>426</lpage>. <pub-id pub-id-type="doi">10.5890/DNC.2014.12.004</pub-id></citation></ref>
<ref id="B18">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Boissonnat</surname> <given-names>J.-D.</given-names></name> <name><surname>Pritam</surname> <given-names>S.</given-names></name></person-group> (<year>2020</year>). <article-title>&#x0201C;Edge collapse and persistence of flag complexes,&#x0201D;</article-title> in <source>36th International Symposium on Computational Geometry (SoCG 2020), Vol. 164 of Leibniz International Proceedings in Informatics (LIPIcs)</source>, eds S. Cabello and D. Z. Chen (<publisher-loc>Dagstuhl</publisher-loc>: <publisher-name>Schloss Dagstuhl-Leibniz-Zentrum f&#x000FC;r Informatik</publisher-name>), <volume>19</volume>:<fpage>1</fpage>&#x02013;<lpage>19:15</lpage>.</citation></ref>
<ref id="B19">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Bubenik</surname> <given-names>P.</given-names></name></person-group> (<year>2015</year>). <article-title>Statistical topological data analysis using persistence landscapes</article-title>. <source>J. Mach. Learn. Res</source>. <volume>16</volume>, <fpage>77</fpage>&#x02013;<lpage>102</lpage>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://jmlr.org/papers/v16/bubenik15a.html">http://jmlr.org/papers/v16/bubenik15a.html</ext-link></citation></ref>
<ref id="B20">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Bubenik</surname> <given-names>P.</given-names></name></person-group> (<year>2020</year>). <article-title>&#x0201C;The persistence landscape and some of its properties,&#x0201D;</article-title> in <source>Topological Data Analysis</source>, eds N. Baas, G. Carlsson, G. Quick, M. Szymik, M. Thaule (<publisher-loc>Geiranger</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>97</fpage>&#x02013;<lpage>117</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-030-43408-3_4</pub-id></citation></ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bubenik</surname> <given-names>P.</given-names></name> <name><surname>Hull</surname> <given-names>M.</given-names></name> <name><surname>Patel</surname> <given-names>D.</given-names></name> <name><surname>Whittle</surname> <given-names>B.</given-names></name></person-group> (<year>2020</year>). <article-title>Persistent homology detects curvature</article-title>. <source>Inverse Probl</source>. <volume>36</volume>:<fpage>025008</fpage>. <pub-id pub-id-type="doi">10.1088/1361-6420/ab4ac0</pub-id></citation></ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bubenik</surname> <given-names>P.</given-names></name> <name><surname>Scott</surname> <given-names>J. A.</given-names></name></person-group> (<year>2014</year>). <article-title>Categorification of persistent homology</article-title>. <source>Discrete Comput. Geom</source>. <volume>51</volume>, <fpage>600</fpage>&#x02013;<lpage>627</lpage>. <pub-id pub-id-type="doi">10.1007/s00454-014-9573-x</pub-id></citation></ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bukkuri</surname> <given-names>A.</given-names></name></person-group> (<year>2020</year>). <article-title>Optimal control analysis of combined chemotherapy-immunotherapy treatment regimens in a PKPD cancer evolution model</article-title>. <source>Biomath</source> <volume>9</volume>, <fpage>1</fpage>&#x02013;<lpage>12</lpage>. <pub-id pub-id-type="doi">10.11145/j.biomath.2020.02.137</pub-id></citation></ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Camara</surname> <given-names>P. G.</given-names></name> <name><surname>Rosenbloom</surname> <given-names>D. I.</given-names></name> <name><surname>Emmett</surname> <given-names>K. J.</given-names></name> <name><surname>Levine</surname> <given-names>A. J.</given-names></name> <name><surname>Rabadan</surname> <given-names>R.</given-names></name></person-group> (<year>2016</year>). <article-title>Topological data analysis generates high-resolution, genome-wide maps of human recombination</article-title>. <source>Cell Syst</source>. <volume>3</volume>, <fpage>83</fpage>&#x02013;<lpage>94</lpage>. <pub-id pub-id-type="doi">10.1016/j.cels.2016.05.008</pub-id><pub-id pub-id-type="pmid">27345159</pub-id></citation></ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Carlsson</surname> <given-names>G.</given-names></name></person-group> (<year>2009</year>). <article-title>Topology and data</article-title>. <source>Bull. Am. Math. Soc</source>. <volume>46</volume>, <fpage>255</fpage>&#x02013;<lpage>308</lpage>. <pub-id pub-id-type="doi">10.1090/S0273-0979-09-01249-X</pub-id></citation></ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Carlsson</surname> <given-names>G.</given-names></name> <name><surname>Zomorodian</surname> <given-names>A.</given-names></name> <name><surname>Collins</surname> <given-names>A.</given-names></name> <name><surname>Guibas</surname> <given-names>L. J.</given-names></name></person-group> (<year>2005</year>). <article-title>Persistence barcodes for shapes</article-title>. <source>Int. J. Shape Model</source>. <volume>11</volume>, <fpage>149</fpage>&#x02013;<lpage>187</lpage>. <pub-id pub-id-type="doi">10.1142/S0218654305000761</pub-id></citation></ref>
<ref id="B27">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Carri&#x000E9;re</surname> <given-names>M.</given-names></name> <name><surname>Cuturi</surname> <given-names>M.</given-names></name> <name><surname>Oudot</surname> <given-names>S.</given-names></name></person-group> (<year>2017</year>). <article-title>&#x0201C;Sliced Wasserstein kernel for persistence diagrams,&#x0201D;</article-title> in <source>Proceedings of Machine Learning Research</source> (<publisher-loc>Sydney, NSW</publisher-loc>).</citation></ref>
<ref id="B28">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Chazal</surname> <given-names>F.</given-names></name> <name><surname>Cohen-Steiner</surname> <given-names>D.</given-names></name> <name><surname>Glisse</surname> <given-names>M.</given-names></name> <name><surname>Guibas</surname> <given-names>L.</given-names></name> <name><surname>Oudot</surname> <given-names>S.</given-names></name></person-group> (<year>2009</year>). <article-title>&#x0201C;Proximity of persistence modules and their diagrams,&#x0201D;</article-title> in <source>Proceedings of the Twenty-Fifth Annual Symposium on Computational Geometry</source> (<publisher-loc>Aarhus</publisher-loc>: <publisher-name>ACM</publisher-name>), <fpage>237</fpage>&#x02013;<lpage>246</lpage>. <pub-id pub-id-type="doi">10.1145/1542362.1542407</pub-id></citation></ref>
<ref id="B29">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Chazal</surname> <given-names>F.</given-names></name> <name><surname>de Silva</surname> <given-names>V.</given-names></name> <name><surname>Glisse</surname> <given-names>M.</given-names></name> <name><surname>Oudot</surname> <given-names>S.</given-names></name></person-group> (<year>2016</year>). <source>The Structure and Stability of Persistence Modules</source>. SpringerBriefs in Mathematics. <publisher-loc>Cham</publisher-loc>: <publisher-name>Springer</publisher-name>. <pub-id pub-id-type="doi">10.1007/978-3-319-42545-0_2</pub-id></citation></ref>
<ref id="B30">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Chazal</surname> <given-names>F.</given-names></name> <name><surname>Fasy</surname> <given-names>B.</given-names></name> <name><surname>Lecci</surname> <given-names>F.</given-names></name> <name><surname>Michel</surname> <given-names>B.</given-names></name> <name><surname>Rinaldo</surname> <given-names>A.</given-names></name> <name><surname>Wasserman</surname> <given-names>L.</given-names></name></person-group> (<year>2017</year>). <article-title>Robust topological inference: distance to a measure and kernel distance</article-title>. <source>J. Mach. Learn. Res</source>. <volume>18</volume>:<fpage>40</fpage>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://jmlr.org/papers/v18/15-484.html">http://jmlr.org/papers/v18/15-484.html</ext-link></citation></ref>
<ref id="B31">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Chittajallu</surname> <given-names>D. R.</given-names></name> <name><surname>Siekierski</surname> <given-names>N.</given-names></name> <name><surname>Lee</surname> <given-names>S.</given-names></name> <name><surname>Gerber</surname> <given-names>S.</given-names></name> <name><surname>Beezley</surname> <given-names>J.</given-names></name> <name><surname>Manthey</surname> <given-names>D.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>&#x0201C;Vectorized persistent homology representations for characterizing glandular architecture in histology images,&#x0201D;</article-title> in <source>2018 IEEE 15th International Symposium on Biomedical Imaging</source> (<publisher-loc>Washington, DC</publisher-loc>). <pub-id pub-id-type="doi">10.1109/ISBI.2018.8363562</pub-id></citation></ref>
<ref id="B32">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Chung</surname> <given-names>Y.-M.</given-names></name> <name><surname>Hu</surname> <given-names>C.-S.</given-names></name> <name><surname>Lawson</surname> <given-names>A.</given-names></name> <name><surname>Smyth</surname> <given-names>C.</given-names></name></person-group> (<year>2018</year>). <article-title>&#x0201C;Topological approaches to skin disease image analysis,&#x0201D;</article-title> in <source>IEEE International Conference on Big Data (Big Data)</source> (<publisher-loc>Seattle, WA</publisher-loc>), <fpage>100</fpage>&#x02013;<lpage>105</lpage>. <pub-id pub-id-type="doi">10.1109/BigData.2018.8622175</pub-id></citation></ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chung</surname> <given-names>Y.-M.</given-names></name> <name><surname>Lawson</surname> <given-names>A.</given-names></name></person-group> (<year>2019</year>). <article-title>Persistence curves: a canonical framework for summarizing persistence diagrams</article-title>. <source>arXiv: 1904.07768</source>.</citation></ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Climent</surname> <given-names>J.</given-names></name> <name><surname>Dimitrow</surname> <given-names>P.</given-names></name> <name><surname>Fridlyand</surname> <given-names>J.</given-names></name> <name><surname>Palacios</surname> <given-names>J.</given-names></name> <name><surname>Siebert</surname> <given-names>R.</given-names></name> <name><surname>Albertson</surname> <given-names>D. G.</given-names></name> <etal/></person-group>. (<year>2007</year>). <article-title>Deletion of chromosome 11q predicts response to anthracycline-based chemotherapy in early breast cancer</article-title>. <source>Cancer Res</source>. <volume>67</volume>, <fpage>818</fpage>&#x02013;<lpage>826</lpage>. <pub-id pub-id-type="doi">10.1158/0008-5472.CAN-06-3307</pub-id></citation>
</ref>
<ref id="B35">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cohen-Steiner</surname> <given-names>D.</given-names></name> <name><surname>Edelsbrunner</surname> <given-names>H.</given-names></name> <name><surname>Harer</surname> <given-names>J.</given-names></name></person-group> (<year>2007</year>). <article-title>Stability of persistence diagrams</article-title>. <source>Discrete Comput. Geom</source>. <volume>37</volume>, <fpage>103</fpage>&#x02013;<lpage>120</lpage>. <pub-id pub-id-type="doi">10.1007/s00454-006-1276-5</pub-id></citation></ref>
<ref id="B36">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Crawford</surname> <given-names>L.</given-names></name> <name><surname>Monod</surname> <given-names>A.</given-names></name> <name><surname>Chen</surname> <given-names>A. X.</given-names></name> <name><surname>Mukherjee</surname> <given-names>S.</given-names></name> <name><surname>Rabad&#x000E1;n</surname> <given-names>R.</given-names></name></person-group> (<year>2020</year>). <article-title>Predicting clinical outcomes in glioblastoma: an application of topological and functional data analysis</article-title>. <source>J. Am. Stat. Assoc</source>. <volume>115</volume>, <fpage>1139</fpage>&#x02013;<lpage>1150</lpage>. <pub-id pub-id-type="doi">10.1080/01621459.2019.1671198</pub-id></citation></ref>
<ref id="B37">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>De Silva</surname> <given-names>V.</given-names></name> <name><surname>Carlsson</surname> <given-names>G.</given-names></name></person-group> (<year>2004</year>). <article-title>&#x0201C;Topological estimation using witness complexes,&#x0201D;</article-title> in <source>Eurographics Symposium on Point-Based Graphics</source> (<publisher-loc>Zurich</publisher-loc>), <fpage>157</fpage>&#x02013;<lpage>166</lpage>. <pub-id pub-id-type="doi">10.2312/SPBG/SPBG04/157-166</pub-id></citation></ref>
<ref id="B38">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dequeant</surname> <given-names>M.-L.</given-names></name> <name><surname>Ahnert</surname> <given-names>S.</given-names></name> <name><surname>Edelsbrunner</surname> <given-names>H.</given-names></name> <name><surname>Fink</surname> <given-names>T. M.</given-names></name> <name><surname>Glynn</surname> <given-names>E. F.</given-names></name> <name><surname>Hattem</surname> <given-names>G.</given-names></name> <etal/></person-group>. (<year>2008</year>). <article-title>Comparison of pattern detection methods in microarray time series of the segmentation clock</article-title>. <source>PLoS ONE</source> <volume>3</volume>:<fpage>e2856</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0002856</pub-id><pub-id pub-id-type="pmid">18682743</pub-id></citation></ref>
<ref id="B39">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>DeWoskin</surname> <given-names>D.</given-names></name> <name><surname>Climent</surname> <given-names>J.</given-names></name> <name><surname>Cruz-White</surname> <given-names>I.</given-names></name> <name><surname>Vazquez</surname> <given-names>M.</given-names></name> <name><surname>Park</surname> <given-names>C.</given-names></name> <name><surname>Arsuaga</surname> <given-names>J.</given-names></name></person-group> (<year>2010</year>). <article-title>Applications of computational homology to the analysis of treatment response in breast cancer patients</article-title>. <source>Topol. Appl</source>. <volume>157</volume>, <fpage>157</fpage>&#x02013;<lpage>164</lpage>. <pub-id pub-id-type="doi">10.1016/j.topol.2009.04.036</pub-id></citation></ref>
<ref id="B40">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dilsizian</surname> <given-names>S. E.</given-names></name> <name><surname>Siegel</surname> <given-names>E. L.</given-names></name></person-group> (<year>2014</year>). <article-title>Artificial intelligence in medicine and cardiac imaging: harnessing big data and advanced computing to provide personalized medical diagnosis and treatment</article-title>. <source>Curr. Cardiol. Rep</source>. <volume>16</volume>:<fpage>441</fpage>. <pub-id pub-id-type="doi">10.1007/s11886-013-0441-8</pub-id><pub-id pub-id-type="pmid">24338557</pub-id></citation></ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dimitriou</surname> <given-names>N.</given-names></name> <name><surname>Arandjelovi&#x00107;</surname> <given-names>O.</given-names></name> <name><surname>Caie</surname> <given-names>P. D.</given-names></name></person-group> (<year>2019</year>). <article-title>Deep learning for whole slide image analysis: an overview</article-title>. <source>Front. Med</source>. <volume>6</volume>:<fpage>264</fpage>. <pub-id pub-id-type="doi">10.3389/fmed.2019.00264</pub-id><pub-id pub-id-type="pmid">32974358</pub-id></citation></ref>
<ref id="B42">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Doyle</surname> <given-names>S.</given-names></name> <name><surname>Agner</surname> <given-names>S.</given-names></name> <name><surname>Madabhushi</surname> <given-names>A.</given-names></name> <name><surname>Feldman</surname> <given-names>M.</given-names></name> <name><surname>Tomaszewski</surname> <given-names>J.</given-names></name></person-group> (<year>2008</year>). <article-title>&#x0201C;Automated grading of breast cancer histopathology using spectral clustering with textural and architectural image features,&#x0201D;</article-title> in <source>5th IEEE International Symposium on Biomedical Imaging: From Nano to Macro</source> (<publisher-loc>Paris</publisher-loc>), <fpage>496</fpage>&#x02013;<lpage>499</lpage>. <pub-id pub-id-type="doi">10.1109/ISBI.2008.4541041</pub-id></citation></ref>
<ref id="B43">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Edelsbrunner</surname> <given-names>H.</given-names></name> <name><surname>Harer</surname> <given-names>J.</given-names></name></person-group> (<year>2010</year>). <source>Computational Topology: An Introduction</source>. <publisher-loc>Providence, RI</publisher-loc>: <publisher-name>American Mathematical Society</publisher-name>. <pub-id pub-id-type="doi">10.1090/mbk/069</pub-id></citation></ref>
<ref id="B44">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Edelsbrunner</surname> <given-names>H.</given-names></name> <name><surname>Letscher</surname> <given-names>D.</given-names></name> <name><surname>Zomorodian</surname> <given-names>A.</given-names></name></person-group> (<year>2002</year>). <article-title>Topological persistence and simplification</article-title>. <source>Discr. Comput. Geom</source>. <volume>28</volume>, <fpage>511</fpage>&#x02013;<lpage>533</lpage>. <pub-id pub-id-type="doi">10.1007/s00454-002-2885-2</pub-id></citation></ref>
<ref id="B45">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Emmett</surname> <given-names>K.</given-names></name> <name><surname>Schweinhart</surname> <given-names>B.</given-names></name> <name><surname>Rabadan</surname> <given-names>R.</given-names></name></person-group> (<year>2016</year>). <article-title>&#x0201C;Multiscale topology of chromatin folding,&#x0201D;</article-title> in <source>Proceedings of the 9th EAI Conference on Bio-inspired Information and Communications Technologies</source> (<publisher-loc>New York, NY</publisher-loc>), <fpage>177</fpage>&#x02013;<lpage>180</lpage>. <pub-id pub-id-type="doi">10.4108/eai.3-12-2015.2262453</pub-id></citation></ref>
<ref id="B46">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Engers</surname> <given-names>R.</given-names></name></person-group> (<year>2007</year>). <article-title>Reproducibility and reliability of tumor grading in urological neoplasms</article-title>. <source>World J. Urol</source>. <volume>25</volume>, <fpage>595</fpage>&#x02013;<lpage>605</lpage>. <pub-id pub-id-type="doi">10.1007/s00345-007-0209-0</pub-id><pub-id pub-id-type="pmid">17828603</pub-id></citation></ref>
<ref id="B47">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Epstein</surname> <given-names>J. I.</given-names></name> <name><surname>Zelefsky</surname> <given-names>M. J.</given-names></name> <name><surname>Sjoberg</surname> <given-names>D. D.</given-names></name> <name><surname>Nelson</surname> <given-names>J. B.</given-names></name> <name><surname>Egevad</surname> <given-names>L.</given-names></name> <name><surname>Magi-Galluzzi</surname> <given-names>C.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>A contemporary prostate cancer grading system: a validated alternative to the Gleason score</article-title>. <source>Eur. Urol</source>. <volume>69</volume>, <fpage>428</fpage>&#x02013;<lpage>435</lpage>. <pub-id pub-id-type="doi">10.1016/j.eururo.2015.06.046</pub-id><pub-id pub-id-type="pmid">26166626</pub-id></citation></ref>
<ref id="B48">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Evans</surname> <given-names>S. M.</given-names></name> <name><surname>Patabendi Bandarage</surname> <given-names>V.</given-names></name> <name><surname>Kronborg</surname> <given-names>C.</given-names></name> <name><surname>Earnest</surname> <given-names>A.</given-names></name> <name><surname>Millar</surname> <given-names>J.</given-names></name> <name><surname>Clouston</surname> <given-names>D.</given-names></name></person-group> (<year>2016</year>). <article-title>Gleason group concordance between biopsy and radical prostatectomy specimens: a cohort study from Prostate Cancer Outcome Registry-Victoria</article-title>. <source>Prost. Int</source>. <volume>4</volume>, <fpage>145</fpage>&#x02013;<lpage>151</lpage>. <pub-id pub-id-type="doi">10.1016/j.prnil.2016.07.004</pub-id><pub-id pub-id-type="pmid">27995114</pub-id></citation></ref>
<ref id="B49">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fass</surname> <given-names>L.</given-names></name></person-group> (<year>2008</year>). <article-title>Imaging and cancer: a review</article-title>. <source>Mol. Oncol</source>. <volume>2</volume>, <fpage>115</fpage>&#x02013;<lpage>152</lpage>. <pub-id pub-id-type="doi">10.1016/j.molonc.2008.04.001</pub-id></citation></ref>
<ref id="B50">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ferracin</surname> <given-names>M.</given-names></name> <name><surname>Pedriali</surname> <given-names>M.</given-names></name> <name><surname>Veronese</surname> <given-names>A.</given-names></name> <name><surname>Zagatti</surname> <given-names>B.</given-names></name> <name><surname>Gaf&#x000E1;</surname> <given-names>R.</given-names></name> <name><surname>Magri</surname> <given-names>E.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>MicroRNA profiling for the identification of cancers with unknown primary tissue-of-origin</article-title>. <source>J. Pathol</source>. <volume>225</volume>, <fpage>43</fpage>&#x02013;<lpage>53</lpage>. <pub-id pub-id-type="doi">10.1002/path.2915</pub-id><pub-id pub-id-type="pmid">21630269</pub-id></citation></ref>
<ref id="B51">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Freije</surname> <given-names>W. A.</given-names></name> <name><surname>Castro-Vargas</surname> <given-names>F. E.</given-names></name> <name><surname>Fang</surname> <given-names>Z.</given-names></name> <name><surname>Horvath</surname> <given-names>S.</given-names></name> <name><surname>Cloughesy</surname> <given-names>T.</given-names></name> <name><surname>Liau</surname> <given-names>L. M.</given-names></name> <etal/></person-group>. (<year>2004</year>). <article-title>Gene expression profiling of gliomas strongly predicts survival</article-title>. <source>Cancer Res</source>. <volume>64</volume>, <fpage>6503</fpage>&#x02013;<lpage>6510</lpage>. <pub-id pub-id-type="doi">10.1158/0008-5472.CAN-04-0452</pub-id><pub-id pub-id-type="pmid">15374961</pub-id></citation></ref>
<ref id="B52">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Garside</surname> <given-names>K.</given-names></name> <name><surname>Henderson</surname> <given-names>R.</given-names></name> <name><surname>Makarenko</surname> <given-names>I.</given-names></name> <name><surname>Masoller</surname> <given-names>C.</given-names></name></person-group> (<year>2019</year>). <article-title>Topological data analysis of high resolution diabetic retinopathy images</article-title>. <source>PLoS ONE</source> <volume>14</volume>:<fpage>e217413</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0217413</pub-id><pub-id pub-id-type="pmid">31125372</pub-id></citation></ref>
<ref id="B53">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ghrist</surname> <given-names>R.</given-names></name></person-group> (<year>2008</year>). <article-title>Barcodes: the persistent topology of data</article-title>. <source>Bull. Am. Math. Soc</source>. <volume>45</volume>, <fpage>61</fpage>&#x02013;<lpage>75</lpage>. <pub-id pub-id-type="doi">10.1090/S0273-0979-07-01191-3</pub-id></citation></ref>
<ref id="B54">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ghrist</surname> <given-names>R. W.</given-names></name></person-group> (<year>2014</year>). <source>Elementary Applied Topology, Vol. 1</source>. Createspace Seattle.</citation></ref>
<ref id="B55">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Gidea</surname> <given-names>M.</given-names></name></person-group> (<year>2017</year>). <article-title>&#x0201C;Topology data analysis of critical transitions in financial networks,&#x0201D;</article-title> in <source>3rd International Winter School and Conference on Network Science</source> (<publisher-loc>Tel Aviv</publisher-loc>), <fpage>47</fpage>&#x02013;<lpage>59</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-319-55471-6_5</pub-id></citation></ref>
<ref id="B56">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gidea</surname> <given-names>M.</given-names></name> <name><surname>Goldsmith</surname> <given-names>D.</given-names></name> <name><surname>Katz</surname> <given-names>Y.</given-names></name> <name><surname>Roldan</surname> <given-names>P.</given-names></name> <name><surname>Shmalo</surname> <given-names>Y.</given-names></name></person-group> (<year>2020</year>). <article-title>Topological recognition of critical transitions in time series of cryptocurrencies</article-title>. <source>Phys. A</source> <volume>548</volume>:<fpage>123843</fpage>. <pub-id pub-id-type="doi">10.1016/j.physa.2019.123843</pub-id></citation></ref>
<ref id="B57">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gidea</surname> <given-names>M.</given-names></name> <name><surname>Katz</surname> <given-names>Y.</given-names></name></person-group> (<year>2018</year>). <article-title>Topological data analysis of financial time series: landscapes of crashes</article-title>. <source>Phys. A</source> <volume>491</volume>, <fpage>820</fpage>&#x02013;<lpage>834</lpage>. <pub-id pub-id-type="doi">10.1016/j.physa.2017.09.028</pub-id></citation></ref>
<ref id="B58">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Goodman</surname> <given-names>M.</given-names></name> <name><surname>Ward</surname> <given-names>K. C.</given-names></name> <name><surname>Osunkoya</surname> <given-names>A. O.</given-names></name> <name><surname>Datta</surname> <given-names>M. W.</given-names></name> <name><surname>Luthringer</surname> <given-names>D.</given-names></name> <name><surname>Young</surname> <given-names>A. N.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>Frequency and determinants of disagreement and error in gleason scores: a population-based study of prostate cancer</article-title>. <source>Prostate</source> <volume>72</volume>, <fpage>1389</fpage>&#x02013;<lpage>1398</lpage>. <pub-id pub-id-type="doi">10.1002/pros.22484</pub-id><pub-id pub-id-type="pmid">22228120</pub-id></citation></ref>
<ref id="B59">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Gretton</surname> <given-names>A.</given-names></name> <name><surname>Borgwardt</surname> <given-names>K. M.</given-names></name> <name><surname>Rasch</surname> <given-names>M. J.</given-names></name> <name><surname>Smola</surname> <given-names>A.</given-names></name> <name><surname>Sch&#x000F6;lkopf</surname> <given-names>B.</given-names></name> <name><surname>Smola Gretton</surname> <given-names>A.</given-names></name></person-group> (<year>2012</year>). <article-title>A kernel two-sample test</article-title>. <source>J. Mach. Learn. Res</source>. <volume>13</volume>, <fpage>723</fpage>&#x02013;<lpage>773</lpage>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://jmlr.org/papers/v13/gretton12a.html">http://jmlr.org/papers/v13/gretton12a.html</ext-link></citation></ref>
<ref id="B60">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gu</surname> <given-names>J.</given-names></name> <name><surname>Taylor</surname> <given-names>C. R.</given-names></name></person-group> (<year>2014</year>). <article-title>Practicing pathology in the era of big data and personalized medicine</article-title>. <source>Appl. Immunohistochem. Mol. Morphol</source>. <volume>22</volume>, <fpage>1</fpage>&#x02013;<lpage>9</lpage>. <pub-id pub-id-type="doi">10.1097/PAI.0000000000000022</pub-id><pub-id pub-id-type="pmid">24326463</pub-id></citation></ref>
<ref id="B61">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Guan</surname> <given-names>Y.</given-names></name> <name><surname>Stephens</surname> <given-names>M.</given-names></name></person-group> (<year>2011</year>). <article-title>Bayesian variable selection regression for genome-wide association studies and other large-scale problems</article-title>. <source>Ann. Appl. Stat</source>. <volume>5</volume>, <fpage>1780</fpage>&#x02013;<lpage>1815</lpage>. <pub-id pub-id-type="doi">10.1214/11-AOAS455</pub-id></citation></ref>
<ref id="B62">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Han</surname> <given-names>W.</given-names></name> <name><surname>Han</surname> <given-names>M. R.</given-names></name> <name><surname>Kang</surname> <given-names>J. J.</given-names></name> <name><surname>Bae</surname> <given-names>J. Y.</given-names></name> <name><surname>Lee</surname> <given-names>J. H.</given-names></name> <name><surname>Bae</surname> <given-names>Y. J.</given-names></name> <etal/></person-group>. (<year>2006</year>). <article-title>Genomic alterations identified by array comparative genomic hybridization as prognostic markers in tamoxifen-treated estrogen receptor-positive breast cancer</article-title>. <source>BMC Cancer</source> <volume>6</volume>:<fpage>92</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2407-6-92</pub-id><pub-id pub-id-type="pmid">16608533</pub-id></citation></ref>
<ref id="B63">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hanahan</surname> <given-names>D.</given-names></name> <name><surname>Weinberg</surname> <given-names>R. A.</given-names></name></person-group> (<year>2000</year>). <source>The Hallmarks of Cancer</source>. Technical report. <volume>100</volume>, <fpage>57</fpage>&#x02013;<lpage>70</lpage>. <pub-id pub-id-type="doi">10.1016/S0092-8674(00)81683-9</pub-id></citation></ref>
<ref id="B64">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Hatcher</surname> <given-names>A.</given-names></name></person-group> (<year>2002</year>). <source>Algebraic Topology</source>. <publisher-loc>Cambridge</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>.</citation></ref>
<ref id="B65">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Helpap</surname> <given-names>B.</given-names></name> <name><surname>Kristiansen</surname> <given-names>G.</given-names></name> <name><surname>Beer</surname> <given-names>M.</given-names></name> <name><surname>K&#x000F6;llermann</surname> <given-names>J.</given-names></name> <name><surname>Oehler</surname> <given-names>U.</given-names></name> <name><surname>Pogrebniak</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>Improving the reproducibility of the gleason scores in small foci of prostate cancer - Suggestion of diagnostic criteria for glandular fusion</article-title>. <source>Pathol. Oncol. Res</source>. <volume>18</volume>, <fpage>615</fpage>&#x02013;<lpage>621</lpage>. <pub-id pub-id-type="doi">10.1007/s12253-011-9484-6</pub-id><pub-id pub-id-type="pmid">22179685</pub-id></citation></ref>
<ref id="B66">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Herbrich</surname> <given-names>R.</given-names></name> <name><surname>Smola</surname> <given-names>A.</given-names></name> <name><surname>Bousquet</surname> <given-names>O.</given-names></name> <name><surname>Sch&#x000F6;lkopf Bernhardschoelkopf</surname> <given-names>B.</given-names></name> <name><surname>Gretton</surname> <given-names>A.</given-names></name> <name><surname>Sch&#x000F6;lkopf Gretton</surname> <given-names>B.</given-names></name></person-group> (<year>2005</year>). <article-title>Kernel methods for measuring independence</article-title>. <source>J. Mach. Learn. Res</source>. <volume>6</volume>, <fpage>2075</fpage>&#x02013;<lpage>2129</lpage>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://jmlr.org/papers/v6/gretton05a.html">http://jmlr.org/papers/v6/gretton05a.html</ext-link></citation></ref>
<ref id="B67">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Hong</surname> <given-names>B.-W.</given-names></name> <name><surname>Brady</surname> <given-names>M.</given-names></name></person-group> (<year>2003</year>). <article-title>&#x0201C;A topographic representation for mammogram segmentation,&#x0201D;</article-title> in <source>International Conference on Medical Image Computing and Computer-Assisted Intervention</source> (<publisher-loc>Montreal, QC</publisher-loc>), <fpage>730</fpage>&#x02013;<lpage>737</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-540-39903-2_89</pub-id></citation></ref>
<ref id="B68">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Horak</surname> <given-names>D.</given-names></name> <name><surname>Maleti&#x00107;</surname> <given-names>S.</given-names></name> <name><surname>Rajkovi&#x00107;</surname> <given-names>M.</given-names></name></person-group> (<year>2009</year>). <article-title>Persistent homology of complex networks</article-title>. <source>J. Stat. Mech</source>. <volume>2009</volume>:<fpage>P03034</fpage>. <pub-id pub-id-type="doi">10.1088/1742-5468/2009/03/P03034</pub-id></citation></ref>
<ref id="B69">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Horlings</surname> <given-names>H. M.</given-names></name> <name><surname>Lai</surname> <given-names>C.</given-names></name> <name><surname>Nuyten</surname> <given-names>D. S.</given-names></name> <name><surname>Halfwerk</surname> <given-names>H.</given-names></name> <name><surname>Kristel</surname> <given-names>P.</given-names></name> <name><surname>Van Beers</surname> <given-names>E.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>Integration of DNA copy number alterations and prognostic gene expression signatures in breast cancer patients</article-title>. <source>Clin. Cancer Res</source>. <volume>16</volume>, <fpage>651</fpage>&#x02013;<lpage>663</lpage>. <pub-id pub-id-type="doi">10.1158/1078-0432.CCR-09-0709</pub-id><pub-id pub-id-type="pmid">20068109</pub-id></citation></ref>
<ref id="B70">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Humphrey</surname> <given-names>P. A.</given-names></name></person-group> (<year>2004</year>). <article-title>Gleason grading and prognostic factors in carcinoma of the prostate</article-title>. <source>Modern Pathol</source>. <volume>17</volume>, <fpage>292</fpage>&#x02013;<lpage>306</lpage>. <pub-id pub-id-type="doi">10.1038/modpathol.3800054</pub-id><pub-id pub-id-type="pmid">14976540</pub-id></citation></ref>
<ref id="B71">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ishwaran</surname> <given-names>H.</given-names></name> <name><surname>Rao</surname> <given-names>J. S.</given-names></name></person-group> (<year>2005</year>). <article-title>Spike and slab variable selection: frequentist and bayesian strategies</article-title>. <source>Ann. Stat</source>. <volume>33</volume>, <fpage>730</fpage>&#x02013;<lpage>773</lpage>. <pub-id pub-id-type="doi">10.1214/009053604000001147</pub-id></citation></ref>
<ref id="B72">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jain</surname> <given-names>R. K.</given-names></name></person-group> (<year>2005</year>). <article-title>Normalization of tumor vasculature: an emerging concept in antiangiogenic therapy</article-title>. <source>Sci. Rev</source>. <volume>307</volume>, <fpage>58</fpage>&#x02013;<lpage>62</lpage>. <pub-id pub-id-type="doi">10.1126/science.1104819</pub-id><pub-id pub-id-type="pmid">15637262</pub-id></citation></ref>
<ref id="B73">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Khan</surname> <given-names>A.</given-names></name> <name><surname>El-Daly</surname> <given-names>H.</given-names></name> <name><surname>Simmons</surname> <given-names>E.</given-names></name> <name><surname>Rajpoot</surname> <given-names>N.</given-names></name></person-group> (<year>2013</year>). <article-title>HyMaP: A hybrid magnitude-phase approach to unsupervised segmentation of tumor areas in breast cancer histology images</article-title>. <source>J. Pathol. Inform</source>. <volume>4</volume>(<supplement>Suppl</supplement>):<fpage>S1</fpage>. <pub-id pub-id-type="doi">10.4103/2153-3539.109802</pub-id><pub-id pub-id-type="pmid">23766931</pub-id></citation></ref>
<ref id="B74">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Khasawneh</surname> <given-names>F. A.</given-names></name> <name><surname>Munch</surname> <given-names>E.</given-names></name></person-group> (<year>2016</year>). <article-title>Chatter detection in turning using persistent homology</article-title>. <source>Mech. Syst. Signal Process</source>. <fpage>70</fpage>&#x02013;<lpage>71</lpage>, <fpage>527</fpage>&#x02013;<lpage>541</lpage>. <pub-id pub-id-type="doi">10.1016/j.ymssp.2015.09.046</pub-id></citation></ref>
<ref id="B75">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kimura</surname> <given-names>M.</given-names></name> <name><surname>Obayashi</surname> <given-names>I.</given-names></name> <name><surname>Takeichi</surname> <given-names>Y.</given-names></name> <name><surname>Murao</surname> <given-names>R.</given-names></name> <name><surname>Hiraoka</surname> <given-names>Y.</given-names></name></person-group> (<year>2018</year>). <article-title>Non-empirical identification of trigger sites in heterogeneous processes using persistent homology</article-title>. <source>Sci. Rep</source>. <volume>8</volume>:<fpage>3553</fpage>. <pub-id pub-id-type="doi">10.1038/s41598-018-21867-z</pub-id><pub-id pub-id-type="pmid">29476108</pub-id></citation></ref>
<ref id="B76">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kourou</surname> <given-names>K.</given-names></name> <name><surname>Rigas</surname> <given-names>G.</given-names></name> <name><surname>Papaloukas</surname> <given-names>C.</given-names></name> <name><surname>Mitsis</surname> <given-names>M.</given-names></name> <name><surname>Fotiadis</surname> <given-names>D. I.</given-names></name></person-group> (<year>2020</year>). <article-title>Cancer classification from time series microarray data through regulatory Dynamic Bayesian Networks</article-title>. <source>Comput. Biol. Med</source>. <volume>116</volume>:<fpage>103577</fpage>. <pub-id pub-id-type="doi">10.1016/j.compbiomed.2019.103577</pub-id><pub-id pub-id-type="pmid">32001012</pub-id></citation></ref>
<ref id="B77">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Kusano</surname> <given-names>G.</given-names></name> <name><surname>Hiraoka</surname> <given-names>Y.</given-names></name> <name><surname>Fukumizu</surname> <given-names>K.</given-names></name></person-group> (<year>2016</year>). <article-title>&#x0201C;Persistence weighted Gaussian kernel for topological data analysis,&#x0201D;</article-title> in <source>International Conference on Machine Learning</source> (<publisher-loc>New York, NY</publisher-loc>), <fpage>2004</fpage>&#x02013;<lpage>2013</lpage>.</citation></ref>
<ref id="B78">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Laurie</surname> <given-names>C. C.</given-names></name> <name><surname>Laurie</surname> <given-names>C. A.</given-names></name> <name><surname>Rice</surname> <given-names>K.</given-names></name> <name><surname>Doheny</surname> <given-names>K. F.</given-names></name> <name><surname>Zelnick</surname> <given-names>L. R.</given-names></name> <name><surname>McHugh</surname> <given-names>C. P.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>Detectable clonal mosaicism from birth to old age and its relationship to cancer</article-title>. <source>Nat. Genet</source>. <volume>44</volume>, <fpage>642</fpage>&#x02013;<lpage>650</lpage>. <pub-id pub-id-type="doi">10.1038/ng.2271</pub-id><pub-id pub-id-type="pmid">22561516</pub-id></citation></ref>
<ref id="B79">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lawson</surname> <given-names>P.</given-names></name> <name><surname>Sholl</surname> <given-names>A. B.</given-names></name> <name><surname>Brown</surname> <given-names>J. Q.</given-names></name> <name><surname>Fasy</surname> <given-names>B. T.</given-names></name> <name><surname>Wenk</surname> <given-names>C.</given-names></name></person-group> (<year>2019</year>). <article-title>Persistent homology for the quantitative evaluation of architectural features in prostate cancer histology</article-title>. <source>Sci. Rep</source>. <volume>9</volume>:<fpage>1139</fpage>. <pub-id pub-id-type="doi">10.1038/s41598-018-36798-y</pub-id><pub-id pub-id-type="pmid">30718811</pub-id></citation></ref>
<ref id="B80">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>C. H.</given-names></name> <name><surname>Tam</surname> <given-names>P. K.</given-names></name></person-group> (<year>1998</year>). <article-title>An iterative algorithm for minimum cross entropy thresholding</article-title>. <source>Pattern Recogn. Lett</source>. <volume>19</volume>, <fpage>771</fpage>&#x02013;<lpage>776</lpage>. <pub-id pub-id-type="doi">10.1016/S0167-8655(98)00057-9</pub-id></citation></ref>
<ref id="B81">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>M.</given-names></name> <name><surname>An</surname> <given-names>H.</given-names></name> <name><surname>Angelovici</surname> <given-names>R.</given-names></name> <name><surname>Bagaza</surname> <given-names>C.</given-names></name> <name><surname>Batushansky</surname> <given-names>A.</given-names></name> <name><surname>Clark</surname> <given-names>L.</given-names></name> <name><surname>Coneva</surname> <given-names>V.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>Topological data analysis as a morphometric method: Using persistent homology to demarcate a leaf morphospace</article-title>. <source>Front. Plant Sci</source>. <volume>9</volume>:<fpage>553</fpage>. <pub-id pub-id-type="doi">10.3389/fpls.2018.00553</pub-id><pub-id pub-id-type="pmid">29922307</pub-id></citation></ref>
<ref id="B82">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Lockwood</surname> <given-names>S.</given-names></name> <name><surname>Krishnamoorthy</surname> <given-names>B.</given-names></name></person-group> (<year>2015</year>). <article-title>&#x0201C;Topological features in cancer gene expression data,&#x0201D;</article-title> in <source>Pacific Symposium on Biocomputing</source> (<publisher-loc>Kohala Coast</publisher-loc>).<pub-id pub-id-type="pmid">25592573</pub-id></citation></ref>
<ref id="B83">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Macenko</surname> <given-names>M.</given-names></name> <name><surname>Niethammer</surname> <given-names>M.</given-names></name> <name><surname>Marron</surname> <given-names>J.</given-names></name> <name><surname>Borland</surname> <given-names>D.</given-names></name> <name><surname>Woosley</surname> <given-names>J. T.</given-names></name> <name><surname>Guan</surname> <given-names>X.</given-names></name></person-group> (<year>2009</year>). <article-title>&#x0201C;A method for normalizing histology slides for quantitative analysis,&#x0201D;</article-title> in <source>IEEE International Symposium on Biomedical Imaging: From Nano to Macro</source> (<publisher-loc>Boston, MA</publisher-loc>), <fpage>1107</fpage>&#x02013;<lpage>1110</lpage>. <pub-id pub-id-type="doi">10.1109/ISBI.2009.5193250</pub-id></citation></ref>
<ref id="B84">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Maley</surname> <given-names>C. C.</given-names></name> <name><surname>Galipeau</surname> <given-names>P. C.</given-names></name> <name><surname>Finley</surname> <given-names>J. C.</given-names></name> <name><surname>Wongsurawat</surname> <given-names>V. J.</given-names></name> <name><surname>Li</surname> <given-names>X.</given-names></name> <name><surname>Sanchez</surname> <given-names>C. A.</given-names></name> <etal/></person-group>. (<year>2006</year>). <article-title>Genetic clonal diversity predicts progression to esophageal adenocarcinoma</article-title>. <source>Nat. Genet</source>. <volume>38</volume>, <fpage>468</fpage>&#x02013;<lpage>473</lpage>. <pub-id pub-id-type="doi">10.1038/ng1768</pub-id><pub-id pub-id-type="pmid">16565718</pub-id></citation></ref>
<ref id="B85">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Maley</surname> <given-names>C. C.</given-names></name> <name><surname>Reid</surname> <given-names>B. J.</given-names></name></person-group> (<year>2005</year>). <article-title>Natural selection in neoplastic progression of Barrett&#x00027;s esophagus</article-title>. <source>Semin. Cancer Biol</source>. <volume>15</volume>, <fpage>474</fpage>&#x02013;<lpage>483</lpage>. <pub-id pub-id-type="doi">10.1016/j.semcancer.2005.06.004</pub-id><pub-id pub-id-type="pmid">16043360</pub-id></citation></ref>
<ref id="B86">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Marquard</surname> <given-names>A. M.</given-names></name> <name><surname>Birkbak</surname> <given-names>N. J.</given-names></name> <name><surname>Thomas</surname> <given-names>C. E.</given-names></name> <name><surname>Favero</surname> <given-names>F.</given-names></name> <name><surname>Krzystanek</surname> <given-names>M.</given-names></name> <name><surname>Lefebvre</surname> <given-names>C.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>TumorTracer: a method to identify the tissue of origin from the somatic mutations of a tumor specimen</article-title>. <source>BMC Med. Genomics</source> <volume>8</volume>:<fpage>58</fpage>. <pub-id pub-id-type="doi">10.1186/s12920-015-0130-0</pub-id><pub-id pub-id-type="pmid">26429708</pub-id></citation></ref>
<ref id="B87">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Marron</surname> <given-names>J. S.</given-names></name> <name><surname>Todd</surname> <given-names>M.</given-names></name></person-group> (<year>2007</year>). <article-title>Distance-weighted discrimination</article-title>. <source>J. Am. Stat. Assoc</source>. <volume>102</volume>, <fpage>1267</fpage>&#x02013;<lpage>1271</lpage>. <pub-id pub-id-type="doi">10.1198/016214507000001120</pub-id></citation></ref>
<ref id="B88">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mileyko</surname> <given-names>Y.</given-names></name> <name><surname>Mukherjee</surname> <given-names>S.</given-names></name> <name><surname>Harer</surname> <given-names>J.</given-names></name></person-group> (<year>2011</year>). <article-title>Probability measures on the space of persistence diagrams</article-title>. <source>Inverse Probl</source>. <volume>27</volume>:<fpage>124007</fpage>. <pub-id pub-id-type="doi">10.1088/0266-5611/27/12/124007</pub-id></citation></ref>
<ref id="B89">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mischaikow</surname> <given-names>K.</given-names></name> <name><surname>Nanda</surname> <given-names>V.</given-names></name></person-group> (<year>2013</year>). <article-title>Morse theory for filtrations and efficient computation of persistent homology</article-title>. <source>Discr. Comput. Geom</source>. <volume>50</volume>, <fpage>330</fpage>&#x02013;<lpage>353</lpage>. <pub-id pub-id-type="doi">10.1007/s00454-013-9529-6</pub-id></citation></ref>
<ref id="B90">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Moran</surname> <given-names>S.</given-names></name> <name><surname>Mart&#x000ED;nez-Card&#x000FA;s</surname> <given-names>A.</given-names></name> <name><surname>Sayols</surname> <given-names>S.</given-names></name> <name><surname>Musul&#x000E9;n</surname> <given-names>E.</given-names></name> <name><surname>Bala n&#x000E1;</surname> <given-names>C.</given-names></name> <name><surname>Estival-Gonzalez</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>Epigenetic profiling to classify cancer of unknown primary: a multicentre, retrospective analysis</article-title>. <source>Lancet Oncol</source>. <volume>17</volume>, <fpage>1386</fpage>&#x02013;<lpage>1395</lpage>. <pub-id pub-id-type="doi">10.1016/S1470-2045(16)30297-2</pub-id><pub-id pub-id-type="pmid">27575023</pub-id></citation></ref>
<ref id="B91">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Munkres</surname> <given-names>J. R.</given-names></name></person-group> (<year>1984</year>). <source>Elements of Algebraic Topology</source>. <publisher-loc>Menlo Park, CA</publisher-loc>: <publisher-name>Addison-Wesley Publishing Company</publisher-name>.</citation></ref>
<ref id="B92">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Neve</surname> <given-names>R. M.</given-names></name> <name><surname>Chin</surname> <given-names>K.</given-names></name> <name><surname>Fridlyand</surname> <given-names>J.</given-names></name> <name><surname>Yeh</surname> <given-names>J.</given-names></name> <name><surname>Baehner</surname> <given-names>F. L.</given-names></name> <name><surname>Fevr</surname> <given-names>T.</given-names></name> <etal/></person-group>. (<year>2006</year>). <article-title>A collection of breast cancer cell lines for the study of functionally distinct cancer subtypes</article-title>. <source>Cancer Cell</source> <volume>10</volume>, <fpage>515</fpage>&#x02013;<lpage>527</lpage>. <pub-id pub-id-type="doi">10.1016/j.ccr.2006.10.008</pub-id><pub-id pub-id-type="pmid">17157791</pub-id></citation></ref>
<ref id="B93">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nicolau</surname> <given-names>M.</given-names></name> <name><surname>Levine</surname> <given-names>A. J.</given-names></name> <name><surname>Carlsson</surname> <given-names>G.</given-names></name></person-group> (<year>2011</year>). <article-title>Topology based data analysis identifies a subgroup of breast cancers with a unique mutational profile and excellent survival</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A</source>. <volume>108</volume>, <fpage>7265</fpage>&#x02013;<lpage>7270</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1102826108</pub-id><pub-id pub-id-type="pmid">21482760</pub-id></citation></ref>
<ref id="B94">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nicolau</surname> <given-names>M.</given-names></name> <name><surname>Tibshirani</surname> <given-names>R.</given-names></name> <name><surname>B&#x000F8;rresen-Dale</surname> <given-names>A. L.</given-names></name> <name><surname>Jeffrey</surname> <given-names>S. S.</given-names></name></person-group> (<year>2007</year>). <article-title>Disease-specific genomic analysis: identifying the signature of pathologic biology</article-title>. <source>Bioinformatics</source> <volume>23</volume>, <fpage>957</fpage>&#x02013;<lpage>965</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btm033</pub-id><pub-id pub-id-type="pmid">17277331</pub-id></citation></ref>
<ref id="B95">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nielson</surname> <given-names>J. L.</given-names></name> <name><surname>Cooper</surname> <given-names>S. R.</given-names></name> <name><surname>Yue</surname> <given-names>J. K.</given-names></name> <name><surname>Sorani</surname> <given-names>M. D.</given-names></name> <name><surname>Inoue</surname> <given-names>T.</given-names></name> <name><surname>Yuh</surname> <given-names>E. L.</given-names></name> <etal/></person-group>. (<year>2017</year>). <article-title>Uncovering precision phenotype-biomarker associations in traumatic brain injury using topological data analysis</article-title>. <source>PLoS ONE</source> <volume>12</volume>:<fpage>e169490</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0169490</pub-id><pub-id pub-id-type="pmid">28257413</pub-id></citation></ref>
<ref id="B96">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nutt</surname> <given-names>C. L.</given-names></name> <name><surname>Mani</surname> <given-names>D. R.</given-names></name> <name><surname>Betensky</surname> <given-names>R. A.</given-names></name> <name><surname>Tamayo</surname> <given-names>P.</given-names></name> <name><surname>Cairncross</surname> <given-names>J. G.</given-names></name> <name><surname>Ladd</surname> <given-names>C.</given-names></name> <etal/></person-group>. (<year>2003</year>). <article-title>Gene expression-based classification of malignant gliomas correlates better with survival than histological classification</article-title>. <source>Cancer Res</source>. <volume>63</volume>, <fpage>1602</fpage>&#x02013;<lpage>1607</lpage>.<pub-id pub-id-type="pmid">12670911</pub-id></citation></ref>
<ref id="B97">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Obayashi</surname> <given-names>I.</given-names></name> <name><surname>Hiraoka</surname> <given-names>Y.</given-names></name></person-group> (<year>2018</year>). <article-title>Persistence diagrams with linear machine learning models</article-title>. <source>J. Appl. Comput. Topol</source>. <volume>1</volume>, <fpage>421</fpage>&#x02013;<lpage>449</lpage>. <pub-id pub-id-type="doi">10.1007/s41468-018-0013-5</pub-id></citation></ref>
<ref id="B98">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Otter</surname> <given-names>N.</given-names></name> <name><surname>Porter</surname> <given-names>M. A.</given-names></name> <name><surname>Tillmann</surname> <given-names>U.</given-names></name> <name><surname>Grindrod</surname> <given-names>P.</given-names></name> <name><surname>Harrington</surname> <given-names>H. A.</given-names></name></person-group> (<year>2017</year>). <article-title>A roadmap for the computation of persistent homology</article-title>. <source>EPJ Data Sci</source>. <volume>6</volume>:<fpage>17</fpage>. <pub-id pub-id-type="doi">10.1140/epjds/s13688-017-0109-5</pub-id><pub-id pub-id-type="pmid">32025466</pub-id></citation></ref>
<ref id="B99">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Oudot</surname> <given-names>S. Y.</given-names></name></person-group> (<year>2015</year>). <source>Persistence Theory: From Quiver Representations to Data Analysis, Vol. 209 of Mathematical Surveys and Monographs</source>. <publisher-loc>Providence, RI</publisher-loc>: <publisher-name>American Mathematical Society</publisher-name>. <pub-id pub-id-type="doi">10.1090/surv/209</pub-id></citation></ref>
<ref id="B100">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Oyama</surname> <given-names>A.</given-names></name> <name><surname>Hiraoka</surname> <given-names>Y.</given-names></name> <name><surname>Obayashi</surname> <given-names>I.</given-names></name> <name><surname>Saikawa</surname> <given-names>Y.</given-names></name> <name><surname>Furui</surname> <given-names>S.</given-names></name> <name><surname>Shiraishi</surname> <given-names>K.</given-names></name> <etal/></person-group>. (<year>2019</year>). <article-title>Hepatic tumor classification using texture and topology analysis of non-contrast-enhanced three-dimensional T1-weighted MR images with a radiomics approach</article-title>. <source>Sci. Rep</source>. <volume>9</volume>:<fpage>8764</fpage>. <pub-id pub-id-type="doi">10.1038/s41598-019-45283-z</pub-id><pub-id pub-id-type="pmid">31217445</pub-id></citation></ref>
<ref id="B101">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pereira</surname> <given-names>C. M.</given-names></name> <name><surname>de Mello</surname> <given-names>R. F.</given-names></name></person-group> (<year>2015</year>). <article-title>Persistent homology for time series and spatial data clustering</article-title>. <source>Expert Syst. Appl</source>. <volume>42</volume>, <fpage>6026</fpage>&#x02013;<lpage>6038</lpage>. <pub-id pub-id-type="doi">10.1016/j.eswa.2015.04.010</pub-id></citation></ref>
<ref id="B102">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Phillips</surname> <given-names>H. S.</given-names></name> <name><surname>Kharbanda</surname> <given-names>S.</given-names></name> <name><surname>Chen</surname> <given-names>R.</given-names></name> <name><surname>Forrest</surname> <given-names>W. F.</given-names></name> <name><surname>Soriano</surname> <given-names>R. H.</given-names></name> <name><surname>Wu</surname> <given-names>T. D.</given-names></name> <etal/></person-group>. (<year>2006</year>). <article-title>Molecular subclasses of high-grade glioma predict prognosis, delineate a pattern of disease progression, and resemble stages in neurogenesis</article-title>. <source>Cancer Cell</source> <volume>9</volume>, <fpage>157</fpage>&#x02013;<lpage>173</lpage>. <pub-id pub-id-type="doi">10.1016/j.ccr.2006.02.019</pub-id><pub-id pub-id-type="pmid">16530701</pub-id></citation></ref>
<ref id="B103">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Qaiser</surname> <given-names>T.</given-names></name> <name><surname>Sirinukunwattana</surname> <given-names>K.</given-names></name> <name><surname>Nakane</surname> <given-names>K.</given-names></name> <name><surname>Tsang</surname> <given-names>Y. W.</given-names></name> <name><surname>Epstein</surname> <given-names>D.</given-names></name> <name><surname>Rajpoot</surname> <given-names>N.</given-names></name></person-group> (<year>2016</year>). <article-title>&#x0201C;Persistent homology for fast tumor segmentation in whole slide histology images,&#x0201D;</article-title> in <source>Procedia Computer Science, Vol. 90</source> (<publisher-loc>Loughborough</publisher-loc>: <publisher-name>Elsevier B.V.</publisher-name>), <fpage>119</fpage>&#x02013;<lpage>124</lpage>. <pub-id pub-id-type="doi">10.1016/j.procs.2016.07.033</pub-id></citation></ref>
<ref id="B104">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rabad&#x000E1;n</surname> <given-names>R.</given-names></name> <name><surname>Mohamedi</surname> <given-names>Y.</given-names></name> <name><surname>Rubin</surname> <given-names>U.</given-names></name> <name><surname>Chu</surname> <given-names>T.</given-names></name> <name><surname>Alghalith</surname> <given-names>A. N.</given-names></name> <name><surname>Elliott</surname> <given-names>O.</given-names></name> <etal/></person-group>. (<year>2020</year>). <article-title>Identification of relevant genetic alterations in cancer using topological data analysis</article-title>. <source>Nat. Commun</source>. <volume>11</volume>, <fpage>1</fpage>&#x02013;<lpage>10</lpage>. <pub-id pub-id-type="doi">10.1101/2020.01.30.922310</pub-id><pub-id pub-id-type="pmid">32732999</pub-id></citation></ref>
<ref id="B105">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ravishanker</surname> <given-names>N.</given-names></name> <name><surname>Chen</surname> <given-names>R.</given-names></name></person-group> (<year>2019</year>). <article-title>Topological data analysis (TDA) for time series</article-title>. <source>arXiv: 1909.10604</source>.</citation></ref>
<ref id="B106">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Reinhard</surname> <given-names>E.</given-names></name> <name><surname>Ashikhmin</surname> <given-names>M.</given-names></name> <name><surname>Gooch</surname> <given-names>B.</given-names></name> <name><surname>Shirley</surname> <given-names>P.</given-names></name></person-group> (<year>2001</year>). <article-title>Color transfer between images</article-title>. <source>IEEE Comput. Graph. Appl</source>. <volume>21</volume>, <fpage>34</fpage>&#x02013;<lpage>41</lpage>. <pub-id pub-id-type="doi">10.1109/38.946629</pub-id></citation></ref>
<ref id="B107">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Reininghaus</surname> <given-names>J.</given-names></name> <name><surname>Huber</surname> <given-names>S.</given-names></name> <name><surname>Bauer</surname> <given-names>U.</given-names></name> <name><surname>Kwitt</surname> <given-names>R.</given-names></name></person-group> (<year>2015</year>). <article-title>&#x0201C;A stable multi-scale kernel for topological machine learning,&#x0201D;</article-title> in <source>Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition</source> (<publisher-loc>Boston, MA</publisher-loc>), <fpage>4741</fpage>&#x02013;<lpage>4748</lpage>. <pub-id pub-id-type="doi">10.1109/CVPR.2015.7299106</pub-id></citation></ref>
<ref id="B108">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Reuter</surname> <given-names>J. A.</given-names></name> <name><surname>Spacek</surname> <given-names>D. V.</given-names></name> <name><surname>Snyder</surname> <given-names>M. P.</given-names></name></person-group> (<year>2015</year>). <article-title>High-throughput sequencing technologies</article-title>. <source>Mol. Cell</source> <volume>58</volume>, <fpage>586</fpage>&#x02013;<lpage>597</lpage>. <pub-id pub-id-type="doi">10.1016/j.molcel.2015.05.004</pub-id></citation></ref>
<ref id="B109">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Roychowdhury</surname> <given-names>S.</given-names></name> <name><surname>Iyer</surname> <given-names>M. K.</given-names></name> <name><surname>Robinson</surname> <given-names>D. R.</given-names></name> <name><surname>Lonigro</surname> <given-names>R. J.</given-names></name> <name><surname>Wu</surname> <given-names>Y. M.</given-names></name> <name><surname>Cao</surname> <given-names>X.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Personalized oncology through integrative high-throughput sequencing: a pilot study</article-title>. <source>Sci. Transl. Med</source>. <volume>3</volume>, <fpage>1</fpage>&#x02013;<lpage>12</lpage>. <pub-id pub-id-type="doi">10.1126/scitranslmed.3003161</pub-id><pub-id pub-id-type="pmid">22133722</pub-id></citation></ref>
<ref id="B110">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rucco</surname> <given-names>M.</given-names></name> <name><surname>Merelli</surname> <given-names>E.</given-names></name> <name><surname>Herman</surname> <given-names>D.</given-names></name> <name><surname>Ramanan</surname> <given-names>D.</given-names></name> <name><surname>Petrossian</surname> <given-names>T.</given-names></name> <name><surname>Falsetti</surname> <given-names>L.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>Using Topological Data Analysis for diagnosis pulmonary embolism</article-title>. <source>arXiv:1409.5020v1</source>. <volume>9</volume>, <fpage>41</fpage>&#x02013;<lpage>55</lpage>.<pub-id pub-id-type="pmid">26515513</pub-id></citation></ref>
<ref id="B111">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Seemann</surname> <given-names>L.</given-names></name> <name><surname>Shulman</surname> <given-names>J.</given-names></name> <name><surname>Gunaratne</surname> <given-names>G. H.</given-names></name></person-group> (<year>2012</year>). <article-title>A robust topology-based algorithm for gene expression profiling</article-title>. <source>ISRN Bioinform</source>. <volume>2012</volume>:<fpage>381023</fpage>. <pub-id pub-id-type="doi">10.5402/2012/381023</pub-id><pub-id pub-id-type="pmid">25969748</pub-id></citation></ref>
<ref id="B112">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Siddiqui</surname> <given-names>S.</given-names></name> <name><surname>Shikotra</surname> <given-names>A.</given-names></name> <name><surname>Richardson</surname> <given-names>M.</given-names></name> <name><surname>Doran</surname> <given-names>E.</given-names></name> <name><surname>Choy</surname> <given-names>D.</given-names></name> <name><surname>Bell</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>Airway pathological heterogeneity in asthma: visualization of disease microclusters using topological data analysis</article-title>. <source>J. Aller. Clin. Immunol</source>. <volume>142</volume>, <fpage>1457</fpage>&#x02013;<lpage>1468</lpage>. <pub-id pub-id-type="doi">10.1016/j.jaci.2017.12.982</pub-id><pub-id pub-id-type="pmid">29550052</pub-id></citation></ref>
<ref id="B113">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Singh</surname> <given-names>G.</given-names></name> <name><surname>M&#x000E9;moli</surname> <given-names>F.</given-names></name> <name><surname>Carlsson</surname> <given-names>G.</given-names></name></person-group> (<year>2007</year>). <article-title>&#x0201C;Topological methods for the analysis of high dimensional data sets and 3D object recognition,&#x0201D;</article-title> in <source>Eurographics Symposium on Point-Based Graphics</source> (<publisher-loc>Prague</publisher-loc>).</citation></ref>
<ref id="B114">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Singh</surname> <given-names>N.</given-names></name> <name><surname>Couture</surname> <given-names>H. D.</given-names></name> <name><surname>Marron</surname> <given-names>J. S.</given-names></name> <name><surname>Perou</surname> <given-names>C.</given-names></name> <name><surname>Niethammer</surname> <given-names>M.</given-names></name></person-group> (<year>2014</year>). <article-title>&#x0201C;Topological descriptors of histology images,&#x0201D;</article-title> in <source>Machine Learning in Medical Imaging</source> (<publisher-loc>Boston, MA</publisher-loc>). <pub-id pub-id-type="doi">10.1007/978-3-319-10581-9_29</pub-id></citation></ref>
<ref id="B115">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>S&#x000F8;ndergaard</surname> <given-names>D.</given-names></name> <name><surname>Nielsen</surname> <given-names>S.</given-names></name> <name><surname>Pedersen</surname> <given-names>C. N.</given-names></name> <name><surname>Besenbacher</surname> <given-names>S.</given-names></name></person-group> (<year>2017</year>). <article-title>Prediction of primary tumors in cancers of unknown primary</article-title>. <source>J. Integr. Bioinform</source>. <volume>14</volume>:<fpage>20170013</fpage>. <pub-id pub-id-type="doi">10.1515/jib-2017-0013</pub-id><pub-id pub-id-type="pmid">28686574</pub-id></citation></ref>
<ref id="B116">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stack</surname> <given-names>E. C.</given-names></name> <name><surname>Wang</surname> <given-names>C.</given-names></name> <name><surname>Roman</surname> <given-names>K. A.</given-names></name> <name><surname>Hoyt</surname> <given-names>C. C.</given-names></name></person-group> (<year>2014</year>). <article-title>Multiplexed immunohistochemistry, imaging, and quantitation: a review, with an assessment of Tyramide signal amplification, multispectral imaging and multiplex analysis</article-title>. <source>Methods</source> <volume>70</volume>, <fpage>46</fpage>&#x02013;<lpage>58</lpage>. <pub-id pub-id-type="doi">10.1016/j.ymeth.2014.08.016</pub-id><pub-id pub-id-type="pmid">25242720</pub-id></citation></ref>
<ref id="B117">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stolz</surname> <given-names>B. J.</given-names></name> <name><surname>Harrington</surname> <given-names>H. A.</given-names></name> <name><surname>Porter</surname> <given-names>M. A.</given-names></name></person-group> (<year>2017</year>). <article-title>Persistent homology of time-dependent functional networks constructed from coupled time series</article-title>. <source>Chaos</source> <volume>27</volume>:<fpage>047410</fpage>. <pub-id pub-id-type="doi">10.1063/1.4978997</pub-id><pub-id pub-id-type="pmid">28456167</pub-id></citation></ref>
<ref id="B118">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Suwinski</surname> <given-names>P.</given-names></name> <name><surname>Ong</surname> <given-names>C. K.</given-names></name> <name><surname>Ling</surname> <given-names>M. H.</given-names></name> <name><surname>Poh</surname> <given-names>Y. M.</given-names></name> <name><surname>Khan</surname> <given-names>A. M.</given-names></name> <name><surname>Ong</surname> <given-names>H. S.</given-names></name></person-group> (<year>2019</year>). <article-title>Advancing personalized medicine through the application of whole exome sequencing and big data analytics</article-title>. <source>Front. Genet</source>. <volume>10</volume>:<fpage>49</fpage>. <pub-id pub-id-type="doi">10.3389/fgene.2019.00049</pub-id><pub-id pub-id-type="pmid">30809243</pub-id></citation></ref>
<ref id="B119">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Tahmassebi</surname> <given-names>A.</given-names></name> <name><surname>Schulte</surname> <given-names>M. H.</given-names></name> <name><surname>Gandomi</surname> <given-names>A. H.</given-names></name> <name><surname>Goudriaan</surname> <given-names>A. E.</given-names></name> <name><surname>McCann</surname> <given-names>I.</given-names></name> <name><surname>Meyer-Baese</surname> <given-names>A.</given-names></name></person-group> (<year>2018</year>). <article-title>&#x0201C;Deep learning in medical imaging: FMRI big data analysis via convolutional neural networks,&#x0201D;</article-title> in <source>ACM International Conference Proceeding Series</source> (<publisher-loc>Pittsburgh, PA</publisher-loc>: <publisher-name>Association for Computing Machinery</publisher-name>), <fpage>1</fpage>&#x02013;<lpage>4</lpage>. <pub-id pub-id-type="doi">10.1145/3219104.3229250</pub-id></citation></ref>
<ref id="B120">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Truesdale</surname> <given-names>M. D.</given-names></name> <name><surname>Cheetham</surname> <given-names>P. J.</given-names></name> <name><surname>Turk</surname> <given-names>A. T.</given-names></name> <name><surname>Sartori</surname> <given-names>S.</given-names></name> <name><surname>Hruby</surname> <given-names>G. W.</given-names></name> <name><surname>Dinneen</surname> <given-names>E. P.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Gleason score concordance on biopsy-confirmed prostate cancer: is pathological re-evaluation necessary prior to radical prostatectomy?</article-title> <source>BJU Int</source>. <volume>107</volume>, <fpage>749</fpage>&#x02013;<lpage>754</lpage>. <pub-id pub-id-type="doi">10.1111/j.1464-410X.2010.09570.x</pub-id><pub-id pub-id-type="pmid">20840549</pub-id></citation></ref>
<ref id="B121">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Truong</surname> <given-names>M.</given-names></name> <name><surname>Slezak</surname> <given-names>J. A.</given-names></name> <name><surname>Lin</surname> <given-names>C. P.</given-names></name> <name><surname>Iremashvili</surname> <given-names>V.</given-names></name> <name><surname>Sado</surname> <given-names>M.</given-names></name> <name><surname>Razmaria</surname> <given-names>A. A.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>Development and multi-institutional validation of an upgrading risk tool for Gleason 6 prostate cancer</article-title>. <source>Cancer</source> <volume>119</volume>, <fpage>3992</fpage>&#x02013;<lpage>4002</lpage>. <pub-id pub-id-type="doi">10.1002/cncr.28303</pub-id><pub-id pub-id-type="pmid">24006289</pub-id></citation></ref>
<ref id="B122">
<citation citation-type="thesis"><person-group person-group-type="author"><name><surname>Truong</surname> <given-names>P.</given-names></name></person-group> (<year>2017</year>). <source>An exploration of topological properties of high-frequency onedimensional financial time series data using TDA</source> (Ph.D. thesis). KTH Royal Institute of Technology, Stockholm, Sweden.</citation></ref>
<ref id="B123">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tschandl</surname> <given-names>P.</given-names></name> <name><surname>Rosendahl</surname> <given-names>C.</given-names></name> <name><surname>Kittler</surname> <given-names>H.</given-names></name></person-group> (<year>2018</year>). <article-title>Data descriptor: the HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions</article-title>. <source>Sci. Data</source> <volume>5</volume>:<fpage>180161</fpage>. <pub-id pub-id-type="doi">10.1038/sdata.2018.161</pub-id></citation></ref>
<ref id="B124">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Turner</surname> <given-names>K.</given-names></name> <name><surname>Mukherjee</surname> <given-names>S.</given-names></name> <name><surname>Boyer</surname> <given-names>D. M.</given-names></name></person-group> (<year>2014</year>). <article-title>Persistent homology transform for modeling shapes and surfaces</article-title>. <source>Inf. Inference</source> <volume>3</volume>, <fpage>310</fpage>&#x02013;<lpage>344</lpage>. <pub-id pub-id-type="doi">10.1093/imaiai/iau011</pub-id></citation></ref>
<ref id="B125">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Verhaak</surname> <given-names>R. G.</given-names></name> <name><surname>Hoadley</surname> <given-names>K. A.</given-names></name> <name><surname>Purdom</surname> <given-names>E.</given-names></name> <name><surname>Wang</surname> <given-names>V.</given-names></name> <name><surname>Qi</surname> <given-names>Y.</given-names></name> <name><surname>Wilkerson</surname> <given-names>M. D.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1</article-title>. <source>Cancer Cell</source> <volume>17</volume>, <fpage>98</fpage>&#x02013;<lpage>110</lpage>. <pub-id pub-id-type="doi">10.1016/j.ccr.2009.12.020</pub-id><pub-id pub-id-type="pmid">20129251</pub-id></citation></ref>
<ref id="B126">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vikes&#x000E5;</surname> <given-names>J.</given-names></name> <name><surname>M&#x000F8;ller</surname> <given-names>A. K.</given-names></name> <name><surname>Kaczkowski</surname> <given-names>B.</given-names></name> <name><surname>Borup</surname> <given-names>R.</given-names></name> <name><surname>Winther</surname> <given-names>O.</given-names></name> <name><surname>Henao</surname> <given-names>R.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>Cancers of unknown primary origin (CUP) are characterized by chromosomal instability (CIN) compared to metastasis of know origin</article-title>. <source>BMC Cancer</source> <volume>15</volume>:<fpage>151</fpage>. <pub-id pub-id-type="doi">10.1186/s12885-015-1128-x</pub-id><pub-id pub-id-type="pmid">25885340</pub-id></citation></ref>
<ref id="B127">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wadhwa</surname> <given-names>R. R.</given-names></name> <name><surname>Williamson</surname> <given-names>D. F. K.</given-names></name> <name><surname>Dhawan</surname> <given-names>A.</given-names></name> <name><surname>Scott</surname> <given-names>J. G.</given-names></name></person-group> (<year>2018</year>). <article-title>TDAstats: R pipeline for computing persistent homology in topological data analysis</article-title>. <source>J. Open Source Softw</source>. <volume>3</volume>:<fpage>860</fpage>. <pub-id pub-id-type="doi">10.21105/joss.00860</pub-id><pub-id pub-id-type="pmid">33381678</pub-id></citation></ref>
<ref id="B128">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>G.</given-names></name></person-group> (<year>2016</year>). <article-title>A perspective on deep imaging</article-title>. <source>IEEE Access</source> <volume>4</volume>, <fpage>8914</fpage>&#x02013;<lpage>8924</lpage>. <pub-id pub-id-type="doi">10.1109/ACCESS.2016.2624938</pub-id></citation></ref>
<ref id="B129">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Weinberger</surname> <given-names>S.</given-names></name></person-group> (<year>2014</year>). <article-title>The complexity of some topological inference problems</article-title>. <source>Found. Comput. Math</source>. <volume>14</volume>, <fpage>1277</fpage>&#x02013;<lpage>1285</lpage>. <pub-id pub-id-type="doi">10.1007/s10208-013-9152-1</pub-id></citation></ref>
<ref id="B130">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Wilkerson</surname> <given-names>A. C.</given-names></name> <name><surname>Chintakunta</surname> <given-names>H.</given-names></name> <name><surname>Krim</surname> <given-names>H.</given-names></name></person-group> (<year>2014</year>). <article-title>&#x0201C;Computing persistent features in big data: a distributed dimension reduction approach,&#x0201D;</article-title> in <source>2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)</source> (<publisher-loc>Florence</publisher-loc>), <fpage>11</fpage>&#x02013;<lpage>15</lpage>. <pub-id pub-id-type="doi">10.1109/ICASSP.2014.6853548</pub-id></citation></ref>
<ref id="B131">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Xiaohua</surname> <given-names>C.</given-names></name> <name><surname>Brady</surname> <given-names>M.</given-names></name> <name><surname>Rueckert</surname> <given-names>D.</given-names></name></person-group> (<year>2004</year>). <article-title>&#x0201C;Simultaneous segmentation and registration for medical image,&#x0201D;</article-title> in <source>International Conference on Medical Image Computing and Computer-Assisted Intervention</source> (<publisher-loc>Saint-Malo</publisher-loc>), <fpage>663</fpage>&#x02013;<lpage>670</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-540-30135-6_81</pub-id></citation></ref>
<ref id="B132">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yu</surname> <given-names>K. H.</given-names></name> <name><surname>Zhang</surname> <given-names>C.</given-names></name> <name><surname>Berry</surname> <given-names>G. J.</given-names></name> <name><surname>Altman</surname> <given-names>R. B.</given-names></name> <name><surname>R&#x000E9;</surname> <given-names>C.</given-names></name> <name><surname>Rubin</surname> <given-names>D. L.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features</article-title>. <source>Nat. Commun</source>. <volume>7</volume>, <fpage>1</fpage>&#x02013;<lpage>10</lpage>. <pub-id pub-id-type="doi">10.1038/ncomms12474</pub-id><pub-id pub-id-type="pmid">27527408</pub-id></citation></ref>
<ref id="B133">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yuan</surname> <given-names>Y.</given-names></name> <name><surname>Failmezger</surname> <given-names>H.</given-names></name> <name><surname>Rueda</surname> <given-names>O. M.</given-names></name> <name><surname>Raza Ali</surname> <given-names>H.</given-names></name> <name><surname>Gr&#x000E4;f</surname> <given-names>S.</given-names></name> <name><surname>Chin</surname> <given-names>S. F.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>Quantitative image analysis of cellular heterogeneity in breast tumors complements genomic profiling</article-title>. <source>Sci. Transl. Med</source>. <volume>4</volume>:<fpage>157r</fpage>a143. <pub-id pub-id-type="doi">10.1126/scitranslmed.3004330</pub-id><pub-id pub-id-type="pmid">23100629</pub-id></citation></ref>
<ref id="B134">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>Y.</given-names></name> <name><surname>Brady</surname> <given-names>M.</given-names></name> <name><surname>Smith</surname> <given-names>S.</given-names></name></person-group> (<year>2001</year>). <article-title>Segmentation of brain MR images through a hidden markov random field model and the expectation-maximization algorithm</article-title>. <source>IEEE Trans. Med. Imaging</source> <volume>20</volume>:<fpage>45</fpage>. <pub-id pub-id-type="doi">10.1109/42.906424</pub-id><pub-id pub-id-type="pmid">11293691</pub-id></citation></ref>
<ref id="B135">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhao</surname> <given-names>B.</given-names></name> <name><surname>Pritchard</surname> <given-names>J. R.</given-names></name> <name><surname>Lauffenburger</surname> <given-names>D. A.</given-names></name> <name><surname>Hemann</surname> <given-names>M. T.</given-names></name></person-group> (<year>2014</year>). <article-title>Addressing genetic tumor heterogeneity through computationally predictive combination therapy</article-title>. <source>Cancer Discov</source>. <volume>4</volume>, <fpage>166</fpage>&#x02013;<lpage>174</lpage>. <pub-id pub-id-type="doi">10.1158/2159-8290.CD-13-0465</pub-id><pub-id pub-id-type="pmid">24318931</pub-id></citation></ref>
<ref id="B136">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname> <given-names>X.</given-names></name> <name><surname>Carbonetto</surname> <given-names>P.</given-names></name> <name><surname>Stephens</surname> <given-names>M.</given-names></name></person-group> (<year>2013</year>). <article-title>Polygenic modeling with bayesian sparse linear mixed models</article-title>. <source>PLoS Genet</source>. <volume>9</volume>:<fpage>e1003264</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1003264</pub-id><pub-id pub-id-type="pmid">23408905</pub-id></citation></ref>
<ref id="B137">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Zomorodian</surname> <given-names>A.</given-names></name></person-group> (<year>2010</year>). <article-title>&#x0201C;The tidy set: a minimal simplicial set for computing homology of clique complexes [extended abstract],&#x0201D;</article-title> in <source>Computational Geometry (SCG&#x00027;10)</source> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>ACM</publisher-name>), <fpage>257</fpage>&#x02013;<lpage>266</lpage>. <pub-id pub-id-type="doi">10.1145/1810959.1811004</pub-id></citation></ref>
<ref id="B138">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zomorodian</surname> <given-names>A.</given-names></name> <name><surname>Carlsson</surname> <given-names>G.</given-names></name></person-group> (<year>2005</year>). <article-title>Computing persistent homology</article-title>. <source>Discrete Comput. Geom</source>. <volume>33</volume>, <fpage>249</fpage>&#x02013;<lpage>274</lpage>. <pub-id pub-id-type="doi">10.1007/s00454-004-1146-y</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn id="fn0001"><p><sup>1</sup>While the intuitive definition will suffice for this paper, we have left out a number of details. For example if we use addition with &#x02124;<sub>2</sub> coefficients, we can detect the Klein bottle surface (&#x003B2;<sub>2</sub> &#x0003D; 1), while if we use &#x02124; coefficients, &#x003B2;<sub>2</sub> &#x0003D; 0 since the Klein bottle does not bound a void. For computational speed, &#x02124;<sub>2</sub> coefficients are frequently used when computing persistent homology.</p></fn>
</fn-group>
<fn-group>
<fn fn-type="financial-disclosure"><p><bold>Funding.</bold> AB was supported by the National Science Foundation Graduate Research Fellowship Program under Grant No. 1746051. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author and do not necessarily reflect the views of the National Science Foundation.</p>
</fn>
</fn-group>
</back>
</article>