<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="research-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Bioinform.</journal-id>
<journal-title>Frontiers in Bioinformatics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Bioinform.</abbrev-journal-title>
<issn pub-type="epub">2673-7647</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">1090730</article-id>
<article-id pub-id-type="doi">10.3389/fbinf.2023.1090730</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Bioinformatics</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Bootstrap confidence for molecular evolutionary estimates from tumor bulk sequencing data</article-title>
<alt-title alt-title-type="left-running-head">Huzar et al.</alt-title>
<alt-title alt-title-type="right-running-head">
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fbinf.2023.1090730">10.3389/fbinf.2023.1090730</ext-link>
</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Huzar</surname>
<given-names>Jared</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1589720/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Shenoy</surname>
<given-names>Madelyn</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Sanderford</surname>
<given-names>Maxwell D.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Kumar</surname>
<given-names>Sudhir</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/413179/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Miura</surname>
<given-names>Sayaka</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1240951/overview"/>
</contrib>
</contrib-group>
<aff id="aff1">
<sup>1</sup>
<institution>Institute for Genomics and Evolutionary Medicine</institution>, <institution>Temple University</institution>, <addr-line>Philadelphia</addr-line>, <addr-line>PA</addr-line>, <country>United States</country>
</aff>
<aff id="aff2">
<sup>2</sup>
<institution>Department of Biology</institution>, <institution>Temple University</institution>, <addr-line>Philadelphia</addr-line>, <addr-line>PA</addr-line>, <country>United States</country>
</aff>
<aff id="aff3">
<sup>3</sup>
<institution>Center for Excellence in Genomic Medicine Research</institution>, <institution>King Abdulaziz University</institution>, <addr-line>Jeddah</addr-line>, <country>Saudi Arabia</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1548109/overview">Wuming Gong</ext-link>, University of Minnesota Twin Cities, United States</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/912585/overview">Anoosha Paruchuri</ext-link>, The Ohio State University, United States</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/918017/overview">Dake Zhang</ext-link>, Beihang University, China</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Sayaka Miura, <email>sayaka.miura@temple.edu</email>
</corresp>
<fn fn-type="other">
<p>This article was submitted to Genomic Analysis, a section of the journal Frontiers in Bioinformatics</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>16</day>
<month>05</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="collection">
<year>2023</year>
</pub-date>
<volume>3</volume>
<elocation-id>1090730</elocation-id>
<history>
<date date-type="received">
<day>05</day>
<month>11</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>28</day>
<month>03</month>
<year>2023</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2023 Huzar, Shenoy, Sanderford, Kumar and Miura.</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Huzar, Shenoy, Sanderford, Kumar and Miura</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>Bulk sequencing is commonly used to characterize the genetic diversity of cancer cell populations in tumors and the evolutionary relationships of cancer clones. However, bulk sequencing produces aggregate information on nucleotide variants and their sample frequencies, necessitating computational methods to predict distinct clone sequences and their frequencies within a sample. Interestingly, no methods are available to measure the statistical confidence in the variants assigned to inferred clones. We introduce a bootstrap resampling approach that combines clone prediction and statistical confidence calculation for every variant assignment. Analysis of computer-simulated datasets showed the bootstrap approach to work well in assessing the reliability of predicted clones as well downstream inferences using the predicted clones (e.g., mapping metastatic migration paths). We found that only a fraction of inferences have good bootstrap support, which means that many inferences are tentative for real data. Using the bootstrap approach, we analyzed empirical datasets from metastatic cancers and placed bootstrap confidence on the estimated number of mutations involved in cell migration events. We found that the numbers of driver mutations involved in metastatic cell migration events sourced from primary tumors are similar to those where metastatic tumors are the source of new metastases. So, mutations with driver potential seem to keep arising during metastasis. The bootstrap approach developed in this study is implemented in software available at <ext-link ext-link-type="uri" xlink:href="https://github.com/SayakaMiura/CloneFinderPlus">https://github.com/SayakaMiura/CloneFinderPlus</ext-link>.</p>
</abstract>
<kwd-group>
<kwd>tumor evolution</kwd>
<kwd>bootstrap</kwd>
<kwd>bulk sequencing</kwd>
<kwd>metastasis</kwd>
<kwd>driver mutation</kwd>
</kwd-group>
<contract-num rid="cn001">LM013385-03</contract-num>
<contract-sponsor id="cn001">National Institutes of Health<named-content content-type="fundref-id">10.13039/100000002</named-content>
</contract-sponsor>
<custom-meta-wrap>
<custom-meta>
<meta-name>section-at-acceptance</meta-name>
<meta-value>Genomic Analysis</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
<body>
<sec id="s1">
<title>1 Introduction</title>
<p>Tumors are characterized by a mixture of cell populations in which many distinct clones arise due to somatic mutations (<xref ref-type="bibr" rid="B10">Gerlinger et al., 2012</xref>; <xref ref-type="bibr" rid="B24">Nik-Zainal et al., 2012</xref>; <xref ref-type="bibr" rid="B6">de Bruin et al., 2014</xref>; <xref ref-type="bibr" rid="B33">Zhao et al., 2016</xref>). These clones may increase in frequency during tumor progression, and they may spread to other locations resulting in metastasis (<xref ref-type="bibr" rid="B10">Gerlinger et al., 2012</xref>; <xref ref-type="bibr" rid="B24">Nik-Zainal et al., 2012</xref>; <xref ref-type="bibr" rid="B6">de Bruin et al., 2014</xref>; <xref ref-type="bibr" rid="B33">Zhao et al., 2016</xref>). Genetic variation in tumors is commonly profiled by bulk sequencing of tumor samples. In bulk sequencing, many cells in the sample are sequenced together to produce somatic variants and their population frequencies. This information informs the degree of genetic heterogeneity in tumors, but not the number of distinct clones present or the sequences of these clones. Knowledge of individual clone sequences is necessary to reconstruct the evolutionary relationship of tumor cells, the dynamics of mutational processes, and the history of metastatic cell migrations (<xref ref-type="bibr" rid="B11">Gundem et al., 2015</xref>; <xref ref-type="bibr" rid="B31">Wei et al., 2017</xref>; <xref ref-type="bibr" rid="B30">Turajlic et al., 2018</xref>; <xref ref-type="bibr" rid="B1">Alves et al., 2019</xref>; <xref ref-type="bibr" rid="B15">Kumar et al., 2020</xref>; <xref ref-type="bibr" rid="B4">Chen et al., 2022</xref>; <xref ref-type="bibr" rid="B5">Chroni et al., 2022</xref>; <xref ref-type="bibr" rid="B20">Miura et al., 2022</xref>).</p>
<p>For this reason, several methods for analyzing bulk sequencing data are available (<xref ref-type="bibr" rid="B2">Beerenwinkel et al., 2015</xref>; <xref ref-type="bibr" rid="B21">Miura et al., 2020</xref>). Some methods are designed to identify clusters of genetic variants with similar variant allele frequencies (VAFs) indicative of their co-presence in the same genotype (<xref ref-type="bibr" rid="B27">Roth et al., 2014</xref>; <xref ref-type="bibr" rid="B17">Malikic et al., 2015</xref>; <xref ref-type="bibr" rid="B25">Popic et al., 2015</xref>; <xref ref-type="bibr" rid="B9">El-Kebir et al., 2018</xref>; <xref ref-type="bibr" rid="B32">Xiao et al., 2020</xref>). This strategy is commonly used when bulk sequencing data from only a single sample is available. More accurate clone predictions can be achieved when multiple tumor samples are sequenced from a patient, which enables the inference of clone genotypes and their evolutionary relationships (<xref ref-type="bibr" rid="B22">Murugaesu et al., 2015</xref>; <xref ref-type="bibr" rid="B12">Hao et al., 2016</xref>; <xref ref-type="bibr" rid="B13">Harbst et al., 2016</xref>; <xref ref-type="bibr" rid="B26">Reiter et al., 2017</xref>; <xref ref-type="bibr" rid="B18">Martinez et al., 2018</xref>; <xref ref-type="bibr" rid="B19">Miura et al., 2018</xref>; <xref ref-type="bibr" rid="B14">Hu et al., 2019</xref>).</p>
<p>Inferred clone sequences from bulk sequencing data are estimates. However, none of the current clone prediction methods provide an assessment of the uncertainty associated with these estimates. Uncertainty in clone inferences should occur because they are based on the similarities of VAFs that are calculated from observed sequencing reads with and without variants. Especially when the number of reads is small, the variance of VAF can be large (<xref ref-type="fig" rid="F1">Figure 1</xref>). Thus, single nucleotide variants (SNVs) with small read counts are expected to affect the accuracy of clone prediction more strongly than those with large read counts. Unfortunately, all current methods comparing VAFs ignore this variance and simply present inferred clones and variation assignments without presenting the assignment variance. Here, we suggest using a bootstrap resampling approach to overcome this shortcoming. We have implemented this idea for use with the CloneFinder method (<xref ref-type="bibr" rid="B19">Miura et al., 2018</xref>) to demonstrate the usefulness of the bootstrap resampling in assessing the uncertainty of clone inference and embracing it in the downstream analysis such as the mapping of metastatic migration histories. We apply the bootstrap approach to analyze an empirical dataset, which yields insights into driver mutations and metastasis migrations.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>The impact of sequencing read counts on the variance of variant allele frequencies (VAFs). A variant with 20% VAF was simulated with 20 and 100 total reads. Reads were randomly sampled to generate 30 bootstrap replicates for each dataset.</p>
</caption>
<graphic xlink:href="fbinf-03-1090730-g001.tif"/>
</fig>
</sec>
<sec sec-type="materials|methods" id="s2">
<title>2 Materials and methods</title>
<sec id="s2-1">
<title>2.1 Bootstrap approach for tumor evolution estimates</title>
<p>Our bootstrap approach samples sequencing reads with and without variants at genomic positions (<xref ref-type="fig" rid="F2">Figures 2A, B</xref>). The total number of reads sampled at a given position remains the same as in the original dataset, but reads are sampled with replacement. Since the same read can be sampled multiple times, some reads and even variants will be missing at that position in the bootstrap replicate dataset. Reads are resampled for each position, and a bootstrap replicate data is generated for each tumor sample. A pseudo-multi-tumor dataset is then generated by combining bootstrap replicates of tumor samples, and this pseudo-multi-tumor dataset is analyzed using the desired clone prediction method (e.g., CloneFinder) to infer clones (<xref ref-type="fig" rid="F2">Figure 2C</xref>). Similarly, more pseudo-multi-tumor datasets are generated, and clones are inferred in many bootstrap replicates.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>Overview of the bootstrap approach for tumor bulk sequencing data analysis. A bootstrap replicate dataset is generated by sampling reads <bold>(A,B)</bold>. Using a clone inference method, clones are generated from this bootstrap replicate dataset <bold>(C)</bold>. The presence and absence of variants are indicated with &#x201c;1&#x201d; and &#x201c;0,&#x201d; respectively. The procedure is repeated by generating many bootstrap replicates, each producing clones. All the clones from all the bootstrap replicate datasets are pooled together and consensus clones can be produced <bold>(D)</bold>. If a study intends to use the bootstrap approach for downstream analysis (e.g., mapping metastatic migration histories), we infer the migration history for every bootstrap replicate dataset using the phylogeny of clones inferred in that replicate <bold>(E)</bold>. All the migration paths inferred using bootstrap datasets are pooled and consensus migration history with bootstrap support can be generated <bold>(F)</bold>. The tumor sampling site is written inside the box, and the arrows indicate cell migration events. The blue boxes and blue arrows are the primary tumor sites and migration events from the primary tumor, respectively. The metastatic tumor sites and migrations from them are shown in red. A number next to an arrow is the bootstrap support (%). The dotted arrows are those with &#x3c;60% bootstrap support.</p>
</caption>
<graphic xlink:href="fbinf-03-1090730-g002.tif"/>
</fig>
<p>The bootstrap approach can be used to build consensus clone sequences. Predicted clones from all replicates are pooled together and (nearly) identical clone genotypes (for a user-supplied SNV count cutoff) are grouped (<xref ref-type="fig" rid="F2">Figure 2D</xref>). Each clone group is then represented by a consensus clone and the proportion of bootstrap replicates in which a clone appears is the bootstrap support for detecting that consensus clone. To construct a consensus clone sequence, a base reconstructed in a greater proportion of bootstrap clones than the desired threshold is selected for each variant position. The base assignment is marked ambiguous when none of the bases have received the minimum desired bootstrap support.</p>
<p>We lastly describe an application of the bootstrap approach to infer consensus tumor evolution estimates of metastatic migration history using predicted clones. As described above, a pseudo-multi-tumor dataset is first generated by sampling reads, and a set of bootstrap clones is inferred (<xref ref-type="fig" rid="F2">Figures 2A&#x2013;C</xref>). These bootstrap clones are subsequently analyzed (<xref ref-type="fig" rid="F2">Figure 2E</xref>). For example, we infer bootstrap cell migration history using the bootstrap clones from every bootstrap replicate dataset. Then, each migration history is expressed as a collection of individual migration paths. All the migration paths are pooled to build a consensus migration history, where paths with the highest frequency are connected with each other first, followed by others with lower frequencies until the minimum desired bootstrap support requirement is met (<xref ref-type="fig" rid="F2">Figure 2F</xref>).</p>
</sec>
<sec id="s2-2">
<title>2.2 Advanced CloneFinder (CloneFinder&#x2b;)</title>
<p>We implemented and tested the bootstrap approach in our CloneFinder method which is known to perform well for inferring clones using bulk-sequencing datasets from multi-tumor samples (<xref ref-type="bibr" rid="B19">Miura et al., 2018</xref>; <xref ref-type="bibr" rid="B21">Miura et al., 2020</xref>). We also took this opportunity to advance CloneFinder by adding a step to preprocess the sequencing datasets by FastClone (<xref ref-type="bibr" rid="B32">Xiao et al., 2020</xref>) software that clusters SNVs based on VAFs. This allows CloneFinder to start with a larger collection of tumor genotypes than the original approach. Thus, the advanced CloneFinder, CloneFinder&#x2b;, begins with the analysis of VAFs using FastClone (<xref ref-type="fig" rid="F3">Figure 3A</xref>). For a tumor sample, FastClone clusters SNVs based on their VAF similarities and predicts relationships of SNV clusters, i.e., ancestor-descendant, sibling, or monoclonal (<xref ref-type="fig" rid="F3">Figure 3B</xref>). CloneFinder&#x2b; constructs candidate clone sequences by accumulating all predicted SNVs from the root cluster to a target cluster. These clones are the candidate clones for a given tumor sample. This analysis is done for every tumor sample individually, and candidate clones are inferred for each tumor sample. All candidate clones from all the tumor samples are then pooled and duplicate clones are removed. Also, potentially spurious candidate clone sequences are filtered, e.g., those with many ambiguous base assignments (Supplementary Note for details). Lastly, a candidate clone sequence matrix, M, is constructed (<xref ref-type="fig" rid="F3">Figure 3C</xref>). This is a binary matrix, where M<sub>
<italic>ij</italic>
</sub> &#x3d; 0/1 represents the absence/presence of a SNV at the <italic>j</italic>th variant in the <italic>i</italic>th candidate clone.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>Overview of CloneFinder&#x2b;. CloneFinder&#x2b; first clusters genetic variants by their VAF similarities using FastClone and then identifies candidate clones for each tumor sample <bold>(A,B)</bold>. Next, all candidate clones are pooled and missing clones are searched using the CloneFinder method <bold>(C&#x2013;G)</bold>.</p>
</caption>
<graphic xlink:href="fbinf-03-1090730-g003.tif"/>
</fig>
<p>Next, we apply CloneFinder to infer clone genotypes (<xref ref-type="bibr" rid="B19">Miura et al., 2018</xref>) (<xref ref-type="fig" rid="F3">Figures 3C&#x2013;G</xref>). A phylogeny of candidate clones is first built using the matrix M. This phylogeny is used to identify missing ancestral clones that have persisted without being replaced by their descendant clones. Ancestral clone sequences are inferred at internal nodes of the phylogeny, and all unique ancestral clones are added to the collection of candidate clones, M (<xref ref-type="fig" rid="F3">Figure 3E</xref>). Then, the presence of these candidate clones in the bulk-sequencing dataset is assessed using VAFs. A matrix of VAFs, V, is constructed, where the number of rows is equal to the number of tumor samples and the number of columns is equal to the number of SNVs (<xref ref-type="fig" rid="F3">Figure 3D</xref>). Next, tumor frequencies of all candidate clones are estimated through regression analysis. Here, V, M, and f, a two-dimensional matrix of estimated clone frequencies of the tumor samples, have the following relationship,<disp-formula id="e1">
<mml:math id="m1">
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>/</mml:mo>
<mml:mn>2</mml:mn>
<mml:mi mathvariant="normal">f</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi mathvariant="normal">M</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mi mathvariant="normal">V</mml:mi>
<mml:mo>.</mml:mo>
</mml:mrow>
</mml:math>
<label>(1)</label>
</disp-formula>
</p>
<p>This relationship is valid only when variants are not affected by copy number alterations (CNAs). Thus, variants that are affected by CNAs need to be excluded from the dataset; or VAFs should be adjusted before the analysis using estimated cancer cell fraction (CCF), i.e., VAF is CCF divided by two. When an estimated clone frequency within a given tumor sample is lower than the desired threshold, that clone is assumed to be absent from the tumor sample. Candidate clones predicted to be absent from all tumor samples are removed.</p>
<p>Next, CloneFinder &#x2b; decomposes potential hybrid sequences using the strategy in CloneFinder (<xref ref-type="bibr" rid="B19">Miura et al., 2018</xref>) (<xref ref-type="fig" rid="F3">Figure 3F</xref>). Briefly, SNVs with the same expected VAF are subclassified into two groups based on their similarity to observed VAFs. Alternate candidate clone sequences are constructed by combining SNV groups. All possible combinations of SNV groups are added to the M and are evaluated using the regression analysis (Eq. <xref ref-type="disp-formula" rid="e1">1</xref>). All combinations estimated to be present in appreciable frequency are retained. Accordingly, M is updated, and this process of searching ancestral and hybrid clones is repeated until no new clone sequences are added. Lastly, CloneFinder&#x2b; finalizes the predicted clone sequences, where variants not assigned to any clone sequences are added to a clone based on their VAF similarities (<xref ref-type="fig" rid="F3">Figure 3G</xref>).</p>
</sec>
<sec id="s2-3">
<title>2.3 Assembly of computer-simulated data</title>
<p>We obtained previously simulated datasets from <ext-link ext-link-type="uri" xlink:href="https://github.com/raphael-group/machina">https://github.com/raphael-group/machina</ext-link>. These datasets were generated by modeling the evolution of primary and metastatic tumors (<xref ref-type="bibr" rid="B9">El-Kebir et al., 2018</xref>). In this simulation, metastatic tumors were founded by one or more than one cancer cells that migrated from another tumor site (primary or another metastatic tumor). Clones were defined as a group of cells with the same sequences. Thus, a new clone could arise even from a single mutation. Each clone phylogeny was unique, and the number of clones was 6&#x2013;26. Each dataset contained 9&#x2013;99 SNVs. We excluded five datasets that contained tumor samples with only one variant because this is unrealistic in empirical data. In total, we analyzed 75 datasets.</p>
</sec>
<sec id="s2-4">
<title>2.4 Assembly of empirical data</title>
<p>We obtained metastatic cancer datasets from <xref ref-type="bibr" rid="B33">Zhao et al. (2016)</xref>, who performed bulk sequencing (exome sequencing) and identified SNVs. For each SNV, these datasets contained total sequencing read counts and counts of reads with altered bases (SNVs). In total, we obtained 40 datasets with three to eight tumors.</p>
<p>For each SNV, we predicted if it was a driver mutation using the Cancer Genome Interpreter (CGI) web tool (<ext-link ext-link-type="uri" xlink:href="http://www.cancergenomeinterpreter.org/">http://www.cancergenomeinterpreter.org</ext-link>), which uses the OncodriveMUT method (<xref ref-type="bibr" rid="B28">Tamborero et al., 2018</xref>). We also used the CRAVAT web tool (<ext-link ext-link-type="uri" xlink:href="http://www/">http://www</ext-link>.cravat.us), which performs the CHASM prediction (<xref ref-type="bibr" rid="B3">Carter et al., 2009</xref>; <xref ref-type="bibr" rid="B7">Douville et al., 2013</xref>). Driver mutations were predicted without specifying a cancer type.</p>
<p>To map mutations at branches of the phylogeny, we analyzed predicted clone sequences and reconstructed ancestral clone sequences using MEGA (<xref ref-type="bibr" rid="B16">Kumar et al., 2012</xref>; <xref ref-type="bibr" rid="B29">Tamura et al., 2021</xref>).</p>
</sec>
<sec id="s2-5">
<title>2.5 Data analysis with the bootstrap approach</title>
<p>Each simulated and empirical dataset was analyzed using CloneFinder&#x2b;, and the reliability of the inferences was assessed using the bootstrap approach. In the bootstrap analysis, we generated 30 replicates because using more (100) replicates essentially produced the same result (<xref ref-type="sec" rid="s10">Supplementary Figure S1</xref>). In the CloneFinder &#x2b; analysis, we clustered variants without giving the tumor purity (the step of FastClone analysis) and used variants with at least 50 reference read counts and two mutant read counts to assess the quality of candidate clones. Note that CloneFinder&#x2b; does not require the value of tumor purity and the maximum number of clones to be inferred. During the analysis, candidate clone genotypes with &#x3c;1% clone frequencies for all tumor samples were discarded. We grouped identical bootstrap clones to derive consensus clone sequences while allowing at most one base assignment difference. We selected a base in &#x3e;90% of bootstrap clones for each variant position.</p>
<p>We inferred cell migration history using the PathFinder method (<xref ref-type="bibr" rid="B15">Kumar et al., 2020</xref>), and the reliability was assessed using the bootstrap approach. PathFinder was performed by providing the correct primary tumor sites, sequences of CloneFinder&#x2b; clones that were predicted with &#x3e;5% clone frequencies, and tumor sites that contained each clone. Note that PathFinder does not require the value of tumor purity.</p>
</sec>
<sec id="s2-6">
<title>2.6 Accuracy measurements</title>
<p>To evaluate the accuracy in inferring correct clones, we paired each simulated clone sequence with its most similar inferred clone sequence. We allowed an inferred clone to be paired with more than one simulated clone. We counted the number of sequence differences between inferred and simulated clones paired. We calculated the average when more than one inferred clone was paired with a given simulated clone, which was divided by the sequence length to estimate genotype error (GE) for a given simulated clone.</p>
<p>To evaluate the accuracy of inferred migration history, we counted the number of migration paths that were correctly inferred, those not identified, and incorrect paths following a previous study (<xref ref-type="bibr" rid="B15">Kumar et al., 2020</xref>).</p>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>3 Results</title>
<sec id="s3-1">
<title>3.1 Bootstrap confidence for predicted clones</title>
<p>The bootstrap approach is widely employed in molecular evolutionary and phylogenetic analyses to estimate variances and confidence limits (<xref ref-type="bibr" rid="B8">Efron and Tibshirani, 1994</xref>; <xref ref-type="bibr" rid="B23">Nei and Kumar, 2000</xref>). Our bootstrap approach samples sequencing reads. So, VAFs will be perturbed, with greater perturbations experienced by VAFs computed from a small number of reads (<xref ref-type="fig" rid="F1">Figure 1</xref>). Since the number of reads is often highly variable among SNVs in empirical data [e.g., 8&#x2013;1,436 for ATP401 from <xref ref-type="bibr" rid="B33">Zhao et al. (2016)</xref>], the variances of their VAFs are also variable. We first show the GEs of inferred clones and their bootstrap supports.</p>
<p>We found that the median GE of the clones with higher bootstrap supports (&#x3e;50%) was zero (<xref ref-type="fig" rid="F4">Figure 4A</xref>). Therefore, clones with &#x3e;50% bootstrap support can be considered to be reliably inferred. We also found that many bootstrap consensus clones (77%) received low bootstrap supports (&#x2264;10%), and these clones contained many genotype errors (GE &#x3e;5%; <xref ref-type="fig" rid="F4">Figure 4A</xref>). Interestingly, these clones had a wide range of GEs, even overlapping with those from higher bootstrap supports. In an extreme case, a few consensus clones with &#x3c;10% bootstrap support had correct clone sequences, i.e., GE &#x3d; 0. Therefore, some simulated (correct) clones were not repeatedly detected among bootstrap replicates, reducing bootstrap support.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>Genotype error (GE) and bootstrap supports. CloneFinder&#x2b; was used to infer clones, and the bootstrap confidence limit was calculated using the bootstrap approach. GE was computed for each clone. All datasets <bold>(A)</bold>, datasets with polyclonal seeding <bold>(B)</bold>, those with monoclonal seeding <bold>(C)</bold>, those with five tumors <bold>(D)</bold>, and those with eight tumors <bold>(E)</bold> were used. The number at the top of a box plot is the number of inferred clones, and the proportion of clones with &#x3e;5% GE was shown at the bottom. <italic>T</italic>-test was performed, and the <italic>p</italic>-values were given, i.e., &#x2a; for <italic>p</italic> &#x3c; 0.05 and &#x2a;&#x2a; for <italic>p</italic> &#x3c;&#x3c; 0.01. The bootstrap confidence was the proportion of replicates that produced the clone. All the simulated datasets were used.</p>
</caption>
<graphic xlink:href="fbinf-03-1090730-g004.tif"/>
</fig>
<p>We next tested if different scenarios of the cell migration history affected the performance of the bootstrap approach. These simulated datasets were generated by modeling the evolution of primary and metastatic tumors, where metastatic tumors were seeded by clones that migrated from either a primary or a metastatic tumor. We classified the datasets into (1) those with metastatic tumors that received a single seeding clone from another tumor site, i.e., monoclonal seeding, and (2) those with more than one seeding event, i.e., polyclonal seeding. Thus, intra-tumor heterogeneity of tumors from polyclonal seeding is higher, while inter-tumor heterogeneity among tumor sites is smaller than those with monoclonal seeding.</p>
<p>We found that the bootstrap approach performed well on both types of datasets, as inferred clones with greater bootstrap confidence tended to have more accurately inferred clone sequences (lower GE) (<xref ref-type="fig" rid="F4">Figures 4B, C</xref>). We also found that GE tended to be slightly better for datasets with monoclonal seeding, indicating that inferred clone sequences were slightly more accurate for data with monoclonal seeding, i.e., lower intra-tumor heterogeneity with higher-inter tumor heterogeneity. This pattern was consistent with previous studies (<xref ref-type="bibr" rid="B21">Miura et al., 2020</xref>).</p>
<p>We also tested the impact of the number of tumors on the performance of the bootstrap approach. These simulated datasets were composed of five or eight tumors, i.e., four or seven metastatic tumors with a primary tumor per dataset. For both datasets with five and eight tumors, we similarly found that clones with greater bootstrap confidence tended to have more accurate clone sequences, i.e., a lower GE (<xref ref-type="fig" rid="F4">Figures 4D, E</xref>). Therefore, the number of tumors in a dataset did not affect the performance of the bootstrap approach to place the reliability, while inferred clone sequences were more accurate for datasets with a larger number of tumors (lower GE). This was consistent with previous studies (<xref ref-type="bibr" rid="B21">Miura et al., 2020</xref>). Overall, these results suggested that the bootstrap approach is useful for assessing the reliability of inferred clone sequences.</p>
</sec>
<sec id="s3-2">
<title>3.2 Analysis of empirical data</title>
<p>Next, we tested the performance of the bootstrap approach using 40 empirical datasets from various cancer types. As observed for the simulation study, most of the bootstrap consensus clones had low bootstrap supports (&#x3c;10%), and few clones were identified with good (&#x3e;50%) bootstrap supports (<xref ref-type="fig" rid="F5">Figure 5A</xref>). We did not find a clear association between cancer types and bootstrap support values, as the distribution of bootstrap values was similar among different cancer types (<xref ref-type="fig" rid="F5">Figure 5B</xref>). We also did not observe an association between tumor mutation burden (number of mutations) and bootstrap values for any cancer types (<xref ref-type="fig" rid="F5">Figures 5C&#x2013;F</xref>). Therefore, the performance of the bootstrap approach was not affected by either cancer type or tumor mutation burden.</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption>
<p>Empirical data analysis. The 40 empirical datasets from various cancer types were analyzed with the bootstrap approach. <bold>(A)</bold> Bootstrap support of consensus clones. All consensus clones from each dataset were pooled. The number in the parenthesis is the number of clones with &#x2264;10% bootstrap support. <bold>(B)</bold> Bootstrap supports of clones for each cancer type. The cancer type is the primary tumor site. Eight datasets do not have information on the primary tumor site, so the cancer type is &#x201c;unknown.&#x201d; The number at the top of each box plot is the number of patients, and the total number of bootstrap consensus clones with &#x3e;10% bootstrap support is shown in parenthesis. <bold>(C&#x2013;F)</bold> The relationship between tumor mutation burden and bootstrap support of a clone for lung <bold>(C)</bold>, pancreas <bold>(D)</bold>, and other cancer with low <bold>(E)</bold> and high <bold>(F)</bold> tumor mutation burden. The tumor mutation burden is the number of total variants in a dataset. The trend line was <bold>(C)</bold> <italic>y</italic> &#x3d; 0.0049<italic>x</italic> &#x2b; 66.03 (<italic>R</italic>
<sup>2</sup> &#x3d; 0.0026), <bold>(D)</bold> <italic>y</italic> &#x3d; 0.097<italic>x</italic> &#x2b; 48.035 (<italic>R</italic>
<sup>2</sup> &#x3d; 0.018), <bold>(E)</bold> <italic>y</italic> &#x3d; 0.092<italic>x</italic> &#x2b; 59.50 (<italic>R</italic>
<sup>2</sup> &#x3d; 0.024), and <bold>(F)</bold> <italic>y</italic> &#x3d; 54.17 (<italic>R</italic>
<sup>2</sup> &#x3d; 0). Clones with &#x2264;10% bootstrap support were excluded <bold>(B&#x2013;F)</bold>.</p>
</caption>
<graphic xlink:href="fbinf-03-1090730-g005.tif"/>
</fig>
</sec>
<sec id="s3-3">
<title>3.3 Bootstrap confidence for inferred cell migration histories</title>
<p>Since the bootstrap approach performed well to place a confidence limit on inferred clones, we next tested if the bootstrap approach is also useful to assess the reliability of a downstream inference of predicted clones. As an example of a downstream analysis of predicted clones, we inferred metastatic cell migration histories using the same simulated datasets.</p>
<p>We found that correct paths often had a high bootstrap support (a median bootstrap support &#x3d; 86.5%), while incorrect paths tended to have low bootstrap supports (a median of 10%) (<xref ref-type="fig" rid="F6">Figure 6</xref>). However, as observed in the analysis of inferred clone sequences, a few correct paths were not well supported. Actually, bootstrap support for correct paths varied considerably, indicating that these paths were not repeatedly found in many bootstrap replicate datasets. Thus, some migration paths are difficult to reconstruct, which is consistent with previous findings (<xref ref-type="bibr" rid="B15">Kumar et al., 2020</xref>). Overall, these results indicated that the reliability of inferred migration path from predicted clones could be assessed using the bootstrap approach.</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption>
<p>Performance of bootstrap approach to infer metastatic cell migration history. Inferred migration paths were classified into true-positive (TP; correct path) and false-positive (FP; incorrect path). All the simulated datasets were used.</p>
</caption>
<graphic xlink:href="fbinf-03-1090730-g006.tif"/>
</fig>
</sec>
<sec id="s3-4">
<title>3.4 Patterns of metastatic cell migrations and driver mutation occurrences</title>
<p>Since point estimates of clones and cell migration paths have limited accuracy, placing bootstrap confidence on inferences is essential in real empirical data analysis. As an example, we illustrate empirical data analysis for revealing the patterns of metastatic cell migrations and driver mutation occurrences. In this analysis, we excluded eight datasets without primary tumors because the information on the primary tumor site was necessary for the inference of migration history.</p>
<p>First, we show the result of a lung cancer patient with three metastatic tumors in the heart, liver, and GEJ (ATP401 patient). The inferred clone phylogeny using CloneFinder&#x2b; (without bootstrap reliability assessment) indicated that clone C1 originated from the root clone (the most recent common ancestor of all clones), and a metastatic tumor in the liver contained this clone (<xref ref-type="fig" rid="F7">Figure 7A</xref>). The PathFinder analysis (without bootstrap reliability assessment) using these predicted clones and phylogeny produced by CloneFinder&#x2b; predicted that clone C1 migrated from the primary tumor site (lung) to the liver. While 128 mutations were mapped on the branch leading to the C1 (B1 branch in <xref ref-type="fig" rid="F7">Figure 7A</xref>), the phylogeny does not tell us whether C1 acquired all, any, or a subset of new mutations in the lung. Nevertheless, seven B1 mutations were predicted to be drivers according to CGI (<xref ref-type="bibr" rid="B28">Tamborero et al., 2018</xref>), which may be important for metastasis. On the other hand, we also found that some migration events were not associated with any detected mutations as no mutations were mapped at corresponding branches (<xref ref-type="fig" rid="F7">Figures 7A, B</xref>). For example, clone C2.1 from metastasis GEJ seeded a heart metastasis without associated mutations, as the same clone is found in both locations. Overall, we found a greater average number of driver mutations in clones that moved from the primary tumor than those from metastatic tumors for this patient (<xref ref-type="fig" rid="F7">Figure 7B</xref>).</p>
<fig id="F7" position="float">
<label>FIGURE 7</label>
<caption>
<p>Clone phylogeny and metastatic cell migration history of a lung cancer patient. <bold>(A)</bold> Inferred clone phylogeny using CloneFinder&#x2b; without the bootstrap assessment. Grey circles represent tumor clones, and their predicted tumor sites (&#x3e;0% clone frequency) are shown within boxes below the clone IDs. Tumor sites shown at internal nodes are predicted sites by PathFinder. Letters along branches are the branch ID, and branches are colored based on predicted tumor sites. All mutations are mapped at branches of the phylogeny through ancestral sequence reconstruction. When a cell migration event is inferred at a branch, the number of drivers and total mutations are shown. <bold>(B)</bold> Inferred cell migration history by PathFinder using CloneFinder&#x2b; clones without the bootstrap assessment. The numbers of drivers and total mutations are shown for each migration path. The primary and metastatic tumors are shown in blue and red boxes, respectively. <bold>(C)</bold> Bootstrap consensus migration history. The number along a path is bootstrap support (%). Dotted arrows indicate paths with &#x3c;40% bootstrap support. <bold>(D)</bold> Driver mutation count and <bold>(E)</bold> driver mutation rates were compared between the paths originating from primary and metastatic tumors. The <italic>p</italic> values were computed using <italic>t</italic>-test. ATP401 patient was used. CGI was used for driver mutation prediction.</p>
</caption>
<graphic xlink:href="fbinf-03-1090730-g007.tif"/>
</fig>
<p>We next demonstrate how this observed pattern could be validated using the bootstrap approach. We found that all of the predicted cell migration paths in the single point PathFinder inference (<xref ref-type="fig" rid="F7">Figure 7B</xref>) were supported with high bootstrap confidences (&#x3e;60%), validating the PathFinder inference (<xref ref-type="fig" rid="F7">Figure 7C</xref>). In the bootstrap analysis, we mapped mutations on each bootstrap migration path and calculated the average driver mutation count per path for each replicate. We found that the number of driver mutations per path was significantly greater for those from the primary tumor than those from a metastatic tumor (<italic>p</italic> &#x3c; 0.01; <italic>t</italic>-test) (<xref ref-type="fig" rid="F7">Figure 7D</xref>). Since migration paths with a larger number of associated mutations may simply result in a larger number of driver mutations, we normalized the count of driver mutation, i.e., we computed the driver mutation rate by dividing it by the total number of mutations for a path. We excluded paths without any associated mutations. Similarly, the driver mutation rate was significantly greater for those from the primary tumor than those from a metastatic tumor (<italic>p</italic> &#x3c; 0.01 by <italic>t</italic>-test) (<xref ref-type="fig" rid="F7">Figure 7E</xref>). Therefore, driver mutations occurred more frequently at migrations from the primary tumor than those from metastatic tumors for this patient.</p>
<p>To test if most of the patients similarly had higher driver mutation rates for migration paths from the primary tumor than metastatic tumors, we analyzed 32 datasets of metastatic cancer patients. Similar to the ATP401 patient, many paths were from metastatic tumors (<xref ref-type="fig" rid="F8">Figure 8A</xref>), indicating that migration events from metastatic tumors were not rare, consistent with previous studies (<xref ref-type="bibr" rid="B15">Kumar et al., 2020</xref>; <xref ref-type="bibr" rid="B5">Chroni et al., 2022</xref>). We found that only six patients showed significant differences in driver mutation rates between those from primary tumors and from metastatic tumors (<italic>p</italic> &#x3c; 0.01 by <italic>t</italic>-test for both driver prediction methods), and all of them had higher driver mutation rates for paths from the primary tumors, which was similar to the ATP401 patient (<xref ref-type="fig" rid="F8">Figure 8B</xref>). However, most of the patients (26) did not show a significant difference in driver mutation rates between migration paths from primary and metastatic tumors (<italic>p</italic> &#x3e; 0.01 by <italic>t</italic>-test for at least one driver prediction method). Therefore, the numbers of driver mutations were often not significantly different between metastatic cell migration events sourced from primary tumors and those from metastatic tumors.</p>
<fig id="F8" position="float">
<label>FIGURE 8</label>
<caption>
<p>The number of cell migration paths and driver mutation rate. <bold>(A)</bold> The number of migration paths originating from the primary and the metastatic tumor sites. The average number among bootstrap replicates was calculated. An Error bar is the standard deviation. <bold>(B)</bold> The driver mutation rate of migration paths originating from the primary versus the metastatic tumor sites. The average rate among bootstrap replicates was calculated. An Error bar is the standard deviation. <italic>t</italic>-test was performed for each patient to test if the driver mutation rate was significantly different between those from the primary tumor and from metastatic tumors. Patients were sorted by the number of migration paths.</p>
</caption>
<graphic xlink:href="fbinf-03-1090730-g008.tif"/>
</fig>
</sec>
</sec>
<sec sec-type="discussion" id="s4">
<title>4 Discussion</title>
<p>In this study, we showed the potential of bootstrap resampling procedures to place confidence limits on estimates obtained from tumor sequencing data. We found that incorrect inferences tended to receive low bootstrap support. Overall, the bootstrap approach performed well to distinguish spurious inferences.</p>
<p>Although the primary usage of the bootstrap approach is to place a confidence limit on inferred clones and downstream analysis (e.g., cell migration inferences), consensus clone sequences and consensus cell migration history can be also built by aggregating all bootstrap inferences. It is important to note that consensus inferences are not expected to be extensively more accurate than the point estimates, because the analysis of bootstrap replicates should not repeatedly produce correct inferences that are not found in the point estimate (<xref ref-type="fig" rid="F9">Figure 9</xref>; <xref ref-type="sec" rid="s10">Supplementary Figure S2</xref>).</p>
<fig id="F9" position="float">
<label>FIGURE 9</label>
<caption>
<p>The accuracy of inferred clones by CloneFinder and single-point and bootstrap consensus inferences (ConsenGeno) by CloneFinder&#x2b;. For the consensus clones, clones with &#x3c;10% bootstrap support were removed. The genotype error (GE) was computed for each simulated dataset, i.e., the average over the pairs of inferred and simulated clones for a dataset. The number at the top of a box plot is the proportion of datasets with &#x3e;5% GE. <italic>t</italic>-test was performed, and the <italic>p</italic>-values were given, i.e., &#x2a; for <italic>p</italic> &#x3c; 0.05 and &#x2a;&#x2a; for <italic>p</italic> &#x3c;&#x3c; 0.01. See Supplementary Note for the parameter settings for CloneFinder. All simulated datasets were used.</p>
</caption>
<graphic xlink:href="fbinf-03-1090730-g009.tif"/>
</fig>
<p>In this study, we also developed CloneFinder&#x2b; by advancing CloneFinder, which now additionally analyzes the similarity of VAFs to cluster SNVs. The accuracy of CloneFinder&#x2b; was slightly better than the original version of CloneFinder (<xref ref-type="fig" rid="F9">Figure 9</xref>). In conclusion, we showed that the bootstrap approach performed well to place a confidence limit on tumor evolution inference. The bootstrap approach can be coupled with any clone prediction method. Therefore, it will be useful for method developers to add a bootstrapping option.</p>
</sec>
</body>
<back>
<sec sec-type="data-availability" id="s5">
<title>Data availability statement</title>
<p>The original contributions presented in the study are included in the article/<xref ref-type="sec" rid="s10">Supplementary Material</xref>, further inquiries can be directed to the corresponding author.</p>
</sec>
<sec id="s6">
<title>Author contributions</title>
<p>SM conceived the idea and supervised the project. JH and SM acquired and processed data. JH and SM developed methods and performed computational experiments. JH, SM, and MDS implemented the method. SM and MS performed empirical data analysis. All authors discussed the results. JH, SM, and SK wrote the manuscript. All authors read and approved the final manuscript.</p>
</sec>
<sec id="s7">
<title>Funding</title>
<p>This research was supported by the National Institutes of Health to SK (LM013385-03) and SM (LM014005-02).</p>
</sec>
<ack>
<p>We thank Dr. Antonia Chroni, Tracy Vu, Vivian Aly, Jiyeong Choi, and Nandita Nagendra for their technical support. Publication of this article was funded in part by the Temple University Libraries Open Access Publishing Fund.</p>
</ack>
<sec sec-type="COI-statement" id="s8">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s9">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec id="s10">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fbinf.2023.1090730/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fbinf.2023.1090730/full&#x23;supplementary-material</ext-link>
</p>
<supplementary-material xlink:href="DataSheet1.PDF" id="SM1" mimetype="application/PDF" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Alves</surname>
<given-names>J. M.</given-names>
</name>
<name>
<surname>Prado-L&#xf3;pez</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Cameselle-Teijeiro</surname>
<given-names>J. M.</given-names>
</name>
<name>
<surname>Posada</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Rapid evolution and biogeographic spread in a colorectal cancer</article-title>. <source>Nat. Commun.</source> <volume>10</volume>, <fpage>5139</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-019-12926-8</pub-id>
</citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Beerenwinkel</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Schwarz</surname>
<given-names>R. F.</given-names>
</name>
<name>
<surname>Gerstung</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Markowetz</surname>
<given-names>F.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Cancer evolution: Mathematical models and computational inference</article-title>. <source>Syst. Biol.</source> <volume>64</volume>, <fpage>e1</fpage>&#x2013;<lpage>e25</lpage>. <pub-id pub-id-type="doi">10.1093/sysbio/syu081</pub-id>
</citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Carter</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Isik</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Tyekucheva</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Velculescu</surname>
<given-names>V. E.</given-names>
</name>
<name>
<surname>Kinzler</surname>
<given-names>K. W.</given-names>
</name>
<etal/>
</person-group> (<year>2009</year>). <article-title>Cancer-specific high-throughput annotation of somatic mutations: Computational prediction of driver missense mutations</article-title>. <source>Cancer Res.</source> <volume>69</volume>, <fpage>6660</fpage>&#x2013;<lpage>6667</lpage>. <pub-id pub-id-type="doi">10.1158/0008-5472.can-09-1133</pub-id>
</citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname>
<given-names>H.-N.</given-names>
</name>
<name>
<surname>Shu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Liao</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Liao</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Qin</surname>
<given-names>Y.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). <article-title>Genomic evolution and diverse models of systemic metastases in colorectal cancer</article-title>. <source>Gut</source> <volume>71</volume>, <fpage>322</fpage>&#x2013;<lpage>332</lpage>. <pub-id pub-id-type="doi">10.1136/gutjnl-2020-323703</pub-id>
</citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chroni</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Miura</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Hamilton</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Vu</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Gaffney</surname>
<given-names>S. G.</given-names>
</name>
<name>
<surname>Aly</surname>
<given-names>V.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). <article-title>Clone phylogenetics reveals metastatic tumor migrations, maps, and models</article-title>. <source>Cancers</source> <volume>14</volume>, <fpage>4326</fpage>. <pub-id pub-id-type="doi">10.3390/cancers14174326</pub-id>
</citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>de Bruin</surname>
<given-names>E. C.</given-names>
</name>
<name>
<surname>McGranahan</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Mitter</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Salm</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Wedge</surname>
<given-names>D. C.</given-names>
</name>
<name>
<surname>Yates</surname>
<given-names>L.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). <article-title>Spatial and temporal diversity in genomic instability processes defines lung cancer evolution</article-title>. <source>Science</source> <volume>346</volume>, <fpage>251</fpage>&#x2013;<lpage>256</lpage>. <pub-id pub-id-type="doi">10.1126/science.1253462</pub-id>
</citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Douville</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Carter</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Niknafs</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Diekhans</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Stenson</surname>
<given-names>P. D.</given-names>
</name>
<etal/>
</person-group> (<year>2013</year>). <article-title>Cravat: Cancer-related analysis of variants toolkit</article-title>. <source>Bioinformatics</source> <volume>29</volume>, <fpage>647</fpage>&#x2013;<lpage>648</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btt017</pub-id>
</citation>
</ref>
<ref id="B8">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Efron</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Tibshirani</surname>
<given-names>R. J.</given-names>
</name>
</person-group> (<year>1994</year>). <source>An introduction to the bootstrap</source>. <publisher-loc>Florida</publisher-loc>: <publisher-name>CRC Press</publisher-name>.</citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>El-Kebir</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Satas</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Raphael</surname>
<given-names>B. J.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Inferring parsimonious migration histories for metastatic cancers</article-title>. <source>Nat. Genet.</source> <volume>50</volume>, <fpage>718</fpage>&#x2013;<lpage>726</lpage>. <pub-id pub-id-type="doi">10.1038/s41588-018-0106-z</pub-id>
</citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gerlinger</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Rowan</surname>
<given-names>A. J.</given-names>
</name>
<name>
<surname>Horswell</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Math</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Larkin</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Endesfelder</surname>
<given-names>D.</given-names>
</name>
<etal/>
</person-group> (<year>2012</year>). <article-title>Intratumor heterogeneity and branched evolution revealed by multiregion sequencing</article-title>. <source>N. Engl. J. Med.</source> <volume>366</volume>, <fpage>883</fpage>&#x2013;<lpage>892</lpage>. <pub-id pub-id-type="doi">10.1056/nejmoa1113205</pub-id>
</citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gundem</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Van Loo</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Kremeyer</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Alexandrov</surname>
<given-names>L. B.</given-names>
</name>
<name>
<surname>Tubio</surname>
<given-names>J. M. C.</given-names>
</name>
<name>
<surname>Papaemmanuil</surname>
<given-names>E.</given-names>
</name>
<etal/>
</person-group> (<year>2015</year>). <article-title>The evolutionary history of lethal metastatic prostate cancer</article-title>. <source>Nature</source> <volume>520</volume>, <fpage>353</fpage>&#x2013;<lpage>357</lpage>. <pub-id pub-id-type="doi">10.1038/nature14347</pub-id>
</citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hao</surname>
<given-names>J.-J.</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>D.-C.</given-names>
</name>
<name>
<surname>Dinh</surname>
<given-names>H. Q.</given-names>
</name>
<name>
<surname>Mayakonda</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>Y.-Y.</given-names>
</name>
<name>
<surname>Chang</surname>
<given-names>C.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <article-title>Spatial intratumoral heterogeneity and temporal clonal evolution in esophageal squamous cell carcinoma</article-title>. <source>Nat. Genet.</source> <volume>48</volume>, <fpage>1500</fpage>&#x2013;<lpage>1507</lpage>. <pub-id pub-id-type="doi">10.1038/ng.3683</pub-id>
</citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Harbst</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Lauss</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Cirenajwis</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Isaksson</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Rosengren</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>T&#xf6;rngren</surname>
<given-names>T.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <article-title>Multiregion whole-exome sequencing uncovers the genetic evolution and mutational heterogeneity of early-stage metastatic melanoma</article-title>. <source>Cancer Res.</source> <volume>76</volume>, <fpage>4765</fpage>&#x2013;<lpage>4774</lpage>. <pub-id pub-id-type="doi">10.1158/0008-5472.can-15-3476</pub-id>
</citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hu</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Fujimoto</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Ying</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Fukuoka</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Ashizawa</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Sun</surname>
<given-names>W.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Multi-region exome sequencing reveals genomic evolution from preneoplasia to lung adenocarcinoma</article-title>. <source>Nat. Commun.</source> <volume>10</volume>, <fpage>2978</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-019-10877-8</pub-id>
</citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kumar</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Chroni</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Tamura</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Sanderford</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Oladeinde</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Aly</surname>
<given-names>V.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>PathFinder: Bayesian inference of clone migration histories in cancer</article-title>. <source>Bioinformatics</source> <volume>36</volume>, <fpage>i675</fpage>&#x2013;<lpage>i683</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btaa795</pub-id>
</citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kumar</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Stecher</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Peterson</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Tamura</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>MEGA-CC: Computing core of molecular evolutionary genetics analysis program for automated and iterative data analysis</article-title>. <source>Bioinformatics</source> <volume>28</volume>, <fpage>2685</fpage>&#x2013;<lpage>2686</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bts507</pub-id>
</citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Malikic</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>McPherson</surname>
<given-names>A. W.</given-names>
</name>
<name>
<surname>Donmez</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Sahinalp</surname>
<given-names>C. S.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Clonality inference in multiple tumor samples using phylogeny</article-title>. <source>Bioinformatics</source> <volume>31</volume>, <fpage>1349</fpage>&#x2013;<lpage>1356</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btv003</pub-id>
</citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Martinez</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Mallo</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Paulson</surname>
<given-names>T. G.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Sanchez</surname>
<given-names>C. A.</given-names>
</name>
<name>
<surname>Reid</surname>
<given-names>B. J.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Evolution of Barrett&#x2019;s esophagus through space and time at single-crypt and whole-biopsy levels</article-title>. <source>Nat. Commun.</source> <volume>9</volume>, <fpage>794</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-017-02621-x</pub-id>
</citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Miura</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Gomez</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Murillo</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Huuki</surname>
<given-names>L. A.</given-names>
</name>
<name>
<surname>Vu</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Buturla</surname>
<given-names>T.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Predicting clone genotypes from tumor bulk sequencing of multiple samples</article-title>. <source>Bioinformatics</source> <volume>34</volume>, <fpage>4017</fpage>&#x2013;<lpage>4026</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bty469</pub-id>
</citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Miura</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Vu</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Choi</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Townsend</surname>
<given-names>J. P.</given-names>
</name>
<name>
<surname>Karim</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Kumar</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>A phylogenetic approach to study the evolution of somatic mutational processes in cancer</article-title>. <source>Commun. Biol.</source> <volume>5</volume>, <fpage>617</fpage>. <pub-id pub-id-type="doi">10.1038/s42003-022-03560-0</pub-id>
</citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Miura</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Vu</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Deng</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Buturla</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Oladeinde</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Choi</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>Power and pitfalls of computational methods for inferring clone phylogenies and mutation orders from bulk sequencing data</article-title>. <source>Sci. Rep.</source> <volume>10</volume>, <fpage>3498</fpage>. <pub-id pub-id-type="doi">10.1038/s41598-020-59006-2</pub-id>
</citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Murugaesu</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Wilson</surname>
<given-names>G. A.</given-names>
</name>
<name>
<surname>Birkbak</surname>
<given-names>N. J.</given-names>
</name>
<name>
<surname>Watkins</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>McGranahan</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Kumar</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2015</year>). <article-title>Tracking the genomic evolution of esophageal adenocarcinoma through neoadjuvant chemotherapy</article-title>. <source>Cancer Discov.</source> <volume>5</volume>, <fpage>821</fpage>&#x2013;<lpage>831</lpage>. <pub-id pub-id-type="doi">10.1158/2159-8290.cd-15-0412</pub-id>
</citation>
</ref>
<ref id="B23">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Nei</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Kumar</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2000</year>). <source>Molecular Evolution and phylogenetics</source>. <publisher-loc>New York</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>.</citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nik-Zainal</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Van Loo</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Wedge</surname>
<given-names>D. C.</given-names>
</name>
<name>
<surname>Alexandrov</surname>
<given-names>L. B.</given-names>
</name>
<name>
<surname>Greenman</surname>
<given-names>C. D.</given-names>
</name>
<name>
<surname>Lau</surname>
<given-names>K. W.</given-names>
</name>
<etal/>
</person-group> (<year>2012</year>). <article-title>The life history of 21 breast cancers</article-title>. <source>Cell</source> <volume>149</volume>, <fpage>994</fpage>&#x2013;<lpage>1007</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2012.04.023</pub-id>
</citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Popic</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Salari</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Hajirasouliha</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Kashef-Haghighi</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>West</surname>
<given-names>R. B.</given-names>
</name>
<name>
<surname>Batzoglou</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Fast and scalable inference of multi-sample cancer lineages</article-title>. <source>Genome Biol.</source> <volume>16</volume>, <fpage>91</fpage>. <pub-id pub-id-type="doi">10.1186/s13059-015-0647-8</pub-id>
</citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Reiter</surname>
<given-names>J. G.</given-names>
</name>
<name>
<surname>Makohon-Moore</surname>
<given-names>A. P.</given-names>
</name>
<name>
<surname>Gerold</surname>
<given-names>J. M.</given-names>
</name>
<name>
<surname>Bozic</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Chatterjee</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Iacobuzio-Donahue</surname>
<given-names>C. A.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>Reconstructing metastatic seeding patterns of human cancers</article-title>. <source>Nat. Commun.</source> <volume>8</volume>, <fpage>14114</fpage>. <pub-id pub-id-type="doi">10.1038/ncomms14114</pub-id>
</citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Roth</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Khattra</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Yap</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Wan</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Laks</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Biele</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). <article-title>PyClone: Statistical inference of clonal population structure in cancer</article-title>. <source>Nat. Methods</source> <volume>11</volume>, <fpage>396</fpage>&#x2013;<lpage>398</lpage>. <pub-id pub-id-type="doi">10.1038/nmeth.2883</pub-id>
</citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tamborero</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Rubio-Perez</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Deu-Pons</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Schroeder</surname>
<given-names>M. P.</given-names>
</name>
<name>
<surname>Vivancos</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Rovira</surname>
<given-names>A.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Cancer Genome Interpreter annotates the biological and clinical relevance of tumor alterations</article-title>. <source>Genome Med.</source> <volume>10</volume>, <fpage>25</fpage>. <pub-id pub-id-type="doi">10.1186/s13073-018-0531-8</pub-id>
</citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tamura</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Stecher</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Kumar</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>MEGA11: Molecular evolutionary genetics analysis version 11</article-title>. <source>Mol. Biol. Evol.</source> <volume>38</volume>, <fpage>3022</fpage>&#x2013;<lpage>3027</lpage>. <pub-id pub-id-type="doi">10.1093/molbev/msab120</pub-id>
</citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Turajlic</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Litchfield</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Rowan</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Chambers</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Lopez</surname>
<given-names>J. I.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Tracking cancer evolution reveals constrained routes to metastases: TRACERx renal</article-title>. <source>Cell</source> <volume>173</volume>, <fpage>581</fpage>&#x2013;<lpage>594.e12</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2018.03.057</pub-id>
</citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wei</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Ye</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Zhong</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Myers</surname>
<given-names>R. E.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>Multiregion whole-exome sequencing of matched primary and metastatic tumors revealed genomic heterogeneity and suggested polyclonal seeding in colorectal cancer metastasis</article-title>. <source>Ann. Oncol.</source> <volume>28</volume>, <fpage>2135</fpage>&#x2013;<lpage>2141</lpage>. <pub-id pub-id-type="doi">10.1093/annonc/mdx278</pub-id>
</citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xiao</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Ulintz</surname>
<given-names>P. J.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Guan</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>FastClone is a probabilistic tool for deconvoluting tumor heterogeneity in bulk-sequencing samples</article-title>. <source>Nat. Commun.</source> <volume>11</volume>, <fpage>4469</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-020-18169-2</pub-id>
</citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhao</surname>
<given-names>Z.-M.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Bai</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Iamarino</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Gaffney</surname>
<given-names>S. G.</given-names>
</name>
<name>
<surname>Schlessinger</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <article-title>Early and multiple origins of metastatic lineages within primary tumors</article-title>. <source>Proc. Natl. Acad. Sci. U. S. A.</source> <volume>113</volume>, <fpage>2140</fpage>&#x2013;<lpage>2145</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1525677113</pub-id>
</citation>
</ref>
</ref-list>
</back>
</article>