<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="editorial">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Big Data</journal-id>
<journal-title>Frontiers in Big Data</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Big Data</abbrev-journal-title>
<issn pub-type="epub">2624-909X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fdata.2022.972726</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Big Data</subject>
<subj-group>
<subject>Editorial</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Editorial: Big Data and machine learning in cancer theranostics</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Silva</surname> <given-names>Fabricio Alves Barbosa da</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/618982/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Tuszynski</surname> <given-names>Jack Adam</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/35442/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Nakaya</surname> <given-names>Helder</given-names></name>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/24735/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Paller</surname> <given-names>Channing J.</given-names></name>
<xref ref-type="aff" rid="aff4"><sup>4</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/669250/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Scientific Computing Program, Oswaldo Cruz Foundation (Fiocruz)</institution>, <addr-line>Rio de Janeiro</addr-line>, <country>Brazil</country></aff>
<aff id="aff2"><sup>2</sup><institution>Department of Physics, University of Alberta</institution>, <addr-line>Edmonton, AB</addr-line>, <country>Canada</country></aff>
<aff id="aff3"><sup>3</sup><institution>Hospital Israelita Albert Einstein</institution>, <addr-line>S&#x000E3;o Paulo</addr-line>, <country>Brazil</country></aff>
<aff id="aff4"><sup>4</sup><institution>The Johns Hopkins Hospital, Johns Hopkins Medicine</institution>, <addr-line>Baltimore, MD</addr-line>, <country>United States</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited and reviewed by: Thomas Hartung, Johns Hopkins University, United States</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Fabricio Alves Barbosa da Silva <email>fabricio.silva&#x00040;fiocruz.br</email></corresp>
<fn fn-type="other" id="fn001"><p>This article was submitted to Medicine and Public Health, a section of the journal Frontiers in Big Data</p></fn></author-notes>
<pub-date pub-type="epub">
<day>19</day>
<month>07</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>5</volume>
<elocation-id>972726</elocation-id>
<history>
<date date-type="received">
<day>18</day>
<month>06</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>30</day>
<month>06</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2022 Silva, Tuszynski, Nakaya and Paller.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Silva, Tuszynski, Nakaya and Paller</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<related-article id="RA1" related-article-type="commentary-article" xlink:href="https://www.frontiersin.org/research-topics/14968/big-data-and-machine-learning-in-cancer-theranostics" ext-link-type="uri">Editorial on the Research Topic <article-title>Big Data and machine learning in cancer theranostics</article-title></related-article>
<kwd-group>
<kwd>Big Data</kwd>
<kwd>machine learning</kwd>
<kwd>cancer</kwd>
<kwd>systems biology</kwd>
<kwd>computational biology</kwd>
</kwd-group>
<counts>
<fig-count count="0"/>
<table-count count="0"/>
<equation-count count="0"/>
<ref-count count="0"/>
<page-count count="0"/>
<word-count count="1099"/>
</counts>
</article-meta>
</front>
<body>
<p>Cancer is a growing global health problem. The World Health Organization (WHO) predicts 27 million new cancer cases worldwide by 2030. Cancer treatment includes several standard protocols that may cause harmful side effects. Nevertheless, after two decades of genomics technological breakthroughs, personalized medicine is being used to improve treatment outcomes and mitigate side effects.</p>
<p>Machine learning is a consolidated methodology for extracting knowledge from omics Big Data. Essentially, machine learning algorithms on biology reveal underlying patterns, design models, and make statistical predictions using omics data as training datasets. We currently see the application of machine learning algorithms in several domains, such as genomics, proteomics, and systems biology. For example, machine learning algorithms for cancer research have been used to identify potential therapeutic targets, propose adjuvant therapies, and define measures to minimize side effects.</p>
<p>The pace of growth of omics data in recent years is astonishing. This speedy growth of omics data volume is particularly true for single-cell RNA-seq datasets. Therefore, innovative machine learning strategies using Big Data are needed to generate new knowledge from ever-growing amounts of cancer Big Data. Furthermore, it is of utmost importance to deploy high-performance and high-throughput computing platforms and tools to achieve this objective accurately and efficiently. Therefore, we proposed the Research Topic &#x0201C;<italic>Big Data and Machine Learning in Cancer Theranostics</italic>&#x0201D; to gather contributions describing the current state of the art in Big Data and Machine Learning methods in cancer theranostics. One main focus is on analyzing large omics datasets on scalable computational infrastructures.</p>
<p>Four papers were accepted for publication in this Research Topic from 11 submissions. The evaluation process was highly selective, and we acknowledge all reviewers that participated in that process. Below we present a summary of the four papers included in this Research Topic.</p>
<p>The original research paper &#x0201C;<italic>Data-Driven Modeling of Breast Cancer Tumors Using Boolean Networks</italic>&#x0201D; by <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fdata.2021.656395">Sgariglia et al.</ext-link> proposed a methodology for building data-driven boolean networks that model the dynamics of breast cancer tumors. The authors defined network components and structure based on RNA-seq data of breast cancer cell lines. They used a Boolean logic formalism to describe the network dynamics by identifying basins of attraction of the epigenetic landscape related to cancer cells. Single-cell RNA-seq data from breast cancer tumors and interactome information enabled the study of the dynamics of malignant subnetworks of up-regulated genes by identifying basins of attractions related to tumor cells on a personalized medicine approach.</p>
<p>The opinion article &#x0201C;<italic>Towards Machine-Readable (Meta) Data and the FAIR Value for Artificial Intelligence Exploration of COVID-19 and Cancer Research Data</italic>&#x0201D; by <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fdata.2021.656553">Campos et al.</ext-link> debates how the explosion of omics Big Data has considerably changed the landscape of cancer research. In recent years, heterogeneous data integration has been more common in dealing with complex biological problems, including cancer. According to the authors, it is a consensus that individual research centers cannot produce enough data to generate accurate prognostic and predictive models. Therefore, data integration in precision oncology is a mandatory step to generate novel results. Finally, the authors discuss how adopting the FAIR (Findable, Accessible, Interoperable, and Reusable) data principles in cancer research impacted data management in other areas, such as COVID-19 research.</p>
<p>In the review article &#x0201C;<italic>A Survey on Human Cancer Categorization Based on Deep Learning</italic>&#x0201D; by <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/frai.2022.884749">Ibrahim et al.</ext-link>, the authors assess deep learning concepts related to medical imaging research and survey several contributions in the field. In addition, it covers the main categories of imaging procedures related to cancer. The survey also comprises the usage of deep learning for cancer diagnosis, detection, and prognosis. Furthermore, the authors discuss challenges and trends for upcoming research.</p>
<p>In the opinion article &#x0201C;<italic>Digital Slide Assessment for Programmed Death-Ligand 1 Combined Positive Score in Head and Neck Squamous Carcinoma: Focus on Validation and Vision</italic>,&#x0201D; by <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/frai.2021.684034">Eccher et al.</ext-link>, the authors discuss how the introduction of immunotherapy drugs targeting the interaction between programmed death-1 protein (PD-1) expressed on T-helper lymphocytes, and the programmed death-ligand-1 (PD-L1) expressed on cancer cells represented a turning point in the therapy of several aggressive cancers in an advanced stage, including Head and Neck Squamous Carcinoma (HNSCC). The authors also anticipate deploying Artificial Intelligence tools in clinical practice to assess PD-L1 Combined Positive Score in HNSCC and select suitable immunotherapy patients.</p>
<sec id="s1">
<title>Author contributions</title>
<p>All authors listed have made a substantial, direct, and intellectual contribution to the work and approved it for publication.</p>
</sec>
<sec id="s2">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s3">
<title>Publisher&#x00027;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<ack><p>The Research Topic Editors acknowledge Specialty Chief Editor for Medicine and Public Health Thomas Hartung&#x00027;s support along the papers&#x00027; evaluation process.</p>
</ack>
</back>
</article>