<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="editorial" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Big Data</journal-id>
<journal-title>Frontiers in Big Data</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Big Data</abbrev-journal-title>
<issn pub-type="epub">2624-909X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">727856</article-id>
<article-id pub-id-type="doi">10.3389/fdata.2021.727856</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Big Data</subject>
<subj-group>
<subject>Editorial</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Editorial: ML and AI Safety, Effectiveness and Explainability in Healthcare</article-title>
<alt-title alt-title-type="left-running-head">Benrimoh et&#x20;al.</alt-title>
<alt-title alt-title-type="right-running-head">Editorial: ML/AI Safety, Effectiveness and Explainability in Healthcare</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Benrimoh</surname>
<given-names>David</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/755942/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Israel</surname>
<given-names>Sonia</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Fratila</surname>
<given-names>Robert</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Armstrong</surname>
<given-names>Caitrin</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Perlman</surname>
<given-names>Kelly</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Rosenfeld</surname>
<given-names>Ariel</given-names>
</name>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Kapelner</surname>
<given-names>Adam</given-names>
</name>
<xref ref-type="aff" rid="aff4">
<sup>4</sup>
</xref>
</contrib>
</contrib-group>
<aff id="aff1">
<label>
<sup>1</sup>
</label>Department of Psychiatry, McGill University, <addr-line>Montreal</addr-line>, <addr-line>QC</addr-line>, <country>Canada</country>
</aff>
<aff id="aff2">
<label>
<sup>2</sup>
</label>Aifred Health, Inc., <addr-line>Montreal</addr-line>, <addr-line>QC</addr-line>, <country>Canada</country>
</aff>
<aff id="aff3">
<label>
<sup>3</sup>
</label>Department of Information Science, Bar-Ilan University, <addr-line>Ramat Gan</addr-line>, <country>Israel</country>
</aff>
<aff id="aff4">
<label>
<sup>4</sup>
</label>Department of Mathematics, Queens College (CUNY), <addr-line>New York City</addr-line>, <addr-line>NY</addr-line>, <country>United&#x20;States</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited and reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/12494/overview">Thomas Hartung</ext-link>, Johns Hopkins University, United&#x20;States</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: David Benrimoh, <email>david.benrimoh@mail.mcgill.ca</email>
</corresp>
<fn fn-type="other">
<p>This article was submitted to Medicine and Public Health, a section of the journal Frontiers in Big&#x20;Data</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>12</day>
<month>07</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>4</volume>
<elocation-id>727856</elocation-id>
<history>
<date date-type="received">
<day>19</day>
<month>06</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>30</day>
<month>06</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2021 Benrimoh, Israel, Fratila, Armstrong, Perlman, Rosenfeld and Kapelner.</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>Benrimoh, Israel, Fratila, Armstrong, Perlman, Rosenfeld and Kapelner</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these&#x20;terms.</p>
</license>
</permissions>
<related-article id="RA1" related-article-type="commentary-article" xlink:href="https://www.frontiersin.org/researchtopic/11327" ext-link-type="uri">Editorial on the Research Topic<article-title>ML and AI Safety, Effectiveness and Explainability in Healthcare</article-title>
</related-article>
<kwd-group>
<kwd>healthcare</kwd>
<kwd>artificial inteligence</kwd>
<kwd>machine learning</kwd>
<kwd>safety</kwd>
<kwd>effectiveness</kwd>
<kwd>explainability</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<p>The increasing performance of machine learning and artificial intelligence (ML/AI) models has led to them being encountered more frequently in daily life, including in clinical medicine (<ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://doi.org/10.3389/frai.2020.507973">Bruckert et&#x20;al.</ext-link>; <xref ref-type="bibr" rid="B8">Rosenfeld et&#x20;al., 2021</xref>). While concerns about the opaque &#x201c;black box&#x201d; nature of ML/AI tools are not new, the need for practical solutions to the interpretability problem has become more pressing as ML/AI devices move from the laboratory, through regulatory processes that have yet to fully catch up to the state-of-the-art (<xref ref-type="bibr" rid="B2">Benrimoh et&#x20;al., 2018a</xref>), and to the bedside. This special edition targets three key domains in which innovation and clearer best practices are required for the implementation of ML/AI approaches in healthcare: ensuring safety, demonstrating effectiveness, and providing explainability. Notably, the first two have long been staples in the evaluation of drugs and medical devices (i.e.,&#x20;in order to be approved for human use, products must prove that they are safe and effective&#x2014;often compared to a reasonable comparator) (<xref ref-type="bibr" rid="B11">Sp&#x142;awi&#x144;ski and Ku&#x17a;niar, 2004</xref>). The third requirement&#x2014;that of explainability&#x2014;appears to be unique to ML/AI, due to the challenge of explaining how models arrive at their increasingly accurate conclusions. Yet, upon closer examination, one might argue that the explainability criterion has been implied in the past: mechanisms of action of drugs and devices are generally described in their product documentation (Health <xref ref-type="bibr" rid="B4">Canada, 2014</xref>). However, this can be misleading. For instance, many drugs have known receptor binding profiles and putative mechanisms of actions, although the precise mechanisms by which they produce their effect remain unclear despite their widespread use in clinical practice. Prime examples of this are lithium (<xref ref-type="bibr" rid="B10">Shaldubina et&#x20;al., 2001</xref>) and electroconvulsive therapy (<xref ref-type="bibr" rid="B9">Scott, 2011</xref>), both longstanding and highly effective treatments whose mechanisms of action remain controversial. Indeed, even the precise mechanism of general anesthesia is a subject of debate (<xref ref-type="bibr" rid="B7">Pleuvry, 2008</xref>). As such, we must consider a compromise-that of <italic>sufficient</italic> explainability (<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fdata.2020.572134">Clarke and Kapelner</ext-link>). This involves answering the question: how much must we know about a model in order to determine that it is safe to use in clinical practice? The articles in this special edition begin to explore possible answers to this as well as other key questions in the application of ML/AI to healthcare contexts.</p>
<p>
<ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://doi.org/10.3389/frai.2020.507973">Bruckert et&#x20;al.</ext-link> propose a Comprehensible Artificial Intelligence (cAI) framework, which they describe as a &#x201c;cookbook&#x201d; approach for integrating explainability into ML/AI systems intended to support medical decision-making. Notably, the authors do not limit explainability to an understanding of general rules a model might use to make predictions, but rather extend it to an example-level approach where human-interpretable semantic information is passed from the machine to the human user. They also discuss systems which not only provide an explanation to the user, but which receive feedback on this explanation in order to learn the implicit rules which experts may use as part of their routine decision-making. Future research could examine the potential biases which a machine might learn from experts, and how this could be mitigated.</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fdata.2020.572134">Clarke and Kapelner</ext-link> present a Bayesian ML/AI model that predicts outcomes after lens implant (a surgical treatment for cataracts). Their approach to explainability entails generating a list of the most important features used by the model. This method coheres with pre-ML/AI approaches in ophthalmology, which relied on traditional linear equations with clear variables (e.g. <xref ref-type="bibr" rid="B5">Dang and Raj, 1989</xref>), an example of an explainability already standard in the field. Because their approach is Bayesian, their predictions come with uncertainty intervals. Thus, the more uncertain the prediction, the wider the interval. Similar to <ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://doi.org/10.3389/frai.2020.507973">Bruckert et&#x20;al.</ext-link>, the authors insist upon a &#x201c;human in the loop,&#x201d; noting that surgeons ought to use the algorithm&#x2019;s predictions and uncertainty intervals as a guide within the context of their clinical judgement. Further, they note that the consequences of an incorrect prediction are relatively minor and simply require corrective medical procedures already employed in standard practice. It is interesting to consider that these kinds of predictions - in which a correct prediction provides a benefit but where a failed prediction carries low risk - are ideal first applications of ML/AI while they remain novel technologies (<xref ref-type="bibr" rid="B1">Benrimoh et&#x20;al., 2018b</xref>).</p>
<p>
<ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://doi.org/10.3389/frai.2021.561528">Desai et&#x20;al.</ext-link> introduce a ML/AI model for the identification of suicidal ideation in the general population. Using a sensitivity analysis, they demonstrate that a deep learning model, which is traditionally difficult to interpret, can be queried using standard statistical approaches to determine whether relationships between variables and outcomes identified by the model are coherent with the literature. Doing so not only makes deep learning a more viable model architecture for use in healthcare, but in&#x20;situations where a large body of literature exists allows for a &#x201c;sanity check&#x201d; of each model produced to ensure they have not learned &#x201c;quirks&#x201d; of a biased or non-representative dataset. This could help address <ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://doi.org/10.3389/frai.2020.507973">Bruckert et&#x20;al.&#x2019;s</ext-link> concern about poor explainability arising from biased data and models.</p>
<p>Finally, <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fdata.2020.579774">Wong et&#x20;al.</ext-link> offer a comprehensive discussion of challenges and opportunities for machine learning approaches in acute respiratory failure. Their discussion is a prime example of the level of granularity necessary for content experts to provide when building AI/ML models in healthcare. In addition to a general review of ML/AI concepts, the authors discuss domain-specific sources of bias as well as difficulties in operationalizing the outcomes that models should be trained to predict. This discourse serves as a useful entry point to the special issue for the clinically-oriented reader who is less familiar with ML/AI approaches. This article reminds us that each healthcare domain has unique challenges, meaning that a one-size-fits-all approach to explainability and the evaluation of safety and effectiveness is unlikely to succeed.</p>
<p>This special edition provides the reader with both a survey of current approaches to integrating ML/AI into healthcare as well as in-depth discussions of how to determine model safety, measure model effectiveness, and provide model explainability that will be of use to clinicians and regulators.</p>
</body>
<back>
<sec id="s1">
<title>Author Contributions</title>
<p>DB led writing of the manuscript. SI, RF, CA, KP, AR, and AK contributed to the writing and review of the manuscript.</p>
</sec>
<sec sec-type="COI-statement" id="s2">
<title>Conflict of Interest</title>
<p>DB, SI, RF, CA, and KP are employees, founders, or shareholders of Aifred Health. AR and AK have collaborated with Aifred Health and received honoraria from Aifred Health.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Benrimoh</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Fratila</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Israel</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Perlman</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Mirchi</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Desai</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2018b</year>). &#x201c;<article-title>Aifred Health, a Deep Learning Powered Clinical Decision Support System for Mental Health</article-title>,&#x201d; in <source>The NIPS &#x2019;17 Competition: Building Intelligent Systems</source>. Editors <person-group person-group-type="editor">
<name>
<surname>Escalera</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Weimer</surname>
<given-names>M.</given-names>
</name>
</person-group> (<publisher-loc>Heidelberg, Germany</publisher-loc>: <publisher-name>Springer International Publishing</publisher-name>), <fpage>251</fpage>&#x2013;<lpage>287</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-319-94042-7_13</pub-id> </citation>
</ref>
<ref id="B2">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Benrimoh</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Israel</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Perlman</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Fratila</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Krause</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2018a</year>). &#x201c;<article-title>Meticulous Transparency&#x2014;An Evaluation Process for an Agile AI Regulatory Scheme</article-title>,&#x201d; in <source>Recent Trends and Future Technology in Applied Intelligence</source>. Editors <person-group person-group-type="editor">
<name>
<surname>Mouhoub</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Sadaoui</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Ait Mohamed</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Ali</surname>
<given-names>M.</given-names>
</name>
</person-group> (<publisher-loc>Heidelberg, Germany</publisher-loc>: <publisher-name>Springer International Publishing</publisher-name>), <fpage>869</fpage>&#x2013;<lpage>880</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-319-92058-0_83</pub-id> </citation>
</ref>
<ref id="B4">
<citation citation-type="web">
<person-group person-group-type="author">
<name>
<surname>Canada</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Guidance Document&#x2014;Product Monograph (Guidance)</article-title>. <comment>Available at: <ext-link ext-link-type="uri" xlink:href="https://www.canada.ca/en/health-canada/services/drugs-health-products/drug-products/applications-submissions/guidance-documents/product-monograph/product-monograph.html">https://www.canada.ca/en/health-canada/services/drugs-health-products/drug-products/applications-submissions/guidance-documents/product-monograph/product-monograph.html</ext-link>
</comment> (<comment>Accessed June 12, 2021</comment>) </citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dang</surname>
<given-names>M. S.</given-names>
</name>
<name>
<surname>Raj</surname>
<given-names>P. P.</given-names>
</name>
</person-group> (<year>1989</year>). <article-title>SRK II Formula in the Calculation of Intraocular Lens Power</article-title>. <source>Br. J.&#x20;Ophthalmol.</source> <volume>73</volume> (<issue>10</issue>), <fpage>823</fpage>&#x2013;<lpage>826</lpage>. <pub-id pub-id-type="doi">10.1136/bjo.73.10.823</pub-id> </citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pleuvry</surname>
<given-names>B. J.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>Mechanism of Action of General Anaesthetic Drugs</article-title>. <source>Anaesth. Intensive Care Med.</source> <volume>9</volume> (<issue>Issue 4</issue>), <fpage>152</fpage>&#x2013;<lpage>153</lpage>. <comment>Available at: <ext-link ext-link-type="uri" xlink:href="https://www.sciencedirect.com/science/article/pii/S1472029907001774">https://www.sciencedirect.com/science/article/pii/S1472029907001774</ext-link>.</comment>. <comment>ISSN 1472-0299</comment>. <pub-id pub-id-type="doi">10.1016/j.mpaic.2007.08.004</pub-id> </citation>
</ref>
<ref id="B8">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Rosenfeld</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Benrimoh</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Armstrong</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Mirchi</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Langlois-Therrien</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Rollins</surname>
<given-names>C.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <source>Big Data Analytics and AI in Mental Healthcare. Applications of Big Data in Healthcare</source>. <publisher-loc>Amsterdam, Netherlands</publisher-loc>: <publisher-name>Academic Press</publisher-name>
</citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Scott</surname>
<given-names>A. I. F.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Mode of Action of Electroconvulsive Therapy: An Update</article-title>. <source>Adv. Psychiatr. Treat.</source> <volume>17</volume> (<issue>1</issue>), <fpage>15</fpage>&#x2013;<lpage>22</lpage>. <pub-id pub-id-type="doi">10.1192/apt.bp.109.007039</pub-id> </citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Shaldubina</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Agam</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Belmaker</surname>
<given-names>R. H.</given-names>
</name>
</person-group> (<year>2001</year>). <article-title>The Mechanism of Lithium Action: State of the Art, Ten Years Later</article-title>. <source>Prog. Neuro-Psychopharmacology Biol. Psychiatry</source> <volume>25</volume> (<issue>4</issue>), <fpage>855</fpage>&#x2013;<lpage>866</lpage>. <pub-id pub-id-type="doi">10.1016/s0278-5846(01)00154-3</pub-id> </citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sp&#x142;awi&#x144;ski</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Ku&#x17a;niar</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Clinical Trials: Active Control vs Placebo-What Is Ethical?</article-title>. <source>Sci. Eng. Ethics</source> <volume>10</volume> (<issue>1</issue>), <fpage>73</fpage>&#x2013;<lpage>79</lpage>. <pub-id pub-id-type="doi">10.1007/s11948-004-0065-x</pub-id> </citation>
</ref>
</ref-list>
</back>
</article>