- 1Faculty of Nursing, Université Laval, Québec, QC, Canada
- 2Faculty of Arts and Sciences, Université de Montréal, Montreal, QC, Canada
- 3Sciences Po, Paris, France
- 4Faculty of Medicine, Université Laval, Québec, QC, Canada
- 5Faculty of Health Sciences, Simon Fraser University, Burnaby, BC, Canada
Introduction: Artificial intelligence systems (AIS) powered by big data (BD) are more and more common in the healthcare sector and many anticipate that they will have a substantial effect on population health. Facing the disruptive potential of these transformations, there is a need to keep the pace with the ethical reflection accompanying the uses of AIS and the BD systems enabling such innovations.
Methods: To carry out this task, we conducted a scoping review of the ethical issues of AIS and BD, in population health, based on 243 scholarly articles.
Results: Our results show the explosion of publications on the subject in recent years. Our qualitative analysis of this literature highlights the potential issues of AIS and BD on the three components of population health: (1) the health outcomes and their distribution in the population and between populations; (2) the patterns of health determinants; (3) the policies and interventions developed to connect the previous components.
Discussion: Our conclusions show the uncertainty of the positive outcomes of these technologies and their potential for unequal distribution. Authors consider that AIS and BD will affect determinants of health either in their understanding and by transforming the structure of these determinants. At last, this review points that the policies and interventions developed to attain population health goals will have to answer to numerous ethical expectations. This review offers a comprehensive mapping of ethical issues raised by the uses of AIS in the global field of population health.
1 Introduction
Artificial intelligence systems (AIS) and big data (BD) are of special interest for population health (Mooney and Pejaver, 2018; World Health Organization, 2021). First, they promise an unprecedented capacity to treat and analyze large sets of data coming from vast social assemblages such as populations (Bellazzi, 2014). Second, they generate the possibility for developing large scale health interventions targeting populations or social groups because of their capacity for automation and their potential autonomy from limited human workforce (OECD, 2019; UNESCO, 2024; Dolley, 2018). Beside these promises, it is not clear on which ethical landscape these systems will be deployed (Floridi et al., 2018). To clarify this situation, our aim was to synthetize the state of the ethical reflection on the main ethical challenges raised by the introduction of systems at the intersection of artificial intelligence (AI) and BD from the perspective of population health.
For this task, we apply the definition of population health suggested by (Kindig and Stoddart, 2003). There is no consensus on what “population health” is, but Kindig and Stoddart’s definition offers an accepted base offering the common features implied by this extension of public health. According to these authors, “population health” can be defined as the “the health outcomes of a group of individuals, including the distribution of such outcomes within the group” (Kindig and Stoddart, 2003). It encompasses three interacting components. The first refers to health outcomes and their distribution. The second considers the patterns of health determinants (e.g., healthcare, social environment, physical environment). The third is the interventions and policies connecting the previous components.
In complement, we used the largest definitions of BD and AI to make sure no relevant article was excluded with regards to our research question. That said, both the definitions of BD and AI are porous and somewhat debated. To categorize the particularity of BD, many authors refer to the “three Vs” definition: volume, variety and velocity (Vogel et al., 2019; Stylianou and Talias, 2017; Tanti, 2015; Thorpe and Gray, 2015a; Dolley, 2018). A fourth and fifth V are sometimes added for “veracity” (Andanda, 2019; Bellazzi, 2014; Cahan et al., 2019; Liyanage et al., 2014) and “value” (Docherty and Lone, 2015; Lajonchere, 2018; Colloc, 2015; Salas-Vega et al., 2015). Sources of BD for population health include medical (Lee and Yoon, 2017; Wyllie and Davies, 2015; Cheung et al., 2019; Wang and Alexander, 2020) and medical-health data collected in various ways and by multiple devices (Vogel et al., 2019; Andanda, 2019; Mooney and Pejaver, 2018; Leyens et al., 2017; Benke and Benke, 2018; Alemayehu and Berger, 2016; Timmins et al., 2018; Barreto and Rodrigues, 2018; Kern et al., 2016), e.g., electronic health records (EHR) (Gossec et al., 2020), social media (Gossec et al., 2020; Aiello et al., 2020), wearable devices (Gossec et al., 2020), the internet of things (Fornasier, 2019), among others. Data can be personal or proprietary, controlled by the government or available in open data commons (Heitmueller et al., 2014).
BD is used to train and feed AIS. A very general definition of AI designates technologies that can execute tasks by imitating human intelligence (Gossec et al., 2020; Tang et al., 2018; Kerr et al., 2018; Xie et al., 2020). AI includes various approaches such as machine learning (supervised or unsupervised), deep learning, and neural networks (Mooney and Pejaver, 2018; Tang et al., 2018; Xie et al., 2020; Noorbakhsh-Sabet et al., 2019; Lajonchere, 2018; Galetsi et al., 2019; Mohr et al., 2017; Wang and Alexander, 2020; Lanier et al., 2020; Sparrow and Hatherley, 2019). It can take many forms, including some visible on computer screens and others as complex as robots (Fulmer, 2019; Kernaghan, 2014). Together, BD and AI are used in multiple ways to study or improve population health, e.g., health decision-making (Hunt et al., 2020; Conrad et al., 2020; Brill et al., 2019), surveillance (Mbunge, 2020; Budd et al., 2020; Larkin and Hystad, 2017), data analysis and research (Sparrow and Hatherley, 2019; Sanchez M and Sarria-Santamera, 2019; Ladner and Ben Abdelaziz, 2018), and assistive technologies (Kernaghan, 2014; Bennett, 2019; de Graaf et al., 2015; Grigorovich and Kontos, 2020; Miller, 2020; Vollmer Dahlke and Ory, 2020; Althobaiti, 2021; Jiang and Cheng, 2021).
In the next sections, we will defend that the use of AIS fueled by BD may affect paradoxically the three components of population health. It is still uncertain if the benefits of these AIS will balance the numerous risks that these technologies pose for the main goal of population health. We can still doubt whether these expectations will match reality. Hence, our knowledge synthesis offers a roadmap for future ethical assessment of AIS in population health.
2 Materials and methods
To achieve our aim, we followed the five stages of the scoping review methodology (Arksey and O’Malley, 2005; Levac et al., 2010; Tricco et al., 2018), starting with the identification of the research question which is: “what are the ethical issues of AIS using BD in population health?”
This question guided us for the next stage which was the identification of relevant studies. With the help of a librarian specialized in reviewing health research evidence, we developed the following research strategy. We conceived a search equation including terms related to the three concepts of our research question: (1) “ethical, legal, and social issues (ELSI),” (2) “population health,” and (3) “AIS and BD technologies” (see Table 1). We selected two databases because of their integration of articles in health sciences and bioethics (Medline) as well as social science and multidisciplinary research (Web of Science). Articles in English and French were included. No restrictions were used for publication date because of the novelty of the topic.
Once the strategy was determined, we started the study inclusion stage. For this purpose, we developed selection criteria (see Table 2) to optimize the search and followed the selection process suggested by the PRISMA flowchart (see Figure 1). The first search was conducted June 20, 2020, and it was updated November 24, 2021. The combined searches led to the identification of 5,173 records by screening their title and abstract. Each step of the screening was done by two reviewers (either MCR and JCBP or VC) for each record. After removing duplicates and analyzing the full text, we obtained a final sampling of 243 articles.
For the fourth stage of the review, we charted the data to have a global picture of the literature. Specifically, we look at the year of publication, the region where the first author is located, the academic domain of the article, and the type of technological application described in the article (see Supplementary material).
For the last stage, the articles were qualitatively analyzed following thematic analysis (Braun and Clarke, 2012). With the help of NVivo 12 (QSR International, 2017), we used inductive and deductive coding. Prior to coding, the principles for governing AI mapped in Fjeld et al. (2019) were used as an initial matrix. The codebook was updated as the coding was carried on. To assess intercoder reliability and to produce a first codebook, a subset (5%) of the articles retrieved in the first search were coded by three researchers (VC, JCBP, MCR). Codes were grouped into themes that we discussed within the definition of “population health” suggested by Kindig and Stoddart (2003).
3 Results
According to the literature, AIS using BD will generate ethical issues affecting each of the three components of population health: (1) health outcomes and their distribution, (2) the patterns of health determinants, (3) as well as the interventions and policies working on health determinants to create positive outcomes. Table 3 summarize these results.
3.1 Health outcomes and distribution
The literature is mostly speculative and ambivalent regarding AIS using BD capacity to generate positive health outcomes (Horvitz and Mulligan, 2015). The major threat of these systems may be the unfair distribution of these outcomes in the population and between populations.
3.1.1 Uncertain outcomes
3.1.1.1 Positive health outcomes
Many authors speculate that these technologies will create positive health outcomes for populations (Althobaiti, 2021; Cheng et al., 2020; Abramoff et al., 2021; Castagno and Khalifa, 2020; Kelly et al., 2020). Some of these positive expectations have been associated with specific optimization of various health services. Authors have mentioned the gain in terms of accessibility (Fornasier, 2019; Xie et al., 2020; Bates et al., 2018; Jones et al., 2020). The combined use of AI and BD opens a new scalability and the possibility to treat an unimaginable quantity of patients in comparison to what the actual workforce can offer (Abramoff et al., 2021). In that sense, AIS can offer a response to the actual health workers shortage that many health systems are facing. In parallel, these technologies could reduce the cost of health services (Kern et al., 2016; Grigorovich and Kontos, 2020) and make resource allocation more efficient (Stylianou and Talias, 2017; Galetsi et al., 2019; Sparrow and Hatherley, 2019; Bates et al., 2018; Machluf et al., 2017; Canaway et al., 2019; Peters and Buntrock, 2014). These benefits could be significant for low- and middle-income countries (LMICs) (Alami et al., 2020), where AIS could complement existing health services (Schwalbe and Wahl, 2020).
Authors have identified specific interventions that could be optimized with the integration of AI and BD such as helping to manage disease (Kerr et al., 2018), faster (Fornasier, 2019) and with more precision (Althobaiti, 2021), or otherwise facilitate diagnosis (Noorbakhsh-Sabet et al., 2019; Sparrow and Hatherley, 2019), determining appropriate treatments (Yang and Chen, 2018), e.g., with the use of precision medicine (Ahmed et al., 2020), and improving patient outcomes more generally (Canaway et al., 2019). Robots more specifically could help reduce loneliness (Miller, 2020) and otherwise induce positive emotions in older patients (Ienca et al., 2016), enhance their autonomy and thus reduce the burden on the healthcare system (de Graaf et al., 2015; Ienca et al., 2016). At the population level, AI and BD can support proactive interventions, particularly in populations of lower socioeconomic status (Machluf et al., 2017; Eng, 2004; Zhang et al., 2017), improve the prevention, prediction and treatment of chronic diseases (Lajonchere, 2018; Kern et al., 2016; Cool, 2016), make disease screening more efficient (Morgenstern et al., 2021), and facilitate epidemics surveillance (Galetsi et al., 2019; Cheng et al., 2020; Bates et al., 2018; Roberts, 2019) and the decision-making in cases of global health emergencies (Galetsi et al., 2019). AIS can offer more targeted populational interventions through so-called “precision public health” (Johnson, 2020; Dolley, 2018). Authors also noted benefits for healthcare systems including analyzing their inefficiencies (Manrique de Lara and Pelaez-Ballestas, 2020; Ho et al., 2020), detecting problems in health laboratories (Yang and Chen, 2018), facilitating the assessment of health technologies and drugs (Lajonchere, 2018) and streamlining the workflow (Thorpe and Gray, 2015a; Tang et al., 2018; Sparrow and Hatherley, 2019; Kostkova et al., 2016). Finally, the AI and BD technologies could optimize the research process at the very core of healthcare (Mooney and Pejaver, 2018; Sparrow and Hatherley, 2019; Balas et al., 2015; Joda et al., 2018) and facilitate the distribution of its benefits (Ahmed et al., 2020).
3.1.1.2 Negative health outcomes
Conversely, authors have identified numerous negative health outcome that could be aggregated into two clusters. The first one focusses on the errors that could be introduced by AI and BD. System dysfunction or malfunction are part of the game (Satava, 2002) and an error in AIS used systemically in healthcare could lead to harming 1,000 of patients (Sparrow and Hatherley, 2019). There is a possibility of misdiagnosis because of bugs or the overreliance of healthcare professionals (HCP) on AIS (Morley et al., 1982). The efficiency of AIS can lead to lower the human scrutiny on the system and diminish human capacity to control the system (Sarbadhikari and Pradhan, 2020). Another risks is the use of an AIS for a purpose other than what it was designed for (Ahmed et al., 2020). In the same vein, the vulnerability of these systems for cyber-attacks could disrupt the use of AI devices and affect populations (Abdulkareem and Petersen, 2021). Errors do not only pertain to the systems and HCPs can misleadingly interpret the results (Stylianou and Talias, 2017; Satava, 2002), misleadingly interpret the results of AIS because of their reluctance or distrust AI predictions (Ahmed et al., 2020; Horgan and Ricciardi, 2017; Nebeker et al., 2019).
The second cluster highlights the reductionist view of health introduced by these systems and the risks that something important will be missed (Dolley, 2018). Careless use may lead to wrong results, harming populations (Alemayehu and Berger, 2016) and wasting resources (Green and Vogt, 2016). The central role of BD for AIS risk reducing populations to numbers, narrowing the whole human experience (Sparrow and Hatherley, 2019) characterized by, inter alia, its irrationality, unpredictability and vulnerability (Kerr et al., 2018; Brill et al., 2019; Ahmed et al., 2020; Delpierre and Kelly-Irving, 2018; Prainsack, 2020), as well as its cultural dimension, situatedness, and its reliance on values, preferences and beliefs (Mentis et al., 2018; Kee and Taylor-Robinson, 2020; van Deursen and Mossberger, 2018). This form of dehumanization (Althobaiti, 2021; Ienca et al., 2016) can be detrimental to the therapeutic relationship (Sparrow and Hatherley, 2019) by providing services devoid of human contact (Fulmer, 2019; Kernaghan, 2014; Miller, 2020; Cordeiro, 2021), the empathy and the compassion normally offered by HCP (Kerr et al., 2018; Morley et al., 1982; Manrique de Lara and Pelaez-Ballestas, 2020). Also, these technologies are seen as ways to ground the personalization of medicine based on the individual’s genetic background. For some, it is feared that this narrow use will divert the focus away from public health interventions, and from upstream determinants of health (Kee and Taylor-Robinson, 2020; Kenney and Mamo, 2019).
3.1.2 Fair distribution of the outcomes
In parallel to the ambivalent outcomes of AIS for population health, many authors suggest that a central issue of these technologies will be to the inequitable distribution of their outcomes (Vollmer Dahlke and Ory, 2020; Althobaiti, 2021; Ienca et al., 2016; Conway, 2014; Ossorio, 2014; Rosen et al., 2020; Samuel and Derrick, 2020; Terrasse et al., 2019; Amann et al., 2020; Xafis et al., 2019; Car et al., 2019). They fear that these technologies’ health benefits will be concentrated in the hands of the more privileged groups while the burdens will be transferred to the less privileged. Five areas of reflections regarding the fair distribution have been scrutinized.
3.1.2.1 Increase of health disparities
Because of the scale at which it is used (Abramoff et al., 2021), some hope that AIS used in population health interventions will contribute to reducing health disparities (Zhang et al., 2017; Genevieve et al., 2019), but the reverse effect is anticipated by many (Nebeker et al., 2019; Terrasse et al., 2019; Breen et al., 2019; Holzmeyer, 2021; Luk et al., 2021). Some fear that these technologies will affect disproportionately parts of the population (Zhang et al., 2017; Montgomery et al., 2018) such as people with disabilities (Jones et al., 2020), vulnerable populations (Rosen et al., 2020), and marginalized communities (Hu et al., 2017). This could be partly due to interventions (e.g., precision public health) narrowly focused on biomedical factors and surveillance instead of taking into consideration social determinants of health (Johnson, 2020; Mentis et al., 2018; Kenney and Mamo, 2019; Backholer et al., 2021; Trein and Wagner, 2021). Conversely, public health surveillance programs may unduly focus on vulnerable populations because they may have less control over their “digital footprint” (Rosen et al., 2020), be insufficiently prepared to represent their interests (van Deursen and Mossberger, 2018) and lack time to manage their virtual identity (Montgomery et al., 2018). In parallel, there is a risk that the technology be used with bad intentions, perpetuating social prejudices and therefore increase health disparities (Abdulkareem and Petersen, 2021). For example, discriminatory uses of BD and AIS, such as selecting who has access to healthcare (Sparrow and Hatherley, 2019), identifying noncompliant patients (Moutel et al., 2018) and cherry-picking patients (Zhang et al., 2017; Terrasse et al., 2019), could increase health disparities by depriving populations who need it most from access to health services (Cahan et al., 2019).
3.1.2.2 Discrimination and stigmatization
Another type of justice consideration regarding BD and AIS relates to discrimination and stigmatization. Data breaches; loss of privacy; public information on social media; the identification of individuals, falsely or not, with a medical condition, a particular genotype, or as the source of an infection (Raza and Luheshi, 2016; Shachar et al., 2020); and the inclusion of social determinants in electronic health records (Goodman, 2020) and tracing apps (Mbunge, 2020); all these situations raise risks of stigmatizing individuals and communities (Aiello et al., 2020; Galetsi et al., 2019; Hunt et al., 2020; Genevieve et al., 2019; Breen et al., 2019; Luk et al., 2021; Baldassarre et al., 2020; Mikal et al., 2016; Vayena et al., 2015; Yang and Chen, 2018; Celedonia et al., 2021; Ngan and Kelmenson, 2021; Straw, 2021; Xing et al., 2021) as well as risks of discrimination (Cordeiro, 2021; Salerno et al., 2017; Yeung, 2018; Gilbert et al., 2019; Chen and See, 2020; Jalal et al., 2020) by insurance companies and employers (Stylianou and Talias, 2017; Cahan et al., 2019; Docherty and Lone, 2015; Colloc, 2015; Salas-Vega et al., 2015; Benke and Benke, 2018; Sparrow and Hatherley, 2019; Brill et al., 2019; van Deursen and Mossberger, 2018; Manrique de Lara and Pelaez-Ballestas, 2020; Montgomery et al., 2018; Salerno et al., 2017; Sun et al., 2020; Babyar, 2019; Ajunwa et al., 2016; Adkins, 2017; Casanovas et al., 2017; Ienca et al., 2018; Mootz et al., 2020; Tigard, 2019; Rajam, 2020). These risks apply even to individuals who have not participated in research activities (Kim et al., 2017) (e.g., when members of a group have shared identifiers) (Manrique de Lara and Pelaez-Ballestas, 2020) and when data has been anonymized (Dolley, 2018; Sanchez M and Sarria-Santamera, 2019). Discrimination could also occur on the basis of race, sex (Abdulkareem and Petersen, 2021; Manrique de Lara and Pelaez-Ballestas, 2020), gender (Zou and Schiebinger, 2021), income, age (Abramoff et al., 2021), and it can take many forms such as “invisibility, exclusion, or complacency employed to avoid detection, critique, or questioning” (Dankwa-Mullan et al., 2021). At the clinical level, protocols based on population statistics may exclude the individual preferences of patients (Dagi, 2017).
3.1.2.3 Digital colonialism
One distribution consideration relates to the fair return of results of technology development. Authors highlight the risk of “digital colonialism” where privileged populations benefit from the development of technology while the less privileged are left apart. This issue can take many forms that are mostly illustrated by the unequal relationships between high-income countries and LMICs. One fear is that researchers from high-income countries take advantage of data collected by researchers in LMICs for their own advantage and without acknowledging the latter’s work (Ballantyne, 2019; Car et al., 2019; Howe and Elenberg, 2020). At the population level, some worry that health data be analyzed in high-income settings with no possibility for LMIC to control how it is used (Andanda, 2019) and to benefit from it (Dolley, 2018; Li and Cong, 2021). Digital colonialism can also take the form of AIS being developed with data from high-income countries that will lead to detrimental and discriminatory effects on health care in LMICs. For example, these AIS may recommend a health intervention that is not feasible locally or only available at a significative costs outside the country (Alami et al., 2020; Bhattacharya et al., 2021; Demuro et al., 2020). Another consideration is that there might be socioeconomical barriers that prevent the implementation, in a LMIC, of an algorithm created in a high-income country (Liu and Bressler, 2020). A corollary is “ethics dumping,” which is “exporting unethical research practices, for example, unethical data processing […] to countries where research ethics committee oversight is lacking” (Samuel and Derrick, 2020). Some could justify this “ethics dumping” with the fact that the access to healthcare can be difficult in some LMICs. In the same vein, there is a concern that non-compliant technologies could bypass security and privacy vulnerabilities since informal healthcare is more prevalent in LMICs countries (Alami et al., 2020). However, this could lead us to a new “medicine for the poor” in the same way that most of the medical equipment being sent to LMICs fail or do not work (Alami et al., 2020).
3.1.2.4 Digital divide
The “digital divide” argument offers a variation on the unfair distribution of outcomes issue (Zhang et al., 2017). It describes inequalities in access to data (van Heerden et al., 2020) and technologies (Bennett, 2019) caused either by of a lack of resources (Mbunge, 2020) or knowledge (Aiello et al., 2020; Galetsi et al., 2019; Bennett, 2019; Vollmer Dahlke and Ory, 2020; Genevieve et al., 2019; Lodders and Paterson, 2020). The increased use of BD and AI in health could worsen the digital divide (Eng, 2004) and perpetuate health inequities (Genevieve et al., 2019; Murphy et al., 2021) by leaving out people who cannot or do not want to use those technologies (Kerr et al., 2018; Cordeiro, 2021; Mootz et al., 2020; Fleming, 2021). This can particularly affect people in LMICs (Brill et al., 2019; Mentis et al., 2018; Manrique de Lara and Pelaez-Ballestas, 2020; Mbunge, 2020) but also populations with lower socioeconomic status in high-income countries (Budd et al., 2020). The digital divide could have multiple consequences. First, it could lead to unrepresentative data sets by excluding populations who have least access to technologies (Cahan et al., 2019; Docherty and Lone, 2015; Benke and Benke, 2018; Aiello et al., 2020; Vollmer Dahlke and Ory, 2020; Dolley, 2018; Delpierre and Kelly-Irving, 2018; Ossorio, 2014; Genevieve et al., 2019; Breen et al., 2019; Mikal et al., 2016; Yeung, 2018; Gilbert et al., 2019; Hodgson et al., 2020). Second, these populations have higher burdens of disease (e.g., advanced age, lower economic status, etc.) but have less resources to benefit from BD and AI innovations (Larkin and Hystad, 2017; Sun et al., 2020; Strang, 2020). Third, the digital divide could also create inequities in digital surveillance (Aiello et al., 2020) and be exacerbated by the uses of the technologies at the international level (Manrique de Lara and Pelaez-Ballestas, 2020). However, populations with lower digital literacy could also be overrepresented because they “may be more likely to unknowingly imply consent” (Mikal et al., 2016; Demuro et al., 2020). Programs aiming to curb the digital divide could create a “privacy divide” if they require that vulnerable populations trade their personal data in exchange for products and services (Montgomery et al., 2018).
3.1.2.5 Biases in datasets and algorithms
An important concern relates to the presence of biases in datasets and the coding of algorithms that may lead to an unfair distribution of the benefits and burdens of the technology in the population or between populations. Biases may have different sources such as the obliteration of certain groups in the datasets used to train AI. This could come from observational, and sampling bias at the basis of data gathering (Cahan et al., 2019; Howe and Elenberg, 2020; Bhattacharya et al., 2021; Strang, 2020; Goldsmith et al., 2021; Tan et al., 2020) or missing data from less represented populations (Cahan et al., 2019; Lanier et al., 2020; Howe and Elenberg, 2020; Bhattacharya et al., 2021; Strang, 2020; Goldsmith et al., 2021; Tan et al., 2020). Biases in programming (Cahan et al., 2019; Wang and Alexander, 2020; Xie et al., 2020; Ahmed et al., 2020), for its part, may come from the amplification of previous biases and the failure to recognize them in subsequent stages (Morgenstern et al., 2021; Zou and Schiebinger, 2021; Baclic et al., 2020; Thomasian et al., 2021). They could also come from the erroneous decision to apply data from one population to another (Tang et al., 2018; Sparrow and Hatherley, 2019; Zhang et al., 2017; Morley et al., 1982; Delpierre and Kelly-Irving, 2018) or developers’ incorrect assumptions and beliefs (Terrasse et al., 2019; Yeung, 2018). All this will result in biased results, or to what authors refer to with the expression “garbage in, garbage out” (GIGO), meaning that biased data leads to biased results (Cahan et al., 2019; Howe and Elenberg, 2020; Evans et al., 2020). The risk at this stage is the perpetuation of biases, as biased algorithms could exacerbate already present racial and socioeconomic inequalities and vulnerabilities (Sarbadhikari and Pradhan, 2020; Luk et al., 2021; Couch et al., 2020). This may affect the health of individual patients (Tang et al., 2018; Morley et al., 1982; Sarbadhikari and Pradhan, 2020) and, moreover, the wellbeing of the global population (Docherty and Lone, 2015; Galetsi et al., 2019; Breen et al., 2019; Yeung, 2018) in terms of the perpetuation of discriminatory racial and social practices (Cahan et al., 2019; Kee and Taylor-Robinson, 2020; van Deursen and Mossberger, 2018; Altenburger and Ho, 2019) or health inequities (Cahan et al., 2019; Lanier et al., 2020; Sparrow and Hatherley, 2019; Zhang et al., 2017; Manrique de Lara and Pelaez-Ballestas, 2020; Yeung, 2018; Ballantyne, 2019; Villongco and Khan, 2020; Carney and Kong, 2017). Biased AIs seem unavoidable (Goodman, 2020), or hard to minimize (Gilbert et al., 2019), because of the black-box nature of many AIS (Lanier et al., 2020; Sparrow and Hatherley, 2019; Rajam, 2020), and the ubiquitous nature of AI (van Deursen and Mossberger, 2018). The mistake may be to consider data as pure and objective realities (Holzmeyer, 2021) although they are determined (like health and wellbeing) by economic, social, and political dynamics (Johnson, 2020; Delpierre and Kelly-Irving, 2018) as well as generating social consequences (Alami et al., 2020).
3.2 Health determinants
Aside from discussing the health outcomes and their distribution in the population and between populations, the literature reflects on how AIS and BD will affect three important health determinants: health behaviors, healthcare functioning, and data infrastructure.
3.2.1 Promotion of healthy behaviors
Looking at how the technologies will affect health-related behaviors, the literature is dubious by both acknowledging their potential for individual empowerment as well as their possibility to undermine the individuals’ autonomy (Snell, 2019). Digital literacy appears to be an important condition to obtain such positive outcomes.
3.2.1.1 Empowerment and disempowerment
AIS using BD may affect positively individual behaviors by empowering patients in taking care of their own health (Fornasier, 2019; Lajonchere, 2018; Fulmer, 2019; Cordeiro, 2021; Manrique de Lara and Pelaez-Ballestas, 2020; Snell, 2019; Montgomery et al., 2018; Prosperi et al., 2018). These technologies could help individuals to monitor their own health (Wang and Alexander, 2020; Kenney and Mamo, 2019; Car et al., 2019), offer pertinent health information (Prosperi et al., 2018; Kostkova et al., 2016), contribute to decision-making (Bellazzi, 2014; van Deursen and Mossberger, 2018), assist in the management of their health and illness (Eng, 2004), and open the possibility of robotic assistance and interactions (Bennett, 2019; de Graaf et al., 2015; Vollmer Dahlke and Ory, 2020; Belk, 2020; Kernaghan, 2014). All this could be of great use for chronic disease management (Kenney and Mamo, 2019; Car et al., 2019), and supporting disabled people (Jones et al., 2020) or elderly people’s autonomy (Manzeschke et al., 2016). The autonomy offered by these systems may modify the power relationship with the HCP in favor of the patient (Galetsi et al., 2019; Sparrow and Hatherley, 2019; Terrasse et al., 2019; Gilbert et al., 2019), thus diminishing medical authority (Lupton and Jutel, 2015). This conception of empowerment and engagement is a strong dimension of the digital health rhetoric (Lupton and Jutel, 2015).
The combined use AI and BD can also have positive effects on collective behaviors. Some anticipate that these technologies offer platforms for collective engagement, for example in disease surveillance (Genevieve et al., 2019; Sun et al., 2020) and in the research process (Manrique de Lara and Pelaez-Ballestas, 2020; Manrique de Lara and Pelaez-Ballestas, 2020; Gossec et al., 2020; Anisetti et al., 2018; Katapally, 2020). In that vein, some see the possibility of citizen engagement in the development of these very same technologies (Casanovas et al., 2017; Altenburger and Ho, 2019), yet a lot has still to be done (Conrad et al., 2020; Breen et al., 2019; Gilbert et al., 2019; Pepin et al., 2020; Nichol et al., 2021; Tang et al., 2018; Manzeschke et al., 2016). This possibility raises its very own ethical issues regarding the authenticity of the engagement of citizens, patients or populations (Bennett, 2019; Vollmer Dahlke and Ory, 2020; Ballantyne, 2019; Evans et al., 2020).
Conversely, many speculate that the technologies will promote disempowerment. For some, patients may feel a loss of agency toward the decision taken by HCPs and AIS (Morley et al., 1982; van Deursen and Mossberger, 2018), particularly if the AIS is opaque (Amann et al., 2020), and create forms of nudging (van Deursen and Mossberger, 2018). Aside, the pervasiveness of the technology may discourage individuals to engage in their own health and leave this task to the technology (Kasperbauer, 2021). On the other hand, they may feel responsible for their health, creating “individuals on alert” (van Deursen and Mossberger, 2018; Samerski, 2018). Despite presenting themselves as patient-empowering, self-diagnosis apps still recommend users to seek medical advice, challenging patient empowerment in face of medical authority (Lupton and Jutel, 2015).
3.2.1.2 Digital and ethical literacy
To sustain the empowering of populations and attain positive health outcomes, digital and ethical literacy appears to be an essential precondition for the stakeholders of population health (Benke and Benke, 2018; Fulmer, 2019; Delpierre and Kelly-Irving, 2018; Ballantyne, 2019). First, there is a need to educate the public regarding digital technologies using BD and AI (Galetsi et al., 2019; van Deursen and Mossberger, 2018) and their various pitfalls such the limitations of the technology (Grigorovich and Kontos, 2020; van Deursen and Mossberger, 2018; Lupton and Jutel, 2015), the complexity of privacy protection (Mikal et al., 2016; Lodders and Paterson, 2020), the risks of cybersecurity (van Deursen and Mossberger, 2018) and the inherent biases of the technology (Mikal et al., 2016). The same necessity for digital and ethical literacy by the general population has been said for policymakers (Lanier et al., 2020), HCP, and researchers (Stylianou and Talias, 2017; Leyens et al., 2017; Aiello et al., 2020; Galetsi et al., 2019; Brill et al., 2019; Machluf et al., 2017; Satava, 2002; Babyar, 2019; Gossec et al., 2020; Pepin et al., 2020; Hemingway et al., 2018; Godfrey et al., 2020; Ho and Caals, 2021). At last, ethical literacy may be critical for data scientists to achieve their aim (Dolley, 2018; Kern et al., 2016; Zhang et al., 2017; van Deursen and Mossberger, 2018; Goodman, 2020; Nebeker et al., 2019).
3.2.2 Efficient healthcare functioning
Aside from health behaviors, authors have dissected the effects of AIS on more structural health determinants such as healthcare accessibility and quality. Regarding that pattern of determinants, it is anticipated that AIS will transform healthcare working conditions (Noorbakhsh-Sabet et al., 2019). Some speculate the potential of the technologies to maximize HCPs’ workforce, others suggest an increased workload and a devaluation of their work.
On the positive side AIS could assist HCPs in their work through numerous tasks such as removing repetitive tasks (Ahmed et al., 2020; Jalal et al., 2020), improving workflow (Tang et al., 2018), managing patients (Kerr et al., 2018; Pagliari, 2021), keeping pace with the medical literature (Conrad et al., 2020), supporting diagnostic and treatment decisions (Benke and Benke, 2018; Jones et al., 2020; Adkins, 2017; Wang et al., 2021), personalizing treatment (Brill et al., 2019), and possibly even reducing misdiagnosis (Xie et al., 2020). They could also support communication between HCPs and patients, maximizing the short time given for clinical consultations (Mootz et al., 2020). Thus, AIS, instead of dehumanizing care, would help rehumanize (Cahan et al., 2019; Conrad et al., 2020; Belk, 2020) and reinforce the relationship (Brill et al., 2019).
There is no consensus on the potential benefits of AIS. Many fear an increase in HCPs’ workload (Grigorovich and Kontos, 2020; Xing et al., 2021). The necessity for HCPs to adapt to new AIS by learning how to use the technology (Kerr et al., 2018; Grigorovich and Kontos, 2020; Godfrey et al., 2020) and the incentive to collect and manage more data, will all add to their workload (Sparrow and Hatherley, 2019; Grigorovich and Kontos, 2020). For example, electronic-health records add administrative burdens for HCPs (Sparrow and Hatherley, 2019). Also, the optimization of the services may lead to treat more patients instead of allowing more time for clinical consultations (Sparrow and Hatherley, 2019). Furthermore, the use of apps for health self-monitoring may lead to increased and unnecessary referrals to HCPs, also adding to their workload (Ienca et al., 2018).
In the long term, many authors raise the concern that AIS could change the healthcare workforce (Benke and Benke, 2018) by devaluating their expertise (Kasperbauer, 2021). Some anticipate that doctors (Stylianou and Talias, 2017; Galetsi et al., 2019; Fulmer, 2019; Ahmed et al., 2020; Satava, 2002; Manrique de Lara and Pelaez-Ballestas, 2020; Conrad et al., 2020; Conrad et al., 2020; Samerski, 2018), and nurses (Goodman, 2020) be replaced by AIS (Althobaiti, 2021), although it is not unanimously supported (Cahan et al., 2019; Lajonchere, 2018; Tang et al., 2018; Adkins, 2017; Belk, 2020). The replacement of HCP by AIS could lead to diminished professional autonomy (Lajonchere, 2018; Sparrow and Hatherley, 2019; Dagi, 2017), dependence to AIS (Abramoff et al., 2021), deskilling of HCPs (Sparrow and Hatherley, 2019; Satava, 2002; Goodman, 2020), and unemployment (Kerr et al., 2018). Authors also highlighted the risk of increasing the surveillance of workers (Tang et al., 2018; Sparrow and Hatherley, 2019; Yang and Chen, 2018).
3.2.3 Data control
From a perspective of health technologies more and more dependent of big data, the control over data plays a strategical role. Who controls data may direct the benefits downstream and affect the health of entire populations. For that reason, issues of data control shape a specific pattern of health determinants. In relation to that concern, three groups of issues play a preponderant role in the literature: issues over data ownership, data management, and data accessibility.
3.2.3.1 Data ownership
The question of data ownership asks the question of who can exercise power over the data that will be used to train and fed AI. The question of ownership is a complex one (Gilbert et al., 2019). Data are created by many people and all have some rights over the data (Mooney and Pejaver, 2018; Tang et al., 2018) while promoting different agendas (Alemayehu and Berger, 2016). BD derived technologies amplify this situation with their capacity, sometime furtive, to aggregate numerous sources of data. These sources of data may be as diverse as ordinary internet-connected object (Miller, 2020; van Deursen and Mossberger, 2018; Ajunwa et al., 2016) to public health surveillance interventions (Hodgson et al., 2020). All this may be complicated by the limited knowledge of the data individuals implicitly share (Horvitz and Mulligan, 2015).
A strong line of thought suggests that there is an information asymmetry between individual and corporation in the favor of the latter (Sun et al., 2020). Health data can be seen as a profitable investment for corporations (Canaway et al., 2019; Cheung, 2020). There is the possibility that private corporation owns sensitive health information (Tigard, 2019) and that they capture health data coming from public health interventions (Aiello et al., 2020). Although they might be regulated (Ballantyne, 2019), they might be less accountable for the use of data (Andanda, 2019) while caring less for the social good (Terrasse et al., 2019) than the protection of intellectual property (Salas-Vega et al., 2015) and developing monopolies (Ladner and Ben Abdelaziz, 2018; Roberts, 2019; Satava, 2003). This situation opens fear of abuses (Benke and Benke, 2018; Delpierre and Kelly-Irving, 2018; Prainsack, 2020) which makes some believe that the deployment of AIS will benefit the corporation rather than the populations (Adkins, 2017) and perpetuate social inequalities (Andanda, 2019).
While corporations play a central role in data economies, the control of individuals over their own data also need to be considered (Andanda, 2019; Colloc, 2015; Salas-Vega et al., 2015; Benke and Benke, 2018; Tang et al., 2018). Policies may play an important role in protecting this form of control (Montgomery et al., 2018) to respond to constant risk of reidentification (Delpierre and Kelly-Irving, 2018) and commodification (Conrad et al., 2020). Traditionally, patients have not been able to control their healthcare data (Bates et al., 2018), but, because of the strategic role data plays. There is an increasing demand from individuals to have access to their own data (Stylianou and Talias, 2017; Hodgson et al., 2020).
An alternative to previous mode of property could be find in collective ownership of data such as “data sovereignty” which could be defined as the “rights of a nation to govern the collection, ownership and use of its own data” (Ballantyne, 2019). It is argued that people using AIS should have the chance to have some control over their data (Cordeiro, 2021; Casanovas et al., 2017), particularly if we consider that the data provided for the development of BD and AIS is a community investment (Oravec, 2019). As a community investment, it may warrant financial returns or a stake in the decision-making (Oravec, 2019). Differences in data systems between countries raise challenges and opportunities for State-bodies (Machluf et al., 2017). Governmental control can be seen as a more secure (Snell, 2019) alternative to commercial management. Governance innovations include “data custodians and/or indigenous data governance bodies” (Ballantyne, 2019). This control of data by communities may contribute to guarantee the inclusion of diverse dimensions and include social determinants of health (Dankwa-Mullan et al., 2021). Although, this community control may be illusory if, at the end, data are stored in the cloud through a network of foreign servers (Colloc, 2015).
3.2.3.2 Data management
A related set of issues to the ones of ownership relates to data management (Stylianou and Talias, 2017; Salas-Vega et al., 2015; Galetsi et al., 2019; Wang and Alexander, 2020; Ladner and Ben Abdelaziz, 2018; Ahmed et al., 2020; Young, 2018). Authors ask what ethical data management would look like (Samuel and Derrick, 2020)? Data management is an important consideration because of the increasing number of people involved in data collection (Vogel et al., 2019), and the enormous amount of data generated (Bellazzi, 2014). Data management implies a long continuum from data production, storage, curation, analysis, protection and circulation. It raises the issue of who has the power to manage the data and the risk of centralized or commercial data control (Manrique de Lara and Pelaez-Ballestas, 2020; Tupasela et al., 2020). On the contrary, centralization may be replaced by over fragmentation and make it difficult to locate when data are used as part of large platforms or by many entities, e.g., in research settings (O’Doherty et al., 2016). In terms of population health, the more acute concern is to optimize their use (Cahan et al., 2019), because of their medical importance (Casanovas et al., 2017), and their role in eliminating health disparities (Carney and Kong, 2017). Authors sometimes talk about the stewardship of data (Sanchez M and Sarria-Santamera, 2019) which includes the “safeguards, audits and operational protocols” (Sanchez M and Sarria-Santamera, 2019).
One risk associated with data management are conflicts of interests (COI) that can arise if data belong to actors who have diverging interests. For example, corporations, governments, the public, healthcare systems, HCPs and researchers may all have diverging needs, interests and goals, raising risks of COI (Salas-Vega et al., 2015; van Deursen and Mossberger, 2018; Manrique de Lara and Pelaez-Ballestas, 2020; Salerno et al., 2017; Casanovas et al., 2017; Car et al., 2019; Altenburger and Ho, 2019). This situation may be more patent for regulators who want to promote, at the same time, commercial and public interests (Tupasela et al., 2020). At last, COIs can be hidden within the programming of their algorithms (Car et al., 2019).
3.2.3.3 Data accessibility and sharing
Corollary issues regard the accessibility of the data and data sharing (Stylianou and Talias, 2017; Wang and Alexander, 2020; Benke and Benke, 2018; Galetsi et al., 2019; Ladner and Ben Abdelaziz, 2018; Ahmed et al., 2020; Nebeker et al., 2019; Goodman, 2020; Sun et al., 2020; Ienca et al., 2018; Tigard, 2019; Hodgson et al., 2020). Publicly funded data and data of public utility may have a stronger obligation for being accessible (Tang et al., 2018; Goodman, 2020; Gossec et al., 2020). Access to data is necessary in order to realize BD and AI’s potential for improving global (Li and Cong, 2021; Galetsi et al., 2019) and individual health (Zhang et al., 2017; Kostkova et al., 2016; Raza and Luheshi, 2016; Li and Cong, 2021; Mahlmann et al., 2017). The accessibility of data can be essential for public health, and become critical during infectious outbreaks and (Budd et al., 2020; Ballantyne, 2019). Data sharing is also strategic for research activities (Galetsi et al., 2019; Zhang et al., 2017). For example, easier access to publicly funded clinical datasets could help reduce data-access inequities between researchers (Zhang et al., 2017), and enable reproducible research (Gossec et al., 2020).
Because of its benefits, some people believe that individuals have a duty to share their data in order to advance health goals. Some authors defend the idea that it is a societal responsibility to act accordingly (Green and Vogt, 2016). In other words, individuals have a duty to share their information for the sake of their own treatment (Salerno et al., 2017), for epidemiological reasons (Salerno et al., 2017), for the advancement of health research (Tsai and Junod, 2018) or for the learning health system (Sparrow and Hatherley, 2019). Individuals will benefit at a certain point in life from these goods (Tsai and Junod, 2018) or they will contribute to the common good (Snell, 2019). Otherwise, it may be considered as selfishness or free riding (Snell, 2019).
However, this imperative to share data may face several barriers that may be practical (Leyens et al., 2017; Wang and Alexander, 2020; Ahmed et al., 2020; Balas et al., 2015; Lee and Yoon, 2017), cultural (Leyens et al., 2017; Tang et al., 2018; Car et al., 2019), economical (Tang et al., 2018; Lee and Yoon, 2017; Machluf et al., 2017), technical (Lajonchere, 2018; Noorbakhsh-Sabet et al., 2019; Zhang et al., 2017; Raza and Luheshi, 2016; Yang and Chen, 2018; Dagi, 2017; Hemingway et al., 2018; Salerno et al., 2017; Deshpande et al., 2019), political (Machluf et al., 2017; Car et al., 2019; Carney and Kong, 2017), ethical (Bates et al., 2018; Machluf et al., 2017; Balas et al., 2015; Yang and Chen, 2018; Gilbert et al., 2019; Prosperi et al., 2018; van Heerden et al., 2020), and regulatory (Salas-Vega et al., 2015; Lajonchere, 2018; Zhang et al., 2017; Manrique de Lara and Pelaez-Ballestas, 2020; Rosen et al., 2020; Raza and Luheshi, 2016; Yang and Chen, 2018; Casanovas et al., 2017; Mahlmann et al., 2017; Deshpande et al., 2019). Many stakeholders may have an interest in accessing data, e.g., researchers, health-policy makers, HCPs, insurances (Stylianou and Talias, 2017). It raises numerous questions. Who should be given access to the data? For which aim? In which conditions? (Stylianou and Talias, 2017; Casanovas et al., 2017) With which safeguards? (Andanda, 2019; Lajonchere, 2018) In which sustainable infrastructure? (Raza and Luheshi, 2016; Pepin et al., 2020) How should benefits and risks of data sharing should be distributed equitably? (Brill et al., 2019; Ballantyne, 2019) These questions are entangled in the web of issues at the intersection of privacy protection, control over data access, and protecting informed consent (Sanchez M and Sarria-Santamera, 2019).
3.3 Interventions and policies
So far, we have seen the ethical tension raised by AIS and their reliance on BD from the perspective of their effect on health outcomes and patterns of health determinants. For this last part, we will look at their effect on intervention and policies. Intervention and policies are seen as ways to work on health determinants to produce greater health outcomes for the population. Looking at the means of population health, the discussion may be summarized as how the uses of the technologies may infringe common ethical and legal obligations in terms of privacy, consent, responsibility, transparency, trust and social acceptability.
3.3.1 Privacy protection
Privacy could be defined as “the right to be left alone” (Ienca et al., 2016). Sun and collaborators argue that, in the context of health, privacy refers to one’s right to decide what identifiable data is collected, how it is used and disclosed (Sun et al., 2020). AIS and BD in population health raise various multidimensional privacy issues (Bellazzi, 2014; Benke and Benke, 2018; Tang et al., 2018; Kerr et al., 2018; Noorbakhsh-Sabet et al., 2019; Althobaiti, 2021; Abramoff et al., 2021; Ahmed et al., 2020; van Deursen and Mossberger, 2018; Cordeiro, 2021; Conway, 2014; Gilbert et al., 2019; Prosperi et al., 2018; Pepin et al., 2020; Hemingway et al., 2018; Joda et al., 2018; Kayaalp, 2018; Shahid et al., 2021) which are seen as an important concern for the public (Mooney and Pejaver, 2018; Comess et al., 2020) and HCPs (Castagno and Khalifa, 2020) because of the significant importance of health data (Heitmueller et al., 2014). However, as we will see, empirical data may mitigate the importance accorded by the public to privacy issues (Esmaeilzadeh, 2020). Two dimensions are of particular interest for ethics: privacy breaches and the difficult operationalization of privacy standards.
3.3.1.1 Privacy breaches
Privacy issues are central to the ethics of AIS and BD because of the informational nature of these technologies. They refer mostly to wrongful uses of data (Colloc, 2015; Balas et al., 2015; Shah and Khan, 2020), accidental disclosure (Mooney and Pejaver, 2018; Balas et al., 2015; Conway, 2014; Casanovas et al., 2017), data crossing (Sparrow and Hatherley, 2019; Horvitz and Mulligan, 2015) or unintentional disclosure of sensitive information (Xing et al., 2021; Murphy et al., 2021; Gilbert et al., 2020). These are frequently analyzed through the lens of cybersecurity issues (Stylianou and Talias, 2017; Tanti, 2015; Dolley, 2018; Liyanage et al., 2014; Lajonchere, 2018; Salas-Vega et al., 2015; Wang and Alexander, 2020; Heitmueller et al., 2014; Kerr et al., 2018; Xie et al., 2020; Galetsi et al., 2019; Sparrow and Hatherley, 2019; Fulmer, 2019; Althobaiti, 2021; Bates et al., 2018; Machluf et al., 2017; Ahmed et al., 2020; Canaway et al., 2019; Ienca et al., 2016; Kostkova et al., 2016; Balas et al., 2015; Prainsack, 2020; van Deursen and Mossberger, 2018; Ossorio, 2014; Rosen et al., 2020; Montgomery et al., 2018; Salerno et al., 2017; Gilbert et al., 2019; Sun et al., 2020; Ajunwa et al., 2016; Casanovas et al., 2017; Ienca et al., 2018; Mootz et al., 2020; Tigard, 2019; Dagi, 2017; Hodgson et al., 2020; Lupton and Jutel, 2015; Pepin et al., 2020; Manzeschke et al., 2016; Belk, 2020; Snell, 2019; O’Doherty et al., 2016; Deshpande et al., 2019; van Heerden et al., 2020; Cutrona et al., 2012; Fornasier, 2019; Hoffman and Podgurski, 2013; Terry, 2014; Torous and Haim, 2018; Tsai and Junod, 2018; Veiga and Ward, 2016). Privacy breaches are increasingly observed in the health sector (Sun et al., 2020; Ajunwa et al., 2016; Dagi, 2017) and they have been highlighted at different phases of health data circulation from collecting (Mohr et al., 2017), transferring between linked services (Shah and Khan, 2020), sharing (Salerno et al., 2017), storing (Stylianou and Talias, 2017; Larkin and Hystad, 2017; Kostkova et al., 2016; Conway, 2014; Gossec et al., 2020; O’Doherty et al., 2016), training AIS (Murphy et al., 2021) to destructing data (Wang et al., 2021).
The main harm of privacy breaches may be the risks of re-identification. Even if data are anonymized, many studies have shown that individuals can often be re-identified (Lee et al., 2016). Re-identification can be done by linking anonymous data, meta data (Gilbert et al., 2019) and datasets (Docherty and Lone, 2015) and is made easier with interoperable datasets (Delpierre and Kelly-Irving, 2018). Many authors in this review agree that re-identification risks are high with BD and related technologies. The re-identification risk increases with data’s dimensionality, i.e., the number of variables of data (e.g., age, location, weight, any other physiological trait, genetic information, etc.) (Bellazzi, 2014; Cahan et al., 2019; Mooney and Pejaver, 2018). Re-identification risks also increase with the low prevalence of the variable (e.g., rare medical conditions) (Docherty and Lone, 2015; Ballantyne, 2019; Demuro et al., 2020), the quantity of personal data in the public domain (Tsai and Junod, 2018), data linkage (Manrique de Lara and Pelaez-Ballestas, 2020), combination of data (Rennie et al., 2020), the improvement of data mining methods (Lee et al., 2016), and who has access to it, at the end, creating various degrees of de-identification (Ballantyne, 2019).
Surveillance activities raise particular concerns in terms of privacy. They are troubling considering the staggering amounts of data held by health organizations, corporations (Celedonia et al., 2021; Lodders and Paterson, 2020) and governments that can be used against the interest of individuals (Andanda, 2019; Baldassarre et al., 2020; Evans et al., 2020). The risk of surveillance is an unavoidable trade-off of the of BD (Ngan and Kelmenson, 2021; Howe and Elenberg, 2020) and AIS in health-related activities and one that attenuates its possible benefits (Galetsi et al., 2019; Mootz et al., 2020). For example, passive technologies such as imbedded sensors are less intrusive than direct observation (Grigorovich and Kontos, 2020), but nonetheless imply the collection of immense quantities of data. In the context of the COVID-19 pandemic, citizen populations were watched in order to prevent the spread of the disease, but many expressed concerns that this information be used for other purposes (Sarbadhikari and Pradhan, 2020; Shachar et al., 2020; Naudé, 2020; Shen and Wang, 2021).
3.3.1.2 Operationalization of privacy standards
The operationalization of privacy standards faces several challenges. It is not clear how to use the polysemic concept of privacy (Mooney and Pejaver, 2018; Conway, 2014; Casanovas et al., 2017; Snell, 2019). Some suggest to distinguish different forms of privacy, which certain forms are more at risk with BD such as informational privacy (Ienca et al., 2016) or physical privacy through surveillance (Bennett, 2019). The complexity may also arise because of the overlapping of privacy with a large spectrum of ethical values such as trust, transparency, security and property over who has access to the data and for what uses (Canaway et al., 2019). Contexts may also influence the definition and operationalization of privacy. For example, different areas of research have various methodologies and tools, complicating the protection of privacy in interdisciplinary health research (Casanovas et al., 2017).
Culture could also influence how privacy is understood, raising the question of whether a core definition should be used across all settings or not (Vayena et al., 2015). Also, in some political and economic contexts, citizens may consider that privacy concerns are irrelevant because of the level of surveillance already imposed by the State (Liu and Graham, 2021). Authors also note regularly the paradox between the perceived lack of concern of people toward sharing identifiable information on internet platforms (Aiello et al., 2020; Mikal et al., 2016; Yang and Chen, 2018; Snell, 2019; Young, 2018) and, at the same time, the fear of privacy breach related to participation in research project (de Graaf et al., 2015; Montgomery et al., 2018; Mootz et al., 2020; Tsai and Junod, 2018; Wongkoblap et al., 2017), public health interventions (O’Doherty et al., 2016) or any other health activities (Yang and Chen, 2018).
Common strategies have been proposed to protect privacy such as de-identification (Comess et al., 2020; Aebi et al., 2021), anonymization, (Comess et al., 2020; Gilbert et al., 2020; Aebi et al., 2021) and geo-masking (Comess et al., 2020; Aebi et al., 2021). However, these strategies face several limitations such as the complex language of privacy policies (Aiello et al., 2020), the lack of transparency about the protection mechanism used (Casanovas et al., 2017), the overall cost of the protection mechanisms (Zhang et al., 2017; Kayaalp, 2018), the use of protection mechanism more adequate for “small data” rather than BD (Wang and Alexander, 2020; Sun et al., 2020), and the ambiguous status of sensible data shared on social media (Celedonia et al., 2021; Gilbert et al., 2020).
The value of privacy conflicts with the possible benefits associated with using BD and AI in health-related contexts (Sparrow and Hatherley, 2019; Cool, 2016; Goodman, 2020; Salerno et al., 2017; Gilbert et al., 2019; van Heerden et al., 2020; Yeung, 2018; Igual et al., 2013). During the COVID pandemics, empirical data have shown that, for certain people, the loss of privacy was perceived as a trade-off for public health (Liu and Graham, 2021; Degeling et al., 2020). Aside from greater public health outcomes and prevention (Dolley, 2018; Alemayehu and Berger, 2016; Aiello et al., 2020; Horvitz and Mulligan, 2015; Roberts, 2019; Raza and Luheshi, 2016; Adkins, 2017; Mootz et al., 2020; Hodgson et al., 2020; Mahlmann et al., 2017), authors suggest that the promotion of scientific innovation could outweigh privacy (Heitmueller et al., 2014; Balas et al., 2015; Manrique de Lara and Pelaez-Ballestas, 2020; Salerno et al., 2017; Liu and Bressler, 2020; Comess et al., 2020; Terry, 2014; Wyllie and Davies, 2015).
3.3.2 Consent
The use of BD and AI in health-related contexts raises issues of free and informed consent (Stylianou and Talias, 2017; Wang and Alexander, 2020; Galetsi et al., 2019; Sparrow and Hatherley, 2019; Vollmer Dahlke and Ory, 2020; Ahmed et al., 2020; Canaway et al., 2019; Zhang et al., 2017; Manrique de Lara and Pelaez-Ballestas, 2020; Ossorio, 2014; Samuel and Derrick, 2020; Xafis et al., 2019; Casanovas et al., 2017; Gossec et al., 2020; Pepin et al., 2020; Deshpande et al., 2019). Using the populations’ data without their consent could weaken trust in institutions and researchers (Tsai and Junod, 2018). Conversely, transparent consent practices could foster trust, especially in underrepresented groups (Zou and Schiebinger, 2021). Paradoxically, there may be too few or too many moments for consent in BD and AI technologies (van Deursen and Mossberger, 2018; Montgomery et al., 2018). Also, consent regulations vary between countries and cultures (Sanchez M and Sarria-Santamera, 2019). Consent is linked to issues of accessibility, as it can enable individuals to control the use of their data (Andanda, 2019). However, informed consent does not necessarily grant people control over their data (Ienca et al., 2018). Thus, the question of control over one’s data may be more important than questions regarding consent (Kostkova et al., 2016).
Several situations compromising consent have been identified in the literature. Consent issues may arise when data is used for purposes that have not been consented to by individuals (Bellazzi, 2014) because the intervention is aiming at large populations (Thorpe and Gray, 2015a; Sanchez M and Sarria-Santamera, 2019; Ienca et al., 2018; Gilbert et al., 2020), such as public health surveillance (Aiello et al., 2020; Conway, 2014; Samuel and Derrick, 2020; Genevieve et al., 2019; Gilbert et al., 2019; Thorpe and Gray, 2015b; Park, 2021), the creation of integrated databases (Wyllie and Davies, 2015), electronic healthcare predictive analysis (Mootz et al., 2020), the linkage of data (Vogel et al., 2019; Bates et al., 2018; Salerno et al., 2017; Joda et al., 2018), biobanking (Docherty and Lone, 2015; Sanchez M and Sarria-Santamera, 2019; Cool, 2016; Mootz et al., 2020; Tigard, 2019; Snell, 2019; O’Doherty et al., 2016; Shah and Khan, 2020; Wyllie and Davies, 2015), and public health emergencies (Shachar et al., 2020). Another difficulty may come to consent for data already publicly available (Rosen et al., 2020). Passive data collection with sensors in the environment or assistive technologies (Kernaghan, 2014; Bennett, 2019; Grigorovich and Kontos, 2020; Miller, 2020; Ienca et al., 2016; Ienca et al., 2018) may also prevent consent mechanism (Manzeschke et al., 2016; van Heerden et al., 2020) and make individual unaware that personal data are collected. Registries, health data record and electronic health records raise the issue of the difficulty to opt-out of these platforms or to be aware of their secondary use (Balas et al., 2015; Joda et al., 2018; Tsai and Junod, 2018; Nakada et al., 2020) by third parties (Kerr et al., 2018; O’Doherty et al., 2016). This situation is complicated if data have already been anonymized (Joda et al., 2018).
Social networks are also sensible platforms for obtaining authentic informed consent. Personal data on these platforms can be of great interest for different actors such as HCPs (Terrasse et al., 2019), healthcare systems (Young, 2018), data brokers (Horvitz and Mulligan, 2015) and researchers (Althobaiti, 2021; Conway, 2014). In principle, public domains are open to data mining (e.g., public health research), but what constitutes a public domain is less clear regarding social media (Vayena et al., 2015; Young, 2018; Wyllie and Davies, 2015). Consent processes on these platforms can be difficult to understand (Aiello et al., 2020; Nebeker et al., 2019; Villongco and Khan, 2020; Gilbert et al., 2020) and people may be nudged to consent mechanically (Terrasse et al., 2019; Mikal et al., 2016).
For some, respecting individual rights implies consent mechanisms (Vayena et al., 2015), but the inability to use data from some populations limits its utility (Cahan et al., 2019). This raises the more general question as to whether individual consent should be sought before using BD and AIS given their potential benefits (Gilbert et al., 2019) or if we should incentivize for the voluntary donations of sensitive data (Tigard, 2019). Some authors argue that, at least, some data should be available without individuals’ consent because of its utility for efficient public health interventions (Balas et al., 2015; Gilbert et al., 2019). Thus, it may be justified to do public health surveillance without consent (Aiello et al., 2020). Also, “[i]nsistence on formal consent for big data research could cause wider societal harm, as the participation bias which might arise could skew the data to such an extent as to make results inaccurate or meaningless” (Docherty and Lone, 2015). In fact, patients may not be aware of the potential of their medical data for research and of the barriers to access it (Machluf et al., 2017) or they may consent only if they feel it is in their interest (Yang and Chen, 2018). Broadly, some laws may allow the divulgation of health information for public health activities without requiring individual consent (Thorpe and Gray, 2015b).
To respond to these issues raised by AIS and BD, new forms of consent are needed (Andanda, 2019; Zhang et al., 2017; Genevieve et al., 2019; Tigard, 2019). Broad consent is an option explored (Hemingway et al., 2018), but its universal applicability is questioned (Howe and Elenberg, 2020; van Heerden et al., 2020). Other options include meta-consent (Sanchez M and Sarria-Santamera, 2019), opt-out and dynamic consent (Andanda, 2019), a trust-based approach to consent (Pickering, 2021), and e-consent (Genevieve et al., 2019). The latter has many drawbacks: users may not read or understand the information provided in the e-consent form; there is no interaction between them and the researcher; and it is difficult to ascertain the individual’s identity (Genevieve et al., 2019). Another type of consent, opt-in consent, may promote informed consent but may result in selection bias, particularly with vulnerable populations (Bates et al., 2018; Evans et al., 2020).
3.3.3 Responsibility, accountability, and liability
AIS raises several issues at the intersection responsibility, accountability and liability (Samuel and Derrick, 2020). Authors ask who is responsible (Ladner and Ben Abdelaziz, 2018), and who is responsible for ensuring the reliability of AIS and their data (Goodman, 2020)? Accountability is connected with “quality, standards, and ethics” (Goodman, 2020) and can conflict with other public health values such as the maximization of benefits (Rosen et al., 2020). In the literature, the term “responsibility” can be used interchangeably with “accountability” and “liability.” In the most general sense, “responsibility” means to hold someone responsible for an act (Cornock, 2011). For its part, “accountability” “simply means to be called to account” (Cornock, 2011). Liability can be seen as a legal accountability which implies to the obligation of giving an account the possibility of sanction (Cornock, 2011). Although different concepts, it is not clear if such distinctions are maintained in the literature.
For the authors, it is clear that AIS in healthcare blur the notion of professional responsibility (Manrique de Lara and Pelaez-Ballestas, 2020). Who should be held accountable and who should be responsible in case an intervention based on AIS harms individuals (Sparrow and Hatherley, 2019; Jones et al., 2020; Manrique de Lara and Pelaez-Ballestas, 2020)? This problem of responsibility comes from the capacity for AI to have an agency or not (Sparrow and Hatherley, 2019). The main tendency is to make HCPs “in charge” when using medical AIS (Manrique de Lara and Pelaez-Ballestas, 2020). Because there is always a human in the loop, humans are responsible for adverse consequences (Lanier et al., 2020; Lupton and Jutel, 2015). In case of an adverse consequence resulting from the use of an AIS, we can always assess whether the HCP’s choice to use this technology was reasonable (Sparrow and Hatherley, 2019) and AIS should be held to the same degree of accountability and effectiveness as other medications and devices (Tang et al., 2018).
Outside the narrow medical field, the literature points toward several example of unclear responsibility (Carney and Kong, 2017). For example, carebots interacting with people with dementia implies agents that are not fully competent (Ienca et al., 2016). Social networks have their share of ambiguity. They can offer health related services but are not considered responsible HCP (Celedonia et al., 2021); they offer data for researchers, but they are not responsible for protecting users privacy (Andanda, 2019). Also, it is not clear who should be held liable for a device malfunction and adverse consequences (Kerr et al., 2018; Sparrow and Hatherley, 2019), or for data lack of quality and security (Kern et al., 2016; Casanovas et al., 2017): the HCP, researchers (Samuel and Derrick, 2020; Ballantyne, 2019), the developers (van Deursen and Mossberger, 2018), the manufacturer, corporations owning the technology (Andanda, 2019), the designer, purchaser of the AI, shareholders, or the AI itself (Sparrow and Hatherley, 2019)? This led Mahlmann and collaborators (Mahlmann et al., 2017) to argue that accountability needs to be at multiple levels because data used in health come from different fields with different legal responsibilities with different forms of access.”
3.3.4 Transparency
Making AIS (and the reasons for their use) transparent is a central issue in the literature (Lee and Yoon, 2017; Althobaiti, 2021; Horvitz and Mulligan, 2015; Li and Cong, 2021; Godfrey et al., 2020; Machluf et al., 2017; Tupasela et al., 2020; Vayena et al., 2015; Kirtley and O’Connor, 2020). Transparency is an important value for both AIS and population health (Kim et al., 2017; Rosen et al., 2020) as it is an essential mechanism to guarantee accountability, public support, inclusion, and trust (Sanchez M and Sarria-Santamera, 2019; Cordeiro, 2021; Ballantyne, 2019; Kostkova et al., 2016; Vayena et al., 2015). Transparency implies “openness to public scrutiny of decision-making, processes, and actions.” (Xafis et al., 2019) Transparency issues are critical at two different levels.
First, the opacity of BD-based technologies can make it impossible for external actors to understand the value of the information (Roberts, 2019). This uncertainty regarding data may occur at each step of data processing: from data collection (Morgenstern et al., 2021; Murphy et al., 2021; Evans et al., 2020; van Deursen and Mossberger, 2018; Manzeschke et al., 2016; Liu and Graham, 2021; Ajunwa et al., 2016), its storage (Manzeschke et al., 2016; Ajunwa et al., 2016), its ownership (Kostkova et al., 2016; Ajunwa et al., 2016), its sharing (Murphy et al., 2021; Manzeschke et al., 2016; Cool, 2016; Deshpande et al., 2019) to its uses (Canaway et al., 2019; Li and Cong, 2021; Evans et al., 2020; van Deursen and Mossberger, 2018). Data transparency is important for health organizations (Leyens et al., 2017) as well for patients (Ahmed et al., 2020; Ienca et al., 2016) and is seen as responsible data management (Cordeiro, 2021). However, data transparency must be balanced with other values such as confidentiality (Straw, 2021; Raza and Luheshi, 2016), privacy (Kostkova et al., 2016; Mohr et al., 2017), and innovation (Horgan and Ricciardi, 2017; Babyar, 2019).
Second, a common aspect of the transparency issue is AI’s black box problem; in other words, the fact that its results are not explainable (Ahmed et al., 2020; Morgenstern et al., 2021; Kee and Taylor-Robinson, 2020; Terrasse et al., 2019; Liu and Bressler, 2020; Murphy et al., 2021; Thomasian et al., 2021; Couch et al., 2020; Montgomery et al., 2018; Kasperbauer, 2021; Lanier et al., 2020; Pepin et al., 2020; Wongkoblap et al., 2017; Delpierre and Kelly-Irving, 2018). In the clinical context, this may impair an HCP’s capacity to identify and mitigate risks for patients, and to discuss and interpret the results (Luk et al., 2021; Sparrow and Hatherley, 2019). More generally, the black box problem may cause a loss of control for data scientists and the population (Delpierre and Kelly-Irving, 2018). When data is not made transparent, algorithmic outcomes cannot be reproduced and checked for accuracy (Tan et al., 2020). Many authors argue that AIS should be more transparent and explainable (Ahmed et al., 2020; Lodders and Paterson, 2020; Fulmer, 2019; Benke and Benke, 2018) and that developers should be transparent about the evidence supporting their product (Kirtley and O’Connor, 2020; Ienca et al., 2018), their underlying assumptions (Delpierre and Kelly-Irving, 2018), theirs ends (Delpierre and Kelly-Irving, 2018), the product’s risks, and its benefits (Kirtley and O’Connor, 2020). However, others argue that making all AIS transparent could be unrealistic because of its complexity and its understandability by only few experts (Terrasse et al., 2019). Yet others emphasize that the health sector is already full of “black boxes” (Sparrow and Hatherley, 2019) leading to the question if we may be able, 1 day, to trust black box healthcare (Manrique de Lara and Pelaez-Ballestas, 2020).
3.3.5 Trust
A lack of transparency can lead to trust issues (Sparrow and Hatherley, 2019; Straw, 2021) at different levels. At the clinical level, the incapacity to explain the results of an AIS may lead an HCP to lose trust in the system (Chen and See, 2020). The deterioration of the patient-HCP relationship can reduce the quality of healthcare services (Cordeiro, 2021) and deter patients from disclosing certain information and participating in research (Manrique de Lara and Pelaez-Ballestas, 2020; Shah and Khan, 2020; Nageshwaran et al., 2021; Rehman et al., 2022). Furthermore, trust helps clinicians and patients approve of the conclusion of an AIHT (Sparrow and Hatherley, 2019; de Graaf et al., 2015; Noorbakhsh-Sabet et al., 2019). Conversely, automatic decision-making processes could be perceived as trustworthy because of their accuracy and impartiality (Araujo et al., 2020).
At the population level, trust is a relational notion bonding citizens and institutions (Ballantyne, 2019). It facilitates the social acceptability of technologies or health practices (Bellazzi, 2014; Sanchez M and Sarria-Santamera, 2019; Balas et al., 2015), the engagement and involvement of communities in AIS development (Hunt et al., 2020; Dankwa-Mullan et al., 2018) and the cooperation of citizens in health initiatives (Ballantyne, 2019; Naudé, 2020), such as public health surveillance systems (Gilbert et al., 2019; Lodders and Paterson, 2020), and biobanking (Colloc, 2015). Trust may also be necessary to address discrimination concerns related to technologies using personal and genetic data (Trein and Wagner, 2021).
For these reasons, trustworthiness is an important ethical value for the implementation of these technologies (Althobaiti, 2021; Rosen et al., 2020; Samuel and Derrick, 2020; Samuel and Derrick, 2020; Xing et al., 2021; Prosperi et al., 2018; Mahlmann et al., 2017). More specifically, patients and the public must trust that their data is used according to their wishes (Andanda, 2019; Lodders and Paterson, 2020), that their privacy is respected (Balas et al., 2015; Abdulkareem and Petersen, 2021; Ienca et al., 2018; Thorpe and Gray, 2015b; Mohr et al., 2017) that data is safe (Fornasier, 2019; Salerno et al., 2017; Shahid et al., 2021; Tsai and Junod, 2018; Wyllie and Davies, 2015; Ienca et al., 2018) and that there are regulations governing the use of data (Tan et al., 2020). However, building and maintaining public trust is challenging (Aiello et al., 2020; Conrad et al., 2020; Hemingway et al., 2018), especially for minority groups (Zhang et al., 2017). Trust can be weakened when organizations sell data to third parties (pharmaceutical, insurance, etc.) for financial gain (Canaway et al., 2019; Gilbert et al., 2019; Kostkova et al., 2016; Tupasela et al., 2020). Weak oversight of such data-sharing (Sanchez and Sarria-Santamera, 2019; Villongco and Khan, 2020), lack of data accuracy, biases or misleading conclusions (Aiello et al., 2020; Grigorovich and Kontos, 2020; Dolley, 2018; Goodman, 2020; Vayena et al., 2015; Gilbert et al., 2019; Igual et al., 2013) and bad communication strategies (Nebeker et al., 2019) can also lead to a crisis of confidence in the technologies (Heitmueller et al., 2014). Rebuilding trust after a loss from the public can be challenging (Bates et al., 2018).
3.3.6 Social acceptability
As discussed above, trust facilitates social acceptability, which is a “primary concern” related to using AIS and BD (Tang et al., 2018). This notion is associated with popular support, which is necessary for data collection (Katapally, 2020), the successful implementation of AIBD technologies (Mootz et al., 2020; Prosperi et al., 2018; Esmaeilzadeh, 2020; Salas-Vega et al., 2015) and the viability of product development or research endeavors (Canaway et al., 2019; Cool, 2016). Little research has explored users’ acceptability of AIS and BD technologies (Wongkoblap et al., 2017; Igual et al., 2013), but some articles have shown that public attitudes toward these technologies may vary depending of their aim (Nakada et al., 2020), data ownership (Ienca et al., 2018) and the perception of subpopulations (Heitmueller et al., 2014). Furthermore, people might be more willing to tolerate data sharing and privacy breaches if they consider that it is for the common good (Gilbert et al., 2020) and if they understand what AIS can offer them personally in terms of health outcomes (Kelly et al., 2020). On the HCP’s side, various factors can influence their support for AIS such as the characteristics of the technology, their knowledge, their opinions, external factors (e.g., patient and health professional interaction), and the organizational capacity to implement it (Kelly et al., 2020). During the COVID-19 pandemic, the fear of infection and death affecting individuals and their families has led to a growing understanding of the importance of public health and therefore contributed to increasing the acceptability of health surveillance (Couch et al., 2020). The pandemic also contributed to an acquired familiarity with telemedicine services and digital health platforms (Ho et al., 2020). However, if AIS do not meet ethical standards, stakeholders might be opposed to their implementation and therefore those technologies will not reach the populations for which they were designed (Abramoff et al., 2021).
4 Discussion
This review synthesized the state of knowledge on the ethical issues of the combined use of AIS and BD in the context of population health. The literature suggests that these technologies may affect every component of population health. At this stage, the literature still debates if the technologies will lead to positive or negative outcomes. Positive outcomes are mostly conceived as an optimization of existent health and research activities. Those who focus on negative outcomes are concerned about communities potentially becoming overly reliant on digital systems as a result of the anticipated AI revolution. An important challenge will be distribution of the benefits and burdens of these technological transformations. There are strong voices anticipating that this distribution will be unfair between populations and inside populations and that it will reinforce prevailing inequities.
This synthesis reveals the need for a balanced perspective, as the potential benefits of AIS and BD, such as precision public health and improved decision-making, are accompanied by substantial ethical risks. A more nuanced approach to interpreting results is essential, particularly one that explicitly addresses both benefits and risks with real-world examples. For instance, initiatives like the “AI for Good” projects by global organizations highlight pathways for leveraging AI ethically, particularly in underrepresented communities.
Aside from these outcomes, we can expect that AIS and BD will affect upstream determinants of health. Because of the ubiquitous nature of BD and AI (Benke and Benke, 2018), these technologies may penetrate every aspect of our existence and, by extension, every element contributing to the overall health of communities. Regardless of this baffling projection, our review encourages to look at specific patterns of health determinant that are considered, to this day, more sensitive to the influence of AI and BD technologies. However, these upstream effects also raise critical concerns about data access and ownership, particularly in the context of global inequities. For example, data collected in LMICs often benefits high-income settings disproportionately, perpetuating patterns of digital colonialism. Interventions addressing these disparities might include creating localized data governance frameworks that empower LMIC stakeholders to oversee and benefit from the use of their data. Developing equitable access to AI training and infrastructure is another pathway to mitigate these issues.
If we look in more detail to the effect of AIS and BD on the determinants of health, the first pattern of health determinants our review identified relates to healthy behaviors. Authors are dubious if the technologies will assist individuals in adopting health behaviors personalized to their conditions. To attain this goal, developing digital and ethical literacy in all segments of the population appears to be an inevitable avenue. A similar doubt persists in the discussion on AIS and BD effects regarding the access and quality of healthcare, the second pattern of health determinants identified in the review. On the one side, the literature argue that the technologies will assist HCPs in their daily tasks, while on the other side, they will accentuate the workload of HCPs and contribute to their deskilling because of their increased dependency on the technology. Further, the impact on health behaviors highlights the importance of patient trust and engagement. Enhancing transparency in AIS can improve trust and empower patients. For example, using explainable AI (XAI) systems in clinical decision-making could foster a stronger relationship between healthcare professionals (HCPs) and patients, as it allows for clearer communication of how decisions are reached. Implementing dynamic consent models could also enhance patients’ control over their data, addressing trust and autonomy concerns simultaneously.
The third pattern deals with the idea that, with the growing recourse to digital health apparatuses, data infrastructures will become a new determinant of population health. Who control data and has access to it will shape profoundly how the benefits and burdens of the technologies will be distributed globally. To ensure equitable outcomes, international data-sharing agreements must incorporate ethical safeguards. For instance, mechanisms for broad but controlled access to non-proprietary datasets, akin to the open science movement, could promote collaboration while protecting sensitive information. Moreover, innovative models like “data trusts,” where communities collectively manage their data, could provide an ethical way to balance privacy, transparency, and accessibility.
The last component of population health relates to interventions and policies. From an ethical perspective, population health interventions are essentially examined on their capacity to generate a complex trade-off between health goals, economic profit, scientific innovation, and collective moral values. The literature advise that we should give a particular attention to how any intervention or policy value privacy protection, free and informed consent, responsibility, and transparency. Respecting these values will contribute to two other inextricable values that are trust and social acceptability, which are essential in the implementation of all population health interventions and policies. Transparency is particularly critical in overcoming the “black box” issue prevalent in many AIS. Embedding requirements for explainability in AI regulatory frameworks could improve not only clinical decision-making but also public trust. Policymakers should look to best practices from other domains, such as the EU’s General Data Protection Regulation (GDPR), which could inspire guidelines on managing data and ensuring accountability.
An additional domain warranting attention involves the epistemological assumptions underpinning AI and BD systems and the statistical fragilities embedded in data-driven models. Much of the literature we reviewed does not critically engage with the capacity of BD and AI to produce valid insights through sheer volume, pattern recognition, and algorithmic refinement. Yet, epistemologically, these systems often prioritize correlation over causation, prediction over explanation, and model fit over interpretive depth; raising foundational questions about what kind of knowledge they generate and how it should inform population health decisions (Leonelli, 2019). Furthermore, the statistical reliability of these systems is subject to multiple threats, including overfitting, selection bias, spurious correlations, and algorithmic opacity (Stiglic et al., 2020), which can lead to “hallucinations,” especially with large language models, which can have extremely significant impacts in high-stake setting such as medicine (Bélisle-Pipon, 2024). In population health, where interventions rest on population-level inferences, such errors may propagate systemic misclassifications or misleading policy signals. A theory-driven approach, integrating causal inference, domain expertise, and interpretive reasoning, remains critical to counterbalance the limits of purely data-driven methods (Cavique, 2024; Pearl and Mackenzie, 2018). The absence of this epistemic reflection risks reinforcing technocratic approaches that obscure value-laden judgments beneath a veneer of objectivity. Future ethical appraisals must scrutinize not only what AI and BD do, but also how they know.
Overall, the literature speculates that AIS using BD will affect population health in an unprecedent manner and with ethical consequences. There are no components of population health that will be immune to the penetration of these technologies in the numerous activities of the actors in the field. It is anticipated that the technologies will shape the determinants of health as well as the interventions and policies aimed at working positively on these determinants.
4.1 Engaging with actionable insights
To move beyond theoretical considerations, actionable recommendations may support stakeholder engagement in answering these questions. Policymakers, developers, healthcare professionals, and researchers each have a role in ensuring the ethical deployment of AIS and BD. Table 4 outlines specific actions for these groups, aligned with key ethical principles and lifecycle phases (Collins et al., 2024).
Table 4 seeks to supplement the review findings with a structured summary of ethical governance strategies, organized by stakeholder group, type of intervention, and the specific lifecycle phase of AIS and BD systems. This kind of lifecycle mapping has been increasingly recommended to operationalize ethical principles across the development, implementation, and decommissioning of AI technologies (Floridi et al., 2018; Collins et al., 2024). The table foregrounds concrete roles (from data stewardship and explainability enforcement to bias audits and participatory co-design) offering a modular governance framework adapted to both institutional and technical contexts (Pacia et al., 2024; Morley et al., 2020). Developers, clinicians, patients, policymakers, and civil society actors are presented not as passive recipients of ethical guidance, but as active agents responsible for aligning technological deployment with public values (Vayena et al., 2018; Bélisle-Pipon and Victor, 2024). Crucially, we emphasize that governance must extend beyond static principle-based declarations, incorporating iterative accountability mechanisms throughout the system’s operational life (Mittelstadt, 2019). Table 4 is designed as both a synthesis and a practical entry point for translating ethics into targeted interventions at specific moments in the AI and BD lifecycle.
The review results resonate with other reviews on the ethics of AI and BD in healthcare (Morley et al., 1982; Murphy et al., 2021; Bélisle-Pipon et al., 2021; d’Elia et al., 2022). However, our results take their distance from a perspective centered on individual, medical and clinical care, to adopt the more global perspective of population health and upstream determinants of health. There are no clearcut demarcations between individual and population health, but technologies such as AI and BD generate their own blurring of these distinctions by offering the technological means to move from set of data pertaining to large group of individuals to conclusion applying to a specific individual. This blurring, or what Shipton and Vitale refer to a “politic of avoidance” (Shipton and Vitale, 2024), should not obscure that the technologies may affect entire populations and health determinants in a subtle manner as suggested by the present review.
4.2 Limits
While this review provides a comprehensive synthesis of the ethical issues surrounding AI and big data in population health, it is important to acknowledge certain limitations that could impact the breadth and applicability of the findings. One of the most significant limitations is the temporal scope of the literature considered. The review synthesizes articles published up to November 2021, meaning that it does not account for advancements, challenges, or ethical insights that have emerged in the last 4 years—a period characterized by rapid technological evolution and significant global events.
The exclusion of literature beyond 2021 omits critical developments in the field, such as the rise of generative AI systems, including large language models like GPT (e.g., ChatGPT’s GPT-4), which have revolutionized AI applications across industries, including healthcare. These systems have introduced new ethical dimensions, such as the propagation of misinformation, explainability issues, and risks of misuse in clinical and public health contexts. These topics, largely absent from the pre-2021 literature, represent key areas of concern that would likely require attention in an updated analysis. Additionally, the review does not address the broader implications of post-pandemic technological advancements. The COVID-19 pandemic significantly accelerated the adoption of AI technologies for public health surveillance, vaccine distribution, remote patient monitoring, and digital contact tracing. The normalization of such technologies has raised new ethical questions around privacy, consent, and equity, particularly in how these tools have been used to monitor populations at scale. These shifts are likely underexplored in the reviewed literature due to the timing of the search.
Since 2021, there have also been important regulatory and ethical developments, such as the European Union’s Artificial Intelligence Act and a growing emphasis on data sovereignty globally. These developments reflect a shift toward formalized governance frameworks that seek to address many of the concerns raised in this review. However, the analysis in this study predates these frameworks, which limits its ability to reflect the current regulatory landscape and its implications for population health. Equity and inclusion have also emerged as prominent themes in recent AI research. Advances in methodologies for debiasing algorithms, participatory AI design, and equity audits have provided tools to promote fairness and inclusivity in AI systems. These tools, while critical to addressing disparities in healthcare, are underrepresented in the body of literature included in this review. Similarly, the environmental impact of AI, particularly the carbon footprint of training large-scale models, has become an increasingly important ethical consideration that was likely not a major focus of studies published before 2022.
This temporal limitation risks presenting an incomplete or outdated understanding of the ethical landscape of AI and big data in population health. Omitting key developments from recent years could lead to an overemphasis on challenges identified in earlier stages of technological maturity while neglecting the ethical issues arising from newer applications and regulatory responses. It also limits the capacity to provide actionable insights for addressing contemporary ethical dilemmas in the field. To address this limitation, future research must prioritize updating the review to include studies published since 2021. Incorporating more recent developments will ensure that the findings remain relevant and responsive to current trends. Additionally, establishing a mechanism for periodic review updates, such as every two to 3 years, could help maintain the relevance of the synthesis over time. Engaging with practitioners and experts working on the front lines of AI ethics in healthcare could further complement the literature, adding real-world insights into the ongoing evolution of these technologies.
4.3 Future research
Considering the limitations of our review process, we would like to conclude by pointing avenues of research on the ethics of AIS and BD in population health that have been discussed since the end of our data analysis (Couture and Bélisle-Pipon, 2023).
Future research will have to integrate the effect of AIS and BD on other important health determinants. For example, policymakers will have to recognize the environmental cost of AIS and BD infrastructures and their consequences on the health of communities (Couture et al., 2023). The disinformation capacity of AI represents another serious threat for the implementation of any health interventions, but also for the stability political institutions (Federspiel et al., 2023). The use of AIS in warfare will also have to be considered as well as the health outcomes of the global transformation of employment and workplace conditions that are taking place with the diffusion of AIS (Federspiel et al., 2023).
To complete this task, AI ethics will need to widen its scope and follow the lead of population health in evaluating the deployment of AI and BD. Future research will need to answer three essential ethical questions: Do the interventions and policies using these technologies have a positive effect on patterns of health determinants? Do this positive outcome is obtained while sufficiently respecting collective moral values? Do the amalgamation of all these specific interventions and policies contribute, at the end, to a just society?
In answering these questions, a deeper integration of cross-disciplinary frameworks is essential. For example, justice-oriented approaches from bioethics could be combined with data science methodologies to develop predictive models that prioritize fairness and equity. Stakeholder engagement, especially involving marginalized populations, should become a cornerstone of both research and implementation to ensure that technologies align with societal values.
Data availability statement
The data analyzed in this study is subject to the following licenses/restrictions: The dataset is mostly qualitative. Please contact the corresponding author. Requests to access these datasets should be directed to dmluY2VudC5jb3V0dXJlQHVtb250cmVhbC5jYQ==.
Author contributions
VC: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. M-CR: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review and editing. ED: Data curation, Formal analysis, Investigation, Validation, Writing – original draft, Writing – review and editing. FT: Data curation, Formal analysis, Investigation, Writing – original draft, Writing – review and editing. J-CB-P: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review and editing.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. We received a seeding grant and publication grant from the Réseau de recherche en santé des populations du Québec (RRSPQ).
Acknowledgments
We would like to thank the Quebec Population Health Research Network (RRSPQ) for its financial support. Vincent Couture would like to recognize the intellectual support of Professor Anne-Marie Turcotte-Tremblay.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The authors declare that no Gen AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fsoc.2025.1536389/full#supplementary-material
References
Abdulkareem, M. , and Petersen, S. E. (2021). The promise of AI in detection, diagnosis, and epidemiology for combating COVID-19: beyond the hype. Front. Artif. Intell. 4:652669. doi: 10.3389/frai.2021.652669
Abramoff, M. D. , Cunningham, B. , Patel, B., et al. (2021). Foundational considerations for artificial intelligence using ophthalmic images. Ophthalmology 129, e14–e32. doi: 10.1016/j.ophtha.2021.08.023
Adkins, D. E. (2017). Machine learning and electronic health records: a paradigm shift. Am. J. Psychiatry 174, 93–94. doi: 10.1176/appi.ajp.2016.16101169
Aebi, N. J. , De Ridder, D. , Ochoa, C., et al. (2021). Can big data be used to monitor the mental health consequences of COVID-19? Int. J. Public Health 66:633451. doi: 10.3389/ijph.2021.633451
Ahmed, Z. , Mohamed, K. , Zeeshan, S. , and Dong, X. (2020). Artificial intelligence with multi-functional machine learning platform development for better healthcare and precision medicine. Database 2020:baaa010. doi: 10.1093/database/baaa010
Aiello, A. E. , Renson, A. , and Zivich, P. N. (2020). Social media- and internet-based disease surveillance for public health. Annu. Rev. Public Health 41, 101–118. doi: 10.1146/annurev-publhealth-040119-094402
Ajunwa, I. , Crawford, K. , and Ford, J. S. (2016). Health and big data: an ethical framework for health information collection by corporate wellness programs. J. Law Med. Ethics 44, 474–480. doi: 10.1177/1073110516667943
Alami, H. , Rivard, L. , Lehoux, P. , Hoffman, S. J. , Cadeddu, S. B. M. , Savoldelli, M., et al. (2020). Artificial intelligence in health care: laying the Foundation for Responsible, sustainable, and inclusive innovation in low- and middle-income countries. Glob. Health 16:52. doi: 10.1186/s12992-020-00584-1
Alemayehu, D. , and Berger, M. L. (2016). Big data: transforming drug development and health policy decision making. Health Serv. Outcome Res. Methodol. 16, 92–102. doi: 10.1007/s10742-016-0144-x
Altenburger, K. M. , and Ho, D. E. (2019). When algorithms import private bias into public enforcement: the promise and limitations of statistical debiasing solutions. J. Inst. Theor. Econ. 175, 98–122. doi: 10.1628/jite-2019-0001
Althobaiti, K. (2021). Surveillance in next-generation personalized healthcare: science and ethics of data analytics in healthcare. New Bioeth. 27, 295–319. doi: 10.1080/20502877.2021.1993055
Amann, J. , Blasimme, A. , Vayena, E. , Frey, D. , and Madai, V. I.Precise4Q consortium (2020). Explainability for artificial intelligence in healthcare: a multidisciplinary perspective. BMC Med. Inform. Decis. Mak. 20:310. doi: 10.1186/s12911-020-01332-6
Andanda, P. (2019). Towards a paradigm shift in governing data access and related intellectual property rights in big data and health-related research. IIC 50, 1052–1081. doi: 10.1007/s40319-019-00873-2
Anisetti, M. , Ardagna, C. , Bellandi, V. , Cremonini, M. , Frati, F. , and Damiani, E. (2018). Privacy-aware big data analytics as a service for public health policies in smart cities. Sustain. Cities Soc. 39, 68–77. doi: 10.1016/j.scs.2017.12.019
Araujo, T. , Helberger, N. , Kruikemeier, S. , and de Vreese, C. (2020). In AI we trust? Perceptions about automated decision-making by artificial intelligence. AI Soc. 35, 611–623. doi: 10.1007/s00146-019-00931-w
Arksey, H. , and O’Malley, L. (2005). Scoping studies: towards a methodological framework. Int. J. Soc. Res. Methodol. 8, 19–32. doi: 10.1080/1364557032000119616
Babyar, J. (2019). Conversations and connections: improving real-time health data on behalf of public interest. Health Technol. 9, 245–249. doi: 10.1007/s12553-019-00296-6
Backholer, K. , Baum, F. , Finlay, S. M. , Friel, S. , Giles-Corti, B. , Jones, A., et al. (2021). Australia in 2030: what is our path to health for all? Med. J. Aust. 214, S5–S40. doi: 10.5694/mja2.51020
Baclic, O. , Tunis, M. , Young, K. , Doan, C. , Swerdfeger, H. , and Schonfeld, J. (2020). Challenges and opportunities for public health made possible by advances in natural language processing. Can. Commun. Dis. Rep. 46, 161–168. doi: 10.14745/ccdr.v46i06a02
Balas, E. A. , Vernon, M. , Magrabi, F. , Gordon, L. T. , and Sexton, J. (2015). “Big data clinical research: validity, ethics, and regulation” in Medinfo 2015 Ehealth-enabled health. eds. I. N. Sarkar , A. Georgiou , and P. M. D. Marques (Amsterdam: Ios Press), 448–452.
Baldassarre, A. , Mucci, N. , Padovan, M. , Pellitteri, A. , Viscera, S. , Lecca, L. I., et al. (2020). The role of electrocardiography in occupational medicine, from Einthoven’s invention to the digital era of wearable devices. Int. J. Environ. Res. Public Health 17, 1–23. doi: 10.3390/ijerph17144975
Ballantyne, A. (2019). Adjusting the focus: a public health ethics approach to data research. Bioethics 33, 357–366. doi: 10.1111/bioe.12551
Barreto, M. L. , and Rodrigues, L. C. (2018). Linkage of administrative datasets: enhancing longitudinal epidemiological studies in the era of “big data.”. Curr. Epidemiol. Rep. 5, 317–320. doi: 10.1007/s40471-018-0177-5
Bates, D. W. , Heitmueller, A. , Kakad, M. , and Saria, S. (2018). Why policymakers should care about “big data” in healthcare. Health Policy Technol. 7, 211–216. doi: 10.1016/j.hlpt.2018.04.006
Bélisle-Pipon, J.-C. (2024). Why we need to be careful with LLMs in medicine. Front. Med. 11:1495582. doi: 10.3389/fmed.2024.1495582
Bélisle-Pipon, J.-C. , Couture, V. , Roy, M.-C. , Ganache, I. , Goetghebeur, M. , and Cohen, I. G. (2021). What makes artificial intelligence exceptional in health technology assessment? Front. Artif. Intell. 4:736697. doi: 10.3389/frai.2021.736697
Bélisle-Pipon, J.-C. , and Victor, G. (2024). Ethics dumping in artificial intelligence. Front. Artif. Intell. 7:1426761. doi: 10.3389/frai.2024.1426761
Belk, R. (2020). Ethical issues in service robotics and artificial intelligence. Serv. Ind. J. 41, 860–876. doi: 10.1080/02642069.2020.1727892
Bellazzi, R. (2014). Big data and biomedical informatics: a challenging opportunity. Yearb. Med. Inform. 23, 08–13. doi: 10.15265/IY-2014-0024
Benke, K. , and Benke, G. (2018). Artificial intelligence and big data in public health. Int. J. Environ. Res. Public Health 15:2796. doi: 10.3390/ijerph15122796
Bennett, B. (2019). Technology, ageing and human rights: challenges for an ageing world. Int. J. Law Psychiatry 66:101449. doi: 10.1016/j.ijlp.2019.101449
Bhattacharya, S. , Hossain, M. M. , Juyal, R. , Sharma, N. , Pradhan, K. B. , and Singh, A. (2021). Role of public health ethics for responsible use of artificial intelligence technologies. Indian J. Community Med. 46, 178–181. doi: 10.4103/ijcm.IJCM_62_20
Braun, V. , and Clarke, V. (2012). “Thematic analysis” in APA handbook of research methods in psychology, vol. 2: research designs: quantitative, qualitative, neuropsychological, and biological. eds. H. Cooper , P. M. Camic , D. L. Long , A. T. Panter , D. Rindskopf , and K. J. Sher (Washington, DC: American Psychological Association), 57–71.
Breen, N. , Berrigan, D. , Jackson, J. S. , Wong, D. W. S. , Wood, F. B. , Denny, J. C., et al. (2019). Translational health disparities research in a data-rich world. Health Equity 3, 588–600. doi: 10.1089/heq.2019.0042
Brill, S. B. , Moss, K. O. , and Prater, L. (2019). Transformation of the doctor-patient relationship: big data, accountable care, and predictive health analytics. HEC Forum 31, 261–282. doi: 10.1007/s10730-019-09377-5
Budd, J. , Miller, B. S. , Manning, E. M. , Lampos, V. , Zhuang, M. , Edelstein, M., et al. (2020). Digital technologies in the public-health response to COVID-19. Nat. Med. 26, 1183–1192. doi: 10.1038/s41591-020-1011-4
Cahan, E. M. , Hernandez-Boussard, T. , Thadaney-Israni, S. , and Rubin, D. L. (2019). Putting the data before the algorithm in big data addressing personalized healthcare. NPJ Digit Med 2:78. doi: 10.1038/s41746-019-0157-2
Canaway, R. , Boyle, D. I. R. , Manski-Nankervis, J.-A. E. , Bell, J. , Hocking, J. S. , Clarke, K., et al. (2019). Gathering data for decisions: best practice use of primary care electronic records for research. Med. J. Aust. 210, S12–S16. doi: 10.5694/mja2.50026
Car, J. , Sheikh, A. , Wicks, P. , and Williams, M. S. (2019). Beyond the hype of big data and artificial intelligence: building foundations for knowledge and wisdom. BMC Med. 17:143. doi: 10.1186/s12916-019-1382-x
Carney, T. J. , and Kong, A. Y. (2017). Leveraging health informatics to foster a smart systems response to health disparities and health equity challenges. J. Biomed. Inform. 68, 184–189. doi: 10.1016/j.jbi.2017.02.011
Casanovas, P. , Mendelson, D. , and Poblet, M. (2017). A linked democracy approach for regulating public health data. Health Technol. 7, 519–537. doi: 10.1007/s12553-017-0191-5
Castagno, S. , and Khalifa, M. (2020). Perceptions of artificial intelligence among healthcare staff: a qualitative survey study. Front. Artif. Intell. 3:578983. doi: 10.3389/frai.2020.578983
Cavique, L. (2024). Implications of causality in artificial intelligence. Front. Artif. Intell. 7:1439702. doi: 10.3389/frai.2024.1439702
Celedonia, K. L. , Corrales Compagnucci, M. , Minssen, T. , and Lowery Wilson, M. (2021). Legal, ethical, and wider implications of suicide risk detection systems in social media platforms. J. Law Biosci. 8:lsab021. doi: 10.1093/jlb/lsab021
Chen, J. , and See, K. C. (2020). Artificial intelligence for COVID-19: rapid review. J. Med. Internet Res. 22:e21476. doi: 10.2196/21476
Cheng, C.-Y. , Soh, Z. D. , Majithia, S. , Thakur, S. , Rim, T. H. , Tham, Y. C., et al. (2020). Big data in ophthalmology. Asia-Pac. J. Ophthalmol. 9, 291–298. doi: 10.1097/APO.0000000000000304
Cheung, S. (2020). Disambiguating the benefits and risks from public health data in the digital economy. Big Data Soc. 7:205395172093392. doi: 10.1177/2053951720933924
Cheung, K.-S. , Leung, W. K. , and Seto, W.-K. (2019). Application of big data analysis in gastrointestinal research. World J. Gastroenterol. 25, 2990–3008. doi: 10.3748/wjg.v25.i24.2990
Collins, B. X. , Bélisle-Pipon, J.-C. , Evans, B. J. , Ferryman, K. , Jiang, X. , Nebeker, C., et al. (2024). Addressing ethical issues in healthcare artificial intelligence using a lifecycle-informed process. JAMIA Open 7:ooae108. doi: 10.1093/jamiaopen/ooae108
Colloc, J. (2015). Health and big data: the state and the individuals, powerless in front of powers of networks. Espace Polit. :3493. doi: 10.4000/espacepolitique.3493
Comess, S. , Akbay, A. , Vasiliou, M. , Hines, R. N. , Joppa, L. , Vasiliou, V., et al. (2020). Bringing big data to bear in environmental public health: challenges and recommendations. Front Artif Intell. 3:31. doi: 10.3389/frai.2020.00031
Conrad, K. , Shoenfeld, Y. , and Fritzler, M. J. (2020). Precision health: a pragmatic approach to understanding and addressing key factors in autoimmune diseases. Autoimmun. Rev. 19:102508. doi: 10.1016/j.autrev.2020.102508
Conway, M. (2014). Ethical issues in using twitter for public health surveillance and research: developing a taxonomy of ethical concepts from the research literature. J. Med. Internet Res. 16:e290. doi: 10.2196/jmir.3617
Cool, A. (2016). Detaching data from the state: biobanking and building big data in Sweden. BioSocieties 11, 277–295. doi: 10.1057/biosoc.2015.25
Cordeiro, J. V. (2021). Digital technologies and data science as health enablers: an outline of appealing promises and compelling ethical, legal, and social challenges. Front. Med. 8:647897. doi: 10.3389/fmed.2021.647897
Cornock, M. (2011). Legal definitions of responsibility, accountability and liability: Marc Cornock clarifies the use of terms that are sometimes used interchangeably but have distinct ramifications in law. Nurs. Child. Young People 23, 25–26. doi: 10.7748/ncyp2011.04.23.3.25.c8417
Couch, D. L. , Robinson, P. , and Komesaroff, P. A. (2020). COVID-19-extending surveillance and the panopticon. J. Bioethical. Inq. 17, 809–814. doi: 10.1007/s11673-020-10036-5
Couture, V. , and Bélisle-Pipon, J.-C. (2023). Artificial intelligence as a threat for global health. BMJ Glob. Health. https://gh.bmj.com/content/artificial-intelligence-threat-global-health
Couture, V. , Roy, M.-C. , Dez, E. , Laperle, S. , and Bélisle-Pipon, J.-C. (2023). Ethical implications of artificial intelligence in population health and the public’s role in its governance: perspectives from a citizen and expert panel. J. Med. Internet Res. 25:e44357. doi: 10.2196/44357
Cutrona, S. L. , Toh, S. , Iyer, A. , Foy, S. , Cavagnaro, E. , Forrow, S., et al. (2012). Design for validation of acute myocardial infarction cases in Mini-sentinel. Pharmacoepidemiol. Drug Saf. 21, 274–281. doi: 10.1002/pds.2314
d’Elia, A. , Gabbay, M. , Rodgers, S. , Kierans, C. , Jones, E. , Durrani, I., et al. (2022). Artificial intelligence and health inequities in primary care: a systematic scoping review and framework. Fam. Med. Community Health. 10:e001670. doi: 10.1136/fmch-2022-001670
Dagi, T. F. (2017). Seven ethical issues affecting neurosurgeons in the context of health care reform. Neurosurgery 80, S83–S91. doi: 10.1093/neuros/nyx017
Dankwa-Mullan, I. , Rivo, M. , Sepulveda, M. , Park, Y. , Snowdon, J. , and Rhee, K. (2018). Transforming diabetes care through artificial intelligence: the future is here. Popul. Health Manag. 22, 229–242. doi: 10.1089/pop.2018.0129
Dankwa-Mullan, I. , Scheufele, E. , Matheny, M. , Quintana, Y. , Chapman, W. , Jackson, G., et al. (2021). A proposed framework on integrating health equity and racial justice into the artificial intelligence development lifecycle. J. Health Care Poor Underserved 32, 300–317. doi: 10.1353/hpu.2021.0065
de Graaf, M. M. A. , Ben Allouch, S. , and Klamer, T. (2015). Sharing a life with Harvey: exploring the acceptance of and relationship-building with a social robot. Comput. Hum. Behav. 43, 1–14. doi: 10.1016/j.chb.2014.10.030
Degeling, C. , Chen, G. , Gilbert, G. , Brookes, V. , Thai, T. , Wilson, A., et al. (2020). Changes in public preferences for technologically enhanced surveillance following the COVID-19 pandemic: a discrete choice experiment. BMJ Open 10:e041592. doi: 10.1136/bmjopen-2020-041592
Delpierre, C. , and Kelly-Irving, M. (2018). Big data and the study of social inequalities in health: expectations and issues. Front. Public Health 6:312. doi: 10.3389/fpubh.2018.00312
Demuro, P. , Petersen, C. , and Turner, P. (2020). Health “big data” value, benefit, and control: the patient ehealth equity gap. Stud Health Technol Inform 270, 1123–1127. doi: 10.3233/SHTI200337
Deshpande, P. , Rasin, A. , Furst, J. , Raicu, D. , and Antani, S. (2019). Diis: a biomedical data access framework for aiding data driven research supporting FAIR principles. Data 4:54. doi: 10.3390/data4020054
Docherty, A. B. , and Lone, N. I. (2015). Exploiting big data for critical care research. Curr. Opin. Crit. Care 21, 467–472. doi: 10.1097/MCC.0000000000000228
Dolley, S. (2018). Big data’s role in precision public health. Front. Public Health 6:68. doi: 10.3389/fpubh.2018.00068
Eng, T. R. (2004). Population health technologies - emerging innovations for the health of the public. Am. J. Prev. Med. 26, 237–242. doi: 10.1016/j.amepre.2003.12.004
Esmaeilzadeh, P. (2020). Use of AI-based tools for healthcare purposes: a survey study from consumers’ perspectives. BMC Med. Inform. Decis. Mak. 20:170. doi: 10.1186/s12911-020-01191-1
Evans, E. A. , Delorme, E. , Cyr, K. , and Goldstein, D. M. (2020). A qualitative study of big data and the opioid epidemic: recommendations for data governance. BMC Med. Ethics 21:101. doi: 10.1186/s12910-020-00544-9
Federspiel, F. , Mitchell, R. , Asokan, A. , Umana, C. , and McCoy, D. (2023). Threats by artificial intelligence to human health and human existence. BMJ Glob. Health 8:e010435. doi: 10.1136/bmjgh-2022-010435
Fjeld, J. , Hilligoss, H. , Achten, N. , Daniel, M. L. , Feldman, J. , and Kagay, S. (2019) Principled artificial intelligence: a map of ethical and rights-based approaches. Berkman Klein Center for Internet and Society, Harvard University.
Fleming, M. N. (2021). Considerations for the ethical implementation of psychological assessment through social media via machine learning. Ethics Behav. 31, 181–192. doi: 10.1080/10508422.2020.1817026
Floridi, L. , Cowls, J. , Beltrametti, M. , Chatila, R. , Chazerand, P. , Dignum, V., et al. (2018). AI4People—an ethical framework for a good AI society: opportunities, risks, principles, and recommendations. Minds Mach 28, 689–707. doi: 10.1007/s11023-018-9482-5
Fornasier, M. d. O. (2019). The applicability of the internet of things (IoT) between fundamental rights to health and to privacy. Rev. Invest. Constit. 6, 297–321. doi: 10.5380/rinc.v6i2.67592
Fulmer, R. (2019). Artificial intelligence and counseling: four levels of implementation. Theor. Psychol. 29, 807–819. doi: 10.1177/0959354319853045
Galetsi, P. , Katsaliaki, K. , and Kumar, S. (2019). Values, challenges and future directions of big data analytics in healthcare: a systematic review. Soc. Sci. Med. 241:112533. doi: 10.1016/j.socscimed.2019.112533
Genevieve, L. D. , Martani, A. , Wangmo, T., et al. (2019). Participatory disease surveillance systems: ethical framework. J. Med. Internet Res. 21:e12273. doi: 10.2196/12273
Gilbert, G. L. , Degeling, C. , and Johnson, J. (2019). Communicable disease surveillance ethics in the age of big data and new technology. Asian Bioeth Rev. 11, 173–187. doi: 10.1007/s41649-019-00087-1
Gilbert, J.-P. , Ng, V. , Niu, J. , and Rees, E. E. (2020). A call for an ethical framework when using social media data for artificial intelligence applications in public health research. Can. Commun. Dis. Rep. 46, 169–173. doi: 10.14745/ccdr.v46i06a03
Godfrey, A. , Goldsack, J. C. , Tenaerts, P. , Coravos, A. , Aranda, C. , Hussain, A., et al. (2020). BioMeT and algorithm challenges: a proposed digital standardized evaluation framework. IEEE J. Transl. Eng. Health Med. 8, 1–8. doi: 10.1109/JTEHM.2020.2996761
Goldsmith, J. , Sun, Y. , Fried, L. P. , Wing, J. , Miller, G. W. , and Berhane, K. (2021). The emergence and future of public health data science. Public Health Rev. 42:1604023. doi: 10.3389/phrs.2021.1604023
Goodman, K. W. (2020). Ethics in health informatics. Yearb. Med. Inform. 29, 026–031. doi: 10.1055/s-0040-1701966
Gossec, L. , Kedra, J. , Servy, H. , Pandit, A. , Stones, S. , Berenbaum, F., et al. (2020). EULAR points to consider for the use of big data in rheumatic and musculoskeletal diseases. Ann. Rheum. Dis. 79, 69–76. doi: 10.1136/annrheumdis-2019-215694
Green, S. , and Vogt, H. (2016). Personalizing medicine: disease prevention in silico and in socio. Humana Mente 30, 105–145. https://www.humanamente.eu/index.php/HM/article/view/62
Grigorovich, A. , and Kontos, P. (2020). Towards responsible implementation of monitoring Technologies in Institutional Care. The Gerontologist 60, 1194–1201. doi: 10.1093/geront/gnz190
Heitmueller, A. , Henderson, S. , Warburton, W. , Elmagarmid, A. , Pentland, A. S. , and Darzi, A. (2014). Developing public policy to advance the use of big data in health care. Health Aff. 33, 1523–1530. doi: 10.1377/hlthaff.2014.0771
Hemingway, H. , Asselbergs, F. W. , Danesh, J. , Dobson, R. , Maniadakis, N. , Maggioni, A., et al. (2018). Big data from electronic health records for early and late translational cardiovascular research: challenges and potential. Eur. Heart J. 39, 1481–1495. doi: 10.1093/eurheartj/ehx487
Ho, C. W.-L. , and Caals, K. (2021). A call for an ethics and governance action plan to harness the power of artificial intelligence and digitalization in nephrology. Semin. Nephrol. 41, 282–293. doi: 10.1016/j.semnephrol.2021.05.009
Ho, C. W.-L. , Caals, K. , and Zhang, H. (2020). Heralding the digitalization of life in post-pandemic east Asian societies. J. Bioethical. Inq. 17, 657–661. doi: 10.1007/s11673-020-10050-7
Hodgson, S. , Fecht, D. , Gulliver, J. , Daby, H. I. , Piel, F. B. , Yip, F., et al. (2020). Availability, access, analysis and dissemination of small-area data. Int. J. Epidemiol. 49, I4–I14. doi: 10.1093/ije/dyz051
Hoffman, S. , and Podgurski, A. (2013). The use and misuse of biomedical data: is bigger really better? Am. J. Law Med. 39, 497–538. doi: 10.1177/009885881303900401
Holzmeyer, C. (2021). Beyond “AI for social good” (AI4SG): social transformations-not tech-fixes-for health equity. Interdiscip. Sci. Rev. 46, 94–125. doi: 10.1080/03080188.2020.1840221
Horgan, D. , and Ricciardi, W. (2017). Leviathan, or the rudder of public health. Biomed Hub 2, 87–94. doi: 10.1159/000479490
Horvitz, E. , and Mulligan, D. (2015). Data, privacy, and the greater good. Science 349, 253–255. doi: 10.1126/science.aac4520
Howe, I. E. G. , and Elenberg, F. (2020). Ethical challenges posed by big data. Innov. Clin. Neurosci. 17, 24–30.
Hu, H. , Galea, S. , Rosella, L. , and Henry, D. (2017). Big data and population health focusing on the health impacts of the social, physical, and economic environment. Epidemiology 28, 759–762. doi: 10.1097/EDE.0000000000000711
Hunt, X. , Tomlinson, M. , Sikander, S. , Skeen, S. , Marlow, M. , du Toit, S., et al. (2020). Artificial intelligence, big data, and mHealth: the Frontiers of the prevention of violence against children. Front Artif Intell 3:543305. doi: 10.3389/frai.2020.543305
Ienca, M. , Jotterand, F. , Vica, C. , and Elger, B. (2016). Social and assistive robotics in dementia care: ethical recommendations for research and practice. Int. J. Soc. Robot. 8, 565–573. doi: 10.1007/s12369-016-0366-7
Ienca, M. , Vayena, E. , and Blasimme, A. (2018). Big data and dementia: charting the route ahead for research, ethics, and policy. Front. Med. 5:13. doi: 10.3389/fmed.2018.00013
Igual, R. , Medrano, C. , and Plaza, I. (2013). Challenges, issues and trends in fall detection systems. Biomed. Eng. Online 12:66. doi: 10.1186/1475-925X-12-66
Jalal, S. , Parker, W. , Ferguson, D. , and Nicolaou, S. (2020). Exploring the role of artificial intelligence in an emergency and trauma radiology department. Can. Assoc. Radiol. J. 72:846537120918338. doi: 10.1177/0846537120918338
Jiang, H. , and Cheng, L. (2021). Public perception and reception of robotic applications in public health emergencies based on a questionnaire survey conducted during COVID-19. Int. J. Environ. Res. Public Health 18, 1–20. doi: 10.3390/ijerph182010908
Joda, T. , Waltimo, T. , Pauli-Magnus, C. , Probst-Hensch, N. , and Zitzmann, N. U. (2018). Population-based linkage of big data in dental research. Int. J. Environ. Res. Public Health 15, 1–5. doi: 10.3390/ijerph15112357
Johnson, W. G. (2020). Using precision public health to manage climate change: opportunities, challenges, and health justice. J. Law Med. Ethics 48, 681–693. doi: 10.1177/1073110520979374
Jones, M. , DeRuyter, F. , and Morris, J. (2020). The digital health revolution and people with disabilities: perspective from the United States. Int. J. Environ. Res. Public Health 17:381. doi: 10.3390/ijerph17020381
Kasperbauer, T. J. (2021). Conflicting roles for humans in learning health systems and AI-enabled healthcare. J. Eval. Clin. Pract. 27, 537–542. doi: 10.1111/jep.13510
Katapally, T. R. (2020). A global digital citizen science policy to tackle pandemics like COVID-19. J. Med. Internet Res. 22:e19357. doi: 10.2196/19357
Kayaalp, M. (2018). Patient privacy in the era of big data. Balkan Med. J. 35, 8–17. doi: 10.4274/balkanmedj.2017.0966
Kee, F. , and Taylor-Robinson, D. (2020). Scientific challenges for precision public health. J. Epidemiol. Community Health 74, 311–314. doi: 10.1136/jech-2019-213311
Kelly, J. T. , Campbell, K. L. , Gong, E. , and Scuffham, P. (2020). The internet of things: impact and implications for health care delivery. J. Med. Internet Res. 22:e20135. doi: 10.2196/20135
Kenney, M. , and Mamo, L. (2019). The imaginary of precision public health. Med. Humanit. 46, 192–203. doi: 10.1136/medhum-2018-011597
Kern, H. P. , Reagin, M. J. , and Reese, B. S. (2016). Priming the pump for big data at Sentara healthcare. Front. Health Serv. Manag. 32, 15–26. doi: 10.1097/01974520-201604000-00003
Kernaghan, K. (2014). The rights and wrongs of robotics: ethics and robots in public organizations. Can. Public Adm. 57, 485–506. doi: 10.1111/capa.12093
Kerr, D. , Axelrod, C. , Hoppe, C. , and Klonoff, D. C. (2018). Diabetes and technology in 2030: a utopian or dystopian future? Diabet. Med. 35, 498–503. doi: 10.1111/dme.13586
Kim, S. J. , Marsch, L. A. , Hancock, J. T. , and Das, A. K. (2017). Scaling up research on drug abuse and addiction through social media big data. J. Med. Internet Res. 19:e353. doi: 10.2196/jmir.6426
Kindig, D. , and Stoddart, G. (2003). What is population health? Am. J. Public Health 93, 380–383. doi: 10.2105/ajph.93.3.380
Kirtley, O. J. , and O’Connor, R. C. (2020). Suicide prevention is everyone’s business: challenges and opportunities for Google. Soc. Sci. Med. 262:112691. doi: 10.1016/j.socscimed.2019.112691
Kostkova, P. , Brewer, H. , de Lusignan, S. , Fottrell, E. , Goldacre, B. , Hart, G., et al. (2016). Who owns the data? Open data for healthcare. Front. Public Health 4:7. doi: 10.3389/fpubh.2016.00007
Ladner, J. , and Ben Abdelaziz, A. (2018). Public health issues in the 21st century: national challenges and shared challenges for the Maghreb countries. Tunis. Med. 96, 847–857.
Lajonchere, J.-P. (2018). Role of big data in evolution of the medical practice. Bull. Acad. Natl Med. 202, 225–238. doi: 10.1016/S0001-4079(19)30353-X
Lanier, P. , Rodriguez, M. , Verbiest, S. , Bryant, K. , Guan, T. , and Zolotor, A. (2020). Preventing infant maltreatment with predictive analytics: applying ethical principles to evidence-based child welfare policy. J. Fam. Violence 35, 1–13. doi: 10.1007/s10896-019-00074-y
Larkin, A. , and Hystad, P. (2017). Towards personal exposures: how technology is changing air pollution and Health Research. Curr Environ Health Rep 4, 463–471. doi: 10.1007/s40572-017-0163-y
Lee, E. C. , Asher, J. M. , Goldlust, S. , Kraemer, J. D. , Lawson, A. B. , and Bansal, S. (2016). Mind the scales: harnessing spatial big data for infectious disease surveillance and inference. J. Infect. Dis. 214, S409–S413. doi: 10.1093/infdis/jiw344
Lee, C. H. , and Yoon, H.-J. (2017). Medical big data: promise and challenges. Kidney Res Clin Pract 36, 3–11. doi: 10.23876/j.krcp.2017.36.1.3
Leonelli, S. (2019). Data-centric biology: a philosophical study. Chicago: University of Chicago Press.
Levac, D. , Colquhoun, H. , and O’Brien, K. K. (2010). Scoping studies: advancing the methodology. Implement. Sci. 5:69. doi: 10.1186/1748-5908-5-69
Leyens, L. , Reumann, M. , Malats, N. , and Brand, A. (2017). Use of big data for drug development and for public and personal health and care. Genet. Epidemiol. 41, 51–60. doi: 10.1002/gepi.22012
Li, X. , and Cong, Y. (2021). A systematic literature review of ethical challenges related to medical and public health data sharing in China. J. Empir. Res. Hum. Res. Ethics 16, 537–554. doi: 10.1177/15562646211040299
Liu, T. Y. A. , and Bressler, N. M. (2020). Controversies in artificial intelligence. Curr. Opin. Ophthalmol. 31, 324–328. doi: 10.1097/ICU.0000000000000694
Liu, C. , and Graham, R. (2021). Making sense of algorithms: relational perception of contact tracing and risk assessment during COVID-19. Big Data Soc. 8, 1–13. doi: 10.1177/2053951721995218
Liyanage, H. , de Lusignan, S. , Liaw, S.-T. , Kuziemsky, C. E. , Mold, F. , Krause, P., et al. (2014). Big data usage patterns in the health care domain: a use case driven approach applied to the assessment of vaccination benefits and risks. Contribution of the IMIA primary healthcare working group. Yearb. Med. Inform. 9, 27–35. doi: 10.15265/IY-2014-0016
Lodders, A. , and Paterson, J. (2020). Scrutinising COVIDSafe: frameworks for evaluating digital contact tracing technologies. Altern. Law J. 45, 153–161. doi: 10.1177/1037969X20948262
Luk, J. W. , Pruitt, L. D. , Smolenski, D. J. , Tucker, J. , Workman, D. E. , and Belsher, B. E. (2021). From everyday life predictions to suicide prevention: clinical and ethical considerations in suicide predictive analytic tools. J. Clin. Psychol. 78, 137–148. doi: 10.1002/jclp.23202
Lupton, D. , and Jutel, A. (2015). “It’s like having a physician in your pocket!” a critical analysis of self-diagnosis smartphone apps. Soc. Sci. Med. 133, 128–135. doi: 10.1016/j.socscimed.2015.04.004
Machluf, Y. , Tal, O. , Navon, A. , and Chaiter, Y. (2017). From population databases to research and informed health decisions and policy. Front. Public Health 5:230. doi: 10.3389/fpubh.2017.00230
Mahlmann, L. , Reumann, M. , Evangelatos, N. , and Brand, A. (2017). Big data for public health policy-making: policy empowerment. Public Health Genomics 20, 312–320. doi: 10.1159/000486587
Manrique de Lara, A. , and Pelaez-Ballestas, I. (2020). Big data and data processing in rheumatology: bioethical perspectives. Clin. Rheumatol. 39, 1007–1014. doi: 10.1007/s10067-020-04969-w
Manzeschke, A. , Assadi, G. , and Viehoever, W. (2016). The role of big data in ambient assisted living. Int. Rev. Inf. Ethics 24, 22–31. doi: 10.29173/irie149
Mbunge, E. (2020). Integrating emerging technologies into COVID-19 contact tracing: opportunities, challenges and pitfalls. Diabetes Metab. Syndr. 14, 1631–1636. doi: 10.1016/j.dsx.2020.08.029
Mentis, A.-F. A. , Pantelidi, K. , Dardiotis, E. , Hadjigeorgiou, G. M. , and Petinaki, E. (2018). Precision medicine and global health: the good, the bad, and the ugly. Front. Med. 5:67. doi: 10.3389/fmed.2018.00067
Mikal, J. , Hurst, S. , and Conway, M. (2016). Ethical issues in using twitter for population-level depression monitoring: a qualitative study. BMC Med. Ethics 17:22. doi: 10.1186/s12910-016-0105-5
Miller, L. F. (2020). Human rights of users of humanlike care automata. Hum. Rights Rev. 21, 181–205. doi: 10.1007/s12142-020-00581-2
Mittelstadt, B. (2019). Principles alone cannot guarantee ethical AI. Nat. Mach. Intell. 1, 501–507. doi: 10.1038/s42256-019-0114-4
Mohr, D. C. , Zhang, M. , and Schueller, S. M. (2017). Personal sensing: understanding mental health using ubiquitous sensors and machine learning. Annu. Rev. Clin. Psychol. 13, 23–47. doi: 10.1146/annurev-clinpsy-032816-044949
Montgomery, K. , Chester, J. , and Kopp, K. (2018). Health wearables ensuring fairness, preventing discrimination, and promoting equity in an emerging internet-of-things environment. J. Inf. Policy 8, 34–77. doi: 10.5325/jinfopoli.8.2018.0034
Mooney, S. J. , and Pejaver, V. (2018). Big data in public health: terminology, machine learning, and privacy. Annu. Rev. Public Health 39, 95–112. doi: 10.1146/annurev-publhealth-040617-014208
Mootz, J. J. , Evans, H. , Tocco, J. , Ramon, C. V. , Gordon, P. , Wainberg, M. L., et al. (2020). Acceptability of electronic healthcare predictive analytics for HIV prevention: a qualitative study with men who have sex with men in new York City. mHealth 6:11. doi: 10.21037/mhealth.2019.10.03
Morgenstern, J. D. , Rosella, L. C. , Daley, M. J. , Goel, V. , Schunemann, H. J. , and Piggott, T. (2021). “AI’S gonna have an impact on everything in society, so it has to have an impact on public health”: a fundamental qualitative descriptive study of the implications of artificial intelligence for public health. BMC Public Health 21:40. doi: 10.1186/s12889-020-10030-x
Morley, J. , Floridi, L. , Kinsey, L. , and Elhalal, A. (2020). From what to how: an initial review of publicly available AI ethics tools, methods and research to translate principles into practices. Sci. Eng. Ethics 26, 2141–2168. doi: 10.1007/s11948-019-00165-5
Morley, J. , Machado, C. C. V. , Burr, C. , Cowls, J. , Joshi, I. , and Taddeo, M. (1982). The ethics of AI in health care: a mapping review. Soc. Sci. Med. 260:113172. doi: 10.1016/j.socscimed.2020.113172
Moutel, G. , Grandazzi, G. , Duchange, N. , and Darquy, S. (2018). The digital pill, between beneficence and vigilance: ethical stakes.). Le médicament connecté, entre bienveillance et surveillance. Med. Sci. 34, 717–722. doi: 10.1051/medsci/20183408019
Murphy, K. , Di Ruggiero, E. , Upshur, R. , Willison, D. J. , Malhotra, N. , Cai, J. C., et al. (2021). Artificial intelligence for good health: a scoping review of the ethics literature. BMC Med. Ethics 22:14. doi: 10.1186/s12910-021-00577-8
Nageshwaran, G. , Harris, R. C. , and Guerche-Seblain, C. E. (2021). Review of the role of big data and digital technologies in controlling COVID-19 in Asia: public health interest vs. privacy. Digit. Health 7:20552076211002953. doi: 10.1177/20552076211002953
Nakada, H. , Inoue, Y. , Yamamoto, K. , Matsui, K. , Ikka, T. , and Tashiro, S. (2020). Public attitudes toward the secondary uses of patient Records for Pharmaceutical Companies’ activities in Japan. Ther. Innov. Regul. Sci. 54, 701–708. doi: 10.1007/s43441-019-00105-2
Naudé, W. (2020). Artificial intelligence vs COVID-19: limitations, constraints and pitfalls. AI Soc. 35, 761–765. doi: 10.1007/s00146-020-00978-0
Nebeker, C. , Torous, J. , and Bartlett Ellis, R. J. (2019). Building the case for actionable ethics in digital health research supported by artificial intelligence. BMC Med. 17:137. doi: 10.1186/s12916-019-1377-7
Ngan, O. M. Y. , and Kelmenson, A. M. (2021). Using big data tools to analyze digital footprint in the COVID-19 pandemic: some public health ethics considerations. Asia Pac. J. Public Health 33, 129–130. doi: 10.1177/1010539520984360
Nichol, A. A. , Bendavid, E. , Mutenherwa, F. , Patel, C. , and Cho, M. K. (2021). Diverse experts’ perspectives on ethical issues of using machine learning to predict HIV/AIDS risk in sub-Saharan Africa: a modified Delphi study. BMJ Open 11:e052287. doi: 10.1136/bmjopen-2021-052287
Noorbakhsh-Sabet, N. , Zand, R. , Zhang, Y. , and Abedi, V. (2019). Artificial intelligence transforms the future of health care. Am. J. Med. 132, 795–801. doi: 10.1016/j.amjmed.2019.01.017
O’Doherty, K. C. , Christofides, E. , Yen, J. , Bentzen, H. B. , Burke, W. , Hallowell, N., et al. (2016). If you build it, they will come: unintended future uses of organised health data collections. BMC Med. Ethics 17:54. doi: 10.1186/s12910-016-0137-x
Oravec, J. A. (2019). Artificial intelligence, automation, and social welfare: some ethical and historical perspectives on technological overstatement and hyperbole. Ethics Soc. Welfare 13, 18–32. doi: 10.1080/17496535.2018.1512142
Ossorio, P. N. (2014). The ethics of translating high-throughput science into clinical practice. Hast. Cent. Rep. 44, 8–9. doi: 10.1002/hast.351
Pacia, D. M. , Ravitsky, V. , Hansen, J. N. , Lundberg, E. , Schulz, W. , and Bélisle-Pipon, J.-C. (2024). Early AI lifecycle co-reasoning: ethics through integrated and diverse team science. Am. J. Bioeth. 24, 86–88. doi: 10.1080/15265161.2024.2377106
Pagliari, C. (2021). Digital health and primary care: past, pandemic and prospects. J. Glob. Health 11:01005. doi: 10.7189/jogh.11.01005
Park, J. (2021). Governing a pandemic with data on the contactless path to AI: personal data, public health, and the digital divide in South Korea, Europe and the United States in tracking of COVID-19. Partecip E Conflitto 14:79. doi: 10.1285/i20356609v14i1p79
Pearl, J. , and Mackenzie, D. (2018). The book of why: The new science of cause and effect. New York: Basic books.
Pepin, J.-L. , Bailly, S. , and Tamisier, R. (2020). Big data in sleep apnoea: opportunities and challenges. Respirology 25, 486–494. doi: 10.1111/resp.13669
Peters, S. G. , and Buntrock, J. D. (2014). Big data and the electronic health record. J. Ambul. Care Manage. 37, 206–210. doi: 10.1097/JAC.0000000000000037
Pickering, B. (2021). Trust, but verify: informed consent, AI technologies, and public health emergencies. Future Internet 13, 1–20. doi: 10.3390/fi13050132
Prainsack, B. (2020). The value of healthcare data: to nudge, or not? Policy Stud. 41, 547–562. doi: 10.1080/01442872.2020.1723517
Prosperi, M. , Min, J. S. , Bian, J. , and Modave, F. (2018). Big data hurdles in precision medicine and precision public health. BMC Med. Inform. Decis. Mak. 18:139. doi: 10.1186/s12911-018-0719-2
Rajam, N. (2020). Policy strategies for personalising medicine “in the data moment”. Health Policy Technol. 9, 379–383. doi: 10.1016/j.hlpt.2020.07.003
Raza, S. , and Luheshi, L. (2016). Big data or bust: realizing the microbial genomics revolution. Microb. Genomics 2:000046. doi: 10.1099/mgen.0.000046
Rehman, A. , Naz, S. , and Razzak, I. (2022). Leveraging big data analytics in healthcare enhancement: trends, challenges and opportunities. Multimedia Systems 28, 1339–1371. doi: 10.1007/s00530-020-00736-8
Rennie, S. , Buchbinder, M. , Juengst, E. , Brinkley-Rubinstein, L. , Blue, C. , and Rosen, D. L. (2020). Scraping the web for public health gains: ethical considerations from a “big data” research project on HIV and incarceration. Public Health Ethics 13, 111–121. doi: 10.1093/phe/phaa006
Roberts, S. L. (2019). Big data, algorithmic governmentality and the regulation of pandemic risk. Eur. J. Risk Regul. 10, 94–115. doi: 10.1017/err.2019.6
Rosen, D. L. , Buchbinder, M. , Juengst, E. , and Rennie, S. (2020). Public Health Research, practice, and ethics for justice-involved persons in the big data era. Am. J. Public Health 110, S37–S38. doi: 10.2105/AJPH.2019.305456
Salas-Vega, S. , Haimann, A. , and Mossialos, E. (2015). Big data and health care: challenges and opportunities for coordinated policy development in the EU. Health Syst Reform 1, 285–300. doi: 10.1080/23288604.2015.1091538
Salerno, J. , Knoppers, B. M. , Lee, L. M. , Hlaing, W. M. , and Goodman, K. W. (2017). Ethics, big data and computing in epidemiology and public health. Ann. Epidemiol. 27, 297–301. doi: 10.1016/j.annepidem.2017.05.002
Samerski, S. (2018). Individuals on alert: digital epidemiology and the individualization of surveillance. Life Sci Soc Policy 14:13. doi: 10.1186/s40504-018-0076-z
Samuel, G. , and Derrick, G. (2020). Defining ethical standards for the application of digital tools to population health research. Bull. World Health Organ. 98, 239–244. doi: 10.2471/BLT.19.237370
Sanchez, C. , and Sarria-Santamera, A. (2019). Unlocking data: where is the key? Bioethics. 33, 367–376. doi: 10.1111/bioe.12565
Sarbadhikari, S. N. , and Pradhan, K. B. (2020). The need for developing technology-enabled, safe, and ethical workforce for healthcare delivery. Saf. Health Work 11, 533–536. doi: 10.1016/j.shaw.2020.08.003
Satava, R. M. (2002). Laparoscopic surgery, robots, and surgical simulation: moral and ethical issues. Semin. Laparosc. Surg. 9, 230–238. doi: 10.1177/155335060200900408
Satava, R. M. (2003). Biomedical, ethical, and moral issues being forced by advanced medical technologies. Proc. Am. Philos. Soc. 147, 246–258.
Schwalbe, N. , and Wahl, B. (2020). Artificial intelligence and the future of global health. Lancet 395, 1579–1586. doi: 10.1016/S0140-6736(20)30226-9
Shachar, C. , Gerke, S. , and Adashi, E. Y. (2020). AI surveillance during pandemics: ethical implementation imperatives. Hast. Cent. Rep. 50, 18–21. doi: 10.1002/hast.1125
Shah, S. , and Khan, R. (2020). Secondary use of electronic health record: opportunities and challenges. IEEE Access 8, 136947–136965. doi: 10.1109/ACCESS.2020.3011099
Shahid, A. , Nguyen, T.-A. N. , and Kechadi, M.-T. (2021). Big data warehouse for healthcare-sensitive data applications. Sensors 21, 1–28. doi: 10.3390/s21072353
Shen, T. , and Wang, C. (2021). Big data technology applications and the right to health in China during the COVID-19 pandemic. Int. J. Environ. Res. Public Health 18, 1–15. doi: 10.3390/ijerph18147325
Shipton, L. , and Vitale, L. (2024). Artificial intelligence and the politics of avoidance in global health. Soc. Sci. Med. 359:117274. doi: 10.1016/j.socscimed.2024.117274
Snell, K. (2019). Health as the moral principle of post-genomic society: data-driven arguments against privacy and autonomy. Camb. Q. Healthc. Ethics 28, 201–214. doi: 10.1017/S0963180119000057
Sparrow, R. , and Hatherley, J. (2019). The promise and perils of Al in medicine. Int. J. Chin. Comp. Philos. Med. 17, 79–109. doi: 10.48550/arXiv.2505.06971
Stiglic, G. , Kocbek, P. , Fijacko, N. , Zitnik, M. , Verbert, K. , and Cilar, L. (2020). Interpretability of machine learning-based prediction models in healthcare. WIREs Data Min. Knowl. Discov. 10:e1379. doi: 10.1002/widm.1379
Strang, K. (2020). Problems with research methods in medical device big data analytics. Int. J. Data Sci. Anal. 9, 229–240. doi: 10.1007/s41060-019-00176-2
Straw, I. (2021). Ethical implications of emotion mining in medicine. Health Policy Technol. 10, 191–195. doi: 10.1016/j.hlpt.2020.11.006
Stylianou, A. , and Talias, M. A. (2017). Big data in healthcare: a discussion on the big challenges. Health Technol. 7, 97–107. doi: 10.1007/s12553-016-0152-4
Sun, Z. , Strang, K. D. , and Pambel, F. (2020). Privacy and security in the big data paradigm. J. Comput. Inf. Syst. 60, 146–155. doi: 10.1080/08874417.2017.1418631
Tan, M. , Hatef, E. , Taghipour, D. , Vyas, K. , Kharrazi, H. , Gottlieb, L., et al. (2020). Including social and behavioral determinants in predictive models: trends, challenges, and opportunities. JMIR Med. Inform. 8:e18084. doi: 10.2196/18084
Tang, A. , Tam, R. , Cadrin-Chenevert, A., et al. (2018). Canadian Association of Radiologists White Paper on artificial intelligence in radiology. J Assoc Can Radiol 69, 120–135. doi: 10.1016/j.carj.2018.02.002
Tanti, M. (2015) Exploitation of “big data”: the experience feedback of the French military health service on sanitary data. 2015 6th International Conference on Information Systems and Economic Intelligence, Hammamet, Tunisia, 1–4. doi: 10.1109/ISEI.2015.7358716
Terrasse, M. , Gorin, M. , and Sisti, D. (2019). Social media, e-health, and medical ethics. Hast. Cent. Rep. 49, 24–33. doi: 10.1002/hast.975
Terry, N. (2014). Health privacy is difficult but not impossible in a post-HIPAA data-driven world. Chest 146, 835–840. doi: 10.1378/chest.13-2909
Thomasian, N. M. , Eickhoff, C. , and Adashi, E. Y. (2021). Advancing health equity with artificial intelligence. J. Public Health Policy 42, 602–611. doi: 10.1057/s41271-021-00319-5
Thorpe, J. H. , and Gray, E. A. (2015a). Big data and ambulatory care breaking down legal barriers to support effective use. J Ambulatory Care Manage 38, 29–38. doi: 10.1097/JAC.0000000000000059
Thorpe, J. H. , and Gray, E. A. (2015b). Big data and public health: navigating privacy laws to maximize potential. Public Health Rep. 130, 171–175. doi: 10.1177/003335491513000211
Tigard, D. (2019). Changing the mindset for precision medicine: from incentivized biobanking models to genomic data. Genet. Res. 101:e10. doi: 10.1017/S0016672319000077
Timmins, K. A. , Green, M. A. , Radley, D. , Morris, M. A. , and Pearce, J. (2018). How has big data contributed to obesity research? A review of the literature. Int. J. Obes. 42, 1951–1962. doi: 10.1038/s41366-018-0153-7
Torous, J. , and Haim, A. (2018). Dichotomies in the development and implementation of digital mental health tools. Psychiatr. Serv. 69, 1204–1206. doi: 10.1176/appi.ps.201800193
Trein, P. , and Wagner, J. (2021). Governing personalized health: a scoping review. Front. Genet. 12:650504. doi: 10.3389/fgene.2021.650504
Tricco, A. C. , Lillie, E. , Zarin, W. , O'Brien, K. K. , Colquhoun, H. , Levac, D., et al. (2018). PRISMA extension for scoping reviews (PRISMA-ScR): checklist and explanation. Ann. Intern. Med. 169, 467–473. doi: 10.7326/M18-0850
Tsai, F.-J. , and Junod, V. (2018). Medical research using governments’ health claims databases: with or without patients’ consent? J. Public Health 40, 871–877. doi: 10.1093/pubmed/fdy034
Tupasela, A. , Snell, K. , and Tarkkala, H. (2020). The Nordic data imaginary. Big Data Soc. 7:2053951720907107, 1–13. doi: 10.1177/2053951720907107
van Deursen, A. J. A. M. , and Mossberger, K. (2018). Any thing for anyone? A new digital divide in internet-of-things skills. Policy Internet 10, 122–140. doi: 10.1002/poi3.171
van Heerden, A. , Wassenaar, D. , Essack, Z. , Vilakazi, K. , and Kohrt, B. A. (2020). In-home passive sensor data collection and its implications for social media research: perspectives of community women in rural South Africa. J. Empir. Res. Hum. Res. Ethics 15, 97–107. doi: 10.1177/1556264619881334
Vayena, E. , Blasimme, A. , and Cohen, I. G. (2018). Machine learning in medicine: addressing ethical challenges. PLoS Med. 15:e1002689. doi: 10.1371/journal.pmed.1002689
Vayena, E. , Salathe, M. , Madoff, L. C. , and Brownstein, J. S. (2015). Ethical challenges of big data in public health. PLoS Comput Biol 11:e1003904. doi: 10.1371/journal.pcbi.1003904
Veiga, J. J. D. , and Ward, T. E. (2016). Data collection requirements for Mobile connected health: an end user development approach. Mobile16 Proc 1st Int Workshop Mob Dev, 23–30. doi: 10.1145/3001854.3001856
Villongco, C. , and Khan, F. (2020). “Sorry I didn’t hear you.” the ethics of voice computing and AI in high risk mental health populations. AJOB Neurosci. 11, 105–112. doi: 10.1080/21507740.2020.1740355
Vogel, C. , Zwolinsky, S. , Griffiths, C. , Hobbs, M. , Henderson, E. , and Wilkins, E. (2019). A Delphi study to build consensus on the definition and use of big data in obesity research. Int. J. Obes. 43, 2573–2586. doi: 10.1038/s41366-018-0313-9
Vollmer Dahlke, D. , and Ory, M. G. (2020). Emerging issues of intelligent assistive technology use among people with dementia and their caregivers: a U.S. perspective. Front. Public Health 8:191. doi: 10.3389/fpubh.2020.00191
Wang, L. , and Alexander, C. A. (2020). Big data analytics in medical engineering and healthcare: methods, advances and challenges. J. Med. Eng. Technol. 44, 267–283. doi: 10.1080/03091902.2020.1769758
Wang, Q. , Su, M. , Zhang, M. , and Li, R. (2021). Integrating digital technologies and public health to fight Covid-19 pandemic: key technologies, applications, challenges and outlook of digital healthcare. Int. J. Environ. Res. Public Health 18, 1–50. doi: 10.3390/ijerph18116053
Wongkoblap, A. , Vadillo, M. A. , and Curcin, V. (2017). Researching mental health disorders in the era of social media: systematic review. J. Med. Internet Res. 19:e228. doi: 10.2196/jmir.7215
World Health Organization (Ed.) (2021). Ethics and governance of artificial intelligence for health: WHO guidance. 1st Edn. Geneva: World Health Organization.
Wyllie, D. , and Davies, J. (2015). Role of data warehousing in healthcare epidemiology. J. Hosp. Infect. 89, 267–270. doi: 10.1016/j.jhin.2015.01.005
Xafis, V. , Schaefer, G. , Labude, M. , Schaefer, G. O. , Labude, M. K. , Brassington, I., et al. (2019). An ethics framework for big data in health and research. Asian Bioeth. Rev. 11, 227–254. doi: 10.1007/s41649-019-00099-x
Xie, G. , Chen, T. , Li, Y. , Chen, T. , Li, X. , and Liu, Z. (2020). Artificial intelligence in nephrology: how can artificial intelligence augment nephrologists’ intelligence? Kidney Dis 6, 1–6. doi: 10.1159/000504600
Xing, F. , Peng, G. , Zhang, B. , Li, S. , and Liang, X. (2021). Socio-technical barriers affecting large-scale deployment of AI-enabled wearable medical devices among the ageing population in China. Technol. Forecast. Soc. Change 166, 1–11. doi: 10.1016/j.techfore.2021.120609
Yang, Y. T. , and Chen, B. (2018). Precision medicine and sharing medical data in real time: opportunities and barriers. Am. J. Manag. Care 24, 356–358.
Yeung, D. (2018). Social media as a catalyst for policy action and social change for health and well-being: viewpoint. J. Med. Internet Res. 20:e94. doi: 10.2196/jmir.8508
Young, S. D. (2018). Social media as a new vital sign: commentary. J. Med. Internet Res. 20:e161. doi: 10.2196/jmir.8563
Zhang, X. , Perez-Stable, E. J. , Bourne, P. E., et al. (2017). Big data science: opportunities and challenges to address minority health and health disparities in the 21st century. Ethn. Dis. 27, 95–106. doi: 10.18865/ed.27.2.95
Keywords: artificial intelligence, big data, ethics, population health, public health
Citation: Couture V, Roy M-C, Dez E, Tremblay F and Bélisle-Pipon J-C (2025) Ethical issues raised by artificial intelligence and big data in population health: a scoping review. Front. Sociol. 10:1536389. doi: 10.3389/fsoc.2025.1536389
Edited by:
Kira Allmann, College of William and Mary, United StatesReviewed by:
Xiaoya Xu, Guangdong University of Finance and Economics, ChinaAbdallah Al-Ani, King Hussein Cancer Center, Jordan
Copyright © 2025 Couture, Roy, Dez, Tremblay and Bélisle-Pipon. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Vincent Couture, VmluY2VudC5jb3V0dXJlQHVtb250cmVhbC5jYQ==
†ORCID: Vincent Couture, https://orcid.org/0000-0002-8811-0524
Marie-Christine Roy, https://orcid.org/0000-0002-1803-4079
Emma Dez, https://orcid.org/0000-0002-2496-2920
Jean-Christophe Bélisle-Pipon, https://orcid.org/0000-0002-8965-8153
 Emma Dez3†
Emma Dez3† 
   
   
   
  