Adapting data science competencies by role and purpose: Voice AI

Dorr, David A.; Krussel, Andrea; Hauck, Rachel; Jackson, Christie; Dalal, Abhijeet; Bedrick, Steven; Payne, Philip R. O.; , Bridge2AI-Voice Consortium; Hersh, William; Bensoussan, Yael; Elemento, Olivier; Rameau, Anais; Sigaras, Alexandros; Ghosh, Satrajit; Powell, Maria; Ravitsky, Vardit; Christophe Belisle-Pipon, Jean; Dorr, David; Payne, Phillip; Johnson, Alistair; Bahr, Ruth; Bolser, Donald; Rudzicz, Frank; Lerner-Ellis, Jordan; Jenkins, Kathy; Awan, Shaheen; Boyer, Micah; Hersh, William; Krussel, Andrea; Bedrick, Steven; Ahmed Syed, Toufeeq; Toghranegar, Jamie; Anibal, James; Sutherland, Duncan; Diaz-Ocampo, Enrique; Costello, John; Gelbard, Alexander; Vinson, Kimberly; Neal, Tempestt; Jayachandran, Lochana; Ng, Evan; Casalino, Selina; Abdel-Aty, Yassmeen; Hanna, Karim; Zesiewicz, Theresa; Moothedan, Elijah; Evangelista, Emily; Salvi Cruz, Samantha; Zhao, Robin; Ebraheem, Mohamed; Newberry, Karlee; De Santiago, Iris; Eiseman, Ellie; Rahman, JM; Jo, Stacy; Goldenberg, Anna

doi:10.3389/fdgth.2025.1610253

CURRICULUM, INSTRUCTION, AND PEDAGOGY article

Front. Digit. Health, 21 October 2025

Sec. Connected Health

Volume 7 - 2025 | https://doi.org/10.3389/fdgth.2025.1610253

This article is part of the Research TopicAdvancing Vocal Biomarkers and Voice AI in Healthcare: Multidisciplinary Focus on Responsible and Effective Development and UseView all 12 articles

Adapting data science competencies by role and purpose: Voice AI

David A. Dorr^1*^†

Andrea Krussel^2*^†

Rachel Hauck²

Christie Jackson¹

Abhijeet Dalal¹

Steven Bedrick¹

Philip R. O. Payne²Bridge2AI-Voice Consortium and

William Hersh¹

¹Division of Informatics, Clinical Epidemiology, and Translational Data Science, Department of Medicine, Oregon Health & Science University, Portland, OR, United States
²Institute for Informatics, Data Science and Biostatistics, Washington University School of Medicine in St. Louis, St. Louis, MO, United States

Competencies help define the skills and knowledge needed by learners. Often broad, educators integrate competencies to provide a framework for curricula or professional standards. For data science, the rate of change in the field, role variations, and specificity in key applications can be challenging. Our objective was to adapt general data science competencies for different learner roles in an emerging area: the clinical utility of Voice, Language, and Speech-based Artificial Intelligence/Machine Learning (AI/ML). Using a persona-inductive approach, we adapted competencies to support learners from varying professional and educational backgrounds and implemented these adaptations in a multi-institutional summer school. Results from these pilot efforts demonstrated feasibility, highlighted the importance of cross-role collaboration, and provided lessons for scaling to broader audiences. Our frameworks show that competency adaptation is necessary and practical in rapidly evolving AI domains.

1 Introduction

The rapid pace of data science and artificial intelligence (AI) development has outstripped the ability of education and training systems to keep up, especially in areas requiring the integration of multimodal data such as text, audio, images, and video (1, 2). Among these, voice and speech hold particular promise: advances in machine learning are enabling their use as digital biomarkers for detecting and monitoring a wide range of health conditions, from neurological and psychiatric disorders to cancers and cardiometabolic diseases (3, 4). However, realizing this potential requires not only technical expertise but also ethical oversight, clinical integration, and engagement from a broad set of stakeholders—including clinicians, researchers, administrators, policymakers, and patients (5).

These demands highlight a central challenge: traditional competency frameworks in biomedical informatics and AI education are often too rigid or slow to adapt to the speed of technological innovation. Emerging risks—including bias, reproducibility concerns, explainability, and safety—further underscore the need for new educational approaches that are flexible, inclusive, and ethically grounded (3).

This paper responds to these challenges by:

• Reviewing the current landscape of data science and AI competency frameworks, with particular attention to their application in healthcare and biomedical informatics.

• Identifying gaps in existing models when applied to rapidly evolving domains such as Voice AI, where technical advances and clinical applications are moving quickly, but educational frameworks lag.

• Our work within the Bridge2AI initiative, specifically within the Training, Recruitment, and Mentoring (TRM) group, provides a broader framework for cross-disciplinary, ethically grounded AI education.

• Presenting implications for future curriculum design, highlighting how adaptive, competency-based approaches can prepare learners to responsibly develop, implement, and use Voice AI tools in clinical and biomedical contexts and have the potential to provide a framework for other clinical AI domains.

2 Literature review

Meeting diverse learners' educational and professional needs is challenging, reflecting the field of AI's interdisciplinary and rapidly evolving nature. Competency development in Biomedical Informatics (BMI) is an excellent touch point; BMI is the “interdisciplinary field that studies and pursues the effective uses of biomedical data, information, and knowledge for scientific inquiry, problem-solving, and decision making, motivated by efforts to improve human health” (6). One of the primary objectives of informatics and data science is to develop technical proficiency among data science students, including foundational skills in machine learning (1). Clinicians must also know how to use and understand tools, including those with AI fields (7). Likewise, researchers must also be facile with data sources and tools for analyzing data in their modern work (8, 9).

To help prepare this diversity of learners for these needs, several groups have defined competencies in data science (10, 11) and AI for clinicians (12, 13). Aligning these competencies with learners' needs in application areas, like voice, is challenging due to the rapid development of new analysis methods and the strong desire to implement them in care. Goodman et al. (10) and Topol (14) emphasize readiness challenges for clinicians adopting AI tools. Our work builds upon these frameworks by focusing on personas, iterative adaptation, and curricular implementation. AI competency, in contrast, denotes practical proficiency in using, engaging, interacting, developing, or managing AI systems and specific tasks that are relevant in real-world contexts (15).

Historical approaches to align competency models have included consensus-based deductive methods from experts and educators in the field. The deductive approach requires substantial time, and new methods and requirements often outpace the updates to the frameworks. However, more pragmatic approaches to data science competencies have started to take an inductive approach focused on the broader competency elements targeted to specific areas. For instance, training in new models of AI/ML may include broad concepts like “Data exploration and inference generation,” with key principles supplemented with self-directed and interactive problem-solving. The benefit of the inductive approach is that it matches knowledge of adult learning standards and enables lifelong learning.

Rapid development in Machine Learning has been a constant, but new and highly complex models have opened a brand-new set of requirements for learners. This rapid development includes incorporating multimodal data, new coding approaches, and new application areas. Developing, testing, and implementing ML/AL models for the voice, speech, and language continuum is a near-perfect example of this triumvirate of new capabilities. Extraction of key information from voice and respiration has shown promise in multiple health conditions across otolaryngology, neurology, psychiatry, infectious disease, and cardiology. The techniques to process these models have evolved quickly, and the ability to change the voice into language can be near instantaneous, allowing for rapid, complex model development. Large Language Models—already transforming language-based analyses—show promise in concept and feature extraction across many data types, including voice and speech. One key aspect of these models is the strong demand for immediate implementation in clinical care, even when evidence for their safe and effective use is limited (3).

These new capabilities come with new risks and biases. While the issues of fairness, accuracy, verification, explainability, and safety have long been known, the behavior of the models has changed the manifestations and implications of these risks. For instance, the ability to emulate any voice or image makes verifiability challenging; the persistence of large models across federated spaces with active learning significantly impacts data retention; and the promiscuous incorporation of biased data in unpredictable models presents major ethical concerns.

As part of an effort to build an ethically sourced dataset for training models across the Voice AI continuum, the Voice as a Biomarker of Health (NIH) program, one of four data generation projects (DGPs) funded by the National Institute of Health's Bridge2AI program, has been launched. Voice, speech, and breathing sounds can reveal valuable information about a patient's health. With advances in AI, these audio signals are being studied as potential digital biomarkers to support early detection of conditions ranging from voice disorders and neurological diseases to head and neck cancers and diabetes. Clinicians across multiple fields—including otolaryngology, neurology, speech-language pathology, and internal medicine—bring complementary expertise critical to advancing this emerging area of research and practice (4).

3 Methods

The Bridge2AI initiative has emphasized the importance of curriculum innovation to close gaps in AI education. Its TRM working group developed a cross-disciplinary curriculum to address deficits in accessibility, reproducibility, integration with clinical practice, and stakeholder engagement. Their model prioritizes ethically sourced data, fosters collaboration across domains, and cultivates professional skills that support accountability and adaptability in biomedical and healthcare settings (16). The model provides a valid comparison point for our work on the Voice AI TRM team, which has been tasked with building competency-based education programs to teach multiple personas how to develop and implement standardized methods for voice data collection to fuel scientific discovery and ethical development of AI/ML models.

1. We used a persona-based inductive approach to adapt existing core data science competencies to the domain of Voice AI. First, we reviewed and synthesized multiple frameworks, including the EDISON Data Science Framework (17), AMIA informatics competencies (9), and recent AI competencies for clinicians (12, 13).

2. We then developed personas based on the NIH CD2H framework, focusing on two broad categories: (1) clinical learners (e.g., clinicians, speech-language pathologists, medical trainees) and (2) technical learners (e.g., informaticians, data scientists, engineers) illustrated in Table 1 below. Personas were refined through expert consensus with members of the NIH Bridge2AI-Voice consortium.

Table 1

Table 1. Personas.

3. Table 2 reveals how the Competencies were adapted iteratively in working groups, with feedback from interdisciplinary educators and domain experts. Adapted competencies included foundational knowledge (e.g., data lifecycle, AI model building) and contextual considerations (e.g., ethics, FAIR/CARE frameworks, societal implications).

4. We implemented the adapted competencies in a multi-site summer school program in 2024 across four institutions (OHSU, Washington University in St. Louis, Weill Cornell Medicine, and the University of South Florida) to test them. The program included didactic instruction, workshops, and a culminating hackathon event where interdisciplinary teams addressed challenges and integrated their AI competencies with clinical voice datasets. The evaluation included questions surrounding program experience, learner feedback, and feasibility across sites.

Table 2

Table 2. Core elements of competencies’ adaptations for AI.

We have illustrated the detailed methods below:

a. Core Data Science CompetenciesTo create the adaptation, we reviewed the data science competencies defined by several initiatives, including the Data Science Initiative, an NIH program that seeks to build health science capacity in Africa. Several authors of this work are investigators on this project and have been building competency-based data science programs in partnership with universities across Sub-Saharan Africa. The training program integrates three core interdisciplinary areas: Computer Science/Informatics, Statistics/Mathematics, and Domain-specific knowledge with diverse mentorship from experts across basic sciences to community-based research initiatives (18). We utilized additional adaptations from EDISON, which identified several key competencies, including Data Analytics, which encompasses statistical methods, machine learning, and business analytics, all of which are essential for extracting insights and making data-driven decisions; engineering competencies, including software development and infrastructure management; and competencies in scientific or research methods to ensure data-driven research meets high standards of validity and reliability (17). IBM Analytics was also utilized and identified the following competencies for their data science apprenticeship model: Statistics and programming foundation, data science foundation, data preparation, model building, model deployment, big data foundation, and leadership and professional development.The list below highlights the six high-level competencies identified from over 60 individual competencies across twelve domains. Cognizant of the strong pressure for use of these models, we leveraged a paper by Russell et al. (12) that focused on the end-users of AI development: clinicians. After conducting semi-structured interviews with 15 healthcare experts, six clinical competencies and 25 sub-competencies were identified. This focus—while practical—also enables easy adaptation. For instance, the “basic knowledge” of techniques may be deep and hands-on for data scientists, while clinicians may learn what to look for in descriptions of model development. Similarly, core foundational ethical considerations can be with specific adaptations for implementations and developers related to their professional roles.

• Basic knowledge of data science techniques, including AI

• Ethical considerations

• Data exploration and inference generation

• Evidence-based evaluation

• Implementation of tools

• Societal issues

b. Personas: Defining Key Learner Types and RolesWe used personas to adapt the competencies further and apply them to specific areas. Personas are representations of potential groups of learners with information about their general needs and goals. We based our personas on roles created for clinical and translational science through the National Center for Data to Health (CD2H). We focused on two categories and four roles: clinical Investigators, including experts and trainees, and technical roles, including informatics and data science experts and trainees. We then divided these initial four roles into Voice AI-specific groupings and identified vital needs and curricular products for these groups. For instance, clinician professionals such as speech-language pathologists may be the front line for collecting data for AI use, understanding the results, and describing the impacts of AI models. In contrast, clinician trainees need a broader sense of Voice AI applications and AI workflow. Technical learners need a more fundamental approach to ML/AI development and testing specific to Voice AI. They also require a background in the clinical problems and their current diagnosis, prognosis, and treatment to develop targeted, practical solutions. All groups require deep core ethical discussions and implications, carefully considering role-based needs.

c. Adapted CompetenciesOnce we had developed the personas, we engaged with experts and educators from the Voice AI collaborative to adapt each core competency to the needs of the learners. For each, we took the core elements of the competencies and iterated on the common needs of learners and the specific elements required for voice. For instance, basic knowledge includes the data science lifecycle and core model building, as well as how to learn about particular methods and features for the voice. In addition, guidance is given on how to review the potential applications and current evidence for the use of AI specific to voice. Ethical concerns have a foundation in key frameworks, such as the need to make data and models Findable, Accessible, Interoperable, and Reusable (FAIR), as well as the core concerns related to voice, especially for communities facing historical discrimination, such as Indigenous peoples. Voice AI can potentially extend the historical theft of voice and language; these considerations require frameworks like CARE—working with affected communities for Community benefit, granting Authority to control, defining Responsibility, and exploring the Ethical implications. The adaptations are intended to give deeper expertise in this area and empower learners through the inductive model by giving them the skills and framework to step through for other areas over time.

d. Curricular Design: Adaptation for topic: Voice AITable 3 demonstrates the curricula developed from the adapted competencies, including specific, available curricular resources (links, in black) in informatics and data science.

e. Cross-pollination of roles: team challenges

Table 3

Table 3. Curricular resources.

One key aspect of this approach is that learning must be cross-pollinated across roles; in essence, learning to work as a team so that the specific learned competencies of each group can complement each other. To this end, we developed team challenges to allow interdisciplinary teams to work together to improve their understanding of others' competencies. Table 4, below, highlights examples of challenges and critical competencies they address. Teams must communicate and problem-solve effectively; success is defined, in part, by recognizing the diverse experiences that each brings. Thus, the personas have key areas to explain, offering their expertise to ensure the team achieves the best possible outcomes. In exploring data science techniques for key clinical areas, clinicians can help define the need and guide the interpretation of results. At the same time, the technical personas can decide on the ensemble method, identify features related to the need, and explain the technical aspects of the results. Similarly, for ethical concerns about identifying current and former smokers through voice analysis, clinicians can help give examples of historical biases and risks to health. At the same time, the technical team can consider potential requirements for the accuracy and reliability of these models and their possible implementations.

Table 4

Table 4. Challenges and personas.

4 Results

To address the growing need to apply AI techniques to acoustic data (voice and sounds), we established a new training activity: an interdisciplinary (medical, nursing, engineering, and science) summer school program for undergraduate and graduate students from four different universities already funded for a Data Generation Project (DGP) “Voice as a Biomarker of Health” as part of the NIH Bridge2AI consortium. The training activity aimed to train students from diverse backgrounds to create computer programs and machine-learning models that use acoustic data for medical applications. The training activities involved identifying areas of unmet medical needs where voice or sounds may help create new solutions, acquiring, managing, and analyzing existing acoustic datasets, and building, testing, and validating AI models that leverage state-of-the-art AI methods (e.g., deep learning), creating user interfaces and if feasible, deploying the applications in a real-world setting.

Our application for supplemental funding was accepted, and as of this writing, the inaugural Voice AI summer school program launched in 2024 at Oregon Health and Science University (OHSU), Washington University in St. Louis, Weill Cornell Medicine, and the University of South Florida (USF). 50 graduate and undergraduate students (selected from a highly competitive and deep pool of interested applicants) participated across the four sites in a 5-week-long course culminating in a hackathon event. The curriculum included an online platform for individual learning, didactic in-person lectures, and workshops with case studies using voice datasets. In the hackathon event, interdisciplinary teams (clinical and informatics students) competed against each other to develop the best models to answer tangible clinical questions using voice data from the Voice DGP. Developed AI models have been made publicly available.

5 Discussion

Competency frameworks help develop curricula and define professional needs. Still, in data science and AI, the rate of change and shifts in paradigms make deductive approaches to competencies challenging. This manuscript describes a method for taking broad competency areas and refining them for specific new areas of analysis (Voice-Speech-Language). We then discuss how we developed specific curricular adaptations to help learners understand the specific concerns of Voice AI while still understanding the core framework needed to ethically develop, evaluate, and implement models for any particular challenge.

Limitations of this work include its focus on one domain (Voice AI) and short-term evaluation in a summer program. Longitudinal assessment of learner outcomes and expansion to additional modalities are the necessary next steps.

6 Conclusion

Adapting data science competencies to emerging domains such as Voice AI requires theoretical grounding and pragmatic curricular design. By combining inductive adaptation with role-based personas, we developed a competency framework that flexibly addresses the needs of learners from different backgrounds. Our pilot summer school demonstrated feasibility and revealed pathways for scaling to broader audiences, including clinicians, data scientists, and interdisciplinary teams. While limited to one application domain, this work provides a transferable model for adapting competencies in other rapidly evolving AI subfields. Future directions include multi-institutional testing, expansion to additional modalities, and long-term evaluation of educational outcomes.

Data availability statement

Publicly available datasets were analyzed in this study. This data can be found here: PhysioNet Repository, Bridge2AI-Voice Dataset v1.1, doi: 10.13026/6rcx-na48, https://physionet.org/content/b2ai-voice/2.0.1/.

Author contributions

DD: Writing – review & editing, Writing – original draft. AK: Writing – original draft, Writing – review & editing. RH: Writing – review & editing. CJ: Writing – original draft. AD: Writing – review & editing, Writing – original draft. SB: Writing – review & editing, Writing – original draft. PP: Writing – review & editing, Writing – original draft. WH: Writing – review & editing, Writing – original draft.

Group member of Bridge2AI-Voice Consortium

University of South Florida, Tampa, FL, US: Yael Bensoussan. Weill Cornell Medicine, New York, NY, USA: Olivier Elemento. Weill Cornell Medicine, New York, NY, USA: Anais Rameau. Weill Cornell Medicine, New York, NY, USA: Alexandros Sigaras. Massachusetts Institute of Technology, Boston, MA, USA: Satrajit Ghosh. Vanderbilt University Medical Center, Nashville, TN, USA: Maria Powell. University of Montreal, Montreal, Quebec, Canada: Vardit Ravitsky. Simon Fraser University, Burnaby, BC, Canada: Jean Christophe Belisle-Pipon. Oregon Health & Science University, Portland, OR, USA: David Dorr. Washington University in St. Louis, St. Louis, MO, USA: Phillip Payne. University of Toronto, Toronto, Ontario, Canada: Alistair Johnson. University of South Florida, Tampa, FL, USA: Ruth Bahr. University of Florida, Gainesville, FL, USA: Donald Bolser. Dalhousie University, Toronto, ON, Canada: Frank Rudzicz. Mount Sinai Hospital, Sinai Health, University of Toronto, Toronto, ON, Canada: Jordan Lerner-Ellis. Boston Children's Hospital, Boston, MA, USA: Kathy Jenkins. University of Central Florida, Orlando, FL, USA: Shaheen Awan. University of South Florida, Tampa, FL, USA: Micah Boyer. Oregon Health & Science University, Portland, OR, USA: William Hersh. Washington University in St. Louis, St. Louis, MO, USA: Andrea Krussel. Oregon Health & Science University, Portland, OR, USA: Steven Bedrick. UT Health, Houston, TX, USA: Toufeeq Ahmed Syed. University of South Florida, Tampa, FL, USA: Jamie Toghranegar. University of South Florida, Tampa, FL, USA: James Anibal. New York, NY, USA: Duncan Sutherland. University of South Florida, Tampa, FL, USA: Enrique Diaz-Ocampo. University of South Florida, Tampa, FL, USA: Elizabeth Silberhoz Boston Children's Hospital, Boston, MA, USA: John Costello. Vanderbilt University Medical Center, Nashville, TN, USA: Alexander Gelbard. Vanderbilt University Medical Center, Nashville, TN, USA: Kimberly Vinson. University of South Florida, Tampa, FL, USA: Tempestt Neal. Mount Sinai Health, Toronto, ON, Canada: Lochana Jayachandran. The Hospital for Sick Children, Toronto, ON, Canada: Evan Ng. Mount Sinai Health, Toronto, ON, Canada: Selina Casalino. University of South Florida, Tampa, FL, USA: Yassmeen Abdel-Aty. University of South Florida, Tampa, FL, USA: Karim Hanna. University of South Florida, Tampa, FL, USA: Theresa Zesiewicz. Florida Atlantic University, Boca Raton, FL, USA: Elijah Moothedan. University of South Florida, Tampa, FL, USA: Emily Evangelista. Vanderbilt University Medical Center, Nashville, TN, USA: Samantha Salvi Cruz. Weill Cornell Medicine, New York, NY, USA: Robin Zhao. University of South Florida, Tampa, FL, USA: Mohamed Ebraheem. University of South Florida, Tampa, FL, USA: Karlee Newberry. University of South Florida, Tampa, FL, USA: Iris De Santiago. University of South Florida, Tampa, FL, USA: Ellie Eiseman. University of South Florida, Tampa, FL, USA: JM Rahman. Boston Children's Hospital, Boston, MA, USA: Stacy Jo. Hospital for Sick Children, Toronto, ON, Canada: Anna Goldenberg.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This study was funded in part by the NIH Common Fund through the Bridge2AI program, award OT2OD032720.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Dhar V. Data science and prediction. Commun ACM. (2013) 56(12):64–73. doi: 10.1145/2500499

Crossref Full Text | Google Scholar

2. Meyer MA. Healthcare data scientist qualifications, skills, and job focus: a content analysis of job postings. J Am Med Inform Assoc. (2019) 26(5):383–91. doi: 10.1093/jamia/ocy181

PubMed Abstract | Crossref Full Text | Google Scholar

3. Idrisoglu A, Dallora AL, Anderberg P, Berglund JS. Applied machine learning techniques to diagnose voice-affecting conditions and disorders: systematic literature review. J Med Internet Res. (2023) 25:e46105. doi: 10.2196/46105

PubMed Abstract | Crossref Full Text | Google Scholar

4. Bensoussan Y, Elemento O, Rameau A. Voice as an AI biomarker of health—introducing audiomics. JAMA Otolaryngology–Head & Neck Surgery. (2024) 150(4):283–4. doi: 10.1001/jamaoto.2023.4807

Crossref Full Text | Google Scholar

5. Hersh W. Competencies and Curricula Across the Spectrum of Learners for Biomedical and Health Informatics. IOS Press (2022).

Google Scholar

6. Kulikowski CA, Shortliffe EH, Currie LM, Elkin PL, Hunter LE, Johnson TR, et al. AMIA Board white paper: definition of biomedical informatics and specification of core competencies for graduate education in the discipline. J Am Med Inform Assoc. (2012) 19(6):931–8. doi: 10.1136/amiajnl-2012-001053

PubMed Abstract | Crossref Full Text | Google Scholar

7. Hersh WR, Gorman PN, Biagioli FE, Mohan V, Gold JA, Mejicano GC. Beyond information retrieval and electronic health record use: competencies in clinical informatics for medical education. Adv Med Educ Pract. (2014) 5:205–12. doi: 10.2147/AMEP.S63903

PubMed Abstract | Crossref Full Text | Google Scholar

8. Moore JH, Boland MR, Camara PG, Chervitz H, Gonzalez G, Himes BE, et al. Preparing next-generation scientists for biomedical big data: artificial intelligence approaches. Per Med. (2019) 16(3):247–57. doi: 10.2217/pme-2018-0145

PubMed Abstract | Crossref Full Text | Google Scholar

9. Valenta AL, Berner ES, Boren SA, Deckard GJ, Eldredge C, Fridsma DB, et al. AMIA Board white paper: AMIA 2017 core competencies for applied health informatics education at the master’s degree level. J Am Med Inform Assoc. (2018) 25(12):1657–68. doi: 10.1093/jamia/ocy132

PubMed Abstract | Crossref Full Text | Google Scholar

10. Goodman KE, Rodman AM, Morgan DJ. Preparing physicians for the clinical algorithm era. N Engl J Med. (2023) 389(6):483–7. doi: 10.1056/NEJMp2304839

PubMed Abstract | Crossref Full Text | Google Scholar

11. Seth P, Hueppchen N, Miller SD, Rudzicz F, Ding J, Parakh K, et al. Data science as a core competency in undergraduate medical education in the age of artificial intelligence in health care. JMIR Med Educ. (2023) 9:e46344. doi: 10.2196/46344

PubMed Abstract | Crossref Full Text | Google Scholar

12. Russell RG, Lovett Novak L, Patel M, Garvey KV, Craig KJT, Jackson GP, et al. Competencies for the use of artificial intelligence–based tools by health care professionals. Acad Med. (2023) 98(3):348–56. doi: 10.1097/ACM.0000000000004963

PubMed Abstract | Crossref Full Text | Google Scholar

13. Liaw W, Kueper JK, Lin S, Bazemore A, Kakadiaris I. Competencies for the use of artificial intelligence in primary care. Ann Fam Med. (2022) 20(6):559–63. doi: 10.1370/afm.2887

PubMed Abstract | Crossref Full Text | Google Scholar

14. Topol EJ. As artificial intelligence goes multimodal, medical applications multiply. Science. (2023) 381(6663):eadk6139. doi: 10.1126/science.adk6139

PubMed Abstract | Crossref Full Text | Google Scholar

15. Chiu TKF. AI Literacy and competency: definitions, frameworks, development and future research directions. Interactive Learning Environments. (2025) 33(5):3225–9. doi: 10.1080/10494820.2025.2514372

Crossref Full Text | Google Scholar

16. Rincon J, Pelletier AR, Gilliland D, Wang W, Wang D, Sankar B, et al. Bridge2AI: Building a cross-disciplinary curriculum towards AI-enhanced biomedical and clinical care. Unknown article. Bridge center of BRIDGE2AI at UCLA. 2023 (2023).

Google Scholar

17. In: Demchenko Y, Belloum A, Los W, Wiktorski T, Manieri A, Brocks H, et al. editors. EDISON Data Science Framework: A Foundation for Building Data Science Profession for Research and Industry. 2016 IEEE International Conference on Cloud Computing Technology and Science (CloudCom); 2016 12-15 Dec (2016).

Google Scholar

18. Beyene J, Harrar SW, Altaye M, Astatkie T, Awoke T, Shkedy Z, et al. A roadmap for building data science capacity for health discovery and innovation in Africa. Front Public Health. (2021) 9. doi: 10.3389/fpubh.2021.710961

Crossref Full Text | Google Scholar

Keywords: artificial intelligence, voice, speech, language, competency frameworks, machine learning, data science, personas

Citation: Dorr DA, Krussel A, Hauck R, Jackson C, Dalal A, Bedrick S, Payne PRO, Bridge2AI-Voice Consortium and Hersh W (2025) Adapting data science competencies by role and purpose: Voice AI. Front. Digit. Health 7:1610253. doi: 10.3389/fdgth.2025.1610253

Received: 11 April 2025; Accepted: 26 August 2025;
Published: 21 October 2025.

Edited by:

Siti Anom Ahmad, Putra Malaysia University, Malaysia

Reviewed by:

Jing Shao, Hong Kong Baptist University, Hong Kong SAR, China
Adedoyin Odumabo, Trinity University, Nigeria

Copyright: © 2025 Dorr, Krussel, Hauck, Jackson, Dalal, Bedrick, Payne, Bridge2AI-Voice Consortium and Hersh. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: David A. Dorr, ZG9ycmRAb2hzdS5lZHU=; Andrea Krussel, a3J1c3NlbGFAd3VzdGwuZWR1

^†These authors share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.