Skip to main content

EDITORIAL article

Front. Public Health, 30 May 2022
Sec. Digital Public Health
This article is part of the Research Topic Voice Technology and Conversational Agents in Health Care Delivery View all 6 articles

Editorial: Voice Technology and Conversational Agents in Health Care Delivery

  • 1NORC at the University of Chicago, Chicago, IL, United States
  • 2Nationwide Children's Hospital, Columbus, OH, United States
  • 3Kids Speech Labs, Dublin, Ireland

Conversational agents (CA) are artificial intelligence (AI) driven applications that can process naturalist text and speech to deliver personalized interactions with users, i.e., chatbots, voice assistants, embodied conversational agents, intelligent virtual assistants, dialogue systems, and interactive voice recognition systems (IVRS). The inclusion of automatic speech recognition, natural language processing, and machine learning algorithms help to understand and react to natural conversations and learn from each interaction. Deloitte reported that the global conversational AI market is expected to reach ~US$14 billion by 2025, and chatbots are mostly used AI solutions, the top use of AI in the industry (1). Similarly, the adoption of smart speakers is increasing, and by 2021, ~94 million U.S. adults own at least one smart speaker (2).

The range of applications and demand for CA within healthcare is also growing, for patient-facing and healthcare providers-facing applications. During the pandemic, CA has been highly utilized tools by CDC and WHO to provide COVID19 screening services for populations and add value to be a remote care delivery tool (3). Furthermore, conversational sentiments, vocal attributes, and speech biomarkers are promising fields that will potentially help improve preventative care and remote patient monitoring, enabling early detection of symptoms, and promoting PGHD and PRO (47). Overall, CA provides an exciting opportunity to develop new approaches for interactions that can be delivered in a multimodal and naturalistic way to enhance healthcare. Yet, it is important to consider the additional design and deployment requirements when delivering CA solutions for healthcare. The articles submitted to this Research Topic highlight the excellent work being carried out to understand the human interaction required to deliver effective solutions via CA and the challenges that uniquely face healthcare applications.

With the advancement of NLU/NLP, CA has improved over time to understand the language and maintain natural conversations. Palanica and Fossat present a study to compare the comprehension accuracy of intelligent virtual assistants, which is a subset of CA (i.e., voice assistants, namely, Amazon Alexa, Google Assistant, and Apple Siri), to recognize commonly used medication names in the U.S. They replicate their earlier study to observe the changes between 2019 and 2021. Their findings show approximately 10% to 24% increase in performances of Amazon Alexa and Apple Siri. Among all, Google Assistant is found to be the best intelligent virtual assistant that comprehends the brand name and generic name of the medications. However, in comparison to the first study, the authors find slight differences in the performance of Apple Siri and no difference in the performance of Amazon Alexa and Google Assistant. They provide further insight on overall improvement in these intelligent virtual assistants toward becoming more usable in everyday life across different user demographics, emphasizing the value of intelligent virtual assistants in remote healthcare delivery and telemedicine.

The voice interactive CA increases the convenience in care management. Sezgin et al. investigate the feasibility of using voice interaction and automatic speech recognition enabled on smartphones to keep medical notes and track symptoms and medications for the patients. They conduct a pre-post study with caregivers of children with special healthcare needs (pre-n = 41, post-n = 24) to observe the caregiver preferences before and after using the voice interactive research app (SpeakHealth). The results are descriptively analyzed. The authors report that the majority of the participants kept voice interactive notes <20 s in length focusing on symptoms and conditions, medications, treatment, and therapies, and patient behaviors. The caregivers respond that the voice interaction and transcribed notes positively change their preference of technology to use and methods for tracking symptoms and health events at home. The findings support that voice interactive tools are feasible and effective in remote care management. The authors further discuss the implementations with voice interactive tools, including interoperable and integrated medical records, improving patient-reported outcomes through the rich unstructured patient-generated health data (through transcriptions), and improving speech biomarkers with generated real-world data.

The technology supporting CA and interactive technologies has accelerated significantly in recent years but the user research to ensure engagement, and adoption is still lagging. Ollier et al. investigate the potential impact of components of language on the acceptability of CA. Their study provides insight for CA designers to consider linguacultures when developing healthcare solutions employing CA. The authors examined the Tu/Vous distinction across German and French in a healthcare CA application and found that French speakers prefer a more formal form of address (vous) compared to Germans. The authors additionally report that the term of address can influence users' evaluation outcomes when age and gender are considered. This work is important to demonstrate the underlying lingua-cultural factors that can affect the acceptance of a CA. Deploying CAs in the healthcare setting requires accurate and complete data collection for successful application. This manuscript presents evidence of additional user experience that should be tested when developing CA applications.

Health and wellbeing is the biggest growing market segment in the voice technology market (8). With so many potential applications and ease of deployments through the skills marketplace, it is important to establish a framework for user experience and validating solutions for conversational health agents (CHA). Maharjan et al. investigates 2,741 critical reviews of 485 Amazon Alexa Skills listed under the “health and fitness” category using structural topic modeling technique. They align the top 15 subjects of user critique with a well-established hierarchy of user needs. Mapping commonly found critique subjects of the skills to themes, such as “Functional,” “Reliable,” “Usable,” and “Pleasurable,” helps to identify the important design components related to user experience for CHA. This analysis of user critiques is important for designers of healthcare apps. The authors also highlight the foundational needs of CHA design and outline the pitfalls that designers can avoid for ensuring better engagement and outcomes. The analysis approach is recommended since individual reviews can highlight opportunities for new features, identify bugs, highlight pain points and provide rich insight about the user's emotional experiences and broader social context.

One of the biggest challenges for any healthcare innovation is the behavior change associated with moving away from an attitude of “what we are doing now is good enough.” Grimm et al. identified the importance of a clear strategy to show potential users the added value of adopting new tools. In their study, the authors report the development of a CA for promoting capacity building to enhance the ability of organizations to address health issues and concerns. The scope of their work is toward developing digital formats for the delivery of training and intervention for team leaders. The addition of machine learning to enhance the capabilities (from an “if-then-else” rule-based system to a more intuitive, personalized, and responsive system) is proposed to deliver a scalable and efficient solution. The authors present a step-by-step description of the development of a chatbot from theory to initial prototype and to evaluation with actual users. The authors capture the challenges in trading off initially increased cognitive load with eventual improvements in working conditions. This work illustrates good practices on how to develop solutions by integrating the right users or stakeholders at the right stage of development.

In this Research Topic, the studies present evidence and insight about the implementations of CA toward individual wellbeing and healthy lifestyle, the increasing value of data collected through CA in remote care management and patient-generated health data, the value of user experience and design of CA, and improvements and challenges on natural language understanding and linguistic approaches (Figure 1).

FIGURE 1
www.frontiersin.org

Figure 1. Special issue's content topics at a glance. NLU, natural language understanding; PGHD, patient-generated health data. The figure has been designed using an image from Freepik.com.

Author Contributions

All authors listed have made a substantial, direct, and intellectual contribution to the work and approved it for publication.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Comes S, Schatsky D, Chauhan R. Conversational AI [Internet]. Deloitte Insights. Deloitte (2021). Available online at: https://www2.deloitte.com/us/en/insights/focus/signals-for-strategists/the-future-of-conversational-ai.html (accessed February 26, 2022).

Google Scholar

2. Alcántara A-M. Smart Speakers Go Beyond Waiting to Be Asked. (2021) Available online at: https://www.wsj.com/articles/smart/speakers/go/beyond/waiting/to/be/asked/11620640802 (accessed February 26, 2022).

Google Scholar

3. Sezgin E, Huang Y, Ramtekkar U, Lin S. Readiness for voice assistants to support healthcare delivery during a health crisis and pandemic. NPJ Digit Med. (2020) 3:122. doi: 10.1038/s41746-020-00332-0

CrossRef Full Text | Google Scholar

4. Anthes E. Alexa, do I have COVID-19? Nature. (2020) 586:22–5. doi: 10.1038/d41586-020-02732-4

CrossRef Full Text | Google Scholar

5. Sezgin E, Militello LK, Huang Y, Lin S. A scoping review of patient-facing, behavioral health interventions with voice assistant technology targeting self-management and healthy lifestyle behaviors. Transl Behav Med. (2020) 10:606–28. doi: 10.1093/tbm/ibz141

CrossRef Full Text | Google Scholar

6. Tudor Car L, Dhinagaran DA, Kyaw BM, Kowatsch T, Joty S, Theng Y-L, et al. Conversational agents in health care: scoping review and conceptual analysis. J Med Internet Res. (2020) 22:e17158. doi: 10.2196/17158

CrossRef Full Text | Google Scholar

7. Robin J, Harrison JE, Kaufman LD, Rudzicz F, Simpson W, Yancheva M. Evaluation of speech-based digital biomarkers: review and recommendations. Digit Biomark. (2020) 4:99–108. doi: 10.1159/000510820

CrossRef Full Text | Google Scholar

8. van der Straten S. Voice Tech Landscape: 150+ Infrastructure, Horizontal Vertical Startups Mapped Analysed [Internet]. Point Nine Land (2017). Available online at: https://medium.com/point-nine-news/voice-tech-landscape-150-startups-mapped-and-analysed-82c5adaf710 (accessed March 1, 2022).

Google Scholar

Keywords: voice technology, chatbot, conversational agent (CA), healthcare, digital health (eHealth)

Citation: Sezgin E and D'Arcy S (2022) Editorial: Voice Technology and Conversational Agents in Health Care Delivery. Front. Public Health 10:887492. doi: 10.3389/fpubh.2022.887492

Received: 01 March 2022; Accepted: 16 May 2022;
Published: 30 May 2022.

Edited and reviewed by: Mitsunori Ogihara, University of Miami, United States

Copyright © 2022 Sezgin and D'Arcy. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Emre Sezgin, esezgin1@gmail.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.