Abstract
Introduction:
Digital Health Interventions (DHIs) have been identified as a solution to the United Nations Sustainable Development Goals (SDG3) for health promotion and prevention. However, DHIs face criticism for shallow and transactional engagement and retention challenges. Integrating DHIs with health coaching represents a promising solution that might address these issues by combining the scalable and accessible nature of DHIs with the meaningful and engaging nature of health coaching. This systematic review aims to synthesise existing peer-reviewed research on coach-facilitated DHIs to understand how digital health coaching is being used in DHIs and the impact it has on engagement and lifestyle outcomes.
Methods:
Studies examining DHIs with a coaching component addressing lifestyle outcomes were included. A search of APA PsychINFO, Medline, Web of Science, and Scopus was performed from inception to February 2025. Three authors conducted the study selection, quality appraisal using the Mixed Methods Appraisal Tool (MMAT), and data extraction. Data extraction captured study characteristics, coaching features, participant engagement, and lifestyle outcomes.
Results:
Thirty-five studies were identified and synthesised using a narrative synthesis approach. This review highlights three coaching modalities in DHIs: digital human coaching, Artificial Intelligence (AI) coaching, and hybrid (human-AI) coaching. All coaching modalities demonstrated feasibility and acceptability.
Discussion:
While both human and AI coaching have shown a positive impact on both engagement and lifestyle outcomes, hybrid approaches need further refinement to harness AI's scalability and the depth of human coaching. However, the variability of engagement metrics and coaching protocols limited study comparability. Standardising how engagement and coaching delivery are measured and contextualised is crucial for advancing evidence-based digital health coaching. This review followed PRISMA guidelines and was registered in PROSPERO (Registration number: CRD42022363279). The Irish Research Council supported this work.
Systematic Review Registration:
https://www.crd.york.ac.uk/PROSPERO/view/CRD42022363279, identifier: CRD42022363279.
1 Introduction
The United Nations (UN) has developed a universal agenda for sustainable development, with the overarching vision for all human beings to thrive and to reach their full potential in dignity and equality in a healthy environment (1). The third goal (SDG3) of this agenda is to āensure healthy lives and promote wellbeing for all at all agesā (1). Under this goal, the UN emphasised reducing noncommunicable disease (NCD)-related mortality rates by one-third through prevention and treatment and promoting mental health and wellbeing (1). Digital health has been identified as a feasible, accessible and affordable solution to the UN goals of preventing NCDs and promoting health and wellbeing (2). Digital health is a complex and multifaceted field of both knowledge and practice, focused on the development and use of various technologies (i.e., smart watches, digital tracking tools, robotics, artificial intelligence and machine learning) for improving health (2ā4). There has been substantial growth in the field of digital health research, especially in terms of evidence-based digital health interventions (DHIs) (5).
DHIs create new opportunities for medical doctors, health professionals and other caregivers to scale and tailor health and lifestyle interventions at a lower cost (5). DHIs are particularly effective for promoting healthy lifestyle behaviours like healthy eating, physical activity, stress reduction and psychological wellbeing (6ā10). These behaviours, in turn, can promote healthier living and aid in the prevention and management of lifestyle-related NCDs such as cardiovascular disease, diabetes, certain cancers and chronic respiratory diseases (4, 5, 11ā13). However, challenges exist in DHI implementation, particularly with self-guided or automated interventions. Despite their low cost and scalability, these interventions are often perceived as shallow, impersonal, and transactional, with participants preferring human support for more meaningful engagement (4, 14ā17). Retention and engagement also pose significant issues, with a pooled estimated dropout rate of 43% across DHIs (18ā20).
Integrating health coaching with DHIs may offer a solution to the limitation of passive or automated interventions by providing live and engaging support to participants. Health coaching is a non-clinical, evidence-based health promotion intervention that promotes sustainable health behaviours and improves lifestyle outcomes through personalised, solution-focused, person-centred support. By tailoring interventions to individual needs and fostering a collaborative relationship, health coaching empowers clients to make informed decisions about their health and facilitates behaviour change (21ā23). Health coaching is already recognised as a valuable asset to healthcare systems. Specifically, for the health promotion and prevention of NCDs (24, 25). As such, health coach-facilitated DHIs could create meaningful yet scalable and accessible digital health solutions for health promotion and NCD prevention (26ā28).
This integration paves the way for a new field of research relating to digital health coaching. There has already been a substantial surge of professional coaches transitioning to digital spaces, with 93.3% of coaches globally transitioning to online coaching (29). In the digital space, health coaching is mediated through technology, where coaches interact with coachees via digital platforms to guide behaviour change (30). This allows coaches and clients to interact regardless of geographical location and through different modalities (i.e., through text, video call or phone call) (31). While the conversation between coaches and clients remains central to digital health coaching, it can be supported by a variety of technologies, including digital tracking tools (i.e., smart watches, glucose monitors), self-guided educational modules, habit tracking and automated reminders. Additionally, recent advancements in artificial intelligence (AI) have spurred research into conversational agent coaching or chatbot coaches as flexible, accessible and cost-efficient alternatives to human-facilitated digital health coaching (32, 33). Chatbots are AI-driven programmes designed to interact with individuals using natural language that mimics human dialogue through algorithm-generated responses (34). However, the active engagement between the coach (AI or human) and coachee through personalised and real-time interaction is central to digital health coaching.
While there has been increasing interest in the integration of coaching within DHIs, a clear gap remains in understanding the impact of different coaching modalities (e.g., human, AI, and hybrid) on participant engagement and health outcomes. Although some studies have investigated coach-facilitated DHIs, comprehensive analyses and synthesis regarding the effectiveness of these modalities, particularly their role in enhancing lifestyle behaviours and preventing NCDs, are limited. This systematic review aims to synthesise existing research on coach-facilitated DHIs to understand how digital health coaching is being used in DHIs and the impact it has on lifestyle outcomes and engagement. Ultimately, this review seeks to provide evidence to inform the development of future digital health interventions and research.
2 Materials and methods
The methodology for this systematic review complies with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines for systematic reviews (35) (Supplementary File S1). This protocol has been registered on PROSPERO (Registration number: CRD42022363279). After registration, amendments were made to enhance the comprehensiveness of the review. The review timeline was updated to incorporate a more recent search date. Additionally, further details were added to the inclusion criteria to clarify modifications. Lastly, the search strategy was revised to reflect the updated search parameters.
2.1 Inclusion and exclusion criteria
The PICO framework guided the definition of eligibility for this review (TableĀ 1). However, in this review, the traditional āComparatorā element in PICO was replaced with āContextā, as this review focuses on the integration of coaching within DHIs without the need for a direct comparator.
Table 1
| PICOS heading | Inclusion and exclusion criteria |
|---|---|
| Population | Studies involving adults aged 18 or older, including adults with or at risk for NCDs (i.e., cancer, diabetes, cardiovascular disease). Studies involving children or adolescents were excluded |
| Intervention | DHIs delivered online through a web-based platform or mobile application tailored towards one or more of the six pillars of lifestyle medicine. DHIs were excluded if the DHI was a medical intervention or used as a part of medical or mental health treatments |
| Comparator | Studies that explored coach-facilitated DHIs with an active coaching component, meaning that there is two-way interaction between coach and participants via text, video, or phone calls. Non-interactive coaching interventions were excluded. This coaching component could be health, lifestyle, or wellbeing coaching delivered by a human coach or conversational agent. Studies focusing on in-person, career/executive, or athletic coaching were excluded |
| Outcomes | Studies were included if they reported at least one of the pillars of lifestyle as its primary outcome. Studies were excluded if their primary outcomes were clinical or biomedical |
| Study design | Only peer-reviewed studies were included, with no publication date restrictions. Non-English studies were excluded due to language limitations |
PICOS framework for study inclusion and exclusion.
This review seeks to gather evidence of coach-facilitated DHIs implemented to increase the health and wellbeing of populations. While this largely falls into preventative interventions to help reduce the risk of NCDs, substantial studies are using coach-facilitated DHIs to help increase the health and wellbeing of cohorts living with NCDs (predominantly cancers and diabetes). It would be remiss to exclude these cohorts because of their disease status, despite the intervention aligning with inclusion criteria. As a result, we included adults living with NCDs once the primary outcomes were focused on lifestyle outcomes and not clinical or medical outcomes or markers (i.e., HbA1c, Cholesterol, BMI, weight reduction or waist circumference). This review concentrated on the adult population (18 years and older). As such, studies involving adolescents or children were excluded.
Peer-reviewed studies were considered if they explored digital coaching in DHIs and addressed at least one of the lifestyle medicine pillars: sleep, physical activity, psychological wellbeing and stress management, substance use, or healthy eating. Studies were excluded if they focused on medical interventions or used coaching as a part of medical or mental health treatments (i.e., coaching for patient education, trust building for treatments, pain management, medication adherence) that were not directly aimed at increasing lifestyle or wellbeing outcomes. Similarly, studies with clinical primary outcomes or biological markers as the primary outcome were excluded.
Studies that incorporated human, AI or hybrid (AI and human) coaching components alongside DHIs were included. For the inclusion criteria, digital health coaching was defined as a coach-coachee partnership facilitated online with the purpose of providing tailored support to enhance the DHI and promote behaviour change and health outcomes among participants. The coaching component of the DHI must align with this definition, meaning that coaching through ad hoc non-interactive messages or notifications was excluded. Studies that explored in-person or face-to-face interventions only, as well as career, executive or athletic coaching, were excluded. Finally, all peer-reviewed studies from any country were included, and no publication date limit was applied, given the limited literature on coaching, especially digital coaching. Studies that were not available in English were excluded.
2.2 Search strategy
The search strategy used keywords identified through an initial review of the literature. Keywords were grouped using Boolean operators and truncations. The PICO Framework also guided the formation of the final search strategy (TableĀ 2).
Table 2
| Population | Intervention | Context | Outcome |
|---|---|---|---|
| Adult* | ādigital healthā OR āeHealthā OR āmHealthā OR āmobile healthā OR ādigital interventionā OR āmobile applicationā OR āsmartphone applicationā OR āhealth appā OR āinternet interventionā OR āonline programā OR āweb-based interventionā | ācoachingā OR āhealth coachingā OR ādigital coachingā OR ālifestyle coachingā OR āmotivational coachingā OR āchatbotā OR āartificial intelligenceā āAI chatbotā OR āvirtual coachingā OR āAI Coachā OR āautomated coachingā OR āhealth coachā OR ācoaching psychologyā OR ādigital health coachā OR āconversational agentā | āengagementā OR āadherenceā OR āparticipationā OR āuser engagementā OR āuser adherenceā OR āretentionā OR āsatisfactionā OR āuser satisfactionā OR āacceptabilityā OR āusabilityā OR ālifestyleā OR āhealthy lifestyleā ālifestyle changeā OR āwellbeingā OR āhealthy livingā OR ābehaviour changeā OR āhabit*ā OR āHealthy eatingā OR āphysical activityā OR ānutritionā OR āweight managementā OR āstress managementā OR āsleep qualityā OR āactive minutesā |
| And not | āAdolescent*ā OR āchild*ā OR āclinical outcomeā OR ātreatmentā OR ārehab*ā OR āathlet*ā OR āsports coach*ā OR āperformance coach*ā OR āclinical treatmentā | ||
Final search strategy using the PICO framework.
* denotes a truncation operator, used to search for word variations or endings.
2.3 Informational sources
We conducted searches in electronic databases, including APA PsychINFO, Medline, Web of Science, and Scopus, on February 1, 2025. In addition to the electronic database searches, backward and forward citation searching of included studies was conducted to identify any additional relevant studies. Furthermore, forward searching of protocols deemed relevant during screening was carried out to ensure the inclusion of studies that may have been missed in the initial search.
2.4 Study screening
Covidence, an online specialised systematic review website, was used to screen studies. One reviewer (CL) screened the titles and abstracts of the identified studies based on the eligibility criteria. Three reviewers (CL, ROD, and JL) then independently reviewed the identified full-text studies. Reviewers met to discuss and resolve any conflicts or disagreements. If consensus could not be reached, a fourth reviewer (PJD) was designated to assess the relevant records. Three reviewers are qualified health coaches accredited by the European Mentoring and Coaching Council (EMCC). Two reviewers hold psychology degrees (CL, ROD), and the fourth reviewer holds a PhD in immunology and a degree in counselling and psychotherapy (PJD). The third reviewer (JL) has a biomedical engineering degree with specific expertise in digital health research.
2.5 Data extraction process
Two reviewers (CL & ROD) independently extracted the data using the Covidence Data Extraction template. To resolve discrepancies in the data extracted, reviewers came together to review data extraction and any existing conflicts. Conflicts that couldn't be resolved were referred to a third reviewer for resolution (PJD). However, this was not needed. The following characteristics were recorded: author, year, study aim, participant description and inclusion and exclusion status, total participant number, study design, lifestyle focus, measures taken, key outcomes, coaching delivery and intensity, coaching theory, role of coaching, length of intervention, and limitations.
2.6 Quality assessment
The Mixed Methods Appraisal Tool assessed the quality of all studies included in the systematic review. This was conducted independently by two reviewers (CL and ROD), who then jointly reviewed independent quality appraisal for any conflicts. Unresolved conflicts were referred to a third reviewer (PJD), though this step was not needed. The final MMAT results are included in the supplementary files (Supplementary File S2).
2.7 Study synthesis
The identified studies were synthesised using a narrative synthesis in relation to the study question: How is digital health coaching used in DHIs, and what is the impact of lifestyle outcomes and engagement? The synthesis followed the steps outlined by Popay (36), which included: becoming familiar with the studies, organising them into logical categories, comparing and synthesising the studies, exploring the relationships within and between the studies and synthesising the data under the relevant themes. Studies included in this review were first grouped by coaching modality (i.e., human, AI, or hybrid coaching) and then further categorised by targeted lifestyle outcomes (physical activity, psychological wellbeing, stress management, healthy eating, sleep, and substance use) and engagement outcomes. We evaluated the consistency of the findings across studies to assess the certainty of evidence. Missing or inconsistent data that was not a criterion for inclusion (i.e., engagement statistics, coaching theories, and length of intervention) was noted during synthesis and presented in the results. Studies were not excluded based on these missing data, but it was considered as a limitation during synthesis. A meta-analysis could not be performed due to significant variations in sample populations, outcome measures, and study designs across the included studies.
3 Results
The database search yielded a total of 4,894 studies. After removing 1,748 duplicates, a further 2,936 studies were removed after title and abstract screening. One hundred and seventy-six studies were removed during the full-text review. In total, 35 studies were included in this review. The complete screening process is illustrated through the PRISMA diagram (FigureĀ 1).
Figure 1

PRISMA flow diagram for systematic reviews illustrating the study selection process. Adapted with permission from āPRISMA 2020 flow diagram template for systematic reviewsā, by Page et al., licensed under CC BY 4.0.
3.1 Quality assessment
No articles were excluded from the review based on the MMAT quality appraisal score (Supplementary File S2). Regarding the methodological rigour of the included articles, 11 studies scored 100% (5/5), and 23 scored 80% (4/5), indicating high quality. Only one study scored 60% (3/5), indicating medium quality. This demonstrates that the majority of studies included in the review met a high standard of methodological rigor.
3.2 Description of the included studies
Thirty-five studies were included for full review and data extraction. TableĀ 3 summarises the study characteristics, including year of publication, country, length of intervention, number of participants, study design, intervention description, lifestyle areas addressed, measures used, and key outcomes. Most of the studies were pilot, feasibility, or early-stage studies, with only seven studies (20.6%) being full-scale randomised controlled trials (37ā43). Intervention lengths ranged from 1 week to 12 months, and sample sizes ranged from 7 to 3,629 participants (TableĀ 3).
Table 3
| Author | Year | Country | Length | Participants | Study design | Intervention | Lifestyle pillars | Measures | Outcomes |
|---|---|---|---|---|---|---|---|---|---|
| Alley et al. (44) | 2,016 | Australia | 8 weeks | 84 adults | Randomised controlled trial | Web-based physical activity interventionā+āvideo-based coaching |
|
Self-report measures:
|
Increased physical activity: Tailoringā+ācoaching (+150ā min/week), Tailoring only (+123ā min/week), Waitlist (+34ā min/week). Significant mental health improvement in tailoringā+āvideo-coaching group only (Pā=ā.01) |
| Aymerich-Franch and Ferrer (17) | 2022 | Spain | 3 weeks | 32 adults | Nonrandomised controlled trial | Coaching programme delivered by a speech-based conversational agent (CAC) |
|
Self-report measures:
|
Significant increase in PGI (Pilot: tā=ā5.28, Pā=ā.013; main study: tā=ā3.84, Pā=ā.001). Moderate and significant increase in SLS (Pilot: tā=ā2.12, Pā=ā.124; main study: tā=ā4.99, Pā<ā.001). Moderate and significant decrease in negative affect (pilot: tā=ā2.37, Pā=ā.098; main study: tā=ā4.31, Pā<ā.001) |
| Bakas et al. (45) | 2018 | USA | 3 weeks | 22 older adults | Quasi-experimental design | A nurse-led intervention delivered through a telepresence robot for healthy independent living |
|
|
Intervention group saw a decline in unhealthy days (3.6 to 1.0 vs. control: 3.4 to 2.3). Slight decline in depressive symptoms (10.1 to 9.6 vs. control worsening to 10.6). Self-efficacy increased (8.2 to 9.1 vs. control: 8.8 to 8.4). Physical activity decreased slightly in both groups. Physical activity minutes declined for the intervention group (163.6 to 128.6ā min/week) and slightly decreased for the waitlist control group (159 to 152.8ā min/week) |
| Blair et al. (46) | 2021 | USA | 13 weeks | 54 older adults | Randomised controlled trial | Jawbone UP2 activity monitor paired with a smartphone appā+ātech support (control) or health coaching via phone call (intervention) |
|
Wearable device measures:
|
No significant change in sedentary time: tech support group changed by 6.0ā min (95% CI ā39.5 to 51.6; Pā=ā.79), health coaching by 7.9ā min (95% CI ā30.8 to 46.6; Pā=ā.68). Steps/day increased significantly in the health coaching group (+1,675, Pā=ā.009) but not in the tech support group (+654, Pā=ā.37). Moderate-intensity physical activity increased significantly in the coaching group (Pā=ā.008), compared to control group (Pā=ā.33). No significant difference in SPPB scores |
| Chang et al. (47) | 2023 | USA | 3 weeks | 15 mothers | A single-group pre-test post-test study | Web-based lessonsā+āhealth coaching sessions |
|
Self-report measures:
|
Sugar intake decreased (21.07 to 12.53 tsp, dā=ā0.48, Pā=ā0.126). Fruit/vegetable intake increased (4.73ā5.55 cups, dā=ā0.49, Pā=ā0.138). Physical activity showed minor improvement (107ā171.5 METs, dā=ā0.13, Pā=ā0.67). Stress reduced (18.33 to 14.67, dā=ā-0.52, Pā=ā0.097). Emotional control significantly improved (38.50ā42.58, dā=ā0.71, Pā=ā0.032). Motivation and self-efficacy improved across all domains. Motivation improved across all domains (healthy eating: Pā=ā0.025; physical activity: Pā=ā0.021; stress: Pā=ā0.011). Self-efficacy increased for healthy eating only (Pā=ā0.058) |
| Chew et al. (48) | 2024 | Singapore | 1 week | 251 adults | A single-group pre-test post-test study | AIāassisted weight management app with a chatbot-based check-in system, food-based computer vision image recognition and automated nudges |
|
Self-report measures:
|
Statistically significant improvements in the overeating habit (Pā<ā.001), snacking habit (Pā=ā.02 -ā<ā.001), self-regulation of eating behaviour (Pā=ā.007), depression (pā=ā.007), and physical activity (Pā<ā.001). Qualitative themes: increased self-monitoring, personalised reminders, food logging with image recognition, and engaging user interface |
| Chow et al. (27) | 2020 | USA | 6 weeks | 19 women cancer survivors | A single-group pre-test post-test study | iCanThrive, an app app-based intervention with eight educational modulesā+āphone coaching |
|
Self-report measures:
|
Significant reduction in depressive symptoms (tā=ā2.22 and Pā=ā0.04), slight increase in emotional self-efficacy (tā=ā1.33, Pā=ā0.20), significant reduction in sleep disturbance (tā=ā3.41 and Pā=ā.003.). Continued significant difference in sleep disruption from baseline to the 4-week follow-up (tā=ā3.71; Pā=ā.002) |
| Daley et al. (49) | 2020 | Brazil | 1 month | 3,629 adults | A single-group pre-test post-test study | Vitalk, a mental health chatbot |
|
Self-report measures:
|
Significant reduction in depressive symptoms (dā=ā0.91, Pā=ā0.001). 46.3% of users moved below the clinical cut off for PHQ-9. Significant anxiety reduction (dā=ā-0.85, Pā=ā0.001) and stress reduction (dā=ā-0.81, Pā=ā0.001), 49.0% of users moved from above to below the clinical cut-off for anxiety (GAD-7 scoreāā¤ā8) |
| Damschroder et al. (37) | 2020 | USA | 12 months | 358 veterans | Randomised controlled trial | StayStrong app, activity monitoring using wearable devicesā+ātelephone health coaching |
|
Wearable device measures:
|
Both groups showed a decline in active minutes (intervention: ā41ā min; control: ā65ā min) and step count (intervention: ā1933; control: ā2427). No significant difference between intervention and control at 12 months for active Minutes (Pā=ā.48) or step counts (Pā=ā.08) |
| D'Avolio et al. (50) | 2023 | USA | 8 weeks | 26 Dyads | Randomised controlled trial | Telehealth coaching program including nutritional education, stress-reduction material and coaching materials |
|
Self-report measures:
|
Protein intake increased significantly in the coached group (1.00ā±ā0.17ā1.35ā±ā0.23ā g/kg) vs. the not-coached (0.91ā±ā0.19ā1.01ā±ā0.33ā g/kg, Pā=ā.01, η²ā=ā.24). No significant effect of protein intake in FMWD. No significant changes in MCSI, SF36 physical component scale and mental component scale, or fatigue |
| Dhinagaran et al. (51) | 2021 | Singapore | 4 weeks | 52 adults | Single-arm feasibility study | Conversational agent promoting healthy lifestyle changes |
|
Self-report measures:
|
Vegetable intake increased from 27% to 29% consuming vegetables at least once a day. Fruit intake increased from 3% to 7% (at least three portions). Participants who never consumed sweetened beverages increased from 38% to 45%. Participants who never consumed fried food and snacks increased from 25% to 30%. The stress score was reduced from 17 to 16. Sleep quality did not change significantly. Physical activity increased from 30 to 50ā min/week. Time spent sitting reduced from 439 to 406, and METs per week score went from 857 to 765. Time in moderate to vigorous activity increased from 30ā min per week at baseline to 50ā min at follow-up |
| Foran et al. (38) | 2024 | USA | 1 month | 1,345 adults | Randomised controlled trial | Zennyāconversational agent designed to enhance wellbeing |
|
Self-report measures:
|
Significant improvements in wellbeing (control: dā=ā0.24, Pā<ā.001; intervention: dā=ā0.26, Pā<ā.001), psychological flourishing (intervention: dā=ā0.19, Pā<ā.001; control: dā=ā0.18, Pā<ā.001), and positive psychological health (intervention: dā=ā0.17, Pā=ā.001; control: dā=ā0.24, Pā<ā.001). No significant differences in effectiveness between groups |
| Gabrielli (52) | 2021 | Italy | 4 weeks | 71 students | Proof of concept study | Atenaāa psychoeducational chatbot supporting stress management |
|
Self-report measures:
|
Significant reduction in anxiety symptoms (t39ā=ā0.94; Pā=ā.009), stress symptoms (t39ā=ā2.00; Pā=ā.05), and increase in mindfulness (Pā<ā.001) |
| Gudenkauf et al. (53) | 2024 | USA | 8 weeks | 13 caregivers | Single-arm pilot feasibility trial | DHI involving monitoring and visualising health-promoting behaviours and health coaching |
|
Self-report measures:
|
Most participants showed improvement or maintenance of QOL (15% and 62%), sleep quality (23% and 62%), social engagement (23% and 69%), and general self-efficacy (23% and 62%). Physical activity outcomes were not reported |
| Han et al. (39) | 2024 | Singapore | 6 months | 148 adults | Randomised controlled trial | nBuddy Diabetes app, including features for diet, physical activity, behaviour change, blood glucose monitoring and health coaching |
|
Self-report measures:
|
Significant improvement in diet quality by 6.2 points (95% CI, 3.8ā8.7; Pā<ā.001) in the intervention group compared with control. Significant reduction in intake of sugar-sweetened beverages (ā0.5 servings/day, 95% CI, ā0.8, ā0.2; Pā<ā.001) and sodium (ā726ā mg/day, 95% CI, ā983, ā468; Pā<ā.001) |
| Heber et al. (40) | 2016 | Germany | 7 weeks | 264 adults | Randomised controlled trial | Guided web and mobile-based stress management trainingā+āwritten feedback on every completed session from an e-coach |
|
Self-report measures:
|
Large effects observed for perceived stress (Pā<ā.001, Cohenās dā=ā0.83), depression (dā=ā0.95, Pā<ā.001), anxiety (dā=ā0.84, Pā<ā.001), and worry (dā=ā0.63, Pā<ā.001). Mental health scores improved significantly (dā=ā0.68, Pā<ā.001). Effects maintained at 12-month follow-up |
| Horn et al. (54) | 2023 | USA | 12 months | 26 women | Qualitative study | Balance after babyāweb-based educational modules and lifestyle coaching |
|
|
Participants reported changes in diet and physical activity as a result of the intervention. Online modules and support from the lifestyle coach were perceived to have had a positive effect on personal and familial lifestyle change. Other components were less utilised, including the community forum, gym memberships, and pedometers |
| Ly, Ly and Andersson (55) | 2017 | Sweden | 2 weeks | 28 adults | Randomised controlled trial | Smartphone app with an automated chatbot |
|
Self-report measures:
|
Participants who adhered to the intervention (nā=ā13): significant effects for FS (F1, 27ā=ā5.12, Pā=ā0.032), PSS-10 (F1, 27ā=ā4.30, Pā=ā0.048). No significant effect for SWLS [F (1, 27)ā=ā2.83, Pā=ā0.10]. Within-group effects: PSS-10 [t (12)ā=ā2.22, Pā=ā0.046, dā=ā0.87] and SWLS [t (12)ā=ā-2.25, Pā=ā0.044, dā=ā0.41] showed medium-to-large improvements in the intervention group |
| Maher et al. (56) | 2020 | Australia | 12 weeks | 31 adults | Case-control study | Artificial intelligence virtual health coach-led physical activity and diet intervention |
|
Self-report measures:
|
Participants increased physical activity by 109.8ā min/week [F (2, 29)ā=ā6.45, Pā=ā.005]. Mediterranean diet scores improved from 3.8 to 9.6 [F(2,29)ā=ā44.56, Pā<ā.001] |
| Marler et al. (57) | 2019 | USA | 18.5 weeks | 319 adults | Prospective open-label single-arm study | Digital smoking cessation program incorporating a Food and Drug Administration, cleared carbon monoxide breath sensor, and text-based human coaching |
|
Self-report measures:
|
Positive changes in attitudes were observed from baseline to Mobilise (pre-Quit): increased confidence to quit (4.2ā7.4, Pā<ā.001) and decreased expected difficulty maintaining quit (3.1ā6.8, Pā<ā.001). Quit attempt rate: 79.4% (216/272, completer). 7-day PPA: 32.0%, 30-day PPA: 27.6%. 25.9% achieved ā„50% smoking reduction |
| McGuire et al. (58) | 2022 | Australia | 8 Weeks | 43 midlife adults | Two-arm parallel group feasibility study | GroWell for Health ProgramāeHealth intervention consisting of an interactive ebook and nurse coaching |
|
Self-report measures:
|
Significant improvements in fruit intake, physical activity stage of change, and exercise habits for arm A (ebook and coaching) and arm b (ebook only) (pā<ā.05).Alcohol frequency decreased for both groups, but this was not significant (arm A: 1.9 to 1.8 days a week, pā=ā0.65; arm B: 2.7 to 2.5 days a week, Pā=ā0.62) |
| Moreno-Blance et al. (59) | 2019 | Spain | 3 weeks | 20 adults | Feasibility study | mHealth platform with an AI chatbotāPaola, educational material, recipes, diet and activity log |
|
Self-report measures:
|
Participants met sleep goals 47.3% of days and logged moderate to vigorous activity 130% over the target. 10,000-step goal met 122% of the time. Adherence to a Mediterranean diet was not reported |
| Olliers et al. (60) | 2023 | Switzerland | Approx. 6 months | 7,135 Adults | Single-arm interventional study | chatbot-led digital health app designed to provide lifestyle coaching across seven health areas to mitigate the collateral damage of the COVID-19 pandemic |
|
|
Significant decrease in anxiety levels between assessments [t(54)ā=ā3.7, Pā<ā0.001, dā=ā0.499; Intent-to-treat: t(416)ā=ā3.4, pā<ā0.001, Cohen dā=ā0.165]. significant change in depression scores occurred (within group): F2,38ā=ā7.01, Pā=ā0.003, with a large effect size. Physical activity, sleep and healthy eating did not report significant changes in outcomes between assessments (Pā=ā.847; Pā=ā.208; Pā=ā.837 respectively) |
| Partridge et al. (41) | 2015 | Australia | 12 weeks | 250 adults | Two-arm, parallel-group randomised controlled trial | Mobile app with education, self-monitoring and support resources, motivational text messages and coaching calls |
|
Self-report measures:
|
Intervention group increased vegetable intake (Pā=ā.009), reduced sugary drinks (Pā=ā.002), and improved physical activity (Pā=ā.05). They also increased their total physical activity by 252.5 MET-minutes (95% CI 1.2ā503.8, Pā=ā.05) and total physical activity by 1.3 days (95% CI 0.5ā2.2, Pā=ā.003) compared to controls |
| Price and Brunet (61) | 2022 | Canada | 12 weeks | 7 young adults | Single-arm feasibility trial | Telehealth behaviour change intervention |
|
Self-report measures:
|
Participants reported significant increase in physical activity (T0ā=ā30; T1ā=ā150; Pā=ā0.018), and fruit and vegetable intake (T0ā=ā2; T2ā=ā4.71; Pā=ā0.018). There was also a significant increase in sense of autonomy (Pā=ā0.046), competence (Pā=ā0.018) and relatedness (Pā=ā0.028) relating to fruit and vegetable intake, and a significant increase in autonomy (Pā=ā0.027) and competence (pā=ā0.28) relating to physical activity. Qualitative data suggests the health coach created an autonomous supported environment, developed capacity for positive connections, and increased motivation |
| Sacher et al. (62) | 2024 | Netherlands | 12 months | 107 adults | Prospective single-arm study | Health coach-led, asynchronous, text-based, Digital Behaviour Change Coaching Intervention (DBCCI) |
|
Self-report measures:
|
WEMWBS scores significantly improved at Month 6 and remained significant at Month 12 (Pā=ā.02). GAD-7 scores significantly decreased at Month 1 (Pā=ā.009). LOCES significantly decreased at month 1 (Pā<ā.001) and remained statistically significant at months 6 and 12 (Pā<ā.001). BBAQ scores significantly increased at Month 6 (Pā<ā.001) but did not remain significant at Month 12 (Pā=ā.45) |
| Santini et al. (63) | 2023 | Italy/Netherlands | 10 weeks | 91 adults | Longitudinal Mixed Methods Study | Digital coaching intervention |
|
Self-report measures:
|
Perceived worsened physical health (SF-12-PCS ā¤50) increased from T0 to T2 (pā=ā0.002), while perceived mental well-being (SF-12-MCS ā¤42) remained stable (Pā<ā0.001). Physical activity significantly increased from T0 to T1, then stabilised (Pā=ā0.012), with low activity rates decreasing from 8.2% to 6% during the human coach-supported phase but rising to 8.3% when using the system independently. Self-efficacy and socialisation levels fluctuated over time |
| Smart et al. (64) | 2022 | USA | 8 weeks | 28 adults | A single-group pre-test post-test study | Text-based coaching and Fitbit program |
|
Wearable device measures:
|
There was no significant change in the average weekly steps (mean difference 7.26, SD 6209.3; Pā=ā.99), sedentary minutes (mean difference ā17.6, 95% CI ā67.8 to 32.6), or light (mean difference ā3.37, 95% CI ā28.8 to 22.1) and moderate to vigorous physical activity (mean difference 6.79, 95% CI ā3.4 to 17.0) |
| Spring et al. (42) | 2018 | USA | 9 months | 212 adults | Randomised controlled trial | Multicomponent intervention integrating mHealth, modest incentives, and remote coaching |
|
Self-report measures:
|
Sustained improvement in composite diet and activity score at 3, 6, and 9 months (Pā<ā.001). Sequential treatment showed slightly greater improvement at 6 months (Pā=ā.03), but no difference at 3 or 9 months compared to simultaneous treatment |
| Terblanche et al. (65) | 2022 | South Africa | 6 months | 168 students | Randomised controlled trial | AI Chatbot Coach |
|
Self-report measures:
|
Psychological wellbeing, resilience and stress showed a medium-sized correlation with each other, ranging from rā=ā.50 to rā=ā-.64 (correlations with stress being negative). The experimental group showed an increase of 55% on their goal attainment compared to 24% in the control group. No statistical significance was found between psychological wellbeing, resilience, stress and time and group (resilience, Pā=ā0.80; psychological wellbeing, Pā=ā0.89; and stress, Pā=ā0.91) |
| To, Green and Vandelanotte (66) | 2021 | Australia | 6 weeks | 116 adults | Quasi-experimental design without control group | Physical activity chatbot and a connected wearable device |
|
Self-report measures:
|
Participants reported an increase in steps from baseline (increase of 627, 95% CI 219ā1035 steps/day; Pā<ā0.01) and total physical activity (increase of 154.2ā min/week; 3.58 times higher at follow-up; 95% CI 2.28ā5.63; Pā<ā0.001). Participants were also more likely to meet the physical activity guidelines (odds ratio 6.37, 95% CI 3.31ā12.27; Pā<ā0.001) at follow-up |
| Ulrich et al. (67) | 2024 | Switzerland | 7 weeks | 230 students | Randomised controlled trial | Conversational agentādelivered stress management coaching intervention |
|
Self-report measures:
|
Significant reduction in perceived stress (Cohen dā=ā-0.60, Pā=ā.001), depressive and psychosomatic symptoms (Cohen dā=ā-0.50, Pā=ā.003; Cohen dā=āā0.36, Pā=ā.010). No significant change in anxiety and active coping (Cohen dā=āā0.29, Pā=ā.08; Cohen dā=ā0.13, Pā=ā.06) |
| Wiegand et al. (68) | 2010 | USA | 14 weeks | 562 women | Longitudinal Study | Internet-based online coaching, education programme and olfactive-based personal care products |
|
Self-report measures:
|
Group 1 (coaching, education programme, and olfactive-based products) had statistically significant improvement in the PSS score vs. Group 3 (control) (Pā<ā0.01). Group 2 (coaching and education programme) demonstrated significantly greater reductions vs. baseline (Pā<ā0.001), but there were no statistically significant differences vs. Group 3. Significant improvements in other efficacy outcomes such as POMS total mood disturbance, TICS work overload and social responsibility subscales, STAI and awakenings, (Pā<ā0.05) |
| Wijsman et al. (43) | 2013 | Netherlands | 3 months | 226 older adults | Randomised controlled trial | Philips DirectLife: monitoring and feedback by accelerometer and digital coaching |
|
Wearable device measures:
|
At the ankle, activity counts increased by 46% [standard error (SE) 7%] in the intervention group, compared to 12% (SE 3%) in the control group (P=<.001). Measured at the wrist, activity counts increased by 11% (SE 3%) in the intervention group and 5% (SE 2%) in the control group (Pā=ā.11). After processing of the data, this corresponded to a daily increase of 11ā min in moderate-to-vigorous activity in the intervention group vs. 0ā min in the control group (Pā=ā.001) |
| Williams et al. (69) | 2021 | New Zealand | 21 days | 64 students | Open trial single-arm study | Digital mental health intervention delivered by a chatbot |
|
Self-report measures:
|
WHO-5 scores improved significantly (SDā=ā15.07; Pā<ā0.001). Mean reduction of the PSS-10ā=ā1.77 (SDā=ā4.69; Pā=ā0.004) equating to effect sizes of 0.49 and 0.38, respectively. Those who were clinically anxious at baseline (nā=ā25) experienced a greater reduction of GAD-7 symptoms than those (nā=ā39) who started the study without clinical anxiety (1.56, SDā=ā3.31 vs. 0.67, SDā=ā3.30; Pā=ā0.011) |
Characteristics of included studies.
To assess the heterogeneity of the participants, demographic information, including sex, age, race, and education, was collected and is presented in TableĀ 4. Participants were predominantly Caucasian, though African American/Black, Asian, multiracial, and other racial groups were represented in smaller proportions. Notably, four studies had a larger representation of African American/Black (47) and Asian participants (39, 48, 51). The mean age of participants varied widely. Participants were largely aged between 40 and 60 years, with the exception of outlying studies focusing on younger adults and students (18ā34 years) (47ā49, 51, 52, 55, 61, 65, 67, 69) and older populations (60ā+āyears) (38, 43, 45, 46, 50). Women were more frequently recruited for digital health interventions, with female participation ranging from 25.2% to 100% across studies. Education levels were inconsistently reported, but in studies that provided this information, the percentage of participants with a college degree ranged from 20.7% to 95%.
Table 4
| Author | Gender %female | Age (mean) | Age (range) | Race % | College degree (%) | ||||
|---|---|---|---|---|---|---|---|---|---|
| Caucasian | African American/Black | Asian | Multiracial | Other | |||||
| Alley et al. (2016) (44) | 76 | 54.21 | 41.66 | ||||||
| Aymerich-Franch and Ferrer (2022) (17) | 53.3 | 40.47 | 76.7 | ||||||
| Bakas et al. (2018) (45) | 71.4 | 84 | 100 | ||||||
| Blair et al. (2021) (46) | 56 | 69.6 | 57 | 7 | 57 | ||||
| Chang et al. (2023) (47) | 100 | 32.9 | 33.33 | 46.66 | 6.66 | 13.33 | 60 | ||
| Chew et al. (2024) (48) | 52 | 31.25 | 97 | 3 | 95 | ||||
| Chow et al. (2020) (27) | 100 | 59.6 | 94 | 3 | 3 | ||||
| Daley et al. (2020) (49) | 76 | 18ā24 | |||||||
| Damschroder et al. (2020) (37) | 25.2 | 39.8 | 64.7 | 13.2 | 22.1 | 93.6 | |||
| D'Avolio et al. (2023) (50) | 92.3 | 66.2 | 84.6 | 3.8 | 3.8 | 7.7 | |||
| Dhinagaran et al. (2021) (51) | 62 | 33.7 | 3 | 93 | 2 | 87 | |||
| Foran et al. (2024) (38) | 80.3 | 47.17 | |||||||
| Gabrielli et al. (2021) (52) | 67.6 | 18ā34 | |||||||
| Gudenkauf et al. (2024) (53) | 61.5 | 52 | 28ā70 | 69.2 | 23.1 | 7.7 | 76.9 | ||
| Han et al. (2024) (39) | 40 | 53.1 | 92.5 | 7.5 | |||||
| Heber et al. (2016) (40) | 73.1 | 43.3 | 83.3 | 16.7 | 76.9 | ||||
| Horn et al. (2023) (54) | 100 | 34.5 | 65 | 23 | 12 | 23 | 62 | ||
| Ly, Ly and Andersson (2017) (55) | 53.5 | 23.3 | |||||||
| Maher et al. (2020) (56) | 67.7 | 56.2 | |||||||
| Marler et al. (2019) (57) | 57.7 | 30ā39 | 82.8 | 6.9 | 1.6 | 81.5 | |||
| McGuire et al. (2022) (58) | 89.2 | 50.6 | 77.7 | ||||||
| Moreno-Blanco et al. (2019) (59) | 33.30 | 30 | |||||||
| Ollier et al. (2023) (60) | 71.3 | 46.3 | |||||||
| Partridge et al. (2015) (41) | 27.7 | 61.3 | 80.6 | ||||||
| Price and Brunet (2022) (61) | 85.7 | 33.8 | 71.4 | ||||||
| Sacher et al. (2024) (62) | 89.7 | 41.8 | 59.9 | 1.9 | 10.3 | ||||
| Santini et al. (2023) (63) | 35.5 | 46.8 | |||||||
| Smart et al. (2022) (64) | 77 | 47.1 | 3 | 80 | 17 | 60 | |||
| Spring et al. (2018) (42) | 76.4 | 40.8 | 41 | 46.7 | 3.8 | 8.5 | 69.3 | ||
| Terblanche et al. (2022) (65) | 56 | 22 | |||||||
| To, Green and Vandelanotte (66) | 81.9 | 49.1 | 87.1 | 12.9 | |||||
| Ulrich et al. (2024) (67) | 73.6 | 26.7 | 26.4 | ||||||
| Wiegand et al. (2010) (68) | 100 | 35.7 | 68.3 | 21.7 | 9.7 | 20.7 | |||
| Wijsman et al. (2013) (43) | 40.9 | 64.8 | 56.7 | ||||||
| Williams et al. (2021) (69) | 81 | 18ā23 | 2.4 | ||||||
Participant demographics.
3.3 Presence of coaching in DHIs
This review focused on the role and impact of health coaching in DHIs. TableĀ 5 summarises the key characteristics of coaching that are evidenced across included studies. Coaching was exhibited through two crucial elements: (1) an active coaching component and (2) evidence-based coaching theories. Theories not only guided the coaching process (27, 40ā42, 45, 46, 50, 53, 57, 58, 61, 62, 68) but also influenced the overall design of DHIs and AI chatbots (17, 38, 48, 49, 51, 52, 55, 56, 60, 65ā67, 69). The active coaching component of DHIs was evident through three different delivery modes: (1) digital human coaching, (2) AI-powered coaching, or (3) a mix of both, which we refer to as hybrid coaching (TableĀ 5). Using the three modes of coaching identified, this review presents the features of coaching, followed by the trends of lifestyle outcomes and engagement that emerged from the included studies.
Table 5
| Author | Coaching approach | Theories/frameworks | Coach qualification disclosed | Number of sessions | Frequency of sessions | Session length | Mode of communicationb | Role in digital health |
|---|---|---|---|---|---|---|---|---|
| Alley et al. (2016) (44) | Hybrid |
|
No | 4 | Bi-weekly | 10ā15ā min | Synchronous | Supportive |
| Aymerich-Franch and Ferrer (2022) (17) | AI |
|
N/Aa | 3 | Weekly | 10ā14ā min | Synchronous | Central |
| Bakas et al. (2018) (45) | Human |
|
Yes | 3 | Not stated | Not stated | Synchronous | Central |
| Blair et al. (2021) (46) | Human |
|
No | 5 | Not stated | 15ā20ā min | Synchronous | Supportive |
| Chang et al. (2023) (47) | Human |
|
No | 3 | Not Stated | Not stated | Synchronous | Central |
| Chew et al. (2024) (48) | AI |
|
N/Aa | Not stated | Not stated | Not stated | Asynchronous | Central |
| Chow et al. (2020) (27) | Human |
|
No | 2 | Monthly | 30 and 5ā min | Synchronous/asynchronous | Supportive |
| Daley et al. (2020) (49) | AI |
|
N/Aa | 4ā5 | Not stated | 5ā min | Synchronous | Central |
| Damschroder et al. (2020) (37) | Hybrid |
|
No | 3 | Over 9 weeks | 30ā min | Synchronous | Supportive |
| D'Avolio et al. (2023) (50) | Human |
|
No | 8 | Weekly | 45ā60ā min | Synchronous | Central |
| Dhinagaran et al. (2021) (51) | AI |
|
N/Aa | Not stated | Not stated | Not stated | Asynchronous | Central |
| Foran et al. (2024) (38) | ĆI |
|
N/A | Not stated | Daily | Not stated | Asynchronous | Central |
| Gabrielli et al. (2021) (52) | AI |
|
N/Aa | 8 | Bi-weekly | 10ā min | Synchronous | Central |
| Gudenkauf et al. (2024) (53) | Human |
|
Yes | 8 | Weekly | 15ā20ā min | Synchronous | Central |
| Han et al. (2024) (39) | Human | Not stated | Yes | Not stated | ad hoc | Not stated | Asynchronous | Supportive |
| Heber et al. (2016) (40) | Human |
|
Yes | 8 | After educational sessions | Not stated | Asynchronous | Supportive |
| Horn et al. (2023) (54) | Human |
|
Yes | 24 | Weekly for 12 weeks, bi-weekly for 12 weeks and monthly thereafter | Not stated | Synchronous | Supportive |
| Ly et al. (2017) (55) | AI |
|
N/Aa | Not stated | Not stated | Not stated | Synchronous | Central |
| Maher et al. (2020) (56) | AI |
|
N/Aa | Unlimited | Unlimited | Not stated | Ā Note stated | Central |
| Marler et al. (2019) (57) | Human |
|
No | Not stated | 3 times weekly for the first 30 days, 1 weekly for the next 30 days, and biweekly for the last 30 days | Not stated | Asynchronous | Central |
| McGuire et al. (2022) (58) | Human |
|
Yes | 3 | Week 1, 4, and 8 | 30ā60ā min | Synchronous | Supportive |
| Moreno-Blanco et al. (2019) (59) | Hybrid |
|
No | Not stated | Not stated | Not stated | Synchronous asynchronous | Central |
| Ollier et al. (2023) (60) | AI |
|
N/A | Not stated | Not stated | 5ā10ā min | Synchronous | Central |
| Partridge et al. (2015) (41) | Human |
|
Yes | 5 | Not stated | 10ā25ā min | Synchronous | Central |
| Price and Brunet (2022) (61) | Human |
|
No | 12 | Weekly | 60ā min | Synchronous | Central |
| Sacher et al. (2024) (62) | Human |
|
Yes | Not states | Not stated | Not stated | Asynchronous | Central |
| Santini et al. (2023) (63) | Hybrid | Not stated | No | Not stated | Daily | Not stated | Asynchronous/synchronous | Central |
| Smart et al. (2022) (64) | Human |
|
Yes | Not stated | Not stated | Not stated | Asynchronous | Central |
| Spring et al. (2018) (42) | Human |
|
Yes | 22 | Weekly (week 1ā12), bi-weekly (13ā24), monthly (25ā40) | 10ā15ā min | Synchronous | Central |
| Terblanche et al., (2022) (65) | AI |
|
N/Aa | Unlimited | Unlimited | Not stated | Synchronous | Central |
| To et al. (66) | AI |
|
N/Aa | 12 | Every 2 to 4 days | Not stated | Synchronous | Central |
| Ulrich et al. (2024) (67) | AI |
|
N/Aa | Unlimited | Unlimited | Not stated | Asynchronous | Central |
| Wiegand et al. (2010) (68) | Human |
|
No | Not stated | Not stated | Not stated | Synchronous | Central |
| Wijsman et al. (2013) (43) | Human |
|
No | Not stated | Not stated | Not stated | Asynchronous | Central |
| Williams et al. (2021) (69) | AI |
|
N/Aa | 21 | Daily | 3ā5ā min | Synchronous | Central |
Characteristics of coaching.
Not Applicable for AI studies.
Mode of communication is catagorised into synchronous, meaning real-time communication between coach and participant, and asynchronous, meaning communication which is not time-bound or instant.
3.3.1 Digital human coaching
Digital human coaching refers to any study that used a human coach alongside DHIs. Eighteen of the included studies (51%) employed human coaching as a part of their interventions (27, 39ā43, 45ā47, 50, 53, 54, 57, 58, 61, 62, 64, 68). Coaching was often enhanced with digital tracking tools (e.g., smart scales, watches, accelerometers, and breath sensors) (41ā43, 46, 53, 57, 58, 62, 64) and educational resources or modules (27, 39ā41, 47, 50, 54, 58, 61, 62, 68). One study used a telepresence robot controlled remotely by practitioners to facilitate communication with participants (45). A consistent finding emerged from these studies: the human coach played a crucial role in engaging participants and enhancing motivation, adherence, behaviour change, and personalisation (39, 46, 58, 61, 62). However, the intensity of human coaching delivered varied. In most cases (67%), the coach was central to the intervention, interacting regularly with participants. In contrast, six studies assigned coaches a more supportive role, primarily supplementing educational modules, training, and activity monitors (27, 39, 40, 46, 54, 58). In these instances, participants primarily engaged with the digital components and the coaches provided additional support to reinforce engagement and adherence, focusing only on issues related to the main intervention (27, 39, 40, 46, 54, 58).
3.3.2 AI-powered coaching
Thirteen included studies (37%) analysed the use of AI-powered coaching using conversational agents (i.e., chatbots) to deliver DHIs (17, 38, 48, 49, 51, 52, 55, 56, 60, 65ā67, 69). The development of conversational agents and chatbots in these studies was primarily based on Natural Language Processing (NLP) models (17, 49, 51, 56) or rule-based approaches, including decision-tree algorithms (38, 51). Most AI coaching interventions were delivered remotely through a digital platform, available 24/7 (38, 45, 48, 49, 51, 52, 55, 56, 60, 65, 66, 69). However, one study used on-site AI coaching in a controlled environment (17). Eleven studies mimicked human coaching sessions through quick text-based sessions with the AI chatbot. One chatbot, however, was primarily voice-based, with text-based options for certain activities. This chatbot could interpret speech and synthesise voices to respond to participants (17). Three studies provided unlimited access to the AI chatbots, enabling ad hoc questions, as well as guidance, monitoring and feedback outside of structured coaching sessions (56, 65, 66). Unlike human coaches, AI coaches were central to all DHIs. Given the artificial nature of the coaching design, no coaching qualifications or accreditations were reported in AI coaching studies.
3.3.3 Hybrid coaching
Four studies (11%) investigated hybrid coaching in DHIs (37, 44, 59, 63). Hybrid coaching refers to any study that integrates human coaches with AI-powered features within a DHI. Three studies investigated the impact of combining human coaching (delivered via video or text) with automated personalisation, support, advice, and motivation delivered by in-app messages, nudges or knowledge pills (short tips or pieces of advice) (37, 44, 59). The AI in these studies typically performed administrative tasks such as (a) sending personalised interventions and educational content (44, 59, 63), (b) monitoring activities, progress, engagement and adherence (37, 63), (c) sending motivational messages or nudges (37, 63), and (d) administering questionnaires (59). While two studies used AI and human coaching in parallel, the third study employed a āhuman-in-the-loopā approach for delivering hybrid coaching, where the human coach monitored, modified, and validated the AI coach's recommendations (59). The fourth study combined human coaching with a digital coach (similar to AI-powered coaching). The digital coach was an avatar that participants could interact with in-app through text-based prompts (63). This resulted in the AI-powered coach taking on more interactive and meaningful tasks like goal-setting with participants (63).
3.4 Lifestyle outcomes across coaching modalities
The reviewed studies assessed changes in physical activity, psychological wellbeing, stress management, healthy eating, sleep, and substance use across human-delivered, AI, and hybrid coaching interventions (TableĀ 3). Among these lifestyle domains, physical activity was the most commonly addressed, covered by 55.8% of studies (nā=ā19), followed by psychological wellbeing (nā=ā18; 52.9%). Stress management was examined in nine studies and was closely associated with psychological wellbeing, with many studies addressing both constructs together. Given its interrelated role, stress management was discussed as a subset of psychological wellbeing in this review. Healthy eating was explored in 13 studies, while sleep (nā=ā4) and substance use (nā=ā2) were less commonly examined.
3.4.1 Physical activity
Physical activity was assessed through a variety measures, including active minutes, Metabolic Equivalent Tasks (METs), daily steps, or sedentary time measured through biometric feedback (37, 42ā44, 59, 64, 66) and self-reported physical activity assessments (41, 45ā48, 51, 56, 58, 61, 63, 66), and qualitative interviews (54). Studies examining human coaching reported mixed findings regarding its impact on physical activity. Several studies found improvements in physical activity measured through changes in METs (41, 47) and moderate-to-vigorous activity (MVPA) minutes (42, 43). However, other studies showed no meaningful changes or a decline in physical activity (45, 46, 64). Some studies also found that human-coach-facilitated DHIs promoted shifts in participants' awareness, motivation, and readiness to change, leading to active change in physical activity (46, 47), along with an increase in purposeful movement, like taking the stairs or walking instead of driving (54).
AI coaching generally showed consistent positive effects on physical activity. Studies reported improvements in physical activity measures, including METs (48), step count (66), active minutes (56, 66), MVPA minutes, and reduced sitting time (51). AI coaching was also associated with improved adherence and increased motivation (56). All four hybrid studies investigated the impact of this approach on physical activity, reporting variable results. Two studies found significant to moderate improvements in physical activity through active minutes (44, 63). However, a third study found decreased active minutes and steps among both intervention and control groups (37). The fourth study did not assess changes in physical activity from baseline but did report high rates of adherence and goal completion (i.e., achieving step targets and weekly active minute goals) (59).
3.4.2 Psychological wellbeing
Psychological wellbeing was primarily assessed through the reduction of depression and anxiety symptoms (27, 40, 45, 49, 52, 60, 62, 67) and general psychological wellbeing using a variety of wellbeing and mental health scales (50, 53, 62, 63, 65, 69). Other aspects of psychological wellbeing, such as worry, emotional control, mindfulness, personal growth, and life satisfaction, were also assessed to a lesser extent (17, 27, 47, 52, 55). Human coaching interventions were associated with reductions in depressive and anxiety symptoms (27, 40, 45, 62), along with reduced worry (40) and increased emotional self-efficacy and emotional control (47, 62). However, this was not reflected in one study among coached and non-coached groups (50) and another which aimed to improve health and wellbeing but did not use scales to assess wellbeing changes (53). AI coaching generally reported positive improvements in psychological wellbeing, including reductions in depressive symptoms (49, 60, 67) and anxiety (49, 52, 60), as well as improvement in general psychological wellbeing (69). AI coaching interventions also improved other wellbeing constructs like mindfulness (52), personal growth (17, 55), and life satisfaction (55). However, two AI-coaching interventions failed to report differences in psychological wellbeing (65, 67). Only one hybrid coaching study assessed psychological wellbeing (63]. While psychological wellbeing improved from baselines, this was not sustained after the human-coaching element of the intervention ended (63).
3.4.2.1 Stress management
All human coaching interventions examining perceived stress reported positive outcomes (40, 47, 68), with some showing greater improvement compared to control groups (40, 68). However, one study observed only minimal reductions in stress levels from baseline (47). Likewise, AI coaching interventions generally led to reductions in perceived stress, ranging from significant (52, 55, 67, 69) to minor improvements (49). One AI intervention reported no positive changes in stress levels (65). Hybrid coaching interventions did not assess stress management outcomes.
3.4.3 Healthy eating
Healthy eating was examined in AI and human coaching interventions only. The most common measure used to assess dietary improvements was daily fruit and vegetable intake (41, 51, 58, 61). Other measures included overall diet quality assessed via questionnaires (39, 42), adherence to the Mediterranean diet (56), protein intake (50), behaviours related to overeating and self-regulation (48). Intervention utilising human coaching demonstrated significant improvements in protein intake (50), fruit and vegetable consumption (41, 58, 61), and overall diet quality (39, 42). Two studies explored behavioural factors influencing healthy eating, reporting increased autonomy and competence (61) and improved control over eating habits (62). AI coaching interventions also yielded positive effects on healthy eating, though to varying degrees. While all three studies showed positive changes (48, 51, 56), only one found significant improvement in their diet scores (56). The latter study focused on eating habits rather than food quality, reporting significant reduction in overeating and snacking habits as well as self-regulation in eating behaviour (48).
3.4.4 Sleep
Sleep was assessed through self-reported sleep quality (27, 51, 53), and sleep duration measured via wearable devices (59). Studies examining human coaching generally reported improvements in sleep quality. One study found that most participants either improved or maintained their sleep quality (53), while another reduced sleep disturbances, with improvements sustained at follow-up (27). In contrast, AI coaching showed minimal impact, with no significant improvements in sleep quality or sleep scores (51). Hybrid coaching interventions provided limited findings on sleep. While one hybrid study assessed sleep outcomes, baseline comparisons were not available. However, results indicated that participants met the recommended seven to eight hours of sleep on nearly half of the recorded days (59).
3.4.5 Substance use
Substance use was assessed by two studies using human coaching. Both studies showed that coaching interventions had a positive impact on decreasing alcohol consumption (58), tobacco use, and smoking quit rates (57). The human-coach-led smoking cessation intervention also enhanced participants' confidence to quit smoking and reduced perceived difficulty in maintaining abstinence (57).
3.5 Engagement and satisfaction
Engagement was measured inconsistently across studies, with definitions and measures varying widely (TableĀ 6). The most commonly reported metric was retention or intervention completion rate. For human coaching interventions, completion and retention rates varied from 80% to 100% (27, 39ā41, 46, 47, 53, 57). AI coaching interventions reported similar completion and retention rates of 90% to 93%, with three outlying studies reporting significantly lower completion rates of 58% (52), 20.3%ā45.4% (49), and 9.8% (60). Hybrid coaching interventions had retention rates of 55%ā56.5% (37, 44). However, one hybrid study found that the human coaching component improved adherence to syncing data from wearable devices (37).
Table 6
| Author | Engagement | Satisfaction |
|---|---|---|
| Alley et al. (2016) (44) | Total Completion rate: 55% (83/151). 47% completed ā„3 modules, coaching completers: 82% vs. 43% others. Retention: No group difference. Week 9 survey completion: 73% (coaching) vs. 53% (others). Average website visits: 7.53; Average time spent: 87.07ā min. Participants who completed coaching spent significantly more time on the website (174.64ā min vs. 77.84ā min) | 68% satisfied with the program; 77% with website usability; 76% with tailored advice; 91% with module questions. Coaching completers reported higher satisfaction (88% vs. 64%, not significant) |
| Aymerich-Franch and Ferrer (2022) (17) | Completion rate: 92% (30/32 participants) | Medium-high satisfaction with the coaching program. satisfaction with the coaching program at 6.06 out of 7 |
| Bakas et al. (2018) (45) | N/A | N/A |
| Blair et al. (2021) (46) | 79% participants checked the app daily. 93% completed all 5 coaching calls. Retention rate: 87% (47/54 participants) | N/A |
| Chang et al. (2023) (47) | Retention rateā80% (12 of 15 participants). All participants attended all three online health coaching sessions | N/A |
| Chew et al. (2024) (48) | Completion rate: 91.6% | N/A |
| Chow et al. (2020) (27) | The app was launched 21.5 times over 6 weeks. Completion rate: 87% | Mean satisfaction: 5.19/7. Usefulness of coaching calls: 4.22/5 |
| Daley et al. (2020) (49) | Completion rate: 20.34%ā45.4%, depending on the program. Higher engagement correlated with lower anxiety and depression. Average response rate: 8.17 responses/day | N/A |
| Damschroder et al. (2020) (37) | Engagement was high initially but declined. 64.4% provided synched data at 6 months, 35.6% at 12 months. 70.8% completed ā„2 coaching calls, 56.7% completed all 3 calls. The coaching group correlated with better adherence to synched data (68.5% vs. 60.3%, difference not sustained) | N/A |
| D'Avolio et al. (2023) (50) | N/A | N/A |
| Dhinagaran (2021) (51) | Completion rate: 93%. 50% of participants completed all conversations. 40% responded immediately 75% of the time | 92% moderately satisfied. 54% likely to recommend, 57% likely to use again |
| Foran et al. (2024) (38) | Participants engaged with the intervention 4.26 days over 30 days, started 3.68 modules, completed 2.78, and sent 51.09 messages on average | 3.21/5 satisfaction score with modules. User satisfaction and participation/engagement were significantly associated with greater improvements in all primary outcomes (Pā=ā.04 toā<ā.001). Participants with more unfinished modules (modules started but not completed) showed less improvement in positive psychological health |
| Gabrielli et al. (2021) (52) | Completion rate: 86% (61 out of 71 participants). By the end of the study, 58% (41 out of 71) of participants completed the postintervention questionnaire, representing an attrition rate of 42%. Engagement and willingness to complete a session were higher during the first and last weeks of the study | N/A |
| Gudenkauf et al. (2024) (53) | Participants completed 6.9/8 weekly assessments and attended 6.9/8 coaching sessions (86.5%). Wore wearable device 79.9% of study days. 100% baseline completers did an 8-week follow-up | Satisfaction: 4.7/5. 85% rated satisfaction as 5/5 |
| Han et al. (2024) (39) | Completion rate: 95%. App utilisation: 87% (first 3 months), 92% (4ā6 months). Two-way interaction: 3 days/week (first 3 months), 2 days/week (4ā6 months) | N/A |
| Heber et al. (2016) (40) | Participants completed 5.7/7 sessions (81.4%) and used the intervention for 8.27 weeks. 43.6% of participants preferred light coaching interaction, and 56.4% preferred intensive coaching. 76.5% requested message support | 92.2% satisfied with overall intervention |
| Horn et al. (2023) (54) | N/A | N/A |
| Ly et al. (2017) (55) | During the 2-week period, 78.6% of participants were active for at least 50% of the days. Active for more than half of the interventions 14 days. (average 8.21 days). Participants opened the app 1.27 times a day | N/A |
| Maher et al. (2020) (56) | Out of the maximum of 11 possible check-ins with chatbot, participants completed an average of 6.9 check-ins (64%). Engagement varied across the intervention. 70% of participants completed check-ins in weeks 2, 3, 4, and 12. Engagement gradually decreased to around 50% through weeks 8 and 9. Participants who completed the first weekly check-in had higher engagement across the intervention period than those who didnāt. completion rate was 90% (28/31) | N/A |
| Marler et al. (2019) (57) | Retention rate: 97.3% (183/188 participants). Completion rate: 95.2% (179/188 participants). The intervention group opened the app an average of 157.9 (vs. 86.5 in control, pā<ā.001) times. High weekly login rates: 86%-98% (intervention), 85%ā97% (control) | N/A |
| McGuire et al. (2022) (58) | The nurse coaching group had better adherence and lower attrition (35% vs. 50%) compared to the eBook group | N/A |
| Moreno-Blanco et al. (2019) (59) | Users read 88.09% of knowledge pills and reported following advice for 65.9% | Usability score (SUS): 81.5. indicating usability and high satisfaction |
| Ollier et al. (2023) (60) | Completion of at least one topic: 9.8% (nā=ā698). 7,135 downloaded the app, and 3,928 opened the app (55.8%) | The net promoter score increased as individuals progressed between periods 1 and 2. Ease of use and usefulness also increased. However, only marginally |
| Partridge et al. (2015) (41) | Completion rate: 85.6%. The mean number of coaching calls completed was 4.6/5, with 82.4% completing all 5 calls. Over half of the intervention participants replied to 8 or more of the 16 SMS text messages, with 20.3% replying to all. Most control participants replied to 2 or more of the 4 text messages, with 62.4% replying to all 4 | N/A |
| Price et al. (2022) (61) | The engagement for this intervention was high, with participants attending 95.2% of the sessions. Session attendance ranged between 66.7% (8 out of 12 sessions) and 100% (12 out of 12 sessions) | N/A |
| Sacher et al. (2024) (62) | Out of the 122 eligible participants who provided consent, 119 were enrolled, and 107 were included in the analysis | 81.9% found the health coaching useful/helpful |
| Santini et al. (2023) (63) | Completion rate: 68% (62/91 participants) | Average SUS score: 59 (below the average score of 68), indicating usability problems with the system |
| Smart et al. (2022) (64) | 73% set at least 7 goals over 8 weeks, and 47% set goals every week. Coaches sent 3ā4 more messages/week than participants. Completion rate: 93% (28/30) | N/A |
| Spring et al. (2018) (42) | Retention at 9 months: 82.1%. Self-monitoring adherence declined but remained substantial (96.3% at baseline, 54.6% at 9 months). Coaching calls declined from 66.0% to 57.7% | N/A |
| Terblanche et al. 2022 (65) | Experimental group retention rate: 56% (75 out of 134), and for the control group: 70% (94 out of 134). Participants who used the AI coaching chatbot more frequently (more than 6 sessions) had a higher average increase in goal attainment (37.62) compared to those who used it less frequently (17.62) | N/A |
| To et al. (66) | 60% of the intervention group completed the post-intervention survey. 45% completed all 13 sessions. Engagement ratio: 74.3% (297 responses/400 messages) | N/A |
| Ulrich et al. (2024) (67) | On average, participants sent 6.7 messages per week to the chatbot and spent. Most participants (93.8%) read the messages sent by the chatbot. About half of the participants sent messages to the chatbot at least once a day | The average usability score for the chatbot was 61.6, with the majority of participants rating the chatbot as āOKā (78.8%) or āGoodā (10.6%). Less than half of the participants (43.4%) would recommend the chatbot to others |
| Wiegand et al. (2010) (68) | N/A | N/A |
| Wijsman et al. (2013) (43) | Completion rate: 97% (226/235 participants). 95.6% of participants in the intervention group started the program after the initial assessment week. 91.2% of participants in the intervention group completed the 12-week intervention program | N/A |
| Williams et al. (2021) (69) | Completion rate: 27.3% (30/110 participants). Adherence: 11/21 days (Mā=ā11.3, SDā=ā7.8) | Satisfaction: 6.61/10 (SDā=ā1.78), 63% rated ā„7/10. 81% found chatbot easy to use |
Engagement and satisfaction.
Several studies reported correlations between engagement levels and outcomes. Studies investigating the addition of a human-coaching component in DHIs found that participants who completed coaching spent more time on the DHI, completed more educational modules (44), had better adherence to the intervention (37, 58), improved retention (61) and increased wellbeing (63). Additionally, higher engagement with AI chatbots correlated with lower anxiety and depressive symptoms (49), improved wellbeing (38), increased physical activity (66), and a higher increase in goal attainment (65). Finally, one study found that early engagement (within the first week) predicted sustained engagement throughout the intervention (56), while another reported that participants who completed the full intervention had lower stress levels (40).
3.5.1 Satisfaction
Satisfaction with the DHI interventions was reported in 37% of the included studies (nā=ā13) (TableĀ 6). Among hybrid studies, 75% (3 out of 4) reported satisfaction outcomes. However, AI coaching interventions had the highest proportion of studies reporting satisfaction, accounting for 46% (6/13), compared to 22% (4/18) of human coaching studies. Overall satisfaction rates were high across studies using human coaching in their DHIs, with 81.9% to 92.5% of participants reporting satisfaction for coaching interventions (40, 62). Additional studies reporting satisfaction through mean scores reported that 85% of participants rated a full score for satisfaction (5/5) (53) and a mean reporting score of 5.19/7 (27). AI coaching interventions showed mixed satisfaction outcomes, generally ranging from moderate to high satisfaction rates (17, 38, 51, 69). Three studies reported on the likelihood of participants recommending the intervention. One study found a significant change in the Net Promoter Score (60), while two studies reported that 43.4% and 54% of participants would recommend the AI intervention. Additionally, 57% of participants indicated they would use it again (51, 66). Hybrid studies reported varied satisfaction rates varying from 67% to 81.5% (44, 59), with one study reporting usability below the average threshold, indicating low satisfaction (63). Interestingly, one hybrid study found that satisfaction rates were higher among participants who completed the coaching component of the intervention (88% compared to 64%) (44).
3.6 Working alliance
Seven studies (20.6%) examined working alliance and connection between participants and coaches. While one study used a working alliance scale, most explored working alliance through qualitative feedback from participants. Human coaching showed a high working alliance. Participants experienced authentic and strong connections, social support, and accountability (64). They also described a sense of investment and warmth from their coach (61), even in text-based coaching (64). This allowed participants to feel comfortable, motivated, and honest about their progress (61). Participants perceived AI coaches as engaging and lifelike, often viewing their interactions as relational (55, 60, 69). Participants viewed chatbots as a positive addition, appreciating its non-judgemental nature, finding it easier to share information (17) and feeling validated in their experiences (69). On a 7-point working alliance scale, the overall alliance with AI and human coaches was rated 4.23, with the bond component scoring 4.20 (67). Another study using the Session Alliance Inventory (ISA) found a minor but insignificant increase in scores (60). Despite positive connections made, some participants found chatbots patronising and preferred to connect to a real person. Others reported feelings of loneliness, disconnection and a lack of warmth while engaging with chatbots (17, 69). One study noted that chatbot interactions felt repetitive, contributing to feelings of disconnection (55). In one hybrid coaching intervention, participants valued human support alongside the AI coaching and expressed a desire for continued human support alongside the AI interventions, particularly for motivation, confidence, and technical support (63).
4 Discussion
This review explored how digital health coaching is integrated into DHIs and its impact on lifestyle outcomes and engagement. We identified three primary coaching models: human, AI, and hybrid (a combination of both human and AI coaches). Our findings suggest that both human- and AI-delivered coaching are generally perceived as acceptable and satisfactory components of DHIs, with trends indicating positive effects on health and wellbeing. Engagement and retention were generally high across all coaching models, with higher engagement linked to improvements in lifestyle outcomes. However, engagement and satisfaction were typically higher with human-delivered coaching. While working alliance was strong across all coaching models, participants reported a stronger sense of connection with human-delivered coaching, including within hybrid interventions.
Despite advancements in digital health coaching, studies were predominantly exploratory, early-intervention studies, focusing on feasibility and acceptability of coach-facilitated DHIs. While individual studies reported significant findings, the lack of consistency in study designs and outcome measures prevented clear inferences about broader trends in lifestyle and engagement. Likewise, there was an imbalanced representation of human (nā=ā18), AI (nā=ā13), and hybrid (nā=ā4) coaching interventions. This disparity, along with inconsistencies in outcome measures, intervention designs and coaching characteristics, limited direct comparisons and the generalisability of our findings; therefore, caution is warranted when interpreting the results in this review.
4.1 The role of coaching: delivery, standards, and trends
This review examined the integration of health coaching with DHIs, focusing on delivery methods, coaching roles, and standards for protocols and coach engagement (TableĀ 5). Findings revealed a lack of consistency in reporting delivery methods, which contributed to ambiguity in the coaching protocols used. Only six studies provided comprehensive descriptions of all the extracted coaching characteristics that represent delivery (17, 42, 52, 53, 58, 69). The lack of transparency in coaching delivery methods made it difficult to determine the optimal frequency and intensity of coach-participant engagement for best outcomes. The total number of coaching sessions varied widely, ranging from 2 to 24, with a mean of 8.85 sessions per intervention (TableĀ 5). Additionally, coaching sessions were most commonly conducted weekly (17, 50, 53, 61) and biweekly (44, 52), but no clear pattern emerged linking coaching frequency to delivery mode. Among long-term studies (ranging from 3 to 12 months), coaching was typically staggered, starting with weekly coaching sessions before transitioning to bi-weekly and monthly as the intervention progressed (42, 54, 57). Very few studies reported the qualifications of the coaches, making it difficult to determine who was delivering the coaching. Nonetheless, some trends emerged from the studies that did provide details on coaching delivery methods. Most studies indicated that both AI and human coaches played a central role in DHIs, though eight studies employed coaches in a supportive capacity (27, 37, 39, 40, 44, 46, 54, 58). In these cases, the role of the coach typically focused on supporting participants' technology use and enhancing usability through education around app features (27, 37, 46), reviewing and monitoring participant data (46), providing feedback to participants (39, 40, 44), adding accountability and promoting the implementation of skills gathered from the main intervention (i.e., educational modules and training) (27, 44, 54, 58).
In several studies, the role of the coach extended beyond the traditional role of facilitating health behaviour change and providing lifestyle support (27, 37, 46). Coaches were often tasked with providing technological and lifestyle support. The burden of supporting participants in navigating DHI technology has been recognised (27, 37). However, only one study provided a separate technology support channel for participants (46). In this instance, technology support was carried out by coaches via telephone calls but conducted outside of the coaching sessions. Moreover, communication predominantly occurred synchronously, with human coaches or conversational agents speaking in real time with participants. However, some studies supplemented this mode of interaction with asynchronous communication between coaching sessions (i.e., check-ins and ad hoc questions and feedback) (27, 59, 63).
4.1.2 Coaching standards
Professional and accredited coaches should play a central role in the development of digital health coaching interventions, whether by directly providing coaching or by informing the creation of coaching chatbots. This ensures that the high standards of practice paved by professional bodies like the International Coaching Federation (ICF), the European Mentoring and Coaching Council (EMCC) and the Association for Coaching (AC) are upheld in DHI research. In this review, we identified two key components for maintaining standards: (1) the qualifications and training of the coaches involved and (2) the evidence-based theories and models that inform coaching practices in DHIs. Only ten studies reported the qualifications or training of the human coaches involved in the interventions (39ā42, 45, 53, 54, 58, 62, 64). These coaches were typically dietitians (39, 41), nutritional coaches (54), psychologists (40, 64), nurse practitioners (45, 58), and trained health coaches (53). Other studies mentioned that coaches received training from qualified professionals to deliver the intervention (42, 62). None of the hybrid coaching studies reported the qualifications or accreditation of the coaches working alongside AI.
Likewise, coaching theories played a crucial role in underpinning DHIs across all coaching models. Cognitive behavioural therapy, behaviour change theories, and positive psychology were the most commonly used (TableĀ 5). All AI coaching interventions presented theories that underpinned the development of their conversational agents. Although six studies using human coaches failed to report theories used, two of which were hybrid coaching interventions (43, 47, 53, 54, 59, 63). This suggests that coaching theories were not only vital for active coaching but also influenced the design of DHIs and AI chatbots. The use of validated, evidence-based theories in the development of chatbots demonstrates that studies are moving towards validated, evidence-based approaches for chatbot creation. However, it also underscores the need for standardised quality and ethical guidelines for AI coaching.
The Designing AI Coach (DAIC) framework developed by Terblanche (70) provides a structured approach for assessing AI standards in coaching. The framework emphasises adapting human-efficacy elements and theoretical models for specific and narrow coaching tasks (i.e., goal setting), in line with ethical codes of conduct from accrediting coaching bodies (i.e., ICF, EMCC, AC). In this review, we found that all AI coaching studies, either intentionally or unintentionally, followed the first two principles of the DAIC framework: adapting human-efficacy and incorporating theoretical models in chatbot development (TableĀ 5). However, the third principle, ethical conduct (covering privacy, autonomy, liability, and bias), was only addressed in three studies (49, 55, 65). The fourth principle, using AI chatbots for narrow, specialised tasks, was met in all but two studies. Notably, Maher et al. (56) used AI for both nutritional and physical activity interventions. Likewise, Dhinagaran et al. (51) and Ollier et al. (60) extended their chatbot's role to cover a wide range of lifestyle areas. Dhinagaran et al. (51) included diet, exercise, sleep, and stress support, while Ollier et al. (60) focused on psychological well-being, healthy eating, physical activity, and sleep. The DAIC framework is a valuable framework for guiding the development of AI coaching chatbots. Further research and standardisation are essential to ensure that AI coaching aligns with ethical and professional standards. This is critical for ensuring high-quality and safe AI interventions in digital health.
4.2 Tends in engagement, satisfaction, and working alliance
Engagement and retention rates have traditionally been considered low for DHIs (71), suggesting that this is a common issue in digital health research. However, a recent review by Boucher and Raiker (72) argued that engagement rates are not inherently low but rather vary widely across studies. They also highlighted a high degree of variability in how engagement is defined and measured (72). Our findings align with those of Boucher and Raiker (72), revealing that engagement was inconsistently reported, causing limitations for comparisons across studies. Retention and completion rates were the most consistently reported metrics. These metrics inferred higher retention and completion rates for human coaching interventions compared to AI and hybrid interventions. While AI retention rates were similar to those for human coaching, hybrid interventions showed a comparable difference. Engagement metrics offered valuable insight into adherence and retention but did not clarify the nature of meaningful interactions that may influence adherence. Boucher & Raiker (72) recommended moving away from the one-size-fits-all approach to assessing engagement and instead encourage researchers to focus on patterns of engagement.
Engagement as behaviour (i.e., quantitative engagement measures like app logins, text messages sent, and coaching sessions attended) provides valuable insight into the dynamics of DHIs and engagement trends (73). However, this construct was also inconsistently reported across studies, making it challenging to draw definitive conclusions (TableĀ 6). Engagement as behaviour suggested that engagement with both human and AI coaches yielded positive outcomes. Among studies that reported engagement as behaviour, trends indicated early engagement and increased engagement with both human and AI coaches led to better outcomes, increased retention, adherence and goal attainment (37, 38, 40, 44, 49, 56, 58, 61, 63, 65, 66). Satisfaction rates offered further insight into participants' experiences with different coaching modalities. Overall, satisfaction was highest in interventions using human-facilitated coaching interventions (27, 40, 53, 62). AI coaching showed more variable satisfaction rates (17, 38, 51, 66, 69). While hybrid approaches demonstrated higher rates of satisfaction over AI-only, they still lagged behind human-only interventions (44, 59, 63).
Participants' experiences were further uncovered through the exploration of working alliances developed between coaches and participants. Prior research has consistently linked strong working alliances with positive coaching outcomes (74). A positive working alliance was formed between participants and AI and human coaches during DHI studies (17, 55, 60, 61, 63, 64, 67, 69). However, AI chatbots reported limitations to the alliances formed, with some participants reporting a sense of disconnection, repetitiveness and lack of warmth with chatbots (17, 55, 69). This trend in working alliance, favouring human-facilitated coaching interventions, was reflected by one hybrid coaching intervention (63). This study reported that participants valued human support along with AI to increase motivation and confidence and support technology navigation (63). Overall, while engagement, satisfaction, and working alliance remained consistent across both human and AI coaching interventions, human-facilitated coaching interventions foster stronger connections and higher satisfaction. Although hybrid and AI models could promote engagement and offer positive participant experiences, they could not replicate the interpersonal elements of human-delivered interventions that enhanced the overall individual experience. While engagement remains a key indicator of adherence, Boucher and Raiker (72) suggested that the true measure of success in DHIs should focus on the ability to foster meaningful lifestyle changes rather than being based on the frequency or duration of engagement.
4.3 Effectiveness of coaching for lifestyle change
The review findings suggest that human and AI-facilitated coaching could produce positive effects on lifestyle to varying degrees. While AI-powered coaching demonstrated more consistent positive trends on physical activity levels (48, 51, 56, 66), human-facilitated coaching yielded more consistent positive outcomes for psychological wellbeing and stress management (27, 40, 45, 47, 62, 68). Likewise, both human and AI coaching reported overall improvements in healthy eating (39, 41, 42, 48, 50, 51, 56, 58, 61, 62). Human coaching reported improvements in sleep, whereas AI did not (27, 51, 53, 59). Finally, substance use was only assessed with human coaches, precluding direct comparisons (57, 58). However, the varying effects of lifestyle outcomes between human and AI delivery systems were minimal, indicating that both AI and human coaching are acceptable and feasible interventions for supporting lifestyle changes. This finding aligns with existing literature suggesting that AI can be equally effective in providing support, particularly when deployed for narrow, specific tasks (32). Such findings suggest that AI could be a promising avenue for delivering scalable and effective lifestyle interventions (75, 76). However, studies using hybrid coaching approaches failed to produce consistent, robust findings compared to AI- and human-only interventions. Although some hybrid studies showed promising effects in physical activity, psychological wellbeing, and sleep, one study indicated that these benefits did not persist after human support ended (63). This suggests that further refinement of the integration between human and AI elements is needed to optimise long-term behaviour change.
4.4 Exploring the hybrid AI-human coaching model: benefits, limitations, and future potential
The lack of significant engagement and lifestyle outcomes in hybrid studies was unexpected. AI pioneers have established the value and complementary nature of combining AI and humans (77), with a growing body of literature on AI-human symbiosis indicating the shifting division of work between humans and machines (77ā79). This comes from longstanding research asserting that computers plus humans do better than either alone (77, 79, 80). AI and humans hold different values and capabilities. While AI has superior computational and analytical skills, humans can better deal with uncertainty and have greater aptitude for intuition, creativity, and holistic work (77, 79). By harnessing human-AI symbioses, it becomes possible to pull the best traits of AI and humans and compensate for the limitations of both (77, 80).
In digital health coaching, hybrid models have been proposed as a promising way to combine the scalability of AI with the relational depth of human coaching (31, 81). This hybrid approach draws on the strengths of AI and human coaches to create high-quality, meaningful, accessible, and scalable coaching conversations (31, 81). Despite this theoretical promise, hybrid coaching studies included in this review did not reflect this. Several factors may explain this, including methodical constraints, the limited role of human coaches, and inconsistencies in AI implementation. Firstly, only four studies in this review examined hybrid coaching, making it difficult to assess its full impact. Secondly, human coaching was commonly offered as a supportive or optional component of hybrid DHIs (37, 82). Three studies highlighted this as a limitation, acknowledging greater outcomes when coaching was the central component of DHIs (37). Participants also expressed a preference for periodic human interaction alongside AI-driven support, reinforcing the importance of sustained human involvement (44, 63). Furthermore, these studies indicated that consistent human support was important for maintaining improvements in lifestyle and sustaining engagement (37, 44, 63). This aligns with existing literature, which suggests that AI should extend rather than replace human coaching capabilities (83).
The AI components used in hybrid coaching varied across studies. Two studies employed basic automation for delivering tailored messages and advice (37, 44), while another used a decision-support system to provide personalised recommendations (59). The fourth study integrated a digital coach similar to a chatbot but with limited interactive capabilities (63). The study that most closely aligned with AI coaching models showed significant improvements in physical activity; however, these benefits diminished once human support was removed (TableĀ 3). This reflects broader findings that AI is effective for structured tasks like goal-setting but lacks the relational depth and working alliance needed for long-term behavioural change (83). This suggests that hybrid coaching is still an emerging area with limited empirical evaluation. While these studies confirmed the feasibility, more research is needed to refine hybrid coaching models and optimise their effectiveness. Additionally, trends suggest that hybrid models should prioritise human connection as the foundation of coaching while leveraging AI to enhance efficiency and scalability. Future hybrid interventions should, therefore, prioritise AI as an enhancement to human coaching rather than a substitute.
4.5 Strengths and limitations
This systematic review provided a comprehensive exploration of coach-facilitated digital DHIs, a novel and growing field. By synthesising existing evidence, this review highlighted the feasibility, acceptability, and impact of human, AI and hybrid coaching modalities on engagement and lifestyle outcomes. Adherence to PRISMA guidelines and the use of the MMAT ensured a rigorous and transparent methodological approach. There are well-established, foundational reviews that have significantly advanced digital health research in the context of health promotion and disease prevention (6, 84, 85). This review builds on this existing foundation to synthesise the existing body of knowledge on the integration of different health coaching modalities within DHIs. The insights gained from this review can inform the development and optimisation of future coach-led DHIs.
However, several limitations of this review should be noted. First, while full-text screening, data-extraction and quality appraisal was done in duplicate, the title and abstract screening was conducted by one reviewer only, increasing the risk of reviewer bias. Additionally, due to the use of broad search terms with multiple definitions, it is difficult to confirm that all relevant articles have been included. Furthermore, Significant heterogeneity in reporting coaching protocol, engagement metrics, and lifestyle outcomes across studies affects the ability to compare findings directly. This heterogeneity hindered the synthesis of results, making it difficult to draw consistent conclusions. Many of the included studies primarily focused on assessing the acceptability and feasibility of coach-facilitated DHIs rather than investigating the impact of specific components of the DHIs (e.g., active coaching, educational modules, tracking tools) and their interactions in influencing lifestyle outcomes. As a result, this review cannot definitively establish the full impact of coaching within DHIs, and its findings should be interpreted with caution. The review's generalisability is also constrained by the heterogeneity of participant demographics. Finally, the inclusion of only peer-reviewed studies may have introduced publication bias, potentially skewing the findings. Despite these limitations, this review offers valuable insights into the current state and potential future directions of digital health coaching in DHIs, providing a foundation for advancing scalable, high-quality, and evidence-based interventions.
5 Conclusion
The studies included in this review contribute to our understanding of how digital health coaching can be effectively integrated into DHIs, highlighting the trends, opportunities, and challenges related to its impact on lifestyle outcomes and engagement. Despite inconsistencies in reporting coaching delivery methods, engagement metrics, satisfaction, and lifestyle outcomes, this review emphasises the potential positive impact of coach-facilitated DHIs on participant engagement and lifestyle outcomes. We confirm the acceptability and feasibility of integrating AI and human coaching into DHIs and identify a gap in the research on hybrid coaching approaches. Digital health coaching in DHIs is complex and multifaceted, making it difficult to isolate components of the intervention to understand their effect (i.e., coaching, wearables, environment). Further research is necessary to better understand these complexities to advance the development of quality, evidence-based, coach-led DHIs that promote participant engagement and positive lifestyle changes. Additionally, it is crucial for future research to explore how the benefits of both AI and human coaching, as reported in this review, can be leveraged through hybrid approaches. Such strategies could help overcome the barriers of meaningful engagement in AI coaching and the scalability and accessibility of human coaching to create a scalable coach-led DHI without decreasing the quality of health coaching.
Statements
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.
Author contributions
CL: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Validation, Visualization, Writing ā original draft, Writing ā review & editing. JL: Formal analysis, Data curation, Validation, Investigation, Writing ā review & Editing. RO: Data curation, Formal analysis, Investigation, Methodology, Software, Supervision, Validation, Writing ā review & editing. PD: Funding acquisition, Investigation, Project administration, Supervision, Validation, Visualization, Writing ā review & editing.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. This work was supported by the Irish Research Council (grant number: EBPPG/2022/122).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Publisherās note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fdgth.2025.1536416/full#supplementary-material
References
1.
Nations U. Global Sustainable Development Report 2015. New York: United Nations Department of Economic and Social Affairs (2015).
2.
Organization WH. Global Strategy on Digital Health 2020ā2025. Geneva: World Health Organisation (WHO) (2021).
3.
Zanaboni P Ngangue P Mbemba GIC Schopf TR Bergmo TS Gagnon M-P . Methods to evaluate the effects of internet-based digital health interventions for citizens: systematic review of reviews. J Med Internet Res. (2018) 20(6):e10202. 10.2196/10202
4.
Murray E Hekler EB Andersson G Collins LM Doherty A Hollis C et al Evaluating digital health interventions: key questions and approaches. Am J Prev Med. (2016) 51(5):843ā51. 10.1016/j.amepre.2016.06.008
5.
Mair JL Salamanca-Sanabria A Augsburger M Frese BF Abend S Jakob R et al Effective behavior change techniques in digital health interventions for the prevention or management of noncommunicable diseases: an umbrella review. Ann Behav Med. (2023) 57(10):817ā35. 10.1093/abm/kaad041
6.
Castro R Ribeiro-Alves M Oliveira C Romero CP Perazzo H Simjanoski M et al What are we measuring when we evaluate digital interventions for improving lifestyle? A scoping meta-review. Front Public Health. (2022) 9:735624. 10.3389/fpubh.2021.735624
7.
Groot J MacLellan A Butler M Todor E Zulfiqar M Thackrah T et al The effectiveness of fully automated digital interventions in promoting mental well-being in the general population: systematic review and meta-analysis. JMIR Ment Health. (2023) 10(1):e44658. 10.2196/44658
8.
Indra B Palmasutra V Setyawan FA . Effectiveness of digital interventions in reducing occupational stress: a systematic review. Port J Public Health. (2024) 42(3):252ā65. 10.1159/000540748
9.
Zheng S Edney SM Goh CH Tai BC Mair JL Castro O et al Effectiveness of holistic mobile health interventions on diet, and physical, and mental health outcomes: a systematic review and meta-analysis. EClinicalMedicine. (2023) 66:102309. 10.1016/j.eclinm.2023.102309
10.
Duan Y Shang B Liang W Du G Yang M Rhodes RE . Effects of eHealth-based multiple health behavior change interventions on physical activity, healthy diet, and weight in people with noncommunicable diseases: systematic review and meta-analysis. J Med Internet Res. (2021) 23(2):e23786. 10.2196/23786
11.
Widmer RJ Collins NM Collins CS West CP Lerman LO Lerman A , editors. Digital health interventions for the prevention of cardiovascular disease: a systematic review and meta-analysis. Mayo Clinic Proceedings; Elsevier; (2015). 10.1016/j.mayocp.2014.12.026
12.
Howarth A Quesada J Silva J Judycki S Mills PR . The impact of digital health interventions on health-related outcomes in the workplace: a systematic review. Digit Health. (2018) 4:1ā18. 10.1177/2055207618770861
13.
Balwan WK Kour S . Lifestyle diseases: the link between modern lifestyle and threat to public health. Saudi J Med Pharm Sci. (2021) 7(4):179ā84. 10.36348/sjmps.2021.v07i04.003
14.
Willems SH Rao J Bhambere S Patel D Biggins Y Guite JW . Digital solutions to alleviate the burden on health systems during a public health care crisis: COVID-19 as an opportunity. JMIR Mhealth Uhealth. (2021) 9(6):e25021. 10.2196/25021
15.
Oāconnor S Hanlon P Oādonnell CA Garcia S Glanville J Mair FS . Understanding factors affecting patient and public engagement and recruitment to digital health interventions: a systematic review of qualitative studies. BMC Med Inform Decis Mak. (2016) 16:1ā15. 10.1186/s12911-016-0359-3
16.
Fadhil A Wang Y Reiterer H . Assistive conversational agent for health coaching: a validation study. Methods Inf Med. (2019) 58(1):9ā23. 10.1055/s-0039-1688757
17.
Aymerich-Franch L Ferrer I . Investigating the use of speech-based conversational agents for life coaching. Int J Hum Comput Stud. (2022) 159:102745. 10.1016/j.ijhcs.2021.102745
18.
Meyerowitz-Katz G Ravi S Arnolda L Feng X Maberly G Astell-Burt T . Rates of attrition and dropout in app-based interventions for chronic disease: systematic review and meta-analysis. J Med Internet Res. (2020) 22(9):e20283. 10.2196/20283
19.
Kernebeck S Busse TS Ehlers JP Vollmar HC . Adherence to digital health interventions: definitions, methods, and open questions. Bundesgesundheitsblatt-Gesundheitsforschung-Gesundheitsschutz. (2021) 64(10):1278ā84. 10.1007/s00103-021-03415-9
20.
Torous J Michalak EE OāBrien HL . Digital health and engagementālooking behind the measures and methods. JAMA Netw Open. (2020) 3(7):e2010918. 10.1001/jamanetworkopen.2020.10918
21.
Silvestri F Vital P Saboga-Nunes L Bittlingmayer U . Health coaching: the node of health literacy & health promotion. Eur J Public Health. (2024) 34(Supplement_3):ckae144.833. 10.1093/eurpub/ckae144.833
22.
Miller BR Klein AA Leheste JR . Health coaching: Americaās overlooked personalized health solution. Med Res Arch. (2024) 12(8):1ā10. 10.18103/mra.v12i8.5662
23.
Olsen JM , editor. Health Coaching: A Concept Analysis. Nursing Forum. Wiley Online Library (2014). 10.1111/nuf.12042
24.
Thom DH . Keeping pace with the expanding role of health coaching. J Gen Intern Med. (2019) 34:5ā6. 10.1007/s11606-018-4730-1
25.
McGlynn A OāCallaghan C McDougall B Osborne J Harris-Roxas B . Translating health coaching training into clinical practice. Int J Environ Res Public Health. (2022) 19(23):16075. 10.3390/ijerph192316075
26.
Castro Sweet CM Chiguluri V Gumpina R Abbott P Madero EN Payne M et al Outcomes of a digital health program with human coaching for diabetes risk reduction in a medicare population. J Aging Health. (2018) 30(5):692ā710. 10.1177/0898264316688791
27.
Chow PI Drago F Kennedy EM Cohn WF . A novel mobile phone app intervention with phone coaching to reduce symptoms of depression in survivors of womenās cancer: pre-post pilot study. JMIR Cancer. (2020) 6(1):e15750. 10.2196/15750
28.
Serio C Gabarda A Uyar-Morency F Silfee V Ludwig J Szigethy E et al Strengthening the impact of digital cognitive behavioral interventions through a dual intervention: proficient motivational interviewingābased health coaching plus in-application techniques. JMIR Form Res. (2022) 6(5):e34552. 10.2196/34552
29.
Passmore J Liu Q Tewald S . Future trends in coaching: results from a global coach survey, 2021. Coach Psychol. (2021) 17(2):41ā51. 10.2196/34552
30.
GeiĆler H . E-coaching: an overview. In: Greif S, Mƶller H, Scholl W, Passmore J, Müller F, editors. International Handbook of Evidence-Based Coaching: Theory, Research and Practice. New York: Springer International Publishing (2022):269ā80. 10.1007/978-3-030-81938-5
31.
Kanatouri S . Digital coaching: a conceptually distinct form of coaching? In: Wegener R, Ackermann S, Amstutz J, Deplazes S, Künzli H, Ryter A, editors. Coaching im Digitalen Wandel. Gƶttingen, Germany: Vandenhoeck & Ruprecht. (2020). p. 40ā50. 10.13109/9783666407420.40
32.
Terblanche NH . Artificial intelligence (AI) coaching: redefining people development and organizational performance. J Appl Behav Sci. (2024) 60(4):631ā8. 10.1177/00218863241283919
33.
Diller SJ Stenzel L-C Passmore J . The coach bots are coming: exploring global coachesā attitudes and responses to the threat of AI coaching. Hum Resour Dev Int. (2024) 27(4):597ā621. 10.1080/13678868.2024.2375934
34.
Hussain S Ameri Sianaki O Ababneh N , editors. A survey on conversational agents/chatbots classification and design techniques. Web, Artificial Intelligence and Network Applications: Proceedings of the Workshops of the 33rd International Conference on Advanced Information Networking and Applications (WAINA-2019). Springer (2019). 33. 10.1007/978-3-030-15035-8_93
35.
Page MJ McKenzie JE Bossuyt PM Boutron I Hoffmann TC Mulrow CD et al The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ. (2021) 372:1ā9. 10.1136/bmj.n71
36.
Popay J . Guidance on the Conduct of Narrative Synthesis in Systematic Review: A Product from the ESRC Methods Programme. Lancaster, England: Lancaster University, Institute for Health Research (2006). 10.13140/2.1.1018.4643
37.
Damschroder LJ Buis LR McCant FA Kim HM Evans R Oddone EZ et al Effect of adding telephone-based brief coaching to an mHealth APP (stay strong) for promoting physical activity among veterans: randomized controlled trial. J Med Internet Res. (2020) 22(8):e19216. 10.2196/19216
38.
Foran HM Kubb C Mueller J Poff S Ung M Li M et al An automated conversational agent self-help program: randomized controlled trial. J Med Internet Res. (2024) 26:e53829. 10.2196/53829
39.
Han CY Lim SL Ong KW Johal J Gulyani A . Behavioral lifestyle intervention program using mobile application improves diet quality in adults with prediabetes (DāLITE study): a randomized controlled trial. J Acad Nutr Diet. (2024) 124(3):358ā71. 10.1016/j.jand.2023.10.005
40.
Heber E Lehr D Ebert DD Berking M Riper H . Web-based and mobile stress management intervention for employees: a randomized controlled trial. J Med Internet Res. (2016) 18(1):e5112. 10.2196/jmir.5112
41.
Partridge SR McGeechan K Hebden L Balestracci K Wong AT Denney-Wilson E et al Effectiveness of a mHealth lifestyle program with telephone support (TXT2BFiT) to prevent unhealthy weight gain in young adults: randomized controlled trial. JMIR Mhealth Uhealth. (2015) 3(2):e4530. 10.2196/mhealth.4530
42.
Spring B Pellegrini C McFadden HG Pfammatter AF Stump TK Siddique J et al Multicomponent mHealth intervention for large, sustained change in multiple diet and activity risk behaviors: the make better choices 2 randomized controlled trial. J Med Internet Res. (2018) 20(6):e10528. 10.2196/10528
43.
Wijsman CA Westendorp RG Verhagen EA Catt M Slagboom PE de Craen AJ et al Effects of a web-based intervention on physical activity and metabolism in older adults: randomized controlled trial. J Med Internet Res. (2013) 15(11):e2843. 10.2196/jmir.2843
44.
Alley S Jennings C Plotnikoff RC Vandelanotte C . Web-based video-coaching to assist an automated computer-tailored physical activity intervention for inactive adults: a randomized controlled trial. J Med Internet Res. (2016) 18(8):e5664. 10.2196/jmir.5664
45.
Bakas T Sampsel D Israel J Chamnikar A Bodnarik B Clark JG et al Using telehealth to optimize healthy independent living for older adults: a feasibility study. Geriatr Nurs (Minneap). (2018) 39(5):566ā73. 10.1016/j.gerinurse.2018.04.002
46.
Blair CK Harding E Wiggins C Kang H Schwartz M Tarnower A et al A home-based mobile health intervention to replace sedentary time with light physical activity in older cancer survivors: randomized controlled pilot trial. JMIR Cancer. (2021) 7(2):e18819. 10.2196/18819
47.
Chang M-W Tan A Wegener DT Lee RE . A pilot goal-oriented episodic future thinking weight loss intervention for low-income overweight or obese young mothers. Nutrients. (2023) 15(13):3023. 10.3390/nu15133023
48.
Chew HSJ Chew NW Loong SSE Lim SL Tam WSW Chin YH et al Effectiveness of an artificial intelligence-assisted app for improving eating behaviors: mixed methods evaluation. J Med Internet Res. (2024) 26:e46036. 10.2196/46036
49.
Daley K Hungerbuehler I Cavanagh K Claro HG Swinton PA Kapps M . Preliminary evaluation of the engagement and effectiveness of a mental health chatbot. Front Digit Health. (2020) 2:576361. 10.3389/fdgth.2020.576361
50.
DāAvolio D Gropper SS Appelbaum M Thiengtham S Holt J Newman D . The impact of a pilot telehealth coaching intervention to improve caregiver stress and well-being and to increase dietary protein intake of caregivers and their family members with dementiaāinterrupted by COVID-19. Dementia. (2023) 22(6):1241ā58. 10.1177/14713012231177491
51.
Dhinagaran DA Sathish T Soong A Theng Y-L Best J Car LT . Conversational agent for healthy lifestyle behavior change: web-based feasibility study. JMIR Form Res. (2021) 5(12):e27956. 10.2196/27956
52.
Gabrielli S Rizzi S Bassi G Carbone S Maimone R Marchesoni M et al Engagement and effectiveness of a healthy-coping intervention via chatbot for university students during the COVID-19 pandemic: mixed methods proof-of-concept study. JMIR Mhealth Uhealth. (2021) 9(5):e27965. 10.2196/27965
53.
Gudenkauf LM Li X Hoogland AI Oswald LB Lmanirad I Permuth JB et al Feasibility and acceptability of C-PRIME: a health promotion intervention for family caregivers of patients with colorectal cancer. Support Care Cancer. (2024) 32(3):198. 10.1007/s00520-024-08395-5
54.
Horn CE Seely EW Levkoff SE Isley BC Nicklas JM . Postpartum womenās experiences in a randomized controlled trial of a web-based lifestyle intervention following gestational diabetes: a qualitative study. J Matern Fetal Neonatal Med. (2023) 36(1):2194012. 10.1080/14767058.2023.2194012
55.
Ly K Ly A Andersson G . A fully automated conversational agent for promoting mental well-being: a pilot RCT using mixed methods. Internet Interv. (2017) 10:39ā46. 10.1016/j.invent.2017.10.002
56.
Maher CA Davis CR Curtis RG Short CE Murphy KJ . A physical activity and diet program delivered by artificially intelligent virtual health coach: proof-of-concept study. JMIR Mhealth Uhealth. (2020) 8(7):e17558. 10.2196/17558
57.
Marler JD Fujii CA Utley DS Tesfamariam LJ Galanko JA Patrick H . Initial assessment of a comprehensive digital smoking cessation program that incorporates a mobile app, breath sensor, and coaching: cohort study. JMIR Mhealth Uhealth. (2019) 7(2):e12609. 10.2196/12609
58.
McGuire AM Porter-Steele J McDonald N Burgess SC Anderson DJ Seib C . Reducing noncommunicable disease risk in midlife adults using eHealth: the GroWell for health program feasibility study. Collegian. (2022) 29(3):328ā36. 10.1016/j.colegn.2021.09.003
59.
Moreno-Blanco D Solana-SƔnchez J SƔnchez-GonzƔlez P JimƩnez-Hernando M Cattaneo G Roca A et al Intelligent coaching assistant for the promotion of healthy habits in a multidomain mHealth-based intervention for brain health. Int J Environ Res Public Health. (2021) 18(20):10774. 10.3390/ijerph182010774
60.
Ollier J Suryapalli P Fleisch E Wangenheim FV Mair JL Salamanca-Sanabria A et al Can digital health researchers make a difference during the pandemic? Results of the single-arm, chatbot-Led Elena+: care for COVID-19 interventional study. Front Public Health. (2023) 11:1185702. 10.3389/fpubh.2023.1185702
61.
Price J Brunet J . Understanding rural-living young adult cancer survivorsā motivation during a telehealth behavior change intervention within a single-arm feasibility trial. Health Informatics J. (2022) 28(1):1ā20. 10.1177/14604582221075560
62.
Sacher PM Fulton E Rogers V Wilson J Gramatica M Dent JE et al Impact of a health coachāled, text-based digital behavior change intervention on weight loss and psychological well-being in patients receiving a procedureless intragastric balloon program: prospective single-arm study. JMIR Form Res. (2024) 8:e54723. 10.2196/54723
63.
Santini S Fabbietti P Galassi F Merizzi A Kropf J HungerlƤnder N et al The impact of digital coaching intervention for improving healthy ageing dimensions among older adults during their transition from work to retirement. Int J Environ Res Public Health. (2023) 20(5):4034. 10.3390/ijerph20054034
64.
Smart MH Nabulsi NA Gerber BS Gupta I Di Eugenio B Ziebart B et al A remote health coaching, text-based walking program in ethnic minority primary care patients with overweight and obesity: feasibility and acceptability pilot study. JMIR Form Res. (2022) 6(1):e31989. 10.2196/31989
65.
Terblanche N Molyn J De Haan E Nilsson VO . Coaching at scale: investigating the efficacy of artificial intelligence coaching. Int J Evid Based Coach Mentor. (2022) 20(2):20ā26. 10.24384/5cgf-ab69
66.
To QG Green C Vandelanotte C . Feasibility, usability, and effectiveness of a machine learningābased physical activity chatbot: quasi-experimental study. JMIR Mhealth Uhealth. (2021) 9(11):e28577. 10.2196/28577
67.
Ulrich S Lienhard N Künzli H Kowatsch T . A chatbot-delivered stress management coaching for students (MISHA app): pilot randomized controlled trial. JMIR Mhealth Uhealth. (2024) 12:e54945. 10.2196/54945
68.
Wiegand B Luedtke K Friscia D Nair M Aleles M McCloskey R . Efficacy of a comprehensive program for reducing stress in women: a prospective, randomized trial. Curr Med Res Opin. (2010) 26(4):991ā1002. 10.1185/03007991003688193
69.
Williams R Hopkins S Frampton C Holt-Quick C Merry SN Stasiak K . 21-day stress detox: open trial of a universal well-being chatbot for young adults. Soc Sci. (2021) 10(11):416. 10.3390/socsci10110416
70.
Terblanche N . A design framework to create artificial intelligence coaches. Int J Evid Based Coach Mentor. (2020) 18(2):152165. 10.24384/b7gs-3h05
71.
Eysenbach G . The law of attrition. J Med Internet Res. (2005) 7(1):e402. 10.2196/jmir.7.1.e11
72.
Boucher EM Raiker JS . Engagement and retention in digital mental health interventions: a narrative review. BMC Digit Health. (2024) 2(1):52. 10.1186/s44247-024-00105-9
73.
Ryan C Bergin M Wells JS . Theoretical perspectives of adherence to web-based interventions: a scoping review. Int J Behav Med. (2018) 25:17ā29. 10.1007/s12529-017-9678-8
74.
GraĆmann C Schƶlmerich F Schermuly CC . The relationship between working alliance and client outcomes in coaching: a meta-analysis. Hum Relat. (2020) 73(1):35ā58. 10.1177/0018726718819725
75.
Fadhil A Gabrielli S , editors. Addressing challenges in promoting healthy lifestyles. Proceedings of the 11th EAI International Conference on Pervasive Computing Technologies for Healthcare, PervasiveHealth; (2017). 10.1145/3154862.3154914
76.
Aggarwal A Tam CC Wu D Li X Qiao S . Artificial intelligenceābased chatbots for promoting health behavioral changes: systematic review. J Med Internet Res. (2023) 25:e40789. 10.2196/40789
77.
Jarrahi MH . Artificial intelligence and the future of work: human-AI symbiosis in organizational decision making. Bus Horiz. (2018) 61(4):577ā86. 10.1016/j.bushor.2018.03.007
78.
Nagao K Nagao K . Symbiosis between humans and artificial intelligence. In: Nagao E, editor. Artificial Intelligence Accelerates Human Learning: Discussion Data Analytics. Singapore: Springer (2019). p. 135ā51. 10.1007/978-981-13-6175-3_6
79.
Grigsby SS , editor. Artificial intelligence for advanced human-machine symbiosis. Augmented Cognition: Intelligent Technologies: 12th International Conference, AC 2018, Held as Part of HCI International 2018; July 15ā20, 2018; Las Vegas, NV, USA (2018). Springer. Proceedings, Part I. 10.1007/978-3-319-91470-1_22
80.
Licklider JC . Man-computer symbiosis. IRE Trans Hum Factors Electron. (1960) 1:4ā11. 10.1109/THFE2.1960.4503259
81.
Clutterbuck D . The challenges of coaching and mentoring in a digitally connected world. In: Wegener R, Ackermann S, Amstutz J, Deplazes S, Künzli H, Ryter A, editors. Coaching im Digitalen Wandel. Gƶttingen, Germany: Vandenhoeck & Ruprecht (2020):19ā29. 10.13109/9783666407420.19
82.
Godino JG Merchant G Norman GJ Donohue MC Marshall SJ Fowler JH et al Using social and mobile tools for weight loss in overweight and obese young adults (project SMART): a 2 year, parallel-group, randomised, controlled trial. Lancet Diabetes Endocrinol. (2016) 4(9):747ā55. 10.1016/S2213-8587(16)30105-X
83.
GraĆmann C Schermuly CC . Coaching with artificial intelligence: concepts and capabilities. Hum Resour Dev Rev. (2021) 20(1):106ā26. 10.1177/1534484320982891
84.
Chatterjee A Prinz A Gerdes M Martinez S . Digital interventions on healthy lifestyle management: systematic review. J Med Internet Res. (2021) 23(11):e26931. 10.2196/26931
85.
Laranjo L Dunn AG Tong HL Kocaballi AB Chen J Bashir R et al Conversational agents in healthcare: a systematic review. J Am Med Inform Assoc. (2018) 25(9):1248ā58. 10.1093/jamia/ocy072
Summary
Keywords
digital health interventions, positive health, health promotion, disease prevention, coaching, health coaching
Citation
Loughnane C, Laiti J, OāDonovan R and Dunne PJ (2025) Systematic review exploring human, AI, and hybrid health coaching in digital health interventions: trends, engagement, and lifestyle outcomes. Front. Digit. Health 7:1536416. doi: 10.3389/fdgth.2025.1536416
Received
28 November 2024
Accepted
24 March 2025
Published
24 April 2025
Volume
7 - 2025
Edited by
Ben Singh, University of South Australia, Australia
Reviewed by
Oscar Castro, ETH Zürich, Singapore
Anna Dysart, Virginia Tech, United States
Ana Catarina Martins, New University of Lisbon, Portugal
Updates
Copyright
Ā© 2025 Loughnane, Laiti, OāDonovan and Dunne.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: CroĆa Loughnane croialoughnane22@rcsi.com
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.