Your new experience awaits. Try the new design now and help us make it even better

REVIEW article

Front. Digit. Health, 17 June 2025

Sec. Digital Mental Health

Volume 7 - 2025 | https://doi.org/10.3389/fdgth.2025.1595243

This article is part of the Research TopicDigital Health Past, Present, and FutureView all 27 articles

The implementation of digital biomarkers in the diagnosis, treatment and monitoring of mood disorders: a narrative review


Andrea P. Garzn-PartidaAndrea P. Garzón-Partida1Citlali B. Padilla-GmezCitlali B. Padilla-Gómez1Diana Emilia Martínez-FernndezDiana Emilia Martínez-Fernández2Joaquín García-EstradaJoaquín García-Estrada1Sonia LuquinSonia Luquin1David Fernndez-Quezada

David Fernández-Quezada1*
  • 1Departamento de Neurociencias, Instituto de Neurociencias Traslacionales, Centro Universitario de Ciencias de la Salud (CUCS), Universidad de Guadalajara (UdeG), Guadalajara, México
  • 2Departamento de Farmacobiología, Centro Universitario de Ciencias Exactas e Ingenierías (CUCEI), Universidad de Guadalajara (UdeG), Guadalajara, México

Mood Disorders are a group of mental health conditions characterized by a disruption of the emotional state that affects the quality of life of the people living with them. Mental Disorders are difficult to diagnose and treat due to the complex processes involved and limitations of the healthcare system. Digital biomarkers have created accessible, long-term, non-invasive, and user-friendly alternatives for the diagnosis, treatment, and monitoring of these conditions. The use of everyday devices like smartphones and smartwatches and specialized tools like actigraphy, in conjunction with powerful statistical tools, artificial intelligence, and machine learning, represents a promising avenue for the implementation of personalized strategies to monitor and treat Mood Disorders, and potentially higher adherence to treatment. We conducted several studies that implement a variety of methodologies and tools to better understand Mood Disorders, using a patient-focused approach with the ultimate goal of identifying better strategies to improve their quality of life.

Introduction

Mood disorders (MD) encompass a category of psychiatric conditions characterized by significant disturbances in emotional regulation. These disorders manifest as prolonged or cyclical episodes of depression, hypomania, or mania, impairing daily functioning and overall well-being. The MD spectrum includes Major Depressive Disorder (MDD), Bipolar Disorder (BD), Cyclothymia, Hypomania, Disruptive Mood Dysregulation Disorder, and Premenstrual Dysphoric Disorder (1).

The diagnosis and treatment of MD remain complex due to barriers such as limited access to mental health services and low long-term treatment adherence, often influenced by socio-demographic factors (2). Studies indicate that in developed countries, only 50% of individuals with MD receive diagnoses and follow-up care. This figure drops to 30% globally when developing countries are considered, where stigma surrounding mental health further hinders access to care (3).

Search method

This article is a narrative review which employed a systematic search process. Articles were collected from previously published original articles in The Web of Science and Pubmed databases, until August 2024. A comprehensive literature search was conducted using a combination of Medical Subject Headings (MeSH) and free-text terms to identify relevant studies examining the use of digital biomarkers in patients with Mood Disorders. The search strategy was designed to capture studies that addressed digital biomarkers in the context of diagnosis, assessment, or monitoring of mood-related conditions. Search terms included combinations of keywords and MeSH terms related to (“digital biomarkers” OR “digital phenotyping” OR “passive sensing” OR “wearable devices” OR “mobile health” OR “mHealth” OR “machine learning”) AND [“mood disorders” [MeSH Terms] OR “depressive disorder” [MeSH Terms] OR “depression” OR “bipolar disorder”] AND (“diagnosis” OR “assessment” OR “monitoring” OR “detection”). These terms were searched within the title, abstract, and indexing fields of each database. Inclusion criteria focused on peer-reviewed original research studies involving human participants, with outcomes relevant to the use of digital biomarkers in mood disorder diagnosis or management. Exclusion criteria included review articles, editorials, conference abstracts, and studies not available in English. Upon full text examination of relevant articles, we included studies that used digital measurements obtained through the use of smartphone devices, wearables or online platforms, on individuals with MD, both depressive and bipolar. After the selection process, data from the studies was compiled on an Excel spreadsheet, coding for variables of interest: authors, year, objectives, MDs, number of participants, duration, dispositive, data type, metrics used and results. Finally, the studies were categorized into categories in accordance to their objective and methodology.

Clinical and digital biomarkers

Biomarkers are among the most widely used tools for assessing physical and mental health due to their precision and reproducibility (4). Traditionally, psychiatric diagnoses rely on standardized psychological tests (5) and neuroimaging techniques such as electroencephalography (6).

Digital biomarkers, derived from data collected through smart devices, offer an alternative to conventional diagnostic methods (7). These biomarkers provide continuous, longitudinal, and objective metrics of physiological, psychological, behavioral, and environmental variables (8). Wearable technologies, such as smart rings, actigraphy devices, smartwatches, and smart patches, along with smartphones and tablets, leverage the Internet of Things (IoT) to enable seamless data collection and integration. This connectivity allows real-time data transmission through Bluetooth or Wi-Fi, facilitating AI- and ML-driven analysis (9, 10).

Machine learning models (MLMs) process large-scale, complex datasets with high precision, identifying patterns that enhance the understanding, diagnosis, and prediction of symptom severity in psychiatric conditions, including MD (11, 12). These models utilize both passive (automated) and active (user-initiated) data collection, with validation achieved through comparisons with standardized clinical assessments (13, 14).

The assessment of MD relies primarily on self-reported questionnaires and validated clinical scales, including the Patient Health Questionnaire (PHQ-9) for depression, the Generalized Anxiety Disorder (GAD-7) scale for anxiety, and the Young Mania Rating Scale (YMRS) for manic symptoms in BD. Additionally, Ecological Momentary Assessment (EMA) and cognitive performance tests have been widely used for continuous mood and functionality evaluation.

Given the heterogeneity of MD research, studies incorporating digital biomarkers are categorized based on their primary objectives: diagnostic tools, therapeutic interventions, ML-based classification models for symptom monitoring, and predictive analytics for prognosis.

Digital therapeutic interventions

Clinical trials evaluating digital interventions for MD report mild to favorable outcomes. The predominant approach involves adapting traditional therapeutic strategies, such as cognitive-behavioral therapy (CBT), into self-guided digital formats. Studies examining the efficacy of smartphone-based self-reporting applications (Table 1) indicate overall improvements in depression and anxiety symptoms. For instance, the Vida Health app integrates digital CBT with symptom monitoring, yielding positive results in clinical settings compared to traditional therapy (15). Another study compared standard CBT with blended CBT, assessing its effectiveness through participants' self-reported therapeutic alliance and mood states at a regular basis, resulting on a higher effectivity of blended CBT after mood states were compared between groups (16).

Table 1
www.frontiersin.org

Table 1. Therapeutic approaches with the use of digital technologies and digital biomarkers.

Additional intervention aimed to teach self-management techniques through specialized activities and self-reports, complementing standard therapeutic approaches. Participants recorded the severity of their symptoms and mood states while receiving coaching on self-management strategies through the LiveWell and IntelliCare apps (17, 18). Also, the Juli app collected passive smartphone data alongside self-reported symptom severity, mood fluctuations, and quality of life indicators, all of which showed improvement following the study (19). A fully remote, app-based intervention tool called CORE demonstrated significant therapeutic effectiveness across participant groups at both the 30-day and 60-day marks, with a 50% adherence rate (20).

Two web-based platforms were evaluated for their usability and acceptability among individuals with depression. MoodFX, a therapeutic tool that also functioned as a tracker for cognitive function, symptom severity, quality of life, medication adherence, and general health status, exhibited high usability, with participants reporting notable improvements in depressive symptoms (21). The Dario Health platform, which incorporated guided coaching interactions for the management of depression and anxiety, demonstrated a reduction in mood disorder symptoms based on user-reported effectiveness scores (22).

The Step-by-Step platform was tested as an intervention tool during Lebanon's economic, environmental, political, and humanitarian crisis amidst the COVID-19 pandemic. Notably, it reported a higher treatment response rate, 1 in 4 participants, compared to traditional therapy, which had a response rate of 1 in 7, even at the three-month follow-up (23). Life Flex, designed for individuals with MDD and BD, was effective in reducing anxiety, depression, psychological distress, and emotional dysregulation. The intervention was well-accepted by users, with adherence rates extending up to 90 days (20).

Correlation models

The Correlation models are statistical techniques used to analyze linear relationships between variables, even when they appear dissimilar (25). Several studies have explored correlations between digital biomarkers, such as physical activity levels, sleep patterns, geolocation data, and speech markers, derived from wearable devices and self-reported clinical scales measuring symptom severity. Study protocols varied in duration, with those lasting ≤14 days classified as short-term and those exceeding 14 days categorized as long-term.

Exploratory short-term studies

Short-term correlation studies (Table 2) included medium-to-large participant samples (n) observed over brief periods. MLMs processed extensive datasets obtained from wearable devices and specialized applications to generate digital profiles of participants. Despite methodological diversity across studies, all utilized smartphone-based data collection.

Table 2
www.frontiersin.org

Table 2. Correlations between digital data and self-report scores for diagnosis in short-term studies.

In studies employing secondary devices, actigraphs, smartwatches, smart patches, and smart glasses were utilized. The DynaPort Hybrid accelerometer was used to assess gait characteristics, including abnormalities and movement patterns, correlating these features with depression scores from two databases of adult and elderly participants. The results revealed a negative correlation between regular gait behavior and depression scores, allowing for a passive assessment through regular behavior (26). Similarly, the research-grade Actigraph Motion Watch 8 and the commercial-grade Galaxy Watch Active 2 were employed to develop a depression phenotype by measuring psychomotor retardation and agitation, with variations correlating to different depressive episode presentations regardless of the dispositive used for measurements (27).

Physiological measurements were collected using the Philips Respironics Actigraph and Proteus smart patches, which detect electrodermal activity, physical activity (PA), and sleep patterns. These studies identified associations between PA levels, energy expenditure, and activity duration with the mood states of participants with BD (28). Additionally, integrating the Proteus patch with the Mood Zoom app allowed researchers to analyze heart rate (HR), PA, sleep, and circadian rhythms, establishing a link between diurnal activity patterns and mood fluctuations in participants with BD and BPD (29).

Specialized wearables, such as Emteq's ECOsense™ smart glasses, demonstrated the versatility of digital biomarkers. Using optomyography-based sensor data, a one-session protocol assessed emotional states in participants with depression, comparing them to healthy controls (HCs) and self-reported measurements of arousal and valence after exposure to visual stimuli. Results indicated a correlation between muscle activation and self-reported emotional scores, distinguishing depressed participants from HCs based on facial muscle response intensity during emotional stimuli (30).

Smartphone-based studies utilized both passive and active data collection methods to characterize participant profiles. One study implemented a mobile app that leveraged Global Positioning System (GPS) data to assess mobility, activity at specific locations, and travel patterns, correlating these behaviors with depressive symptoms, confirming a negative correlation between lowered physical activity and a higher depression scores (31). Additionally, GPS data was used to determine participants' approximate geographical locations, facilitating the acquisition of air quality index (AQI) ratings and weather conditions throughout the day, particularly the approximated air pollution of the areas visited, based on the levels reported by meteorological services. This study found a significant relationship between AQI levels and mood disorder symptoms in individuals with BD, with a one-point increase in AQI correlating with a 0.11-point rise in depression severity scores, suggesting a potential link between air pollution exposure and worsening mood symptoms (32).

Within the same category of short-term monitoring, researchers analyzed passive smartphone-recorded social interactions in individuals with depression, individuals with anxiety disorders, and healthy controls. The collected auditory data, including speech characteristics and environmental noise, was used to examine correlations with depression and anxiety severity, individual functionality, and physiological measures such as PA and sleep patterns. The analysis revealed a negative correlation between social interactions and depression severity, though no significant relationship was observed for anxiety symptoms (33). These findings reinforce the potential of alternative digital biomarkers as valuable tools for monitoring and treating mood disorders, further validating the well-established relationship between reduced social interactions and increased depressive symptoms (34).

Exploratory long-term studies

In the second category (Table 3), monitoring durations ranged from 89 to 391 days, with the largest variation in sample sizes (n), ranging from 30 to 10,718 participants. Digital sensor-based and self-reported measurements were used to monitor participants in real-world environments over extended periods, allowing for real-time data collection and deeper insights into the lived experiences of individuals with mood disorders (35).

Table 3
www.frontiersin.org

Table 3. Correlation models through long-term monitoring of symptoms in MD.

Actigraphy and smartwatches have proven to be viable tools for collecting long-term data, with study protocols extending up to 336 days. In one study, circadian rhythms, mood fluctuations, and symptoms in participants with mood disorders were monitored using a combination of Fitbit Charge HR 2 and Fitbit Charge HR 3, smartphone-delivered self-assessment EMA, and standardized clinical scales. Results demonstrated a discernible relationship between circadian phase disturbances, mood fluctuations, and symptom severity in individuals with BD, highlighting circadian rhythms as potential variables for digital phenotyping of BD mood state (36). Similarly, the effect of circadian rhythms on individuals with MDD was investigated using the ActiCal actigraph measurements of PA, analyzed for variability. This study examined medication effects in both remitters and non-remitters, revealing a significant correlation between symptom remission and circadian rhythm stability, allowing for a potential use of circadian rhythm stability as a potential biomarker for treatment effectivity (37).

In addition to passive data collection and self-assessments, cognitive function was monitored in participants with BD and MDD using smartwatches. A study integrating Apple Watch physiological data, smartphone-delivered cognitive tests, and self-reported symptom tracking evaluated long-term changes in mood, cognition, PA, and HR in participants with MDD. Findings indicated a correlation between mood fluctuations, PA levels, and HR, as well as a positive relationship between increased PA and improved mood (38). Another study assessed cognition and symptom fluctuations in individuals with BD using smartwatch-delivered EMA, while also evaluating usability and engagement with the LiveWell self-management app. Results showed a significant impact of cognitive performance on depression and mania scores (39). Showing a viability of cognitive performance assessment in non-clinical settings for a greater insight into variability, emotional state, and environmental factors.

Furthermore, PA and stress levels were monitored through passive data collected via the Monsenso smartphone app in a study assessing mood stability in individuals with BD. Results indicated that greater mood stability was positively correlated with stable PA levels and perceived stress regulation, while negatively correlated with quality-of-life indicators. This data was collected through daily self-reports for up to 266 days, (40). A similar relationship was observed in another 28 days-long study examining individuals with MDs and anxiety disorders, where symptoms and mood stability were measured through passive data and self-reports using the ReMAP app, which was consistent with clinical scales of depression and anxiety, accounting for effectiveness in shorter periods of times (41).

Language and speech patterns have also been utilized in long-term correlation models through at least two distinct approaches. The first approach involved analyzing acoustic content-free speech features and behavioral trends via the VoiceSense app to classify speech patterns and vocal features indicative of depression. This method was particularly effective in individuals with high PHQ-9 depression scores, with results also highlighting the influence of age and educational level on speech characteristics (42). The second approach involved analyzing the content of text-based therapy sessions from the Talkspace web-based therapy platform. In this study, behavioral activation words, Linguistic Inquiry and Word Count (LIWC) markers, and time-related metadata from interactions were successfully correlated with established depression markers and clinical scores, allowing for an assessment of therapy efficacy and depression improvement through time (43).

Classification models of MD diagnosis

MLMs can be implemented using various algorithms, each designed for specific tasks based on the type of data and the underlying model architecture. Among the two primary types of MLMs, supervised and unsupervised, the reviewed literature primarily focused on supervised models, which were used to classify data, establish regressions, or perform both tasks simultaneously (44).

For studies employing classification models, data were categorized to identify and group information into predefined categories (45). By recognizing patterns, these models allowed diverse datasets to be assigned to specific classes, effectively distinguishing between HCs and individuals with MDs, as well as identifying disorder-specific characteristics (Table 4).

Table 4
www.frontiersin.org

Table 4. Classification MLMs of MD diagnosis.

Wearable devices have proven instrumental in providing data to train MLMs and develop phenotypic classifications for MDs. The Actiwatch actigraph, for instance, successfully detected changes in depressive symptoms, enabling the diagnosis of participants with MDD and BD by identifying movement patterns and activity characteristics (46). Similarly, the GENEActiv Original actigraph was used to examine the variability of circadian rhythms, correlating mood instability and symptom severity with sleep variability, which effectively distinguished between BD, BPD, and HCs (47). The validated precision of these devices further strengthens PA and circadian rhythm as reliable biomarkers in the assessment of MDs and symptoms intensity.

Smartbands with different levels of precision have also been employed for this purpose, with the Empatica E4 collecting physiological data later used to train deep-learning models and MLMs that outperformed traditional psychological scales such as the Hamilton Depression Rating Scale (HDRS) and the YMRS (48). Additionally, Fitbit Charge HR 2 and HR 3 were used to obtain physiological and behavioral data, which, through MLM analysis, successfully correlated circadian disturbances with MDD and BD-I, leading to the development of a diagnostic models for both conditions, where a personalized model outperformed a generalized one by 23.8% accuracy (49). A less commonly utilized wearable, the Oura smart ring, which integrates mobile usage data and GPS capabilities, was employed in a study where participant mobility patterns, including the duration and type of locations visited, along with sleep characteristics, such as sleep onset and daily variability, served as reliable markers for accurately classifying depression through multi-level models. Combined models where data from all devices and self-report measures, obtained the best predictive performance (50).

Another key physiological measure used in MLM-based classification models was electrodermal activity (EDA). Three separate studies explored EDA variations among participants with MDD, BD, and HCs, aiming to classify mood states and symptoms for diagnostic purposes. In individuals with BD, the Empatica E4 detected mood-related EDA variations, with peaks per minute correlating negatively with depressive episodes and positively with manic episodes, thereby distinguishing between BD participants' emotional states (51). Additionally, another study utilizing the same device determined that wakefulness EDA was the most reliable classifier for differentiating participant groups, with symptoms, body temperature, and medication use emerging as the primary variables influencing the model. Nightly EDA. All MLM, regardless of the circadian phase, were significantly affected by mixed symptoms, skin temperature, anticholinergic medications. Sex was significant for tonic phases of sleep, while age didn't affect any model (52). The ADI Study Watch, which functioned as both a physiological monitoring tool and a digital symptom diary, detected significant positive correlations between increased skin conductance responses and depressive symptoms such as anhedonia, guilt, sleep disturbances, and appetite changes. Centrality predictive strength was found with a higher intensity for suicide, followed by skin conductance and depressive moods. These findings contributed to the development of a model capable of accurately identifying individuals with MDD (53).

Smartphone-based diagnostic MLMs for MDs

Smartphone GPS data and specialized mobile applications have been leveraged to develop diagnostic MLMs for BD. One study distinguished BD participant from HCs by analyzing variations in location entropy and mobility patterns. GPS data with PA and sleep parameters were integrated, enabling the detection of BD and classification of symptom states. The highest accuracy (F1 = 0.86) was achieved for the detection of depressive symptoms (54). Furthermore, an MLM integrating EMA data successfully distinguished BD, schizophrenia, and schizoaffective disorder, revealing small to moderate correlations between participant location data and negative symptoms, functional outcomes, and active behaviors, while positive symptoms, depression, and anxiety exhibited lower correlations (55).

MLMs have also been applied in the multidimensional assessment of cognition in MDs. One study combined Magnetic Resonance Imaging (MRI), motor characteristics, PA monitoring using the Open Lab: Open Movement 1, memory and attention tests, and standardized depression scales to examine late-life depression. Results revealed significant associations between late-life depression and slower PA, decreased fine motor skills, impaired learning, and reduced quality of life, leading to the development of a classification tool based on depression severity scores (56, 57). Another study employed the Fitbit Ionic smartwatch alongside the Monsenso system to monitor cognition, PA, sleep, and geolocation, enabling the identification of depressive, manic, and euthymic states in BD participants. Notably, the frequency of phone calls per day was found to correlate significantly with mood states. While significant correlations were found, the MLM couldn't be validated (AUC = 0.42) (40). Similarly, an MLM analyzing smartphone-based socialization patterns and cognitive assessment activities successfully diagnosed depression using standard clinical outcome measures, achieving an accuracy of 80%–85% when the NeuroCart battery for eye movement was used as classifier (57).

One study utilized multimedia data in diagnostic MLMs, incorporating headsets and cameras to analyze facial and vocal features for distinguishing MDs from other psychiatric disorders. The study found that muscle and voice activity variations could discriminate between schizophrenia and BD while also quantifying symptom severity, achieving a performance up to AUROC = 0.81 in the prediction of the symptom of blunted affect by voice features, and down to AUROC = 0.61 for worthlessness (58). The Sonde Health smartphone app, complemented by self-assessment data, developed a model capable of stratifying individuals by mental health risk, further refining classification accuracy by integrating self-reported symptomatology and protocol completion rates. Accuracy of assessment was dependent of the level of engagement with the App, with a 38% of participants achieving the best predictor level (high) (59). Additionally, the BiAffect smartphone app successfully detected depression severity changes through typing behavior analysis, with key parameters including typing speed, variability, accuracy, session duration, autocorrect usage, and overall device interaction patterns. Age, typing session duration and one or two-hand typing were factors that affected the typing speed, variability, use of autocorrect and pauses (60).

A high-performance MLM was developed by integrating multisource data from the Empatica E4 wristband, a smartphone, and a specialized mobile application. Monitoring was carried out using sensors on both hands, with a variance of daily data from 17 to 15.5 h. This model incorporated self-reported symptom assessments, EDA, sleep patterns, motion data, location tracking, and social interaction metrics (e.g., phone calls and SMS frequency), demonstrating superior classification accuracy for mood disorder diagnosis (61).

Web-based platforms for MD diagnosis

While most studies reviewed in this section relied on wearables and smartphones, two investigations utilized web-based platforms. The first study compared MDD symptomatology with subthreshold depressive symptoms using digital assessments. The platform effectively distinguished between clinical MDD and subthreshold depression, improving classification accuracy through transdiagnostic symptom analysis. However, results indicated a tendency toward overdiagnosis in PHQ-9-based assessments (62). Another study applied an Internet-Based Cognitive Assessment Tool (ICAT) to evaluate cognitive function in BD, successfully linking cognitive impairments with poorer functional outcomes and symptom severity (63).

Predictive models of MD prognosis

Predictive models utilize mathematical and computational techniques to estimate the likelihood of positive, negative, or stable clinical outcomes based on behavioral, physiological, and environmental data (64, 65). As outlined in Table 5, these models have been employed to predict remission and relapse rates, symptom severity trajectories, and cognitive functioning changes across varying timeframes.

Table 5
www.frontiersin.org

Table 5. Predictive models of MD prognosis.

Chronobiological data obtained from Fitbit smartwatches, smartphones, and a specialized app successfully predicted three-day mood states with 65% accuracy for manic and hypomanic episodes, particularly for a personalized model, which outperformed the general by 23.8% (66). A dynamic network analysis of PA, measured via Actiwatch actigraphs and clinical assessments, predicted worsening depressive symptoms in BD-I, with declines in PA serving as an early indicator. Interestingly, the dynamic network structures were constructed using multilevel vector autoregressive (VAR) analysis, where each affect item and physical activity feature was classified as a node (67).

Furthermore, by implementing activity anomaly detection frameworks (N-of-1) algorithms, one study predicted MDD relapse with ≥71% accuracy, up to 2–3 weeks in advance, from data of patients from two different cohorts of people with and without at least one relapse (68). Another study utilized sleep data and EMA self-assessments to create a one-day mood forecast MLM, in which past mood was identified as the most accurate and least error-prone predictor. Lineal and ordinal hierarchical models were evaluated, finding lineal models to be better at catching distribution tails, and the Bayesian ordinal model better at data grouping and predicting mood levels (69).

Patterns of movement, tracked using smartphone-integrated GPS, were incorporated into predictive MLMs. The Mood Triggers app monitored symptoms and correlated them with PA and pre-defined locations, enabling the creation of a model that predicted the onset of depressed moods in individuals with MDD up to one hour before onset (70). Additionally, self-reported sleep quality, mood, and passively collected PA data were identified as reliable predictors of next-week mood and symptom outcomes in BD participants. In this study, data was approached through distinct time frames: repeated, daily, weekly and monthly, allowing a deeper correlation analysis (71). Another MLM achieved higher precision using a group-personalized regression model, correlating diurnal movement patterns, transition time between locations, total traveled distance, variability of location, and time spent at home with symptom severity in BD (72).

MLMs for predicting mood episodes were developed using circadian rhythm data. Passively collected daily activity patterns, obtained from a Fitbit Charge HR smartwatch, a smartphone app, and self-reported mood assessments, contributed to models that accurately predicted depressive, manic, and hypomanic episodes by linking low sleep quality, abnormal PA, and circadian disruptions with an increased likelihood of manic or hypomanic episodes (73). Similarly, a high-accuracy predictive model of depressive symptom severity, capable of forecasting symptoms up to three weeks in advance, was developed using circadian rhythm data, socialization patterns, and digital behavior collected via the AWARE framework smartphone app (74).

Predictive typing behavior-centered MLMs were developed by analyzing typing speed, accuracy, correction patterns, and autocorrect use, combined with EMA-derived mood and cognitive performance data. Typing speed was identified as a predictor of attention and processing skills, correlating with depressive symptoms. Three mixed longitudinal MLMs were implemented, with all of them accounting for age and time of the day, and a possible practice effect on keystroke behavior (75). The BiAffect app further demonstrated predictive capabilities by analyzing time, quality, and duration of typing sessions, successfully forecasting depression scores in BD participants (76).

Smartphone usability features have been explored as potential markers of communication behaviors and digital activity. One study using the Carat app monitored screen time and app usage, identifying positive correlations between screen activity patterns and depressive symptoms. MLMs trained on these data successfully predicted depression severity based on screen and internet connectivity metrics (50). Additionally, complex models integrating metadata from the HelloBetter app, symptom self-assessments via the MAIKI app, and phone usage data from the Insight app generated an MLM capable of assessing depression severity. However, this model exhibited a notable sex-based bias, overrepresenting female participants (77). Similarly, correlating passive smartphone PA and sleep data (collected via the MovisensXS app) with phone activity data from the ChronoRecord system apps, allowed for the successful prediction of same-day manic and depressive episodes in BD participants (78).

Multimodal data collection, incorporating multimedia records, has been employed to predict mania and depression severity in BD patients, testing seven MLMs. The most effective at predicting were the Least Absolute Shrinkage And Selection Operator (LASSO) and ElasticNet regularization moels, which were improved through a greater combination of heterogeneous data (79). By integrating phone and app usage data, MLMs were able to forecast mood variability in BD, with step count and sleep patterns emerging as the most accurate predictive metrics, whereas app usage data exhibited lower predictive accuracy. In all combinations, Random Forest models outperformed others (80). When data from the Oura smart ring was combined with phone usage data, multilevel MLMs achieved higher accuracy in predicting depression and anxiety symptom severity in participants with MDD (81). Further, physiological data, such as heart rate (HR) collected using multiple actigraphs in the same protocol with the D-MOMO app, was found to be a unimodal predictor of MDD, while integrating facial expression analysis, vocal recordings, and self-reported mood significantly improved MLM performance (82).

A multimodal approach has also been employed to assess socialization patterns. By passively collecting Bluetooth-based proximity data from smartphones, researchers calculated socialization time and interaction levels. MLMs successfully correlated reductions in social interactions with worsening depressive symptoms, highlighting the role of circadian rhythms, social engagement, and location frequency in mood prediction (83).

Finally, online-based platforms have been utilized for training, validating, and verifying predictive models. Longitudinal data collected via the True Colors platform established a link between insomnia and mood instability, influenced by demographic and clinical factors. The weekly inertia of sleep quality emerged as a predictor of hypomanic and depressive symptoms in BD (84). Additionally, an analysis of Facebook posts spanning six years was used to train MLMs capable of predicting future depression onset using linguistic markers. The models identified seven out of ten language topics previously associated with depression as predictors of future depressive episodes (85).

Limitations of digital biomarkers

The use of digital biomarkers has expanded in both research and clinical settings as diagnostic, therapeutic, and disease-monitoring tools for individuals with physical and mental health conditions. These biomarkers also offer enhanced self-management opportunities for patients (86). However, as their implementation grows, several methodological, technological, and ethical challenges have emerged.

A key methodological limitation is heterogeneity in study protocols. Luo et al. (87) proposed standardized guidelines for developing MLMs in biomedical research, highlighting the challenge of determining optimal data collection durations. Study lengths have ranged from less than 30 min (60) to two years (65), raising concerns about underfitting or overfitting when developing statistical and machine learning models.

Another critical issue is sample size selection. Rykov et al. (88) proposed a formula for estimating appropriate sample sizes in biomedical research. While some researchers advocate for larger sample sizes to enhance model accuracy (89), others suggest that smaller, high-quality datasets may improve clinical applicability and reduce computational biases (90).

Additionally, sensor accuracy and feature detection reliability have been scrutinized (91, 92). Given the variability in sensor quality, especially between commercial and research-grade devices (93), there is a growing demand for greater transparency regarding measurement accuracy and participant characteristics (95). Validation efforts should focus on comparing digital biomarkers against established clinical measures and standardizing data reporting across studies (93).

Ethical considerations surrounding privacy and data security have also been raised. Given the sensitive nature of digital health data, concerns have emerged about data collection, storage, and processing practices. A great concern regarding privacy and informed consent has been addressed, particularly regarding understanding the true magnitude of the data collected through digital tools, within age and cognitive performance groups, particularly with elderly participants and in cases where participants present a condition where cognition can be affected (96).

While international regulatory bodies have established privacy guidelines for the private sector (97), biomedical research regulations for digital biomarkers remain underdeveloped, necessitating clearer ethical frameworks (94, 95).

Finally, the widely discussed bias present in both digital sensors and algorithms, can be a limitation of digital health technologies applied to mental health. This ethical concern, in addition to the existing race and sex-related bias in the biomedical research, can potentially represent a risk of use of these technologies in underrepresented groups (98).

Conclusion

The integration of digital measurements, whether obtained through specialized sensors, passively collected metadata, or user interactions with technology, is rapidly transforming mental health research and clinical practice. These emerging methodologies offer non-invasive, cost-effective, and scalable solutions for the diagnosis, monitoring, and treatment of MDs, ultimately improving patients' quality of life.

Studies investigating digital biomarkers for MDs have demonstrated promising results, reducing clinician bias by moving beyond static, clinical assessments and incorporating real-world, dynamic data. However, these technologies also present methodological, accuracy-related, and ethical challenges that must be addressed to ensure their widespread acceptance and clinical validity.

Future perspectives in the field of digital health would benefit by an emphasis on the standardization of the digital tools used in studies, providing a better understanding of the accuracy of the tools and sensors used in the measurement of the metrics. Transparency on the methodology used for validation of datasets, disclosure of the quality of data and its processing could also strengthen possible weaknesses originated by the variability of participants' behaviors in highly heterogeneous environments.

Study protocols in this area can greatly benefit from an interdisciplinary approach. This is by the collaboration between healthcare and mental health professionals, as well as data and biotechnology professionals, to ensure better rigor through the research process.

Author contributions

AG-P: Writing – original draft, Data curation, Investigation, Conceptualization. CP-G: Data curation, Writing – original draft, Investigation. DM-F: Writing – original draft, Investigation. JG-E: Writing – original draft, Investigation. SL: Writing – review & editing, Writing – original draft. DF-Q: Supervision, Writing – review & editing, Conceptualization, Investigation, Funding acquisition, Validation.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. Apoyo para Pago de Publicación de Artículos Científicos APPAC-VII-CUCS-2025, Universidad de Guadalajara.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Sekhon S, Gupta V. Mood disorder. In: StatPearls. Treasure Island (FL): StatPearls Publishing (2024). p. 1–26. Available at: http://www.ncbi.nlm.nih.gov/books/NBK558911/ (Accessed July 9, 2024)

Google Scholar

2. World Health Organization. World Mental Health Report: Transforming Mental Health for All. 1st ed Geneva: World Health Organization (2022). p. 1.

Google Scholar

3. Latifian M, Abdi K, Raheb G, Islam SMS, Alikhani R. Stigma in people living with bipolar disorder and their families: a systematic review. Int J Bipolar Disord. (2023) 11:9. doi: 10.1186/s40345-023-00290-y

PubMed Abstract | Crossref Full Text | Google Scholar

4. Arya SS, Dias SB, Jelinek HF, Hadjileontiadis LJ, Pappa A-M. The convergence of traditional and digital biomarkers through AI-assisted biosensing: a new era in translational diagnostics? Biosens Bioelectron. (2023) 235:115387. doi: 10.1016/j.bios.2023.115387

PubMed Abstract | Crossref Full Text | Google Scholar

5. American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders : Fifth Edition Text Revision DSM-5-TRTM. (2022).

Google Scholar

6. Thapa S, Ghimire A, Adhikari S, Bhoi AK, Barsocchi P. Chapter 3—cognitive internet of things (IoT) and computational intelligence for mental well-being. In: Bhoi AK, de Albuquerque VHC, Srinivasu PN, Marques G, editors. Cognitive and Soft Computing Techniques for the Analysis of Healthcare Data. Intelligent Data-Centric Systems. Cambridge, MA: Academic Press (2022). p. 59–77. doi: 10.1016/B978-0-323-85751-2.00004-9

Crossref Full Text | Google Scholar

7. Alonso AKM, Hirt J, Woelfle T, Janiaud P, Hemkens LG. Definitions of digital biomarkers: a systematic mapping of the biomedical literature. BMJ Health Care Inform. (2024) 31:1–12. doi: 10.1136/bmjhci-2023-100914

Crossref Full Text | Google Scholar

8. Kyriazakos S, Pnevmatikakis A, Cesario A, Kostopoulou K, Boldrini L, Valentini V, et al. Discovering composite lifestyle biomarkers with artificial intelligence from clinical studies to enable smart eHealth and digital therapeutic services. Front Digit Health. (2021) 3:648190. doi: 10.3389/fdgth.2021.648190

PubMed Abstract | Crossref Full Text | Google Scholar

9. Sun M, Jung W, Koltermann K, Zhou G, Watson A, Blackwell G, et al. Parkinson’s disease action tremor detection with supervised-leaning models. IEEE Int Conf Connect Health Appl Syst Eng Technol. (2023) 2023:1–10. doi: 10.1145/3580252.3586977

PubMed Abstract | Crossref Full Text | Google Scholar

10. Vasudevan S, Saha A, Tarver ME, Patel B. Digital biomarkers: convergence of digital health technologies and biomarkers. npj Digit Med. (2022) 5:1–3. doi: 10.1038/s41746-022-00583-z

PubMed Abstract | Crossref Full Text | Google Scholar

11. TerKonda SP, TerKonda AA, Sacks JM, Kinney BM, Gurtner GC, Nachbar JM, et al. Artificial intelligence: singularity approaches. Plast Reconstr Surg. (2024) 153:204e. doi: 10.1097/PRS.0000000000010572

PubMed Abstract | Crossref Full Text | Google Scholar

12. Guthrie NL, Carpenter J, Edwards KL, Appelbaum KJ, Dey S, Eisenberg DM, et al. Emergence of digital biomarkers to predict and modify treatment efficacy: machine learning study. BMJ Open. (2019) 9:e030710. doi: 10.1136/bmjopen-2019-030710

PubMed Abstract | Crossref Full Text | Google Scholar

13. Islam M, Hassan S, Akter S, Jibon FA, Sahidullah M. A comprehensive review of predictive analytics models for mental illness using machine learning algorithms. Healthc Anal. (2024) 6:100350. doi: 10.1016/j.health.2024.100350

Crossref Full Text | Google Scholar

14. Nguyen TM, Leow AD, Ajilore O. A review on smartphone keystroke dynamics as a digital biomarker for understanding neurocognitive functioning. Brain Sci. (2023) 13:959. doi: 10.3390/brainsci13060959

PubMed Abstract | Crossref Full Text | Google Scholar

15. Venkatesan A, Rahimi L, Kaur M, Mosunic C. Digital cognitive behavior therapy intervention for depression and anxiety: retrospective study. JMIR Ment Health. (2020) 7:e21304. doi: 10.2196/21304

PubMed Abstract | Crossref Full Text | Google Scholar

16. Doukani A, Quartagno M, Sera F, Free C, Kakuma R, Riper H, et al. Comparison of the working alliance in blended cognitive behavioral therapy and treatment as usual for depression in Europe: secondary data analysis of the E-COMPARED randomized controlled trial. J Med Internet Res. (2024) 26:e47515. doi: 10.2196/47515

PubMed Abstract | Crossref Full Text | Google Scholar

17. Jonathan GK, Abitante G, McBride A, Bernstein-Sandler M, Babington P, Dopke CA, et al. Livewell, a smartphone-based self-management intervention for bipolar disorder: intervention participation and usability analysis. J Affect Disord. (2024) 350:926–36. doi: 10.1016/j.jad.2024.01.099

PubMed Abstract | Crossref Full Text | Google Scholar

18. Graham AK, Greene CJ, Kwasny MJ, Kaiser SM, Lieponis P, Powell T, et al. Coached mobile app platform for the treatment of depression and anxiety among primary care patients. JAMA Psychiatry. (2020) 77:1–9. doi: 10.1001/jamapsychiatry.2020.1011

Crossref Full Text | Google Scholar

19. Kandola A, Edwards K, Muller MA, Dührkoop B, Hein B, Straatman J, et al. Digitally managing depression: a fully remote randomised attention-placebo controlled trial. Digital Health. (2024) 10:20552076241260409. doi: 10.1177/20552076241260409

PubMed Abstract | Crossref Full Text | Google Scholar

20. Ben-Zeev D, Chander A, Tauscher J, Buck B, Nepal S, Campbell A, et al. A smartphone intervention for people with serious mental illness: fully remote randomized controlled trial of CORE. J Med Internet Res. (2021) 23:e29201. doi: 10.2196/29201

PubMed Abstract | Crossref Full Text | Google Scholar

21. Li VW, Sahota J, Dev DK, Gill DD, Evans VC, Axler A, et al. A randomized evaluation of MoodFX, a patient-centred e-health tool to support outcome measurement for depression: une évaluation randomisée de MoodFX, un outil de santé en ligne centré sur le patient pour soutenir la mesure du résultat dans la dépression. Can J Psychiatry. (2024) 69:493–502. doi: 10.1177/07067437241245331

PubMed Abstract | Crossref Full Text | Google Scholar

22. Fundoiano-Hershcovitz Y, Asher IB, Ritholz MD, Feniger E, Manejwala O, Goldstein P. Specifying the efficacy of digital therapeutic tools for depression and anxiety: retrospective, 2-cohort, real-world analysis. J Med Internet Res. (2023) 25:e47350. doi: 10.2196/47350

PubMed Abstract | Crossref Full Text | Google Scholar

23. Cuijpers P, Heim E, Ramia JA, Burchert S, Carswell K, Cornelisz I, et al. Guided digital health intervention for depression in Lebanon: randomised trial. Evid Based Ment Health. (2022) 25:e34–40. doi: 10.1136/ebmental-2021-300416

PubMed Abstract | Crossref Full Text | Google Scholar

24. Klein B, Nguyen H, McLaren S, Andrews B, Shandley K. A fully automated self-help biopsychosocial transdiagnostic digital intervention to reduce anxiety and/or depression and improve emotional regulation and well-being: pre–follow-up single-arm feasibility trial. JMIR Form Res. (2023) 7:e43385. doi: 10.2196/43385

PubMed Abstract | Crossref Full Text | Google Scholar

25. Thati RP, Dhadwal AS, Kumar P, Sainaba P. A novel multi-modal depression detection approach based on mobile crowd sensing and task-based mechanisms. Multimed Tools Appl. (2023) 82:4787–820. doi: 10.1007/s11042-022-12315-2

PubMed Abstract | Crossref Full Text | Google Scholar

26. Zhang Y, Folarin AA, Sun S, Cummins N, Vairavan S, Qian L, et al. Associations between depression symptom severity and daily-life gait characteristics derived from long-term acceleration signals in real-world settings: retrospective analysis. JMIR Mhealth Uhealth. (2022) 10:e40667. doi: 10.2196/40667

PubMed Abstract | Crossref Full Text | Google Scholar

27. Maruani J, Mauries S, Zehani F, Lejoyeux M, Geoffroy PA. Exploring actigraphy as a digital phenotyping measure: a study on differentiating psychomotor agitation and retardation in depression. Acta Psychiatr Scand. (2024) 151(3):401–11. doi: 10.1111/acps.13739

PubMed Abstract | Crossref Full Text | Google Scholar

28. Merikangas KR, Swendsen J, Hickie IB, Cui L, Shou H, Merikangas AK, et al. Real-time mobile monitoring of the dynamic associations among motor activity, energy, mood, and sleep in adults with bipolar disorder. JAMA Psychiatry. (2019) 76:190–8. doi: 10.1001/jamapsychiatry.2018.3546

PubMed Abstract | Crossref Full Text | Google Scholar

29. Carr O, Saunders KEA, Bilderbeck AC, Tsanas A, Palmius N, Geddes JR, et al. Desynchronization of diurnal rhythms in bipolar disorder and borderline personality disorder. Transl Psychiatry. (2018) 8:79. doi: 10.1038/s41398-018-0125-7

PubMed Abstract | Crossref Full Text | Google Scholar

30. Broulidakis MJ, Kiprijanovska I, Severs L, Stankoski S, Gjoreski M, Mavridou I, et al. Optomyography-based sensing of facial expression derived arousal and valence in adults with depression. Front Psychiatry. (2023) 14:1–8. doi: 10.3389/fpsyt.2023.1232433

Crossref Full Text | Google Scholar

31. Zhang Y, Folarin AA, Sun S, Cummins N, Vairavan S, Bendayan R, et al. Longitudinal relationships between depressive symptom severity and phone-measured mobility: dynamic structural equation modeling study. JMIR Ment Health. (2022) 9:e34898. doi: 10.2196/34898

PubMed Abstract | Crossref Full Text | Google Scholar

32. Kandola A, Hayes JF. Real-time air pollution and bipolar disorder symptoms: remote-monitored cross-sectional study. BJPsych Open. (2023) 9:e107. doi: 10.1192/bjo.2023.77

PubMed Abstract | Crossref Full Text | Google Scholar

33. Matteo DD, Fotinos K, Lokuge S, Yu J, Sternat T, Katzman MA, et al. The relationship between smartphone-recorded environmental audio and symptomatology of anxiety and depression: exploratory study. JMIR Form Res. (2020) 4:e18751. doi: 10.2196/18751

PubMed Abstract | Crossref Full Text | Google Scholar

34. Kupferberg A, Hasler G. The social cost of depression: investigating the impact of impaired social emotion regulation, social cognition, and interpersonal behavior on social functioning. J Affect Disord Rep. (2023) 14:100631. doi: 10.1016/j.jadr.2023.100631

Crossref Full Text | Google Scholar

35. Au R, Kolachalama VB, Paschalidis IC. Redefining and validating digital biomarkers as fluid, dynamic multi-dimensional digital signal patterns. Front Digit Health. (2022) 3:751629. doi: 10.3389/fdgth.2021.751629

PubMed Abstract | Crossref Full Text | Google Scholar

36. Song YM, Jeong J, de los Reyes AA, Lim D, Cho C-H, Yeom JW, et al. Causal dynamics of sleep, circadian rhythm, and mood symptoms in patients with major depression and bipolar disorder: insights from longitudinal wearable device data. eBioMedicine. (2024) 103:105094. doi: 10.1016/j.ebiom.2024.105094

PubMed Abstract | Crossref Full Text | Google Scholar

37. Ali FZ, Parsey RV, Lin S, Schwartz J, DeLorenzo C. Circadian rhythm biomarker from wearable device data is related to concurrent antidepressant treatment response. npj Digit Med. (2023) 6:1–11. doi: 10.1038/s41746-023-00827-6

PubMed Abstract | Crossref Full Text | Google Scholar

38. Cormack F, McCue M, Skirrow C, Cashdollar N, Taptiklis N, van Schaik T, et al. Characterizing longitudinal patterns in cognition, mood, and activity in depression with 6-week high-frequency wearable assessment: observational study. JMIR Ment Health. (2024) 11:e46895. doi: 10.2196/46895

PubMed Abstract | Crossref Full Text | Google Scholar

39. Bomyea JA, Parrish EM, Paolillo EW, Filip TF, Eyler LT, Depp CA, et al. Relationships between daily mood states and real-time cognitive performance in individuals with bipolar disorder and healthy comparators: a remote ambulatory assessment study. J Clin Exp Neuropsychol. (2021) 43:813–24. doi: 10.1080/13803395.2021.1975656

PubMed Abstract | Crossref Full Text | Google Scholar

40. Faurholt-Jepsen M, Busk J, Bardram JE, Stanislaus S, Frost M, Christensen EM, et al. Mood instability and activity/energy instability in patients with bipolar disorder according to day-to-day smartphone-based data—an exploratory post hoc study. J Affect Disord. (2023) 334:83–91. doi: 10.1016/j.jad.2023.04.139

PubMed Abstract | Crossref Full Text | Google Scholar

41. Goltermann J, Emden D, Leehr EJ, Dohm K, Redlich R, Dannlowski U, et al. Smartphone-based self-reports of depressive symptoms using the remote monitoring application in psychiatry (ReMAP): interformat validation study. JMIR Ment Health. (2021) 8:e24333. doi: 10.2196/24333

PubMed Abstract | Crossref Full Text | Google Scholar

42. Tonn P, Seule L, Degani Y, Herzinger S, Klein A, Schulze N. Digital content-free speech analysis tool to measure affective distress in mental health: evaluation study. JMIR Form Res. (2022) 6:e37061. doi: 10.2196/37061

PubMed Abstract | Crossref Full Text | Google Scholar

43. Burkhardt HA, Alexopoulos GS, Pullmann MD, Hull TD, Areán PA, Cohen T. Behavioral activation and depression symptomatology: longitudinal assessment of linguistic indicators in text-based therapy sessions. J Med Internet Res. (2021) 23:e28244. doi: 10.2196/28244

PubMed Abstract | Crossref Full Text | Google Scholar

44. Raikar G, Vedant S, Raikar AS, Somnache SN. Advancements in artificial intelligence and machine learning in revolutionising biomarker discovery. Braz J Pharm Sci. (2023) 59:e23146. doi: 10.1590/s2175-97902023e23146

Crossref Full Text | Google Scholar

45. Alnuaimi AFAH, Albaldawi THK. An overview of machine learning classification techniques. BIO Web Conf. (2024) 97:00133. doi: 10.1051/bioconf/20249700133

Crossref Full Text | Google Scholar

46. Jacobson NC, Weingarden H, Wilhelm S. Digital biomarkers of mood disorders and symptom change. npj Digit Med. (2019) 2:1–3. doi: 10.1038/s41746-019-0078-0

PubMed Abstract | Crossref Full Text | Google Scholar

47. McGowan Nm, Goodwin Gm, Bilderbeck Ac, Saunders Kea. Actigraphic patterns, impulsivity and mood instability in bipolar disorder, borderline personality disorder and healthy controls. Acta Psychiatr Scand. (2020) 141:374–84. doi: 10.1111/acps.13148

PubMed Abstract | Crossref Full Text | Google Scholar

48. Corponi F, Li BM, Anmella G, Mas A, Pacchiarotti I, Valentí M, et al. Automated mood disorder symptoms monitoring from multivariate time-series sensory data: getting the full picture beyond a single number. Transl Psychiatry. (2024) 14:1–9. doi: 10.1038/s41398-024-02876-1

PubMed Abstract | Crossref Full Text | Google Scholar

49. Langholm C, Breitinger S, Gray L, Goes F, Walker A, Xiong A, et al. Classifying, and clustering mood disorder patients using smartphone data from a feasibility study. npj Digit Med. (2023) 6:1–6. doi: 10.1038/s41746-023-00977-7

PubMed Abstract | Crossref Full Text | Google Scholar

50. Opoku Asare K, Moshe I, Terhorst Y, Vega J, Hosio S, Baumeister H, et al. Mood ratings and digital biomarkers from smartphone and wearable data differentiates and predicts depression status: a longitudinal data analysis. Pervasive Mob Comput. (2022) 83:101621. doi: 10.1016/j.pmcj.2022.101621

Crossref Full Text | Google Scholar

51. Anmella G, Mas A, Sanabra M, Valenzuela-Pascual C, Valentí M, Pacchiarotti I, et al. Electrodermal activity in bipolar disorder: differences between mood episodes and clinical remission using a wearable device in a real-world clinical setting. J Affect Disord. (2024) 345:43–50. doi: 10.1016/j.jad.2023.10.125

PubMed Abstract | Crossref Full Text | Google Scholar

52. Valenzuela-Pascual C, Mas A, Borràs R, Anmella G, Sanabra M, González-Campos M, et al. Sleep–wake variations of electrodermal activity in bipolar disorder. Acta Psychiatr Scand. (2025) 151(3):412–25. doi: 10.1111/acps.13718

PubMed Abstract | Crossref Full Text | Google Scholar

53. Whiston A, Igou ER, Fortune DG, Semkovska M. Longitudinal interactions between residual symptoms and physiological stress in the remitted symptom network structure of depression. Acta Psychol (Amst). (2023) 241:104078. doi: 10.1016/j.actpsy.2023.104078

PubMed Abstract | Crossref Full Text | Google Scholar

54. Palmius N, Tsanas A, Saunders KEA, Bilderbeck AC, Geddes JR, Goodwin GM, et al. Detecting bipolar depression from geographic location data. IEEE Trans Biomed Eng. (2017) 64:1761–71. doi: 10.1109/TBME.2016.2611862

PubMed Abstract | Crossref Full Text | Google Scholar

55. Raugh IM, James SH, Gonzalez CM, Chapman HC, Cohen AS, Kirkpatrick B, et al. Geolocation as a digital phenotyping measure of negative symptoms and functional outcome. Schizophr Bull. (2020) 46:1596–607. doi: 10.1093/schbul/sbaa121

PubMed Abstract | Crossref Full Text | Google Scholar

56. O’Brien JT, Gallagher P, Stow D, Hammerla N, Ploetz T, Firbank M, et al. A study of wrist-worn activity measurement as a potential real-world biomarker for late-life depression. Psychol Med. (2017) 47:93–102. doi: 10.1017/S0033291716002166

Crossref Full Text | Google Scholar

57. Sverdlov O, Curcic J, Hannesdottir K, Gou L, De Luca V, Ambrosetti F, et al. A study of novel exploratory tools, digital technologies, and central nervous system biomarkers to characterize unipolar depression. Front Psychiatry. (2021) 12:640741. doi: 10.3389/fpsyt.2021.640741

PubMed Abstract | Crossref Full Text | Google Scholar

58. Birnbaum ML, Abrami A, Heisig S, Ali A, Arenare E, Agurto C, et al. Acoustic and facial features from clinical interviews for machine learning–based psychiatric diagnosis: algorithm development. JMIR Ment Health. (2022) 9:e24699. doi: 10.2196/24699

PubMed Abstract | Crossref Full Text | Google Scholar

59. Larsen E, Murton O, Song X, Joachim D, Watts D, Kapczinski F, et al. Validating the efficacy and value proposition of mental fitness vocal biomarkers in a psychiatric population: prospective cohort study. Front Psychiatry. (2024) 15:1–22. doi: 10.3389/fpsyt.2024.1342835

Crossref Full Text | Google Scholar

60. Vesel C, Rashidisabet H, Zulueta J, Stange JP, Duffecy J, Hussain F, et al. Effects of mood and aging on keystroke dynamics metadata and their diurnal patterns in a large open-science sample: a BiAffect iOS study. J Am Med Inform Assoc. (2020) 27:1007–18. doi: 10.1093/jamia/ocaa057

PubMed Abstract | Crossref Full Text | Google Scholar

61. Pedrelli P, Fedor S, Ghandeharioun A, Howe E, Ionescu DF, Bhathena D, et al. Monitoring changes in depression severity using wearable and mobile sensors. Front Psychiatry. (2020) 11:1–11. doi: 10.3389/fpsyt.2020.584711

PubMed Abstract | Crossref Full Text | Google Scholar

62. Martin-Key NA, Mirea D-M, Olmert T, Cooper J, Han SYS, Barton-Owen G, et al. Toward an extended definition of major depressive disorder symptomatology: digital assessment and cross-validation study. JMIR Form Res. (2021) 5:e27908. doi: 10.2196/27908

PubMed Abstract | Crossref Full Text | Google Scholar

63. Miskowiak KW, Jespersen AE, Obenhausen K, Hafiz P, Hestbæk E, Gulyas L, et al. Internet-based cognitive assessment tool: sensitivity and validity of a new online cognition screening tool for patients with bipolar disorder. J Affect Disord. (2021) 289:125–34. doi: 10.1016/j.jad.2021.04.039

PubMed Abstract | Crossref Full Text | Google Scholar

64. Toma M, Wei OC. Predictive modeling in medicine. Encyclopedia. (2023) 3:590–601. doi: 10.3390/encyclopedia3020042

Crossref Full Text | Google Scholar

65. Shuaib A, Arian H, Shuaib A. The increasing role of artificial intelligence in health care: will robots replace doctors in the future? Int J Gen Med. (2020) 13:891–6. doi: 10.2147/IJGM.S268093

PubMed Abstract | Crossref Full Text | Google Scholar

66. Cho C-H, Lee T, Kim M-G, In HP, Kim L, Lee H-J. Mood prediction of patients with mood disorders by machine learning using passive digital phenotypes based on the circadian rhythm: prospective observational cohort study. J Med Internet Res. (2019) 21:e11029. doi: 10.2196/11029

PubMed Abstract | Crossref Full Text | Google Scholar

67. Curtiss J, Fulford D, Hofmann SG, Gershon A. Network dynamics of positive and negative affect in bipolar disorder. J Affect Disord. (2019) 249:270–7. doi: 10.1016/j.jad.2019.02.017

PubMed Abstract | Crossref Full Text | Google Scholar

68. Vairavan S, Rashidisabet H, Li QS, Ness S, Morrison RL, Soares CN, et al. Personalized relapse prediction in patients with major depressive disorder using digital biomarkers. Sci Rep. (2023) 13:18596. doi: 10.1038/s41598-023-44592-8

PubMed Abstract | Crossref Full Text | Google Scholar

69. Busk J, Faurholt-Jepsen M, Frost M, Bardram JE, Vedel Kessing L, Winther O. Forecasting mood in bipolar disorder from smartphone self-assessments: hierarchical Bayesian approach. JMIR Mhealth Uhealth. (2020) 8:e15028. doi: 10.2196/15028

PubMed Abstract | Crossref Full Text | Google Scholar

70. Jacobson NC, Chung YJ. Passive sensing of prediction of moment-to-moment depressed mood among undergraduates with clinical levels of depression sample using smartphones. Sensors. (2020) 20:3572. doi: 10.3390/s20123572

PubMed Abstract | Crossref Full Text | Google Scholar

71. Tseng Y-C, Lin EC, Wu CH, Huang H-L, Chen PS. Associations among smartphone app-based measurements of mood, sleep and activity in bipolar disorder. Psychiatry Res. (2022) 310:114425. doi: 10.1016/j.psychres.2022.114425

PubMed Abstract | Crossref Full Text | Google Scholar

72. Palmius N, Saunders KEA, Carr O, Geddes JR, Goodwin GM, Vos MD. Group-personalized regression models for predicting mental health scores from objective Mobile phone data streams: observational study. J Med Internet Res. (2018) 20:e10194. doi: 10.2196/10194

PubMed Abstract | Crossref Full Text | Google Scholar

73. Lee H-J, Cho C-H, Lee T, Jeong J, Yeom JW, Kim S, et al. Prediction of impending mood episode recurrence using real-time digital phenotypes in major depression and bipolar disorders in South Korea: a prospective nationwide cohort study. Psychol Med. (2023) 53:5636–44. doi: 10.1017/S0033291722002847

PubMed Abstract | Crossref Full Text | Google Scholar

74. Balliu B, Douglas C, Seok D, Shenhav L, Wu Y, Chatzopoulou D, et al. Personalized mood prediction from patterns of behavior collected with smartphones. npj Digit Med. (2024) 7:1–14. doi: 10.1038/s41746-024-01035-6

PubMed Abstract | Crossref Full Text | Google Scholar

75. Ross MK, Demos AP, Zulueta J, Piscitello A, Langenecker SA, McInnis M, et al. Naturalistic smartphone keyboard typing reflects processing speed and executive function. Brain Behav. (2021) 11:e2363. doi: 10.1002/brb3.2363

PubMed Abstract | Crossref Full Text | Google Scholar

76. Zulueta J, Piscitello A, Rasic M, Easter R, Babu P, Langenecker SA, et al. Predicting mood disturbance severity with mobile phone keystroke metadata: a BiAffect digital phenotyping study. J Med Internet Res. (2018) 20:e241. doi: 10.2196/jmir.9775

PubMed Abstract | Crossref Full Text | Google Scholar

77. Kathan A, Harrer M, Küster L, Triantafyllopoulos A, He X, Milling M, et al. Personalised depression forecasting using mobile sensor data and ecological momentary assessment. Front Digit Health. (2022) 4:1–15. doi: 10.3389/fdgth.2022.964582

Crossref Full Text | Google Scholar

78. Ebner-Priemer UW, Mühlbauer E, Neubauer AB, Hill H, Beier F, Santangelo PS, et al. Digital phenotyping: towards replicable findings with comprehensive assessments and integrative models in bipolar disorders. Int J Bipolar Disord. (2020) 8:35. doi: 10.1186/s40345-020-00210-4

PubMed Abstract | Crossref Full Text | Google Scholar

79. Wu C-H, Hsu J-H, Liou C-R, Su H-Y, Lin EC-L, Chen P-S. Automatic bipolar disorder assessment using machine learning with smartphone-based digital phenotyping. IEEE Access. (2023) 11:121845–58. doi: 10.1109/ACCESS.2023.3328342

Crossref Full Text | Google Scholar

80. Bai R, Xiao L, Guo Y, Zhu X, Li N, Wang Y, et al. Tracking and monitoring mood stability of patients with Major depressive disorder by machine learning models using passive digital data: prospective naturalistic multicenter study. JMIR Mhealth Uhealth. (2021) 9:e24365. doi: 10.2196/24365

PubMed Abstract | Crossref Full Text | Google Scholar

81. Moshe I, Terhorst Y, Opoku Asare K, Sander LB, Ferreira D, Baumeister H, et al. Predicting symptoms of depression and anxiety using smartphone and wearable data. Front Psychiatry. (2021) 12:1–12. doi: 10.3389/fpsyt.2021.625247

Crossref Full Text | Google Scholar

82. Chen J, Chan NY, Li C-T, Chan JWY, Liu Y, Li SX, et al. Multimodal digital assessment of depression with actigraphy and app in Hong Kong Chinese. Transl Psychiatry. (2024) 14:150. doi: 10.1038/s41398-024-02873-4

PubMed Abstract | Crossref Full Text | Google Scholar

83. Zhang Y, Folarin AA, Sun S, Cummins N, Ranjan Y, Rashid Z, et al. Predicting depressive symptom severity through Individuals’ nearby bluetooth device count data collected by Mobile phones: preliminary longitudinal study. JMIR Mhealth Uhealth. (2021) 9:e29840. doi: 10.2196/29840

PubMed Abstract | Crossref Full Text | Google Scholar

84. Lewis KJS, Tilling K, Gordon-Smith K, Saunders KEA, Florio AD, Jones L, et al. The dynamic interplay between sleep and mood: an intensive longitudinal study of individuals with bipolar disorder. Psychol Med. (2023) 53:3345–54. doi: 10.1017/S0033291721005377

PubMed Abstract | Crossref Full Text | Google Scholar

85. Eichstaedt JC, Smith RJ, Merchant RM, Ungar LH, Crutchley P, Preoţiuc-Pietro D, et al. Facebook Language predicts depression in medical records. Proc Natl Acad Sci USA. (2018) 115:11203–8. doi: 10.1073/pnas.1802331115

PubMed Abstract | Crossref Full Text | Google Scholar

86. Luo W, Phung D, Tran T, Gupta S, Rana S, Karmakar C, et al. Guidelines for developing and reporting machine learning predictive models in biomedical research: a multidisciplinary view. J Med Internet Res. (2016) 18:e5870. doi: 10.2196/jmir.5870

Crossref Full Text | Google Scholar

87. Rykov Y, Thach T-Q, Lee E, Christopoulos G, Car J. Digital biomarkers for depression screening with wearable devices: cross-sectional study with machine learning modeling—pMC. JMIR Mhealth Uhealth. (2021) 9:1–24. doi: 10.2196/24872

Crossref Full Text | Google Scholar

88. Biau DJ, Kernéis S, Porcher R. Statistics in brief: the importance of sample size in the planning and interpretation of medical research. Clin Orthop Relat Res. (2008) 466:2282–8. doi: 10.1007/s11999-008-0346-9

PubMed Abstract | Crossref Full Text | Google Scholar

89. Bailly A, Blanc C, Francis É, Guillotin T, Jamal F, Wakim B, et al. Effects of dataset size and interactions on the prediction performance of logistic regression and deep learning models. Comput Methods Programs Biomed. (2022) 213:106504. doi: 10.1016/j.cmpb.2021.106504

PubMed Abstract | Crossref Full Text | Google Scholar

90. Li K, Persaud D, Choudhary K, DeCost B, Greenwood M, Hattrick-Simpers J. Exploiting redundancy in large materials datasets for efficient machine learning with less data. Nat Commun. (2023) 14:7283. doi: 10.1038/s41467-023-42992-y

PubMed Abstract | Crossref Full Text | Google Scholar

91. Chen I-M, Chen Y-Y, Liao S-C, Lin Y-H. Development of digital biomarkers of mental illness via Mobile apps for personalized treatment and diagnosis. J Pers Med. (2022) 12:936. doi: 10.3390/jpm12060936

PubMed Abstract | Crossref Full Text | Google Scholar

92. Motahari-Nezhad H, Al-Abdulkarim H, Fgaier M, Abid MM, Péntek M, Gulácsi L, et al. Digital biomarker–based interventions: systematic review of systematic reviews. J Med Internet Res. (2022) 24:e41042. doi: 10.2196/41042

PubMed Abstract | Crossref Full Text | Google Scholar

93. Walzel S, Mikus R, Rafl-Huttova V, Rozanek M, Bachman TE, Rafl J. Evaluation of leading smartwatches for the detection of hypoxemia: comparison to reference oximeter. Sensors. (2023) 23:9164. doi: 10.3390/s23229164

PubMed Abstract | Crossref Full Text | Google Scholar

94. Landers C, Vayena E, Amann J, Blasimme A. Stuck in translation: stakeholder perspectives on impediments to responsible digital health. Front Digit Health. (2023) 5:1069410. doi: 10.3389/fdgth.2023.1069410

PubMed Abstract | Crossref Full Text | Google Scholar

95. Prakash S, Balaji JN, Joshi A, Surapaneni KM. Ethical conundrums in the application of artificial intelligence (AI) in healthcare—A scoping review of reviews. J Pers Med. (2022) 12:1914. doi: 10.3390/jpm12111914

PubMed Abstract | Crossref Full Text | Google Scholar

96. Andreoletti M, Haller L, Vayena E, Blasimme A. Mapping the ethical landscape of digital biomarkers: a scoping review. PLOS Digit Health. (2024) 3:e0000519. doi: 10.1371/journal.pdig.0000519

PubMed Abstract | Crossref Full Text | Google Scholar

97. European Union. GDPR Law. GDPR.eu Available at: https://gdpr.eu/tag/gdpr/ (Accessed January 11, 2025).

Google Scholar

98. Hurd TC, Cobb Payton F, Hood DB. Targeting machine learning and artificial intelligence algorithms in health care to reduce bias and improve population health. Milbank Q. (2024) 102:577–604. doi: 10.1111/1468-0009.12712

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: digital biomarkers, mood disorders, predictive model, machine learning, bipolar disorder, depression, major depressive disorder

Citation: Garzón-Partida AP, Padilla-Gómez CB, Martínez-Fernández DE, García-Estrada J, Luquin S and Fernández-Quezada D (2025) The implementation of digital biomarkers in the diagnosis, treatment and monitoring of mood disorders: a narrative review. Front. Digit. Health 7:1595243. doi: 10.3389/fdgth.2025.1595243

Received: 17 March 2025; Accepted: 4 June 2025;
Published: 17 June 2025.

Edited by:

Björn Wolfgang Schuller, Imperial College London, United Kingdom

Reviewed by:

Mihaela Dinsoreanu, Technical University of Cluj-Napoca, Romania
Francisco Maria Calisto, University of Lisbon, Portugal

Copyright: © 2025 Garzón-Partida, Padilla-Gómez, Martínez-Fernández, García-Estrada, Luquin and Fernández-Quezada. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: David Fernández-Quezada, ZGF2aWQuZmVybmFuZGV6QGFjYWRlbWljb3MudWRnLm14

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.