- 1Institute of Health Care (ICS-IAS), Girona, Spain
- 2Control Engineering and Intelligent Systems (eXiT), University of Girona, Girona, Spain
- 3Sant Joan de Deu-Numancia Health Park, Barcelona, Spain
- 4Ibero-American Cochrane Center (CCIb), Barcelona, Spain
- 5Department of Clinical Pharmacology, Valll d’Hebron Barcelona Hospital Campus, Barcelona, Spain
- 6TransLab Research Group, Department of Medical Sciences, University of Girona, Girona, Spain
Introduction: Clinical practice guidelines (CPGs) have several limitations, namely: obsolescence, lack of personalization, and insufficient patient participation. These factors may contribute to suboptimal treatment recommendation compliance and poorer clinical outcomes. APPRAISE-RS is an adaptation of the GRADE heuristic designed to generate CPG-like treatment recommendations that are automated, updated, personalized, participatory, and explanatory using a symbolic AI approach. TDApp is a clinical decision support system (CDSS) that implements APPRAISE-RS for ADHD.
Methods: Two clinical trials were conducted. In both studies a total of 33 and 32 ADHD patients, respectively, requiring treatment initiation or a major treatment change were enrolled. TDApp recommendations were compared to those of selected CPGs (American Academy of Pediatrics, National Institute for Health and Care Excellence, Spanish Health System, Canadian ADHD Resource Alliance, and the Australasian ADHD Professionals Association) CPGs. The diversity of treatment recommendations was analyzed using Blau’s index. Concordance between TDApp and CPGs recommendations was assessed by calculating the proportion of patients for whom TDApp recommended one drug that was also endorsed by CPGs. Dendrograms were plotted to compare the distance between treatment recommendations as calculated using the NbN nomenclature.
Results: The first study investigated eight methods that differed in how patient and clinician preferred outcomes were handled and the extent to which TDApp tailored the analysis of evidence. The method deemed most suitable was examined in the second study, which found that 50-75% of the patients received at least one favorable treatment recommendation. TDApp evaluated over 10 drugs, including recently marketed ones, with amphetamine derivatives emerging as the most frequently recommended interventions. TDApp generated 8–12 distinct treatment recommendations with a diversity index of 0.70-0.88, which was higher than those of CPGs. The proportion of patients for whom TDApp recommendations overlapped with at least one drug endorsed by CPGs ranged from 21.9% to 100%. Dendrogram analysis revealed that TDApp was positioned on one side of the tree, while CPGs clustered together on the opposite side.
Conclusions: TDApp is an advanced prototype of an CDSS offering automated, participatory, personalized, and explanatory treatment recommendations for ADHD. It represents a promising alternative to CPGs for aiding clinicians and patients in shared treatment decision-making.
Introduction
As medical research continues to expand, evidence-based medicine (EBM) — a model of care that integrates scientific evidence, clinical expertise, and patient values — has become a cornerstone in bridging the gap between the best available evidence, policy and practice in healthcare (1, 2). Clinical practice guidelines (CPGs) play a pivotal role in EBM, serving as systematic tools for appraising evidence and formulating recommendations. In the context of CPG creation, the Institute of Medicine (IOM) recommends using a series of sequential steps for developing CPGs, including formulating a clinical question using the PICO (Patient, Intervention, Comparison, and Outcome) format, searching for relevant studies, conducting a meta-analysis for each comparison and outcome, determining the confidence (quality) in the evidence of effects, assessing the risk-benefit relationship of each intervention, generating treatment recommendations based on the previous analyses, and assigning strength to these recommendations (3). More specifically, the GRADE (Grading of Recommendations Assessment, Development, and Evaluation) Working Group has developed a structured approach to assessing the certainty of evidence regarding effects and the strength of recommendations, as well as frameworks to support the process of translating evidence into decisions. These frameworks also incorporate the assessment of cost and resource use, the impact on inequity, the acceptability of the intervention, and its feasibility for implementation (4, 5).
Treatment recommendations in CPGs have several limitations. One key issue is obsolescence, with 20% of recommendations becoming outdated within a few years of a CPG’s publication (6, 7). Another limitation is oversimplification, as many guidelines adopt a “one-size-fits-all” approach, overlooking variations in disease severity and patient comorbidities when making recommendations (8–10). A further limitation is lack of prospective validation beyond face validity (11, 12), which is somewhat contradictory because CPGs aim to translate scientific evidence into bedside practice, yet their validity are seldom investigated. Centralization is a concern, as the clinician that makes the decision is not involved in the CPG development and because CPGs are often sponsored by governmental organizations or scientific societies and developed by experts with financial ties to companies that either market products, medical devices or tests related to the disease addressed in the CPGs (13–15). Furthermore, patient involvement in the development of CPGs, while desirable, remains limited (16), and their implementation often lacks formats that facilitate patient participation in shared decision-making (17). This centralization may lead to negative perceptions among clinicians and patients (18), who feel their priorities differ from those of CPG committees. As a result, CPGs may be seen as tools to control costs and reduce clinician autonomy, rather than enhance care (19). Altogether, these shortcomings might explain why studies frequently report clinicians’ non-compliance with CPGs as relatively common (20, 21). This lower compliance with CPGs recommendations has negative clinical consequences, as several studies across different specialties have shown that non-compliance leads to poorer treatment outcomes (22–24). Research has demonstrated that non-adherence to CPGs is linked to higher medical costs resulting from prolonged treatment durations, higher complication rates, and the need for more intensive medical interventions (25).
Our team has developed APPRAISE-RS (Automated, uPdated, Participatory, and peRsonAlISEd treatment Recommender System), which adapts the GRADE heuristic to generate clinical recommendations that are automated, updated, personalized, participatory, and explanatory using a symbolic AI approach (26). According to the patient’s characteristics, APPRAISE-RS identifies the RCTs for which the patient meets the inclusion criteria, applies the findings from these studies on the clinician and patient-preferred outcomes to evaluate the risk-benefit relationship of the investigated interventions, and generates treatment recommendations based on these assessments. Despite the importance of cost in treatment selection, APPRAISE-RS does not currently incorporate cost considerations due to significant price variations both within and across countries (27, 28), nor does it address the intervention’s impact on inequity or its feasibility of implementation, as these factors are more relevant to public health policy and healthcare organization than to clinical practice.
APPRAISE-RS addresses the previously mentioned shortcomings of CPGs. It tackles obsolescence by leveraging an updated dataset of studies and overcomes oversimplification by analyzing evidence from RCTs in which the patient meets the inclusion criteria, thereby generating evidence directly applicable to the individual patient. This is attained by cross-referencing patient characteristics with the inclusion and exclusion criteria of RCTs. Centralization is addressed by making EBM participatory and facilitating shared decision making (29). This is attained through integrating patient and clinician preferences, conducting evidence-based analyses of available treatment options based on these preferences, and generating tailored treatment recommendations at the point of care. Finally, APPRAISE-RS ensures that treatment recommendations are explanatory by producing a therapeutic report that presents the results of the analysis of the evidence for each intervention and critical preference. Other artificial intelligence (AI)-based systems/methods/tools have been explored to address some of the shortcomings of the CPGs, including obsolescence (30), lack of personalization in clinical recommendations (31), and insufficient patient participation (32), however, to our knowledge, no other system besides APPRAISE-RS addresses all of these challenges in such a comprehensive manner.
To test the clinical utility of APPRAISE-RS, we selected attention deficit hyperactivity disorder (ADHD) as a proof of concept before extending the approach to other conditions. APPRAISE-RS/TDApp (TDApp for simplicity, available at https://tdapp.org/en/) is a clinical decision support system (CDSS) that implements APPRAISE-RS for ADHD patients (26). ADHD was selected due to its high prevalence health and its substantial consequences, including academic failure, an increased risk of substance use, accidents, and even mortality (33). Furthermore, evidence suggests that providing suitable ADHD treatment is associated with a reduced risk of accidents (34), academic underachievement (35), and legal issues (36). However, ADHD CPGs are subject to the limitations described above, which may contribute to the observed suboptimal clinician adherence to treatment recommendations (37, 38), significant variability in treatment approaches both within and across regions in numerous countries (39–42), and low patient initiation, adherence, and persistence with prescribed treatments (43, 44).
A series of studies have been designed to validate TDApp. In a preliminary in silico validation study with simulated patients (26), TDApp generated more updated and personalized treatment recommendations compared to ADHD CPGs, highlighting its potential to address issues of obsolescence and oversimplification in CPGs. Following this in silico study, TDApp was clinically tested in ADHD patients. This article reports the findings from the first two clinical studies, conducted consecutively between 2020 and 2024, which aimed to compare TDApp treatment recommendations with those from selected CPGs in ADHD patients. In Study 1, multiple methods for analyzing the scientific evidence and treatment recommendations were studied, with recommendations hidden from clinicians during data collection. In Study 2, a single method was implemented, and TDApp recommendations were made visible to clinicians during clinical visits.
Study 1
Methodology:
Study design
The study is a non-randomized, non-controlled clinical trial.
Patients
The primary objective of the study was to compare TDApp treatment recommendations with those of CPGs by analyzing the diversity and similarity of the recommendations using Blau’s diversity index and dendrograms (see Statistical analysis section). As neither of these methods involves the formulation of an explicit null or alternative hypothesis, a formal sample size calculation was not applicable. Therefore, no power analysis was conducted, and the sample was set at 55 patients. Enrollment began in August 2020 and was halted in May 2022 after 33 patients were included, due to disruptions in standard healthcare operations caused by the COVID-19 pandemic, which impeded normal enrollment during the scheduled data collection period.
The study included male and female patients aged 6–65 years with a DSM-5 diagnosis of ADHD based on clinical assessment, who required initiation or modification of treatment. Exclusion criteria comprised patients already receiving adequate treatment or those requiring minor medication adjustments (e.g., a change in dose or formulation), as well as patients lacking an electronic device with internet access. Adult patients and parents of minors with ADHD provided signed informed consent, while adolescent patients provided assent. Recruitment occurred at 3 study sites in Girona (Catalonia, Spain).
Independent variables:
The independent variable was the treatment recommender system for ADHD patients. TDApp was compared with five relevant ADHD CPGs. We included the most recent versions from four internationally recognized institutions with global impact: the American Academy of Pediatrics (45), National Institute for Health and Care Excellence (46), Canadian ADHD Resource Alliance (47), and the Australasian ADHD Professionals Association (48). Also, the CPG from the Spanish Health System (SHS) (49) was included, as the clinical study was conducted in Spain.
TDApp (26) is an AI-based CDSS that generates automated, updated, personalized, participatory, and explanatory treatment recommendations for ADHD patients. The process starts with gathering patient’s demographic and clinical characteristics. Clinicians’ and patients’ or parents’ “preferences” on treatment goals are collected from a list of 18 treatment preferences. Patients and clinicians rate each preference from 1 (not important) to 9 (very important), with preferences rated 7 or higher classified as “critical”. To enhance patient and parent participation, TDApp includes an education module that provides information on ADHD, treatment options and treatment preferences. This module aims to improve health literacy surrounding ADHD and empower patients and parents in the treatment decision-making process by guiding the selection of critical outcomes that TDApp will analyze to assess the risk-benefit relationship of the interventions analyzed. TDApp uses patient characteristics and both patient and clinician critical preferences to formulate PICO questions. In this framework, “P” represents the patient seeking mental health care, “I” corresponds to the pharmacological interventions examined in RCTs, “C” refers to drug vs placebo comparisons, and “O” to the critical preferences selected by the patient and clinician. Next, TDApp answers the PICO question. By selecting RCTs from a curated dataset that match the patient’s demographic and clinical characteristics. TDApp then generates ad hoc evidence for each intervention and “critical preference” by applying meta-analysis techniques. The evidence quality is assessed based on the risk of bias (rated from “high” to “very low”). Using the selected critical efficacy and safety outcomes, TDApp makes the risk-benefit relationship judgment based on predefined rules (26) that can be summarized with the following rule of thumb: an intervention has a suitable risk-benefit relationship if, compared to a placebo, it improves the majority of critical efficacy outcomes and does not increase the risk of any critical safety outcome or treatment discontinuation. Conversely, an intervention has an unsuitable risk-benefit relationship if it fails to improve the majority of critical efficacy outcomes and increases the risk of any critical safety outcomes or treatment discontinuation. In all other cases, the risk-benefit judgment is considered “neither favorable nor unfavorable”. Simultaneously, the quality of this judgment is rated from “high” to “low”. Finally, a clinical recommendation is generated from the risk-benefit judgement and its quality, categorized as “strongly in favor”, “weakly in favor”, “weakly against”, or “strongly against” the intervention.
As this was the first TDApp clinical study, treatment recommendations were withheld from clinicians until data collection was completed. Eight methods were employed to integrate patient and clinician critical preferences and to group RCTs into meta-analysis. Regarding procedures for combining patient and clinician critical preferences, two methods were used: the “comprehensive” method, which considered all critical preferences selected by either patients and clinicians, and the “conjoint” method, which only used the preferences selected by both the patient and the clinician. With regard to procedures for combining data from different RCTs in the meta-analysis, four levels (L1, L2, L3 and L4) of data pooling were implemented (see ESM 1 and 2 for examples). In L1, a stringent patient-RCT match based on demographic and clinical characteristics was required, and meta-analyses were performed for each drug and dose. As the level of analysis advanced, these criteria were relaxed.
Outcome measures:
The study outcomes analyzed in this article are the treatment recommendations provided by TDApp to patients and those outlined by selected CPGs. The degree of concordance between TDApp’s recommendations and the treatment recommended by other CPGs was assessed using a measure of pharmacological distance, derived from the NbN nomenclature (50). This nomenclature categorizes drugs based on their pharmacological targets and mode of action within the central nervous system. The pharmacological distances in our study range from 0 to 33, with larger values observed between drugs that differ in their targets and modes of action. For instance, the pharmacological distance between methylphenidate and dexmethylphenidate, bupropion, atomoxetine, guanfacine and no treatment is 1, 7, 15, 19 and 33, respectively (see ESM 3 for more details and the full distance matrix, and https://tdapp.org/en/resources/ for future updates as new drugs are investigated).
Additionally, data on the treatment prescribed, patient/parent satisfaction with the participation in treatment decision-making, the educational materials, TDApp usability, treatment effectiveness, safety, physiological measures, and patient satisfaction with the treatment were collected and will be reported separately.
Procedures:
The study comprised four visits. During the first visit, informed consent/assent was requested. If the patient/parents agreed to participate, sociodemographic and clinical data were collected during the second visit. This included information on the type and severity of ADHD, clinical global impression, the history of pharmacological treatment for ADHD, and comorbidities and their treatment. Additionally, patients were encouraged to use TDApp’s educational material to help them make informed decisions about treatment goals. During the subsequent visit, which occurred 3–5 days later, patients/parents reported their preferences regarding therapeutic goals. At the end of the third visit, clinical investigators prescribed the treatment they deemed most appropriate. Data on patient/parent satisfaction with the health decision-making process, the educational materials, and TDApp usability and simplicity were also collected. The final visit was scheduled for week 3, during which ADHD symptom severity, patient/parent satisfaction with the treatment, and assessments of adverse events were collected.
The study was registered at clinicaltrials.gov (NCT04228094) and obtained approval from the Ethics Committee of Research with Medicines in Girona (Catalonia, Spain).
Statistical analysis:
A descriptive analysis was conducted on patients’ preferences, evidence-related variables, and treatment recommendations. The McNemar test was used to compare dichotomous variables, while the Wilcoxon signed rank and Friedman’s tests were employed to assess differences between two and more than two groups, respectively, as the continuous variables were non-normally distributed. To assesses the diversity in treatment recommendations, Blau’s index was calculated. This index captures both the number of categories (variety) and the proportional distribution of elements across those categories. Its values range from 0 (indicating no diversity, where all elements belong to a single category) to 1 (maximum diversity, where elements are evenly distributed across all categories). Furthermore, the proportion of patients for which TDApp recommended a drug that was also recommended by CPGs was determined. Dendrograms were plotted to compare the concordance between TDApp and CPG recommendations by examining the pharmacological distance between the recommended drugs. Dendrograms are hierarchical visual representations that showcase similarities (in treatment recommendations) between elements (in this study, recommender systems) through branching structures based on clustering algorithms. In this study, dendrograms group recommender systems according to varying levels of similarity in their treatment recommendations, as determined by the pharmacological intervention distance matrix. Recommender systems that cluster together at lower levels of the tree provide more similar (concordant) recommendations compared to those clustering at higher levels. All analyses were conducted using data from patients whose preferences were available (N=30), except for the patient baseline description, which included all patients (N=33).
Results
Patients
Thirty-three patients were enrolled, 3 of whom were lost at follow up. Table 1 shows the patients’ baseline characteristics. Most patients were children/adolescents, male, and had an ADHD combined subtype. Comorbidities were common, with one-third of patients having a comorbid neurodevelopmental disorder. Almost half of the patients were receiving psychological support for ADHD.

Table 1. Patient baseline characteristics in Study 1 with age is expressed as median and ADHD symptom severity as mean.
Preferences
The mean number of preferences rated as critical was high among patients (mean = 9.2) and moderate among clinicians (mean = 4.5), resulting in a mean of 11.0 critical preferences in the comprehensive method and 3.5 in the conjoint method. The most common critical preferences were symptom improvement, clinical impressions regarding efficacy, and safety concerns such as seizures and vasovagal syncope. Among clinicians, the most common preferences were improvement of symptoms, quality of life, vasovagal syncope and seizures. Overall, 31 distinct combinations of critical preferences for 33 patients were considered using the comprehensive method, and 25 using the conjoint method.
Meta-analysis and quality
The number of interventions analyzed ranged from 9 to 18 and varied across levels of analysis, while the number of meta-analyses and the number of studies included in each meta-analysis varied depending on both the level of analysis and the method used to handle clinician and patient preferences (ESM 4). The overall quality of the evidence generated through meta-analyses was low. In general, only minor differences in quality were observed based on the level of analysis and the method used to incorporate clinician and patient preferences. The number of quality points lost due to bias and heterogeneity differed across levels and no clear pattern emerged. Imprecision was the main factor contributing to loss of quality points, with the greatest impact observed at the lowest levels of analysis.
Recommendations and comparison with CPGs
The proportion of patients receiving treatment recommendations varied widely depending on the method used for combining the critical preferences selected by clinicians and patient/parents, as well as on the level of analysis (ESM 5). Fewer patients received recommendations “in favor” when employing the comprehensive method compared to the conjoint method. In contrast, no clear trend emerged for recommendations “against”. Regarding the level of analysis, higher levels of analysis were generally associated with a higher frequency of recommendations “against” when using the comprehensive method, whereas the conjoint method yielded mixed results.
TDApp considered over 10 drugs, including recently marketed medications such as viloxazine or serdexmethylphenidate, which were not evaluated by any CPG. Methylphenidate and amphetamine derivatives were the most frequently recommended interventions “in favor” (ESM 6 and 7), while atomoxetine and viloxazine were the most frequently recommended “against” (ESM 8 and 9). All treatment recommendations were classified “weak”. The AAP and SHS CPGs recommended both stimulant and non-stimulant drugs, whereas the NICE, CADDRA and AADPA primarily recommended stimulant drugs. The number of distinct recommendations generated with TDApp ranged from 2 to 13, with Blau’s index values ranging from 0.18 to 0.80, which were higher when using the conjoint method (ESM 10). In contrast, CPGs provided 2 to 5 distinct recommendations, with Blau’s index values ranging from 0.17 to 0.50. The proportion of patients for whom TDApp recommended a drug that was also recommended by CPGs was low when using the comprehensive method (ranging from 13.6% to 20.0%) and medium-high (ranging from 56.7% to 95.5%) when using the conjoint method (ESM 11). Dendrograms (Figure 1) illustrate that the CPG recommendations, particularly those from NICE, CADDRA and AADPA, clustered together on one side of the tree. In contrast, the recommendations generated by the comprehensive method formed a distinct cluster on the opposite side. Among these, the Level 4 branch connected to the CPG cluster at a shorter distance compared to the Level 1–3 branches. Furthermore, the Level 1–3 branches of the conjoint method clustered together near the center of the figure, while the Level 4 branch was positioned between the remaining conjoint method recommendations and the CPGs — though slightly closer to the latter.

Figure 1. Hierarchical clustering dendrogram depicting similarities on the pharmacological interventions recommended by several versions of TDApp and CPGs in Study 1 (see ESM 12 for an analysis limited to children and adolescents). AADPA, Australasian ADHD Professionals Association; AAP, American Academy of Pediatrics; CADDRA, Canadian ADHD Resource Alliance; CPG, clinical practice guideline; L, level of analysis; NICE, National Institute for Health and Care Excellence; SHS, Spanish Health System.
Study 2
Methods
Study design
The study is a non-randomized, non-controlled clinical trial.
Patients
As in study 1, no formal sample size calculation was performed, and the sample was set at 32 patients. The study included male and female patients aged 6 to 17-years with a DSM-5 diagnosis of ADHD confirmed through clinical assessment, who required initiation or modification of treatment. Exclusion criteria included patients already receiving adequate treatment or requiring only minor medication adjustments, as well as those without access to an electronic device capable of connecting to the Internet. Parents provided signed informed consent, while adolescent patients provided assent. Recruitment took place at 2 study sites in Girona (Catalonia, Spain).
Independent variables
The independent variable was the recommender system. The recommender systems compared included TDApp, and the CPGs from AAP, NICE, SHS, CADDRA, and the AADPA. In this study, the comprehensive-L3 method was adopted. Although, Study 1 found that the comprehensive method generated treatment recommendations for fewer patients than the conjoint method, the former was chosen for implementation in Study 2 for two reasons: 1) the conjoint method filtered out certain preferences that patients or clinicians deemed critical, which did not fully align with our approach to participatory decision-making. Furthermore, this method could potentially result in no critical preference being used if patients and users did not agree on any of them. 2) The lower number of patients receiving treatment recommendations with the comprehensive method in Study 1 was attributed to the large number of critical preferences TDApp had to consider, resulting in treatment recommendations being identified only for a minority of patients whose treatment goals could be met. To address this surplus of critical preferences in Study 2, patients and clinicians were instructed to be more discriminatory between critical and non-critical preferences, with the recommendation that the number of critical preferences be minimized, with a minimum of one. Level 3 was selected because, as demonstrated in Study 1, when the comprehensive method was used it provided the highest number of distinct recommendations with the greatest level of diversity. Nevertheless, a post hoc analysis was conducted to explore the implications of using the conjoint method.
Outcome measures
TDApp recommendations were compared to those of the selected CPGs using a measure of pharmacological distance based on the NbN nomenclature (50). Additionally, data on treatment effectiveness on ADHD symptom severity and clinical global impression, adverse events, physiological measures, sense of coherence, self-efficacy, health literacy, patient/parent satisfaction with TDApp, clinicians’ satisfaction with the treatment recommendations, and treatment efficacy and safety were also collected and will be reported in a separate article.
Procedures
The study consisted of 5 visits over 6 weeks. During the first study visit, informed consent and assent were requested. If the patient/parents agreed to participate, sociodemographic and clinical history were then collected. Additionally, the patient was encouraged to use TDApp’s educational materials on ADHD and its treatment to make informed decisions about treatment goals. At the next visit, patients/parents and clinicians reported their preferences regarding treatment goals and TDApp generated the treatment recommendations and therapeutic report, which were then presented to the clinician (see ESM 13 for screenshots on TDApp frontend during the study), who prescribed treatment if deemed necessary. During visits 3-5, measures of patient satisfaction were taken of both the educational material and their participation in decision-making, as well as patient empowerment and health literacy, clinician satisfaction with TDApp recommendations, treatment effectiveness and safety, and patient satisfaction with treatment outcomes.
The study was registered at clinicaltrials.gov (NCT05651685) and obtained approval from the Ethics Committee of Research with Medicines in Girona (Catalonia, Spain) before the start of patient enrollment.
Statistical analysis:
A descriptive analysis was conducted, and Blau’s index was calculated to assess the diversity in treatment recommendations. The McNemar and the Wilcoxon signed-rank tests were used to compare dichotomous and non-parametric continuous variables, respectively. Dendrograms were generated to evaluate the concordance between TDApp and CPG recommendations by analyzing the pharmacological distance between their suggested treatments.
Results
Patients
Thirty-two patients were enrolled, with one lost to follow up. The cohort was predominantly male, with an average age of 12 years, and exhibited moderate to severe ADHD, predominantly of the combined subtype (Table 2). The presence of comorbidities was common, with other neurodevelopmental disorders being the most prevalent. Nearly half of the patients were receiving psychotherapy for ADHD.
Preferences
The mean number of preferences rated as critical was 4–5 among both patients and clinicians resulting in an average of 6.4 critical preferences analyzed by TDApp per patient. The most common critical preferences were academic performance and improvement of ADHD symptoms on the efficacy side, and seizures and syncope on the safety side among patients, while clinicians prioritized improvement of symptoms, clinical impression, seizures, and appetite decrease. In total, 24 distinct combinations of critical preferences were considered.
Meta-analysis and quality:
Around a median of 7–8 interventions were analyzed per patient involving around 29 meta-analyses. The overall quality of the evidence was low, with imprecision being the main reason for reducing the quality of the evidence (ESM 14).
Recommendations and comparison with CPGs:
Half of patients received at least one treatment recommendation “in favor”, and half received at least one “against” (ESM 15). TDApp generated eight distinct recommendations “in favor” for the study participants, with a Blau’s index of 0.70, while each CPG analyzed generated two distinct recommendations with a Blau’s index of 0.06 (Table 3). TDApp considered over 10 drugs, including recently marketed ones. Amphetamine derivatives were the most frequently recommended interventions “in favor”, followed by methylphenidate derivatives and atomoxetine (Table 3). Viloxazine, followed by amphetamine derivatives were the interventions most frequently recommended “against” (ESM 16). All treatment recommendations were classified “weak”. The percentage of patients for whom TDApp recommendations overlapped with at least one drug recommended by CPGs ranged from 21.9% with the NICE CPG to 50.0% with the AAP and SHS CPGs. When limiting the analysis to only patients who received a TDApp recommendation “in favor” of treatment, the overlap ranged from 43.8% with the NICE CPG to 100% with the AAP and SHS CPGs (Figure 2). Dendrograms (Figure 3) showed TDApp recommendations to be positioned on one side of the tree while those from GPGs clustered together on the opposite one with the NICE, CADDRA and AADPA clustering at the lowest levels of the tree. Furthermore, TDApp recommendations from the conjoint method connected at a lower distance to CPGs branches than those form the comprehensive one.

Table 3. Comparison of the number, diversity and types of treatment recommended “in favor” by TDApp using the comprehensive and conjoint methods, and relevant CPGs in Study 2.

Figure 2. The percentage of patients with at least one overlapping pharmacological treatment recommendation between relevant CPGs and TDApp, using the comprehensive and conjoint methods on all patients (top), and all patients for whom TDApp recommended treatments in favor (bottom) in Study 2. AADPA, Australasian ADHD Professionals Association; AAP, American Academy of Pediatrics; CADDRA, Canadian ADHD Resource Alliance; CPG, clinical practice guideline; L, level of analysis; NICE, National Institute for Health and Care Excellence; SHS, Spanish Health System. The bottom table denotes the p-values for the comparison between comprehensive and conjoint methods for all patients.

Figure 3. Hierarchical clustering dendrogram depicting differences in the pharmacological interventions recommended by CPGs and TDApp using the comprehensive (left) and conjoint methods (right) in Study 2. AADPA, Australasian ADHD Professionals Association; AAP, American Academy of Pediatrics; CADDRA, Canadian ADHD Resource Alliance; CPG, clinical practice guideline; L, level of analysis; NICE, National Institute for Health and Care Excellence; SHS, Spanish Health System.
Post hoc analyses
Upon completion of the primary analysis, a post hoc analysis was performed by using the conjoint method for handling patient and clinician preferences. With this approach, the mean number of preferences rated as critical by both the patient and the clinician was 3.2. A median of 7–8 interventions were analyzed per patient involving 22–23 meta-analyses. The overall quality of the evidence was low (ESM 14). Most (75.0%) patients received at least one treatment recommendation “in favor” and 40.6% at least one “against” (ESM 14). TDApp generated 12 distinct recommendations “in favor” (Table 3), with a Blau’s diversity index of 0.88. Methylphenidate, amphetamine derivatives and atomoxetine were the most frequently recommended interventions “in favor” (46.9%, 40.6% and 40.6%, respectively), while the interventions most frequently recommended “against” (ESM 16) were viloxazine, atomoxetine and methylphenidate derivatives (25.0%, 12.5% and 6.3%, respectively). Three (9.4%) patients received at least one “strong” treatment recommendation. The proportion of patients for whom TDApp recommendations coincided with at least one drug ranged from 53.1% to 71.9% when considering all samples, and from 68.0% to 95.8% when analyzing only patients with TDApp recommendations (Figure 2). The dendrograms (Figure 3) reveal that TDApp recommendations are positioned on one side of the tree, while the other systems form a separate cluster on the opposite side.
Discussion
TDApp is a “first in class” CDSS designed to assist clinicians in shared treatment decision-making by generating up-to-date, personalized, participatory and explanatory treatment recommendations. Findings from the initial clinical study indicate that, while CPGs provided treatment recommendations for 100% of ADHD patients, the proportion of patients receiving recommendations from TDApp varied significantly depending on the methods employed. Modifying the method for matching patients with RCTs and for combining interventions in meta-analysis influenced the number of studies included in the meta-analysis, the evidence generated, its quality, and the number and type of interventions recommended. However, these changes had little impact on the proportion of patients receiving recommendations “in favor”. In contrast, adjustments to the methods for handling critical preferences significantly affected the number of endpoints considered in the evidence analysis. This, in turn, influenced the likelihood of identifying interventions with a favorable risk-benefit relationship, ultimately affecting the proportion of patients receiving recommendations “in favor”. In Study 2, the comprehensive-L3 method was implemented, and patients/clinicians were required to discriminate between critical and non-critical preferences. With this approach, TDApp generated recommendations “in favor” for half of the participants, representing an improvement over Study 1, but still falling short of our expectations. Consequently, the data were re-analyzed using the conjoint method, resulting in “in favor” treatment recommendations for the majority (75%) of patients. Based on these findings, it appears reasonable to implement the comprehensive-L3 method as the default approach while allowing for the use of the conjoint-L3 method when no treatment recommendation can be made. The challenges encountered in developing TDApp highlight its uniqueness as a CDSS designed to generate treatment recommendations, underscoring the complexities of developing participatory decision-making tools. The trial-and-error experience undertaken by the research offers valuable insights for future developers of clinical decision-making tools with a similar approach, potentially facilitating a more efficient path towards an optimal version.
Several findings support the validity of TDApp as a CDSS that makes up-to-date, personalized, participatory and explanatory treatment recommendations for ADHD patients. First, the meta-analyses results and their quality are consistent: higher levels of analyses tend to include more studies, more precise effect estimates, and greater statistical heterogeneity. Second, TDApp found that most meta-analyses were of low quality, which is consistent with other systematic reviews with meta-analysis investigating the efficacy and safety of interventions on ADHD (51–53). Third, the comprehensive method, which incorporates more critical preferences than the conjoint method, tends to generate fewer recommendations “in favor”. Fourth, TDApp fundamentally recommends FDA/EMA ADHD-approved drugs, particularly psychostimulants, aligning with drug prescriptions in clinical practice (54–56). In contrast, atomoxetine was one of the least prescribed interventions, which aligns with evidence indicating that atomoxetine has a questionable risk-benefit profile (51, 52), as well as data on ADHD drug use showing it is prescribed to a minority of patients (42, 54). Furthermore, it is frequently associated with treatment discontinuation and switching after initial prescription (57). Fifth, the proportion of patients for whom TDApp recommendations overlapped in at least one drug with CPGs was notably high. Sixth, since Level 4 provides the most general treatment recommendations, which are more similar to those found in CPGs, its branches in the dendrograms connect to the CPGs branches at a shorter distance than those from the lower levels. Similarly, the recommendations generated by the conjoint method—like CPGs, based on a smaller number of preferences—also cluster closer to the CPGs than those generated by the comprehensive method. Finally, the observation that TDApp does not recommend drug treatment for certain patients further supports its validity, as approximately one in four patients with ADHD are ineligible for RCTs (58).
TDApp offers up-to-date recommendations, demonstrated by its inclusion of newly investigated drugs for ADHD, such as viloxazine and serdexmethylphenidate, which are not included in the latest versions of the selected CPGs. TDApp’s capacity to offer up-to-date treatment recommendations is clinically relevant, as delays in guideline updates is known to be a reason for clinician non-adherence to CPGs (59). Another implication of TDApp up-to-date recommendations is its potential to bridge the gap between cutting edge research and clinical practice, a gap that is particularly pronounced with CPGs due to the time required for their updates.
TDApp makes participatory clinical recommendations by allowing patients to specify the treatment goals they want to achieve. This is particularly relevant as patient involvement has shown to improve treatment adherence in ADHD patients (60). This study provides additional data to gain insight into this issue by revealing large between-patient variability in their preferences, therefore challenging the conventional approach of centralized selection of clinical preferences in CPG development, which fails to account for patient variability in treatment goals. Another important finding related to patient participation is that many patients identified academic performance as a key treatment outcome. This is significant for two reasons. First, none of the selected CPGs considered academic achievement a critical preference, increasing the chances of misalignment between CPG recommendations and patients’ values and preferences. Second, it highlights the need for additional RCTs to assess treatment efficacy on improving academic performance, as this is a demand from patients and parents (61).
Personalization is achieved through an evidence-based analysis of individual patient characteristics and preferences, revealing significant variability and underscoring the need to tailor treatment recommendations to each patient. Unlike CPGs, where PICO questions are formulated by a centralized panel of experts, TDApp, constructs PICO questions based on patient characteristics while also considering critical preferences expressed by both patients and clinicians. Consequently, TDApp generates a broader range of recommendations than CPGs, as shown by a higher number of distinct recommendations and an increased Blau’s index, particularly in Study 2, as well as with the conjoint method in Study 1. Furthermore, the greater diversity of treatment recommendations provided by TDApp can also be attributed to the fact that it considers a wider range of interventions than CPGs. This includes newly marketed drugs, as well as medications approved for indications other than ADHD, such as modafinil or bupropion, though these are infrequently recommended “in favor”.
Finally, similar to CPGs, TDApp provides explanatory treatment recommendations, as each recommendation is supported by an automatically generated evidence-based analysis report, ensuring transparency in the methods used to reach these conclusions. This contrasts with recommender systems, particularly those employing deep learning, which often lack clinical explanations for their recommendations (62). Likewise, systems such as IBM Watson, which uses natural language processing, machine learning, and knowledge-based reasoning, but relies on opaque methods for generating recommendations (63). Both approaches present ethical challenges (64) and limitations in clinical usefulness, as parental trust in treatment decisions increased the likelihood of accepting ADHD medication (65).
It should be noted that the similarity between recommendations provided by TDApp and those from CPGs has been investigated using an ad hoc continuous measure of the pharmacological distance based on an external source of information on drug characteristics, namely the NbN nomenclature. This is the first instance of using pharmacological distance, based on an external source, to assess the degree of concordance between different recommender systems. It is noteworthy that that this approach can be implemented in different ways; for instance, our analysis assumes that the difference between two drugs acting on different targets is the same, regardless of the particular targets involved. Thus, a drug acting on serotonin is equally different to two drugs acting on dopamine or on norepinephrine. Future studies would benefit from replicating and modifying our approach in the context of ADHD and other mental health conditions. In addition to this novel approach, a traditional categorical method (66) was also studied by analyzing the proportion of patients who were recommended the same drug. These two methods provide complementary insights. While the quantitative analyses, shown in dendrograms, indicate that TDApp functions as a distinct recommender system when compared to CPGs, the categorical analysis demonstrates that overlap in at least one recommended drug is common, supporting the idea that TDApp is both a distinct and valid recommender system in relation to CPGs.
It is important to highlight that this represents the first step in clinically validating TDApp recommendations. This is a complex issue, as proper validation requires a gold standard for comparison. While ADHD CPGs could be considered this gold standard, CPGs themselves often lack prospective clinical evaluation. Furthermore, different ADHD CPGs may make conflicting recommendations, making it unclear which one should be used as the gold standard for comparison (12). Alternatively, the validity and clinical usefulness of TDApp could be examined through a RCT using clinically relevant outcomes, which should be conducted with the finalized version of TDApp.
The limitations of this study should be emphasized. The sample sizes of both studies are small, though likely adequate to study the concordance between TDApp and CPG recommendations and the differences in their diversity. No formal contrast of hypothesis was conducted to compare the diversity and concordance of recommendations in the dendrograms, as no appropriate methods are currently available. Combinations of drugs were not considered in these studies, but this is of little relevance, as combinations are rarely studied, and no drug combination has received FDA/EMA approval or is recommended by any of the CPGs included in this study. Multiple iterations were explored in Study 1, and post hoc analyses carried out in Study 2. This iterative process can result in models overly tailored to the idiosyncrasies of the available data, thereby limiting their generalizability and undermining replicability in broader clinical contexts (64). To prevent this, several transparency practices should be adopted. Protocol registration involves publicly posting a summary of the study’s design, usually including objectives, methodology, and primary outcomes, before the study begins. Study preregistration requires researchers to detail their hypotheses, methods, and analysis plans prior to data collection, enhancing accountability (67). Registered reports go further by having researchers submit a full study protocol for peer review before data collection (68, 69).
TDApp does not take cost into account, despite cost being a significant factor in treatment decision-making. However, this issue can be addressed when implementing TDApp in individual centers and, more specifically, within each country, given that costs vary within and between countries. TDApp recommendations “in favor” were compared to those from CPG; however, we did not compare the recommendations “against” treatment as they are not clearly reported in CPGs. At present, TDApp makes pharmacological treatment recommendations only. Non-pharmacological interventions will be included in the next iteration. Study 1 was conducted during the COVID-19 pandemic, which altered usual healthcare practices and may have introduced potential selection biases. Furthermore, the external validity of the study may be compromised by the fact participants were enrolled in public healthcare centers, which are likely to differ from private healthcare settings.
Conclusions
● TDApp recommendations vary depending on the methods used for matching patients and RCTs, combining interventions and management of critical preferences.
● TDApp comprehensive-Level 3 and conjoint-Level 3 methods produced diverse and valid treatment recommendations.
● TDApp stands as an advanced prototype of an innovative AI-based CDSS, offering automated, participatory, personalized, and explanatory treatment recommendations. It emerges as a promising alternative to CPGs for assisting clinicians and patients in shared treatment decision-making.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by Comitè d’Ètica d’Investigació Clínica de l’Hospital Universitari de Girona Dr. Josep Trueta. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin.
Author contributions
EB: Conceptualization, Data curation, Funding acquisition, Investigation, Writing – review & editing. ÒR: Conceptualization, Investigation, Methodology, Software, Validation, Visualization, Writing – review & editing. CL: Data curation, Investigation, Writing – review & editing. BG: Data curation, Investigation, Writing – review & editing. IA: Data curation, Investigation, Writing – review & editing. DL: Data curation, Investigation, Writing – review & editing. TF: Data curation, Investigation, Writing – review & editing. RC: Data curation, Funding acquisition, Investigation, Methodology, Writing – review & editing. DS: Funding acquisition, Investigation, Writing – review & editing. DRi: Funding acquisition, Investigation, Methodology, Writing – review & editing. DRa: Formal Analysis, Funding acquisition, Investigation, Methodology, Project administration, Writing – review & editing. BL: Formal Analysis, Funding acquisition, Visualization, Writing – review & editing, Software. XC: Conceptualization, Data curation, Formal Analysis, Funding acquisition, Investigation, Methodology, Project administration, Supervision, Visualization, Writing – original draft, Writing – review & editing.
Funding
The author(s) declare financial support was received for the research and/or publication of this article. This work received funding from Fundació Pascual i Prats and Campus-Salut de la Universitat de Girona under grant no. AIN2018E, the Carlos III Research Institute (ISCIII), under grant no. PI19/00375 and PI23/01690 and the Generalitat de Catalunya 2021 SGR 01125.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be constructed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyt.2025.1582746/full#supplementary-material
References
1. Evidence-Based Medicine Working Group. Evidence-based medicine. A new approach to teaching the practice of medicine. JAMA. (1992) 268:2420–5. doi: 10.1001/jama.1992.03490170092032
2. Sur RL and Dahm P. History of evidence-based medicine. Indian J Urol. (2011) 27:487–9. doi: 10.4103/0970-1591.91438
3. Institute of Medicine (US) Committee on Standards for Developing Trustworthy Clinical Practice Guidelines. Clinical practice guidelines we can trust. Washington, DC: National Academies Press (2011). doi: 10.17226/13058
4. Atkins D, Best D, Briss PA, Eccles M, Falck-Ytter Y, Flottorp S, et al. Grading quality of evidence and strength of recommendations. BMJ. (2004) 328:1490. doi: 10.1136/bmj.328.7454.1490
5. Alonso-Coello P, Oxman AD, Moberg J, Brignardello-Petersen R, Akl EA, Davoli M, et al. GRADE Evidence to Decision (EtD) frameworks: a systematic and transparent approach to making well informed healthcare choices. 2: Clinical practice guidelines. BMJ. (2016) 353:i2089. doi: 10.1136/bmj.i2089
6. Alderson LJ, Alderson P, and Tan T. Median life span of a cohort of National Institute for Health and Care Excellence clinical guidelines was about 60 months. J Clin Epidemiol. (2014) 67:52–5. doi: 10.1016/j.jclinepi.2013.07.012
7. Martínez García L, Sanabria AJ, García Alvarez E, Trujillo-Martín MM, Etxeandia-Ikobaltzeta I, Kotzeva A, et al. The validity of recommendations from clinical guidelines: a survival analysis. CMAJ. (2014) 186:1211–9. doi: 10.1503/cmaj.140547
8. Steel N, Abdelhamid A, Stokes T, Edwards H, Fleetcroft R, Howe A, et al. A review of clinical practice guidelines found that they were often based on evidence of uncertain relevance to primary care patients. J Clin Epidemiol. (2014) 67:1251–7. doi: 10.1016/j.jclinepi.2014.05.020
9. Austad B, Hetlevik I, Mjølstad BP, and Helvik AS. Applying clinical guidelines in general practice: a qualitative study of potential complications. BMC Fam Pract. (2016) 17:92. doi: 10.1186/s12875-016-0490-3
10. Montori VM. Removing the blindfold: The centrality of care in caring for patients with multiple chronic conditions. Health Serv Res. (2021) 56 Suppl 1:969–72. doi: 10.1111/1475-6773.13865
11. Barry HC, Cosgrove L, and Slawson DC. Where clinical practice guidelines go wrong. Am Fam Physician. (2022) 105:350–2.
12. Pliszka SR, Pereira-Sanchez V, and Robles-ramamurthy B. A review of clinical practice guidelines in the diagnosis and treatment of attention-deficit/hyperactivity disorder. Child Adolesc Psychiatr Clin N Am. (2022) 31:569–81. doi: 10.1016/j.chc.2022.03.009
13. Cohen D and Carter P. Conflicts of interest. WHO and the pandemic flu “conspiracies. BMJ. (2010) 340:c2912. doi: 10.1136/bmj.c2912
14. Brems JH, Davis AE, and Clayton EW. Analysis of conflict of interest policies among organizations producing clinical practice guidelines. PloS One. (2021) 16:e0249267. doi: 10.1371/journal.pone.0249267
15. Pagano D, Home P, Irs A, Ledermann J, Curigiliano G, Iwatani T, et al. The path for medical associations to sponsor trustworthy guidelines: is it feasible? J R Soc Med. (2022) 115:252–6. doi: 10.1177/01410768221095240
16. Armstrong MJ and Bloom JA. Patient involvement in guidelines is poor five years after institute of medicine standards: review of guideline methodologies. Res Involv Engagem. (2017) 3:19. doi: 10.1186/s40900-017-0070-2
17. Maes-Carballo M, Muñoz-Núñez I, Martín-Díaz M, Mignini L, Bueno-Cavanillas A, and Khan KS. Shared decision making in breast cancer treatment guidelines: Development of a quality assessment tool and a systematic review. Health Expect. (2020) 23:1045–64. doi: 10.1111/hex.13112
18. Alonso-Coello P, Irfan A, Solà I, Gich I, Delgado-Noguera M, Rigau D, et al. The quality of clinical practice guidelines over the last two decades: a systematic review of guideline appraisal studies. Qual Saf Health Care. (2010) 19:e58. doi: 10.1136/qshc.2010.042077
19. Ratnani I, Fatima S, Abid MM, Surani Z, and Surani S. Evidence-based medicine: history, review, criticisms, and pitfalls. Cureus. (2023) 15:e35266. doi: 10.7759/cureus.35266
20. Arts DL, Voncken AG, Medlock S, Abu-Hanna A, and van Weert HC. Reasons for intentional guideline non-adherence: A systematic review. Int J Med Inform. (2016) 89:55–62. doi: 10.1016/j.ijmedinf.2016.02.009
21. Kriner P, Brieger P, Pogarell O, Schüle C, Mußmann L, Korbmacher J, et al. Treatment of bipolar depression: clinical practice vs. adherence to guidelines-data from a Bavarian drug surveillance project. Front Psychiatry. (2024) 15:1425549. doi: 10.3389/fpsyt.2024.1425549
22. Hines RB, Barrett A, Twumasi-Ankrah P, Broccoli D, Engelman KK, Baranda J, et al. Predictors of guideline treatment nonadherence and the impact on survival in patients with colorectal cancer. J Natl Compr Canc Netw. (2015) 13:51–60. doi: 10.6004/jnccn.2015.0008
23. Setkowski K, Boogert K, Hoogendoorn AW, Gilissen R, and van Balkom AJLM. Guidelines improve patient outcomes in specialised mental health care: A systematic review and meta-analysis. Acta Psychiatr Scand. (2021) 144:246–58. doi: 10.1111/acps.13332
24. Ricci-Cabello I, Vásquez-Mejía A, Canelo-Aybar C, Niño de Guzman E, Pérez-Bracchiglione J, Rabassa M, et al. Adherence to breast cancer guidelines is associated with better survival outcomes: a systematic review and meta-analysis of observational studies in EU countries. BMC Health Serv Res. (2020) 20:920. doi: 10.1186/s12913-020-05753-x
25. Castillo RC, Heins S, Feldman D, DuGoff EH, Roberts E, Staguhn ED, et al. The impact of adherence to clinical practice guidelines on medical costs. J Occup Environ Med. (2020) 62:712–7. doi: 10.1097/JOM.0000000000001938
26. López B, Raya O, Baykova E, Saez M, Rigau D, Cunill R, et al. APPRAISE-RS: Automated, updated, participatory, and personalized treatment recommender systems based on GRADE methodology. Heliyon. (2023) 9:e13074. doi: 10.1016/j.heliyon.2023.e13074
27. Benson NM and Song Z. Prices and cost sharing for psychotherapy in network versus out of network in the United States. Health Aff (Millwood). (2020) 39:1210–8. doi: 10.1377/hlthaff.2019.01468
28. Zhu JM, Huntington A, Haeder S, Wolk C, and McConnell KJ. Insurance acceptance and cash pay rates for psychotherapy in the US. Health Aff Sch. (2024) 2:qxae110. doi: 10.1093/haschl/qxae110
29. Fischer L, Scheibler F, Schaefer C, Karge T, Langer T, Schewe LV, et al. Fostering shared decision-making between patients and health care professionals in clinical practice guidelines: protocol for a project to develop and test a tool for guideline developers. JMIR Res Protoc. (2024) 13:e57611. doi: 10.2196/57611
30. Ge L, Agrawal R, Singer M, Kannapiran P, De Castro Molina JA, Teow KL, et al. Leveraging artificial intelligence to enhance systematic reviews in health research: advanced tools and challenges. Syst Rev. (2024) 13:269. doi: 10.1186/s13643-024-02682-2
31. Liu X, Chen CH, Karvela M, and Toumazou C. A DNA-based intelligent expert system for personalised skin-health recommendations. IEEE J BioMed Health Inform. (2020) 24:3276–84. doi: 10.1109/JBHI.2020.2978667
32. Kondylakis H, Bucur A, Crico C, Dong F, Graf N, Hoffman S, et al. Patient empowerment for cancer patients through a novel ICT infrastructure. J BioMed Inform. (2020) 101:103342. doi: 10.1016/j.jbi.2019.103342
33. Faraone SV, Banaschewski T, Coghill D, Zheng Y, Biederman J, Bellgrove MA, et al. The World Federation of ADHD International Consensus Statement: 208 Evidence-based conclusions about the disorder. Neurosci Biobehav Rev. (2021) 128:789–818. doi: 10.1016/j.neubiorev.2021.01.022
34. Dalsgaard S, Leckman JF, Mortensen PB, Nielsen HS, and Simonsen M. Effect of drugs on the risk of injuries in children with attention deficit hyperactivity disorder: a prospective cohort study. Lancet Psychiatry. (2015) 2:702–9. doi: 10.1016/S2215-0366(15)00271-0
35. Lu Y, Sjölander A, Cederlöf M, D’Onofrio BM, Almqvist C, Larsson H, et al. Association between medication use and performance on higher education entrance tests in individuals with attention-deficit/hyperactivity disorder. JAMA Psychiatry. (2017) 74:815–22. doi: 10.1001/jamapsychiatry.2017.1472
36. Mohr-Jensen C, Müller Bisgaard C, Boldsen SK, and Steinhausen HC. Attention- deficit/hyperactivity disorder in childhood and adolescence and the risk of crime in young adulthood in a danish nationwide study. J Am Acad Child Adolesc Psychiatry. (2019) 58:443–52. doi: 10.1016/j.jaac.2018.11.016
37. Ellis LA, Blakely B, Hazell P, Woolfenden S, Hiscock H, Sarkozy V, et al. Guideline adherence in the management of attention deficit hyperactivity disorder in children: An audit of selected medical records in three Australian states. PloS One. (2021) 16:e0245916. doi: 10.1371/journal.pone.0245916
38. Rosenau PT, Dietrich A, Matthijssen AM, Kleine-Deters R, van de Loo-Neus GHH, Buitelaar JK, et al. Adherence to clinical guidelines for dose finding and monitoring methylphenidate use: A medical record audit in child and adolescent mental health care and pediatric settings. J Child Adolesc Psychopharmacol. (2023) 33:183–9. doi: 10.1089/cap.2022.0095
39. Piper BJ, Ogden CL, Simoyan OM, Chung DY, Caggiano JF, Nichols SD, et al. Trends in use of prescription stimulants in the United States and Territories, 2006 to 2016. PloS One. (2018) 13:e0206100. doi: 10.1371/journal.pone.0206100
40. Wallach-Kildemoes H, Skovgaard AM, Thielen K, Pottegård A, and Mortensen LH. Social adversity and regional differences in prescribing of ADHD medication for school-age children. J Dev Behav Pediatr. (2015) 36:330–41. doi: 10.1097/DBP.0000000000000170
41. Mykletun A, Widding-Havneraas T, Chaulagain A, Lyhmann I, Bjelland I, Halmøy A, et al. Causal modelling of variation in clinical practice and long-term outcomes of ADHD using Norwegian registry data: the ADHD controversy project. BMJ Open. (2021) 11:e041698. doi: 10.1136/bmjopen-2020-041698
42. Akmatov MK, Holstiege J, and Bätzing J. Secular trends and regional variations in pharmacotherapy of attention-deficit/hyperactivity disorder (ADHD) among children and adolescents in Germany. BMC Psychiatry. (2021) 21:405. doi: 10.1186/s12888-021-03409-6
43. Balan-Moshe L and Gothelf D. Minding the gap between clinical guidelines and real-life clinical work. Eur Child Adolesc Psychiatry. (2021) 30:681–3. doi: 10.1007/s00787-021-01765-y
44. Khan MU and Aslani P. A review of factors influencing the three phases of medication adherence in people with attention-deficit/hyperactivity disorder. J Child Adolesc Psychopharmacol. (2019) 29:398–418. doi: 10.1089/cap.2018.0153
45. Wolraich ML, Hagan JF Jr, Allan C, Chan E, Davison D, Earls M, et al. Clinical practice guideline for the diagnosis, evaluation, and treatment of attention-deficit/hyperactivity disorder in children and adolescents. Pediatrics. (2019) 144:e20192528. doi: 10.1542/peds.2019-2528
46. National Institute for Health and Care Excellence (NICE). Attention deficit hyperactivity disorder: diagnosis and management (NICE Guideline No. NG87). London: NICE (2019). Available online at: https://www.nice.org.uk/guidance/ng87 (Accessed June 01, 2022).
47. CADDRA. Canadian ADHD resource alliance: canadian ADHD practice guidelines. 4.1 Edition. Toronto ON: CADDRA (2020).
48. ADHD Guideline Development Group. Australian evidence-based clinical practice guideline for Attention Deficit Hyperactivity. Melbourne: Australian ADHD Professionals Association (2022).
49. Working Group of the Clinical Practice Guide on therapeutic interventions in attention deficit hyperactivity disorder (ADHD). Clinical practice guide on therapeutic interventions in ADHD. Zaragoza (Spain): Ministry of Health, Social Services and Equality. Aragonese Institute of Health Sciences (IACS. (2017).
50. Zohar J, Stahl S, Moller HJ, Blier P, Kupfer D, Yamawaki S, et al. A review of the current nomenclature for psychotropic agents and an introduction to the Neuroscience-based Nomenclature. Eur Neuropsychopharmacol. (2015) 25:2318–25. doi: 10.1016/j.euroneuro.2015.08.019
51. Cunill R, Castells X, Tobias A, and Capellà D. Efficacy, safety and variability in pharmacotherapy for adults with attention deficit hyperactivity disorder: a meta-analysis and meta-regression in over 9000 patients. Psychopharmacol (Berl). (2016) 233:187–97. doi: 10.1007/s00213-015-4099-3
52. Riera M, Castells X, Tobias A, Cunill R, Blanco L, and Capellà D. Discontinuation of pharmacological treatment of children and adolescents with attention deficit hyperactivity disorder: meta-analysis of 63 studies enrolling 11,788 patients. Psychopharmacol (Berl). (2017) 234:2657–71. doi: 10.1007/s00213-017-4662-1
53. Storebø OJ, Storm MRO, Pereira Ribeiro J, Skoog M, Groth C, Callesen HE, et al. Methylphenidate for children and adolescents with attention deficit hyperactivity disorder (ADHD). Cochrane Database Syst Rev. (2023) 3:CD009885. doi: 10.1002/14651858.CD009885.pub3
54. Raman SR, Man KKC, Bahmanyar S, Berard A, Bilder S, Boukhris T, et al. Trends in attention-deficit hyperactivity disorder medication use: a retrospective observational study using population-based databases. Lancet Psychiatry. (2018) 5:824–35. doi: 10.1016/S2215-0366(18)30293-1
55. Öner Ö, Yilmaz EŞ, Karadağ H, Vural M, Vural EH, Akbulat A, et al. ADHD medication trends in Turkey: 2009-2013. J Atten Disord. (2017) 21:1192–7. doi: 10.1177/1087054714523129
56. Prieto Antolín B, Gutiérrez-Abejón E, Alberola López S, and Andrés de Llano JM. Tendencia del consumo de fármacos en el trastorno por déficit de atención e hiperactividad en niños y adolescentes (2010-2019) [Trend in medicines use for attention deficit hyperactivity disorder in children and adolescents (2010-2019). Rev Esp Salud Publica. (2022) 96:e202203033.
57. Liman C, Schein J, Wu A, Huang X, Thadani S, Childress A, et al. Real world analysis of treatment change and response in adults with attention-deficit/hyperactivity disorder (ADHD) alone and with concomitant psychiatric comorbidities: results from an electronic health record database study is the United States. BMC Psychiatry. (2024) 24:618. doi: 10.1186/s12888-024-05994-8
58. Garcia-Argibay M, Chang Z, Brikell I, Kuja-Halkola R, D’Onofrio BM, Lichtenstein P, et al. Evaluating ADHD medication trial representativeness: a Swedish population-based study comparing hypothetically trial-eligible and trial-ineligible individuals. Lancet Psychiatry. (2025) 12:131–39. doi: 10.1016/S2215-0366(24)00396-1
59. Stewart D, Al Hail M, Al-Shaibi S, Hussain TA, Abdelkader NN, Pallivalapila A, et al. A scoping review of theories used to investigate clinician adherence to clinical practice guidelines. Int J Clin Pharm. (2023) 45:52–63. doi: 10.1007/s11096-022-01490-9
60. Coletti DJ, Pappadopulos E, Katsiotas NJ, Berest A, Jensen PS, and Kafantaris V. Parent perspectives on the decision to initiate medication treatment of attention-deficit/hyperactivity disorder. J Child Adolesc Psychopharmacol. (2012) 22:226–37. doi: 10.1089/cap.2011.0090
61. Penn SCH, Yoo K, Glocko P, and Castells X. Effectiveness of pharmacological and non-pharmacological treatment on academic performance in patients with diagnosed ADHD: systematic review based on meta-analysis. Neurosci Appl. (2024) 3:S144. doi: 10.1016/j.nsa.2024.104371
62. Bharati S, Mondal MRH, and Podder P. A review on explainable artificial intelligence for healthcare: why, how, and when. IEEE Transactions on Artificial Intelligence (2023) New York (NY). doi: 10.1109/TAI.2023.3266418
63. Hamilton JG, Genoff Garzon M, Westerman JS, Shuk E, Hay JL, Walters C, et al. A tool, not a crutch”: patient perspectives about IBM watson for oncology trained by memorial sloan kettering. J Oncol Pract. (2019) 15:e277–88. doi: 10.1200/JOP.18.00417
64. Grote T and Berens P. On the ethics of algorithmic decision-making in healthcare. J Med Ethics. (2020) 46:205–11. doi: 10.1136/medethics-2019-105586
65. dosReis S and Myers MA. Parental attitudes and involvement in psychopharmacological treatment for ADHD: a conceptual model. Int Rev Psychiatry. (2008) 20:135–41. doi: 10.1080/09540260801933084
66. Tian Y, Liu X, Wang Z, Cao S, Liu Z, Ji Q, et al. Concordance between watson for oncology and a multidisciplinary clinical decision-making team for gastric cancer and the prognostic implications: retrospective study. J Med Internet Res. (2020) 22:e14122. doi: 10.2196/14122
67. Nosek BA, Ebersole CR, DeHaven AC, and Mellor DT. The preregistration revolution. Proc Natl Acad Sci U.S.A. (2018) 115:2600–6. doi: 10.1073/pnas.1708274114
68. Chambers CD and Tzavella L. The past, present and future of Registered Reports. Nat Hum Behav. (2022) 6:29–42. doi: 10.1038/s41562-021-01193-7
Keywords: Attention defcit hyperactivity disorder (ADHD), recommendation systems, evidence base for decision making, shared decision making, Artificial intelligence (AI), patient empowerment, clinical practice guidelines
Citation: Baykova E, Raya Ò, Lombardía C, Gonzalvo B, Andreu I, Losada D, Falkenhain T, Cunill R, Serrano D, Rigau D, Ramírez-Saco D, López B and Castells X (2025) Assessing TDApp: An AI-based clinical decision support system for ADHD treatment recommendations. Front. Psychiatry 16:1582746. doi: 10.3389/fpsyt.2025.1582746
Received: 24 February 2025; Accepted: 25 July 2025;
Published: 22 August 2025.
Edited by:
David Ma Mehler, University Hospital RWTH Aachen, GermanyReviewed by:
Soukaina Hattabi, University of Jendouba, TunisiaAhmed Farag, Faculty of Graduate Studies and Environmental Research Department of Humanities Ain Shams University, Egypt
Copyright © 2025 Baykova, Raya, Lombardía, Gonzalvo, Andreu, Losada, Falkenhain, Cunill, Serrano, Rigau, Ramírez-Saco, López and Castells. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Xavier Castells, eGF2aWVyLmNhc3RlbGxzQHVkZy5lZHU=
‡ORCID: Òscar Raya, orcid.org/0000-0002-0420-3038