Skip to main content


Front. Psychiatry, 06 December 2021
Sec. Public Mental Health
Volume 12 - 2021 |

Digital Interventions for Generalized Anxiety Disorder (GAD): Systematic Review and Network Meta-Analysis

Pedro Saramago1* Lina Gega2,3,4 David Marshall5 Georgios F. Nikolaidis1,6 Dina Jankovic1 Hollie Melton5 Sarah Dawson7 Rachel Churchill5,7 Laura Bojke1
  • 1Centre for Health Economics, University of York, York, United Kingdom
  • 2Department of Health Sciences, University of York, York, United Kingdom
  • 3Hull York Medical School, University of York, Heslington, United Kingdom
  • 4Tees, Esk and Wear Valleys NHS Trust, Darlington, United Kingdom
  • 5Centre for Reviews and Dissemination, University of York, York, United Kingdom
  • 6IQVIA, London, United Kingdom
  • 7Common Mental Disorders Group, Cochrane Collaboration, York, United Kingdom

Background: Generalized anxiety disorder is the most common mental health condition based on weekly prevalence. Digital interventions have been used as alternatives or as supplements to conventional therapies to improve access, patient choice, and clinical outcomes. Little is known about their comparative effectiveness for generalized anxiety disorder.

Methods: We conducted a systematic review and network meta-analysis of randomized controlled trials comparing digital interventions with medication, non-digital interventions, non-therapeutic controls, and no intervention.

Results: We included 21 randomized controlled trials with a total of 2,350 participants from generalized anxiety disorder populations. Pooled outcomes using analysis of Covariance and rankograms based on the surface under the cumulative ranking curves indicated that antidepressant medication and group therapy had a higher probability than digital interventions of being the “best” intervention. Supported digital interventions were not necessarily “better” than unsupported (pure self-help) ones.

Conclusions: Due to very wide confidence intervals, network meta-analysis results were inconclusive as to whether digital interventions are better than no intervention and non-therapeutic active controls, or whether they confer an additional benefit to standard therapy. Future research needs to compare digital interventions with one-to-one therapy and with manualized non-digital self-help and to include antidepressant medication as a treatment comparator and effect modifier.


Generalized Anxiety Disorder (GAD) is the most common mental health condition with 6% point-prevalence (measured over the preceding week) in the UK, nearly double that of depression (3.3%) (1). It is often confused with panic disorder or depression when self-reported by survey participants (2). GAD is characterized by excessive worry that persists for several months and leads to significant distress or impairment in everyday life and functioning (3). Other typical characteristics include free-floating anxiety and physical symptoms, such as muscle tension, headaches, restlessness, difficulty concentrating, irritability, or sleep problems. GAD is associated with low quality of life and high healthcare costs (4).

Psychological interventions can be effective for GAD, especially cognitive behavior therapy (CBT) (5) and applied relaxation (6). CBT helps the individual challenge or tolerate worrying thoughts and confront anxiety-provoking situations rather than avoiding them. Applied relaxation counteracts the physical symptoms of GAD thought a series of tense-then-release muscle exercises that reduce muscle tension. Antidepressant medication can also be effective (7) and is often the first choice for treatment by clinicians in view of limited capacity to deliver psychological interventions. To improve access to psychological therapies and increase patient choice and therapist capacity, digital interventions have been used as alternatives or supplements to conventional face-to-face clinic-based therapy (8, 9).

Digital interventions are defined as software-based therapeutic activities accessed via technology platforms, such as the internet, virtual reality (VR), mobile phones. According to the World Health Organization (WHO), the term “digital intervention” represents a discrete function of using technology to achieve health sector objectives (10). In the context of a specific condition, such as GAD, digital intervention (DIs) fulfil the discrete function of using software and digital media to deliver therapeutic activities that aim to improve symptoms associated with the condition in populations.

An example of a DI for GAD is a 10-week internet-based self-help programme consisting of psychoeducation (information about worry, stress, and anxiety, including its risk factors and treatments), CBT (dealing with the purpose, meaning, and content of worry, as well as modifying unhelpful responses to worry), relaxation (tensing then releasing body muscle groups and refocus attention away from worry) and physical activity (11). Another example is a mobile app (12) that teaches diaphragmatic breathing in a series of mini-games, from sailing a boat down a river to flying balloons into the sky.

Previous reviews of the effectiveness of DIs for GAD (13, 14) included mixed populations of anxiety disorders and depression without reporting outcomes separately for GAD subgroups within these mixed samples. Reporting a disorder-specific outcome for mixed samples can be misleading because it implies that, if an intervention works for the mixed sample, it will also work for each of its constituent populations. Studies reporting findings from mixed samples do not answer the question of whether DIs are effective for GAD to inform disorder-specific clinical guidelines. To achieve this, and while preserving the benefits of randomisation, we need to analyse GAD outcomes reported separately for GAD populations and GAD sub-samples within mixed populations.

This paper reports a systematic review and quantitative synthesis of RCTs comparing DIs with other interventions, non-therapeutic control arms and no intervention for GAD populations with varying levels of illness severity (sub-threshold, mild, moderate, severe). The review had four objectives:

1. Categorize the DIs and comparator arms into groups that could be pooled together.

2. Compare the pooled outcomes of DIs with the pooled outcomes of non-digital interventions, medication, non-therapeutic controls, and no intervention for GAD symptoms.

3. Compare the pooled outcomes of different types of DIs.

4. Identify limitations and gaps in the existing research on DIs for GAD.

After describing the review, the classification and synthesis methodology in section Methods, section Results of the paper goes through the characteristics of the evidence base and synthesis results. Section Discussion discusses the findings, followed by concluding remarks in section Conclusions.



The protocol for the review was registered with PROSPERO 2018 CRD42018105837 as part of a larger piece of work that investigated the costs and outcomes of digital interventions for mental health, funded by the UK's National Institute for Health Research (NIHR). The review has been conducted and reported as recommended by the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) extension statement for network meta-analysis (15).

Search Strategy

In December 2018, the following databases were searched to identify published and unpublished studies: MEDLINE, PsycINFO, Cochrane Central Register of Controlled Trials (CENTRAL), Cochrane Database of Systematic Reviews (CDSR), Cumulative Index to Nursing & Allied Health (CINAHL Plus), Database of Abstracts of Reviews of Effects (DARE), EMBASE, Web of Science Core Collection, NHS Economic Evaluation Database (NHS EED); Database of promoting health effectiveness reviews (DoPHER) and Proquest.

We also searched two clinical trial registries and other resources for ongoing studies: and the WHO International Clinical Trials Registry Platform portal, as well as the NIHR portfolio of studies. Web searches were conducted using Google and Google Scholar making use of simplified search terms. After searches were complete, supplementary searches were conducted, including reference lists of included studies and forward citation searches. Finally, we contacted the authors of included studies for information on any other work in the field they were aware of. The searches were undertaken for studies conducted since 1997 and restricted to those written in English.

In June 2019, the searches were updated and widened to include terms based on unspecified Anxiety Disorders. An additional pilot search was conducted on Cochrane Library and PsycINFO databases using terms based on “worry” and “anxiety prevention.” This ensured that no articles were missed. No new included articles emerged from the pilot search, and it was not deemed necessary to expand it to all remaining databases. The full search terms and outputs of the database searches are provided in Appendix A (December 2018 search) and Appendix B (June 2019 search).

Study Identification and Selection

Two reviewers (DM, HM) independently screened all titles and abstracts of the identified studies against our inclusion/exclusion criteria. If either reviewer indicated a study could be relevant, we retrieved the full text. The same two reviewers independently assessed the full texts against our inclusion/exclusion criteria. A third reviewer (LG) resolved any disagreements through discussion to agree a final list of included and excluded studies.

Eligible studies included: (a) study design: RCTs, to minimize risk of bias and confounding variables; (b) participants: participants with symptoms or risk of GAD within mental health populations or within the general population; we defined this as a certified diagnosis using a standardized diagnostic interview or a score above an accepted cut-off for diagnosable GAD (which may include sub-threshold scores) in standardized questionnaires; (c) interventions: software-based systems and technology platforms designed for patient-facing delivery of a mental health intervention (i.e., an intervention to improve mental health outcomes); (d) comparisons: all comparisons relevant to DIs, even when two or more DIs were compared with each other without other comparators; and (e) outcomes: GAD-specific measures of anxiety or worry (e.g., GAD-7), reported for GAD populations or GAD sub-samples within mixed populations.

We have excluded: (i) mixed populations of GAD with other conditions, when the outcomes were not reported separately for GAD subgroups; (ii) technology used as a means for telecommunication (e.g., email, phone or video) without any software-based processing; (iii) software-based systems designed for training of health professionals or for administration without any patient-facing intervention components; and (iv) studies that were only identified as protocols, abstracts, or reviews; these were marked so we could check for RCTs that we may have missed in the database searches.

Data Extraction and Risk of Bias Assessment

Two researchers (DM, HM) independently extracted data from published and unpublished study reports. Data were extracted on the sample, study design, intervention, and comparator characteristics, baseline characteristics, and results. Any discrepancies were resolved by a third reviewer (LG). Risk of bias of each study was assessed using Cochrane's Risk of Bias (RoB) 2.0 (16).

Classification of DIs and Their Alternatives

In order to conduct an evidence synthesis, it was necessary to classify and group the interventions from studies. We developed a classification system for DIs and their comparator interventions and controls, in the following four steps:

a) We conducted a detailed data extraction of intervention/control arm characteristics as reported within our included RCTs for GAD and their relevant linked papers.

b) We identified common and differentiating features of intervention/control arms between RCTs, but also incremental differences between interventions/controls within the same RCT.

c) We consulted the literature and an advisory group of health services researchers and clinicians about intervention features that could be important for clinical outcomes (e.g., amount of interpersonal contact, who offers support to the DI, whether the intervention is available publicly or via referral to specialist services, types of software required or therapeutic approaches used).

d) We applied the classification criteria to each randomisation arm in the included RCTs so that each intervention/control arm was assigned to a classification group.

These 4 steps were iterative; we resolved discrepancies by refining our classification criteria until two reviewers (DM, LG) independently reached the same allocation for every intervention/control they classified. We grouped DIs and their alternatives according to the three criteria below.

Criterion 1–Intervention (I) or Control (C): An Intervention (I) was an action carried out as part of a research protocol for therapeutic purposes, i.e., it was expected to improve clinical symptoms and functioning based on psychological or behavioral theories and preliminary evidence. A Control (C) was a non-therapeutic activity that was not expected to make a clinical difference to the condition; this could be a psychological placebo, an “attention control,” or a change in usual care introduced by the research team to keep participants safe and minimize attrition.

Criterion 2–Digital (D) or Non-Digital (NoD): A Digital Intervention (DI) or a Digital Control (DC) included software programmes to guide patient-facing activities. A Non-Digital Intervention/Non-Digital Control (NoDI/NoDC) did not involve any technology and was delivered by printed materials or during face-to-face meetings, or via telecommunications technology without automated software e.g., consultations by email, skype, or phone.

Criterion 3–Supported (S) or Unsupported (U): Supported interventions/controls included scheduled or regular two-way person-to-person contact (e.g., between service user and clinician or researcher, or peer-to-peer). Unsupported interventions/controls either had no interpersonal contact or included limited ad-hoc interaction (e.g., phoning a helpline with any problems as a one-off). We also classified as unsupported interventions/controls those in which communication was one-way, such as a reminders by email, post, or phone.

Based on these three criteria, we mapped DIs and their alternatives into eight groups resulting from the combinations of (I or C) x (D or NoD) x (S or U).

Group 1: Supported Digital Intervention (SDI), e.g., computerized cognitive behavior therapy with phone support; clinician-delivered therapy assisted by virtual reality.

Group 2: Unsupported Digital Intervention (UDI), e.g., internet self-help without any clinician contact, mobile app with automated reminders but without personal interaction.

Group 3: Supported Non-Digital Intervention (SNoDI), e.g., individual or group therapy in a clinic, or therapy delivered by a clinician via phone or an online platform.

Group 4: Unsupported Non-Digital Intervention (UNoDI), e.g., self-help using a treatment manual or a book or a website without clinician input.

Group 5: Supported Digital Control (SDC), e.g., access to a general health education website with weekly check-in calls from a researcher; virtual reality “placebo” environment used in a clinic with a researcher present.

Group 6: Unsupported Digital Control (UDC), e.g., access to an educational website without any support from a person or with just automated reminder emails.

Group 7: Supported Non-Digital Control (SNoDC), e.g., weekly check-in by phone or regular clinical assessment face-to-face without any specific therapy instructions.

Group 8: Unsupported Non-Digital Controls (UnoDC), e.g., printed materials with general health advice without any specific therapy instructions.

Separate groups were used for medication and no intervention. Medication (M) was any pharmacological agent (pills, injections, etc.) offered as part of a research protocol. No Intervention (NI) included waiting lists and usual care in which there were no additional therapeutic activities and no changes in patient routines. The NI group may still have received medication or consultations as part of routine care, but this would have been equally accessible to all participants in a trial irrespective of group allocation, so the effect would be canceled out across randomisation arms.

Data Synthesis and Statistical Analysis

Over the last two decades, network meta-analysis (NMA) methods (17) also known as mixed treatment comparisons (18, 19)–have been developed to synthesize evidence from multiple studies. NMA is an extension to the standard (pairwise) meta-analysis, which pools together the results of studies for one type of intervention compared to one type of alternative (e.g., active treatment or placebo control). An NMA enables the simultaneous comparison of multiple interventions and multiple comparators within a single coherent analysis. Such an approach is routinely used in health technology assessments to inform the optimal intervention strategy for a given medical condition (20). NMAs are often used to inform estimates of clinical and cost-effectiveness and commissioning decisions.

In the NMA, an ANalysis of COVAriance (ANCOVA) modeling framework was used, where a final outcome measurement is synthesized and adjusted for baseline measurements. Compared to the “change from baseline” approach, the ANCOVA model avoids guessing within-patient correlation across measurements as typically this is not reported in studies. Treatment effect estimates based on ANCOVA methods have been shown to be more efficient, less biassed and robust to random baseline imbalance (2126). Hence, the ANCOVA model, is the preferred method for estimating treatment effects from continuous outcomes (2730).

We adopted a modeling approach in line with the parameterisation for continuous data with normal likelihood and identity link used by Dias et al. (21, 22). Fixed-effects (FE) and random-effects (RE) models (the latter accounting for potential correlation within multi-arm trials) were fit to the data. In the model, patients who did not receive any treatment were assumed to neither improve nor worsen over the duration (i.e., null placebo effect). Furthermore, it was assumed that the effect of the baseline measurement is common across all treatments, implying that when two active treatments are compared in a trial, the baseline effects are offset.

All analyses were conducted within a Bayesian Markov chain Monte Carlo (MCMC) approach, fitted using WinBUGS software version 1.4.3 [Copyright © 2007 Medical Research Council (UK) and Imperial College (UK)] (31) and linked to the freely available software R [version 4.0.2, Copyright © 2020 (32)] through the package R2WinBUGS (33). In all models the MCMC Gibbs sampler was initially run for 10,000 iterations and these were discarded as “burn-in.” Models were run for at least further 5,000 iterations, on which inferences were based. Chain convergence was checked using autocorrelation and Brooks-Gelman-Rubin diagram diagnostics (3436). Goodness of fit and model complexity was assessed using the deviance information criterion (DIC) and posterior mean residual deviance (37).

We presented the estimated results as relative treatment effect scores and associated 95% credible intervals, CrIs. We have estimated the probability of a treatment being the “best” (i.e., being the most clinically effective) (38), and presented rankograms for all interventions, which provide the probabilities of an intervention being ranked 1 (the most effective) to 7 (the least effective). Finally, we reported the surface under the cumulative ranking curve (SUCRA), which is a numerical presentation of the overall ranking of each intervention. SUCRA values range from 0 to 100%, with higher SUCRA values suggesting that a treatment is likely to be better overall (21, 39).

Appendix C gives further details on the analysis (C1), including annotated synthesis WinBUGS code (C2), sample data and initial values for the main model used (C3).

Assessment of Heterogeneity and Consistency

The model was extended to include study-level covariates as potential treatment effect modifiers. This meant that we looked for factors other than the treatment itself, which could have influenced outcomes within each study and may have created differences (heterogeneity) across studies. These factors included disease severity (40), concomitant medication (41) and the presence of comorbidities (42, 43). Meta-regression is the most commonly employed method to explore the influence of particular study-level covariates on the relative effect. To preserve all studies (and treatments), when a covariate was not reported by some studies, we allowed the model to impute missing covariate information (multiple imputation procedure assuming “missing at random”).

We assessed inconsistency to check that all pieces of evidence (from direct and indirect sources) were in agreement. Following guidance by Dias and colleagues (44, 45), inconsistency was assessed by comparing the DIC of our primary analyses (based on NMA models that assume consistency between direct and indirect evidence) and the DICs yielded by inconsistency models (which provide effect estimates based on direct evidence only). Results were assessed for coherence by qualitatively comparing estimates of pairwise ANCOVA meta-analysis (direct) and ANCOVA RE NMA (direct and indirect).

Sensitivity Analysis

We conducted two types of sensitivity analysis. First, we evaluated the sensitivity of the networks to the influence of each individual trial. When network links were informed by more than one trial, we removed each trial one at a time (giving nj1 for each analysis, where n is the total number of trials in contrast j) and investigated the impact on the probability of each intervention being “best.” Second, we assessed the robustness of the synthesis results by repeating the analysis while excluding all studies of <30 patients.


Included and Excluded Studies

Initial systematic searches of bibliographic databases identified 16,272 records; in addition, 32 records were identified through secondary searches (e.g., citation searching of protocols and abstracts). After duplicates were removed, a total of 8,920 records were screened by title and abstract and 8,560 records were excluded. We retrieved the full text papers for the remaining 377 records and, as a result of further screening, 352 articles were excluded. In total, 21 studies (reported in 25 papers) were included in the review. The PRISMA diagram (Figure 1) summarisez the number of records retrieved and selected at different stages of identification and screening. Appendix D gives a full reference list of the excluded studies grouped according to reasons for exclusion.


Figure 1. PRISMA diagram for the identification and selection of clinical trials relating to digital interventions for generalized anxiety disorder (GAD). Adapted from Moher (46).

Sample Characteristics in RCTs of DIs for GAD

The 21 RCTs included in the review, as detailed in Table 1, were conducted over 10 years between 2009 and 2019 in 10 countries (Sweden, Australia, USA, UK, Canada, Spain, Italy, Ireland, Taiwan, Netherlands) and involved 2,547 randomized participants. Most participants were recruited from the adult general population, except in four studies that recruited students/young adults and one study with over 60s. GAD populations were defined as either meeting the criteria of an established diagnostic tool, such as the Mini-International Neuropsychiatric Interview (MINI) (69), or a score above an accepted cut-off for diagnosable GAD in standardized questionnaires, such as the Generalized Anxiety Disorder-7 item questionnaire (GAD-7) (40).


Table 1. Sample characteristics and outcome measurement in RCTs of digital interventions for GAD.

Risk of Bias Assessment

All but one (52) out of the 21 included studies, were judged to have a high risk of bias in at least one domain of assessment, for at least one outcome measure. This was largely due to the choice of outcome measurement as all studies used self-reported–albeit standardized–questionnaires. Self-reported outcomes are considered to have a high risk of bias in these studies because participants can rarely be blind to their allocation group. A visual description of the results of the RoB assessment is given in Appendix E, both for each RoB domain across all studies (Appendix E1) and for each study under each RoB domain (Appendix E2).

Classification of Digital Interventions and Comparators

A classification exercise took place to enable consistency across digital interventions and comparators. We classified DIs and their alternatives according to three criteria: (a) whether they were a psychological/behavioral intervention (I) or a non-therapeutic psychological/behavioral control (C); (b) whether they were digital (D) or non-digital (NoD); (c) whether they were supported (S) or unsupported (U). Waiting lists and usual care were classified under no intervention (NI) unless an active component (e.g., monitoring, sham activity) was introduced, in which case the waiting list/usual care was classified as non-therapeutic psychological/behavioral control. An additional classification group was included for pharmacological interventions, called medication (M).

The interventions and controls of the 20 RCTs were allocated to one of the following eight classification groups: medication (M); no intervention (NI); supported digital control (SDC); supported digital intervention (SDI); supported non-digital control (SNoDC); supported non-digital intervention (SNoDI); unsupported digital control (UDC); unsupported digital intervention (UDI). There were no available clinical studies that included unsupported non-digital interventions (UNoDI) or unsupported non-digital controls (UNoDC). Table 2 describes all the interventions and controls included in each classification group for each study.


Table 2. Characteristics and classifications of interventions and controls in RCTs of digital interventions for GAD.

Based on the 8-group classification for GAD RCTs, the majority of DIs studied were supported (SDI−18 RCTs) and were compared against no intervention (NI−12 RCTs). Only 3 RCTs evaluated unsupported DIs (UDI); two were web-based CBT (11, 51); and one (Pham et al.) a mobile game to practice breathing re-training. The only non-digital intervention (NoDI) represented in two RCTs (57, 68) was group therapy (one CBT and one mindfulness-based intervention) and there was only one RCT that included an antidepressant medication, Sertraline (49). With regards to non-therapeutic active controls reported in 8 RCTs, most included a digital element whereas only one RCT (64) had a non-digital control in the form of a weekly face-to-face assessment with a research assistant in a lab (SDC).

Just over half of the included RCTs (12/21) evaluated CBT. Therapeutic approaches other than CBT included: psychodynamic therapy (47), extinction therapy (48), acceptance and commitment therapy (50), cognitive or attentional bias modification (52, 53, 64), mindfulness (50, 57), relaxation (Repetto, 52), and diaphragmatic breathing (12). The most common technology platform use was a web-interface (n = 17). Two studies used VR platforms (57, 59), and two used smartphone apps (12, 64). The study by Repetto et al. (59) also used a mobile interface to enable users to access the VR scenarios at home but has also included a biofeedback system in one of the study arms.

DIs differed not only in technology platform and therapy type, but also in whether additional interpersonal support was offered as an adjunct to the digital element and, if so, its type. Most DIs included some interpersonal contact by phone or face-to-face with professionals (GPs, therapists, psychologists both students and qualified), non-clinical researchers, or lay people. Only two studies included DIs that were pure self-help without any contact (11, 12). Some DIs were supplemented by standardized emails without regular communication with another person (11, 51, 54).

Selection of Studies and Outcome for the NMA

A total of 45 different outcome measures were reported in the included RCTs, as shown in Appendix F. GAD-7 was used in 14 out of the 21 RCTs to measure symptoms at baseline and outcomes at follow-up. Penn State Worry Questionnaire (PSWQ) (70, 71) was also reported in 14 studies. Apart from GAD-7 and PSWQ, the two other most frequently reported outcomes were for depression: the Patient Health Questionnaire−9 item (PHQ-9) (72) and the Beck Depression Inventory–version II (BDI-II) (73), reported in 8 and 6 RCTs, respectively (Appendix F1). The Hamilton Anxiety Scale (HAM-A) (74) used in a recent NMA on medication for GAD (7), only appeared once in the included RCTs (Appendix F2).

We focused on GAD-7 as our outcome of choice for the NMA. GAD-7 is a 7-item anxiety scale described in the literature as a valid and efficient tool to screen for GAD and assess symptom severity in clinical practice and research (40).

Our NMA for GAD-7 included 13 studies (Table 1). One study (54) used GAD-7 but it was not included in the meta-analysis because it only reported categorical outcomes (i.e., mild, moderate, severe) rather than continuous scores. The measurement period across studies ranged from 3 to 12 weeks, with longer follow-ups only available for very few studies.

Given the high level of reporting of PSWQ, data for this outcome was also quantitatively synthesized (see Appendix J).

NMA Results for GAD-7 Scores at Follow-Up

Ten direct treatment comparisons were made in the 13 trials included in the GAD-7-based NMA; 4 of the 13 trials were multi-arm trials [three 3-arm trials (49, 59, 63) and one 5-arm trial (11)]; five comparisons were informed by more than one trial where pair-wise ANCOVA meta-analysis was conducted (ANCOVA FE models and ANCOVA RE for when n > 3).

We constructed a network plot to illustrate which interventions had been compared head-to-head (direct pairwise comparisons) for GAD-7 within the 13 included RCTs. An overview of these pairwise comparisons and synthesized data are shown in Appendix G. The structure of the network for GAD-7 is shown in Figure 2.


Figure 2. Network plot for comparisons between all interventions and controls for GAD populations in RCTs with GAD-7 score. GAD, Generalized Anxiety Disorder; M, medication; NI, No intervention; SDC, Supported Digital Control; SDI, Supported Digital Intervention; SNoDI, Supported Non-digital intervention; UDC, Unsupported Digital Control; UDI, Unsupported Digital Intervention. Line thickness around the node: proportional to the number of patients contributing to each intervention/control group. Line thickness connecting nodes: proportional to the number of patients contributing to each pairwise comparison between interventions/controls. n, number of trials informing each comparison.

Fixed- and random-effects models were employed with minimal difference in mean residual deviances and DIC identified between the models tested. However, posterior estimates of between-study heterogeneity, τ2, suggested considerable variability across studies, which was in line with the narrative assessment of the studies. Hence, a random-effects approach was preferred. There was a high degree of uncertainty in the network results, especially in links not informed by direct evidence. Table 3 presents the full results of the NMA based on GAD-7 scores.


Table 3. Full meta-analysis results: network and direct pairwise comparisons between all interventions and controls for post-treatment (12 weeks) GAD-7 scores adjusted for baseline.

Medication (M) was associated with the largest decrease in GAD-7 median scores compared to the other interventions, although uncertainty was high in the NMA estimates, with all 95% credible intervals including zero. These results are driven by the outcomes of a small (n = 21), three-arm, trial (49) that compared medication supplemented with scheduled face-to-face meetings with psychologists and GPs, against SDC (a general health website with scheduled meetings with psychologists & GPs) and SDI (a web-based CBT self-help programme with scheduled meetings with psychologists & GPs). The adjustment for baseline scores indicated that the baseline effect on the final outcome is small with a 95% credible interval including zero [change in GAD score: −0.14 (95% CrI −1.10 to 0.82)].

Results of independently pooling direct evidence for each contrast (but not pooling when n = 1) were found to be generally consistent with the NMA results, both in terms of direction and magnitude of the estimates (Table 3, upper-right triangle, shaded). Of note are the differences in the estimates found when applying fixed- and random-effects ANCOVA meta-analysis model on direct evidence for the comparisons of SDIs vs. SDCs (n = 4) and SDIs vs. NI (n = 8), evidencing non-negligible variability across studies and the importance of accounting for between-study heterogeneity.

Based on SUCRA values and rankograms for each intervention, as detailed in Appendices H1, H2, respectively, SDIs were estimated to be more effective (i.e., ranked higher) than UDIs, which included unsupported web-based CBT (11, 51) and an unsupported mobile breathing retraining game (12); however SDIs were less effective than SNoDI, a weekly group mindfulness-based intervention with a therapist (57).

Similar analysis was performed on the PSWQ outcome. Results are shown in Appendix J.

Results of Between-Study Heterogeneity and Inconsistency Assessments

Three sources of heterogeneity were considered relevant: disease severity, concomitant medication, and comorbidities. Using data relating to disease severity and comorbidities was not feasible–see Appendix I for further details–thus only data on concomitant medication was included as a covariate in the synthesis modeling. When this covariate is included, the between-study heterogeneity parameter, τ2, is not reduced, suggesting that heterogeneity is not explained by this covariate. Crucially, even if the proportion receiving concomitant medication was identified as an important effect modifier, the meta-regression model is not necessarily suited to detect this intervention-covariate interaction as patients were receiving medication before trial entry. Therefore, medication may have already exerted an effect on patients, being captured by the ANCOVA baseline adjustment component.

Several data loops existed in the network, where both direct and indirect data informed intervention effectiveness estimates; the possibility of inconsistencies was investigated. Table 3 showed no evidence of substantial discrepancies between the direct and the NMA results; given the uncertainty in the data, only very large differences were likely to result in statistical significance. Results of the consistency and inconsistency models indicated the existence of overall model consistency, as detailed in Appendix I.

Similar analysis was performed on the PSWQ outcome. Results are shown in Appendix J.

Sensitivity Analysis Results

The sensitivity of the network to specific studies was investigated. In total, 10 analyses with 12 (rather than the total 13) included studies for GAD-7 were performed, and the probability of each intervention being the best was assessed. The SSRI and group CBT continued to have the highest chances of being “best,” with probabilities of around 43 and 30%, respectively.

Two studies (49, 59) had <30 patients. Excluding these studies from the network also removed medication (M) from the comparator set, altering the network structure. As expected, with the reduction in the number of studies informing the network, the uncertainty in the posterior effect distributions increased further. However, no significant changes were observed compared to the main model results.

Similar analysis was performed on the PSWQ outcome. The ranking of active interventions in terms of median PSWQ score decrease vs. no intervention (NI) was unaltered, although higher score decreases were estimated. Comparing the direction and magnitude of differences in median scores at follow-up between GAD-7 and PSWQ results (where available for both), we make three observations (Appendix K). First, the difference in GAD-7 median scores at follow-up between medication and DIs is the largest across all comparators and favors medication. Second, there were no data available for comparisons between DIs and individual therapy, either face-to-face or by telephone, or between DIs and manualized guided self-help (which is the non-digital counterpart of most DIs). Third, the direction of effect favored SDIs for GAD-7 and UDIs for PSWQ.


Summary and Interpretation

Our systematic review retrieved 21 RCTs of DIs or alternative pathways of care, including no intervention, for GAD. Comparators included in the studies varied. Specifically, interventions or controls could be digital or non-digital and supported or unsupported by clinicians or lay people. The majority of comparisons were between supported digital interventions and no intervention. Using an ANCOVA framework, our main NMA on GAD-7 pooled together post-treatment scores–adjusted for baseline. In addition, the existence of treatment effect modifiers was assessed, several sensitivity analyses were carried out and network consistency evaluated. NMA on the PSWQ outcome was also performed.

Our NMA results suggest that medication is associated with lower anxiety scores at follow-up relative to all other interventions and controls. Medication also ranks first in terms of its likelihood of being most effective, which considers the uncertainty in relative effect estimates. Medication results are based on data from one study. Antidepressant medication as a treatment for GAD is supported by clinical guidelines (75) and previous evidence syntheses. A large NMA (7) of medication against placebo for GAD found that Sertaline (the same antidepressant used in the study by Christensen et al. included in our NMA) improved HAM-A scores by a mean difference of −2·88 (CrI −4·17 to −1·59) from baseline compared to a placebo based on six trials. Another meta-analysis involving a mixed population (13) favored a combined treatment of psychological therapy and medication for all depressive and anxiety disorders, except GAD, where the direction of effect favored antidepressant medication (Venlafaxine) alone.

Previous reviews of DIs that reported GAD-related outcomes (13, 14) used mixed samples of anxiety disorders and depression, without reporting outcomes separate for GAD subgroups. There are no RCTs with GAD populations comparing DIs with non-digital self-help interventions based on a manual rather than a web-based program. Also, no RCTs compare DIs with individual therapy for GAD, either face-to-face or by telephone; the only available comparisons in the literature are between DIs and group therapy.

Due to very wide confidence intervals, our NMA results were inconclusive as to whether DIs for GAD were better than no intervention or non-therapeutic active controls, or whether they confer an additional benefit to standard therapy. Previous meta-analyses have suggested that supported DIs could be as good as face-to-face therapy across depression, anxiety, and somatic disorders (76, 77). However, the mixed samples in these meta-analyses without separate analysis or reporting for GAD sub-samples does not allow any conclusions about the relative efficacy of DIs specific to the treatment and prevention of GAD.

The results for supported vs. unsupported DIs for GAD were counterintuitive, as we would expect supported DIs to rank higher in terms of the likelihood of being “best,” based on a previous meta-analysis in which supported DIs were found to be four times more likely to be effective compared to those without any therapist contact (78). We found that unsupported DIs rank higher than supported DIs in terms of the probability of being best, but vice versa when considering all rankings (SUCRAs). This is consistent with a recent review (79) that reported mixed findings regarding guided vs. unguided DIs and human vs. automated support for DIs. This suggests that the design, content, technology platform or type of reinforcement offered in lieu of personal support in unsupported DIs may be important and account for some of the variability in outcomes.

Strengths and Limitations

To our knowledge this is the first ANCOVA NMA model to synthesize evidence on two widely used outcomes for GAD. Our NMA makes best use of all currently available RCT-based evidence on DIs for GAD. Despite the sparse and low-quality data, a statistical synthesis can still be useful for decision-makers in mental health (including healthcare professionals, providers and policy-makers, patients and their families, and the research community) who may be considering the use of DIs for GAD, so that they are properly informed about the current status of the evidence base, know which DIs have been shown to be more effective in reducing GAD and prioritize future research.

There was substantial uncertainty around effect estimates of DIs against alternatives for GAD-7. This is driven by the small number of studies informing most comparisons, the small sample sizes used in some of these studies and their high risk of bias across the evidence base, all limiting our confidence in any observed differences in anxiety scores between intervention, comparators, and control arms. These observed differences may simply be due to chance; but in view of the current evidence base we cannot make clear recommendations about the relative effectiveness of DIs against their comparators.

We have to use caution when interpreting the results of our NMA across all different interventions for GAD. Our review has been completed in the context of DIs; it only included RCTs in which at least one of the randomisation arms was a DI. Therefore, we cannot draw any conclusions about the comparative merit of non-digital interventions (psychological or pharmacological) for GAD when these are considered separately to DIs (for example group CBT vs. medication). To be able to do this, we would need to include RCTs in an NMA that would enable 2nd or 3rd order contrasts (e.g., RCTs comparing no intervention and medication), which was beyond the scope of this review. Also, ranking based on likelihood of being best and on SUCRAs does not reflect differences in effectiveness estimates between interventions and controls and credible intervals, that is, we cannot tell whether the differences between ranking position (e.g., between 1st, 2nd, 3rd, 4th, etc.) are clinically meaningful.

Another point of caution, as with all evidence synthesis of complex interventions, is the pooling DIs and their alternatives into groups for analysis based on our classification criteria. Any classification implies interpretation and judgement which is conditional upon the information available from included studies. We note the insufficient reporting of details about “non-therapeutic controls” and waiting list in some studies. Furthermore, we could have split DIs into further categories according to the technology used (e.g., VR, internet, mobile app), or the function of the technology (e.g., adjunct to clinician-delivered therapy vs. patient self-help), or the type of support (e.g., phone calls vs. meetings). This would have created more “nodes” in the NMA models, but also more uncertainty because comparisons between DIs and their alternatives within each subgroup would have been informed by fewer studies.

Many of the included RCTs recruited small samples and involved multiple arms, often comparing different versions of the same intervention, thereby reducing the power of the study. Our evidence synthesis also shows that the majority of RCTs have either a short timeframe for follow-up (up to 12 weeks), or the control group has already crossed to the intervention at the point of a longer follow-up (up to 2 years), which undermines the original randomisation. Consequently, we did not include observations for further follow-up time points, where these were available, nor did we account for time differences in the short-term outcome reporting (post- treatment assessments varied from 3 to 12 weeks). Our NMA results reflect the short-term impact of DIs over an initial treatment period, but there is scant evidence to inform randomized comparisons about effectiveness beyond 12 weeks.


As GAD is the most prevalent and least studied condition among other common mental health problems, future evidence syntheses will be helpful to focus on GAD populations and stratified GAD subgroups where these are randomized within mixed populations, as means of informing GAD-specific future research and clinical guidelines (75). Feasibility and pilot studies, as well as user involvement in the development of the intervention and delivery protocols, could ensure that the final RCT tests the best possible intervention for GAD. Adaptive designs with improved intervention features and boosted recruitment numbers to a fully powered RCT are preferable to the underpowered studies with multiple arms testing increments of the same DI that we have identified.

Our NMAs and previous literature suggest that antidepressants are an important factor to consider in future studies on DIs for GAD. Psychological interventions–whether digital or non-digital–include participants who are taking medication as part of routine care. It is difficult to disentangle the effects of medication and psychological support for GAD, and future RCTs need to report medication details (name, dose, and duration) and include it as a covariate in their analysis to establish how outcomes with DIs and controls are influenced by concurrent medication use.

The evidence base available in this setting is complex. In particular, the sheer volume of anxiety metrics (45 in total) being reported across the available studies, suggests a lack of consensus on which measures to use in evaluating GAD outcomes. Having a consensus about GAD-specific outcome measures can prevent participant fatigue from completing batteries of different questionnaires and enable comparisons across studies and data synthesis. GAD-7 is more sensitive to changes associated with treatment and therefore may be more suitable for longitudinal clinical research (80). Reporting continuous data on the GAD-7 as a common measure in RCTs with GAD populations will make more studies available for a future statistical synthesis. Including HAM-A in studies of psychological therapies will enable us to compare results with pharmacological studies. Future analyses using multivariate models may be able to make better use of the available evidence by borrowing strength across different outcomes.

Many studies that follow up participants over the longer-term offer the intervention to those randomized to the control group at a crossover point. Many studies that follow up participants over the longer term offer the intervention to those randomized to the control group at a crossover point, potentially biasing any long-term treatment effect (81). Participants are also likely to receive some treatment as part of usual care the longer they remain on waiting lists or non-therapeutic controls, so studies cannot withhold interventions to enable long-term follow-up. As the typical duration of DIs is between 3 and 12 weeks, the follow-up period of future RCTs needs to be longer, for example 6 months, to help us better understand the “stickiness” longer term effects of DIs beyond their initial delivery period. Usual care and waiting lists are poorly reported in RCTs and do not include data on concurrent interventions accessed by participants, including openly available self-help, which can influence the observed difference in outcome between DIs and no intervention. Greater clarity and more detailed reporting about the specific elements of comparators is essential to improve our understanding of the effects of DIs.


This study is the first to evaluate the effectiveness of DIs specifically in a GAD population. It is also the first to combine all the RCT-based effectiveness evidence from DIs and key comparators in a single modeling framework, allowing the estimation of relative treatment effects for all relevant comparisons. Our results suggest that antidepressant medication is associated with lower anxiety scores at follow-up relative to all other interventions and controls. Results were inconclusive as to whether DIs are better than no intervention and non-therapeutic active controls for GAD, or whether they confer an additional benefit to standard therapy. Overall, our findings are limited in informing decision-making, highlighting how little is currently known about the comparative effectiveness of such interventions. Future primary studies and meta-analyses need to focus on GAD populations rather than mixed samples, or report outcomes specifically for GAD sub-samples if they intend to answer questions about the comparative effectiveness of DIs for GAD. Comparing DIs with manualized (non-digital) self-help and individual therapy, for which there are no current RCTs for GAD populations, will be useful in the context of stepped care. Antidepressant medication for GAD as a first-line treatment against DIs deserves further research and economic modeling. To inform commissioning and potential disinvestment from non-digital alternatives, we need to put the findings of this evidence synthesis into context together with an assessment of the costs of developing and implementing DIs in clinical practice.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.

Author Contributions

PS reviewed the clinical evidence, performed the data synthesis and statistical analysis, interpreted findings, wrote the manuscript, and led the work as a whole. LG reviewed the clinical evidence, classified the interventions, interpreted findings, supported the write up of the manuscript, and all aspects of the review. DM identified and selected the studies, extracted study data, assessed their risk of bias, interpreted findings, and reviewed the manuscript. GN supported the data synthesis and statistical analysis and reviewed the manuscript. DJ contributed to the selection of studies, interpreted findings, and reviewed the manuscript. HM identified and selected the studies, extracted study data, assessed their risk of bias, interpreted findings, and reviewed the manuscript. SD performed the searches and reviewed the manuscript. RC provided support on all aspects of the review and reviewed the manuscript. LB provided support on all aspects of the review and reviewed the manuscript. All authors contributed to the article and approved the submitted version.


This work was supported by the UK's National Institute for Health Research (NIHR), Health Technology Assessment (HTA) Programme, Grant Number 17/93/06, 2019.

Author Disclaimer

This paper presents independent research supported by the National Institute for Health Research (NIHR), Health Technology Assessment (HTA) Programme. The views expressed are those of the authors and not necessarily of the National Health Service, the NIHR, or the Department of Health. The content of this manuscript has previously appeared online as a preprint of the NIHR report:

Conflict of Interest

LG is the fund holder of the research grant that supported this work. GN is employed by IQVIA.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.


We thank Prof. Paul McNamee, Prof. Nicola Cooper, Dr. Alrinda Cerga-Pashoja, and Daniel Horne for their guidance and support as members of our advisory/steering group.

Supplementary Material

The Supplementary Material for this article can be found online at:


1. Mental Health Foundation. Fundamental Facts About Mental Health 2016. London: Mental Health Foundation (2016).

Google Scholar

2. McManus S, Bebbington P, Jenkins R, Brugha T. Mental Health and Wellbeing in England: Adult Psychiatric Morbidity Survey 2014. A Survey Carried Out for NHS Digital by NatCen Social Research and the Department of Health Sciences. Leicester: University of Leicester (2016).

Google Scholar

3. World Health Organization. International Classification of Diseases, 11th Revision (ICD-11). Geneva: World Health Organization (2018).

Google Scholar

4. Revicki DA, Travers K, Wyrwich KW, Svedsäter H, Locklear J, Mattera MS, et al. Humanistic and economic burden of generalized anxiety disorder in North America and Europe. J Affect Disord. (2012) 140:103–12. doi: 10.1016/j.jad.2011.11.014

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Hunot V, Churchill R, Teixeira V, de Lima MS. Psychological therapies for generalised anxiety disorder. Cochrane Datab Syst Rev. (2007) 2007:CD001848. doi: 10.1002/14651858.CD001848.pub4

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Hayes-Skelton SA, Roemer L, Orsillo SM, Borkovec TD. A contemporary view of applied relaxation for generalized anxiety disorder. Cogn Behav Ther. (2013) 42:292–302. doi: 10.1080/16506073.2013.777106

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Slee A, Nazareth I, Bondaronek P, Liu Y, Cheng Z, Freemantle N. Pharmacological treatments for generalised anxiety disorder: a systematic review and network meta-analysis. Lancet. (2019) 393:768–77. doi: 10.1016/S0140-6736(18)31793-8

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Aboujaoude E, Gega L, Parish MB, Hilty DM. Editorial: Digital interventions in mental health: Current status and future directions. Front Psychiatry. (2020) 11:111. doi: 10.3389/fpsyt.2020.00111

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Gega L, Gilbody S. Software-Based Psychotherapy: The Example of Computerized Cognitive-Behavioral Therapy Mental Health in the Digital Age: Grave Dangers, Great Promise. Oxford: Oxford University Press (2015). p. 196–219. doi: 10.1093/med/9780199380183.003.0011

CrossRef Full Text | Google Scholar

10. World Health Organization. WHO Guideline Recommendations on Digital Interventions for Health System Strengthening. Geneva: World Health Organization (2019).

Google Scholar

11. Christensen H, Batterham P, Mackinnon A, Griffiths KM, Hehir KK, Kenardy J, et al. Prevention of generalized anxiety disorder using a web intervention, iChill: Randomized controlled trial. J Med Internet Res. (2014) 16:3507. doi: 10.2196/jmir.3507

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Pham Q, Khatib Y, Stansfeld S, Fox S, Green T. Feasibility and efficacy of an mHealth game for managing anxiety: “Flowy” randomized controlled pilot trial and design evaluation. Games Health J. (2016) 5:50–67. doi: 10.1089/g4h.2015.0033

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Cuijpers P, Sijbrandij M, Koole SL, Andersson G, Beekman AT, Reynolds CF III. Adding psychotherapy to antidepressant medication in depression and anxiety disorders: a meta-analysis. World Psychiatry. (2014) 13:56–67. doi: 10.1002/wps.20089

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Richards D, Richardson T, Timulak L, McElvaney J. The efficacy of internet-delivered treatment for generalized anxiety disorder: a systematic review and meta-analysis. Internet Interventions. (2015) 2:272–82. doi: 10.1016/j.invent.2015.07.003

CrossRef Full Text | Google Scholar

15. Moher D, Liberati A, Tetzlaff J, Altman DG. Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. Ann Intern Med. (2009) 151:264–9. doi: 10.7326/0003-4819-151-4-200908180-00135

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Sterne JAC, Savović J, Page MJ, Elbers RG, Blencowe NS, Boutron I, et al. RoB 2: a revised tool for assessing risk of bias in randomised trials. BMJ. (2019) 366:l4898. doi: 10.1136/bmj.l4898

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Lumley T. Network meta-analysis for indirect treatment comparisons. Stat Med. (2002) 21:2313–24. doi: 10.1002/sim.1201

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Caldwell DM, Ades AE, Higgins JP. Simultaneous comparison of multiple treatments: combining direct and indirect evidence. BMJ. (2005) 331:897–900. doi: 10.1136/bmj.331.7521.897

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Lu G, Ades AE. Combination of direct and indirect evidence in mixed treatment comparisons. Stat Med. (2004) 23:3105–24. doi: 10.1002/sim.1875

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Cooper NJ, Peters J, Lai MC, Juni P, Wandel S, Palmer S, et al. How valuable are multiple treatment comparison methods in evidence-based health-care evaluation? Value Health. (2011) 14:371–80. doi: 10.1016/j.jval.2010.09.001

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Dias S, Sutton AJ, Ades AE, Welton NJ. Evidence synthesis for decision making 2: a generalized linear modeling framework for pairwise and network meta-analysis of randomized controlled trials. Med Decis Making. (2013) 33:607–17. doi: 10.1177/0272989X12458724

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Dias S, Welton NJ, Sutton AJ, Ades AE. NICE DSU Technical Support Document 2: A Generalised Linear Modelling Framework for Pairwise and Network Meta-Analysis of Randomised Controlled Trials. London: National Institute for Health and Clinical Excellence (2014).

PubMed Abstract | Google Scholar

23. Higgins JPT, Cochrane Collaboration. Cochrane Handbook for Systematic Reviews of Interventions (Second edition). Hoboken, NJ: Wiley-Blackwell (2020).

PubMed Abstract | Google Scholar

24. Riley RD, Kauser I, Bland M, Thijs L, Staessen JA, Wang J, et al. Meta-analysis of randomised trials with a continuous outcome according to baseline imbalance and availability of individual participant data. Stat Med. (2013) 32:2747–66. doi: 10.1002/sim.5726

PubMed Abstract | CrossRef Full Text | Google Scholar

25. van Breukelen GJ. ANCOVA versus CHANGE from baseline in nonrandomized studies: the difference. Multivariate Behav Res. (2013) 48:895–922. doi: 10.1080/00273171.2013.831743

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Winkens B, van Breukelen GJ, Schouten HJ, Berger MP. Randomized clinical trials with a pre- and a post-treatment measurement: repeated measures versus ANCOVA models. Contemp Clin Trials. (2007) 28:713–9. doi: 10.1016/j.cct.2007.04.002

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Deeks JJ, Higgins JPT, Altman DG. Chapter 9: analysing data and undertaking meta-analyses. In: Higgins JPT, editor. Cochrane Handbook for Systematic Reviews of Interventions. Version 5.0.1. The Cochrane Collaboration. (2008).

Google Scholar

28. Fu R, Vandermeer BW, Shamliyan TA, O'Neil ME, Yazdi F, Fox SH, et al. Handling continuous outcomes in quantitative synthesis. In: Methods Guide for Effectiveness and Comparative Effectiveness Reviews. Rockville, MD: Agency for Healthcare Research and Quality (2008).

PubMed Abstract | Google Scholar

29. Vickers AJ. The use of percentage change from baseline as an outcome in a controlled trial is statistically inefficient: a simulation study. BMC Med Res Methodol. (2001) 1:6. doi: 10.1186/1471-2288-1-6

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Vickers AJ, Altman DG. Statistics notes: Analysing controlled trials with baseline and follow up measurements. BMJ. (2001) 323:1123–4. doi: 10.1136/bmj.323.7321.1123

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Lunn DJ, Thomas A, Best N, Spiegelhalter D. WinBUGS - a Bayesian modelling framework: concepts, structure, and extensibility. Stat Comput. (2000) 10:325–37. doi: 10.1023/A:1008929526011

CrossRef Full Text | Google Scholar

32. R Foundation for Statistical Computing (2014). R version 3.6.0. Vienna: R Foundation for Statistical Computing.

Google Scholar

33. Sturtz S, Ligges U, Gelman A. R2WinBUGS: A package for running WinBUGS from R. J Stat Softw. (2005) 12:1–16. doi: 10.18637/jss.v012.i03

CrossRef Full Text | Google Scholar

34. Brooks S, Gelman A. General methods for monitoring convergence of iterative simulations. J Comput Graph Stat. (1998) 7:434–55. doi: 10.1080/10618600.1998.10474787

CrossRef Full Text | Google Scholar

35. Brooks S, Gelman A. Some issues in monitoring convergence of iterative simulations. Dimen Reduct Comput Compl Inf. (1998) 30:30–6.

Google Scholar

36. Gelman A, Rubin DB. Inference from iterative simulation using multiple sequences. Stat Sci. (1992) 7:457–72. doi: 10.1214/ss/1177011136

CrossRef Full Text | Google Scholar

37. Spiegelhalter DJ, Best NG, Carlin BR, van der Linde A. Bayesian measures of model complexity and fit. J R Stat Soc B. (2002) 64:583–616. doi: 10.1111/1467-9868.00353

CrossRef Full Text | Google Scholar

38. Ades AE, Sculpher M, Sutton A, Abrams K, Cooper N, Welton N, et al. Bayesian methods for evidence synthesis in cost-effectiveness analysis. Pharmacoeconomics. (2006) 24:1–19. doi: 10.2165/00019053-200624010-00001

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Dias S, Welton NJ, Sutton AJ, Ades AE. NICE DSU Technical Support Document 2: A Generalised Linear Modelling Framework for Pairwise Network Meta-Analysis of Randomised Controlled Trials [Internet]. London: National Institute for Health Care Excellence (NICE) (2014). Available online at:

PubMed Abstract | Google Scholar

40. Spitzer RL, Kroenke K, Williams JB, Lowe B. A brief measure for assessing generalized anxiety disorder: The GAD-7. Arch Intern Med. (2006) 166:1092–7. doi: 10.1001/archinte.166.10.1092

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Berger A, Edelsberg J, Bollu V, Alvir JMJ, Dugar A, Joshi AV, et al. Healthcare utilization and costs in patients beginning pharmacotherapy for generalized anxiety disorder: a retrospective cohort study. BMC Psych. (2011) 11:193. doi: 10.1186/1471-244X-11-193

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Smith JP, Book SW. Comorbidity of generalized anxiety disorder and alcohol use disorders among individuals seeking outpatient substance abuse treatment. Addict Behav. (2010) 35:42–5. doi: 10.1016/j.addbeh.2009.07.002

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Pacek LR, Storr CL, Mojtabai R, Green KM, La Flair LN, Alvanzo AA, et al. Comorbid alcohol dependence and anxiety disorders: a national survey. J Dual Diag. (2013) 9:835164. doi: 10.1080/15504263.2013.835164

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Dias S, Welton NJ, Sutton AJ, Caldwell DM, Lu G, Ades AE. Evidence synthesis for decision making 4: inconsistency in networks of evidence based on randomized controlled trials. Med Decis Making. (2013) 33:641–56. doi: 10.1177/0272989X12455847

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Dias S, Welton NJ, Sutton AJ, Caldwell DM, Lu G, Ades AE. NICE DSU Technical Support Document 4: Inconsistency in Networks of Evidence Based on Randomised Controlled Trials. London: National Institute for Health and Clinical Excellence (2014).

PubMed Abstract | Google Scholar

46. Moher D, Liberati A, Tetzlaff J, Altman DG, The The PRISMA Group. Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. PLoS Med. (2009) 6:e1000097. doi: 10.1371/journal.pmed.1000097

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Andersson G, Paxling B, Roch-Norlund P, Ostman G, Norgren A, Almlov J, et al. Internet-based psychodynamic versus cognitive behavioral guided self-help for generalized anxiety disorder: a randomized controlled trial. Psychother Psychosom. (2012) 81:344–55. doi: 10.1159/000339371

PubMed Abstract | CrossRef Full Text | Google Scholar

48. Andersson E, Hedman E, Wadstrom O, Boberg J, Andersson EY, Axelsson E, et al. Internet-based extinction therapy for worry: a randomized controlled trial. Behav Therapy. (2016) 2:3. doi: 10.1016/j.beth.2016.07.003

PubMed Abstract | CrossRef Full Text | Google Scholar

49. Christensen H, Mackinnon AJ, Batterham PJ, O'Dea B, Guastella AJ, Griffiths KM, et al. The effectiveness of an online e-health application compared to attention placebo or Sertraline in the treatment of Generalised Anxiety Disorder. Internet Interventions. (2014) 1:169–74. doi: 10.1016/j.invent.2014.08.002

PubMed Abstract | CrossRef Full Text | Google Scholar

50. Dahlin M, Andersson G, Magnusson K, Johansson T, Sjogren J, Hakansson A, et al. Internet-delivered acceptance-based behaviour therapy for generalized anxiety disorder: a randomized controlled trial. Behav Res Therapy. (2016) 77:86–95. doi: 10.1016/j.brat.2015.12.007

PubMed Abstract | CrossRef Full Text | Google Scholar

51. Dear BF, Staples LG, Terides MD, Karin E, Zou J, Johnston L, et al. Transdiagnostic versus disorder-specific and clinician-guided versus self-guided internet-delivered treatment for generalized anxiety disorder and comorbid disorders: a randomized controlled trial. J Anxiety Disord. (2015) 36:63–77. doi: 10.1016/j.janxdis.2015.09.003

PubMed Abstract | CrossRef Full Text | Google Scholar

52. Hazen RA, Vasey MW, Schmidt NB. Attentional retraining: a randomized clinical trial for pathological worry. J Psychiatr Res. (2009) 43:627–33. doi: 10.1016/j.jpsychires.2008.07.004

PubMed Abstract | CrossRef Full Text | Google Scholar

53. Hirsch CR, Krahe C, Whyte J, Loizou S, Bridge L, Norton S, et al. Interpretation training to target repetitive negative thinking in generalized anxiety disorder and depression. J Consult Clin Psychol. (2018) 86:1017–30. doi: 10.1037/ccp0000310

PubMed Abstract | CrossRef Full Text | Google Scholar

54. Howell AN, Rheingold AA, Uhde TW, Guille C. Web-based CBT for the prevention of anxiety symptoms among medical and health science graduate students. Cogn Behav Ther. (2018) 1–21. doi: 10.1080/16506073.2018.1533575

PubMed Abstract | CrossRef Full Text | Google Scholar

55. Johansson R, Bjorklund M, Hornborg C, Karlsson S, Hesser H, Ljotsson B, et al. Affect-focused psychodynamic psychotherapy for depression and anxiety through the Internet: a randomized controlled trial. PeerJ. (2013) 1:e102. doi: 10.7717/peerj.102

PubMed Abstract | CrossRef Full Text | Google Scholar

56. Jones SL, Hadjistavropoulos HD, Soucy JN. A randomized controlled trial of guided internet-delivered cognitive behaviour therapy for older adults with generalized anxiety. J Anxiety Disord. (2016) 37:1–9. doi: 10.1016/j.janxdis.2015.10.006

PubMed Abstract | CrossRef Full Text | Google Scholar

57. Navarro-Haro MV, Modrego-Alarcon M, Hoffman HG, Lopez-Montoyo A, Navarro-Gil M, Montero-Marin J, et al. Evaluation of a mindfulness-based intervention with and without virtual reality dialectical behavior therapy mindfulness skills training for the treatment of generalized anxiety disorder in primary care: a pilot study. Front Psychol. (2019) 10:55. doi: 10.3389/fpsyg.2019.00055

PubMed Abstract | CrossRef Full Text | Google Scholar

58. Paxling B, Almlov J, Dahlin M, Carlbring P, Breitholtz E, Eriksson T, et al. Guided internet-delivered cognitive behavior therapy for generalized anxiety disorder: a randomized controlled trial. Cognit Behav Ther. (2011) 40:159–73. doi: 10.1080/16506073.2011.576699

PubMed Abstract | CrossRef Full Text | Google Scholar

59. Repetto C, Gaggioli A, Pallavicini F, Cipresso P, Raspelli S, Riva G. Virtual reality and mobile phones in the treatment of generalized anxiety disorders: a phase-2 clinical trial. Pers Ubiquitous Comput. (2013) 17:253–60. doi: 10.1007/s00779-011-0467-0

CrossRef Full Text | Google Scholar

60. Pallavicini F, Algeri D, Repetto C, Gorini A, Riva G. Biofeedback, virtual reality and mobile phones in the treatment of generalized anxiety disorder (gad): a phase-2 controlled clinical trial. J. Cyber Ther Rehabil. (2009) 2:315–27.

Google Scholar

61. Gorini A, Pallavincini F, Algeri D, Repetto C, Gaggioli A, Riva G. Virtual reality in the treatment of generalized anxiety disorders. Annu Rev Cyber Ther Telemed. (2010) 8:31–5.

PubMed Abstract | Google Scholar

62. Richards D, Timulak L, Rashleigh C, McLoughlin O, Colla A, Joyce C, et al. Effectiveness of an internet-delivered intervention for generalized anxiety disorder in routine care: a randomised controlled trial in a student population. Internet Interv. (2016) 6:80–8. doi: 10.1016/j.invent.2016.10.003

PubMed Abstract | CrossRef Full Text | Google Scholar

63. Robinson E, Titov N, Andrews G, McIntyre K, Schwencke G, Solley K. Internet treatment for generalized anxiety disorder: a randomized controlled trial comparing clinician vs. technician assistance. PLoS ONE. (2010) 5:e0010942. doi: 10.1371/journal.pone.0010942

PubMed Abstract | CrossRef Full Text | Google Scholar

64. Teng M-H, Hou Y-M, Chang S-H, Cheng H-J. Home-delivered attention bias modification training via smartphone to improve attention control in sub-clinical generalized anxiety disorder: a randomized, controlled multi-session experiment. J Affect Disord. (2019) 246:444–51. doi: 10.1016/j.jad.2018.12.118

PubMed Abstract | CrossRef Full Text | Google Scholar

65. Titov N, Andrews G, Robinson E, Schwencke G, Johnston L, Solley K, et al. Clinician-assisted Internet-based treatment is effective for generalized anxiety disorder: randomized controlled trial. Austr New Zeal J Psychiatry. (2009) 43:905–12. doi: 10.1080/00048670903179269

PubMed Abstract | CrossRef Full Text | Google Scholar

66. Lorian CN, Titov N, Grisham JR. Changes in risk-taking over the course of an internet-delivered cognitive behavioral therapy treatment for generalized anxiety disorder. J Anxiety Disord. (2012) 26:140–9. doi: 10.1016/j.janxdis.2011.10.003

PubMed Abstract | CrossRef Full Text | Google Scholar

67. Titov N, Andrews G, Johnston L, Robinson E, Spence J. Transdiagnostic Internet treatment for anxiety disorders: a randomized controlled trial. Behav Res Ther. (2010) 48:890–9. doi: 10.1016/j.brat.2010.05.014

PubMed Abstract | CrossRef Full Text | Google Scholar

68. Topper M, Emmelkamp PM, Watkins E, Ehring T. Prevention of anxiety disorders and depression by targeting excessive worry and rumination in adolescents and young adults: a randomized controlled trial. Behav Res Ther. (2017) 90:123–36. doi: 10.1016/j.brat.2016.12.015

PubMed Abstract | CrossRef Full Text | Google Scholar

69. Sheehan DV, Lecrubier Y, Sheehan KH, Amorim P, Janavs J, Weiller E, et al. The Mini-International Neuropsychiatric Interview (MINI): the development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10. J Clin Psychiatry. (1998) 59(Suppl 2):22–33. doi: 10.1037/t18597-000

PubMed Abstract | CrossRef Full Text | Google Scholar

70. Meyer TJ, Miller ML, Metzger RL, Borkovec TD. Development and validation of the Penn State Worry Questionnaire. Behav Res Ther. (1990) 28:487–95. doi: 10.1016/0005-7967(90)90135-6

PubMed Abstract | CrossRef Full Text | Google Scholar

71. Barlow DH. Anxiety and its Disorders : The Nature and Treatment of Anxiety and Panic (2nd ed.). New York, NY: Guilford Press (2002).

Google Scholar

72. Kroenke K, Spitzer RL. The PHQ-9: a new depression diagnostic and severity measure. Psychiatr Ann. (2002) 32:509–15. doi: 10.3928/0048-5713-20020901-06

CrossRef Full Text | Google Scholar

73. Beck AT, Steer RA, Brown GK. Beck Depression Inventory-II. San Antonio. (1996) 78:490–8. doi: 10.1037/t00742-000

CrossRef Full Text | Google Scholar

74. Maier W, Buller R, Philipp M, Heuser I. The Hamilton Anxiety Scale: reliability, validity and sensitivity to change in anxiety and depressive disorders. J Affect Disord. (1988) 14:61–8. doi: 10.1016/0165-0327(88)90072-9

PubMed Abstract | CrossRef Full Text | Google Scholar

75. National Institute for Health and Clinical Excellence. Generalised Anxiety Disorder and Panic Disorder in Adults: Management. Clinical Guideline 113. London: NICE (2011).

Google Scholar

76. Cuijpers P, Donker T, van Straten A, Li J, Andersson G. Is guided self-help as effective as face-to-face psychotherapy for depression and anxiety disorders? A systematic review and meta-analysis of comparative outcome studies. Psychol Med. (2010) 40:1943–57. doi: 10.1017/S0033291710000772

PubMed Abstract | CrossRef Full Text | Google Scholar

77. Carlbring P, Andersson G, Cuijpers P, Riper H, Hedman-Lagerlöf E. Internet-based vs. face-to-face cognitive behavior therapy for psychiatric and somatic disorders: an updated systematic review and meta-analysis. Cognit Behav Therapy. (2018) 47:1–18. doi: 10.1080/16506073.2017.1401115

PubMed Abstract | CrossRef Full Text | Google Scholar

78. Spek V, Cuijpers P, Nyklícek I, Riper H, Keyzer J, Pop V. Internet-based cognitive behaviour therapy for symptoms of depression and anxiety: a meta-analysis. Psychol Med. (2007) 37:319–28. doi: 10.1017/S0033291706008944

PubMed Abstract | CrossRef Full Text | Google Scholar

79. Shim M, Mahaffey B, Bleidistel M, Gonzalez A. A scoping review of human-support factors in the context of Internet-based psychological interventions (IPIs) for depression and anxiety disorders. Clin Psychol Rev. (2017) 57:129–40. doi: 10.1016/j.cpr.2017.09.003

PubMed Abstract | CrossRef Full Text | Google Scholar

80. Dear BF, Titov N, Sunderland M, McMillan D, Anderson T, Lorian C, et al. Psychometric comparison of the Generalized Anxiety Disorder Scale-7 and the Penn State Worry Questionnaire for measuring response during treatment of generalised anxiety disorder. Cogn Behav Ther. (2011) 40:216–27. doi: 10.1080/16506073.2011.582138

PubMed Abstract | CrossRef Full Text | Google Scholar

81. Latimer NR, Abrams KR, Lambert PC, Crowther MJ, Wailoo AJ, Morden JP, et al. Adjusting survival time estimates to account for treatment switching in randomized controlled trials–an economic evaluation context: methods, limitations, and recommendations. Med Decis Making. (2014) 34:387–402. doi: 10.1177/0272989X13520192

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: worry, anxiety, cognitive behavior therapy (CBT), mobile applications, digital, systematic review (sr), meta-analysis

Citation: Saramago P, Gega L, Marshall D, Nikolaidis GF, Jankovic D, Melton H, Dawson S, Churchill R and Bojke L (2021) Digital Interventions for Generalized Anxiety Disorder (GAD): Systematic Review and Network Meta-Analysis. Front. Psychiatry 12:726222. doi: 10.3389/fpsyt.2021.726222

Received: 16 June 2021; Accepted: 04 November 2021;
Published: 06 December 2021.

Edited by:

Cyrus S. H. Ho, National University of Singapore, Singapore

Reviewed by:

Soni Kewalramani, Amity University, India
Darpan Kaur, Mahatma Gandhi Missions Medical College and Hospital, India
Yong Shian Shawn Goh, National University of Singapore, Singapore

Copyright © 2021 Saramago, Gega, Marshall, Nikolaidis, Jankovic, Melton, Dawson, Churchill and Bojke. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Pedro Saramago,