- 1Multisensory Experience Lab, Department of Architecture, Design and Media Technology, The Technical Faculty of IT and Design, Aalborg University, Copenhagen, Denmark
- 2VihTek - Research and Test Center for Health Technologies in the Capital Region of Denmark, Glostrup, Denmark
- 3Caring Futures Institute, College of Nursing and Health Science, Flinders University, Adelaide, SA, Australia
Background: As the elderly population continues to grow, so does the demand for new and innovative solutions to tackle age-related chronic diseases and disabilities. Virtual Reality (VR) has been explored as a novel therapeutic tool for numerous health-related applications. Although findings frequently favors VR, methodological shortcomings prevent clinical recommendations. Moreover, the term “VR” is frequently used ambiguously to describe e.g., video games; the distinction remains vague between immersive VR (IVR) systems and non-immersive VR (NVR). With no distinct demarcation, results of outcome measures are often pooled in meta-analyses, without accounting for the immersiveness of the system.
Objective: This systematic review focused on virtual reality-based rehabilitation of older adults (+60) in motor rehabilitation programs. The review aims to retrospectively classify previous studies according to the level of immersion, in order to get an overview of the ambiguity-phenomenon, and to utilize meta-analyses and subgroup analyses to evaluate the comparative efficacy of system immersion in VR-based rehabilitation.
Methods: Following PRISMA guidelines, we conducted a systematic search for randomized controlled trials, describing virtual rehabilitation or video games interventions for older adults (+60). Main outcomes were pain, motivation, mobility, balance, and adverse events.
Results: We identified 15 studies which included 743 patients. Only three studies utilized IVR. The rest used various NVR-equipment ranging from commercial products (e.g., Nintendo Wii), to bespoke systems that combine tracking devices, software, and displays. A random effects meta-analysis of 10 studies analyzed outcome measures of mobility, balance, and pain. Protocols and dosage varied widely, but outcome results were in favor of immersive and non-immersive interventions, however, dropout rates and adverse events were mostly in favor of the control.
Conclusions: We initialize a call-for-action, to distinguish between types of VR-technology and propose a taxonomy of virtual rehabilitation systems based on our findings. Most interventions use NVR-systems, which have demonstrably lower cybersickness-symptoms than IVR-systems. Therefore, adverse events may be under-reported in RCT-studies. An increased demand for IVR-systems highlight this challenge. Care should be given, when applying the results of existing NVR tools to new IVR-technologies. Future studies should provide more detail about their interventions, and future reviews should differentiate between NVR and IVR.
1. Introduction
By 2050 the world population is projected to reach 9.7 billion people, with older adults ≥65 accounting for approximately one fifth (1.7b) (United Nations, Department of Economic and Social Affairs). Increased life-expectancy implies a higher risk of developing various chronic diseases, including cardiovascular diseases, cancer, dementia, osteoarthritis, and stroke (Christensen et al., 2009; Fontana and Hu, 2014; Kennedy et al., 2014). Consequently, the diagnosis and treatment of these chronic diseases, which often require special care or hospitalizations, leads to rising expenditures for the healthcare systems around the world (United Nations, Department of Economic and Social Affairs). As an approach to prevent these trends, it has been suggested that increased physical activity, as regular exercising provides multiple health benefits, and reduces the risk of obtaining chronic diseases (Duncan et al., 2010; Anderson and Durstine, 2019; De la Rosa et al., 2020).
Despite evidence for the health benefits of keeping active, low motivation is often a challenge when seeking to counteract physical inactivity and sedentary lifestyles, through exercise programs (Teixeira et al., 2012). In the context of rehabilitative interventions, outcomes, and recovery often depend on the patient's motivation, leading programs to suffer from low adherence as a consequence. This has been identified as a challenge within different fields of rehabilitation, including pulmonary rehabilitation (Bourbeau and Bartlett, 2008; Salinas et al., 2011), acute stroke (Maclean et al., 2000), and diabetes (Rizzo et al., 2011).
Novel technologies such as active video games and virtual reality (VR) technologies, when used appropriately, have the potential to solve some of the challenges with low motivation and adherence. However, implementation into clinical practice has not yet been fully realized. On the other hand, VR has seen a commercial breakthrough within the last 5 years, and steadily increased the technological awareness of consumers and health professionals (Keshner et al., 2019). Within this field of rehabilitation, the therapeutic effects and value of VR technologies has been evaluated and scrutinized for over two decades, often under the general term of Virtual Rehabilitation (Burdea, 2003; Tieri et al., 2018). Within this highly specialized and diverse field, VR has successfully been applied to rehabilitation for adults with simple phobias (Rothbaum et al., 2006; Parsons and Rizzo, 2008; Powers and Emmelkamp, 2008; Maples-Keller et al., 2017); Post-traumatic stress disorder (PTSD); (Rothbaum et al., 2001; Difede et al., 2007; Kothgassner et al., 2019); acute and chronic pain treatment (Gold et al., 2005; Hoffman et al., 2008; Li et al., 2011; Pourmand et al., 2018; Matamala-Gomez et al., 2019; Wittkopf et al., 2020); post-stroke treatment, brain injury, and various other forms of neurological disorders (Rizzo et al., 2004; Rose et al., 2005; Stewart et al., 2007; Laver et al., 2017; Karamians et al., 2020).
For motor rehabilitation as an example, advantages with immersive characteristics of VR include how the sense of presence can induce an illusion of virtual body ownership and agency through multisensory feedback (Kilteni et al., 2015). The sensorimotor loops needed for motor rehabilitation can be strengthened through the introduction of a virtual context, to connect relevant cognitive associations to otherwise isolated repetitive motor tasks (Tieri et al., 2018). This is highly relevant in rehabilitation to reestablish cognitive function processes as motor skills, for instance with stroke patients (de Bruin et al., 2010). For geriatric rehabilitation, virtually augmented exercise is similarly proposed to influence cognitive abilities, for instance in cases including dementia (Garcia-Betances et al., 2015).
Nevertheless, VR remains an umbrella term within the field of rehabilitation, used to describe many and vastly different technologies, from “non-immersive” single desktop displays to “immersive” high fidelity motion-sensing input devices and wearable technologies such as head-mounted displays (HMDs) (Tieri et al., 2018). Hardware aside, variations between software solutions used to study the efficacy of “VR-based” rehabilitation (Burdea, 2003) (VRBR) is equally pluralistic. Hence, attempting to define VR, entails a certain ambiguity across a large body of research. However, interventions rarely use immersive VR (IVR)-technology as a facilitator (Tieri et al., 2018).
1.1. Current Systematic Reviews
In systematic reviews exploring the efficacy of virtual systems, VR is likewise ambiguously defined, and is frequently specifically defined as the use of commercial non-immersive consoles such as Nintendo Wii (Donath et al., 2016). Systematic reviews have explored the use of VR for improving mobility and balance (Donath et al., 2016; Neri et al., 2017; Amorim et al., 2018; Porras et al., 2018), physical functioning (Molina et al., 2014) and in general, to improve health-related domains (Miller et al., 2014). However, included articles frequently only describe interventions using NVR; again, most commonly using the Nintendo Wii (Miller et al., 2014; Molina et al., 2014; Amorim et al., 2018; Reis et al., 2019). For example, of the 10 articles included in Amorim et al.'s review (Amorim et al., 2018), 6 use Nintendo Wii console, while the remaining used Playstation EyeToy (n = 1), Xavis measured step system (n = 1), or bespoke systems with pressure mats or balance boards (n = 2). Likewise, of the 13 articles included in the review by Molina et al. (2014), most used Nintendo Wii (Fit) (n = 8), Balance Rehabilitation Unit from Medicaa (n = 1), or video games or bespoke systems (n = 4). Indeed, in a recent review by Karamians et al. (2020), exploring the effectiveness of VR and gaming-based interventions for UE post-stroke rehabilitation, only three of the included 38 articles described IVR technology. And while the authors are well aware of the distinguishing features of VR-systems (Karamians et al., 2020), this crucial differentiation may easily be lost, if the review is included in future syntheses. While findings frequently demonstrate a significant improvement in favor of virtual rehabilitation (for example Neri et al., 2017, P < 0.01), the quality of the evidence is often low with a high risk-of-bias (RoB) (Laver et al., 2012; Donath et al., 2016; Neri et al., 2017; Amorim et al., 2018). Therefore, the need remains to explore the efficacy of virtual rehabilitation in larger and better controlled studies.
Previous attempts have sought to delimit VR, by simply referring to devices which utilize immersive technology (Iosa et al., 2012; Rizzo and Koenig, 2017; Tieri et al., 2018). However, “immersion” has likewise seen its share of ambiguous usage, and is often confused with related terms, such as presence (Nilsson et al., 2016). VR-systems of high fidelity (e.g., HMDs), are usually referred to as fully immersive VR, or simply immersive VR (IVR). Lower fidelity systems are in these cases mostly referred to as non-immersive VR (NVR). For clarification, we will outline these aspects, before commencing the review's methodology.
1.2. Defining Immersion
VR can be described as a computer-generated interactive virtual environment. The defining feature separating VR from traditional media, is arguably VR's ability to give users a compelling illusion of “being there” in virtual environments. This illusion is often referred to as presence or place illusion (Slater, 2009), and has been described as the subjective correlate of immersion (Slater and Sanchez-Vives, 2016). Place illusion describes the subjective experience of a user, whereas immersion relates to objective characteristics of the system used to deliver this experience. The more immersive a system is, the higher degree of presence it can elicit. Immersive systems have been characterized based on the sensorimotor contingencies (SCs) they support (Slater, 2009). SCs are the actions a person can perform in order to perceive the world (e.g., changing one's gaze direction by moving the head or eyes, or kneeling to see underneath something). The level of immersion supported by a given VR-system depends on how well it supports normal SCs. Therefore, in this review, when discussing immersion, we operate with the term “system immersion” (Nilsson et al., 2016). A number of factors related to both displays and tracking can affect system immersion, however, for the purpose of the current review, we adopt a simple dichotomous categorization with respect to immersion. Level of immersion is distinguished between two broad categories of systems: immersive systems and non-immersive systems. Immersive systems allow users to view virtual content in all directions (i.e., they have an unlimited field of regard, FOR), even though the field of view (FOV) usually is smaller than the users visual field. Contrarily, non-immersive systems only offer a limited FOR and a limited FOV (e.g., screen- and projection-based systems).
1.3. Specific and Non-specific Systems
When the Nintendo Wii launched in 2006, it quickly became an affordable closed system, that supported physical activity with games and entertainment, with researchers soon after applying it to physical therapy programs (Deutsch et al., 2008). This caused a shift from bespoke systems (i.e., software and hardware solutions created for specific users and contexts) toward commercially available solutions (Keshner et al., 2019). A recent systematic review exploring types of VR applications within rehabilitation, characterize these different systems dichotomously as either specific (systems specifically built for rehabilitation) or non-specific (i.e., computerized systems meant for recreational activities and gaming) (Maier et al., 2019). However, systems can be simultaneously commercial and specific. This is evident from the increasing amount of companies developing high-end equipment, where gamification principles are embedded into the therapy (IREX, VRRS-systems, and others; Maier et al., 2019). Systems can also be custom-built from existing hardware and software, tailored for specific needs (i.e., bespoke systems). We argue that a distinction has to be made between commercial and bespoke systems, since low availability and accessibility of certain VR-systems challenges the reproducibility of findings or clinical applications. This is most commonly a trait of bespoke systems, which are usually developed in closed ecosystems, specifically to solve contextual challenges. Conversely, commercially available "off-the-shelf" systems can more reliably reproduce results. This means that cross-study analyses would gain a homogeneous data sets, and that any heterogeneity found in e.g., meta-analyses, would more confidently be attributed to sampling error.
1.4. The Potential Challenges of Ambiguous Classifications
The caveat to IVR and a main reason why a clear distinction is important for systematic reviews, is how the technology leads to demonstrably larger levels of side-effects, when compared to conventional displays (Sharples et al., 2008; Kim et al., 2014; Dennison et al., 2016; Chang et al., 2020). These side effects are also known as VR-sickness, cybersickness, VR-induced symptoms, and effects (VRISE) (Sharples et al., 2008), or visually induced motion sickness (Rebenitsch and Owen, 2016). In a study from 2008, Sharples et al. compared side-effects between different display technologies, including HMD, desktop monitor and projection screens (Sharples et al., 2008). The results indicated a significant increase in nausea symptoms when using HMD, compared to desktop and projection screens. Technology has progressed substantially since 2008, by including improved frame rate- and refresh rates frequencies to accommodate human-eye resolution and sensorimotor contingencies (LaViola, 2000). However, cybersickness remains an unsolved problem with IVR technology. A commonly used measure of VR-sickness is the simulator sickness questionnaire (SSQ) (Kennedy et al., 1993). While ironically not developed for IVR, it is a frequently used, standardized and validated measure of the severity of symptoms related to nausea, oculomotor disturbances and disorientation, while using a VR-system. It has also previously been used to measure adverse events related to VRBR-use (Dahdah et al., 2017), although often reported incorrectly. Additionally, it has been suggested that only IVR-technology should be defined as VR (Tieri et al., 2018). More specifically, solutions utilizing non-immersive technologies to facilitate an immersive and interactive digital environment. We agree with Tieri et al. (2018), therefore, another aim our work is to propose a model to better classify the use of VR-equipment in clinical contexts.
This review seeks to distinguish between the broader uses of VR, which encompasses non-immersive VR (NVR), for example video games and consoles such as Nintendo Wii, and the more discrete use of IVR, where the “immersion” is a property of the technical system (Nilsson et al., 2016) such as with HMDs. Like Tieri et al. (2018), we believe taxonomic consistency is more pertinent now than it was previously, as the availability of commercial IVR-systems will continue to increase the demand for clinical applications. Paradoxically, the evidence in favor of the safety, affordance, feasibility, efficacy, and implementation within clinical use, is still in its infancy. Furthermore, a classification of VRBR solutions for clinical application is needed, to frame such evidence and to allow practitioners suitable awareness, before including solutions into daily practice. And since the geriatric population is the largest group with rehabilitation needs, this is a good place to commence.
1.5. Research Questions
This review focuses on VRBR of older adults (+60) in motor rehabilitation programs. The aim of this review is to:
1. Retrospectively classify previous studies according to level of immersion, in order to get an overview of the ambiguity-phenomenon.
2. Utilize meta-analyses and subgroup analysis to determine outcome effect variations between IVR and NVR
3. Evaluate the comparative effectiveness of system immersion in IVR and NVR systems
4. Analyze comparative risks and adverse events between IVR and NVR systems.
2. Methods
The systematic review protocol was registered in PROSPERO (ID: CRD42019121172), and the reporting of the review was conducted following the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) guidelines (Liberati et al., 2009).
2.1. Eligibility Criteria
The selection of studies were conducted based on prespecified PICOS (participants, interventions, comparisons, outcomes, and study design) (Liberati et al., 2009):
• Participants: older adults (≥60 years old).
• Intervention: VR-based motor rehabilitation (e.g., for non-neurological muscular dysfunction, replacement surgery, prosthetic adaptation, or traumatic injuries).
• Comparison: conventional therapy or usual care.
• Outcome: Mobility and balance, motivation, pain, and adverse events (e.g., cybersickness, fall injuries, dizziness, eye strain, or other reported adverse incidents).
• Study design: Randomized Controlled Trials (RCT). Both parallel - and crossover groups.
2.2. Information Sources
The systematic search was undertaken on the following databases: PubMed/MEDLINE, Web of Science, CINAHL, Cochrane Library, and EMBASE to find articles describing randomized controlled trials (RCT), published in English, Danish, Swedish, or Norwegian.
2.3. Search
The search strategy was developed by and approved by all authors. The searches were performed by SMF and SWH who (1) extracted studies from the databases into EndNote; (2) performed duplicate removal; (3) uploaded them into Covidence for screening. All databases were searched from inception to the 30th April 30, 2020.
Search strings were adapted to fit each database individually using boolean search operators and limited to Human studies and Randomized Controlled Trials whenever possible. Search themes included rehabilitation or physical therapy using virtual reality, "exergames" or video games. A broad search for video games as well was to not only search for interventions describing non-immersive applications. A full list of all searches performed can be found in the Supplementary Material.
2.4. Study Selection
Title and abstract screening was performed by ERH and TMP; articles were excluded based on the following criteria: no full text available, wrong language, not peer-reviewed, wrong study design, wrong study population (participants are healthy adults, under the age of 60 or principle diagnosis was neurological), wrong outcomes or wrong setting.
A blinded full-text screening was performed independently by two reviewers (ERH and TMP). Conflicts were resolved by ERH and TMP through discussion, or with arbitration by third reviewer (KBJ).
2.5. Data Collection Process
Quantitative data was extracted from the included studies by pairwise independent reviewers (ERH, TMP, KBJ, JPE, and NCN) using a standardized data extraction form, which was presented and agreed upon at a joint meeting. Inter-rater conflicts in the data extraction process was resolved by ERH and TMP in consensus.
2.6. Assessment of Risk of Bias in Included Studies
We utilized the Cochrane Collaboration's RoB Tool (Liberati et al., 2009) to evaluate the methodological quality of the included studies. RoB assessment was independently performed by two paired reviewers (ERH, TMP, NCN, JPE, and KBJ). Conflicts were resolved by ERH and TMP.
2.7. Data Analysis
Results from the different trials were pooled using RevMan 5.4 (The Nordic Cochrane Centre, The Cochrane Collaboration, 2020). The primary outcomes for balance was determined as either timed tasks such as Timed Up and Go (TUG), or composite scores such as the Berg's Balance Scale (BBS). For functional mobility, the outcomes were limited to timed tasks such as the six minute walk test (6mWT) or the 10 meter walk test (10mWT). Pain-measurements were limited to standardizable self-reported uni-dimensional measures, such as visual analog scales (VAS), numerical rating scales (NRS). To estimate effect sizes of outcomes, we used standard mean difference (SMD) for different (or variations of the same) instruments, including VAS and BBS. For studies using similar instruments the mean difference (MD) was used.
Random-effects were used for all analyses, as the included populations were likely not functionally equivalent, since the interventions described different outcomes and patient populations. Variability within studies are reported in forest plots. Subgroup analysis to determine differences between IVR and NVR studies were performed. Due to the low amount of IVR-studies, subgroup analysis was only undertaken for pain. Heterogeneity was assessed individually for each outcome and considered insignificant if the I2 value was beneath a moderate level >50% as suggested by Higgins and Thompson (2002). Effects are considered statistically significant if p ≤ 0.05. All analyses use End of Treatment (EoT) scores.
3. Results
3.1. Study Identification
Through the different databases, we identified 3,507 articles matching the search strategy. No additional records were identified. After removing duplicates, 2,202 articles were screened, and 2,039 articles were excluded based on title and abstract, because they did not match the inclusion criteria. The full search strategy is outlined in Figure 1. Many of the excluded articles described interventions that did not include virtual rehabilitation. Among the most frequent occurrences were interventions with cold-water immersion. One hundred and sixty-three articles published between 1999 (Kim et al., 1999) and April 2020 were assessed for eligibility and 15 articles (n = 743) satisfied the inclusion criteria and 10 (n = 555) articles satisfied the correct outcomes required for conducting a meta-analysis.
 
  Figure 1. The PRISMA flow diagram for the systematic review detailing the selection process of the included RCT studies.
Of the 15 included articles describing RCT interventions (see Table 1), two studies used specific commercial IVR technology (Duque et al., 2013; Gianola et al., 2020), one study used a specific commercial NVR system (Nirvana VR system) (Yeşilyaprak et al., 2016) and one intervention used a bespoke specific system (Jin et al., 2018). Six studies used NVR non-specific commercial systems, either Nintendo Wii Fit (Laver et al., 2012; Fu et al., 2015; Sobral Monteiro-Junior et al., 2015; Kwok and Pua, 2016; Morone et al., 2016; Tsang and Fu, 2016) or Sony Playstation with Eye Toy (Lee and Shin, 2013), and four studies used specific bespoke systems (Schwenk et al., 2016; Mugueta-Aguinaga and Garcia-Zapirain, 2017; Oesch et al., 2017; Anson et al., 2018). Of the included studies, 13 referred to "virtual reality" while only two did not mention virtual reality, but referred to "exergames" (Mugueta-Aguinaga and Garcia-Zapirain, 2017; Oesch et al., 2017). All IVR studies referred to the intervention as "VR" (Duque et al., 2013; Jin et al., 2018; Gianola et al., 2020), and four NVR studies referred to the intervention as "VR" (Lee and Shin, 2013; Sobral Monteiro-Junior et al., 2015; Kwok and Pua, 2016; Yeşilyaprak et al., 2016) while the remaining studies described the intervention as interactive gaming (Laver et al., 2012) or exergames (Mugueta-Aguinaga and Garcia-Zapirain, 2017; Oesch et al., 2017); visual feedback (Anson et al., 2018) or sensor-based balance training (Schwenk et al., 2016) and Wii Fit Training/Wii exercise (Fu et al., 2015; Morone et al., 2016; Tsang and Fu, 2016). Two articles used "virtual reality" in the title or abstract, but provided very limited mention of VR in the full-text (Tsang and Fu, 2016; Anson et al., 2018), and VR was not mentioned as part of the intervention. Authors were contacted for two articles to clarify system specific details. Both contacts responded within 4 months.
Participants and settings varied across the included articles. Four articles recruited older adults living in residential aged care (Fu et al., 2015; Tsang and Fu, 2016; Yeşilyaprak et al., 2016; Mugueta-Aguinaga and Garcia-Zapirain, 2017). Older adults with balance difficulties living in the community were the focus of three articles (Duque et al., 2013; Morone et al., 2016; Anson et al., 2018). Two articles recruited participants following total knee arthroplasty (Jin et al., 2018; Gianola et al., 2020). Frail older adults were recruited in one study (Kwok and Pua, 2016). A range of different diagnoses in the inpatient setting were the focus of two studies (Laver et al., 2012; Oesch et al., 2017). Chronic conditions such as diabetes mellitus (Lee and Shin, 2013), peripheral neuropathy (Schwenk et al., 2016), and chronic low back pain (Sobral Monteiro-Junior et al., 2015). Dosage of the interventions varied as well between 20 and 90 min per session (mean ± SD: 43.7 ± 19.8), between 1 and 5 weekly sessions (2.7 ± 1) for durations between 3 and 12 weeks (6.6 ± 2.7).
3.2. Mobility and Balance
A range of outcome-measures were used. The most commonly used outcomes for mobility and balance were Timed Up and Go (TUG) (Laver et al., 2012; Lee and Shin, 2013; Kwok and Pua, 2016; Tsang and Fu, 2016; Yeşilyaprak et al., 2016; Anson et al., 2018), Berg Balance Scale (BBS) (Laver et al., 2012; Lee and Shin, 2013; Morone et al., 2016; Tsang and Fu, 2016; Yeşilyaprak et al., 2016; Anson et al., 2018), and six-minute walk test (6MWT) (Kwok and Pua, 2016; Anson et al., 2018). Other measures included posturography (Duque et al., 2013; Schwenk et al., 2016; Oesch et al., 2017), short physical performance battery (SPPB) (Duque et al., 2013; Fu et al., 2015; Mugueta-Aguinaga and Garcia-Zapirain, 2017), and fall risk (Fu et al., 2015). Six studies were included in the meta-analysis for the overall effect of NVR (VFB treadmill, Nintendo Wii, Playstation 2 + EyeToy, NIRVANA VR Interactive System) on TUG scores as a measure of dynamic balance (Figure 2). The mean time to complete the TUG ranged from 9.1 (±1.1) s to 28.86 (±11.71) s across the studies. The MD between experimental and control groups ranged from 0.23 to 1.11 s. Considerable heterogeneity was found between studies (Tau2 = 0.08, Chi2 = 15.50, df = 5, I2 = 68%, df = 5). Compared to the control group, the SMD between groups on TUG scores was significantly greater for the VR group, demonstrating a significant treatment effect (Z = 1.94, p = 0.05).
 
  Figure 2. Forest plot of outcome measures for balance assessment using timed tasks, specifically timed up-and-go (TUG), for VR-based therapy vs. control (EoT).
Six studies were included in the meta analysis for the overall effect of NVR (VFB treadmill, Nintendo Wii Fit, Playstation 2 + EyeToy, NIRVANA VR Interactive System) on BBS scores as a measure of balance (Figure 3). The mean BBS scores ranged from 30.1 (±8.84) to 53.41 (±1.49) across the studies. The MD between experimental and control groups ranged from 0 to –2.49 s. Considerable heterogeneity was found between studies (Tau2 = 0.96, Chi2 = 9.03, df = 5, I2 = 45%, df = 5). Compared to the control group, the SMD between groups on BBS scores was significantly greater for the VR group, demonstrating a significant treatment effect (Z = 4.02, p ≤ 0.0001).
 
  Figure 3. Forest plot of outcome measures for balance assessment using Berg Balance Scale (BBS) for VR-based therapy vs. control (EoT).
Two studies were included in the meta-analysis for the overall effect of NVR (VFB treadmill, Nintendo Wii Fit) on 6MWT scores (Figure 4). The mean 6MWT scores ranged from 323.7 (±25.9) s to 387.8 (±70.8) s across the studies. Considerable heterogeneity was found between studies (Tau2 = 0.00, Chi2 = 0.73, df = 1, I2 = 0%). Compared to the control group, the SMD between groups on 6MWT scores was not significantly different between the VR group and control group (Z = 1.85, p = 0.06).
 
  Figure 4. Forest plot of timed tasks, specifically 6MWT, for functional mobility outcome measures for VR-based therapy vs. control (EoT).
3.3. Motivation
Adherence, enjoyment and motivation were measured in only one study (Oesch et al., 2017). The primary outcome of the study was the adherence to exercise as measured by the duration of exercise each day. Motivation and enjoyment were measured after each training using a five-point Likert scale. Each of these outcomes, adherence, motivation, and enjoyment were found to be favored in the conventional exercise group. Another study measured game satisfaction on a custom dichotomous scale with direct "yes/no" questions, but only for the experimental group (Mugueta-Aguinaga and Garcia-Zapirain, 2017).
3.4. Pain
Two studies were included in the meta-analysis for the overall effect of IVR (Khymeia VRRS, HTC Vive) on pain scores for patients following total knee arthroplasty. The SMD between groups on pain scores was significantly greater for the VR group, demonstrating a significant treatment effect (Z = 2.57, p = 0.01). Two studies were included in the meta-analysis for the overall effect of NVR (Nintendo WiiFit) on pain scores for people with chronic low back pain and balance disorders. There was no significant difference between the NVR and control groups (Z = 0.78, p = 0.44). Comparison of IVR and NVR demonstrated no significant difference in overall treatment effect between type of VR (Z = 1.03, p = 0.02) (Figure 5).
 
  Figure 5. Forest plot of studies measuring pain intensity through unidimensional measures, using a visual analog scale (VAS), for VR-based therapy vs. control (EoT).
3.5. Adverse Events and Dropouts
Two studies mentioned that adverse events were observed during the intervention (Laver et al., 2012; Oesch et al., 2017), and five articles mentioned that no adverse events were detected (Kwok and Pua, 2016; Schwenk et al., 2016; Yeşilyaprak et al., 2016; Anson et al., 2018; Gianola et al., 2020). The majority of the studies made no specific mention of adverse events (Duque et al., 2013; Lee and Shin, 2013; Fu et al., 2015; Sobral Monteiro-Junior et al., 2015; Morone et al., 2016; Tsang and Fu, 2016; Mugueta-Aguinaga and Garcia-Zapirain, 2017; Jin et al., 2018).
Dropouts varied across studies and interventions (Figure 6). The absolute risk of the experimental group was 41 vs. 20 for the control groups. The weighted average dropout rate was 10% for experimental groups and 5% for control groups. The highest number of dropouts were seen in Oesch et al. (2017) 11/28 in the experimental group, whereas the authors identify 7 dropouts related to the treatment either due to dislike (n = 5) or experience of pain (n = 2).
3.6. Risk of Bias Assessment
The risk of bias analysis was performed to assess the methodological quality of the articles included in the quantitative synthesis. The resulting summary can be seen in Figure 7. No additional sources of bias were discovered.
 
  Figure 7. Risk of bias (RoB) summary of included studies (only quantitative synthesis). Icon explanations: green "+",low risk of bias; red "-", high risk of bias; yellow "?", uncertain risk of bias.
3.6.1. Selection Bias
All studies except two (Kwok and Pua, 2016; Jin et al., 2018) described a random component when allocating participants to groups (sequence generation). However, six of the included studies did not conceal allocation when assigning participants to the intervention groups (allocation concealment) assessed as a high risk (Lee and Shin, 2013; Sobral Monteiro-Junior et al., 2015; Tsang and Fu, 2016; Yeşilyaprak et al., 2016; Anson et al., 2018; Jin et al., 2018).
3.6.2. Performance Bias
Blinding of participants and personnel is almost always impossible in physical health research (Karanicolas et al., 2010), which is also reflected in all of the study receiving a high risk assessment.
3.6.3. Detection Bias
While blinding of participants or personnel is impossible, blinding of outcome assessors is still feasible. However, half of the studies (Lee and Shin, 2013; Morone et al., 2016; Tsang and Fu, 2016; Yeşilyaprak et al., 2016; Jin et al., 2018) reported that outcomes were assessed by the same people who performed the experiment which we estimate as a high risk. One study (Kwok and Pua, 2016) did not specify details (unknown risk) and four studies (Laver et al., 2012; Sobral Monteiro-Junior et al., 2015; Anson et al., 2018; Gianola et al., 2020) took steps to blind outcome assessors (low risk).
3.6.4. Attrition Bias
Concerning incomplete outcome data, three studies had no missing data (Morone et al., 2016; Tsang and Fu, 2016; Jin et al., 2018), (low risk, see also Figure 6). Two studies reported dropouts, but outcomes were calculated based the number of participants, and intention-to-treat was used to account for missing data (Laver et al., 2012; Morone et al., 2016) (low risk). Two studies (Lee and Shin, 2013; Anson et al., 2018) had a low and slightly disproportionate dropout-rate in favor of the control group, however performed a per-protocol analysis (uncertain risk). Three studies (Sobral Monteiro-Junior et al., 2015; Yeşilyaprak et al., 2016; Gianola et al., 2020) had moderate dropouts, disproportionately in favor of the control group, and conducted a per-protocol analysis with no attempts at an intention-to-treat analysis (high risk).
3.6.5. Reporting Bias
We did not compare trial protocols with published outcomes, therefore intervention effects could be overestimated. Selective outcome reporting was assessed based on whether or not the articles made a reference to an existing protocol. Only three studies referenced a prospectively registered trial protocol (Sobral Monteiro-Junior et al., 2015; Anson et al., 2018; Gianola et al., 2020), two studies were retrospectively registered (Laver et al., 2012; Kwok and Pua, 2016), and five studies made no reference to a protocol receiving a high risk assessment (Lee and Shin, 2013; Morone et al., 2016; Tsang and Fu, 2016; Yeşilyaprak et al., 2016; Jin et al., 2018). We justify an unknown risk for retrospectively registered trials because reported outcomes in the registry technically could reflect findings of the study, which could also indicate overestimated intervention effects.
3.7. Frequency and Classification of Publications With IVR Interventions
To further gain an understanding of the inconsistent use of “VR” as a term to describe technological systems, articles excluded in the full text screening (n = 163), were reviewed. After excluding protocols, wrong study design, and “no full text available” articles, the following was extracted from the articles (n = 108): information such as intervention description, immersion type, specificity, and availability of software/hardware (commercial or bespoke), the equipment used, and how the authors describe the intervention. Over half om the articles (n = 60) used VR to describe the intervention, and three articles used the descriptor “VR” only as a keyword, with no further mention in the paper. Of the 60 articles, 49 (82%) used non-immersive equipment, 10 (17%) used high-immersive equipment (HMD or CAVE systems) and one article did not describe the equipment used in detail, but just referred to “VR-technology” (Cacau et al., 2013), which made classification impossible.
4. Discussion
Virtual rehabilitation continues to evolve as an independent field of study (Keshner et al., 2019). However, despite spanning over two decades, the effectiveness of VR-systems continues to elude, whether specifically made for rehabilitation purposes or adapting recreational non-specific games (Maier et al., 2019). Furthermore, the use of the technology for older adults in non-neurological disorders is still scarce. Even more surprisingly, VR remains a "buzzword" used to describe interventions that do not use IVR-equipment. Both the Oculus Rift CV1 and the HTC Vive were released commercially in early 2016, and the Oculus Rift DK1 was available as early as 2012. Yet, even though VR-systems are now of higher quality and lower prices than previously, IVR-systems appear to be still under-represented in virtual rehabilitation (Figure 8). We argue that an increasing public awareness of what could constitute a VR-system, paired with a general lack of research consensus on how it should be specifically interpreted and understood, poses a potential health-risk. An example is how the assessment of adverse events are generally under-prioritized in RTCs (Bonell et al., 2015). Our findings affirmed this, as we found adverse events to be generally poorly reported. Although the reporting of no events may be due to a lack of occurrence, it may also be due to only serious events being considered and negligible effects. An example could be how a slight dizziness could easily go unreported. Meanwhile, there is a complexity to adverse effects evaluation, as negligible symptoms may be ignored for (or by) some patient populations, while the same symptoms could be considered severe for (or by) others.
 
  Figure 8. Frequency plot of the articles (n = 60), published between 1999 and 2020, which were excluded in the eligibility assessment, who used "Virtual Reality" to refer to the intervention. The graph shows interventions post-classified as immersive or non-immersive, by the authors, according to the taxonomy of virtual rehabilitation systems (see Figure 9).
Nevertheless, measuring nausea or other VR-related side-effects using standardized tools, is seldom an independent outcome prioritized in randomized trials. However, users experiencing VR-sickness, remains an unsolved challenge which is more frequently observed in IVR-systems (Sharples et al., 2008; Kim et al., 2014; Dennison et al., 2016; Chang et al., 2020). Therefore, if clinical trials are included in syntheses, without accounting for the degree of system immersion, prevalent adverse events may go unnoticed. This has potential harmful human consequences, as national- or international health authorities base their clinical guidelines on these RCT-studies, reviews and meta-analyses, that may not differentiate systems or adverse effects correctly.
The subgroup-analysis between IVR and NVR revealed that the dropout rate for IVR-studies were higher than for NVR-studies. While both tended to have a higher retention for the control group, the dropout rate for IVR experimental groups were significant (p = 0.02) while the NVR experimental groups was not (p = 0.10). Adverse events were often not properly addressed, except for two studies (Laver et al., 2012; Gianola et al., 2020), who both included a detailed description and discussion. Due to poor reporting we cannot infer causality between dropouts and adverse events. However, description from Gianola et al. (2020) does highlight that dropouts might also be connected to the participants feeling “uncomfortable” wearing the HMD, or lacking face-to-face contact with the therapist. A recent study exploring the acceptance of HMDs among older adults, concluded that attitude changed to positive after experiencing the technology with minimal symptoms. However, there are some caveats related to the authors' conclusion, that negative attitudes or VR-sickness is negligible. Firstly, the results relate to healthy older adults, thus not synonymous and possibly not applicable to more vulnerable users. Secondly, the VR-application used in the experiment (Perfect by nDreams) has the lowest rating on the Oculus comfort spectrum (nDreams, 2016), which implies that related symptoms will be very low. VR content has a significant impact on the amount of symptoms experienced (Saredakis et al., 2020), and symptoms should therefore be evaluated across different content characteristics, before validating a generalized use.
At least one article describes preliminary steps to delimit adverse events (Sobral Monteiro-Junior et al., 2015), but it would be beneficial if adverse events, related to IVR-systems, are measured more consistently with standardized instruments (e.g., the SSQ) in future studies. This would allow to gain a more systematic understanding of the potential challenges with VR as a therapeutic tool across different patient populations, age-groups, and systems.
4.1. Summary of Main Findings
The studies included in this review varied widely across the intervention type and dosage, outcome measures participant characteristics and setting. Participants in the included studies ranged from hospital inpatients, to residential aged care, to people living in the community. This range of settings and focus on different conditions or diagnoses, suggests that participants may be different at baseline, making it difficult to compare. Additionally, all analyses had high heterogeneity, demonstrating large variation across the included studies. While motivation, engagement and adherence are commonly cited as benefits of the use of VR in the therapy setting, only one study evaluated this outcome (Oesch et al., 2017). This seems paradoxical, since motivation is often a central principle in the reasoning for using the technology in the first place.
4.1.1. Taxonomy of Virtual Rehabilitation Systems
Although the intention to classify VR-systems based on level of immersion was pre-specified in the protocol, a Taxonomy of Virtual Rehabilitation Systems was developed a posteriori to the findings in this review, expand upon the different types of VR-systems, both in terms of immersion [non-immersive (NVR) vs. immersive (IVR)] and specificity [specific (S) vs. non-specific (NS)] (see Figure 9). The latter is describing systems developed exclusively for rehabilitation purposes (specific), as opposed to recreational and/or off-the-shelf video games, which have simply been applied to rehabilitation interventions (non-specific) (Maier et al., 2019). Furthermore, to account for implications for practical applications and availability of the systems, the taxonomy sub-classifies each type of system. Specifically, this depends on whether or not the systems are commercially (C) available as a "closed system," or have been developed as bespoke (B) technology, which presumably makes it less accessible as an off-the-shelf product.
Non-immersive VR - Non-specific (NVR-NS)
This sub-category most notably entails commercial NVR-NS(C) systems, such as the Nintendo Wii, with studies that are more easily reproducible, due to the consistency and availability of the systems and software. Likewise, the studies are frequently larger, and span a wide spectrum of patient populations. The caveat is that the systems are not developed for the target population, i.e., people with disabilities. Therefore, studies will encounter users who are not able to operate the system, which may introduce frustration and lack of motivation. Bespoke NVR-NS(B) systems within this sub-category will likely be underrepresented. We have not identified any studies using NVR-NS(B) systems.
Non-immersive VR - Specific (NVR-S)
Acknowledging the issues with NS systems, many studies have also utilized specifically designed systems, to tackle some of these problems. Issues with commercial NVR-S(C) systems include that they are often expensive purchases, or requiring renewable licenses. Bespoke NVR-S(B) systems are also frequently represented in the literature, however, are often designed specifically for the study and often not publicly available. Functionalities are sometimes described in great detail, but we argue, mostly not sufficiently, to reproduce and replicate findings.
Immersive VR - Non-specific (IVR-NS)
Similar to what the Nintendo Wii achieved in 2006, VR-headsets are now an affordable and commercial off-the-shelf solution. We therefore anticipate an increase of studies evaluating IVR-NS(C) applications within rehabilitation contexts in the near future. For example, we identified one recent publication with preliminary results (Erhardsson et al., 2020) using the IVR-NS(C) application Beat Saber (2018). Other potential IVR-NS(C) applications currently available, could include Job Simulator (2016) or OhShape (2019). The primary challenge, similar to NVR-NS(C) systems, is how such systems are developed for users with normal function and abilities. Most likely, there will be no specific settings constructed to allow inclusivity toward “extreme users.” Bespoke IVR-NS(B) systems for rehabilitation, while unlikely, could in practicality exist.
Immersive VR - Specific (IVR-S)
Commercial IVR-S(C) systems have been available since at least 2010 (Medicaa's Balance Rehabilitation Unit™(BRU), but as with NVR-S(C) systems, IVR-S(C) systems are often expensive and are likely to require renewable license. More of these will appear, as companies with an already established brand in NVR-S(C) systems, apply "immersive modules" to their existing hardware. We expect them to acknowledge the increasing demand for such systems, for example Khymeia VRRS®. Bespoke IVR-S(B) systems will also likely start to appear more frequently, both clinically and within research, which will likely create a more balanced representation between NVR and IVR systems. However, we argue that a problem with IVR-S(B), similar to NVR-S(B), is how bespoke systems are rarely commercially available, but developed and maintained in closed research ecosystems. This makes research reproducibility very challenging.
Looking at the applicability of this taxonomy, we see how a bulk of research included in this review, is the adaptation of the NVR-NS(C) classified Nintendo Wii. While disheartening from an IVR review-based needs-perspective, the advantages of the Nintendo Wii's (C) classification are clear as they include availability, technological reliability, and production value. This infers that working within the (C) classifications, can provide preconditions for studies, resulting in fundamental advantages. These include how studies can prepare quickly, do not face technological inconsistencies, and can be easily and globally reproduced. Whether “NS” is ultimately a serious disadvantage, depends on how well the contextual rehabilitation needs, converge with the demands and effects of the non-specific solution. Noticeably remaining in this case example, is the role of the NVR nature of the Wii.
While IVR is a technology with high potential benefits - typically amplified from the sensation of presence), it also entails an increase in risks such as falls or injuries (e.g., from not being aware of ones surroundings, while wearing the headset), to nausea, ocular disturbances, and disorientation, which may be negligible or severe depending on the individual participants. While (B) applications may be aimed to fit contextual needs more precisely, they may also lack the refinement and additional benefits of some (C) grade products.
IVR-NS(C) products are currently undergoing rapidly increasing development, both in terms of quantity and quality. With products such as Beat Saber and Half-Life Alyx breaking records for IVR software sales, IVR-NS(C) titles are gradually demonstrating potential for usage, across entertainment- and clinical settings. These represent a point for IVR, where their success is likely gaining more from effectively utilizing the defining features of IVR to their advantage, than they are losing from any adverse effects. Researchers and practitioners should definitely consider any apprehension, on utilizing (C) products as their vehicle to explore the viability of IVR-based rehabilitation.
This does require researchers to find proper interventions for the IVR-NS(C) applications, and to design their studies around those spaces, where their applications are therapeutically and methodologically useful. Meanwhile, as this was possible for the Nintendo Wii, it should be considered within a range of possibility for current or future IVR applications. We are still to see the IVR-NS(C) application, which achieves weight and role within IVR-based rehabilitation, as the Wii did in the past for NVR-NS(C) based rehabilitation.
Meanwhile, developing this taxonomic classification for VR systems is a starting part of this. If a non-discreet distinction between the non-immersive and immersive exists (on a continuum), the current state of RCT-descriptions of technologies pinpoint a demarcation problem of immersion. The taxonomy proposed in this review, is a layer to this. Acknowledging the placement of an intervention is important, especially based on the findings of this review, to make initial judgement on the research field it should be place.
Despite the taxonomy proposed in this review, however, more detailed classification methods remain needed to further distinguish IVR-based interventions. For example, from the usage of FOV and FOR. Currently, information about interventions are seldom sufficient enough, to use those measurements as variables.
4.2. Overall Completeness and Applicability of Evidence
Limited detail about the intervention was provided in the included studies, which limits the ability to replicate the research. This is especially true of specific bespoke systems. When not commercially available, and when details about hardware, software and interactions are not described in detail, bespoke systems become exceedingly difficult to include in cross-study evaluations or comparisons. Future research endeavors should carefully consider and attend to this inclusion.
4.3. Potential Biases in the Review Process
This systematic review verifies and supports previous suggestions in narrative reviews, where the term VR has been used inconsistently, when describing interventions. Furthermore, given that a majority of the articles are published after 2016, which correlates with the availability of high-immersive commercial and affordable VR-equipment, this review supports the need for development and evaluation of more high-quality interventions. Partly to better understand the effectiveness and adverse events of IVR-equipment in motor rehabilitation of older adults, but also in other domains where better evidence exists, such as stroke therapy (Laver et al., 2017).
Many factors contribute to the sense of the immersion (see section 1.2), thus, the dichotomous classification applied in this review is quite reductionistic. Although it can be argued that immersion exists on a continuum, the extent of interacting elements that nurtures it, curtails clear demarcations between the different features. Furthermore, classifying virtual rehabilitation systems a posteriori on a continuum, would require detailed technical descriptions (e.g., FOV, FOR, and frame-rate), which RCT-studies do not traditionally supply.
4.4. Limitations of This Review
To our knowledge, this is the first systematic review attempting to evaluate differences in treatment effects, by differences in the properties of the system, though subgroup analyses. However, the authors acknowledge that there are limitations to this approach. Firstly, the scope of the review spanned a variety of different outcomes, and potentially heterogeneous populations, as long as it was non-neurological rehabilitation. This raises the question about whether the data from independent studies can be validly pooled. One of the criteria for pooling data in meta-analyses, is that treatment effects are investigated for the same fundamental impairments, using similar or identical systems and comparators. While this review does include a very specific population (i.e., older adults), it differs in the purpose of the interventions, as well as the potential functional capacity of the included participants. Likewise, the comparator offered in the control groups differed from being no activity at all, to leaflet and the same exact intervention minus the digital augmentation. Therefore, the substantial heterogeneity observed, for example in TUG (I2 = 68%), can be due to differences across participants, study design and outcomes, rather than sampling errors. Furthermore, the small number of included studies describing non-neurological IVR-interventions for older adults (60+), the insufficient reported reasons for dropping out, as well as a generally poor description of adverse events, do pose severe limitations. As 56% of studies did not report adverse events, however, we cannot assume there were no adverse events, simply because none were reported. Moreover, since IVR and NVR is usually pooled, safety, and feasibility of the technology may be inflated.
4.5. Future Directions
As VR-systems improve (e.g., wider FOV, higher pixel density, frame-rate, and resolution), the adverse symptoms experienced by many users, will likely be mitigated. However, other challenges may also be relevant to consider, when implementing IVR in rehabilitation programs. Technological innovations will need to be continuously monitored and deemed appropriate for clinical use, as new barriers may arise when new interfaces are inevitably added, as new design standards. For example, mass-market brain-computer interfaces are likely to become embedded in wearable computing devices, within a foreseeable future. Although it definitely will be a game-changer for patient monitorization during therapy, it is not unlikely that such interfaces can be considered in violation with personal data protection regulations, when placed in off-the-shelf commercial products. For researchers seeking to implement clinical VR, it may be valuable to theorize on the potential harms of the technology, and evaluate it continuously during the process. One approach to evaluating the potential harmful consequences, could be through the development of “dark logic models” (Bonell et al., 2015).
In this review we have proposed a taxonomy expanding the previous distinction between specific and non-specific VR (Maier et al., 2019) to include the distinction between immersive and non-immersive VR, as well as differentiation between commercial and bespoke systems. Admittedly, the field of Virtual Rehabilitation has so far used VR as an umbrella-term. However, to avoid confusing consumers, researchers, and healthcare professionals alike, who are leading the change, the recent commercialization of VR should re-establish discussions - and reach a taxonomic consensus on whether (or not) the term of “VR” should be reserved exclusively for IVR-systems, as a subcategory of Virtual Rehabilitation.
Finally, the authors encourage that similar methods are taken, to distinguish between NVR and IVR interventions in more focused reviews, to better understand the differences in treatment effects and related adverse events. Potentially, this task can be undertaken through umbrella reviews, to synthesize results from systematic reviews, while accounting for a posteriori classifications.
5. Conclusion
The majority of studies included in this review evaluated the use of non-specific, commercially available NVR systems. Three of the 15 studies included in this review evaluated IVR interventions. Two of these studies met the criteria for meta-analysis. Six studies included in the meta-analysis indicated a significant treatment effect of NVR on TUG scores and BBS scores compared to the control intervention. No significant difference in 6MWT scores were found in the meta-analysis of the two studies using NVR interventions. Pain scores were significantly different for the two IVR interventions compared to control for patients, following total knee arthroplasty. Yet, no significant difference was found in pain scores between the NVR interventions and control, for people with chronic back pain or balance disorders.
We initialize a call-for-action, to distinguish between types of VR-technology, and propose a taxonomy of virtual rehabilitation systems, based on our findings. Most interventions uses NVR systems, which has demonstrably lower VR-sickness than IVR-systems. Therefore, RCT adverse events may be under-reported. An increased demand for IVR-systems highlight this challenge. Care should be taken when applying the results of existing NVR tools to new IVR technologies. NVR could improve functional outcomes, and should not be underestimated, simply by to the contemporary existence of IVR. Future studies should provide more detail about their interventions, and future reviews should differentiate between NVR and IVR.
5.1. Implications for Practice
The heterogeneity in VR intervention, participant type, study setting and outcome measures across the included studies, along with small sample sizes, provide limited ability to draw strong conclusions to support the use of VR in practice. Stakeholders and clinicians should be careful when applying the results of existing NVR interventions to new IVR technologies. While both NVR and IVR can effectively improve functional outcomes, IVR generally causes more adverse events, such as VR-sickness which can lead to higher dropout-rates, or even worse pose health-risks if patients are not properly monitored.
5.2. Implications for Research
Future studies should provide more detail about the equipment used in the interventions, and also better monitor, measure and report system-specific side effects through standardized tools. Future reviews should differentiate between NVR and IVR.
Data Availability Statement
The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author/s.
Author Contributions
EH, JB-P, TP, and SF conceived and formulated the research questions for the study. EH drafted the a priori protocol, analyzed the data, and drafted the manuscript. SF and SH performed the literature search, identified and removed duplicate records, and prepared files for Covidence. EH and TP performed title and abstract screening as well as full-text screening, and resolved conflicts between raters. EH, TP, JB-P, KH, and NN performed independent data extraction and Risk of Bias assessment. EH, BL, and JB-P interpreted results of the data synthesis. EH, BL, JB-P, NN, SS, TP, SF, and CK edited and contributed to the manuscript. All authors reviewed and accepted the final version.
Funding
This systematic review was funded as a joined effort between Aalborg University and VihTek Research and Test Center for Health Technologies. The systematic review was written as part of a Ph.D. study undertaken by Emil Rosenlund Høeg, funded by the municipality of Frederiksberg.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
Many thanks to Professor Hanne Tønnesen and colleagues from WHO Collaborating Centre for Evidence-based Health promotion in Hospitals and Health Services, Frederiksberg for letting me attend the already full Ph.D.-Course on Systematic Review Techniques, 2018. Without that course, this review would not have happened.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/frvir.2021.647993/full#supplementary-material
References
Amorim, J. S. C. D., Leite, R. C., Brizola, R., and Yonamine, C. Y. (2018). Virtual reality therapy for rehabilitation of balance in the elderly: a systematic review and meta-analysis. Adv. Rheumatol. 58:18. doi: 10.1186/s42358-018-0013-0
Anderson, E., and Durstine, J. L. (2019). Physical activity, exercise, and chronic diseases: a brief review. Sports Med. Health Sci. 1, 3–10. doi: 10.1016/j.smhs.2019.08.006
Anson, E., Ma, L., Meetam, T., Thompson, E., Rathore, R., Dean, V., et al. (2018). Trunk motion visual feedback during walking improves dynamic balance in older adults: assessor blinded randomized controlled trial. Gait Posture 62, 342–348. doi: 10.1016/j.gaitpost.2018.03.044
Bonell, C., Jamal, F., Melendez-Torres, G., and Cummins, S. (2015). dark logic: theorising the harmful consequences of public health interventions. J. Epidemiol. Commun. Health 69, 95–98. doi: 10.1136/jech-2014-204671
Bourbeau, J., and Bartlett, S. J. (2008). Patient adherence in copd. Thorax 63, 831–838. doi: 10.1136/thx.2007.086041
Burdea, G. (2003). Virtual rehabilitation-benefits and challenges. Yearb. Med. Inform. 12, 170–176. doi: 10.1055/s-0038-1638156
Cacau, L. D. A. P., Oliveira, G. U., Maynard, L. G., Araújo Filho, A. A. D., Silva Junior, W. M. D., Cerqueria Neto, M. L., et al. (2013). The use of the virtual reality as intervention tool in the postoperative of cardiac surgery. Braz. J. Cardiovasc. Surg. 28, 281–289. doi: 10.5935/1678-9741.20130039
Chang, E., Kim, H. T., and Yoo, B. (2020). Virtual reality sickness: a review of causes and measurements. Int. J. Hum. Comput. Interact. 36, 1658–1682. doi: 10.1080/10447318.2020.1778351
Christensen, K., Doblhammer, G., Rau, R., and Vaupel, J. W. (2009). Ageing populations: the challenges ahead. Lancet 374, 1196–1208. doi: 10.1016/S0140-6736(09)61460-4
Dahdah, M. N., Bennett, M., Prajapati, P., Parsons, T. D., Sullivan, E., and Driver, S. (2017). Application of virtual environments in a multi-disciplinary day neurorehabilitation program to improve executive functioning using the Stroop task. Neurorehabilitation 41, 721–734. doi: 10.3233/NRE-172183
de Bruin, E. D., Schoene, D., Pichierri, G., and Smith, S. T. (2010). Use of virtual reality technique for the training of motor control in the elderly. Z. Gerontol. Geriatr. 43, 229–234. doi: 10.1007/s00391-010-0124-7
De la Rosa, A., Olaso-Gonzalez, G., Arc-Chagnaud, C., Millan, F., Salvador-Pascual, A., Garc-a-Lucerga, C., et al. (2020). Physical exercise in the prevention and treatment of alzheimer's disease. J. Sport Health Sci. 9, 394–404. doi: 10.1016/j.jshs.2020.01.004
Dennison, M. S., Wisti, A. Z., and D'Zmura, M. (2016). Use of physiological signals to predict cybersickness. Displays 44, 42–52. doi: 10.1016/j.displa.2016.07.002
Deutsch, J. E., Borbely, M., Filler, J., Huhn, K., and Guarrera-Bowlby, P. (2008). Use of a low-cost, commercially available gaming console (wii) for rehabilitation of an adolescent with cerebral palsy. Phys. Ther. 88, 1196–1207. doi: 10.2522/ptj.20080062
Difede, J., Cukor, J., Jayasinghe, N., Patt, I., Jedel, S., Spielman, L., et al. (2007). Virtual reality exposure therapy for the treatment of posttraumatic stress disorder following september 11, 2001. J. Clin. Psychiatry 68:1639. doi: 10.4088/JCP.v68n1102
Donath, L., Rössler, R., and Faude, O. (2016). Effects of virtual reality training (exergaming) compared to alternative exercise training and passive control on standing balance and functional mobility in healthy community-dwelling seniors: a meta-analytical review. Sports Med. 46, 1293–1309. doi: 10.1007/s40279-016-0485-1
Duncan, L. R., Hall, C. R., Wilson, P. M., and Jenny, O. (2010). Exercise motivation: a cross-sectional analysis examining its relationships with frequency, intensity, and duration of exercise. Int. J. Behav. Nutr. Phys. Act. 7, 1–9. doi: 10.1186/1479-5868-7-7
Duque, G., Boersma, D., Loza-Diaz, G., Hassan, S., Suarez, H., Geisinger, D., et al. (2013). Effects of balance training using a virtual-reality system in older fallers. Clin. Interv. Aging 8:257. doi: 10.2147/CIA.S41453
Erhardsson, M., Murphy, M. A., and Sunnerhagen, K. S. (2020). Commercial head-mounted display virtual reality for upper extremity rehabilitation in chronic stroke: a single-case design study. J. Neuroeng. Rehabil. 17, 1–14. doi: 10.1186/s12984-020-00788-x
Fontana, L., and Hu, F. B. (2014). Optimal body weight for health and longevity: bridging basic, clinical, and population research. Aging Cell 13, 391–400. doi: 10.1111/acel.12207
Fu, A. S., Gao, K. L., Tung, A. K., Tsang, W. W., and Kwan, M. M. (2015). Effectiveness of exergaming training in reducing risk and incidence of falls in frail older adults with a history of falls. Arch. Phys. Med. Rehabil. 96, 2096–2102. doi: 10.1016/j.apmr.2015.08.427
Garcia-Betances, R. I., Jiménez-Mixco, V., Arredondo, M. T., and Cabrera-Umpiérrez, M. F. (2015). Using virtual reality for cognitive training of the elderly. Am. J. Alzheimers Dis. Other Dement® 30, 49–54. doi: 10.1177/1533317514545866
Gianola, S., Stucovitz, E., Castellini, G., Mascali, M., Vanni, F., Tramacere, I., et al. (2020). Effects of early virtual reality-based rehabilitation in patients with total knee arthroplasty: a randomized controlled trial. Medicine 99:e19136. doi: 10.1097/MD.0000000000019136
Gold, J. I., Kant, A. J., Kim, S. H., et al. (2005). “Virtual anesthesia: the use of virtual reality for pain distraction during acute medical interventions,” in Seminars in Anesthesia, Perioperative Medicine and Pain, Vol. 24, eds P. D. Lumb and D. F. McIntee (Elsevier), 203–210.
Higgins, J. P., and Thompson, S. G. (2002). Quantifying heterogeneity in a meta-analysis. Stat. Med. 21, 1539–1558. doi: 10.1002/sim.1186
Hoffman, H. G., Patterson, D. R., Seibel, E., Soltani, M., Jewett-Leahy, L., and Sharar, S. R. (2008). Virtual reality pain control during burn wound debridement in the hydrotank. Clin. J. Pain 24, 299–304. doi: 10.1097/AJP.0b013e318164d2cc
Iosa, M., Morone, G., Fusco, A., Bragoni, M., Coiro, P., Multari, M., et al. (2012). Seven capital devices for the future of stroke rehabilitation. Stroke Res. Treat. 2012:187965. doi: 10.1155/2012/187965
Jin, C., Feng, Y., Ni, Y., and Shan, Z. (2018). Virtual reality intervention in postoperative rehabilitation after total knee arthroplasty: a prospective and randomized controlled clinical trial. Int. J. Clin. Exp. Med. 11, 6119–6124.
Karamians, R., Proffitt, R., Kline, D., and Gauthier, L. V. (2020). Effectiveness of virtual reality-and gaming-based interventions for upper extremity rehabilitation poststroke: a meta-analysis. Arch. Phys. Med. Rehabil. 101, 885–896. doi: 10.1016/j.apmr.2019.10.195
Karanicolas, P. J., Farrokhyar, F., and Bhandari, M. (2010). Blinding: who, what, when, why, how? Can. J. Surg. 53:345.
Kennedy, B. K., Berger, S. L., Brunet, A., Campisi, J., Cuervo, A. M., Epel, E. S., et al. (2014). Geroscience: linking aging to chronic disease. Cell 159, 709–713. doi: 10.1016/j.cell.2014.10.039
Kennedy, R. S., Lane, N. E., Berbaum, K. S., and Lilienthal, M. G. (1993). Simulator sickness questionnaire: an enhanced method for quantifying simulator sickness. Int. J. Aviat. Psychol. 3, 203–220. doi: 10.1207/s15327108ijap0303_3
Keshner, E. A., Weiss, P. T., Geifman, D., and Raban, D. (2019). Tracking the evolution of virtual reality applications to rehabilitation as a field of study. J. Neuroeng. Rehabil. 16:76. doi: 10.1186/s12984-019-0552-6
Khymeia Group (). Virtual Reality Rehabilitation System (VRRS)®. Available online at: http://khymeia.com/en/products/vrrs/ (accessed November 14, 2020).
Kilteni, K., Maselli, A., Kording, K. P., and Slater, M. (2015). Over my fake body: body ownership illusions for studying the multisensory basis of own-body perception. Front. Hum. Neurosci. 9:141. doi: 10.3389/fnhum.2015.00141
Kim, K., Rosenthal, M. Z., Zielinski, D. J., and Brady, R. (2014). Effects of virtual environment platforms on emotional responses. Comput. Methods Programs Biomed. 113, 882–893. doi: 10.1016/j.cmpb.2013.12.024
Kim, N. G., Yoo, C. K., and Im, J. J. (1999). A new rehabilitation training system for postural balance control using virtual reality technology. IEEE Trans. Rehabil. Eng. 7, 482–485. doi: 10.1109/86.808952
Kothgassner, O. D., Goreis, A., Kafka, J. X., Van Eickels, R. L., Plener, P. L., and Felnhofer, A. (2019). Virtual reality exposure therapy for posttraumatic stress disorder (ptsd): a meta-analysis. Eur. J. Psychotraumatol. 10:1654782. doi: 10.1080/20008198.2019.1654782
Kwok, B. C., and Pua, Y. H. (2016). Effects of wiiactive exercises on fear of falling and functional outcomes in community-dwelling older adults: a randomised control trial. Age Ageing 45, 621–627. doi: 10.1093/ageing/afw108
Laver, K., George, S., Ratcliffe, J., Quinn, S., Whitehead, C., Davies, O., et al. (2012). Use of an interactive video gaming program compared with conventional physiotherapy for hospitalised older adults: a feasibility trial. Disabil. Rehabil. 34, 1802–1808. doi: 10.3109/09638288.2012.662570
Laver, K. E., Lange, B., George, S., Deutsch, J. E., Saposnik, G., and Crotty, M. (2017). Virtual reality for stroke rehabilitation. Cochrane Database Syst. Rev. 11:CD008349. doi: 10.1002/14651858.CD008349.pub4
LaViola, J. J. (2000). A discussion of cybersickness in virtual environments. ACM SIGCHI Bull. 32, 47–56. doi: 10.1145/333329.333344
Lee, S., and Shin, S. (2013). Effectiveness of virtual reality using video gaming technology in elderly adults with diabetes mellitus. Diabet. Technol. Ther. 15, 489–496. doi: 10.1089/dia.2013.0050
Li, A., Montaño, Z., Chen, V. J., and Gold, J. I. (2011). Virtual reality and pain management: current trends and future directions. Pain Manage. 1, 147–157. doi: 10.2217/pmt.10.15
Liberati, A., Altman, D. G., Tetzlaff, J., Mulrow, C., Gøtzsche, P. C., Ioannidis, J. P., et al. (2009). The prisma statement for reporting systematic reviews and meta-analyses of studies that evaluate health care interventions: explanation and elaboration. J. Clin. Epidemiol. 62, e1–e34. doi: 10.1016/j.jclinepi.2009.06.006
Maclean, N., Pound, P., Wolfe, C., and Rudd, A. (2000). Qualitative analysis of stroke patients' motivation for rehabilitation. BMJ 321, 1051–1054. doi: 10.1136/bmj.321.7268.1051
Maier, M., Rubio Ballester, B., Duff, A., Duarte Oller, E., and Verschure, P. F. (2019). Effect of specific over nonspecific vr-based rehabilitation on poststroke motor recovery: a systematic meta-analysis. Neurorehabil. Neural Repair 33, 112–129. doi: 10.1177/1545968318820169
Maples-Keller, J. L., Yasinski, C., Manjin, N., and Rothbaum, B. O. (2017). Virtual reality-enhanced extinction of phobias and post-traumatic stress. Neurotherapeutics 14, 554–563. doi: 10.1007/s13311-017-0534-y
Matamala-Gomez, M., Donegan, T., Bottiroli, S., Sandrini, G., Sanchez-Vives, M. V., and Tassorelli, C. (2019). Immersive virtual reality and virtual embodiment for pain relief. Front. Hum. Neurosci. 13:279. doi: 10.3389/fnhum.2019.00279
Miller, K. J., Adair, B. S., Pearce, A. J., Said, C. M., Ozanne, E., and Morris, M. M. (2014). Effectiveness and feasibility of virtual reality and gaming system use at home by older adults for enabling physical activity to improve health-related domains: a systematic review. Age Ageing 43, 188–195. doi: 10.1093/ageing/aft194
Molina, K. I., Ricci, N. A., de Moraes, S. A., and Perracini, M. R. (2014). Virtual reality using games for improving physical functioning in older adults: a systematic review. J. Neuroeng. Rehabil. 11:156. doi: 10.1186/1743-0003-11-156
Morone, G., Paolucci, T., Luziatelli, S., Iosa, M., Piermattei, C., Zangrando, F., et al. (2016). Wii fit is effective in women with bone loss condition associated with balance disorders: a randomized controlled trial. Aging Clin. Exp. Res. 28, 1187–1193. doi: 10.1007/s40520-016-0578-6
Mugueta-Aguinaga, I., and Garcia-Zapirain, B. (2017). Fred: exergame to prevent dependence and functional deterioration associated with ageing. a pilot three-week randomized controlled clinical trial. Int. J. Environ. Res. Public Health 14:1439. doi: 10.3390/ijerph14121439
nDreams (2016). Perfect. Available online at: https://www.oculus.com/experiences/rift/387422848047994/ (accessed December 2, 2020).
Neri, S. G., Cardoso, J. R., Cruz, L., Lima, R. M., De Oliveira, R. J., Iversen, M. D., et al. (2017). Do virtual reality games improve mobility skills and balance measurements in community-dwelling older adults? systematic review and meta-analysis. Clin. Rehabil. 31, 1292–1304. doi: 10.1177/0269215517694677
Nilsson, N. C., Nordahl, R., and Serafin, S. (2016). Immersion revisited: a review of existing definitions of immersion and their relation to different theories of presence. Hum. Technol. 12, 108–134. doi: 10.17011/ht/urn.201611174652
Oesch, P., Kool, J., Fernandez-Luque, L., Brox, E., Evertsen, G., Civit, A., et al. (2017). Exergames versus self-regulated exercises with instruction leaflets to improve adherence during geriatric rehabilitation: a randomized controlled trial. BMC Geriatr. 17:77. doi: 10.1186/s12877-017-0467-7
Parsons, T. D., and Rizzo, A. A. (2008). Affective outcomes of virtual reality exposure therapy for anxiety and specific phobias: a meta-analysis. J. Behav. Ther. Exp. Psychiatry 39, 250–261. doi: 10.1016/j.jbtep.2007.07.007
Porras, D. C., Siemonsma, P., Inzelberg, R., Zeilig, G., and Plotnik, M. (2018). Advantages of virtual reality in the rehabilitation of balance and gait: systematic review. Neurology 90, 1017–1025. doi: 10.1212/WNL.0000000000005603
Pourmand, A., Davis, S., Marchak, A., Whiteside, T., and Sikka, N. (2018). Virtual reality as a clinical tool for pain management. Curr. Pain Headache Rep. 22, 1–6. doi: 10.1007/s11916-018-0708-2
Powers, M. B., and Emmelkamp, P. M. (2008). Virtual reality exposure therapy for anxiety disorders: a meta-analysis. J. Anxiety Disord. 22, 561–569. doi: 10.1016/j.janxdis.2007.04.006
Rebenitsch, L., and Owen, C. (2016). Review on cybersickness in applications and visual displays. Virtual Real. 20, 101–125. doi: 10.1007/s10055-016-0285-9
Reis, E., Postolache, G., Teixeira, L., Arriaga, P., Lima, M. L., and Postolache, O. (2019). Exergames for motor rehabilitation in older adults: an umbrella review. Phys. Ther. Rev. 24, 84–99. doi: 10.1080/10833196.2019.1639012
Rizzo, A. A., Schultheis, M., Kerns, K. A., and Mateer, C. (2004). Analysis of assets for virtual reality applications in neuropsychology. Neuropsychol. Rehabil. 14, 207–239. doi: 10.1080/09602010343000183
Rizzo, A. S., and Koenig, S. T. (2017). Is clinical virtual reality ready for primetime? Neuropsychology 31, 877–899. doi: 10.1037/neu0000405
Rizzo, A. S., Lange, B., Suma, E. A., and Bolas, M. (2011). Virtual reality and interactive digital game technology: new tools to address obesity and diabetes. J. Diabetes Sci. Technol. 5, 256–264. doi: 10.1177/193229681100500209
Rose, F. D., Brooks, B. M., and Rizzo, A. A. (2005). Virtual reality in brain damage rehabilitation. Cyberpsychol. Behav. 8, 241–262. doi: 10.1089/cpb.2005.8.241
Rothbaum, B. O., Anderson, P., Zimand, E., Hodges, L., Lang, D., and Wilson, J. (2006). Virtual reality exposure therapy and standard (in vivo) exposure therapy in the treatment of fear of flying. Behav. Ther. 37, 80–90. doi: 10.1016/j.beth.2005.04.004
Rothbaum, B. O., Hodges, L. F., Ready, D., and Alarcon, R. D. (2001). Virtual reality exposure therapy for vietnam veterans with posttraumatic stress disorder. J. Clin. Psychiatry 62, 617–622. doi: 10.4088/JCP.v62n0808
Salinas, G. D., Williamson, J. C., Kalhan, R., Thomashow, B., Scheckermann, J. L., Walsh, J., et al. (2011). Barriers to adherence to chronic obstructive pulmonary disease guidelines by primary care physicians. Int. J. Chronic Obstruct. Pulmon. Dis. 6, 171–179. doi: 10.2147/COPD.S16396
Saredakis, D., Szpak, A., Birckhead, B., Keage, H. A., Rizzo, A., and Loetscher, T. (2020). Factors associated with virtual reality sickness in head-mounted displays: a systematic review and meta-analysis. Front. Hum. Neurosci. 14:96. doi: 10.3389/fnhum.2020.00096
Schwenk, M., Grewal, G. S., Holloway, D., Muchna, A., Garland, L., and Najafi, B. (2016). Interactive sensor-based balance training in older cancer patients with chemotherapy-induced peripheral neuropathy: a randomized controlled trial. Gerontology 62, 553–563. doi: 10.1159/000442253
Sharples, S., Cobb, S., Moody, A., and Wilson, J. R. (2008). Virtual reality induced symptoms and effects (vrise): comparison of head mounted display (hmd), desktop and projection display systems. Displays 29, 58–69. doi: 10.1016/j.displa.2007.09.005
Slater, M. (2009). Place illusion and plausibility can lead to realistic behaviour in immersive virtual environments. Philos. Trans. R. Soc. B Biol. Sci. 364, 3549–3557. doi: 10.1098/rstb.2009.0138
Slater, M., and Sanchez-Vives, M. V. (2016). Enhancing our lives with immersive virtual reality. Front. Robot. AI 3:74. doi: 10.3389/frobt.2016.00074
Sobral Monteiro-Junior, R., Pereira de Souza, C., Lattari, E., Rocha, B. F., Mura, G., Machado, S., et al. (2015). Wii-workouts on chronic pain, physical capabilities and mood of older women: a randomized controlled double blind trial. CNS Neurol. Disord. Drug Targets 14, 1157–1164. doi: 10.2174/1871527315666151111120131
Stewart, J. C., Yeh, S.-C., Jung, Y., Yoon, H., Whitford, M., Chen, S.-Y., et al. (2007). Intervention to enhance skilled arm and hand movements after stroke: a feasibility study using a new virtual reality system. J. Neuroeng. Rehabil. 4, 1–6. doi: 10.1186/1743-0003-4-21
Teixeira, P. J., Carraça, E. V., Markland, D., Silva, M. N., and Ryan, R. M. (2012). Exercise, physical activity, and self-determination theory: a systematic review. Int. J. Behav. Nutr. Phys. Act. 9:1. doi: 10.1186/1479-5868-9-78
The Nordic Cochrane Centre The Cochrane Collaboration. (2020). Review Manager (RevMan) [Computer program]. Version 5.4.
Tieri, G., Morone, G., Paolucci, S., and Iosa, M. (2018). Virtual reality in cognitive and motor rehabilitation: facts, fiction and fallacies. Expert Rev. Med. Devices 15, 107–117. doi: 10.1080/17434440.2018.1425613
Tsang, W. W., and Fu, A. S. (2016). Virtual reality exercise to improve balance control in older adults at risk of falling. Hong Kong Med. J. 22(Suppl. 2), S19–S22.
United Nations Department of Economic Social Affairs Population Division. (2019). World Population Prospects 2019: Highlights (st/esa/ser.a/423). Available online at: https://population.un.org/wpp/Publications/Files/WPP2019_Highlights.pdf (accessed October 10, 2020).
Wittkopf, P. G., Lloyd, D. M., Coe, O., Yacoobali, S., and Billington, J. (2020). The effect of interactive virtual reality on pain perception: a systematic review of clinical studies. Disabil. Rehabil. 42, 3722–3733. doi: 10.1080/09638288.2019.1610803
Yeşilyaprak, S. S., Yıldırım, M. Ş., Tomruk, M., Ertekin, Ö., and Algun, Z. C. (2016). Comparison of the effects of virtual reality-based balance exercises and conventional exercises on balance and fall risk in older adults living in nursing homes in turkey. Physiother. Theor. Pract. 32, 191–201. doi: 10.3109/09593985.2015.1138009
Keywords: virtual reality, rehabilitation, immersive displays, older adults, balance, functional mobility, pain, systematic review
Citation: Høeg ER, Povlsen TM, Bruun-Pedersen JR, Lange B, Nilsson NC, Haugaard KB, Faber SM, Hansen SW, Kimby CK and Serafin S (2021) System Immersion in Virtual Reality-Based Rehabilitation of Motor Function in Older Adults: A Systematic Review and Meta-Analysis. Front. Virtual Real. 2:647993. doi: 10.3389/frvir.2021.647993
Received: 30 December 2020; Accepted: 12 March 2021;
 Published: 12 April 2021.
Edited by:
Georgina Cardenas-Lopez, National Autonomous University of Mexico, MexicoReviewed by:
Daniel Simon Harvie, Griffith University, AustraliaBenjamin J. Li, Nanyang Technological University, Singapore
Copyright © 2021 Høeg, Povlsen, Bruun-Pedersen, Lange, Nilsson, Haugaard, Faber, Hansen, Kimby and Serafin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Emil Rosenlund Høeg, ZXJoQGNyZWF0ZS5hYXUuZGs=
 Tina Myung Povlsen2
Tina Myung Povlsen2 
   
  