The Effectiveness of Post-exercise Stretching in Short-Term and Delayed Recovery of Strength, Range of Motion and Delayed Onset Muscle Soreness: A Systematic Review and Meta-Analysis of Randomized Controlled Trials

Afonso, José; Clemente, Filipe Manuel; Nakamura, Fábio Yuzo; Morouço, Pedro; Sarmento, Hugo; Inman, Richard A.; Ramirez-Campillo, Rodrigo

doi:10.3389/fphys.2021.677581

SYSTEMATIC REVIEW article

Front. Physiol., 05 May 2021

Sec. Exercise Physiology

Volume 12 - 2021 | https://doi.org/10.3389/fphys.2021.677581

This article is part of the Research TopicMusculoskeletal Adaptations to Training and Sports Performance: Connecting Theory and PracticeView all 11 articles

The Effectiveness of Post-exercise Stretching in Short-Term and Delayed Recovery of Strength, Range of Motion and Delayed Onset Muscle Soreness: A Systematic Review and Meta-Analysis of Randomized Controlled Trials

José Afonso¹

Filipe Manuel Clemente^2,3

Fábio Yuzo Nakamura^4,5

Pedro Morouço⁶

Hugo Sarmento⁷

Richard A. Inman⁸

Rodrigo Ramirez-Campillo^9,10^*

¹Centre for Research, Education, Innovation and Intervention in Sport, Faculty of Sport of the University of Porto, Porto, Portugal
²Escola Superior Desporto e Lazer, Instituto Politécnico de Viana do Castelo, Rua Escola Industrial e Comercial de Nun'Álvares, Viana do Castelo, Portugal
³Instituto de Telecomunicações, Delegação da Covilhã, Covilhã, Portugal
⁴Research Center in Sports Sciences, Health Sciences and Human Development (CIDESD), University Institute of Maia (ISMAI), Maia, Portugal
⁵Associate Graduate Program in Physical Education Universidade de Pernambuco (UPE)/Universidade Federal da Paraíba (UFPB), João Pessoa, Brazil
⁶Superior School of Education and Social Sciences, Polytechnic of Leiria, Leiria, Portugal
⁷Research Unit for Sport and Physical Activity (CIDAF), Faculty of Sport Sciences and Physical Education, University of Coimbra, Coimbra, Portugal
⁸The Psychology for Positive Development Research Center (CIPD), Universidade Lusíada, Porto, Portugal
⁹Human Performance Laboratory, Department of Physical Activity Sciences, Universidad de Los Lagos, Osorno, Chile
¹⁰Centro de Investigación en Fisiología del Ejercicio, Facultad de Ciencias, Universidad Mayor, Santiago, Chile

Background: Post-exercise (i.e., cool-down) stretching is commonly prescribed for improving recovery of strength and range of motion (ROM) and diminishing delayed onset muscular soreness (DOMS) after physical exertion. However, the question remains if post-exercise stretching is better for recovery than other post-exercise modalities.

Objective: To provide a systematic review and meta-analysis of supervised randomized-controlled trials (RCTs) on the effects of post-exercise stretching on short-term (≤1 h after exercise) and delayed (e.g., ≥24 h) recovery makers (i.e., DOMS, strength, ROM) in comparison with passive recovery or alternative recovery methods (e.g., low-intensity cycling).

Methods: This systematic review followed PRISMA guidelines (PROSPERO CRD42020222091). RCTs published in any language or date were eligible, according to P.I.C.O.S. criteria. Searches were performed in eight databases. Risk of bias was assessed using Cochrane RoB 2. Meta-analyses used the inverse variance random-effects model. GRADE was used to assess the methodological quality of the studies.

Results: From 17,050 records retrieved, 11 RCTs were included for qualitative analyses and 10 for meta-analysis (n = 229 participants; 17–38 years, mostly males). The exercise protocols varied between studies (e.g., cycling, strength training). Post-exercise stretching included static stretching, passive stretching, and proprioceptive neuromuscular facilitation. Passive recovery (i.e., rest) was used as comparator in eight studies, with additional recovery protocols including low intensity cycling or running, massage, and cold-water immersion. Risk of bias was high in ~70% of the studies. Between-group comparisons showed no effect of post-exercise stretching on strength recovery (ES = −0.08; 95% CI = −0.54–0.39; p = 0.750; I² = 0.0%; Egger's test p = 0.531) when compared to passive recovery. In addition, no effect of post-exercise stretching on 24, 48, or 72-h post-exercise DOMS was noted when compared to passive recovery (ES = −0.09 to −0.24; 95% CI = −0.70–0.28; p = 0.187–629; I² = 0.0%; Egger's test p = 0.165–0.880).

Conclusion: There wasn't sufficient statistical evidence to reject the null hypothesis that stretching and passive recovery have equivalent influence on recovery. Data is scarce, heterogeneous, and confidence in cumulative evidence is very low. Future research should address the limitations highlighted in our review, to allow for more informed recommendations. For now, evidence-based recommendations on whether post-exercise stretching should be applied for the purposes of recovery should be avoided, as the (insufficient) data that is available does not support related claims.

Systematic Review Registration: PROSPERO, identifier: CRD42020222091.

Introduction

Exercise sessions typically begin with a warm-up period, followed by the main workout, and end with a cool-down phase, including a progressive reduction of effort and intensity (ACSM, 2018). Stretching is prescribed as an essential component of the cool-down phase by the guidelines of ACSM (2018) and the American Heart Association (2020). The main goals of stretching exercises applied during the cool-down phase (i.e., post-exercise stretching) are to enhance range of motion (ROM) and to reduce stiffness and delayed onset muscle soreness (DOMS) (Sands et al., 2013). There are different post-exercise stretching methods, such as passive static, active static, dynamic, proprioceptive neuromuscular facilitation (PNF), among others (Lima et al., 2019). Despite its wide adoption in exercise protocols, its effectiveness is not well-understood (Van Hooren and Peake, 2018).

Past research has a mixed and often contradicting set of results, with numerous studies indicating post-exercise stretching is not effective for improving recovery. Indeed, in one study with 10 healthy men (Mika et al., 2007), the participants performed three sets of leg extension and flexion at 50% of maximum voluntary contraction (MVC). Post-exercise recovery protocols were used, including light-intensity cycle ergometer and PNF stretching for 5 min. Light-intensity cycle ergometer exercise (10 W at 60 rpm) induced greater short-term recovery (i.e., immediately after the post-exercise protocol) than stretching as measured by MVC, total effort time, motor unit activation and EMG frequency (p < 0.05). In another study (Robey et al., 2009), club (8 men, 6 women; age: 20.2 ± 2.2 years) and elite level rowers (4 men, 2 women, age: 18.6 ± 0.8 years) performed a strenuous stair-climb running protocol. Post-exercise recovery protocols were applied at 15-min, 24 and 48 h, including stretching, hot/cold water immersion and passive recovery (i.e., rest). Compared to passive recovery, stretching and hot/cold water immersion induced no recovery effect on leg extension concentric peak torque, 2 km rowing ergometer times, creatine kinase levels, or DOMS, at any time-point. Further, nine physically active men (age, 23 ± 1 years) performed a fatiguing exercise protocol (i.e., 8-min of cycle ergometer at 90% maximum oxygen uptake), followed by a post-exercise stretching protocol (i.e., 10 min) (Cè et al., 2013). After 1 h of performing the stretching protocol, mechanical and physiological assessments (e.g., MVC, EMG amplitude, and lactate kinetics) were similar between the stretching group and the passive recovery group.

Moreover, stretching may be ineffective in relieving perceived muscle pain or in reducing DOMS (Wessel and Wan, 1994; Cheung et al., 2003; Xie et al., 2018). Also, recovery may not simply mean a return to basal values. In other words, to be effective, post-exercise stretching should recover and improve participants function over basal condition (Sands et al., 2013; Van Hooren and Peake, 2018).

Furthermore, potential short-term positive effects of post-exercise stretching on recovery should be balanced with long-term adaptations. For example, Fuchs et al. (2020) recently demonstrated that post-exercise cooling (i.e., cold-water immersion) accelerated acute recovery after training sessions; however, it impaired myofibrillar protein synthesis rates after 2-weeks of training compared to not performing cold-water immersion. In this sense, to comprehensively assess the effectiveness of post-exercise stretching, both short-term and delayed recovery should probably be considered.

In order to bring clarity to conflicting results, systematic reviews and meta-analysis (SRMA) are usually performed as a cornerstone for evidence-based practices (Higgins et al., 2019). Indeed, studies in the field tend to use small samples with reduced statistical power (Abt et al., 2020). In contrast, SRMA provide greater statistical power. In fact, some attempts were performed to synthesize current literature related to post-exercise stretching and recovery. A SRMA of randomized and quasi-randomized studies showed that stretching before or after exercise did not protect from DOMS (Herbert and Gabriel, 2002), and two independent updates reinforced the same conclusions (Henschke and Lin, 2011; Herbert et al., 2011). However, relevant databases such as PubMed and Web of Science were not included in the searches of the aforementioned SRMAs, and potentially relevant search terms such as “mobility” and “post-exercise” or “post-training” were not applied. Likewise, external experts were not consulted after automated searches, as suggested in high-standard protocols (Moher et al., 2009, 2015; Shea et al., 2017). Moreover, nearly a decade has passed since the publication of the aforementioned SRMAs, and a cursory search of articles in Google Scholar from 2011 to present date suggests that several new studies have been done on the topic. An updated SRMA focused solely on post-exercise stretching and limited to randomized controlled trials (RCTs) may provide a more homogeneous and high-quality data set (Hariton and Locascio, 2018), while an expanded set of relevant databases and search terms may provide a more representative sample of existing studies.

Therefore, our goal was to review supervised RCTs on the effects of post-exercise stretching on recovery makers (i.e., DOMS, strength, ROM), in comparison with passive recovery or alternative recovery methods (e.g., low-intensity cycling). Short-term (≤1 h after exercise) and delayed recovery (24, 48, and 72 h) markers were considered.

Methods

Protocol and Registration

This systematic review followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines (Moher et al., 2009, 2015), the Cochrane Collaboration guidelines for evaluation of risk of bias (RoB) in randomized studies (Sterne et al., 2019), and the AMSTAR 2 recommendations (Shea et al., 2017). Quality of studies was assessed using the Grading of Recommendations Assessment, Development, and Evaluation (GRADE) (Guyatt et al., 2011). The review methods were established before initiating the research, and protocol registration preceded the search. Protocol was published in PROSPERO with the reference CRD42020222091.

Eligibility Criteria

Studies were eligible if consisting of original research or replication studies published in peer-reviewed journals, with full-text not limited to any particular language or publication date. Beyond English language, the authors also have a deep understanding of Portuguese and Spanish, as well as a good understanding of French and Italian. If studies were written in different languages, professional translators were hired. Based on scope, P.I.C.O.S. and timeframe for follow-up, Table 1 presents the inclusion and exclusion criteria. The limitation to RCTs was decided because randomization reduces the RoB and balances participants distribution between groups (Hariton and Locascio, 2018). Indeed, RCTs are the gold standard for evidence-based practices (Spieth et al., 2016). Supervision was considered if explicit information was available stating that at least one qualified professional oriented the post-exercise protocol. No studies were excluded on the basis of RoB as assessed through RoB 2 (Sterne et al., 2019).

TABLE 1

Table 1. Inclusion and exclusion criteria based on scope, PICOS and timeframe for follow-up.

Information Sources

Search was programmed to start on January 1st, 2021, but since protocol approval occurred earlier (December 2nd, 2020), we conducted the automated searches on December 23 and 24, 2020, with search results being exported to EndNote X9 for Mac (v.9.3.3., Clarivate Analytics). The following electronic databases were searched: Cochrane Library (including CENTRAL), EBSCO (all available databases), PEDro, PubMed, Scielo, Scopus, SPORTDiscus (all databases), and Web of Science (all databases/collections). Search protocol used Boolean operators and required the title, abstract, or keywords had to include (“stretch^*” OR “flex^*” OR “mobility” OR “range of motion”) AND (“post-exerci^*” OR “post-workout” OR “post-exertion” OR “post-train^*” OR “after exerci^*” OR “after workout” OR “after exertion” OR “after training” OR “recover^*” OR “warm-down” OR “cool-down”) AND “random.^*” Similar terms or synonyms were used to guarantee a more inclusive initial search and avoid an excessively narrow scope of analyzed studies. Searches were updated on February 16, 2021, for inclusion of records with date of entry from December 25, 2020, onwards. Where date of entry was not a feature (e.g., EBSCO, Scielo, Scopus, SPORTDiscus, Web of Science), publication date was limited to 2021, since the year 2000 would be practically all covered until the search was completed.

A manual search was conducted within the reference list of the records included in the sample after full text analysis, to retrieve potentially relevant studies that had not emerged in the initial search. After completion of this stage, the list of studies, as well as inclusion and exclusion criteria were sent to eight independent experts in the field, to check if they were aware of additional papers. The experts were university professors with a Ph.D. and with peer-reviewed publications within the scope of our SRMA. Search strategy was not provided, to avoid biasing the experts' search. After the final list of studies was completed, all the databases were again consulted to retrieve errata, corrigenda/corrections, or retractions of the included studies, as some may have been found to be fraudulent or retracted (Higgins et al., 2019).

Study Selection

The screening process started on January 4, 2021 for the first wave of searches. The screening process for the updated searches started on February 17, 2021. JA and FMC conducted the initial search, screening of titles and abstracts and analysis of full texts independently. HS and PM later reviewed the entire process. Thirdly, a step-by-step comparison of the whole process was conducted, and any disagreements motivated a new analysis of the records in question. Discussion regarding manuscripts suitability was performed with all the involved authors in the study selection process, until consensus was achieved. The same process was then used to analyze the reference lists of the included studies to verify if additional relevant studies were available. External experts were contacted to provide additional suggestions of relevant studies based on inclusion criteria and on our preliminary list. JA and FMC independently verified the list to decide on inclusion of the suggested studies. HS and PM then reviewed this process. The same process was applied to search for errata of the included studies.

Data Extraction

All extracted data were defined a priori, to avoid biased analyses (Spieth et al., 2016). Study characteristics: (i) sample size and features (e.g., age, sex, health, training status, country, continent; single or multicenter study); (ii) length and characteristics of the interventions and comparators (e.g., weekly frequency, type/modality of stretching and comparators, volume, intensity, duration, supervision ratio, qualification of supervisors, description of co-interventions); (iii) adherence rates to training (i.e., attendance percentage); (iv) funding sources and potential conflicts of interest. Data specific to cross-over studies (Elbourne et al., 2002; Spieth et al., 2016): (i) length of wash-in and wash-out periods; (ii) carryover effects, if there were any.

Primary outcomes for short-term recovery (≤1-h post-intervention): strength levels (e.g., maximum voluntary contraction) and joint ROM immediately or until 1 h after exertion. Primary outcomes for delayed recovery: DOMS, strength levels, and joint ROM at 24, 48, and 72 h, which are considered theoretically relevant (Van Hooren and Peake, 2018) and are commonly assessed periods on studies investigating this subject matter (Bonfim et al., 2010; Torres et al., 2013).

Secondary outcomes: Biochemical markers (e.g., plasma creatine kinase; blood lactate concentration); muscle and tendon stiffness; adverse effects during the post-exercise interventions (type, intensity or severity, time points). The timings described in the previous paragraph were considered for secondary outcomes as well.

Outcomes were only considered for analysis in case there was no additional exercise bout between the initial session and the delayed recovery timeframe. For all primary and secondary outcomes, description of measurement tools and metrics was included (Higgins et al., 2019) and both significant and non-significant results were considered (Spieth et al., 2016). Furthermore, parallel and cross-over trials were combined as long as the latter did not have significant carryover effects (Elbourne et al., 2002). JA and FMC completed initial data extraction independently. HS and PM later reviewed the entire process and consensus had to be achieved. The data required for meta-analysis was fulfilled by JA and FMC and then reviewed by HS and PM. RRC provided a final verification of the quality of data inserted into the table.

Risk of Bias in Individual Studies

Bias refers to systematic errors that can threaten the internal validity of an RCT (Spieth et al., 2016). RoB was assessed using the revised Cochrane risk-of-bias tool for randomized trials (RoB 2) (Sterne et al., 2019), which consists of five dimensions, i.e., bias arising due to: (i) the randomization process; (ii) deviations from intended interventions; (iii) missing outcome data; (iv) measurement of the outcome; and (v) selection of the reported result. JA and FMC independently assessed RoB for all studies. After the first assessment, tables were compared and disagreements were discussed, with a subsequent re-analysis of the situation. Finally, HS and PM reviewed the assessments to ensure the quality of the evaluations. For assessing RoB in parallel trials, the Excel tool ROB2_IRPG_beta_v7 (Cochrane) was used. For crossover trials, the Excel tool ROB2.0_IRCX_beta (MRC | Hubs for Trials Methodology Research) was planned to be used. However, this tool is outdated. Following the most up-to-date Cochrane guidelines for applying RoB 2 to individual cross-over trials (Higgins et al., 2020), the five domains can be assessed following the structure of parallel trials. However, an extra dimension (Domain S) is added. Therefore, we used the ROB2_IRPG_beta_v7, with manual addition of Domain S.

Summary Measures

It is possible to use two studies in a meta-analysis (Valentine et al., 2010), but we chose to establish a minimum of three studies (Moran et al., 2018; García-Hermoso et al., 2019; Skrede et al., 2019) to avoid small sample sizes (Abt et al., 2020; Lohse et al., 2020). Pre- and post-intervention means and standard deviations (SDs) were converted to Hedge's g effect size (ES) (García-Hermoso et al., 2019; Skrede et al., 2019). In case the study instead provides 95% confidence intervals (CIs) or standard errors of mean (SEM), means and standard deviations were obtained from 95% CI or SEM, using Cochrane's RevMan Calculator for Microsoft Excel (Drahota and Beller, 2020). In case data for primary outcomes was presented only in graphical form, a validated software (r = 0.99, p < 0.001), WebPlotDigitizer, version 4.4 (Rohatgi, 2020) was used to extract data, with all values rounded to two decimal places. In these cases, the main author extracted data from the graphs, and an outside researcher, not involved in this work (see section Acknowledgments), performed an independent data extraction. Reliability was calculated through Cronbach's Alpha, using SPSS Statistics version 27 for Mac (IBM).

The inverse variance random-effects model for meta-analyses was used because it allocates a proportionate weight to trials based on the size of their individual standard errors (Deeks et al., 2008) and enables analysis while accounting for heterogeneity across studies (Kontopantelis et al., 2013). The ESs were presented alongside 95% CIs and interpreted using the following thresholds (Hopkins et al., 2009): <0.2, trivial; 0.2–0.6, small; >0.6–1.2, moderate; >1.2–2.0, large; >2.0–4.0, very large; >4.0, extremely large. Heterogeneity was assessed using the I² statistic, with values of <25, 25–75, and >75% considered to represent low, moderate, and high levels of heterogeneity, respectively (Higgins and Thompson, 2002). Publication bias was explored using the extended Egger's test (Egger et al., 1997). To adjust for publication bias, a sensitivity analysis was conducted using the trim and fill (Duval and Tweedie, 2000), with L0 as the default estimator for the number of missing studies (Shi and Lin, 2019). Analyses were performed in the Comprehensive Meta-Analysis program (version 2; Biostat, Englewood, NJ, USA). Statistical significance was set at p ≤ 0.05.

Moderator Analyses

These analyses were planned but could not be performed. Details on planned moderator analysis can be found in the Supplementary Materials.

Confidence in Cumulative Evidence

For RCTs, GRADE starts assuming high quality, which can be downgraded according to five dimensions (Zhang et al., 2019). In addition to RoB, inconsistency (heterogeneity) and publication bias, which have already been addressed, indirectness and imprecision (using 95% CIs) were assessed independently by JA and FMC and verified by HS. These authors also estimated the overall quality and confidence in cumulative evidence.

Results

Study Selection

Initial search retrieved 16,851 results [Cochrane Library: 13 reviews and 621 trials; EBSCO: 1,704; PEDro: 21; PubMed: 2,421; Scielo: 12; Scopus: 5,253; SPORTDiscus: 734; Web of Science (all collections): 6,072]. Automated removal (EndNote function) of 6,635 duplicates resulted in 10,216 records. Manual removal of additional 2,333 duplicates resulted in 7,882 records to be screened. The first stage of screening titles and abstracts was based on study type (first inclusion criteria) and resulted in the exclusion of 2,101 records. The second stage of screening started with 5,781 records and 5,481 studies that were clearly out of scope (e.g., exercise-related studies not addressing the theme of our work, non-exercise related studies) were removed. Finally, starting with 300 records, the third stage of screening applied the PICOS criteria, and further excluded 278 studies. In these three stage-screening processes, exclusion criteria were defined hierarchically, i.e., if a paper had several reasons for exclusion, its exclusion would be based on the first criteria it failed to fit. Finally, two records had untraceable full texts, with discontinued links, disappearance from databases from where they were retrieved, and even not emerging in searches within the journals where they were supposedly published.

The updated searches retrieved 199 new records [Cochrane Library: 1 review and 8 trials; EBSCO: 49; PEDro: 0; PubMed: 53; Scielo: 3; Scopus: 25; SPORTDiscus: 7; Web of Science (all collections): 53]. Removal of duplicates results in 121 records, of which 14 were excluded due not fitting study type, 60 being non-related to exercise, 40 being related to exercise but out of scope, and six did not comply with PICOS criteria. More in-depth information concerning the screening can be found in Supplementary Table 1. Therefore, 21 records were considered eligible for analysis of the full text (20 in the initial searches and one in the updated searches). While most were written in English, one was in Portuguese (Bonfim et al., 2010), one in Greek (Kokkinidis et al., 1998), and three in Korean ( yes et al., 2010; Oh, 2013; Kang and Park, 2018). A translator was hired for the Korean studies, and another for the Greek study.

At this stage, 12 records were excluded, with reasons. The study by Apostolopoulos et al. (2018) was excluded because the interventions were not supervised. However, they have interesting results that we will explore briefly here. Since they applied stretching for three consecutive days after the eccentric exercise protocol, only results at 24 h were considered. The authors used a 90% CIs (and not the more common 95% CIs) to compare low-intensity and high-intensity stretching to a control group using passive rest. Despite the authors' claims, all 90% CIs passed through zero, and no differences were observed at 24 h between the stretching groups and the controls for DOMS, eccentric and isometric peak torques of knee extensors, creatine kinase (U/L), and high-sensitivity C-reactive protein. The study of Boobphachart et al. (2017) was excluded because the stretching intervention was performed three times per day and, furthermore, was unsupervised.

The study of Cha and Kim (2015) was excluded because both groups included some form of stretching, therefore inhibiting the comparison of stretching with alternative protocols and failing our PICOS criteria. The study of Duffield et al. (2014) was excluded because both the training interventions and the protocols were applied twice a day. Furthermore, one of the protocols included not only immediate measures (15-min cold-water immersion), but also ongoing measures such as 3 h of wearing full-body compression garments, plus abiding by sleep-hygiene recommendations in that night. The study of Gulick et al. (1996) was excluded because randomization was compromised. The authors created seven groups with 10 participants each. When a participant would quit, they would simply recruit a new participant to the group, therefore compromising both randomization and baseline values for each group. In addition, no details were provided concerning how these new subjects changed the values for each variable.

The study of Kang and Park (2018) was excluded because the exercise intervention lasted 20 min, while the post-exercise stretching protocol consisted of 5 min of so-called preparation exercises, followed by 30 min of stretching, followed by 5 min of so-called clean-up exercises. Therefore, not only did post-exercise recovery last 200% more than the exercise intervention (thereby, being akin to a stretching intervention per se and failing our inclusion criteria), but also the recovery intervention was not exclusively reliant on stretching (again, falling our inclusion criteria). The study of McGlynn et al. (1979) was excluded because stretching was applied immediately post-exercise, but also repeated at 6, 25, 30, 49, and 54 h post-exercise. Therefore, even the 24 h assessments could not be attributed to stretching performed immediately following an exercise bout. Incidentally, the authors reported that both the stretching and biofeedback groups observed a reduction in EMG muscle activity on the biceps brachii in comparison with a passive control group, but they had no effect on perceived pain.

The study of Oh (2013) was excluded because the cool-down protocols were not stretching-based. The study of Pooley et al. (2020) was a cross-over study that was excluded because randomization was compromised: while after “home” fixtures, the participants were randomized to cold-water immersion or cycle ergometer, in “away” fixtures stretching was always prescribed. The study of Robey et al. (2009) was excluded because the authors detail, in the manuscript, that the crossover was only semi randomized, and therefore does not meet our inclusion criteria. In any case, the main characteristics and results from this study have been addressed in the introduction, which was written prior to our searches. The study of Xanthos et al. (2013) was excluded because the so-called traditional recovery group was multimodal. The study of yes et al. (2010) was excluded because the cool-down protocol was multimodal.

Therefore, nine studies fulfilled all inclusion criteria (Kokkinidis et al., 1998; Mika et al., 2007; Bonfim et al., 2010; Cè et al., 2013; Torres et al., 2013; McGrath et al., 2014; Muanjai and Namsawang, 2015; Cooke et al., 2018; César et al., 2021). As per protocol, in studies where the recovery methods were applied in multiple sessions (e.g., stretching after exercise and repeated at 24 and 48 h), only data before the second application was considered. To illustrate, in the studies of Cooke et al. (2018) and Kokkinidis et al. (1998), only the results at 24 h post-exercise were considered. Since a new recovery session was applied at 24 h, the results at 48 h and longer were not considered in the meta-analysis since results might not be attributable to the immediate post-exercise stretching protocol. In addition, and following protocol, multimodal recovery groups also including stretching were excluded from analysis (e.g., the group combining stretching followed by cold water immersion in the study of Muanjai and Namsawang, 2015). In the study of Torres et al. (2013), two groups were considered: the group performing eccentric exercise, and the group performing eccentric exercise followed by a single bout of stretching. The group that only performed stretching and the group that performed eccentric exercise followed by repeated bouts of stretching in the following days were excluded as they did not conform to our inclusion criteria.

A manual search within the reference lists of included studies revealed 26 potentially fitting titles (including updated searches). Of these, two had already been included in our final sample, and five had been excluded during the process. Nineteen studies had not appeared in our searches; screening of their abstracts resulted in the exclusion of five based on study type (e.g., abstract, review), and 10 based on failure to fulfill PICOS criteria. Of the four studies that required full text analysis, two fulfilled all PICOS criteria and were therefore added to our sample (Torres et al., 2005; West et al., 2014). In relation to Torres et al. (2005), and following the rules applied to Torres et al. (2013), only the two groups meeting the criteria were considered for analysis. Subsequently, eight experts were invited to contribute with additional relevant studies. Two experts declined the invitation due to lack of time, while five experts did not respond. One expert responded that our list was thorough and did not make any additional recommendation. Finally, errata, corrigenda, corrections, and retractions were searched for the included studies, but none was found. Therefore, 11 studies were included for qualitative analysis (n = 289), of which 10 could integrate quantitative analysis (n = 280, n = 229 after exclusion of groups that did not fulfill PICOS criteria). The process is summarized in the PRISMA flow diagram (Figure 1).

FIGURE 1

Figure 1. Flowchart describing the study selection process.

Study Characteristics

Study characteristics are provided in Table 2. Three studies used a cross-over design (Mika et al., 2007; Cè et al., 2013; West et al., 2014), while the remaining used a parallel design. Sample size ranged from 9 (Cè et al., 2013) to 57 (McGrath et al., 2014), with ages ranging from 17 to 38 years-old, i.e., all studies were performed with adults or near-adulthood (i.e., the usual legal age of 18 years old). The studies of Bonfim et al. (2010) and McGrath et al. (2014) had a mixed sample of men and women. The remaining studies only used male participants. All participants were healthy, but varied considerably in terms of training status: described as sedentary or untrained in four studies (Kokkinidis et al., 1998; Torres et al., 2005, 2013; Bonfim et al., 2010), “physically active,” “recreationally active,” or “not involved in intense physical conditioning” in five studies (Mika et al., 2007; Cè et al., 2013; McGrath et al., 2014; Muanjai and Namsawang, 2015; Cooke et al., 2018), one study assessed the effects in aerobically trained, recreational cyclists (West et al., 2014), and only one study assessed athletes (César et al., 2021). Geographically, five studies were performed in Europe (Kokkinidis et al., 1998; Torres et al., 2005, 2013; Mika et al., 2007; Cè et al., 2013), three in North America (McGrath et al., 2014; West et al., 2014; Cooke et al., 2018), two in South America (Bonfim et al., 2010; César et al., 2021) and one in Asia (Muanjai and Namsawang, 2015).

TABLE 2

Table 2. Study characteristics.

The studies purposefully applied soreness-inducing exercise protocols for the upper limbs (César et al., 2021) or lower limbs (all other articles), using diverse means such as cycling (Cè et al., 2013; West et al., 2014), running-based activities (Cooke et al., 2018), plyometrics (Muanjai and Namsawang, 2015), simulated jiu-jitsu fights (César et al., 2021), and more commonly, some form of strength training, usually with an emphasis on the eccentric component (Kokkinidis et al., 1998; Torres et al., 2005, 2013; Mika et al., 2007; Bonfim et al., 2010; McGrath et al., 2014). Familiarization with the soreness-inducing protocols was described in three studies (Mika et al., 2007; Cè et al., 2013; Cooke et al., 2018; César et al., 2021), stated but not described in one (Muanjai and Namsawang, 2015), and not performed or unreported in six (Kokkinidis et al., 1998; Torres et al., 2005, 2013; Bonfim et al., 2010; McGrath et al., 2014; West et al., 2014). In most studies, the duration of the soreness-inducing protocol was unclear (Kokkinidis et al., 1998; Torres et al., 2005, 2013; Mika et al., 2007; Bonfim et al., 2010; McGrath et al., 2014; West et al., 2014; Muanjai and Namsawang, 2015), but unlikely to have surpassed 30 min, considering the descriptions provided. In the remaining studies, soreness-inducing protocols lasted between 10 min (César et al., 2021) and 55 min (Cooke et al., 2018), including warm-up when applied. The only study to report a co-intervention stated that a nutritional bar was provided pre-fatiguing exercise (West et al., 2014).

All studies had at least one group performing post-exercise stretching as an attempt to mitigate the negative effects of the soreness-inducing protocols. Active static stretching was used in four studies (Kokkinidis et al., 1998; Bonfim et al., 2010; West et al., 2014; Cooke et al., 2018), passive stretching in six (Torres et al., 2005, 2013; Cè et al., 2013; McGrath et al., 2014; Muanjai and Namsawang, 2015; César et al., 2021), and PNF in two (Mika et al., 2007; McGrath et al., 2014). McGrath et al. (2014) used both passive static stretching and PNF. No study used dynamic stretching. Almost all the post-exercise stretching protocols targeted the lower limbs, with one study targeting the upper limbs (César et al., 2021), and lasted between ~1 min (McGrath et al., 2014) and 30 min (West et al., 2014; Cooke et al., 2018). Intensity of stretching was measured using only subjective feelings during the exercise, ranging from “subjects perceiving a slight feeling of stretching (…), without generating discomfort” (Bonfim et al., 2010) to “until subjects felt a maximal stretch of the hamstrings” (McGrath et al., 2014) or “until the greatest discomfort was reported by the participants” (César et al., 2021).

The comparator post-exercise interventions were also varied across studies, with some studies having more than one comparator group. Passive recovery (i.e., rest) was used as comparator in eight studies (Kokkinidis et al., 1998; Torres et al., 2005, 2013; Mika et al., 2007; Bonfim et al., 2010; Cè et al., 2013; McGrath et al., 2014; César et al., 2021). Additional recovery protocols included low-intensity cycling (Mika et al., 2007; Cè et al., 2013; West et al., 2014) or running/jogging (West et al., 2014; Cooke et al., 2018), superficial and deep massage (Cè et al., 2013), cryotherapy and/or cold-water immersion (Kokkinidis et al., 1998; Muanjai and Namsawang, 2015; César et al., 2021).

One study explicitly stated that there were no adverse effects to report (Muanjai and Namsawang, 2015), while the other studies made no mention to it. We further highlight that two studies had potentially relevant conflicts of interest, as the company manufacturing the anti-gravity treadmill provided financing for the research (West et al., 2014; Cooke et al., 2018).

Risk of Bias Within Studies

Cochrane's RoB 2 tool evaluates RoB in five different dimensions (Sterne et al., 2019), the second of which subdivided into two parts. Here, an intention-to-treat analysis was considered. In terms of outcomes, RoB was only assessed for the primary outcomes (i.e., strength, ROM, and DOMS). None of the included studies had a pre-registered protocol. However, one had a specific reference to a grant (Torres et al., 2013), and another to an approval number by an Ethics Committee (Bonfim et al., 2010). In both cases, a pre-study protocol had to exist, and so we have contacted the authors. The corresponding author of Bonfim et al. (2010) provided the trial protocol, which also contained a statistical analysis plan. The main author of Torres et al. (2013), which was the recipient of the grant, was contacted, but unfortunately did not have the original project, which is comprehensible given the timeline. Since some studies had more than one outcome, assessments for domains 4 and 5 could have multiple assessments for each study. The complete assessments (i.e., one assessment per outcome per study) can be found in Supplementary Table 2. Table 3 presents the worst-case scenario for each study, i.e., considering the outcome for which the risk of bias was higher.

TABLE 3

Table 3. Risk of bias in individual studies (worst-case scenario).

These results can be visualized in Figure 2, which exhibits the percentage distribution of RoB for domains 1–5 and overall bias considering the worst assessment for each study. Overall RoB was high in 72.7% of the studies and presented some concerns in 27.3%. All studies presented problems with the randomization process: no description of how randomization was achieved and whether allocation sequence was properly concealed and, in 27.3% of the studies, baseline values suggested problems with the randomization process. Moreover, 72.7% of studies had high RoB in measurement of the outcome, mostly because testers were usually not blinded, and some outcomes were particularly prone to being influenced by knowledge of the intervention received.

FIGURE 2

Figure 2. Percentage distribution of risk of bias in individual studies (RoB 2).

There was low RoB arising from deviations from intended interventions and from missing outcome data in 90.9% of the papers. Finally, although 90.9% of papers presented some concerns for RoB arising from selection of the reported result, this resulted mostly from lack of pre-registered protocols, and our opinion upon reading the studies is that the authors provided an honest and complete reporting. Of the crossover studies, one had high RoB for carry-over effects (Cè et al., 2013) and, following protocol, was excluded from meta-analysis. However, it still integrated the qualitative review.

Results of Individual Studies

Primary outcomes were registered on the form of means ± SDs, except for Cè et al. (2013), that used means ± SEM. This study, in particular, had a graph from which we felt we could not extract reliable data. Allied to the fact that this study could not enter the meta-analytical calculations, we chose not to extract the data from the graph, and only present the qualitative results provided by the authors. For values extracted from graphs (Mika et al., 2007; Bonfim et al., 2010; Muanjai and Namsawang, 2015; Cooke et al., 2018; César et al., 2021), Cronbach's Alpha values were 0.991 (means) and 0.981 (SDs). The results of individual studies are compiled in Table 4.

TABLE 4

Table 4. Results of individual studies.

Primary outcomes were any assessments related to strength, ROM and/or soreness, both short-term (i.e., until ≤ 1-h post-recovery) and delayed (24, 48, and 72 h post-recovery). These outcomes were useful only if there were pre-exercise and post-recovery assessments. Short-term effects were reported for strength-related measures in six studies (Torres et al., 2005, 2013; Mika et al., 2007; Cè et al., 2013; Muanjai and Namsawang, 2015; César et al., 2021), ROM in one study (McGrath et al., 2014; Muanjai and Namsawang, 2015), and DOMS in three studies (Torres et al., 2005, 2013; Muanjai and Namsawang, 2015). Three studies had no short-term assessments (Kokkinidis et al., 1998; Bonfim et al., 2010; West et al., 2014). One study mentioned having data at 15- and 30-min after recovery, but that data only applied to secondary outcomes (Cooke et al., 2018). With the exception of César et al. (2021), all strength-related assessments were performed for the lower limbs, and this was valid also for delayed assessments.

Delayed assessments were performed for strength-related variables in five studies (Torres et al., 2005, 2013; West et al., 2014; Muanjai and Namsawang, 2015; Cooke et al., 2018), ROM in one (Muanjai and Namsawang, 2015), and DOMS in seven (Kokkinidis et al., 1998; Torres et al., 2005, 2013; Bonfim et al., 2010; McGrath et al., 2014; Muanjai and Namsawang, 2015; Cooke et al., 2018). Three studies did not have delayed outcomes (Mika et al., 2007; Cè et al., 2013; César et al., 2021). Although Kokkinidis et al. (1998) assessed delayed effects on strength and ROM, they presented only means, without any measure of variation that could help to better interpret the results. As previously explained, if the delayed assessments were conducted after a new bout of the recovery protocol, they would be discarded, as the effects of the first bout could no longer be assessed. Of the studies including delayed assessments, four had data for the three timepoints defined in our protocol (i.e., 24, 48, and 72 h) (Torres et al., 2005, 2013; Bonfim et al., 2010; Muanjai and Namsawang, 2015), one study had data for 24 and 48 h post-recovery protocol (McGrath et al., 2014), and three had data for 24 h post-recovery only (Kokkinidis et al., 1998; West et al., 2014; Cooke et al., 2018).

Based on their data, some studies concluded that post-exercise stretching was not an effective recovery strategy, and was not superior to comparator interventions (West et al., 2014; Cooke et al., 2018), including passive recovery, i.e., rest (Bonfim et al., 2010; Cè et al., 2013; César et al., 2021). In the study of Kokkinidis et al. (1998), the authors stated that stretching and cryotherapy were superior to passive rest, but these effects were not observed at 24 h, only at 48 h; moreover, after 24 h, the experimental groups had an additional recovery bout applied, but without the soreness-inducing exercise. In study of McGrath et al. (2014), PNF was not superior to passive recovery, and the static stretching group was the only one not showing significant decreases in DOMS at 24 or 48 h.

In the study of Mika et al. (2007), short-term strength levels recovered faster in the low-intensity cycling group than in the stretching or passive rest groups. In two studies (Torres et al., 2005, 2013), the authors stated that post-exercise stretching did not impair recovery in terms of strength and DOMS when compared to a passive rest group, but it did not improve recovery either. Finally, Muanjai and Namsawang (2015) concluded that both stretching and cold-water immersion could be used to improve post-exercise recovery. However, this conclusion is not sustained on their data, as DOMS only returned to baseline at 96 h post-recovery protocol, strength levels and ROM after 48 h, and vertical jump was still not back to baseline even after 96 h. Moreover, without a passive recovery group to compare to, no statement can be provided regarding acceleration of recovery.

Synthesis of Results

As stipulated in the protocol, cross-over trials would only be combined with parallel trials if there were no significant carryover effects (Elbourne et al., 2002). This was not guaranteed in the study of Cè et al. (2013), which was therefore excluded from meta-analysis. Across the remaining nine studies, as previously presented, there was considerable variation concerning the soreness-inducing protocols, the comparators to stretching, the outcome domains, the measurements within those outcome domains, and the timepoints of assessing the outcomes. Our protocol had stipulated three primary outcomes (strength, ROM, and DOMS) across four different timepoints (short-term, i.e., maximum 1 h after the recovery intervention; and 24, 48, and 72 h after the recovery intervention). After analyzing the outcomes and timepoints in each study, and also considering the comparator protocols, we found that only a few meta-analytical comparisons were feasible.

Short-Term Effects on Strength, Stretching vs. Passive Recovery (Rest)

Three studies had comparable data (i.e., strength measures of the knee extensors) to afford this meta-analysis (Torres et al., 2005, 2013; Mika et al., 2007). One study used PNF stretching (Mika et al., 2007) and the others used passive static stretching and compared this intervention to passive rest. Although the study of César et al. (2021) had strength assessments, they were for the upper limbs, more specifically grip strength, and so we decided not to compare it with the remaining studies. In RoB assessments considering this outcome, these studies had an overall classification of “some concerns,” meaning none of the domains presented high RoB. In domain 4 (measurement of the outcome), they had low RoB.

For within-group effects, three studies provided data for short-term strength recovery, involving three stretching groups (pooled n = 33). Results showed that post-exercise stretching protocols did not allow participants to recover their basal strength level (ES = −0.85; 95% CI = −1.53 to −0.17; p = 0.015; I² = 80.4%; Egger's test p = 0.396; Figure 3).

FIGURE 3

Figure 3. Forest plot denoting short-term strength recovery level in participants that completed post-exercise stretching protocols. Values shown are effect sizes (Hedges's g) with 95% confidence intervals (CI). The size of the plotted squares reflects the statistical weight of each study. The black diamond reflects the overall result. Note: negative values denote that post-exercise stretching protocols did not allow participants to recover their basal strength level (i.e., 0.00 in the figure).

In addition, three studies provided data for short-term strength recovery, involving three passive recovery groups (pooled n = 32). Results showed that post-exercise passive recovery protocols did not allow participants to recover their basal strength level (ES = −0.81; 95% CI = −1.46 to −0.15; p = 0.016; I² = 78.7%; Egger's test p = 0.435; Figure 4).

FIGURE 4

Figure 4. Forest plot denoting short-term strength recovery level in participants that completed post-exercise passive recovery protocols. Values shown are effect sizes (Hedges's g) with 95% confidence intervals (CI). The size of the plotted squares reflects the statistical weight of each study. The black diamond reflects the overall result. Note: negative values denote that post-exercise passive recovery protocols did not allow participants to recover their basal strength level (i.e., 0.00 in the figure).

Between-group comparisons (pooled n = 65) showed no effect of post-exercise stretching protocols on strength recovery (ES = −0.08; 95% CI = −0.54 to 0.39; p = 0.750; I² = 0.0%; Egger's test p = 0.531; Figure 5) when compared to control condition (i.e., passive recovery).

FIGURE 5

Figure 5. Forest plot of changes in short-term strength recovery after participating in post-exercise stretching protocols compared to control conditions (i.e., passive recovery). Values shown are effect sizes (Hedges's g) with 95% confidence intervals (CI). The size of the plotted squares reflects the statistical weight of each study. The black diamond reflects the overall result.

Delayed Effects (24 h) on Delayed Onset Muscle Soreness, Stretching vs. Passive Recovery (Rest)

Five studies had comparable data to assess DOMS at 24 h (Kokkinidis et al., 1998; Torres et al., 2005, 2013; Bonfim et al., 2010; McGrath et al., 2014). Two used active static stretching (Kokkinidis et al., 1998; Bonfim et al., 2010) and three passive static stretching (Torres et al., 2005, 2013; McGrath et al., 2014). All had at least one comparator that passively recovered (i.e., rest). The study of Bonfim et al. (2010) had two assessments of DOMS; here, we used the assessment through the visual analog scale, as the other studies also used similar scales. The four studies had high RoB in measurement of this outcome, so all results should be considered with caution.

For within-group comparisons, five studies provided data for 24-h post-exercise DOMS, involving five experimental groups (pooled n = 57). Results showed that post-exercise DOMS remained significantly above basal levels after post-exercise stretching protocols (ES = 1.55; 95% CI = 1.12–1.97; p < 0.001; I² = 48.3%; Egger's test p = 0.231; Figure 6).

FIGURE 6

Figure 6. Forest plot denoting 24-h post-exercise delayed onset of muscle soreness (DOMS) in participants that completed post-exercise stretching protocols. Values shown are effect sizes (Hedges's g) with 95% confidence intervals (CI). The size of the plotted squares reflects the statistical weight of each study. The black diamond reflects the overall result. Note: positive values denote that post-exercise stretching protocols did not allow participants to recover their basal DOMS level (i.e., 0.00 in the figure).

In addition, five studies provided data for 24-h post-exercise DOMS, involving five control groups (pooled n = 54). Results showed that passive recovery protocols did not allow participants to recover their basal DOMS level (ES = 1.87; 95% CI = 1.28–2.46; p < 0.001; I² = 64.6%; Egger's test p = 0.119; Figure 7).

FIGURE 7

Figure 7. Forest plot denoting 24-h post-exercise delayed onset of muscle soreness (DOMS) in participants that completed passive recovery (control conditions) protocols. Values shown are effect sizes (Hedges's g) with 95% confidence intervals (CI). The size of the plotted squares reflects the statistical weight of each study. The black diamond reflects the overall result. Note: positive values denote that passive recovery protocols did not allow participants to recover their basal DOMS level (i.e., 0.00 in the figure).

Between-group comparisons involved five experimental and five control groups (pooled n = 111). Results showed no effect of post-exercise stretching protocols on 24-h post-exercise DOMS (ES = −0.24; 95% CI = −0.60–0.12; p = 0.187; I² = 0.0%; Egger's test p = 0.880; Figure 8) when compared to control conditions (i.e., passive recovery).

FIGURE 8

Figure 8. Forest plot of changes in 24-h post-exercise delayed onset of muscle soreness (DOMS) after participating in post-exercise stretching protocols compared to control conditions (i.e., passive recovery). Values shown are effect sizes (Hedges's g) with 95% confidence intervals (CI). The size of the plotted squares reflects the statistical weight of each study. The black diamond reflects the overall result.

Delayed Effects (48 h) on Delayed Onset Muscle Soreness, Stretching vs. Passive Recovery (Rest)

Four studies had comparable data (Torres et al., 2005, 2013; Bonfim et al., 2010; McGrath et al., 2014). One used active static stretching (Bonfim et al., 2010) and three passive static stretching (Torres et al., 2005, 2013; McGrath et al., 2014). All had at least one comparator that passively recovered (i.e., rest). With regard to RoB, four studies had high RoB in measurement of this outcome.

Four studies provided data for within-group comparisons on 48-h post-exercise DOMS, involving four experimental groups (pooled n = 53). Results showed that post-exercise DOMS remained significantly above basal levels after post-exercise stretching protocols (ES = 1.50; 95% CI = 1.02–1.98; p < 0.001; I² = 59.8%; Egger's test p = 0.257; Figure 9).

FIGURE 9

Figure 9. Forest plot denoting 48-h post-exercise delayed onset of muscle soreness (DOMS) in participants that completed post-exercise stretching protocols. Values shown are effect sizes (Hedges's g) with 95% confidence intervals (CI). The size of the plotted squares reflects the statistical weight of each study. The black diamond reflects the overall result. Note: positive values denote that post-exercise stretching protocols did not allow participants to recover their basal DOMS level (i.e., 0.00 in the figure).

Four studies provided data for 48-h post-exercise DOMS, involving four control groups (i.e., passive recovery) (pooled n = 50). Results showed that post-exercise passive recovery protocols did not allow participants to recover their basal DOMS level (ES = 1.52; 95% CI = 1.17–1.87; p < 0.001; I² = 18.3%; Egger's test p = 0.120; Figure 10).

FIGURE 10

Figure 10. Forest plot denoting 48-h post-exercise delayed onset of muscle soreness (DOMS) in participants that completed post-exercise passive recovery. Values shown are effect sizes (Hedges's g) with 95% confidence intervals (CI). The size of the plotted squares reflects the statistical weight of each study. The black diamond reflects the overall result. Note: positive values denote that post-exercise passive recovery did not allow participants to recover their basal DOMS level (i.e., 0.00 in the figure).

For between-group comparisons, four studies provided data for 48-h post-exercise DOMS, involving four experimental and four control groups (pooled n = 103). Results showed no effect of post-exercise stretching protocols on 48-h post-exercise DOMS (ES = −0.09; 95% CI = −0.47–0.28; p = 0.629; I² = 0.0%; Egger's test p = 0.777; Figure 11) when compared to control conditions (i.e., passive recovery).

FIGURE 11

Figure 11. Forest plot of changes in 48-h post-exercise delayed onset of muscle soreness (DOMS) after participating in post-exercise stretching protocols compared to control conditions (i.e., passive recovery). Values shown are effect sizes (Hedges's g) with 95% confidence intervals (CI). The size of the plotted squares reflects the statistical weight of each study. The black diamond reflects the overall result.

Delayed Effects (72 h) on Delayed Onset Muscle Soreness, Stretching vs. Passive Recovery (Rest)

Three studies had comparable data for DOMS at 72 h (Torres et al., 2005, 2013; Bonfim et al., 2010). One used active static stretching (Bonfim et al., 2010) and two passive static stretching (Torres et al., 2005, 2013). With regard to RoB, the three studies had high RoB in measurement of this outcome.

For within-group analysis, three studies provided data for 72-h post-exercise DOMS, involving three experimental groups (pooled n = 33). Results showed that post-exercise DOMS remained significantly above basal levels after post-exercise stretching protocols (ES = 0.98; 95% CI = 0.67–1.28; p < 0.001; I² = 0.0%; Egger's test p = 0.525; Figure 12).

FIGURE 12

Figure 12. Forest plot denoting 72-h post-exercise delayed onset of muscle soreness (DOMS) in participants that completed post-exercise stretching protocols. Values shown are effect sizes (Hedges's g) with 95% confidence intervals (CI). The size of the plotted squares reflects the statistical weight of each study. The black diamond reflects the overall result. Note: positive values denote that post-exercise stretching protocols did not allow participants to recover their basal DOMS level (i.e., 0.00 in the figure).

Three studies provided data for 72-h post-exercise DOMS, involving three passive recovery groups (pooled n = 32). Results showed that post-exercise passive recovery protocols did not allow participants to recover their basal DOMS level (ES = 0.99; 95% CI = 0.68–1.30; p < 0.001; I² = 0.0%; Egger's test p = 0.641; Figure 13).

FIGURE 13

Figure 13. Forest plot denoting 72-h post-exercise delayed onset of muscle soreness (DOMS) in participants that completed post-exercise passive recovery protocols. Values shown are effect sizes (Hedges's g) with 95% confidence intervals (CI). The size of the plotted squares reflects the statistical weight of each study. The black diamond reflects the overall result. Note: positive values denote that post-exercise passive recovery protocols did not allow participants to recover their basal DOMS level (i.e., 0.00 in the figure).

For between-group comparisons, three studies provided data for 72-h post-exercise DOMS, involving three experimental and three control groups (pooled n = 65). Results showed no effect of post-exercise stretching protocols on 72-h post-exercise DOMS (ES = −0.23; 95% CI = −0.70–0.24; p = 0.337; I² = 0.0%; Egger's test p = 0.165; Figure 14) when compared to control conditions (i.e., passive recovery).

FIGURE 14

Figure 14. Forest plot of changes in 72-h post-exercise delayed onset of muscle soreness (DOMS) after participating in post-exercise stretching protocols compared to control conditions (i.e., passive recovery). Values shown are effect sizes (Hedges's g) with 95% confidence intervals (CI). The size of the plotted squares reflects the statistical weight of each study. The black diamond reflects the overall result.

Additional Analysis

Due to the small number of studies included in each meta-analysis, additional analysis, and sensitivity analyses were not performed. In each analysis, RoB was similar in all studies, and so we decided not to assess the effects of RoB on the results. Meta-regression was not performed due to having <10 studies with sufficient commonalities.

Confidence in Cumulative Evidence

Confidence in cumulative is equivalent to quality of the evidence (Higgins et al., 2019). GRADE assessments are presented in Table 5. Overall, we have very little confidence in the effect estimate, and the true effect is likely to be substantially different from the estimate of effect.

TABLE 5

Table 5. GRADE assessment for the certainty of evidence.

Discussion

Summary of Evidence

Stretching has been traditionally prescribed for the cool-down phase of training sessions, under the premise that it enhances recovery (ACSM, 2018; American Heart Association, 2020). But this premise has been questioned by previous assessments of the literature (Herbert and Gabriel, 2002; Henschke and Lin, 2011; Herbert et al., 2011). Therefore, we have conducted a systematic review with meta-analysis of supervised RCTs on the effects of post-exercise stretching on short-term (i.e., ≤ 1 h) and delayed (24, 48, and 72 h) recovery of strength levels, ROM, and DOMS. Searches were conducted in eight electronic databases post-protocol approval, on December 23 and 24 of 2020, and updated on February 16, 2021. Of the 17,050 records emerging from the searches and 25 additional records emerging from manual searches within reference lists, 11 RCTs were eligible for qualitative analysis (n = 289), and 10 for quantitative analyses (n = 280, with n = 229 after excluding groups not fulfilling PICOS criteria). Due to the overall small sample size, generalization to a broader population is not advised.

Active static stretching, passive stretching and PNF were used for post-exercise recovery, but no protocol adopted dynamic stretching. Overall, analysis of individual studies showed that there was no evidence that stretching enhanced recovery in comparison to passive recovery (i.e., rest) or to alternative recovery modalities, such as cycling and cold-water immersion. There was no evidence to the contrary, i.e., that stretching impaired recovery. Even for secondary outcomes, such as blood lactate and serum creatine kinase, for example, no strong case can be made for stretching accelerating or improving recovery. Furthermore, overall RoB was high, meaning that this field of research is lacking in terms of methodological design. Especially problematic was the wide use of unblinded testers, even for outcomes with greater degree of subjectivity.

Due to the diversity of outcomes and timepoints of assessments, only four meta-analytical comparisons were possible, all between stretching and passive recovery (i.e., rest): strength levels at ≤ 1 h, and DOMS at 24, 48, and 72 h. Overall, stretching was no more effective than passive recovery in returning strength levels and DOMS to baseline values. Heterogeneity of the meta-analysis (I²) was high for within-group (pre-post) comparisons and low for between-group comparisons for strength outcomes at ≤ 1 h of recovery, moderate (within) and low (between) for DOMS at 24 h, low to moderate (within) and low (between) for DOMS at 48 h, and low (within and between) for DOMS at 72 h. Information in terms of recovery of ROM after different recovery protocols was insufficient to run a meta-analysis. There was no evidence of publication bias.

Poor External Validity

Overall, the studies included in our analysis may be considered to have poor external validity. In terms of population, they only apply to adults under 40-years-old, with no studies being performed in children, teenagers or adults older ≥40-years-old. And only two of the 11 studies included women in their sample: 50% of the sample in one study (McGrath et al., 2014) and unclear in another (Bonfim et al., 2010). As such, current results derive mainly from studies with men. As all subjects were healthy, it is unclear how subjects with injuries and/or pathologies would respond. Furthermore, only two studies included recreationally trained subjects (West et al., 2014) or athletes (César et al., 2021).

The nature of the exercise protocols (pre-recovery) presents a number of problems that limit their external validity as well. While most studies used protocols that were likely to induce DOMS, in real-life settings coaches are unlikely to regularly try to elicit DOMS in their athletes or patients. And since most studies did not assess athletes, it is possible that results from the fatigue-inducing protocols have been somewhat artificial, as most were conducted with populations not engaged in regular, structured physical activity, and thereby less well-adapted to the acute effects of fatiguing exercise. Lack of familiarity with the protocols may have exacerbated this effect. Moreover, the protocols were single-component or even single exercise, while real-life exercise sessions will more likely involve multiple components and/or multiple exercises. Also, most of the knowledge derives from studies focusing on the lower limbs, with only one study having assessed the effects of the upper limbs (César et al., 2021).

With one exception (Cooke et al., 2018), the fatigue-inducing protocols had very short durations, usually well below 30 min. Hardly will a real-life exercise session last ≤ 30 min, especially with athletic populations. Conversely the duration of recovery protocols was excessive in many cases, even reaching 30 min in duration (West et al., 2014; Cooke et al., 2018). The combination of very short exercise sessions with long recovery sessions does not seem practical. Also, six studies (~55%) used individualized passive stretching. This means that one supervisor is required for every practitioner, something that will hardly be possible to implement in physical education classes, sports training, and even for the general gym-going population (exceptions would be those with access to a personal trainer).

Data Is Scarce, Heterogeneous, and Does Not Support Existing Guidelines

Considering that stretching is so often prescribed as a valid protocol for enhancing post-exercise recovery (ACSM, 2018), the reduced number of studies (n = 11) and small overall sample (n = 289) emerging from our searches, allied with a considerable diversity of exercise and post-exercise recovery protocols, demonstrate that data is too scarce and heterogeneous to support existing guidelines. Although absence of evidence is not evidence of absence, world-leading organizations should encourage further research in this field before promoting more definitive recommendations. Recommendations should not be provided in the absence of empirical support. At a minimum, guidelines should acknowledge that prescribing post-exercise stretching as a means of improving recovery is based on belief and not on data. In fact, enhancing recovery implies that recovery is accelerated and/or improved if post-exercise stretching is applied than if passive recovery (i.e., rest) is used. Our data does not sustain this belief. Indeed, >70% of the analyzed studies had one group performing passive recovery (i.e., rest), and stretching did not prove to improve recovery when compared to those controls. Perhaps the eventual benefits of post-exercise stretching are balanced by the extra fatigue that they add, although further research is required to better explore the mechanistic phenomena underlying these effects.

We strongly suggest that science should abide by the burden of proof. Until more (and better) data is collected, no case should be built for (or against) post-exercise stretching with the goal of improving recovery. Admittedly, post-exercise stretching may have other goals than improving recovery, but these were not addressed in our analysis.

What's Different in Relation to Previous Systematic Reviews on the Topic?

As mentioned in the introduction, previous SRMA addressed the topic of post-exercise stretching (Herbert and Gabriel, 2002; Henschke and Lin, 2011; Herbert et al., 2011). However, important differences in design exist in comparison with our review, beyond the natural update: (i) these reviews assessed the effects of both post- and pre-exercise stretching, while we focused solely on post-exercise stretching; (ii) they assessed the effects of stretching on DOMS and risk of injury, while we focused on DOMS, strength levels, and ROM; (iii) finally, they accepted non-randomized studies, while our review was limited to randomized studies; (iv) furthermore, we consulted more databases than those reviews. Therefore, it is not surprising that the list of included articles is largely different. Still, our review reinforces previous conclusions that post-exercise stretching does not confer protection from DOMS, while also showing that it does not accelerate (nor impairs) recovery and strength levels or ROM.

Limitations

The limited number of studies; the high RoB and high heterogeneity, allied to the diversity of designs and poor external validity advise against more definitive conclusions. Moreover, the included studies solicited extremely varied stretching intensities, but all were based in vague sentences to suggest the subjects the degree of stretching intended. And if stretching intensity is not properly described, any comparisons can be limited (Sands et al., 2013). Instead, we believe that stretching intensity could be more rigorously assessed with instruments such as the Stretching Intensity Scale (Freitas et al., 2015).

Conclusions

Overall, our data does not support nor contradicts the utilization of post-exercise stretching. Notwithstanding, if post-exercise stretching does not seem to enhance recovery in relation to passive recovery (i.e., rest), the implementation of the former among participants or athletes is, at least, questionable. Still, data is scarce, heterogenous, and overall confidence in cumulative evidence is very low. For now, recommendations on whether post-exercise stretching should be applied for the purposes of recovery are misleading, as the (insufficient) data that is available does not support those claims.

We suggest that future research on post-exercise recovery always pre-registers the protocol and adopts a randomized design, with proper description of how randomization was performed and whether allocation sequence was concealed. A passive recovery (i.e., rest) control group should always be included. Multi-component exercise sessions lasting ≥60 min, with recovery protocols lasting ≤ 15 min, would provide greater external validity to the findings. Studies with women and athletes should be reinforced, as studies with children, teenagers, adults ≥40 years and populations with pathologies and/or injuries are lacking and should be prioritized.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Materials, further inquiries can be directed to the corresponding author.

Author Contributions

All authors had substantial contributions to the conception or design of the work, acquisition, analysis, or interpretation of data for the work, drafting the work or revising it critically for important intellectual content, final approval of the version to be published, and agreement to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

We thank Professor Pantelis Nikolaidis and Nefeli Papanikolaki for the help with obtaining and translating the Greek study for which who had to analyze the full text. We thank Lee Saong Min for the translation of the Korean studies to both English and Portuguese. We thank Ana Gracinda Ramos for independent extraction of data from graphs. We thank Professor José Alberto Duarte for providing expert input in suggesting potentially relevant studies to be included in the final sample, after having analyzed the list of studies we had selected, as well as the inclusion and exclusion criteria.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fphys.2021.677581/full#supplementary-material

References

Abt, G., Boreham, C., Davison, G., Jackson, R., Nevill, A., Wallace, E., et al. (2020). Power, precision, and sample size estimation in sport and exercise science research. J. Sports Sci. 38, 1933–1935. doi: 10.1080/02640414.2020.1776002

PubMed Abstract | CrossRef Full Text | Google Scholar

ACSM (2018). ACSM's Guidelines for Exercise Testing and Prescription, 10th Edn., eds M. P. Bayles and A. M. Swank. Philadelphia, PA: Wolters Kluwer.

Google Scholar

American Heart Association (2020). Guidelines and Statements. Available online at: https://professional.heart.org/en/guidelines-and-statements/guidelines-and-statements-search

Google Scholar

Apostolopoulos, N. C., Lahart, I. M., Plyley, M. J., Taunton, J., Nevill, A. M., Koutedakis, Y., et al. (2018). The effects of different passive static stretching intensities on recovery from unaccustomed eccentric exercise – a randomized controlled trial. Appl. Physiol. Nutr. Metab. 43, 806–815. doi: 10.1139/apnm-2017-0841

PubMed Abstract | CrossRef Full Text | Google Scholar

Bonfim, A. E. D., De Re, D., Gaffuri, J., Costa, M. M. D., Portolez, J. L. M., and Bertolini, G. R. F. (2010). Use of static stretching as an intervenient factor in delayed onset muscle soreness. Rev. Brasil. De Med. Do Esporte 16, 349–352. doi: 10.1590/S1517-86922010000500006

CrossRef Full Text | Google Scholar

Boobphachart, D., Manimmanakorn, N., Manimmanakorn, A., Thuwakum, W., and Hamlin, M. J. (2017). Effects of elastic taping, non-elastic taping and static stretching on recovery after intensive eccentric exercise. Res Sports Med. 25, 181–190. doi: 10.1080/15438627.2017.1282360

PubMed Abstract | CrossRef Full Text | Google Scholar

Cè, E., Limonta, E., Maggioni, M. A., Rampichini, S., Veicsteinas, A., and Esposito, F. (2013). Stretching and deep and superficial massage do not influence blood lactate levels after heavy-intensity cycle exercise. J. Sports Sci. 31, 856–866. doi: 10.1080/02640414.2012.753158

PubMed Abstract | CrossRef Full Text | Google Scholar

César, E. P., Júnior, C. S. R., and Francisco, R. N. (2021). Effects of 2 intersection strategies for physical recovery in Jiu-Jitsu athletes. Int. J. Sports Physiol. Perform. 16, 1–6. doi: 10.1123/ijspp.2019-0701

PubMed Abstract | CrossRef Full Text | Google Scholar

Cha, H. G., and Kim, M. K. (2015). Effects of the hold and relax-agonist contraction technique on recovery from delayed onset muscle soreness after exercise in healthy adults. J. Phys. Ther. Sci. 27, 3275–3277. doi: 10.1589/jpts.27.3275

PubMed Abstract | CrossRef Full Text | Google Scholar

Cheung, K., Hume, P., and Maxwell, L. (2003). Delayed onset muscle soreness: treatment strategies and performance factors. Sports Med. 33, 145–164. doi: 10.2165/00007256-200333020-00005

PubMed Abstract | CrossRef Full Text | Google Scholar

Cooke, M. B., Nix, C. M., Greenwood, L. D., and Greenwood, M. C. (2018). No differences between alter G-trainer and active and passive recovery strategies on isokinetic strength, systemic oxidative stress and perceived muscle soreness after exercise-induced muscle damage. J. Strength Condition. Res. 32, 736–747. doi: 10.1519/JSC.0000000000001750

PubMed Abstract | CrossRef Full Text | Google Scholar

Deeks, J. J., Higgins, J. P., and Altman, D. G. (2008). “Analysing data and undertaking meta-analyses,” in Cochrane Handbook for Systematic Reviews of Interventions: The Cochrane Collaboration, eds J. P. Higgins and S. Green (New Jersey, NY: The Cochrane Collaboration), 243–296. doi: 10.1002/9780470712184.ch9

CrossRef Full Text

Drahota, A., and Beller, E. (2020). RevMan Calculator for Microsoft Excel [Computer Software]. Cochrane.

Duffield, R., Murphy, A., Kellett, A., and Reid, M. (2014). Recovery from repeated on-court tennis sessions: combining cold-water immersion, compression, and sleep interventions. Int. J. Sports Physiol. Perform. 9, 273–282. doi: 10.1123/ijspp.2012-0359

PubMed Abstract | CrossRef Full Text | Google Scholar

Duval, S., and Tweedie, R. (2000). Trim and fill: a simple funnel-plot-based method of testing and adjusting for publication bias in meta-analysis. Biometrics 56, 455–463. doi: 10.1111/j.0006-341X.2000.00455.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Egger, M., Davey Smith, G., Schneider, M., and Minder, C. (1997). Bias in meta-analysis detected by a simple, graphical test. BMJ 315, 629–634. doi: 10.1136/bmj.315.7109.629

CrossRef Full Text | Google Scholar

Elbourne, D. R., Altman, D. G., Higgins, J. P. T., Curtin, F., Worthington, H. V., and Vail, A. (2002). Meta-analyses involving cross-over trials: methodological issues. Int. J. Epidemiol. 31, 140–149. doi: 10.1093/ije/31.1.140

PubMed Abstract | CrossRef Full Text | Google Scholar

Freitas, S. R., Vaz, J. R., Gomes, L., Silvestre, R., Hilário, E., Cordeiro, N., et al. (2015). A new tool to assess the perception of stretching intensity. J. Strength Condition. Res. 29:2666–78. doi: 10.1519/JSC.0000000000000926

PubMed Abstract | CrossRef Full Text | Google Scholar

Fuchs, C. J., Kouw, I. W. K., Churchward-Venne, T. A., Smeets, J. S. J., Senden, J. M., Lichtenbelt, W., et al. (2020). Postexercise cooling impairs muscle protein synthesis rates in recreational athletes. J. Physiol. 598, 755–772. doi: 10.1113/JP278996

PubMed Abstract | CrossRef Full Text | Google Scholar

García-Hermoso, A., Ramírez-Campillo, R., and Izquierdo, M. (2019). Is muscular fitness associated with future health benefits in children and adolescents? A systematic review and meta-analysis of longitudinal studies. Sports Med. 49, 1079–1094. doi: 10.1007/s40279-019-01098-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Gulick, D. T., Kimura, I. F., Sitler, M., Paolone, A., and Kelly, J. D. (1996). Various treatment techniques on signs and symptoms of delayed onset muscle soreness. J. Athl. Train. 31, 145–152.

PubMed Abstract | Google Scholar

Guyatt, G. H., Oxman, A. D., Akl, E. A., Kunz, R., Vist, G., Brozek, J., et al. (2011). GRADE guidelines: 1. Introduction-GRADE evidence profiles and summary of findings tables. J. Clin. Epidemiol. 64, 383–394. doi: 10.1016/j.jclinepi.2010.04.026

PubMed Abstract | CrossRef Full Text | Google Scholar

Hariton, E., and Locascio, J. J. (2018). Randomised controlled trials - the gold standard for effectiveness research: study design: randomised controlled trials. BJOG 125, 1716–1716. doi: 10.1111/1471-0528.15199

PubMed Abstract | CrossRef Full Text | Google Scholar

Henschke, N., and Lin, C. C. (2011). Stretching before or after exercise does not reduce delayed-onset muscle soreness. Br. J. Sports Med. 45:1249. doi: 10.1136/bjsports-2011-090599

PubMed Abstract | CrossRef Full Text | Google Scholar

Herbert, R., and Gabriel, M. (2002). Effects of stretching before and after exercising on muscle soreness and risk of injury: systematic review. BMJ 325:468. doi: 10.1136/bmj.325.7362.468

PubMed Abstract | CrossRef Full Text | Google Scholar

Herbert, R., Noronha, M., and Kamper, S. (2011). Stretching to Prevent or Reduce Muscle Soreness After Exercise (Review). Cochrane Library, CD004577.

PubMed Abstract | Google Scholar

Higgins, J. P., Li, T., and Sterne, J. A. C. (2020). Revised Cochrane Risk of Bias Tool for Randomized Trials (RoB 2). Additional Considerations for Crossover Trials. Preliminary Tool Version, 8 December 2020. Cochrane. Available online at: https://sites.google.com/site/riskofbiastool/welcome/rob-2-0-tool/rob-2-for-crossover-trials.

Higgins, J. P., Thomas, J., Chandler, J., Cumpston, M., Li, T., Page, M. J., et al. (2019). Cochrane Handbook for Systematic Reviews of Interventions, 2nd Edn. Chichester: John Wiley & Sons.

Google Scholar

Higgins, J. P., and Thompson, S. G. (2002). Quantifying heterogeneity in a meta-analysis. Stat. Med. 21, 1539–1558. doi: 10.1002/sim.1186

CrossRef Full Text | Google Scholar

Hopkins, W. G., Marshall, S. W., Batterham, A. M., and Hanin, J. (2009). Progressive statistics for studies in sports medicine and exercise science. Med. Sci. Sports Exerc. 41, 3–13. doi: 10.1249/MSS.0b013e31818cb278

PubMed Abstract | CrossRef Full Text | Google Scholar

Kang, T.-W., and Park, J.-B. (2018). The effect of stretching in cold immersion after artificial delayed onset of muscle soreness (DOMS) on muscle pain and muscular function for life care. J. Korea Entertain. Indus. Assoc. 12, 317–326. doi: 10.21184/jkeia.2018.12.12.8.317

CrossRef Full Text

Kokkinidis, E., Tsamourtas, A., Buckenmeyer, P., and Machairidou, M. (1998). The effect of static stretching and cryotherapy on the recovery of delayed muscle soreness. Exerc. Soc. J. Sport Sci. 19, 45–53.

Kontopantelis, E., Springate, D. A., and Reeves, D. (2013). A re-analysis of the Cochrane Library data: the dangers of unobserved heterogeneity in meta-analyses. PLoS ONE. 8:e69930. doi: 10.1371/journal.pone.0069930

PubMed Abstract | CrossRef Full Text | Google Scholar

Lima, C. D., Ruas, C. V., Behm, D. G., and Brown, L. E. (2019). Acute effects of stretching on flexibility and performance: a narrative review. J. Sci. Sport Exerc. 1, 29–37. doi: 10.1007/s42978-019-0011-x

CrossRef Full Text | Google Scholar

Lohse, K. R., Sainani, K. L., Taylor, J. A., Butson, M. L., Knight, E. J., and Vickers, A. J. (2020). Systematic review of the use of “magnitude-based inference” in sports science and medicine. PLoS ONE 15:e0235318. doi: 10.1371/journal.pone.0235318

PubMed Abstract | CrossRef Full Text | Google Scholar

McGlynn, G. H., Laughlin, N. T., and Rowe, V. (1979). Effect of electromyographic feedback and static stretching on artificially induced muscle soreness. Am. J. Phys. Med. 58, 139–148.

PubMed Abstract | Google Scholar

McGrath, R. P., Whitehead, J. R., and Caine, D. J. (2014). The effects of proprioceptive neuromuscular facilitation stretching on post-exercise delayed onset muscle soreness in young adults. Int. J. Exerc. Sci. 7, 14–21.

PubMed Abstract | Google Scholar

Mika, A., Mika, P., Fernhall, B., and Unnithan, V. B. (2007). Comparison of recovery strategies on muscle performance after fatiguing exercise. Am. J. Phys. Med. Rehabil. 86, 474–481. doi: 10.1097/PHM.0b013e31805b7c79

PubMed Abstract | CrossRef Full Text | Google Scholar

Moher, D., Liberati, A., Tetzlaff, J., and Altman, D. G. (2009). Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. BMJ 339:b2535. doi: 10.1136/bmj.b2535

PubMed Abstract | CrossRef Full Text | Google Scholar

Moher, D., Shamseer, L., Clarke, M., Ghersi, D., Liberati, A., Petticrew, M., et al. (2015). Preferred reporting items for systematic review and meta-analysis protocols (PRISMA-P) 2015 statement. Syst. Rev. 4:1. doi: 10.1186/2046-4053-4-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Moran, J., Ramirez-Campillo, R., and Granacher, U. (2018). Effects of jumping exercise on muscular power in older adults: a meta-analysis. Sports Med. 48, 2843–2857. doi: 10.1007/s40279-018-1002-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Muanjai, P., and Namsawang, J. (2015). Effects of stretching and cold-water immersion on functional signs of muscle soreness following plyometric training. J. Phys. Educ. Sport 15, 128–135. doi: 10.7752/jpes.2015.01021

CrossRef Full Text | Google Scholar

Oh, D.-w. (2013). Effects of warm-up and cool-down exercises for preventing delayed onset muscle soreness on pain and muscle activation. yes yes . Phys. Ther. Korea 20, 28–35. doi: 10.12674/ptk.2013.20.1.028

CrossRef Full Text | Google Scholar

Pooley, S., Spendiff, O., Allen, M., and Moir, H. J. (2020). Comparative efficacy of active recovery and cold water immersion as post-match recovery interventions in elite youth soccer. J. Sports Sci. 38, 1423–1431. doi: 10.1080/02640414.2019.1660448

CrossRef Full Text | Google Scholar

Robey, E., Dawson, B., Goodman, C., and Beilby, J. (2009). Effect of postexercise recovery procedures following strenuous stair-climb running. Res. Sports Med. 17, 245–259. doi: 10.1080/15438620902901276

PubMed Abstract | CrossRef Full Text | Google Scholar

Rohatgi, A. (2020). WebPlotDigitizer, Version 4.4. Pacifica, CA. Available online at: https://automeris.io/WebPlotDigitizer

Sands, W. A., McNeal, J. R., Murray, S. R., Ramsey, M. W., Sato, K., Mizuguchi, S., et al. (2013). Stretching and its effects on recovery. Strength Cond. J. 35, 30–36. doi: 10.1519/SSC.0000000000000004

CrossRef Full Text | Google Scholar

Shea, B. J., Reeves, B. C., Wells, G., Thuku, M., Hamel, C., Moran, J., et al. (2017). AMSTAR 2: a critical appraisal tool for systematic reviews that include randomised or non-randomised studies of healthcare interventions, or both. BMJ 358:j4008. doi: 10.1136/bmj.j4008

PubMed Abstract | CrossRef Full Text | Google Scholar

Shi, L., and Lin, L. (2019). The trim-and-fill method for publication bias: practical guidelines and recommendations based on a large database of meta-analyses. Medicine 98:e15987. doi: 10.1097/MD.0000000000015987

PubMed Abstract | CrossRef Full Text | Google Scholar

Skrede, T., Steene-Johannessen, J., Anderssen, S. A., Resaland, G. K., and Ekelund, U. (2019). The prospective association between objectively measured sedentary time, moderate-to-vigorous physical activity and cardiometabolic risk factors in youth: a systematic review and meta-analysis. Obes. Rev. 20, 55–74. doi: 10.1111/obr.12758

PubMed Abstract | CrossRef Full Text | Google Scholar

Spieth, P. M., Kubasch, A. S., Penzlin, A. I., Illigens, B. M.-W., Barlinn, K., and Siepmann, T. (2016). Randomized controlled trials - a matter of design. Neuropsychiatr. Dis. Treat. 12, 1341–1349. doi: 10.2147/NDT.S101938

PubMed Abstract | CrossRef Full Text | Google Scholar

Sterne, J. A. C., Savović, J., Page, M. J., Elbers, R. G., Blencowe, N. S., Boutron, I., et al. (2019). RoB 2: a revised tool for assessing risk of bias in randomised trials. BMJ 366:l4898. doi: 10.1136/bmj.l4898

PubMed Abstract | CrossRef Full Text | Google Scholar

Torres, R., Carvalho, P., and Duarte, J. A. (2005). Effects of a static stretching program on clinical and biochemical markers of muscle damage induced by eccentric exercise. Rev. Portuguesa Ciências Desporto 5, 274–287. doi: 10.5628/rpcd.05.03.274

CrossRef Full Text

Torres, R., Pinho, F., Duarte, J. A., and Cabri, J. M. H. (2013). Effect of single bout versus repeated bouts of stretching on muscle recovery following eccentric exercise. J. Sci. Med. Sport 16, 583–588. doi: 10.1016/j.jsams.2013.01.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Valentine, J. C., Pigott, T. D., and Rothstein, H. R. (2010). How many studies do you need?: a primer on statistical power for meta-analysis. J. Educ. Behav. Stat. 35, 215–247. doi: 10.3102/1076998609346961

CrossRef Full Text | Google Scholar

Van Hooren, B., and Peake, J. M. (2018). Do we need a cool-down after exercise? A narrative review of the psychophysiological effects and the effects on performance, injuries and the long-term adaptive response. Sports Med. 48, 1575–1595. doi: 10.1007/s40279-018-0916-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Wessel, J., and Wan, A. (1994). Effect of stretching on the intensity of delayed-onset muscle soreness. Clin. J. Sport Med. 4, 83–87. doi: 10.1097/00042752-199404000-00003

PubMed Abstract | CrossRef Full Text | Google Scholar

West, A. D., Cooke, M. B., LaBounty, P. M., Byars, A. G., and Greenwood, M. (2014). Effects of G-trainer, cycle ergometry, and stretching on physiological and psychological recovery from endurance exercise. J. Strength Cond. Res. 28, 3453–3461. doi: 10.1519/JSC.0000000000000577

PubMed Abstract | CrossRef Full Text | Google Scholar

Xanthos, P. D., Lythgo, N., Gordon, B. A., and Benson, A. C. (2013). The effect of whole-body vibration as a recovery technique on running kinematics and jumping performance following eccentric exercise to induce delayed-onset muscle soreness. Sports Technol. 6, 112–121. doi: 10.1080/19346182.2013.819359

CrossRef Full Text | Google Scholar

Xie, Y., Feng, B., Chen, K., Andersen, L. L., Page, P., and Wang, Y. (2018). The efficacy of dynamic contract-relax stretching on delayed-onset muscle soreness among healthy individuals: a randomized clinical trial. Clin J. Sport Med. 28, 28–36. doi: 10.1097/JSM.0000000000000442

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, Y., Alonso-Coello, P., Guyatt, G. H., Yepes-Nuñez, J. J., Akl, E. A., Hazlewood, G., et al. (2019). GRADE Guidelines: 19. Assessing the certainty of evidence in the importance of outcomes or values and preferences - risk of bias and indirectness. J. Clin. Epidemiol. 111, 94–104. doi: 10.1016/j.jclinepi.2018.01.013

PubMed Abstract | CrossRef Full Text | Google Scholar

yes (2010). The effect of cool-down and warm-up on delayed onset muscle soreness. [ yes doms yes ]. Arch. Orthopedic Sports Phys. Ther. 6:1. Available online at: http://journal.sportspt.co.kr/journal/article.php?code=52642

Keywords: flexibility, post exercise recovery, myalgia, cool-down, delayed onset muscular soreness, stretching, muscle stretching exercises, articular range of motion

Citation: Afonso J, Clemente FM, Nakamura FY, Morouço P, Sarmento H, Inman RA and Ramirez-Campillo R (2021) The Effectiveness of Post-exercise Stretching in Short-Term and Delayed Recovery of Strength, Range of Motion and Delayed Onset Muscle Soreness: A Systematic Review and Meta-Analysis of Randomized Controlled Trials. Front. Physiol. 12:677581. doi: 10.3389/fphys.2021.677581

Received: 08 March 2021; Accepted: 06 April 2021;
Published: 05 May 2021.

Edited by:

Argyris G. Toubekis, National and Kapodistrian University of Athens, Greece

Reviewed by:

Olyvia Donti, National and Kapodistrian University of Athens, Greece
James Robert Broatch, Victoria University, Australia

Copyright © 2021 Afonso, Clemente, Nakamura, Morouço, Sarmento, Inman and Ramirez-Campillo. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Rodrigo Ramirez-Campillo, ci5yYW1pcmV6QHVsYWdvcy5jbA==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.