ORIGINAL RESEARCH article
The Performance-Result Gap in Mixed-Reality Cycling – Evidence From the Virtual Tour de France 2020 on Zwift
- Center for Management, Chair of Business Administration - Organization, Personnel and Innovation, University of Münster, Münster, Germany
Background: Mixed-reality sports are increasingly reaching the highest level of sport, exemplified by the first Virtual Tour de France, held in 2020. In road races, power output data are only sporadically available, which is why the effect of power output on race results is largely unknown. However, in mixed-reality competitions, measuring and comparing the power output data of all participants is a fundamental prerequisite for evaluating the athlete’s performance.
Objective: This study investigates the influence of different power output parameters (absolute and relative peak power output) as well as body mass and height on the results in mixed-reality competitions.
Methods: We scrape data from all six stages of the 2020 Virtual Tour de France of women and men and analyze it using regression analysis. Third-order polynomial regressions are performed as a cubic relationship between power output and competition result can be assumed.
Results: Across all stages, relative power output over the entire distance explains most of the variance in the results, with maximum explanatory power between 77% and 98% for women and between 84% and 99% for men. Thus, power output is the most powerful predictor of success in mixed-reality sports. However, the identified performance-result gap reveals that other determinants have a subordinate role in success. Body mass and height can explain the results only in a few stages. The explanatory power of the determinants considered depends in particular on the stage profile and the progression of the race.
Conclusion: By identifying this performance-result gap that needs to be addressed by considering additional factors like competition strategy or the specific use of equipment, important implications for the future of sports science and mixed-reality sports emerge.
Mixed-reality sports platforms such as Zwift allow the athletic performance provided in the real world to be transferred into a virtual space so that the participating athletes can also physically interact and even compete with each other (Speicher et al., 2019; Westmattelmann et al., 2021a). Driven by the Covid-19 pandemic, mixed-reality sports have reached the highest level of sports in a very short time, with the first-ever Virtual Tour de France held in July 2020 (Westmattelmann et al., 2021a). Both the women’s and men’s races, which were hosted on the indoor cycling platform Zwift, featured professional road cyclists (Westmattelmann et al., 2021b). In the men’s race, several “real-world” Tour de France overall winners and stage winners participated (Cyclingnews, 2020). The races gained a lot of public attention and were broadcast in over 130 countries (Cyclingnews, 2020). The fact that in December 2020, the world cycling federation (UCI) for the first time hosted the “UCI Cycling Esports World Championships” reflects the rising interest from sports organizations (UCI, 2020). In addition to participating in competitions, professional cyclists use mixed-reality sports platforms for training or social interaction (Westmattelmann et al., 2021c).
In mixed-reality cycling races, the speed of the avatar within the virtual world depends on the physical power generated (in watts), in addition to the athlete’s body mass and height (Delaney and Bromley, 2020). Cycling power is measured through a so-called smart trainer on which the athlete’s road bike is mounted. Besides physical performance, the avatar’s speed may also depend on additional factors like slipstream effects, simulated course profiles or virtual equipment. Additionally, so-called power-ups are available, whose strategic use may also affect the race results. Mixed-reality sports platforms use a number of implemented algorithms to define how athletic performance on the smart trainer is transferred to the virtual world and can be analogously compared to the laws of nature in the real world. Similarly, rolling resistance, aerodynamic drag, and the movement on mixed-reality sports platforms are also simulated according to defined rules (Kyle, 2003). In both the real world and the virtual world, knowledge of all relevant parameters, such as power output or body mass and height of the rider and course characteristics, could be used to accurately determine the speed or riding time over a defined course in individual time trials. In mass-start races, such as the (Virtual) Tour de France, the behavior of the riders and the associated interaction (e.g., slipstream) lead to an emergent system behavior that is not represented by the laws of nature (real world) or is not included in the algorithm on mixed-reality sports platforms (virtual world).
Given the complex interplay of various factors that may be critical to success in cycling races, exercise physiology research can be divided into two domains. The first domain focuses on factors that influence the physical performance of athletes in endurance sports, which in cycling is measured as absolute power in watts or relative power in watts per kilogram body weight. In their review, Jeukendrup and Martin, (2001) summarized factors that determine performance in cycling and ranked them in order of importance. According to their study, the greatest impact on cycling performance originates from the internal factor training, particularly a training program. They also valued altitude training as having a positive impact on performance. Particularly in the context of indoor cycling, the starting strategy (Mattern et al., 2001), the time of day (Atkinson et al., 2005), warm-up (Munro et al., 2017), recovery duration (Glaister et al., 2005), and precooling strategy (Quod et al., 2008) could also be considered as factors influencing performance. In cycling, the seating position (standing or seated) also influences performance (Hansen and Waldeland, 2008). Furthermore, gear ratio and pedaling cadence have a direct impact on cycling economy/efficiency (Faria et al., 2005). Finally, Atkinson et al. (2003) identified the athletes’ nutritional strategy before, during, and after a race as a determinant of performance in addition to the factors mentioned so far.
In the second domain, sports physiology literature discusses numerous factors - like physical performance delivered by the athlete - that can predict the outcome of competitions or the sporting success of an athlete (Alvero-Cruz et al., 2019; Sousa et al., 2021). Race success correlates with the peak power an athlete is able to perform in a laboratory test (Faria et al., 2005). Thus, it can be noted that the outcome of a cycling race depends on power-to-weight characteristics (Gallo et al., 2021; Lee et al., 2002). Van Erp and Sanders, (2020) show that in professional cycling races maximum mean power over shorter durations (<5 min) are higher for riders who are placed in the top-10 of a race than for riders who are not in the top-10. In addition to physiological parameters, factors such as the bike mass and body mass (both are particularly important in the mountains; Jeukendrup and Martin, 2001) as well as aerodynamic components such as body position, bike frame, and wheels can affect the competition result (Faria et al., 2005; Malizia et al., 2021). Based on their machine learning approach, Kholkine et al. (2020) assume that factors such as weather, team strategy, road conditions, or mechanical failure must be included when predicting a competition result. Another study by Van Erp et al. (2021b) showed that a cyclists’ position in the peloton could be an indicator for the result of a stage at the Tour de France. In terms of para-cyclists’ performance, Wright, (2016) also found that pacing strategy improves athletes’ performance in short time trials on cycling track.
To summarize the current state of research in the two domains, we can state that the factors determining an athlete’s physical performance are widely known, particularly in cycling. In contrast, the effect of the physical performance delivered in competition on the competition result requires further research. Studies show that several factors, such as high peak power or high power-to-weight ratio, can influence the result in cycling races. However, the physical performance was mostly determined in laboratory tests with a time lag before or after the competition (e.g., Lee et al., 2002; Babault et al., 2018; Gallo et al., 2021). Thus, it is still unclear what influence the actual power output in the competition has on the competition result. Although performance in-competition data from professional athletes are increasingly available via platforms such as Strava, the quantity of available data is still insufficient to calculate the effect on the competition result (Sanders and Heijboer, 2019; Van Erp and Sanders, 2020; Westlake, 2020; Van Erp et al., 2021a).
In mixed-reality competitions the performance data, body mass and body height of all participants are recorded and published. This happens for two reasons. First, without the disclosure of the recorded data, the athlete’s avatar would not move in the virtual world, and thus the competition would not be possible. Second, the performance data of all participants are publicly available during and after the race for all participants and spectators and are even integrated into the broadcasts to increase transparency about the race result. It was shown that the relative power output in mixed-reality races in which professional athletes participated is comparable to the relative performances in professional road races (Westmattelmann et al., 2021b). Since mixed-reality races are generally shorter than road races, the performance delivered is more similar to real-world time trials or mountain top finishes (Westmattelmann et al., 2021b). Although mixed-reality competitions publish the in-competition performance data from all professional road cyclists participating in the same competition publicly, it is unclear whether critical performances over specific time periods are particularly more relevant to race results than other performance parameters.
Regarding this research gap, the aim of this study is to analyze the effect of different performance parameters and the rider’s body mass and height on the results of mixed-reality competitions. In doing so, the following research question is addressed by analyzing in-competition data and results from the six stages of the women’s and men’s Virtual Tour de France: What influence do 1) absolute power over the entire distance, 2) relative power over the entire distance, 3) maximum relative power over 20 min, 4) maximum relative power over 5 min, 5) maximum relative power over 1 min, 6) maximum relative power over 30 s, 7) maximum relative power over 15 s, 8) body mass, and 9) body height of a rider have on the competition result (measured in riding time)?
The following section describes how the performance data for the different Virtual Tour de France stages 2020 are collected and analyzed. Subsequently, the influence of the in-competition performance parameters and the rider’s body mass and height on the result is determined via polynomial regression analyses. The findings are discussed against the background of insights from sports physiology, and practical implications for the future of (mixed-reality) sports are presented. Finally, the main insights are summarized in the conclusion.
Materials and Methods
This study was approved by the Ethics Committee of the School of Business & Economics of the University of Münster, Germany. Since only publicly available data from the Zwift Power platform was used, no informed consent was required from the athletes.
Given the novelty of mixed-reality sport for exercise physiology science, we first provide an overview of the four phases of the experimental design (Figure 1).
Phase 1 marks the three weekends in July 2020 where the Virtual Tour de France took place. The performance data (absolute and relative power output) as well as body mass and height were integrated into the extensive broadcasting of the competitions and uploaded to the Zwift Power platform and are therefore publicly available, even after the end of the competitions. In phase 2, the athletes’ performance data, body mass and height, and the corresponding results of the six stages of the Virtual Tour de France for women and men were obtained using the web scraping technique from Zwift Power. In phase 3, the different performance data, body mass and height were plotted with the competition result (measured in riding time) to visualize the relationship between the determinants (power output, body mass and height) and the competition result. In Phase 4, a cubic relationship was assumed based on the visualizations in phase 3. The influence of the determinants on the competition outcome was calculated using third order polynomial regressions to account for the cubic relationship. The following subsections are organized according to each of the four phases of the experimental design.
Virtual Tour de France
In July 2020, the cycling platform Zwift and the Amaury Sport Organisation (ASO) cooperatively hosted the first “Virtual Tour de France” (Tour de France, 2020b). In this event female and male professional cyclists competed for individual stage victories and overall team classifications in a total of six stages (Tour de France, 2020a). Unlike the real Tour de France, an annual event which is also organized by the ASO, the classifications (overall, youth, mountain, and sprint) were not awarded to individual riders but to the best teams. To have the highest chance of winning, the professional road cycling teams nominated four riders for each stage who were best suited to the different stage profiles (Schlange, 2020). Thereby, no male rider was allowed to race more than three and no female rider more than four stages in total. The different stages were won by the riders who completed the route in the shortest riding time. However, the avatar’s speed and thus the competition result also depended on the use of so-called power-ups (e.g., the aero power-up makes a rider more aerodynamic for 15 s) or riding in the slipstream. The six stages of the Virtual Tour de France are partly modeled after the real Tour de France (for example, by recreating the Mont Ventoux climb or the circuit on the Champs-Élysées in Paris, Tour de France, 2020a). The route profiles of the stages of the Virtual Tour de France are illustrated in Figure 2.
FIGURE 2. Stage profiles Virtual Tour de France. Source: www.letour.fr.
Data Collection and Sample
To be able to analyze the influence of different performance indicators as well as body mass and height on the race results of the stages of the Virtual Tour de France, we scraped the performance data, body mass, body height, and the corresponding riding times from the platform Zwift Power (Ostanek, 2020). To do so, we used the web scraping tool Parsehub and scraped all six stages individually. Zwift Power collects different racing data for public races scheduled and performed on Zwift. After identifying the type of data relevant for our study, we used the .csv output file generated by Parsehub that contains the data to perform our analysis. For this study, the following data for all stages of the women’s and men’s races were retrieved: absolute power (watts) and relative power (w/kg) over the entire distance, as well as relative peak power over 20 min (w/kg_20m), 5 min (w/kg_5m), 1 min (w/kg_1m), 30 s (w/kg_30s), 15 s (w/kg_15s), body mass and height, average heart rate (HR_avg), and maximum heart rate (HR_max) over the entire race duration. Finally, riding times were retrieved, which are used as the dependent variable (competition success). Compared to the pure competition rankings, the metric scaling of the riding times accounts for differences between the ranks. The descriptive statistics of all athletes who finished the respective stage appear in the competition results and are shown in Tables 1 and 2. For each stage, the length, the elevation meters, and the winner’s time are also presented.
The winning time in the different stages of the men’s race ranged between 41 and 59 min. Accordingly, the length of the virtual stages corresponds more to that of a time trial, which in the Virtual Tour de France was 36.8 km long, and the winning time was 55 min (Sanders and Heijboer, 2019; Procyclingstats, 2020). Women raced the same distances as men in the Virtual Tour de France, with winning times ranging from 47 to 66 min.
In road cycling, riders must overcome the forces of gravity, rolling resistance, and aerodynamic drag to move forward (Kyle, 2003). In particular, at a speed above 40 kph - which is exceeded in most professional races - the interaction of these three forces causes a cubic relationship between the power to be produced by the rider and the resulting speed, respectively riding time over a defined distance (Kyle, 2003). As mixed-reality platforms like Zwift seek to simulate speed as realistically as possible, the relationship between the performance parameters considered and the race outcome was visualized to gain initial insights on whether the cubic relationship also applies to mixed-reality competitions. In Figures 3 and 4, the relationship between the relative power output (in watts per kg body weight) and the competition result (riding time in milliseconds) is shown in scatterplots for the six stages of the women and men.
FIGURE 3. Scatterplot and fitted linear, quadratic and cubic relationship of relative power output (W/kg) and result (riding time in milliseconds) for women’s stages.
FIGURE 4. Scatterplot and fitted linear, quadratic and cubic relationship of relative power output (W/kg) and result (riding time in milliseconds) for men’s stages.
Using the example of the relationship between the relative power output and the competition result, the scatterplots of the respective stages show that the cubic graph fits better to the observed data points than the linear or the quadratic graph. Furthermore, comparable cubic patterns can be seen in particular for the other power output parameters (absolute and relative) and in an attenuated form for body mass and height.
In order to address the research question raised, the influence of the relative and absolute power over the complete duration, the relative power in different time intervals, and the rider’s body mass and height on the competition result were analyzed by performing regression analysis. Initially, we selected the appropriate type of regression analysis. Performing multiple regressions based on multiple parameters is not feasible because the variables depend on each other (absolute vs. relative power output) or represent overlapping time intervals. We tested considering multiple parameters. Thereby, the variance inflation factor was above 10 for almost all variables for all stages for both women and men, which is strong evidence for multicollinearity and should therefore be avoided (Hair et al., 2016). For this reason, the regression analyses are performed separately for all parameters that are specified in the research question.
Since the scatterplots in Figures 3 and 4 indicate a cubic relationship between the parameters and the competition results, third-order polynomial regressions are performed (see Eq. 1; Stimson et al., 1978). Polynomial regressions are a type of multiple regression, and therefore the assumptions of 1) no multicollinearity, 2) independence of residuals, 3) homoscedasticity, and 4) normal distribution of residuals must be considered. Multicollinearity often occurs when polynomial regressions are performed, because the first-, second-, and third-order polynomials of the independent variable are usually highly correlated. Therefore, it is not uncommon to have standardized regression coefficients of < −1 or > +1. To attenuate the effects of multicollinearity and improve interpretability of regression results, the included variables were mean-centered before running the polynomial regressions (Dalal and Zickar, 2012). Accordingly, all regressions are based on Eq. 1:
In Eq. 1, Y is the outcome variable (here: riding time in milliseconds), x is the mean-centered independent variable (predictor; 1) absolute power over the entire distance, 2) relative power over the entire distance, 3) maximum relative power over 20 min, 4) maximum relative power over 5 min, 5) maximum relative power over 30 s, 6) maximum relative power over 15 s, 7) body mass and 8) body height of a rider). b0 is the Y-intercept of the regression surface, while b1, b2, and b3 are the estimated coefficients of the predictor’s respective polynomial (first-, second-, third-order), and e is the random error component.
While multicollinearity is addressed via the mean centering function prior to running the regression analyses, the other assumptions are tested during the regression analysis to ensure the validity of the results. With regard to the second assumption (the independence of the residuals), the autocorrelation of the residuals is determined by performing the Durbin-Watson test (Allen, 1997). The test statistic values can thereby range from 0 to 4, with 0 indicating a perfectly positive autocorrelation, 4 indicating a perfectly negative autocorrelation, and 2 indicating no autocorrelation. Since the residuals should ideally be independent in polynomial regression, a value of 2 is desirable, while values between 1 and 3 are acceptable (Field, 2013). The third assumption, homoscedasticity of the residuals, is used to verify that the regression model makes good predictions across all values. Therefore, it is checked whether the dispersion of the residuals is constant. Finally, the assumption of normal distribution of the residuals is assessed by histograms. However, potential violations of this assumption have little to no impact on the execution and interpretation of the polynomial regressions (Stimson et al., 1978; Lumley et al., 2002).
All regression analyses were performed on the basis of the data sets described in Tables 1 and 2 with IBM SPSS Statistics software version 26. The data sets include the performance data of all participants of the respective stage, since the measurement and disclosure of the performance data is a prerequisite for the execution of mixed-reality competitions. To finally obtain the best model fit, the explanatory power of the cubic polynomial regression was compared with the explanatory power of the linear and quadratic functions (Stimson et al., 1978). Compared to the cubic function, neither the linear nor the quadratic regression function achieved a higher overall explanatory power for the relative and absolute performance parameters and body mass and height. Therefore, only the results of the third-order polynomial regression analyses are reported in the following results section.
The results of the third-order polynomial regression analyses for the women’s and men’s Virtual Tour de France stages are summarized in Table 3. Compared to (multiple) linear regressions, polynomial regression analyses do not allow meaningful interpretation of the strength of individual regression coefficients since the polynomials depend on each other and can only be varied simultaneously (Stimson et al., 1978). Accordingly, polynomial regressions are to be evaluated based on global fit indices. The R2 indicates to what extent the variance of the dependent variable (competition result as riding time on the stage) can be explained by the independent variable (regressor). The p-value of the F-statistic indicates the extent to which the calculated R2 differs significantly from 0. In Table 3, regression results are highlighted in italics if they do not meet one or more of the validity criteria. This can happen if 1) the 95% significance level (p > 0.05) is not reached, 2) the Durbin-Watson test shows values <1 or >3, which would indicate autocorrelation, or on the basis of the histograms respectively the scatterplots 3) heteroscedasticity or 4) no normal distribution of the residuals is observed. Furthermore, the strongest predictors (largest R2) for each women’s and men’s stage are highlighted in bold. All other regression results (non-italics and non-bold) also meet the validity criteria. Due to limited interpretability, standardized regression coefficients, associated p-values, regression functions, and confidence intervals are presented in Supplementary Material SA, SB. The effects of regressors 1) to 9) on the competition outcome are listed per stage in the same order in Table 3 as in the research question raised.
The stages of the Virtual Tour de France have varying characteristics. Stage one was held on a hilly course. 1.5 km before the finish, the last mountain classification had to be mastered, followed by a descent to the finish. In the women’s race, the results show that relative power output over the entire stage duration (W/kg) has the greatest influence on the race result (R2 = 0.765; p < 0.001). Relative power is less important the shorter the time interval considered. In men’s race, relative power output over the entire stage (W/kg; R2 = 0.945; p < 0.001) is also the strongest predictor of success. The relative performances over time intervals of 15 s (W/kg 15s; R2 = 0.700; p < 0.001) and 30 s (W/kg 30s; R2 = 0.654; p < 0.001) also still show a high explanatory power.
Stage two led over one high mountain (>500 m of elevation gain) in the middle of the stage and ended with a flat section. In the women’s race, the relative performances over the entire distance (W/kg; R2 = 0.887; p < 0.001) and over 20 (W/kg 20m; R2 = 0.780; p < 0.001) and 5 min (W/kg 5m; R2 = 0.686; p < 0.001) are particularly decisive for the race outcome. This result indicates that it was important to get over the mountain in the first group, while in the final part of the race, sprinting skills were less critical as the peloton was split into small groups. The men’s race developed in a similar way. Here the relative performance over the entire distance (W/kg; R2 = 0.872; p < 0.001) also explains most of the success.
Stage three was an undulating course with a slightly hilly finish. In the women’s race, the strongest predictor is relative power over the entire distance (W/kg; R2 = 0.861; p < 0.001). In the men’s race, relative power over the entire duration (W/kg; R2 = 0.846; p < 0.001) is also the strongest predictor.
Stage four led over two smaller and two medium-high mountains (>100 m of elevation gain), with the last medium-high mountain about 3 km from the finish. In the women’s race, relative performance over the entire distance (W/kg; R2 = 0.882; p < 0.001) is the strongest predictor, while shorter time intervals of the relative peak power (<5 min) do not significantly affect the competition result. However, the progression in the men’s race was different. Here, a comparatively large group sprinted for the win, which is indicated by the fact that relative peak power over 30 s (W/kg 30s; R2 = 0.806; p < 0.001) is almost as critical for success as the strongest predictor relative peak power over 20 min (W/kg 20m; R2 = 0.843; p < 0.001). Notably, body mass (R2 = 0.340; p = 0.008) significantly affects competition results.
The fifth stage, after a short flat section, led up to a mountain top finish, where a difference in altitude of over 1,000 m had to be overcome. For both women (R2 = 0.983; p < 0.001) and men (R2 = 0.990; p < 0.001), relative power output over the entire distance is the strongest predictor of success due to the height of the mountain. For both genders, relative peak power over 1 minute still has a relatively high explanatory power (women: R2 = 0.618; p < 0.001; men: R2 = 0.702; p = 0.001). Overall, this is also reflected in the race progressions, which included a few accelerations on the uphill, but it was particularly important to keep relative peak power high to defend the advantage gained all the way to the finish. Remarkably, despite the mountainous course profile, body mass alone has no effect on race performance, neither in women (p = 0.512), nor in men (presence of autocorrelation due to Durbin-Watson statistic <1).
The sixth and final stage followed a slightly hilly route, with the last kilometers being flat. In the women’s race, relative peak power over the entire stage (W/kg; R2 = 0.862; p < 0.001) is the strongest predictor of success and the explanatory power decrease constantly for shorter peak power intervals. In the men’s race, a bunch sprint occurred for the win. Here, relative power over the entire stage (W/kg; R2 = 0.968; p < 0.001) is the strongest predictor for the competition results, while relative peak power interval of 1 minute (W/kg 1m; R2 = 0.528; p = 0.022) also has a notable impact.
Overall, it appears that relative power output over the entire stage can explain the highest proportion of the variance in the riding time, and thus success in the race, for both women and men. However, the explanatory power (R2) ranges between 77% and 98% for women and between 84% and 99% for men depending on the stage. Accordingly, for some stages, other performance indicators (regressors) explain success almost as well as - or in individual cases even better than (see men’s stage 4) - relative power output over the entire stage, which will be addressed in the following discussion.
Previous studies have investigated the relevance of absolute and relative power output (Gallo et al., 2021; Lee et al., 2002) and peak power (Faria et al., 2005) for success by relating performance data measured in a laboratory setting to success in outdoor competitions that are not directly related to that data. This study advances such work by focusing on mixed-reality sports platforms to analyze performance data from competition in combination with the directly corresponding competition outcome. Following the relationship between the power output generated by a rider and the resulting speed in the real world, a cubic relationship was assumed for the regression analyses performed here to model the relationship between the performance data and the riding time of the avatar over a defined distance in the virtual world. The third-order polynomial regression analyses reveal that the assumed cubic relationship also applies to mixed-reality sports.
We contribute to sports physiology science by indicating that the (relative) in-competition power output is the strongest predictor of success in mixed-reality competitions. The explanatory power of the examined power output, body mass, and body height depends on the course characteristics and the progression of the race. The role of course characteristics is indicated by the fact that for flat stages, the explanatory power of absolute and relative power output over the entire duration are comparable (e.g., stage 1 women: R2watts = 0.744 vs. R2w/kg = 0.765; stage 1 men: R2watts = 0.897 vs. R2w/kg = 0.945; similar effects are observed for stages 3 and 6). For mountain stages, the explanatory power of relative power output over the entire distance is considerably higher than for absolute power output (e.g., stage 5 women: R2watts = 0.714 vs. R2w/kg = 0.983; stage 5 men: R2watts = 0.560 vs. R2w/kg = 0.990; similarly, for stage 4). Body mass alone is a significant predictor of athletic success only on the mountain stages (e.g., stage 4 men: R2body mass = 0.340). The effect of the progression of the race on the relevance of different performance parameters for the competition’s result can be seen in the fact that on identical courses in the women’s and men’s races, different performance parameters explained success substantially differently. For example, stage 1 of the women’s race was ridden offensively, and the winner was decided in the sprint of a leading group. This shows that the relative performance over the entire distance has the strongest explanatory power (R2w/kg = 0.765) and decreases with the observation of shorter time intervals (e.g., R2w/kg 5m = 0.500), since it is initially critical for success to be in the leading group. Stage 1 of the men was decided in a mass sprint. Here, the relative power over the entire distance (R2w/kg = 0.945) also has the highest explanatory power, which initially also decreases with shorter time intervals of the relative peak power (e.g., R2w/kg 5m = 0.397), but then increases again substantially for the shortest time intervals considered (R2w/kg 30s = 0.654; R2w/kg 15s = 0.700), which might represent the critical performance in the mass sprint. Body height cannot explain the success in any stage. This result is notable because participants are required to report their height and current body mass prior to the competition. Furthermore, the scatterplots and the polynomial regressions showed that in mixed-reality competitions, there is a cubic relationship between power output and speed (riding time over a defined distance). The cubic relationship is also present in the real world, where aerodynamic drag is especially important at high speeds, in addition to gravity and rolling resistance (Kyle, 2003). Although the body height of the athletes is considered in mixed-reality competitions, we could not observe a significant influence on the competition result. The frontal area of the bike on which the power output is performed on the smart trainer in the real world has no effect on the competition result. The reason for this is that the athletes cannot provide any information on this before the competition, and therefore it cannot be taken into account by the algorithms of the mixed-reality sports platform. In summary, on the one hand, relative power over a long period of time is especially critical for success when long mountains have to be overcome on the course or when the race is ridden very offensively, and the peloton is divided into many small groups of riders. On the other hand, relative peak power over a shorter period of time is increasingly relevant with shorter climbs, especially towards the end of the race or when the race is decided in a bunch sprint.
We can conclude that the power profile of a mixed-reality cycling race is more complex compared to the rather constant power profiles that are delivered during ergometer tests in a laboratory setting over a specific period of time (e.g., 30 min; Lee et al., 2002) or in a time trial (e.g., Padilla et al., 2000). Understanding the power profile and its effect on competition outcome allows the transfer of insights from studies. Internal factors such as specific training programs or altitude training can be used to improve athletes’ critical performance parameters (e.g., Jeukendrup and Martin, 2001). In addition, factors such as seating position (Hansen and Waldeland, 2008), gear ratio and pedaling cadence that affect the cycling economy (Faria et al., 2005), or nutritional strategy (Atkinson et al., 2003) may increase relevant performance in mixed-reality competition. The findings from the area of indoor cycling are also relevant, where in particular an effect of starting strategy (Mattern et al., 2001), time of day (Atkinson et al., 2005), warm-up (Munro et al., 2017), recovery duration (Glaister et al., 2005), and precooling strategy (Quod et al., 2008) on athletic performance has been proven. It has already been demonstrated that the starting strategy in mixed-reality races is considerably different from that in road races, as the effort at the beginning of the race is significantly higher in mixed-reality races (Westmattelmann et al., 2021a).
This study revealed that specific performance parameters (power output) are the main predictors of success in mixed-reality competitions. Nevertheless, success in any stage of the Virtual Tour de France could not be explained entirely by only one of the performance parameters, resulting in a performance-result gap, which varies in size between 1% and 23% (Figure 5), depending on the characteristics of the course and the progression of the race. At first, it might be relevant at which point in time during the race the athletes produce certain power outputs. For example, a higher power output towards the end of the race or on a mountain could be more critical for success than if the power is produced at the beginning of the race or on a downhill segment. Furthermore, different external parameters could also explain the performance-result gap. Weight and aerodynamics of the real-world bike and the rider’s position are not directly relevant because the rider produces the performance while stationary on a smart trainer. In contrast, as aerodynamics, including slipstream effects, are simulated, and different virtual bikes and corresponding equipment have different weight and aerodynamic characteristics that resemble the real-world equipment, these can affect the competition result. Simulated slipstream effects, team strategy, or positioning in the peloton might also explain the performance-result gap (Van Erp et al., 2021b). It has to be mentioned that the rider’s positioning in the virtual race depends mostly on the rider’s power output because steering skills are not considered. Still, due to the absence of consideration of steering skills in mixed-reality competitions, such skills are likely to be less relevant for the competition result than in road races. While mechanical failures can be critical in road races, the risk of mechanical failures is much lower in mixed-reality competitions. For example, while the chain can still break, a punctured tire has no impact on the race. However, technical failures in the form of an unstable internet connection or crashed software are potential race-deciding aspects in mixed-reality competitions (Lazzari et al., 2020). Finally, the performance-result gap can also be explained by specific knowledge about the mixed-reality sports platform. A corresponding example from the Zwift platform represents the effective use of power-ups, which make the athlete lighter, more aerodynamic, or invisible to competitors for a certain period of time and can thus be crucial for the race result (Westmattelmann et al., 2021a).
Based on our findings, recommendations for action can be offered to different stakeholder groups in sports. First, our results provide athletes and coaches insights into which performance parameters are critical for success in mixed-reality sports. Accordingly, training programs can be tailored to prepare athletes to deliver the relevant critical performances. Since we provide extensive knowledge on the performance profile of mixed-reality competitions, they can also be systematically integrated into the preparation for real-world competitions to train in a competition-specific manner. This is especially beneficial when no competitions occur in the off-season, when factors such as unfavorable weather at outdoor competitions pose an increased risk of injury or infection.
Our analysis offers coaches, sports directors, or team managers a solid basis for decision-making to deploy athletes in mixed-reality competitions according to the respective course characteristics. For example, performance data from training or performance diagnostics can be compared with the required performance profiles of mixed-reality competitions. This allows the selection of athletes whose performance best matches the required performance profile. Furthermore, due to the wide availability of performance data, mixed-reality competitions offer scouts the opportunity to track and recruit athletes based on their performance and body composition.
For mixed-reality sports platform operators, it is valuable to show that physical performance is the most critical factor for success in the competitions on their platform. Accordingly, mixed-reality sports competitions host relevant and authentic athletic performances, which underlines the relevance of mixed-reality events. It is also notable that success is not solely determined by physical performance in mixed-reality competitions. This uncertainty about the outcome of the competition makes mixed-reality competitions interesting for spectators.
Finally, we provide key insights for sports federations on how to deal with mixed-reality sports platforms. Mixed-reality-specific factors might explain the performance-result gap, which raises the question for sports federations as to the extent to which they want to establish mixed-reality competitions as an independent discipline. In this regard, federations need to decide whether the mixed-reality events should be as realistic as possible or whether they deliberately enhance the presence of gamified elements and mechanisms. While discipline-specific skills and strategies are relevant in traditional cycling disciplines such as road cycling, mountain-biking, cyclo-cross, or track racing, the same is true for mixed-reality cycling. Mixed-reality cycling has the potential to introduce people with an affinity for esports to traditional sports and thus represents a promising extension to appeal to a younger target group. At the same time, it will be interesting to observe whether professionals from road cycling will also participate in mixed-reality cycling competitions in the future, as in the Virtual Tour de France, 2020b, or whether a separate professional scene will emerge.
Limitations and Further Research
There are potential limitations to this study. First, the stages of the Virtual Tour de France were conducted in a decentralized manner due to the Covid-19 pandemic, so the organizers could not monitor the calibration of the smart trainer or the body mass and height of the riders to the same extent as that would have been the case if the event had been conducted in a centralized manner. In their analysis of the Virtual Tour de France, Westmattelmann et al. (2021b) showed that the performance data correspond to the performances of real-world professional races but also observes suspicious weight data. However, since minor differences in performance can be decisive for victory and defeat in competitive sports, the reliability of the measured data needs to be considered further so that, for example, incorrect calibrations of the smart trainer or power meter can be identified.
Second, the ability to apply the insights gained here from mixed-reality cycling races to road races might be limited. First, the competition mode differs, as the Virtual Tour de France, 2020b featured a team competition, which could affect the team strategy. In contrast, the Tour de France is decided based on the total riding time in an individual classification. In addition, in the Virtual Tour de France, 2020b, riders were swapped between stages, and the starting fields were also much smaller than in the real-world Tour de France. Nevertheless, the athlete population considered in this study is representative of the real-world Tour de France as the men’s Virtual Tour de France included almost exclusively professional teams and riders who also compete in the real Tour de France, while the women’s Virtual Tour de France included professionals.
Although essential aspects from the real world were simulated in the Virtual Tour de France such as elevation profiles and slipstream effect, there are still differences, such as power-ups or the non-inclusion of steering skills, as the avatar steers autonomously. Furthermore, some environmental factors, such as rain or crosswinds, which can affect the race result in road races, are not considered in mixed-reality races. Follow-up studies should aim to investigate the extent to which gamification elements such as power-ups are used strategically by riders and what effect the power-up use has on competition outcomes and thus explain the performance-result gap. In view of the increasing importance of mixed-reality sports, the results presented here nevertheless show a high relevance for sports physiology science (see Crivoi do Carmo et al., 2022). The wide-ranging data transparency required to conduct mixed-reality sport goes far beyond the data available from traditional sport and thus offers plenty of new research opportunities. Since the power output data of all participants of the Virtual Tour de France are available and were considered in this study, the effect of race performance data on the competition result can be quantified for the first time.
This study shows that success in mixed-reality sports particularly depends on athletes’ physical performance. Exemplified by the performance data, rider’s body mass and height as well as results of the Virtual Tour de France, we demonstrate that the relative power output over the entire stage distance explains most of the variance in the riding time. However, depending on the course characteristics and the race progression, the explanatory power of the considered performance indicators differs in some cases considerably. While relative peak performances over a long period are most critical for success in races with longer climbs, relative peak performances of around 1 minute are critical for success in flatter races ending in a bunch sprint. The relevance of the relative performance indicates that the body mass of the riders has some importance. However, body mass alone can predict race success only occasionally when a mountain lies just before the finish. There was no significant effect for body height on the race result observed.
Overall, insights from sports science can be applied to train specific critical performances (e.g., Jeukendrup and Martin, 2001). Although physical performance is the main predictor of success in mixed-reality sport, a performance-result gap is identified. Depending on the stage, a maximum of 77%–98% of success in women and 84%–99% in men could be explained by performance indicators. Future studies will have to quantify which additional factors can explain this gap. Due to the complete availability of in-competition performance data, mixed-reality offers new research opportunities for sports science. Furthermore, mixed-reality sport has the potential to attract new audiences as an individual discipline and to complement traditional sport.
Data Availability Statement
The datasets analyzed for this study will be made available by the authors upon request.
The studies involving human participants were reviewed and approved by Ethics Committee of the School of Business and Economics of the University of Münster, Germany. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.
DW: Conceptualization, methodology, analysis, writing, project administration. BS: Conceptualization, writing, visualization. MS: Methodology, writing. J-GG: Conceptualization, writing. GS: Supervision, writing.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fphys.2022.868902/full#supplementary-material
Alvero-Cruz J. R., Parent Mathias V., Garcia Romero J., Carrillo de Albornoz-Gil M., Benítez-Porres J., Ordoñez F. J., et al. (2019). Prediction of Performance in a Short Trail Running Race: The Role of Body Composition. Front. Physiol. 10, 1306. doi:10.3389/fphys.2019.01306
Babault N., Poisson M., Cimadoro G., Cometti C., Païzis C. (2018). Performance Determinants of Fixed Gear Cycling during Criteriums. Eur. J. Sport Sci. 18 (9), 1199–1207. doi:10.1080/17461391.2018.1484177
Crivoi do Carmo E., Renfree A., Vieira C. Y. N., Ferreira D. D. S., Truffi G. A., Barroso R. (2022). Effects of Different Goal Orientations and Virtual Opponents Performance Level on Pacing Strategy and Performance in Cycling Time Trials. Eur. J. Sport Sci. 22 (4), 491–498. doi:10.1080/17461391.2021.1880645
Cyclingnews (2020). Virtual Tour de France 2020. Available at: https://www.cyclingnews.com/races/virtual-tour-de-france-2020/ (Accessed February 28, 2022).
Dalal D. K., Zickar M. J. (2012). Some Common Myths about Centering Predictor Variables in Moderated Multiple Regression and Polynomial Regression. Organizational Res. Methods. 15 (3), 339–362. doi:10.1177/1094428111430540
Delaney B., Bromley S. (2020). Zwift: Your Complete Guide: What Zwift Is, How it Works and what Equipment You Need. Available at: https://www.bikeradar.com/features/zwift-your-complete-guide/ (Accessed February 01, 2022).
Gallo G., Filipas L., Tornaghi M., Garbin M., Codella R., Lovecchio N., et al. (2021). Thresholds Power Profiles and Performance in Youth Road Cycling. Int. J. Sports Physiol. Perform. 16 (7), 1049–1051. doi:10.1123/ijspp.2020-0579
Glaister M., Stone M. H., Stewart A. M., Hughes M., Moir G. L. (2005). The Influence of Recovery Duration on Multiple Sprint Cycling Performance. J. Strength Conditioning Res. 19 (4), 831–837. doi:10.1519/00124278-200511000-00018
Kholkine L., Schepper T., Verdonck T., Latré S. (2020). “A Machine Learning Approach for Road Cycling Race Performance Prediction,” in Machine Learning and Data Mining for Sports Analytics. MLSA 2020. Communications in Computer and Information Science. Editors U. Brefeld, J. Davis, J. Van Haaren, and A. Zimmermann (Heidelberg: Springer), 103–112. doi:10.1007/978-3-030-64912-8_9
Lee H., Martin D. T., Anson J. M., Grundy D., Hahn A. G. (2002). Physiological Characteristics of Successful Mountain Bikers and Professional Road Cyclists. J. Sports Sci. 20, 121001–121008. doi:10.1080/026404102321011760
Lumley T., Diehr P., Emerson S., Chen L. (2002). The Importance of the Normality assumption in Large Public Health Data Sets. Annu. Rev. Public Health. 23 (1), 151–169. doi:10.1146/annurev.publhealth.23.100901.140546
Munro L. A., Stannard S. R., Fink P. W., Foskett A. (2017). Potentiation of Sprint Cycling Performance: The Effects of a High-Inertia Ergometer Warm-Up. J. Sports Sci. 35 (14), 1442–1450. doi:10.1080/02640414.2016.1215492
Ostanek D. (2020). ZwiftPower: Your Ultimate Guide. Available at: https://www.cyclingnews.com/features/zwiftpower-your-ultimate-guide/ (Accessed February 01, 2022).
Padilla S., Mujika I. I., Orba?Anos J., Angulo F. (2000). Exercise Intensity during Competition Time Trials in Professional Road Cycling. Med. Sci. Sports Exerc. 32 (4), 850–856. doi:10.1097/00005768-200004000-00019
Procyclingstats (2020). 107th Tour de France. Available at: https://www.procyclingstats.com/race.php/tour-de-france/2020/stage-1?id1=tour-de-france&id2=2020&id3=stage-1 (Accessed February 01, 2022).
Quod M. J., Martin D. T., Laursen P. B., Gardner A. S., Halson S. L., Marino F. E., et al. (2008). Practical Precooling: Effect on Cycling Time Trial Performance in Warm Conditions. J. Sports Sci. 26 (14), 1477–1487. doi:10.1080/02640410802298268
Schlange E. (2020). All about the Virtual Tour de France on Zwift. Available at: https://zwiftinsider.com/vtdf/ (Accessed February 01, 2022).
Sousa C. V., Aguiar S., Olher R. R., Cunha R., Nikolaidis P. T., Villiger E., et al. (2021). What Is the Best Discipline to Predict Overall Triathlon Performance? an Analysis of Sprint, Olympic, Ironman 70.3, and Ironman 140.6. Front. Physiol. 12, 654552. doi:10.3389/fphys.2021.654552
Tour de France (2020b). Virtual Tour de France on Zwift. Available at: https://www.letour.fr/en/virtual-tour-de-france (Accessed February 01, 2022).
Tour de France (2020a). Stage Profiles. Available at: https://www.letour.fr/en/virtual-tour-de-france/stages-profiles (Accessed February 01, 2022).
UCI (2020). Zwift UCI Cycling Esports World Championships: Moolman-Pasio and Osborne Make History in Watopia. Available at: https://www.uci.org/cycling-esports/overview/2020/moolman-pasio-and-osborne-make-history-in-watopia (Accessed September 20, 2021).
Van Erp T., Kittel M., Lamberts R. P. (2021a). Demands of the Tour de France: A Case Study of a World-Class Sprinter (Part I). Int. J. Sports Physiol. Perform. 16, 1363–1370. doi:10.1123/ijspp.2020-0700
Van Erp T., Kittel M., Lamberts R. P. (2021b). Sprint Tactics in the Tour de France: A Case Study of a World-Class Sprinter (Part II). Int. J. Sports Physiol. Perform. 16, 1371–1377. doi:10.1123/ijspp.2020-0701
Westmattelmann D., Grotenhermen J.-G., Sprenger M., Rand W., Schewe G. (2021a). Apart We Ride Together: The Motivations behind Users of Mixed-Reality Sports. J. Business Res. 134, 316–328. doi:10.1016/j.jbusres.2021.05.044
Westmattelmann D., Grotenhermen J.-G., Sprenger M., Schewe G. (2021b). The Show Must Go on - Virtualisation of Sport Events during the COVID-19 Pandemic. Eur. J. Inf. Syst. 30 (2), 119–136. doi:10.1080/0960085x.2020.1850186
Westmattelmann D., Grotenhermen J.-G., Stoffers B., Schewe G. (2021c). “Exploring the Adoption of Mixed-Reality Sport Platforms: A Qualitative Study on ZWIFT,” in Proceedings of the 29th European Conference on Information Systems (Marrakech, Virtual), 1–18.
Keywords: mixed-reality, cycling, result prediction, power output, elite athletes, competition
Citation: Westmattelmann D, Stoffers B, Sprenger M, Grotenhermen J-G and Schewe G (2022) The Performance-Result Gap in Mixed-Reality Cycling – Evidence From the Virtual Tour de France 2020 on Zwift. Front. Physiol. 13:868902. doi: 10.3389/fphys.2022.868902
Received: 03 February 2022; Accepted: 11 April 2022;
Published: 13 May 2022.
Edited by:Leonardo Alexandre Peyré-Tartaruga, Federal University of Rio Grande do Sul, Brazil
Reviewed by:Luca Paolo Ardigò, University of Verona, Italy
Fábio Juner Lanferdini, Federal University of Santa Catarina, Brazil
Copyright © 2022 Westmattelmann, Stoffers, Sprenger, Grotenhermen and Schewe. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Daniel Westmattelmann, firstname.lastname@example.org