The Critical Power Model as a Potential Tool for Anti-doping

Puchowicz, Michael J.; Mizelman, Eliran; Yogev, Assaf; Koehle, Michael S.; Townsend, Nathan E.; Clarke, David C.

doi:10.3389/fphys.2018.00643

REVIEW article

Front. Physiol., 06 June 2018

Sec. Integrative Physiology

Volume 9 - 2018 | https://doi.org/10.3389/fphys.2018.00643

The Critical Power Model as a Potential Tool for Anti-doping

1. Department of Health Services, Arizona State University, Tempe, AZ, United States
2. Department of Biomedical Physiology and Kinesiology and Sports Analytics Group, Simon Fraser University, Burnaby, BC, Canada
3. School of Kinesiology, The University of British Columbia, Vancouver, BC, Canada
4. Division of Sport and Exercise Medicine, The University of British Columbia, Vancouver, BC, Canada
5. Athlete Health and Performance Research Centre, Aspetar Orthopaedic and Sports Medicine Hospital, Doha, Qatar
6. Canadian Sport Institute Pacific, Victoria, BC, Canada

Abstract

Existing doping detection strategies rely on direct and indirect biochemical measurement methods focused on detecting banned substances, their metabolites, or biomarkers related to their use. However, the goal of doping is to improve performance, and yet evidence from performance data is not considered by these strategies. The emergence of portable sensors for measuring exercise intensities and of player tracking technologies may enable the widespread collection of performance data. How these data should be used for doping detection is an open question. Herein, we review the basis by which performance models could be used for doping detection, followed by critically reviewing the potential of the critical power (CP) model as a prototypical performance model that could be used in this regard. Performance models are mathematical representations of performance data specific to the athlete. Some models feature parameters with physiological interpretations, changes to which may provide clues regarding the specific doping method. The CP model is a simple model of the power-duration curve and features two physiologically interpretable parameters, CP and W′. We argue that the CP model could be useful for doping detection mainly based on the predictable sensitivities of its parameters to ergogenic aids and other performance-enhancing interventions. However, our argument is counterbalanced by the existence of important limitations and unresolved questions that need to be addressed before the model is used for doping detection. We conclude by providing a simple worked example showing how it could be used and propose recommendations for its implementation.

Introduction

Athletes have long used exogenous substances to enhance performance for personal gain (McHugh et al., 2005). In the past half-century, sporting federations created regulations and developed testing programs to detect and discourage this doping to create a level playing field. However, inconsistencies existed in how doping was addressed across sporting disciplines and regions; the World Anti-Doping Agency (WADA) was created in 1999 to harmonize these processes (Ljungqvist, 2014). The original strategy of doping detection was to detect evidence of banned substances by assaying biological fluids for illicit substances or their metabolites. While such direct detection methods have advantages, they are limited in important ways, especially for substances that are synthetic versions of naturally occurring endogenous hormones such as growth hormone (McHugh et al., 2005) and erythropoietin (EPO) (Pascual et al., 2004). Recently, indirect detection methods, which test for the biological effects of the substance rather than the substance itself, have shown both promise and limitations for detecting blood doping and exogenous EPO use. Accordingly, new approaches to anti-doping are needed. One strategy is to infer doping based on performance per se, which is sensible given that the within-subject coefficient of variation in performance is relatively low for elite athletes (Malcata and Hopkins, 2014), and ultimately, performance is the outcome that athletes are attempting to manipulate.

Herein, we review the potential for, and challenges of, applying the critical power (CP) model to anti-doping. The review is divided into five main sections. First, we discuss current biomarker-based doping control practices and their limitations. We then discuss in general terms the potential of performance-based markers as an additional class of evidence within indirect detection methods. We then narrow our focus to the CP model and describe in detail its basis and potential for doping control, followed by a detailed critical appraisal of its properties, which collectively show the model’s promise, limitations, and unanswered questions for this application. We conclude by offering guidelines for its implementation, recommending future research, and providing a simple worked example of its implementation. We also briefly review an extension of the CP model, the W′_bal model, which offers unique insight into performance during intermittent tasks. Because the CP model has yet to be scientifically evaluated in the context of doping control, our arguments are theoretical and supported by indirect evidence. We thus intend for the review to serve as a catalyst for discussion and to guide future studies in this area. Overall, we posit that the model holds promise for anti-doping, but gaps in knowledge and issues with the model must first be resolved.

Existing Doping Control Practices

The WADA Code: A Brief Overview

The first comprehensive list of prohibited substances (the World Anti-Doping Code) was released in 2004. Broadly, violations are grouped into substances and methods (World Anti-Doping Agency, 2017b): Substances are classified as (i) anabolic agents, (ii) peptide hormones, growth factors and related substances, (iii) β₂ adrenergic receptor agonists, (iv) hormone and metabolic modulators, and (v) diuretics and masking agents. Additional substances that have not been approved for human therapeutic use are also prohibited, even if they are not listed. Three classes of prohibited methods exist: manipulation of blood and blood components, chemical and physical manipulation, and gene doping. According to WADA, a substance or method is prohibited if it meets two of the following three criteria: (1) it has the potential to enhance sport performance, (2) it represents a health risk to the athletes, or (3) it violates the spirit of sport (World Anti-Doping Agency, 2015). To enforce the Code, WADA conducts testing to detect doping, and the testing is either direct or indirect. Direct testing refers to the detection of a prohibited substance in a biological matrix such as blood or urine (Vernec, 2014). Indirect methods seek to detect the biological effects of doping rather than the substance or method itself; indirect methods have also demonstrated success by leading to sanctions in the absence of an adverse analytical finding (Vernec, 2014).

Challenges With Enforcing the Code: Direct Detection

Directly detecting prohibited substances in athletes is challenging as demonstrated by the many athletes who passed doping tests throughout their careers only to belatedly confess to doping once retired (Vernec, 2014). Doping prevalence is estimated to be 14–39% of athletes, which far exceeds the 1–2% of annual sanctions for doping (de Hon et al., 2014). These estimates support the possibility that athletes may be successfully exploiting the time lag between the act of doping and its resultant detection window and the delayed but persistent performance benefit.

One example of a substance that is challenging to detect using direct methods is erythropoietin (EPO), a naturally occurring hormone that stimulates production of red blood cells by the bone marrow. Recombinant human EPO (rhEPO) was developed to treat anemia in clinical populations but has subsequently been used as an ergogenic aid due to its ability to increase hemoglobin mass, and hence oxygen carrying capacity of the blood (Clark et al., 2017). Despite its widespread use in the past (Thevis et al., 2017), its direct detection remains challenging because it is an analog of a naturally occurring substance in the body, it features high interindividual variability in athletes, and its levels change in response to various natural factors, including health, training load, altitude and even sleep apnea (Pascual et al., 2004).

For doping purposes, athletes may administer short-half-life rhEPO intravenously in more frequent smaller (‘micro’) doses than are used clinically. When taken in this manner, rhEPO is rapidly eliminated (Martin et al., 2016), such that a dose taken at night may be eliminated before the athlete is tested the following morning. Furthermore, rhEPO is typically used in the period prior to the competition because it has cumulative effects on hemoglobin mass that persist over the course of weeks (Clark et al., 2017), with the performance benefits possibly lasting longer. Therefore, the majority of EPO detection must occur through out-of-competition (OOC) testing, which consists of tests administered throughout the year, and which is logistically cumbersome in low-resource regions. Athletes have also employed strategies to mask the doping agent and its detectable metabolites, such as hyperhydration (Martin et al., 2016) or treatment of urine with proteases (Lamon et al., 2007), to dilute or degrade the compound prior to testing. Athletes are also permitted to miss two doping tests per year without triggering a sanction, which creates an additional loophole to circumvent OOC testing.

Challenges With Enforcing the Code: Indirect Detection

The limitations of direct testing motivated the development of alternative indirect testing methods. The Athlete Biological Passport (ABP) is an example of an indirect testing method that has been effective in doping detection. It consists of two modules: hematological and steroidal. In the case of rhEPO detection and blood doping, the hematological profile is used to monitor several blood biomarkers known to be sensitive to blood manipulation (Sottas et al., 2006). By shifting from the detection of the stimulus to its hematological effects, the detection window is broadened and thus likely to better cover the period of performance enhancement. Furthermore, the ABP features a Bayesian model to determine an individualized expected range of normal values, which is updated over time based on the trends observed from longitudinal testing. Subsequent tests are compared to these ranges, and significant intra-individual deviations outside the individual’s normal range are flagged. This method improves the sensitivity and specificity of detection compared to using population norms. With this strategy, the ABP can be used to prompt direct targeted testing of athletes and to serve as evidence for establishing “Use” in pursuing a doping violation without having directly detected a prohibited substance or method (World Anti-Doping Agency, 2015).

The ABP features several limitations. First, it exhibits a lack of sensitivity to micro-dose rhEPO regimens that can raise hemoglobin mass by as much as 10% (Ashenden et al., 2011). Second, hemoglobin is measured as a concentration, such that the ABP can be subverted by hyperhydration (Bejder et al., 2016) and is compromised by natural plasma volume expansion during periods of heavy exercise load such as cycling grand tours (Corsetti et al., 2012). Third, concerns have been raised about the sensitivity, validity, and fairness of sanctions resulting from the ABP. Specifically, perturbations other than doping, such as altitude training, medications, bleeding ulcers, and bleeding hemorrhoids, can each cause blood parameter irregularities that could confound the ABP (Hailey, 2011). Lastly, the process by which the expert panel reviews suspect ABP results has been claimed to lack objectivity and transparency (Hailey, 2011). Additional strategies for doping detection are therefore sought.

Performance as a Marker for Doping Detection

Since the primary goal of doping is to enhance performance, raw performance data, profiles, or derived metrics could serve as indirect markers of doping (Schumacher and Pottgiesser, 2009; Hopker et al., 2016). Indeed, the effectiveness of doping for enhancing performance has been shown by retrospective studies of professional cycling, which reported a period of rapid improvement in individual and group race speeds among top 10 finishers following the introduction of rHuEPO in the late 1980s (El Helou et al., 2010; Perneger, 2010; Lodewijkx and Brouwer, 2011) and a subsequent decline after 2004 as anti-doping efforts intensified (Perneger, 2010). Blood data from 2001 to 2009 corroborates suspected changes in doping behavior as elevated rates of abnormally high reticulocyte counts dropped after the introduction of the rHuEPO test in 2002, and the subsequent elevation of rates of abnormally low reticulocyte counts fell with the implementation of the ABP in 2008 (Zorzoli and Rossi, 2010). Similarly, improvements in group mean 5 and 10K running race speeds for the top 10, 20, and 40 performers, and the prevalence of “elite” and world-record individual performances have stagnated since 2005, coinciding with improved rHuEPO detection (Kruse et al., 2014). Hence, changes in performance coincided with trends in doping practices during these periods, such that performance may therefore serve as a marker for detecting doping.

Performance markers of doping offer several complementary advantages to biomarkers. First, performance enhancement manifests at the time of competition, whereas biomarkers may only be detectable in the weeks and months prior to competition when doping agents and methods tend to be used (USADA, 2012). Second, performance markers should be insensitive to practices used to subvert biologic detection protocols such as micro-dosing (Ashenden et al., 2011) and hyper-hydration masking (Russell et al., 2002), thus improving the sensitivity of testing. Third, statistical techniques for assessing time series data are well established (Shumway and Stoffer, 2017) and could be used along with data regarding typical errors of elite athlete performance, which tend to be relatively low compared to those of biomarkers (Hopkins et al., 2001; Bagger et al., 2003; Malcata and Hopkins, 2014). Hence, the underlying framework for an anti-doping performance test already exists, such that future developments in analytical approaches should be reasonably straightforward.

The feasibility of using performance markers for doping detection is clearest for sports such as track and field, weight lifting, and swimming in which the competition settings are relatively standardized, the outcome is a discrete, objective measurement of distance covered, mass lifted, or time achieved, and the athlete’s proficiency is highly correlated with specific physiological characteristics modifiable by doping agents. The relative standardization of the competition settings help minimize within-athlete variability (Malcata and Hopkins, 2014), such that results across competitions are directly comparable, and observed improvements in performance are likely due to improved physical capacity.

It is less evident how performance markers could be established for most other sports because the competition settings are less standardized and athlete physical capacity may not be the primary determinant of performance. For example, it would be less straightforward to detect suspicious performance of a soccer player. This gap may be addressable owing to the advent of player tracking technologies in which video systems or portable sensors are used to quantify player movements (Barris and Button, 2008; Aughey, 2011). From the changes in a player’s position over time, velocities and accelerations can be calculated (Aughey, 2011). In cycling, bicycle-mounted power meters enable the direct measurement of rider work intensity. The power or velocity data for each athlete can be summarized as a “mean maximal power (MMP) profile” or “record power profile” (Quod et al., 2010; Pinot and Grappe, 2011) or, equivalently, a mean maximal velocity profile (Delaney et al., 2015; Roecker et al., 2017). These profiles are predictive of future performances (Quod et al., 2010) and evolve as the athlete develops over time (Pinot and Grappe, 2015), such that unrealistic increases in the powers sustainable for the indicated durations could serve as evidence for doping.

Performance data nevertheless feature important limitations. The primary limitation is that performance data indicate what the athlete did rather than what they were capable of doing. Factors such as pacing, tactics, periodization, health, and environmental conditions will inevitably confound performance data. Another disadvantage is access to performance data. At the present time, athletes are not required to share their physiological or performance data, such that these data must be extracted from publicly available sources, which may be insufficient in terms of quality and quantity. The demand for data is particularly burdensome for generating an athlete’s MMP profile. Raw power data are needed for all workouts and competitions within the time frame of interest to ensure that the relevant best performances are captured (Quod et al., 2010; Pinot and Grappe, 2011, 2015). In addition, individual MMP data points do not predict performance at other durations. As a result, MMP profiles must feature sufficient sampling across all durations that may be of interest. Otherwise, comparisons cannot be made if future performances happen to occur for durations not already captured in the profile. Likewise, the MMP profile neither leverages neighboring MMP data points to reduce prediction errors nor features prediction intervals. Basing doping detection thresholds on MMP data alone would thus require population averages of performance variability, which would be wide compared to individualized prediction intervals. MMP data should therefore be supplemented with methods to interpolate performance at durations not included in the profile itself and to individualize the uncertainty estimates to the athlete being tested.

Performance Models in Doping Detection

Performance models are mathematical representations of performance data and are useful for integrating data, inferring mechanistic parameters, and for predicting future performance. Examples of performance models include the CP model, which models the power-duration relationship, and the impulse-response model, which models the time course of performance as a function of daily training (Clarke and Skiba, 2013). The use of models may help to overcome the limitations of performance data, profiles, and simple metrics. In particular, performance models enable one to interpolate performances for values of the independent variable that were not originally tested. For example, the CP model reduces MMP data points to two parameters, CP and W′, which can then be used to predict performance for any duration within its domain of validity (Morton, 2006).

Importantly, metrics derived from performance models should in principle conform to the WADA code. According to WADA, “the ABP can be used to establish ‘Use’ per Code article 2.2 without necessarily relying on the detection of a particular Prohibited Substance or Prohibited Method” (World Anti-Doping Agency, 2017a). Additionally, the ABP is not specific to particular markers because both hematological (Sottas et al., 2009) and steroid profiles (Sottas et al., 2010) are now in routine use. Therefore, it is reasonable to suggest that indirect detection by athlete profiling is a general method and that performance-based markers should be acceptable under the WADA code. As such, performance metrics could form the basis of an athlete performance profile. The performance profile could then be used in a manner similar to the biological profiles in order to identify and target athletes for specific analytical testing, to pursue anti-doping rule violations in accordance with Article 2.2, to corroborate other analytical or non-analytical evidence (Saugy et al., 2014), or to monitor group prevalence (Sottas et al., 2011).

Like the WADA code for indirect testing, the Bayesian model underpinning the ABP is also general (Sottas et al., 2009, 2010). Detrended performance metrics could therefore be used as inputs to the Bayesian model and updated longitudinally at regular intervals to generate prediction intervals for the model parameters and its outputs. As one potential scenario, performance metrics could be combined with biological parameters similar to the OFF-hr score, which combines the concentrations of hemoglobin and % reticulocytes into a single score (Gore and Parisotto, 2003), or the abnormal blood profile (ABPS) score, which consists of seven hematologic parameters [red blood cell count, hemoglobin, hematocrit, mean corpuscular (MC) volume, MC hemoglobin, MC hemoglobin concentration, % reticulocytes] (Sottas et al., 2006). Alternatively, the Bayesian model of the ABP could be expanded for multiple lines of evidence. The current form of the Bayesian model features two variables, D and M, in which D is a binary variable that represents the state (doped or not doped) and M is a continuous variable that represents the biomarker. The causal relationship is specified as follows (Sottas et al., 2009):

According to Bayes’ theorem, the model could be expanded as follows for multiple lines of evidence to find the probability of doping given both a biomarker (M_B) and a performance marker (M_P)

Numerous models of the power (or velocity)-duration profile (“PD models”) have been proposed and are reviewed in detail elsewhere (e.g., Billat et al., 1999). Compared to other PD models, the CP model features several advantages for anti-doping applications. First, the CP model is the most extensively studied PD model (Morton, 2006; Poole et al., 2016) and has been validated for use with individual athlete data collected both in the lab and field (Skiba et al., 2014a; Karsten et al., 2015). Most other PD models have just been applied to world-record data (Billat et al., 1999). The CP model is also among the most parsimonious of the PD models, featuring just two adjustable parameters. Models with fewer parameters require less data for fitting. In the case of the CP model, only two performances at different durations are minimally required to estimate the model parameters. Finally, the CP model parameters are physiologically interpretable, and they change in predictable manners in response to physiological, nutritional, and ergogenic interventions. This feature is useful for doping detection because the parameters will change in a manner consistent with the mechanism of the doping method, which may thus provide insight into which doping method was used. In the remainder of the review, we discuss the suitability of the CP model for use in doping control.

The CP Model: Basis, Physiology, and Current Applications

Definition of the Model

A conserved hyperbolic relationship exists between maximally sustainable power output and duration (Figure 1). This relationship was first observed by Hill (1925) for world-record performances, followed by Monod and Scherrer (1965) for performance of isolated muscle groups (Monod and Scherrer, 1965), and then by Moritani et al. (1981) for whole-body exercise. Monod and Scherrer codified the CP concept into a two-parameter mathematical model:

FIGURE 1

in which t_lim is the time to exhaustion, P is the power output during task performance, b is the asymptote of the curve, and a is curvature constant of the curve (Monod and Scherrer, 1965). Both a and b have physiological interpretations: b was called the “CP” and represents the power output that is sustainable for a very long time without fatigue (theoretically infinite time), and a is the total work that can be performed at intensities above CP.

The equation was subsequently restated by Moritani et al. (1981) in a linearized form:

in which P = power output, t = time to exhaustion, CP is critical power (same as b in equation 1), and W′ is the work that can be performed above CP (same as a in equation 1) (Figure 1). Another common approach to expressing the CP model is to relate the total mechanical work done to CP and W′. This equation is also linear:

Due to the difficulty of directly measuring mechanical power output for many exercise modalities, velocity is often substituted for power. The resulting critical velocity model features analogous parameters to those of the CP model: critical velocity (CV, units of distance over time) is used in place of CP and D′ in place of W′. D′ represents the distance that can be covered at intensities above CV.

The CP model permits clear physiological interpretations of the parameters but also requires several simplifying assumptions. Originally, CP was interpreted as the maximum power sustainable by steady-state aerobic energy provision whereas W′ was considered to represent the “anaerobic work capacity” (Moritani et al., 1981), which is defined as the mechanical work performed during exhausting exercise of sufficient duration to elicit near-maximal anaerobic ATP yield (Green, 1994). The assumptions are as follows: First, power output is assumed to be a function of energy generated from both aerobic and anaerobic pathways. Aerobic energy supply is not limited in capacity but rather by rate, and work done at or below CP is thus limited by the maximum rate of aerobic energy supply. Anaerobic energy supply cannot be sustained indefinitely and therefore W′ is assumed to be limited by capacity but not by rate (i.e., no limit to peak power or speed). W′ is defined as work done above CP to the limit of tolerance (Poole et al., 2016). When this limit is achieved, the sustainable power is markedly reduced (typically below CP), such that no more work above CP accumulates and a maximum value of W′ is thus achieved. The physiological interpretations of the CP model parameters enable the CP model to be used for assessing task-specific aerobic and anaerobic fitness.

Early investigations of the CP model in whole-body exercise suggested its benefits for athletic performance, based on its ability to define specific pacing strategies for continuous efforts (Moritani et al., 1981; Gaesser and Wilson, 1988). The model has since been extended to model intermittent performance (Morton and Billat, 2004; Skiba et al., 2012). Given that many sports are intermittent in nature, we discuss the suitability of the W′_bal model for doping detection in the final section of the review.

Procedures for Estimating the Model Parameters for an Athlete

The most commonly applied CP test protocol requires the athlete to perform two or more time-to-exhaustion (TTE) tests. These tests consist of predetermined constant-work rates (CWR) that ensure the athlete will achieve exhaustion at particular durations. The athlete is typically granted 24 h or more of recovery between each test. The power (y-axis) vs. duration (x-axis) data from all trials is then fitted to the two-parameter CP model using ordinary least-squares regression. Early studies featured protocols consisting of two to seven trials to generate data for fitting the model (Hill, 1993). In addition, determining the powers for TTE/CWR tests is typically done using data from a graded exercise test, which requires an additional testing session. Therefore, CP testing using the TTE/CWR tests is time consuming, which limits the practical application of the method. Furthermore, these tests require control of the work rate, which is typically achieved using ergometers in laboratory-based settings.

To improve the time efficiency of CP estimation, Vanhatalo et al. (2007) proposed a new 3-min all-out test (3AOT) protocol conducted in a single testing session. This protocol is based on the assumption that a sufficiently long unpaced maximal effort (∼3 min) should fully deplete W′, such that the sustainable power beyond this time should be, by definition, equivalent to CP. Vanhatalo et al. (2007) validated the 3AOT against a traditional protocol consisting of TTE/CWR-based tests by showing that the end-test power from the 3AOT correlated with CP estimated from the traditional protocol [r = 0.99; standard error of the estimate (SEE) = 6 W] and the work completed above the end-test power correlated to W′ (r = 0.84; SEE = 2.8 kJ). The 3AOT has since been increasingly applied in research studies and in sport science practice as a time-efficient method to estimate CP and W′. However, its demanding nature is a disadvantage such that pacing is likely inevitable (Tsai, 2015) and the test also requires expensive laboratory-based cycle ergometers. Furthermore, several studies have reported that the end-test powers from the 3AOT likely overestimate the “true” CP (McClave et al., 2011; Bergstrom et al., 2013a,b; Nicolò et al., 2017).

Recently, time trials (TTs) and constant-duration tests have been increasingly used to estimate the CP model. In TT, the target distance or energy expenditure is determined and the athlete attempts to minimize the time to completion. In constant-duration tests, the trial duration is specified and the athlete attempts to maximize the average power or velocity over that time. The advantages of TT and constant-duration tests include lacking the need for a prior graded exercise test (which are used to determine the powers for TTE trials) and self-pacing may foster enhanced performance (Black et al., 2015) by enabling a fast-start strategy that results in faster O₂ kinetics (Black et al., 2015; Fullagar et al., 2016). Furthermore, TT and constant-duration tests can be conducted in the field using portable bicycle-mounted power meters, which enhances the feasibility and ecological validity of the CP model. Indeed, the need for an ecologically valid time-efficient protocol led Karsten et al. (2015) to evaluate a single-day field-based protocol for estimating the CP model. Their constant-duration protocol involved three trials of 12, 7, and 3 min in duration presented in this order and separated by 30 min of recovery. They compared this protocol to the conventional method of three TTE tests conducted in the laboratory. CP estimated from the two-parameter linear model was not statistically different between the two methods (mean difference = -2 ± 14 W; limits of agreement = -26 to 29 W) (Karsten et al., 2015). Similarly, W′ was not significantly different (mean difference = -0.14 ± 3.36 kJ; limits of agreement = -6 to 7 kJ) (Karsten et al., 2015). Similar results were obtained for CV modeling in running, as CV estimates from single-day field-based protocols featuring 30- and 60-min recoveries between the TT were not statistically different from those estimated using constant-velocity TTE tests, whereas the estimates for D′ were different (Galbraith et al., 2014). Hence, field-based, single-day protocols based on constant-duration tests can provide valid estimates for CP (or CV) but possibly not for W′ (D′). Indeed, estimates of W′ tend to be highly variable compared to those from TTE-based protocols (Table 1). The practical applicability of constant-duration trials would be further enhanced by minimizing the number of trials. A recent study compared CP and W′ estimates from protocols featuring either two or three constant-duration trials and found no difference in CP estimates (Parker Simpson and Kordi, 2016). These results corroborate those from earlier studies using TTE-based protocols, which showed that as few as two trials could be used to obtain accurate CP and W′ estimates (Hill, 1993). Therefore, two maximal-effort tests separated by as little as 30 min of recovery may represent an acceptable method for accurately modeling CP in the field. However, since the CP model has two adjustable parameters, a downside to protocols consisting of only two tests is that goodness-of-fit metrics and residuals cannot be computed.

Table 1

Property	CP test protocol

	Constant work rate (power)/time to exhaustion (CWR/TTE)	3-min all-out	Constant-duration	Time trial (constant work or distance)	Field data (akin to constant-duration)
Independent variable	Power (ergometer) or velocity (treadmill)	Time (3 min)	Time (e.g., 3, 7, 12 min)	Measured times to completion for set work or distance trials	Time (e.g., 3, 7, 12 min)
Errors in IV^a	Precision of power for cycle ergometers: 0.6–3.2% (Woods et al., 1994)	Negligible	Negligible	Ergometer – total work: precision should be similar to that of power	Negligible
	Accuracy of cycle ergometers: variable, often large systematic errors (∼10%) due to calibration, drift during the trial (Maxwell et al., 1998; Paton and Hopkins, 2001)			Distance: e.g., of a road course as measured by Jones counter or method of similar precision: ∼0.1%
	Accuracy of treadmill velocity: one group observed accuracy within 0.02 m s^-1 of desired speed (Galbraith et al., 2014)			(International Association of Athletics Federations [IAAF] and Association of International Marathons and Distance Races [AIMS], 2008; Georgopoulos et al., 2012)
Dependent variable	TTE	Power vs. time curve	Mean power	Power (constant-work) or velocity (constant distance) calculated from the times to completion for the set work or distance trials	Highest average power^e
Typical errors of the dependent variables of the test^b	Cycling TTE (durations 2–20 min):	See typical errors of end-test powers (ETP) below	Mean power cycling, trial durations 2–60 min:	Mean power cycling, trial distances 5–20 km (∼6-25 min):	Typical errors for these types of data have yet to be published
	CV(%)^d = 10–19 (Hopkins et al., 2001; Currell and Jeukendrup, 2008)		CV(%) = 1.5-3.5 (Currell and Jeukendrup, 2008)	CV(%) = ∼1–2% (Hopkins et al., 2001; Currell and Jeukendrup, 2008)	Accuracy of on-board power meters, which can vary based on conditions (e.g., temperature): ∼ ± 2.5% (Paton and Hopkins, 2001; Gardner et al., 2004; Abbiss et al., 2009)
	TTE converted to power:		Mean velocity running, trial durations 2–60 min:	Mean velocity running, trial distances 1,500–5,000 m (∼6–20 min)
	CV(%) = 1.5–2.7% (Hopkins et al., 2001)		CV(%) = 2.7 (Schabort et al., 1998)	CV(%) = 1–3% (Driller et al., 2017; Currell and Jeukendrup, 2008)
	Running TTE: (durations 2–20 min)
	CV(%) = 10 (Billat et al., 1994)
	CV(%) = 13–15 (Laursen et al., 2007)
Appropriate mathematical expression (based on the assigned independent and dependent variables)	Non-linear (equation 3)	CP = end-test power	Linear (equation 4)	Linear (equation 5, solved for t)	Linear (equation 4)
		(ETP) = mean power from final 30 s of the test
		W′ = numerically integrated AUC of power vs. time curve bounded by ETP at bottom
Typical errors in CP model parameter estimates^c	Cycling:	Cycling:	Cycling:	Running:	Cycling:
	CP – CV(%) = 2–8%	CP – CV(%) = 1–7	CP – CV(%) = 2–3	Critical velocity – CV(%) = < 1–4%	CP – CV(%) = 3–4
	W′ – CV(%) = 7–14 (Hopkins et al., 2001)	W′ – CV(%) = 28 (Johnson et al., 2011; Wright et al., 2017)	W′ – CV(%) = 46 (Experiment 1, Karsten et al., 2015)	D′ – CV(%) = 9–18% (Galbraith et al., 2011, 2014; Nimmerichter et al., 2015)	W′ – CV(%) = 15–18 (Experiment 3, Karsten et al., 2015)
Pacing/variable power	No – constant, enforced by ergometer or treadmill	Theoretically no – maximum effort throughout; however, some pacing is likely	Yes	Yes	Yes
Time to complete test protocol	Hours (if trials on same day) to days	3 min for the test itself	Hours (if trials on same day) to days	Hours (if trials on same day) to days	Data collected over days-weeks

Critical power (CP) test protocol properties.

^aErrors in the independent variable reflect systematic and random error of the instrument and operator error. ^bErrors in the dependent variable reflect the biological variability of performance, systematic and random errors of the instruments used to measure both the independent and dependent variables, and operator error. ^cErrors in CP and W′ reflect the integration of errors propagated from the independent and dependent variables. ^dCV(%) = coefficient of variation = standard error of the measurement/mean × 100. ^eHighest average power outputs from field training, and racing/time trial data recorded by an onboard power meter.

A final strategy for estimating the CP model is to extract mean-maximal power (MMP) profiles from power-meter data collected during all training and racing. MMP profiles are generated by extracting the highest average powers across a range of durations (Pinot and Grappe, 2011). Portions of these data can then be used to fit the CP model. CP models fit this way using MMP for 3, 7, and 12 min did not differ from models fit using laboratory-based constant-duration trials of the same durations (Fullagar et al., 2016). Similarly, CV models for athletes in timed sports such as swimming and running can be fit from race results over different distances (e.g., Dekerle et al., 2006; Jones and Vanhatalo, 2017). While convenient, CP models from race results can be confounded by issues such as the time between the sessions that led to the maximum powers for each duration, pacing and tactics, uncertainty as to whether maximal effort was applied, and environmental conditions.

Physiological Interpretations

Monod and Scherrer (1965) originally described the CP of a muscle as corresponding to “the maximum rate it can keep up for a very long time without fatigue.” Thus, the physiological interpretation of both CP and W′ can be framed with reference to the mechanisms of fatigue. Accordingly, Poole et al. (2016) stated that “CP may be regarded as a ‘fatigue threshold’ in the sense that it separates exercise intensity domains within which the physiological responses to exercise can (<CP) or cannot (>CP) be stabilized.” Therefore, CP represents the highest intensity of exercise for which muscle metabolic homeostasis can be sustained. Since steady-state energy metabolism reflects matching between “wholly aerobic” energy supply and total energy demand, exercise performed at or below CP is not associated with rapid accumulation of fatigue inducing metabolites and is therefore sustainable for long duration. In contrast, exercise performed above CP requires a greater contribution of substrate-level phosphorylation to meet energy demand, which leads to a progressive depletion of PCr, increased [Pi] and [H⁺], decreasing metabolic efficiency, and continuously increasing O₂, until O_2max is attained (Grassi et al., 2015). Consequently, CP represents the boundary between achievable steady-state and non-steady-state aerobic metabolism, which corresponds to the heavy- and severe-intensity domains, respectively (Figure 1; Burnley and Jones, 2007). Many studies have sought to validate CP using physiological data. CP correlates with the power output at maximal lactate steady state (Pringle and Jones, 2002) and respiratory compensation point (Keir et al., 2015), both of which are classified as “second” or “anaerobic” thresholds (Binder et al., 2008). Furthermore, O₂ achieves steady state for exercise at or below CP, but inexorably increases to O_2max during exercise slightly above CP (Poole et al., 1988; De Lucas et al., 2013; Murgatroyd et al., 2014; Vanhatalo et al., 2016). In each study, participants achieved task failure markedly sooner for exercise slightly above CP.

W′ was originally considered to represent an energy reserve for mechanical work for power above CP (Monod and Scherrer, 1965). This energy reserve was thought to be from anaerobic sources (Moritani et al., 1981), such that W′ was subsequently conceptualized as a metric of anaerobic work capacity (Bulbulian et al., 1986; Nebelsick-Gullett et al., 1988; Housh et al., 1990). However, this terminology was deemed inappropriate for several reasons. First, the inexorable increase in O₂ until task failure means that oxidative phosphorylation contributes to the total energy supply for power above CP, such that W′ cannot be fully anaerobic in origin. Second, estimates of W′ were lower when modeled from trials performed in hyperoxia compared to normoxia (Vanhatalo et al., 2010a), suggesting that it is sensitive to oxygen availability and thus has an aerobic component. Third, after exhaustive exercise, the reconstitution of W′ is slower than the recovery of O₂ but faster than lactate (Ferguson et al., 2010). This result implies that the kinetics of W′ reconstitution are not a unique function of phosphocreatine concentration, lactate concentration, or anaerobic energy per se. Lastly, it was found that skeletal muscle blood flow increases disproportionately during exercise above CP (Sarelius and Pohl, 2010). These authors concluded that increased muscle blood flow implies higher rates of oxidative metabolism, which is a characteristic of type-I muscle fibers. Hence, increased recruitment of type-I muscle fibers may help to protect against a progressive reduction in efficiency at or above CP (Murgatroyd and Wylde, 2011). Therefore, the three main mechanisms of energy production (PCr, glycolysis, oxidative) increase their energy output during exercise above CP and hence contribute to the energy store known as W′ (Grassi et al., 2015).

Although W′ is not uniquely determined by anaerobic capacity, it nevertheless correlates to various indices thereof, including to biochemical estimates from muscle biopsies (r = 0.73; Green et al., 1994), the mean power from the Wingate test (r = 0.74; Nebelsick-Gullett et al., 1988), accumulated work in high-intensity intervals (r = 0.74; Jenkins and Quigley, 1991), and maximal accumulated oxygen deficit (MAOD; W′ and MAOD were not different, Hill and Smith, 1993; r = 0.65, Muniz-Pumares et al., 2016). As discussed below, W′ is also sensitive to manipulations expected to change anaerobic capacity. Accordingly, anaerobic capacity is an important but not sole determinant of W′, such that W′ is potentially useful for detecting doping methods that seek to manipulate this capacity.

Applications in Sport

The CP model has long been applied to analyzing and optimizing athletic performance. The model enables performance prediction, informs pacing tactics, and helps with the design of interval-training workouts (Pettitt, 2016). Furthermore, CP represents the boundary between heavy and severe-intensity exercise, such that it informs the training zones used by coaches in prescribing training intensity (Clarke and Skiba, 2013). The related W′_bal model enables the real-time monitoring of energy available for severe-intensity exercise, which could inform tactical decisions during competitions. The CP model has been used to derive insights into world-record performances (Dekerle et al., 2006). The model is applicable to diverse sports; it has previously been applied to individual sports such as cycling (Moritani et al., 1981; McClave et al., 2011; Karsten et al., 2015), running (Hughson et al., 1984; Hill et al., 2011), swimming (Wakayoshi et al., 1992; Toubekis and Tokmakidis, 2013), and rowing (Kennedy and Bell, 2000; Morton, 2009; Kendall et al., 2011), team sports such as rugby sevens (Clarke et al., 2014) and soccer (Clark et al., 2013) and racquet sports such as table tennis (Zagatto et al., 2008). The model has yet to be applied for doping detection, and this application would represent the most stringent test of its properties.

Evaluation of the CP Model for Doping Detection: Promise and Challenges

The CP model could be used in three ways to suspect doping: (1) unrealistically high CP or W′ values compared to population norms, (2) unrealistic increase in one or both of the model parameters, CP or W′, within a given time frame, or (3) unrealistic performance compared to the prediction of an existing CP model within a given time frame. In each case, thresholds of suspicion must be established. These thresholds in turn would need to be based on scientifically justified abnormal values or rates of change that exceed the typical error of the measurement with high probability.

The severe consequences of doping sanctions on athletes, which include bans up to 4 years for first offenses and up to lifetime for second offenses, necessitates that any classification method used as evidence for sanctions must be highly specific for doping. The method must also be sufficiently sensitive to serve as a significant deterrent. Sensitivity and specificity are properties that express the ability of a continuous measurement to appropriately classify a subject in terms of a discrete feature or property; these properties are often visualized as receiver–operator characteristic (ROC) curves. Sensitivity is the true positive rate (dopers correctly classified as dopers) while specificity is the true negative rate (non-dopers correctly classified as non-dopers). In the case of CP-model-based doping detection, the continuous measurement would be the athlete’s CP, W′, or observed performance, which if outside a threshold value would classify the athlete as “suspected to be doping.” To be acceptable as a method for doping detection, a classifier based on the CP model would have to feature specificity greater than 99%, as required by WADA (World Anti-Doping Agency, 2014), and a sensitivity greater than the 10–20% estimated for existing detection methods (de Hon et al., 2014). The sensitivity and specificity of the CP model to classify dopers have yet to be scientifically studied.

Although no direct evidence yet exists pertaining to its properties as a classifier for doping detection, at least two indirect lines of evidence enable the evaluation of its potential for use in doping detection and to identify challenges to be resolved. These lines of evidence include (1) the sensitivity of the CP model parameters to performance-modifying manipulations and (2) the accuracies of the model parameter value estimates and the accuracy of the model predictions. In the discussion that follows, we employ the following definitions. Accuracy refers to the degree to which the estimate is different from the “true” value. It is analogous to criterion validity; however, we prefer “accuracy” rather than “validity” because of difficulties with interpreting the latter (Sechrest, 2005; cf. Newton and Shaw, 2013).

Additional concepts important to this discussion are reliability, minimally detectable change, and precision. Reliability is the reproducibility of the values measured in repeated trials conducted under the same conditions (Hopkins, 2000; Weir, 2005). Reliability is assessed through repeated measurements on the same subjects and is typically expressed in either relative or absolute terms. Relative reliability is expressed as the intraclass correlation coefficient and absolute reliability is expressed as the standard error of the measurement (SEM) (Weir, 2005). Absolute reliability is also commonly expressed as a coefficient of variation or typical error, which is the ratio of the SEM and the mean value of the repeated measures (Hopkins, 2000; Weir, 2005). Furthermore, the SEM determines the minimally detectable change, which is the smallest difference between measurements that can be considered real and not due to random error (Weir, 2005). Precision refers to the goodness-of-fit of a model to data, and is expressed as the R² or model standard error of the estimate, and is reflected by the confidence intervals of the parameter estimates. While precision and reliability are not synonymous, reliability is intertwined with the precision of single measurements (Hopkins, 2000; Weir, 2005). Good precision and reliability are necessary for model accuracy.

Another important property is the typical variation in performance. While variation in athletic performance depends on the nature of the sport, the within-season coefficients of variation in race times across several sports are typically less than 2.5% (Table 2). Furthermore, within-athlete performance variabilities are similar across seasons; for example, skeleton, rowing, and cross-country skiing performance variations in race times were 0.5, 1, and 1.3%, respectively (Malcata and Hopkins, 2014). The potential usefulness of the CP model as a doping detection tool depends on its ability to detect performance gains beyond these predictable seasonal performance gains. Since the typical variations in performance tend to be small, and that these performance data are used to estimate the CP model, the typical errors of the CP model parameter estimates are likely to be small as well, as will their subsequent minimally detectable changes. The discussion that follows corroborates this expectation: the CP model is sensitive to the administration of performance-modifying substances and strategies.

Table 2

Activity type	Distance/event	Season variation	Reference
Running	<3 km	Men: 0.8%	Hopkins, 2005
		Women: 1%
Running	3–10 km	Men: 1.1%	Hopkins, 2005
		Women: 1.1%
Track cycling individual pursuit	4 km	Men: 1%	Flyger, 2009
		Women: 1.2%
Cycling road racing	Tour de France and World Cup (top eighth)	Men: 0.4–0.7%	Paton and Hopkins, 2006
Cycling time trials	Tour de France (top eighth) and International (top half)	Men: 1.3–1.7%	Paton and Hopkins, 2006
Triathlon	Olympic distance, total time for top-10% of finishers	Men: 1.1%	Paton and Hopkins, 2005
Mountain biking	World cups (top quarter)	Men: 2.4%	Paton and Hopkins, 2006
		Women: 2.5%

Examples of typical variation in race times for elite athletes.

Sensitivity of CP and W′ to Performance-Modifying Manipulations

The CP model parameters are sensitive to performance-modifying manipulations, such as training, different environments, and ergogenic manipulations (Table 3). Importantly, CP and W′ tend to be sensitive to manipulations that are consistent with their physiological interpretations, which can provide clues as to the nature of the doping substance or method. Specifically, CP tends to be sensitive to substances and methods that improve oxygen transport whereas W′ tends to be sensitive to substances and methods that improve strength and power.

Table 3

Intervention	Dosage/exposure	Duration	Participants	Effect size^a	Reference
Hypoxia	FiO₂: 20% (∼250 m) vs. 12% (∼4,250 m)	Single exposure	9 trained male cyclists	CP: 2.98 ↓	Townsend et al., 2017
				W′: 1.19 ↓
	FiO₂: 21% (sea level) vs. 15.5% (∼2,500 m)	Single exposure	11 well-trained male cyclists	CP: 0.68↓	Shearman et al., 2016
				W′: 0.068 ↓
Hyperoxia	FiO₂: 70% vs. 21% (sea level)	Single exposure	7 habitually active males	CP: 0.77↑	Vanhatalo et al., 2010a
				W′: 0.81↓
Caffeine	5 mg ⋅ kg^-1 body mass	2 non-consecutive days	9 males	CP: 1.05↑	Silveira et al., 2017
				W′: 1.3↑
	6 mg ⋅ kg^-1 body mass	4 non-consecutive days	8 males	CP: 0.16↓	Moreira Gonalves et al., 2010
				W′: 0.8↑
Creatine	20 g ⋅ day	5 consecutive days	8 healthy males	CP: 0.32 ↓	Miura et al., 1999
				W′: 0.98↑
	20 g ⋅ day	5 consecutive days	10 physically active women	W′: 0.77↑	Eckerson et al., 2004
	20 g ⋅ day	5 consecutive days	19 participants	CP: 0.81↑	Jacobs et al., 1997
	20 g ⋅ day	5 consecutive days	15 untrained university students	CP: 0	Smith et al., 1998
				W′: 0.4↑
	10 g ⋅ day	4 weeks	42 recreationally active men	CP: 0.26↑	Kendall et al., 2009
				W′: 0
Bicarbonate	0.3 g⋅kg^-1 body mass	5 consecutive days	8 trained male cyclists and triathletes	CP: 0.9↑	Mueller et al., 2013
	0.3 g ⋅ kg^-1 body mass	Single trial	8 habitually active participants	CP: 0.06↑	Vanhatalo et al., 2010b
				W′: 0.11↓
	0.3 g ⋅ kg^-1 body mass	2 trials	11 trained cyclists	Normoxia	Deb et al., 2017
				W′: 0.4↑
				Hypoxia
				W′: 0.53↑
“Pre-workout” supplement	10 g ⋅ day	3/per week /3 weeks	24 moderately trained recreational athletes	CV: 0.5↑	Smith et al., 2010
				W′: 0
Erythropoietin	Meta–analysis of 17 laboratory studies	n/a		Aerobic performance:	Lodewijkx and Brouwer, 2011
				0.41–0.49 ↑
Human Growth Hormone and Testosterone:	HGH: daily doses up to 30 μg ⋅ kg^-1 body mass	12 weeks	14 middle-aged men	O_2max: 0.76 ↑	Zając et al., 2014
	Testosterone: 100 mg; once a week			Anaerobic threshold: 0.68 ↑
				Work rate max: 0.6 ↑
				Total work: 0.29 ↑
				Maximum power output: 0.27 ↑
Ephedrine	0.8 mg ⋅ kg^-1 body mass	Single day	10 males, 2 women	Time to completion: 0.43 ↓	Bell et al., 2002
	1 mg ⋅ kg^-1 body mass	Single day	16 males	Power output 5 s Wingate test: 0.18↑	Bell et al., 2001
				Time to Exhaustion- MAOD: 0.35 ↑
Training	Low intensity continuous exercise training/ high intensity interval training	6 weeks	14 males	CP: low intensity 1.8 ↑	Gaesser and Wilson, 1988
				High intensity: 2.5 ↑
				W′: low intensity: 0.56↓
				High intensity: 0.58↓
	High intensity interval training	7 weeks	8 males	CP: 1.67 ?	Poole et al., 1990
				W′: 0.13 ?
	High intensity interval training	8 weeks	19 males	CP: 0.56 ?	Jenkins and Quigley, 1993
				W′: 2.43 ?
	Resistance training	6 weeks	16 males	CP: 0.87 ↓	Bishop and Jenkins, 1996
	High intensity interval training (with/without creatine supplementation)	6 weeks	42 active men	CP (Cr): 0.26 ↑	Kendall et al., 2009
				CP (Placebo): 0.165↑
				W′ (Cr): 0.17 ↓
				W′ (Placebo): 0.49 ↑
	Resistance training	8 weeks	14 males	CP: 0.05 ↓	Sawyer et al., 2014
				W′: 1.02 ↑

Effects of performance-modifying interventions on CP model estimates.

^aEffect size was calculated as Cohen’s d, whereby where is the mean for the ith group and where S_i is the standard deviation for the ith group.

Training

Critical power increases in response to both low-intensity continuous training (Gaesser and Wilson, 1988) and high-intensity interval training (Gaesser and Wilson, 1988; Poole et al., 1990; Jenkins and Quigley, 1993). Low-intensity, continuous training decreases W′ while the effects of high-intensity interval training on W′ remain controversial (Table 3). Resistance training reduces CP (Bishop and Jenkins, 1996; Sawyer et al., 2014) and improves W′ (Jenkins and Quigley, 1993; Sawyer et al., 2014).

Environmental Variables

Critical power increases with exposure to acute hyperoxia (70% O₂, 30% N₂) compared to normoxia, whereas W′ decreases (Vanhatalo et al., 2010a). The opposing responses of CP and W′ in this experiment may have been artifactually caused by the hyperbolic form of the model (see section below on “Model Bias and Artifacts”). In contrast, acute hypoxia treatment to simulate various altitudes decreases CP (Parker Simpson et al., 2014; Townsend et al., 2017) in a dose-response manner consistent with observed decrements in O_2max (Townsend et al., 2017). Specifically, CP decreased in proportion to simulated altitude, with significant reduction evident at 1,250 m. W′ was less sensitive to altitude change than CP as it was significantly reduced only at a simulated altitude of 4,250 m.

Ergogenic Aids

Ergogenic aids are substances or methods used to improve athletic performance. The effects of several ergogenic aids including caffeine, ephedrine, creatine, and bicarbonate have been tested for their effects on the CP model. An acute ingestion (60 min pre-workout) of caffeine (6 mg kg^-1) significantly increased W′ (∼23%, effect size = 0.8) while CP was unchanged (Moreira Gonalves et al., 2010). However, a recent study found that a similar caffeine supplementation (5 mg kg^-1, 60 min prior to the workout) significantly improved both W′ (effect size = 1.3) and CP (effect size = 1.5) (Silveira et al., 2017). Increases in both CP and W′ in response to experimental treatments are uncommon.

By comparison, acute ingestion of ephedrine (0.8 mg/kg) significantly decreased 10-km run times by approximately 48 s (Bell et al., 2002). Furthermore, ephedrine ingestion increased power output during the early phase of the Wingate test (effect size = 0.18), increased TTE (effect size = 0.35), and blood lactate, glucose, and catecholamine levels (Bell et al., 2001). Similar effect sizes were therefore observed for ephedrine intake and caffeine on performance measures reflecting aerobic and anaerobic fitness.

Creatine supplementation enhances the resynthesis of phosphocreatine (Williams and Branch, 1998), hence it is reasonable to expect that it might affect W′. Indeed, creatine supplementation (20 g for 5 days) significantly improved W′ [effect sizes = 0.98 and 0.74] (Miura et al., 1999; Eckerson et al., 2004). In contrast, the effect of creatine on CP is uncertain. Some studies have revealed small effects of creatine supplementation on CP (Jacobs et al., 1997; Smith et al., 1998), while creatine supplementation combined with high-intensity interval training was reported to significantly improve CP (Kendall et al., 2009). In the latter study, the duration of creatine supplementation exceeded those of previous studies by more than five fold (28 days vs. 5 days), which may explain the difference in the results.

Three types of bicarbonate supplementation protocols are typically employed: acute (single dose of ∼0.3 g⋅ kg^-1 60–90 min before competition), chronic (∼0.5 g⋅ kg^-1 per day divided into 2–3 portions), and multi-day acute supplementation (one dose per day before competition for all days of the competition). A multi-day (5 days) acute bicarbonate supplementation in well-trained endurance athletes significantly increased W′ (effect size = 0.9) compared to placebo (Mueller et al., 2013). Acute bicarbonate supplementation (0.3 g⋅kg pre-exercise) did not affect W′ and CP in one study (Vanhatalo et al., 2010a) but significantly improved W′ in both hypoxic and normoxic environments (effect sizes = 0.4 and 0.53, respectively) in another study (Deb et al., 2017). This improvement was possibly due to enhanced buffering capacity that delays exercise-induced acidosis and enhances anaerobic energy supply (Deb et al., 2017).

Taken together, the effect sizes of performance-modifying treatments on CP and W′ are similar to those observed for doping agents (Table 3). These results therefore support the potential utility of the CP model for detecting doping in individuals. However, three caveats limit this claim. First, the study volunteers were not elite athletes and in some cases were untrained, such that the potencies of the ergogenic aids may be different than those observed in elite athletes. Second, the reported effect sizes of prohibited methods and substances are similar to those caused by legal performance-enhancing methods and substances, such that doping thresholds should exceed these effects to enhance detection specificity (i.e., avoid false positives). We note that anecdotally reported effects of doping typically exceed those reported in studies. Third, doping is always done in conjunction with other strategies to optimize performance, such that the observed changes to the model parameters in response to doping per se may be substantially less than those of isolated factors. At least one study examined the effects on CV of a supplement that contained several of the ergogenic aids listed above. Specifically, supplementation of participants with Game Time^® (Corr-Jensen Laboratories Inc., Aurora, CO, United States), which contains whey protein, cordyceps sinensis, creatine, citrulline, ginseng, and caffeine, was found to increase CV relative to placebo (+2.9%, effect size = 0.5) when combined with high-intensity interval training (Smith et al., 2010). Similarly, caffeine and ephedrine offered no additional benefit over ephedrine alone (Bell et al., 2001, 2002). The apparent lack of additive effects of performance-enhancing supplements reported in these studies suggest that higher sensitivity may be necessary to detect small additive or synergistic changes of prohibited agents on top of training effects.

Accuracies of Model Parameter Estimates and Predictions of Performance

The accuracies of CP model parameter estimates and predictions of performance using the model represent a second line of indirect evidence for evaluating the potential of the model to detect dopers. Inaccurate models would be difficult to justify for use in anti-doping.

Accuracy of the Parameter Estimates

The accuracy of CP model parameter estimates is challenging to directly assess because there is no gold-standard measurement against which to compare them. In the past, the accuracy of the CP model was assessed according to its definition as the “maximal power that can be sustained without fatigue for a very long time” (theoretically infinite time). The accuracy of CP was accordingly assessed using TTE tests completed at CP, and CP was found to be sustainable for 20–60 min depending on the study (Hill, 1993; Vandewalle et al., 1997). The accuracy of CP was best when it was estimated from protocols featuring test durations that were well spaced in the domain of durations and that included a longer-duration test (e.g., >20 min; Vandewalle et al., 1997). Nevertheless, CP is inevitably inaccurate based on its original mathematical definition because the definition reflects the simplifying assumption that fatigue is solely caused by W′ depletion, which is physiologically untrue. Instead, assessing the accuracy of the CP estimates should be in light of its physiological definition, i.e., the maximum power at which muscle metabolic variables achieve steady state (Poole et al., 2016). To fulfill this criterion, participants should exercise at various powers near CP, during which measurements of physiological and metabolic variables are collected. Such studies (Poole et al., 1988; Jones et al., 2008; De Lucas et al., 2013; Murgatroyd et al., 2014) feature protocols in which exercise was performed at an intensity 5–10% above CP, the responses to which were compared to those of exercise at or slightly below CP. Steady states in physiological variables were achieved for exercise at or below CP but not for exercise above CP. The estimates of CP are therefore accurate at least to within 5–10% of the “true” physiological CP. These studies have typically featured specialized equipment that is inaccessible to most athletes; instead, emerging techniques such as portable near-infrared spectroscopy to measure muscle oxygenation may prove useful as a criterion measure.

As with CP, there is no gold-standard physiological measure of W′ that can be used to assess its accuracy. In the past, when W′ was conceptualized as the anaerobic work capacity, several groups tested the relationship between W′ and commonly used indirect measures of anaerobic capacity, such as Wingate tests and MAOD (discussed in the Section “Physiological Interpretations”). Subsequent studies showed higher correlations between W′ and MAOD when the W′ estimates had lower standard errors or when the estimates of W′ from the three common mathematical expressions of the two-parameter CP model (see equations 3, 4, and 5) were more similar (Hill and Smith, 1994). These precision criteria were then proposed as means to assure the accuracy of W′ estimates (Hill and Smith, 1994). However, the validity of this approach is limited because indirect measures of anaerobic capacity are themselves inaccurate. All indirect measures of anaerobic capacity are confounded by the contributions of aerobically produced energy and compromised by assumptions regarding efficiency of energy conversion (Green, 1994). The current definition of W′ is the mechanical work completed above CP until the limit of tolerance (Poole et al., 2016), and any future attempts to establish its accuracy must be in accordance with this definition.

Accuracy of the Model Predictions

While the accuracies of the parameter estimates are difficult to evaluate, the accuracy of performance prediction is more straightforward to evaluate because predicted performances can be compared to observed performances. For example, the CP model accurately predicted 2,000-m rowing-ergometer performance (Kennedy and Bell, 2000), and predicted marathon running performance better than O_2max and ventilatory threshold (Florence and Weir, 1997). In general terms, the CP model is accurate for predicting performances when interpolated from within its domain of validity and is less accurate outside of that domain (Vandewalle et al., 1997), the reasons for which are described in more detail below.

Precision and Reliability

The precision of CP model fits to power-duration data tends to be excellent, with values of R² typically well above 0.9. The typical errors of CP and W′ are respectively low and high (Table 1). An explanation for these observed typical errors is the hyperbolic relationship between power and duration: small increases in sustainable power at a given duration lead to large changes in TTE at the prior sustainable power. CP is relatively insensitive to errors in TTE, whereas W′ is highly sensitive to such errors (Vandewalle et al., 1997; see Figure 5 in that paper).

Model Bias and Artifacts

The assumed hyperbolic form of the power-duration curve introduces artifacts that bias estimates and predictions (Vandewalle et al., 1997). The departure of power-duration data from the hyperbolic curve is easily visualized (Figure 2) and demonstrates that the CP model will overpredict performance for trials whose durations are outside the range of those used in estimating the model (Pepper et al., 1992; Vandewalle et al., 1997). The physiological basis for this lack of fit is that numerous fatigue mechanisms operate to decrease sustainable power as duration increases (Burnley and Jones, 2007), whereas the CP model assumes that fatigue occurs solely during exercise above CP due to W′ depletion. The tendency for the CP model to overpredict performance represents a clear limitation of the CP model for doping detection. In addition, model lack-of-fit may manifest even within its valid domain, as non-uniformity of model residuals has been observed (Hinckson and Hopkins, 2005). The extent and implications of this lack-of-fit should encourage more authors to report residual diagnostics when using the CP model, which is standard procedure in statistical modeling for assessing model goodness-of-fit and validating the model assumptions (Morton and Hodgson, 1996).

FIGURE 2

Another possible artifact of the CP model is the anti-correlation of changes in CP and W′ in response to experimental treatments (Gaesser and Wilson, 1988; Jenkins and Quigley, 1993; Vanhatalo et al., 2010a; Poole et al., 2016). While the decrease in W′ might be real in some circumstances, at least two plausible explanations for this observation exist. First, the artifact might arise from the assumption of no rate limitation in W′ expenditure, which ignores the physiological reality that peak power is finite. This finite peak power may constrain improvements to short-duration performance in response to increased CP. In modeling improved CP, the hyperbolic function may compensate for these constraints by rotating counterclockwise, which results in reduced W′. Second, the artifact may result from learning effects affecting longer-duration TTE tests disproportionately compared to short-duration TTE tests (Hill, 1993). That is, learning effects may cause the study participants to improve more in the longer-duration tests than in the shorter duration ones over the course of repeated administrations of the tests. Improvements in the long-duration trials but not in the short-duration trials would artifactually increase CP and decrease W′. The impact of this potential anti-correlation artifact is unclear: on the one hand it points to a limitation of the model; on the other hand, simultaneous increases of CP and W′ may represent a potential standalone criterion for doping suspicion given that such changes are rarely observed in response to legal performance-enhancing strategies.

Finally, the precision and accuracy of CP model parameters and predictions are sensitive to the methods used for estimating the model. Several options are available for estimating the CP model, including the test protocol type (e.g., TTE, 3AOT, etc.; Table 1), the specific intensities or durations of the trials, and the mathematical expression used to fit the data (non-linear, linear power vs. inverse duration, linear work vs. time) (Table 1). The choice of the mathematical model depends in part on how the test was conducted and which variables are considered independent and dependent. For example, if TTE tests are used, the independent variable is the power and the dependent variable is duration. Conversely, if a TT is employed, then the distance is the independent variable and duration is the dependent variable. These assignments matter because the statistical procedures used to regress the variables feature assumptions about the errors of the variables. For linear regression, the independent variable is assumed to have no error; if it does, errors-in-variables models should be used because the estimates may be biased otherwise (Raboud, 2005). Hence, the common procedure of using the linearized form of the CP model for fitting TTE tests may lack statistical rigor, although the consequences of its use will depend on the magnitude of the errors in the TTE data.

Summary: Potential for and Limitations of the CP Model for Use in Doping Detection

Power-duration models are useful in anti-doping because of their ability to describe MMP data. Of existing power-duration models, the CP model holds particular promise for use in doping detection because its properties have been well studied, the model is simple (i.e., it features just two parameters) and thus requires relatively few data to estimate, and the physiological interpretations of the parameters mean that doping strategies will specifically enhance either CP or W′ depending on their mechanisms of action. From a statistical standpoint, the model can be used to detect doping if the doping effects cause changes to the CP model parameters or its performance predictions that exceed their typical errors and seasonal fluctuations due to legal performance enhancement strategies (e.g., training, ergogenic aids). The typical errors for CP are low, especially for constant-duration tests, TTs, and field data, while those for W′ are high (Table 1). The accuracy of CP estimates is also unknown but is at least within 5–10% of the “true” physiological CP. The accuracy of W′ estimates is doubtful given its large typical error. Hence, thresholds for detection for CP and predicted performance would be relatively narrow while the threshold for detection based on W′ would be relatively wide. Furthermore, the CP model is sufficiently sensitive to detect average changes in performance in response to treatments applied to groups of people and competitive performances of highly trained elite athletes are relatively invariant within and across seasons (Table 2). Accordingly, large increases in performance due to doping should be detectable using the CP model.

The promise of the CP model for anti-doping is counterbalanced by several limitations. A main limitation arises from the simplifying assumption that power-duration data are well described by a hyperbolic curve. Indeed, such data are well approximated by the curve within the domain of durations of the trials used to generate the data to estimate the model. Outside of that domain, the model will overpredict performance. A second limitation is the high typical error of W′ estimates. The importance of this limitation depends on the duration of the predicted performance because the relative influence of W′ decreases with duration as its contribution to total energy supply relative to CP diminishes. A third limitation is that the parameter estimates are sensitive to how the data were collected. The degree to which these limitations affect the ability of the model to detect doping is currently unknown.

Implementing the CP Model in Doping Detection: Recommendations Regarding Methodology and Future Research

The preceding discussion motivates three methodological recommendations regarding implementing the CP model for doping detection. First, data of the highest quality should be used. Data for fitting CP models could come from several types of sources, such as from power or velocity data curated from athlete-monitoring devices or video tracking, or from publicly available databases of race results. It is also conceivable that the CP model estimates could come from laboratory-based testing. Regardless of the source, data from the same source should be used for longitudinal comparisons because of the sensitivity of the CP model to the test protocol and statistical procedure used to fit it. Furthermore, the limitations of a given data source must be acknowledged and explicitly accounted for. For example, field data from training and competitions represents what the person did and not necessarily what they were capable of doing, which could lead to artifactually large differences in CP and W′ estimates at different points in time. Second, the statistical procedure used to fit the model should suit the data source due to the potential bias that could be introduced if the independent variable has errors and the statistical procedure does not account for them. In addition, rigorous statistical procedure demands that the model residual diagnostic tests be performed for all model fits and confidence or prediction intervals be calculated for the model parameters and predicted performance. Finally, detection decisions must be insensitive to the consequences of the model’s simplifying assumptions. Power-duration data are well approximated by the hyperbolic function but lack-of-fit is to be expected. Detection thresholds must be sufficiently wide such that the lack-of-fit does not lead to false positives; however, wider detection thresholds reduce the sensitivity of the method.

The preceding discussion also revealed several open questions that must be resolved through new research. Most critically, the classification properties of the CP model (sensitivity, specificity) with regards to discriminating dopers and non-dopers must be characterized. The data showing the sensitivity of the CP model to individual treatments are insufficient in isolation because the model will have to detect performance enhancements due to doping that are inevitably confounded with those caused by legal performance enhancements due to training and use of ergogenic aids. A first study could involve applying the model to retrospective longitudinal velocity/power-duration data from a group of athletes that includes convicted dopers. A successful model-based doping detection approach should identify suspicious performances in the known doper cases.

Before such a study is possible, the method used to set the detection thresholds must be established. The simplest approach is to leverage existing statistical approaches for detecting changes in athletic performance (Hopkins, 2000, 2004; Bagger et al., 2003; Weir, 2005). These approaches have yet to be evaluated in their abilities to detect non-random changes in CP or W′ estimates. Even if these methods were found to be suitable, it would be best to integrate CP-model-based detection into the existing ABP framework because it is unclear whether doping sanctions could be assigned based on performance data alone. Instead, performance data should be included in the ABP as an additional independent source of evidence. How the CP model should be integrated into the ABP is currently unclear. The ABP works by monitoring biomarkers over time and comparing their values to thresholds that are set according to previously observed variation. The expected mean and variance of the biomarker is determined using a Bayesian approach in which an athlete’s values are first compared to a distribution generated from population norms and subsequently updated by the integration of the athlete’s own values. However, normative cross-sectional data are not yet available for the CP model, such that future studies employing meta-analyses of published data and cross-sectional studies of athlete populations are needed to estimate the population distributions of the CP model parameter values. Such studies should be supplemented with longitudinal monitoring studies of individuals to estimate the expected seasonal variations in CP and W′ (e.g., Passfield et al., 2017), which could further inform the prior distributions. Longer term studies of the trajectories of CP and W′ throughout athlete careers would provide information on the usual rates of improvement in these parameters as a function of athlete age and training experience. Suspicious rates of improvement could then be used as an additional variable for doping detection.

Finally, we recommend a third line of research in which the CP model statistical properties are studied. It would be beneficial to express the CP model as a formal statistical model that includes all the important sources of variance. Such a model would enable the study of how errors in the data collection propagate to the model parameter estimates, which in turn may enable the optimization of data collection protocols to minimize the uncertainties of the CP and W′ estimates. The data presented in Table 1 could be useful for parameterizing such models.

Implementing the CP Model in Doping Detection: an Illustration

We offer a simple example of how a CP-model-based detection method could be implemented. We extracted the data of Pinot and Grappe (2015), which features longitudinal MMP data from a professional grand tour cyclist, and added simulated doping effects for selected years. The dataset contains MMPs for each of the years from 2003 to 2008, of which the data corresponding to the 300, 600, 1,200, and 1,800 s durations were used to fit CP models, because these durations are within the valid domain of the CP model. In addition, the MMP data exhibited a log-linear increasing trend that is typical for a cyclist developing from age 18 to 23. For simplicity of this illustration, we removed this trend from the MMP values using log-linear regression so that we could fit the CP model to data from across multiple years in a manner similar to how the ABP is applied. In years 2007 and 2008, we increased by 5% all the MMP values to simulate “doped” performances while the MMP curves from 2003 to 2006 were left unchanged to represent baseline “clean” performances. In the absence of population norms for the CP model, we assumed that utilizing three or more years of baseline data (12 or more MMP data points) would generate individualized CP model thresholds comparable to a fully Bayesian implementation. This assumption is supported by previous work showing that z-score thresholds generated from an individual athlete’s data alone converge with the ABP model thresholds and demonstrate comparable classification performance once both models are trained on sufficient baseline data (Sottas et al., 2007). We thus chose to condition the CP model on years 2003–2005 to generate 99% prediction intervals that were used as the basis of comparison for the MMP data from 2006. Similarly, the data from years 2003–2006 were used to compare data from 2007, and data from years 2003–2007 were used to compare data from 2008.

We observed that the “clean” 2006 performances all fell within the prediction interval (Figure 3A), such that the performance would not evoke suspicion. In contrast, two out of four “doped” 2007 performances fell outside the intervals (Figure 3B), which would be classified as suspicious. Inspection of the parameter estimates revealed that the W′ and CP estimates for “clean” year 2006 both fell within the 99% interval, while the CP but not W′ fell outside the interval in the 2007 “doped” year (Figures 3D,E). The significant change in CP but not W′ can help predict the nature of the doping agent because changes to CP are consistent with the use of doping substances that increase oxygen transport but not muscular strength. Interestingly, neither the performances (Figure 3C) nor the parameter estimates (Figures 3D,E) for “doped” 2008 fell outside the prediction intervals. This result highlights a limitation in “passport-type” detection methods in which the “doped” 2007 data were included in the model training and biased the means and increased the variance such that the “doped” 2008 performances and parameter estimates were not statistically detected.

FIGURE 3

Critical Review of the W′_bal Model

This review has focussed on utility of the CP model to doping detection; however, a limitation in the model’s utility is that performance predictions are for continuous exercise, whereas many sports are intermittent in nature. An important extension of the CP model is the “W′ balance” or “work-balance” (W′_bal) model, which predicts W′ levels over time during intermittent exercise featuring bouts of severe-intensity exercise alternated with bouts of recovery (Skiba et al., 2012). This model offers unique information for doping detection that complements that of the CP model, such that it deserves discussion.

The empirically derived W′_bal model stipulates that the remaining amount of W′, or “balance” of W′ (W′_bal), is the total W′ (W′_o) subtracted by the product of W′ expended in a prior bout of severe-intensity exercise (W′_exp) and a decreasing exponential function of time, with time constant, τ_w′:

where D_CP is the difference between CP and the power during the recovery. The decreasing exponential diminishes to zero in time, causing its product with W′_exp to decline as well, such that this function models the recovery of W′_bal to W′_o. To determine the nature of τ_w′, the model was used to simulate W′_bal in response to four protocols involving intermittent interval exercise to exhaustion. The four protocols featured 60-s severe-intensity exercise alternating with 30-s of recovery exercise, the intensity of which was different for each protocol (20 W, moderate, heavy, and severe intensities). τ_w′ was then fit to cause the modeled W′ to equal zero when the subject was exhausted. τ_w′ was observed to increase (i.e., recovery took longer) as a function of the recovery intensity. Subsequent studies further revealed the sensitivity of τ_w′ to work and recovery bout durations (Skiba et al., 2014a) and to environmental conditions (Townsend et al., 2017).

The W′_bal model adds unique information that could be used as evidence for doping detection. First, the model is useful for describing stochastic exercise featuring bouts of intermittent high- and low-intensity exercise, such as could be expected in tactical races and field-based sports. Since the point of exhaustion during a maximal bout of intermittent exercise should theoretically coincide with complete expenditure of W′ (i.e., W′_bal has reached 0 kJ), suspicion of doping might arise from W′_bal repeatedly declining well below zero and the lower bound of the model’s prediction interval, which would be physiologically implausible. Alternatively, it is possible that unrealistically high values for τ_w′ are observed, which would signify implausibly fast recovery kinetics from high-intensity efforts.

The promise is counterbalanced by issues regarding the accuracy of the model and its sensitivity to different conditions. First, the model takes as input W′, which is difficult to accurately estimate as discussed above. Furthermore, τ_w′ is determined in part by CP, and both CP and W′ are sensitive to environmental conditions such as altitude (Shearman et al., 2016; Townsend et al., 2017), which serve to reduce the accuracy of the W′_bal model if such factors are left unaccounted for. In addition, the depletion and recovery kinetics are dependent on task features such as work and rest durations (Skiba et al., 2012, 2014b) and pacing strategy. While the W′_bal model might be suitable for predicting when a rider is approaching exhaustion (Skiba et al., 2014a), it is questionable as to whether it can provide quantitative predictions of athlete performance capabilities with the stringency required for doping detection. Future research is recommended that focuses on modifying the W′_bal model in a manner that improves its accuracy for predicting performances across a broad range of intermittent exercise protocols.

Conclusion

Herein we reviewed the potential of the CP model for use in anti-doping. We conclude that the model, or improved versions thereof, hold promise for doping detection because of its sensitivity to performance-enhancing manipulations, its physiological interpretability, the low data burden, and its suitability for use in established statistical frameworks for monitoring of individuals and/or the ABP. We caution, however, that important limitations exist in applying CP-model-based doping detection. First, the discriminative abilities of the CP model for classifying dopers and non-dopers has yet to be directly studied and only indirect evidence is available to support its use. Second, the simplifying assumption of the hyperbolic function for approximating the power-duration relationship introduces biases and artifacts that reduce the model’s accuracy. Third, the model parameter estimates are sensitive to the source of the data and the data quality; W′ estimates are especially imprecise. Given the severe consequences of a positive doping test, these limitations ought to be addressed prior to the adoption of the CP model as an anti-doping tool.

Statements

Author contributions

NT and DC conceived the work. MP, EM, AY, MK, NT, and DC each contributed sections of the manuscript and contributed to editing drafts of the full manuscript. DC oversaw the project. All authors approved the final submission.

Funding

This work was supported by a Simon Fraser University President’s Research Start-Up Grant to DC.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

1
AbbissC. R.QuodM. J.LevinG.MartinD. T.LaursenP. B. (2009). Accuracy of the Velotron ergometer and SRM power meter.Int. J. Sport Med.30107–112. 10.1055/s-0028-1103285
2
AshendenM.GoughC. E.GarnhamA.GoreC. J.SharpeK. (2011). Current markers of the Athlete Blood Passport do not flag microdose EPO doping.Eur. J. Appl. Physiol.1112307–2314. 10.1007/s00421-011-1867-6
3
AugheyR. J. (2011). Applications of GPS technologies to field sports.Int. J. Sports Physiol. Perform.6295–310. 10.1123/ijspp.6.3.295
- CrossRef
- Google Scholar
4
BaggerM.PetersenP. H.PedersenP. K. (2003). Biological variation in variables associated with exercise training.Int. J. Sports Med.24433–440. 10.1055/s-2003-41180
5
BarrisS.ButtonC. (2008). A review of vision-based motion analysis in sport.Sport Med.381025–1043. 10.2165/00007256-200838120-00006
6
BejderJ.HoffmannM. F.AshendenM.NordsborgN. B.KarstoftK.MørkebergJ. (2016). Acute hyperhydration reduces athlete biological passport OFF-hr score.Scand. J. Med. Sci. Sports26338–347. 10.1111/sms.12438
7
BellD. G.JacobsI.ElleringtonK. (2001). Effect of caffeine and ephedrine ingestion on anaerobic exercise performance.Med. Sci. Sport Exerc.331399–1403. 10.1097/00005768-200108000-00024
- CrossRef
- Google Scholar
8
BellD. G.McLellanT. M.SabistonC. M. (2002). Effect of ingesting caffeine and ephedrine on 10-km run performance.Med. Sci. Sports Exerc.34344–349. 10.1097/00005768-200202000-00024
9
BergstromH. C.HoushT. J.ZunigaJ. M.TraylorD. A.LewisR. W.CamicC. L.et al (2013a). Metabolic and neuromuscular responses at critical power from the 3-min all-out test.Appl. Physiol. Nutr. Metab.387–13. 10.1139/apnm-2012-0216
10
BergstromH. C.HoushT. J.ZunigaJ. M.TraylorD. A.LewisR. W.CamicC. L.et al (2013b). Responses during exhaustive exercise at critical power determined from the 3-min all-out test.J. Sports Sci.31537–545. 10.1080/02640414.2012.738925
11
BillatL. V.KoralszteinJ. P.MortonR. H. (1999). Time in human endurance models.Sport Med.27359–379. 10.2165/00007256-199927060-00002
- CrossRef
- Google Scholar
12
BillatV.RenouxJ. C.PinoteauJ.PetitB.KoralszteinJ. P. (1994). Reproducibility of running time to exhaustion at VO2max in subelite runners.Med. Sci. Sports Exerc.26254–257. 10.1249/00005768-199402000-00018
- CrossRef
- Google Scholar
13
BinderR. K.WonischM.CorraU.Cohen-SolalA.VanheesL.SanerH.et al (2008). Methodological approach to the first and second lactate threshold in incremental cardiopulmonary exercise testing.Eur. J. Cardiovasc. Prev. Rehabil.15726–734. 10.1097/HJR.0b013e328304fed4
14
BishopD.JenkinsD. G. (1996). The influence of resistance training on the critical power function & time to fatigue at critical power.Aust. J. Sci. Med. Sport28101–105.
- Google Scholar
15
BlackM. I.JonesA. M.BaileyS. J.VanhataloA. (2015). Self-pacing increases critical power and improves performance during severe-intensity exercise.Appl. Physiol. Nutr. Metab.40662–670. 10.1139/apnm-2014-0442
16
BulbulianR.WilcoxA. R.DarabosB. L. (1986). Anaerobic contribution to distance running performance of trained cross-country athletes.Med. Sci. Sports Exerc.18107–113. 10.1249/00005768-198602000-00018
17
BurnleyM.JonesA. M. (2007). Oxygen uptake kinetics as a determinant of sports performance.Eur. J. Sport Sci.763–79. 10.1080/17461390701456148
- CrossRef
- Google Scholar
18
ClarkB.WoolfordS. M.EastwoodA.SharpeK.BarnesP. G.GoreC. J. (2017). Temporal changes in physiology and haematology in response to high- and micro-doses of recombinant human erythropoietin.Drug Test. Anal.91561–1571. 10.1002/dta.2176
19
ClarkI. E.WestB. M.ReynoldsS. K.MurrayS. R.PettittR. W. (2013). Applying the critical velocity model for an off-season interval training program.J. Strength Cond. Res.273335–3341. 10.1519/JSC.0b013e31828f9d87
20
ClarkeA. C.PreslandJ.RattrayB.PyneD. B. (2014). Critical velocity as a measure of aerobic fitness in women’s rugby sevens.J. Sci. Med. Sport17144–148. 10.1016/j.jsams.2013.03.008
21
ClarkeD. C.SkibaP. F. (2013). Rationale and resources for teaching the mathematical modeling of athletic training and performance.Adv. Physiol. Educ.37134–152. 10.1152/advan.00078.2011
22
CorsettiR.LombardiG.LanteriP.ColombiniA.GrazianiR.BanfiG. (2012). Haematological and iron metabolism parameters in professional cyclists during the Giro d’Italia 3-weeks stage race.Clin. Chem. Lab. Med.50949–956. 10.1515/cclm-2011-0857
23
CurrellK.JeukendrupA. E. (2008). Validity, reliability and sensitivity of measures of sporting performance.Sport Med.38297–316. 10.2165/00007256-200838040-00003
- CrossRef
- Google Scholar
24
de HonO.KuipersH.van BottenburgM. (2014). Prevalence of doping use in elite sports: a review of numbers and methods.Sport Med.4557–69. 10.1007/s40279-014-0247-x
25
De LucasR. D.De SouzaK. M.CostaV. P.GrosslT.GuglielmoL. G. A. (2013). Time to exhaustion at and above critical power in trained cyclists: the relationship between heavy and severe intensity domains.Sci. Sports28e9–e14. 10.1016/j.scispo.2012.04.004
- CrossRef
- Google Scholar
26
DebS. K.GoughL. A.SparksS. A.McNaughtonL. R. (2017). Determinants of curvature constant (W’) of the power duration relationship under normoxia and hypoxia: the effect of pre-exercise alkalosis.Eur. J. Appl. Physiol.117901–912. 10.1007/s00421-017-3574-4
27
DekerleJ.NesiX.CarterH. (2006). The distance-time relationship over a century of running Olympic performances: a limit on the critical speed concept.J. Sports Sci.241213–1221. 10.1080/02640410500497642
28
DelaneyJ. A.ScottT. J.ThorntonH. R.BennettK. J. M.GayD.DuthieG. M.et al (2015). Establishing duration-specific running intensities from match-play analysis in rugby league.Int. J. Sports Physiol. Perform.10725–731. 10.1123/ijspp.2015-0092
29
DrillerM.Brophy-WilliamsN.WalkerA. (2017). The Reliability of a 5km run test on a motorized treadmill.Measur. Phys. Educ. Exerc. Sci.21121–126. 10.1080/1091367X.2016.1264402
- CrossRef
- Google Scholar
30
EckersonJ. M.StoutJ. R.MooreG. A.StoneN. J.NishimuraK.TamuraK. (2004). Effect of two and five days of creatine loading on anaerobic working capacity in women.J. Strength Cond. Res.18168–173. 10.1519/R-19655.1
31
El HelouN.BerthelotG.ThibaultV.TaffletM.NassifH.CampionF.et al (2010). Tour de France, Giro, Vuelta, and classic European races show a unique progression of road cycling speed in the last 20 years.J. Sports Sci.28789–796. 10.1080/02640411003739654
32
FergusonC.RossiterH. B.WhippB. J.CathcartA. J.MurgatroydS. R.WardS. A. (2010). Effect of recovery duration from prior exhaustive exercise on the parameters of the power-duration relationship.J. Appl. Physiol.108866–874. 10.1152/japplphysiol.91425.2008
33
FlorenceS. L.WeirJ. P. (1997). Relationship of critical velocity to marathon running performance. Relation entre la velocite critique et la performance au marathon.Eur. J. Appl. Physiol. Occup. Physiol.75274–278. 10.1007/s004210050160
- CrossRef
- Google Scholar
34
FlygerN. (2009). “Variability in competitive performance of elite track cyclists,” inProceedings of the ISBS-Conference,Konstanz.
- Google Scholar
35
FullagarH. H. K.DuffieldR.SkorskiS.WhiteD.BloomfieldJ.KöllingS.et al (2016). Sleep, travel, and recovery responses of national footballers during and after long-haul international air travel.Int. J. Sports Physiol. Perform.1186–95. 10.1123/ijspp.2015-0012
36
GaesserG. A.WilsonL. A. (1988). Effects of continuous and interval training on the parameters of the power-endurance time relationship for high-intensity exercise.Int. J. Sports Med.9417–421. 10.1055/s-2007-1025043
37
GalbraithA.HopkerJ.LelliottS.DiddamsL.PassfieldL. (2014). A single-visit field test of critical speed.Int. J. Sports Physiol. Perform.9931–935. 10.1123/ijspp.2013-0507
38
GalbraithA.HopkerJ. G.JobsonS. A.PassfieldL. (2011). A novel field test to determine critical speed.J. Sports Med. Doping Stud.11–4.
- Google Scholar
39
GardnerA. S.StephensS.MartinD. T.LawtonE.LeeH.JenkinsD. (2004). Accuracy of SRM and power tap power monitoring systems for bicycling.Med. Sci. Sport Exerc.361252–1258. 10.1249/01.MSS.0000132380.21785.03
40
GeorgopoulosA.PapageorgakiI.TapinakiS.IoannidisC. (2012). Measuring the classic marathon course.Int. J. Herit. Digit. Era1125–144. 10.1260/2047-4970.1.1.125
- CrossRef
- Google Scholar
41
GoreC.ParisottoR. (2003). Second-generation blood tests to detect erythropoietin abuse by athletes.Haematologica88333–344.
- Google Scholar
42
GrassiB.RossiterH. B.ZoladzJ. A. (2015). Skeletal muscle fatigue and decreased efficiency.Exerc. Sport Sci. Rev.4375–83. 10.1249/JES.0000000000000043
43
GreenS. (1994). A definition and systems view of anaerobic capacity.Eur. J. Appl. Physiol. Occup. Physiol.69168–173. 10.1007/BF00609411
44
GreenS.DawsonB. T.GoodmanC.CareyM. F. (1994). Y-intercept of the maximal work-duration relationship and anaerobic capacity in cyclists.Eur. J. Appl. Physiol. Occup. Physiol.69550–556. 10.1007/BF00239874
45
HaileyN. (2011). A false start in the race against doping in sport: concerns with cycling’s biological passport.Source Duke Law J.61393–432.
- Pubmed Abstract
- Google Scholar
46
HillA. V. (1925). The physiological basis of athletic records.Nature116544–548. 10.1038/116544a0
- CrossRef
- Google Scholar
47
HillD. W. (1993). The critical power concept - a review.Sport Med.16237–254. 10.2165/00007256-199316040-00003
- CrossRef
- Google Scholar
48
HillD. W.SmithJ. C. (1993). A comparison of methods of estimating anaerobic work capacity.Ergonomics361495–1500. 10.1080/00140139308968017
49
HillD. W.SmithJ. C. (1994). A method to ensure the accuracy of estimates of anaerobic capacity derived using the critical power concept.J. Sport Med. Phys. Fitness3423–37.
- Pubmed Abstract
- Google Scholar
50
HillD. W.VingrenJ. L.NakamuraF. Y.KokobunE. (2011). Relationship between speed and time in running.Int. J. Sports Med.32519–522. 10.1055/s-0031-1275298
51
HincksonE. A.HopkinsW. G. (2005). Reliability of time to exhaustion analyzed with critical-power and log-log modeling.Med. Sci. Sports Exerc.37696–701. 10.1249/01.MSS.0000159023.06934.53
52
HopkerJ.HopkerJ.PassfieldL.FaissR.SaugyM. (2016). Modelling of cycling power data and its application for anti-doping.J. Sci. Cycl.5:2.
- Google Scholar
53
HopkinsW. G. (2000). Measures of reliability in sports medicine and science.Sport Med.301–15. 10.2165/00007256-200030010-00001
- CrossRef
- Google Scholar
54
HopkinsW. G. (2004). How to interpret changes in an athletic performance test.Sportscience81–7.
- Google Scholar
55
HopkinsW. G. (2005). Competative performance of elite track and field athletes. variability and smallest worthwhile enhancements.Sportscience917–20.
- Google Scholar
56
HopkinsW. G.SchabortE. J.HawleyJ. A. (2001). Reliability of power in physical performance tests.Sport Med.31211–234. 10.2165/00007256-200131030-00005
- CrossRef
- Google Scholar
57
HoushD. J.HoushT. J.BaugeS. M. (1990). A methodological consideration for the determination of critical power and anaerobic work capacity.Res. Q. Exerc. Sport61406–409. 10.1080/02701367.1990.10607506
58
HughsonR. L.OrokC. J.StaudtL. E. (1984). A high velocity treadmill running test to assess endurance running potential.Int. J. Sport Med.523–25. 10.1055/s-2008-1025875
59
International Association of Athletics Federations [IAAF] and Association of International Marathons and Distance Races [AIMS] (2008). The Measurement of Road Race Courses, 2nd Edn. Stockholm: IAAF. 10.1139/h97-015
- CrossRef
- Google Scholar
60
JacobsI.BleueS.GoodmanJ. (1997). Creatine ingestion increases anaerobic capacity and maximum accumulated oxygen deficit.Can. J. Appl. Physiol.22231–243. 10.1080/00140139108967284
61
JenkinsD. G.QuigleyB. M. (1991). The y-intercept of the critical power function as a measure of anaerobic work capacity.Ergonomics3413–22. 10.1249/00005768-199302000-00019
- CrossRef
- Google Scholar
62
JenkinsD. G.QuigleyB. M. (1993). The influence of high-intensity exercise training on the Wlim-Tlim relationship.Med. Sci. Sports Exerc.25275–282. 10.1249/MSS.0b013e318224cb0f
63
JohnsonT. M.SextonP. J.PlacekA. M.MurrayS. R.PettittR. W. (2011). Reliability analysis of the 3-min all-out exercise test for cycle ergometry.Med. Sci. Sports Exerc.432375–2380. 10.1249/MSS.0b013e318224cb0f
64
JonesA. M.VanhataloA. (2017). The “critical power” concept: applications to sports performance with a focus on intermittent high-intensity.Exerc. Sport Med.4765–78. 10.1007/s40279-017-0688-0
65
JonesA. M.WilkersonD. P.DiMennaF.FulfordJ.PooleD. C. (2008). Muscle metabolic responses to exercise above and below the “critical power” assessed using 31P-MRS.Am. J. Physiol. Regul. Integr. Comp. Physiol.294R585–R593. 10.1152/ajpregu.00731.2007
66
KarstenB.JobsonS. A.HopkerJ.StevensL.BeedieC. (2015). Validity and reliability of critical power field testing.Eur. J. Appl. Physiol.115197–204. 10.1007/s00421-014-3001-z
67
KeirD. A.FontanaF. Y.RobertsonT. C.MuriasJ. M.PatersonD. H.KowalchukJ. M.et al (2015). Exercise intensity thresholds.Med. Sci. Sport Exerc.471932–1940. 10.1249/MSS.0000000000000613
68
KendallK. L.SmithA. E.FukudaD. H.DwyerT. R.StoutJ. R. (2011). Critical velocity: a predictor of 2000-m rowing ergometer performance in NCAA D1 female collegiate rowers.J. Sports Sci.29945–950. 10.1080/02640414.2011.571274
69
KendallK. L.SmithA. E.GraefJ. L.FukudaD. H.MoonJ. R.BeckT. W.et al (2009). Effects of four weeks of high-intensity interval training and creatine supplementation on critical power and anaerobic working capacity in college-aged men.J. Strength Cond. Res.231663–1669. 10.1519/JSC.0b013e3181b1fd1f
70
KennedyM. D. J.BellG. J. (2000). A comparison of critical velocity estimates to actual velocities in predicting simulated rowing performance.Can. J. Appl. Physiol.25223–235. 10.1139/h00-017
71
KruseT. N.CarterR. E.RosedahlJ. K.JoynerM. J. (2014). Speed trends in male distance running.PLoS One9:e112978. 10.1371/journal.pone.0112978
72
LamonS.RobinsonN.SottasP.-E.HenryH.KamberM.ManginP.et al (2007). Possible origins of undetectable EPO in urine samples.Clin. Chim. Acta38561–66. 10.1016/j.cca.2007.06.018
73
LaursenP. B.FrancisG. T.AbbissC. R.NewtonM. J.NosakaK. (2007). Reliability of time-to-exhaustion versus time-trial running tests in runners.Med. Sci. Sport Exerc.391374–1379. 10.1249/mss.0b013e31806010f500005768-200708000-00021
74
LjungqvistA. (2014). The fight against doping is a fight for the protection of the clean athlete, the health of the athlete and the integrity of sport.Br. J. Sport Med.48:799. 10.1136/bjsports-2014-093654
75
LodewijkxH. F. M.BrouwerB. (2011). Some empirical notes on the epo epidemic in professional cycling.Res. Q. Exerc. Sport82740–754. 10.1080/02701367.2011.10599811
76
MalcataR. M.HopkinsW. G. (2014). Variability of competitive performance of elite athletes: a systematic review.Sport Med.441763–1774. 10.1007/s40279-014-0239-x
77
MartinL.AshendenM.BejderJ.HoffmannM.NordsborgN.KarstoftK.et al (2016). New insights for identification of doping with recombinant human erythropoietin micro-doses after high hydration.Drug Test. Anal.81119–1130. 10.1002/dta.2004
78
MaxwellB. F.WithersR. T.IlsleyA. H.WakimM. J.WoodsG. F.DayL. (1998). Dynamic calibration of mechanically, air-and electromagnetically braked cycle ergometers.Eur. J. Appl. Physiol. Occup. Physiol.78346–352. 10.1007/s004210050430
79
McClaveS. A.LeBlancM.HawkinsS. A. (2011). Sustainability of critical power determined by a 3-minute all-out test in elite cyclists.J. Strength Cond. Res.253093–3098. 10.1519/JSC.0b013e318212dafc
80
McHughC. M.ParkR. T.SonksenP. H.HoltR. I. (2005). Challenges in detecting the abuse of growth hormone in sport.Clin. Chem.511587–1593. 10.1373/clinchem.2005.047845
- CrossRef
- Google Scholar
81
MiuraA.KinoF.KajitaniS.SatoH.FukubaY. (1999). The effect of oral creatine supplementation on the curvature constant parameter of the power-duration curve for cycle ergometry in humans.Jpn. J. Physiol.49169–174. 10.2170/jjphysiol.49.169
82
MonodH.ScherrerJ. (1965). The work capacity of a synergic muscular group.Ergonomics8329–338. 10.1080/00140136508930810
- CrossRef
- Google Scholar
83
Moreira GonalvesE.Bodnariuc FontesE.de Paula Caraa SmirmaulB.Okada TrianaR.Hideki OkanoA.et al (2010). Neuromuscular fatigue threshold, critical power and anaerobic work capacity under caffeine ingestion.Int. Sport J.11380–388.
- Google Scholar
84
MoritaniT.NagataA.deVriesH. A.MuroM. (1981). Critical power as a measure of physical work capacity and anaerobic threshold.Ergonomics24339–350. 10.1080/00140138108924856
85
MortonR. H. (2006). The critical power and related whole-body bionergetic models (Morton, H., 2006).pdf.Eur. J. Appl. Physiol.96339–354. 10.1007/s00421-005-0088-2
86
MortonR. H. (2009). Isoperformance curves: an application in team selection.J. Sports Sci.271601–1605. 10.1080/02640410903352915
87
MortonR. H.BillatL. V. (2004). The critical power model for intermittent exercise.Eur. J. Appl. Physiol.91303–307. 10.1007/s00421-003-0987-z
88
MortonR. H.HodgsonD. J. (1996). The relationship between power output and endurance: a brief review.Eur. J. Appl. Physiol. Occup. Physiol.73491–502. 10.1007/BF00357670
- CrossRef
- Google Scholar
89
MuellerS. M.GehrigS. M.FreseS.WagnerC. A.BoutellierU.ToigoM. (2013). Multiday acute sodium bicarbonate intake improves endurance capacity and reduces acidosis in men.J. Int. Soc. Sports Nutr.10:16. 10.1186/1550-2783-10-16
90
Muniz-PumaresD.PedlarC.GodfreyR.GlaisterM. (2016). A comparison of methods to estimate anaerobic capacity: accumulated oxygen deficit and W’during constant and all-out work-rate profiles.J. Sports Sci.352357–2364. 10.1080/02640414.2016.1267386
91
MurgatroydS. R.WyldeL. A. (2011). The power–duration relationship of high-intensity exercise: from mathematical parameters to physiological mechanisms.J. Physiol.5892443–2445. 10.1113/jphysiol.2011.209346
- CrossRef
- Google Scholar
92
MurgatroydS. R.WyldeL. A.CannonD. T.WardS. A.RossiterH. B. (2014). A “ramp-sprint” protocol to characterise indices of aerobic function and exercise intensity domains in a single laboratory test.Eur. J. Appl. Physiol.352191–2197. 10.1007/s00421-014-2908-8
93
Nebelsick-GullettL. J.HoushT. J.JohnsonG. O.BaugeS. M. (1988). A comparison between methods of measuring anaerobic work capacity.Ergonomics311413–1419. 10.1080/00140138808966785
94
NewtonP. E.ShawS. D. (2013). Standards for talking and thinking about validity.Psychol. Methods18301–319. 10.1037/a0032969
95
NicolòA.BazzucchiI.SacchettiM. (2017). Parameters of the 3-Minute All-Out test: overestimation of competitive-cyclist time-trial performance in the severe-intensity domain.Int. J. Sports Physiol. Perform.12655–661. 10.1123/ijspp.2016-0111
96
NimmerichterA.SteindlM.WilliamsC. A. (2015). Reliability of the single-visit field test of critical speed in trained and untrained adolescents.Sports3358–368. 10.3390/sports3040358
- CrossRef
- Google Scholar
97
Parker SimpsonL.JonesA. M.SkibaP. F.VanhataloA.WilkersonD. (2014). Influence of hypoxia on the power-duration relationship during high-intensity exercise.Int. J. Sports Med.36113–119. 10.1055/s-0034-1389943
98
Parker SimpsonL.KordiM. (2016). Comparison of critical power and W’ derived from two or three maximal tests.Int. J. Sports Physiol. Perform.12825–830. 10.1123/ijspp.2016-0371
- CrossRef
- Google Scholar
99
PascualJ. A.BelalcazarV.de BolosC.GutiérrezR.LlopE.SeguraJ. (2004). Recombinant erythropoietin and analogues: a challenge for doping control.Ther. Drug Monit.26175–179. 10.1097/00007691-200404000-00016
100
PassfieldL.HopkerJ.JobsonS.FrielD.ZabalaM. (2017). Knowledge is power: issues of measuring training and performance in cycling.J. Sports Sci.351426–1434. 10.1080/02640414.2016.1215504
101
PatonC. D.HopkinsW. G. (2001). Tests of cycling performance.Sport Med.31489–496. 10.2165/00007256-200131070-00004
- CrossRef
- Google Scholar
102
PatonC. D.HopkinsW. G. (2005). Competitive performance of elite Olympic-distance triathletes: reliability and smallest worthwhile enhancement.Sportscience91–5.
- Google Scholar
103
PatonC. D.HopkinsW. G. (2006). Variation in performance of elite cyclists from race to race.Eur. J. Sport Sci.625–31. 10.1080/17461390500422796
- CrossRef
- Google Scholar
104
PepperM. L.HoushT. J.JohnsonG. O. (1992). The accuracy of the critical velocity test for predicting time to exhaustion during treadmill running.Int. J. Sports Med.13121–124. 10.1055/s-2007-1021242
105
PernegerT. V. (2010). Speed trends of major cycling races: does slower mean cleaner?Int. J. Sports Med.31261–264. 10.1055/s-0030-1247593
106
PettittR. W. (2016). Applying the critical speed concept to racing strategy and interval training prescription.Int. J. Sports Physiol. Perform.11842–847. 10.1123/ijspp.2016-0001
107
PinotJ.GrappeF. (2011). The record power profi le to assess performance in elite cyclists.Sport Med.32839–844. 10.1055/s-0031-1279773
108
PinotJ.GrappeF. (2015). A six-year monitoring case study of a top-10 cycling Grand Tour finisher.J. Sport Sci.33907–914. 10.1080/02640414.2014.969296
109
PooleD. C.BurnleyM.VanhataloA.RossiterH. B.JonesA. M. (2016). Critical power: an important fatigue threshold in exercise physiology.Med. Sci. Sports Exerc.482320–2334. 10.1249/MSS.0000000000000939
110
PooleD. C.WardS. A.GardnerG. W.WhippB. J. (1988). Metabolic and respiratory profile of the upper limit for prolonged exercise in man.Ergonomics311265–1279. 10.1080/00140138808966766
111
PooleD. C.WardS. A.WhippB. J. (1990). The effects of training on the metabolic and respiratory profile of high-intensity cycle ergometer exercise.Eur. J. Appl. Physiol. Occup. Physiol.59421–429. 10.1007/BF02388623
112
PringleJ. S.JonesA. M. (2002). Maximal lactate steady state, critical power and EMG during cycling.Eur. J. Appl. Physiol.88214–226. 10.1007/s00421-002-0703-4
113
QuodM. J.MartinD. T.MartinJ. C.LaursenP. B. (2010). The power profile predicts road cycling mmp-Quod. Laursen-.pdf.Int. J. Sport Med.31397–401. 10.1055/s-0030-1247528
114
RaboudJ. M. (2005). Errors in Variables.Vancouver, BC: Canadian HIV Trials Network.
- Google Scholar
115
RoeckerK.MahlerH.HeydeC.RöllM.GollhoferA. (2017). The relationship between movement speed and duration during soccer matches.PLoS One12:e0181781. 10.1371/journal.pone.0181781
116
RussellG.GoreC. J.AshendenM. J.ParisottoR.HahnA. G. (2002). Effects of prolonged low doses of recombinant human erythropoietin during submaximal and maximal exercise 1.Eur. J. Appl. Physiol.86442–449. 10.1007/s00421-001-0560-6
117
SareliusI.PohlU. (2010). Control of muscle blood flow during exercise: local factors and integrative mechanisms.Acta Physiol.199349–365. 10.1111/j.1748-1716.2010.02129.x
118
SaugyM.LundbyC.RobinsonN. (2014). Monitoring of biological markers indicative of doping: the athlete biological passport.Br. J. Sports Med.48827–832. 10.1136/bjsports-2014-093512
119
SawyerB. J.StokesD. G.WomackC. J.MortonR. H.WeltmanA.GaesserG. A. (2014). Strength training increases endurance time to exhaustion during high-intensity exercise despite no change in critical power.J. Strength Cond. Res.28601–609. 10.1519/JSC.0b013e31829e113b
120
SchabortE. J.HopkinsW. G.HawleyJ. A. (1998). Reproducibility of self-paced treadmill performance of trained endurance runners.Int. J. Sports Med.1948–51. 10.1055/s-2007-971879
121
SchumacherY. O.PottgiesserT. (2009). Performance profiling: a role for sport science in the fight against doping?Int. J. Sports Physiol. Perform.4129–133. 10.1123/ijspp.4.1.129
122
SechrestL. (2005). Validity of measures is no simple matter.Health Serv. Res.401584–1604. 10.1111/j.1475-6773.2005.00443.x
123
ShearmanS.DwyerD.SkibaP.TownsendN. (2016). Modeling intermittent cycling performance in hypoxia using the critical power concept.Med. Sci. Sports Exerc.48527–535. 10.1249/MSS.0000000000000794
124
ShumwayR. H.StofferD. S. (2017). Time Series Analysis and Its Applications.Berlin: Springer. 10.1007/978-3-319-52452-8
- CrossRef
- Google Scholar
125
SilveiraR.Andrade-SouzaV. A.ArcoverdeL.TomaziniF.SansonioA.BishopD. J.et al (2017). Caffeine increases work done above critical power, but not anaerobic work.Med. Sci. Sport Exerc.50131–140. 10.1249/MSS.0000000000001408
126
SkibaP. F.ChidnokW.VanhataloA.JonesA. M. (2012). Modeling the expenditure and reconstitution of work capacity above critical power.Med. Sci. Sports Exerc.441526–1532. 10.1249/MSS.0b013e3182517a80
127
SkibaP. F.ClarkeD.VanhataloA.JonesA. M. (2014a). Validation of a novel intermittent w’model for cycling using field data.Int. J. Sports Physiol. Perform.9900–904. 10.1123/ijspp.2013-0471
128
SkibaP. F.JackmanS.ClarkeD.VanhataloA.JonesA. M. (2014b). Effect of work and recovery durations on W’ reconstitution during intermittent exercise.Med. Sci. Sports Exerc.461433–1470. 10.1249/MSS.0000000000000226
129
SmithA. E.FukudaD. H.KendallK. L.StoutJ. R. (2010). The effects of a pre-workout supplement containing caffeine, creatine, and amino acids during three weeks of high-intensity exercise on aerobic and anaerobic performance.J. Int. Soc. Sport Nutr.7:10. 10.1186/1550-2783-7-10
130
SmithJ. C.StephensD. P.HallE. L.JacksonA. W.EarnestC. P. (1998). Effect of oral creatine ingestion on parameters of the work rate-time relationship and time to exhaustion in high-intensity cycling.Eur. J. Appl. Physiol. Occup. Physiol.77360–365. 10.1007/s004210050345
131
SottasP.-E.BaumeN.SaudanC.SchweizerC.KamberM.SaugyM. (2007). Bayesian detection of abnormal values in longitudinal biomarkers with an application to T/E ratio.Biostatistics8285–296. 10.1093/biostatistics/kxl009
132
SottasP.-E.RobinsonN.FischettoG.DolleG.AlonsoJ. M.SaugyM. (2011). Prevalence of blood doping in samples collected from elite track and field athletes.Clin. Chem.57762–769. 10.1373/clinchem.2010.156067
133
SottasP.-E.RobinsonN.GiraudS.TaroniF.KamberM.ManginP.et al (2006). Statistical classification of abnormal blood profiles in athletes.Int. J. Biostat.2:1. 10.2202/1557-4679.1011
- CrossRef
- Google Scholar
134
SottasP.-E.RobinsonN.SaugyM. (2009). “The athlete’s biological passport and indirect markers of blood doping,” inHandbook of Experimental Pharmacology, ed.Barrett JamesE. (Berlin: Springer), 305–326. 10.1007/978-3-540-79088-4_14
135
SottasP.-E.SaugyM.SaudanC. (2010). Endogenous steroid profiling in the athlete biological passport.Endocrinol. Metab. Clin. North Am.3959–73. 10.1016/j.ecl.2009.11.003
136
ThevisM.KuuranneT.GeyerH.SchänzerW. (2017). Annual banned-substance review: analytical approaches in human sports drug testing.Drug Test. Anal.96–29. 10.1002/dta.2139
137
ToubekisA. G.TokmakidisS. P. (2013). Metabolic responses at various intensities relative to critical swimming velocity.J. Strength Cond. Res.271731–1741. 10.1519/JSC.0b013e31828dde1e
138
TownsendN. E.NicholsD. S.SkibaP. F.RacinaisS.PériardJ. D. (2017). Prediction of critical power and W’ in hypoxia: application to work-balance modelling.Front. Physiol.8:180. 10.3389/fphys.2017.00180
- CrossRef
- Google Scholar
139
TsaiM.-C. (2015). Revisiting the Power-Duration Relationship and Developing Alternative Protocols to Estimate Critical Power Parameters.Doctoral dissertation, University of Toronto, Toronto.
- Google Scholar
140
USADA (2012). Report on Proceedings under the World Anti-Doping Code and the USADA Protocol.Colorado Springs, CO: USADA.
- Google Scholar
141
VandewalleH.VautierJ. F.KachouriM.LechevalierJ. M.MonodH. (1997). Work-exhaustion time relationships and the critical power concept – A critical review.J. Sports Med. Phys. Fitness3789–102.
- Pubmed Abstract
- Google Scholar
142
VanhataloA.BlackM. I.DiMennaF. J.BlackwellJ. R.SchmidtJ. F.ThompsonC.et al (2016). The mechanistic bases of the power–time relationship: muscle metabolic responses and relationships to muscle fibre type.J. Physiol.5944407–4423. 10.1113/JP271879
143
VanhataloA.DoustJ. H.BurnleyM. (2007). Determination of critical power using a 3-min all-out cycling test.Med. Sci. Sports Exerc.39548–555. 10.1249/mss.0b013e31802dd3e6
144
VanhataloA.FulfordJ.DiMennaF. J.JonesA. M. (2010a). Influence of hyperoxia on muscle metabolic responses and the power-duration relationship during severe-intensity exercise in humans: a 31P magnetic resonance spectroscopy study.Exp. Physiol.95528–540. 10.1113/expphysiol.2009.050500
145
VanhataloA.McNaughtonL. R.SieglerJ.JonesA. M. (2010b). Effect of induced alkalosis on the power-duration relationship of “all-out” exercise.Med. Sci. Sports Exerc.42563–570. 10.1249/MSS.0b013e3181b71a4a
146
VernecA. R. (2014). The Athlete Biological Passport: an integral element of innovative strategies in antidoping.Br. J. Sports Med.48817–819. 10.1136/bjsports-2014-093560
147
WakayoshiK.IkutaK.YoshidaT.UdoM.MoritaniT.MutohY.et al (1992). Determination and validity of critical velocity as an index of swimming performance in the competitive swimmer.Eur. J. Appl. Physiol. Occup. Physiol.64153–157. 10.1007/BF00717953
148
WeirJ. P. (2005). Quantifying test-retest reliability using the intraclass correlation coefficient and the SEM.J. Strength Cond. Res.19231–240.
- Google Scholar
149
WilliamsM. H.BranchJ. D. (1998). Creatine supplementation and exercise performance: an update.J. Am. Coll. Nutr.17216–234. 10.1080/07315724.1998.10718751
- CrossRef
- Google Scholar
150
WoodsG. F.DayL.WithersR. T.IlsleyA. H.MaxwellB. F. (1994). The dynamic calibration of cycle ergometers.Int. J. Sports Med.15168–171. 10.1055/s-2007-1021041
151
World Anti-Doping Agency (2014). Athlete Biological Passport: Operating Guidelines and Compilation of Required Elements.Montreal: World Anti-Doping Agency.
- Google Scholar
152
World Anti-Doping Agency (2017a). Athlete Biological Passport (ABP) Operating Guidelines. Available at: https://www.wada-ama.org/sites/default/files/resources/files/guidelines_abp_v6_2017_jan_en_final.pdf[accessed November 23, 2017].
- Google Scholar
153
World Anti-Doping Agency (2015). World Anti-Doping Code. Available at: https://www.wada-ama.org/sites/default/files/resources/files/wada-2015-world-anti-doping-code.pdf[accessed November 23, 2017].
- Google Scholar
154
World Anti-Doping Agency (2017b). What is Prohibited: World anti-doping agency. Available at: https://www.wada-ama.org/sites/default/files/resources/files/2016-09-29_-_wada_prohibited_list_2017_eng_final.pdf[accessed November 23, 2017].
- Google Scholar
155
WrightJ.Bruce-LowS.JobsonS. A. (2017). The reliability and validity of the 3-min all-out cycling critical power test.Int. J. Sports Med.38462–467. 10.1055/s-0043-102944
156
ZagattoA. M.PapotiM.GobattoC. A. (2008). Validity of critical frequency test for measuring table tennis aerobic endurance through specific protocol.J. Sports Sci. Med.7461–466.
- Pubmed Abstract
- Google Scholar
157
ZającA.WilkM.SochaT.MaszczykA.ChyckiJ. (2014). Effects of growth hormone and testosterone therapy on aerobic and anaerobic fitness, body composition and lipoprotein profile in middle-aged men.Ann. Agric. Environ. Med.21156–160.
- Pubmed Abstract
- Google Scholar
158
ZorzoliM.RossiF. (2010). Implementation of the biological passport: the experience of the international cycling union.Drug Test. Anal.2542–547. 10.1002/dta.173

Summary

Keywords

critical power model, W′ balance model, performance models, athletic performance, doping in sports, performance-enhancing substances, biomarkers, critical velocity

Citation

Puchowicz MJ, Mizelman E, Yogev A, Koehle MS, Townsend NE and Clarke DC (2018) The Critical Power Model as a Potential Tool for Anti-doping. Front. Physiol. 9:643. doi: 10.3389/fphys.2018.00643

Received

30 November 2017

Accepted

11 May 2018

Published

06 June 2018

Volume

9 - 2018

Edited by

Raphael Faiss, Université de Lausanne, Switzerland

Reviewed by

Emilia Ambrosini, Politecnico di Milano, Italy; Chris R. Abbiss, Edith Cowan University, Australia

Updates

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: David C. Clarke, dcclarke@sfu.ca

This article was submitted to Integrative Physiology, a section of the journal Frontiers in Physiology

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

REVIEW article

The Critical Power Model as a Potential Tool for Anti-doping

Abstract

Introduction

Existing Doping Control Practices

The WADA Code: A Brief Overview

Challenges With Enforcing the Code: Direct Detection

Challenges With Enforcing the Code: Indirect Detection

Performance as a Marker for Doping Detection

Performance Models in Doping Detection

The CP Model: Basis, Physiology, and Current Applications

Definition of the Model

Procedures for Estimating the Model Parameters for an Athlete

Physiological Interpretations

Applications in Sport

Evaluation of the CP Model for Doping Detection: Promise and Challenges

Sensitivity of CP and W′ to Performance-Modifying Manipulations

Training

Environmental Variables

Ergogenic Aids

Accuracies of Model Parameter Estimates and Predictions of Performance

Accuracy of the Parameter Estimates

Accuracy of the Model Predictions

Precision and Reliability

Model Bias and Artifacts

Summary: Potential for and Limitations of the CP Model for Use in Doping Detection

Implementing the CP Model in Doping Detection: Recommendations Regarding Methodology and Future Research

Implementing the CP Model in Doping Detection: an Illustration

Critical Review of the W′bal Model

Conclusion

Statements

Author contributions

Funding

Conflict of interest

References

Summary

Outline

Figures

Cite article

Share article

Article metrics

Critical Review of the W′_bal Model