Evaluating Individual Level Responses to Exercise for Health Outcomes in Overweight or Obese Adults

Background Understanding group responses to a given exercise exposure is becoming better developed; however, understanding of individual responses to specific exercise exposures is significantly underdeveloped and must advance before personalized exercise medicine can become a functional reality. Herein, utilizing data from the STRRIDE studies, we address some of the key issues surrounding our efforts to develop better understanding of individual exercise responsiveness. Methods We assessed individual cardiometabolic and cardiorespiratory fitness responses in subjects successfully completing STRRIDE I (n = 227) and STRRIDE II (n = 155). Subjects were previously sedentary, overweight or obese men and women with mild-to-moderate dyslipidemia. Subjects were randomized to either an inactive control group or to an exercise training program. Training groups varied to test the differential effects of exercise amount, intensity, and mode on cardiometabolic health outcomes. Measures included fasting plasma glucose, insulin, and lipids; blood pressure, minimal waist circumference, visceral adipose tissue, and peak VO2. Absolute change scores were calculated for each subject as post-intervention minus pre-intervention values in order to evaluate the heterogeneity of health factor responsiveness to exercise training. Results For subjects completing one of the aerobic training programs, change in peak VO2 ranged from a loss of 37% to a gain of 77%. When ranked by magnitude of change, we observed discordant responses among changes in peak VO2 with changes in visceral adipose tissue, HDL-C, triglycerides, and fasting plasma insulin. There was also not a clear, direct relationship observed between magnitudes of individual response in the aforementioned variables with aerobic training adherence levels. This same pattern of highly variable and discordant responses was displayed even when considering subjects with adherence levels greater than 70%. Conclusion Our findings illustrate the unclear relationship between magnitude of individual response for a given outcome with training adherence and specific exercise exposure. These discordant and heterogeneous responses highlight the difficult nature of developing understanding for how individuals will respond to any given exposure. Further investigation into the biological, physiological, and genetics factors affecting individual responsiveness is vital to making personalized exercise medicine a reality.

Background: Understanding group responses to a given exercise exposure is becoming better developed; however, understanding of individual responses to specific exercise exposures is significantly underdeveloped and must advance before personalized exercise medicine can become a functional reality. Herein, utilizing data from the STRRIDE studies, we address some of the key issues surrounding our efforts to develop better understanding of individual exercise responsiveness.

Methods:
We assessed individual cardiometabolic and cardiorespiratory fitness responses in subjects successfully completing STRRIDE I (n = 227) and STRRIDE II (n = 155). Subjects were previously sedentary, overweight or obese men and women with mild-to-moderate dyslipidemia. Subjects were randomized to either an inactive control group or to an exercise training program. Training groups varied to test the differential effects of exercise amount, intensity, and mode on cardiometabolic health outcomes. Measures included fasting plasma glucose, insulin, and lipids; blood pressure, minimal waist circumference, visceral adipose tissue, and peak VO 2 . Absolute change scores were calculated for each subject as post-intervention minus preintervention values in order to evaluate the heterogeneity of health factor responsiveness to exercise training.
Results: For subjects completing one of the aerobic training programs, change in peak VO 2 ranged from a loss of 37% to a gain of 77%. When ranked by magnitude of change, we observed discordant responses among changes in peak VO 2 with changes in visceral adipose tissue, HDL-C, triglycerides, and fasting plasma insulin. There was also not a clear, direct relationship observed between magnitudes of individual response in the aforementioned variables with aerobic training adherence levels. This same pattern of highly variable and discordant responses was displayed even when considering subjects with adherence levels greater than 70%.
Conclusion: Our findings illustrate the unclear relationship between magnitude of individual response for a given outcome with training adherence and specific exercise

INTRODUCTION
Understanding the benefits of exercise at a group or population level has several purposes. At a population level -for the purposes of public health -understanding group responses and the variability around these responses provides information about cost-benefit and what health messages and initiatives should be provided to a population so as to be able to obtain the greatest benefit for the population as a whole. This is the goal of the recently released 2018 Physical Activity Guidelines for Americans (U.S Department of Health and Human Services, 2018) and the science that supports these guidelines (Physical Activity Guidelines Advisory and Committee, 2018).
Understanding the benefits of exercise at the individual level has other purposes. Assessing the heterogeneity of exercise training effects on health benefits at an individual level and being able to predict what benefits will accrue to an individual in response to a given exercise regimen may allow one to prescribe a specific program for a specific health benefit to accomplish one of the main goals of personalized exercise medicine -the right exercise regimen, for the right condition, for the right person, at the right time (Kraus, 2017). However, to use physical activity or exercise prescriptions to reduce risk for disease or treat a developing health problem, one needs to be able to predict with some precision what magnitude of response in health parameters one might expect to see at an individual level from any one of a number of different exercise regimens. We are now just beginning to develop these understandings to be able to approach this question with some precision; nonetheless, there is still some way to go (Buford et al., 2013;Ozemek and Arena, 2019).
Both intensity and amount effects of aerobic exercise contribute to improvements in cardiorespiratory fitness and in the various components of the metabolic syndrome (Duscha et al., 2005;Johnson et al., 2007;Bateman et al., 2011). At the group level, greater amounts of aerobic exercise drive improvements in HDL-cholesterol (Kraus et al., 2002), waist circumference  and visceral adiposity ; and intensity of aerobic exercise drives reductions in fasting insulin and triglycerides in a paradoxical fashion, whereby moderate intensity has a more beneficial effect than vigorous intensity when the same amount (energy expenditure) of exercise is performed (Kraus et al., 2002;Houmard et al., 2004). As the relationship of various characteristics of exercise to health improvements is multifaceted at the group level, the heterogeneity of response at the individual level introduces even greater complexity. Within an individual, the magnitude of favorable or adverse change in any given health parameter may bear an unpredictable relationship to favorable or adverse responses to the same exercise regimen for other equally important health parameters. The purpose of this paper is to discuss these concepts using a secondary analysis of data from the STRRIDE studies (Kraus et al., 2001;Slentz et al., 2011), including our understanding of the challenges and limitations of a personalized approach to lifestyle medicine as they currently exist.

Laboratory Measurements
Subjects underwent multiple laboratory measures at baseline and post-intervention. Body mass and height were assessed while subjects were wearing light clothing and no shoes. Blood pressure was measured at rest in a seated position. Waist circumference was measured at the minimal waist, which is the least circumference measurement obtained above the umbilicus and below the xiphoid (Willis et al., 2007). Computed tomography scans were performed by a radiological technologist and the images were analyzed using Slice-o-matic imaging software from Tomo Vision to determine the area of visceral adipose tissue .
Fasting blood samples were obtained from the beginning of an intravenous glucose tolerance test at baseline and 16-24 h after the final exercise bout. Fasting plasma glucose was determined via a YSI analyzer (Yellow Springs, OH, United States), and HDL-C as well as triglycerides were determined via nuclear magnetic resonance spectroscopy (LipoScience, Raleigh, NC, United States). Fasting plasma insulin was determined via immunoassay (Access Immunoassay System, Beckman Coulter, Fullerton, CA, United States).
Subjects completed graded maximal cardiopulmonary exercise tests on a treadmill with 12-lead electrocardiography and expired gas analysis (TrueMax Parvomedics; Provo, UT, United States). Within STRRIDE I and STRRIDE II, the same study staff members performed all tests. The graded treadmill test protocol consisted of 2-min stages, starting at 3 mph and 0% grade, then increased speed and/or grade by approximately one metabolic equivalent per stage until the subject reached volitional exhaustion. The two greatest, consecutive 15-s readings were averaged to determine peak VO 2 .

Exercise Training
For the aerobic training groups, subjects underwent an initial ramp period of 2-3 months in order to allow subjects to gradually adapt to their exercise prescription. The ramp period was followed by six additional months of training at the appropriate exercise prescription. The intensity of prescribed exercise was based on the individual baseline cardiopulmonary exercise test results. Aerobic exercise modes included treadmills, elliptical trainers, cycle ergometers, or any combination of these.
For the resistance training groups, the ramp period started with one set during weeks 1-2, two sets during weeks 3-4, and built up to the three set prescription on week 5. The prescription included three sessions per week (non-consecutive days), of three sets of 8-12 repetitions on eight Cybex weight-lifting machines. The resistance exercises were designed to target all major muscle groups. To ensure a progressive resistance training stimulus throughout the intervention, the amount of weight lifted was increased by 2.75 kg each time the subject performed 12 repetitions properly on all three sets during two consecutive workout sessions.
For the aerobic training groups, exercise intensity and duration for all exercise sessions were verified by direct supervision and/or with the use of downloadable heart rate monitors (Polar Electro, Woodbury, NY, United States). Aerobic training adherence was calculated for each subject as the number of minutes completed within the prescribed heart rate range, divided by the number of total weekly minutes prescribed. Resistance training sessions were verified by direct supervision and/or the FitLinxx Strength Training Partner (FitLinxx, Norwalk, CT, United States). The "training partner" automatically sent data from each session to the FitLinxx server computer. The computers recorded total weight lifted via laser weight plate detection, and total number of repetitions (which were only counted when subjects lifted through the full range of motion), and finally speed of weight lifting motion was monitored and alerts were given when subjects lifted too quickly.

Statistical Analyses
To further illustrate the heterogeneity of cardiometabolic response to exercise intervention, we assessed both group and individual level changes in the metabolic syndrome z-score in the STRRIDE I cohort. As described in a previous STRRIDE study , the metabolic syndrome z-score was used as a continuous score of the five metabolic syndrome variables. A modified z-score was calculated for each metabolic syndrome variable using individual subject data, the Adult Treatment Panel (ATP) III criteria (Grundy et al., 2005), and baseline standard deviations for the entire STRRIDE I cohort. To account for variations in ATP III criteria between men and women, we used sex-specific metabolic syndrome z-score equations. For all outcome variables, absolute change scores were calculated for each subject as the difference between the postminus pre-intervention values. We also calculated percent change for peak VO 2 as the absolute change score divided by the pre-intervention value. At the group level, paired t tests were performed to determine significant mean changes within groups; a p-value of 0.05 was used to indicate statistical significance (STATVIEW, SAS, Cary, NC, United States). To illustrate the individual variability of responses to exercise training, waterfall plots were created for peak VO 2 and metabolic syndrome z-score, in which each bar represents an individual subject's change score.
When assessing mean change at the group level in STRRIDE I , both the low amount/moderate intensity and high amount/vigorous intensity groups significantly decreased their metabolic syndrome z-scores following 8 months of exercise training, while the inactive control and low amount/vigorous intensity groups did not significantly change compared to baseline (Figure 1). However, when plotting individual change scores ordered by magnitude of change, all four groups display a broad range of responses, from negative to positive (Figure 2). As shown in Figure 2D, even the group with the greatest average improvement in metabolic syndrome z-score (−1.5 ± 1.6 points for the high amount/vigorous intensity group) included individuals who either had no significant improvement or even an increase in their metabolic syndrome z-score.
We also evaluated the heterogeneity of responsiveness of peak VO 2 in both STRRIDE I and STRRIDE II. By order of magnitude, we ranked all graduates who participated in an aerobic training group by percent change peak VO 2 . Across both cohorts, change in peak VO 2 ranged from a loss of 37.0% to a gain of 76.8%. We plotted these individual change scores for those with the greatest positive change in peak VO 2 (ranging from 25.2 to 76.8%; n = 29) with the corresponding training adherence in Figures 3A,B. We also plotted the individual change scores for the 29 subjects with the greatest negative to no response in peak VO 2 (ranging from −37.0 to −0.2%) and their corresponding training adherence (Figures 3C,D). To further explore the substantial variability in peak VO 2 response and confirm that the subjects performed ideal maximal exercise tests, we evaluated the peak respiratory exchange ratios (RER) achieved during the exercise tests to serve as an objective measure of maximal effort. For the 58 subjects listed in Figure 3, we found that pre-intervention peak RER values averaged 1.16 and the post-intervention peak RER values FIGURE 1 | Effects of amount and intensity of exercise on changes in metabolic syndrome z-score at the group level in STRRIDE I. Amt, amount; * represents p < 0.05 for within-group change with exercise training. Reprinted from Johnson et al. (2007) with permission from Elsevier, Copyright 2007. averaged 1.13. In addition, only five individual tests out of 116 tests had RER's below 1.0.
In addition, we assessed these 58 subjects with the most negative and positive peak VO 2 responses by their corresponding changes in the following cardiometabolic risk factors: visceral adipose tissue, HDL-C, triglycerides, and fasting plasma insulin. In both tables, change scores that were greater than the technical variation in the test in the favorable direction are shaded green. Change scores greater than the technical variation in an unfavorable direction are shaded in red. As illustrated in Tables 1, 2, there was discordance in response across all variables within individual subjects. For any given group of outcome measures, subjects having a negative response in one outcome variable did not necessarily have a negative response in the other risk factors. Taking together, the illustrations in Figure 3 and Tables 1, 2, there does not appear to be a clear, direct relationship between magnitude of individual response for a given outcome variable with aerobic training adherence nor specific training stimulus group. Even when considering only the subjects who had aerobic training adherence levels greater than 70%, the same pattern is observed: large heterogeneity and discordance of responses across a variety of important cardiometabolic health risk factors.

DISCUSSION
From our work in STRRIDE, we have several key observations bearing on the issues surrounding our efforts to develop better understanding of individual exercise responsiveness. First, hidden beneath the well-defined health benefits of exercise at the group level reported in exercise training studies -where groups may be defined as different modes, intensities, or amounts of exercise exposure -there is significant heterogeneity of response at an individual level.
Second, this heterogeneity of response is defined by the error bars routinely reported in our study figures (e.g., Figure 1). The heterogeneity of the response clearly can be appreciated in a series of waterfall plots of response by group (see Figure 2 for metabolic syndrome z-score). The fact that this heterogeneity is intrinsic to the individual biology or physiology of an individual, and not a reflection of exposure, can be seen when the individual responses are plotted over the individual intervention adherence metrics (Figure 3 for change in cardiorespiratory fitness, peak VO 2 ). Interestingly, even when we assessed the subjects with aerobic training adherence levels greater than 70%, we observed large variability and discordant responses across peak VO 2 , visceral adipose tissue, HDL-C, triglycerides, and fasting insulin (Tables 1, 2). At least part of the individual biology responsible for the individual heterogeneity of response is genetic (Sarzynski et al., 2016(Sarzynski et al., , 2017Ross et al., 2019). In the case of change in cardiorespiratory fitness in response to aerobic exercise, approximately 50% is genetic (Timmons et al., 2010;Bouchard, 2012). Work is just now beginning to define the genetic predictors of response to other elements of the metabolic syndrome (Sarzynski et al., 2016). That said, because they do not yet replicate well across study populations,  polygenic predictors of response for even change in peak VO 2 to exercise are not well developed enough to be useful clinically (Timmons et al., 2010). In fact, this is true for even the strongest polygenic risk predictors for obesity and cardiovascular disease, where the predictive capacity for the best polygenic risk for obesity predicts a mere 29.2% of cases (Emdin et al., 2017;Khera et al., 2019).
Third, this heterogeneity is expected in continuous variables and is characteristic of a Gaussian distribution of responses across a population. The tails of this distribution are defined as being outside two standard deviations of the mean response (positive or negative), where 5% of the population will be at the tails of the response. When this extreme distribution is wide, and the lower tail crosses the technical variation of the Each row represents an individual subject's data; Vig/Low, vigorous intensity and low amount of aerobic exercise training; Mod/Low, moderate intensity and low amount; Vig/Low/Resistance, vigorous intensity and low amount aerobic plus resistance exercise training; Vig/High, vigorous intensity and high amount; Delta VAT, change in visceral adipose tissue; Delta HDL-C, change in high-density lipoprotein-cholesterol; Delta TG, change in triglycerides; Delta FI, change in fasting insulin.
health outcome variable, it can be termed an adverse or negative response . When the intervention exposure is modest or weak, even a greater number may be characterized as non-responders; however, this is more a characteristic of the exposure, not the individual. As an example, a recent study evaluated individual patterns of fitness response in a randomized, cross-over study in 21 healthy young adults comparing two different exercise exposures: 3 weeks of endurance cycling versus 3 weeks of sprint interval cycling (Bonafiglia et al., 2016). Both training protocols elicited similar main group effects, while the individual responses varied widely and the patterns of individual response differed by protocol. Although the study assessed the initial response to short-term training, the results support the hypothesis that individuals who do not favorably respond to a given exercise exposure may undergo more beneficial fitness responses to a different exercise exposure. However, we do not know whether these results would extend to longer duration training interventions and populations with additional risk factors, like the present study includes. Fourth, the response of a given health parameter to a given exercise regimen for any given individual does not predict the response to a related health variable to the same intervention. As illustrated in Tables 1, 2, there is little correlation between cardiorespiratory fitness responses and some elements of the metabolic syndrome. For example, a high responder in cardiorespiratory fitness does not necessarily mean any given individual will be a high responder in metabolic syndrome or the constituent elements thereof (HDL-C, triglycerides, fasting insulin or visceral adiposity; Table 2). Similarly, a high responder in one element of the metabolic syndrome (e.g., HDL-C) is not necessarily a high responder, or responder at all, in the other elements of the metabolic syndrome. Fifth, there is heterogeneity in the health outcome responses to the components of exercise exposure (intensity, amount, frequency, and mode). At the group level, both intensity and amount influence improvements in cardiorespiratory fitness in response to aerobic exercise training, while intensity influences responses in fasting insulin, triglycerides, and metabolic syndrome score in an opposite direction to that of how it influences cardiorespiratory fitness: greater intensity when controlling for amount leads to greater improvements in cardiorespiratory fitness; whereas, the converse is true (less intense exercise training controlling for amount) for fasting insulin, triglycerides, and metabolic syndrome score at a group level (Figure 1).
Sixth, in addition to the biological and physical contributors to the heterogeneity of the response of health outcomes to exercise training, there are biological -in fact, genetic -predictors of exercise and physical activity, even in humans (de Geus et al., 2014). The identification of such genetic variants may promote better targeted and individualized messaging for individuals less likely from a constitutional standpoint to engage in favorable physical activity behaviors. For instance, using a candidate gene approach, we identified a genetic variant in the acid ceramidase gene predicting non-completion -or drop-out -from our STRRIDE studies (Lewis et al., 2018). As has been demonstrated for other lifestyle behaviors such as smoking and nutrition (e.g., coffee consumption) (Cornelis et al., 2011;Gustavsson et al., 2012;Shenker et al., 2013;Matoba et al., 2019), this result provides evidence for the presence of biological predictors of exercise adherence. We are now underway in attempts to identify other genetic variants predicting exercise behavior in human STRRIDE subjects using a genome-wide approach.

Limitations
We recognize some limitations of our study. In STRRIDE I and II, subjects were allowed to perform aerobic training on a combination of aerobic equipment, while exercise testing occurred solely on a treadmill. We recognize that cross-training effects may have occurred, which can lead to bias in the exercise testing results; however, we believe the aerobic training stimuli from treadmill and elliptical trainer use, which were the primary modes of aerobic training, should yield similar peak VO 2 responses. We also recognize that the STRRIDE studies were not designed to specifically investigate individual-level exercise training responsiveness. Thus, the current paper is a secondary analysis highlighting the observed heterogeneity of responses.

CONCLUSION
There are inherent individual biological, physiological, and genetics factors involved in determining the responses in a myriad of health parameters to exercise training of a given mode, intensity and amount. Indeed, not only is there heterogeneity at the individual level to exercise-induced health responses and behaviors, there is also heterogeneity at a group level for specific health parameters to exercise programs that differ in mode, intensity and amount. All of this heterogeneity makes predicting, at an individual level, favorable responses to exercise training inherently difficult, but not impossible. Developing more information in this arena may eventually make personalized exercise medicine a reality, permitting us not only to suggest exercise programs of a given mode, intensity or amount for a particular health condition, but also to target intensive behavioral reinforcement strategies for exercise to those most likely to be poorly adherent, or who avoid exercise altogether.

DATA AVAILABILITY STATEMENT
The datasets analyzed in this manuscript are not publicly available. Requests to access the datasets should be directed to william.kraus@duke.edu.

ETHICS STATEMENT
The studies involving human participants were reviewed and approved by Duke University and ECU Institutional Review Boards. The patients/participants provided their written informed consent to participate in this study.

AUTHOR CONTRIBUTIONS
LR conceived the project, developed the data, and assisted in writing the manuscript. CS collected the data, synthesized the data, and assisted in editing the manuscript. WK conceived the project, obtained funding for the project, collected the data, and wrote the manuscript. All authors contributed to data analysis, interpretation of results, figure preparation, drafting and editing the manuscript, and approved the final version of the manuscript. FUNDING STRRIDE I and II were funded by NHLBI grant HL-057354. LR was supported by NHLBI fellowship T32HL007101.