Abstract
The search for the genetic foundation of multiple sclerosis (MS) severity remains elusive. It is, in fact, controversial whether MS severity is a stable feature that predicts future disability progression. If MS severity is not stable, it is unlikely that genotype decisively determines disability progression. An alternative explanation tested here is that the apparent instability of MS severity is caused by inaccuracies of its current measurement. We applied statistical learning techniques to a 902 patient-years longitudinal cohort of MS patients, divided into training (n = 133) and validation (n = 68) sub-cohorts, to test four hypotheses: (1) there is intra-individual stability in the rate of accumulation of MS-related disability, which is also influenced by extrinsic factors. (2) Previous results from observational studies are negatively affected by the insensitive nature of the Expanded Disability Status Scale (EDSS). The EDSS-based MS Severity Score (MSSS) is further disadvantaged by the inability to reliably measure MS onset and, consequently, disease duration (DD). (3) Replacing EDSS with a sensitive scale, i.e., Combinatorial Weight-Adjusted Disability Score (CombiWISE), and substituting age for DD will significantly improve predictions of future accumulation of disability. (4) Adjusting measured disability for the efficacy of administered therapies and other relevant external features will further strengthen predictions of future MS course. The result is a MS disease severity scale (MS-DSS) derived by conceptual advancements of MSSS and a statistical learning method called gradient boosting machines (GBM). MS-DSS greatly outperforms MSSS and the recently developed Age Related MS Severity Score in predicting future disability progression. In an independent validation cohort, MS-DSS measured at the first clinic visit correlated significantly with subsequent therapy-adjusted progression slopes (r = 0.5448, p = 1.56e−06) measured by CombiWISE. To facilitate widespread use of MS-DSS, we developed a free, interactive web application that calculates all aspects of MS-DSS and its contributing scales from user-provided raw data. MS-DSS represents a much-needed tool for genotype-phenotype correlations, for identifying biological processes that underlie MS progression, and for aiding therapeutic decisions.
Introduction
Multi-centric genetic consortia identified more than 200 multiple sclerosis (MS) susceptibility genes, but validated no gene variants associated with MS severity (, ). The aggressiveness of a patient’s MS course can be measured by the MS Severity Score (MSSS) (), which relates Expanded Disability Status Scale (EDSS) () to MS disease duration (DD). A newer version of the EDSS-based MSSS, called the Age Related MS Severity Score (ARMSS), substitutes age for DD (). However, it is unclear if MS severity represents a stable disease phenotype (). Consequently, our ability to provide reliable prognostic information to patients is limited.
Large natural history studies have shown that past MS course does not predict future rate of disability accumulation once a patient reaches an EDSS of 6 (, ). This suggests that MS severity is either not a stable feature or represents a stable phenotype only for low-to-moderate disability levels. This understanding is essential because only stable features can be successfully used as outcomes for statistical learning techniques to identify genes, biomarkers, and/or biological processes that hasten MS progression.
Consequently, the purpose of this longitudinal study was to determine if MSSS (or ARMSS) can reliably predict future rates of disability accumulation measured by EDSS, and, if they cannot, to develop and validate a model that more reliably predicts MS severity for both clinical and research applications.
Materials and Methods
Patients
The study was approved by the Institutional Review Board and all patients signed an informed consent. Participants were selected from the prospectively acquired natural history protocol 09-N-0032 (Clinicaltrials.gov Identifier NCT00794352): “Comprehensive multimodal analysis of patients with neuroimmunological diseases of the CNS” based on the following criteria: (1) confirmed diagnosis of MS and (2) longitudinal follow-up consisting of three or more clinic visits with recorded CombiWISE values spanning ≥1 year or two or more clinic visits with recorded CombiWISE values spanning ≥3 years. We excluded clinical visits associated with MS exacerbation because these would overestimate the amount of sustained disability progression, as a large proportion of exacerbation-related disability is reversible. The mean length of follow-up was 4.49 ± 2.90 years. All MS subjects fulfilling these criteria as of 5/26/2017 were included in this analysis. Pertinent inclusion criteria for 09-N-0032 protocol are (1) presentation with a clinical syndrome consistent with an immune-mediated central nervous system (CNS) disorder, (2) neuroimaging evidence of inflammatory and/or demyelinating CNS disease, and (3) 12–75 years of age at the time of enrollment. The final diagnosis was assigned after thorough diagnostic work-up as described (). Patient demographics and other clinical characteristics are provided in Table 1.
Table 1
| Training (N = 133) | Validation (N = 68) | Total (N = 201) | |
|---|---|---|---|
| Demographics | |||
| Age (years) | |||
| Mean ± SD | 48.5 ± 12.0 | 49.0 ± 10.9 | 48.6 ± 11.6 |
| Median | 49.2 | 49.9 | 49.5 |
| Range | 24.0–67.8 | 23.2–70.3 | 23.2–70.3 |
| Gender, no. of patients (%) | |||
| Female | 66 (49.6) | 34 (50.0) | 100 (49.8) |
| Male | 67 (50.4) | 34 (50.0) | 101 (50.2) |
| Race, no. of patients (%) | |||
| White | 116 (87.2) | 60 (88.2) | 176 (87.6) |
| Other | 17 (12.8) | 8 (11.8) | 25 (12.4) |
| Smoking history, no. of patients (%) | |||
| Yes | 80 (60.2) | 41 (60.3) | 121 (60.2) |
| No/unknown | 53 (39.8) | 27 (39.7) | 80 (39.8) |
| Family history of multiple sclerosis (MS), no. of patients (%) | |||
| No | 87 (65.4) | 48 (70.6) | 135 (67.2) |
| Mild (1 distant relative with MS) | 22 (16.5) | 6 (8.8) | 28 (13.9) |
| Moderate (>1 distant relative with MS) | 7 (5.3) | 3 (4.4) | 10 (5.0) |
| Strong (first degree relative with definite MS) | 10 (7.5) | 6 (8.8) | 16 (8.0) |
| Unknown | 7 (5.3) | 5 (7.4) | 12 (6.0) |
| Other clinical characteristics | |||
| Age at disease onset (years) | |||
| Mean ± SD | 36.5 ± 11.6 | 38.5 ± 11.1 | 37.1 ± 11.4 |
| Median | 35.9 | 39.4 | 37.2 |
| Range | 7.9–63.7 | 16.5–60.1 | 7.9–63.7 |
| Disease duration (years) | |||
| Mean ± SD | 12.0 ± 10.2 | 10.5 ± 9.2 | 11.5 ± 9.9 |
| Median | 9.4 | 7.7 | 8.7 |
| Range | 0.1–39.5 | 0.1–42.3 | 0.1–42.3 |
| EDSS | |||
| Mean ± SD | 4.0 ± 2.3 | 3.9 ± 2.2 | 3.9 ± 2.2 |
| Median | 4 | 3 | 4 |
| Range | 0.0–7.5 | 0–6.5 | 0.0–7.5 |
| MSSS | |||
| Mean ± SD | 5.1 ± 2.6 | 5.2 ± 2.4 | 5.1 ± 2.5 |
| Median | 5.6 | 5.0 | 5.4 |
| Range | 0.1–9.8 | 0.8–9.7 | 0.1–9.8 |
| CombiWISE | |||
| Mean ± SD | 31.0 ± 18.3 | 29.2 ± 16.0 | 30.4 ± 17.5 |
| Median | 32.7 | 25.9 | 30.2 |
| Range | 1.6–79.5 | 3.9–53.1 | 1.6–79.5 |
| COMRIS-CTD* | |||
| Mean ± SD | 12.9 ± 6.5 | 12.7 ± 6.1 | 12.8 ± 6.3 |
| Median | 12.8 | 12.9 | 12.9 |
| Range | 0.0–29.9 | 2.1–28.5 | 0.0–29.9 |
Patient demographics and other clinical characteristics.
Data are mean ± SD, median, and range or n (%). Characteristics labeled with an (*) were calculated by excluding missing data. Percentages may not sum to 100 due to rounding.
EDSS, Expanded Disability Status Scale; MSSS, MS Severity Score; CombiWISE, Combinatorial Weight-Adjusted Disability Score; COMRIS-CTD, Combinatorial MRI Score of CNS tissue destruction.
Data Collection
All data were prospectively acquired and entered into the research database. Clinicians acquired the EDSS () and Scripps Neurological Rating Scale (SNRS) (), while a different set of investigators acquired the Timed 25-Foot Walk (T25FW) and 9-Hole Peg Test (9HPT) scores. From these data, the research database automatically computed the CombiWISE score based on the published formula (): where log2(T25FW) is the logarithm (base 2) of the T25FW, T25FWFAIL is binary (0 for completed test, 1 for failed test), log2(NDH-9HPT) is the logarithm (base 2) of the non-dominant hand 9HPT, and NDHFAIL is binary (0 for completed test, 1 for failed test). Prospectively acquired MRI scans were semi-quantitatively rated according to the published protocol () and grades were entered into the research database, which automatically calculates Combinatorial MRI Scale of CNS Tissue Destruction (COMRIS-CTD) as described ().
While current disease modifying therapies (DMTs) were entered into the database prospectively at the time of each clinical visit, detailed history of past DMTs, smoking, and family history were gathered from prospectively acquired patient questionnaires and clinic notes. When such detailed history data were missing, an investigator obtained them retrospectively over the phone or during a follow-up clinic visit. All data underwent weekly quality control (QC), after which the data were locked in the database.
For analysis, QC’d data were exported into Excel and then imported into the statistical software R v3.3.1 (RStudio v1.0) (, ). To ensure that both training and validation cohorts were representative of the underlying population, we used the following methods to construct training and validation datasets: patients were first separated based on their race (Caucasian and non-Caucasian) and gender (male and female). Within each of these four groups, patients were binned according to whether they were above or below the median measurements for therapy-adjusted CombiWISE/age (derived from data collected at the first clinic visit; see definition later in Section “Materials and Methods”) and age at first clinic visit. Within each of these 12 groupings, approximately two-thirds of patients were randomly selected as a training dataset, leaving the remaining one-third of patients as a validation dataset. This procedure balanced the distribution of the main demographic features and disability between the training and validation cohorts.
Adjusting for Therapeutic Efficacy
In a previous study (), we performed a meta-analysis of 38 randomized, blinded clinical trials of immunomodulatory therapies in MS to test the hypothesis that efficacy of current immunomodulatory DMTs on disability progression decreases with age. After fitting a weighted regression with an interaction term, treatment efficacy was found to be highly associated with age, with differences in outcome (inhibition of disability progression) dependent on drug classification into high- or low-efficacy categories [R2 (coefficient of determination) = 0.6757, p-value for interaction <0.0001].
Therefore, to adjust measured disability values (i.e., CombiWISE) and their longitudinal change for the average age-adjusted therapeutic efficacy of administered DMTs, we used the weighted linear regression from the previously described meta-analysis. We used the following strategies for adjustment: 1. cross-sectional disability data collected at first clinic visit and utilized for predicting future disability course were adjusted for all past treatments received before the first National Institutes of Health (NIH) clinic visit. We applied an identical strategy for adjusting cross-sectional disability data measured at the last clinic visit to determine whether MS disease severity scale (MS-DSS) at the last clinic visit also correlated with measured longitudinal data. 2. We then measured progression slopes derived from disability measurements obtained at all visits—this represented the “future” disability progression we wanted to predict from single-visit data. Because most patients were treated during this longitudinal follow-up, we also had to adjust these “future” progression slopes for de facto “future” treatments. Mathematically, the process of adjustment for therapeutic efficacy can be described as follows.
First, we calculated the cumulative efficacy across the entire treatment period of duration tn − t0, using the following formula: where t0 is the age at the first visit, tn is the age at the last visit, and each term Effi represents therapeutic efficacy generated from the following linear models: where the term (ti + ti-1)/2 is an average of two time points. The dichotomization of drugs into low- versus high-efficacy was described in the meta-analysis () and will be also mentioned in the results. All negative values for efficacy were assumed to be 0.
Measured progression slopes may be positive (i.e., patient accumulates disability) or negative (i.e., patient is repairing previously accumulated disability). If a patient remained untreated, then the measured progression slope quantifies the true progression slope. However, if a treated patient continued to accumulate clinical disability, then the true progression without therapy would be equivalent to shifting the measured slope upward by a magnitude that corresponds to the efficacy of the administered DMT [i.e., Eff(t0, tn)]. This, in effect, increases the rate of progression to provide an estimate of expected disease accumulation without therapy. Therefore, adjusted CombiWISE slopes were calculated by adding the measured CombiWISE slopes (fit by simple linear regression) to the percent adjustment due to efficacy, as follows:
Solving for the adjusted CombiWISE slope gives
For patients who improved while taking DMT, the true progression without treatment would be equivalent to shifting the negative measured slope upward by a magnitude, Eff(t0, tn), of the absolute value of the measured slope. Since the slope is negative, this is equivalent to subtracting the product of the measured slope and efficacy and, in effect, decreases the expected rate of improvement to model disease accumulation without therapy. Therefore, the adjusted CombiWISE slopes were similarly calculated by subtracting the percent adjustment due to efficacy from the measured CombiWISE slopes, as follows:
Factoring the right-hand side gives
We, then, adjusted the first measured CombiWISE, CWmeasured (t0), and last measured CombiWISE, CWmeasured (tn), upward, as follows: and where Eff(0, t0) represents the total cumulative efficacy from birth to the first clinic visit, and Eff(0, tn) represents the total cumulative efficacy from birth to the last clinic visit. Functions within the R script were used to generate all therapy adjustments. The Supplementary Material contains the utilized R code in its entirety.
Optimization of MS-DSS by Statistical Learning
A Gradient Boosting Machine [GBM (, )] was used to develop a model predicting therapy-adjusted CombiWISE Slopes from baseline covariates collected during a patient’s initial visit. GBM’s are sequentially built using regression trees. The current tree is built using residuals from the previous tree’s predictions, and the predictions are iteratively updated by adding the current tree’s prediction (times a shrinkage parameter between 0 and 1) to the previous tree’s prediction. For each tree constructed, a random sample containing half of the observations is withheld from the training cohort to introduce randomness into the modeling process (called the out-of-bag or OOB sample). The main tuning parameters for a GBM are the depth of the individual trees (sometimes called the interaction depth, typically between 1 and 4), the shrinkage parameter (typically small, less than 0.1), and the number of trees. Using the gbm R package (), we selected an interaction depth of 2, a conservative shrinkage parameter of 0.001, and used a five-fold cross validation to select the number of trees. The relative influence of each of the variables was computed by examining the improvement in squared-error from splits within each individual tree and averaging these improvements across all trees in the ensemble.
Results
MSSS and ARMSS Do Not Predict Development of Future Disability Measured with EDSS
We calculated patient-specific linear regression slopes (“progression slopes”) from clinical scales (EDSS and CombiWISE) collected at each clinic visit. EDSS measured during the first clinic visit was then used to calculate MSSS () and ARMSS () as described.
To determine whether MSSS predicts future disease severity measured by EDSS progression slopes, we assessed the correlation between patient-specific MSSS derived from first visit data and EDSS progression slopes derived from subsequent longitudinal follow-up. The resulting correlation in the full cohort (n = 201) was poor and not statistically significant (r = 0.0301, p = 0.672; Figure 1A). A poor predictive result was also observed for ARMSS, which ranks EDSS based on age (r = 0.0504, p = 0.478; Figure 1B).
Figure 1
We hypothesized that these null results were due to the insensitivity of EDSS to correctly measure MS progression in intervals shorter than 10 years. To test this hypothesis, we investigated whether MSSS and ARMSS predict future progression slopes using a more sensitive and continuous scale, CombiWISE (
CombiWISE Progression Slopes Explain a Larger Proportion of Variance than EDSS Progression Slopes
While EDSS and CombiWISE progression slopes are strongly correlated (r = 0.7768, p < 2.2e−16; Figure 2A), most patients with EDSS progression slopes of 0 (i.e., EDSS “stable” patients highlighted in blue) had non-zero CombiWISE progression slopes (Figure 2B). Of these patients, two-thirds progressed in CombiWISE, as demonstrated by the positively skewed distribution of CombiWISE progression slopes for patients with unchanged EDSS (Figure 2B).
Figure 2

CombiWISE measures multiple sclerosis (MS) progression more accurately than Expanded Disability Status Scale (EDSS). (A) Patient-specific progression rates measured by EDSS and CombiWISE regression slopes strongly correlate. (B) Frequency distribution of measured CombiWISE slopes for patients with no reported change in EDSS [highlighted blue points in (A)]. The histogram demonstrates the insensitivity of EDSS; patients can progress as much as 4 CombiWISE units/year (or improve by up to 2 CombiWISE units/year) with no change in EDSS. (C) Boxplots depict the proportion of variance explained for CombiWISE versus EDSS regressions. The median is indicated with a solid black line, and the mean is indicated with a dashed blue line. The bottom and top of the box denotes the first and third quartiles, respectively. The whiskers indicate the 10th and 90th percentiles. A two-tailed paired t-test indicated strong evidence of a mean difference of 0.1357 (95% CI, 0.089–0.182) in R2 values between the CombiWISE and EDSS regressions. (D) Histogram of difference in R2 values between CombiWISE and EDSS regressions suggests that the values are relatively normally distributed. (E) Representative patient examples illustrate higher accuracy of linear regressions (solid blue lines with dashed 95% CI; upper panels) derived from longitudinal CombiWISE measurements (no fill circles; upper panels) in comparison to linear regressions (solid black lines; lower panels) derived from longitudinal EDSS measurements (black filled circles; lower panels) from the same three MS patients.
The proportion of variance explained (R2) reflects how well the regression lines, as a function of time, explain the measured responses for each patient. The full cohort data (Figure 2C) demonstrated unequivocal superiority of CombiWISE over EDSS in its ability to reliably measure MS disability progression slopes. A higher proportion of variance was explained on average for CombiWISE than for EDSS (Figure 2C; mean of differences = 0.1357, 95% CI = [0.089, 0.182], p = 2.84e−08). Figure 2D shows the distribution of the pairwise differences, reinforcing the improved representation of the changes over time possible when using CombiWISE as a response over EDSS. Representative examples of individual patients (Figure 2E) provide visual clues as to why CombiWISE is a better scale. In the second example, a representative patient with stable disease based on EDSS (i.e., EDSS slope of 0) had a clear linear progression of disability measured by CombiWISE (Figure 2E, middle panels).
Substituting Measured CombiWISE/Age for MSSS Improves Predictive Power
It is impossible to measure MS onset and, therefore, disease duration (DD), with high precision. Since, in natural history cohorts, MS patients reached major disability milestones around the same age (and not around the same DD) (
Indeed, measured CombiWISE/age outperformed both MSSS and ARMSS in predicting future CombiWISE progression [MSSS: r = 0.2821, p = 4.97e−05 (Figure 1C); ARMSS: r = 0.2948, p = 2.15e−05 (Figure 1D); measured CombiWISE/age: r = 0.3093, p = 7.90e−06 (Figure 3A)].
Figure 3

Therapy-adjusted CombiWISE/age is a better predictor of future disability than measured CombiWISE/age. (A) Patient-specific CombiWISE progression slopes moderately correlate with cross-sectional CombiWISE/age measured at the first clinic visit. CombiWISE values were calculated based on a published formula (
Adjusting Disability Measures for the Effects of Administered Treatments
Although using cross-sectional CombiWISE/age values strengthened correlations with subsequent CombiWISE slopes, this model could still explain only a small proportion of the variance in these slopes. Therefore, we sought other sources of error that could potentially be eliminated. One apparent source of error involves disregarding the effects of DMTs, as clinical trials convincingly demonstrated that (at least some) DMTs inhibit rates of disability progression.
We recently performed a meta-analysis of all randomized, blinded clinical trials (38 trials, >28,000 MS patients) of immunomodulatory MS drugs that reported efficacy on disability progression (
Therapy-Adjusted CombiWISE/Age Better Predicts Therapy-Adjusted CombiWISE Slopes
Conceptually, to adjust measured CombiWISE progression slopes, we divided each treatment period into therapeutically homogeneous segments. For example, if a patient received two treatments separated by a period of no therapy, the longitudinal follow-up would be divided into three segments (Figure 3B). We computed the patient’s mean age as an average of the initiation and termination ages within each treatment period and then used this age as input for Eqs 3–5. For untreated patients, the therapeutic efficacy was assumed to be 0 (Eq. 3). By averaging the therapeutic efficacies within each treatment period proportional to the duration of each treatment period (Figure 3B), we derived a cumulative efficacy for the entire longitudinal follow-up. The measured progression slope was then adjusted by this patient-specific cumulative efficacy to derive a “therapy-adjusted CombiWISE Slope,” which represents a predicted progression slope had the patient remained untreated.
Similarly, to adjust cross-sectional CombiWISE scores obtained at the first or last clinic visit for all preceding treatments, we calculated the cumulative efficacy of past treatments. Because exact MS onset cannot be determined for most patients, we used time from birth to the time of cross-sectional visit for these adjustments.
Using therapy-adjusted data improved the correlation between cross-sectional CombiWISE/age and subsequent longitudinal progression slopes from measured values (r = 0.3093, p = 7.90e−06; Figure 3A) to therapy-adjusted values (r = 0.3468, p = 4.55e−07; Figure 3C).
Using Statistical Learning with Known and Newly Developed Variables to Derive Optimized MS-DSS Model
Finally, we asked whether we could further enhance the predictive power of future disability progression from a single cross-sectional measurement by using unbiased statistical learning. Because statistical learning uses measured data to obtain new knowledge, the validity of a model derived from such learning is best assessed on a new dataset not utilized for the modeling. This ensures that the model is generalizable to an entire population and not limited to the sample on which the model was trained. Thus, we split the cohort of 201 MS patients into training (n = 133) and independent validation (n = 68) cohorts using stratification based on therapy-adjusted CombiWISE/age (at first visit), age (at first visit), race, and gender to ensure that both training and validation cohorts were representative of the underlying population.
In addition to variables that were previously hypothesized or shown to affect disability progression such as gender, smoking, and race, we selected new features for statistical learning based on clinical knowledge. We hypothesized that the following features may negatively influence recovery from CNS lesions and, therefore, speed up accumulation of disability: age and the amount of CNS tissue destruction reflected by baseline disability (measured by CombiWISE) and/or quantified by MRI using published Combinatorial MRI Scale of CNS Tissue Destruction [COMRIS-CTD (
We selected GBM for statistical learning, which can handle observations with missing predictors, and are known for their excellent predictive ability (
The GBM model greatly outperformed simple linear regression based on therapy-adjusted CombiWISE/age, achieving a correlation of r = 0.6589, p = 6.62e−18 in the training cohort (Figure 4A).
Figure 4

MS disease severity scale (MS-DSS) (at first clinic visit) is a strong predictor of future disability measured with therapy-adjusted CombiWISE slopes. (A) Cross-sectional MS-DSS measured at the first clinic visit strongly correlates with longitudinal therapy-adjusted CombiWISE regression slopes in the training cohort. (B) Relative influence of individual features in MS-DSS model. The Gradient Boosting Machine assigned importance to the following variables measured at the first clinic visit: therapy-adjusted CombiWISE/age, measured CombiWISE, COMRIS-CTD, time from disease onset to first therapy, difference between adjusted CombiWISE and unadjusted CombiWISE, age, and family history of MS. Known dichotomous classifiers of MS such as gender, race (Caucasian or non-Caucasian), and smoking history were determined to have little if any influence on model results. (C) Cross-sectional MS-DSS measured at the first clinic visit also strongly correlates with longitudinal therapy-adjusted CombiWISE regression slopes in the validation cohort. CombiWISE values were calculated based on a published formula (
As demonstrated in Figure 4B, the GBM trained on a training cohort (n = 133) assigned higher influence to new variables, such as measured CombiWISE, COMRIS-CTD, and difference between adjusted CombiWISE and unadjusted CombiWISE, than to known dichotomous modifiers of MS disease course, such as gender, smoking, or race. Family history of MS contributed only marginally to the MS-DSS model. Although age also had only marginal importance in the model, one should not forget that age is also present in the most influential variable in the model, therapy-adjusted CombiWISE/age, and in the mathematical adjustments of administered treatments. To simplify the MS-DSS model, we retrained the GBM model on a reduced set of covariates, including all variables determined to have relative influence above 4.0 and excluding all those with little to no relative influence. The final reduced model included the following list of variables, all measured at the first clinic visit: therapy-adjusted CombiWISE/age, measured CombiWISE, COMRIS-CTD, time from disease onset to first therapy, difference between adjusted CombiWISE and unadjusted CombiWISE, age, and family history of MS. This final model achieved a correlation of r = 0.5448, p = 1.56e−06 in an independent validation cohort (Figure 4C).
Data Required to Compute MS-DSS Can Be Obtained Post Hoc and Used to Accurately Predict MS Severity
While predicting future MS course is important in clinical applications, reliable measurement of MS severity is absolutely required in scientific efforts to identify biological processes and genetic modifiers that determine the speed of CNS tissue destruction. Our primary goal in MS-DSS development was to facilitate such studies. However, most historical cohorts with stored biological samples lack the data necessary for calculating CombiWISE or MS-DSS. It is likely that a significant portion of patients from these historical cohorts can be called back for a comprehensive follow-up visit that would allow collection of all data necessary for computing therapy-adjusted CombiWISE/age. Therefore, we asked how well both therapy-adjusted CombiWISE/age and MS-DSS computed at the last clinic visit correlated with prospectively acquired clinical data and whether both measures outperformed MSSS in this regard (Figure 5).
Figure 5

Therapy-adjusted CombiWISE/age and MS disease severity scale (MS-DSS) (at last clinic visit) are strong predictors of prior disability measured with therapy-adjusted CombiWISE slopes. (A) Cross-sectional MS Severity Score (MSSS) measured at the last clinic visit moderately correlates with longitudinal Expanded Disability Status Scale (EDSS) regression slopes. (B,C) The correlation markedly improves when using both cross-sectional therapy-adjusted CombiWISE/age (B) and subsequent MS-DSS (C) to predict therapy-adjusted CombiWISE regression slopes. Results are shown separately for the training cohort (Left panels; n = 133) and validation cohort (Right panels; n = 68). MSSS and CombiWISE values were calculated using published formulas (
In both the training cohort and the validation cohort, MSSS measured at the last clinic visit had a moderate and significant correlation with previous disease progression measured by EDSS (Training: r = 0.3205, p = 1.70e−04; Validation: r = 0.3595, p = 0.003; Figure 5A). However, the correlation between therapy-adjusted CombiWISE/age and therapy-adjusted CombiWISE slopes markedly improved (Training: r = 0.5556, p = 3.85e−12; Validation: r = 0.6651, p = 6.14e−10; Figure 5B). MS-DSS also correlated strongly with therapy-adjusted CombiWISE slopes (Training: r = 0.6101, p = 6.35e−15; Validation: r = 0.6825, p = 1.45e−10; Figure 5C).
Interactive Web Application for Computing MS Scales
In previous manuscripts (
Figure 6

Dashboard (home page) within Shiny web application. Dashboard page introduces users to the application and allows them to monitor the status of each menu by dynamically updating the number of patient entries. Users navigate between four submenus (COMRIS (
Figure 7

COMRIS submenu within the Shiny web application. This submenu allows the user to calculate four COMRIS values (COMRIS-CTD, COMRIS-PDO-EDSS, COMRIS-PDO-SNRS, and COMRIS-CDO-SDMT) by semi-quantitative grading of 9 MRI categories from 1.5T or 3T MRI. Each drop-down menu allows the user to select the lesion load or level of atrophy necessary to grade the MRI. Users who are unfamiliar with COMRIS can click the help menu beside each drop down for a brief description of the grading procedure and a link to the published article (
Figure 8

CombiWISE submenu within the Shiny web application. This submenu allows the user to input values for four scales [Expanded Disability Status Scale (EDSS), SNRS, Timed 25-Foot Walk (T25FW), and 9-Hole Peg Test (9HPT)] to compute CombiWISE (
Figure 9

Therapy-adjusted CombiWISE/age submenu within the Shiny web application. This submenu allows the user to enter the patient’s medication history as needed for calculation of MS disease severity scale (MS-DSS) in the final tab. The medication history can be edited by adding or removing medications or amending the dates of administration. The list of medications only includes those that fulfill criteria for low- or high-efficacy as detailed in a recent meta-analysis of randomized, blinded clinical trials of immunomodulatory therapies in MS (
Figure 10

MS disease severity scale (MS-DSS) submenu within Shiny web application. This submenu allows the user to compute MS-DSS by entering information regarding family history and age at disease onset. All other features required for MS-DSS are computed on the server side of the application using the previously entered demographic information, medication history, auxiliary and neurological scales, and MRI grades. If the user does not have MRI grades within the past year, the application will treat this information as missing and use other available features to compute MS-DSS.
Figure 11

Quality control, help menus, and export within the Shiny web application. (A) If the user enters data that are non-numeric (e.g., ten instead of 10) or outside of the scope of the calculation (e.g., 0 second trial), the application will automatically generate a warning message with specifics regarding the error and directions on how to proceed. (B) The user must complete the COMRIS (
Discussion
Statistical learning is an indispensable technique for analyzing clinical data if the collected data has sufficient power to reliably measure existing relationships. Statistical power depends on the effect sizes of the processes we desire to investigate, the quantity of available measurements, and the precision of these measurements. Effect size is biologically determined, and, thus, cannot be modified by experimental design. Effect sizes tend to be small when considering individual elements, such as genes, proteins, or metabolites, in human polygenic diseases (
Our finding, that EDSS lacks the sensitivity to reliably measure disease progression in intervals shorter than 10 years, agrees with a recent observational study of 17,365 MS patients that measured a median annualized EDSS change of 0.1 (
Another source of imprecision comes from assessing DD. There is a wealth of knowledge regarding subclinical formation of MS lesions (e.g., in patients with radiologically isolated syndrome) and equally subclinical loss of neurological functions that a patient may not perceive as disability, depending on his/her physical and cognitive activity levels. This makes exact determination of MS onset impossible. On the other hand, large observational studies demonstrated that EDSS disability milestones are accumulated generally as a function of age rather than DD (
Another apparent source of error stems from the disregard of administered therapies in MSSS computation. We utilized a regression model from a recent meta-analysis of randomized, blinded clinical trials of MS DMTs (
The final step in the refinement of MS-DSS was the application of unbiased statistical learning. We used features that were previously shown to influence MS severity, such as gender and smoking. We also tested new features that, based on clinical knowledge, we considered likely to influence the rate of MS progression. Our hypothesis, that an increase in CNS tissue destruction (reflected by COMRIS-CTD and CombiWISE) will hasten disease progression because it limits functional recovery (i.e., formation of new synapses/circuits), was strongly supported by the GBM relative influence metrics, which selected these features above currently known modifiers of MS course. Similarly, delayed initiation of treatments was the fourth most important feature in GBM model, validating our hypothesis that such delay promotes establishment of compartmentalized inflammation and enabling us to identify patients with inadequate efficacy of administered DMTs. Analogously, the difference between therapy-adjusted CombiWISE and measured CombiWISE may reflect the beneficial effect of past treatments and, therefore, identify types of MS that are amenable to immunomodulation. It is important to recognize that none of the new features had better power in predicting future disability progression than therapy-adjusted CombiWISE/age. Thus, the new features are important modifiers that strengthen the predictive power of therapy-adjusted CombiWISE/age.
We acknowledge the following limitations: (1) our cohort was small in comparison to previous studies of MS severity models (
We also acknowledge that the current MS-DSS model might be further improved by adding biological data. As of this study, all models of MS severity, including MSSS, ARMSS, and MS-DSS, are based on the behavior of an average MS patient. However, as depicted in the Figure 12, behind every statistical average lies a distribution (e.g., Gaussian) from which this average is derived. The exact location a patient falls on this distribution is determined by patient-specific biological or environmental factors. The wider the spread in this distribution, the more imprecise the patient-specific predictions are when derived from a model that captures only group behaviors. Therefore, it is not surprising that, without inputting patient-specific data (e.g., genotypes, protein, and metabolic biomarkers), the prognostic power of MS-DSS can be only moderate. Based on the results of this study, we expect that inclusion of patient-specific biological features will serve to further strengthen the predictive power of MS-DSS. However, identification and validation of biological features that predict response to treatment or progression rates is missing precisely because we have, thus far, lacked a sensitive MS severity scale necessary for generating this knowledge in reasonably sized cohorts.
Figure 12

Explanation of additional sources of variation in the MS disease severity scale (MS-DSS) model. Every adjustment we made to the MS-DSS model is based on average values from large populations. Underlying each of these statistical averages lies a distribution (e.g., Gaussian) represented by the bell-shaped curves. The precise location of a patient within these distributions depends on patient-specific genetic, biological, and environmental factors. We will describe three types of theoretical patients that may be highly informative, even though each represents a rare variant. Assume that a patient P1 has precisely average behavior on all distributions. S/he will have an average age of multiple sclerosis (MS) onset (tonset), average response to high-efficacy therapy (HE-Th1), and average response to low-efficacy therapy (LE-Th3). Furthermore, assume that patient P2 has unusually severe MS behavior, and patient P3 has unusually mild MS behavior. When compared to the average patient (P1), P2 experienced MS onset at an early age and did not respond well to high- or low-efficacy therapy; whereas, P3 experienced MS onset at a later age and responded well to high- or low-efficacy therapy. Finally, all patients will end within the blue distribution, which comprises all variation demonstrated by the green distributions and is, therefore, the broadest. Within this distribution, P1 will fall precisely at the MS-DSS predicted average (peak); whereas, P2 will have higher than average MS-DSS predicted disability, and P3 will have lower than average MS-DSS predicted disability. The resulting behavior of P2 and P3 within each of these distributions can be attributed to a variety of patient-specific factors that are not included in MS-DSS model, such as genetic predisposition to more aggressive (P2) or milder (P3) MS, and environmental factors such as EBV infection, smoke exposure, low vitamin D, obesity, lack of exercise, etc. The MS-DSS model cannot make more accurate individual predictions until we understand all genetic, biological, and environmental modifiers and include them in the model.
We have also demonstrated that therapy-adjusted CombiWISE/age can be applied “backwards” without any loss of sensitivity by collecting CombiWISE measurements at the post-treatment stage and adjusting these measurements for previous therapeutic history. This allows utilization of existing MS sample repositories given that the patients can be called back and re-examined. For the patients who cannot be re-examined, there are two additional solutions if only SNRS is missing: (1) SNRS can be generated retrospectively from a documented neurological exam, or (2) if a detailed neurological exam is not available, SNRS can be successfully approximated from any 1.5T or 3T brain and upper spinal cord MRI using COMRIS-based models (
The final limitation of MS-DSS, which may hinder its widespread acceptance, is its complexity. The user-friendly web application that we have developed allows any user, devoid of knowledge in mathematics, statistics or programming, to derive all scales that we have developed [COMRIS-CTD (
In conclusion, MS-DSS has the potential to correctly identify relationships between MS severity and genes, proteins, or metabolites in significantly smaller cohorts than required for MSSS or its ARMSS analog. While MS-DSS can aid decisions about personalized treatments in its current form, future enhancements of the model with patient-specific biological data will unquestionably strengthen its individualized predictive power.
Statements
Ethics statement
This study was carried out in accordance with the recommendations of Declaration of Helsinki with written informed consent from all subjects. The protocol was approved by the NIH CNS institutional Review Board (IRB).
Author contributions
BB was responsible for original study concept, design, and supervision. BB and AW were responsible for acquisition of clinical data, and PK was responsible for data export. PK, MK, and BB were responsible for design of COMRIS-CTD (MRI scale) utilized as a feature in the model. AMW, CB, MT-M, MG, KJO, and BB were responsible for model design and statistical analyses. AMW, KJA, and TT developed and tested the web application. AMW, CB, MG, and BB were responsible for drafting the text. All authors contributed to revision and approval of the final manuscript.
Funding
The study has been supported by the Intramural Research Program of the National Institute of Neurological Disorders and Stroke (NINDS) of the National Institutes of Health (NIH).
Acknowledgments
The authors would like to thank the many clinicians (Irene Cortese, Joan Ohayon, Kaylan Fenton, Jamie Cherup), nurses (Helen Griffith, Jenifer Dwyer, Laura Kannaian), and schedulers (Anne Mayfield, Kewounie Pumphrey, and Shania Dowd) who participated in the collection of clinical data. The authors are especially grateful to patients and their caregivers, who are our lifelong teachers about their disease. MK contributed to this work as a former employee of NINDS, and the opinions expressed in this work do not represent her current affiliation.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Supplementary material
The Supplementary Material for this article can be found online at http://www.frontiersin.org/article/10.3389/fneur.2017.00598/full#supplementary-material.
Abbreviations
ARMSS, age related multiple sclerosis severity score; CDP, confirmed disability progression; CNS, central nervous system; CombiWISE, combinatorial weight-adjusted disability score; COMRIS-CTD, combinatorial MRI score of CNS tissue destruction; DD, disease duration; DMT, disease modifying therapy; EDSS, expanded disability status scale; GBM, gradient boosting machine; MRI, magnetic resonance imaging; MS-DSS, MS disease severity scale; MSSS, MS severity score; NIH, National Institutes of Health; 9HPT, 9-hole peg test; PPMS, primary progressive multiple sclerosis; QC, quality control; r, Pearson correlation coefficient; R2, coefficient of determination; RRMS, relapsing-remitting multiple sclerosis; SNRS, Scripps neurological rating scale; T25FW, timed 25-foot walk.
References
1
GeorgeMFBriggsFBShaoXGianfrancescoMAKockumIHarboHFet alMultiple sclerosis risk loci and disease severity in 7,125 individuals from 10 studies. Neurol Genet (2016) 2:e87.10.1212/NXG.0000000000000087
2
SawcerSHellenthalGPirinenMSpencerCCPatsopoulosNAMoutsianasLet alGenetic risk and a primary role for cell-mediated immune mechanisms in multiple sclerosis. Nature (2011) 476:214–9.10.1038/nature10251
3
RoxburghRHSeamanSRMastermanTHensiekAESawcerSJVukusicSet alMultiple sclerosis severity score: using disability and disease duration to rate disease severity. Neurology (2005) 64:1144–51.10.1212/01.WNL.0000156155.19270.F8
4
KurtzkeJF. Rating neurologic impairment in multiple sclerosis: an expanded disability status scale (EDSS). Neurology (1983) 33:1444–52.10.1212/WNL.33.11.1444
5
ManouchehriniaAWesterlindHKingwellEZhuFCarruthersRRamanujamRet alAge related multiple sclerosis severity score: disability ranked by age. Mult Scler (2017):1352458517690618.10.1177/1352458517690618
6
ConfavreuxCVukusicS. Age at disability milestones in multiple sclerosis. Brain (2006) 129:595–605.10.1093/brain/awl313
7
ConfavreuxCVukusicSAdeleineP. Early clinical predictors and progression of irreversible disability in multiple sclerosis: an amnesic process. Brain (2003) 126:770–82.10.1093/brain/awg081
8
HanSLinYCWuTSalgadoADMexhitajIWuestSCet alComprehensive immunophenotyping of cerebrospinal fluid cells in patients with neuroimmunological diseases. J Immunol (2014) 192:2551–63.10.4049/jimmunol.1302884
9
SipeJCKnoblerRLBrahenySLRiceGPPanitchHSOldstoneMB. A neurologic rating scale (NRS) for use in multiple sclerosis. Neurology (1984) 34:1368–72.10.1212/WNL.34.10.1368
10
KosaPGhazaliDTanigawaMBarbourCCorteseIKelleyWet alDevelopment of a sensitive outcome for economical drug screening for progressive multiple sclerosis treatment. Front Neurol (2016) 7:131.10.3389/fneur.2016.00131
11
KosaPKomoriMWatersRWuTCorteseIOhayonJet alNovel composite MRI scale correlates highly with disability in multiple sclerosis patients. Mult Scler Relat Disord (2015) 4:526–35.10.1016/j.msard.2015.08.009
12
R Development Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing (2016).
13
RStudio Team. RStudio: Integrated Development for R. Boston, MA: RStudio, Inc (2015).
14
WeidemanAMTapia-MaltosMAJohnsonKGreenwoodMBielekovaB. Meta-analysis of the age-dependent efficacy of multiple sclerosis treatments. Front Neurol (2017).10.3389/fneur.2017.00577
15
HastieTTibshiraniRFriedmanJ. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. 2nd ed. New York: Springer (2009).
16
FriedmanJH. Greedy function approximation: a gradient boosting machine. Ann Stat (2001) 29:1189–232.10.1214/aos/1013203451
17
Ridgeway G with contributions from othersgbm: Generalized Boosted Regression Models. R package version 2.1.3. https://CRAN.R-project.org/package=gbm (2015).
18
KomoriMBlakeAGreenwoodMLinYCKosaPGhazaliDet alCSF markers reveal intrathecal inflammation in progressive multiple sclerosis. Ann Neurol (2015) 78:3–20.10.1002/ana.24408
19
KomoriMLinYCCorteseIBlakeAOhayonJCherupJet alBielekova, insufficient disease inhibition by intrathecal rituximab in progressive multiple sclerosis. Ann Clin Transl Neurol (2016) 3:166–79.10.1002/acn3.293
21
BielekovaBVodovotzYAnGHallenbeckJ. How implementation of systems biology into clinical trials accelerates understanding of diseases. Front Neurol (2014) 5:102.10.3389/fneur.2014.00102
22
LorscheiderJBuzzardKJokubaitisVSpelmanTHavrdovaEHorakovaDet alDefining secondary progressive multiple sclerosis. Brain (2016) 139:2395–405.10.1093/brain/aww173
23
ConfavreuxCVukusicS. Natural history of multiple sclerosis: a unifying concept. Brain (2006) 129:606–16.10.1093/brain/awl313
20
ChengWChengJAllaireJXieYMcPhersonJ. Shiny: Web Application Framework for R. R package version 1.0.0. https://CRAN.R project.org/package=shiny (2017).
24
KurtzkeJF. Epidemiology of multiple sclerosis. In: VinkenPJBruynGWKlawansHLKoetsierJC, editors. Handbook of Clinical Neurology, Demyelinating Diseases. Amsterdam/New York: Elsevier (1985). p. 259–87.
Summary
Keywords
multiple sclerosis, clinical trials, neuroimmunology, neuroinflammation, clinical practice, education
Citation
Weideman AM, Barbour C, Tapia-Maltos MA, Tran T, Jackson K, Kosa P, Komori M, Wichman A, Johnson K, Greenwood M and Bielekova B (2017) New Multiple Sclerosis Disease Severity Scale Predicts Future Accumulation of Disability. Front. Neurol. 8:598. doi: 10.3389/fneur.2017.00598
Received
09 August 2017
Accepted
24 October 2017
Published
10 November 2017
Volume
8 - 2017
Edited by
Björn Tackenberg, Philipps University of Marburg, Germany
Reviewed by
Ryan Ramanujam, Karolinska Institute (KI), Sweden; Trygve Holmøy, Akershus University Hospital, Norway
Updates

Check for updates
Copyright
© 2017 Weideman, Barbour, Tapia-Maltos, Tran, Jackson, Kosa, Komori, Wichman, Johnson, Greenwood and Bielekova.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Bibiana Bielekova, bibi.bielekova@nih.gov
†These authors have contributed equally to this work.
Specialty section: This article was submitted to Multiple Sclerosis and Neuroimmunology, a section of the journal Frontiers in Neurology
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.