Skip to main content

ORIGINAL RESEARCH article

Front. Neurol., 13 May 2022
Sec. Stroke

Improving Visualization Methods of Utility-Weighted Disability Outcomes for Stroke Trials

\nIvie Tokunboh&#x;Ivie Tokunboh1Eleanor Mina Sung*&#x;Eleanor Mina Sung2*Fiona ChatfieldFiona Chatfield1Nathan GainesNathan Gaines1May NourMay Nour3Sidney StarkmanSidney Starkman4Jeffrey L. SaverJeffrey L. Saver1
  • 1Department of Neurology and Comprehensive Stroke Center, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, United States
  • 2Viterbi School of Engineering, University of Southern California, Los Angeles, CA, United States
  • 3Department of Neurology, Division of Interventional Neuroradiology, and Comprehensive Stroke Center, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, United States
  • 4Departments of Emergency Medicine and Neurology and Comprehensive Stroke Center, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, United States

Background: The modified Rankin Scale (mRS) is the most common endpoint in acute stroke trials, but its power is limited when analyzed dichotomously and its indication of effect size is challenging to interpret when analyzed ordinally. To address these issues, the utility-weighted-mRS (UW-mRS) has been developed as a patient-centered, linear scale. However, appropriate data visualizations of UW-mRS results are needed, as current stacked bar chart displays do not convey crucial utility-weighting information.

Design/Methods: Two UW-mRS display formats were devised: (1) Utility Staircase charts, and (2) choropleth-stacked-bar-charts (CSBCs). In Utility Staircase displays, mRS segment height reflects the utility value of each mRS level. In CSBCs, mRS segment color intensity reflects the utility of each mRS level. Utility Staircase and CSBC figures were generated for 15 randomized comparisons of acute ischemic/hemorrhagic stroke therapies, including fibrinolysis, endovascular reperfusion, blood pressure moderation, and hemicraniectomy. Display accuracy in showing utility outcomes was assessed with the Tufte-lie-factor and ease-of-use assessed by formal ratings completed by a panel of 4 neurologists and emergency physicians and one nurse-coordinator.

Results: The Utility Staircase and CSBC displays rapidly conveyed patient-centered valuation of trial outcome distributions not available in conventional ordinal stacked bar charts. Tufte-lie-factor (LF) scores indicated “substantial distortion” of utility-valued outcomes for 93% (14/15) of conventional stacked bar charts, vs. “no distortion” for all Utility Staircase and CSBC displays. Clinician ratings on the Figural Display Questionnaire indicated that utility information encoded in row height (Utility Staircase display) was more readily assimilated than that conveyed in segment hue intensity (CSBC), both superior to conventional stacked bar charts.

Conclusions: Utility Staircase displays are an efficient graphical format for conveying utility weighted–modified Rankin Scale primary endpoint results of acute stroke trials, and choropleth-stacked-bar-charts a good alternative. Both are more accurate in depicting quantitative, health-related quality of life results and preferred by clinician users for utility results visualization, compared with conventional stacked bar charts.

Introduction

Visual display of quantitative information can greatly aid understanding of research findings and appropriate action by both experts and laypeople. A general principle of quantitative data display is that particular types of data are most accurately and accessibly depicted using particular types of figural formats (13). Accordingly, when a new type of numeric data analysis is developed, it is desirable to also identify or develop preferred formats for figurally displaying the data results.

Recently, a new numeric approach to analyzing primary outcomes in acute stroke trials was advanced-the utility-weighted modified Rankin Scale (UW-mRS) (4). The UW-mRS is a refinement of the modified Rankin Scale (mRS), the most common primary endpoint in acute stroke trials. The modified Rankin Scale is a seven-level scale of global impairment, disability, and handicap; however, its power is limited when analyzed dichotomously and its indication of effect size is challenging to interpret when analyzed ordinally. To address these issues, the UW-mRS has been developed as a patient-centered, linear scale that is both powerful and readily interpretable. Utility weights convert the spacing between the seven ranks on the mRS from arbitrarily uniform intervals to distances that directly reflect patient and societal valuation of each outcome state.

The UW-mRS provides evidence-based weights for different disability health states post-stroke for use in acute treatment trials, as has been recommended for assessing multi-state outcomes in clinical trials for all medical conditions (58). In assigning weights to the seven levels of the modified Rankin Scale, UW-mRS provides a simple and efficient technique to rate disability outcomes with greater accuracy than older dichotomous or unweighted polychotomous approaches to mRS analysis. The resulting assessment of disability-related quality of life is not as granular as that afforded by more complex instruments (8, 9), but is often more easily implementable and feasible in large trials (10). The UW-mRS has been supported by leading experts, supported by population-based epidemiologic studies, and endorsed by the Stroke Therapy Academic Industry Roundtable (STAIR) consensus group (5, 1114). Moreover, it has already been incorporated into acute stroke clinical trial designs, including serving as the primary endpoint in industry, FDA-regulated, and National Institute of Health multicenter pivotal trials (15, 16).

However, appropriate visual displays of UW-mRS results have not yet been explored. Display techniques developed for the ordinal mRS, including generic horizontal stacked bar charts, do not convey utility-weighting information. New or modified figure types are needed to communicate trial UW-mRS results visually in a way that is rapidly interpretable.

We therefore developed and tested two new visual data visualization approaches appropriate for patient-centered weighting of outcomes on ordinal scales across all diseases, including for UW-mRS findings in acute stroke: (1) the choropleth stacked bar chart (a modification of existing stack bar chart data visualizations), and (2) the Utility Staircase.

Methods

Visual Display Design

Based on patient population-based and international health-care provider studies, the UW-mRS assigns the each mRS level the following utility values: mRS 0–1.00; mRS 1–0.91; mRS 2–0.76; mRS 3–0.65; mRS 4–0.33; mRS 5–0.00; mRS 6–0.00. (4) The new visual displays appropriate to the UW-mRS were required to convey, in a single image, the following information: (1) frequency of each ordinal outcome in the active treatment group; (2) frequency of each ordinal outcome in the control group; (3) frequency of all possible dichotomized ordinal outcomes in the active treatment group; (4) frequency of all possible dichotomized ordinal outcomes in the control treatment group; (5) value (utility) of each ordinal outcome in the active treatment group; (6) value (utility) of each ordinal outcome in the control group; (7) total value (utility) of all outcomes in the active treatment group; (8) total value (utility) of all outcomes in the control group.

Two types of display meeting these criteria were developed: (1) the choropleth stacked bar chart (CSBC), and (2) the Utility Staircase.

Construction of Choropleth Stacked Bar Charts

Choropleth stacked bar charts are stacked bar charts that have the added condition that the degree of color intensity assigned to each bar segment is directly proportional to the quantified utility value of the segment's outcome level. As in standard stacked bar charts, each mRS segment length reflects the frequency of patients with the associated mRS value. However, unlike standard bar charts, each mRS segment's color intensity also communicates quantitative information–the utility value of that mRS score. Choropleth stacked bar charts can be made using single colors, with variations in intensity of, for example, greenness, redness, blueness, etc., or two color gradients, with quantified hue transitions from one color at one pole to a second color at the other pole. For this study, we created green CSBCs. Using RGB color model settings (which maps to the red, green, and blue optimum wavelengths of the three types of human retinal cone cells), red and blue inputs were set to zero, and the green input was varied proportionately to the utility value of each mRS segment. Accordingly, mRS 0 was assigned a green value of 255 (=255 × 1.0), mRS 1 a green value of 232 (=255 × 0.91), mRS 2 a green value of 194 (=255 × 0.76), mRS 3 a green value of 166 (=255 × 0.65), mRS 4 a green value of 84 (=255 × 0.33), and both mRS 5 and 6 a green value of 0 (=255 × 0.00, resulting in black coloration for segments five and six). Use of color to convey mRS level severity is already often employed in stacked bar charts of the mRS, but by employing arbitrary degrees of variation in hue rather than exactly measured RGB/brightness values corresponding to the utility weight.

Construction of Utility Staircase

Utility Staircase charts are derived from the starting point of standard stacked bar charts by adding the condition that the height of assigned to each bar segment is directly proportional to the quantified utility value of the segment's outcome level. As in standard stacked bar charts, each mRS segment's horizontal width reflects the frequency of patients with the associated mRS value. However, unlike standard bar charts, each mRS segment's height also communicates quantitative information–the utility value of that mRS score.

Along the x-axis of the Utility Staircase figure, the chart shows the proportion of patients with outcomes occurring in each mRS level. Along the y-axis, the chart shows the utility of each mRS segment, from zero to one. The total utility across the study population yielded by a treatment arm is visually conveyed by the total area of that treatment group's color field on the graph. The Utility Staircase plots have greater heights toward the left of the figures, reflecting the higher utility values of lower mRS scores, and lower heights toward the right of the figures, reflecting the lower utility of higher mRS scores. When more good outcomes occur, the higher utility value segments expand in width rightward, to take up more and more of the available figural space (the “utility space”). As a result, the area of rightward and upward shift in color field associated with better outcomes in a treatment group exactly quantifies the total utility value gained at the treatment group level. At the extremes, a treatment that resulted in fatal outcomes for all patients would yield an empty figural space, reflecting zero utility at the group level, while a treatment that resulted in fully normal mRS 0 values for all patients would yield a color rectangle occupying the entire available figural space, reflecting perfect utility at the group level. The utility value for the control group is conveyed by a gray color field. When there is unidirectional benefit, the utility value for the experimental group is conveyed by the additive combination of the gray and green color fields. When there is unidirectional harm, the utility value for the experimental group is conveyed by the subtraction of the red color field from the gray color field. When there are bidirectional effects, the utility value for the experimental group is conveyed by the additions of the green color field and subtractions of the red color field.

In the Utility Staircase generated for this study, the control group outcome distribution is assigned a gray color. The active treatment group is assigned a green color in regions where outcomes are more favorable than control and a red color in regions where outcomes are less favorable than control. If there were no difference whatsoever between the treatment groups in frequency of mRS outcomes at every level, the two treatments would overlap exactly and only the gray color would be visible. The total area of the green regions minus the red regions in a figure visually and quantitatively conveys the total, population-level, increase in utility conferred by the active treatment.

Analysis of Figural Accuracy

Formal analysis was performed of the degree to which the standard stacked bar charts, the choropleth stacked bar charts, and the Utility Staircase charts quantitatively captured the health-utility value of clinical trial outcomes. When display elements were disproportionate to the numeric value of utility, the degree of distortion was quantified using the Utility Distortion Factor (UDF), a variation of the Tufte lie factor (LF):(1) UDF = (size of cumulative group utility treatment effect shown in graphic) ÷ (size of cumulative group utility treatment effect shown in data). As recommended by Tufte, values >1.05 or <0.95 were considered to indicate substantial distortion. Additional details of the figural accuracy analysis methods are provided in Supplementary Methods Text 1.

Testing User Preferences

We constructed a formal rating scale to assess clinician preference among the three figural display options: the Showing Utility trial Result Findings - Clinician Assessment Scale (SURF-CAS). SURF-CAS scale items were adapted from the Practitioner Opinion (Acceptance) Survey and the Preparation for Decision Making Scale (17, 18). The resulting SURF-CAS instrument has five Likert scale items, asking clinicians to indicate which of two figural displays of trial utility results is: (1) overall easier to understand; (2) better visualizes the value a treatment group outcomes: (3) better conveys the difference in value of treatment outcomes; (4) more closely accords with how the clinician thinks about the value of study outcomes, and (5) better help the clinician recommend a treatment based on what matters to patients. (Supplementary Figure 1) for each scale item, the score ranges from +2 (Strongly Agree), +1 (Agree), 0 (Neutral), −1 (Disagree), and −2 (Strongly Disagree). The total SURF-CAS Scale score accordingly ranges from +10 (strong preference for first figure in pair) through 0 (neutrality between figures) to −10 (strong preference for second figure in pair). The SURF-CAS was administered to a purposive sample of five clinicians selected to represent different types and levels of training, including a senior faculty Emergency Medicine and Neurology physician, a mid-career faculty noninvasive Vascular Neurologist, a junior faculty Interventional Neurologist, a Vascular Neurology fellow, and a stroke center Nurse-Coordinator. After a brief orientation to the task and figure types, all raters assessed how well utility outcome information was conveyed by standard stacked bar charts, CSBCs, and Utility Staircase displays showing the results of the same four randomized trials, encompassing a positive endovascular thrombectomy trial (DAWN), a neutral endovascular thrombectomy trial (IMS 3), a positive intravenous thrombolysis study (NINDS-tPA Study), and a positive hemicraniectomy trial (DESTINY 2).

Results

The three visual display types, stacked bar charts, choropleth stacked bar charts, and Utility Staircase charts, are shown side-by-side in Figure 1, with each chart type depicting results from the same trial, DAWN, a study in which the UW-mRS was the lead primary endpoint. (15).

FIGURE 1
www.frontiersin.org

Figure 1. Comparative Standard Stacked Bar Chart, Choropleth Stacked Bar Chart, and Utility Staircase Chart Displays. Outcomes at 3 months on the ordinal modified Rankin Scale from the same trial, DAWN, are shown using each figural approach: (A) Standard stacked bar chart–display visually conveys mRS rank frequencies (bar segment widths), but does not explicitly visually communicate utility values; (B) Choropleth stacked bar chart–display visually conveys both mRS rank frequencies (bar segment widths) and mRS rank utility values (degree of green intensity); and (C) Utility Staircase chart–display visually communicates both mRS rank frequencies (row widths) and mRS rank utility values (column heights). In addition, display directly communicates utility value of the two treatment groups: utility value of control treatment–area size of gray coloration; utility value of active treatment–area size of combined gray and green coloration; utility gain with active treatment–area size of green coloration.

Figural Accuracy in Displaying Health-Utility Findings

For the standard stacked bar charts, among the 15 RCTs analyzed, the UDF values ranged from 0.53 to 1.38. The absolute error ranged from 0.02–0.47, with mean absolute error 0.21 (±0.15). Substantial distortion in the depiction of health-utility outcomes was present in 14/15 (93%) displays. In contrast, for both CSBC and Utility Staircase Displays, UDF values were uniformly 1.0 for all charts, without any occurrence of substantial distortion in the depiction of health-utility outcomes.

Clinician Ratings of Figural Effectiveness

The results of expert physician and nurse clinician ratings comparing the three display types are shown in Table 1. Responses had no missing data. Ratings showed trends to indicating that utility information encoded in row height (Utility Staircase Display) and conveyed in segment hue intensity (CSBC) were superior to conventional stacked bar charts.

TABLE 1
www.frontiersin.org

Table 1. Clinician preferences among visual displays of trial utility results.

Utility Staircase Trial Displays for Diverse Trials

The Utility Staircase Displays for the analyzed trials demonstrate the applicability of this figural format to diverse treatment modalities, patient groups, and outcome scales. For acute ischemic stroke with the UW-mRS as the primary outcome, displays readily visually convey key utility findings related to effects of later onset to treatment time for time-urgent therapies (Figure 2); different types of pharmacologic and endovascular reperfusion therapy (Figure 3); and effects of hemicraniectomy in instances of malignant acute ischemic stroke for patients both under and over 60 years old (Figure 4).

FIGURE 2
www.frontiersin.org

Figure 2. Utility Staircase charts demonstrating effects of different onset to treatment times for endovascular thrombectomy. Outcomes at 3 months on the UW-mRS are shown for: (A) last known well to puncture of 2 h, and (B) last known well to puncture of 5 h. Medical therapy outcomes (gray) are similar in both time periods, while endovascular thrombectomy outcomes have greater differential additional value (green) with earlier treatment start.

FIGURE 3
www.frontiersin.org

Figure 3. Utility Staircase charts showing nine trials and meta-analyses of reperfusion therapy for acute ischemic stroke. Top row: intravenous fibrinolysis trials, including: (A) IV tPA under 3 h; (B) IV tPA up to 6 h; (C) IV tPA in 3–4.5 h;(D) IV tenecteplase vs IV tPA in large vessel occlusion patients up to 4.5 h, and (E) IV alteplase in imaging selected patients after 4.5 h. Bottom row: endovascular reperfusion trials, including: (F) early generation techniques;(G) second generation mechanical thrombectomy largely in early time windows (H) second generation mechanical thrombectomy with imaging selection 6–24 h after onset, and (I) second generation mechanical thrombectomy with broader imaging selection 6–16 h after onset.

FIGURE 4
www.frontiersin.org

Figure 4. Utility Staircase charts showing trials data for hemicraniectomy for malignant acute ischemic stroke. (A) Among patients up to age 60 years old; (B) among patients over 60 years old. The greater utility benefit among younger patients is readily apparent, as is the overall poor health-related quality of life outcomes in both control and treated groups in both age ranges (both gray and green color fields hew toward the lower left portion of the rectangular achieved utility space).

Several aspects of these Figures exemplify the insights conveyed by the Utility Staircase Displays. For example, in Figure 3, in the top row, comparison of the first three figures showing different intravenous alteplase trials shows at a glance the greater overall utility benefit provided by earlier treatment (A vs. B and C) and the better control group utility outcome starting point resulting from milder deficits at entry (C vs. A and B). In the bottom row, comparison of figures F through I shows at a glance the great utility benefit from second generation compared with early endovascular reperfusion techniques (G-I vs. F) and the substantially equivalent degree of utility benefit with treatment in imaging selected, late patients vs. time-selected early patients (H, and I vs. F). In addition, comparison of the panels in top vs. bottom rows shows at a glance the overall better control group starting point of health-related quality of life outcome, resulting from milder deficits at entry in the broader population of IV fibrinolysis patients (A–E) compared with the narrower population of large vessel occlusion patients (F–I).

Supplementary Figure 2 demonstrates two additional aspects of the Utility Staircase method. First, it shows applicability to a different disease subtype, acute intracerebral hemorrhage rather than ischemic stroke. Second, it exemplifies how the Utility Staircase approach displays results when an experimental treatment tends to have an adverse, rather than favorable, effect on health-related quality of life. The plot of INTERACT two shows a favorable point estimate for moderate vs. mild blood pressure lowering, with the magnitude of enhanced utility outcomes displayed in green. Conversely, the plot of ATACH 2 shows an unfavorable point estimate for intensive versus moderate blood pressure lowering, with the magnitude of reduced utility outcomes displayed in red.

In Supplementary Figure 3, the plot demonstrates that the Utility Staircase figural methodology is generalizable across all scales to which utility values can be assigned to measure scores, not just the seven level mRS scale. This figure shows health-related quality of life outcomes arising from the four level 3SQ ordinal scale, the primary outcome measure in the IST1, and CAST mega-trials of aspirin in early ischemic stroke. The graphic also readily demonstrates the small absolute magnitude of utility benefit conferred by early aspirin vs. no aspirin, compared with the larger treatment effects in other figures. Supplementary Figure 4 shows further elements that can optionally be added to Utility Staircase Displays: adding the actual n values or percent values for each mRS in each treatment arm as embedded data labels. This approach provides more granular information regarding the numeric strength of the underlying evidence. However, it draws reader attention back to the ordinal aspects of the data and away from the utility-weighted aspects of the data that are indicated by the color field areas in their whole, not in part.

Figure 5 shows a general comparison of 13 RCTs of diverse therapies for varied subtypes of acute ischemic and hemorrhagic stroke, ordered by the point estimate of magnitude of utility benefit on the UW-mRS of the active intervention. The greatest utility gains are conferred by endovascular thrombectomy with modern devices and by hemicraniectomy for malignant infarction in young patients, the latter accruing benefit in the lower quadrant of utility space vs. thrombectomy accruing benefit in the mid-range of utility space. The least utility gains are conferred by older endovascular thrombectomy techniques and late intravenous fibrinolysis for ischemic stroke, and blood pressure lowering for intracerebral hemorrhage, the latter three associated with fairly good utility outcomes in control groups, tending toward the upper quadrant of the utility space.

FIGURE 5
www.frontiersin.org

Figure 5. (A–M) General comparative display of Utility Staircase plots for 13 RCTs and meta-analyses of treatments for acute ischemic and hemorrhagic stroke that used the mRS as a primary endpoint. Charts are ordered, left to right and then top to bottom, by each study's point estimate of the active intervention's overall magnitude of effect in increasing health-related quality of life.

Discussion

The Utility Staircase Chart and Choropleth Stacked Bar Charts are novel approaches to visualizing health-related quality of life outcomes in randomized clinical trials. These displays are appropriate to figurally convey results of trials using as a primary or auxiliary endpoint any ordinal or continuous scale having levels or scores convertible to health-related utility values, including the utility-weighted modified Rankin Scale recently developed and deployed for acute stroke trials. In the current study, both the Utility Staircase and CSBC plots outperformed conventional stacked bar chart displays, objectively in depicting health-related quality of life results without quantitative distortion and subjectively in clinician ratings of clarity and usefulness for decision-making. Moreover, among the two developed display types, clinicians nominally rate the Utility Staircase Display superior to the CSBC in clarity and decision-making value.

The optimal display approaches for different types of medical data has been the subject of extensive investigation over the past two decades, including development of displays appropriate for binary, ordinal, linear, categorical, and multidimensional data, including bar charts, line plots, scatter plots, trellis plots, spie charts, radar charts, and person-icon arrays (13, 1922) As far as we aware, prior studies have not addressed developing and assessing data visualizations that are appropriate for conveying health-related quality of life outcomes and outcome differences over the full outcome spectrum among intervention and control groups in randomized clinical trials. In the last two decades, clinical trial designs have increasingly incorporated patient-reported outcome measures and health-related quality of life endpoints, so that trial findings can be assimilated into practice based not solely upon disease-specific measures of variable clinical import, but also upon effects on life value that are clearly meaningful to patients, families, and society (23, 24). Accordingly, figural display formats appropriate for health utility outcome distribution data, like the Utility Staircase and CSBC, could be of wide applicability.

While quantitatively accurate Choropleth Stacked Bar Charts were not developed prior to the recent DAWN trial (15), a variety of approaches have previously been taken by researchers to assigning color values to modified Rankin Scale outcomes in stacked bar charts. In some study reports, the mRS stacked bar chart is shown without any color or shade variation (25), so that the color/shading channel is not used to convey value information. In other trial publications, the mRS stacked bar chart is shown with color or shading variations that perform various roles. These approaches vary widely. In some reports, each mRS cell is given a graded variations in shading or color intensity that conveys value information qualitatively, but not quantitatively, reinforcing the message, already communicated by cell position in the stacked bar chart, that the outcomes are ordered, but not providing added information. This qualitative reinforcement is most often monochromatic but with variation in shading/intensity of the single color (26, 27). Different groups have chosen directionally opposite patterns of shade variation, some depicting better outcomes as lighter (26, 27), and others depicting better outcomes as darker (28, 29). Other clinical trial reports have used cell coloring to embed a dichotomous analysis of the mRS in the figure, coloring good outcome cells (e.g., mRS 0,1,2) with one color and bad outcome cells with another color (e.g., mRS 3,4,5,6) (30). In addition, color choices by different group have reflected disparate implicit color metaphors. such as using funereal black or gray for the mRS 6 fatal outcome level, or traffic light-based green spectrum colors for better outcomes (e.g., mRS 0, 1, 2), yellow spectrum colors for intermediate outcomes (e.g., mRS 3, 4), and red spectrum colors for poor outcomes (e.g., mRS 5, 6). (31) At the extreme, some clinical trial reports have used alternating colors (32), or apparently random unique colors, (33) for the different mRS ranks, making the mRS levels more visually distinct, but doing so in a manner that somewhat undermines even the simple information about their relative value conveyed by their order in the stacked bar. In all of these previous approaches, color and color intensity selections were made qualitatively, and did not align with the actual health utility value of each of the mRS levels.

While the CSBC displays tested in this study overcame this limitation by having each mRS level's numeric utility value precisely determine its color intensity, the use of color to convey utility information still had limitations. First, readers with color blindness will not be able to appreciate all information conveyed in the color channel. This drawback can be mitigated by preparing alternate figures in which color intensities are varied within the spectrum of multiple colors (34). For this study, we prepared figures with variations in the color green, but the same approach could be used employing variations in blue, yellow, red, or other colors, or simply in gray-black, so that readers with different forms of color blindness would still have access to figures with variations visible to them. But another limitation is that human subjective responses to color intensity variation do not exactly track objective changes in red-green-blue color scale variation (35), and are culturally-influenced so non-universal (36). As a result, even color values exactly matched to mRS utility values may not evoke subjective responses in readers matched to the utility values. These limitations of color signaling may be one reason that physician and nurse clinicians did not nominally value the CSBC figures as highly as the Utility Staircase figures.

In contrast, the Utility Staircase figures' use of cell height to convey value is a psychophysically more sound approach to conveying quantitative information. Degree of linear extension vertically (height) is readily discriminated by humans in a manner that is highly accurate and relatively culturally invariant (37). Moreover, it accords with the manner in which mRS stacked bar charts are already conveying information about the frequency of the different mRS outcomes–degree of linear extension horizontally (width). For each mRS level, cell width (outcome frequency) and cell height (outcome value) combine to yield cell area as direct, readily appreciable, quantitative indicator of the contribution of that outcome to the treatment group's total achieved utility. The combination of frequency along the x axis and value along the y axis yields a figure in which a perfect outcome would be a rectangular color field occupying the entire graphical space (all patients achieving mRS 0). This approach is similar that widely used in figural displays of receiver operating curves evaluating diagnostic test performance. There too the directly measured variable is conveyed along the x axis (biomarker amount) and its clinical value is conveyed along the y axis (degree of combined sensitivity and sensitivity), yielding a display in which a perfect outcome would be rectilinear lines going straight up to the top of the chart and then straight across.

Both the Utility Staircase figure and the Choropleth Stacked Bar Chart are applicable to all ordinal scales with outcome levels that can be mapped to health-related quality of life. This generalizability was shown in the current study by creating Utility Staircase and CSBC figures not only for trials reporting results on the seven-level modified Rankin Scale, but also to a trial reporting results on the four-level 3SQ scale. The same approach can be used for the diverse ordinal scales used in clinical trials of varied disease states with that have discrete, ordered outcome states, including multiple sclerosis (e.g., expanded Kurtzke disability scale), traumatic brain injury (e.g., expanded Glasgow outcome scale), cardiac arrest (e.g., expanded Glasgow outcome scale), cancer (e.g., Karnofsy performance scale), and congestive heart failure (e.g., New York Heart Association Functional Classification). In principle, the Utility Staircase display can be similarly deployed to depict outcome values for disease-specific outcome scales that have continuous, rather than ordinal, outcome distributions, as long as a valid method is available to convert the disease-specific outcome values to health-related quality of life values.

This study has limitations. First, though the clinician raters assessing the different figural displays were purposively sampled to include diverse stakeholders, including nurses, physician-trainees, junior, mid-career, and senior faculty, and different specialties, all were from a single institution and certain specialties salient to acute stroke care, such as neurosurgeons, were not represented. Assessment is desirable of display type preference in numerically more individuals, additional stakeholder groups and geographic regions, potentially by online administration. Similarly, although Utility Staircase and CSBC displays were assessed for 14 different trials evaluating varied types of interventions (medical, endovascular, and open surgical), for different disease stages (acute and early secondary prevention), and different stroke subtypes (ischemic and hemorrhagic), further evaluation would be helpful in trials assessing other treatment modalities (e.g., rehabilitation) and additional stroke subtypes (e.g., subarachnoid hemorrhage). Second, the raters could not be objectively tested to evaluate improvements in understanding before and after figure review, as they were being asked to compare three figures precluding assessment of the independent knowledge contribution of any one figure type. Instead, they provided subjective ratings of the comparative degree to which they found the figures helpful. Future studies of knowledge acquisition with the highest-rated figure type, the Utility Staircase are desirable. Third, the standard stacked bar chart visualization assessed used only right-to-left placement to convey outcome levels visually. While this approach is used in the literature, it is also common to add use of color but employing arbitrary degrees of variation in hue rather than exactly measured RGB/brightness values corresponding to the utility weight. Rater preferences for the chloropleth stacked bar chart and Utility Staircase over the standard stacked bar chart likely would have been less if an arbitrarily color-graded standard bar chart was the comparator. Fourth, the displayed figures employ green color for positive and red color for negative outcomes. This strategy has the advantage of being symbolically-based upon the worldwide foundation of the use of green for go and red for stop at traffic lights (38). However, this approach may cause perceptual difficulty for individuals with red-green color blindness, for whom other color options may be employed.

Conclusions

Utility Staircase displays are an efficient graphical format for conveying utility weighted–modified Rankin Scale primary endpoint results of acute stroke trials, and choropleth stacked bar charts a good alternative. Both are more accurate in depicting quantitative, health-related quality of life results, and preferred by clinician users for utility results visualization, compared with conventional stacked bar charts.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics Statement

Ethical review and approval was not required for the study on human participants in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.

Author Contributions

IT: data analysis and writing of first draft. ES: data analysis and writing of second draft incorporating additional data. FC, NG, MN, and SS: served as expert raters and provided critical review and comments on manuscript. JS: study conception, data analysis and provided critical review and comments to manuscript. All authors contributed to the article and approved the submitted version.

Funding

This research and open access publication fees were supported by the UCLA Health System. The funder had no role in the design or interpretation of study results.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fneur.2022.875350/full#supplementary-material

References

1. Tufte E. The Visual Display of Quantitative Information. 2nd ed. Cheshire, Connecticut: Graphics Press LLC (2001). 197 p.

Google Scholar

2. Ware C. Visual Thinking for Design. Amsterdam: Elsevier Morgan Kaufmann Publishers. (2011).

Google Scholar

3. JAMA Network. Instructions for Authors - Table of Figure Requirements. Available online at: https://jamanetwork.com/journals/jama/pages/instructions-for-authors#SecFigureFileRequirementsforPublication

Google Scholar

4. Chaisinanunkul N, Adeoye O, Lewis RJ, Grotta JC, Broderick J, Jovin TG, et al. Adopting a patient-centered approach to primary outcome analysis of acute stroke trials using a utility-weighted modified rankin scale. Stroke. (2015) 46:2238–43. doi: 10.1161/STROKEAHA.114.008547

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Lees KR, Khatri P, Collaborators SI. Stroke treatment academic industry roundtable recommendations for individual data pooling analyses in stroke. Stroke. (2016) 47:2154–9. doi: 10.1093/med/9780199687039.003.0067_update_001

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Duc AN, Wolbers M. Weighted analysis of composite endpoints with simultaneous inference for flexible weight constraints. Stat Med. (2017) 36:442–54. doi: 10.1002/sim.7147

PubMed Abstract | CrossRef Full Text | Google Scholar

7. McLeod C, Norman R, Litton E, Saville BR, Webb S, Snelling TL. Choosing primary endpoints for clinical trials of health care interventions. Contemp Clin Trials Commun. (2019) 16:100486. doi: 10.1016/j.conctc.2019.100486

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Vaanholt MCW, Kok MM, von Birgelen C, Weernink MGM, van Til JA. Are component endpoints equal? A preference study into the practice of composite endpoints in clinical trials. Health Expect. (2018) 21:1046–55. doi: 10.1111/hex.12798

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Rethnam V, Bernhardt J, Dewey H, Moodie M, Johns H, Gao L, et al. Utility-weighted modified rankin scale: still too crude to be a truly patient-centric primary outcome measure? Int J Stroke. (2019) 15:268–77. doi: 10.1177/1747493019830583

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Berry S, Nogueira R, Saver JL. Letter by Berry et al. Regarding article, “utility-weighted modified rankin scale as primary outcome in stroke trials: a simulation study”. Stroke. (2018) 49:e337. doi: 10.1161/STROKEAHA.118.022198

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Taylor-Rowan M, Wilson A, Dawson J, Quinn TJ. Functional assessment for acute stroke trials: properties, analysis, and application. Front Neurol. (2018) 9:191. doi: 10.3389/fneur.2018.00191

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Rangaraju S, Haussen D, Nogueira RG, Nahab F, Frankel M. Comparison of 3-Month Stroke Disability and Quality of Life across Modified Rankin Scale Categories. Interv Neurol. (2017) 6:36–41. doi: 10.1159/000452634

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Ganesh A, Luengo-Fernandez R, Wharton RM, Rothwell PM. Ordinal vs dichotomous analyses of modified Rankin Scale, 5-year outcome, and cost of stroke. Neurology. (2018) 91:e1951–e60. doi: 10.1212/WNL.0000000000006554

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Jovin TG, Albers GW, Liebeskind DS, Consortium SI. Stroke Treatment Academic Industry Roundtable: The Next Generation of Endovascular Trials. Stroke. (2016) 47:2656–65. doi: 10.1161/STROKEAHA.116.013578

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Nogueira RG, Jadhav AP, Haussen DC, Bonafe A, Budzik RF, Bhuva P, et al. Thrombectomy 6 to 24 hours after stroke with a mismatch between deficit and infarct. N Engl J Med. (2018) 378:11–21. doi: 10.1056/NEJMoa1706442

PubMed Abstract | CrossRef Full Text | Google Scholar

16. MOST: Multi-arm Optimation of Stroke Thrombolysis trial. [Available online at: https://www.nihstrokenet.org/clinical-trials/acute-interventional-trials.

Google Scholar

17. Graham ID. Practitioner Opinion (Acceptability) Survey. (1996). Available online at: https://decisionaid.ohri.ca/docs/develop/user_manuals/um_acceptability.pdf

Google Scholar

18. Graham ID, O'Connor AM. Preparation for Decision Making Scale. (2005). Available online at: http://decisionaid.ohri.ca/eval.html

Google Scholar

19. Stafoggia M, Lallo A, Fusco D, Barone AP, D'Ovidio M, Sorge C, et al. Spie charts, target plots, and radar plots for displaying comparative outcomes of health care. J Clin Epidemiol. (2011) 64:770–8. doi: 10.1016/j.jclinepi.2010.10.009

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Saver JL, Gornbein J, Starkman S. Graphic reanalysis of the two NINDS-tPA trials confirms substantial treatment benefit. Stroke. (2010) 41:2381–90. doi: 10.1161/STROKEAHA.110.583807

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Tokunboh I, Vales Montero M, Zopelaro Almeida MF, Sharma L, Starkman S, Szeder V, et al. Visual aids for patient, family, and physician decision making about endovascular thrombectomy for acute ischemic stroke. Stroke. (2018) 49:90–7. doi: 10.1161/STROKEAHA.117.018715

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Stephan P, Schmid C, Freckmann G, Pleus S, Haug C, Muller P. The rectangle target plot: a new approach to the graphical presentation of accuracy of systems for self-monitoring of blood glucose. J Diabetes Sci Technol. (2015) 10:343–9. doi: 10.1177/1932296815612490

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Ahmed S, Berzon RA, Revicki DA, Lenderking WR, Moinpour CM, Basch E, et al. The use of patient-reported outcomes (PRO) within comparative effectiveness research: implications for clinical practice and health care policy. Med Care. (2012) 50:1060–70. doi: 10.1097/MLR.0b013e318268aaff

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Cella D, Lai JS, Nowinski CJ, Victorson D, Peterman A, Miller D, et al. Neuro-QOL: brief measures of health-related quality of life for clinical research in neurology. Neurology. (2012) 78:1860–7. doi: 10.1212/WNL.0b013e318258f744

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Dharmasaroja PA, Muengtaweepongsa S, Kommarkg U. Implementation of Telemedicine and Stroke Network in thrombolytic administration: comparison between walk-in and referred patients. Neurocrit Care. (2010) 13:62–6. doi: 10.1007/s12028-010-9360-3

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Goyal M, Menon BK, van Zwam WH, Dippel DW, Mitchell PJ, Demchuk AM, et al. Endovascular thrombectomy after large-vessel ischaemic stroke: a meta-analysis of individual patient data from five randomised trials. Lancet. (2016) 387:1723–31. doi: 10.1016/S0140-6736(16)00163-X

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Saver JL, Starkman S, Eckstein M, Stratton SJ, Pratt FD, Hamilton S, et al. Prehospital use of magnesium sulfate as neuroprotection in acute stroke. N Engl J Med. (2015) 372:528–36. doi: 10.1056/NEJMoa1408827

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Campbell BC, Mitchell PJ, Kleinig TJ, Dewey HM, Churilov L, Yassi N, et al. Endovascular therapy for ischemic stroke with perfusion-imaging selection. N Engl J Med. (2015) 372:1009–18. doi: 10.1056/NEJMoa1414792

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Choi JC, Lee JS, Park TH, Park SS, Cho YJ, Park JM, et al. Intravenous Tissue Plasminogen Activator Improves the Outcome in Very Elderly Korean Patients with Acute Ischemic Stroke. J Stroke. (2015) 17:327–35. doi: 10.5853/jos.2015.17.3.327

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Furlan A, Higashida R, Wechsler L, Gent M, Rowley H, Kase C, et al. Intra-arterial prourokinase for acute ischemic stroke. The PROACT II study: a randomized controlled trial Prolyse in Acute Cerebral Thromboembolism. JAMA. (1999) 282:2003–11. doi: 10.1001/jama.282.21.2003

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Saver JL. Novel end point analytic techniques and interpreting shifts across the entire range of outcome scales in acute stroke trials. Stroke. (2007) 38:3055–62. doi: 10.1161/STROKEAHA.107.488536

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Broderick JP, Palesch YY, Demchuk AM, Yeatts SD, Khatri P, Hill MD, et al. Endovascular therapy after intravenous t-PA versus t-PA alone for stroke. N Engl J Med. (2013) 368:893–903. doi: 10.1056/NEJMoa1214300

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Chamorro A, Amaro S, Castellanos M, Segura T, Arenillas J, Marti-Fabregas J, et al. Safety and efficacy of uric acid in patients with acute stroke (URICO-ICTUS): a randomised, double-blind phase 2b/3 trial. Lancet Neurol. (2014) 13:453–60. doi: 10.1016/S1474-4422(14)70054-7

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Roskoski R Jr.. Guidelines for preparing color figures for everyone including the colorblind. Pharmacol Res. (2017) 119:240–1. doi: 10.1016/j.phrs.2017.02.005

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Zeki S. In praise of subjective truths. J Physiol. (2009) 587(Pt 12):2825–35. doi: 10.1113/jphysiol.2009.170415

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Regier T, Kay P. Language, thought, and color: Whorf was half right. Trends Cogn Sci. (2009) 13:439–46. doi: 10.1016/j.tics.2009.07.001

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Ono H. Difference threshold for stimulus length under simultaneous and nonsimultaneous viewing conditions. Percept Psychophys. (1967) 2:201–7. doi: 10.3758/BF03213050

CrossRef Full Text | Google Scholar

38. International, Driving Authority,. London, Great Britain. [Updated March 2017; cited April 2022]. Available online at: https://idaoffice.org/posts/traffic-lights-in-different-countries-en/

Google Scholar

Keywords: acute stroke, clinical trials, visual display, disability, utility

Citation: Tokunboh I, Sung EM, Chatfield F, Gaines N, Nour M, Starkman S and Saver JL (2022) Improving Visualization Methods of Utility-Weighted Disability Outcomes for Stroke Trials. Front. Neurol. 13:875350. doi: 10.3389/fneur.2022.875350

Received: 14 February 2022; Accepted: 25 April 2022;
Published: 13 May 2022.

Edited by:

Bruce Campbell, The University of Melbourne, Australia

Reviewed by:

Vincent Thijs, University of Melbourne, Australia
Manon Kappelhof, Amsterdam University Medical Center, Netherlands

Copyright © 2022 Tokunboh, Sung, Chatfield, Gaines, Nour, Starkman and Saver. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Eleanor Mina Sung, minasung7@gmail.com

These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.