SYSTEMATIC REVIEW article

Front. Immunol., 25 September 2023

Sec. Cancer Immunity and Immunotherapy

Volume 14 - 2023 | https://doi.org/10.3389/fimmu.2023.1228812

Common methodological pitfalls in ICI pneumonitis risk prediction studies

  • 1. Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, United Kingdom

  • 2. Department of Surgery, Cambridge University Hospitals, Cambridge, United Kingdom

  • 3. School of Clinical Medicine, University of Cambridge, Cambridge, United Kingdom

  • 4. Digital Health, Oncology R&D, AstraZeneca, Gaithersburg, MD, United States

  • 5. Department of Pure Mathematics and Mathematical Statistics, University of Cambridge, Cambridge, United Kingdom

  • 6. Department of Medicine, University of Cambridge, Cambridge, United Kingdom

  • 7. Department of Oncology, Cambridge University Hospitals, Cambridge, United Kingdom

Article metrics

View details

2

Citations

2,8k

Views

1,5k

Downloads

Abstract

Background:

Pneumonitis is one of the most common adverse events induced by the use of immune checkpoint inhibitors (ICI), accounting for a 20% of all ICI-associated deaths. Despite numerous efforts to identify risk factors and develop predictive models, there is no clinically deployed risk prediction model for patient risk stratification or for guiding subsequent monitoring. We believe this is due to systemic suboptimal approaches in study designs and methodologies in the literature. The nature and prevalence of different methodological approaches has not been thoroughly examined in prior systematic reviews.

Methods:

The PubMed, medRxiv and bioRxiv databases were used to identify studies that aimed at risk factor discovery and/or risk prediction model development for ICI-induced pneumonitis (ICI pneumonitis). Studies were then analysed to identify common methodological pitfalls and their contribution to the risk of bias, assessed using the QUIPS and PROBAST tools.

Results:

There were 51 manuscripts eligible for the review, with Japan-based studies over-represented, being nearly half (24/51) of all papers considered. Only 2/51 studies had a low risk of bias overall. Common bias-inducing practices included unclear diagnostic method or potential misdiagnosis, lack of multiple testing correction, the use of univariate analysis for selecting features for multivariable analysis, discretization of continuous variables, and inappropriate handling of missing values. Results from the risk model development studies were also likely to have been overoptimistic due to lack of holdout sets.

Conclusions:

Studies with low risk of bias in their methodology are lacking in the existing literature. High-quality risk factor identification and risk model development studies are urgently required by the community to give the best chance of them progressing into a clinically deployable risk prediction model. Recommendations and alternative approaches for reducing the risk of bias were also discussed to guide future studies.

1 Introduction

Immune checkpoint inhibitors (ICIs) have dramatically improved outcomes in cancer treatment in the past decade. Success has been seen in melanoma, lung and kidney cancer, although their use is rapidly expanding to other cancer types (1). In addition to their use in advanced cancer, they are also used in the perioperative settings to reduce risk of cancer recurrence (2). ICIs most commonly block the programmed cell death protein 1 (PD-1) and cytotoxic T-lymphocyte–associated antigen 4 (CTLA-4) pathways, key negative regulators of the anti-tumour immune response. Despite the success of ICIs, their mechanism of action means that they can trigger immune reactions against non-tumour, healthy tissues. These ‘immune-related adverse events’ (irAEs) may necessitate stopping treatment, or adding other drugs such as steroids to dampen the immune reaction; in rare cases, irAEs are severe enough to require hospital treatment or lead to patient death (3). Early recognition and ideally prevention of irAEs are therefore key challenges in oncology practice.

One of the more common irAEs that lead to drug discontinuation is pneumonitis, also known as interstitial lung disease (ILD), an inflammation of the lung tissue. It accounts for 20% of all ICI-associated deaths (4). Patients with ICI induced pneumonitis (ICI pneumonitis) most commonly present with symptoms of dyspnoea and cough (53 and 35 percent, respectively), while approximately one-third of patients are asymptomatic (5). More than half of patients with ICI pneumonitis may also present with another immune-related adverse event, such as colitis, dermatitis, or thyroiditis (5).

ICI pneumonitis can be challenging to distinguish from other pathologies such as pulmonary embolus, infection, heart failure or underlying cancer progression (6). Strategies to identify patients at risk of pneumonitis, and to recognize it early are a clinical priority (6).

Since the introduction of ICIs, many studies have been conducted to discover risk factors and to build risk prediction models. Both systematic and non-systematic reviews have been written to identify possible mechanisms for ICI pneumonitis (7), to summarize risk factors (8) and to recommend management strategies (9). A meta-analysis published in 2022 summarized the odds ratios from 35 studies between 2000 and 2022 and identified several risk factors that have significant pooled effect (Table 1) (8). Three studies aimed at building risk prediction models for ICI pneumonitis in human were also published in the same year (10–12). Chao et al. developed a nomogram from a 164-subject dataset; chronic obstructive pulmonary disease (COPD) diagnosis, PD-L1 expression and interleukin 8 (IL-8) levels were included as final predictors for incidence of ICI pneumonitis in non-small cell lung cancer (NSCLC) patients (10). Jia et al. took the nomogram approach as well, with a 209-subject training set, they identified hypertension, ILD emphysema and platelet/lymphocyte ratio (PLR) as model predictors (11). In contrast, Tan et al. trained a deep neural network on 48 subjects to combine pre-ICI imaging and clinical data, which represents the first application of modern machine learning techniques for ICI pneumonitis risk prediction (12).

Table 1

Covariate-adjusted or notOR95% CI
Squamous cell carcinomay1.131.18-1.45
Previous thoracic radiotherapyy2.071.34-3.19
Pre-existing radiation-induced pneumonitisy3.621.53-8.58
Pre-existing respiratory diseasey2.431.45-4.07
Pre-existing interstitial lung diseasey5.783.08-10.85
Pre-existing ground glass attenuationy11.481.13-116.74
Pre-existing honeycombingy6.112.37-15.79
Pre-existing pulmonary emphysemay2.721.00-7.36
Use of pembrolizumaby2.891.56-5.35
High PD-L1 expressiony3.591.23-10.50
Hypoalbuminemiay0.30.14-0.64
Smoking historyn1.391.14-1.71
Neutrophil-lymphocyte ration1.041.01-1.08
C-reactive proteinn1.081.01-1.16

Risk factors found to have significant pooled odds ratio (OR) in a meta-analysis conducted by Zhou et al. (8).

PD-L1, Programmed death-ligand 1.

Despite these efforts, none of the risk factors and risk prediction models have yet translated through to clinical deployment of risk prediction tools. Due to the lack of large-scale high-quality validation studies for the commonly investigated risk factors, the community does not have good evidence to rely on to form a consensual set of risk factors for risk modelling. Findings on individual risk factors are also inconsistent between studies. This inconsistency may be a result of suboptimal methodology, examples include 1) bias in the study populations; 2) difficulty in ICI pneumonitis diagnosis; 3) increased risk of chance findings in small datasets; 4) bias in statistical analysis.

Risk of bias analysis has been reported only in one previous review (8), which, together with other existing reviews, did not provide any detailed assessment on the methodology or the prevalence of bias-prone practices (7, 9, 13–16). Therefore, in the current systematic review, we present a thorough critical appraisal of the methodology of ICI pneumonitis risk factor identification and risk model development studies. Prevalence of bias-prone approaches is quantified as well.

2 Materials and methods

2.1 Search strategy and selection criteria

Published works and preprints were identified using a Python interface for arXiv (arxiv=1.4.2) and R interfaces for PubMed (RISmed=2.3.0), medRxiv (medrxivr=0.0.5) and bioRxiv (also medrxivr=0.0.5). The databases were searched from 1 January 2000 to 30 September 2022. An initial high-level search was performed for risk factor and risk prediction studies on anti-cancer drug-related pneumonitis. The subsequent search had a narrower scope, focusing on risk factor and risk prediction studies for ICI-related pneumonitis and specifically included individual ICIs in the search terms. The only preprint was removed after confirming that it did not investigate ICI pneumonitis.

A two-stage process was then adopted to identify papers that reported risk factors or risk models for ICI pneumonitis; first, the title and abstract were screened followed by a second screen of the full article text.

2.1.1 Stage I: title and abstract screening

Three reviewers (Y.C., S.W., M.R.) determined the relevance of the studies based on titles and abstracts. Each paper was assessed by two reviewers independently. Conflicts were resolved by consensus between the three reviewers.

The inclusion criteria were: 1. Indication of risk factors, biomarkers and/or predictive models for ICI pneumonitis, including comparison of ICI pneumonitis incidence between different patient subgroups. 2. Reporting of imaging characteristics for ICI pneumonitis

2.1.2 Stage II: full-text screening

Four reviewers (Y.C., S.W., M.R., J.J.) determined the relevance of the studies based on the full text. Each paper was assessed by two reviewers independently, conflicts were resolved by consensus between the four reviewers.

In this review, we included any original study that reported: risk factors, predictive biomarkers and/or models for ICI pneumonitis (including a comparison of ICI pneumonitis incidence between different patient subgroups). The analysis must use statistical tests or predictive modelling for inclusion in our review.

2.2 Risk of bias in individual studies

The QUIPS tool (17) was adopted to assess the risk of bias in the risk factor studies. For studies that aim to develop a risk prediction model, the PROBAST (18) method was used to assess the risk of bias. Each study was independently evaluated by two reviewers (Y.C., A.P.), disagreements were resolved by consensus.

2.3 Data analysis

The following information was extracted from the papers (Supplementary Table 1) (1): outcome of interest; (2) country in which the data were collected; (3) type of cancer treated by ICI; (4) whether the study reported risk factors or a risk prediction model; (5) sample size; (6) statistical tests or models used; (7) data pre-processing, including discretization and handling of missing-values (8) method for validation if a study describes a risk prediction model; (9) whether the code for training the model and the trained model was publicly available (only for studies reporting risk prediction models); and (10) whether imaging features were involved and how they were extracted.

The extracted information was then profiled to evaluate the prevalence of each suboptimal practice.

2.4 Keywords in search strategy

2.4.1 Initial search

The initial search looked for the presence of a combination of keywords in the title and abstract of each study. A study was retained if either the title or the abstract contains at least one of: chemotherapy, TKI, tyrosine kinase inhibitor, immune checkpoint inhibitor, immune checkpoint blockade, ICPI, ICI, mTOR, targeted, immune-related; and at least one of: pneumonitis, interstitial lung disease, ILD; as well as one of: biomarker, biomarkers, predictor, predictors, predictive, predict, predicts, prediction, risk factor, risk factors.

2.4.2 ICI-specific search

The secondary search was conducted in the same way but using different search terms. The title or the abstract must contain at least one of: immune checkpoint inhibitor, immune checkpoint blockade, ICPI, ICI, pembrolizumab, nivolumab, cemiplimab, durvalumab, avelumab, atezolizumab, ipilimumab, tremelimumab, immune-related; and at least one of: pneumonitis, interstitial lung disease, ILD; as well as one of: biomarker, biomarkers, predictor, predictors, predictive, predict, predicts, prediction, risk factor, risk factors.

3 Results

3.1 Study selection

711 distinct studies were found in the initial search, and 199 of these were deemed relevant during abstract screening. Of the 199 that were eligible for full-text screening, 51 were retained for discussion in this analysis (Figure 1, selection criteria are detailed in the Methods section). ICI pneumonitis of different grades were investigated in the studies, including any-grade (48/51), grade 2 or above (3/51), grade 3 or above (3/51), and grade 5 (1/51).

Figure 1

50/51 studies aimed to identify baseline or pre-treatment risk factors for developing ICI pneumonitis from clinical data (46/50), imaging data (27/50, all but 2 included clinical data as well) or specialized laboratory tests such as genetics (2/50) and antibody abundance (2/50). 2/50 studies additionally developed a risk prediction model after risk factor identification from clinical data (10, 11). The remaining study focused on risk prediction model development (with clinical and imaging data) and did not investigate the significance of individual risk factors (12).

3.1.1 Studies investigating risk factors for developing ICI pneumonitis (n=50)

Most (47/50) studies used private data collected from their authors’ affiliated institutions. The remaining 3/51 studies used public databases (Shah 2020 used VigiBase, Asada 2021 and Bai 2021 used data from FDA Adverse Event Reporting System, FAERS), which have limited availability for clinical variables and diagnostic method. Study sample size ranged between 17 and 40826 (19) (median=169, IQR:93-248), with a mean case:control ratio of 21:100 for any-grade (N=47), 16:100 for grade 2 or above (N=3), 11:100 for grade 3 or above (N=3), 14:100 for grade-5 (N=1).

Close to 50% (24/50) of the studies used data from Japan, followed by USA (10/50), China (10/50), Australia (1/50), South Korea (1/50), Mexico (1/50) and Spain (1/50). 3/51 studies were analysed data from the FAERS database (2/3) and the VigiBase database (1/3) (Figure 2A), both contain drug adverse event reports from multiple countries. In addition, 78% (21/27) of studies that examined imaging-based risk factors for ICI pneumonitis development used only Japanese population (Figure 2B).

Figure 2

74% (20/27) of imaging-based risk factor studies used computerized tomography (CT) as the sole imaging technique. The remaining 7 did not specify the imaging modality. All the studies relied on manual interpretation to determine radiographic features. 11/27 studies had 2-3 radiologists or pulmonologists reviewing the CT scans, 2/27 studies used a central review committee, 14/27 did not report the number of investigators involved in extracting the imaging findings. Acquisition parameters for the images were described in only 7/27 studies.

A majority of risk factor studies (39/50) conducted analyses on lung cancer, of which 33/38 recruited only NSCLC patients (Figures 3A, B). Melanoma was investigated in 3/50 studies. Acute myeloid leukaemia (AML) was considered in one study (Figure 3A). Cancers in different organs were combined in the analyses from 10/50 studies, 3/10 excluded subjects with lung cancer (Figures 3A, C). One study (19) used data from the FAERS database which contains all the adverse event reports submitted to FDA regardless of cancer type; but the authors did not report the proportion of different cancer types in the data analysed.

Figure 3

19/50 studies summarized follow-up duration. The median time to onset is shorter than the median follow-up duration in all studies that reported both follow-up time and time to onset.

3.1.2 Studies developing ICI pneumonitis risk prediction models (n=3)

A total of 3 studies attempted to develop risk prediction models (10–12). All of them focused on any-grade ICI pneumonitis and used data collected from the their authors’ affiliated institutions, which were all in China (Figure 2A). Sample sizes for model development were 48 (12), 164 (10) and 209 (11). Only one study had an internal validation (holdout) set (i.e. was not used in cross-validation or bootstrapping), the same study is also the only one that utilized an external validation set (11). The internal and external validation sets consisted of 209 and 172 subjects, respectively.

Imaging features were investigated in 2/3 studies. The modality of imaging was described in one study (12) that used a deep neural network to implicitly and automatically extract relevant CT imaging features. The other study used imaging-based diagnoses (pre-existing ILD and emphysema status) as candidate predictors but did not explicitly state the imaging domain (11).

In terms of cancer type, all models were built on data from lung cancer patients (Figures 3A, B): NSCLC in Chao 2022 and Jia 2022, lung cancer with different histology in Tan 2022.

3.2 Risk of bias

Following the recommendations from the Cochrane Prognosis Methods Group, the Quality In Prognosis Studies (QUIPS) tool (17) was adopted to assess the risk of bias in the risk factor studies. A total of six domains are considered by the QUIPS tool: study participation, study attrition, prognostic factor measurement, outcome measurement, study confounding, and statistical analysis and reporting. In the current analysis, the study attrition domain was considered irrelevant since only five studies were prospective and their primary endpoints were either safety or efficacy rather than ICI pneumonitis development. This means the concept of study completion is ill-defined with respect to ICI pneumonitis onset. In addition, due to the lack of understanding and consensus on potential confounding factors for ICI pneumonitis, the risk of bias due to study confounding was not assessed.

For studies that aim to develop a risk prediction model, the Risk Of Bias Assessment Tool (PROBAST) was used (18). The four domains of focus are: participants, predictors, outcomes and analysis.

3.2.1 Studies investigating risk factors for developing ICI pneumonitis (n=50)

3.2.1.1 Study participation

3/50 studies were determined to have high risk of bias (Table 2). The three studies with high risk of bias (19, 20, 36) did not provide any summary statistics on the baseline demographic and clinical characteristics of the participants. The remaining 47/50 studies provided a sufficient description of the source of subjects involved and on the distribution of clinical and demographic characteristics in the population, so were considered to have low risk of bias.

Table 2

StudyReferenceUses pre-ICI imaging featureStudy participationPrognostic factor measurementOutcome measurementStatistical analysis and reportingHighest RoB grading
Fujimoto 2018(21)nlowhighhighhighhigh
Cui 2018(22)nlowlowmoderatemoderatemoderate
Nakahama 2018(23)ylowlowhighmoderatehigh
Owen 2018(24)nlowlowhighmoderatehigh
Suresh 2018(25)nlowlowmoderatemoderatemoderate
Cho 2018(26)ylowlowlowmoderatemoderate
Yamaguchi 2018(27)ylowlowlowmoderatemoderate
Sukari 2019(28)ylowlowhighhighhigh
Dávila-Dupont 2019(29)nlowlowlowhighhigh
Shibaki 2019(30)ylowlowhighlowhigh
Duma 2019(31)nlowlowhighlowhigh
Nakanishi 2019(32)ylowhighlowhighhigh
Komiya 2019(33)ylowhighhighhighhigh
Fukihara 2019(34)ylowlowhighmoderatehigh
Tone 2019(35)nlowlowlowmoderatemoderate
Tahir 2019(36)nhighlowhighlowhigh
Nishiyama 2019(37)ylowlowlowmoderatemoderate
Sugano 2020(38)ylowlowhighmoderatehigh
Suzuki 2020(39)nlowhighlowhighhigh
Shah 2020(20)nhighlowhighhighhigh
Li 2020(40)nlowhighmoderatemoderatehigh
Okada 2020(41)ylowlowmoderatehighhigh
Ikeda 2020(42)ylowlowlowlowlow
Moda 2020(43)ylowlowlowhighhigh
Chu 2020(44)nlowhighhighmoderatehigh
Shimoji 2020(45)ylowlowlowmoderatemoderate
Isono 2020(46)ylowlowhighhighhigh
Zhang 2020(47)ylowlowmoderatehighhigh
Asada 2021(19)nhighhighhighmoderatehigh
Atchley 2021(48)ylowlowhighhighhigh
Zhou 2021(49)nlowlowhighmoderatehigh
Bai 2021(50)nlowlowhighhighhigh
Lin 2021(51)nlowlowlowmoderatemoderate
Yamaguchi 2021(52)ylowlowlowhighhigh
Yamamoto 2021(53)ylowhighhighhighhigh
Sierra-Rodero 2021(54)nlowhighmoderatehighhigh
Ichimura 2021(55)ylowhighhighmoderatehigh
Reuss 2021(56)nlowlowmoderatelowmoderate
Yamaguchi 2022(57)ylowlowlowhighhigh
Wang 2022(58)nlowhighhighhighhigh
Chao 2022(10)nlowlowmoderatemoderatemoderate
Ohe 2022(59)ylowlowhighlowhigh
Xu 2022(60)nlowlowhighhighhigh
Sheshadri 2022(61)nlowlowmoderatehighhigh
Uchida 2022(62)ylowlowlowmoderatemoderate
Uhara 2022(63)ylowlowhighmoderatehigh
Jia 2022(11)ylowhighhighhighhigh
Ikeda 2022(64)ylowhighlowmoderatehigh
Abed 2022(65)nlowlowhighlowhigh
Lu 2022(66)ylowlowlowlowlow

Risk of bias assessment based on QUIPS for the 51 studies that reported risk factors.

Cells are highlighted in red, amber and green to indicate high, moderate and low risk. RoB: risk of bias.

3.2.1.2 Prognostic factor measurement

37/50 studies were considered to have low risk of bias for prognostic factor measurement, with clear information on data source and data handling (Table 2). 13/50 studies were evaluated to have high risk of bias due to unclear description on the source of the prognostic factors (2/13, Table 3) (33, 55), use of data-driven discretization that was optimized to maximise discrimination (5/13, Table 3) (11, 39, 44, 53, 58), and lack of information on data processing such as the handling of missing data or discretization (5/13, Table 3) (11, 19, 21, 39, 64). Additionally, 5/13 studies simply excluded subjects with missing data from the analysis when the proportion of missing values were high (> 10% missing, Table 3) (32, 40, 53–55).

Table 3

Prognostic factor measurement RoBData-driven discretizationExcluded missing values when abundantUnclear pre-processingUnclear definition or source of risk factorNumber of studies
(DDD)(EXCL)(PREP)(RFDEF)
High (DDD)20002
High (DDD;EXCL)11001
High (DDD;PREP)10101
High (DDD;PREP)10101
High (EXCL)03003
High (PREP)00303
High (RFDEF)00011
High (RFDEF;EXCL)01011
LowN/AN/AN/AN/A37
Total555250

Number of risk factor studies judged to have high, moderate and low risk of bias for prognostic factor (risk factor) measurement for different reasons.

Reasons for high and moderate risk are listed in the bracket. If multiple reasons are applicable to the same study, the reasons are separated by semicolon. Cells are highlighted in red, amber and green to indicate high, moderate and low risk. RoB: risk of bias.

3.2.1.3 Outcome measurement

All studies explicitly stated ICI pneumonitis as the outcome of interest, alternative terms used in the studies include: pneumonitis (as a type of immune-related adverse event), immune-related pneumonitis/ILD and exacerbation of interstitial pneumonia after ICI administration. 16/50 studies had sufficient description on ICI pneumonitis diagnosis (although no gold-standard diagnostic test exists) or excluded other possible cause of lung inflammation by design, so were considered to have low risk for bias for outcome measurement (Table 2). 9/50 had moderate risk, where the authors attempted to distinguish ICI pneumonitis from alternative diagnoses such as infection, tumour progression and pre-existing lesions, but there remains a risk of mistaking radiation pneumonitis (RP) for ICI pneumonitis due to prior thoracic radiotherapy or a lack of information on thoracic radiotherapy (Table 4). The remaining 25/50 studies had high risk of bias, 21 of which did not mention isolating ICI pneumonitis from other causes of lung inflammation, 4 had explicitly included RP and other ILD in the definition of ICI pneumonitis (Table 4) (19, 34, 48, 55).

Table 4

Outcome measurement RoBNumber of studies
High (explicit inclusion of RP and/or other diagnoses)4
High (lacks diagnostic detail)21
Moderate (attempted differential diagnosis, risk of RP)9
Low16
Total50

Number of risk factor studies judged to have high, moderate and low risk of bias for outcome measurement for different reasons.

Reasons for high and moderate risk are listed in the bracket. Cells are highlighted in red, amber and green to indicate high, moderate and low risk. RoB: risk of bias, RP: radiation pneumonitis.

3.2.1.4 Statistical analysis and reporting

8/50 studies were considered to have low risk of bias (Table 2). 21/50 had moderate risk due to lack of clarity in some but not all analytical steps (4/21, Table 5), the use of univariate analysis to select variables for multivariable analysis (11/21, Table 5), and discretization of continuous variables (16/21, Table 5). The remaining 21/50 were high risk studies (Table 5): 11 applied significance test on over 20 predictors in at least one of the analytical steps but used uncorrected p-value < 0.05 as significance threshold; 6 tested the same factor more than once using different discretization thresholds without appropriate multiple testing adjustment; 7 did not provide enough detail to indicate if there was selective reporting of results.

Table 5

Statistical analysis and reporting RoB> 20 features/factors but use uncorrected p-value < 0.05 as significance threshold
(MULTI)
Tested multiple dichotomization thresholds for the same variable (THRESH)Possible selective reporting
(SELECT)
Univariate-based feature selection before multivariable analysis (UNIMULT)Discretization before modelling/statistical test
(DISC)
Unclear description
(UNCLR)
Number of studies
High (MULTI)1000001
High (MULTI;THRESH)1100001
High (MULTI;DISC)1000101
High (MULTI; DISC;THRESH;UNIMULT)1101101
High (MULTI; UNIMULT; DISC)2002202
High (MULTI; UNIMULT; DISC;THRESH)1101101
High (SELECT)0070007
High (DISC;THRESH)0100101
High (UNIMULT;MULTI; DISC)4004404
High (UNIMULT; DISC;THRESH)0202202
Moderate (DISC)N/AN/AN/A0909
Moderate (UNCLR)N/AN/AN/A0011
Moderate (UNIMULT;DISC;UNCLR)N/AN/AN/A1111
Moderate (UNIMULT)N/AN/AN/A2002
Moderate (UNIMULT; DISC)N/AN/AN/A6606
Moderate (UNIMULT;UNCLR)N/AN/AN/A2022
LowN/AN/AN/AN/AN/AN/A8
Total with high risk11671012021
Total with moderate riskN/AN/AN/A1116421
Total in all11672128450

Number of risk factor studies judged to have high, moderate and low risk of bias for statistical analysis and reporting for different reasons.

If multiple reasons are applicable to the same study, the reasons are separated by semicolon. Cells are highlighted in red, amber and green to indicate high, moderate and low risk. RoB: risk of bias. Reasons for high and moderate risk are listed in the brackets.

3.2.2 Studies developing ICI pneumonitis risk prediction models (n=3)

3.2.2.1 Participants

According to PROBAST, all three studies reporting risk prediction models had low risk of bias in terms of participant or study sample selection. They retrospectively included data from cancer patients who were given ICI treatments in hospitals, no bias was identified from the inclusion/exclusion criteria.

3.2.2.2 Predictors

All three studies reporting risk prediction models had low risk of bias introduced by predictors or their assessment. Candidate predictors were all extracted pre-treatment or at baseline without knowledge of outcome data. All the predictors were available in training, internal validation, and the external validation sets when used.

3.2.2.3 Outcome

All three studies had unclear risk of bias for outcome determination due to lack of description on how ICI pneumonitis were distinguished from other alternative diagnoses such as infection and tumour progression.

3.2.2.4 Analysis

All three studies had high risk of bias for data analysis due to low sample size:feature ratio (24 cases and 24 controls for deep learning in Tan 2022) (12), discretization of continuous variables (10, 11), exclusion of missing values when a large proportion of a variable is missing (17% missing in Chao 2022) (10), use of univariate analysis to select predictors (10) and uncorrected optimism in the reported model performance (hyperparameter selection with cross-validation and absence of a holdout set in Tan 2022, univariate feature selection based on full population and lack of a holdout set in Chao 2022) (10, 12).

3.3 Data analysis

3.3.1 Missing data and imputation

3.3.1.1 Studies investigating risk factors for developing ICI pneumonitis (n=50)

Missing values were reported in 22/50 studies, the maximum proportion of missing values in a single-variable ranges between 0.2% and 64.7% across the studies (median:17.3%, IQR: 7.5%-36.1%). 4/22 of the studies included the missing values as a separate level in regression analysis (11, 35, 41, 46), 5/22 were unclear about the imputation method used (19, 21, 39, 61, 64), the rest simply excluded subjects with missing values. None of the studies investigated whether the missing values were independent from values in other variables.

3.3.1.2 Studies developing ICI pneumonitis risk prediction models (n=3)

2/3 studies reported missing values and also investigated risk factors. One had a maximum of 17% of values missing in a single variable and excluded samples with missing values from analysis (10); the other included missing values as a separate category during model development but did not mention the frequency of missingness (11). Similar to the risk factors studies, the relationship between the missing values and other variables (except for ICI pneumonitis status) were not examined.

3.3.2 Univariate and multivariable analysis

3.3.2.1 Studies investigating risk factors for developing ICI pneumonitis (n=50)

Univariate analysis was performed in 47/50 studies, most of them (26/47) used logistic regression in combination with other tests or by itself: 6/26 studies used contingency and/or two-sample tests to determine variables that should be further assessed by univariate logistic regression (26, 36, 41, 44, 47, 58). One study used contingency test for categorial factors and logistic regression for continuous factors (50). The remaining 19/26 studies used only logistic regression to identify risk factors.

Survival analysis was the second most popular method for univariate analysis, and was performed in 8/47 studies that conducted univariate analysis: 5/8 studies used survival analysis alone to identify risk factors (2 applied the Fine-Gray test, 3 applied Cox proportional hazard model) (40, 46, 53, 61, 63). One study conducted a contingency test and used a Cox proportional hazard model on the same factor (49), the other two used contingency and/or two-sample tests to select variables that should be further assessed in univariate survival modelling (39, 58).

Amongst the remaining studies that conducted univariate analysis, 12 used nothing but contingency and/or two-sample tests, one applied significance test on area under the receiver operating characteristics curve (ROC) (11), and one used general estimating equation (GEE) (56).

46/47 studies that performed univariate analysis used p=0.05 as significance threshold, one implied a threshold of p=0.001 (59). None, except one study, applied multiple testing correction (24).

Multivariable analysis was performed in 31/51 studies, most of them (23/31) used only logistic regression, another 6/31 used only survival modelling (3 with the Fine-Gray test, 3 with the Cox proportional hazard model) (39, 40, 46, 53, 61, 63). One study implied the existence of multivariable analysis for ICI pneumonitis but did not report corresponding results (50), the other applied both logistic regression and Cox proportional hazard model (58). Overall, in either univariate or multivariable analyses, 31/51 studies used logistic regression, 8/51 studies applied survival modelling.

A total of 21/31 studies performing multivariable analysis relied solely (14/21) or partially (7/21) on univariate analysis for feature preselection, so only variables that were below a small p-value (the p-value thresholds ranged between 0.05 and 0.2) or had sufficiently large effect size were passed to the multivariable analysis. 6/31 studies did not conduct any data-driven pre-selection at all (11, 23, 28, 38, 56, 61). 3/31 were unclear on the method of pre-selection (21, 29, 34). All the studies that were partially dependent on univariate analysis for feature selection had additional pre-specified factors. Feature selection during multivariable modelling was conducted in 4/31 studies via stepwise selection (34, 61), shrinkage (43) and p-value cut-off (48).

3.3.2.2 Studies developing ICI pneumonitis risk prediction models (n=3)

Each of the three studies used different tests for univariate analysis. Chao et al. used logistic regression to select predictors for model building (10); Jia et al. used ROC analysis to determine the best dichotomization threshold for continuous variables before multivariable modelling (11); Tan et al. only used univariate contingency and two-sample tests to compare the baseline characteristics between ICI pneumonitis and control subject (12).

The risk prediction models were all multivariable. 2/3 were based on logistic regression (10, 11), they all investigated risk factors as well. The remaining paper experimented different deep learning approaches (unimodal with either clinical factors or CT images alone, multimodal combining the two, and each of these approaches enhanced by contrastive learning) (12).

4 Discussion

4.1 Study aims

In this analysis, we found that almost all of papers on ICI pneumonitis prediction focused on statistical significance and effect sizes of risk factors rather than risk model development. While statistical significance and the effect sizes (e.g. odds ratio and hazard ratio) may inform clinicians about the relative risk of ICI pneumonitis development for a patient, a clinical decision made without an estimate of the absolute risk can be suboptimal: a 2-fold increase in relative risk could represent a change of absolute risk from 40% to 80%, or from 5% to 10%. We suggest that future studies should particularly focus on validation of existing risk factors and the development of risk prediction models. Both types of study require a much larger sample size than the existing studies (median = 169) for the findings to be considered reliable enough for clinical application. For example, the Liverpool Lung Project (LLP) lung cancer risk prediction model was developed on 1736 subjects and validated on two large cohorts with sample sizes of 2922 and 7652, respectively (67, 68). Furthermore, since distinguishing low grade from high grade ICI pneumonitis is critical for ICI management decisions and the urgency of treatment, risk factors and models that predict high-grade pneumonitis and their time of onset would be highly valuable. However, due to the rarity of high-grade ICI pneumonitis, single-centre studies may find the sample sizes required to be impractical. Multi-national and multi-centre collaboration may be an attractive option in this case.

4.2 Dataset considered

In terms of the datasets considered, 47% of studies (24/51, Figure 2) were conducted using data from Japan. The proportion increases to 71% (21/28, Figure 2) if just the imaging-based studies are considered. This could be due to historically high incidence rates of drug-related ILD in Japan (69). The over-representation of Japanese patient data may lead to inaccurate risk prediction in other countries and ethnicities as models fail to generalize (70). Studies including non-Japanese populations should therefore be encouraged. Most (39/51) studies had specific focus on lung cancer patients, and we suggest that future studies may also concentrate on discovering and validating risk factors and models in non-lung cancer populations to identify whether any generalise to other cancers.

In studies that investigated imaging-based risk factors, all but one study used imaging features that were derived from manually identified abnormal radiological patterns in the lungs. Although the studies sought agreement between multiple radiologists or pulmonologists, subjective bias may still exist and lead to inaccuracy in the labels due to varying levels of experience and training. Automated tools should be developed to identify the radiological patterns to limit such bias.

4.3 Outcome definition

One of the main observations from the risk of bias analysis is the between-study inconsistency in the definition of the ICI pneumonitis population and a lack of description on how ICI pneumonitis was distinguished from other diagnoses. This reflects the lack of gold-standard diagnostic criteria for ICI pneumonitis.

Regardless of the cause, patients experiencing pneumonitis will undergo the same or similar clinical management and differentiating the cause of pneumonitis may not necessarily add value to clinical decision-making during its direct management. However, for cancer patients, accurate identification of the cause of pneumonitis in similar subjects is crucial, as if severe ICI induced pneumonitis is suspected this may lead to discontinuation of an effective treatment.

Further studies should endeavour to accurately establish the cause and type of pneumonitis, and document the associated risk factors, so that the community can build a better understanding of managing this clinical situation

4.4 Statistical analysis

The risk of bias analysis also revealed some suboptimal techniques in the statistical analysis of the reviewed studies. One prevalent method was the use of univariate analysis to select variables to include in the multivariable analysis, this was observed in 21/31 studies reporting risk factors from multivariable analysis and 1/3 studies reporting risk model development. With this approach, variables that are only informative after controlling for other variables will be dropped out from the final model (71). This phenomenon was observed in one of the studies in our review: Uchida et al. yielded insignificant univariate result for association between pre-existing ILD and risk of symptomatic ICI pneumonitis, but when adjusted for lung metastasis, the association became statistically significant (62). A better alternative would be a step-wise regression or a sparse regularized model (e.g. Least Absolute Shrinkage and Selection Operator (72)).

The widespread lack of multiple testing correction (in all but one study) was another common source of bias when many potential risk factors were simultaneously tested. The bias was exaggerated when the same factors were tested multiple times with different discretization thresholds. For instance, under the assumption of independence between the risk factors, a study should expect at least one false discovery when more than 20 factors are tested at a significance threshold of p = 0.05. This can lead to optimistic and non-reproducible results. To reduce the Type I error, i.e. when a null hypothesis is rejected when it is actually true, methods such as Benjamini-Hochberg correction and Bonferroni correction could be employed (72).

In studies that report risk model development, the most concerning source of optimism came from ill-defined cross-validation where univariate feature selection and data-driven feature transformation (e.g. feature dichotomization based on univariate ROC analysis) was conducted on the entire population rather than the training set in each round of cross-validation. Similarly, reporting only the cross-validation performance after it has been used in hyperparameter optimisation can also lead to an overestimated model performance. To obtain the least biased estimate on model performance, investigators should ideally use internal and (whenever possible) external holdout sets that have not been exposed to any part of model optimisation.

Many (13/22) of the reviewed studies that contained missing values simply discarded the observations from corresponding analyses. This approach assumes that the missing values are randomly distributed and are not related to the outcome or other potential risk factors; this assumption was not verified in any of the studies. As a result, the studies that had sizable numbers of missing values were likely to contain biased results due to failure to account for informative missing values. Statistical power can also be jeopardized due to the reduction in sample size. Future studies should explain and check assumptions on the distribution of missing values, reasons for missingness, and use appropriate imputation methods before considering excluding observations (73). One way to check the informativeness of missing values is to include them as a separate category in the regression analysis, as done by four studies reviewed in this analysis. However, this approach requires discretizing continuous variables, which itself introduces bias (74), so should be discouraged for continuous variables. In places where missing values have been imputed or excluded, non-linear trends involving continuous variables should be investigated using techniques such as spline regression rather than with discretization (74).

We noted the overwhelming popularity of logistic regression in the risk factor analysis studies (55%, either univariate or multivariable) and in risk model development studies (75%). The results from these studies could be interpreted as identifying the odds ratios for (or predicting the risk of) developing ICI pneumonitis before the last follow-up or death regardless of the timing of the events. This limits clinical utility, as the user would not know whether the intervention should be urgent. Recorded follow-up time ranges from days to months, making survival analysis a better and clinically-actionable alternative.

5 Conclusion

Overall, our work highlighted several common methodological pitfalls in ICI pneumonitis risk factor identification and risk model development studies, covering areas from diagnosis of ICI pneumonitis to statistical analysis and risk modelling. The majority of the studies considered here are likely to have reported biased results due to those pitfalls. We also provided recommendations and alternative approaches for reducing the risk of bias. Studies with low risk of bias in all domains are lacking in the existing literature (only 2/51, Table 2), high-quality risk factor identification and especially risk model development studies (i.e. predicting absolute rather than relative risk) are urgently required by the community in order to progress into formation of a clinical deployable risk prediction model.

Statements

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Author contributions

Original idea (SW/MR), paper searching (YC), title/abstract/full-text screen (YC/SW/MR/JJ), data extraction and review of bias (YC/AP), draft manuscript (YC/JR/JJ/MR). All authors contributed to the article and approved the submitted version.

Acknowledgments

The authors are grateful for the following funding: AstraZeneca (YC), the EU/EFPIA Innovative Medicines Initiative project DRAGON (101005122) (C-BS, MR), the Trinity Challenge (C-BS, MR), the EPSRC Cambridge Mathematics of Information in Healthcare Hub EP/T017961/1 (JA, C-BS, JR, MR), the Cantab Capital Institute for the Mathematics of Information (C-BS), the European Research Council under the European Union’s Horizon 2020 research and innovation programme grant agreement no. 777826 (C-BS), the Alan Turing Institute (C-BS), Wellcome Trust (JR), Cancer Research UK Cambridge Centre (C9685/A25177) (C-BS), British Heart Foundation (JR), the NIHR Cambridge Biomedical Research Centre (JR), HEFCE (JR). In addition, C-BS acknowledges support from the Leverhulme Trust project on ‘Breaking the non-convexity barrier’, the Philip Leverhulme Prize, the EPSRC grants EP/S026045/1 and EP/T003553/1 and the Wellcome Innovator Award RG98755.

Conflict of interest

Authors BT, KB and EH were employed by the company AstraZeneca.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

This study received funding from AstraZeneca. The funder was not involved in the study design, collection, analysis, interpretation of data, the writing of this article or the decision to submit it for publication.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2023.1228812/full#supplementary-material

References

  • 1

    HargadonKMJohnsonCEWilliamsCJ. Immune checkpoint blockade therapy for cancer: An overview of FDA-approved immune checkpoint inhibitors. Int Immunopharmacology (2018) 62:29–39. doi: 10.1016/j.intimp.2018.06.001

  • 2

    ChoueiriTKTomczakPParkSHVenugopalBFergusonTChangY-Het al. Adjuvant pembrolizumab after nephrectomy in renal-cell carcinoma. New Engl J Med (2021) 385(8):683–94. doi: 10.1056/NEJMoa2106391

  • 3

    HaanenJObeidMSpainLCarbonnelFWangYRobertCet al. Management of toxicities from immunotherapy: ESMO Clinical Practice Guideline for diagnosis, treatment and follow-up☆. Ann Oncol (2022) 33(12):1217–38. doi: 10.1016/j.annonc.2022.10.001

  • 4

    WangDYSalemJECohenJVChandraSMenzerCYeFet al. Fatal toxic effects associated with immune checkpoint inhibitors: A systematic review and meta-analysis. JAMA Oncol (2018) 4(12):1721–8. doi: 10.1001/jamaoncol.2018.3923

  • 5

    NaidooJWangXWooKMIyribozTHalpennyDCunninghamJet al. Pneumonitis in patients treated with anti–programmed death-1/programmed death ligand 1 therapy. J Clin Oncol (2016) 35(7):709–17. doi: 10.1200/JCO.2016.68.2005

  • 6

    ReussJESureshKNaidooJ. Checkpoint inhibitor pneumonitis: mechanisms, characteristics, management strategies, and beyond. Curr Oncol Rep (2020) 22(6):56. doi: 10.1007/s11912-020-00920-z

  • 7

    ZhaiXZhangJTianYLiJJingWGuoHet al. The mechanism and risk factors for immune checkpoint inhibitor pneumonitis in non-small cell lung cancer patients. Cancer Biol Med (2020) 17(3):599–611. doi: 10.20892/j.issn.2095-3941.2020.0102

  • 8

    ZhouPZhaoXWangG. Risk factors for immune checkpoint inhibitor-related pneumonitis in cancer patients: A systemic review and meta-analysis. Respiration (2022) 101(11):1–16. doi: 10.1159/000526141

  • 9

    KimSLimJU. Immune checkpoint inhibitor-related interstitial lung disease in patients with advanced non-small cell lung cancer: systematic review of characteristics, incidence, risk factors, and management. J Thorac Disease (2022) 14(5):1684–95. doi: 10.21037/jtd-22-93

  • 10

    ChaoYZhouJHsuSDingNLiJZhangYet al. Risk factors for immune checkpoint inhibitor-related pneumonitis in non-small cell lung cancer. Transl Lung Cancer Res (2022) 11(2):295–306. doi: 10.21037/tlcr-22-72

  • 11

    JiaXChuXJiangLLiYZhangYMaoZet al. Predicting checkpoint inhibitors pneumonitis in non-small cell lung cancer using a dynamic online hypertension nomogram. Lung Cancer (2022) 170:74–84. doi: 10.1016/j.lungcan.2022.06.001

  • 12

    TanPHuangWWangLDengGYuanYQiuSet al. Deep learning predicts immune checkpoint inhibitor-related pneumonitis from pretreatment computed tomography images. Front Physiol (2022) 13:978222–. doi: 10.3389/fphys.2022.978222

  • 13

    von ItzsteinMSKhanSGerberDE. Investigational biomarkers for checkpoint inhibitor immune-related adverse event prediction and diagnosis. Clin Chem (2020) 66(6):779–93. doi: 10.1093/clinchem/hvaa081

  • 14

    XuYFuYZhuBWangJZhangB. Predictive biomarkers of immune checkpoint inhibitors-related toxicities. Front Immunol (2020) 11:. doi: 10.3389/fimmu.2020.02023

  • 15

    ChennamadhavuniAAbushahinLJinNPresleyCJManneA. Risk factors and biomarkers for immune-related adverse events: A practical guide to identifying high-risk patients and rechallenging immune checkpoint inhibitors. Front Immunol (2022) 13:779691. doi: 10.3389/fimmu.2022.779691

  • 16

    ZhangYZhangXLiWDuYHuWZhaoJ. Biomarkers and risk factors for the early prediction of immune-related adverse events: a review. Hum Vaccin Immunother (2022) 18(1):2018894. doi: 10.1080/21645515.2021.2018894

  • 17

    HaydenJAvan der WindtDACartwrightJLCôtéPBombardierC. Assessing bias in studies of prognostic factors. Ann Intern Med (2013) 158(4):280–6. doi: 10.7326/0003-4819-158-4-201302190-00009

  • 18

    WolffRFMoonsKGMRileyRDWhitingPFWestwoodMCollinsGSet al. PROBAST: A tool to assess the risk of bias and applicability of prediction model studies. Ann Internal Med (2019) 170(1):51. doi: 10.7326/M18-1376

  • 19

    AsadaMMikamiTNiimuraTZamamiYUesawaYChumaMet al. The risk factors associated with immune checkpoint inhibitor-related pneumonitis. Oncology (2021) 99(4):256–9. doi: 10.1159/000512633

  • 20

    ShahKPSongHYeFMoslehiJJBalkoJMSalemJEet al. Demographic factors associated with toxicity in patients treated with anti-programmed cell death-1 therapy. Cancer Immunol Res (2020) 8(7):851–5. doi: 10.1158/2326-6066.CIR-19-0986

  • 21

    FujimotoDYoshiokaHKataokaYMorimotoTKimYHTomiiKet al. Efficacy and safety of nivolumab in previously treated patients with non-small cell lung cancer: A multicenter retrospective cohort study. Lung Cancer (2018) 119:14–20. doi: 10.1016/j.lungcan.2018.02.017

  • 22

    CuiPLiuZWangGMaJQianYZhangFet al. Risk factors for pneumonitis in patients treated with anti-programmed death-1 therapy: A case-control study. Cancer Med (2018) 7(8):4115–20. doi: 10.1002/cam4.1579

  • 23

    NakahamaKTamiyaAIsaSITaniguchiYShiroyamaTSuzukiHet al. Association between imaging findings of airway obstruction adjacent to lung tumors and the onset of interstitial lung disease after nivolumab. In Vivo (2018) 32(4):887–91. doi: 10.21873/invivo.11324

  • 24

    OwenDHWeiLBertinoEMEddTVillalona-CaleroMAHeKet al. Incidence, risk factors, and effect on survival of immune-related adverse events in patients with non-small-cell lung cancer. Clin Lung Cancer (2018) 19(6):e893–900. doi: 10.1016/j.cllc.2018.08.008

  • 25

    SureshKVoongKRShankarBFordePMEttingerDSMarroneKAet al. Pneumonitis in non-small cell lung cancer patients receiving immune checkpoint immunotherapy: incidence and risk factors. J Thorac Oncol (2018) 13(12):1930–9. doi: 10.1016/j.jtho.2018.08.2035

  • 26

    ChoJYKimJLeeJSKimYJKimSHLeeYJet al. Characteristics, incidence, and risk factors of immune checkpoint inhibitor-related pneumonitis in patients with non-small cell lung cancer. Lung Cancer (2018) 125:150–6. doi: 10.1016/j.lungcan.2018.09.015

  • 27

    YamaguchiTShimizuJHasegawaTHorioYInabaYYatabeYet al. Pre-existing pulmonary fibrosis is a risk factor for anti-PD-1-related pneumonitis in patients with non-small cell lung cancer: A retrospective analysis. Lung Cancer (2018) 125:212–7. doi: 10.1016/j.lungcan.2018.10.001

  • 28

    SukariANagasakaMAlhasanRPatelDWozniakARamchandrenRet al. Cancer site and adverse events induced by immune checkpoint inhibitors: A retrospective analysis of real-life experience at a single institution. Anticancer Res (2019) 39(2):781–90. doi: 10.21873/anticanres.13175

  • 29

    Davila-DupontDMotola-KubaDDorantes-HerediaRGonzalez-AlonsoBKAlcantara-VelardeTGarcia-SantistebanRet al. Risk of pneumonitis with the use of different immune checkpoint inhibitors in a mexican population. Oncology (2019) 96(5):268–72. doi: 10.1159/000497405

  • 30

    ShibakiRMurakamiSMatsumotoYGotoYKandaSHorinouchiHet al. Tumor expression and usefulness as a biomarker of programmed death ligand 1 in advanced non-small cell lung cancer patients with preexisting interstitial lung disease. Med Oncol. (2019) 36(6):49–. doi: 10.1007/s12032-019-1274-0

  • 31

    DumaNAbdel-GhaniAYadavSHoverstenKPReedCTSitekANet al. Sex Differences in Tolerability to Anti-Programmed Cell Death Protein 1 Therapy in Patients with Metastatic Melanoma and Non-Small Cell Lung Cancer: Are We All Equal? Oncologist. (2019) 24(11):e1148–e55. doi: 10.1634/theoncologist.2019-0094

  • 32

    NakanishiYMasudaTYamaguchiKSakamotoSHorimasuYNakashimaTet al. Pre-existing interstitial lung abnormalities are risk factors for immune checkpoint inhibitor-induced interstitial lung disease in non-small cell lung cancer. Respir Investig (2019) 57(5):451–9. doi: 10.1016/j.resinv.2019.05.002

  • 33

    KomiyaKNakamuraTAbeTOgusuSNakashimaCTakahashiKet al. Discontinuation due to immune-related adverse events is a possible predictive factor for immune checkpoint inhibitors in patients with non-small cell lung cancer. Thorac Cancer (2019) 10(9):1798–804. doi: 10.1111/1759-7714.13149

  • 34

    FukiharaJSakamotoKKoyamaJItoTIwanoSMoriseMet al. Prognostic impact and risk factors of immune-related pneumonitis in patients with non-small-cell lung cancer who received programmed death 1 inhibitors. Clin Lung Cancer (2019) 20(6):442–50.e4. doi: 10.1016/j.cllc.2019.07.006

  • 35

    ToneMIzumoTAwanoNKuseNInomataMJoTet al. High mortality and poor treatment efficacy of immune checkpoint inhibitors in patients with severe grade checkpoint inhibitor pneumonitis in non-small cell lung cancer. Thorac Cancer (2019) 10(10):2006–12. doi: 10.1111/1759-7714.13187

  • 36

    TahirSAGaoJMiuraYBlandoJTidwellRSSZhaoHet al. Autoimmune antibodies correlate with immune checkpoint therapy-induced toxicities. Proc Natl Acad Sci U S A (2019) 116(44):22246–51. doi: 10.1073/pnas.1908079116

  • 37

    NishiyamaNHondaTSemaMKawaharaTJinYNatsumeIet al. The utility of ground-glass attenuation score for anticancer treatment-related acute exacerbation of interstitial lung disease among lung cancer patients with interstitial lung disease. Int J Clin Oncol (2020) 25(2):282–91. doi: 10.1007/s10147-019-01576-x

  • 38

    SuganoTSeikeMSaitoYKashiwadaTTerasakiYTakanoNet al. Immune checkpoint inhibitor-associated interstitial lung diseases correlate with better prognosis in patients with advanced non-small-cell lung cancer. Thorac Cancer (2020) 11(4):1052–60. doi: 10.1111/1759-7714.13364

  • 39

    SuzukiYKarayamaMUtoTFujiiMMatsuiTAsadaKet al. Assessment of immune-related interstitial lung disease in patients with NSCLC treated with immune checkpoint inhibitors: A multicenter prospective study. J Thorac Oncol (2020) 15(8):1317–27. doi: 10.1016/j.jtho.2020.04.002

  • 40

    LiMSpakowiczDZhaoSPatelSHJohnsAGroganMet al. Brief report: inhaled corticosteroid use and the risk of checkpoint inhibitor pneumonitis in patients with advanced cancer. Cancer Immunol Immunother (2020) 69(11):2403–8. doi: 10.1007/s00262-020-02674-w

  • 41

    OkadaNMatsuokaRSakuradaTGodaMChumaMYagiKet al. Risk factors of immune checkpoint inhibitor-related interstitial lung disease in patients with lung cancer: a single-institution retrospective study. Sci Rep (2020) 10(1):13773–. doi: 10.1038/s41598-020-70743-2

  • 42

    IkedaSKatoTKenmotsuHOguraTIwasawaSSatoYet al. A phase 2 study of atezolizumab for pretreated NSCLC with idiopathic interstitial pneumonitis. J Thorac Oncol (2020) 15(12):1935–42. doi: 10.1016/j.jtho.2020.08.018

  • 43

    ModaMSaitoHKatoTUsuiRKondoTNakaharaYet al. Tumor invasion in the central airway is a risk factor for early-onset checkpoint inhibitor pneumonitis in patients with non-small cell lung cancer. Thorac Cancer (2020) 11(12):3576–84. doi: 10.1111/1759-7714.13703

  • 44

    ChuXZhaoJZhouJZhouFJiangTJiangSet al. Association of baseline peripheral-blood eosinophil count with immune checkpoint inhibitor-related pneumonitis and clinical outcomes in patients with non-small cell lung cancer receiving immune checkpoint inhibitors. Lung Cancer (2020) 150:76–82. doi: 10.1016/j.lungcan.2020.08.015

  • 45

    ShimojiKMasudaTYamaguchiKSakamotoSHorimasuYNakashimaTet al. Association of preexisting interstitial lung abnormalities with immune checkpoint inhibitor–induced interstitial lung disease among patients with nonlung cancers. JAMA Netw Open (2020) 3(11):e2022906–e. doi: 10.1001/jamanetworkopen.2020.22906

  • 46

    IsonoTKagiyamaNTakanoKHosodaCNishidaTKawateEet al. Outcome and risk factor of immune-related adverse events and pneumonitis in patients with advanced or postoperative recurrent non-small cell lung cancer treated with immune checkpoint inhibitors. Thorac Cancer (2020) 12(2):153–64. doi: 10.1111/1759-7714.13736

  • 47

    ZhangCGaoFJinSGaoWChenSGuoR. Checkpoint inhibitor pneumonitis in Chinese lung cancer patients: clinical characteristics and risk factors. Ann Palliat Med (2020) 9(6):3957–65. doi: 10.21037/apm-20-1823

  • 48

    AtchleyWTAlvarezCSaxena-BeemSSchwartzTAIshizawarRCPatelKPet al. Immune checkpoint inhibitor-related pneumonitis in lung cancer: real-world incidence, risk factors, and management practices across six health care centers in north carolina. Chest (2021) 160(2):731–42. doi: 10.1016/j.chest.2021.02.032

  • 49

    ZhouJZhaoJJiaQChuQZhouFChuXet al. Peripheral blood autoantibodies against to tumor-associated antigen predict clinical outcome to immune checkpoint inhibitor-based treatment in advanced non-small cell lung cancer. Front Oncol (2021) 11:625578–. doi: 10.3389/fonc.2021.625578

  • 50

    BaiSTianTPachecoJMTachiharaMHuPZhangJ. Immune-related adverse event profile of combination treatment of PD-(L)1 checkpoint inhibitors and bevacizumab in non-small cell lung cancer patients: data from the FDA adverse event reporting system. Transl Lung Cancer Res (2021) 10(6):2614–24. doi: 10.21037/tlcr-21-464

  • 51

    LinXDengHYangYWuJQiuGLiSet al. Peripheral blood biomarkers for early diagnosis, severity, and prognosis of checkpoint inhibitor-related pneumonitis in patients with lung cancer. Front Oncol (2021) 11:698832–. doi: 10.3389/fonc.2021.698832

  • 52

    YamaguchiTShimizuJHasegawaTHorioYInabaYHanaiNet al. Pre-existing interstitial lung disease is associated with onset of nivolumab-induced pneumonitis in patients with solid tumors: a retrospective analysis. BMC Cancer (2021) 21(1):924. doi: 10.1186/s12885-021-08661-3

  • 53

    YamamotoNNakanishiYGemmaANakagawaKSakamotoTAkamatsuAet al. Real-world safety of nivolumab in patients with non-small-cell lung cancer in Japan: Postmarketing surveillance. Cancer Sci (2021) 112(11):4692–701. doi: 10.1111/cas.15117

  • 54

    Sierra-RoderoBCruz-BermúdezANadalEGaritaonaindíaYInsaAMosqueraJet al. Clinical and molecular parameters associated to pneumonitis development in non-small-cell lung cancer patients receiving chemoimmunotherapy from NADIM trial. J Immunother Cancer (2021) 9(8). doi: 10.1136/jitc-2021-002804

  • 55

    IchimuraTHinataMIchikuraDSuzukiS. Safety of immune checkpoint inhibitors in non-small-cell lung cancer patients with idiopathic interstitial pneumonia: a matched case-control study. Cancer Chemother Pharmacol (2021) 89(1):21–30. doi: 10.1007/s00280-021-04362-7

  • 56

    ReussJEBrighamEPsoterKJVoongKRShankarBEttingerDSet al. Pretreatment lung function and checkpoint inhibitor pneumonitis in NSCLC. JTO Clin Res Rep (2021) 2(10):100220–. doi: 10.1016/j.jtocrr.2021.100220

  • 57

    YamaguchiTShimizuJOyaYWatanabeNHasegawaTHorioYet al. Risk factors for pneumonitis in patients with non-small cell lung cancer treated with immune checkpoint inhibitors plus chemotherapy: A retrospective analysis. Thorac Cancer (2022) 13(5):724–31. doi: 10.1111/1759-7714.14308

  • 58

    WangHZhouFZhaoCChengLZhouCQiaoMet al. Interleukin-10 is a promising marker for immune-related adverse events in patients with non-small cell lung cancer receiving immunotherapy. Front Immunol (2022) 13:840313. doi: 10.3389/fimmu.2022.840313

  • 59

    OheYYamazakiNYamamotoNMurakamiHYohKKitanoSet al. The real-world safety of atezolizumab as second-line or later treatment in Japanese patients with non-small-cell lung cancer: a post-marketing surveillance study. Jpn J Clin Oncol (2022) 52(6):623–32. doi: 10.1093/jjco/hyac024

  • 60

    XuHFengHZhangWWeiFZhouLLiuLet al. Prediction of immune-related adverse events in non-small cell lung cancer patients treated with immune checkpoint inhibitors based on clinical and hematological markers: Real-world evidence. Exp Cell Res (2022) 416(1):113157–. doi: 10.1016/j.yexcr.2022.113157

  • 61

    SheshadriAGoizuetaAAShannonVRLondonDGarcia-ManeroGKantarjianHMet al. Pneumonitis after immune checkpoint inhibitor therapies in patients with acute myeloid leukemia: A retrospective cohort study. Cancer (2022) 128(14):2736–45. doi: 10.1002/cncr.34229

  • 62

    UchidaYKinoseDNagataniYTanaka-MizunoSNakagawaHFukunagaKet al. Risk factors for pneumonitis in advanced extrapulmonary cancer patients treated with immune checkpoint inhibitors. BMC Cancer (2022) 22(1):551–. doi: 10.1186/s12885-022-09642-w

  • 63

    UharaHTsuchidaTKiyoharaYAkamatsuASakamotoTYamazakiN. Safety and effectiveness of nivolumab in Japanese patients with Malignant melanoma: Final analysis of a post-marketing surveillance. J Dermatol (2022) 49(9):862–71. doi: 10.1111/1346-8138.16432

  • 64

    IkedaSKatoTKenmotsuHOguraTSatoYHinoAet al. Atezolizumab for pretreated non-small cell lung cancer with idiopathic interstitial pneumonia: final analysis of phase II AMBITIOUS study. Oncologist (2022) 27(9):720–e02. doi: 10.1093/oncolo/oyac118

  • 65

    AbedALawNCalapreLLoJBhatVBowyerSet al. Human leucocyte antigen genotype association with the development of immune-related adverse events in patients with non-small cell lung cancer treated with single agent immunotherapy. Eur J Cancer (2022) 172:98–106. doi: 10.1016/j.ejca.2022.05.021

  • 66

    LuXWangJZhangTZhouZDengLWangXet al. Comprehensive pneumonitis profile of thoracic radiotherapy followed by immune checkpoint inhibitor and risk factors for radiation recall pneumonitis in lung cancer. Front Immunol (2022) 13:918787–. doi: 10.3389/fimmu.2022.918787

  • 67

    RajiOYDuffySWAgbajeOFBakerSGChristianiDCCassidyAet al. Predictive accuracy of the Liverpool Lung Project risk model for stratifying patients for computed tomography screening for lung cancer: a case-control and cohort validation study. Ann Intern Med (2012) 157(4):242–50. doi: 10.7326/0003-4819-157-4-201208210-00004

  • 68

    CassidyAMylesJPvan TongerenMPageRDLiloglouTDuffySWet al. The LLP risk model: an individual risk prediction model for lung cancer. Br J Cancer (2008) 98(2):270–6. doi: 10.1038/sj.bjc.6604158

  • 69

    AzumaA. High prevalence of drug-induced pneumonia in Japan. JMA Journal. (Tokyo, Japan: Japan Medical Association and the Japanese Association of Medical Sciences). (2007) 50(5):405–411.

  • 70

    TollDBJanssenKJMVergouweYMoonsKGM. Validation, updating and impact of clinical prediction rules: A review. J Clin Epidemiol (2008) 61(11):1085–94. doi: 10.1016/j.jclinepi.2008.04.008

  • 71

    SunG-WShookTLKayGL. Inappropriate use of bivariable analysis to screen risk factors for use in multivariable analysis. J Clin Epidemiol (1996) 49(8):907–16. doi: 10.1016/0895-4356(96)00025-X

  • 72

    JamesGWittenDHastieTTibshiraniR. An introduction to statistical learning. New York, NY: Springer (2013).

  • 73

    van GinkelJRLintingMRippeRCAvan der VoortA. Rebutting existing misconceptions about multiple imputation as a method for handling missing data. J Pers Assessment (2020) 102(3):297–308. doi: 10.1080/00223891.2018.1530680

  • 74

    RoystonPAltmanDGSauerbreiW. Dichotomizing continuous predictors in multiple regression: a bad idea. Stat Med (2006) 25(1):127–41. doi: 10.1002/sim.2331

  • 75

    Dávila-DupontDMotola-KubaDDorantes-HerediaRGonzález-AlonsoBKAlcántara-VelardeTGarcía-SantistebanRMartínez-SámanoJEGrimaldo-RoqueHJRuiz-MoralesJMet al. Risk of Pneumonitis with the Use of Different Immune Checkpoint Inhibitors in a Mexican Population. Oncology (2019) 96(5):268–272. doi: 10.1159/000497405

Summary

Keywords

immune checkpoint inhibitor induced pneumonitis, review of methodology, risk of bias assessment, study design, statistical analysis

Citation

Chen YK, Welsh S, Pillay AM, Tannenwald B, Bliznashki K, Hutchison E, Aston JAD, Schönlieb C-B, Rudd JHF, Jones J and Roberts M (2023) Common methodological pitfalls in ICI pneumonitis risk prediction studies. Front. Immunol. 14:1228812. doi: 10.3389/fimmu.2023.1228812

Received

25 May 2023

Accepted

04 September 2023

Published

25 September 2023

Volume

14 - 2023

Edited by

Vassiliki A. Boussiotis, Harvard Medical School, United States

Reviewed by

Ajay Sheshadri, University of Texas MD Anderson Cancer Center, United States; Jennifer Possick, Yale University, United States; Stefania Canova, San Gerardo Hospital, Italy

Updates

Copyright

*Correspondence: Michael Roberts,

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics