Serum Human Epididymis Protein 4 as a Novel Biomarker in Identifying Patients With Interstitial Lung Disease in Rheumatoid Arthritis

Objective: Human epididymis protein 4 (HE4) have been implicated in the pulmonary involvements. We aimed to investigate the clinical utility of HE4 in clinical stratification in patients with rheumatoid arthritis (RA). Methods: This study included a discovery cohort comprising 70 RA patients and 64 healthy controls (HCs), and a validation cohort comprising 98 RA patients and 75 HCs. Human epididymis protein 4 were determined by electrochemical luminescence analyzer. Results: The levels of HE4 were significantly elevated in patients with RA compared to HCs. The positive rates of HE4 in patients with RA and HCs were 50.0% and 0, respectively, in the discovery cohort and 53.1 and 1.3%, respectively, in the validation cohort. When RA patients were subgrouped according to HE4 status, HE4-positive group displayed higher prevalence of interstitial lung disease (ILD) compared to HE4-negative group (28.6 vs. 11.4% in discovery cohort and 57.7 vs. 8.7% in the validation cohort). A positive correlation between the levels of HE4 with the degree of lung impairment was identified. Receiver operating curve (ROC) analysis revealed an optimal cut-off value of 104.3 pmol/L in HE4 for distinguishing RA-ILD from RA-non ILD with the areas under the curve (AUC) of 0.790. Multivariate logistic regression analysis illustrated that high levels of HE4 independently identified patients with RA-ILD (OR, 9.080, p < 0.001). Conclusion: Our findings showed a novel role of HE4 in RA risk stratification, suggest that introducing HE4 to the current RA test panel may serve as an indicator in identifying RA patients for further RA-ILD workups, such as high-resolution computed tomography (HRCT).


INTRODUCTION
Rheumatoid arthritis (RA) is a chronic and progressive autoimmune joint disorder characterized by synovial inflammation, joint destruction, and various extra-articular manifestations (EAMs) (1)(2)(3). Pulmonary complications, particularly interstitial lung disease (ILD), represent an important extra-articular feature of various EAMs and a major cause of mortality in patients with RA (2)(3)(4)(5). A recent study showed that 72% patients with RA-ILD had an inpatient admission and 76% had an emergency room visit (6). Further, clinically significant RA-ILD accounts for 10% of the RA population, and is associated with shortened survival and more severe underlying disease (7). To date, there are few treatments that are proven to be effective in the treatment of RA-ILD, and biomarkers that can predict RA patients at risk for ILD are in great need.
Human epididymis protein 4 (HE4), a human epididymisspecific protein, has been widely utilized in clinical practice as a tumor marker in identifying patients at risk of ovarian cancer (8). Human epididymis protein 4 is expressed in multiple tissues in the oral cavity, the respiratory tracts as well as in renal tubular epithelial cells (9). Previous studies have shown that serum HE4 levels were significantly elevated in patients with systemic sclerosis (SSc)-ILD (10). Further, high levels of HE4 can stratify patients into SSc-ILD subsets (10). We also showed that HE4 can identify primary Sjögren's syndrome (pSS) patients at risk of pulmonary/renal involvements (11). Taken together, these Data are presented as median (interquartile range) and n (%). Anti-CCP, anti-cyclic citrullinated peptide antibody; bDMARDs, biological disease modifying anti-rheumatic drugs; cDMARDs, conventional disease modifying anti-rheumatic drugs; CRP, C-reactive protein; DAS28-ESR, disease activity score-ESR; ESR, erythrocyte sedimentation rate; ILD, interstitial lung disease; NSAIDs, non-steroid anti-inflammatory drugs; RF, rheumatoid factor; SJC28, swollen joint counts in 28 joints; TJC28, tender joint counts in 28 joints.
findings suggest that HE4 may have a diagnostic potential in disease stratification in patients with autoimmune diseases. In this study, we aimed to investigate the clinical utility of HE4 in the diagnosis and disease stratification in RA.

Subjects
To evaluate the clinical utility of HE4 in RA patients, we recruited two cohorts of RA patients, including the discovery cohort and the validation cohort ( Table 1)   diseases, neoplastic, and autoimmune/autoinflammatory diseases were excluded. The study protocol was reviewed and approved by the Ethical Committee of PKUPH (Protocol number: 2019PHB244). Informed consents of discovery cohort were waived for those patients in the discovery cohort. However, all participants in the validation cohort gave written informed consent.

Data Collection
Demographic features and clinical and laboratory findings, including age, disease duration, swollen joint count in 28 joints (SJC28), tender joint count in 28 joints (TJC28), erythrocyte sedimentation rate (ESR), C-reactive protein (CRP), rheumatoid factor (RF), anti-CCP antibodies, EAMs, treatments and medical history, were collected from the medical database of PKUPH. Disease Activity Score (DAS28) was assessed as previous described (13,14).

Assessment of Interstitial Lung Disease
The assessment of ILD was performed by two pulmonologists and two radiologists with more than 10 years of thoracic imaging experience, mainly based on the symptoms and respective abnormalities suggestive of ILD in high-resolution computed tomography (HRCT) and pulmonary function tests (PFTs). The RA-ILD was assessed semi-quantitatively based on the HRCT scans, and any indeterminate ILD were excluded from the analyses. Pulmonary function tests were determined as forced vital capacity (FVC), the median forced expiratory volume in 1 second (FEV1), and diffusing capacity of the lung for carbon monoxide (DLco). Forced vital capacity, FEV1, and DLco were presented as a percentage of the predicted values for the patient's age, sex, and height, as previously described (15). Abnormalities of PFTs were defined as predicted values of FVC < 80% and DLco < 70% (16).

Statistical Analysis
Continuous variables were presented as mean ± standard deviation (SD) for normal distribution or median (interquartile range, IQR) for abnormal distribution. Categorical variables were shown as numbers (percentages) of the total samples. The statistical significance between groups was assessed using the Mann-Whitney U-test, Student t-test, Chi-square (χ 2 )test, where it was applicable. Spearman's correlation test was used to determine the relationships between HE4 and clinical parameters. The receiver operating characteristic (ROC) curve was generated to evaluate the sensitivity, specificity and areas under the ROC curve (AUC) with the 95% confidence interval (95% CI). The optimal cutoff value for predicting the incidence of ILD was identified by calculating the Youden index. Multivariate Logistic regression analyses were used to determine the risk factors for ILD. Once a univariate statistic was generated, the multivariate model was then built using a forward selection procedure. Variables with a p-value of <0.1 in the univariate analysis were first considered as candidates for the multivariate model, then variables with a p-value of <0.05 were used in the final model, and odds ratios (ORs) were calculated with 95% CI. Data analyses were calculated using SPSS 20.0 statistical software

Characteristics of Patients With RA in the Two Cohort
Detailed clinical and laboratory characteristics of the two cohort are presented in Table 1. The median duration of RA patients was 8.5 years for the discovery cohort and 9.5 years for the validation cohort. The median DAS28 was 5.4 (IQR 3.9, 7.2) for RA patients in the discovery cohort and 5.2 (IQR 3.7, 6.8) for RA patients in the validation cohort. Anti-CCP antibodies were positive in 85.7% of RA patients in the discovery cohort and 88.8% of RA patients in the validation cohort. Rheumatoid factor was positive in 74.3% of RA patients in the discovery cohort and 80.6% of RA patients in the validation cohort. The prevalence of RA-ILD was 20.0% in the discovery cohort and 34.7% in the validation cohort, respectively.

The Levels of HE4 Were Significantly Elevated in Patients With RA
The cut-off values for positivity of all tumor markers in the female-tumor biomarker screening panel are listed in Table 2.
Compared to other tumor markers, HE4 displayed the highest positive rates in RA patients (50.0%), followed by CYFRA21-1 (21.4%) and CA125 (20.0%) (  Since HE4 was the most significant marker among all the tumor markers in female-tumor biomarker screening panel, we next focus the clinical performance of HE4 in RA. We first verified these results in the validation cohort. The positive rate of HE4 was 53.1% in this cohort, which was similar to the discovery cohort ( Table 2). A similar trend was also observed in RA patients in the validation cohort.

Characteristics of Patients With RA According to HE4 Levels
Clinical and laboratory characteristics were compared between HE4-positive RA patients and HE4-negative RA patients. Patients in the HE4-positive group displayed similar disease duration and DAS28-ESR (Table 3). Further, HE4-positive patients showed similar laboratory characteristics, including RF positivity, anti-CCP positivity, levels of C3, levels of C4, and levels of IgG ( Table 3). Of interest, the prevalence of RA-ILD in patients positive for HE4 was twice as much as that in patients negative for HE4 (28.6 vs. 11.4%, p = 0.073) ( Table 3). The higher incidence of RA-ILD in HE4-positive RA was also confirmed in the validation cohort (57.7 vs. 8.7%, p < 0.001). Consistent with higher levels of HE4 in patients with RA-ILD, the levels of HE4 displayed a significantly negative correlation with DLco% (r = −0.452, p = 0.006) (Figure 3A). In addition, a negative correlation between the levels of HE4 and FVC% (r = −0.319, p = 0.062) was noticed (Figure 3B). In contrast, no significant correlations between RF or anti-CCP and DLco% or FVC% were identified ( Table 4).

Diagnostic Potential of HE4 in RA-ILD
Receiver operating curve (ROC) analysis was utilized to characterize the clinical performance of HE4 in identifying patients with RA-ILD (Figure 4). Based on Youden index, the optimal cut-off value of HE4 for distinguishing RA-ILD from RAnon ILD was 104.3 pmol/L with an area under the curve (AUC) of 0.790 and a sensitivity and a specificity of 70.8 and 77.5%, respectively. In addition, ROC analysis was also performed to calculate the cut-off values of high-CCP and high-RF, and the optimal levels were 133.14 U/ml and 170.5 IU/ml, respectively. These cut-off values were further used to identify which RA patients are at risk of having ILD ( Table 5). Univariable analysis showed that male gender (p = 0.002), old age (p = 0.012), and patients positive for HE4 (p < 0.001) were risk factor in predicting RA-ILD. In contrast, RA-non ILD patients tended to show higher SJC28 (p = 0.024) and TJC28 (p = 0.002) compare to patients with RA-ILD. Besides, patients with ILD had higher percentages of high-RF and high-anti-CCP than those without,  Table 5).

DISCUSSION
Human epididymis protein 4 has been widely utilized as an effective biomarker in the diagnosis and follow-up of patients with ovarian cancer. Of interest, accumulating evidence suggest that HE4 may also has diagnostic potential in other clinical settings, including lung adenocarcinomas (17), renal fibrosis (18), cystic fibrosis (19), as well as autoimmunerelated organ involvements, such as SSc-ILD (10) and pSSrelated pulmonary/renal involvements (11). In this study, we extended the clinical application of HE4 into the diagnosis of RA. We found that the levels of HE4 were significantly elevated in patients with RA, particularly in patients with RA-ILD. Further, we showed that high levels of HE4 were an independent factor for identifying patients with RA-ILD. Given that biomarkers predicting RA patients at risk for ILD are currently lacking, our findings thus represent an important endeavor in risk stratification and clinical subset identification in patients with RA.
In this study, we found that the levels of HE4 were elevated in more than half of RA patients. When we further subgrouped RA patients according to HE4 status, we found that HE4-positive RA group had a higher percentage of RA-ILD compared to HE4negative RA group. This phenotype was also verified in a separate RA cohort. Recently, Zhang et al. reported that serum HE4 levels were significantly increased in patients with SSc-ILD compared to SSc-non-ILD, which was consistent with our findings (10). Further, we found that high levels of HE4 were independently associated with the presence of RA-ILD in a multivariate logistic regression analysis. Consistent with our results, Nishiyama et al. also showed that HE4 was a new biomarker to predict the prognosis of progressive fibrosing ILD (20). Taken together, these findings support a role of HE4 in risk stratification of ILD.
In this study, we showed a positive correlation between the levels of HE4 with the degree of lung impairment. Human epididymis protein 4 has been shown to be expressed in respiratory epithelium (9) and increased expression of HE4 was identified in lung biopsy from patients with cystic fibrosis (CF) (17). A recent study found that elevated levels of HE4 was positively associated the degree of pulmonary dysfunction in patients with CF (19), which was consistent with our study. Further, the levels of HE4 inversely correlated with lung function improvement in CF patients after treatment, suggesting a diagnostic potential of HE4 in routine clinical and laboratory follow-up of CF treatment (21). Although it remains unclear whether HE4 is implicated in the pathogenesis of RA-ILD, LeBleu et al. have showed that HE4 can suppress the activity of multiple proteases, including serine proteases and matrix metalloproteinases, and specifically inhibits their capacity to degrade type I collagen, thereby promoting the development of kidney fibrosis (18). Further studies are needed to define the functional relevance of HE4 in the pathogenesis of RA-ILD. Of interest, 25 (72%) RA patients in the discovery cohort and 22 (42%) RA patient in the validation cohort were positive for HE4, but did not have ILD, suggesting other RA-related factors may influenced the levels of HE4. Further studies investigating how the levels of HE4 were modulated in the context of RA will be of great importance. In addition, Krebs von den Lungen-6 (KL-6) has been proposed as a potential biomarker in the diagnosis of ILD (22). It will be of great interest to assess whether combination of KL-6 with HE4 can improve the diagnostic value of each single biomarker.
Our study has a number of notable strengths. To the best of our knowledge, our study represents the first study investigating the clinical performance of HE4 in risk stratification of RA. Our findings thus expand our understanding of the clinical utility of HE4 in clinical practice, especially in rheumatoid diseases, such as RA. It should be noted, however, that our study has several limitations. First, it was a single-center study with a small RA cohort. Second, most participants of our study were female, which may result in analysis bias, as RA-ILD are more frequent in male patients. Third, patients of this study had long disease durations, and the levels of HE4 can't represent the onset status. Fourth, longitudinal examination of HE4 in patients with RA was missing. Further multi-center studies with a larger cohort comprising more male patients will be needed to corroborate our findings.
In summary, our findings showed a novel role of HE4 in RA risk stratification, suggesting that introducing HE4 to the current RA test panel (i.e., anti-CCP and RF) may provide additional diagnostic value to the current clinically available assays, especially in identifying RA patients for further RA-ILD workups, such as HRCT. Since RA-ILD represents a major complication responsible for morbidity and mortality in RA, this simple and highly reproducible biomarker, which has been already available in routine clinical practice in some countries, would be of great importance to risk-stratify RA patients for the performance of HRCT.