Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Oncol., 22 January 2026

Sec. Gastrointestinal Cancers: Hepato Pancreatic Biliary Cancers

Volume 16 - 2026 | https://doi.org/10.3389/fonc.2026.1776032

This article is part of the Research TopicImmunotherapy Optimization in Hepatocellular CarcinomaView all articles

Comparative effectiveness of two first-line, ICI-based regimens for advanced HCC: a target trial emulation using an electronic medical record network

Chihiro Shiraishi*&#x;Chihiro Shiraishi1*†Miho ShigyouMiho Shigyou1Ryuichi InoueRyuichi Inoue2Toru Ogura&#x;Toru Ogura3†Susumu KaneshigeSusumu Kaneshige2Toshinobu Hayashi&#x;Toshinobu Hayashi1†
  • 1Department of Pharmaceutical Sciences for Health Crisis Management, Faculty of Pharmaceutical Sciences, Fukuoka University, Fukuoka, Japan
  • 2Department of Pharmacy, Fukuoka University Hospital, Fukuoka, Japan
  • 3Clinical Research Support Center, Mie University Hospital, Tsu, Mie, Japan

Introduction: Head-to-head comparative evidence of the relative efficacies of atezolizumab plus bevacizumab (Atezo+Bev) and tremelimumab plus durvalumab (Treme+Dur) as first-line therapies for advanced hepatocellular carcinoma (HCC) remains limited. Thus, in the present study, we compared the real-world efficacy and safety of these two modalities using a target trial emulation approach.

Methods: Using the TriNetX Research Network, we identified adults (≥20 years) with HCC (ICD-10 C22.0) who initiated first-line Atezo+Bev or Treme+Dur (November 2022– November 2024). Propensity score matching (1:1) was used to balance the baseline characteristics. The primary outcome measure was overall survival (OS). Secondary outcomes included 1- or 2-year OS and organ-specific immune-related adverse events (irAEs) within 12 months, based on pre-specified ICD-10 definitions.

Results: After matching, 640 patients were included in each group. Residual imbalance persisted in hepatic reserve markers (albumin, international normalized ratio, and platelet count; standardized mean differences >0.1). One-year OS was numerically higher in the Atezo+Bev group than in the Treme+Dur group (61% vs 55%; hazard ratio [HR], 0.806; 95% confidence interval [CI], 0.665–0.976; p = 0.027). Two-year OS (HR, 0.864; 95% CI, 0.723–1.031; p = 0.105) and overall OS showed no significant differences (median: 19.4 vs 19.0 months [591 vs 578 days]; HR, 0.886; 95% CI, 0.743–1.057; p = 0.179). Most irAEs were similar; however, the time-to-first hepatic irAEs favored Atezo+Bev (HR, 0.678; 95% CI, 0.487–0.943; p = 0.020). Notably, respiratory irAEs occurred significantly earlier in the Treme+Dur group than in the Atezo+Bev group (mean onset: 2.4 vs 3.2 days; p = 0.031).

Conclusions: In this real-world target trial emulation, Atezo+Bev and Treme+Dur demonstrated broadly comparable long-term OS rates when used as first-line therapies for HCC. While baseline hepatic reserve plays an important role, the treatment regimen itself may contribute to earlier onset of hepatic and respiratory irAE. Careful monitoring for early-onset hepatic dysfunction and respiratory irAEs may be warranted in patients treated with Treme+Dur combination therapy.

1 Introduction

Hepatocellular carcinoma (HCC) remains a leading cause of cancer-related mortality worldwide, and a substantial proportion of patients present with or eventually develop advanced-stage disease requiring systemic therapy (1). In patients with advanced HCC, outcomes are determined not only by tumor burden but also by the severity of the underlying liver disease, which can limit treatment eligibility, affect tolerability, and strongly influence prognosis (2). Consequently, in patients with heterogeneous hepatic reserves, routine treatment selection requires balancing antitumor efficacy with safety and feasibility.

In recent years, immune checkpoint inhibitor (ICI)-based combination regimens have reshaped first-line treatment for advanced HCC. Atezolizumab plus bevacizumab (Atezo+Bev) showed improvements in overall and progression-free survival compared to sorafenib in a pivotal randomized trial and has since become a widely used first-line treatment option (3). Recently, tremelimumab plus durvalumab (Treme+Dur) demonstrated a survival benefit over sorafenib and is now incorporated into the guideline-recommended first-line therapy (4). Although both regimens have been established as standards of care, direct head-to-head comparative evidence is lacking, and cross-trial comparisons are inherently limited by differences in eligibility criteria, baseline liver function, subsequent therapies, and outcome ascertainment.

Real-world comparative studies can help address this evidence gap by evaluating the effectiveness and safety of this treatment in broader patient populations encountered in clinical practice. In practice, regimen choice is commonly shaped by hepatic reserve, endoscopic variceal ligation (EVL) history (a proxy for clinically significant portal hypertension)-related bleeding risk and anticipated immune-mediated toxicities (5). However, prior observational comparisons have often been constrained by heterogeneous comparator definitions (e.g., the inclusion of durvalumab monotherapy) and incomplete or non-organ-specific assessments of immune-related adverse events (irAEs) (6).

Target trial emulation provides a principled framework to strengthen causal interpretation using observational data by explicitly specifying the components of a hypothetical randomized trial—eligibility criteria, treatment strategies, time zero (index date), follow-up, outcomes, and estimands—and by operationalizing them in real-world datasets (7, 8). In the present study, we emulated a target trial using the TriNetX Research Network, a large multi-institutional electronic medical record (EMR) network, to compare the efficacy of Atezo+Bev with that of Treme+Dur as a first-line systemic therapy for patients with advanced HCC. The primary objective was to evaluate the overall survival (OS). The secondary objectives included assessing the 1- or 2-year OS and characterizing organ-specific irAEs, with particular attention paid to hepatic events, given the central role of baseline liver disease in this population.

2 Materials and methods

2.1 Target trial emulation framework

We conducted a targeted trial using observational data to evaluate the efficacy of Atezo+Bev vs Treme+Dur in patients with advanced HCC (International Classification of Diseases, Tenth Revision [ICD-10] code C22.0). This approach defines the key elements of a hypothetical RCT and replicates them in a retrospective cohort setting, with particular emphasis placed on the eligibility criteria, treatment strategies, follow-up, and analytical methods.

2.2 Study design and data source

We conducted a retrospective cohort study using the TriNetX Research Network, a large multi-institutional EMR network (9, 10). TriNetX provides access to de-identified EMR from approximately 250 million patients across academic medical centers, specialty practices, and community hospitals. The available data included demographics, diagnoses, procedures, medications, laboratory results, and clinical documentation. Diagnoses were identified using ICD-10-CM codes, and medications were identified using RxNorm terminology (Supplementary Table S1). A natural language processing network within the TriNetX Analytics Platform was also used. The database reflects racial and ethnic diversity, including White (Demographics: 2106-3), Black or African American (2054-5), Hispanic or Latino (2135-2), Asian (2028-9), and other ethnic groups.

2.3 Eligibility criteria

Patients aged ≥20 years with a diagnosis of HCC (ICD-10 code C22.0) were eligible. Two cohorts were defined: (1) patients who received Atezo+Bev and (2) patients who received Treme+Dur as first-line systemic treatment. All patients were treated between November 2022 and November 2024, with follow-up censored administratively on November 30, 2025. Thus, the maximum available follow-up varied according to the index date. To define first-line therapy status, we excluded patients with prior sorafenib or lenvatinib exposure before the index date. Additionally, patients in the Atezo+Bev cohort who had received prior Treme+Dur treatment and those in the Treme+Dur cohort who had received prior Atezo+Bev treatment were excluded. Patients who received subsequent systemic treatments after the index date were retained in the cohort, consistent with an observational analogue of an intention-to-treat framework. The definitions of the conditions and disease classifications are provided in Supplementary Table S1. The index date (day 0) was defined as the date of the initiation of Atezo+Bev or Treme+Dur administration. These criteria are summarized in Table 1.

Table 1
www.frontiersin.org

Table 1. Target trial emulation of Atezo+Bev and Treme+Dur.

2.4 Outcomes and follow-up

The primary outcome was total OS, defined as the time from the initiation of systemic therapy to death from any cause. Secondary analyses evaluated the 1- or 2-year OS and irAEs identified using ICD-10 codes categorized by organ system based on previously published literature and clinical relevance (Supplementary Table S1) (11). Although organ-specific irAEs were identified using ICD-10 codes, prior validation studies have demonstrated high concordance between TriNetX-coded data and clinical documentation from multi-institutional medical records (11). IrAEs were defined as newly recorded ICD-10 diagnoses after the index date that were not adjudicated for attribution or grade. The time to risk was measured from the index date until death or loss to follow-up. For irAEs, follow-up was limited to 12 months after treatment initiation to capture the early onset toxicities typically associated with ICIs. To reduce misclassification, patients with any relevant irAEs-related ICD-10 codes within 12 months before ICI initiation were excluded from the organ−specific irAE analyses. Organ system categories included cardiac, endocrine, musculoskeletal, rheumatological, hematological, hepatic, gastrointestinal, respiratory, renal, ocular, neurological, and cutaneous.

2.5 Covariates

Baseline covariates were grouped into the demographic, liver-related, comorbidity, metastatic, laboratory, and medication domains. Demographic variables included age, sex, body mass index (BMI, kg/m²), tobacco use, nicotine dependence, and race/ethnicity (White, Black or African American, Hispanic or Latino, and Asian). Liver-related factors included secondary esophageal varices, portal vein thrombosis, chronic viral hepatitis, alcoholic liver disease, non-alcoholic steatohepatitis (NASH), other liver diseases, other venous embolism/thrombosis, and alpha-fetoprotein (AFP) levels. Comorbidities included hypertension, dyslipidaemia, ischaemic heart disease, cerebrovascular disease, and proteinuria. The metastatic disease burden was characterized using indicators for secondary malignant neoplasms of the respiratory and digestive organs, bone and bone marrow, brain and cerebral meninges, lymph nodes, and other non-infective lymphatic disorders. Laboratory data included alanine aminotransferase (ALT, U/L), aspartate aminotransferase (AST, U/L), platelet count (×109/L), total bilirubin (mg/dL), albumin (ALB, g/dL), international normalized ratio (INR), and lactate dehydrogenase (U/L). Medication-related variables included anticoagulant, platelet aggregation inhibitor, and systemic glucocorticoid use (Supplementary Table 1).

2.6 Statistical analysis

Baseline characteristics are summarized as means with standard deviations or frequencies with percentages for continuous and categorical variables, respectively. Continuous covariates (e.g. age and BMI) were entered into the models as continuous terms, whereas all other covariates were treated as categorical variables.

Propensity scores were estimated using logistic regression models that included all the baseline covariates. A 1:1 greedy nearest-neighbor matching algorithm without replacement, with a calliper width of 0.2 standard deviations of the logit of the propensity score, was applied to balance the baseline characteristics between the treatment groups. Covariate balance was assessed using absolute standardized mean differences (SMDs), with SMD < 0.1, considered indicative of acceptable balance, set as the target.

OS was analyzed in the propensity score–matched cohort using the Kaplan–Meier method. Survival probabilities at 1 or 2 years, as well as total survival, were estimated from Kaplan–Meier curves for each treatment group, while hazard ratios (HRs) with 95% confidence intervals (CIs) were obtained from Cox proportional hazards models with the treatment group (Atezo+Bev vs Treme+Dur) set as the main exposure. TriNetX reports Kaplan–Meier estimates among patients with follow-up information available at each horizon; denominators vary by time point. In addition, to identify independent predictors of hepatic irAEs and 1-year mortality, exploratory multivariable Cox models were fitted. Owing to platform limitations, this analysis was conducted in the overall (pre-matched) cohort, including the treatment group and other clinically relevant covariates. A two-tailed p-value of <0.05 was considered statistically significant. Cohort identification and statistical analyses were conducted on 4 December 2025 using the TriNetX platform query builder and analytical functions.

2.7 Negative control outcome

In target trial emulation, negative controls are used to detect and quantify residual bias, such as unmeasured confounding or systematic error, in observational analyses aimed at estimating causal effects. A negative control refers to an exposure, outcome, or association that is known or strongly believed not to be causally related to the exposure of interest. Therefore, any observed association indicates the presence of potential bias in the study design or analysis. Incorporating negative controls helps assess the validity of the emulation and the robustness of causal inference by determining whether the analytic methods produce spurious associations where none are expected (12). To examine residual confounding, knee osteoarthritis was selected as a negative control outcome, as it is not expected to be influenced by ICI therapy. The incidence rates and comparative risk estimates for this outcome were calculated using the same analytical methods applied to the primary outcomes (13, 14).

3 Results

3.1 Baseline characteristics

Table 2 summarizes the baseline characteristics of the patients who received Atezo+Bev or Treme+Dur before and after propensity score matching (PSM). In the unmatched cohort, several clinically relevant imbalances were observed between the two groups. Patients receiving Treme+Dur had a higher prevalence of liver-related complications and comorbidities, including secondary esophageal varices, portal vein thrombosis, alcoholic liver disease, NASH, other liver diseases, hypertension, dyslipidaemia, ischaemic heart diseases, and proteinuria. The Treme+Dur group also had more frequent cardiovascular risk factors and showed a greater use of anticoagulant, antiplatelet agent, and glucocorticoid. In contrast, chronic viral hepatitis and Asian ethnicity were more common in the Atezo+Bev group. Laboratory data further indicated worse liver function in the Treme+Dur group, with lower ALB levels and a higher INR at baseline.

Table 2
www.frontiersin.org

Table 2. Baseline characteristics.

Before matching, 1,394 patients in the Atezo+Bev group and 698 in the Treme+Dur group met the eligibility criteria. After 1:1 propensity score matching, 640 patients were successfully matched in each treatment group. This corresponded to retention rates of 46.0% for Atezo+Bev and 91.7% for Treme+Dur, with unmatched patients excluded because suitable matches within the predefined caliper were unavailable. Most baseline characteristics were well-balanced between the groups; however, residual imbalance persisted in several laboratory surrogates of hepatic reserve, including ALB, INR, and platelet count (SMDs >0.1).

3.2 Overall survival in the PSM cohorts

Throughout the entire follow-up period, there was no statistically significant difference in OS between the Atezo+Bev and Treme+Dur groups (median OS, 19.4 vs 19.0 months [591 vs 578 days]; HR, 0.886; 95% CI, 0.743–1.057; p = 0.179) (Figure 1). In the propensity score–matched cohort, the Kaplan–Meier–estimated 1-year OS was 61.1% in the Atezo+Bev group and 54.9% in the Treme+Dur group (HR, 0.806; 95% CI, 0.665–0.976; p = 0.027). At 2 years, the corresponding OS estimates were 43.9% and 42.9%, respectively (HR, 0.864; 95% CI, 0.723–1.031; p = 0.105). The survival estimates are summarized in Table 3.

Figure 1
Kaplan–Meier survival curves over days after treatment initiation, comparing Atezo+Bev (purple line) with Treme+Dur (green line). Survival probability on the y-axis decreases stepwise over follow-up in both groups, with the two curves running close together across most of the period. A table beneath the plot lists, at each time point, the number at risk, the number censored, and the number of events for each treatment group.

Figure 1. Overall survival in patients treated with Atezo+Bev or Treme+Dur. Atezo, atezolizumab; Bev, bevacizumab; Dur, durvalumab; OS, overall survival; Treme, tremelimumab. Kaplan-Meier survival curves for Atezo+Bev versus Treme+Dur in advanced HCC. Kaplan-Meier curves showing overall survival probability for patients with advanced HCC treated with Atezo+Bev (purple line) versus Treme+Dur (green line). The x-axis represents the days after treatment initiation, and the y-axis shows the survival probability. Shaded areas represent 95% CIs.

Table 3
www.frontiersin.org

Table 3. Overall survival in the PSM cohorts.

3.3 Immune-related adverse events with time to first onset in the PSM cohorts

Within the first year after treatment initiation, organ-specific irAEs were recorded and categorized in the PSM cohort (Table 4). The incidence of most irAEs was similar between the Atezo+Bev and Treme+Dur groups. For cardiac, endocrine, musculoskeletal, hematologic, gastrointestinal, respiratory, neurological, and cutaneous events, the risk ratios were close to 1.0, and the risk differences were small, with a 95% CI crossing the null. Rheumatological, renal, and ocular irAEs were not observed in either group; therefore, effect estimates could not be calculated. Hepatic irAEs showed the largest numerical difference, occurring in 28% of patients in the Atezo+Bev group and 35% of those in the Treme+Dur group; however, this difference was not statistically significant (risk ratio, 0.798 [95% CI, 0.607–1.049]; risk difference, -7.117% [95% CI, -15.678–1.444]).

www.frontiersin.org

. Table 4A Immune-related adverse events in the PSM cohorts.

www.frontiersin.org

. Table 4B Time to first onset of immune-related adverse events in the PSM cohorts.

In addition to incidence, the mean time to the first onset of irAEs was also evaluated. For most organ systems, the onset of irAEs occurred within the first few days after treatment initiation in both groups. Notably, respiratory irAEs occurred significantly earlier in the Treme+Dur group than in the Atezo+Bev group (mean onset: 2.4 vs 3.2 days; p = 0.031), whereas the timing of onset for other irAEs did not differ significantly.

Supplementary Table 3 shows a Kaplan–Meier estimate of the time to first irAEs in the PSM cohort. In a complementary time-to-event analysis focusing on hepatic irAE, patients treated with Atezo+Bev exhibited a significantly lower hazard of hepatic irAE than those treated with Treme+Dur (HR, 0.678; 95% CI, 0.487–0.943; p = 0.020). The other irAEs categories did not demonstrate any significant between-group differences in the time-to-event analyses.

3.4 Negative control outcomes in the PSM cohort

To assess the potential unmeasured confounding factors, the incidence of knee osteoarthritis was compared between the two treatment groups. There was no significant difference in the incidence (1.8% vs 1.8%; HR, 0.922; 95% CI, 0.384–2.217; p = 0.856), providing some reassurance against large residual confounding in the primary analysis (data not shown).

3.5 Exploratory cox analyses in the pre-PSM cohort of hepatic events

To further explore factors associated with hepatic irAEs, we fitted a multivariate Cox proportional hazards model in the overall (pre-matched) cohort. As this analysis was conducted prior to propensity score matching, it was considered exploratory and supportive rather than a primary analysis (Supplementary Table 4; Supplementary Figure 1). In this model, treatment with Atezo+Bev was associated with a significantly lower hazard of hepatic irAEs than Treme+Dur (HR, 0.830; 95% CI, 0.715–0.963; p = 0.014).

Among baseline characteristics, alcoholic liver disease (HR, 1.573; 95% CI, 1.320–1.876; p < 0.001), fatty liver (HR, 1.373; 95% CI, 1.107–1.702; p = 0.004), NASH (HR, 1.887; 95% CI, 1.521–2.342; p < 0.001), and elevated AST levels (HR per unit increase, 1.002; 95% CI, 1.001–1.002; p < 0.001) were independently associated with an increased hazard of hepatic irAEs, whereas platelet count showed a modest inverse association (HR, 0.999; 95% CI, 0.998–1.000; p = 0.028). Other covariates, including age, sex, glucocorticoid use, chronic viral hepatitis, and ALT levels, were not significantly associated with hepatic events in this study.

3.6 Exploratory cox analyses in the pre-PSM cohort of 1-year mortality

In an exploratory multivariable Cox regression analysis of 1-year mortality conducted in the overall (pre-matched) cohort (Supplementary Table 5), treatment with Atezo+Bev was associated with a numerically lower hazard of death compared with Treme+Dur (HR, 0.865; 95% CI, 0.733–1.021; p = 0.087); however, this difference did not reach statistical significance. In contrast, several baseline patient characteristics were strongly associated with 1-year mortality. Markers of portal hypertension and advanced liver disease severity emerged as the most influential predictors: portal vein thrombosis (HR, 1.447; 95% CI, 1.199–1.746; p < 0.001) and a history of EVL (HR, 1.639; 95% CI, 1.203–2.234; p = 0.002) were each associated with a substantially increased risk of death.

Elevated INR was associated with more than a two-fold increase in 1-year mortality risk (HR, 2.071; 95% CI, 1.641–2.615; p < 0.001), whereas higher serum albumin levels were strongly protective (HR, 0.486; 95% CI, 0.424–0.556; p < 0.001).

Additional factors associated with increased mortality included older age (HR per year, 1.012; 95% CI, 1.004–1.021; p = 0.004) and the presence of secondary malignant neoplasms at other or unspecified sites (HR, 1.346; 95% CI, 1.051–1.725; p = 0.019). Platelet count showed a small but statistically significant positive association with mortality (HR, 1.001; 95% CI, 1.000–1.002; p = 0.001).

4 Discussion

In this large, real-world target trial using a multi-institutional EMR network, Atezo+Bev and Treme+Dur showed broadly comparable long-term OS rates when used as first-line systemic therapies for HCC. From a clinical perspective, our findings indicate that the overall incidence of irAE was broadly comparable between the two regimens, except for hepatic irAE and early-onset of respiratory irAE, which were more frequent in the Treme+Dur group. These findings suggest that treatment selection can be guided by individual patient characteristics rather than safety concerns alone.

Although direct comparisons across studies should be interpreted with caution, the survival outcomes observed in this real-world analysis appear broadly similar to those reported in the IMbrave150 and HIMALAYA randomized trials (3, 4), suggesting that comparable long-term benefits can be achieved with either regimen in clinical practice. While a numerical difference in 1-year OS was observed, this early separation likely reflects differences in baseline disease severity rather than a sustained causal effect of treatment. Although proxies of portal hypertension (prior EVL and portal vein thrombosis) were well balanced after PSM (15), modest residual imbalances persisted in markers of hepatic reserve (albumin, INR, and platelet count), which are indicative of hepatic synthetic function and associated with short-term mortality risk (Supplementary Table 5) (1620), highlighting the influence of baseline clinical characteristics on early outcomes and the potential for residual confounding.

Among organ-specific irAEs, hepatic events represented the most notable safety signal in this study. Although the 12-month cumulative incidence of hepatic irAEs was numerically higher in the Treme+Dur group than in the Atezo+Bev group (35% vs 28%), this difference did not reach statistical significance. In contrast, the time-to-first-event analysis demonstrated a significantly lower hazard of the first hepatic irAE with Atezo+Bev, suggesting a higher cumulative incidence of early hepatic events in the Treme+Dur group rather than a difference in mean time to onset.

Immune-mediated hepatitis, which can be a fatal, manifests in several pathological patterns, including cholestatic, hepatocellular, and mixed injury (21). From a mechanistic perspective, CTLA-4 inhibitors primarily exert their immunomodulatory effects during the early phases of T-cell activation within lymphoid organs, whereas PD-1/PD-L1 inhibitors act later in peripheral tissues (22, 23). A previous study showed that combination therapies involving PD-1/PD-L1 inhibitors plus CTLA-4 inhibitors are associated with a higher risk of hepatic laboratory abnormalities, including elevation in ALT, AST, GGT, and bilirubin, compared with PD-1/PD-L1 inhibitor monotherapy, whereas no significant differences have been observed in the risk of grade 3–5 adverse events. These findings suggest that while immune-related liver injury may occur more frequently or earlier with combination therapy, severe hepatotoxicity does not necessarily increase, and treatment rechallenge may remain feasible in selected patients. Consistent with these observations, our study did not identify a higher overall burden of hepatotoxicity with Treme+Dur compared with Atezo+Bev therapy. These findings reinforce a central clinical principle in advanced HCC: patient outcomes are shaped not only by antitumor efficacy, but also—often predominantly—by the severity of the underlying liver disease and portal hypertension (2426).

The incidence of respiratory irAEs has been reported to be <1% with Atezo+Bev and approximately 3% with Treme+Dur in patients with HCC, with most events being mild to moderate and severe cases remaining rare (27). Respiratory irAEs have been observed to occur across a broad time range, from as early as 9 days to as late as 19 months after initiation of ICI therapy. Notably, combination regimens involving CTLA-4 and PD-1/PD-L1 inhibitors are more frequently associated with earlier onset, often within the first 1–2 months, compared to monotherapy (27, 28). In our study, although the overall incidence of respiratory irAEs was comparable between the groups, patients receiving the Treme+Dur regimen experienced significantly earlier onset than those in the Atezo+Bev group (mean onset: 2.4 vs. 3.2 days; p = 0.031). Importantly, known risk factors such as smoking history and the presence of lung metastases were well balanced between the groups (29), suggesting that the difference in timing may be attributable to the treatment regimen itself. These findings have important implications for clinical practice. In particular, the earlier onset of respiratory irAEs with the STRIDE regimen underscores the need for enhanced monitoring during the first 1–3 months of therapy. Recommended strategies include baseline pulmonary assessment, frequent follow-up visits, and prompt imaging in response to new respiratory symptoms. Furthermore, previous studies have consistently shown a higher frequency of irAEs with CTLA-4–containing regimens (30), supporting the possibility that early toxicity may contribute to treatment discontinuation and potentially influence early survival differences. Prospective studies and the identification of predictive biomarkers remain critical to improving early detection and management of irAEs. Until such tools become available, clinicians should maintain a high index of suspicion and implement robust monitoring protocols during the initial phase of ICI therapy.

This study has several strengths, including an explicit target trial emulation design, a large and diverse real-world dataset encompassing both academic and community practice settings, comprehensive baseline covariate adjustment using PSM, and a prespecified organ system–based framework for irAE ascertainment. In addition, the negative control outcome analysis using knee osteoarthritis did not demonstrate any detectable between-group differences, providing reassurance against the presence of substantial residual confounding.

This study had several limitations. Firstly, residual confounding factors cannot be fully eliminated, particularly for hepatic reserve and tumor burden, due to the incomplete availability of granular staging information (e.g. BCLC), liver function scores (e.g. Child–Pugh/ALBI), radiographic response, and subsequent therapies. Second, mortality ascertainment in the EMR networks may be incomplete owing to possible out-of-network deaths or delayed reporting (31). Third, irAEs identified via ICD-10 codes may be misclassified, lack grading and attribution, and are subject to potential under-coding of mild events and documentation heterogeneity across institutions (32). Fourth, multiple outcomes were examined without any formal multiplicity adjustment, which may increase the risk of type I error, and follow-up was limited by administrative censoring, restricting evaluation of longer-term survival “tails” and late-onset toxicities. Fifth, the laboratory variables also showed substantial missingness, which may have affected matching and comparability. Sixth, owing to platform constraints, inverse probability weighting and flexible covariate adjustment were not feasible. Therefore, we employed propensity score matching, which is among the validated causal inference tools supported within the TriNetX environment. The pre-matched cohort may be subject to residual confounding and should not be interpreted as confirmatory.

5 Conclusion

In this real-world target trial emulation, the overall antitumor effectiveness and incidence of irAEs were broadly comparable between Atezo+Bev and Treme+Dur as first-line therapies for HCC. Although no major differences in overall toxicity burden were observed, Treme+Dur was associated with an earlier onset of hepatic adverse events, indicating that careful monitoring for early hepatic dysfunction is warranted with this regimen. Importantly, baseline liver disease severity and portal hypertension appeared to exert a greater influence on prognosis than treatment regimen itself. Future studies with longer follow-up and more granular clinical staging, liver function assessments, and treatment sequencing data are needed to clarify whether early between-regimen differences reflect causal treatment effects.

Data availability statement

The datasets presented in this article are not readily available because the dataset used in this study was obtained from the TriNetX research network and is subject to data use agreements and privacy regulations. The data are de-identified and cannot be shared publicly. Access to the dataset is restricted to licensed users of the TriNetX platform and cannot be made available through a public repository or upon request. Requests to access the datasets should be directed to https://trinetx.com/products/real-world-datasets/.

Ethics statement

The studies involving humans were approved by The Ethics Committee of Fukuoka University (approval number: F25-12-001). The studies were conducted in accordance with the local legislation and institutional requirements. The human samples used in this study were acquired from This point-and-click platform enables observational analyses using curated variable definitions and a limited set of analytical tools. Informed consent from participants or their legal guardians was waived because the study used de-identified data and complied with national legislation and institutional policies. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and institutional requirements.

Author contributions

CS: Writing – review & editing, Investigation, Data curation, Writing – original draft, Conceptualization, Visualization, Formal analysis, Methodology. MS: Writing – review & editing. RI: Writing – review & editing. TO: Writing – review & editing, Formal analysis. SK: Writing – review & editing. TH: Supervision, Writing – review & editing.

Funding

The author(s) declared that financial support was not received for this work and/or its publication.

Acknowledgments

We are grateful to the physicians, surgeons, nurses, and pharmacists for assisting and caring for the patients. We would like to thank Editage (www.editage.com) for English language editing.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2026.1776032/full#supplementary-material

References

1. El Hajra I, Sanduzzi-Zamparelli M, Sapena V, Muñoz-Martínez S, Mauro E, Llarch N, et al. Outcome of patients with HCC and liver dysfunction under immunotherapy: a systematic review and meta-analysis. Hepatology. (2023) 77:1139–49. doi: 10.1097/HEP.0000000000000030

PubMed Abstract | Crossref Full Text | Google Scholar

2. Gordan JD, Kennedy EB, Abou-Alfa GK, Beal E, Finn RS, Gade TP, et al. Systemic therapy for advanced hepatocellular carcinoma: ASCO guideline update. J Clin Oncol. (2024) 42:1830–50. doi: 10.1200/JCO.23.02745

PubMed Abstract | Crossref Full Text | Google Scholar

3. Finn RS, Qin S, Ikeda M, Galle PR, Ducreux M, Kim TY, et al. Atezolizumab plus bevacizumab in unresectable hepatocellular carcinoma. N Engl J Med. (2020) 382:1894–905. doi: 10.1056/NEJMoa1915745

PubMed Abstract | Crossref Full Text | Google Scholar

4. Lau G, Sangro B, Cheng AL, Kudo M, Kelley RK, Tak WY, et al. Immune-mediated adverse events and overall survival with tremelimumab plus durvalumab and durvalumab monotherapy in unresectable HCC: HIMALAYA phase III randomized clinical trial. Hepatology (2025). doi: 10.1097/HEP.0000000000001385

PubMed Abstract | Crossref Full Text | Google Scholar

5. Kaplan DE, Ripoll C, Thiele M, Fortune BE, Simonetto DA, Garcia-Tsao G, et al. AASLD Practice Guidance on risk stratification and management of portal hypertension and varices in cirrhosis. Hepatology. (2024) 79:1180–211. doi: 10.1097/HEP.0000000000000647

PubMed Abstract | Crossref Full Text | Google Scholar

6. Kournoutas I, Marell P, Gile J, Peersen A, Shah P, VanDommelen K, et al. First-line atezolizumab/bevacizumab or durvalumab/tremelimumab in advanced hepatocellular carcinoma: a real world, multicenter retrospective study. Oncologist. (2025) 30:oyaf286. doi: 10.1093/oncolo/oyaf286

PubMed Abstract | Crossref Full Text | Google Scholar

7. Hernán MA, Wang W, and Leaf DE. Target trial emulation: A framework for causal inference from observational data. JAMA. (2022) 328:2446–47. doi: 10.1001/jama.2022.21383

PubMed Abstract | Crossref Full Text | Google Scholar

8. Matthews AA, Danaei G, Islam N, and Kurth T. Target trial emulation: applying principles of randomized trials to observational studies. BMJ. (2022) 378:e071108. doi: 10.1136/bmj-2022-071108

PubMed Abstract | Crossref Full Text | Google Scholar

9. Data Sets. TriNetX. Available online at: https://trinetx.com/products/real-world-datasets/ (Accessed December 4, 2025).

Google Scholar

10. Palchuk MB, London JW, Perez-Rey D, Drebert ZJ, Winer-Jones JP, Thompson CN, et al. A global federated real-world data and analytics platform for research. JAMIA Open. (2023) 6:ooad035. doi: 10.1093/jamiaopen/ooad035

PubMed Abstract | Crossref Full Text | Google Scholar

11. Wan G, Chen W, Khattab S, Roster K, Nguyen N, Yan B, et al. Multi-organ immune-related adverse events from immune checkpoint inhibitors and their downstream implications: a retrospective multicohort study. Lancet Oncol. (2024) 25:1053–69. doi: 10.1016/S1470-2045(24)00278-X

PubMed Abstract | Crossref Full Text | Google Scholar

12. Lipsitch M, Tchetgen Tchetgen E, and Cohen T. Negative controls: a tool for detecting confounding and bias in observational studies. Epidemiology. (2010) 21:383–8. doi: 10.1097/EDE.0b013e3181d61eeb

PubMed Abstract | Crossref Full Text | Google Scholar

13. Robinson WH, Lepus CM, Wang Q, Raghu H, Mao R, Lindstrom TM, et al. Low-grade inflammation as a key mediator of the pathogenesis of osteoarthritis. Nat Rev Rheumatol. (2016) 12:580–92. doi: 10.1038/nrrheum.2016.136

PubMed Abstract | Crossref Full Text | Google Scholar

14. Mora JC, Przkora R, and Cruz-Almeida Y. Knee osteoarthritis: pathophysiology and current treatment modalities. J Pain Res. (2018) 11:2189–96. doi: 10.2147/JPR.S154002

PubMed Abstract | Crossref Full Text | Google Scholar

15. Tapper EB and Parikh ND. Diagnosis and management of cirrhosis and its complications: A review. JAMA. (2023) 329:1589–602. doi: 10.1001/jama.2023.5997

PubMed Abstract | Crossref Full Text | Google Scholar

16. Sun L, Yin H, Liu M, Xu G, Zhou X, Ge P, et al. Impaired albumin function: a novel potential indicator for liver function damage? Ann Med. (2019) 51:333–44. doi: 10.1080/07853890.2019.1693056

PubMed Abstract | Crossref Full Text | Google Scholar

17. Maiwall R, Kulkarni AV, Arab JP, and Piano S. Acute liver failure. Lancet. (2024) 404:789–802. doi: 10.1016/S0140-6736(24)00693-7

PubMed Abstract | Crossref Full Text | Google Scholar

18. Field KM and Dow C. Michael M. Part I: Liver function in oncology: biochemistry and beyond. Lancet Oncol. (2008) 9:1092–101. doi: 10.1016/S1470-2045(08)70279-1

PubMed Abstract | Crossref Full Text | Google Scholar

19. Fujita K, Yamasaki K, Morishita A, Shi T, Tani J, Nishiyama N, et al. Albumin platelet product as a novel score for liver fibrosis stage and prognosis. Sci Rep. (2021) 11:5345. doi: 10.1038/s41598-021-84719-3

PubMed Abstract | Crossref Full Text | Google Scholar

20. Iida H, Kaibori M, Matsui K, Ishizaki M, and Kon M. Ratio of mean platelet volume to platelet count is a potential surrogate marker predicting liver cirrhosis. World J Hepatol. (2018) 10:82–7. doi: 10.4254/wjh.v10.i1.82

PubMed Abstract | Crossref Full Text | Google Scholar

21. Losser MR and Payen D. Mechanisms of liver damage. Semin Liver Dis. (1996) 16:357–67. doi: 10.1055/s-2007-1007249

PubMed Abstract | Crossref Full Text | Google Scholar

22. Fife BT and Bluestone JA. Control of peripheral T-cell tolerance and autoimmunity via the CTLA-4 and PD-1 pathways. Immunol Rev. (2008) 224:166–82. doi: 10.1111/j.1600-065X.2008.00662.x

PubMed Abstract | Crossref Full Text | Google Scholar

23. Zheng C, Huang S, Lin M, Hong B, Ni R, Dai H, et al. Hepatotoxicity of immune checkpoint inhibitors: What is Currently Known. Hepatol Commun. (2023) 7:e0063. doi: 10.1097/HC9.0000000000000063

PubMed Abstract | Crossref Full Text | Google Scholar

24. Ripoll C, Groszmann R, Garcia-Tsao G, Grace N, Burroughs A, Planas R, et al. Hepatic venous pressure gradient predicts clinical decompensation in patients with compensated cirrhosis. Gastroenterology. (2007) 133:4818. doi: 10.1053/j.gastro.2007.05.024

PubMed Abstract | Crossref Full Text | Google Scholar

25. Reig M, Forner A, Rimola J, Ferrer-Fàbrega J, Burrel M, Garcia-Criado Á, et al. BCLC strategy for prognosis prediction and treatment recommendation: The 2022 update. J Hepatol. (2022) 76:68193. doi: 10.1016/j.jhep.2021.11.018

PubMed Abstract | Crossref Full Text | Google Scholar

26. D’Amico G, GarciaTsao G, and Pagliaro L. Natural history and prognostic indicators of survival in cirrhosis. J Hepatol. (2006) 44:21731. doi: 10.1016/j.jhep.2005.10.013

PubMed Abstract | Crossref Full Text | Google Scholar

27. Ito T, Shimose S, Tani J, Tomonari T, Saeki I, Takeuchi Y, et al. Antitumor effects and immune-mediated adverse events of durvalumab plus tremelimumab treatment for unresectable hepatocellular carcinoma. Hepatol Res. (2025) 55:577–87. doi: 10.1111/hepr.14151

PubMed Abstract | Crossref Full Text | Google Scholar

28. Spagnolo P, Chaudhuri N, Bernardinello N, Karampitsakos T, Sampsonas F, and Tzouvelekis A. Pulmonary adverse events following immune checkpoint inhibitors. Curr Opin Pulm Med. (2022) 28:391–8. doi: 10.1097/MCP.0000000000000895

PubMed Abstract | Crossref Full Text | Google Scholar

29. Zhou X, Xu Y, Ying Y, Chen R, Wang Z, and Lv X. Risk factors for checkpoint inhibitor pneumonitis in lung cancer patients treated with immune checkpoint inhibitors: a systematic review and meta-analysis. Front Immunol. (2025) 16:1607170. doi: 10.3389/fimmu.2025.1607170

PubMed Abstract | Crossref Full Text | Google Scholar

30. Sogbe M, Ruiz-Margáin A, and Duarte-Rojo A. Response: advancing the prognostic utility of free testosterone in advanced chronic liver disease-key methodological considerations. Liver Int. (2026) 46:e70495. doi: 10.1111/liv.70495

PubMed Abstract | Crossref Full Text | Google Scholar

31. Cowper DC, Kubal JD, Maynard C, and Hynes DM. A primer and comparative review of major US mortality databases. Ann Epidemiol. (2002) 12:4628. doi: 10.1016/s1047-2797(01)00285-x

PubMed Abstract | Crossref Full Text | Google Scholar

32. Brown JS, Kahn M, and Toh S. Data quality assessment for comparative effectiveness research in distributed data networks. Med Care. (2013) 51:S229. doi: 10.1097/MLR.0b013e31829b1e2c

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: atezolizumab, bevacizumab, durvalumab, hepatocellular carcinoma, immune-related adverse events, real-world evidence, target trial emulation, tremelimumab

Citation: Shiraishi C, Shigyou M, Inoue R, Ogura T, Kaneshige S and Hayashi T (2026) Comparative effectiveness of two first-line, ICI-based regimens for advanced HCC: a target trial emulation using an electronic medical record network. Front. Oncol. 16:1776032. doi: 10.3389/fonc.2026.1776032

Received: 26 December 2025; Accepted: 06 January 2026; Revised: 05 January 2026;
Published: 22 January 2026.

Edited by:

Si-Yuan Song, Baylor College of Medicine, United States

Reviewed by:

Xiaowei Wei, International Peace Maternity and Child Health Hospital, China
Yingzhe Yan, Sun Yat-sen University, China

Copyright © 2026 Shiraishi, Shigyou, Inoue, Ogura, Kaneshige and Hayashi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Chihiro Shiraishi, Yy5zaGlyYWlzaGkuZ2dAZnVrdW9rYS11LmFjLmpw

ORCID: Chihiro Shiraishi, orcid.org/0000-0002-1190-6540
Toru Ogura, orcid.org/0000-0002-6343-5349
Toshinobu Hayashi, orcid.org/0000-0001-6183-4939

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.