Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Nutr., 26 January 2026

Sec. Nutritional Epidemiology

Volume 12 - 2025 | https://doi.org/10.3389/fnut.2025.1748611

Plant-based dietary patterns, micronutrient status and breast cancer outcomes: a joint analysis of UK Biobank and Chinese longitudinal healthy longevity survey

  • 1Department of Breast and Thyroid Surgery, The Second Affiliated Hospital of Soochow University, Suzhou, China
  • 2Department of Clinical Laboratory, Zhongshan Second People's Hospital, Zhongshan, China
  • 3Department of Infectious Diseases and Public Health City University of Hong Kong, Hong Kong, Hong Kong SAR, China

Background: Plant-based diets may lower breast cancer risk, but their impact on breast cancer-related mortality is unclear. We explored associations of plant-based dietary patterns (Healthful Plant-Based Diet Index [HPDI/PDI]) and micronutrient intake with breast cancer incidence and all-cause mortality in patients.

Methods: Using data of UK Biobank (UKB; 67,045 cancer-free participants; 3,397 breast cancer patients) and Chinese Longitudinal Healthy Longevity Survey (CLHLS), we analyzed dietary scores and micronutrient intake via multivariate Cox regression, restricted cubic splines, and predictive models (concordance index, Random Forest, and time-dependent ROC).

Results: Among 67,045 breast cancer-free participants, the highest HPDI tertile was associated with 11% lower breast cancer risk (HR = 0.89, 95%CI: 0.82–0.98) vs. lowest tertile (4% reduction per SD increase, HR = 0.96, 95%CI: 0.93–1.00). Among 3,397 breast cancer patients, the highest HPDI tertile showed 28% lower mortality (HR = 0.72, 95%CI: 0.55–0.95) vs. lowest (11% reduction per SD, HR = 0.89, 95%CI: 0.79–1.00). Individuals with high PDI scores exhibited a 39% lower risk of cancer compared to those with low scores in CLHLS (HR = 0.61, 95%CI: 0.41–0.92). Higher intakes of vitamins B2 and C, calcium, and magnesium were inversely associated with risk and mortality, while each SD increase in sodium raised mortality risk by 15% (HR = 1.15, 95%CI: 1.01–1.32). Predictive models showed optimal 5-year performance overall; micronutrients alone best predicted breast cancer risk across timepoints, while HPDI peaked for 5-year mortality prediction (AUC = 0.625). The combined model achieved superior 10-year prognosis.

Conclusions: High adherence to a healthful plant-based diet, together with sufficient intake of key micronutrients and reduced sodium consumption, may contribute to breast cancer prevention and improved survival outcomes.

1 Introduction

Breast cancer remains one of the most common malignancies globally and a leading cause of cancer-related death among women. Despite advances in detection and treatment, its persistent incidence and mortalitypose a significant challenge to public health worldwide (1). Consequently, the exploration and identification of modifiable lifestyle factors, particularly dietary patterns, are crucial for both the primary prevention and secondary prognosis of breast cancer.

In recent years, Plant-Based Dietary Patterns have garnered increasing attention. These eating styles, such as the Healthful Plant-Based Diet Index (HPDI) and the Alternate Mediterranean Diet (AMED), emphasize higher consumption of fruits, vegetables, whole grains, and legumes, while limiting red and processed meats. These diets are believed to exert anti-inflammatory and anti-carcinogenic potential due to their richness in antioxidants and bioactive compounds. Numerous prospective cohort studies and meta-analyses have confirmed an inverse association between higher adherence to healthful plant-based diets and the risk of developing various cancers, including breast cancer (2, 3). Notably, a Chinese cohort study has reported a potential protective role of calcium intake against breast cancer, likely mediated by calcium's inherent antiproliferative effects on cancer cells (4, 5). Huang et al. (20) found that increased magnesium intake may reduce breast cancer risk through downregulating C-reactive protein, an established marker of systemic inflammation linked to cancer progression (6). However, evidence gaps remain. First, most research has focused on breast cancer incidence, while studies on the impact of plant-based dietary patterns on all-cause mortality in patients already diagnosed with breast cancer are limited and often inconsistent. Second, prior work largely evaluates the holistic effect of dietary patterns, lacking a systematic and comprehensive assessment of the independent contribution of individual key micronutrients (e.g., specific vitamins and minerals) to breast cancer risk and patient prognosis. Third, the predictive capacity of dietary factors for long-term outcomes remains underexplored; conventional regression models may not capture complex non-linear relationships or variable interactions that modern machine learning approaches can address (7).

Using data from two large prospective cohorts of UK Biobank (UKB) and Chinese Longitudinal Healthy Longevity Survey (CLHLS), we aimed to: (1) evaluate the associations of HPDI and AMED with breast cancer incidence and all-cause mortality in the UK and China; (2) examine the independent associations of key micronutrient intake (including specific vitamins and minerals) with these outcomes; (3) compare the long-term predictive performance of dietary patterns and key micronutrients, individually and jointly, utilizing multidimensional statistical approaches including Cox regression, Restricted Cubic Splines (RCS), and multiple machine learning models (such as Random Forest and time-dependent ROC analysis). These findings may inform precision nutrition strategies for breast cancer prevention and survivorship in the UK and China.

2 Methods

2.1 Study population and design

This study utilized data from the UKB, a large-scale biomedical resource encompassing the entire United Kingdom. The UKB collected genetic data, lifestyle information, biological samples, and health records from 500,000 participants. After excluding individuals who were male, had incomplete dietary information, missing outcome indicators, or incomplete covariate data, the analytical cohort comprised 67,045 participants without breast cancer and 3,397 patients with breast cancer (Supplementary Figure S1). All participants provided written informed consent. CLHLS is a community-based prospective cohort study conducted among elderly people in China. It collected various data related to population characteristics, lifestyle, health outcomes, and more. This study used food frequency information data measured in the 2008 wave as the baseline and followed up on disease outcomes in the 2011, 2014, and 2018 waves. Participants who meet the following criteria were included in the analysis: (1) complete data of cancer definition and without cancer in 2008, (2) complete data from simplified FFQ in 2008, (3) completion of at least one follow-up after the 2008 wave, and (4) complete data of covariates. Finally, a total of 7,431 eligible participants were included in the analysis, of which 114 participants developed cancer during follow-up (Supplementary Figure S2).

2.2 Definition of plant-based diets and micronutrients

Diet was assessed using the validated web-based Oxford WebQ, a 24-h dietary recall tool. A UKB sub-cohort completed this assessment on ≥1 of five occasions between April 2009 and June 2012. The Oxford WebQ has been validated against interviewer-administered 24-h recalls, yielding a mean Spearman correlation coefficient of 0.62 (range: 0.54–0.69) for macronutrients (8, 9). CLHLS used a simplified FFQ of 22 food group items to collect participants' dietary intake information. In the current study, a total of 16 food groups were used to evaluate plant-based dietary patterns.

Food consumption amounts were calculated by multiplying the reported quantity consumed by its assigned portion size. Nutrient intakes were derived by multiplying each food's consumption quantity by its nutrient content per portion (using McCance and Widdowson's The Composition of Foods and Supplements), then aggregating values across all food groups. For participants with multiple assessments, usual intake was estimated using average food and nutrient values.

2.2.1 AMED score

This study assessed adherence to the Mediterranean diet using the AMED score, which quantifies intake across nine key components based on sex-specific median cutoffs. Beneficial components (e.g., vegetables, fruits, whole grains) scored 1 point for intake above the sex-specific median and 0 for below. Conversely, potentially harmful components (e.g., monounsaturated-to-saturated fatty acid ratio, red/processed meats, poultry) used reverse scoring (below median = 1; above = 0). Total scores ranged from 0 to 9, with higher values indicating stronger adherence to Mediterranean diet principles (10).

2.2.2 HPDI

The HPDI score was calculated from 17 food groups (excluding vegetable oil, unavailable in UKB), with each group scored 1–5 based on intake quintiles. For plant-based foods (whole grains, fruits, vegetables, nuts, legumes, and tea/coffee), the highest quintile scored 5 and the lowest 1. Conversely, animal-based foods (animal fat, dairy, eggs, fish/seafood, meat, and other animal foods) followed reverse scoring (highest quintile = 1; lowest = 5). Other groups (refined grains, potatoes, sugary drinks, fruit juices, and sweets/desserts) scored 1 for the highest quintile. Total scores ranged from 17 to 85, with higher values indicating a healthier diet (11). In CLHLS, we used 16 food groups. For fruits and fresh vegetables, the intake frequency of “almost every day,” “quite often,” “occasionally,” or “rarely or never” corresponds to 5, 4, 2, and 1 points, respectively. For whole grains, refined grains, vegetable oils, and animal fats, the answer is recorded as a binary method (“whether as a staple food” and “whether as the main cooking oil”), corresponding to 5 or 1 point. Positive scores indicate that the higher the score, the higher the frequency of consumption. The plant-based food group of PDI received positive scores, while the animal based food group received reverse scores. Compared to PDI, HPDI received reverse scores for the unhealthy plant-based food group (refined grains, pickled vegetables, and sugar) (12). Total scores ranged from 16 to 80. In our analysis, we consider these two indices as categorical variables (measured in minimum 40% and maximum 60% of the population).

2.3 Definition of breast cancer and mortality

Cancer diagnoses were captured through linkage to national cancer and death registries. All outcomes were defined according to the World Health Organization's International Statistical Classification of Diseases (ICD-10), with breast cancer including C50 and D05. Person-years of follow-up were calculated from baseline assessment at recruitment to first registration of cancer, death, loss, or end of follow-up, whichever came first. Patients with breast cancer at baseline were either diagnosed with breast cancer prior to enrollment or self-reported. Person-years of follow-up for mortality outcome were calculated from baseline assessment at recruitment to death, loss, or end of follow-up, whichever came first (13). In CLHLS, this study used self-reported data from participants to obtain follow-up results for cancer.

2.4 Covariates

The study adjusted for the following covariates in the UKB cohorts: age (continuous), race (White, Asian/Asian British, Black/Black British, Chinese, Mixed, and Other), educational level (less than high school/high school or above), body mass index (BMI, continuous), total energy intake (continuous), Townsend deprivation index (tertiles:T1–T3), and lifestyle factors (smoking status: yes/no; drinking frequency: unknown/never/ < 1 time/week/1–7 times/week). For CLHLS, this study adjusted age (< 90, ≥90), gender (male, female), province (23 provinces and cities including Beijing, Tianjing, Hebei, and others), occupation (10 occupational statuses including professional and technical personnel and others), financial support (10 financial supports including retirement wages and others), educational level (classified by years of schooling: < 6, 7–8, 9–11, ≥12), drinking status (yes, no), physical activity (yes, no), history of CVD (yes, no), and SBP (continuous).

2.5 Statistical analysis

In the baseline characteristics, continuous variables were presented as mean ± standard deviation (SD), while categorical variables were described using counts (percentages). Group comparisons were performed using t-tests for continuous variables and χ2 tests for categorical variables. The study employed multidimensional statistical models to examine the associations between dietary factors and outcomes. Cox proportional hazards models were constructed with the first tertile of each dietary indicator as the reference. Two adjusted models were implemented: Model 1 adjusted for age, race and total energy intake; Model 2 additionally adjusted for BMI, Townsend deprivation index, education level, smoking status, drinking frequency. Results were reported as hazard ratios (HRs) with 95% confidence intervals (CIs). This study employed the RCS model to thoroughly investigate the dose-response relationship between dietary patterns and breast cancer and all-cause mortality. As a flexible non-parametric regression method, RCS excel in analyzing non-linear dose-response relationship between continuous exposure variables and outcomes, while effectively avoiding overfitting and multicollinearity (14). In this study, we used three knots at the 10th, 50th, and 90th percentile of the dietary patterns, with median dietary scores selected as reference values (15). To assess the statistical significance of non-linear association, likelihood ratio tests (LRT) were performed to calculate both non-linear P values and overall P values. RCS analyses were performed using the rms package in R. We compared the full RCS model (allowing non-linearity) with a linear nested model via likelihood ratio test (LRT), and the non-linear P value was derived from the LRT output of the anova() function in the rms package. A non-linear P value less than 0.05 was considered indicative of the non-linear relationship between exposure and outcome.

Based on the results from the Cox proportional hazards model, we focused on evaluating the predictive value of the HPDI and micronutrients (including Vitamin C, Calcium, Magnesium, and Copper for breast cancer; Vitamin B2, Calcium, Magnesium, Phosphorus, and Sodium for all-cause mortality). We constructed three distinct models: (1) the dietary model including HPDI; (2) the micronutrient model including micronutrients above; and (3) the combined model incorporating both HPDI and micronutrients. Similarly, to minimize potential confounding effects, all models were adjusted for age, ethnicity/race, total energy intake, Townsend deprivation index, drinking frequency, and educational level. The C-index of the corresponding Cox proportional hazards model were calculated to evaluate model fit (16). Additionally, we employed the Random Forest to predict the outcomes. Specifically, the dataset was randomly divided into training (70%) and testing (30%) sets at a 7:3 ratio, with 500 decision trees constructed. The predictive performance of the models was assessed by calculating the Area Under the ROC Curve (AUC). Finally, to evaluate the predictive capacity of the HPDI and micronutrients at different time points (3, 5, and 10 years) for breast cancer and all-cause mortality, time-dependent ROC analysis was conducted (17). ROC curves were plotted to visually compare the predictive efficacy across different models. Higher C-index and AUC values indicate superior predictive performance of the models.

All statistical analyses were conducted using R statistical software (version 4.5.0). The R packages utilized in the analyses included “plotRCS,” “randomForest,” “ggplot2,” “survival” and “pROC.” A two-sided P value < 0.05 was defined as statistically significant.

3 Results

3.1 Baseline characteristics of participants

Our analysis for UKB included 67,045 participants without breast cancer and 3,397 patients with breast cancer at baseline (Table 1). Compared to participants without breast cancer, patients with breast cancer were older (mean 58.5 vs. 55.4 years), higher proportion of White individuals (97.8 vs. 96.4%) and low-income (16.1 vs. 13.4%) and lower proportion of non-smoking (57.9 vs. 61.2%) and non-drinking (5.8 vs. 6.7%). Patients with breast cancer had higher HDPI (58.7 vs. 58.1) and AMED scores (4.4 vs. 4.3). Intake of vitamin A, B1, B9, B12, C, D, E, magnesium, iron, and copper were significantly different between the two groups (P < 0.05). For CLHLS, lower index groups were older (mean 83.9 vs. 81.0, 84.0 vs. 80.9; Supplementary Table S1). And there was significant difference between participants with higher index and participants with lower index in terms of province, financial support and years of schooling (P < 0.01).

Table 1
www.frontiersin.org

Table 1. Baseline characteristics of participants in the UK Biobank.

3.2 Associations of plant-based dietary patterns with incidence breast cancer and mortality

As illustrated in Figure 1, HPDI exhibited significant associations with incidence breast cancer and all-cause mortality. Compared to the lowest tertile, the highest tertile of HPDI was associated with an 11% lower risk of new breast cancer [HR (95%CI): 0.89 (0.82, 0.98)]. Additionally, each SD increase in HPDI was associated with a 4% lower risk of new breast cancer [0.96 (0.93, 1.00)]. Similarly, among breast cancer patients, each SD increase in HPDI was associated with an 11% lower risk of all-cause mortality [0.89 (0.79, 1.00)], and the highest tertile of HPDI was associated with a 28% lower risk of all-cause mortality compared to the lowest tertile [0.72 (0.55, 0.95)]. The results of Model 1 and Model 2 were generally consistent. Higher AMED scores were associated with a decreased risk of new breast cancer and all-cause mortality, but were not statistically significant. In CLHLS, high PDI and HPDI are associated with a lower incidence rate of cancer (Figure 2). RCS analysis revealed no significant non-linear relationship between continuous HPDI and breast cancer or all-cause mortality (P for non-linear = 0.993 and 0.242, Figure 3). Compared with the low score group, the high score group of HPDI was associated with a 32% reduction in the risk of cancer in Model 2 [HR (95%CI): 0.68 (0.45, 1.01)], but the results were not statistically significant (P = 0.056). However, in Model 2 of the PDI index, the high score group was associated with a 39% reduction in the risk of cancer, and the results were statistically significant (P = 0.017).

Figure 1
Two line graphs showing hazard ratios (HR) with 95% confidence intervals (CI) for breast cancer and all-cause mortality against HPDI values. The breast cancer graph displays a slight downward trend with overall p-value 0.048 and nonlinear p-value 0.993. The all-cause mortality graph shows a curve with overall p-value 0.033 and nonlinear p-value 0.242.

Figure 1. Association of plant-based diets and minerals with incidence breast cancer and mortality. Model 1: age (continuous), ethnicity/race (White, Asian or Asian British, Black or Black British, Chinese, Mixed, Other ethnic group), total energy intake (continuous). Model 2: Model 1+ BMI (continuous), educational level (less than high school, high school and above), Townsend deprivation index (T1, T2, T3), smoking status (Yes or No), drinking frequency (unknown, never, <1 time/week, 1–7 times/week). P-values less than 0.05 (P < 0.05) were considered significant. UKB, UK Biobank; BMI, body mass index; AMED, Alternate Mediterranean Diet; HPDI, Healthful Plant-Based Diet Index; HR, hazard ratio; CI, confidence interval; N, number.

Figure 2
Two ROC curve graphs for Random Forest models. The left graph shows breast cancer data with AUCs: HPDI 0.514, Micronutrients 0.512, Combined 0.507. The right graph shows all-cause mortality data with AUCs: HPDI 0.562, Micronutrients 0.552, Combined 0.581. Both graphs compare sensitivity versus 1-specificity.

Figure 2. Association of plant-based diets with incidence cancer. Model 1: age (<90, ≥90), gender (male, female). Model 2: Model 1+ province (23 provinces and cities including Beijing, Tianjing, Hebei, and others), occupation (10 occupational statuses including professional and technical personnel and others), financial support (10 financial supports including retirement wages and others), educational level (classified by years of schooling: <6, 7–8, 9–11, ≥12), drinking status (yes, no), physical activity (yes, no), history of CVD (yes, no) and SBP (continuous). P-values less than 0.05 (P < 0.05) were considered significant. PDI, Plant-Based Diet Index; HPDI, Healthful Plant-Based Diet Index; HR, hazard ratio; CI, confidence interval; N, number.

Figure 3
Forest plot showing hazard ratios (HR) and 95% confidence intervals (CI) for CLHLS cancer data. The analysis compares two indices, PDI and HPDI, using two models. Model 1 and Model 2 include data for groups 1 and 2 with event numbers. HR values with CIs are shown with reference and P values. Notably, in PDI model 2, group 2 displays an HR of 0.61 (0.41 to 0.92) with a significant P value of 0.017. Plotted points and error bars visually represent the data across the studied factors.

Figure 3. Restricted cubic spline plots of the association of HPDI with incidence breast cancer and mortality. Model 1: age (continuous), ethnicity/race (White, Asian or Asian British, Black or Black British, Chinese, Mixed, Other ethnic group), total energy intake (continuous). Model 2: Model 1+ BMI (continuous), educational level (less than high school, high school and above), Townsend deprivation index (T1, T2, T3), smoking status (Yes or No), drinking frequency (unknown, never, <1 time/week, 1–7 times/week). P-values less than 0.05 (P < 0.05) were considered significant. UKB, UK Biobank; BMI, body mass index; HPDI, Healthful Plant-Based Diet Index; HR, hazard ratio; CI, confidence interval; N, number.

3.3 Associations of micronutrients with incidence breast cancer and mortality

Figures 1, 4 illustrate the associations of mineral and vitamins intake with outcome. Compared to the lowest tertile, the highest tertile of calcium intake was associated with an 12% lower risk of new breast cancer [HR (95%CI): 0.88 (0.79, 0.98)]. Additionally, each SD increase in calcium intake was associated with a 5% lower risk of new breast cancer [0.95 (0.91, 1.00)]. Compared to the lowest tertile, the highest tertile of magnesium and copper intake was associated with an 11% [0.89 (0.79, 1.00)] and 12% [0.88 (0.79, 0.99)] lower risk of new breast cancer, respectively. Among breast cancer patients, each SD increase in calcium and phosphorus intake was associated with a 15% [0.85 (0.74, 0.98)] and 18% [0.82 (0.68, 0.98)] lower risk of all-cause mortality, respectively. In contrast, each SD increase in sodium intake was associated with a 15% increased risk of mortality [1.15 (1.01, 1.32)]. Furthermore, magnesium intake in the second tertile was associated with a 26% lower risk of all-cause mortality compared to the lowest tertile [0.74 (0.55, 0.99)].

Figure 4
Six ROC curves compare breast cancer and all-cause mortality at 3, 5, and 10 years. Each curve includes Combined, HPDI, and Micronutrients groups with corresponding AUC values. Breast cancer curves show AUCs around 0.549-0.650, while all-cause mortality curves show AUCs around 0.503-0.617. The sensitivity vs. 1-specificity plots assess predictive performance with lines close to the diagonal.

Figure 4. Association of vitamins with incidence breast cancer and mortality. Model 1: age (continuous), ethnicity/race (White, Asian or Asian British, Black or Black British, Chinese, Mixed, Other ethnic group), total energy intake (continuous). Model 2: Model 1+ BMI (continuous), educational level (less than high school, high school and above), Townsend deprivation index (T1, T2, T3), smoking status (Yes or No), drinking frequency (unknown, never, <1 time/week, 1–7 times/week). P-values less than 0.05 (P < 0.05) were considered significant. UKB, UK Biobank; BMI, body mass index; HR, hazard ratio; CI, confidence interval; N, number.

Compared to the lowest tertile, the highest tertile of vitamin C intake was associated with a significant 9% lower risk of new breast cancer [0.91 (0.83, 0.99)]. Among breast cancer patients, the highest tertile of vitamin B2 intake, compared to the lowest tertile, was associated with a 27% lower risk of all-cause mortality [0.73 (0.53, 0.99)].

3.4 Random forest and time-dependent ROC curves and C-index of HPDI and micronutrients for predicting incidence breast cancer and all-cause mortality

Based on the results of the previous COX regression modeling, we selected micronutrients that were significantly associated with outcomes separately. In multivariate models adjusted for other clinically relevant variables, the C-index for new breast cancer was 0.5408 for HPDI and 0.5405 for micronutrients, respectively; however, the C-index increased to 0.5416 when HPDI and micronutrients were included jointly. For all-cause mortality among breast cancer patients, the C-index was 0.6041 for HPDI and 0.6104 for micronutrients, respectively, increasing to 0.6123 upon their combined inclusion (Supplementary Table S1).

Figure 5 demonstrates the performance of HPDI, micronutrients, and their combination in predicting new breast cancer and all-cause mortality among breast cancer patients using the Random Forest model. For predicting new breast cancer, the AUC values were 0.514 for HPDI, 0.512 for micronutrients, and 0.507 for their combination. Among breast cancer patients, the AUC values for predicting all-cause mortality were 0.562 for HPDI, 0.552 for micronutrients, and 0.581 for the combined model.

Figure 5
Forest plots compare dietary patterns and mineral intake with breast cancer risk and all-cause mortality. Categories include HPDI, AMED, and various minerals. Hazard ratios, confidence intervals, and p-values are displayed for different models. Each section presents data for tertiles (T1, T2, T3) and per standard deviation (Per+SD). The plots illustrate the association strength for both conditions, stratified by dietary factors.

Figure 5. Random forest curves and random forest AUC values of HPDI and micronutrients for predicting incidence breast cancer and all-cause mortality. Micronutrients in breast cancer including vitamin C, calcium, copper and magnesium; Combined in breast cancer including HPDI, vitamin C, calcium, copper and magnesium; Micronutrients in all-cause mortality including vitamin B2, calcium, phosphorus, Sodium and magnesium; Combined in all-cause mortality including HPDI, vitamin B2, calcium, phosphorus, Sodium and magnesium; Model: age (continuous), ethnicity/race (UKB: White, Asian or Asian British, Black or Black British, Chinese, Mixed, Other ethnic group), total energy intake (continuous), educational level (less than high school, high school and above), Townsend deprivation index (T1, T2, T3), smoking status (Yes or No), drinking frequency (unknown, never, <1 time/week, 1–7 times/week), BMI (continuous). BMI, body mass index; HPDI, Healthful Plant-Based Diet Index; ROC, receiver operating characteristic; AUC, area under the ROC curve.

We developed time-dependent ROC modeling to evaluate the performance of HPDI, micronutrients individually, and their combination in predicting new breast cancer and mortality at 3, 5, and 10 years. The results indicated that predictive performance for both outcomes was generally optimal at the 5-year mark for all predictors (HPDI alone, micronutrients alone, and their combination). Specifically, micronutrient intake alone demonstrated the highest AUC for predicting new breast cancer across all three time points (3, 5, and 10 years). In contrast, HPDI alone achieved its best performance for predicting mortality among breast cancer patients at 5 years, with an AUC of 0.625 (Figure 6). Notably, the combination of HPDI and micronutrients yielded the highest AUC for predicting both new breast cancer and mortality at the 10-year time point.

Figure 6
Forest plot comparing the impact of various vitamins on breast cancer and all-cause mortality. Each vitamin is analyzed across multiple models and tiers with hazard ratios, confidence intervals, and P values. The plot includes metrics for Retinol, Alpha-carotene, Beta-carotene, Thiamin, Riboflavin, Niacin, Vitamin B6, Folate, Vitamin B12, and Vitamin C. Results show different associations for each vitamin, with references and significance levels noted. The analysis is divided into two main categories: breast cancer and all-cause mortality, facilitating comparison between the two health outcomes.

Figure 6. Time-dependent ROC curves and time-dependent AUC values of HPDI and micronutrients for predicting incidence breast cancer and all-cause mortality. Micronutrients in breast cancer including vitamin C, calcium, copper and magnesium; Combined in breast cancer including HPDI, vitamin C, calcium, copper and magnesium; Micronutrients in all-cause mortality including vitamin B2, calcium, phosphorus, Sodium and magnesium; Combined in all-cause mortality including HPDI, vitamin B2, calcium, phosphorus, Sodium and magnesium; Model: age (continuous), ethnicity/race (UKB: White, Asian or Asian British, Black or Black British, Chinese, Mixed, Other ethnic group), total energy intake (continuous), educational level (less than high school, high school and above), Townsend deprivation index (T1, T2, T3), smoking status (Yes or No), drinking frequency (unknown, never, <1 time/week, 1–7 times/week), BMI (continuous). BMI, body mass index; HPDI, Healthful Plant-Based Diet Index; ROC, receiver operating characteristic; AUC, area under the ROC curve.

4 Discussion

In this large prospective cohort of UKB participants, greater adherence to a HPDI was significantly associated with reduced breast cancer incident and improved overall survival among breast cancer patients. Specifically, women in the highest HPDI tertile experienced an 11% lower risk of developing breast cancer and a 28% lower risk of all-cause mortality after diagnosis, while selected micronutrients (including calcium, magnesium, copper, phosphorus, vitamin C, and vitamin B2) showed independent inverse associations with these outcomes, whereas sodium intake was positively associated with mortality. Although the predictive performance of HPDI, micronutrients, and their combination was modest, the incremental improvement observed when combining dietary pattern and nutrient data suggests that comprehensive dietary profiling may offer additional, albeit limited, predictive value for long-term breast cancer outcomes.

Our findings are consistent with prior cohort studies reporting plant-forward diets or dietary quality indices (e.g., HPDI, PDI) to reduced breast cancer risk. Alignment with limited evidence linking post-diagnosis dietary quality to survival benefits. Meta-analyses have shown 10%−15% reductions in incidence among women adhering to healthful plant-forward diets or vegetable-fruit-soybean patterns (18). Our observed 11% lower risk in the highest HPDI tertile aligns closely with these estimates. Similarly, our finding of a 28% lower mortality risk aligns with prior evidence suggesting improved survival among patients with higher dietary quality after diagnosis (19). We observed that the non-linear association between the HPDI and breast cancer incidence as well as all-cause mortality was not statistically significant, with a tendency toward a negative correlation. The non-significant trend observed for AMED may reflect the index's lower sensitivity in distinguishing plant- vs. animal-derived food quality within the UK dietary context. As prior work noted, dietary variables alone provide modest discriminatory power compared with molecular or genetic models (e.g., Molecular Classification of Breast Cancer), our predictive performance metrics (C-index and AUC) remained limited even when combining HPDI and micronutrients, reinforcing the need to integrate diet with other established risk factors for meaningful clinical prediction. In the analysis of individual nutrient effects, we found that vitamin C intake in the highest tertile was associated with a 9% lower risk of incident breast cancer compared with the lowest tertile, a relationship plausibly attributable to vitamin C's robust antioxidant properties. Meanwhile, vitamin B2 was shown to exert a significant protective effect on mortality risk among breast cancer patients, thereby providing a theoretical rationale for nutritional intervention strategies targeting this population.

A plant-based dietary pattern demonstrates favorable preventive effects and improved prognosis in female breast cancer. Consistent with this trend, our CLHLS study similarly found that higher PDI and HPDI levels were associated with a 39 and 32% reduction in overall cancer risk, respectively. Although data limitations prevented separate analysis of breast cancer cases, the findings still indicate that HPDI and PDI are linked to reduced overall cancer risk, providing supplementary evidence for the cancer-preventive potential of plant-based diets.

Our study benefits from the exceptionally large sample size and rich phenotypic data of the UKB, enabling robust estimates and simultaneous evaluation of overall dietary patterns and individual micronutrients. The prospective design, use of multivariable Cox models, restricted cubic splines, and machine learning approaches further strengthen the validity of our observations. However, the observational nature of the analysis precludes causal inference and residual confounding cannot be fully excluded. Due to inherent limitations in the availability of UK Biobank data, the database lacks systematic questionnaire data on Hormone replacement therapy, family history of breast cancer, and parity. Menopausal status could only be inferred from estradiol levels, which were available for only a subset of participants. Consequently, these breast cancer risk factors were not included in the model. Dietary intake was assessed only once at baseline, potentially underestimating temporal changes. Predictive performance remained modest despite combining HPDI and micronutrients; and the predominance of White participants limits generalizability to more diverse populations. Additionally, the constraints of self-reported data in the CLHLS preclude the specific identification of breast cancer, limiting our capacity to evaluate the distinct impacts of HPDI and PDI on breast cancer-related outcomes in the Chinese cohort.

In summary, greater adherence to a healthful plant-based diet and optimal intake of selected micronutrients were associated with lower breast cancer incidence and improved survival, although predictive performance was limited. These findings underscore the potential public health importance of promoting plant-forward dietary patterns and adequate micronutrient intake as part of comprehensive breast cancer prevention and survivorship strategies. Future prospective and interventional studies across diverse populations are warranted to clarify causality and refine dietary risk prediction models.

5 Conclusions

High adherence to healthful plant-based diets (e.g., HPDI/PDI) combined with adequate intake of key micronutrients like calcium and magnesium improves breast health, reduces breast cancer incidence, and enhances long-term survival in patients.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The UK Biobank received ethical approval from the North West Multicenter Research Ethics Committee (Approved Research ID: 89871; approval date: 20 September 2022). All participants gave written informed consent before enrolment in the study, which was conducted in accordance with the principles of the Declaration of Helsinki. We utilized publicly available longitudinal data from the Chinese Longitudinal Healthy Longevity Study (CLHLS). Initiated in 1998, the study received approval from the Research Ethics Committees at Duke and Peking Universities (IRB00001052-13074). As this study exclusively utilized publicly available, de-identified data from CLHLS and does not involve human subjects research, Institutional Review Board (IRB) approval was not required.

Author contributions

WX: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Software, Supervision, Validation, Writing – original draft, Writing – review & editing. WG: Investigation, Software, Writing – original draft. YH: Conceptualization, Data curation, Investigation, Methodology, Writing – original draft, Writing – review & editing. SL: Formal analysis, Investigation, Resources, Visualization, Writing – original draft. HL: Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. XZ: Conceptualization, Data curation, Funding acquisition, Resources, Supervision, Visualization, Writing – original draft, Writing – review & editing.

Funding

The author(s) declared that financial support was not received for this work and/or its publication.

Acknowledgments

The authors thank the CLHLS and UK Biobank for contributing data and all participants involved in this study.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnut.2025.1748611/full#supplementary-material

References

1. Bray F, Laversanne M, Sung H, Ferlay J, Siegel RL, Soerjomataram I, et al. Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. (2024) 74:229–63. doi: 10.3322/caac.21834

PubMed Abstract | Crossref Full Text | Google Scholar

2. Couser WG, Remuzzi G, Mendis S, Tonelli M. The contribution of chronic kidney disease to the global burden of major noncommunicable diseases. Kidney Int. (2011) 80:1258–70. doi: 10.1038/ki.2011.368

PubMed Abstract | Crossref Full Text | Google Scholar

3. Shah S, Laouali N, Mahamat-Saleh Y, Biessy C, Nicolas G, Rinaldi S, et al. Plant-based dietary patterns and breast cancer risk in the European prospective investigation into cancer and nutrition (EPIC) study. Eur J Epidemiol. (2025) 40:947–58. doi: 10.1007/s10654-025-01277-y

PubMed Abstract | Crossref Full Text | Google Scholar

4. Zhang CX, Ho SC, Fu JH, Cheng SZ, Chen YM, Lin FY. Dairy products, calcium intake, and breast cancer risk: a case-control study in China. Nutr Cancer. (2011) 63:12–20. doi: 10.1097/CEJ.0b013e32834572bb

PubMed Abstract | Crossref Full Text | Google Scholar

5. Ratajczak-Pawłowska AE, Jezierska K, Szymczak-Tomczak A, Zawada A, Rychter AM, Skoracka K, et al. Lifestyle and breast cancer: prevention and treatment support. Cancers. (2025) 17:2830. doi: 10.3390/cancers17172830

PubMed Abstract | Crossref Full Text | Google Scholar

6. Bezerra DLC, Mendes PMV, Melo SRS, Dos Santos LR, Santos RO, Vieira SC, et al. Hypomagnesemia and its relationship with oxidative stress markers in women with breast cancer. Biol Trace Elem Res. (2021) 199:4466–74. doi: 10.1007/s12011-021-02579-4

PubMed Abstract | Crossref Full Text | Google Scholar

7. Khong TMT, Bui TT, Kang H-Y, Park E, Ki M, Choi Y-J, et al. Cancer risk according to lifestyle risk score trajectories: a population-based cohort study. BJC Rep. 3:28. doi: 10.1038/s44276-025-00141-6

PubMed Abstract | Crossref Full Text | Google Scholar

8. Li K, Huang Y, Wang L, Yuan Y, Jiang X, Yang Y, et al. Association of four dietary patterns and stair climbing with major adverse cardiovascular events: a large population-based prospective cohort study. Nutrients. (2024) 16:3576. doi: 10.3390/nu16213576

PubMed Abstract | Crossref Full Text | Google Scholar

9. Liu B, Young H, Crowe FL, Benson VS, Spencer EA, Key TJ, et al. Development and evaluation of the Oxford WebQ, a low-cost, web-based method for assessment of previous 24 h dietary intakes in large-scale prospective studies. Public Health Nutr. (2011) 14:1998–2005. doi: 10.1017/S1368980011000942

PubMed Abstract | Crossref Full Text | Google Scholar

10. Fung TT, McCullough ML, Newby PK, Manson JE, Meigs JB, Rifai N, et al. Diet-quality scores and plasma concentrations of markers of inflammation and endothelial dysfunction. Am J Clin Nutr. (2005) 82:163–73. doi: 10.1093/ajcn/82.1.163

PubMed Abstract | Crossref Full Text | Google Scholar

11. Heianza Y, Zhou T, Sun D, Hu FB Qi L. Healthful plant-based dietary patterns, genetic risk of obesity, and cardiovascular risk in the UK biobank study. Clin Nutr. (2021) 40:4694–701. doi: 10.1016/j.clnu.2021.06.018

PubMed Abstract | Crossref Full Text | Google Scholar

12. Chen H, Shen J, Xuan J, Zhu A, Ji JS, Liu X, et al. Plant-based dietary patterns in relation to mortality among older adults in China. Nat Aging. (2022) 2:224–30. doi: 10.1038/s43587-022-00180-5

PubMed Abstract | Crossref Full Text | Google Scholar

13. Khan M, Papier K, Pirie KL, Key TJ, Atkins J, Travis RC. Sex differences in cancer incidence: prospective analyses in the UK Biobank. Br J Cancer. (2025) 133:216–26. doi: 10.1038/s41416-025-03028-y

PubMed Abstract | Crossref Full Text | Google Scholar

14. Xu Y, Han D, Xu F, Shen S, Zheng X, Wang H, et al. Using restricted cubic splines to study the duration of antibiotic use in the prognosis of ventilator-associated pneumonia. Front Pharmacol. (2022) 13:898630. doi: 10.3389/fphar.2022.898630

PubMed Abstract | Crossref Full Text | Google Scholar

15. Cui C, Qi Y, Song J, Shang X, Han T, Han N, et al. Comparison of triglyceride glucose index and modified triglyceride glucose indices in prediction of cardiovascular diseases in middle aged and older Chinese adults. Cardiovasc Diabetol. (2024) 23:185. doi: 10.1186/s12933-024-02278-z

PubMed Abstract | Crossref Full Text | Google Scholar

16. Tao L, Wu T, Du X, Li Q, Hao Y, Zhou T, Yi Y. Association of dietary inflammatory index on all-cause and cardiovascular mortality in US adults with metabolic dysfunction associated steatotic liver disease. Front Nutr. (2025) 12:1478165. doi: 10.3389/fnut.2025.1478165

Crossref Full Text | Google Scholar

17. Chu Q, Wu B, Zhang Z. Association of neutrophil to lymphocyte ratio with all-cause and cardiovascular mortality among individuals with kidney stone disease: result from NHANES, 2007-2018. Front Endocrinol. (2025) 16:1537403. doi: 10.3389/fendo.2025.1537403

PubMed Abstract | Crossref Full Text | Google Scholar

18. Zhang L, Huang S, Cao L, Ge M, Li Y, Shao J. Vegetable-fruit-soybean dietary pattern and breast cancer: a meta-analysis of observational studies. J Nutr Sci Vitaminol. (2019) 65:375–82. doi: 10.3177/jnsv.65.375

PubMed Abstract | Crossref Full Text | Google Scholar

19. Rigi S, Mousavi SM, Benisi-Kohansal S, Azadbakht L, Esmaillzadeh A. The association between plant-based dietary patterns and risk of breast cancer: a case-control study. Sci Rep. (2021) 11:3391. doi: 10.1038/s41598-021-82659-6

PubMed Abstract | Crossref Full Text | Google Scholar

20. Huang WQ, Long WQ, Mo XF, Zhang NQ, Luo H, Lin FY, et al. Direct and indirect associations between dietary magnesium intake and breast cancer risk. Sci Rep. (2019) 9:5764. doi: 10.1038/s41598-019-42282-y

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: breast cancer, CLHLS, machine learning, micronutrients, mortality, plant-based diets, UK Biobank

Citation: Xu W, Gu W, Huang Y, Li S, Liu H and Zhu X (2026) Plant-based dietary patterns, micronutrient status and breast cancer outcomes: a joint analysis of UK Biobank and Chinese longitudinal healthy longevity survey. Front. Nutr. 12:1748611. doi: 10.3389/fnut.2025.1748611

Received: 18 November 2025; Revised: 19 December 2025; Accepted: 24 December 2025;
Published: 26 January 2026.

Edited by:

Rosa Casas Rodriguez, August Pi i Sunyer Biomedical Research Institute (IDIBAPS), Spain

Reviewed by:

Wensheng Zhang, Xavier University of Louisiana, United States
Morvarid Noormohammadi, Iran University of Medical Sciences, Iran

Copyright © 2026 Xu, Gu, Huang, Li, Liu and Zhu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Honglin Liu, MTU2MjYwMTMwNTNAMTYzLmNvbQ==; Xun Zhu, emh1eHVuMTAyM0AxMjYuY29t

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.