ORIGINAL RESEARCH article

Front. Nutr., 16 July 2025

Sec. Nutritional Epidemiology

Volume 12 - 2025 | https://doi.org/10.3389/fnut.2025.1582674

Comparison between compositional data analysis and principal component analysis for identifying dietary patterns associated with hyperuricemia

  • 1. Shanxi Bethune Hospital, Shanxi Academy of Medical Sciences, Third Hospital of Shanxi Medical University, Shanxi Province Clinical Research Center for Dermatologic and Immunologic Diseases (Rheumatic Diseases), Shanxi Province Clinical Theranostics Technology Innovation Center for Immunologic and Rheumatic Diseases, Taiyuan, China

  • 2. Shanxi Academy of Advanced Research and Innovation (SAARI), Taiyuan, China

  • 3. Laboratory of International Agro-Informatics, Department of Global Agricultural Sciences, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan

  • 4. Third Hospital of Shanxi Medical University, Shanxi Bethune Hospital, Shanxi Academy of Medical Sciences, Taiyuan, China

  • 5. School of Public Health, Shanxi Medical University, Taiyuan, China

  • 6. Second Clinical College, Shanxi University of Chinese Medicine, Jinzhong, China

  • 7. National Institute for Nutrition and Health, Chinese Center for Disease Control and Prevention, Beijing, China

Article metrics

View details

2k

Views

571

Downloads

Abstract

Background/objectives:

Dietary patterns play an important role in regulating serum uric acid (SUA) levels in the body. Recently, compositional data analysis (CoDA) has been proposed as an alternative technique in identifying dietary patterns. However, the relative advantages of CoDA, particularly in identifying dietary patterns associated with hyperuricemia have not been investigated. We evaluated and compared CoDA, including compositional principal component analysis (CPCA) and principal balances analysis (PBA), with the most commonly used principal component analysis (PCA) in determining dietary patterns associated with hyperuricemia.

Methods:

The 3 day 24-h dietary recall method was used to estimate dietary data from 3,954 study participants of the China Health and Nutrition Survey (CHNS). Dietary patterns were constructed using PCA, CPCA and PBA. These methods were compared based on the performance to identify plausible patterns associated with hyperuricemia.

Results:

PCA, CPCA and PBA all identified three dietary patterns, with a common “traditional southern Chinese” pattern high in rice and animal-based foods and low in wheat products and dairy. Only this pattern was positively associated with risk of hyperuricemia [PCA: OR (95%CI) = 1.29 (1.15–1.46); CPCA: OR (95%CI) = 1.25 (1.10–1.40); PBA: OR (95%CI) = 1.23 (1.09–1.38)].

Conclusion:

All three dietary patterns methods in our study identified that a “traditional southern Chinese” dietary pattern was associated with increased risk of hyperuricemia, suggesting a robust and consistent finding.

1 Introduction

Hyperuricemia, a metabolic condition characterized by excessive concentration of serum uric acid (SUA), poses a significant risk factor of the development of chronic diseases including gout, cardiovascular disease, and diabetes, etc. (1). Latest data shows that the prevalence of hyperuricemia in Chinese population has increased from 8.4% in 2010 to 13.6% in 2021 (2, 3), signaling the need for effective interventions to mitigate the rising rate of hyperuricemia. A growing body of research indicates that hyperuricemia can be effectively managed non-pharmacologically through dietary modifications such as avoiding alcohol consumption and adhering to a diet low in purine-rich foods (4–7). Additionally, plant-based foods, including vegetable, fruits, cereals, and spices, are rich in dietary polyphenols, which are shown to prevent hyperuricemia through inhibiting uric acid (UA) synthesis and enhancing UA secretion (8). Despite the potential benefits, studying human diets is largely complicated due to the impracticality of isolating diets into single foods or nutrients. Therefore, it is increasingly common in the field of nutrition epidemiology to investigate dietary patterns as a more holistic approach to capture day-to-day eating habits (9).

In most of the literature, statistical methods of deriving dietary patterns fall into three classic categories: a priori, a posteriori, and hybrid methods (9). The a priori approach, based on established knowledge and evidence, focuses on diet quality using various dietary indices, including Mediterranean scores and Healthy Eating Index, but tends to overlook the contributions of specific nutrients and the interactions between nutrients (10). In contrast, the a posteriori method is data-driven, extracting dietary patterns through statistical dimension-reduction techniques. The most widely used data-driven method is principal component analysis (PCA), which analyzes the correlation matrix of food variables and derives principal components (PC) or dietary patterns characterized by different foods (11). Hybrid methods combine the first two classes of methods to identify dietary patterns (9).

Over the past few years, a growing number of studies have emerged to use PCA as the primary method to investigate the impact of dietary patterns on hyperuricemia among Chinese adult populations, but the results have been inconsistent (12, 13). For example, while one cohort study found no association between a dietary pattern high in fruits, deep-fried foods, and sweets with hyperuricemia (14), another cohort study identified a similar pattern rich in sweet foods that was positively associated with hyperuricemia (1). Furthermore, when a cross-sectional study identified a plant-based dietary pattern inversely associated with hyperuricemia (13), another found no such association (15). A possible explanation for the discrepancies in these results is that PCA is not completely data-driven. When intending to qualitatively interpret the PC factors, the threshold value of foods’ factor loadings, coupled with the labeling of each dietary pattern, remains somewhat arbitrary (16). The inconsistencies in dietary patterns identified by PCA across studies suggest a potential weakness in using a single approach to derive patterns related to hyperuricemia.

Given that the amount of food intakes for an individual is relatively constant, the increase in the intake of some foods will lead to the decrease in the intake of other foods, vice versa. This implies the compositional nature of dietary intake, which can be adequately addressed by Compositional Data Analysis (CoDA) (9). CoDA is a novel class of dimension reduction methods that has not been widely utilized in health and dietary research (9). It encompasses a standard family of statistical methods computing log-ratio transformation of dietary data. These methods include compositional PC analysis (CPCA) and principal balances analysis (PBA), which serve as viable tools in estimating the relative importance of food variables within dietary patterns and facilitating the interpretation of the results (17).

Despite their promising application in dietary pattern research, limited studies have utilized these statistical methods (17). Moreover, the majority of studies have only chosen PCA as the single a posteriori approach to extract dietary patterns as hyperuricemia predictors, resulting in a lower reproducibility of the results (12). Therefore, the aim of this study is to (1) simultaneously use three statistical approaches – one traditional approach (i.e., PCA) and two novel approaches (i.e., PBA and CPCA) – to extract dietary patterns and explore shared food groups across these patterns; and (2) to investigate their associations with hyperuricemia in a cohort of Chinese population.

2 Materials and methods

2.1 Study design and population

This retrospective cohort study utilized data from the China Health and Nutrition Survey (CHNS), an ongoing large-scale, prospective cohort survey initiated in 1989 and continued in 1991, 1993, 1997, 2000, 2004, 2006, 2009, 2011, and 2015. Multi-stage random cluster sampling was employed to select the study participants from nine provinces and three autonomous cities with diverse demographic, geographical, economic development, and public resource characteristics. A comprehensive description of the CHNS is provided elsewhere (18). Given that the only hyperuricemia data available was in 2009, and the revised version of the China Food Composition Table (FCT) was initially used in 2004 to obtain nutrient values (19), we used data from the CHNS conducted in 2004, 2006, and 2009.

We included 14,086 participants who entered the study in either 2004 or 2006 and were followed up until 2009. We excluded 2,495 participants aged under 18 years and 4,853 who had missing hyperuricemia data. From the remaining participants, we further excluded pregnant women, those with diabetes, myocardial infarction, stroke, hypertension, and extreme total energy intake (≥ 8,000 and ≤ 800 kcal/day for men; ≥ 6,000 and ≤ 600 kcal/day for women) (16), and those who self-identified as vegetarians. Additionally, we excluded 912 participants with missing data on marital status, education, income, smoking, drinking, Body Mass Index (BMI), sleep, physical activity, and sedentary behavior (Supplementary Figure S1). Finally, a total sample of 3,954 participants were selected for the analysis.

2.2 Dietary assessment

To collect dietary data and assess individual diet, the CHNS employed a consecutive 3-day 24-h diet recall method. Literature has validated the use of the 24-h dietary recall method (20). Details of the dietary assessment has been described elsewhere (21). In summary, qualified interviewers requested that participants report the quantities and types of foods consumed on the preceding day for three consecutive days, which were randomly assigned from Monday to Sunday (21). We calculated the three-day average intake (grams per day) of foods for each participant in each survey round (2004, 2006, 2009) based on the FCT. We categorized 20 food groups according to their nutrient and culinary characteristics (see Supplementary Table S1).

2.3 Outcome measurement: SUA and hyperuricemia

The primary outcome of the study was hyperuricemia. In 2009, fasting blood samples (12 mL) were collected from participants. The SUA was measured using the enzymatic colorimetric method with a Hitachi 7,600 automated analyzer (Tokyo, Japan) and Randox reagents (Randox Laboratories Ltd., Crumlin, UK). Hyperuricemia was defined as SUA levels ≥ 416 μmol/L in men and ≥ 357 μmol/L in women (22).

2.4 Covariates

In line with the literature (22), the following confounders were included in the analysis due to their established association with hyperuricemia: age, sex, residence (urban and rural), region (southern and northern), marital status (married and others), education level (primary school and below, middle school, high school, college degree and above), income, hours of physical activity (PA), drinking, smoking, daily energy intake, BMI, sleep duration, and sedentary time. Smoking status was categorized as former smoker, current smoker, or non-smoker. Drinking status was classified into two categories: those who consumed beer/liquor in the previous year and those who did not. Anthropometric measurements including height and weight were taken using standard procedures, with participants wearing light clothing and no shoes. Body Mass Index (BMI) was calculated as weight in kilograms divided by height in meters squared (kg/m2), and was classified according to recommended cutoff points: underweight (<18.5), normal (18.5–23.9), overweight (24.0–27.9), and obese (≥28.0) (21). The total hours of PA and sedentary behavior were calculated based on the reported weekly hours of each activity type.

2.5 Statistical analysis

We obtained dietary patterns using three dimension reduction methods: PCA, PBA, and CPCA. Many of the 20 food groups included in the dietary analysis were consumed by only a small proportion of participants. For PCA, the food groups were classified into two categories: those consumed by <25% of participants were converted to binary variables (non-consumers and consumers), while those consumed by ≥25% of participants were categorized into three levels (non-consumers, consumers with intake ≤ median, and consumers with intake > median) (16). We used varimax orthogonal rotation to enhance the interpretability of factor correlations with food groups and retained factors with eigenvalues > 1.0 to maximize variance retention while reducing dimensionality. Each retained factor was interpreted based on factor loadings, reflecting correlations between food groups and the pattern. Patterns were labeled using food groups with absolute factor loadings≥0.3, indicating strong contributions.

CoDA uses isometric log-ratio transformations to generate new variables (dietary patterns) that represent log-ratios between compositional parts (17). To handle zero values from rarely consumed foods in the three-day 24-h dietary recalls, we applied zero imputation using a modified expectation–maximization algorithm with a lower detection limit (16). Consistent with CoDA principles, individual food group intakes were expressed as proportions of the total intake across all 20 groups. We applied PBA to generate principal balances (PBs) that quantify the relative contribution of specific food group subsets against another subset (16). Given the absence of a standardized criterion for PBs retention, we retained the same number of dietary patterns as derived from PCA. For the alternative CoDA approach, CPCA, we computed the log-ratio of each food group relative to the geometric mean of all 20 groups (17). Similar to PCA, each CPCA-derived PC involves all food groups, with absolute factor loadings≥0.3 indicating significant contributions to the pattern. The criterion for retaining PCs in CPCA mirrored that used in PCA.

Baseline characteristics were compared between participants with and without hyperuricemia, using chi-square tests for categorical variables and either Student’s t-tests or analysis of variance (ANOVA) for continuous variables, as appropriate. Tertiles for each dietary pattern were constructed to assess the associations between dietary patterns and the risk of hyperuricemia in three multivariate logistic regression models: Model 1 adjusted for age and sex; Model 2 additionally adjusted for marital status, residence, region, education, and income; Model 3 further adjusted for smoking, drinking, BMI, physical activity time, sedentary time, sleep duration, and daily energy intake.

All statistical analyses were performed using R version 4.0.3. The two-sided p < 0.05 was considered as statistical significance.

3 Results

3.1 Characteristics of study participants

After applying a series of exclusion criteria (see Supplementary Figure S1), a total of 3,954 participants were included in the cohort. Baseline characteristics of participants with and without hyperuricemia are summarized in Table 1. We documented 534 participants with hyperuricemia, with the overall prevalence of 13.5%. Compared to participants without hyperuricemia, those with hyperuricemia were more likely to be men, older, urban residents, southerners, highly educated, current smokers, current drinkers, and to have higher BMI, higher income, and shorter sleep duration.

Table 1

CharacteristicAll participants (n 3,954)Hyperuricemiap value
No (n 3,420)Yes (n 534)
Sex, n (%)
Men1827 (46.2)1,487 (43.5)340 (63.7)<0.001
Women2,127 (53.8)1933 (56.5)194 (36.3)
Age, mean (SD), yr45.7 (12.2)45.5 (12.1)47.2 (12.5)0.003
Residence, n (%)
Urban1,124 (28.4)930 (27.2)194 (36.3)<0.001
Rural2,830 (71.6)2,490 (72.8)340 (63.7)
Region, n (%)
Northern1,541 (39.0)1,380 (40.4)161 (30.2)<0.001
Southern2,413 (61.0)2040 (59.7)373 (69.9)
Marriage status, n (%)
Married3,590 (90.8)3,111 (91.0)479 (89.7)0.35
Unmarried364 (9.2)309 (9.0)55 (10.3)
Education, n (%)
Primary school and below1,665 (42.1)1,457 (42.6)208 (39.0)0.001
Middle school1,353 (34.2)1,190 (34.8)163 (30.5)
High school529 (13.4)437 (12.8)92 (17.2)
College degree and above407 (10.3)336 (9.8)71 (13.3)
Smoking status, n (%)
Non-smoker2,676 (67.7)2,379 (69.6)297 (55.6)<0.001
Former smoker93 (2.4)71 (2.1)22 (4.1)
Current smoker1,185 (30.0)970 (28.4)215 (40.3)
Drinking status, n (%)
Never2,642 (66.8)2,363 (69.1)279 (52.3)<0.001
Current1,312 (33.2)1,057 (30.9)255 (47.8)
BMI, n (%)
Underweight238 (6.0)223 (0.03)15 (2.8)<0.001
Normal2,448 (61.9)2,188 (64.0)260 (48.7)
Overweight1,051 (26.6)848 (24.8)203 (38.0)
Obese217 (5.5)161 (4.7)56 (10.5)
Income, mean (SD), CNY/yr10289.3 (13675.3)9958.5 (13332.7)12407.7 (15541.0)<0.001
Sleep duration, mean (SD), h/d8.1 (1.2)8.1 (1.2)8.0 (1.2)0.04
Physical activity time, mean (SD), h/d0.4 (1.8)0.4 (1.9)0.41 (1.3)0.97
Sedentary time, mean (SD), h/d4.9 (3.94)4.9 (4.0)4.91 (3.6)0.76
Energy, mean (SD), kcal/d2282.4 (660.0)2275.8 (657.5)2324.40 (674.9)0.11
Serum uric acid, mean (SD), μmol/L301.1 (102.2)273.8 (66.0)476.42 (118.3)<0.001

Baseline characteristics of the 3,954 study participants.

BMI, Body Mass Index; CNY/yr, Chinese yuan/year; SD, standard deviation.

3.2 Dietary patterns based on PCA, PBA, and CPCA

Factor loadings of the 20 food groups for the dietary patterns derived from PCA, PBA, and CPCA are shown in Table 2. PCA identified three distinct dietary patterns, which were determined by three PC factors with eigenvalues greater than 1.0. The first PC factor (“traditional southern Chinese”) had low intakes of wheat, milk, and other cereals, with high intakes of fresh vegetables, pork, aquatic products, and rice. The second PC factor (“western”) had high intakes of fast foods, sugary food, fruits, beverages, eggs, processed meat, and fungi and algae. The third PC factor (“plant-based”) was characterized by high intakes of tubers, legumes, and fresh vegetables, and low intakes of other livestock and organs. The three PCs explained 28.07% of the total variation.

Table 2

Food groupsPCAPBACPCA
Factor 1Factor 2Factor 3Factor 1Factor 2Factor 3Factor 1Factor 2Factor3
Wheat products−0.78−0.020.25−0.44−0.41−0.83−0.12−0.14
Dairy0.760.030.22−0.44−0.41−0.83−0.11−0.10
Tubers−0.080.000.53−0.440.82−0.22−0.09−0.56
Other cereals0.56−0.02−0.13−0.82−0.080.09−0.10
Fungi and algae0.070.45−0.19−0.04−0.020.51
Eggs0.040.330.240.41−0.02−0.29−0.14
Beverages0.020.31−0.04−0.020.920.05
Fruits0.020.66−0.010.41−0.00−0.240.21
Vegetables0.31−0.140.310.010.91−0.06
Fast Foods−0.040.560.060.070.08−0.12
Sweets−0.040.460.090.080.11−0.05
Other livestock meat0.060.14−0.430.100.02−0.08
Organs0.060.05−0.500.100.300.11
Processed meat0.040.32−0.170.120.39−0.11
Poultry0.160.290.420.330.15−0.010.62
Nuts−0.000.25−0.110.200.240.07
Rice0.63−0.270.060.330.330.320.04
Aquatic products0.350.110.010.330.36−0.130.41
Legumes0.100.170.480.37−0.30−0.40
Pork0.340.17−0.090.330.45−0.150.14
Variance explained (%)12.109.466.5116.849.148.8513.5111.516.98

Factor loadings of for patterns derived using principal component analysis (PCA), principal balances analysis (PBA), and compositional principal component analysis (CPCA).

Bold values indicate food groups that were significant contributors to the patterns (with factor loadings ≥ |0.30|).

PBA analysis also identified three dietary patterns. The first PB (“traditional southern Chinese”) showed high intakes of rice, poultry, aquatic products, and pork, with low intakes of wheat, tubers, and milk. The second PB (“Tuber-Based”) had high intakes of tubers and low intakes of milk and wheat. The third PB (“low cereal”) was characterized by high intakes of fruits and eggs and a low intake of other cereals. The three PBs accounted for 34.83% of the food intake variations.

Similar to the first PC and the first PB, the CPCA presents that the first PC (“traditional southern Chinese”) was characterized by high intakes of rice, aquatic product, legumes, and pork and low intakes of wheat and milk. The second PC (“mixed”) had high intakes of beverages, vegetables, organ, processed meat, and rice. The third PC (“seafood and poultry”) showed high intakes of fungi and algae, poultry, aquatic products, with low intakes of tubers and legumes. The total variance explained by the three PCs was 32%.

3.3 Dietary patterns and hyperuricemia

Table 3 displays the association between the risk of hyperuricemia with dietary patterns obtained from PCA, PBA and CPCA. For PCA, after adjusting for all covariates (in Model 3), the “traditional southern Chinese” pattern was positively associated with hyperuricemia (adjusted OR for T3 vs. T1, 1.68; 95% CI, 1.31–2.14; P trend < 0.001), while the “plant-based” pattern was inversely associated with hyperuricemia (adjusted OR for T3 vs. T1, 0.76; 95% CI, 0.60–0.96; P trend = 0.02). Similarly, the “traditional southern Chinese” pattern derived from PBA was positively associated with hyperuricemia (adjusted OR for T3 vs. T1, 1.43; 95% CI, 1.12–1.84; P trend = 0.001), while the other two patterns of PBA showed no association with hyperuricemia. The “traditional southern Chinese” pattern derived from CPCA was also positively associated with hyperuricemia (adjusted OR for T3 vs. T1, 1.57; 95% CI, 1.23–2.01; P trend < 0.001).

Table 3

ModelsT1T2T3P-trend
OR (95%CI)OR (95%CI)
Principal component analysis
PC1: “traditional southern Chinese” pattern
Model 1Ref.1.41 (1.11, 1.78)1.43 (1.13, 1.81)0.002
Model 2Ref.1.37 (1.08, 1.74)1.42 (1.12, 1.79)0.003
Model 3Ref.1.54 (1.21, 1.97)1.68 (1.31, 2.14)<0.001
PC2: “western” pattern
Model 1Ref.1.20 (0.95, 1.51)1.15 (0.91, 1.45)0.26
Model 2Ref.1.12 (0.89, 1.41)0.94 (0.73, 1.21)0.61
Model 3Ref.1.06 (0.83, 1.34)0.90 (0.69, 1.16)0.40
PC3: “plant-based” pattern
Model 1Ref.0.77 (0.62, 0.97)0.76 (0.60, 0.95)0.01
Model 2Ref.0.84 (0.67, 1.05)0.77 (0.61, 0.97)0.02
Model 3Ref.0.80 (0.63, 1.02)0.76 (0.60, 0.96)0.02
Principal balances
PB1: “traditional southern Chinese” pattern
Model 1Ref.1.11 (0.88, 1.40)1.37 (1.08, 1.74)0.005
Model 2Ref.1.17 (0.93, 1.48)1.29 (1.01, 1.64)0.02
Model 3Ref.1.30 (1.02, 1.65)1.43 (1.12, 1.84)0.001
PB2: “tuber-based” pattern
Model 1Ref.1.01 (0.81, 1.27)1.02 (0.81, 1.28)0.93
Model 2Ref.1.07 (0.85, 1.34)1.08 (0.86, 1.36)0.54
Model 3Ref.1.06 (0.84, 1.33)1.08 (0.85, 1.36)0.55
PB3: “low cereal” pattern
Model 1Ref.1.13 (0.89, 1.44)1.06 (0.84, 1.35)0.46
Model 2Ref.1.19 (0.93, 1.51)1.05 (0.83, 1.34)0.68
Model 3Ref.1.21 (0.94, 1.55)1.03 (0.81, 1.31)0.91
Compositional PCA
PC1: “traditional southern Chinese” pattern
Model 1Ref.0.97 (0.78, 1.20)1.48 (1.17, 1.88)0.002
Model 2Ref.0.99 (0.79, 1.23)1.40 (1.10, 1.78)0.01
Model 3Ref.1.09 (0.87, 1.36)1.57 (1.23, 2.01)< 0.001
PC2: “mixed” pattern
Model 1Ref.1.12 (0.89, 1.40)1.11 (0.88, 1.39)0.40
Model 2Ref.1.12 (0.89, 1.40)1.08 (0.86, 1.36)0.49
Model 3Ref.1.14 (0.90, 1.44)1.12 (0.89, 1.42)0.32
PC3: “seafood and poultry” pattern
Model 1Ref.0.88 (0.70, 1.10)1.13 (0.89, 1.43)0.49
Model 2Ref.0.93 (0.74, 1.17)1.24 (0.97, 1.57)0.12
Model 3Ref.0.94 (0.75, 1.18)1.26 (0.98, 1.60)0.10

Odds ratios (95% CI) for risk of hyperuricemia by tertile of each dietary pattern scores derived from principal component analysis (PCA), principal balances analysis (PBA), and compositional principal component analysis (CPCA).

Results were obtained from multivariate logistic regression models; Model 1: adjusted for age and sex; Model 2: additionally adjusted for marital status, region, residence, education, and income; Model 3, further adjusted for smoking, drinking, BMI, physical activity time, sedentary time, sleep duration, and daily energy intake. PC, principal component; PB, principal balance; OR, odds ratios; CI, confidence interval; T, tertile.

4 Discussion

This study applied three analytical methods—PCA and two data-driven CoDA methods (PBA and CPCA)—each identified three dietary patterns in a cohort of 3,954 Chinese adult participants. The three methods harmonized in identifying a “traditional southern Chinese” dietary pattern with shared attributes: lower wheat and dairy intake, higher rice, aquatic, and pork intake, and a positive association with hyperuricemia risk. The high comparability suggests that methods focusing on food composition can effectively capture dietary behavior. Consistent findings across methods strengthen our understanding that a high-animal-product, low-dairy-and-cereal-product dietary pattern may increase hyperuricemia risk among Chinese.

Evidence shows that traditional agricultural practices in southern China have led to a dominant rice-based dietary pattern (23). With the rapid aquaculture development (24) and economic growth especially in the southern coastal region in China, the traditional Chinese diet has transitioned towards a western-like diet with more animal-based foods including meats and seafood products (25). These products tend to have a high purine content, which elevates blood levels of uric acid, or urate (the end product of purine), making it one of the major risk factors for hyperuricemia (26). The adverse effect of animal meats, organs, and seafoods on the outcome of hyperuricemia as well as disorders like gout has been confirmed in various studies (27–30). Research supports our observation that hyperuricemia was prevalent among male participants with higher incomes living in economically developed urban southern Chinese regions, likely due to frequent purine-rich diet consumption (30, 31).

Literature suggests that plant-based foods like wheat and cereals are beneficial for lowering SUA and the risk of hyperuricemia because they contain minimal purines (26, 32). Not all animal-based foods increase the risk of hyperuricemia. Dairy products, like milk and cheese, are also linked to reduced uric acid levels (26, 28). A cohort study found a strong inverse relationship between dairy products and gout, suggesting dairy proteins and other nutrients in dairy products cause urate lowering (28). Increased consumption of urate-elevating foods (like meat and seafood) and decreased consumption of urate-lowering foods (like wheat and dairy) may explain the positive association between the “traditional southern Chinese” pattern and hyperuricemia risk in this study.

Another cohort study of Chinese adults identified a “traditional southern” dietary pattern using PCA, similar to ours. This pattern is characterized by low whole grain and cereal intake and high pork, fish, offal, and poultry intake. It was also positively associated with the risk of hyperuricemia in a full adjusted model (13). In contrast, we found a significant inverse association between the “plant-based” pattern and hyperuricemia, similar to a China-based study by Zhou et al., though the food group compositions differed (13). Our “plant-based” pattern emphasized legumes, tubers, and vegetables, while theirs included legumes, vegetables, and non-plant-based foods like eggs and dairy (13). Similarly, using PCA, Mao et al. identified a plant-based dietary pattern high in tuber and vegetables, which had a reverse association with the SUA levels (33). Moreover, a comprehensive meta-analysis study supported our findings, showing a positive association between animal-based diets and hyperuricemia, while plant-based diets showed an inverse relationship (34).

Previous studies defining dietary patterns using multiple methods yielded inconclusive results regarding their correlation with hyperuricemia risk (12). For instance, in the cohort study by Zhou et al., different methods like PCA derived incomparable dietary patterns among northern Chinese adults, with varying associations with hyperuricemia (13). Due to inconsistent findings in the literature, it’s challenging to definitively establish dietary patterns associated with hyperuricemia. This study identified a dietary pattern significantly linked to hyperuricemia, replicated across three methods: conventional PCA and two novel CoDA methods. It is important to note that the “traditional southern Chinese” dietary pattern identified in our study differs in composition from the commonly referenced “traditional Chinese” or “eastern” dietary pattern (12). While the traditional Chinese diet is predominantly plant-based and characterized by lower meat consumption (33), the “traditional southern Chinese” pattern observed in our analysis is distinguished by having higher intakes of animal-based foods and lower intakes of dairy and cereal products. Therefore, in daily life, one should consciously increase the intake of plant-based foods and dairy products and reduce the intake of animal-based foods.

From a statistical perspective, these three methods differ in their methodological approaches, each with its own strengths and limitations. PCA, a commonly used technique, constructs factors based on variance in food group intake to capture dietary patterns (35). However, it lacks a standard threshold value of factor loadings, introduces subjectivity in pattern naming, and cannot model food substitution effects (16). CoDA methods (PBA and CPCA), in contrast, model substitution by representing dietary patterns as balances between food groups (17). PBA simplifies pattern naming by focusing on a smaller number of food groups but also lacks standard retention rules, whereas CPCA, similar to PCA, presents all food groups but retains subjectivity in labeling. Despite these methodological differences, the results from all three approaches converged, consistently identifying an association between the “traditional southern Chinese” dietary pattern and hyperuricemia in the Chinese population. This convergence strengthens the robustness of our conclusion.

This study has several strengths. To the best of our knowledge, it is the first to use the recently developed CoDA methods to find dietary patterns linked to hyperuricemia in a Chinese population over time. The large sample size and careful control of covariates enhanced the findings’ reliability. Consistent identification of a similar dietary pattern associated with hyperuricemia across three methods demonstrated reproducibility. However, this study has several limitations. First, we were unable to account for unobserved potential confounders, including possible impacts of drug use (such as diuretics) or genetic factors (such as the SLC2A9 genotype) on UA (36), thereby establishing causality remains challenging, as is the case with all observational studies. Additionally, although dietary data were collected repeatedly using an extensive database, the three-day dietary records may not fully capture individuals’ habitual eating patterns. Third, the generalizability of the results to other study settings, including different population groups and health outcomes, is questionable due to the current study population’s limited scope to general Chinese adults. Finally, the findings may not capture shifts in dietary patterns since the data used in this analysis were limited to 2009, the most recent year for which biomarker data were available. Future studies with more up-to-date data are warranted to validate and extend these findings.

5 Conclusion

In conclusion, we extract dietary patterns through three data-driven methods and explored the relationships between the dietary patterns and the risk of hyperuricemia. All three methods – PCA, PBA, and CPCA – consistently identified one common dietary pattern that was positively associated with the risk of hyperuricemia: the “traditional southern Chinese” pattern, characterized by a high intake of rice and animal-based foods and a low intake of wheat products and dairy. On the other hand, the “plant-based” pattern may also be inversely associated with hyperuricemia, but further verification is still needed, while the other dietary patterns showed no association with hyperuricemia. The findings indicate that reducing animal-based food consumption and increasing the intake of wheat products and dairy, as “eastern” dietary pattern advocated, may be advantageous in preventing hyperuricemia. Future longitudinal studies conducted in other settings are necessary to validate our results and explore causal mechanisms.

Statements

Data availability statement

Publicly available datasets were analyzed in this study. This data can be found at: http://www.cpc.unc.edu/projects/china.

Ethics statement

All procedures involving research participants were approved by the Institutional Review Committee of the University of North Carolina at Chapel Hill, the Chinese Institute of Nutrition and Food Safety, and the China Centers for Disease Control and Prevention (Approval Number: 201524). Informed consent was obtained from all subjects involved in the study.

Author contributions

JZ: Funding acquisition, Resources, Writing – original draft, Writing – review & editing. YajZ: Data curation, Resources, Writing – original draft. JH: Methodology, Visualization, Writing – review & editing. YiZ: Software, Writing – review & editing. SL: Conceptualization, Investigation, Writing – review & editing. ZL: Conceptualization, Supervision, Visualization, Writing – original draft. LZ: Funding acquisition, Project administration, Supervision, Writing – original draft, Writing – review & editing. YanZ: Investigation, Supervision, Validation, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This research was funded by the Science Foundation of Shanxi Health Commission (grant number: 2023127), the Scientific Research Foundation for Talent Introduction of Shanxi Bethune Hospital (grant number: 2023RC27), the Science and Technology Innovation Project of Higher Education of Shanxi Province (grant number: 2024 L100) and Research and Innovation Team Project for Scientific Breakthroughs at Shanxi Bethune Hospital (2024AOXIANG02).

Acknowledgments

We are grateful for research grant funding from the National Institute for Health (NIH), the Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) for R01 HD30880, National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK) for R01DK104371 and R01HL108427, the NIH Fogarty grant D43 TW009077 for financial support for the CHNS data collection and analysis files since 1989, and the China Japan Friendship Hospital, Ministry of Health for support for CHNS 2009, Chinese National Human Genome Center at Shanghai since 2009, and Beijing Municipal Center for Disease Prevention and Control since 2011. We thank the National Institute for Nutrition and Health, the China Center for Disease Control and Prevention, the Beijing Municipal Center for Disease Control and Prevention, and the Chinese National Human Genome Center in Shanghai.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The authors declare that no Gen AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnut.2025.1582674/full#supplementary-material

References

  • 1.

    ZhangTRayamajhiSMengGZhangQLiuLWuHet al. Dietary patterns and risk for hyperuricemia in the general population: Results from the TCLSIH cohort study. Nutrition. (2022) 93:111501. doi: 10.1016/j.nut.2021.111501

  • 2.

    LiuHZhangXMWangYLLiuBC. Prevalence of hyperuricemia among Chinese adults: a national cross-sectional survey using multistage, stratified sampling. J Nephrol. (2014) 27:6538. doi: 10.1007/s40620-014-0082-z

  • 3.

    FengTLiCZhengJXuYWangXLiYet al. Prevalence of and Risk Factors for Hyperuricemia in Urban Chinese Check-Up Population. Int J Endocrinol. (2024) 2024:8815603. doi: 10.1155/2024/8815603

  • 4.

    DuLZongYLiHWangQXieLYangBet al. Hyperuricemia and its related diseases: mechanisms and advances in therapy. Signal Transduct Target Ther. (2024) 9:212. doi: 10.1038/s41392-024-01916-y

  • 5.

    ZhangYNeogiTChenCChaissonCHunterDJChoiHK. Cherry consumption and decreased risk of recurrent gout attacks. Arthritis Rheum. (2012) 64:400411. doi: 10.1002/art.34677

  • 6.

    ParkKYKimHJAhnHSKimSHParkEJYimSYet al. Effects of coffee consumption on serum uric acid: systematic review and meta-analysis. Semin Arthritis Rheum. (2016) 45:5806. doi: 10.1016/j.semarthrit.2016.01.003

  • 7.

    YuKHSeeLCHuangYCYangCHSunJH. Dietary factors associated with hyperuricemia in adults. Semin Arthritis Rheum. (2008) 37:24350. doi: 10.1016/j.semarthrit.2007.04.007

  • 8.

    MehmoodAZhaoLWangCNadeemMRazaAAliNet al. Management of hyperuricemia through dietary polyphenols as a natural medicament: a comprehensive review. Crit Rev Food Sci Nutr. (2019) 59:143355. doi: 10.1080/10408398.2017.1412939

  • 9.

    ZhaoJLiZGaoQZhaoHChenSHuangLet al. A review of statistical methods for dietary pattern analysis. Nutr J. (2021) 20:37. doi: 10.1186/s12937-021-00692-7

  • 10.

    ArvanitiFPanagiotakosDB. Healthy indexes in public health practice and research: a review. Crit Rev Food Sci Nutr. (2008) 48:31727. doi: 10.1080/10408390701326268

  • 11.

    DiBelloJRKraftPMcGarveySTGoldbergRCamposHBaylinA. Comparison of 3 methods for identifying dietary patterns associated with risk of disease. Am J Epidemiol. (2008) 168:143343. doi: 10.1093/aje/kwn274

  • 12.

    WenZYWeiYFSunYHJiWP. Dietary pattern and risk of hyperuricemia: an updated systematic review and meta-analysis of observational studies. Front Nutr. (2024) 11:1218912. doi: 10.3389/fnut.2024.1218912

  • 13.

    ZhouMHuangXLiRZhangZZhangLGaoXet al. Association of dietary patterns with blood uric acid concentration and hyperuricemia in northern Chinese adults. Nutr J. (2022) 21:42. doi: 10.1186/s12937-022-00789-7

  • 14.

    ShiZ. Cadmium Intake, Dietary Patterns and Hyperuricemia Among Adults in China. Expo Health. (2020) 13:21927. doi: 10.1007/s12403-020-00375-4

  • 15.

    LiTLiSTianTNieZXuWLiuLet al. Association and interaction between dietary patterns and gene polymorphisms in Liangshan residents with hyperuricemia. Sci Rep. (2022) 12:1356. doi: 10.1038/s41598-021-04568-y

  • 16.

    ZhaoJGuoWWangJWangT. Exploring the association of dietary patterns with the risk of hypertension using principal balances analysis and principal component analysis. Public Health Nutr. (2023) 26:16070. doi: 10.1017/s136898002200091x

  • 17.

    SolansMCoendersGMarcos-GrageraRCastellóAGràcia-LavedanEBenaventeYet al. Compositional analysis of dietary patterns. Stat Methods Med Res. (2019) 28:283447. doi: 10.1177/0962280218790110

  • 18.

    ZhangBZhaiFYDuSFPopkinBM. The China Health and Nutrition Survey, 1989-2011. Obes Rev. (2014) 15:27. doi: 10.1111/obr.12119

  • 19.

    Institute of Nutrition and Food Safety. China Food Composition 2004. Beijing, China: Peking University Medical Press (2009).

  • 20.

    ZhaiFGuoXPopkinBMMaLKJG. Evaluation of the 24-hour individual recall method in China. Food Nutr Bull. (1996) 17:459. doi: 10.1177/156482659601700209

  • 21.

    ZhaoYGaoQZhangJWangJArakiTZhaoJ. Trajectories of plant-based diet indices and the associated risk of hypertension among Chinese adults: a cohort study based on the China Health and Nutrition Survey 2004-2015. Nutr J. (2024) 23:155. doi: 10.1186/s12937-024-01053-w

  • 22.

    YangYPiaoWHuangKFangHJuLZhaoLet al. Dietary pattern associated with the risk of hyperuricemia in Chinese elderly: result from China nutrition and health surveillance 2015-2017. Nutrients. (2022) 14:14(4). doi: 10.3390/nu14040844

  • 23.

    DengZHuangBZhangQZhangM. First farmers in the South China Coast: new evidence from the Gancaoling site of Guangdong Province. Front Earth Sci. (2022) 10:858492. doi: 10.3389/feart.2022.858492

  • 24.

    WangPJiJZhangY. Aquaculture extension system in China: development, challenges, and prospects. Aquac Rep. (2020) 17:100339. doi: 10.1016/j.aqrep.2020.100339

  • 25.

    HuangYTianX. Food accessibility, diversity of agricultural production and dietary pattern in rural China. Food Policy. (2019) 84:92102. doi: 10.1016/j.foodpol.2019.03.002

  • 26.

    KanekoKAoyagiYFukuuchiTInazawaKYamaokaN. Total purine and purine base content of common foodstuffs for facilitating nutritional therapy for gout and hyperuricemia. Biol Pharm Bull. (2014) 37:70921. doi: 10.1248/bpb.b13-00967

  • 27.

    VillegasRXiangYBElasyTXuWHCaiHCaiQet al. Purine-rich foods, protein intake, and the prevalence of hyperuricemia: the Shanghai Men's Health Study. Nutr Metab Cardiovasc Dis. (2012) 22:40916. doi: 10.1016/j.numecd.2010.07.012

  • 28.

    ChoiHKAtkinsonKKarlsonEWWillettWCurhanG. Purine-rich foods, dairy and protein intake, and the risk of gout in men. N Engl J Med. (2004) 350:1093103. doi: 10.1056/NEJMoa035700

  • 29.

    WuXTangWTangDHuYZhangNDaiSet al. Two a posteriori dietary patterns are associated with risks of hyperuricemia among adults in less-developed multiethnic regions in Southwest China. Nutr Res. (2023) 110:96107. doi: 10.1016/j.nutres.2022.12.012

  • 30.

    LiYShenZZhuBZhangHZhangXDingX. Demographic, regional and temporal trends of hyperuricemia epidemics in mainland China from 2000 to 2019: a systematic review and meta-analysis. Glob Health Action. (2021) 14:1874652. doi: 10.1080/16549716.2021.1874652

  • 31.

    PiaoWZhaoLYangYFangHJuLCaiSet al. The prevalence of hyperuricemia and its correlates among adults in China: results from CNHS 2015-2017. Nutrients. (2022) 14:4095. doi: 10.3390/nu14194095

  • 32.

    ZykovaSNStorhaugHMToftIChadbanSJJenssenTGWhiteSL. Cross-sectional analysis of nutrition and serum uric acid in two Caucasian cohorts: the AusDiab study and the Tromsø study. Nutr J. (2015) 14:49. doi: 10.1186/s12937-015-0032-1

  • 33.

    MaoDFengJZhouYLiH. Analysis of different plant- and animal-based dietary patterns and their relationship with serum uric acid levels in Chinese adults. Nutr J. (2023) 22:53. doi: 10.1186/s12937-023-00885-2

  • 34.

    ChengSShanLYouZXiaYZhaoYZhangHet al. Dietary patterns, uric acid levels, and hyperuricemia: a systematic review and meta-analysis. Food Funct. (2023) 14:785368. doi: 10.1039/d3fo02004e

  • 35.

    CunhaDBAlmeidaRMPereiraRA. A comparison of three statistical methods applied in the identification of eating patterns. Cad Saude Publica. (2010) 26:213848. doi: 10.1590/s0102-311x2010001100015

  • 36.

    VitartVRudanIHaywardCGrayNKFloydJPalmerCNAet al. SLC2A9 is a newly identified urate transporter influencing serum urate concentration, urate excretion and gout. Nat Genet. (2008) 40:43742. doi: 10.1038/ng.106

Summary

Keywords

dietary patterns, compositional data, hyperuricemia, principal component analysis, China health and nutrition survey

Citation

Zhao J, Zhao Y, Han J, Zhao Y, Liu S, Liu Z, Zhang L and Zhang Y (2025) Comparison between compositional data analysis and principal component analysis for identifying dietary patterns associated with hyperuricemia. Front. Nutr. 12:1582674. doi: 10.3389/fnut.2025.1582674

Received

26 February 2025

Accepted

02 July 2025

Published

16 July 2025

Volume

12 - 2025

Edited by

Xinping Li, Capital Medical University, China

Reviewed by

Elizabeth Nascimento, Federal University of Pernambuco, Brazil

Bin Yang, Shanxi Provincial Cancer Hospital, China

Dong Liang, China National Center for Food Safety Risk Assessment, China

Updates

Copyright

*Correspondence: Yan Zhang, Liyun Zhang, Zhida Liu,

†These authors have contributed equally to this work

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics