- 1College of Animal Science and Veterinary Medicine, Shenyang Agricultural University, Shenyang, China
- 2College of Animal Science and Technology, China Agricultural University, Beijing, China
- 3Hebei Livestock Breeding Station, Shijiazhuang, China
- 4Ministry of Education Joint International Research Laboratory of Animal Health and Food Safety, College of Veterinary Medicine, Nanjing Agricultural University, Nanjing, China
- 5Guangdong Provincial Key Laboratory of Animal Molecular Design and Precise Breeding, School of Animal Science and Technology, Foshan University, Foshan, China
This study aimed to assess the impact of cows' age (in months) on production performance and evaluate the feasibility of using on-site data to inform precision agricultural practices on farms. Cows were randomly categorized by age in months, and the influence of lactation days was controlled using analysis of covariance (ANCOVA). Weighted linear models (WLM) was employed to control the effect of aggregating data. Principal component analysis (PCA) was applied to explore the relationships among production traits and to elucidate their underlying principles, providing insights for precision management in large-scale farms. The findings were as follows: (1) the age in months had no significant effect on dry matter intake (DMI), residual feed intake (RFI), or daily milk yield (MY; p > 0.05); (2) PCA scatter plots revealed no distinct separation between groups, supporting the ANCOVA result that production traits did not differ significantly across the groups. However, the centroid of Group A showed separation from other groups along the second principal component, which was predominantly influenced by age in months, indicating that group distribution in the loading diagram corresponded to the fundamental grouping principle. This study concluded that, at least within the age range of 38.78–53.83 months, cows' age in months did not significantly affect performance. The alignment with ANCOVA results suggests that on-site field data, when analyzed appropriately, can serve as a valuable reference for precision farm management practices.
1 Introduction
Milk contains various essential nutrients, including fat, protein, phosphopeptides, and vitamins (1), which play critical roles in human cells and enzyme functions (2). In recent years, many developed nations have shifted their focus from solely increasing cow numbers to enhancing the production performance and efficiency of dairy cows (3). Numerous factors influence dairy cow performance, such as genetic, environmental, and physiological variables (4, 5). Holstein cows are known for their high milk yield and extended peak lactation periods, making them a globally prevalent breed. As cows' physiological states change, significant variations in metabolic levels and production performance occur (6). For instance, first-parity cows, still in a developmental stage, gradually increase milk production, which then declines with age as the body weakens.
To the best of our knowledge, most of the nutritional literature have studied and reported based on data from small-scale scientific trials, and only a small part of the literature is based on large-scale production data from commercial farm and mostly focus on health issue (7). Proper analysis of these production data can offer valuable insights, enabling farm managers to advance precision farming practices and foster the growth of the dairy industry. While numerous studies have investigated the effects of cow parity on production performance (8–10), research on the impact of cows' age in months remains scarce, leaving a gap in specific management guidelines for dairy farms. Most published studies focus on the effect of calving month, such as its influence on lactation curve parameters (11), rather than on the age in months of the animals.
Furthermore, lactating cows typically reach their peak milk production around 50–60 days postpartum, after which production gradually declines, forming the lactation curve (12). In studies, failure to account for the effect of days in milk (DIM) can compromise the accuracy of milk production results, especially in nutritional research (13, 14).
However, limited studies have thoroughly examined the effect of DIM on production performance. Most experiments have controlled for this by selecting animals with similar lactation days, which becomes challenging when there is considerable variation in DIM across field groups. This issue can be addressed by incorporating lactation days as a covariate in the model (15). Therefore, it is crucial to refine the parity effect at the age in months level to better understand how age in months influences dairy cow production performance. This approach, utilizing DIM as a covariate, allows for a more precise analysis based on the farm's specific conditions, offering a theoretical foundation for farmers to understand both population-level and individual-level performance dynamics.
Principal component analysis (PCA) is a dimensionality reduction technique that generates independent composite indicators capable of explaining related variables. The method enables the identification of patterns in the data, which can be visualized in loading diagrams to highlight the relationships among observations and variables (16). PCA, in this context, can be used to examine the distribution and associations of individual cows and herds. Additionally, PCA results can complement the findings from analysis of covariance (ANCOVA). In this study, cows were randomly grouped according to their average age in months, and the effect of lactation days was controlled through ANCOVA to assess the impact of age in months on dry matter intake (DMI), residual feed intake (RFI), and daily milk yield (MY). Furthermore, PCA was employed to explore the relationships between production traits across different age in months groups based on field data from the farms.
The primary aim of this study was to investigate the influence of age in months on the production performance of lactating cows, particularly excluding the effect of DIM. Additionally, the study explored the interrelationships among productive traits while considering age in months, offering insights for enhancing dairy cow feeding management and supporting the feasibility of implementing precision farming practices in large-scale dairy operations.
2 Materials and methods
2.1 Animals and management
This experiment was conducted in accordance with the Guide for the Care and Use of Laboratory Animals, and received approval from the Animal Ethics and Welfare Committee at Shenyang Agricultural University (22030604). The data were collected from a dairy farm (123°E, 43°N) near Shenyang City in December 2021 to February 2022. The animals used were Chinese Holstein cows, with a total of 2,247 cows included in the data collection, encompassing cows with days in milk ranging from 1 to 305, as well as dry cows. Based on the farm management guideline, the cows were categorized into four groups: “High-yield,” “Mid-yield,” “Low-yield,” and “Dry-cows” and reared in the farm. These groups were housed in 14 zones, each with an average of 161 cows (with a standard deviation of 30 per zone). As the cows were reared in a zone unit, some of the data were observed at multiple animals level, such as the parity and the performance traits, thus some of the statistical data were presented at an average of a sub-group level but not in specific to individual level (e.g., Table 1). The cows were fed a same total mixed ration (TMR), with feeding occurring three times daily and free access to water provided. Milking took place three times a day: morning, midday, and evening. To ensure a relatively balanced sample (i.e., sub-groups) size across groups while capturing potential effects of age on the outcomes, the entire population data was assigned into three groups based on their age in months: 25–44 months (Group A), 45–51 months (Group B), and 52 months and older (Group C). In summary, in the absence of established cut-off points in the literature, the grouping was guided by the distribution of age values as well as the sub-group number in the dataset.
2.2 Data collection
During the research period, data were collected regularly according to the farm's routine procedures, including daily recording of feed intake and residual feed, as well as TMR sample collection for dry matter intake calculation. Due to farm conditions, individual cow feed intake could not be precisely measured, so the average feed intake per zone within each experimental group was used for analysis. Milk production and lactation days were recorded daily using the farm management software associated with the milking equipment. Due to farm constraints, the average weekly milk yield per zone was used for analyzing milk yield performance in this study.
2.3 Data analysis
After the experimental data were recorded in Excel (2010 version), data cleaning was performed using a threshold (μ ± 3*SD) to exclude outliers from a statistical perspective for further analysis (17). Kurtosis and skewness were used to assess the normality of the data. A greater kurtosis value indicates steeper data distribution, while a lower value suggests a flatter distribution. Similarly, a higher absolute value of skewness signifies greater data skewness. Data were considered normally distributed if the absolute values of both kurtosis and skewness were less than 2 (18). According to the kurtosis and skewness results presented in Table 1, all data were found to follow a normal distribution.
Due to the large diversity of days in milk in each treatment group and the large gap between groups, when analyzing the effect of the age in months on feed intake (DMI), residual feed intake (RFI) and daily milk yield (MY), it might be affected by DIM. To address this, the analysis of covariance (ANCOVA) method was employed to eliminate the influence of DIM, allowing for a more accurate evaluation of the effects of age in months on DMI, RFI, and MY, thereby improving the quality of the analysis. Since age in months provides a more specific measure of parity in production, and a positive correlation between parity and age in months was observed in this study (r = 0.71, p < 0.05), the average parity in each group was approximately 2. Therefore, the effect of parity was not considered. However, since the lactation and feed intake curves are non-linear from 50 to 60 days postpartum, which violated the model's assumption of linearity, records with a DIM of less than 50 days were excluded to improve the accuracy of the analysis. Additionally, one record from each group was excluded from the MY analysis due to its occurrence during the dry milk period.
Descriptive statistics and correlation calculations were performed using the base package of R 4.1.3 software. ANCOVA was conducted using the rstatix package, with DIM as a covariate, age in months as the independent variable, and DMI, RFI, or MY as dependent variables. Factors such as management levels were included as random errors. Traits exhibiting significant differences were reported as least squares means after accounting for the effects of covariates. Prior to conducting the ANCOVA, tests for homogeneity of variance, linearity between covariates and dependent variables, and a slope consistency hypothesis test among groups were performed. A significance level of 0.05 was set for all statistical tests. The model used in the ANCOVA analyses was as follows:
where yij represents the jth observation of the dependent variable for the ith group; μ is the overall mean; Ti denotes the treatment effect for the ith group; Xij is the corresponding covariate; β is the slope of the linear regression of the covariate corresponding to the dependent variable; and εij is the random error. To further validate the results from the ANCOVA analysis, the individual production indicators discussed earlier were used as input variables. After standardization, PCA was conducted. A scree plot was employed to determine the number of principal components to retain, using a threshold value of 1 (19). Graphs depicting the distribution of variables, groups, and individuals were created to observe how individuals were distributed across different month-age groups. Specifically, individual samples were projected onto the principal component space, with colors indicating group membership. For each group, a centroid was calculated as the arithmetic mean of the individuals' coordinates on the first two principal components, representing the group's central tendency in the first two dimensions. These centroids were plotted as group-level mean points. To visualize within-group variability and uncertainty, 95% confidence ellipses were drawn around each group, assuming multivariate normality. In brief, each ellipse was constructed using the empirical covariance matrix of the group's individuals and the chi-square distribution, forming a boundary expected to enclose approximately 95% of future observations. The FactoMineR and factoextra packages of R 4.1.3 software were utilized for conducting the PCA and visualizing the result.
In order to control the risk of ecological fallacy induced by data aggregation, the weighted linear model (WLM) was employed (20–22). In brief, the WLM could account for unequal group sizes, using group-level means as outcomes and group-specific sample sizes as weights in the model. This way of doing could improve estimation precision and is suitable when individual-level data is unavailable. Moreover, by incorporating covariates and weighting by group sizes, WLM helps mitigate ecological bias arising from the use of aggregated data.
3 Results
After cleaning the data using the criterion μ ± 3 * SD, no records were excluded. Therefore, the final analysis was based on 13 average data points, representing the average data for each of the 14 rearing zones (excluding one data point with DIM of less than 50 days). The results of the skewness and kurtosis analyses, presented in Table 1, indicated that the data followed a normal distribution. As presented in Table 1, the mean number of cows in each group were 143 (averaged from five sub-groups), 177 (averaged from five sub-groups), and 163 (averaged from four sub-groups), with standard deviations ranging from 10 to 46, respectively. The mean DIM for the groups were 124.00, 195.80, and 180.50 days, with standard deviations ranging between 84.29 and 137.94. The mean daily milk yield ranged between 19.75 and 24.38 kg, with standard deviations between 7.96 and 16.29 kg. Mean DMI was 17.42, 17.48, and 16.05 kg, with standard deviations ranging from 3.81 to 7.30 kg. Mean age in months was 38.78, 48.50, and 53.83 months for the groups, with standard deviations ranging from 0.90 to 8.83 months. Mean RFI was −2.35, −1.79, and −1.49 kg for the groups, with standard deviations of 0.44, 0.56, and 1.37 kg. The kurtosis and skewness values for each trait ranged from −1.41 to 0.54 and −1.50 to 1.84, respectively. The Pearson correlation information is given in Figure 1. As observed, all the p values of the correlation were higher than 0.05, except for the correlation coefficient between milk yield and DMI. Age in months had positive correlation coefficient with other productive traits but not significant (Figure 1). In specific, the correlation coefficient value ranged from 0.15 to 0.46.
Figure 1. The Pearson correlation among milk yield, dry matter intake (DMI), age in months (age), days in milk (DIM), and residual feed intake (RFI). (Color in red represents positive correlation, blue indicates negative correlation; the number in ellipses means p value (ellipses without number inside denotes p value less than 0.05).
3.1 Effect of age in months of cows on feed intake
The variance homogeneity test yielded a p-value of 0.52, and the regression residual normality test yielded a p-value of 0.09. The parallelism of the regression slopes across groups was nearly identical, with the slopes of DMI and DIM ranging from −0.08 to −0.01 and an R2 value of up to 0.59 (Figure 2a), meeting the fundamental requirements for the analysis of covariance.
Figure 2. The linear relationship between the covariate days in milk (DIM) and each dependent variable: (a) dry matter intake (DMI), (b) residual feed intake (RFI), (c) milk yield (MY), and (d) the principal component analysis scree plot. (Color in red represents group A, green indicates group B, and blue means group C; the dash red line in plot d means the threshold at 1.0 for the scree plot).
ANCOVA was used to assess the effect of age in months on DMI, yielding a p-value of 0.36 (Table 2). This indicated no significant difference in DMI between the different age in months groups after correcting for the covariate DIM. The least squares means of DMI for each group are presented in Table 2. From these results, it can be concluded that the difference in DMI between groups A, B, and C ranged from 1.30 to 6.30 kg, but the differences were not statistically significant (p > 0.05), suggesting that age in months did not have a significant effect on the DMI of lactating cows.
Table 2. Least squares means of production performance of each age in months group estimated by different models.
3.2 Effect of cows' age in months on RFI
Consistent with the results on the effect of age in months on feed intake, data among groups supported the hypothesis of variance homogeneity (p > 0.05), and the residuals from the regression model followed a normal distribution (p > 0.05). Additionally, the slopes of the regression lines between groups were parallel, with RFI and DIM slopes ranging from −0.02 to 0.004, close to zero, and the highest R2 value reaching 0.93 (Figure 2b), meeting the fundamental assumptions for covariance analysis.
As shown in Table 2, the least squares means of RFI for the groups were −2.23, −1.84, and −1.54 kg, with a maximum inter-group difference of 0.69 kg. ANCOVA yielded a p-value of 0.87 for the effect of age in months on RFI (Table 2), indicating no significant difference in RFI between the age in months groups.
3.3 Effect of age in months of cows on daily milk yield
The effect of age in months on daily milk yield was also assessed. The p-values from the variance homogeneity test and the regression residual normality test were 0.14 and 0.66, respectively, both greater than 0.05. Similar to the previous two traits, the parallelism of the slopes of the regression linearity groups of the DIM on daily milk yield was basically the same (the slopes of the milk yield vs. the DIM were spread out between −0.06 and 0.02, which was close to 0, The highest R2 was 0.83, as shown in Figure 2c), which also met the basic requirements of the analysis of covariance.
As presented in Table 2, the least squares means of daily milk yield across the groups ranged from 21.20 to 29.00 kg, with a maximum difference of 7.80 kg between groups. However, after performing ANCOVA, no significant effect of age in months on milk yield was observed (p > 0.05), once the influence of DIM was excluded.
3.4 Effect size and 95% confidence intervals of ANCOVA and WLM results
The pairwise effect sizes of each trait as well the corresponding 95% confidence intervals are shown in Figure 3 (p > 0.05). As observed, the effect size ranged between −0.01 for DMI and 1.36 for RFI in the ANCOVA (Figures 3A–C). However, the results were not significant due to the 95% confidence intervals covered 0. Similarly, the effect size (Figures 3D–F) of all traits resulted from WLM analysis were between 0.02 for DMI and 0.08 for milk yield. The corresponding 95% confidence intervals also included 0.
Figure 3. Effect size and 95% confidence intervals of analysis of covariance and weighted linear model results (Color in blue represents analysis of covariance, red indicates weighted linear model; (A) and (D) denotes dry matter intake; (B) and (E) indicating residual feed intake; (C) and (F) means milk yield).
3.5 Results of principal component analysis
The principal component extraction details from the PCA for each variable are presented in Table 3. The scree plot (Figure 2d) was used to determine the number of principal components to retain, selecting those with eigenvalues greater than 1. Based on the information in Table 3 and Figure 2d, two principal components were extracted, explaining 55.03 and 30.40% of the total variance, respectively. Thus, the cumulative variance explained by these two components was 85.43%.
Figure 4 illustrates the relationships between the variables, with Dim1 and Dim2 representing the first and second principal components, accounting for 55.03 and 30.40% of the variance, respectively. Dim1 primarily captures the effects of daily milk yield (Milk yield), DMI, days in lactation (DIM), and partial RFI, while Dim2 primarily represents age in months (Age), Milk yield, DMI, and RFI.
Figure 4. Loading plot of principal component analysis (PCA). (The length and color of each arrow denotes the contribution of the trait to each dimension; color in red means higher contribution and purple denote lower contribution).
The angles between Milk yield and DMI, as well as between RFI and DIM, are both less than 90°. In contrast, the angles between milk yield or DMI with RFI or DIM are greater than 90°, and the angles between these four variables and age in months are less than 90°. Smaller angles between variable vectors indicate stronger correlations between the variables, while angles approaching 90° suggest weaker correlations. The longer the vector, the greater the loading of that variable in the corresponding principal component direction. The individual scatter plot in two-dimensional space (Figure 5) further demonstrates the ability of PCA to distinguish and display the characteristics of different groups. The distribution of groups across different age in months shows notable separation, with clusters forming within the data. Specifically, the larger dots in each group represent the group centroids, and it is evident that the centroid of group B is closer to that of group C, while being slightly farther from the centroid of group A.
Figure 5. Individual and group distribution plot based on the principal component analysis (PCA) results (the ellipses around each group represent the 95% confidence).
4 Discussion
To further validate the findings derived from ANCOVA, we applied a weighted linear model (WLM) to account for potential ecological bias induced by aggregated data and group-level sample size variability. The estimated least squares means obtained from the WLM were numerically consistent with those from the ANCOVA, and the between-group differences remained statistically non-significant. These results suggest that the absence of significant group effects observed in the initial ANCOVA analysis, despite not explicitly addressing ecological fallacy, was not substantially biased. Therefore, the agreement between the two modeling approaches supports the robustness of the current findings and provides a sound basis for subsequent interpretation and discussion.
4.1 Effect of age in months on cow feed intake
After data cleaning, no records were excluded, indicating that the data quality from this farm was acceptable and aligned with the management standards typically observed in dairy cow farms in China. The daily milk yield for each group, with DIM ranging from 124.00 to 195.80 days, varied between 19.75 and 24.38 kg, falling within the reasonable performance range for Holstein cows as recommended by the international ICAR standards (23). The DMI ranged from 16.05 to 17.42 kg/d, which was slightly lower than the 20.87 kg/d observed in Holstein cows with an average parity of 2.3, as reported by De Boever et al. (24). This difference may be attributed to the lower productivity and higher intra-group variability at the farm used in this study. However, despite the considerable variability in daily milk production and DMI, the feeding efficiency (i.e., the ratio of milk production to DMI) of the animals in this study ranged from 1.23 to 1.40, which aligns with the recommended range of 1.20–1.80 (25), indicating that the animal population was within a positive production level.
As presented in Table 2, after correcting for the effect of DIM, the DMI values for each group changed. The DMI of Group A decreased from 17.42 kg/d before correction to 13.10 kg/d, while the DMI for the other two groups increased slightly by approximately 2 kg. This suggests that DIM had a more pronounced effect on DMI. After adjusting for DIM, although trends in DMI between groups were observed, such as 6.30 kg higher intake in Group B compared to Group A and 5.00 kg higher intake in Group C compared to Group A, there was no statistically significant effect of month age on the DMI of dairy cows (p > 0.05). The corrected feed intake showed a trend of increasing and then decreasing, although not statistically significant. This trend is consistent with that described by Reshalaitihan et al. (26), where changes in feed intake were related to parity, even though the specific focus on age in months was not addressed. This provides a reference for the present study. After 16 months of age, Holstein cows reach body maturity, which may explain the lack of statistical difference in DMI after correcting for DIM. This is supported by the mean ages of 38.78, 48.50, and 53.83 months for the respective groups, suggesting that animals in each group had already reached maturity. Furthermore, the lack of significant differences in performance (e.g., daily milk yield) further explains the absence of a statistically significant difference in DMI.
4.2 Effect of age in months of cows on residual feed intake
As shown in Table 2, no significant differences (p > 0.05) in RFI were observed between the different age groups, indicating that age in months had no significant impact on RFI after accounting for the effect of DIM. RFI, defined as the difference between actual feed intake and predicted feed intake based on expected production performance, is widely regarded as an indicator of feed efficiency in animals (27). A lower (typically negative) RFI value indicates greater productivity (28). After correcting for DIM, the RFI values remained relatively unchanged, with the smallest value in Group A and the largest in Group C, but no statistical differences were found between groups (p > 0.05).
However, there was a tendency, albeit not significant, for RFI to increase with age in months, which aligns with the trend observed by Grandl (29) in their study involving groups with considerable age variation (6–120 months). One potential reason for the lack of statistical significance in this study could be the limited sample size and the diversity of age in months within groups. Increasing the data size in future studies would enhance statistical power, providing results that more closely reflect real-world conditions.
4.3 Effect of age in months of cows on daily milk production
Table 2 also reveals that, after correcting for the effect of the covariate DIM, the daily milk yield of the different age in months groups was 29.00, 25.30, and 21.20 kg, respectively. These values changed by 4.62, 1.53, and 1.45 kg, compared to pre-correction values. Numerically, the daily milk yields of Groups A and B, corrected for DIM, were higher than that of Group C, with differences of 7.8 and 4.1 kg, respectively. However, similar to DMI and RFI, no significant effect of age in months on daily milk yield was observed (p > 0.05). Given that cows' physiological development varies with age, milk yield typically increases as body functions develop and reach their peak, after which it declines with further age (25). The average parity in each group in this study was the second parity, making the lack of significant differences reasonable. Although the age in months variability within each group was relatively low (overall coefficient of variation, CV = 16%), the lack of significant differences may have been limited by the sample size. Increasing the data size would likely improve the ability to detect age in months effects on dairy cow production performance in future studies.
PCA is a dimensionality reduction algorithm that generates mutually independent composite indicators, called principal components, which explain the relevant variables. This method clusters individuals and variables in loading diagrams to effectively display the data characteristics and reveal the relationships between them (16). In this experiment, PCA was applied to the production performance data of Holstein cows, converting trait variables with different units into uniform linear scores. By simplifying redundant information through dimensionality reduction, differences between groups and individuals were analyzed. As shown in Table 3, the cumulative variance explained by the first two principal components was 85.43%, exceeding the 80% threshold, confirming the representativeness of the loading diagram for these components.
The PCA loading plot (PCA-biplot) illustrates the correlations between component variables and reflects the relationships between the production indices as well as between groups and individuals. As shown in Figures 4, 5, daily milk yield, DMI, RFI, age in months, and DIM—key production indicators for dairy cows—can be reduced to two dimensions, enabling observation of both individual and group distribution based on the principal components, along with the variable loading vectors. For example, daily milk yield and DMI exhibit a strong positive correlation: as one increases, so does the other. This aligns with findings by Zhang et al. using a large group, who described a similar trend in lactating cows, where DMI and milk production evolved together (12). Additionally, it has been reported that as age in months increases, body function development in cows, such as an increase in rumen volume, leads to greater feed intake (12). Figure 4 highlights that feed intake is a critical factor influencing lactation performance in dairy cows, reinforcing the positive correlation between DMI and MY observed in prior studies (30). These results further support the validity of the PCA findings in this study. However, age in months was projected closely to Dim2 which although having acute angle with daily milk yield, DMI, RFI, and DIM, the angle was not close to zero degrees, especially for DIM and daily milk yield. This explained why the difference for those traits were not significant even modeled based on the ANCOVA and WLM algorism as the angle were even reach 90 degrees. Further, the fact that correlation of age with other traits were not significant (Figure 1) also confirmed age in months had no effects on studied productive traits.
Similarly, Figure 5 illustrates that the individual scatters of each group were not clearly distinguishable along the first principal component, suggesting a considerable diversity of samples within each group. However, along the second principal component, the three groups displayed a hierarchical distribution corresponding to their respective age in months. Specifically, the lower age in months in Group A was aligned with the negative direction of Dim2, while the higher age in months in Group C was positioned toward the positive direction of Dim2. Overall, Group A was more concentrated toward the positive direction of the first principal component, indicating higher production performance, while Groups B and C were situated more toward the negative direction, reflecting relatively lower production efficiency. These observations are consistent with the ANCOVA results, which showed that Group A had a lower RFI compared to the other groups, along with a relatively higher milk yield (Table 2).
In summary, based on ANCOVA and the WLM analysis, after correcting for the DIM effect, the DMI, RFI, and MY of the different age in months groups exhibited changes compared to pre-correction values, highlighting the importance of adjusting for DIM. However, despite these corrections, no significant differences in corrected DMI, RFI, or MY were observed across the groups. The ANCOVA results suggest that, at least within the age range of 38.78–53.83 months, age in months does not significantly influence the productive performance of lactating dairy cows. What is more, according to the effect size evaluation from both the ANCOVA and WLM, the age in months effect was small (0.02) to medium (1.36) with 95% confidence intervals across 0 which confirmed the findings above. Additionally, the PCA results revealed clear relationships between performance traits, such as the strong association between DMI and MY with Dim1, emphasizing the role of DMI in lactation performance. The score plot further clarified the distribution of inter-group and inter-individual relations. The lack of clear separation between groups based on age in months served as additional confirmation of the ANCOVA and WLM findings, suggesting that within this age range, age in months is not a major factor influencing the production performance of dairy cows at least at a sub-group level. Due to the on-site limitation, the current research aggregated the individual animals into sub-groups to do the analysis. Although many researches in poultry science and pig science field often adopt the aggregate data as the smallest unit for statistical analysis (i.e., a replicate containing many animals as the smallest unit) (31–33). This increased the risk of Type II errors and preventing individual-level inferences and eliminates valuable within-group variability and raises concerns about ecological fallacy. So, future research could benefit from increasing the sample size (especially the individual level data) or the number of the sub-groups to enhance statistical power and the reliability of conclusions.
5 Conclusions
After accounting for the effect of days in milk (DIM) using ANCOVA and WLM, notable adjustments were observed in the estimates of production traits, underscoring the necessity of controlling for DIM in performance analyses. Although age in months showed effect on DMI, RFI, or MY but not significant in this research, the finding may reflect current data structure limitations. With expanded sample sizes, especially at individual level, in future studies, age in months might potentially be identified as a significant factor. This would support more precise age-based classification of animals, facilitating targeted nutritional and management interventions to optimize production efficiency. So, the data derived from on-site production conditions can provide a reliable reference for farm management, after given appropriate analysis. The dynamic relationships between these traits could inform real-time guidance for precision management at various production stages to optimize economic outcomes.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
This experiment was conducted in accordance with the Guide for the Care and Use of Laboratory Animals, and received approval from the Animal Ethics and Welfare Committee at Shenyang Agricultural University (22030604). The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent was obtained from the owners for the participation of their animals in this study.
Author contributions
LZ: Data curation, Formal analysis, Investigation, Methodology, Writing – original draft, Writing – review & editing. C-LL: Data curation, Formal analysis, Investigation, Writing – original draft, Writing – review & editing. QM: Data curation, Formal analysis, Investigation, Writing – original draft, Writing – review & editing. J-CL: Data curation, Formal analysis, Writing – original draft, Writing – review & editing. C-FL: Investigation, Methodology, Writing – review & editing. Y-BM: Investigation, Data curation, Writing – original draft, Writing – review & editing. QS: Investigation, Writing – original draft. J-PZ: Investigation, Validation, Writing – review & editing. J-TL: Data curation, Writing – original draft, Writing – review & editing. S-YZ: Conceptualization, Funding acquisition, Project administration, Supervision, Validation, Visualization, Writing – review & editing. J-MC: Conceptualization, Funding acquisition, Supervision, Validation, Visualization, Writing – review & editing. Y-LZ: Formal analysis, Investigation, Methodology, Writing – original draft, Writing – review & editing.
Funding
The author(s) declared that financial support was received for this work and/or its publication. This research was funded by National Natural Science Foundation of China (32472853) and partly from China Postdoctoral Science Foundation (2022MD723805) and the Science and Technology Plan Project of Shenyang City (No. 24-215-2-08).
Conflict of interest
The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The author J-MC declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.
Generative AI statement
The author(s) declared that generative AI was not used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Pereira PC. Milk nutritional composition and its role in human health. Nutrition. (2014) 30:619–27. doi: 10.1016/j.nut.2013.10.011
2. Singhal Y, Bake RD, Baker SS. A comparison of the nutritional value of cow's milk and nondairy beverages. J Pediatr Gastroenterol Nutr. (2017) 64:799–805. doi: 10.1097/MPG.0000000000001380
3. Adesogan AT, Dahl GE. MILK symposium introduction: dairy production in developing countries. J Dairy Sci. (2020) 103:9677–80. doi: 10.3168/jds.2020-18313
4. Brito LF, Bedere N, Douhard F, Oliveira HR, Arnal M, Peñagaricano F, et al. Review: genetic selection of high-yielding dairy cattle toward sustainable farming systems in a rapidly changing world. Animal. (2021) 15:100292. doi: 10.1016/j.animal.2021.100292
5. Bova TL, Chiavaccini L, Cline GF, Hart CG, Matheny K, Muth AM, et al. Environmental stressors influencing hormones and systems physiology in cattle. Reprod Bio Endocrinol. (2014) 12:58. doi: 10.1186/1477-7827-12-58
6. Kerwin AL, Burhans WS, Mann S, Nydam D, Wall SK, Schoenberg KM, et al. Transition cow nutrition and management strategies of dairy herds in the northeastern United States: Part II-Associations of metabolic- and inflammation-related analytes with health, milk yield, and reproduction. J Dairy Sci. (2022) 105:5349–69. doi: 10.3168/jds.2021-20863
7. Lokhorst C, de Mol RM, Kamphuis C. Invited review: big data in precision dairy farming. Animal. (2019) 13:1519–28. doi: 10.1017/S1751731118003439
8. Lean IJ, Golder HM, LeBlanc SJ, Duffield T, Santos JEP. Increased parity is negatively associated with survival and reproduction in different production systems. J Dairy Sci. (2023) 106:476–99. doi: 10.3168/jds.2021-21672
9. Pushpa; Magotra A, Sindhu V, Chaudhary P. Evaluation of parity and its interaction effect with season of calving, period of calving on production and reproduction traits in hardhenu (Bos taurus × Bos indicus) dairy cattle. Reprod Domestic Animal. (2024) 59:e14537. doi: 10.1111/rda.14537
10. Vrhel M, Duchacek J, Gasparik M, Vacek M, Codl R, Pytlik J. Association between production and reproduction parameters based on parity and breed of dairy cows in the Czech Republi. Arch Anim Breed. (2024) 67:197–205. doi: 10.5194/aab-67-197-2024
11. Li M, Rosa GJM, Reed KF, Cabrera VE. Investigating the effect of temporal, geographic, and management factors on US Holstein lactation curve parameter. J Dairy Sci. (2022) 105:7525–38. doi: 10.3168/jds.2022-21882
12. Zhang L, Gengler N, Dehareng F, Colinet F, Froidmont E, Soyeurt H. Can we observe expected behaviors at large and individual scales for feed efficiency-related traits predicted partly from milk mid-infrared spectra? Animals. (2020) 10:873. doi: 10.3390/ani10050873
13. Al-Marashdeh O, Maxwell TMR, Wheadon NM, Moonsan Y, Edwards GR, Cheng L. How does stage of lactation and breeding worth affect milk solids production and production efficiency of grazing dairy cows? Anim Biotechnol. (2020) 31:217–22. doi: 10.1080/10495398.2019.1583108
14. Piccioli-Cappelli F, Seal CJ, Parker DS, Loor JJ, Minuti A, Lopreiato V, et al. Effect of stage of lactation and dietary starch content on endocrine-metabolic status, blood amino acid concentrations, milk yield, and composition in Holstein dairy cows. J Dairy Sci. (2022) 105:1131–49. doi: 10.3168/jds.2021-20539
15. Stanley P. Analysis of covariance: a useful tool for the pharmacologist to reduce variation and improve precision using fewer animals. Br J Pharmacol. (2022) 179:3645–50. doi: 10.1111/bph.15855
16. Abdi, H, Williams, LJ. Principal component analysis. WIREs Comput Stat. (2010) 2:433–59. doi: 10.1002/wics.101
17. Rybakowska P, Van Gassen S, Quintelier K, Saeys Y, Alarcon-Riquelme ME, Maranon C. Data processing workflow for large-scale immune monitoring studies by mass cytometry. Comput Struct Biotechnol J. (2021) 19:3160–75. doi: 10.1016/j.csbj.2021.05.032
18. Curran PJ, West SG, Finch JF. The robustness of test statistics to nonnormality and specification error in confirmatory factor analysis. Psychol Methods. (1996) 1:16–29. doi: 10.1037//1082-989X.1.1.16
19. Schreiber JB. Issues and recommendations for exploratory factor analysis and principal component analysis. Res Soc Adm Pharm. (2021) 17:1004–11. doi: 10.1016/j.sapharm.2020.07.027
20. Prentice RL, Sheppard L. Aggregate data studies of disease risk factors. Biometrika. (1995) 82:113–25. doi: 10.1093/biomet/82.1.113
21. Wakefield J, Salway R. A statistical framework for ecological and aggregate studies. J R Stat Soc A (Stat Soc). (2001) 164:119–37. doi: 10.1111/1467-985X.00191
22. Torres David J. Damain R. Investigating the ecological fallacy through sampling distributions constructed from finite populations. Monte Carlo Methods Appl. (2024) 30:331–63. doi: 10.1515/mcma-2024-2013
23. ICAR Committee. Section 2—Guidelines for Dairy Cattle Milk Recording. International Organization for Standardization (2017). Available online at: https://www.icar.org/Guidelines/02-Overview-Cattle-Milk-Recording.pdf (Accessed June 1, 2023).
24. De Boever JL, Goossens K, Peiren N, Swanckaert J, Ampe B, Reheul D, et al. The effect of maize silage type on the performances and methane emission of dairy cattle. J Anim Physiol Anim Nutr. (2017) 101:e246–56. doi: 10.1111/jpn.12598
25. Zhang L. Prediction of feed efficiency based on test-day liveweight of dairy cows estimated using animal characteristics and milk mid-infrared spectra-all databases. Doctor's thesis, University of Liege, Liege, Belgium (2021).
26. Reshalaitihan M, Wynn S, Teramura M, Sato T, Hanada M. Effect of parity number on the dry matter intake of dairy cows during the first week after calvin. J Anim Sci. (2020) 91:e13314. doi: 10.1111/asj.13314
27. Potts SB, Boerman JR, Lock AL, Allen MS, VandeHaar MJ. Residual feed intake is repeatable for lactating holstein dairy cows fed high and low starch diet. J Dairy Sci. (2015) 98:4735–47. doi: 10.3168/jds.2014-9019
28. Arthur JPF, Herd RM. Residual feed intake in beef cattle. Rev Bras Zootec. (2008) 37:269–79. doi: 10.1590/S1516-35982008001300031
29. Grandl F. Longevity of dairy cows: biological and production system implications for greenhouse gas emissions. Doctor's thesis, University of ETH Zurich, Zurich, Switzerland (2017).
30. Liang S, Wu C, Peng W, Liu JX, Sun HZ. Predicting daily dry matter intake using feed intake of first two hours after feeding in mid and late lactation dairy cows with fed ration three times per day. Animals. (2021) 11:104. doi: 10.3390/ani11010104
31. Kandel M, Macelline SP, Melwani RR, Toghyani M, Zadoks RN, Selle PH, et al. Increased dietary nutrient density did not restore growth performance in broiler chickens offered diets with high canola inclusion. J Appl Poult Res. (2025) 34:100566. doi: 10.1016/j.japr.2025.100566
32. Dillard RB, Jones MK, Davis AJ. Assessing dried black soldier fly larva as a feed component for poultry production. J Appl Poult Res. (2025) 34:100570. doi: 10.1016/j.japr.2025.100570
Keywords: analysis of covariance, feed intake, milk yield, PCA, weighted linear model
Citation: Zhang L, Luo C-L, Mu Q, Liu J-C, Li C-F, Ma Y-B, Su Q, Zhang J-P, Li J-T, Zhang S-Y, Chai J-M and Zhao Y-L (2026) Does age in months influence Chinese Holstein dairy cow production performance after controlling for days in milk? Front. Vet. Sci. 12:1743799. doi: 10.3389/fvets.2025.1743799
Received: 11 November 2025; Revised: 14 December 2025;
Accepted: 22 December 2025; Published: 23 January 2026.
Edited by:
Tugay Ayasan, Osmaniye Korkut Ata University, TürkiyeReviewed by:
Nazire Mikail, Siirt University, TürkiyeWenxun Chen, Wuhan Polytechnic University, China
Copyright © 2026 Zhang, Luo, Mu, Liu, Li, Ma, Su, Zhang, Li, Zhang, Chai and Zhao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Shu-Yi Zhang, c3poYW5nQHN5YXUuZWR1LmNu; Jian-Min Chai, SmNoYWlAdWFyay5lZHU=; Yao-Lu Zhao, eWFvbHV6aGFvQHN5YXUuZWR1LmNu
†These authors have contributed equally to this work
Cheng-Long Luo2†