ORIGINAL RESEARCH article

Front. Anim. Sci.

Sec. Precision Livestock Farming

Volume 6 - 2025 | doi: 10.3389/fanim.2025.1579974

This article is part of the Research TopicSustainable and Climate Resilient Livestock SystemsView all 3 articles

Machine Learning Approaches for Predicting Feed Intake in Australian Merino, Corriedale, and Dohne Merino Sheep

Provisionally accepted
  • 1Departament of Animal Science, Federal University of Rio Grande do Sul, Porto Alegre, Brazil
  • 2Sistema Ganadero Extensivo, INIA Tacuarembó, Instituto Nacional de Investigación Agropecuaria, Tacuarembó, Uruguay
  • 3Sistema Ganadero Extensivo, INIA Las Brujas, Instituto Nacional de Investigación Agropecuaria, Canelones, Uruguay

The final, formatted version of the article will be published soon.

Feed intake is a challenging trait to measure due to the high costs associated with labor, feeding, and facilities. Applying machine learning approaches, considering traits as potential predictors, offers a cost-effective alternative to direct feed intake measurement. By leveraging existing animal data, these models can optimize resources and enable feed intake estimation across a larger population without the need for labor-intensive trials. This research aimed to test combinations of feature selection and prediction models to find the best feed intake (expressed as metabolizable energy intake) prediction approach for a dataset comprising Australian Merino, Corriedale, and Dohne Merino data. The study dataset with 1,708 observations included 920 Australian Merino, 215 Corriedale, and 337 Dohne Merino sheep from 17 feed intake trials conducted between 2019 and 2022. The dataset was randomly partitioned into two subsets: one for training (80%) the algorithms and the other for direct validation (20%). Feature selection methods included track analysis, stepwise model, and principal components analysis. The prediction models were stepwise, linear regression, nonlinear regression, k-nearest neighbor regression, random forest regression, and support vector machines. The highest R 2 value was found in the support vector machines using the stepwise model for feature selection, with a value of 0.91 in the cross-validation of the training dataset, and Pearson and Spearman correlation coefficients of 0.95 and 0.93, respectively. In direct validation, the k-nearest neighbor model with the stepwise feature selection model presented the highest Pearson and Spearman correlation coefficients, with values of 0.92 and 0.90, respectively. In the confusion matrix, the support vector machines with stepwise feature selection showed the best performance. The model correctly distinguished between high and low metabolic energy intake in all cases, achieving an overall accuracy of 0.76. This indicates that support vector machines effectively captures the underlying patterns of feed intake distribution. The approaches that presented the best performance balance in both cross-validation and direct validation were the k-nearest neighbor model and the support vector machines using the stepwise model for feature selection.

Keywords: K-nearest neighbor, Enteric methane, Carbon Dioxide, random forest, Support Vector Machines

Received: 19 Feb 2025; Accepted: 05 May 2025.

Copyright: © 2025 Amarilho-Silveira, De Barbieri, Navajas, Cobuci and Ciappesoni. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Fernando Amarilho-Silveira, Departament of Animal Science, Federal University of Rio Grande do Sul, Porto Alegre, Brazil

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.