Your new experience awaits. Try the new design now and help us make it even better

BRIEF RESEARCH REPORT article

Front. Sports Act. Living

Sec. Sports Management, Marketing, and Economics

Volume 7 - 2025 | doi: 10.3389/fspor.2025.1638446

This article is part of the Research TopicEvolving Economies in Sports: Management Practices and Market ImpactsView all 5 articles

Advancing NFL Win Prediction: From Pythagorean Formulas to Machine Learning Algorithms

Provisionally accepted
Caroline  WeirichCaroline Weirich1Jun Woo  KimJun Woo Kim1*Youngmin  YoonYoungmin Yoon2Seunghoon  JeongSeunghoon Jeong3
  • 1Arcadia University, Glenside, United States
  • 2University of North Texas, Denton, United States
  • 3Woosuk University, Wanju-gun, Republic of Korea

The final, formatted version of the article will be published soon.

This study evaluates the predictive performance of traditional and machine learning-based models in forecasting NFL team winning percentages over a 21-season dataset (2003–2023). Specifically, we compare the Pythagorean expectation formula—commonly used in sports analytics—with Random Forest regression and a feedforward Neural Network model. Using key performance indicators such as points scored, points allowed, turnovers, rushing and passing efficiency, and penalties, the machine learning models demonstrate superior predictive accuracy. The Neural Network model achieved the highest performance (MAE = 0.052, RMSE = 0.064, R² = 0.891), followed by the Random Forest model, both of which significantly outperformed the Pythagorean method. Feature importance analysis using SHAP values identifies points scored and points allowed as the most influential predictors, supplemented by margin of victory, turnovers, and offensive efficiency metrics. These findings underscore the limitations of fixed-formula models and highlight the flexibility and robustness of data-driven approaches. The study offers practical implications for analysts, coaches, and sports management professionals seeking to optimize strategic decisions and competitive performance. Ultimately, the integration of advanced machine learning models provides a powerful tool for enhancing decision-making processes across the NFL landscape.

Keywords: NFL, Neural Network, Pythagorean theorem, machine learning, Sports analytics, random forest

Received: 30 May 2025; Accepted: 28 Aug 2025.

Copyright: © 2025 Weirich, Kim, Yoon and Jeong. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Jun Woo Kim, Arcadia University, Glenside, United States

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.