Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Endocrinol.

Sec. Obesity

Volume 16 - 2025 | doi: 10.3389/fendo.2025.1661376

Extreme gradient boosting using conventional parameters accurately predicts insulin sensitivity in young and middle-aged Japanese persons

Provisionally accepted
Norimitsu  MuraiNorimitsu Murai1*Naoko  SaitoNaoko Saito2Sayuri  NiiSayuri Nii1Hiroto  NishikawaHiroto Nishikawa1Eriko  KodamaEriko Kodama1Tatsuya  IidaTatsuya Iida1Hideyuki  ImaiHideyuki Imai1Mai  HashizumeMai Hashizume1Rie  TadokoroRie Tadokoro1Chiho  SugisawaChiho Sugisawa1Toru  IizakaToru Iizaka1Fumiko  OtsukaFumiko Otsuka1Shun  IshibashiShun Ishibashi2Shoichiro  NagasakaShoichiro Nagasaka1
  • 1Showa University Fujigaoka Hospital, Yokohama, Japan
  • 2Jichi Ika Daigaku, Shimotsuke, Japan

The final, formatted version of the article will be published soon.

Background: This study tested the hypothesis that insulin sensitivity (SI) can be estimated using machine learning (ML) based only on physical indicators or with the addition of lipid and fasting glucose levels. Methods: In 1268 young (age <40 years, normal glucose tolerance; NGT) and 1723 middle-aged Japanese persons with NGT (n=1276) and glucose intolerance (n=447), the Matsuda index and the 1/homeostasis model assessment of insulin resistance were calculated as SI. In each group, SI was estimated by using 8 ML methods, based only on physical indicators, as well as by using physical indicators together with lipid and fasting glucose levels. Eleven lipid-related estimates for SI were also calculated. Results: Estimates by extreme gradient boosting showed the best correlations with SI indices among 8 ML methods. According to feature importance and SHapley Additive exPlanations values, the contribution of each clinical factor to SI differed greatly by age and glucose tolerance status. Relationships of lipid-related estimates with SI were weaker than those of ML-derived estimates. Conclusions: It was possible to estimate SI using ML based only on physical indicators, or those with lipid and fasting glucose levels. The results also imply that it would be difficult to establish universal and robust estimates for SI using conventional parameters. Further validation studies are necessary in diverse ethnic groups with various body composition.

Keywords: Oral glucose tolerance test, insulin sensitivity, Triglyceride glucose index, machine learning, Extreme gradient boosting

Received: 07 Jul 2025; Accepted: 30 Sep 2025.

Copyright: © 2025 Murai, Saito, Nii, Nishikawa, Kodama, Iida, Imai, Hashizume, Tadokoro, Sugisawa, Iizaka, Otsuka, Ishibashi and Nagasaka. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Norimitsu Murai, uipnpzk2npq3@yahoo.co.jp

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.