Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Microbiol.

Sec. Infectious Agents and Disease

Volume 16 - 2025 | doi: 10.3389/fmicb.2025.1654918

This article is part of the Research TopicRapid and Efficient Analytical Technologies for Pathogen DetectionView all 9 articles

Machine Learning-Based High-Specificity Diagnostic Model for Talaromyces marneffei Infection in Febrile Patients using Routine Clinical Laboratory Data

Provisionally accepted
Yingjun  XiaoYingjun Xiao1Xiling  ChenXiling Chen1Xiping  OuXiping Ou2Zheqing  DongZheqing Dong1Xiaoyan  ZhangXiaoyan Zhang3Wei  LiangWei Liang4Xiaojing  NanXiaojing Nan1Chan  XuChan Xu1Xiaobo  LaiXiaobo Lai2Peng  XuPeng Xu1,2Kui  FangKui Fang1*
  • 1Third Affiliated Hospital of Zhejiang Chinese Medical University, Hangzhou, China
  • 2Zhejiang Chinese Medical University, Hangzhou, China
  • 3The First People's Hospital of Yuhang District, Hangzhou, China
  • 4Taizhou Fourth People's Hospital, Taizhou, China

The final, formatted version of the article will be published soon.

Objective: This study developed and validated a machine learning (ML)-based predictive model utilizing febrile patients' routine clinical laboratory data for the purpose of screening such patients for Talaromyces marneffei infection and to provide reference information for feature selection in the subsequent establishment of a more precise early warning model. Methods: This retrospective study enrolled febrile patients who visited Zhejiang Provincial People's Hospital and the Third Affiliated Hospital of Zhejiang Chinese Medical University from January 2021-April 2025. Patient data, including sex, age, and laboratory test results, were collected. Through sparse partial least squares discriminant analysis, the most informative features were extracted from the dataset. Six classic machine learning algorithms were utilized to develop the optimal predictive model through 1000 bootstrap resamplings. Finally, the model was validated on an independent clinical validation dataset. Results: The training dataset comprised 485 febrile patients (141 with T. marneffei infection). The clinical validation dataset comprised 1,953 febrile patients (13 with T. marneffei infection). The random forest model demonstrated the highest performance in classifying T. marneffeiinfected patients, with an area under the receiver operating characteristic curve of 0.987 in outof-bag validation and 0.989 in clinical validation. The model also exhibited good specificity (0.999) for T. marneffei infection and good sensitivity (0.845) in predicting bacteraemia in clinical validation.4 Conclusion: A random forest model can effectively utilize routine clinical laboratory data to predict T. marneffei infection and bacteraemia in febrile patients, offering a promising early screening tool for individuals at high risk for T. marneffei infection.

Keywords: Talaromyces marneffei, Febrile patients, machine learning, predictive model, feature mining

Received: 30 Jun 2025; Accepted: 16 Aug 2025.

Copyright: © 2025 Xiao, Chen, Ou, Dong, Zhang, Liang, Nan, Xu, Lai, Xu and Fang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Kui Fang, Third Affiliated Hospital of Zhejiang Chinese Medical University, Hangzhou, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.