ORIGINAL RESEARCH article

Front. Plant Sci.

Sec. Plant Pathogen Interactions

Volume 16 - 2025 | doi: 10.3389/fpls.2025.1621604

This article is part of the Research TopicOmics Applications for Pathogen Control and Disease ResistanceView all 5 articles

Integrative Identification of Key Genes Governing Verticillium Wilt Resistance in Gossypium hirsutum Using Machine Learning and WGCNA

Provisionally accepted
Yufeng  LeiYufeng Lei1Jing  ZhaoJing Zhao2Siyuan  HouSiyuan Hou1Fufeng  XuFufeng Xu1Chongbo  ZhangChongbo Zhang1Dongchen  CaiDongchen Cai1Xiaolei  CaoXiaolei Cao1Zhaoqun  YaoZhaoqun Yao1*Sifeng  ZhaoSifeng Zhao1*
  • 1Key Laboratory at the Universities of Xinjiang Uygur Autonomous Region for Oasis Agricultural Pest Management and Plant Protection Resource Utilization, Agriculture College, Shihezi, China
  • 2Cotton Research Institute, Xinjiang Academy of Agricultural and Reclamation Sciences, Shihezi, China

The final, formatted version of the article will be published soon.

Verticillium wilt, caused by Verticillium dahliae, is one of the most devastating diseases threatening global cotton (Gossypium hirsutum) production. Given the limited effectiveness of chemical control measures and the polygenic nature of resistance, elucidating the key genetic determinants is imperative for the development of resistant cultivars. In this study, we aimed to dissect the temporal transcriptional dynamics and regulatory mechanisms underlying Gossypium hirsutum response to V. dahliae infection. Using a Gossypium hirsutum cultivar Jimian 11, we conducted time-course RNA-Seq on root and leaf tissues post-inoculation. Comparative transcriptome analysis revealed that roots displayed more rapid and intense transcriptional responses than leaves. Weighted gene co-expression network analysis (WGCNA) identified infection time-correlated modules enriched in defense-related pathways. Furthermore, machine learning algorithms including LASSO, Random Forest, and SVM were applied to select key candidate genes from DEGs and network modules. A robust set of core genes involved in pathogen recognition (GhRLP6), calcium signaling (GhCIPK6, GhCBP60A), hormone response, and secondary metabolism (GhF3'H) were identified. Our findings provide novel insights into the spatiotemporal regulation of immune responses in cotton and offer valuable candidate genes for molecular breeding of Verticillium wilt resistance.

Keywords: verticillium wilt, Gossypium hirsutum, RNA-Seq, WGCNA, machine learning, Disease Resistance

Received: 01 May 2025; Accepted: 04 Jul 2025.

Copyright: © 2025 Lei, Zhao, Hou, Xu, Zhang, Cai, Cao, Yao and Zhao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Zhaoqun Yao, Key Laboratory at the Universities of Xinjiang Uygur Autonomous Region for Oasis Agricultural Pest Management and Plant Protection Resource Utilization, Agriculture College, Shihezi, China
Sifeng Zhao, Key Laboratory at the Universities of Xinjiang Uygur Autonomous Region for Oasis Agricultural Pest Management and Plant Protection Resource Utilization, Agriculture College, Shihezi, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.