Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Bioinform.

Sec. Single Cell Bioinformatics

Integrating Trajectory Inference and Self-Explainable Predictive Models to Explore Cell State Transitions in Breast Cancer at Single-Cell Resolution

Provisionally accepted
  • 1University of Calabria, Cosenza, Cosenza, Italy
  • 2Ospedale Civile dell'Annunziata, Cosenza, Italy

The final, formatted version of the article will be published soon.

Breast cancer is characterized by a highly heterogeneous cellular environment composed of diverse malignant clones and components of the tumor microenvironment (TME) that collectively influence the progression of the disease. Single-cell RNA sequencing (scRNA-seq) offers a powerful tool to dissect this complexity, enabling high-resolution characterization of tumor heterogeneity and the functional interactions within the TME. Moreover, it supports the discovery of clinically relevant subpopulations and potential therapeutic targets. In this study, we present a novel scRNA-seq dataset from an infiltrating ductal breast cancer, profiling over 5,000 cells and identifying six distinct clusters spanning cancer and TME populations. To explore the molecular drivers of cell state transitions, we integrate pseudotime trajectory inference with interpretable, tree-based machine learning. This approach enables the identification of key genes and expression thresholds associated with dynamic phenotypic shifts. Unlike black-box models, our framework yields transparent, rule-based insights into transcriptional reprogramming during tumor evolution. The resulting dataset, along with an accessible and transparent analytical pipeline, offers a valuable resource for the breast cancer research community and lays the groundwork for future studies aimed at refining molecular classification and precision therapy development.

Keywords: breast cancer, Interpretability, single-cell RNA sequencing, statistical learning, trajectory inference

Received: 24 Jul 2025; Accepted: 29 Jan 2026.

Copyright: © 2026 Verrina, Talia, Cesario, Capalbo, Scordamaglia, Lappano, Miglietta, Maggiolini and Giordano. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Marcello Maggiolini
Sabrina Giordano

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.