ORIGINAL RESEARCH article
Front. Digit. Health
Sec. Digital Mental Health
Volume 7 - 2025 | doi: 10.3389/fdgth.2025.1585309
Supervised machine learning applied in nursing notes for identifying childhood cancer patients' need for psychosocial support
Provisionally accepted- 1Department of Computing, Faculty of Technology, University of Turku, Turku, Finland
- 2Department of Computer Science, School of Science, Aalto University, Otakaari, Ostrobothnia, Finland
- 3Department of Nursing Science, Faculty of Medicine, University of Turku, Turku, Finland
- 4Nursing Administration, Turku University Hospital, Turku, Finland
- 5Department of Paediatrics and Adolescent Medicine, Turku University Hospital, Turku, Finland
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
IntroductionChildhood cancer survivors have a higher risk for mental health and adaptive problems compared, e.g., to their siblings. Assessing the need for psychosocial support is essential for prevention. This project aimed to investigate the use of supervised machine learning in the form of text classification in identifying childhood cancer patients needing psychosocial support from nursing notes when at least one year had passed from their cancer diagnosis.MethodsWe evaluated three well-known machine learning-based models to recognize patients who had outpatient clinic reservations in the mental health-related care units from free-text nursing notes of 1672 patients. For model training, the patients were children diagnosed with diabetes mellitus or cancer, while evaluation of the model was done on the patients diagnosed with cancer. A stratified five-fold nested cross-validation was used. We designed this as a binary classification task, with the labels: no support (0) or psychosocial support (1) was needed. Patients with the latter were identified by having an outpatient appointment reservation in a mental health-related care unit at least one year after their primary diagnosis.ResultsThe Random Forest classification model trained on both cancer and diabetes patients performed best for the cancer patient population in three times repeated nested-cross validation with 0.798 mean area under the receiver operating characteristics curve and was better with 99% probability (credibility interval - 0.2840, - 0.0422) than the neural network-based model using only cancer patients in training when comparing all classifiers pairwise by Bayesian Correlated t-test. ConclusionsUsing machine learning to predict childhood cancer patients needing psychosocial support was possible using nursing notes with a good area under the receiver operating characteristics curve. The reported experiment indicates that machine learning may assist in identifying patients likely to need mental health-related support later in life.
Keywords: Cancer, nursing notes, machine learning, Electronic Health Records, Psychosocial support systems, late effects
Received: 28 Feb 2025; Accepted: 15 Jul 2025.
Copyright: © 2025 Reunamo, Moen, Salanterä and Lähteenmäki. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Akseli Reunamo, Department of Computing, Faculty of Technology, University of Turku, Turku, 20014, Finland
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.