- 1Department of Neurology, Lianyungang Second People’s Hospital Affiliated to Kangda College of Nanjing Medical University, Lianyungang, Jiangsu, China
- 2School of Psychology, Zhejiang Normal University, Jinhua, China
- 3Department of Psychology, School of Humanities and Social Sciences, University of Science and Technology of China, Hefei, China
- 4Department of Philosophy, Zhenjiang Mental Health Center, Zhenjiang, Jiangsu, China
- 5School of Physical Education and Health, Guangzhou City Construction College, Guangzhou, Guangdong, China
- 6Kangda College of Nanjing Medical University, Lianyungang, Jiangsu, China
- 7Shanghai Key Laboratory of Psychotic Disorders, Brain Health Institute, National Centre for Mental Disorders, Shanghai Mental Health Centre, Shanghai Jiao Tong University School of Medicine, Shanghai, China
Background: Post-stroke language impairment affects nearly one-third of acute stroke patients or 30% of ischemic stroke patients, yet predicting its progression remains challenging due to multifactorial recovery processes.
Methods: Predictors were selected using Least Absolute Shrinkage and Selection Operator (LASSO) regression. The interrelationships among these predictors and their mechanistic links to gender and post-stroke language impairment were subsequently explored employing Graphical LASSO (GLASSO) and Bayesian network analysis. Furthermore, Network Outcome Analysis (NOA) was applied to investigate the associations between preoperative predictors and the severity of postoperative language impairment deterioration.
Results: LASSO identified eight predictors for inclusion in the model. The ROC analysis demonstrated favorable predictive efficacy (AUC = 0.80 [95% CI: 0.719–0.876], accuracy = 0.73, sensitivity = 0.78, specificity = 0.69, PPV = 0.71, NPV = 0.76). The DCA results (probability range: 0–0.81, 0.91) further indicated good clinical utility of the model. Preoperative GCS emerged as the primary direct predictor. Although male patients exhibited larger infarct volumes (22.72 mL vs. 15.19 mL in females), this difference was not directly associated with poorer language outcomes.
Discussion: This multimodal model, enhanced by network analysis, accurately predicts language impairment progression and highlights preoperative consciousness as a key mediator, supporting precision stroke rehabilitation by capturing complex predictor interrelationships.
1 Introduction
Stroke remains a leading cause of long-term disability worldwide, with neurological deterioration during the acute phase significantly impacting recovery outcomes and rehabilitation strategies (1). Among the various post-stroke impairments, language impairment—a motor speech disorder affecting articulation—occurs in approximately one-third of acute stroke patients or 30% of ischemic stroke patients, and serves as a sensitive marker of neurological deterioration (2, 3). Early prediction of language impairment worsening is clinically crucial, as timely identification allows for targeted speech therapy interventions and improved communication outcomes (2). However, predicting which patients will experience language impairment progression remains challenging due to the multifactorial nature of stroke recovery.
Traditional stroke severity assessments, including the National Institutes of Health Stroke Scale (NIHSS), Glasgow Coma Scale (GCS), and modified Rankin Scale (mRS), provide valuable baseline measures but demonstrate limited accuracy in predicting specific neurological deterioration patterns (4, 5). While these clinical scales capture acute stroke severity, they fail to account for critical imaging features such as infarct volume and location, which are known to influence speech outcomes (6, 7). Furthermore, most existing prediction models employ univariate approaches that overlook complex interactions among clinical, biochemical, and imaging factors (8). For instance, studies have shown that infarct volume impacts recovery differently across sexes, with males exhibiting stronger associations between lesion size and functional outcomes (9, 10). The lack of models that simultaneously consider these multimodal factors and their interrelationships represents a significant gap in stroke prognostication.
Recent advances in machine learning and network analysis offer promising solutions to address these limitations. Least Absolute Shrinkage and Selection Operator (LASSO) regression enables efficient identification of the most predictive variables from high-dimensional datasets, reducing model complexity while maintaining accuracy (11). Beyond variable selection, network analysis methods including Graphical LASSO (GLASSO) and Bayesian networks can map complex interdependencies among predictors, revealing non-obvious relationships that traditional regression models miss (12). For example, here we use GLASSO to identify whether infarct volume directly influences language impairment deterioration or operates through indirect pathways via consciousness level or functional independence. Similarly, Bayesian networks can establish directional relationships, distinguishing causal predictors from mere correlates (13). Despite their potential, these advanced analytical techniques remain underutilized in stroke recovery research, particularly for predicting specific neurological outcomes like language impairment worsening.
Here we report on the development and validation of a comprehensive predictive model for early language impairment progression following stroke surgery by integrating clinical assessments (NIHSS, GCS, mRS, Activities of Daily Living scale [ADL], Clinical Dementia Rating [CDR], Alberta Stroke Program Early CT Score [ASPECTS], Hachinski Ischemic Scale [HIS]), biochemical markers, and CT-derived infarct volume and affected brain regions. We employ LASSO regression to identify core predictors from this multimodal dataset, then apply GLASSO and Bayesian network analyses to elucidate the mechanistic pathways linking predictors to language impairment outcomes. A particular focus is placed on investigating sex- specific associations with infarct volume, as these relationships may inform personalized rehabilitation strategies. We hypothesize that: (1) a multimodal model incorporating clinical scales, biomarkers, and imaging features will achieve superior predictive accuracy for language impairment deterioration compared to single-domain approaches; (2) network analysis will reveal that infarct volume influences language impairment worsening through indirect pathways mediated by consciousness level and functional status, with sex-specific effects; and (3) preoperative assessments will demonstrate stronger predictive value than postoperative measures, enabling earlier clinical decision-making.
2 Methods
2.1 Study design
This study employed a cross-sectional cohort design. The data for this study were obtained from stroke patients who have undergone emergency cerebrovascular intervention surgery in a hospital in China and encompassed multiple data modalities, including biochemical indicators, cerebral infarct volume, CT imaging, and standardized questionnaire assessments. Data collection spanned 4 years, from January 2022 to September 2025, with a total of 204 patients enrolled. Data were considered valid only when all 34 specified indicators were completely recorded or assessed. A total of 164 patients were included in the analysis. In observational studies, it is generally recommended that the sample size be 10 to 20 times the number of dependent variables. We plan to include the top 10 variables with nonzero coefficients (or all variables if fewer than 10) as predictors. In the most extreme case, with 10 predictors, one demographic variable (gender), and one dependent variable, the minimum required sample size would be 120. Our study sample size meets and exceeds this minimum requirement. CT imaging assessments and certain questionnaires were designed with a pretest-posttest design. Language impairment was assessed using the ninth item (Language) of the NIHSS questionnaire. Patients underwent preoperative assessment within 2 h prior to emergency cerebrovascular intervention surgery, followed by a postoperative assessment on the second day after surgery (approximately 48 h post-intervention). The difference between the postoperative and preoperative assessments was used to evaluate the degree of deterioration in stroke-related neurological function. All personnel involved in the project received professional training to ensure the validity and consistency of data collection. This study was approved by the Institutional Review Board (IRB) of the Second People’s Hospital of Lianyungang (Approval No. 2025K096). The study was conducted in accordance with local regulations and institutional requirements.
The data analysis process consisted of six steps: 1. Comparing of differential indicators between male and female groups. 2. Comparing the differential indicators between groups with different degrees of language impairment progression. 3. Using LASSO to select core predictive factors for language impairment progression and testing them with ROC and DCA, thereby establishing a reliable predictive model for this speech disorder. 4. Using GLASSO network analysis to explore the influencing mechanisms among core predictive factors, gender, and language impairment progression. 5. Building upon the previous step, using Bayesian network analysis to test the directionality of different influencing paths. 6. Exploring the impact of preoperative predictive factors on postoperative language impairment progression through network outcome analysis (NOA).
2.2 Measurement
The NIHSS is used to assess the degree of neurological deterioration in stroke patients (14). The NIHSS comprises 11 subdomains, including level of consciousness, gaze, visual fields, facial palsy, motor function of the upper limb, motor function of the lower limb, limb ataxia, sensory function, language, dysarthria, and neglect. Post-stroke language impairment progression (measured as the deterioration score, LANG) is defined here as the difference between the preoperative and postoperative NIHSS subdomain scores for language (postoperative score minus preoperative score). A LANG value of less than zero was classified as improvement; all other values constituted the non-improvement group.
The factors influencing the progression of post-stroke language impairment.
The patients’ post-stroke self-care ability, level of consciousness, cognitive function, extent of ischemic injury, overall degree of functional impairment, and vascular dementia were evaluated using six assessment tools: ADL (15), GCS (16), CDR (17), ASPECTS (18), mRS (19), and HIS (20). In addition, biochemical data were obtained from routine clinical blood draws upon admission. Cerebral infarct volume was quantified from admission non-contrast CT scans using a validated semi-automated hemispheric-asymmetry workflow implemented in the FMRIB Software Library (FSL). Each scan was first reoriented to the standard neurological convention (fslreorient2std), skull-stripped with a CT-optimized fractional intensity threshold, resampled to 1.0 mm isotropic resolution, and intensity-normalized within a Hounsfield Unit window of −20 to 80. A left–right flipped image was rigidly registered back to the original image using FLIRT (21), and an absolute voxel wise difference map was computed to highlight hemispheric asymmetry. The map was thresholded at the 98th percentile of its intensity distribution, and clusters smaller than 500 mm3 were removed, yielding the final lesion mask. The mask was rigidly aligned to the MNI152 1 mm template for spatial standardization, and infarct volumes were expressed in mL. All lesion masks underwent visual quality control by experienced raters, and the full methodological details are provided in the Supplementary materials.
In this study, the predictive factors were coded using standardized abbreviations for clarity. Functional status at admission was represented by PQ. Postoperative imaging findings were assessed using PSQ3 (ASPECTS), while discharge outcomes were captured by PSQ6 (mRS). Biochemical marker was represented by BIOC. In addition, VOL (infarct volume), CE (damaged brain regions), and LANG (language impairment score) were also incorporated into the analysis.
2.3 Quality control
The original data were obtained from clinical practice, with a total of 204 stroke patients enrolled. As certain predictive factors are measured infrequently in stroke patients, simply excluding missing values would lead to a significant reduction in sample size. To ensure an adequate sample size, factors with excessive missing values were excluded, specifically the postoperative 3-month and 12-month mRS follow-up scores, as well as homocysteine levels. Subsequently, samples with missing data were removed, resulting in a final sample size of 164 valid cases. Finally, the completeness of the data was verified through visualization of missing data, providing a solid foundation for subsequent analysis.
2.4 Establishment of the predictive model
To identify the key variables influencing language impairment progression, the LASSO algorithm was employed for variable selection. LASSO is a machine learning method that applies L1 regularization to shrink some regression coefficients to zero, thereby eliminating unimportant or redundant variables and facilitating the identification of core features with predictive value.
In this study, biochemical indicators (BIOC), infarct volume (VOL), cerebral affected region (CE), preoperative questionnaire scores (PQ), and postoperative questionnaire scores (PSQ) were included as predictive variables, while the LANG was defined as the dependent variable. Variables with non-zero coefficients after LASSO selection were identified as the key influencing factors associated with language impairment progression.
To evaluate the performance of the model, Receiver Operating Characteristic (ROC) analysis and Decision Curve Analysis (DCA) were conducted. In these analyses, LANG values less than 0 were coded as 1, and all others as 0. ROC analysis was used to assess the predictive accuracy of the model, with the area under the curve (AUC) serving as a key metric. An AUC value closer to 1 indicates superior model performance, with the 95% confidence interval calculated using the DeLong method. DCA was used to evaluate the clinical utility and net benefit of the predictive model, employing 2000 bootstrap resampling iterations to assess net benefit, with threshold values ranging from 0 to 1 to cover clinical decision-making scenarios. The model’s curve achievement of a higher net benefit than the “treat-all” and “treat-none” strategies across a broader range of threshold values indicates greater clinical applicability and utility of the model.
2.5 Network analysis
GLASSO network analysis was applied to construct a regularized network structure to evaluate the strength and centrality of associations among the selected core variables, gender, and language impairment progression following stroke by quickNet R package1. This method effectively reveals the interrelationships among core predictive factors, gender, and language impairment progression outcomes. The centrality measures in the network analysis include Strength, Closeness, Betweenness, and Expected Influence. Strength represents the total intensity of connections for a given node, Closeness measures the average distance from a node to all other nodes, Betweenness assesses the bridging role of a node, and Expected Influence is used to assess the potential impact of a node on other nodes. In the visualization of the network analysis, due to the limited sample size obtained from clinical practice, no fixed threshold was set to better represent the relationships among core predictive factors, gender, and language impairment progression. This approach allowed for a more flexible representation of the interconnections and their potential impacts.
Bayesian network analysis using the bnlearn R package was employed to explore the causal relationships among the core predictive factors, gender, and changes in LANG (22). The hc algorithm was utilized to infer the underlying network structure. The obtained structure was subsequently adapted to the data domain, with arc weights estimated via 50,000 bootstrap resampling iterations. The constructed network was visualized using the qgraph R package to illustrate the relationships among variables. The direction of each edge was determined based on the predominant direction observed during resampling. During the resampling process, if at least 51% of the bootstrapped networks indicated a directed edge from node A to node B, this directionality was represented in the final visualization by an arrow pointing from A to B.
Subsequently, based on the above findings, NOA was further conducted using the quickNet R package. NOA integrates baseline predictive variables and follow-up outcome measures within a single network, allowing for a more comprehensive evaluation of the influence of predictive factors on outcome variables. The predictive variables included those selected from the preoperative assessment indicators and incorporated into the final predictive model, while the outcome variable represented the degree of language impairment progression following stroke. The parameter settings and centrality measures for the NOA network analysis were the same as those used in the GLASSO network.
2.6 Statistical analyses
All statistical analyses were performed using R. Except for the biochemical indicators of Cl and infarct volume, the remaining variables were non-normally distributed. Categorical data were presented as n (%), and between-group comparisons were performed using the chi-square test or Fisher’s exact test (when the expected frequency was less than 5), with effect sizes evaluated by Cramer’s V. Continuous data are expressed as mean ± standard deviation, with between-group comparisons analyzed by independent samples t-tests and effect sizes quantified using Cohen’s d. The p-values were adjusted for multiple comparisons using the Benjamini-Hochberg method. LASSO regression was primarily conducted using the glmnet R package, the ROC curve was plotted with pROC and ggplot2 packages, and the DCA curve was drawn using the rmda package. Network analysis was primarily performed using the quickNet integrated package, the bnlearn package, and the qgraph package for computation or visualization. The alpha level was set at 0.05.
3 Results
3.1 Comparison of parameters between patients of different sexes
As shown in Table 1, among the 164 included patients, 101 were male and 63 were female. Compared to females, on average males were younger (69.39 ± 10.72 vs. 73.51 ± 9.38; t = 2.59, p < 0.05, Cohen’s d = 0.40), had lower lactate dehydrogenase levels (208.32 ± 48.95 vs. 234.25 ± 71.64; t = 2.53, p < 0.05, Cohen’s d = 0.44), and had larger infarct volumes (22.72 ± 6.65 vs. 15.19 ± 6.36; t = −7.24, p < 0.001, Cohen’s d = −1.15). The indicator that remained significant after Benjamini-Hochberg correction was infarct volume, with an adjusted p-value of <0.001. No significant differences were observed for other indicators.
3.2 Comparison of parameters among patients with different degrees of language impairment progression
As shown in Table 2, this study included a total of 164 stroke patients who underwent surgery, among whom 45 patients exhibited postoperative improvement in language function (LANG < 0), while 119 patients showed no improvement (LANG ≥ 0). Compared to the non-improvement group, patients in the improvement group demonstrated significantly lower CDR scores at admission (7.41 ± 3.92 vs. 9.69 ± 4.98; t = 3.08, p = 0.003, Cohen’s d = 0.48), lower GCS scores at admission (7.82 ± 2.33 vs. 9.12 ± 2.29; t = 3.20, p = 0.002, Cohen’s d = 0.56), lower HIS scores at admission (10.09 ± 1.83 vs. 10.84 ± 2.25; t = 2.20, p < 0.05, Cohen’s d = 0.35), and lower mRS scores at discharge (3.02 ± 1.14 vs. 3.66 ± 1.18; t = 3.19, p = 0.002, Cohen’s d = 0.55). In addition, their Cl values were significantly lower (101.76 ± 3.91 vs. 103.10 ± 3.54; t = 2.02, p < 0.05, Cohen’s d = 0.37). Conversely, the improvement group demonstrated higher ASPECTS scores at postoperative (7.78 ± 1.91 vs. 6.71 ± 2.96; t = −2.73, p = 0.007, Cohen’s d = −0.40) and larger infarct volumes (22.07 ± 7.75 vs. 18.98 ± 7.23; t = −2.32, p < 0.05, Cohen’s d = −0.42). The three indicators that remained significant after Benjamini-Hochberg correction were the CDR admission score, GCS admission score, and mRS discharge score, with adjusted p-values all equal to 0.030. No significant differences were observed in the remaining parameters between the two groups.
Table 2. Table of comparison of parameters among patients with different degrees of deterioration in language impairment.
3.3 Predictive model
The results of the LASSO analysis revealed eight core predictive variables, including VOL (infarct volume), CE1 (the affected hemisphere), PQ3 (ADL score at admission), PQ5 (GCS score at admission), BIOC4 (C-reactive protein 0–10 mg/L), BIOC19 (glycated hemoglobin 4–6%), PSQ3 (ASPECTS score after surgery), and PSQ6 (mRS score at discharge). These eight variables were subsequently incorporated into the model as core predictors.
Subsequently, the predictive performance of the LANG model was evaluated, demonstrating satisfactory predictive accuracy. The ROC curve analysis yielded an AUC of 0.80 (95% CI: 0.719–0.876), with an accuracy of 73.3%, sensitivity of 77.8%, specificity of 68.9%, positive predictive value (PPV) of 71.4%, and negative predictive value (NPV) of 75.6%, as shown in Figure 1a. The DCA result demonstrated that the model provided superior net benefit compared to the “None” and “All” strategies across a wide threshold probability range of 0–0.81, 0.91, as shown in Figure 1b.
Figure 1. Validation of the language impairment progression prediction model based on LASSO regression. (a) Core predictive factors ROC curve validation plot. (b) Core predictive factors DCA validation plot. The closer the AUC and accuracy values are to 1, the better the predictive performance of the model. In the DCA, the LASSO model demonstrates a wider threshold range above the None and All strategy curves, indicating greater clinical applicability of the model.
3.4 Network results
The result of the GLASSO network analysis is shown in Figure 2a. PQ5 (preoperative GCS score) has a direct positive edge with LANG (weight = 0.08), while PQ3 (preoperative ADL score), PSQ3 (postoperative ASPECTS score), and PSQ6 (discharge mRS score) indirectly influence LANG through PQ5. Notably, VOL (infarct volume) shows a negative association with gender, with a weight of −0.31. The centrality indicators of the network analysis show that gender, PQ3, PQ5, PSQ3, PSQ6, and VOL play relatively central roles in the network, with Strength values greater than 0.3. Among them, PQ3, PQ5, and PSQ6 serve as strong bridges, with bridging occurrences of 8, 6, and 6, respectively (one unit in the figure represents two times). In the Expected Influence analysis, only PQ3 and PQ5 have positive values, specifically 0.29 and 0.43 respectively, as shown in Figure 2b.
Figure 2. Association among core predictive variables, gender, and progression of language impairment. (a) GLASSO network composed of core predictive factors, gender, and the language impairment progression. (b) Centrality property of core predictive factors, gender, and the language impairment progression network. (c) Bayesian network consisting of core predictive factors, gender, and the language impairment progression. The nodes in the figure represent the filtered core predictive factors, gender, and the language impairment progression. The numerical values along the paths indicate the strength of associations or dependencies between connected variables. Blue lines denote positive correlations, while red lines indicate negative correlations; the thickness and boldness of the lines reflect the magnitude of these associations. The direction of the arrows represents the direction of influence between variables. Green lines in the figure illustrate the conditional dependencies among core predictive factors, gender, and language impairment worsening in the Bayesian network. PSQ3, ASPECTS postoperative score; PSQ6, mRS discharge score; BIOC4, C-reactive protein (0–10 mg/L); BIOC19, glycosylated hemoglobin (4–6%); PQ3, ADL admission score; PQ5, GCS admission score; VOL, cerebral infarction volume; CE1, the affected hemisphere; and LANG, degree of language impairment worsening.
The results of the Bayesian network analysis showed that VOL, PQ5, and PSQ6 had causal effects on LANG, with effect strengths of 0.61, 0.89, and 0.56, respectively. Consistent with the GLASSO results, a significant causal relationship was found between gender and VOL, with an effect strength of 1, as shown in Figure 2c.
As shown in Figure 3a, in the NOA analysis, PQ5 exhibited a positive correlation with LANG, with a weight of 0.07, while PQ3 showed an indirect association with LANG. These results are consistent with the GLASSO network analysis, where PQ3 and PQ5 once again demonstrated their important roles. The centrality indicator results from the NOA network analysis revealed that PQ3 and PQ5 play the most central roles, with Strength values of 0.35 and 0.42, respectively, as shown in Figure 3b.
Figure 3. Network outcome analysis (NOA). (a) NOA composed of core predictive factors for language impairment progression prediction. (b) Centrality property of the NOA network analysis for language impairment progression prediction. The nodes in the figure consist of LANG and the core predictive factors measured at different time points from LANG. Blue lines indicate positive correlations, red lines represent negative correlations, and the thickness and boldness of the lines reflect the strength of the associations. Variables include BIOC4, C-reactive protein (0–10 mg/L); PQ3, ADL admission score; PQ5, GCS admission score; VOL, cerebral infarction volume; CE1, the affected hemisphere; and LANG, degree of language impairment worsening.
4 Discussion
This study developed a multimodal predictive model for language impairment progression following stroke surgery, achieving an AUC of 0.80 (95% CI: 0.719–0.876) with balanced clinical utility across threshold probabilities. LASSO regression identified eight core predictors from clinical, biochemical, and imaging domains, with preoperative consciousness level (GCS) and functional independence (ADL) demonstrating the strongest predictive value. Network analyses revealed two key mechanistic insights: first, GCS functions as a central hub through which other predictors influence language impairment outcomes, rather than operating as isolated risk factors; second, infarct volume exhibits a strong sex-specific association (weight = −0.31; Bayesian effect strength = 1.0), with male patients showing systematically larger lesions. These findings confirm our hypotheses that multimodal integration enhances prediction accuracy and that network analysis elucidates complex interrelationships overlooked by traditional univariate approaches.
The central role of preoperative GCS in both GLASSO and NOA networks—serving as the sole direct predictor of language impairment progression (edge weight = 0.08, 0.07) while bridging connections among ADL, ASPECTS, and mRS—revealed a critical mechanistic pathway. This finding is biologically plausible because the consciousness level reflects the integrity of ascending reticular activating systems and distributed cortical networks that also govern motor speech control (23, 24). The Bayesian network’s directional confirmation (GCS → language impairment, effect strength = 0.89) supports a causal interpretation, consistent with clinical observations that impaired arousal compromises compensatory mechanisms for speech production (25). Importantly, this hub architecture suggests that interventions targeting consciousness enhancement—such as minimizing sedation, optimizing sleep–wake cycles, or augmenting arousal pharmacologically—may yield cascading benefits including improved speech outcomes (26). The positive association (higher GCS predicts greater worsening) underscores the prognostic value of bedside consciousness assessment, enabling rapid risk stratification before surgery using readily available clinical tools rather than advanced imaging or biomarkers.
The strong negative association between infarct volume and male sex represents a novel contribution with implications for personalized prognosis. Our differential analysis demonstrated that males on average harbor significantly larger infarcts (mean 22.72 vs. 15.19 mL), yet this volume difference did not translate into proportionally worse language impairment outcomes, as evidenced by the absence of direct sex → language impairment edges in either network model. This dissociation aligns with recent evidence showing sex-specific lesion-outcome relationships, wherein males exhibit stronger volume-outcome associations but females demonstrate greater vulnerability to smaller yet still functionally consequential lesions in certain regions (27). Several mechanisms may explain this pattern: anatomical factors (larger brain volumes and cerebrovascular reserve in males), hormonal neuroprotection in premenopausal women limiting infarct expansion, and sex differences in stroke etiology producing distinct lesion characteristics (10). Clinically, these findings argue for sex-stratified prognostic thresholds and rehabilitation protocols, as traditional volume-based predictions may systematically underestimate risk in female patients with smaller but equally disabling lesions.
This study has several limitations that should be addressed in future research. Although left-hemisphere lesions were included as a core predictor in the LASSO model, no significant association with language impairment scores was observed in the chi-square or network analysis. This may be due to the small sample size in this study, which could have introduced bias, limited statistical power to detect weaker network connections, and precluded comprehensive subgroup analyses. Additionally, the single-center Chinese cohort may limit the generalizability of the findings to populations with different stroke etiologies or healthcare contexts. The cross-sectional design with short-term follow-up (2 days postoperative) cannot assess long-term language impairment trajectories or validate whether early deterioration predicts persistent speech deficits. Network stability could be improved through fixed thresholding with larger samples, and the reliance on the NIHSS language impairment subscale—a coarse 3-point measure—may have reduced sensitivity to subtle speech changes compared to detailed acoustic analyses. Unmeasured confounders including lesion location (e.g., involvement of precentral gyrus vs. subcortical structures), white matter hyperintensity burden, and premorbid cognitive reserve may partly explain observed associations. Future research should prioritize external validation in diverse cohorts, incorporation of advanced neuroimaging (DTI for white matter tract quantification, resting-state fMRI for network connectivity), longitudinal follow-up through rehabilitation phases, and sex-stratified analyses to determine whether distinct predictive models are clinically warranted. Finally, the short-term follow-up (48 h after surgery) captures acute language deterioration but cannot assess long-term aphasia trajectories. Future research should incorporate extended follow-up periods (3, 6, and 12 months) to validate whether early deterioration serves as a predictor of chronic language impairment.
5 Conclusion
In conclusion, this study developed a multimodal predictive model for post-stroke language impairment progression, achieving robust performance (AUC 0.80, 95% CI 0.719–0.876) with broad clinical decision thresholds. LASSO identified eight key predictors, with preoperative GCS and ADL exerting the strongest influence. Network analyses elucidated critical mechanisms: GCS serves as a central hub mediating indirect pathways from ADL, ASPECTS, and mRS to language outcomes, while infarct volume shows a pronounced sex-specific pattern (weight = −0.31). These findings validate multimodal integration for superior prediction and reveal interconnected risk architectures overlooked by conventional models, paving the way for precision interventions targeting consciousness enhancement and sex-stratified rehabilitation strategies.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by Institutional Review Board of the Second People’s Hospital of Lianyungang (2025K096). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
G-DL: Resources, Writing – original draft, Writing – review & editing. Z-HT: Data curation, Writing – review & editing. Y-KC: Data curation, Formal analysis, Validation, Writing – original draft. Z-KW: Investigation, Writing – review & editing. Z-NZ: Data curation, Writing – review & editing. B-HL: Data curation, Writing – review & editing. Y-HC: Resources, Writing – review & editing. S-SM: Resources, Writing – review & editing. BF: Resources, Writing – review & editing. HZ: Conceptualization, Project administration, Supervision, Writing – review & editing.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. This work was supported by the 2024 Lianyungang Municipal Health and Medical Science and Technology Project (No. 202418), Jiangsu Provincial Foreign Specialists Hundred Talents Program Project (BX2021081) and High-end Foreign and Experts Introduction Program Project (G2021014067L).
Acknowledgments
Thank you to all participants and their families.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The authors declare that Gen AI was used in the preparation of this manuscript. During the preparation of this work, the authors used DeepSeek V3.2, ChatGPT-4o and ChatGPT-5 Series for the initial translation and formatting checks. AI was not employed for scientific insights, data analysis, or interpretation of results. After using these tools, the authors reviewed and edited the content as needed and take full responsibility for the content of the publication.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fneur.2025.1728433/full#supplementary-material
Footnotes
References
1. GBD 2019 Stroke Collaborators. Global, regional, and national burden of stroke and its risk factors, 1990-2019: a systematic analysis for the global burden of disease study 2019. Lancet Neurol. (2021) 20:795–820. doi: 10.1016/S1474-4422(21)00252-0
2. Brady, MC, Kelly, H, Godwin, J, Enderby, P, and Campbell, P. Speech and language therapy for aphasia following stroke. Cochrane Database Syst Rev. (2016) 2016:CD000425. doi: 10.1002/14651858.CD000425.pub4
3. Grönberg, A, Henriksson, I, Stenman, M, and Lindgren, AG. Incidence of aphasia in ischemic stroke. Neuroepidemiology. (2022) 56:174–82. doi: 10.1159/000526710
4. Inoa, V, Aron, AW, Staff, I, Fortunato, G, and Sansing, LH. Lower NIH stroke scale scores are required to accurately predict a good prognosis in posterior circulation stroke. Cerebrovasc Dis. (2014) 37:251–5. doi: 10.1159/000358869
5. Quinn, TJ, Dawson, J, Walters, MR, and Lees, KR. Functional outcome measures in contemporary stroke trials. Int J Stroke. (2009) 4:200–5. doi: 10.1111/j.1747-4949.2009.00271.x
6. Etherton, MR, Rost, NS, and Wu, O. Infarct topography and functional outcomes. J Cereb Blood Flow Metab. (2018) 38:1517–32. doi: 10.1177/0271678X17700666
7. Liew, S-L, Zavaliangos-Petropulu, A, Jahanshad, N, Lang, CE, Hayward, KS, Lohse, KR, et al. The ENIGMA stroke recovery working group: big data neuroimaging to study brain-behavior relationships after stroke. Hum Brain Mapp. (2022) 43:129–48. doi: 10.1002/hbm.25015
8. Zu, W, Huang, X, Xu, T, Du, L, Wang, Y, Wang, L, et al. Machine learning in predicting outcomes for stroke patients following rehabilitation treatment: a systematic review. PLoS One. (2023) 18:e0287308. doi: 10.1371/journal.pone.0287308
9. Bonkhoff, AK, Schirmer, MD, Bretzner, M, Hong, S, Regenhardt, RW, Brudfors, M, et al. Outcome after acute ischemic stroke is linked to sex-specific lesion patterns. Nat Commun. (2021) 12:3289. doi: 10.1038/s41467-021-23492-3
10. Carcel, C, Woodward, M, Wang, X, Bushnell, C, and Sandset, EC. Sex matters in stroke: a review of recent evidence on the differences between women and men. Front Neuroendocrinol. (2020) 59:100870. doi: 10.1016/j.yfrne.2020.100870
11. Meng, Z, Wang, M, Guo, S, Zhou, Y, Zheng, M, Liu, M, et al. Development and validation of a LASSO prediction model for better identification of ischemic stroke: a case-control study in China. Front Aging Neurosci. (2021) 13:630437. doi: 10.3389/fnagi.2021.630437
12. Luo, B, Nong, F, Qi, B, Liu, X, He, Y, and Zheng, H. Beyond impulsivity, predicting non-suicidal self-injurious behavior in adolescents: a network approach. J Affect Disord. (2025) 391:119945. doi: 10.1016/j.jad.2025.119945
13. Chen, J, Guo, L, Chen, T-Z, Chen, Y, Xu, C, Zheng, H, et al. Prediction and explanation of the increase in suicide risk of emerging adults: a comprehensive approach combining logistic regression, glasso network analysis, and Bayesian networks. J Affect Disord. (2025) 383:469–79. doi: 10.1016/j.jad.2025.04.171
14. Brott, T, Adams, HP, Olinger, CP, Marler, JR, Barsan, WG, Biller, J, et al. Measurements of acute cerebral infarction: a clinical examination scale. Stroke. (1989) 20:864–70. doi: 10.1161/01.str.20.7.864
15. Katz, S, Ford, AB, Moskowitz, RW, Jackson, BA, and Jaffe, MW. Studies of illness in the aged. The index of ADL: a standardized measure of biological and psychosocial function. JAMA. (1963) 185:914–9. doi: 10.1001/jama.1963.03060120024016
16. Teasdale, G, and Jennett, B. Assessment of coma and impaired consciousness. A practical scale. Lancet. (1974) 2:81–4. doi: 10.1016/s0140-6736(74)91639-0
17. Morris, JC. The clinical dementia rating (CDR): current version and scoring rules. Neurology. (1993) 43:2412–4. doi: 10.1212/wnl.43.11.2412-a
18. Pexman, JH, Barber, PA, Hill, MD, Sevick, RJ, Demchuk, AM, Hudon, ME, et al. Use of the Alberta stroke program early CT score (ASPECTS) for assessing CT scans in patients with acute stroke. AJNR Am J Neuroradiol. (2001) 22:1534–42.
19. Rankin, J. Cerebral vascular accidents in patients over the age of 60. Part II. Prognosis. Scott Med J. (1957) 2:200–15. doi: 10.1177/003693305700200504
20. Hachinski, VC, Iliff, LD, Zilhka, E, Du Boulay, GH, McAllister, VL, Marshall, J, et al. Cerebral blood flow in dementia. Arch Neurol. (1975) 32:632–7. doi: 10.1001/archneur.1975.00490510088009
21. Bao, Q, Mi, S, Gang, B, Yang, W, Chen, J, and Liao, Q. MDAN: Mirror difference aware network for brain stroke lesion segmentation. IEEE J Biomed Health Inform. (2022) 26:1628–39. doi: 10.1109/JBHI.2021.3113460
22. Chen, J, Wang, X-Y, Huang, S-N, Lu, J-X, and Zheng, H. Similarities and differences between internet addiction and nicotine dependence —— motivation network. Proc Natl Acad Sci USA. (2025) 162:108480. doi: 10.1016/j.chb.2024.108480
23. Schiff, ND. Central thalamic contributions to arousal regulation and neurological disorders of consciousness. Ann N Y Acad Sci. (2008) 1129:105–18. doi: 10.1196/annals.1417.029
24. Parvizi, J, and Damasio, AR. Neuroanatomical correlates of brainstem coma. Brain. (2003) 126:1524–36. doi: 10.1093/brain/awg166
25. Fridriksson, J, Fillmore, P, Guo, D, and Rorden, C. Chronic Broca’s aphasia is caused by damage to Broca’s and Wernicke’s areas. Cereb Cortex. (2015) 25:4689–96. doi: 10.1093/cercor/bhu152
26. Giacino, JT, Whyte, J, Bagiella, E, Kalmar, K, Childs, N, Khademi, A, et al. Placebo-controlled trial of amantadine for severe traumatic brain injury. N Engl J Med. (2012) 366:819–26. doi: 10.1056/NEJMoa1102609
Keywords: stroke, language impairment progression, neurological deterioration, predictive model, LASSO regression, network analysis, Glasgow Coma Scale, sex-specific associations
Citation: Li G-D, Tao Z-H, Chen Y-K, Wu Z-K, Zhu Z-N, Luo B-H, Chen Y-H, Ma S-S, Fu B and Zheng H (2025) Sex-specific infarct volume associations and early prediction of language impairment progression following stroke surgery: a network approach. Front. Neurol. 16:1728433. doi: 10.3389/fneur.2025.1728433
Edited by:
Luis Rafael Moscote-Salazar, Colombian Clinical Research Group in Neurocritical Care, ColombiaReviewed by:
Luis Alberto Camputaro, University of Buenos Aires, ArgentinaPratiksha Baliga, Mahatma Gandhi Mission Institute of Health Sciences, India
Copyright © 2025 Li, Tao, Chen, Wu, Zhu, Luo, Chen, Ma, Fu and Zheng. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Shan-Shan Ma, bXNzZ3VvZG9uZ0AxNjMuY29t; Bing Fu, ZnJhbmtsZXlAMTYzLmNvbQ==; Hui Zheng, emguZG10ckBnbWFpbC5jb20=
†ORCID: Shan-Shan Ma, https://orcid.org/0000-0002-8605-088X
Bing Fu, https://orcid.org/0000-0002-5861-5708
Hui Zheng, https://orcid.org/0000-0001-9735-5657
†These authors share first authorship
Guo-Dong Li1†