Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Hum. Neurosci.

Sec. Speech and Language

This article is part of the Research TopicAutism Spectrum Disorder: Exploring the speech and language continuumView all 6 articles

Lexicogrammatical profiling of ASD: Cognitive-functional mapping and diagnostic implications

Provisionally accepted
Sumi  KatoSumi Kato1,2*Kazuaki  HanawaKazuaki Hanawa3
  • 1Hirosaki University, Hirosaki, Japan
  • 2Hirosaki Daigaku Igakubu Daigakuin Igaku Kenkyuka, Hirosaki, Japan
  • 3Tohoku Daigaku, Sendai, Japan

The final, formatted version of the article will be published soon.

Introduction: Previous corpus-based study first established an annotated dataset of autism spectrum disorder (ASD) discourse, and subsequent modeling of lexicogrammatical patterns distinguished ASD from non-ASD discourse with high performance (accuracy 80%, precision 82%, sensitivity 73%, specificity 87%). That line of research further identified 46 statistically significant discriminators, of which 20 were analyzed in detail. The present study examines 18 additional discriminators and situates them within cognitive-functional domains to clarify their diagnostic relevance. Findings refine the language–cognition interface in ASD and extend the utility of lexicogrammatical profiling for assessment. Methods: The Tag Linear Model was employed to identify lexicogrammatical features that distinguish ASD and non-ASD discourse. Logistic regression with 10,000 bootstrap iterations was applied to establish statistical significance. Although DNN models yielded higher predictive accuracy, the linear model provided transparent identification of discriminators. Results: Of the 135 items analyzed, 46 were confirmed as statistically significant discriminators (p < 0.05). 18 of these, not previously examined, were analyzed in the present study. The discriminators were mapped onto 12 cognitive-functional domains, including working memory, executive functioning, joint attention, predictive processing, and weak central coherence. The results reveal distinctive patterns across multiple domains, including reduced use of benefactive auxiliaries, relational attributive clauses, obligation modality, evaluative and gradational resources, and mimetic onomatopoeia, reflecting systematic constraints in abstraction, perspective-taking, and pragmatic orientation. Discussion: These findings demonstrate that choice patterns of lexicogrammar in ASD reflect domain-specific cognitive constraints. Interpreting the 18 discriminators within 12 cognitive-functional domains provides a linguistically grounded perspective on the neurocognitive profile of ASD and offers implications for future diagnostic and intervention research.

Keywords: Autism spectrum disorder (ASD), natural language processing (NLP), Machinelearning, Diagnostic assessment, corpus, lexicogrammatical discriminator, systemic functionallinguistics (SFL)

Received: 14 Sep 2025; Accepted: 06 Nov 2025.

Copyright: © 2025 Kato and Hanawa. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Sumi Kato, katoh@aomoricgu.ac.jp

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.