AUTHOR=Bertolini Lorenzo , Consoli Sergio , Weeds Julie TITLE=Dreams are more “predictable” than you think JOURNAL=Frontiers in Sleep VOLUME=Volume 4 - 2025 YEAR=2025 URL=https://www.frontiersin.org/journals/sleep/articles/10.3389/frsle.2025.1625185 DOI=10.3389/frsle.2025.1625185 ISSN=2813-2890 ABSTRACT=IntroductionA growing body of work has used machine learning and AI tools to analyse dream reports, and compare them to other textual content. Since these tools are usually trained on text from the web, researchers have speculated they might not be suited to model dreams reports, often labeled as “unusual” and “bizarre” content.MethodsWe used a set of large language models (LLMs) to encode dream reports from DreamBank and Wikipedia. To estimate the ability of LLMs to model and predict textual reports we adopted perplexity, a measure based on entropy, formally, the exponentiated log-likelihood of a sequence. Intuitively, perplexity indicates how “surprising” a sequence of words is to a model.ResultsIn most models, perplexity scores for dream reports were significantly lower than those for Wikipedia articles. Moreover, we found that perplexity scores were significantly different in reports produced by male vs female participants, and between blind and normally sighted individuals. In one case, we found this difference to be significant between clinical and healthy subjects.DiscussionDream reports were found to be generally easier to model and predict than Wikipedia articles. LLMs were also found to implicitly encode group differences previously observed in the literature based on gender, visual impairment, and clinical population.