AUTHOR=Viola Lorella , Verheul Jaap TITLE=One Hundred Years of Migration Discourse in The Times: A Discourse-Historical Word Vector Space Approach to the Construction of Meaning JOURNAL=Frontiers in Artificial Intelligence VOLUME=Volume 3 - 2020 YEAR=2020 URL=https://www.frontiersin.org/journals/artificial-intelligence/articles/10.3389/frai.2020.00064 DOI=10.3389/frai.2020.00064 ISSN=2624-8212 ABSTRACT=This study proposes an experimental method to trace the historical evolution of media discourse as a means to investigate the construction of collective meaning. Based on distributional semantics theory (Harris 1954, 1956; Firth 1957) and critical discourse theory (Fairclough and Wodak 1997), it explores the value of merging two techniques widely employed to investigate language and meaning in two separate fields: neural word embeddings (computational linguistics) and the discourse-historical approach (DHA – Reisigl and Wodak 2001) (applied linguistics). As a use case, we investigate the historical changes in the semantic space of public discourse of migration in the UK and we use the Times Digital Archive (TDA) from 1900 to 2000 as dataset. For the computational part, we use the publicly available TDA word2vec models (Martinez-Ortiz et al. 2016; Kenter et al. 2015); these models have been trained according to sliding time windows with the specific intention to map conceptual change. We then use DHA to triangulate the results generated by the word vector models with social and historical data to identify plausible explanations for the changes in the public debate. By bringing the focus of the analysis to the level of discourse, with this method, we aim to go beyond mapping different senses expressed by single words and to add the currently missing socio-historical and socio-linguistic depth to the computational results. The study rests on the foundation that social changes will be reflected in changes in public discourse (Couldry 2008). Although correlation does not prove direct causation, we argue that historical events, language and meaning should be considered as a mutually reinforcing cycle in which the language used to describe events shapes explicit meanings which in turn trigger other events, which again will be reflected in the public discourse.