Original Research ARTICLE
Topic Modelling of Everyday Sexism Project Entries
- 1University of Oxford, United Kingdom
- 2Alan Turing Institute, United Kingdom
The Everyday Sexism Project documents everyday examples of sexism reported by volunteer contributors from all around the world. It collected 100,000 entries in 13+ languages within the first 3 years of its existence. The content of reports in various languages submitted to Everyday Sexism is a valuable source of crowdsourced information with great potential for feminist and gender studies. In this paper, we take a computational approach to analyze the content of reports. We use topic-modelling techniques to extract emerging topics and concepts from the reports, and to map the semantic relations between those topics. The resulting picture closely resembles and adds to that arrived at through qualitative analysis, showing that this form of topic modeling could be useful for sifting through datasets that had not previously been subject to any analysis. More precisely, we come up with a map of topics for two different resolutions of our topic model and discuss the connection between the identified topics. In the low-resolution picture, for instance, we found Public space/Street, Online, Work related/Office, Transport, School, Media harassment, and Domestic abuse. Among these, the strongest connection is between Public space/Street harassment and Domestic abuse and sexism in personal relationships. The strength of the relationships between topics illustrates the fluid and ubiquitous nature of sexism, with no single experience being unrelated to another.
Keywords: Sexism, gender, Everyday sexism, Topic Modeling, Content Analysis
Received: 24 Nov 2017;
Accepted: 20 Dec 2018.
Edited by:Tom Crick, Swansea University, United Kingdom
Reviewed by:Judy Robertson, University of Edinburgh, United Kingdom
Jonathan Gillard, Cardiff University, United Kingdom
Copyright: © 2018 Melville, Eccles and Yasseri. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Dr. Taha Yasseri, University of Oxford, Oxford, United Kingdom, email@example.com