Skip to content

Overview of data sets provided in this repository

Teaching text analysis with Twitter data in 2021 and 2022

In 2021 and 2022, the coordinators of the Machines of Knowledge course taught computational text analysis with Twitter data because the Twitter API allowed allowed user-friendly access to a large amount of social media data on current topics. Among the topics that we explore in class in winter 2022 were the attack on a gay bar in Oslo, ethical debates around the FIFA Worldcup in Qatar, and Elon Musk's contested acquisition of Twitter:

Working with these data sets while full academic API access was still possible revealed increasing flaws of Twitter data that made us question whether this type of data was at all suitable for teaching and student research. Among the observations that we made with our students were the high amount of spam and bot posts piggy-backing on every trending hashtag, the many cryptocurrency and NFT advertisements distorting our data, and the problematic tone and quality of many contributions.

Exploring Twitter data from a data feminist perspective

Exploring alternative data sources after 2023

In 2022, the take-over of Twitter by Elon Musk resulted in fundamental changes of the platform's academic and educational policy, which forced us to explore new data sources in the 2023/2024 academic year.

Case studies for teaching since 2024

Since 2024, Monika Barget has integrated data stilled used for teaching the MA Digital Cultures course Machines of Knowledge at Maastricht University into more elaborate case studies that also offer students historical and cultural background information. These case studies, which include recommendations for different data sources per topic, are regularly updated in Github:

How to ingest these data sets into Voyant Tools

In order to analyse the provided data in Voyant Tools, first go to the Voyant Tools website and simply paste the raw URL of the linked data into the "add text" field. The raw URL on Github is displayed when you open an individual data set in a Github repository and then click on the raw button in the top right corner. You can then copy the raw URL from the address bar of your browser. Once you have inserted the raw URL in the "add text" field in Voyant, you can press the blue "reveal" button and start exploring the data set.