Breitfeller et al. Microaggressions Dataset

Dataset of self-reported microaggressions from microaggressions.com. 2,934 posts were collected targeted towards gender (1,314 posts), race (1,278 posts), sexuality (461 posts), and religion (88 posts), among others. Posts were labelled under 4 themes and 11 subthemes by three expert annotators with an overall Fleiss’ κ of 0.4641.

Data and Resources

Additional Info

Field Value
Authors Luke Breitfeller, Emily Ahn, David Jurgens, Yulia Tsvetkov
Author contact email Luke Breitfeller, Emily Ahn, David Jurgens, Yulia Tsvetkov
Publication / paper reference Breitfeller, L., Ahn, E., Muis, A.O., Jurgens, D., & Tsvetkov, Y. (2019). Finding Microaggressions in the Wild: A Case for Locating Elusive Phenomena in Social Media Posts. EMNLP/IJCNLP.
Publication / paper link https://www.aclweb.org/anthology/D19-1176/
Dataset about page https://drive.google.com/drive/folders/1bKf8PQuuOk7z3ehgAcmTLjmK5Cb86ZTz
Language(s) covered English
Source data platform(s) microaggressions.com
Annotation schema description Multi-topic: themes (Attributive, Institutionalized, Teaming, and Othering) + subthemes
Phenomena annotated Microaggression towards gender, race, sexuality and religion + themes (e.g. Attributive, Institutionalized, Teaming, and Othering).
Level of instances Single comment / post
Data statement link
Total umber of instances in dataset 2,934
Proportion of positive/abusive instances 1
Submitter Laila Sprejer
Submitter Email sprejerlaila@gmail.com
State active