Breitfeller et al. Microaggressions Dataset

Dataset of self-reported microaggressions from 2,934 posts were collected targeted towards gender (1,314 posts), race (1,278 posts), sexuality (461 posts), and religion (88 posts), among others. Posts were labelled under 4 themes and 11 subthemes by three expert annotators with an overall Fleiss’ κ of 0.4641.

Data and Resources

Additional Info

Field Value
Paper Authors Luke Breitfeller, Emily Ahn, David Jurgens, Yulia Tsvetkov
Author contact email Luke Breitfeller, Emily Ahn, David Jurgens, Yulia Tsvetkov
Publication / paper reference Breitfeller, L., Ahn, E., Muis, A.O., Jurgens, D., & Tsvetkov, Y. (2019). Finding Microaggressions in the Wild: A Case for Locating Elusive Phenomena in Social Media Posts. EMNLP/IJCNLP.
Publication / paper link
Publication Year
Dataset about page
Language(s) covered English
Source data platform(s)
Phenomena annotated Microaggression towards gender, race, sexuality and religion + themes (e.g. Attributive, Institutionalized, Teaming, and Othering).
Level of instances Single comment / post
Data statement link
Total number of instances in dataset 2,934
Proportion of positive/abusive instances 1
Submitter Laila Sprejer
Submitter Email
State active