-
Caselli et al. Implicit/Explicit Expansion on OLID
This dataset expands the OLID/OffensEval (OLID (Zampieri et al., 2019a), Offensive Language Identification Dataset) by adding the explicitness of the message. The OLID data was... -
Ibrohim and Budi Abuse in Indonesian Twitter Dataset
Dataset of abusive tweets sampled with offensive terms. Tweets were annotated by 20 volunteer annotators and labelled by at least 3 people each. Only tweets with 100% annotators... -
Ibrohim and Budi Multi-label Hate Speech and Abusive Language Detection in In...
Dataset of hate speech and abusive language sampled from Twitter by using keywords and keyphrases. The dataset includes posts from March 2018 until September 2018 and integrated... -
Ross et al. Hate Speech Against Refugees
Dataset of German annotated corpus of tweets regarding refugees in Germany. Tweets were sampled using 10 hateful hashtags and labelled by experts with 2 annotators per tweet.... -
Waseem and Hovy Racism and Sexism on Twitter Dataset
Dataset of racist and sexist tweets sampled from Twitter and labelled by a mix of expert annotators and activists. Tweets were sampled in 2016 over 2 months using keywords....