-
Caselli et al. Implicit/Explicit Expansion on OLID
This dataset expands the OLID/OffensEval (OLID (Zampieri et al., 2019a), Offensive Language Identification Dataset) by adding the explicitness of the message. The OLID data was... -
Waseem and Hovy Racism and Sexism on Twitter Dataset
Dataset of racist and sexist tweets sampled from Twitter and labelled by a mix of expert annotators and activists. Tweets were sampled in 2016 over 2 months using keywords.... -
de Gibert et al. Hate Speech from a White Supremacy Forum Dataset
Hate speech dataset composed of thousands of sentences extracted from Stormfront, a white supremacist forum, manually labelled by experts. Annotator agreement for the 1st round...