-
Fernando Hate Speech Dataset in Sinhalese from Twitter
Datasets contain racism and sexism in Sinhalese from Twitter. The data was sampled using pre-identified keywords from surveys and experts. The data was annotated by experts... -
Mubarak et al. Abuse in Arabic Social Media Dataset
Dataset 1 includes offensive Arabic tweets sampled in March 2014 using obscene keywords and hashtags used for pornographic pages (available as a .txt file word list). Dataset 2...