CONAN: Multilingual Dataset of Responses to Fight Online Hate Speech

Dataset of pairs islamophobic hate speech and counter-responses with 3 types of metadata: expert demographics, hate speech sub-topic, counter-narrative type. The dataset is augmented through translation (4,078 pairs original, with argumented data 14,988). Hate speech sentences and counter-narratives were created by NGO trainers firstly and then by the operators.

Data and Resources

Additional Info

Field Value
Authors Chung, Y., Kuzmenko, E., Tekiroglu, S. and Guerini, M
Author contact email Chung, Y., Kuzmenko, E., Tekiroglu, S. and Guerini, M
Publication / paper reference Chung, Y., Kuzmenko, E., Tekiroglu, S. and Guerini, M., 2019. CONAN - COunter NArratives through Nichesourcing: a Multilingual Dataset of Responses to Fight Online Hate Speech. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Florence, Italy: Association for Computational Linguistics, pp.2819-2829.
Publication / paper link https://www.aclweb.org/anthology/P19-1271.pdf
Dataset about page https://github.com/marcoguerini/CONAN
Language(s) covered English,French,Italian
Source data platform(s) self-authored
Annotation schema description Pair (Islamophobic hate speech, counter sentence)
Phenomena annotated group-directed islamophobic hate speech
Level of instances Single comment / post
Data statement link N/A
Total umber of instances in dataset English: 2819, French: 2819, Italian: 2819
Proportion of positive/abusive instances 0.5
Submitter Philine Zeinert
Submitter Email phze@itu.dk
State active