Mulki et al. A Levantine Twitter Dataset for Hate Speech and Abusive Language (L-HSAB)
Data and Resources
-
L-HSAB: Levantine Twitter Dataset for Hate...TXT
Fields: Tweet, Class
Additional Info
Field | Value |
---|---|
Paper Authors | Mulki, H., Haddad, H., Bechikh, C. and Alshabani, H. |
Author contact email | Mulki, H., Haddad, H., Bechikh, C. and Alshabani, H. |
Publication / paper reference | Mulki, H., Haddad, H., Bechikh, C. and Alshabani, H., 2019. L-HSAB: A Levantine Twitter Dataset for Hate Speech and Abusive Language. In: Proceedings of the Third Workshop on Abusive Language Online. Florence, Italy: Association for Computational Linguistics, pp.111-118. |
Publication / paper link | https://www.aclweb.org/anthology/W19-3512.pdf |
Publication Year | |
Dataset about page | https://github.com/Hala-Mulki/L-HSAB-First-Arabic-Levantine-HateSpeech-Dataset |
Approved | |
Language(s) covered | Arabic |
Source data platform(s) | |
Phenomena annotated | group-directed + person-directed hate speech and abusive language |
Level of instances | Single comment / post |
Data statement link | N/A |
Total number of instances in dataset | 5,846 |
Proportion of positive/abusive instances | 0.38 |
Submitter | Philine Zeinert |
Submitter Email | phze@itu.dk |
State | active |