ViHSD - Vietnamese Hate Speech Detection on Soical Media Texts
Data and Resources
-
codebookCSV
The dataset consists of two properties: 1. free text: the comments of users...
Additional Info
Field | Value |
---|---|
Paper Authors | Son T. Luu; Kiet Van Nguyen; Ngan Luu-Thuy Nguyen |
Author contact email | Son T. Luu; Kiet Van Nguyen; Ngan Luu-Thuy Nguyen |
Publication / paper reference | Son T. Luu, Kiet Van Nguyen and Ngan Luu-Thuy Nguyen, A Large-scale Dataset for Hate Speech Detection on Vietnamese Social Media Texts, ArXiv |
Publication / paper link | https://arxiv.org/abs/2103.11528 |
Publication Year | |
Dataset about page | |
Approved | |
Language(s) covered | Vietnamese |
Source data platform(s) | |
Phenomena annotated | |
Level of instances | Single comment / post |
Data statement link | |
Total number of instances in dataset | 33,400 |
Proportion of positive/abusive instances | |
Submitter | Son T. Luu |
Submitter Email | sonlt@uit.edu.vn |
State | active |