Wulczyn et al. Personal Attacks on Wikipedia Dataset
Data and Resources
-
Personal Attack annotationsTSV
FROM THE AUTHORS: 100k labeled comments from English Wikipedia by...
-
Aggression annotationsTSV
FROM THE AUTHORS: 100k labeled comments from English Wikipedia by...
-
Toxicity annotationsTSV
FROM THE AUTHORS: 160k labeled comments from English Wikipedia by...
Additional Info
Field | Value |
---|---|
Paper Authors | Wulczyn, E., Thain, N. and Dixon, L |
Author contact email | Wulczyn, E., Thain, N. and Dixon, L |
Publication / paper reference | Wulczyn, E., Thain, N. and Dixon, L., 2017. Ex Machina: Personal Attacks Seen at Scale. ArXiv,. |
Publication / paper link | https://arxiv.org/pdf/1610.08914 |
Publication Year | |
Dataset about page | https://meta.wikimedia.org/wiki/Research:Detox/Data_Release |
Approved | |
Language(s) covered | English |
Source data platform(s) | Wikipedia |
Phenomena annotated | Person-directed attacks, toxicity, aggression |
Level of instances | Single comment / post |
Data statement link | |
Total number of instances in dataset | 115,737 comments; 100,000; 160,000 |
Proportion of positive/abusive instances | 0.12; NA; NA |
Submitter | Laila Sprejer |
Submitter Email | sprejerlaila@gmail.com |
State | active |