Keyword-Score-List: Refined n-grams in Hate...
FROM THE AUTHORS: The file refined_ngram_dict.csv contains a refined lexicon of n-grams. To get this lexicon we took the set of n-grams of length 1-4 that were contained in our labelled data and for each n-gram calculated the proportion of tweets containing it that were considered as hate speech by the human coders. We then manually went through the lexicon to remove irrelevant terms.
Additional Information
Field | Value |
---|---|
Data last updated | 4 February 2021 |
Metadata last updated | 4 February 2021 |
Created | 4 February 2021 |
Format | CSV |
License | License not specified |
Has views | True |
Id | fd9ba441-c519-424d-9c0d-345258b1180b |
Mimetype | text/csv |
On same domain | True |
Package id | 9d032b4d-c668-4cd1-a405-97557ae9dda0 |
Position | 2 |
Size | 3.1 KiB |
State | active |
Url type | upload |