Update README.md
Browse files
README.md
CHANGED
|
@@ -8,5 +8,24 @@ base_model:
|
|
| 8 |
Not Just Hate (NJH) - Uploaded Version
|
| 9 |
The uploaded balanced model for multi-label harmful speech classification. Labels: Profanity, Insults, Outrage, Character Assassination, Discrimination, Hostility, Incivility, and Intolerance.
|
| 10 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 11 |
|
| 12 |
Bianchi, F., Hills, S., Rossini, P., Hovy, D., Tromble, R., & Tintarev, N. (2022). "It's not just hate": a multi-dimensional perspective on detecting harmful speech online. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics.
|
|
|
|
| 8 |
Not Just Hate (NJH) - Uploaded Version
|
| 9 |
The uploaded balanced model for multi-label harmful speech classification. Labels: Profanity, Insults, Outrage, Character Assassination, Discrimination, Hostility, Incivility, and Intolerance.
|
| 10 |
|
| 11 |
+
## Label Details
|
| 12 |
+
|
| 13 |
+
**Incivility** refers to discourse that is rude, offensive, or use of swear-words/cursing. This includes four sub-types:
|
| 14 |
+
|
| 15 |
+
* Profanity: The use of swear words and vulgar language, not necessarily targeted at individuals/groups
|
| 16 |
+
* Insults: Direct offensive language (e.g. belittling, demeaning, vulgar, and/or insulting language) targeted at individuals or groups, including their character, behaviors, ideas, or opinions.
|
| 17 |
+
* Outrage: Expressions of extreme anger or indignation, using dramatic/emotional/provocative language or negative exaggeration when referencing behaviors, policies, views, political opinions, etc. of a person, group, branch of government, party or organization.
|
| 18 |
+
* Character Assassination: Attacks on someone's moral character, credibility, honesty, or integrity. Character assassination can target individuals or organizations (e.g. NGOs, Government), but not identity-based groups (coded as intolerance/discrimination)
|
| 19 |
+
|
| 20 |
+
**Intolerance** refers to expressions that threaten democratic norms and values, such as equality, diversity, and freedom. This includes two sub-types:
|
| 21 |
+
|
| 22 |
+
* Discrimination: Content that discriminates against individuals or groups based on their identity
|
| 23 |
+
* Hostility: An aggregated label that encompasses several particularly harmful types of speech including "hateful speech, dehumanization, serious threat-personal abuse-harassment, and democratic threat"
|
| 24 |
+
|
| 25 |
+
## Cookbook
|
| 26 |
+
|
| 27 |
+
If you open the **Files** section of this repo you will find the annotation cookbook.
|
| 28 |
+
|
| 29 |
+
## Cite As
|
| 30 |
|
| 31 |
Bianchi, F., Hills, S., Rossini, P., Hovy, D., Tromble, R., & Tintarev, N. (2022). "It's not just hate": a multi-dimensional perspective on detecting harmful speech online. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics.
|