Hate speech detection in social media communication has become one of the primary concerns to avoid conflicts and curb undesired activities. In an environment where multilingual speakers switch among multiple languages, hate speech detection becomes a challenging task using methods that are designed for monolingual corpora. In our work, we attempt to analyze, detect and provide a comparative study of hate speech in a code-mixed social media text. We also provide a Hindi-English code-mixed data set consisting of Facebook and Twitter posts and comments. Our experiments show that deep learning models trained on this code-mixed corpus perform better.

Rani, P., Suryawanshi, S., Goswami, K., Chakravarthi, B. R., Fransen, T., Mccrae, J. P., A Comparative Study of Different State-of-the-Art Hate Speech Detection Methods in Hindi-English Code-Mixed Data, in Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying, (Marseille, France, 11-16 May 2020), European Language Resources Association (ELRA), Marseille 2020: 42-48 [https://hdl.handle.net/10807/270177]

A Comparative Study of Different State-of-the-Art Hate Speech Detection Methods in Hindi-English Code-Mixed Data

Fransen, Theodorus;
2020

Abstract

Hate speech detection in social media communication has become one of the primary concerns to avoid conflicts and curb undesired activities. In an environment where multilingual speakers switch among multiple languages, hate speech detection becomes a challenging task using methods that are designed for monolingual corpora. In our work, we attempt to analyze, detect and provide a comparative study of hate speech in a code-mixed social media text. We also provide a Hindi-English code-mixed data set consisting of Facebook and Twitter posts and comments. Our experiments show that deep learning models trained on this code-mixed corpus perform better.
2020
Inglese
Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying
Second Workshop on Trolling, Aggression and Cyberbullying
Marseille, France
11-mag-2020
16-mag-2020
979-10-95546-56-6
European Language Resources Association (ELRA)
Rani, P., Suryawanshi, S., Goswami, K., Chakravarthi, B. R., Fransen, T., Mccrae, J. P., A Comparative Study of Different State-of-the-Art Hate Speech Detection Methods in Hindi-English Code-Mixed Data, in Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying, (Marseille, France, 11-16 May 2020), European Language Resources Association (ELRA), Marseille 2020: 42-48 [https://hdl.handle.net/10807/270177]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/270177
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact