Detecting and Countering Hate Speech Messages on Social Media
Abstract: In today’s digital age, the huge amount of abusive content and hate speech on social media platforms presents a significant challenge. Natural Language Processing (NLP) methods have focused on detecting explicit forms of hate speech, often overlooking more nuanced and implicit instances. To address this gap, we aim to enhance the detection and understanding of implicit and subtle hate speech. Once hate speech has been detected, the most promising method to counter-act it is to use counter-speech. The potential effectiveness of counter-speech as a hate speech mitigation strategy is attracting increasing interest in the NaturalLanguage Generation research community, particularly towards the task of automatically producing it. However, automatically generated responses often lack the argumentative richness which characterises expert-produced counter-speech. For this reason, we focus on two aspects of counter-speech generation to produce more cogent responses. First, by investigating the tension between helpfulness and harmlessness of LLMs, we test whether the presence of safety guardrails hinders the quality of the generations. Secondly, we assess whether attacking a specific component of the hate speech results in a more effective argumentative strategy to fight online hate.
When browsing Université Côte d'Azur website and Université Côte d'Azur components websites by profile ("I am" menu), informations may be saved in a "Cookie" file installed by Université Côte d'Azur on your computer, tablet or mobile phone. This Cookie file contains informations, such as a unique identifier, the name of the portal, and the chosen profile. This Cookie file is read by its transmitter. During its 12-month validity period, it allows to recognize your terminal and to propose the chosen profile as your default home page.
You have accepted the deposit of profile information cookies in your navigator.
You have declined the deposit of profile information cookies in your navigator.
"Do Not Track" is enabled in your browser. No profiles information will be collected.
Cookies de mesure d 'audiences
This website uses Google Analytics. By clicking on "I accept" or by navigatin on it, you authorize us to deposit a cookie for audience measurements purposes.
You have accepted the deposit of audience measurement cookies in your navigator.
You have declined the deposit of audience measurement cookies in your navigator.
"Do Not Track" is enabled in your browser. No navigation statistics will be collected.