Empathy-based counterspeech can reduce racist hate speech in a social media field experiment

Dominik Hangartner

Gloria Gennaro

Sary Alasiri

Nicholas Bahrich

Alexandra Bornhoft

Joseph Boucher

Buket Buse Demirci

Laurenz Derksen

Aldo Hall

Matthias Jochum

Maria Murias Munoz

Marc Richter

Franziska Vogel

Salomé Wittwer

Felix Wüthrich

Fabrizio Gilardi

Karsten Donnay

Proceedings of the National Academy of Sciences 118(50).

Despite heightened awareness of the detrimental impact of hate speech on social media platforms on affected communities and public discourse, there is little consensus on approaches to mitigate it. While content moderation—either by governments or social media companies—can curb online hostility, such policies may suppress valuable as well as illicit speech and might disperse rather than reduce hate speech. As an alternative strategy, an increasing number of international and nongovernmental organizations (I/NGOs) are employing counterspeech to confront and reduce online hate speech. Despite their growing popularity, there is scant experimental evidence on the effectiveness and design of counterspeech strategies (in the public domain). Modeling our interventions on current I/NGO practice, we randomly assign English‐speaking Twitter users who have sent messages containing xenophobic (or racist) hate speech to one of three counterspeech strategies—empathy, warning of consequences, and humor—or a control group. Our intention‐to‐treat analysis of 1,350 Twitter users shows that empathy‐based counterspeech messages can increase the retrospective deletion of xenophobic hate speech by 0.2 SD and reduce the prospective creation of xenophobic hate speech over a 4‐wk follow‐up period by 0.1 SD. We find, however, no consistent effects for strategies using humor or warning of consequences. Together, these results advance our understanding of the central role of empathy in reducing exclusionary behavior and inform the design of future counterspeech interventions.Anonymized text data, analysis code, and additional replication materials have been deposited in a Harvard Dataverse at https://doi.org/10.7910/DVN/ARZ9PU (16).

URL: https://www.pnas.org/content/118/50/e2116310118

DOI: 10.1073/pnas.2116310118

Hangartner, Dominik, Gloria Gennaro, Sary Alasiri, Nicholas Bahrich, Alexandra Bornhoft, Joseph Boucher, Buket Buse Demirci, Laurenz Derksen, Aldo Hall, Matthias Jochum, Maria Murias Munoz, Marc Richter, Franziska Vogel, Salomé Wittwer, Felix Wüthrich, Fabrizio Gilardi, and Karsten Donnay. 2021. “Empathy-Based Counterspeech Can Reduce Racist Hate Speech in a Social Media Field Experiment.” Proceedings of the National Academy of Sciences 118(50).

@article{empathy-based-counterspeech-can-reduce-racist-hate-speech-in-a-social-media-field-experiment,
   author = {Hangartner, Dominik and Gennaro, Gloria and Alasiri, Sary and Bahrich, Nicholas and Bornhoft, Alexandra and Boucher, Joseph and Demirci, Buket Buse and Derksen, Laurenz and Hall, Aldo and Jochum, Matthias and Murias Munoz, Maria and Richter, Marc and Vogel, Franziska and Wittwer, Salom{\'e} and W{\"u}thrich, Felix and Gilardi, Fabrizio and Donnay, Karsten},
   title = {Empathy-based counterspeech can reduce racist hate speech in a social media field experiment},
   volume = {118},
   number = {50},
   elocation-id = {e2116310118},
   year = {2021},
   doi = {10.1073/pnas.2116310118},
   publisher = {National Academy of Sciences},
   abstract = {Despite heightened awareness of the detrimental impact of hate speech on social media platforms on affected communities and public discourse, there is little consensus on approaches to mitigate it. While content moderation{\textemdash}either by governments or social media companies{\textemdash}can curb online hostility, such policies may suppress valuable as well as illicit speech and might disperse rather than reduce hate speech. As an alternative strategy, an increasing number of international and nongovernmental organizations (I/NGOs) are employing counterspeech to confront and reduce online hate speech. Despite their growing popularity, there is scant experimental evidence on the effectiveness and design of counterspeech strategies (in the public domain). Modeling our interventions on current I/NGO practice, we randomly assign English-speaking Twitter users who have sent messages containing xenophobic (or racist) hate speech to one of three counterspeech strategies{\textemdash}empathy, warning of consequences, and humor{\textemdash}or a control group. Our intention-to-treat analysis of 1,350 Twitter users shows that empathy-based counterspeech messages can increase the retrospective deletion of xenophobic hate speech by 0.2 SD and reduce the prospective creation of xenophobic hate speech over a 4-wk follow-up period by 0.1 SD. We find, however, no consistent effects for strategies using humor or warning of consequences. Together, these results advance our understanding of the central role of empathy in reducing exclusionary behavior and inform the design of future counterspeech interventions.Anonymized text data, analysis code, and additional replication materials have been deposited in a Harvard Dataverse at https://doi.org/10.7910/DVN/ARZ9PU (16).},
   issn = {0027-8424},
   URL = {https://www.pnas.org/content/118/50/e2116310118},
   eprint = {https://www.pnas.org/content/118/50/e2116310118.full.pdf},
   journal = {Proceedings of the National Academy of Sciences}
}