Author of the publication

ToxiGen: A Large-Scale Machine-Generated Dataset for Adversarial and Implicit Hate Speech Detection.

, , , , , and . ACL (1), page 3309-3326. Association for Computational Linguistics, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Predicting Individual Well-Being Through the Language of Social Media., , , , , , , , , and 3 other author(s). PSB, page 516-527. (2016)Clever Hans or Neural Theory of Mind? Stress Testing Social Reasoning in Large Language Models., , , , , , , and . EACL (1), page 2257-2273. Association for Computational Linguistics, (2024)Challenges in Automated Debiasing for Toxic Language Detection., , , , and . EACL, page 3143-3155. Association for Computational Linguistics, (2021)NORMAD: A Benchmark for Measuring the Cultural Adaptability of Large Language Models., , , , and . CoRR, (2024)WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models., , , , , , , , , and 1 other author(s). CoRR, (2024)Misinfo Reaction Frames: Reasoning about Readers' Reactions to News Headlines., , , , , , and . ACL (1), page 3108-3127. Association for Computational Linguistics, (2022)On the Resilience of Multi-Agent Systems with Malicious Agents., , , , , , , , and . CoRR, (2024)Relying on the Unreliable: The Impact of Language Models' Reluctance to Express Uncertainty., , , and . ACL (1), page 3623-3643. Association for Computational Linguistics, (2024)SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization., , , , , , , , , and 2 other author(s). EMNLP, page 12930-12949. Association for Computational Linguistics, (2023)Exploring the Effect of Author and Reader Identity in Online Story Writing: the STORIESINTHEWILD Corpus., , , , and . NUSE@ACL, page 46-54. Association for Computational Linguistics, (2020)