Liwei Jiang | 姜力炜
Liwei Jiang | 姜力炜
Home
Publications
Honors
CV
Maarten Sap
Latest
An Empirical Investigation of Machines' Capabilities for Moral Judgment with the Delphi Experiment
WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models
HAICOSYSTEM: An Ecosystem for Sandboxing Safety Risks in Human-AI Interactions
Particip-AI: Anticipating Future AI Use Cases and Impacts with Lay Users
Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties
BiasX: "Thinking Slow" in Toxic Content Moderation with Explanations of Implied Social Biases
SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization
ProsocialDialog: A Prosocial Backbone for Conversational Agents
Aligning to Social Norms and Values in Interactive Narratives
Cite
×