Liwei Jiang | 姜力炜

Liwei Jiang | 姜力炜

Home
Publications
Honors
CV

Maarten Sap

Latest

An Empirical Investigation of Machines' Capabilities for Moral Judgment with the Delphi Experiment
WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models
HAICOSYSTEM: An Ecosystem for Sandboxing Safety Risks in Human-AI Interactions
Particip-AI: Anticipating Future AI Use Cases and Impacts with Lay Users
Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties
BiasX: "Thinking Slow" in Toxic Content Moderation with Explanations of Implied Social Biases
SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization
ProsocialDialog: A Prosocial Backbone for Conversational Agents
Aligning to Social Norms and Values in Interactive Narratives

Copyright © 2024 Liwei Jiang

Published with Wowchemy — the free, open source website builder that empowers creators.

Cite