Allyson Ettinger

WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models
AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution of Machine Text against Web Text
WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs
The Generative AI Paradox: "What It Can Create, It May Not Understand"
Faith and Fate: Limits of Transformers on Compositionality

Published with Wowchemy — the free, open source website builder that empowers creators.