High-Stakes Alignment via Adversarial Training [Redwood Research Report] | BlueDot Narrated | Podwise