Stanford Seminar - Connecting Robotics and Foundation Models, Brian Ichter of Google DeepMind | Stanford Online

Foundation models offer transformative potential for robotics by providing general reasoning and semantic knowledge, yet they require physical grounding to be actionable. Bridging this gap involves techniques like SACAN and Grounded Decoding, which align model outputs with environmental constraints and robot capabilities. Generating executable code enables robots to perform complex, long-horizon tasks that are difficult to specify through natural language alone. Training on heterogeneous datasets—including simulated and multi-embodiment data—facilitates cross-domain generalization and prevents catastrophic forgetting, as seen in models like RT1 and PaLM-E. While foundation models have significantly improved high-level planning, the primary bottleneck in robotics remains the physical interaction and low-level control. Future advancements depend on integrating these models with robust, high-frequency control systems to achieve reliable, real-world manipulation.

Outlines

Sign in to continue reading, translating and more.

Open full episode in Podwise

Stanford Seminar - Connecting Robotics and Foundation Models, Brian Ichter of Google DeepMind

Stanford Online

Leveraging Foundation Models for Robotic Task Execution

Core Challenges in Integrating Foundation Models with Robotics

Grounding Language Models through SACAN and Feedback Loops

Code as Policies for Complex Robotic Reasoning

Scaling Robotic Data with RT1 and Imagined Experience

Unified Vision-Language-Robotics Models with PALMI

Addressing Robotic Failures and Future Research Directions

Stanford Seminar - Connecting Robotics and Foundation Models, Brian Ichter of Google DeepMind

Stanford Online

00:10Leveraging Foundation Models for Robotic Task Execution

Leveraging Foundation Models for Robotic Task Execution

05:04Core Challenges in Integrating Foundation Models with Robotics

Core Challenges in Integrating Foundation Models with Robotics

08:56Grounding Language Models through SACAN and Feedback Loops

Grounding Language Models through SACAN and Feedback Loops

20:25Code as Policies for Complex Robotic Reasoning

Code as Policies for Complex Robotic Reasoning

24:12Scaling Robotic Data with RT1 and Imagined Experience

Scaling Robotic Data with RT1 and Imagined Experience

30:48Unified Vision-Language-Robotics Models with PALMI

Unified Vision-Language-Robotics Models with PALMI

38:02Addressing Robotic Failures and Future Research Directions

Addressing Robotic Failures and Future Research Directions