[LIVE] Anthropic Distillation & How Models Cheat (SWE-Bench Dead) | Nathan Lambert & Sebastian Raschka | Latent Space: The AI Engineer Podcast

The podcast explores the ethics and practicalities of "distillation attacks" on large language models (LLMs), where smaller models are trained on the outputs of larger, proprietary models. The discussion covers the challenges of detecting such attacks versus legitimate evaluation, noting that scale and pattern analysis are key detection methods. The participants debate whether companies should restrict model access via APIs to prevent distillation, with some arguing for product-exclusive models. The conversation shifts to the saturation and inherent flaws of coding benchmarks like SWE-Bench, including the discovery of unsolvable tasks and models memorizing solutions. They highlight the need for updated, private benchmarks and discuss the surprising capacity of LLMs to memorize data from a single pass, underscoring the understudied information theory of LLMs.

Outlines

Part 1: Introduction, Distillation Basics

Part 2: Industry Analysis, API Business Models

Part 3: SWE-Bench, Code Benchmarking

Sign in to continue reading, translating and more.

Open full episode in Podwise

[LIVE] Anthropic Distillation & How Models Cheat (SWE-Bench Dead) | Nathan Lambert & Sebastian Raschka

Latent Space: The AI Engineer Podcast

Part 1: Introduction, Distillation Basics

Introduction to SAIL Live and Distillation in Machine Learning

Detecting Distillation Attacks and Privacy Concerns with LLM Usage

Part 2: Industry Analysis, API Business Models

Analyzing Anthropic's Distillation Claims and the Timing of Data Collection

The Complexities of Distillation Data and API Business Models

API Business Models, Codex, and Future Discussion Topics

Part 3: SWE-Bench, Code Benchmarking

Defining SWE-Bench and the Challenges of Code Benchmarking

Unintentional Cheating and the Information Theory of LLMs

SWE-Bench Pro, Frontier Evals, and the Future of Benchmarking

[LIVE] Anthropic Distillation & How Models Cheat (SWE-Bench Dead) | Nathan Lambert & Sebastian Raschka

Latent Space: The AI Engineer Podcast

Part 1: Introduction, Distillation Basics

00:00Introduction to SAIL Live and Distillation in Machine Learning

Introduction to SAIL Live and Distillation in Machine Learning

05:04Detecting Distillation Attacks and Privacy Concerns with LLM Usage

Detecting Distillation Attacks and Privacy Concerns with LLM Usage

Part 2: Industry Analysis, API Business Models

12:08Analyzing Anthropic's Distillation Claims and the Timing of Data Collection

Analyzing Anthropic's Distillation Claims and the Timing of Data Collection

17:10The Complexities of Distillation Data and API Business Models

The Complexities of Distillation Data and API Business Models

25:32API Business Models, Codex, and Future Discussion Topics

API Business Models, Codex, and Future Discussion Topics

Part 3: SWE-Bench, Code Benchmarking

28:41Defining SWE-Bench and the Challenges of Code Benchmarking

Defining SWE-Bench and the Challenges of Code Benchmarking

37:12Unintentional Cheating and the Information Theory of LLMs

Unintentional Cheating and the Information Theory of LLMs

42:48SWE-Bench Pro, Frontier Evals, and the Future of Benchmarking

SWE-Bench Pro, Frontier Evals, and the Future of Benchmarking