Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer | Latent Space: The AI Engineer Podcast

Turbopuffer's origin story and architecture are explored with Simon Eskildsen, delving into its evolution as a search engine for unstructured data. Eskildsen details Turbopuffer's unique approach to database architecture, leveraging NVMe SSDs and object storage, and its reliance on S3's consistency for consensus. He recounts leaving Shopify to consult for Readwise, where the high cost of embedding articles sparked the idea for a more cost-effective solution. The conversation highlights Turbopuffer's early customers, including Cursor and Notion, and the lengths they went to, such as buying dark fiber, to meet Notion's latency requirements. Eskildsen also shares insights into Turbopuffer's pricing strategy, team-building philosophy centered on "P99 engineers," and future plans, including expanding full-text search capabilities and scaling to handle massive datasets.

Outlines

Part 1: Origins, Philosophy, and the Aarhus Connection

Part 2: Technical Genesis and Architectural Innovation

Part 3: Market Adoption and Real-World Use Cases

Part 4: Business Strategy and Pricing Evolution

Part 5: The P99 Engineer and Team Culture

Part 6: Future Roadmap and Personal Interests

Sign in to continue reading, translating and more.

Open full episode in Podwise

Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer

Latent Space: The AI Engineer Podcast

Part 1: Origins, Philosophy, and the Aarhus Connection

A Risky Proposition: Promising to Return Capital if Product-Market Fit Fails

Simon Eskildsen's Danish Roots and the Aarhus Mafia's Programming Prowess

Turbopuffer Defined: A Search Engine for Unstructured Data and AI Connectivity

Part 2: Technical Genesis and Architectural Innovation

From Shopify to Readwise: The Genesis of Turbopuffer's AI-Powered Search

Napkin Math and Object Storage: The Architectural Foundation of Turbopuffer

S3 Consistency, Compare and Swap, and Dark Fiber: Overcoming Cloud Infrastructure Limitations

Part 3: Market Adoption and Real-World Use Cases

Notion's Workloads and the Buy vs. Build Decision in the Age of AI

Cursor's Early Adoption: A 95% Cost Reduction and the Power of All-In Support

Code as Data: Cursor's Security Posture and the Hybrid Nature of Workloads

Part 4: Business Strategy and Pricing Evolution

The Evolution of Search: From Context Building to Concurrent Agent Queries

Turbopuffer's Pricing Evolution: From Vibe-Based to Hardware-Driven

Why Locky? Honesty, Authenticity, and the Value of a Generalist Investor

Part 5: The P99 Engineer and Team Culture

Building a P99 Engineering Team: Traits, Talent Density, and the Default "No"

Defining the P99 Engineer: Bending Software to Your Will and a Love of Maps

Trade-offs, First Principles, and the High-Agency P99 Engineer

Part 6: Future Roadmap and Personal Interests

The Future of Turbopuffer: Full-Text Search, Scale, and a Better Dashboard

Act 3 Candidates: Simpler OLAP Queries, Traces and Logging, and Time Series

Yabukita Kamairicha: A Tea Obsession and the Pursuit of the Perfect Cup

P99 Live: A Potential New Venture with Sam Lambert of PlanetScale

Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer

Latent Space: The AI Engineer Podcast

Part 1: Origins, Philosophy, and the Aarhus Connection

00:00A Risky Proposition: Promising to Return Capital if Product-Market Fit Fails

A Risky Proposition: Promising to Return Capital if Product-Market Fit Fails

00:48Simon Eskildsen's Danish Roots and the Aarhus Mafia's Programming Prowess

Simon Eskildsen's Danish Roots and the Aarhus Mafia's Programming Prowess

02:10Turbopuffer Defined: A Search Engine for Unstructured Data and AI Connectivity

Turbopuffer Defined: A Search Engine for Unstructured Data and AI Connectivity

Part 2: Technical Genesis and Architectural Innovation

06:26From Shopify to Readwise: The Genesis of Turbopuffer's AI-Powered Search

From Shopify to Readwise: The Genesis of Turbopuffer's AI-Powered Search

12:26Napkin Math and Object Storage: The Architectural Foundation of Turbopuffer

Napkin Math and Object Storage: The Architectural Foundation of Turbopuffer

17:12S3 Consistency, Compare and Swap, and Dark Fiber: Overcoming Cloud Infrastructure Limitations

S3 Consistency, Compare and Swap, and Dark Fiber: Overcoming Cloud Infrastructure Limitations

Part 3: Market Adoption and Real-World Use Cases

23:32Notion's Workloads and the Buy vs. Build Decision in the Age of AI

Notion's Workloads and the Buy vs. Build Decision in the Age of AI

25:59Cursor's Early Adoption: A 95% Cost Reduction and the Power of All-In Support

Cursor's Early Adoption: A 95% Cost Reduction and the Power of All-In Support

28:55Code as Data: Cursor's Security Posture and the Hybrid Nature of Workloads

Code as Data: Cursor's Security Posture and the Hybrid Nature of Workloads

Part 4: Business Strategy and Pricing Evolution

31:17The Evolution of Search: From Context Building to Concurrent Agent Queries

The Evolution of Search: From Context Building to Concurrent Agent Queries

34:22Turbopuffer's Pricing Evolution: From Vibe-Based to Hardware-Driven

Turbopuffer's Pricing Evolution: From Vibe-Based to Hardware-Driven

38:17Why Locky? Honesty, Authenticity, and the Value of a Generalist Investor

Why Locky? Honesty, Authenticity, and the Value of a Generalist Investor

Part 5: The P99 Engineer and Team Culture

41:27Building a P99 Engineering Team: Traits, Talent Density, and the Default "No"

Building a P99 Engineering Team: Traits, Talent Density, and the Default "No"

45:26Defining the P99 Engineer: Bending Software to Your Will and a Love of Maps

Defining the P99 Engineer: Bending Software to Your Will and a Love of Maps

48:57Trade-offs, First Principles, and the High-Agency P99 Engineer

Trade-offs, First Principles, and the High-Agency P99 Engineer

Part 6: Future Roadmap and Personal Interests

51:13The Future of Turbopuffer: Full-Text Search, Scale, and a Better Dashboard

The Future of Turbopuffer: Full-Text Search, Scale, and a Better Dashboard

54:30Act 3 Candidates: Simpler OLAP Queries, Traces and Logging, and Time Series

Act 3 Candidates: Simpler OLAP Queries, Traces and Logging, and Time Series

57:05Yabukita Kamairicha: A Tea Obsession and the Pursuit of the Perfect Cup

Yabukita Kamairicha: A Tea Obsession and the Pursuit of the Perfect Cup

58:48P99 Live: A Potential New Venture with Sam Lambert of PlanetScale

P99 Live: A Potential New Venture with Sam Lambert of PlanetScale