12 Mar 2026
1h 0m

Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer

Podcast cover

Latent Space: The AI Engineer Podcast

Turbopuffer's origin story and architecture are explored with Simon Eskildsen, delving into its evolution as a search engine for unstructured data. Eskildsen details Turbopuffer's unique approach to database architecture, leveraging NVMe SSDs and object storage, and its reliance on S3's consistency for consensus. He recounts leaving Shopify to consult for Readwise, where the high cost of embedding articles sparked the idea for a more cost-effective solution. The conversation highlights Turbopuffer's early customers, including Cursor and Notion, and the lengths they went to, such as buying dark fiber, to meet Notion's latency requirements. Eskildsen also shares insights into Turbopuffer's pricing strategy, team-building philosophy centered on "P99 engineers," and future plans, including expanding full-text search capabilities and scaling to handle massive datasets.

Outlines

Part 1: Origins, Philosophy, and the Aarhus Connection

Part 2: Technical Genesis and Architectural Innovation

Part 3: Market Adoption and Real-World Use Cases

Part 4: Business Strategy and Pricing Evolution

Part 5: The P99 Engineer and Team Culture

Part 6: Future Roadmap and Personal Interests

Sign in to continue reading, translating and more.

Open full episode in Podwise