Inside Deep Research with Isa Fulford: Building the Future of AI Agents

This episode explores the development and capabilities of OpenAI's Deep Research, a new agentic product released in February. Isa Fulford, a key developer, details its origins, stemming from internal advancements in reinforcement learning algorithms initially applied to math and coding problems. The team then shifted focus to more user-centric tasks, such as online research and information synthesis, aiming to create a tool useful for knowledge workers and aligning with OpenAI's broader AGI goals. For instance, the model was trained to find all papers co-authored by specific researchers and even locate a coworker's middle name. More significantly, the discussion delves into the challenges of data creation, tool development (a text-based browser with access to PDFs and Python tools), and the complexities of reinforcement fine-tuning. The conversation also touches upon safety concerns, the potential for hallucinations, and the future of agents capable of both research and action-taking. Ultimately, this highlights the rapid progress in AI and the emerging potential for agents to become increasingly sophisticated and integrated into various workflows.

Outlines

Sign in to continue reading, translating and more.

Continue

No Priors: Artificial Intelligence | Technology | Startups

Origin and Goals of OpenAI's Deep Research

Data Creation, Tool Development, and Reinforcement Fine-Tuning

Browsing Expertise and Learned Planning

Agent Safety and Future Development

Future Improvements and the Role of Retrieval

OpenAI's Product Development Process and Deep Research Use Cases

Deep Research's Capabilities and Future of Agents

Blockers to Progress and User Instincts for Deep Research

Deep Research Use Cases and Future Projections

Surprising Agent Capabilities and the Vision of a Unified Agent

Inside Deep Research with Isa Fulford: Building the Future of AI Agents

No Priors: Artificial Intelligence | Technology | Startups

00:05Origin and Goals of OpenAI's Deep Research

Origin and Goals of OpenAI's Deep Research

04:56Data Creation, Tool Development, and Reinforcement Fine-Tuning

Data Creation, Tool Development, and Reinforcement Fine-Tuning

08:42Browsing Expertise and Learned Planning

Browsing Expertise and Learned Planning

11:02Agent Safety and Future Development

Agent Safety and Future Development

13:36Future Improvements and the Role of Retrieval

Future Improvements and the Role of Retrieval

15:00OpenAI's Product Development Process and Deep Research Use Cases

OpenAI's Product Development Process and Deep Research Use Cases

17:45Deep Research's Capabilities and Future of Agents

Deep Research's Capabilities and Future of Agents

20:40Blockers to Progress and User Instincts for Deep Research

Blockers to Progress and User Instincts for Deep Research

23:16Deep Research Use Cases and Future Projections

Deep Research Use Cases and Future Projections

27:01Surprising Agent Capabilities and the Vision of a Unified Agent

Surprising Agent Capabilities and the Vision of a Unified Agent