Optimizing for efficiency with IBM’s Granite

This Practical AI podcast episode interviews Kate Soule, Director of Technical Product Management at Granite for IBM, about IBM's family of large language models (LLMs). The discussion covers IBM's decision to open-source Granite under the Apache 2 license, the rationale behind developing different sized models (1B to 8B parameters), and the incorporation of "mixture of experts" architecture for efficient inference. Soule also details Granite's multimodal capabilities, including vision and time series models, and the role of Granite Guardian for ensuring responsible AI. Listeners gain insights into the practical considerations of building and deploying LLMs, including the trade-offs between model size, performance, and cost, and the importance of safety and security features.

Outlines

Part 1: Introduction to IBM's LLM Strategy

Part 2: Granite Model Details and Capabilities

Part 3: Responsible AI and Future Outlook

Sign in to continue reading, translating and more.

Continue

Practical AI

Part 1: Introduction to IBM's LLM Strategy

Introduction and Guest Introduction

IBM's Approach to Large Language Models

Architectural Choices and Model Development

Part 2: Granite Model Details and Capabilities

Mixture of Experts and Model Sizes

Chain of Thought Reasoning and Model Capabilities

The Trend Towards Smaller Models

Overview of the Granite Model Family

Part 3: Responsible AI and Future Outlook

Granite Guardian and Responsible AI

Granite, Agents, and the Future of AI

Conclusion and Call to Action

Optimizing for efficiency with IBM’s Granite

Practical AI

Part 1: Introduction to IBM's LLM Strategy

00:43Introduction and Guest Introduction

Introduction and Guest Introduction

03:33IBM's Approach to Large Language Models

IBM's Approach to Large Language Models

10:10Architectural Choices and Model Development

Architectural Choices and Model Development

Part 2: Granite Model Details and Capabilities

13:33Mixture of Experts and Model Sizes

Mixture of Experts and Model Sizes

18:26Chain of Thought Reasoning and Model Capabilities

Chain of Thought Reasoning and Model Capabilities

22:03The Trend Towards Smaller Models

The Trend Towards Smaller Models

25:19Overview of the Granite Model Family

Overview of the Granite Model Family

Part 3: Responsible AI and Future Outlook

30:05Granite Guardian and Responsible AI

Granite Guardian and Responsible AI

34:12Granite, Agents, and the Future of AI

Granite, Agents, and the Future of AI

42:50Conclusion and Call to Action

Conclusion and Call to Action