IBM AI →
The Community for AI architects and builders to learn, share ideas and connect with others
Join/Log In
IBM TechXchange 2025 conference is accepting
Session proposals through April 11
The Granite Guardian model collection is designed to detect risks in user prompts and LLM The Granite Guardian models are a collection of models designed to detect risks in prompts and responses. Trained on instruction fine-tuned Granite languages models, these models can help with risk detection along many key dimensions catalogued in the IBM Risk Atlas. The models are trained on unique data comprising human annotations from socioeconomically diverse people and synthetic data informed by internal red-teaming. They outperform similar models on standard benchmarks.
Granite Guardian is useful for risk detection use-cases which are applicable across a wide-range of enterprise applications:
Granite Guardian is available in 2B and 8B variants. These are enterprise-grade models trained in a transparent manner, and according to IBM’s AI Ethics principles and released with Apache 2.0 license for research and commercial use.