watsonx.ai

A one-stop, integrated, end- to-end AI development studio

View Only

Back to Blog List

IBM Granite 4.0 Tiny Preview: A sneak peek at the next generation of Granite models

By NICK PLOWDEN posted Fri May 02, 2025 09:09 AM

We’re excited to present IBM Granite 4.0 Tiny Preview, a preliminary version of the smallest model in the upcoming Granite 4.0 family of language models, to the open source community.

Granite 4.0 Tiny Preview is extremely compact and compute efficient: at FP8 precision, several concurrent sessions performing long context (128K) tasks can be run on consumer grade hardware, including GPUs commonly available for under $350 USD.¹ Though the model is only partially trained—it has only seen 2.5T of a planned 15T or more training tokens—it already offers performance rivaling that of IBM Granite 3.3 2B Instruct despite fewer active parameters and a roughly 72% reduction in memory requirements.² We anticipate Granite 4.0 Tiny’s performance to be on par with that of Granite 3.3 8B Instruct by the time it has completed training and post-training.

Read the full article.

#watsonx.ai
#GenerativeAI

0 comments

14 views

Permalink

https://community.ibm.com/community/user/blogs/nickolus-plowden/2025/05/02/ibm-granite-40-tiny-preview-a-sneak-pat-the-ne

watsonx.ai

watsonx.ai

IBM Granite 4.0 Tiny Preview: A sneak peek at the next generation of Granite models

By NICK PLOWDEN posted Fri May 02, 2025 09:09 AM

Permalink

Additional
Resources

Office

Quick Links

watsonx.ai

watsonx.ai

IBM Granite 4.0 Tiny Preview: A sneak peek at the next generation of Granite models

By NICK PLOWDEN posted Fri May 02, 2025 09:09 AM

Permalink

Additional Resources

Office

Quick Links

Additional
Resources