Co-authors: @Elina priyadarshinee @Madhu Tadiparthi @Guangya Liu vLLMs provide observability features that help in maintaining efficient and reliable serving of large language models. These features include detailed runtime metrics such as encompassing throughput, latency, and...
Jaxon’s Domain-Specific AI Language (DSAIL) is the formal fact checker that makes AI trustworthy. Based on R&D with the DoD, Jaxon's proprietary DSAIL technology uses advanced reasoning techniques to check if AI-generated answers align with established policies, regulations, and known...
Jaxon watsonx Deminar June 2025.mp4
Authors: @Divya Pathak @Harshit Kumar Co-Authors: @Jayanth Putta @Guangya Liu @Madhu Tadiparthi @Adharsh H Introduction As AI evolves, so does the way it understands and responds to our questions. A popular...
I am putting this entry together to describe a project our IBM Storage SWAT team for the Americas have been tasked with to assist the setup of a Kubernetes based infrastructure leveraging IBM ESS at a GPU cloud provider. Context : Kubernetes community version Ubuntu Linux distribution ...
From Mixture of Experts podcast series How long until Anthropic drops Claude 5.0? On today’s bonus episode of Mixture of Experts, guest host Bryan Casey is joined by Chris Hay, Marina Danilevsky and Shobhit Varshney to analyze the newly released Claude 4.0 family: Opus 4 and Sonnet 4. ...
A compendium of tips, tricks, and techniques for implementing and optimizing Retrieval Augmented Generation (RAG) solutions. Aimed to provide solution architects, data engineers, developers, and 'hands on' technical folks with practical advice on implementing and optimizing...
We’re excited to present IBM Granite 4.0 Tiny Preview, a preliminary version of the smallest model in the upcoming Granite 4.0 family of language models, to the open source community. Granite 4.0 Tiny Preview is extremely compact and compute efficient: at FP8 precision, several concurrent...