IBM Sovereign Core

IBM Sovereign Core

IBM Sovereign Core delivers a cohesive ready-to-run sovereignty software stack — combining an AI control plane, continuous compliance evidence, and governed agentic workflows across any hybrid environment.


#Automation
#AI
#Data
#Cloud
#Storage
 View Only

Open by Design: Generalist and Pre‑Built Agents in the Sovereign Core

By Shikha Srivastava posted Thu April 30, 2026 05:49 PM

  

Authors: 

@Shikha Srivastava - Distinguished Engineer, Sovereign Core and MCSP, Master Inventor, IBM Software
@SAMI MARREED - Staff Research Engineer - Intelligent Automation

@Gegi Thomas - Senior Technical Staff Member


1. Overview

As Generative AI shifts from simple chat interfaces to autonomous workflows, enterprises face a critical barrier: Governance at Scale. While agents reduce manual work, the risks of data egress and non-deterministic behavior in regulated industries remain a hard barrier to adoption.

The Customer Challenge

Our struggle isn't building an agent; it's proving that the agent respects our data boundaries every single time it reasons.

This challenge is what Sovereign Core is built to solve. We are introducing Agent as a Service—a framework where sovereignty isn't a bolt-on, but the fundamental execution layer.


Agent provisioning workflow


2. Out-of-the-Box Agents

We've introduced four specialized agent templates designed for rapid deployment without sacrificing domain-specific logic.

Knowledge Agent

Focused on RAG (Retrieval-Augmented Generation) within private networks. Suited for teams that need accurate retrieval over sensitive internal documents.

Healthcare Assistant

Pre-loaded with 23 medical-domain policies. Handles claims and provider lookups with HIPAA-aligned guardrails.

CRM Agent

Connects to private CRM instances to automate lead nurturing and ticket triaging without data ever leaving the VNET.

Live Docs Agent

Syncs with protected documentation servers on a configurable schedule, so the agent's retrieval pool reflects the current state of your docs.

Model as a Service — Specialized Agents

Several of the out-of-the-box agents — including the Healthcare Assistant and CRM Agent — are powered by gpt-oss-20b, an open-source 20B-parameter model deployed on Sovereign Core's Model as a Service infrastructure. This keeps all inference within the Sovereign Core boundary while giving those agents a dedicated reasoning engine tuned for their domain.


3. Architectural Spotlight: Boundary Isolation

As we designed this platform, we prioritized Boundary Isolation. In standard environments, agents call out to external reasoning engines, creating a leaky perimeter. Sovereign Core keeps the data, the control plane, and the execution engine within the same logical boundary.

Execution

Agents run in transient, isolated containers within the tenant's workspace.

Control

Policies define what the agent can decide, when it must escalate, and what it must never do — expressed in plain language that compliance teams can read and sign off on, not just engineers.

Connectivity

Uses MCP (Model Context Protocol) and OpenAPI for secure tool integration.

Agent Harness

A thin orchestration wrapper that bootstraps agent context, injects guardrails, and wires tools before any reasoning begins — ensuring every run starts from a trusted, policy-compliant baseline.

Observability

Distributed traces for every agent run are emitted to Grafana Tempo — an open-source, cost-effective tracing backend. Each reasoning step, tool call, and retrieval event becomes a queryable span, giving operators full end-to-end visibility without data leaving the cluster.


4. Built to Be Configured, Not Customized Around

Most agent platforms lock you into their model choice and their defaults. CUGA inverts this — the platform has no opinion on your domain, your model, your tools, or your data sources. Model selection is a first-class configuration: point the agent at any model-as-a-service endpoint you've approved, or on Sovereign Core, run open-source models entirely under your own infrastructure. The model you choose directly shapes the agent's reasoning quality, latency, and cost — so that decision belongs to you, not the platform. Policies, tools, and knowledge follow the same pattern, each independently configurable with no dependency between them.

Policies

Governance layer

Policies govern exactly how the agent reasons, responds, and uses its tools — without requiring code changes. Each policy type targets a specific layer of agent behavior, from what the agent is allowed to intend, to how it formats its output.

  • Intent Guard — blocks off-topic or policy-violating requests before reasoning begins
  • Use Case Playbooks — domain-specific instruction sets that shape how the agent handles a given scenario
  • Tool Guides — define when and how the agent is permitted to invoke each connected tool
  • Output Formatter — enforces response structure, tone, and content boundaries on every reply
  • Tool Approvals — require explicit human or system approval before the agent executes sensitive tool actions

Tools

Integration layer

Connect your authorized tools to the agent without writing a custom integration from scratch. Point CUGA at an OpenAPI spec or an MCP server, and it discovers available actions automatically — so the same agent can talk to your CRM, your ticketing system, and your data warehouse. Every tool connection is scoped to what you've explicitly permitted; no changes to the core runtime required.

  • OpenAPI auto-discovery
  • MCP servers
  • MCP server management (coming soon)
  • Private VNETs only
  • Per-tool approval controls

Knowledge

Context layer

Attach one or many private knowledge bases to an agent at deploy time. The agent retrieves only what it needs, only from sources you've authorized — no training, no fine-tuning, and retrieval is scoped per tenant so one agent cannot access another's data. Swap or update a knowledge base and the agent picks it up on the next run.

  • Private RAG
  • Docling support
  • Multi-document type ingestion
  • Agent-level knowledge
  • Session-level knowledge
  • Tenant-isolated retrieval
  • Live doc sync

Each dimension is independently configurable — change your tools without touching your policies, or swap your knowledge base without redeploying.

All of this configurability is only credible if you can trust what's underneath. That's why the runtime powering every agent in Sovereign Core is open source.


5. CUGA — Open Source Agent Runtime

Open Sourcegithub.com/cuga-project/cuga-agent

The execution layer powering these agentic capabilities is built on CUGA (Configurable Agent-Harness for the Enterprise) — an open-source framework that runs every agent in Sovereign Core. Because CUGA is open source, enterprises can inspect every line of reasoning logic, audit the graph execution, and extend the framework to meet their specific compliance requirements.

This is not incidental: open source is a sovereignty guarantee. When an agent's runtime is a black box, sovereignty is a promise. When it's open code, sovereignty is verifiable. CUGA's LangGraph-based execution graph gives regulated industries the auditability they need — every reasoning step and tool call is traceable and tied to an open, inspectable execution trace.

For general-purpose CUGA deployments, the default backbone is gpt-oss-120b — a high-capacity open-source model that handles complex, multi-step reasoning while remaining fully air-gapped within your infrastructure. Because CUGA is model-agnostic by design, the model layer is swappable: teams already running gpt-oss-20b on Model as a Service can reuse the same configuration, and additional open-source models are coming soon.

The observability stack is equally open: CUGA emits OpenTelemetry traces for every reasoning step directly into Grafana Tempo, an open-source distributed tracing backend. Operators get per-run flame graphs, tool-call latencies, and retrieval timings — all stored and queried inside the tenant's own infrastructure, with no telemetry phoning home.

Runtime highlights: LangGraph execution; MCP tool integration; policy-driven reasoning; gpt-oss-120b (default); gpt-oss-20b (MaaS); more models coming soon; Grafana Tempo (OpenTelemetry tracing).


Video demo: Provisioning to Deployment


6. Sovereignty & Value

Sovereignty has practical consequences beyond compliance. Because agents are defined by portable JSON manifests, a team can export an agent's full configuration — policies, tools, knowledge references — and redeploy it to a different sovereign node without rebuilding from scratch.

Efficiency & Performance

Eliminating external LLM round-trips reduces per-request latency, and using pre-built templates cuts the time to get a governed agent running compared to building one from scratch.

Cost Improvements

Centralized resource management for agents allows MSPs to optimize compute costs by dynamically scaling agent instances based on workload demand.


7. Key Takeaways

  • Sovereignty by Design: Data never leaves the workspace, and reasoning is strictly guided by policies.
  • Auditable by default: Every configuration change and agent interaction is captured in a full audit history you can export for review.
  • Faster deployment: Pre-built agent templates give teams a working, governed starting point rather than a blank slate — reducing the time from design to production.
  • Open Source = Verifiable Sovereignty: The underlying CUGA runtime is open source. Customers can inspect, audit, and extend the execution graph — turning sovereignty from a vendor promise into a technical fact.
  • Open-Source Models, Zero Data Egress: General CUGA deployments run on gpt-oss-120b by default, while specialized agents use gpt-oss-20b via Model as a Service. More open-source models are on the roadmap — the platform is model-agnostic by design.
  • Open-Source Observability with Grafana Tempo: Every agent run emits OpenTelemetry traces into Grafana Tempo, giving operators full visibility into reasoning steps, tool calls, and retrieval latencies — all inside the tenant's own infrastructure.


8. What's Next

The capabilities shipped today are the foundation. The roadmap extends the agent framework across four areas — making agents more context-aware, more capable, and more deeply integrated with enterprise event systems.

Memory Integration

Agents will persist context across sessions — retaining user preferences, prior decisions, and interaction history to enable continuity in long-running workflows.

Skills

Reusable, composable capabilities that can be attached to any agent — letting teams share and standardize tested behaviors across deployments without duplicating configuration.

Event-Driven Integrations

Agents triggered by external events — webhooks, message queues, scheduled jobs — rather than only responding to direct user input, enabling fully autonomous background workflows.

Policies & Knowledge Improvements

Richer policy primitives, versioned playbook management, and expanded knowledge ingestion — including deeper Docling integration and broader document type support for more complex retrieval scenarios.

Links to Sovereign Core to Learn More:

0 comments
88 views

Permalink