Cloud Platform as a Service

Cloud Platform as a Service

Join us to learn more from a community of collaborative experts and IBM Cloud product users to share advice and best practices with peers and stay up to date regarding product enhancements, regional user group meetings, webinars, how-to blogs, and other helpful materials.

 View Only

Introducing Kubernetes and OpenShift clusters on IBM Cloud with NVIDIA H200 GPUs

By Elvin Galarza posted Mon February 24, 2025 01:32 PM

  

At IBM, we’re dedicated to offering state-of-the-art technology to our customers as the world continues to evolve. That’s why we’re excited to expand our GX3 family with another flavor.


On February 10, we made the
NVIDIA H200 Tensor Core GPU generally available for IBM Cloud Kubernetes Service (IKS) and Red Hat OpenShift on IBM Cloud (ROKS) clusters running on IBM Cloud VPC. While both the H100 and the H200 are suited for demanding artificial intelligence workloads and are available as part of the GX3 family in IBM Cloud, the H200 boasts larger memory capacity, higher memory bandwidth, and improved performance GPU-to-GPU than that of the H100.  Offering 141 GB of HBM3e memory at 4.8 TB/s, the H200 nearly doubles the capacity of the H100 of 80 GB of at 3.35 TB/s. The substantial increase in memory capacity and enhanced bandwidth enables the H200 to handle larger datasets and more complex models, especially Large Language Models (LLMs), translating to faster data transfer and processing. Learn more about the NVIDIA H200 Tensor Core GPU and explore how it leads in performance here

New GX3 flavor now available 

The following H200 GPU flavor is available for IBM Cloud VPC clusters that support RHEL 8 or RHCOS for Red Hat OpenShift and Ubuntu 24 for Kubernetes. Access to this flavor can be obtained here.

Here are the specs for the new GX3 flavor:

Flavor size

vCPUs

Memory

# of GPUs (H200)

Instance storage

Network bandwidth

gx3-160x1792x8h200

160

1.8 TB

8

8 x 7.7 TB

32 Gbps


Getting started with GX3 flavors on IBM Cloud Kubernetes Service

Once approved for access, enjoy a plug-and-play experience with IBM Cloud Kubernetes Service when provisioning a cluster. GPU drivers are automatically installed, and you can get started immediately by provisioning a new cluster at 1.31 or later with GX3 worker nodes. No additional configuration is required to set up the GPU. If you already have a 1.31+ cluster, simply add a worker pool that uses the GX3 nodes to your existing cluster. For more information, see Deploying an app on a GPU machine for IBM Cloud Kubernetes Service.

Getting started with GX3 flavors on Red Hat OpenShift on IBM Cloud 

Once approved for access, with Red Hat OpenShift on IBM Cloud, installing the NVIDIA GPU Operator automates the management of all the necessary NVIDIA software components. Once complete, provision a new cluster at 4.15 or later with the GX3 worker nodes. If you already have a 4.15+ cluster, simply add a worker pool that uses the GX3 nodes to your existing cluster. For more information, see Deploying an app on a GPU machine for Red Hat OpenShift on IBM Cloud. 

0 comments
39 views

Permalink