At IBM, we’re dedicated to offering state-of-the-art technology to our customers as the world continues to evolve. That’s why we’re excited to expand our GX3 family with another flavor.
On February 10, we made the NVIDIA H200 Tensor Core GPU generally available for IBM Cloud Kubernetes Service (IKS) and Red Hat OpenShift on IBM Cloud (ROKS) clusters running on IBM Cloud VPC. While both the H100 and the H200 are suited for demanding artificial intelligence workloads and are available as part of the GX3 family in IBM Cloud, the H200 boasts larger memory capacity, higher memory bandwidth, and improved performance GPU-to-GPU than that of the H100. Offering 141 GB of HBM3e memory at 4.8 TB/s, the H200 nearly doubles the capacity of the H100 of 80 GB of at 3.35 TB/s. The substantial increase in memory capacity and enhanced bandwidth enables the H200 to handle larger datasets and more complex models, especially Large Language Models (LLMs), translating to faster data transfer and processing. Learn more about the NVIDIA H200 Tensor Core GPU and explore how it leads in performance here.
New GX3 flavor now available
The following H200 GPU flavor is available for IBM Cloud VPC clusters that support RHEL 8 or RHCOS for Red Hat OpenShift and Ubuntu 24 for Kubernetes. Access to this flavor can be obtained here.
Here are the specs for the new GX3 flavor:
Flavor size
|
vCPUs
|
Memory
|
# of GPUs (H200)
|
Instance storage
|
Network bandwidth
|
gx3-160x1792x8h200
|
160
|
1.8 TB
|
8
|
8 x 7.7 TB
|
32 Gbps
|
Getting started with GX3 flavors on IBM Cloud Kubernetes Service
Once approved for access, enjoy a plug-and-play experience with IBM Cloud Kubernetes Service when provisioning a cluster. GPU drivers are automatically installed, and you can get started immediately by provisioning a new cluster at 1.31 or later with GX3 worker nodes. No additional configuration is required to set up the GPU. If you already have a 1.31+ cluster, simply add a worker pool that uses the GX3 nodes to your existing cluster. For more information, see Deploying an app on a GPU machine for IBM Cloud Kubernetes Service.
Getting started with GX3 flavors on Red Hat OpenShift on IBM Cloud
Once approved for access, with Red Hat OpenShift on IBM Cloud, installing the NVIDIA GPU Operator automates the management of all the necessary NVIDIA software components. Once complete, provision a new cluster at 4.15 or later with the GX3 worker nodes. If you already have a 4.15+ cluster, simply add a worker pool that uses the GX3 nodes to your existing cluster. For more information, see Deploying an app on a GPU machine for Red Hat OpenShift on IBM Cloud.