AI on IBM Z & IBM LinuxONE

Leverage AI on IBM Z & LinuxONE to enable real-time AI decisions at scale, accelerating your time-to-value, while ensuring trust and compliance

View Only

Back to Blog List

IBM Z Deep Learning Compiler 4.2.0

By Charles Volzka posted Thu April 18, 2024 02:35 PM

IBM Z Deep Learning Compiler 4.2.0

Today we are excited to announce the 4.2.0 update for the IBM Z Deep Learning Compiler (IBM zDLC). The icr.io/ibmz/zdlc:4.2.0 image is available now from the IBM Z and LinuxONE Container Image Registry. An updated tutorial is also available at https://github.com/IBM/zDLC. For those interested for enterprise level support for mission critical workloads, IBM zDLC is included in the AI Toolkit for IBM Z and IBM LinuxONE.

Changes in this release:

New compile time options
- New flags to control placement of individual operators between NNPA and CPU
  - Removed old -execNodesOnCpu flag
Updated ONNX-MLIR and ONNX
ONNX Operators Updates
Bug fixes and Performance Improvements

New Compile Time Options

--nnpa-save-device-placement-file=<json_file> and --nnpa-load-device-placement-file=<json_file> replaces previous --execNodesOnCpu=<node-name>

The flag to control whether and individual operator goes to the NNPA or the CPU has changed this release. The previous --execNodesOnCpu=<node_name> flag has been removed and replaced by --nnpa-save-device-placement-file=<file> and --nnpa-load-device-placement-file=<file>. These new flags give greater control over device placement, allow saving placement configuration for reuse, and are compatible with models where nodes are not individually named. An example of how to use the new flag is available at https://github.com/IBM/zDLC?tab=readme-ov-file#nnpa-device-placement

Updated to the latest ONNX and ONNX-MLIR

The IBM zDLC is based on on cutting edge open source technology. These updates enable the many features in this release. In zDLC 4.2.0 we've updated to include the latest released versions of the following:

ONNX Operators Updates

CPU Operators
- New
  - Gelu
  - CastLike
  - DFT
  - GroupNormalization
  - LayerNormaliztion
- Updated for ONNX OpSet 20
  - AveragePool
  - Cast
  - Constant
  - IsInf
  - IsNaN
  - Pad
  - QuantizeLinear
  - Resize
  - Slice
  - Split
NNPA / Integrated Accelerator for AI Operators
- GRU added support for optional sequence_lens input

If you have questions on getting started with AI on IBM Z, reach out to us at aionz@us.ibm.com

0 comments

26 views

Permalink

https://community.ibm.com/community/user/blogs/charles-volzka/2024/04/15/ibm-z-deep-learning-compiler-420

AI on IBM Z & IBM LinuxONE

AI on IBM Z & IBM LinuxONE

IBM Z Deep Learning Compiler 4.2.0

By Charles Volzka posted Thu April 18, 2024 02:35 PM

IBM Z Deep Learning Compiler 4.2.0

Changes in this release:

New Compile Time Options

Updated to the latest ONNX and ONNX-MLIR

ONNX Operators Updates

Permalink

Additional
Resources

Office

Quick Links

AI on IBM Z & IBM LinuxONE

AI on IBM Z & IBM LinuxONE

IBM Z Deep Learning Compiler 4.2.0

By Charles Volzka posted Thu April 18, 2024 02:35 PM

IBM Z Deep Learning Compiler 4.2.0

Changes in this release:

New Compile Time Options

Updated to the latest ONNX and ONNX-MLIR

ONNX Operators Updates

Permalink

Additional Resources

Office

Quick Links

Additional
Resources