IBM Storage Ceph

IBM Storage Ceph

Connect, collaborate, and share expertise on IBM Storage Ceph

 View Only

Announcing the IBM Redbooks “Unlocking Data Insights and AI: IBM Storage Ceph as a Data Lakehouse Platform for IBM watsonx.data and Beyond”

By Gucer Vasfi posted 11 hours ago

  
 
 
This practical book starts with a foundational overview of modern data lake architecture. Then, it explores how IBM Storage Ceph's storage and IBM watsonx.data's analytics capabilities combine to create a powerful Data and AI platform for unlocking valuable insights from data. Finally, the book dives deep into a real-world customer scenario using an S3 data lake.
 
Table of Contents
Chapter 1.  Introduction
Chapter 2.  The modern data lake architecture
Chapter 3.  Building a scalable data lake with Ceph Object Storage
Chapter 4.  Replacing Hadoop Distributed File System (HDFS) with IBM Storage Ceph Object Storage
Chapter 5.  IBM Storage Ceph with IBM watsonx.data
Chapter 6.  Retail use case
Chapter 7.  Ingest: Landing and raw zones
Chapter 8.  Transform: Staging and curated zones
Chapter 9.  Consume: Curated zone
 
The readers can explore each stage of the data pipeline – ingest, transformation, and consumption – with step-by-step instructions and hands-on examples. All code samples created for the book's scenarios are available on the Redbooks GitHub, so that customers can download them and experiment with these scenarios in their own environments. To enhance the readers’ learning experience, we added “Key Takeaways” sections throughout the book, summarizing key points and offering best practice suggestions.
 
If you are interested in learning more about IBM Storage Ceph, you can also refer to the following Redbooks:
 
 
I hope you enjoy these Redbooks. If you have any comments or questions, please use the Comments section of this entry. Thank you.
0 comments
5 views

Permalink