watsonx.data

Read about the latest and greatest, how-tos, best practices, and use cases from our experts and experienced product users.

We would love for you to blog with us! IBMers can blog without approval. All other users please apply here to become a blogger.

 View Only

Search Blogs

Apache Arrow Flight is a general-purpose client-server framework that simplifies high performance transport of large datasets over network interfaces. In this article, we will see how to extend the Arrow Flight OSS module in watsonx.data Presto (Java) to connect to an Arrow Flight based data sources. We will also see an example of Presto IBM Arrow ...
0 comments
Imagine a world where querying complex data feels seamless, where exploring datasets becomes intuitive, and where operational bottlenecks are no longer part of the equation. That’s the vision behind the latest tech preview features of watsonx.data . While data lakes and lakehouses are transformative, they often bring challenges like slow query ...
0 comments
Getting Started with IBM watsonx.data Milvus 1. Introduction With Milvus integrated into IBM watsonx.data , users can now leverage advanced vector search capabilities with enterprise-grade scalability, reliability and security. To kick off your Milvus journey on watsonx.data, please follow these prerequisites: ...
0 comments
Introduction to Milvus: The Foundation of GenAI From creating realistic virtual assistants to automating complex content generation tasks, GenAI adoption is accelerating across industries. Crafting human-like text, generating stunning visuals, or creating immersive audio experiences, GenAI is rapidly redefining how we interact with technology across ...
1 comment
Purpose In this article, we’ll see how to add remote data sources to IBM watsonx.data, and how to add tables from these remote sources to an IBM Cloud Pak for Data governed catalog. We’ll additionally apply a data protection rule to mask some of that data, protecting sensitive information whether the table is viewed in IBM Cloud Pak for Data or in ...
0 comments
This document is designed to guide developers in selecting similarity metrics and In-Memory indexes when using the Milvus vector database. It provides a concise overview of key concepts and parameters, encourages iterative testing with your dataset, and invites users to explore the resource links for an in-depth understanding of the topics discussed. ...
0 comments
This document is designed to guide developers in selecting similarity metrics and in-memory indexes when using the Milvus vector database. It provides a concise overview of key concepts and parameters, encourages iterative testing with your dataset, and invites users to explore the resource links for an in-depth understanding of the topics discussed. ...
0 comments
Introducing Open Unified Data Governance with watsonx.data Thu, Dec 12, 2024 11:00 AM EST Summary IBM watsonx.data announces the latest enhancement to its Common Policy Gateway, enabling seamless integration with third-party policy engines like Apache Ranger. This new feature provides customers with unprecedented flexibility in managing ...
0 comments
As businesses embrace data-driven strategies, the complexity of managing data governance across diverse environments has grown exponentially. Organizations now require governance solutions that not only enforce robust access controls but also ensure compliance with regulatory standards across hybrid and multi-cloud landscapes. IBM watsonx.data’s latest ...
0 comments
Making the right decisions for your data architecture is crucial for efficient performance and optimal cost. Open lakehouse architectures have become popular due to their increased flexibility by storing data in open source storage formats. Additionally, open data lakehouses leverage a data catalog to allow multiple, governed access points and manage ...
0 comments
The white paper discusses augmenting Db2 and Netezza workloads with watsonx.data, a transformative approach to handling data workloads. It highlights two approaches: co-existing and augmentation. The co-existing approach involves integrating Db2/Netezza with watsonx.data, enabling seamless interaction between platforms. The augmentation approach identifies ...
3 comments
IBM’s watsonx.data is stepping up with two exciting features in tech preview: Spark C++ (Gluten) and Materialized Views. Here’s what you need to know about these features and how you can be one of the first few to try it. What’s New? 1. Materialized View We’re thrilled to announce that Materialized View marks the first public ...
0 comments
Streamline Your Data Pipelines with dbt, Airflow, and VSCode Integration Building and managing complex data pipelines is a common pain point for organizations. Switching between tools and languages slows progress, while coordinating workflows adds inefficiencies. watsonx.data now offers a powerful suite of tools for the modern dataops stack: ...
0 comments

What's New watsonx.data 2.0.2

What's New watsonx.data 2.0.2 The new version 2.0.2 of watsonx.data has been released July 2024! There are many new exciting features and enhancements related to data sources, storage, integration, and access management. You can now store your data using Azure Data Lake Storage (ADLS) with the new Data Access Service (previously Content ...
1 comment
Presto C++: Revolutionizing Data Analytics Discover the Future of Presto with Presto C++! Join us for an in-depth exploration of the next-generation Presto engine. Built on the powerful Velox library, Presto C++ delivers unmatched performance and reliability. In this webinar, we'll delve into: Unleashing Performance: Learn how ...
0 comments
Get more from your IBM Z data with Data Gate for watsonx Data from IBM Z can now be seamlessly synchronized into watsonx.data for deeper, more trustworthy AI insights. Utilize transactional data from IBM Z combined with other lakehouse data to enhance your enterprise machine learning and AI initiatives. Here are 5 Key Benefits of Data Gate ...
0 comments
Introduction With the ever changing, fast paced IT industry where the product/project deliverables have become more agile and with more emphasis on exploratory testing, one begins to wonder whether Test design techniques still hold any importance. Test design technique Before we delve into the topic, two of the standard ...
0 comments
Installing OpenShift Data Foundation on OpenShift Container Platform using local storage devices 1) Login to RH Openshift console using kubeadmin and corresponding password. 2) On the left had side of the menu, select Operators --> Operator Hub. And in the search bar search for Local Storage. ...
0 comments
PrestoCon Day 2024 Recap: A Celebration of Community and Innovation The annual PrestoCon Day brought together hundreds of data enthusiasts for a jam-packed day of learning and celebrating the open-source Presto project. Here are some key takeaways: Growth and Achievements: The Presto community has grown significantly, with a 3x increase ...
0 comments
IBM now has several supported embedding models available with watsonx.ai. Embedding models are encoder-only foundation models that create text embeddings. A text embedding encodes the meaning of a sentence or passage in an array of numbers known as a vector. The following embedding models are available in watsonx.ai: slate-30m-english-rtvr ...
0 comments