Yesterday , we used ETL to build a clean data and well governed data warehouse based on operational data, the sources can be multiple but they are well optimized in datamart and data reconciliation allow us to build a model that can answer a large variety of business questions.
This DWH is a single source of truth and multiple users cans easily compare their results and take the correct decision
This DWH since it's already clean and keep all history , it can be used to build predictive results , data scientist can leverage it to build their models.
==> the total amount of tool needed : 4 (ETL, DB, Data science tool, data visualization tool (or a Business intelligence suite if we need to address governance))
==> this approach, I believe was, and still suitable for the majority of companies
Today : we are providing a huge number of tool with different level of complexity, and the marketing behind it seems to focus mostly on AI and data science , multiplying the way to access data , and to process it , so one tool for governance , one tool for lineage, one tool for datascience one tool for this and one tool for that
I don't understand now :
==> where is the single source of truth ? how do you know from where the info is from and is the result is showing is accurate (or we need a tool on top of the others tools to make tat possible ?)
==> is data warehousing is still , the way we was doing it or its outdated ? the is still a huge amount of companies that do not have real time data, or Very large volume of data ? they will be lost in a data fabric as we are presenting it
==> there is a point in time where DWH and business intelligence have been replaced with data science and data fabric and a total end user independence that is in my opinion a non achievable objective , or at least for the second part to be achieved , the old architecture is still mandatory , I'm I mistaken ?
------------------------------
Mhamed Ben Jmaa
------------------------------
Original Message:
Sent: Tue May 10, 2022 03:56 PM
From: Shannon Rouiller
Subject: Questions for AMA: Data Fabric, Data Mesh, Data Lake, Data Warehouse, which one do you need?
We'd like to answer your questions about the differences between Data Fabric, Data Mesh, Data Lakes, and Data Warehouses.
We've arranged for experts from across IBM to answer your questions right here in this forum thread on on May 26 at 2pm Eastern/11am Pacific for a whole hour of AMA (Ask Me Anything). Our topic is Data Fabric, Data Mesh, Data Lake, Data Warehouse, which one do you need or do you need all of them? If you have questions, please start posting them as a response to this post.
Our experts will hop on the Cloud Pak for Data Community discussion forum on May 26 at 2pm Eastern/11am Pacific and start answering your questions right here in this thread.
------------------------------
Shannon Rouiller
Content strategist, Cloud Pak for Data
------------------------------
#CloudPakforDataGroup