Cloud Pak for Data

 View Only
  • 1.  General Data Fabric questions

    Posted Wed September 21, 2022 11:10 AM

    New to the community. Trying to get a handle on the concepts behind CP4D

    1) Are all data assets in the data fabric have to be virtualized? Ex. If I created an ETL to load a table, can someone access the table directly from the data fabric or would I have to virtualize it first? Or perhaps a flat file on a network share, does someone access it directly or do I need to first virtualize?

    2)  when it comes to exposing data ​what's the difference between the catalog and the data fabric?


    thanks 



    ------------------------------
    glenn garze
    ------------------------------

    #CloudPakforDataGroup


  • 2.  RE: General Data Fabric questions

    Posted Thu September 22, 2022 01:08 PM
    Glenn, 

    1. Virtualized tables are just one of the ways to access data on Cloud Pak for Data. You can also create connections directly to remote data sources, like Db2, and add tables as data assets. You can add files, like CSVs, from your local system. See: Adding data to a project (Watson Studio and Watson Knowledge Catalog). For ETL, you can use DataStage: Transforming data (DataStage)

    2. A data fabric is an architecture for providing access to data without moving the data and an integrated set of tools for getting value from data. A catalog is the part of the data fabric where you share data assets. Here's more information about a data fabric on Cloud Pak for Data: Data fabric use cases.  

    I hope that helps!

    ------------------------------
    Inge Halilovic
    ------------------------------



  • 3.  RE: General Data Fabric questions

    Posted Thu September 29, 2022 03:59 PM
    Thank you Inge. This is helpful.

    ------------------------------
    glenn garze
    ------------------------------