Cloud Pak for Data

 View Only
Expand all | Collapse all

Data refining of virtualised data?

  • 1.  Data refining of virtualised data?

    Posted Mon September 19, 2022 10:26 AM

    Hi,

    I virtualized some data with the help of a snowflake connection, now I want to prepare the data without posting the data to the project. So can we prepare the virtualized data without posting it into any project or catalog.



    ------------------------------
    Akshay Nirmal
    ------------------------------

    #CloudPakforDataGroup


  • 2.  RE: Data refining of virtualised data?

    Posted Mon September 19, 2022 01:53 PM

    Hi @Akshay Nirmal
    ​You can work with visualized snowflake data without copying data to project but you still need to register "connected" data asset or at least "connection" to snowflake in Project. This only captures metadata about where the data is, without physically moving data.

    In CP4D 4.5, this is done via https://www.ibm.com/docs/en/cloud-paks/cp-data/4.5.x?topic=project-adding-data-from-connection

    Once you "register" your virtual data, you can use data refinery tool to refine it.

    When describing your Refinary job, you can indicate target table that is also remote https://www.ibm.com/docs/en/cloud-paks/cp-data/4.5.x?topic=data-copying-from-source-target



    ------------------------------
    Lena Woolf
    Senior Technical Staff Member
    IBM
    ------------------------------