Transformation on dataset at source

Solved!
piyushk
Level 4
Transformation on dataset at source

How can we transform a dataset at the source level from Dataiku? For example, I am extracting data from a table and would like to perform cleaning operations at the source level using the computation power of the source. Can we do it through DSS?

1 Solution
VinceDS
Dataiker

Hi, you can use DSS visual recipes to perform cleaning / wrangling operations, if properly configured they will leverage the underlying execution engines (SQL, Spark, Hive...)

https://academy.dataiku.com/latest/concepts/where-compute-happens.html

You can also use code recipes and queries and write your own scripts to manage data extraction and cleaning operations.

Hope this helps

 

View solution in original post

1 Reply
VinceDS
Dataiker

Hi, you can use DSS visual recipes to perform cleaning / wrangling operations, if properly configured they will leverage the underlying execution engines (SQL, Spark, Hive...)

https://academy.dataiku.com/latest/concepts/where-compute-happens.html

You can also use code recipes and queries and write your own scripts to manage data extraction and cleaning operations.

Hope this helps