Community Conundrum 25:Feature Visualization is now live! Read More

Transformation on dataset at source

Level 3
Transformation on dataset at source

How can we transform a dataset at the source level from Dataiku? For example, I am extracting data from a table and would like to perform cleaning operations at the source level using the computation power of the source. Can we do it through DSS?

1 Reply
Dataiker
Dataiker

Hi, you can use DSS visual recipes to perform cleaning / wrangling operations, if properly configured they will leverage the underlying execution engines (SQL, Spark, Hive...)

https://academy.dataiku.com/latest/concepts/where-compute-happens.html

You can also use code recipes and queries and write your own scripts to manage data extraction and cleaning operations.

Hope this helps