Sign up to take part
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
Added on May 6, 2021 1:17AM
Likes: 3
Replies: 3
The prepare recipe is one of the most complex and useful aspects of Dataiku. But currently, the only way to create a prepare recipe programmatically is to define it using json. This can be a lot to manage for more complex recipes, especially since the developer is left needing to manually create recipes to find out what options are valid for the json document.
An API could make interacting with these recipes substantially easier, especially when it comes to debugging and knowing which options are valid with which processors.
It looks like other Dataiku users are also looking for this functionality.
Some features that would be really cool to have:
An example use-case:
I have a data warehouse I'd like to create semantic views for. After completing the necessary joins and stacks, the data needs to be transformed so it can be consumed by human users. This workflow involves renaming columns, changing column types and meanings, transforming strings, and mapping enums. Every recipe is slightly different depending on the configuration of the source dataset. For more than 7,700 datasets, I need to create prepare recipes that will perform and validate the necessary transformations, providing user-friendly datasets on the other side. These can then be exposed for integration into other projects where the data will be used. Currently, this can be done with a Python recipe, which will iterate through the datasets and generate recipes. However, the process is rather fragile since for every dataset, a complex json object defining all the processors for each prepare recipe must be constructed. With an API, developing and maintaining scripts to programmatically define these prepare recipes would be much easier.
I believe @tomas
(post) and @adamnieto
(post linked above) might be interested in this feature as well based on their posts.
Thank you, @natejgardner
!
Thanks for your idea, @natejgardner
. Your idea meets the criteria for submission, we'll reach out should we require more information.
If you’re reading this and think an API for the Prepare Recipe would be a great capability to add to DSS, be sure to kudos the original post!
Take care,
Ashley