-
Provide ability to export Insights to images in Scenario Steps and the Python API
Currently only Dashboards can be exported to images in Scenario Steps (Export Dashboard step). While there is an export option in the GUI to export Insights to images this is not possible to do via Scenario Steps nor the Python API. So please add support for this. And also extend the Python API to allow Dashboard exports…
-
Custom recipe: Default value not work for values from API
In plugin I cannot see default value for field in recipe.json, when value comes from API. In recipe.json I have field: { "name": "field", "label": "Field", "type": "SELECT", "getChoicesFromPython": true, "mandatory": true, "defaultValue": "Default", }, In above code default value is set to "Default", but it is not set in…
-
How to execute a recipe after an empty dataset ?
Is there any possible way of checking readyness of a dataset? I have a dataset that might be empty after a Hive query, it shouldn't be a problem but since it is (I cannot use it in a left join...) I decided to build another dataset that would contain either the result if it exists or a dummy line if it does not. All this…
-
Can the hyper-parameter change for each new training model with each new dataset
Dear dataikuler thanks for reading my question. Hi, so my problem is when i re-train my model with different dataset (like my first dataset is from 12/10 /2024 to 12/10/2025 and my second dataset is 30/11/2024 to 30/11/2025) and then i deploy the second model i check the hyperparameter of each version and i see all of them…
-
Append a pandas dataframe to an already existing Dataset within a plugin
I'm creating a custom plugin containing a recipe that evaluates a machine learning model and outputs a DSS Dataset with performance metrics (it is very similar to the in-built Evaluate recipe). However, each time I train the model, I would like to append the new performance record to the already-existing Dataset rather…
-
Can a project variable be used as a scenario variable?
Hi, I have a SQL query that when it returns no data, I need to be able to change the email content of my scenario reporter. I am currently setting a project variable and have a couple questions: 1. Can I use the project variable to conditional change which email reporter to use? 2. If no to #1, what is the best way…
-
Problem using python recipe
I have a problem when using a python recipe that uses the kubernetes engine when running. I checked the log step by step, the python process should have been completed, but for some reason the kubernetes engine does not show a sign of completion and looks like an unfinished loop. Please help with this error.
-
The recipe execution is taking long time due to handling a large volume of data in dataiku
We are experiencing long execution times for a recipe in Dataiku due to handing large datasets, while we have implemented partitioning using a filter on a specific column, it still takes 1.5-2 hours to partitioning 30M records. Is there a more efficient way to handle and process this data quickly and effectively because…
-
How to run integration tests on flows with Python recipes
I've recently started to use the "Run integration test" scenario step for testing. It's definitely some work to create the test reference datasets but it once set up it's great to be able to run this test after later code changes to confirm the process works as expected. Our flows typically mostly use SQL script recipes.…
-
Add PEP8 validation for Python code
It would be very helpful to have PEP8 formatting validation of Python code integrated into the UI in places where Python code is used. As this is a standard that our code, and I guess many peoples', needs to abide by. Most useful examples would be Python code recipes (maybe as an extra validate option) and Project…