-
Backup execution for Kubernetes
Ran into a Microsoft issue with connectivity in Azure, which forced me to manually move over to local execution. Idea: When an execution is set to run in a pod but fails, reattempt 'n' times, then upon failure, default to local execution to run. Error handling by default from local execution. Option to send admin an email…
-
Idea: Include Associated Objects When Duplicating Dataset / Flows
Hello When duplicating parts of a flow in Dataiku, the associated datasets are duplicated, but the developed charts linked to those datasets are not included. This means that users have to manually recreate or copy these charts, which can be time-consuming and prone to errors. Benefits Including the duplicate feature for…
-
Improve Tasks in Project Todo List
I would really like to use the todo list accessible in a Dataiku DSS project home page to communicate with my team. However it is not always easy to use it. Here are a couple of improvements suggestions: Allow tasks to be moved / reordered The text editing area has a fixed maximum height, which makes it difficult to edit,…
-
Being able to set User Settings via Python API
There are now more user settings than ever in the user's profile page (DSS/profile/). In v12.3.2 there are now 10 different email notification settings. We would like to be able to customise these to our preferred defaults via the Dataiku Python API. Currently the Dataiku Python API does not support this. Thanks
-
Easier Undo Actions in Dataiku DSS
Hello, Dataiku users. In my daily use of Dataiku, I find it very convenient overall, but the lack of an "undo" feature is often inconvenient. Currently, Dataiku does not have a direct "undo" button or a "Ctrl+Z" function to immediately revert mistakenly deleted steps or recipes. If such a feature were available, I believe…
-
Add support for Pandas 2.0
Pandas 2.0 can bring great performance improvements when using the pyarrow backend: https://towardsdatascience.com/pandas-2-0-a-game-changer-for-data-scientists-3cd281fcc4b4
-
Improve Code Env Rebuild Process for On Prem Upgrades
When upgrading Dataiku in place, it can take 10+ hours to rebuild images and all code envs. Workarounds include including core packages in base images or caching pypi indices but these are minor. Upgrade time scales linearly with the number of envs on nodes, which is unfortunate. We cannot execute an actual blue/green…
-
Auto Synch Jupyter Notebook and Corresponding Recipe Code
I often find myself editing Python Jupyter Notebooks that have an associated recipe. If I save the notebook but forget to "Save Back to Recipe," there is a "Mismatch between recipe and notebook." In turn, running the recipe as a standalone script, or running the recipe through scenario automation results in an unintended…
-
Prevent deletion of objects that are used in Scenarios
The stability of our workloads is very important to us. We now have significant dependencies in our Dataiku platform so we are very sensitive to any changes that can impact our scheduled scenarios. After being burnt multiple times we have identified a series of user actions that can result in scenarios failing without the…
-
Provide an API to access to SQL Notebooks
One of the great things about DSS is the ability to access virtually any aspect of the platform via the API. SQL Notebooks are a major exception. The request is to close this gap and make SQL Notebooks accessible via the API. It'd be great if through the UI one could export and import these notebooks like one can with…