-
Easier Undo Actions in Dataiku DSS
Hello, Dataiku users. In my daily use of Dataiku, I find it very convenient overall, but the lack of an "undo" feature is often inconvenient. Currently, Dataiku does not have a direct "undo" button or a "Ctrl+Z" function to immediately revert mistakenly deleted steps or recipes. If such a feature were available, I believe…
-
Add support for Pandas 2.0
Pandas 2.0 can bring great performance improvements when using the pyarrow backend: https://towardsdatascience.com/pandas-2-0-a-game-changer-for-data-scientists-3cd281fcc4b4
-
Add option to support non-pandas dataframes (e.g. polars) in Python recipes
Hi, There are many pandas alternatives. One that is new and very fast is polars. Polars is built on Rust so it is memory safe and runs in parallel by design. I use polars in one of my recipes but have to convert it to pandas to write the dataset. thx
-
Improve Code Env Rebuild Process for On Prem Upgrades
When upgrading Dataiku in place, it can take 10+ hours to rebuild images and all code envs. Workarounds include including core packages in base images or caching pypi indices but these are minor. Upgrade time scales linearly with the number of envs on nodes, which is unfortunate. We cannot execute an actual blue/green…
-
Auto Synch Jupyter Notebook and Corresponding Recipe Code
I often find myself editing Python Jupyter Notebooks that have an associated recipe. If I save the notebook but forget to "Save Back to Recipe," there is a "Mismatch between recipe and notebook." In turn, running the recipe as a standalone script, or running the recipe through scenario automation results in an unintended…
-
Prevent deletion of objects that are used in Scenarios
The stability of our workloads is very important to us. We now have significant dependencies in our Dataiku platform so we are very sensitive to any changes that can impact our scheduled scenarios. After being burnt multiple times we have identified a series of user actions that can result in scenarios failing without the…
-
Provide an API to access to SQL Notebooks
One of the great things about DSS is the ability to access virtually any aspect of the platform via the API. SQL Notebooks are a major exception. The request is to close this gap and make SQL Notebooks accessible via the API. It'd be great if through the UI one could export and import these notebooks like one can with…
-
Add test button for SQL query change and Custom trigger scenario triggers
Users love the flexibility of having SQL query change and custom trigger (Python) scenario triggers. However these two types of triggers involve custom user code and they are not easy to test since the user would need to either wait for Dataiku to execute them and check the logs or take the SQL / Python code outside of the…
-
RSS feed for Dataiku releases, Dataiku Plugin releases and Dataiku Security Advisories
As a Dataiku Administrator I like to keep up-to-date on new Dataiku releases, Dataiku Plugin releases and Dataiku Security Advisories. At the moment there is no automated way of doing so. The most common way to programmatically publish information in a computer readable format is via RSS (Really Simple Syndication). Having…
-
Auto-generated visual recipe concept
Problem Statement: Right now, there are great, no-code visual recipes in DSS for select, common machine learning algorithms and data wrangling tools. In other cases, code recipes are required and users repeatedly have to both code the algorithms and add the same lines of code wrappers in DSS code recipes to map to datasets…