-
Remote kernal for notebook/recipe
similar like how jupyter notebook capability. reason behind is that some library/capability already in remote server/terminal. executing the code at remote kernal and just obtain result back from it.
-
Document conversion to source RAG
support library like - docling (https://ds4sd.github.io/docling/) - markitdown (https://github.com/microsoft/markitdown)
-
Add Auto Syncing Mode for Code Studios
As an end user of DSS I want to be able to have the ability to auto sync my changes in the code studio to DSS so that I don't lose my work if the code studio crashes or automatically shuts down. Auto syncing would allow me to not be able to lose any work the code studio gets turned off or I forget to sync my changes back…
-
Enhance Dataiku - Snowflake interoperability
I have encountered several challenges involving column name handling and data type management while integrating with Snowflake. I'd pointed out a few things during a mission to integrate the platform on SF. It was no mean feat, especially when it came to managing schemas and types. I noticed that the problem is becoming…
-
Enhance Code Studios Templates APIs to support automated administration
Hi, The current Code Studios Templates APIs (see links below) don't support certain capabilities that we need. We would like to have Python APIs to: Obtain the full list of build IDs related to a Code Studios Template as shown in the Code Studios Template ⇒ Build History ⇒ Show Build drop down. This is needed to be able to…
-
Dataiku Input recipe using Encoding
Please add an option for character encoding when specifying input files; even if I want to specify UTF-8, I can't do so on Dataiku and have to use another tool to convert the character encoding before it can be imported into the Dataiku flow.
-
End to End Possibility for Dev-Ops Implementation with Best Practices
Write now Dataiku possess a lot of unique abilities to develop scalable ML / Deep Learning / GenAI algorithms. In addition to that Dataiku has facilitated collaborative development using flow-zones etc. Even there are a lot of Data quality checks and metrics to facilitate operational efficiency and drift detection. But all…
-
Marimo Notebooks Integration in DSS
I'd like to propose the integration of Marimo notebooks alongside the existing Jupyter notebooks in DSS. Marimo is an innovative notebook environment that addresses several limitations of traditional Jupyter notebooks while maintaining compatibility. Here are some key advantages of Marimo notebooks: Code quality : Marimo…
-
Move objects and Zones on the Flow
I think it would be very useful to be able to move objects and flow zones around in the flow display. It appears Dataiku determines where each recipe, dataset, etc go in the flow and I cannot edit that. I have used Alteryx in the past and it had that ability, which I liked. It allows me to organize the flow however I see…
-
Vertical Scrolling for Datasets
It would boost my productivity significantly if I could use "Shift" + "Scrollwheel" to vertically scroll. Instead of finding the small scrollbar in the bottom of the dataset each time.