-
Enhance Dataiku - Snowflake interoperability
I have encountered several challenges involving column name handling and data type management while integrating with Snowflake. I'd pointed out a few things during a mission to integrate the platform on SF. It was no mean feat, especially when it came to managing schemas and types. I noticed that the problem is becoming…
-
Enhance Code Studios Templates APIs to support automated administration
Hi, The current Code Studios Templates APIs (see links below) don't support certain capabilities that we need. We would like to have Python APIs to: Obtain the full list of build IDs related to a Code Studios Template as shown in the Code Studios Template ⇒ Build History ⇒ Show Build drop down. This is needed to be able to…
-
Dataiku Input recipe using Encoding
Please add an option for character encoding when specifying input files; even if I want to specify UTF-8, I can't do so on Dataiku and have to use another tool to convert the character encoding before it can be imported into the Dataiku flow.
-
End to End Possibility for Dev-Ops Implementation with Best Practices
Write now Dataiku possess a lot of unique abilities to develop scalable ML / Deep Learning / GenAI algorithms. In addition to that Dataiku has facilitated collaborative development using flow-zones etc. Even there are a lot of Data quality checks and metrics to facilitate operational efficiency and drift detection. But all…
-
Move objects and Zones on the Flow
I think it would be very useful to be able to move objects and flow zones around in the flow display. It appears Dataiku determines where each recipe, dataset, etc go in the flow and I cannot edit that. I have used Alteryx in the past and it had that ability, which I liked. It allows me to organize the flow however I see…
-
Vertical Scrolling for Datasets
It would boost my productivity significantly if I could use "Shift" + "Scrollwheel" to vertically scroll. Instead of finding the small scrollbar in the bottom of the dataset each time.
-
Automated alerts from Unified Monitoring on bundle or API endpoint failure
We find the Unified Monitoring (UM) feature extremely useful as it allows us to see the health of our bundle and real-time prediction APIs. However, the is no way to be alerted if a deployment fails or if an API endpoint is down. We currently have some Python scripts that scrape the data from UM and then identify any…
-
googlesheets plugin feature: Ignore top n rows on import
Reading a google sheet with the plugin currently requires that header columns are in row 1. In the wild, a lot of users don't build sheets like that and the data begins some rows down the sheet. I suggest to add a feature of ignoring a number of top rows to correctly set the header row and table data.
-
Show all data points in Charts even when "Automatic" date range and Zoom are enabled
As per the title, the Chart seems to truncate at the 3rd or 2nd last data point when the Automatic date range and Zoom are enabled (see first screenshot). If I instead select the actual granularity of my data (e.g. Day from the X axis Date Range drop-down menu) then the last data points appear on the chart, BUT I lose the…
-
automatically remove obsolete versions of code envs on Automation Nodes
This product idea addresses the issue discussed here: Remove old versioned environments and kernels after importing a bundle. Recently, we faced an issue with one of our automation nodes. New deployments were failing because there was no space left on disk. Upon investigation, we discovered that a code environment was…