-
Reorder column list in a Prepare Recipe Move Step
User Story: As a data analyst who likes their columns in a certain order for easy data evaluation I would like to be able to reorder the column names in a move recipe step after initial recipie creation in the same way I can re-order text replacements in a text replacement step. This would save time when I don't get the…
-
Allow project variables to be overridden at the Flow Zone level
Hi, We routinely have to override project variables at the Python recipe level. However, this gets tedious and it would be great if this could be done at the flow zone level. thx
-
Allow export of datasets to parquet file format
Hi, Would be great, from file size and export time perspectives, to allow users to export datasets as parquet. thx
-
Enhance Managed Folders APIs to be able to handle local-vs-non-local folders automatically
Dataiku supports creating Managed Folders over different storage layers including local storage, network storage, cloud storage (ie buckets) or even Sharepoint. However the way you deal with these folders depends on where the API client is located (inside or outside DSS) and where the storage is:…
-
Add filter capability to the run Scenario step / run after Scenario trigger
The run Scenario step / run after Scenario scenario trigger show all available scenarios which in a large Dataiki instance would make it for a very long drop down list (see first image below). This idea is to add a filter box as the one used in the Run As scenario Settings so that you can easily search and find the desired…
-
Scenario re-run capability
While Scenario steps allow for automatic retries there is no built in feature for Scenarios themselves. One possible work around is to use a wrapper Scenario which runs a Scenario via the Run Scenario step and can have auto-retries defined. However this is not a scalable solution as it means duplicating all scenarios to…
-
Easier access to Determine Data Set Rebuild Behavior Setting
User Story As a Dataiku Flow designer who periodically uses the Rebuild Behavior (explicit or write_protected) to keep some datasets from Automatically updating when flows are rebuilt. I can find it difficult to see which data object are going to rebuild when I rebuild a flow zone. Or why some data objects have not…
-
Include VI, PR, AE, AA, and AP in US States
For the "US State" datatype, it doesn't recognize PR (Puerto Rico), VI (Virgin Islands), AP (Military Bases Pacific), AA (Military Americas outside of Canada and US), AE (Military bases the rest of the world). While these aren't actually states, they are valid states for USPS.
-
Add support for Connection as SQL Recipe Input
The challenge When we write SQL recipes, we sometimes have several tables attached (10 tables or more). It can be very tedious to add all the tables in the flow area (Dataset > SQL Databases > Oracle then configure table) and can lead to a messy workflow. The solution Add the possibility of selecting the entire connection…
-
Updated Data "Meaning" of Email Addresses to accept RFC 6531 addresses that allows some UTF-8
User Story: As a data analyst that works with persons from around the world. It is challenging when the meaning of email address in data views does not currently correctly take into account local parts of email addresses (the part before the @) that includes characters beyond ASCII. The use of UTF-8 strings has been…