-
Count number of rows depending on a condition
Hello, I would like to increment the number of rows group by some variables only when a condition between a date and its lag is true. The idea is the following : if it is the first time we encounter an id, then var = 1; else if id = id_lag and dat - dat_lag > 30 then var = var +1 ; I try to do this with a window recipe but…
-
Add an automatic timestamp to a dataset name
Hello ! I would like to automatically add a timestamp to my output dataset names (and then export them to folders). Does someone know how to do that ? For example, at September, 10th, my dataset would be named "Dataset_100924"
-
Export datasets to folders
Hello ! I would like to export several datasets of my project to the same folder (each folder for each necessary date). Does someone know how to do that ? Thanks !
-
How to go from flat relational data to nested object oriented data
I am trying to combine multiple rows into a single nested json object. I know how to do the opposite (i.e. flatten), but cannot find the right tool to go the opposite direction. As an example, I start with this data: Class, Student, Grade 1, Sally, A 1, Matt, A 1, Phil, C What I want as an output is a single record: Class,…
-
Fuzzy Match in Alteryx equivalent tool in Dataiku
Hi Team, I am migrating a workflow from Alteryx to Dataiku where I encountered a tool called Fuzzy Match in the Alteryx which actually compared 3 columns and generate a new column that satisfies partial matching criteria. Below is the input data Below is the configuration in Alteryx tool Below is the sample output in…
-
Question about the if formula using a generic charaters string
Hello, I would like to use a "if" formula and make a condition only if the charaters string begin with "Adults" charaters. Thanks for uor help
-
Who should be able to deploy bundles into Production Automation node?
A user must be Admin on the Automation node project in order to be able to deploy a bundle there. But Admin permissions in Production seems like too much for an ordinary user to have given the risk of impacts to production assets. Should deployment then be restricted to a smaller set of users, such as project leads or a…
-
How can I add data to an existing dataset using an Dataiku DSS API?
Operating system used: Dataiku Operating system used: Dataiku Operating system used: Dataiku
-
Same input and output external database table in python recipe
Hi There, I am using an external snowflake table to capture metrics, say Table ABC. During training/retraining, I would like the flow to read the same table ABC before updating the model for predictions and write the new metrics and the path to the model in the same table ABC. I have written the python recipe to do this…
-
Oops: an unexpected error occurred java.lang.IllegalStateException: Expected a double but was BEGIN
Hi, I am working on the project "Advanced Designer Assessment" after modified Prepare recipe to add column "qualifies", when i open dataset "Online_Retail_Prepared", i got this error message. Oops: an unexpected error occurred java.lang.IllegalStateException: Expected a double but was BEGIN_ARRAY at line 377 column 21 path…