-
Difference in minutes and seconds between two date
Hi everyone, How can I get the difference in "hours" "minutes" AND "seconds" (example: 14:30:24) from two date? Example: Col 1: 2018-01-01T09:50:15.000Z Col 2: 2018-01-01T10:07:55.000Z Col 3 (difference): 00:17:40 Thank so much for your help Maxime
-
Using sample.py after export model python
Hello, I’m trying to use the sample.py after unzipping the archive of a model I extracted. The model is a light gbm with a feature selection step. The version of the dss is 12.6.5 However the python script crash after the dummifier step with the error : Indexed_matrix.py Line 35 in _ remap _ key Remapped_key = (key[0],…
-
"Fold" processors in visual recipe - Implement In-Database engine
Today, fold processors require the DSS engine because they are not supported as in-database processing, which forces dataiku designers to implement SQL recipes to perform fold operations. Most modern databases support "unpivot" syntax, which enable fold processors to be converted to SQL.…
-
min between 2 dates
Hello, I'm trying to find the 1st date between 2 date fields. I was thinking of using a min formula but there may be missing values in these fields and the formula doesn't seem to work in this case. Is there another solution other than an "if then" formula ?
-
Recursive query in Dataiku
Hello all, Is there anyway to use the recursive query in Dataiku under DSS engine? Thanks all
-
Is it possible to write multiple datasources at a time using python?
I have a data processing task that requires python. Specifically, I'm reading data from a proprietary file format, then writing the extracted data out to the database. I want to split this data into multiple datasources, one for training data, one for holdouts, and one for bad data (so I can analyze corrupted data from the…
-
Question about the behaviour of the "Split and Fold" prepare processor and NULL handling
Let's say I have a table containing the following data: IDFruitOther Random Data1Apple, Pear, Cherryaksdhkajshda2NULLkasdhjkasjhkas3Watermelonajshdgjashgdjashg If i run the Split and Fold prepare step on the Fruit column, i will get the following result: IDFruitOther Random…
-
How to use regex in DSS Formula replace()?
I'm trying to use a regex in a replace() using the Formula processor in a prepare recipe. According to the documentation/docstring this should work, but I can't get it to recognize the text to replace parameter as a regex and not as literal text. I'm using a formula as below. How can I get the formula language to interpret…
-
Senario Halting when no new data is available
I've got a data pipeline that runs on scenarios. There may or may not be new data every time the scenario runs. I have been doing some inefficient items each time the scenario runs. 1. I have been guaranteeing that I get at least one old record every time the scenario runs, in order to ensure a stable schema. I have…
-
Trouble with IF logic on formula step
Getting some unexpected results with this formula step where some retired customers are showing the max of L12/2019/2020/2021/2022 when they should be showing L12. What I'm looking to do is this: - If a customer is retired (using Retired Flag = TRUE) or if they have bad debt (using Bad Debt Status =TRUE), I want to always…