-
How to stack columns from one dataset
Hi, Here is a simplified schema of a basic dataset structure I need to reshape: firstname name vote col4 col5 col6 col7 col8 col9 etc.. ARTHAUD Nathalie 5 ARMAND Thierry 9 ARNAUD Bernard 6 etc.. ARTHAUD Nathalie 7 ARMAND Thierry 3 ARNAUD Bernard 8 etc.. The number of columns in this is variable but it will always be a…
-
Setting up Stages in Snowflake to work with Dataiku
In Dataiku DSS when working with Snowflake there is an option to use a stage. This apparently speeds up performance by increasing the number of different types of processes one can do inside Snowflake without having to ship data back to the DSS server for processing. Are folks using this feature? What has your experience…
-
Select Columns Outside of Join Recipe
I would like to be able to select the columns of data outside of a join recipe. A couple of examples: 1 - Usage of "unmatched rows". The column selection occurs after the join does not apply to data that isn't joined. In this case I am using both sets of data so need the option to select columns from both sets. 2 - Removal…
-
Configurable Timezone Display for Date Columns (Beyond UTC-only)
Current Situation Dataiku DSS has specific behaviors when handling time columns: When it recognizes time-related columns (e.g., date, timestamp_tz, or timestamp_ntz), it displays them as Date columns, rendering them in timestamp format (with both date and time components). A significant limitation is that Date columns…
-
Difference in minutes and seconds between two date
Hi everyone, How can I get the difference in "hours" "minutes" AND "seconds" (example: 14:30:24) from two date? Example: Col 1: 2018-01-01T09:50:15.000Z Col 2: 2018-01-01T10:07:55.000Z Col 3 (difference): 00:17:40 Thank so much for your help Maxime
-
"Fold" processors in visual recipe - Implement In-Database engine
Today, fold processors require the DSS engine because they are not supported as in-database processing, which forces dataiku designers to implement SQL recipes to perform fold operations. Most modern databases support "unpivot" syntax, which enable fold processors to be converted to SQL.…
-
Remove duplicate rows in one column
How can I remove duplicated rows in one column ?
-
Extract text from html stored in column
How would one extract the text and strip all the html. parseHTML() gives me just the html back, and htmlText() gives me the html as text (no brackets)
-
How to remove rows with all emptly values across all columns
Pretty sure this is a basic question, however I was not able to find a solution so far. After removing some columns, I ended up with rows that have an empty value in each column (tooltip shows "null"), which I want to remove now. I tried the following processor ("filter rows/cells on value"): However, the empty rows are…
-
How to replace scientific notation
Bonjour, I have a dataset with scientif value , i can't find the solution to convert them in numeric. Thanks