-
Looking to replicate a SUM(COUNTIF) formula in Dataiku
I am working on a scorecard in Dataiku and I would like to calculate the percentage of completion in a set number of columns. Basically, I would like to replicate this formula in excel: =SUM(COUNTIF(ColumnX:ColumnXX,"*")/Total Number of Columns) and am having issues. The columns are a mix of strings, integers, and text,…
-
Prevent Y / N column from being Auto-Detected as Boolean column
For my dataset I have over 200 columns with many of them Y/N flag columns. I would like to keep them as text and not infer the type. There is a checkbox that I am leaving unchecked: Normalize booleans Normalize all possible boolean values (0, 1, yes, no, …) to 'true' and 'false'
-
How do i create a categorisation model for a reviews dataset
Hi there - new to dataiku, Lets say i have an excel sheet of 2 columns where one has app reviews and the other has dates they were posted. Is there a video tutorial anywhere or example where i can create a model to categorise the app reviews into categories eg) ux/ui problem or customer service problem as well as include…
-
How to correctly do time conversions
I have a column that has been parsed and is in UTC, when I try to format the date to be in eastern / New York time I get a new column that is -5 hours, but isn't the current the current difference -4 hours? I'm sure this has something to do with daylight savings time vs normal time, but I just want to ensure that my…
-
How to stop keep formatting using prepare / formula recipe
I have string value called hours, formatted as "00" and a value called minutes formatted as "07". Im trying to use the concat function to create an HH:mm value = 00:07 but for some reason when using concat, it gets rid of trailing 0s and results in 0:7. is there a way around this? if not is there any other way I can create…
-
Schema changes from double to integer all the time.
I have a column with round floats (e.g. only 1.0, 100.0, etc.) and manually defined it as type "double" in a visual recipe. However, once the dataset is loaded into another visual recipe, the visual recipe automatically changes all the column types back to "integer" causing a failure everytime there appear a non-round…
-
Using date in DataIKU
Hi, Despite going through documentation multiple times, I still don't really understand how dates work in DSS. I'm importing dataset from a connection. Without turning on any of the options in Date & Time handling, this is how data looks like: It says that the data type is string, while in the database itself it is, in…
-
How to scrub for keywords in an excel sheet for an email inbox
I currently have a excelsheet (sample data in image attached) showing the emails in my email with their body, subject, date sent etc. How do i make Dataiku scrub through the bodies to retrieve common keywords? eg) in the 9 emails there, dataiku will have 3 of them show up as "marketing enquiries" and so on. I believe text…
-
Influx DB to Dataiku
Hi all! Do anyone have solution to connect Influx DB to Dataiku? Operating system used: Linux
-
How to add a months to given date
Hi Team. Is there a dataprocessor or function to add months to a given date. Something like for ex : dateadd function in SQL. ex 1: for todays date(24-Aug-2020), i would like to add 4 months then it would be 24-Dec-2020 ex2: for todays date(24-Aug-2020), i would like to add - 4 months then it would be 24-Apr-2020. I know a…