-
Clustering datasets having both numerical and categorical variables
Hello World! I would like to know if there is a way to directly perform Clustering while having both numerical and categorical variables in Dataiku DSS? Thank you!
-
How to fill the column with the value in a row based on a condition
Hi Team, I have a use case where I have to replace a value based on the row value on another condition. for an instance I have 3 columns (ABC) and have to generate a new column (D) A. B. C. Output ->. A. B. C. D 2 red. red 2. red. red. 2 3. red. blue 3 red blue 2 4 green. pink 4. green pink. 2 In the above example I have…
-
Running your ml flow from bokeh web-app submit button in Dataiku
Hello everyone! So, i am working on a project where we have a webApp in bokeh and ml pipeline in flow section of DataIku. So, the application has several input fields which should be supplied to the ml-pipeline residing in the flow. Now, once the user press submit button in bokeh webapp, the ml-pipeline in flow should…
-
Date formula
Hello dataikers, I would like to know if one date is part of two distincts dates. For example, if this date is superior to this date and inferior to this one, then put a 1 in the column or a 0 if it’s not. I know that i have to use a IF function but it doesn’t seems to work ! Thank for your help
-
Bluk replace a connection for datasets
Hello We want to import multiple projects and while importing into new environment, we would like to bulk replace connection from "file system" to "HDFS". Standard Export - Import only allows to choose/replace connections of same type. But in our case, we would like to change connection type itself. Since there are…
-
How to force dataset icon to be blue in Flow (when table is not empty)
When a dataset is created, the icon is a white square (with blue dashed line). Then when the table is populated by a recipe, it becomes a blue square. However I have a dataset which is populated by a custom library I am using. Hence, the dataset is not empty anymore, but the square stays white. I guess it only becomes blue…
-
Date Difference
Hi I want to calculate Date difference between 2 columns by excluding the weekends. I was wondering is there any formula by which I can calculate the same in the Visual Recipe. I however managed to solve this using Python Code recipe but I want to do the same in Visual Recipe. Thanks
-
Is there a way to extract the test set of a Model to run comparative experiments ?
I would like to score my own prediction method against the same test set that DSS generated to test a model. The test set being generated by a sampling method with randomization, it's quite tricky. Is there a simple way to perform that ? Maybe by extracting the test set to a dataset on which I could do some analysis ?…
-
Can you fix the K-means train results?
We fixed the seed in K-means, but the clusters we got for each train were different. Specifically, the Variables importance and silhouette were different. Are there any other settings needed besides seed to fix the results? We think that changing results every time under the same conditions is a big problem for business…
-
How do I import an IPython notebook?