-
Run Macro xlsm file on Dataset
I have dataset in my project called abc_stacked, I have Macro file called xyz.xlsm which has Macro enabled, please let me know how to run macro on dataset in project?
-
SQL help case statement
Greetings all, Obviously there is a syntax error, is there some one who can correct? I am expecting a result set with columns acct, name, country, splind1, splindnot1, splind_all. Much appreciated! SELECT `acct`, `name`,`country`, case when `splind = '1' then sum((`transactions`)/`MV`)*100 as "splind1" when `splind` != '1'…
-
Parallel Processing Multiple Run in Single Code recipe
Hi Everyone, Is it possible in dataiku dss to run same code multiple time in parallel with same datasets in inputs and outputs? I am trying to execute single DSS Code Recipe(Python and SQL) multiple times in in parallel to perform on the 100 crores of data for 7 time duration with 200 number of brands with 500 columns. So…
-
Smoothing in charts, useless?
The Line > Smoothing option gives some weird artifacts with loops and multiple values of Y for a single value of X, which I found hilarious : The same line chart without smoothing. I'm guessing it's some kind of bezier curve but doing something with the control points that I don't understand. Does anybody know for sure…
-
The forum software is missing the Dataiku Version 12
The new post form has Dataiku Version 11-5, version 12 is not there,
-
Efficient Data Cleaning Techniques in Dataiku?
Hi all, How do you handle missing values and outliers in Dataiku? Any plugins or workflows you'd recommend for efficient data cleaning? Thanks for your tips!
-
Reverse Geocoding plugin installation
Hi community, I'm trying to install the Reverse Geocoding plugin but it takes forever. Is it very long to install or am I having a problem ? Thank you for your help ! Operating system used: Mac
-
Optimize Data read in time of the recipe in the Flow
Hi, I have a recipe flow which reads data from s3 and does some filtration, calculation on the input data and produces the final output. If the input dataset is huge, the DSS recipe takes a lot of time to read it, is there any parallel processing methodologies to read the data at higher speed?
-
Metrics display by partition in dashboard
Hello, If i have a partitionned dataset and want to display metrics for specific partitions instead of whole datases, what should i do? Because apparently it only allows for metrics for whole datasets and not for partitions. Thank you!
-
Export Isolation Forest Individual Interpretation / Heatmap
The Isolation Forest Report, under "Clusters" & Anomalies has a very nice heat maps highlighting which datapoints are anomalies. As far as I can tell there is no way to export this heatmap and no way to export a dataset that says which datapoints contributed most to the anomaly score. You can only export the dataset which…