-
The forum software is missing the Dataiku Version 12
The new post form has Dataiku Version 11-5, version 12 is not there,
-
Efficient Data Cleaning Techniques in Dataiku?
Hi all, How do you handle missing values and outliers in Dataiku? Any plugins or workflows you'd recommend for efficient data cleaning? Thanks for your tips!
-
Reverse Geocoding plugin installation
Hi community, I'm trying to install the Reverse Geocoding plugin but it takes forever. Is it very long to install or am I having a problem ? Thank you for your help ! Operating system used: Mac
-
Optimize Data read in time of the recipe in the Flow
Hi, I have a recipe flow which reads data from s3 and does some filtration, calculation on the input data and produces the final output. If the input dataset is huge, the DSS recipe takes a lot of time to read it, is there any parallel processing methodologies to read the data at higher speed?
-
Metrics display by partition in dashboard
Hello, If i have a partitionned dataset and want to display metrics for specific partitions instead of whole datases, what should i do? Because apparently it only allows for metrics for whole datasets and not for partitions. Thank you!
-
Export Isolation Forest Individual Interpretation / Heatmap
The Isolation Forest Report, under "Clusters" & Anomalies has a very nice heat maps highlighting which datapoints are anomalies. As far as I can tell there is no way to export this heatmap and no way to export a dataset that says which datapoints contributed most to the anomaly score. You can only export the dataset which…
-
Schema for ML prediction analysis for Recommendation system
Hello, I am new to ML and I'm trying to create a very basic recommendation system for a very simple dataset I have. My dataset only contains productID and customerID and I have performed auto collaborative filtering (using the recommendation system plugin) on this dataset to generate a score. I want an item based…
-
Suggestion of the handling of Support Cases
I notice that the support ticket are only visible to the reporter. In my case, we are two administrators and it would be nice that tickets are shared. I know that I can "Add people to the conversation" so that they get emails with the updates. I would like really to be about to open the…
-
What formula is used for the fuzzy values clustering in the prepare recipe on DSS?
According to: https://knowledge.dataiku.com/latest/kb/data-prep/prepare-recipe/How-to-standardize-text-fields-using-fuzzy-values-clustering.html You can choose a clustering strategy of “Fuzzy” or “Highly fuzzy” to cluster and merge similar text in the dataset. What is this fuzzy matching based on? Damerau–Levenshtein? If…
-
Regular Expression Replace double spaces in a string with one space.
Hi, I have a string 'Ranjith Jose' in column 'Name', which has double space between first and last name. I have used Replace prepare recipe. Matching Mode: Regular Expression Normalization Mode: Exact Replacements \s+ --> No Value (I need to keep one space instead of 'No Value'.) Please assist. Thanks, Ranjith Jose.