The new post form has Dataiku Version 11-5, version 12 is not there,
Hi all, How do you handle missing values and outliers in Dataiku? Any plugins or workflows you'd recommend for efficient data cleaning? Thanks for your tips!
Hi community, I'm trying to install the Reverse Geocoding plugin but it takes forever. Is it very long to install or am I having a problem ? Thank you for your help ! Operating system used: Mac
Hi, I have a recipe flow which reads data from s3 and does some filtration, calculation on the input data and produces the final output. If the input dataset is huge, the DSS recipe takes a lot of time to read it, is there any parallel processing methodologies to read the data at higher speed?
Hello, If i have a partitionned dataset and want to display metrics for specific partitions instead of whole datases, what should i do? Because apparently it only allows for metrics for whole datasets and not for partitions. Thank you!
The Isolation Forest Report, under "Clusters" & Anomalies has a very nice heat maps highlighting which datapoints are anomalies. As far as I can tell there is no way to export this heatmap and no way to export a dataset that says which datapoints contributed most to the anomaly score. You can only export the dataset which…
Hello, I am new to ML and I'm trying to create a very basic recommendation system for a very simple dataset I have. My dataset only contains productID and customerID and I have performed auto collaborative filtering (using the recommendation system plugin) on this dataset to generate a score. I want an item based…
I notice that the support ticket are only visible to the reporter. In my case, we are two administrators and it would be nice that tickets are shared. I know that I can "Add people to the conversation" so that they get emails with the updates. I would like really to be about to open the…
According to: https://knowledge.dataiku.com/latest/kb/data-prep/prepare-recipe/How-to-standardize-text-fields-using-fuzzy-values-clustering.html You can choose a clustering strategy of “Fuzzy” or “Highly fuzzy” to cluster and merge similar text in the dataset. What is this fuzzy matching based on? Damerau–Levenshtein? If…
Hi, I have a string 'Ranjith Jose' in column 'Name', which has double space between first and last name. I have used Replace prepare recipe. Matching Mode: Regular Expression Normalization Mode: Exact Replacements \s+ --> No Value (I need to keep one space instead of 'No Value'.) Please assist. Thanks, Ranjith Jose.
Create an account to contribute great content, engage with others, and show your appreciation.