Dataiku remove duplicate data
Options

Chandrima
Registered Posts: 10 ✭✭
Hi all in dataiku after doing a left join operation I am removing all the duplicate values, but when I exported the dataset as an excel I noticed there are few rows which are exactly matching but are not removed or considered duplicate. Once I did the TRIM operation in excel then duplicate values were considered and removed. How can we tackle this in dataiku.
Answers
-
Turribeach Dataiku DSS Core Designer, Neuron, Dataiku DSS Adv Designer, Registered, Neuron 2023 Posts: 2,528 Neuron
Use a Prepare recipe with the Transform String processor and change the mode to "Remove leading/trailing white spaces".
-
Can you please explain a little bit more?
-
Turribeach Dataiku DSS Core Designer, Neuron, Dataiku DSS Adv Designer, Registered, Neuron 2023 Posts: 2,528 Neuron
Follow the steps shown on this documentation page:
Use the Transform String processor and change the mode to "Remove leading/trailing white spaces".