Dataiku compare data between two columns
We have two datasets that I have uploaded from local and then I am joining the data based on a key and merged the columns. then when I am comparing two column values I used the function like:
if(colA=ColB, "Match", "No Match")
Even though some column values are exact same still it is showing as No Match.
How to fix the issue.
Answers
-
Turribeach Dataiku DSS Core Designer, Neuron, Dataiku DSS Adv Designer, Registered, Neuron 2023, Circle Member Posts: 2,626 NeuronThe most likely scenario is that the column values are NOT exactly the same. Most likely they may have some spaces or some other hard to see differences. Post some funny samples of your files if you can’t find the differences.
-
Please take a look the column values despite being exactly same I am unable to get a "Match" for the formula used.
New Revised formula:
if(toNumber(format("%.20f",ColA))==toNumber(format("%.20f",ColB)), "Match", "No Match")
-
Turribeach Dataiku DSS Core Designer, Neuron, Dataiku DSS Adv Designer, Registered, Neuron 2023, Circle Member Posts: 2,626 NeuronLikely an issue with floating-point numbers. Have a read at this post:
