Sign up to take part
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
Added on November 2, 2022 5:07PM
Likes: 1
Replies: 0
DSS supports fuzzy joins with 4 built in distance calculation algorithms - Damerau–Levenshtein, Hamming, Jaccard & Cosine (see https://doc.dataiku.com/dss/latest/other_recipes/fuzzy-join.html?highlight=damerau#text-columns)
Is it possible to calculate the matching distance of 2 strings using those algorithms without performing any fuzzy join, but just adding a new column with the matching distance to the dataset ?
Operating system used: Windows 10
Operating system used: Windows 10