Fuzzy Match in Alteryx equivalent tool in Dataiku

Registered Posts: 4 ✭✭

Hi Team,

I am migrating a workflow from Alteryx to Dataiku where I encountered a tool called Fuzzy Match in the Alteryx which actually compared 3 columns and generate a new column that satisfies partial matching criteria.

Below is the input data

Below is the configuration in Alteryx tool

Below is the sample output in Alteryx

How to achieve this in DataIku? I tried Fuzzy Join but of no luck as I was unable to configure it and didn't find the similar options in the configuration settings.

Please anyone help me in doing this and it will be great if explained with screenshots of configuration from DataIku or share the sample workflow here

Thanks & Regards

Dilver

Operating system used: Windows

Answers

  • PartnerApplicant, Dataiku DSS Core Designer, Dataiku DSS ML Practitioner, Dataiku DSS Adv Designer, Registered Posts: 103 ✭✭✭✭✭✭
    edited September 2024

    Hello,

    I don't know exactly what alteryx does, but if I understand your description correctly, it's like a relative column matching, which exists on Dataiku :

    Site faviconFuzzy join: joining two datasets — Dataiku DSS 13 documentation

    So you can match columns with this.

    But as a developper plus what I can see in your screenshots, may it's more advanced, I know there are python libraries where in two lines you can define a treshold and match columns with conditions.

    I checked this one, which you can apply the same treshold params as in your config screenshot above;

    Site faviconfuzzywuzzy

    Best,

Welcome!

It looks like you're new here. Sign in or register to get started.

Welcome!

It looks like you're new here. Sign in or register to get started.