Fuzzy Match in Alteryx equivalent tool in Dataiku

Dilver
Dilver Registered Posts: 4 ✭✭

Hi Team,

I am migrating a workflow from Alteryx to Dataiku where I encountered a tool called Fuzzy Match in the Alteryx which actually compared 3 columns and generate a new column that satisfies partial matching criteria.

Below is the input data

Below is the configuration in Alteryx tool

Below is the sample output in Alteryx

How to achieve this in DataIku? I tried Fuzzy Join but of no luck as I was unable to configure it and didn't find the similar options in the configuration settings.

Please anyone help me in doing this and it will be great if explained with screenshots of configuration from DataIku or share the sample workflow here

Thanks & Regards

Dilver

Operating system used: Windows

Answers

  • Grixis
    Grixis PartnerApplicant, Dataiku DSS Core Designer, Dataiku DSS ML Practitioner, Dataiku DSS Adv Designer, Registered Posts: 80 ✭✭✭✭✭
    edited September 10

    Hello,

    I don't know exactly what alteryx does, but if I understand your description correctly, it's like a relative column matching, which exists on Dataiku :

    https://doc.dataiku.com/dss/latest/other_recipes/fuzzy-join.html

    So you can match columns with this.

    But as a developper plus what I can see in your screenshots, may it's more advanced, I know there are python libraries where in two lines you can define a treshold and match columns with conditions.

    I checked this one, which you can apply the same treshold params as in your config screenshot above;

    https://pypi.org/project/fuzzywuzzy/

    Best,

Setup Info
    Tags
      Help me…