What's your use case? Cartesian product is a common issue when joining dataset with a bad key. It's not always easy to detect and users can even forget to check for it because they think they know the…
I have 2 datasets that I'm trying to left join using a contains. but when I try to join them, I'm getting a string index out of bounds error, does this have something to do with the sized of my data s…
I'm starting to work with the Fuzzy Joins and having good luck. However, I'm trying to figure out when I might want to use a Relative Threshold related to the Right or Left Table when doing a overall …
Hi community, Pretty new to Dataiku but not to analytics and data visualization, integration etc. But I'm having an issue in the course. I'm doing the tutorial for the Fuzzy Join and am working on our…
Hi All, i have two tables for eg: Table 1 : RULEBP1BP1_PK123 table 2: RULEBP2BP2_PK156 i need to append these two tables to get the output as: RULEBP1BP1_PKBP2BP2_PK123 1 45 Any help would be apprecia…
Hello, Is it possible to join datasets using an array or list attributein one of these datasets ? I have a dataset with a list type attribute, I want to keep only the rows whose this attribute contain…
i need to do fuzzy match based on jaro distance .I have two columns (X, Y). I have two unique values in the Y column.The fuzzy match need to take the shortest string from the X column and it should co…
I'm experiencing something unexpected with the join recipe using a left join and post-join computer columns. I'm joining 2 datasets on a single column and then computing 4 additional columns after the…
Hello, My data looks like this: Recordsvaluesrecords_0_NameJimmyrecords_0_Number1records_0_StatusStudentrecords_1_NamesMarierecords_1_Number2records_1_StatusWorker And i want it looks like this: NameN…
Hi All, i'm using Hive engine, Is there a way to check the joining table is not empty before running the join recipe? scenario: Table A left Join Table B. Table A is empty. i'm getting below com.datai…