Comparing two data sets at Column level

bhakuniv
Level 2
Comparing two data sets at Column level

Hi All

I am trying to compare two datasets with the same columns and finding out the differences at a column level. The goal is to identify the number of rows(defined by a unique key) that have an exact match at each column.

refer sample data below

Dataset 1

IDNameAgeCountry
1ABC21USA
2XYZ23UK
3DEF67CHN

Dataset 2

IDNameAgeCountry
1ABC22USA
2XYZ23UK
3DEF67SWZ

Output

 Count of ID
Name3
Age2
Country2

 

Thanks

0 Kudos
1 Reply
ATsao
Dataiker

Hi,

This would probably be best handled by writing your own code, whether using Python or R, to perform this operation. In this case, you should include both of these datasets as an input to your code recipe and then store the result as an output dataset. More information about using Python and R recipes in DSS can be found in our documentation here: 
https://doc.dataiku.com/dss/latest/code_recipes/python.html
https://doc.dataiku.com/dss/latest/code_recipes/r.html

Thanks,
Andrew

0 Kudos