i have a two data sets and using the join recipe. I want the dataset to be populated as such :
keep only those records which are available in only in dataset1 (say left hand side) and not available in dataset 2.
I used the join condition of the two datasets' common column using "join on" as is different . But the operation is running long time . Yes its a huge dataset with 500k records.
Is there any fastest way to get such kind of results?