I am new to DSS and struggling to start. So any kind of help appreciate.
I have 2 columns, Company and Product group. What I want to do is to filter/keep only those rows in which I have my company's product group.
Attached a screenshot in excel with an example. Of course in real life I have like couple of thousands of rows and hundrets of product groups it diffucult to use simple "filter by value" or at least im not sure how to do it.
You can achieve that in 2 steps in a Prepare recipe:
1. Flag rows where product group is from your company. That will generate a new "flag" column with 1 and empty values
2. Remove rows where the flag column is empty
For further information on how to go from Excel to Dataiku, this tutorial can be very helpful: https://academy.dataiku.com/latest/guide/analyst/excel-to-dss/index.html
I'm not sure if I understand. I mean I can flag product group based on my company name, but how could I add another flag I created in column D, so product groups that overlapps with those from my company?
Sorry, I had slightly misunderstood your requirements. My recommendation was missing a step, which is a Group recipe to compute the distinct list of companies by product.
Please find below a video walkthrough on how to do it. I have used specifically the dataset you have shared as an example.
I hope it helps,
PS: I have also attached the project archive if you want to import it on your Dataiku DSS instance
Thank you for your help, but video is not working for some reason, is it an issue on my side?
I'm getting this error: "There is an error. Please try again later.... More information"
Also attached project - where I can find it?
Sorry for the trouble & Thank you
I have edited my post, it should work now. Can you confirm? The video link is here: