Delete/flag rows... how to start

Solved!
emate
Level 5
Delete/flag rows... how to start

Hi All,

I am new to DSS and struggling to start. So any kind of help appreciate.

I have 2 columns, Company and Product group. What I want to do is to filter/keep only those rows in which I have my company's product group.
Attached a screenshot in excel with an example. Of course in real life I have like couple of thousands of rows and hundrets of product groups it diffucult to use simple "filter by value" or at least im not sure how to do it.

 

Thanks

0 Kudos
2 Solutions
Alex_Combessie
Dataiker Alumni

Hi,

Sorry, I had slightly misunderstood your requirements. My recommendation was missing a step, which is a Group recipe to compute the distinct list of companies by product.

Please find below a video walkthrough on how to do it. I have used specifically the dataset you have shared as an example.

I hope it helps,

Alex

PS: I have also attached the project archive if you want to import it on your Dataiku DSS instance

View solution in original post

Alex_Combessie
Dataiker Alumni

Hi,

I have edited my post, it should work now. Can you confirm? The video link is here:

https://www.youtube.com/watch?v=RLYqtX2k87g

Cheers,

Alex

View solution in original post

5 Replies
Alex_Combessie
Dataiker Alumni

Hi,

You can achieve that in 2 steps in a Prepare recipe:

1. Flag rows where product group is from your company. That will generate a new "flag" column with 1 and empty values

2. Remove rows where the flag column is empty

For further information on how to go from Excel to Dataiku, this tutorial can be very helpful: https://academy.dataiku.com/latest/guide/analyst/excel-to-dss/index.html

Cheers,

Alex

0 Kudos
emate
Level 5
Author

Hi,

I'm not sure if I understand. I mean I can flag product group based on my company name, but how could I add another flag I created in column D, so product groups that overlapps with those from my company?

Thanks

0 Kudos
Alex_Combessie
Dataiker Alumni

Hi,

Sorry, I had slightly misunderstood your requirements. My recommendation was missing a step, which is a Group recipe to compute the distinct list of companies by product.

Please find below a video walkthrough on how to do it. I have used specifically the dataset you have shared as an example.

I hope it helps,

Alex

PS: I have also attached the project archive if you want to import it on your Dataiku DSS instance

emate
Level 5
Author

Thank you for your help, but video is not working for some reason, is it an issue on my side?

I'm getting this error: "There is an error. Please try again later.... More information" 

Also attached project - where I can find it?

emate_0-1584103364196.png

 

Sorry for the trouble & Thank you

0 Kudos
Alex_Combessie
Dataiker Alumni

Hi,

I have edited my post, it should work now. Can you confirm? The video link is here:

https://www.youtube.com/watch?v=RLYqtX2k87g

Cheers,

Alex