Filter data based on distinct row names
Hi Dataiku,
I have thousands of rows and about 20 columns with info. In the first column, I have patient names, some appearing more than once. I am interested to filter data based on distinct patient names, so that all patients appear only once.
It should be fairly straightforward but I am not that experienced in DSS.
Any ideas?
Thank you,
Leonardo
Operating system used: windows
Best Answer
-
Manuel Alpha Tester, Dataiker Alumni, Dataiku DSS Core Designer, Dataiku DSS ML Practitioner, Dataiku DSS Core Concepts, Dataiku DSS Adv Designer, Registered Posts: 193 ✭✭✭✭✭✭✭
Hi,
Have a look at the Group recipe, it allows you to indicate the keys to group by (e.g. customer name) and then specify the aggregations.
See this video as an introduction https://www.youtube.com/watch?v=DGAuPmAgWZo.
If you are new to DSS, you should consider completing one of our free academy courses to get you started, https://academy.dataiku.com/path/core-designer.
I hope this helps.