Need to do stratified sampling in a recipe Sample/Filter
Yasmine
Dataiku DSS Core Designer, Dataiku DSS Adv Designer, Registered Posts: 14 ✭✭✭
Hello,
I created a Recipe Sample/Filter on Dataiku and I need to do a stratified sampling on my dataset. I found that this type of sampling is only available when you explore the data.
Is this type of sampling available on Dataiku? If not, how to do it in Python code please?
Thank you in advance for your return,
Kind regards
Tagged:
Best Answer
-
Hi,
Check out this other post: https://community.dataiku.com/t5/Using-Dataiku/Split-dataset-by-stratified-sampling/m-p/2151
It has python examples in the comments
Best,
Pat
Answers
-
Marlan Neuron 2020, Neuron, Registered, Dataiku Frontrunner Awards 2021 Finalist, Neuron 2021, Neuron 2022, Dataiku Frontrunner Awards 2021 Participant, Neuron 2023 Posts: 320 Neuron
Hi @Yasmine
,FYI, I submitted a product idea to provide an easy stratify option for train/test splits in the visual ML tool:
Consider voting for it if it would be useful to you.
Regards,
Marlan