Sign up to take part
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
Hello @MRvLuijpen,
Thank you so much for posting a question on Dataiku Community.
A seed cannot be set for the "Random subset of column(s) value" mode, but in this mode, the behaviour is actually deterministic and the results will be the same as long as the input data and the Recipe's settings are not changed (so, a seed is not needed to be set for this mode).
I hope this would help. Please let us know if you have any further questions about this topic.
Sincerely,
Keiji
Hello @MRvLuijpen,
Thank you so much for posting a question on Dataiku Community.
A seed cannot be set for the "Random subset of column(s) value" mode, but in this mode, the behaviour is actually deterministic and the results will be the same as long as the input data and the Recipe's settings are not changed (so, a seed is not needed to be set for this mode).
I hope this would help. Please let us know if you have any further questions about this topic.
Sincerely,
Keiji
Hello @KeijiY ,
Thank you for your response. It is clear.
Would it be an idea to implement the possibility of introducing a seed in there.
For our business case we would like to set a seed in order to create multiple random, but different, subsets.
A reference: sklearn groupshufflesplit parameter random_state
Thanks again.
Sincerely,
Marc Robert