Submit your use case or success story to the 2023 edition of the Dataiku Frontrunner Awards ENTER YOUR SUBMISSION

Split the table in DSS into separate test and train set table

Sam648
Level 2
Split the table in DSS into separate test and train set table
Hi,

I want to split the data table in the workflow into train and test set with split done randomly. How can we do that? I tried using "split" recipe but i didnt get the desired output.. Can any one help me with this ?



Sam
0 Kudos
1 Reply
cperdigou
Dataiker Alumni

These settings will let you achieve what you desire:



- define a custom variable "rand()"



- add two filters on that variable, one <0.8, the other >=0.8



0 Kudos

Labels

?
Labels (1)
A banner prompting to get Dataiku