Sign up to take part
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
Added on April 20, 2016 3:54PM
Likes: 0
Replies: 1
We do not have a direct recipe to do so.
The fastest way is probably as a SQL recipe. For instance in Hive:
SELECT * FROM train_set WHERE target = 1
UNION ALL
SELECT * FROM (SELECT * FROM train_set WHERE target = 0 ORDER BY rand() LIMIT 1000000) foo