Discover all of the brand-new features and improvements to existing capabilities in the Dataiku 11.3 updateLET'S GO

ability to train a model on the entire dataset

It is currently not possible to train a deployed model on an entire dataset because dataiku forces the user to specify a test set.

See this thread detailing dataiku's current limitations when training a model: https://community.dataiku.com/t5/Using-Dataiku/when-training-a-model-with-a-visual-recipe-does-datai...

However, especially when a model has not reached its performance plateau (which one can see using a learning curve), it is considered best practice to train a model on all available data after finding the best combination of hyperparameters.

Adding this feature in the train visual recipe would be highly appreciated.