Submit your innovative use case or inspiring success story to the 2023 Dataiku Frontrunner Awards! LET'S GO

ability to train a model on the entire dataset

It is currently not possible to train a deployed model on an entire dataset because dataiku forces the user to specify a test set.

See this thread detailing dataiku's current limitations when training a model:

However, especially when a model has not reached its performance plateau (which one can see using a learning curve), it is considered best practice to train a model on all available data after finding the best combination of hyperparameters.

Adding this feature in the train visual recipe would be highly appreciated.