Sign up to take part
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
Added on January 7, 2020 5:06PM
Likes: 0
Replies: 0
You may sometimes be interested in building a prediction model on different subgroups of your dataset, rather than the overall dataset. These models, called stratified models (or partitioned models), can lead to better predictions when relevant predictors for a target variable are different across subgroups of the dataset. For example, customers in different data subgroups may have different purchasing patterns that contribute to how much they spend.
When you create a visual machine learning (prediction) model on a partitioned dataset, you have the option to create partitioned models.
The following results show partitioned models.
When you select algorithms to use for training, Dataiku DSS trains a partitioned model for each algorithm. Each partitioned model consists of one sub-model (or model partition) per data partition. For example, the previous screenshot shows two partitioned models (Logistic Regression - Partitioned and Decision Tree - Partitioned). Each of these models has three model partitions, one for each partition that was trained.