How to get model performance on the train set ?

MatthieuPx
MatthieuPx Registered Posts: 6 ✭✭✭

Within a lab/visual analyses/classification task, after a model was trained, how to get the model performance on the train set ? Is there a direct way to get it without deploying the model and evaluating it on the train set ?

I believe that all displayed metrics in the Results tab are (obviously) related to the test set. What if we need to look at some metrics (AUC or else) on the train set, to check at the overfitting ?

Thank you.

Best Answer

  • AdamN
    AdamN Dataiker, Registered Posts: 2 Dataiker
    Answer ✓

    Hi @MatthieuPx
    , thank you for your question.

    You will indeed need to deploy the model to do this.

    You can use an Evaluate recipe to compute the model's performance on the train set. In the settings for the Evaluate recipe, you can choose which metrics to compute.

    In case you do not have the train and test sets in separate datasets, you can export the train and test sets from the model results, as shown here:

    7AC3B896-0BA3-4A63-93B7-126D3A791259_1_201_a.jpeg

    Also, we are currently working on adding learning curve charts which will show the model's performance on the train and test set. This will be available in a future version of Dataiku.

Setup Info
    Tags
      Help me…