Output explanations of a subset of data

Options
Anders
Anders Registered Posts: 7 ✭✭✭

Hi,

I have a large dataset, and my model estimates probabilities for each record. I would like to limit "ouput explanations" to the 1 % highest probabilities, since the calculation is quite time and resource consuming. I know that I can calculate all probabilities once, then filter the dataset, and finally calculate probabilities with explanations for the filtered subset. But I hope that it was a more elegant way to fix this.

-anders-

Tagged:

Best Answer

  • AdrienL
    AdrienL Dataiker, Alpha Tester Posts: 196 Dataiker
    Answer ✓
    Options

    Hi,

    There is no option in the scoring recipe to selectively explain rows based on some predicate. The filtering you suggested would be the way to go.

    There is such a feature in the model's report though, where the Individual explanations tab offers to explain the prediction for the rows from the test set with the highest and/or lowest probabilities.

Setup Info
    Tags
      Help me…