Survey banner
Share your feedback on the Dataiku documentation with this 5 min survey. Thanks! TAKE THE SURVEY

Meaning of 'others' in variable importance inside Machine learning modelling in DSS

kautuk
Level 2
Meaning of 'others' in variable importance inside Machine learning modelling in DSS

Hi, 

I have built a binary classification model using RandomForest. After the model is built, I am getting few variable value as 'others' inside variable importance. I do not have any value as 'others' in these variables and hence, I wanted to know what value DSS categorizes as 'others' on running the model.

Thanks,

Kautuk

0 Kudos
1 Reply
Muennighoff
Dataiker

Hey! Variable importance shows importances for individual features after preprocessing is applied. The preprocessing may add new columns.

In your case, you are likely preprocessing features like SN_PN with Dummy encoding, which creates an Other category for column values that do not fall within the specified Max nb. of categories. Increasing Max nb. of categories in the preprocessing window prior to training should remove the Other.

Hope this helps 🙂

0 Kudos