Variables importance

Highlighted
tifo
Level 1
Variables importance
Jump to solution
I did Clustering with K-MEANS model and I wish to understand how the variables importance percentages in the histogram are calculated? what does it measure?

Thanks
0 Kudos
1 Solution

Accepted Solutions
Alex_Combessie Dataiker
Dataiker
Re: Variables importance
Jump to solution
We fit a simple random forest supervised model to the output classes of the kmeans. This allows us to derive variable importances, as per the random forest standard method (implemented in scikit-learn).

View solution in original post

3 Replies
Alex_Combessie Dataiker
Dataiker
Re: Variables importance
Jump to solution
We fit a simple random forest supervised model to the output classes of the kmeans. This allows us to derive variable importances, as per the random forest standard method (implemented in scikit-learn).

View solution in original post

nuvitu
Level 2
Re: Variables importance
Jump to solution
I can see a feature with 10%, another is 5%. What is the meaning of % in variable importances?
0 Kudos
Alex_Combessie Dataiker
Dataiker
Re: Variables importance
Jump to solution
We use the definition of variable importance in percentage from the random forest model in scikit-learn.
0 Kudos