Advanced Designer Learning Path is now live! Read More

Can you fix the K-means train results?

Level 1
Level 1
Can you fix the K-means train results?

We fixed the seed in K-means, but the clusters we got for each train were different. Specifically, the Variables importance and silhouette were different.

Are there any other settings needed besides seed to fix the results? We think that changing results every time under the same conditions is a big problem for business use.

How to reproduce it is as follows.

  • Create a project by importing Sample project(name=Predicting Churn).
  • Changing the setting of Visual Analysis(name=Clustering customers into segments).
    • Algorithms > KMeans > Seed = 1000
  • Run the TRAIN of "Clustering customers into segments".
0 Kudos
1 Reply
Community Manager
Community Manager

Hi, @kamegai_satoshi ! Can you provide any further details on the thread to assist users in helping you find a solution (insert examples like DSS version etc.) Also, can you let us know if you’ve tried any fixes already?This should lead to a quicker response from the community.

Looking for more resources to help you use DSS effectively and upskill your knowledge? Check out these great resources: Dataiku Academy | Documentation | Knowledge Base

A reply answered your question? Mark as ‘Accepted Solution’ to help others like you!
0 Kudos
A banner prompting to get Dataiku DSS