You now have until September 15th to submit your use case or success story to the 2022 Dataiku Frontrunner Awards!ENTER YOUR SUBMISSION

Model Scoring takes too long when compared to training.

MarcioCoelho
Level 1
Model Scoring takes too long when compared to training.

Hey everyone,

I've recently ran into a small issue, which is the time it takes to dataiku to score a model.

As you can see in provided screenshot, it took the model roughly 9min to do everything from loading to saving, while the remainder 51min were spent in scoring it.

Is there anything I can do to speed this up? I imagine that, for a binary classification, it calculates the results using different cut-off thresholds, but I'm not sure.

The algorithm I am using is a custom python model, based on a standard scikit-learn algorithm which is BalancedRandomForest Classifier.

Any help would be appreciated.

Best regards,

Márcio


Operating system used: Windows

0 Kudos
2 Replies
mkamal1
Level 1

@MarcioCoelho I have the same problem after upgrading DSS 9.0 to 10.0.4 on Linux. It happens across all visual ML models (Random Forests, XGBoost, LightGBM, etc). 

The model is training a binary classifier. Training takes about 5 minutes, but the scoring is stuck for over an hour. I am using DSS's builtin environment for visual ML LightGBM.

Did you find a fix?

0 Kudos
MarcioCoelho
Level 1
Author

Unfortunately no, the issue still persists, even after upgrading to DSS 10.0.4 (I had the issue in version 9.0).

0 Kudos