Error modeling

Level 1
Error modeling

Hello I got this error while modeling 

Kubernetes pod failed: Kubernetes pod failed, exitCode=119, reason=OOMKilled


I dont know if anyone knows how to fix this issue 

0 Kudos
1 Reply

This issue usually occurs when the chosen configuration doesn't have enough memory allocated.

For model training you can define this under the Design tab in Runtime environment. You can try to gradually increase this. The default is 4 GB and you can try to move to 8 GB and retry. If that still fails try with16 GB.

The other way to reduce memory usage for some models is to reduce parallelism for the algorithm. The default is 4 for parallelism.

Screenshot 2023-11-30 at 10.49.58 AM.png

You can try to change it to 2  and 16 GB container and then check if this is sufficient.


0 Kudos