Error modeling

Srovalo
Srovalo Registered Posts: 2

Hello I got this error while modeling

Kubernetes pod failed: Kubernetes pod failed, exitCode=119, reason=OOMKilled

I dont know if anyone knows how to fix this issue

Answers

  • Catalina
    Catalina Dataiker, Dataiku DSS Core Designer, Registered Posts: 135 Dataiker

    This issue usually occurs when the chosen configuration doesn't have enough memory allocated.

    For model training you can define this under the Design tab in Runtime environment. You can try to gradually increase this. The default is 4 GB and you can try to move to 8 GB and retry. If that still fails try with16 GB.

    The other way to reduce memory usage for some models is to reduce parallelism for the algorithm. The default is 4 for parallelism.

    Screenshot 2023-11-30 at 10.49.58 AM.png

    You can try to change it to 2 and 16 GB container and then check if this is sufficient.

Setup Info
    Tags
      Help me…