how to check GPU pods and increase the capacity?

Options
Rachel0000
Rachel0000 Registered Posts: 1

when i start the code studio in GPU cluster, I get below errors:

Normal NotTriggerScaleUp 4m54s (x28 over 9m57s) cluster-autoscaler pod didn't trigger scale-up: 1 Insufficient nvidia.com/gpu, 2 max node group size reached

how to increase the GPU pods?

Answers

  • Grixis
    Grixis PartnerApplicant, Dataiku DSS Core Designer, Dataiku DSS ML Practitioner, Dataiku DSS Adv Designer, Registered Posts: 47 ✭✭✭✭✭
    Options

    Hello @Rachel0000
    ,

    Hard to say what is your root cause. Seem your cluster configuration isnt fit for autoscalling or may you reach your max usage of your cluster.

    Could you share more information about that and, by the way, what kind of dataiku are you on ? And the version ?

    Martin

Setup Info
    Tags
      Help me…