Sign up to take part
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
Added on November 15, 2023 3:36PM
Likes: 0
Replies: 2
Is it possible to control the rate at which a prompt recipe issues requests? I ran my first one yesterday and very quickly hit the TPM rate cap on our OpenAI model deployment.
Thanks! I set "Max parallelism" to 1 and that seems to be working. I also upped the retry delay to 2 seconds.