Prompt recipe - controlling request rate?

TrevorHall · November 2023

Is it possible to control the rate at which a prompt recipe issues requests? I ran my first one yesterday and very quickly hit the TPM rate cap on our OpenAI model deployment.

AdrienL · November 2023

Hi,
On the connection, there are some settings to control the rate of queries. There is no direct limit for TPM, but reducing parallelism will usually fix it.

TrevorHall · November 2023

Thanks! I set "Max parallelism" to 1 and that seems to be working. I also upped the retry delay to 2 seconds.

Prompt recipe - controlling request rate?

Best Answer

Answers

Categories

Setup Info

Tags