support batching #201

go-dockly · 2023-03-26T21:59:24Z

It would be nice if the chatCompletion method could accept an array of requests optionally as described below:

Batching requests
The OpenAI API has separate limits for requests per minute and tokens per minute.

If you're hitting the limit on requests per minute, but have available capacity on tokens per minute, you can increase your throughput by batching multiple tasks into each request. This will allow you to process more tokens per minute, especially with our smaller models.

Sending in a batch of prompts works exactly the same as a normal API call, except you pass in a list of strings to the prompt parameter instead of a single string.

sashabaranov · 2023-04-05T13:30:58Z

Done in #220

animalnots · 2024-06-07T01:08:19Z

Hi, @sashabaranov,
I noticed that this feature seems different from the batch API as described in the Batch API FAQ. Could you clarify how it supports batch processing, including statuses, job checks, and JSONL logic?

I've also noticed a PR which mentions support for it.
And an issue asking support for it.

This was referenced Mar 30, 2023

support Completionbatching #210

Closed

CompletionBatchingRequestSupport #220

Merged

sashabaranov closed this as completed Apr 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support batching #201

support batching #201

go-dockly commented Mar 26, 2023

sashabaranov commented Apr 5, 2023

animalnots commented Jun 7, 2024

support batching #201

support batching #201

Comments

go-dockly commented Mar 26, 2023

sashabaranov commented Apr 5, 2023

animalnots commented Jun 7, 2024