LLM Batch inference by HenryL27 · Pull Request #1202 · aryn-ai/sycamore

HenryL27 · 2025-02-27T23:43:25Z

Adds batch inference modes for openai and anthropic.
I didn't do bedrock or gemini bc those involve dealing with s3 and gcs/bigquery.

OpenAI batch is pretty slow - to be able to test it I ended up using 3.5 turbo as it has far less demand and batch inferences are low priority (expires only after 24h). Anthropic haiku 3 was decently fast (competitive with async!) although that may be the same effect. I did not test it with a more modern / powerful claude.

Signed-off-by: Henry Lindeman <hmlindeman@yahoo.com>

karanataryn

LGTM with few suggestions. Can we add a unit test of some sort here? Integration tests would obviously be not practical.

karanataryn · 2025-02-28T00:47:04Z

        return res
    elif llm_mode == LLMMode.BATCH:
-        raise NotImplementedError("Haven't done batch yet")
+        return llm.generate_batch(prompts=prompts)


Signed-off-by: Henry Lindeman <hmlindeman@yahoo.com>

HenryL27 added 3 commits February 27, 2025 14:25

add batch mode and openai batch implementation

b710fcf

Signed-off-by: Henry Lindeman <hmlindeman@yahoo.com>

add anthropic batch mode

d93366e

Signed-off-by: Henry Lindeman <hmlindeman@yahoo.com>

mypy

05c9892

Signed-off-by: Henry Lindeman <hmlindeman@yahoo.com>

karanataryn approved these changes Feb 28, 2025

View reviewed changes

review comments. drop tqdms

3aeae20

Signed-off-by: Henry Lindeman <hmlindeman@yahoo.com>

HenryL27 merged commit 889844a into main Feb 28, 2025

HenryL27 deleted the hml-llm-batch branch February 28, 2025 01:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLM Batch inference#1202

LLM Batch inference#1202
HenryL27 merged 4 commits into
mainfrom
hml-llm-batch

HenryL27 commented Feb 27, 2025

Uh oh!

karanataryn left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

karanataryn Feb 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

HenryL27 commented Feb 27, 2025

Uh oh!

karanataryn left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

karanataryn Feb 28, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

karanataryn left a comment •

edited

Loading