[Serve] add support for custom batch size function by abrarsheikh · Pull Request #59059 · ray-project/ray

abrarsheikh · 2025-11-28T07:08:08Z

update documentation
add tests
add code examples and link to docs
compare performance with len(batch) baseline

script

from ray import serve
from typing import List

@serve.deployment(max_ongoing_requests=200)
class BatchSizeFnExample:
    @serve.batch(
        max_batch_size=50,
        batch_wait_timeout_s=0.5,
        # batch_size_fn=lambda items: len(items),
    )
    async def handle_batch(self, requests: List):
        return [req["value"] * 2 for req in requests]

    async def __call__(self, request):
        body = await request.json()
        return await self.handle_batch(body)

app = BatchSizeFnExample.bind()

load test

# First, create a file with the JSON payload
echo '{"value": 5}' > /tmp/post_data.json

# Run Apache Bench: 1000 requests, 50 concurrent connections
ab -n 1000 -c 100 -p /tmp/post_data.json -T "application/json" http://localhost:8000/

results

batch size 10
master
Requests per second:    317.68 [#/sec] (mean)
Time per request:       314.780 [ms] (mean)
Time per request:       3.148 [ms] (mean, across all concurrent requests)

PR
Requests per second:    307.80 [#/sec] (mean)
Time per request:       324.891 [ms] (mean)
Time per request:       3.249 [ms] (mean, across all concurrent requests)



batch size 50
master
Requests per second:    328.21 [#/sec] (mean)
Time per request:       304.684 [ms] (mean)
Time per request:       3.047 [ms] (mean, across all concurrent requests)

pr
Requests per second:    329.03 [#/sec] (mean)
Time per request:       303.922 [ms] (mean)
Time per request:       3.039 [ms] (mean, across all concurrent requests)

Signed-off-by: abrar <abrar@anyscale.com>

python/ray/serve/batching.py

doc/source/serve/advanced-guides/dyn-req-batch.md

akyang-anyscale · 2025-12-02T21:01:20Z

python/ray/serve/batching.py

+            # Put deferred item back in queue for next batch
+            if deferred_item is not None:
+                self.queue.put_nowait(deferred_item)


this would put the request it in the back of the queue correct?

…batch

Signed-off-by: abrar <abrar@anyscale.com>

harshit-anyscale · 2025-12-04T16:50:58Z

python/ray/serve/batching.py

+                    batch.append(self.queue.get_nowait())
+
+            # Put deferred item back in queue for next batch
+            if deferred_item is not None:


nit: we can put this block after

while not self.queue.empty(): next_item = self.queue.get_nowait() # Temporarily add to check size batch.append(next_item) new_size = self._compute_batch_size(batch) if new_size > max_batch_size: # Would exceed limit, remove it and save for later batch.pop() deferred_item = next_item break # Size is OK, keep it in the batch (already added above)

Signed-off-by: abrar <abrar@anyscale.com>

cursor · 2025-12-04T19:42:22Z

python/ray/serve/batching.py

+            args, kwargs = recover_args(request.flattened_args)
+            # The batch function expects a single positional argument (the item)
+            # after 'self' has been extracted (if it was a method)
+            items.append(args[0])


Bug: Potential IndexError when using keyword arguments with batch_size_fn

The _compute_batch_size method assumes that requests are always passed as positional arguments, accessing args[0] without checking if args is empty. If a user defines a batch method with keyword-only parameters (e.g., async def handle_batch(self, *, request)) and calls it with keyword arguments, recover_args will return an empty args list, causing an IndexError: list index out of range. This would result in a confusing error message rather than a clear explanation that batch_size_fn requires the batched argument to be passed positionally.

fixes ray-project#58956 - [x] update documentation - [x] add tests - [x] add code examples and link to docs - [x] compare performance with `len(batch)` baseline ### script ```python from ray import serve from typing import List @serve.deployment(max_ongoing_requests=200) class BatchSizeFnExample: @serve.batch( max_batch_size=50, batch_wait_timeout_s=0.5, # batch_size_fn=lambda items: len(items), ) async def handle_batch(self, requests: List): return [req["value"] * 2 for req in requests] async def __call__(self, request): body = await request.json() return await self.handle_batch(body) app = BatchSizeFnExample.bind() ``` ### load test ```bash # First, create a file with the JSON payload echo '{"value": 5}' > /tmp/post_data.json # Run Apache Bench: 1000 requests, 50 concurrent connections ab -n 1000 -c 100 -p /tmp/post_data.json -T "application/json" http://localhost:8000/ ``` ### results ``` batch size 10 master Requests per second: 317.68 [#/sec] (mean) Time per request: 314.780 [ms] (mean) Time per request: 3.148 [ms] (mean, across all concurrent requests) PR Requests per second: 307.80 [#/sec] (mean) Time per request: 324.891 [ms] (mean) Time per request: 3.249 [ms] (mean, across all concurrent requests) batch size 50 master Requests per second: 328.21 [#/sec] (mean) Time per request: 304.684 [ms] (mean) Time per request: 3.047 [ms] (mean, across all concurrent requests) pr Requests per second: 329.03 [#/sec] (mean) Time per request: 303.922 [ms] (mean) Time per request: 3.039 [ms] (mean, across all concurrent requests) ``` --------- Signed-off-by: abrar <abrar@anyscale.com> Signed-off-by: peterxcli <peterxcli@gmail.com>

[Serve] add support for custom batch size function

81da1ce

Signed-off-by: abrar <abrar@anyscale.com>

abrarsheikh added the go add ONLY when ready to merge, run all tests label Nov 28, 2025

abrarsheikh marked this pull request as ready for review December 1, 2025 18:47

abrarsheikh requested review from a team as code owners December 1, 2025 18:47

abrarsheikh mentioned this pull request Dec 1, 2025

[serve] Add an optional batch_size_fn to batch decorator #58956

Closed

cursor bot reviewed Dec 1, 2025

View reviewed changes

python/ray/serve/batching.py Outdated Show resolved Hide resolved

ray-gardener bot added serve Ray Serve Related Issue docs An issue or change related to documentation labels Dec 1, 2025

abrarsheikh requested review from akyang-anyscale and landscapepainter December 1, 2025 20:34

lbluque mentioned this pull request Dec 2, 2025

Batch server unit facebookresearch/fairchem#1622

Merged

4 tasks

akyang-anyscale reviewed Dec 2, 2025

View reviewed changes

abrarsheikh added 2 commits December 2, 2025 21:08

Merge branch 'master' of github.com:ray-project/ray into 58956-abrar-…

136f9eb

…batch

add more tests

d5dd472

Signed-off-by: abrar <abrar@anyscale.com>

akyang-anyscale approved these changes Dec 3, 2025

View reviewed changes

harshit-anyscale approved these changes Dec 4, 2025

View reviewed changes

move code blocks around

d481799

Signed-off-by: abrar <abrar@anyscale.com>

cursor bot reviewed Dec 4, 2025

View reviewed changes

abrarsheikh merged commit 0d75cbc into master Dec 4, 2025
6 checks passed

abrarsheikh deleted the 58956-abrar-batch branch December 4, 2025 21:39

lbluque mentioned this pull request Dec 18, 2025

Use number of atoms for batch size facebookresearch/fairchem#1690

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Serve] add support for custom batch size function#59059

[Serve] add support for custom batch size function#59059
abrarsheikh merged 4 commits intomasterfrom
58956-abrar-batch

abrarsheikh commented Nov 28, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

akyang-anyscale Dec 2, 2025

Uh oh!

abrarsheikh Dec 2, 2025

Uh oh!

harshit-anyscale Dec 4, 2025

Uh oh!

cursor bot Dec 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

abrarsheikh commented Nov 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

script

load test

results

Uh oh!

Uh oh!

Uh oh!

Uh oh!

akyang-anyscale Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

abrarsheikh Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

harshit-anyscale Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

cursor bot Dec 4, 2025

Choose a reason for hiding this comment

Bug: Potential IndexError when using keyword arguments with batch_size_fn

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

abrarsheikh commented Nov 28, 2025 •

edited

Loading