[grpc] Refactor openai module by CatherineSue · Pull Request #16511 · sgl-project/sglang

CatherineSue · 2026-01-05T18:08:40Z

Motivation

openai router module has the following problems:

route_responses() is 270 lines of inline logic (830-1097)

route_responses() in router.rs:
├── Metrics recording
├── Worker selection
├── Previous response chain loading (~30 lines)
├── Conversation history loading (~90 lines)
├── Payload transformation
├── Context building
└── Finally: dispatch to streaming/non-streaming

Compare to gRPC's route_responses_impl() (~50 lines):

// gRPC just validates and delegates
validate_worker_availability(&self.worker_registry, model);
if is_harmony {
    serve_harmony_responses(&harmony_ctx, body.clone()).await
} else {
    responses::route_responses(&self.responses_context, ...).await
}

handle_non_streaming_response() is inline in router.rs (371-491)

This 120-line function should be in a separate file like Harmony's non_streaming.rs.

Circular dependency between files:

accumulator.rs ──imports──> streaming.rs (extract_output_index, get_event_type)

These helpers should be in a common utilities file, not streaming.rs.

Mixed responsibilities:

File	What it has	What it SHOULD have
router.rs	Route + non-streaming handler + conversation loading	Just routing (like gRPC)
responses.rs	Tiny utils only (225 lines)	Could be the entry point
streaming.rs	Handlers + SSE helpers	Just handlers
mcp.rs	Non-streaming loop + streaming execution	Tool execution only

Modifications

1. Moved files from openai/ into openai/responses/ subdirectory
2. Extracted handle_non_streaming_response() from router.rs to responses/non_streaming.rs
3. Created common.rs with shared SSE helpers used by both streaming and accumulator
4. Removed unused re-exports of Provider, ProviderError, ProviderRegistry

Accuracy Tests

Benchmarking and Profiling

Checklist

Format your code according to the Format code with pre-commit.
Add unit tests according to the Run and add unit tests.
Update documentation according to Write documentations.
Provide accuracy and speed benchmark results according to Test the accuracy and Benchmark the speed.
Follow the SGLang code style guidance.

Review Process

Ping Merge Oncalls to start the PR flow. See the PR Merge Process.
Get approvals from CODEOWNERS and other reviewers.
Trigger CI tests with comments (/tag-run-ci-label, /rerun-failed-ci, /tag-and-rerun-ci) or contact authorized users to do so.
After green CI and required approvals, ask Merge Oncalls to merge.

Key Changes 1. Moved files from openai/ into openai/responses/ subdirectory 2. Extracted handle_non_streaming_response() from router.rs to responses/non_streaming.rs 3. Created common.rs with shared SSE helpers used by both streaming and accumulator 4. Removed unused re-exports of Provider, ProviderError, ProviderRegistry 5. Updated documentation in openai_mcp_flow.md

gemini-code-assist · 2026-01-05T18:08:44Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

CatherineSue added 2 commits January 5, 2026 10:01

Fix cargo clippy

9b3e66f

CatherineSue requested review from key4ng and slin1237 as code owners January 5, 2026 18:08

github-actions bot added the model-gateway label Jan 5, 2026

CatherineSue added the run-ci label Jan 5, 2026

CatherineSue added 2 commits January 5, 2026 10:13

Simplify re-export

a8c8739

Tighten visibilities in openai/responses

cf90ede

slin1237 approved these changes Jan 5, 2026

View reviewed changes

slin1237 merged commit 5154140 into main Jan 5, 2026
68 of 69 checks passed

slin1237 deleted the chang/resp-refactor-4 branch January 5, 2026 19:39

jamesjxliu pushed a commit to jamesjxliu/sglang that referenced this pull request Jan 6, 2026

[grpc] Refactor openai module (sgl-project#16511)

77081f1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[grpc] Refactor openai module#16511

[grpc] Refactor openai module#16511
slin1237 merged 4 commits intomainfrom
chang/resp-refactor-4

CatherineSue commented Jan 5, 2026 •

edited

Loading

Uh oh!

gemini-code-assist bot commented Jan 5, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

CatherineSue commented Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

Review Process

Uh oh!

gemini-code-assist bot commented Jan 5, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

CatherineSue commented Jan 5, 2026 •

edited

Loading