fix(pipeline): handle duplicate finish_reason chunks from OpenRouter by simon100500 · Pull Request #2403 · QwenLM/qwen-code

simon100500 · 2026-03-15T21:26:47Z

Problem

Some OpenRouter model providers (e.g. google/gemini-3.1-flash-lite-preview) send two consecutive SSE chunks with finish_reason: "tool_calls". The second chunk arrives after streamingToolCallParser.reset() has already been called, so it carries empty parts — no functionCall entries.

handleChunkMerging treated every finish chunk as authoritative and overwrote pendingFinishResponse with the empty duplicate, discarding the functionCall parts correctly assembled from the first finish chunk.

This caused processStreamResponse to see hasToolCall=false and throw:

Model stream ended with empty response text.

Fix

In handleChunkMerging: when a second finish chunk arrives and a pendingFinishResponse already exists, only merge usageMetadata (if present) and keep the candidates from the first finish chunk.

if (isFinishChunk) {
  if (hasPendingFinish) {
    // Duplicate finish chunk — keep candidates from first, merge only metadata
    const lastResponse = collectedGeminiResponses[...];
    if (response.usageMetadata) lastResponse.usageMetadata = response.usageMetadata;
    setPendingFinish(lastResponse);
  } else {
    collectedGeminiResponses.push(response);
    setPendingFinish(response);
  }
  return false;
}

Testing

The existing pipeline.test.ts suite should cover regressions. A new test case can be added for the duplicate-finish-chunk scenario if desired.

Some OpenRouter model providers (e.g. google/gemini-3.1-flash-lite-preview) send two consecutive SSE chunks with finish_reason='tool_calls'. The second chunk arrives after streamingToolCallParser.reset() has been called, so it carries empty parts — no functionCall entries. The original handleChunkMerging treated every finish chunk as authoritative and overwrote pendingFinishResponse, discarding the functionCall parts that were correctly assembled from the first finish chunk. Fix: when a second finish chunk arrives and a pendingFinishResponse already exists, only merge usageMetadata (if present) and keep the candidates from the first finish chunk.

@qwen-code

…uter OpenRouter присылает два SSE чанка с finish_reason=tool_calls. Второй пустой чанк перезаписывал pendingFinishResponse, сбрасывая functionCall parts — SDK выбрасывал "Model stream ended with empty response text". Патч handleChunkMerging: при повторном finish чанке сохраняем candidates от первого, мёрджим только usageMetadata. - patches/@qwen-code+sdk+0.1.5.patch — персистентный патч - package.json — postinstall: patch-package - patch-package добавлен в devDependencies - PR: QwenLM/qwen-code#2403 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Mingholy · 2026-03-16T07:44:36Z

Thanks for the contribution!
This is a core change, and has some conflicts with #2404. I'm merging them into a single test branch to validate. This may take some time and will be merged after the validation!

simon100500 requested review from DennisYu07, DragonnZhang, LaZzyMan, Mingholy, gwinthis, pomelo-nwu and tanzhenxin as code owners March 15, 2026 21:26

github-actions bot mentioned this pull request Mar 16, 2026

📊 AI CLI 工具社区动态日报 2026-03-16 gsscsd/big_model_radar#43

Open

Mingholy approved these changes Mar 16, 2026

View reviewed changes

Mingholy added the scope/content-generation AI content generation label Mar 16, 2026

Mingholy self-assigned this Mar 16, 2026

github-actions bot mentioned this pull request Mar 17, 2026

📊 AI CLI 工具社区动态日报 2026-03-17 gsscsd/big_model_radar#50

Open

tanzhenxin linked an issue Mar 18, 2026 that may be closed by this pull request

qwen-code finishes session abruptly after trying to call a tool #2449

Closed

tanzhenxin merged commit a60fadd into QwenLM:main Mar 18, 2026
15 checks passed

This was referenced Mar 18, 2026

fix: prevent tool call loss from late-arriving names and duplicate finish chunks #2404

Open

fix: OpenAI API compliance for tool response format #2450

Open

This was referenced Mar 21, 2026

📊 AI CLI 工具社区动态日报 2026-03-21 gsscsd/big_model_radar#70

Open

📊 Bản tin hàng ngày công cụ AI CLI 2026-03-23 compasify/agents-radar#75

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(pipeline): handle duplicate finish_reason chunks from OpenRouter#2403

fix(pipeline): handle duplicate finish_reason chunks from OpenRouter#2403
tanzhenxin merged 1 commit intoQwenLM:mainfrom
simon100500:fix/duplicate-finish-chunk-tool-calls

simon100500 commented Mar 15, 2026

Uh oh!

Mingholy commented Mar 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

simon100500 commented Mar 15, 2026

Problem

Fix

Testing

Uh oh!

Mingholy commented Mar 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants