Add detok in chat completion fn for non stream mode when VLLM_DETOKENIZE_ON_OPENAI_SERVER=true by shepark · Pull Request #1768 · HabanaAI/vllm-fork

shepark · 2025-08-18T04:29:44Z

Adding missing part in #1741 for chat_completion_full_generator().

When VLLM_DETOKENIZE_ON_OPENAI_SERVER=true

xuechendi · 2025-08-18T15:43:25Z

there are two files, serving_chat.py and serving_completion.py, does the other once also need to have same change?

shepark · 2025-08-18T17:03:47Z

there are two files, serving_chat.py and serving_completion.py, does the other once also need to have same change?

Actually, it's needed for our customer support only for now.
But, it'd be better to have one PR to cover all location.
I will close this PR.

xuechendi · 2025-08-25T18:39:37Z

/run-gaudi-tests

xuechendi

Needed by customer, ok to have partially supported per author's request

xuechendi · 2025-08-26T15:46:27Z

/run-gaudi-tests

Add detok in nonstream mode

f663428

When VLLM_DETOKENIZE_ON_OPENAI_SERVER=true

shepark requested review from PatrykWo, afierka-intel, jikunshang, kzawora-intel, madamczyk-intel, mgawarkiewicz-intel, michalkuligowski, mswiniarsk, vivekgoe and xuechendi as code owners August 18, 2025 04:29

shepark closed this Aug 18, 2025

shepark reopened this Aug 25, 2025

shepark requested review from deepvars and wpyszka as code owners August 25, 2025 18:26

Merge branch 'habana_main' into separk/detok_nonstream

d009093

xuechendi approved these changes Aug 25, 2025

View reviewed changes

Merge branch 'habana_main' into separk/detok_nonstream

6a6c03f

michalkuligowski approved these changes Aug 28, 2025

View reviewed changes

michalkuligowski enabled auto-merge (squash) August 28, 2025 17:51

michalkuligowski merged commit 06a6d5d into habana_main Aug 29, 2025
47 checks passed

michalkuligowski deleted the separk/detok_nonstream branch August 29, 2025 05:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add detok in chat completion fn for non stream mode when VLLM_DETOKENIZE_ON_OPENAI_SERVER=true#1768

Add detok in chat completion fn for non stream mode when VLLM_DETOKENIZE_ON_OPENAI_SERVER=true#1768
michalkuligowski merged 3 commits intohabana_mainfrom
separk/detok_nonstream

shepark commented Aug 18, 2025 •

edited by github-actions Bot

Loading

Uh oh!

xuechendi commented Aug 18, 2025

Uh oh!

shepark commented Aug 18, 2025

Uh oh!

xuechendi commented Aug 25, 2025

Uh oh!

xuechendi left a comment

Uh oh!

xuechendi commented Aug 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

shepark commented Aug 18, 2025 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xuechendi commented Aug 18, 2025

Uh oh!

shepark commented Aug 18, 2025

Uh oh!

xuechendi commented Aug 25, 2025

Uh oh!

xuechendi left a comment

Choose a reason for hiding this comment

Uh oh!

xuechendi commented Aug 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

shepark commented Aug 18, 2025 •

edited by github-actions Bot

Loading