Embedding fix: warmup failure in embedding model by shepark · Pull Request #1510 · HabanaAI/vllm-fork

shepark · 2025-07-02T03:03:50Z

Fix the failures at warmup stage in pooling mode

--
due to.
[rank0]: File "/wm/vllm-fork/vllm/worker/hpu_model_runner.py", line 2904, in warmup_model
[rank0]: self.warmup_graphs(
[rank0]: File "/wm/vllm-fork/vllm/worker/hpu_model_runner.py", line 2714, in warmup_graphs
[rank0]: self.warmup_scenario(batch_size,
[rank0]: File "/wm/vllm-fork/vllm/worker/hpu_model_runner.py", line 2561, in warmup_scenario
[rank0]: inputs = self.prepare_model_input_align_worker(
[rank0]: File "/wm/vllm-fork/vllm/worker/model_runner_base.py", line 233, in prepare_model_input_align_worker
[rank0]: raise NotImplementedError
[rank0]: NotImplementedError

kdamaszk · 2025-07-03T07:44:17Z

@shepark please fix pre-commit

shepark · 2025-07-03T19:39:02Z

@kdamaszk I fixed previous pre-commit failure related to return type.
But, this time there's new pre-commit failure in pip-compile section.
I ran same pre-commit in local, and no issue found. (python 3.10)
I think the failure here is related to python version in CI, the version we run currently is 3.10 even 1.22.
So, we still need modules in test.txt changes.
Can you check it?
cc: @libinta

madamczyk-intel · 2025-07-08T05:33:30Z

/run-gaudi-tests

michalkuligowski

Triggered rerun for failing gsm8k_small_g3_tp1_fp8 Test Failed!

michalkuligowski · 2025-07-09T07:01:24Z

/run-gaudi-tests

Fix the failures at warmup stage in pooling mode -- due to. [rank0]: File "/wm/vllm-fork/vllm/worker/hpu_model_runner.py", line 2904, in warmup_model [rank0]: self.warmup_graphs( [rank0]: File "/wm/vllm-fork/vllm/worker/hpu_model_runner.py", line 2714, in warmup_graphs [rank0]: self.warmup_scenario(batch_size, [rank0]: File "/wm/vllm-fork/vllm/worker/hpu_model_runner.py", line 2561, in warmup_scenario [rank0]: inputs = self.prepare_model_input_align_worker( [rank0]: File "/wm/vllm-fork/vllm/worker/model_runner_base.py", line 233, in prepare_model_input_align_worker [rank0]: raise NotImplementedError [rank0]: NotImplementedError Co-authored-by: Libin Tang <litang@habana.ai>

Merge changes from habana_main for embedding fix #1510 ---- details ---- Fix the failures at warmup stage in pooling mode -- due to. [rank0]: File "/wm/vllm-fork/vllm/worker/hpu_model_runner.py", line 2904, in warmup_model [rank0]: self.warmup_graphs( [rank0]: File "/wm/vllm-fork/vllm/worker/hpu_model_runner.py", line 2714, in warmup_graphs [rank0]: self.warmup_scenario(batch_size, [rank0]: File "/wm/vllm-fork/vllm/worker/hpu_model_runner.py", line 2561, in warmup_scenario [rank0]: inputs = self.prepare_model_input_align_worker( [rank0]: File "/wm/vllm-fork/vllm/worker/model_runner_base.py", line 233, in prepare_model_input_align_worker [rank0]: raise NotImplementedError [rank0]: NotImplementedError Co-authored-by: Libin Tang <litang@habana.ai> Co-authored-by: Michał Kuligowski <mkuligowski@habana.ai>

shepark force-pushed the dev/shepark/fix_warmup_in_pooling branch 2 times, most recently from a24d250 to 3518a32 Compare July 2, 2025 16:34

libinta marked this pull request as ready for review July 3, 2025 04:35

libinta requested review from PatrykWo, afierka-intel, jikunshang, kzawora-intel, madamczyk-intel, mgawarkiewicz-intel, michalkuligowski, mswiniarsk, vivekgoe and xuechendi as code owners July 3, 2025 04:36

libinta changed the title ~~Fix warmup failure in pooling mode~~ Embedding fix: warmup failure in embedding model Jul 3, 2025

shepark force-pushed the dev/shepark/fix_warmup_in_pooling branch 2 times, most recently from 126d667 to 7941fe3 Compare July 3, 2025 18:05

shepark force-pushed the dev/shepark/fix_warmup_in_pooling branch from 7941fe3 to 92e33d0 Compare July 7, 2025 03:06

Fix warmup failure in pooling mode

2dd0233

shepark force-pushed the dev/shepark/fix_warmup_in_pooling branch from 92e33d0 to 2dd0233 Compare July 7, 2025 03:40

kdamaszk reviewed Jul 7, 2025

View reviewed changes

Comment thread vllm/worker/hpu_model_runner.py

michalkuligowski reviewed Jul 7, 2025

View reviewed changes

Comment thread vllm/worker/hpu_pooling_model_runner.py

Merge branch 'habana_main' into dev/shepark/fix_warmup_in_pooling

e034222

michalkuligowski reviewed Jul 8, 2025

View reviewed changes

michalkuligowski approved these changes Jul 8, 2025

View reviewed changes

michalkuligowski enabled auto-merge (squash) July 8, 2025 11:29

Merge branch 'habana_main' into dev/shepark/fix_warmup_in_pooling

764f5ac

michalkuligowski merged commit 3d16ae3 into habana_main Jul 9, 2025
53 checks passed

michalkuligowski deleted the dev/shepark/fix_warmup_in_pooling branch July 9, 2025 10:01

shepark mentioned this pull request Jul 9, 2025

Embedding fix: warmup failure in embedding model (#1510) #1557

Closed

shepark mentioned this pull request Jul 9, 2025

Embedding fix: warmup failure in embedding model (#1510) #1559

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Embedding fix: warmup failure in embedding model#1510

Embedding fix: warmup failure in embedding model#1510
michalkuligowski merged 3 commits intohabana_mainfrom
dev/shepark/fix_warmup_in_pooling

shepark commented Jul 2, 2025 •

edited by github-actions Bot

Loading

Uh oh!

kdamaszk commented Jul 3, 2025

Uh oh!

shepark commented Jul 3, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

madamczyk-intel commented Jul 8, 2025

Uh oh!

michalkuligowski left a comment

Uh oh!

michalkuligowski commented Jul 9, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

shepark commented Jul 2, 2025 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kdamaszk commented Jul 3, 2025

Uh oh!

shepark commented Jul 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

madamczyk-intel commented Jul 8, 2025

Uh oh!

michalkuligowski left a comment

Choose a reason for hiding this comment

Uh oh!

michalkuligowski commented Jul 9, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

shepark commented Jul 2, 2025 •

edited by github-actions Bot

Loading

shepark commented Jul 3, 2025 •

edited

Loading