Skip to content

Embedding fix: warmup failure in embedding model#1510

Merged
michalkuligowski merged 3 commits intohabana_mainfrom
dev/shepark/fix_warmup_in_pooling
Jul 9, 2025
Merged

Embedding fix: warmup failure in embedding model#1510
michalkuligowski merged 3 commits intohabana_mainfrom
dev/shepark/fix_warmup_in_pooling

Conversation

@shepark
Copy link
Copy Markdown

@shepark shepark commented Jul 2, 2025

Fix the failures at warmup stage in pooling mode

--
due to.
[rank0]: File "/wm/vllm-fork/vllm/worker/hpu_model_runner.py", line 2904, in warmup_model
[rank0]: self.warmup_graphs(
[rank0]: File "/wm/vllm-fork/vllm/worker/hpu_model_runner.py", line 2714, in warmup_graphs
[rank0]: self.warmup_scenario(batch_size,
[rank0]: File "/wm/vllm-fork/vllm/worker/hpu_model_runner.py", line 2561, in warmup_scenario
[rank0]: inputs = self.prepare_model_input_align_worker(
[rank0]: File "/wm/vllm-fork/vllm/worker/model_runner_base.py", line 233, in prepare_model_input_align_worker
[rank0]: raise NotImplementedError
[rank0]: NotImplementedError

@shepark shepark force-pushed the dev/shepark/fix_warmup_in_pooling branch 2 times, most recently from a24d250 to 3518a32 Compare July 2, 2025 16:34
@libinta libinta marked this pull request as ready for review July 3, 2025 04:35
@libinta libinta changed the title Fix warmup failure in pooling mode Embedding fix: warmup failure in embedding model Jul 3, 2025
@kdamaszk
Copy link
Copy Markdown

kdamaszk commented Jul 3, 2025

@shepark please fix pre-commit

@shepark shepark force-pushed the dev/shepark/fix_warmup_in_pooling branch 2 times, most recently from 126d667 to 7941fe3 Compare July 3, 2025 18:05
@shepark
Copy link
Copy Markdown
Author

shepark commented Jul 3, 2025

@kdamaszk I fixed previous pre-commit failure related to return type.
But, this time there's new pre-commit failure in pip-compile section.
I ran same pre-commit in local, and no issue found. (python 3.10)
I think the failure here is related to python version in CI, the version we run currently is 3.10 even 1.22.
So, we still need modules in test.txt changes.
Can you check it?
cc: @libinta

@shepark shepark force-pushed the dev/shepark/fix_warmup_in_pooling branch from 7941fe3 to 92e33d0 Compare July 7, 2025 03:06
@shepark shepark force-pushed the dev/shepark/fix_warmup_in_pooling branch from 92e33d0 to 2dd0233 Compare July 7, 2025 03:40
Comment thread vllm/worker/hpu_model_runner.py
Comment thread vllm/worker/hpu_pooling_model_runner.py
@madamczyk-intel
Copy link
Copy Markdown

/run-gaudi-tests

Copy link
Copy Markdown

@michalkuligowski michalkuligowski left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Triggered rerun for failing gsm8k_small_g3_tp1_fp8 Test Failed!

@michalkuligowski michalkuligowski enabled auto-merge (squash) July 8, 2025 11:29
@michalkuligowski
Copy link
Copy Markdown

/run-gaudi-tests

@michalkuligowski michalkuligowski merged commit 3d16ae3 into habana_main Jul 9, 2025
53 checks passed
@michalkuligowski michalkuligowski deleted the dev/shepark/fix_warmup_in_pooling branch July 9, 2025 10:01
shepark added a commit that referenced this pull request Jul 9, 2025
Fix the failures at warmup stage in pooling mode

--
due to.
[rank0]: File "/wm/vllm-fork/vllm/worker/hpu_model_runner.py", line
2904, in warmup_model
[rank0]: self.warmup_graphs(
[rank0]: File "/wm/vllm-fork/vllm/worker/hpu_model_runner.py", line
2714, in warmup_graphs
[rank0]: self.warmup_scenario(batch_size,
[rank0]: File "/wm/vllm-fork/vllm/worker/hpu_model_runner.py", line
2561, in warmup_scenario
[rank0]: inputs = self.prepare_model_input_align_worker(
[rank0]: File "/wm/vllm-fork/vllm/worker/model_runner_base.py", line
233, in prepare_model_input_align_worker
[rank0]: raise NotImplementedError
[rank0]: NotImplementedError

Co-authored-by: Libin Tang <litang@habana.ai>
shepark added a commit that referenced this pull request Jul 9, 2025
Fix the failures at warmup stage in pooling mode

--
due to.
[rank0]: File "/wm/vllm-fork/vllm/worker/hpu_model_runner.py", line
2904, in warmup_model
[rank0]: self.warmup_graphs(
[rank0]: File "/wm/vllm-fork/vllm/worker/hpu_model_runner.py", line
2714, in warmup_graphs
[rank0]: self.warmup_scenario(batch_size,
[rank0]: File "/wm/vllm-fork/vllm/worker/hpu_model_runner.py", line
2561, in warmup_scenario
[rank0]: inputs = self.prepare_model_input_align_worker(
[rank0]: File "/wm/vllm-fork/vllm/worker/model_runner_base.py", line
233, in prepare_model_input_align_worker
[rank0]: raise NotImplementedError
[rank0]: NotImplementedError

Co-authored-by: Libin Tang <litang@habana.ai>
wpyszka pushed a commit that referenced this pull request Jul 22, 2025
Merge changes from habana_main for embedding fix
#1510

---- details ----
Fix the failures at warmup stage in pooling mode

--
due to.
[rank0]: File "/wm/vllm-fork/vllm/worker/hpu_model_runner.py", line
2904, in warmup_model
[rank0]: self.warmup_graphs(
[rank0]: File "/wm/vllm-fork/vllm/worker/hpu_model_runner.py", line
2714, in warmup_graphs
[rank0]: self.warmup_scenario(batch_size,
[rank0]: File "/wm/vllm-fork/vllm/worker/hpu_model_runner.py", line
2561, in warmup_scenario
[rank0]: inputs = self.prepare_model_input_align_worker( [rank0]: File
"/wm/vllm-fork/vllm/worker/model_runner_base.py", line 233, in
prepare_model_input_align_worker
[rank0]: raise NotImplementedError
[rank0]: NotImplementedError

Co-authored-by: Libin Tang <litang@habana.ai>
Co-authored-by: Michał Kuligowski <mkuligowski@habana.ai>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants