Skip to content

[SPEC_DECODE][V0] fix for spec decode eagle after rebase#1150

Merged
xuechendi merged 4 commits intohabana_mainfrom
dev/rebase_spec_decode_fix
Apr 30, 2025
Merged

[SPEC_DECODE][V0] fix for spec decode eagle after rebase#1150
xuechendi merged 4 commits intohabana_mainfrom
dev/rebase_spec_decode_fix

Conversation

@xuechendi
Copy link
Copy Markdown

@xuechendi xuechendi commented Apr 24, 2025

Now all spec decode tests are passed.

This PR not only fixed the rebase error, also refactored spec decode impl, includes

  1. remove unnecessary change to common codes: batch_expansion.py
  2. remove hpu_draft_model_runner.py and fix the previous_hidden_states in hpu_model_runner

image
image
image

Signed-off-by: Chendi Xue <chendi.xue@intel.com>
Signed-off-by: Chendi Xue <chendi.xue@intel.com>
@xuechendi
Copy link
Copy Markdown
Author

@michalkuligowski , please check. I have verified in my own env.
Meanwhile, I have started spec_decode_upstream PR, so the code change here is aligned with upstream PR.
I reverted HPU if_statement in common shared spec decode path, so future rebase can be easier.

Comment thread vllm/model_executor/layers/vocab_parallel_embedding.py
Comment thread vllm/spec_decode/batch_expansion.py
Comment thread vllm/spec_decode/draft_model_runner.py
Comment thread vllm/worker/hpu_model_runner.py
Comment thread vllm/worker/hpu_model_runner.py
Comment thread vllm/worker/hpu_model_runner.py
Comment thread vllm/worker/hpu_model_runner.py
Comment thread vllm/worker/model_runner_base.py
Comment thread vllm/worker/hpu_worker.py
@michalkuligowski
Copy link
Copy Markdown

/run-gaudi-tests

@xuechendi xuechendi closed this Apr 24, 2025
@xuechendi xuechendi reopened this Apr 24, 2025
@xuechendi
Copy link
Copy Markdown
Author

xuechendi commented Apr 24, 2025

@michalkuligowski , I saw failed UT seems went timeout because the shutdown blocked the resource release
image
image

@xuechendi xuechendi enabled auto-merge (squash) April 25, 2025 16:22
@michalkuligowski
Copy link
Copy Markdown

/run-gaudi-tests

@michalkuligowski
Copy link
Copy Markdown

I retriggered failed tests

@michalkuligowski
Copy link
Copy Markdown

/skip-gaudi-tests due to false negative in mulimodal tests

@xuechendi xuechendi merged commit 2a06ffa into habana_main Apr 30, 2025
39 of 41 checks passed
@xuechendi xuechendi deleted the dev/rebase_spec_decode_fix branch April 30, 2025 14:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants