TimeoutError during startup with Pipeline Parallelism (PP) on Qwen3-8B

I am testing Qwen3-8B using vLLM with kvcached on a 2-GPU node. When Pipeline Parallelism (PP) is enabled (e.g., PP=2, TP=1), the engine fails to initialize, raising a TimeoutError. The KVCacheManager fails to detect KV tensor creation within the 10-second threshold.

<img width="2388" height="393" alt="Image" src="https://github.com/user-attachments/assets/9226b214-8d9e-4410-9d63-7dca2a9e6ad3" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TimeoutError during startup with Pipeline Parallelism (PP) on Qwen3-8B #245

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

TimeoutError during startup with Pipeline Parallelism (PP) on Qwen3-8B #245

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions