-
Notifications
You must be signed in to change notification settings - Fork 4.8k
Closed
Labels
Description
Checklist
- 1. If the issue you raised is not a feature but a question, please raise a discussion at https://github.com/sgl-project/sglang/discussions/new/choose Otherwise, it will be closed.
- 2. Please use English, otherwise it will be closed.
Motivation
i test, find it seems like that pipeline parallel with mixed chunk worked well?
so why assert this:
if self.pp_size > 1: assert ( self.disable_overlap_schedule and self.speculative_algorithm is None and not self.enable_mixed_chunk ), "Pipeline parallelism is not compatible with overlap schedule, speculative decoding, mixed chunked prefill."
Related resources
No response
Reactions are currently unavailable