Skip to content

[Feature] pipeline parallel with mixed chunk #10589

@MichoChan

Description

@MichoChan

Checklist

Motivation

i test, find it seems like that pipeline parallel with mixed chunk worked well?
so why assert this:
if self.pp_size > 1: assert ( self.disable_overlap_schedule and self.speculative_algorithm is None and not self.enable_mixed_chunk ), "Pipeline parallelism is not compatible with overlap schedule, speculative decoding, mixed chunked prefill."

Related resources

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions