Skip to content

[Feature] Fully Overlap with spec v2 + Constrained Decoding #13019

@JustinTong0323

Description

@JustinTong0323

Required when tool_choice="required"

related: #13106

Demonstration

Naive version (#13425) has been merged. Here is a simple tutorial about how overlap scheduling works with constrained decoding.

Normal Overlap Scheduling

Image

Delay Sampling + Overlap Ccheduling

Image

Advanced Delay Launching + Overlap Scheduling

Image

Metadata

Metadata

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions