Skip to content

[Feature] 2:4 sparse marlin #13597

@hjlee1371

Description

@hjlee1371

Checklist

Motivation

Hello, are there any plans to integrate 2:4 sparse marlin (w4a16) into sglang?

It seems to have been removed in #10750 , and I'm asking because I couldn't find it in the quantization roadmap #8180. Thank you.

Related resources

No response

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions