-
Notifications
You must be signed in to change notification settings - Fork 4.8k
Closed
Description
Checklist
- 1. If the issue you raised is not a feature but a question, please raise a discussion at https://github.com/sgl-project/sglang/discussions/new/choose Otherwise, it will be closed.
- 2. Please use English, otherwise it will be closed.
Motivation
Tencent released this new model:
https://huggingface.co/tencent/Hunyuan-A13B-Instruct
It matches bigger models on benchmarks. It has a decent size to run locally and the MoE architecture should make it pretty fast.
It has 256K context too.
The tencent team released a docker version compatible with sglang, but it would be nice if native support was added.
Related resources
No response
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels