HPZ10

Follow

HPZ10

Follow

3 followers · 57 following

Pinned Loading

CARE-COVARIANCE-AWARE-AND-RANK-ENHANCED-DECOMPOSITION-FOR-ENABLING-MULTI-HEAD-LATENT-ATTENTION- CARE-COVARIANCE-AWARE-AND-RANK-ENHANCED-DECOMPOSITION-FOR-ENABLING-MULTI-HEAD-LATENT-ATTENTION- Public

Forked from FutureMLS-Lab/CARE

CARE: Covariance-Aware and Rank-Enhanced Decomposition for Enabling Multi-Head Latent Attention

Python
SpecForge SpecForge Public

Forked from sgl-project/SpecForge

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

Python 1