[Roadmap] Lora Support

### Checklist

- [x] 1. If the issue you raised is not a feature but a question, please raise a discussion at https://github.com/sgl-project/sglang/discussions/new/choose Otherwise, it will be closed.
- [x] 2. Please use English, otherwise it will be closed.

# Features 

### CY25H2
- [x] overlapped lora updates #8213 @lifuhuang 
- [x] compatibility with radix attention #2880 #9144 #7216 @Fridge003 
- [x] adapter GPU pinning #8053  #8697 #9249 @lifuhuang 
- [x] LRU cache support for lora memory pool  #8053 #11041 @ConnorLi96 
- [x] FlashInfer deprecation #7809 @lifuhuang
- [x] Perf - LoRA Batch Preparation Optimization #6961 @lifuhuang @Fridge003 
- [x] Perf - Kernel Optimization #9040 #7910  #10286 @Qiaolin-Yu @Fridge003 @lifuhuang 
- [ ] Perf - Async LoRA prefetch (#8712)
- [ ] support lora for speculative decoding @ConnorLi96 
- [x] support lora for embedding layer #14177
- [ ] support lora for MoE layer #9897 #11894 @ConnorLi96 
- [ ] unified paging (support lora with different ranks) #3647 @Sunt-ing @jcbjcbjc
- [x] OpenAI compatible API (#11551 #11570) @ConnorLi96  @neelabhsinha 
- [x] LRU Offloading (#10266)
- [ ] Support PDL for shrink & expand LoRA https://www.databricks.com/blog/fast-peft-serving-scale #14346

### CY25H1
- [x] triton kernel & benchmark #3161 @Fridge003 
- [x] dynamic load/unload #7412 #7446 @lifuhuang  @Fridge003 
- [x] accuracy alignment #2671 #3413 @Fridge003 
- [x] test cases enhancement #3414 #3652 #4492 #4925 @aoshen524 @jcbjcbjc
- [x] support multi-rank adaptors #4492 @jcbjcbjc
- [x] support tensor parallel #2931 #4274 @aoshen524 
- [x] compatibility with cuda graph #3282 #4115 @Qiaolin-Yu  @Beichen-Ma 
- [x] support phi4mm #6544 @lifuhuang 
- [x] Documentation #5521 @Fridge003 

### Related resources

Prior todo list can be referred to #1307 and #1728

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Roadmap] Lora Support #2929

Checklist

Features

CY25H2

CY25H1

Related resources

Sub-issues

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Roadmap] Lora Support #2929

Description

Checklist

Features

CY25H2

CY25H1

Related resources

Sub-issues

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions