Add pipeline parallelism for Qwen2 and Qwen3 Model#6250
Add pipeline parallelism for Qwen2 and Qwen3 Model#6250zhyncs merged 14 commits intosgl-project:mainfrom
Conversation
|
After change the model to Qwen/Qwen3-8B, here is the result for the pipeline parallelism test cases. |
|
@Ying1123 This PR can be reviewed quickly, thanks! |
|
also test for the Qwen3-30B-A3B model |
|
this is a following improve for the pp impl. We want to test the Qwen model for the PP |
|
ping @zhyncs , @merrymercy if have time for this small PR. |
Ying1123
left a comment
There was a problem hiding this comment.
HI @libratiger, thanks for the PR. Could you also add an accuracy test for PP on these models? Also, could you resolve the conflicts and pass the CI tests?
I fix the conflicts and add a new accuracy test case as suggested。 Here is the result: |
|
In the previous CI result, I just noticed some flaky failed test is timeout on DeepSeekV3 model. |
|
@libratiger Great work. let me rerun the CI and review it. Do not need to rebase on your own unless we ask. thank1 |
Motivation
Modifications
Checklist