[NV] Add GB200 MegaMOE throughput curve recipes by alec-flowers · Pull Request #1223 · SemiAnalysisAI/InferenceX

alec-flowers · 2026-04-29T06:21:54Z

Summary

add the validated GB200 Dynamo vLLM MegaMOE mid-curve recipe at conc=256/512/1024
rename/update the validated 2P/1D DEP8 MegaMOE high-throughput recipe at conc=4096
remove the old no-MegaMOE max-tpt recipe from the GB200 Dynamo vLLM matrix

Validation

python3 utils/matrix_logic/generate_sweep_configs.py full-sweep --config-files .github/configs/nvidia-master.yaml --framework dynamo-vllm --model-prefix dsv4 --runner-type gb200

github-actions · 2026-04-29T06:22:02Z

Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook

If it is not, please create a PR first before we can merge your PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you

PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. If re-running failed jobs is attempted, PR authors are responsible for ensuring it passes. See GitHub's docs on re-running failed jobs: https://docs.github.com/en/actions/how-tos/manage-workflow-runs/re-run-workflows-and-jobs#re-running-failed-jobs-in-a-workflow

As a rule of thumb, generally, PR authors should request a review & get a PR approval from the respective companies' CODEOWNERS before requesting a review from core maintainers.

If additional help is needed, PR authors can reach out to core maintainers over Slack.

github-actions · 2026-04-29T06:22:02Z

Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook

If it is not, please create a PR first before we can merge your PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you

PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. If re-running failed jobs is attempted, PR authors are responsible for ensuring it passes. See GitHub's docs on re-running failed jobs: https://docs.github.com/en/actions/how-tos/manage-workflow-runs/re-run-workflows-and-jobs#re-running-failed-jobs-in-a-workflow

As a rule of thumb, generally, PR authors should request a review & get a PR approval from the respective companies' CODEOWNERS before requesting a review from core maintainers.

If additional help is needed, PR authors can reach out to core maintainers over Slack.

alec-flowers requested a review from a team April 29, 2026 06:21

alec-flowers requested review from jgangani and kedarpotdar-nv as code owners April 29, 2026 06:21

github-project-automation Bot added this to InferenceMAX Board Apr 29, 2026

alec-flowers force-pushed the codex/inferencex-gb200-megamoe-curves branch from ea7a22c to 6d44e60 Compare April 29, 2026 06:22

Add GB200 MegaMOE throughput curve recipes

750e6dd

alec-flowers force-pushed the codex/inferencex-gb200-megamoe-curves branch from 6d44e60 to 750e6dd Compare April 29, 2026 06:28

alec-flowers added the full-sweep-enabled label Apr 29, 2026

claude Bot reviewed Apr 29, 2026

View reviewed changes

Comment thread perf-changelog.yaml Outdated

Trigger GB200 MegaMOE sweep rerun

45acc7e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NV] Add GB200 MegaMOE throughput curve recipes#1223

[NV] Add GB200 MegaMOE throughput curve recipes#1223
alec-flowers wants to merge 2 commits intomainfrom
codex/inferencex-gb200-megamoe-curves

alec-flowers commented Apr 29, 2026

Uh oh!

github-actions Bot commented Apr 29, 2026

Uh oh!

github-actions Bot commented Apr 29, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

alec-flowers commented Apr 29, 2026

Summary

Validation

Uh oh!

github-actions Bot commented Apr 29, 2026

Uh oh!

github-actions Bot commented Apr 29, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant