Commit 0f7e8e8
File tree
- 3rdparty
- Megatron-Bridge-workspace
- Megatron-LM-workspace
- docs
- guides
- examples/configs
- recipes/llm
- nemo_rl
- algorithms
- data/datasets/response_datasets
- environments
- models
- generation
- vllm
- policy
- utils
- tests
- test_suites
- llm
- unit
- algorithms
- models
- generation
- policy
- tools
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | 2 | | |
3 | | - | |
4 | | - | |
| 3 | + | |
| 4 | + | |
5 | 5 | | |
6 | 6 | | |
7 | 7 | | |
8 | 8 | | |
9 | | - | |
| 9 | + | |
10 | 10 | | |
11 | 11 | | |
12 | 12 | | |
| |||
Submodule Megatron-Bridge updated 70 files
- .github/workflows/cicd-main.yml-5
- README.md+1-1
- docs/models/vlm/index.md-1
- docs/models/vlm/nemotron-nano-v2-vl.md-192
- examples/conversion/compare_hf_and_megatron/compare.py+2-4
- examples/conversion/hf_megatron_roundtrip.py+1-1
- pyproject.toml-2
- scripts/performance/configs/deepseek/deepseek_v3_llm_pretrain.yaml-3
- scripts/performance/configs/llama3/llama3_70b_llm_pretrain.yaml-3
- scripts/performance/configs/llama3/llama3_8b_llm_pretrain.yaml-3
- scripts/performance/configs/llama31/llama31_405b_llm_pretrain.yaml-3
- scripts/performance/configs/qwen3/qwen3_235b_a22b_llm_pretrain.yaml-3
- scripts/performance/configs/qwen3/qwen3_30b_a3b_llm_pretrain.yaml-3
- src/megatron/bridge/data/loaders.py+2-3
- src/megatron/bridge/models/conversion/auto_bridge.py+17-11
- src/megatron/bridge/models/conversion/param_mapping.py+5-193
- src/megatron/bridge/models/conversion/utils.py+30-7
- src/megatron/bridge/models/deepseek/deepseek_provider.py-2
- src/megatron/bridge/models/gemma/gemma3_provider.py+4-1
- src/megatron/bridge/models/gpt_oss/gpt_oss_bridge.py+1
- src/megatron/bridge/models/gpt_provider.py+6-7
- src/megatron/bridge/models/llama/llama_provider.py+8-10
- src/megatron/bridge/models/mamba/mamba_provider.py+1-14
- src/megatron/bridge/models/model_provider.py+11-12
- src/megatron/bridge/models/nemotron/nemotron_provider.py+4-3
- src/megatron/bridge/models/nemotronh/nemotron_h_bridge.py-1
- src/megatron/bridge/models/nemotronh/nemotron_h_provider.py+3-4
- src/megatron/bridge/models/qwen/__init__.py-5
- src/megatron/bridge/models/qwen/qwen3_next_bridge.py-222
- src/megatron/bridge/models/qwen/qwen_provider.py+1-70
- src/megatron/bridge/recipes/nemotronh/nemotronh.py+2-2
- src/megatron/bridge/recipes/qwen/qwen3_moe.py+2
- src/megatron/bridge/training/config.py+10-13
- src/megatron/bridge/training/initialize.py+1-18
- src/megatron/bridge/training/utils/train_utils.py-10
- src/megatron/bridge/utils/fusions.py+106-12
- tests/functional_tests/data/test_loaders.py-3
- tests/functional_tests/models/test_deepseek_conversion.py+3-20
- tests/functional_tests/models/test_gemma2_conversion.py+12-2
- tests/functional_tests/models/test_gemma2_provider.py+10-1
- tests/functional_tests/models/test_gemma3_conversion.py+11-3
- tests/functional_tests/models/test_gemma3_provider.py+7-1
- tests/functional_tests/models/test_gemma_conversion.py+12-2
- tests/functional_tests/models/test_gemma_provider.py+10-1
- tests/functional_tests/models/test_glm45_provider.py+10-1
- tests/functional_tests/models/test_nemotron_h_conversion.py+3-14
- tests/functional_tests/models/test_qwen3_next_conversion.py-281
- tests/functional_tests/models/test_qwen3_next_provider.py-51
- tests/functional_tests/recipes/test_llama_recipes_pretrain_1b.py+11-16
- tests/functional_tests/recipes/test_mamba_recipes_pretrain.py+22-13
- tests/functional_tests/recipes/test_qwen_recipes_pretrain.py+7-13
- tests/functional_tests/recipes/test_qwen_vl_recipes_finetune.py+4-13
- tests/functional_tests/recipes/utils.py+3-13
- tests/functional_tests/training/test_inprocess_restart.py+2-3
- tests/functional_tests/training/test_megatron_fsdp.py+5-5
- tests/functional_tests/training/test_pretrain.py+4-5
- tests/functional_tests/training/test_pretrain_resume.py+3-3
- tests/functional_tests/training/test_sample_based_training.py-1
- tests/unit_tests/models/llama/test_llama_provider.py+1
- tests/unit_tests/models/nemotronh/test_nemotron_h_bridge.py+17-2
- tests/unit_tests/models/nemotronh/test_nemotron_h_provider.py+1-1
- tests/unit_tests/models/qwen/test_qwen3_next_bridge.py-453
- tests/unit_tests/models/test_utils_auto_map.py-53
- tests/unit_tests/recipes/nemotronh/test_nemotronh.py+5-10
- tests/unit_tests/training/test_config.py-146
- tests/unit_tests/training/test_initialize.py-311
- tests/unit_tests/training/test_tokenizer.py+1-250
- tests/unit_tests/training/utils/test_train_utils.py-68
- tests/unit_tests/utils/test_fusions.py+391-41
- uv.lock-2
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
37 | 37 | | |
38 | 38 | | |
39 | 39 | | |
| 40 | + | |
40 | 41 | | |
41 | | - | |
| 42 | + | |
42 | 43 | | |
43 | | - | |
44 | 44 | | |
45 | | - | |
46 | | - | |
| 45 | + | |
47 | 46 | | |
48 | 47 | | |
49 | 48 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
8 | 8 | | |
9 | 9 | | |
10 | 10 | | |
11 | | - | |
| 11 | + | |
12 | 12 | | |
13 | 13 | | |
14 | 14 | | |
| |||
71 | 71 | | |
72 | 72 | | |
73 | 73 | | |
74 | | - | |
| 74 | + | |
75 | 75 | | |
76 | 76 | | |
77 | 77 | | |
78 | 78 | | |
79 | | - | |
| 79 | + | |
80 | 80 | | |
81 | 81 | | |
82 | 82 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
41 | 41 | | |
42 | 42 | | |
43 | 43 | | |
| 44 | + | |
| 45 | + | |
44 | 46 | | |
45 | 47 | | |
46 | 48 | | |
| |||
65 | 67 | | |
66 | 68 | | |
67 | 69 | | |
| 70 | + | |
| 71 | + | |
68 | 72 | | |
69 | 73 | | |
70 | 74 | | |
| |||
158 | 162 | | |
159 | 163 | | |
160 | 164 | | |
| 165 | + | |
| 166 | + | |
| 167 | + | |
| 168 | + | |
| 169 | + | |
161 | 170 | | |
162 | 171 | | |
163 | 172 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
28 | 28 | | |
29 | 29 | | |
30 | 30 | | |
31 | | - | |
| 31 | + | |
32 | 32 | | |
33 | 33 | | |
34 | 34 | | |
| |||
163 | 163 | | |
164 | 164 | | |
165 | 165 | | |
166 | | - | |
167 | | - | |
168 | | - | |
| 166 | + | |
| 167 | + | |
169 | 168 | | |
170 | 169 | | |
171 | 170 | | |
| |||
279 | 278 | | |
280 | 279 | | |
281 | 280 | | |
282 | | - | |
| 281 | + | |
283 | 282 | | |
284 | 283 | | |
285 | 284 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
100 | 100 | | |
101 | 101 | | |
102 | 102 | | |
103 | | - | |
| 103 | + | |
| 104 | + | |
104 | 105 | | |
105 | 106 | | |
106 | 107 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
145 | 145 | | |
146 | 146 | | |
147 | 147 | | |
148 | | - | |
149 | 148 | | |
150 | 149 | | |
151 | 150 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
99 | 99 | | |
100 | 100 | | |
101 | 101 | | |
102 | | - | |
103 | 102 | | |
104 | 103 | | |
105 | 104 | | |
| |||
0 commit comments