Add TRITON_PROFILE_COMPILE knob for compilation time profiling by dshi7 · Pull Request #905 · facebookexperimental/triton

dshi7 · 2026-02-13T21:23:43Z

When enabled, prints per-stage compilation time breakdowns to stderr for each kernel compilation (ir_init, ttir, ttgir, llir, ptx, cubin, store), and benchmark timing summaries after autotuning completes. Default off.

Test plan:

TRITON_ALWAYS_COMPILE=1 TRITON_PROFILE_COMPILE=1 CUDA_VISIBLE_DEVICES=7 third_party/tlx/denoise.sh python third_party/tlx/tutorials/testing/test_blackwell_gemm_perf.py --version ws

LOG: https://www.internalfb.com/intern/paste/P2186293125/

Authored with Claude.

When enabled, prints per-stage compilation time breakdowns to stderr for each kernel compilation (ir_init, ttir, ttgir, llir, ptx, cubin, store) along with config info (block sizes, warps, stages), and benchmark timing summaries with compile vs bench breakdown after autotuning completes. Default off. Test plan: - Unit tests: `pytest python/test/unit/runtime/test_compilation_listener.py` - test_profile_compile: verifies stage breakdowns on cache miss, "cache hit" label on cache hit - test_profile_compile_off_by_default: verifies no output when knob is off - Perf test: `TRITON_PROFILE_COMPILE=1 CUDA_VISIBLE_DEVICES=<gpu> \ third_party/tlx/denoise.sh python \ third_party/tlx/tutorials/testing/test_blackwell_gemm_perf.py \ --version ws` Authored with Claude.

meta-codesync · 2026-02-13T21:31:30Z

@dshi7 has imported this pull request. If you are a Meta employee, you can view this in D93278081.

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Feb 13, 2026

dshi7 force-pushed the daohang/profile_compiling branch from 6bca499 to 68c8e8a Compare February 13, 2026 21:24

dshi7 force-pushed the daohang/profile_compiling branch from 68c8e8a to 85be855 Compare February 13, 2026 21:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add TRITON_PROFILE_COMPILE knob for compilation time profiling#905

Add TRITON_PROFILE_COMPILE knob for compilation time profiling#905
dshi7 wants to merge 1 commit intomainfrom
daohang/profile_compiling

dshi7 commented Feb 13, 2026 •

edited

Loading

Uh oh!

meta-codesync bot commented Feb 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

dshi7 commented Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

meta-codesync bot commented Feb 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

dshi7 commented Feb 13, 2026 •

edited

Loading