llvm: Fix alloca alignment and type selection in AllocOpt by gbaraldi · Pull Request #60699 · JuliaLang/julia

gbaraldi · 2026-01-15T12:49:05Z

Inherit alignment from the original GC allocation with JL_SMALL_BYTE_ALIGNMENT
as the minimum. Use alignment-sized integer chunks for the alloca type
(matching emit_static_alloca) so SROA splits allocations into aligned pieces
for better performance and vectorization.

Also adds the missing setAlignment call in splitOnStack.

Co-Authored-By: Claude Opus 4.5 noreply@anthropic.com

Inherit alignment from the original GC allocation with JL_SMALL_BYTE_ALIGNMENT as the minimum. Use alignment-sized integer chunks for the alloca type (matching emit_static_alloca) so SROA splits allocations into aligned pieces for better performance and vectorization. Also adds the missing setAlignment call in splitOnStack. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

vtjnash · 2026-01-15T18:14:24Z

src/llvm-alloc-opt.cpp

-    if (sz > 1)
-        align = MinAlign(JL_SMALL_BYTE_ALIGNMENT, NextPowerOf2(sz));
+    // Inherit alignment from the original allocation, with GC alignment as minimum.
+    Align align(std::max((unsigned)orig_inst->getRetAlign().valueOrOne().value(), (unsigned)JL_SMALL_BYTE_ALIGNMENT));


This feels loosely unsound, since this is the minimum known alignment, and not the required alignment. The JL_SMALL_BYTE_ALIGNMENT value is the largest value that julia.gc_alloc_obj is permitted to return, so it is sometimes reasonable that we can use this as a hint, but we should be sure to clarify that this overalignment is merely a hint to the layout (although being more than 16 will penalize performance since it requires a more expensive stack adjustment on entry)

(unsigned)orig_inst->getRetAlign().valueOrOne().value()

But if the allocation required a larger alignment wouldn't we inherit that, that's why we prefer to inherit and just baseline to gc align

getRetAlign is not the required alignment, it is the minimum, so if it requires it, this would introduce a bug here

but that said, the function isn't capable of giving more than JL_SMALL_BYTE_ALIGNMENT (16) so having getRetAlign return more than 16 here would be a miscompile, so this always gives the correct answer anyways (and increasing from there is only a runtime performance penalty, not a correctness issue)

It’s the alignment we emitted no? If it required more it would already be a bug no? Unless you mean a gc alloc aligned that wouldn’t tell LLVM

vchuravy · 2026-01-15T19:13:29Z

test/llvmpasses/alloc-opt-gcframe.ll


 ; CHECK-LABEL: @ccall_ptr
-; CHECK: alloca i64
+; CHECK: alloca i128, align 16


cc: @maleadt

This might cause issues for the intel backend? If I recall correctly, they don't like i128?

Pretty sure Our Int128 emits i128.

So this i128 should be storage only most of the time, I just followed whatever we do for base Julia

Some backends don't support integer types larger than 64 bits, so cap the element size used in emit_static_alloca and AllocOpt at i64. For allocations larger than 8 bytes, use arrays of i64 instead of i128/i256. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Inherit alignment from the original GC allocation with JL_SMALL_BYTE_ALIGNMENT as the minimum. Use alignment-sized integer chunks for the alloca type (matching emit_static_alloca) so SROA splits allocations into aligned pieces for better performance and vectorization. Also adds the missing setAlignment call in splitOnStack. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com> (cherry picked from commit 54fde7e)

oscardssmith added the compiler:codegen Generation of LLVM IR and native code label Jan 15, 2026

gbaraldi requested review from vchuravy and vtjnash January 15, 2026 14:27

vtjnash reviewed Jan 15, 2026

View reviewed changes

vtjnash approved these changes Jan 15, 2026

View reviewed changes

vchuravy reviewed Jan 15, 2026

View reviewed changes

DilumAluthge mentioned this pull request Jan 15, 2026

Backports for 1.12.5 #60612

Merged

40 tasks

gbaraldi mentioned this pull request Jan 16, 2026

Restore alloca type EnzymeAD/Enzyme.jl#2902

Merged

Merge branch 'master' into gb/alloca-align

9880647

gbaraldi added the merge me PR is reviewed. Merge when all tests are passing label Jan 20, 2026

oscardssmith merged commit 54fde7e into master Jan 21, 2026
9 checks passed

oscardssmith deleted the gb/alloca-align branch January 21, 2026 05:44

oscardssmith removed the merge me PR is reviewed. Merge when all tests are passing label Jan 21, 2026

KristofferC mentioned this pull request Jan 26, 2026

Backports for 1.13.0-beta2 #60614

Merged

43 tasks

KristofferC removed backport 1.12 Change should be backported to release-1.12 backport 1.13 Change should be backported to release-1.13 labels Feb 3, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

llvm: Fix alloca alignment and type selection in AllocOpt#60699

llvm: Fix alloca alignment and type selection in AllocOpt#60699
oscardssmith merged 3 commits intomasterfrom
gb/alloca-align

gbaraldi commented Jan 15, 2026

Uh oh!

vtjnash Jan 15, 2026

Uh oh!

gbaraldi Jan 15, 2026

Uh oh!

vtjnash Jan 15, 2026 •

edited

Loading

Uh oh!

gbaraldi Jan 15, 2026

Uh oh!

vchuravy Jan 15, 2026

Uh oh!

oscardssmith Jan 15, 2026

Uh oh!

gbaraldi Jan 15, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Conversation

gbaraldi commented Jan 15, 2026

Uh oh!

vtjnash Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

gbaraldi Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

vtjnash Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gbaraldi Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

vchuravy Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

oscardssmith Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

gbaraldi Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

vtjnash Jan 15, 2026 •

edited

Loading