Fix sret alloca alignment to match callee's preferred type alignment by maleadt · Pull Request #61192 · JuliaLang/julia

maleadt · 2026-02-27T14:37:42Z

The caller's sret alloca used julia_alignment (union_align) which can be smaller than the LLVM preferred type alignment that the callee uses for its loads/stores. For example, a struct of floats gets julia_alignment=4 but the callee uses DL.getPrefTypeAlign()=8, generating 8-byte-aligned memcpy operations. On strict-alignment targets (NVPTX), the resulting misaligned access causes CUDA_ERROR_MISALIGNED_ADDRESS.

Fix by computing the sret type's preferred alignment from the callee's StructRet attribute and taking the max with union_align, matching the alignment the callee computes for its sret parameter.

Fixes JuliaGPU/CUDA.jl#3034
Regression introduced in 1.12 by #55730

src/codegen.cpp

gbaraldi · 2026-03-02T18:13:20Z

LGTM

src/ccall.cpp

The sret parameter's alignment attribute was set to LLVM's preferred type alignment (getPrefTypeAlign), which can exceed julia_alignment. This caused misaligned memory accesses on strict-alignment targets like NVPTX, since the caller's alloca uses julia_alignment. Fix by setting the sret alignment to julia_alignment and not overriding it in the function definition, so that caller and callee agree on the same alignment. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

topolarity

Thanks @maleadt !

The sret parameter's alignment attribute was set to LLVM's preferred type alignment (getPrefTypeAlign), which can exceed julia_alignment. This caused misaligned memory accesses on strict-alignment targets like NVPTX, since the caller's alloca uses julia_alignment. Fix by setting the sret alignment to julia_alignment and not overriding it in the function definition, so that caller and callee agree on the same alignment. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

maleadt requested a review from vtjnash February 27, 2026 14:37

maleadt added compiler:codegen Generation of LLVM IR and native code gpu Affects running Julia on a GPU backport 1.12 Change should be backported to release-1.12 backport 1.13 Change should be backported to release-1.13 labels Feb 27, 2026

vtjnash reviewed Feb 27, 2026

View reviewed changes

src/codegen.cpp Outdated Show resolved Hide resolved

maleadt force-pushed the tb/sret_align branch 2 times, most recently from a908e3c to 2464148 Compare March 2, 2026 14:43

gbaraldi added the merge me PR is reviewed. Merge when all tests are passing label Mar 2, 2026

maleadt removed the merge me PR is reviewed. Merge when all tests are passing label Mar 2, 2026

maleadt marked this pull request as draft March 3, 2026 08:38

maleadt force-pushed the tb/sret_align branch from 2464148 to 6ade58e Compare March 3, 2026 09:07

maleadt marked this pull request as ready for review March 3, 2026 09:12

maleadt requested a review from vtjnash March 3, 2026 09:13

KristofferC mentioned this pull request Mar 3, 2026

Backports for 1.13.0-beta3 #60920

Merged

56 tasks

topolarity reviewed Mar 3, 2026

View reviewed changes

src/ccall.cpp Outdated Show resolved Hide resolved

maleadt force-pushed the tb/sret_align branch from 6ade58e to 6d0fa69 Compare March 3, 2026 19:41

maleadt requested a review from topolarity March 3, 2026 19:41

topolarity reviewed Mar 3, 2026

View reviewed changes

src/ccall.cpp Outdated Show resolved Hide resolved

maleadt force-pushed the tb/sret_align branch from 6d0fa69 to 868656f Compare March 3, 2026 19:59

topolarity approved these changes Mar 3, 2026

View reviewed changes

topolarity added the merge me PR is reviewed. Merge when all tests are passing label Mar 3, 2026

vtjnash approved these changes Mar 3, 2026

View reviewed changes

maleadt merged commit f519f3e into master Mar 4, 2026
8 of 9 checks passed

maleadt deleted the tb/sret_align branch March 4, 2026 07:05

DilumAluthge removed the merge me PR is reviewed. Merge when all tests are passing label Mar 4, 2026

maleadt removed the backport 1.13 Change should be backported to release-1.13 label Mar 6, 2026

maleadt mentioned this pull request Mar 6, 2026

Backports for 1.12.6 #61154

Merged

37 tasks

KristofferC removed the backport 1.12 Change should be backported to release-1.12 label Mar 25, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix sret alloca alignment to match callee's preferred type alignment#61192

Fix sret alloca alignment to match callee's preferred type alignment#61192
maleadt merged 1 commit intomasterfrom
tb/sret_align

maleadt commented Feb 27, 2026

Uh oh!

Uh oh!

gbaraldi commented Mar 2, 2026

Uh oh!

Uh oh!

Uh oh!

topolarity left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Uh oh!

Conversation

maleadt commented Feb 27, 2026

Uh oh!

Uh oh!

gbaraldi commented Mar 2, 2026

Uh oh!

Uh oh!

Uh oh!

topolarity left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants