Releases: JuliaGPU/GPUCompiler.jl
Releases · JuliaGPU/GPUCompiler.jl
v1.9.1
GPUCompiler v1.9.1
v1.9.0
GPUCompiler v1.9.0
Merged pull requests:
- Enable Ahead-of-Time Compilation by hiding the runtime functions in the
GLOBAL_METHOD_TABLE(#749) (@apozharski) - [NFC] Typos (#763) (@christiangnrd)
- Switch to FileCheck.jl. (#764) (@maleadt)
- [Metal] Add attributes for some simd permute intrinsics (#766) (@simsurace)
- Fix some tests (#767) (@christiangnrd)
- Move to ParallelTestRunner v2 (#769) (@christiangnrd)
- Fixup test filtering (#770) (@christiangnrd)
- Fix nightly (#771) (@christiangnrd)
- GCN: use byref instead of byval+lower_byval for kernel arguments (#772) (@gbaraldi)
- Default to the job's optimize parameter in
code_llvm(#773) (@christiangnrd) - Remove deprecation warning in native test (#774) (@christiangnrd)
- Adjust broken test predicate so 1.13 test pass (#775) (@christiangnrd)
- Derive the deferred world from the parent one. (#776) (@maleadt)
Closed issues:
- Dwarf warnings with Julia 1.13 (#765)
v1.8.2
GPUCompiler v1.8.2
Merged pull requests:
- Revert "Allow semi-concrete interpretation" (#760) (@christiangnrd)
Closed issues:
- Metal.jl
sincosintrinsic produces invalid LLVM IR on Julia 1.11+ (GPUCompiler 1.7.6+) (#761)
v1.8.1
GPUCompiler v1.8.1
Merged pull requests:
v1.8.0
v1.7.6
GPUCompiler v1.7.6
Merged pull requests:
- [Metal] Add some simdgroup and barrier intrinsic attributes (#744) (@christiangnrd)
- Bump actions/checkout from 5 to 6 (#747) (@dependabot[bot])
- Allow semi-concrete interpretation (#750) (@maleadt)
- Enable users to map from GV to Julia value and initialize them whenever possible (#752) (@wsmoses)
- Fix codeinstance mapping when using nested interpretation (#755) (@wsmoses)
- Fixes for 1.13 and 1.14 (#757) (@maleadt)
Closed issues:
- Re-enable semi-concrete evaluation (#382)
v1.7.5
v1.7.4
GPUCompiler v1.7.4
Merged pull requests:
- [CI] set 10 minute timeout for GH actions jobs (#739) (@simeonschaub)
- fix
GPUInterpreter(interp; ...)constructor (#740) (@simeonschaub) - use
CC.MAX_INLINE_COSTinstead oftypemax(Int)(#741) (@simeonschaub)
v1.7.3
GPUCompiler v1.7.3
Merged pull requests:
- Add option for backends to run vectorization passes (#716) (@gbaraldi)
- Update Test workflows for 1.12 (#729) (@kshyatt)
- Enable Metal testing on 1.12. (#730) (@maleadt)
- Use ParallelTestRunner.jl (#731) (@vchuravy)
- Bump peter-evans/find-comment from 3 to 4 (#732) (@dependabot[bot])
- Bump peter-evans/create-or-update-comment from 4 to 5 (#733) (@dependabot[bot])
- [SPIRV] convert i128 allocas to <2 x i64> (#734) (@simeonschaub)