-
Notifications
You must be signed in to change notification settings - Fork 6
Description
We observed some specific problems when going from CPUSummary.jl v0.1.8 to v0.1.14 at Trixi.jl. Everything is fine with the old version of CPUSummary.jl. CI also passes with the new version unless the GitHub CI runner happens to use LLVM: libLLVM-12.0.1 (ORCJIT, skylake-avx512) (either ubuntu-latest or windows-latest).
I could reduce this problem at https://github.com/trixi-framework/TrixiDebug.jl. Using the latest version of CPUSummary.jl, CI fails on
ubuntu-latest(e.g., https://github.com/trixi-framework/TrixiDebug.jl/runs/5492313195?check_suite_focus=true#step:6:357)windows-latest(e.g., https://github.com/trixi-framework/TrixiDebug.jl/runs/5492410761?check_suite_focus=true#step:6:356)
Restricting CPUSummary.jl to v0.1.8 let's CI pass on
ubuntu-latestwithLLVM: libLLVM-12.0.1 (ORCJIT, skylake-avx512)(https://github.com/trixi-framework/TrixiDebug.jl/runs/5493268766?check_suite_focus=true#step:6:358)windows-latestwithLLVM: libLLVM-12.0.1 (ORCJIT, skylake-avx512)(https://github.com/trixi-framework/TrixiDebug.jl/runs/5493268841?check_suite_focus=true#step:6:357)
So far, we have not been able to reproduce this locally...
For context: We use some matrix multiplications based on matmul! from Octavian.jl. To me, it seems like these multiplications fail catastrophically, resulting in the errors shown in CI.
CC @sloede