-
Notifications
You must be signed in to change notification settings - Fork 265
Closed
Labels
Description
System: Dell Wyse 5070 Intel Celeron J4105 (Gemini Lake)
Intel Compute Runtime 23.30.26918.9 installed with official instructions.
What works
- Ubuntu 20.04 kernel 5.4: Works out of the box
- Ubuntu 22.04 kernel 5.15: Doesn't work out of the box, but works with
i915.enable_hangcheck=0 i915.request_timeout_ms=100000, see Broadwell iGPU hangs running clpeak with 5.13+ kernels #497 - Without the above params, clpeak hangs with the following kernel logs:
[ 101.920695] Fence expiration time out i915-0000:00:02.0:clpeak[969]:454!
[ 101.920754] Fence expiration time out i915-0000:00:02.0:clpeak[969]:452!
[ 101.920760] Fence expiration time out i915-0000:00:02.0:clpeak[969]:450!
What doesn't work
- Ubuntu 22.04 kernel 6.2
- clpeak fails, and produces error clFinish (-5) with kernel logs (note that these logs were never seen with kernel 5.15):
[ 70.897984] i915 0000:00:02.0: [drm] Resetting rcs0 for preemption time out
[ 70.898106] i915 0000:00:02.0: [drm] clpeak[910] context reset due to GPU hang
[ 70.907884] i915 0000:00:02.0: [drm] GPU HANG: ecode 9:1:e757fefe, in clpeak [910]
...
[ 170.596540] i915 0000:00:02.0: [drm] Resetting rcs0 for CS error
[ 170.596659] i915 0000:00:02.0: [drm] clpeak[910] context reset due to GPU hang
[ 170.606125] i915 0000:00:02.0: [drm] GPU HANG: ecode 9:1:ac045407, in clpeak [910]
- Setting the params that fixed 5.15 (
i915.enable_hangcheck=0 i915.request_timeout_ms=100000) or increasing to 200000 doesn't appear to help. - Vulkan compute (vkpeak) works fine. So this appears to be a problem specific to Intel Compute Runtime.
Reactions are currently unavailable