Conversation
cc7c289 to
793393a
Compare
did you observe this? looks like stopping logics has some problems? |
|
commit: eval command: result: commit : |
I can reproduce, its likely due to something related to inconsistent AR state across ranks: kernel dump on rank 0: kernel dump on rank 2: python stack trace when hang (same across ranks) |
Keep aligning with the main to track the compatibility. From #8490
all
ACC_LEN=3.02