-
Notifications
You must be signed in to change notification settings - Fork 701
Closed
Labels
type/bugAn issue about a bugAn issue about a bug
Description
🐛 Describe the bug
There is a significant discrepancy in the initial loss values between different versions of olmo and the presence or absence of the step-738020 checkpoint. This suggests potential issues with the model initialization or checkpoint handling in version 0.4.0. I believe the following results can be reproduced, since this bug has costed me for a week.
Task:
- Training from scratch / fine-tuning on BIoMed
Results
-
olmo v0.4.0 : w/ step-738020 ckpt -- intial loss is 71
-
olmo v0.4.0 : w/o step-738020 ckpt -- intial loss is 32
-
olmo v0.3.0 : w/ step-738020 ckpt -- intial loss is 2
-
olmo v0.3.0 : w/o step-738020 ckpt -- intial loss is 11
Versions
Build from source
- olmo v0.4.0
- olmo v0.3.0
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
type/bugAn issue about a bugAn issue about a bug
