Skip to content

Conversation

@tomaarsen
Copy link
Member

Hello!

Pull Request overview

Details

The train method for the distillation trainer is very similar to the train method on the regular trainer. As a result, it also contained the bug from #280 and the corresponding fix could be copied from there without any issues. For additional details, see that PR or #268 (comment) which originally showcases the bug.

  • Tom Aarsen

@tomaarsen tomaarsen added the bug Something isn't working label Jan 23, 2023
@tomaarsen tomaarsen merged commit 0cb8ffd into huggingface:main Jan 23, 2023
@tomaarsen tomaarsen deleted the squared_steps_distillation branch January 23, 2023 12:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bug Something isn't working

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant