Train via the Sentence Transformers Trainer from ST v3 #554

tomaarsen · 2024-09-17T14:26:01Z

Hello!

Pull Request overview

Replace the embedding finetuning training methods with the Sentence Transformers v3 Trainer

Details

In v1 of SetFit, I moved from the old model.fit training from Sentence Transformers (as it didn't have e.g. loss logging, etc.) to a custom training loop that does. However, since then, Sentence Transformers v3 has released, which also added all of the features that were previously lacking. To simplify the training moving forward, the training is now (once again) deferred to Sentence Transformers.

Because both the old and new training approach are based on the transformers Trainer, we don't need to make a lot of changes. I've worked to try and prevent breaking changes, e.g. updating the Sentence Transformers callbacks such that it still returns a SetFitModel like before.

Tom Aarsen

HuggingFaceDocBuilderDev · 2024-09-17T14:54:12Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

It'll just keep using the loss of whatever trainer you're using.

tomaarsen added 7 commits September 17, 2024 09:59

Train via the Sentence Transformers Trainer from ST v3

8c73e71

Simplify some init code; docstring

a69a7c6

Prevent breaking changes by updating TrainerCallback

e6ba567

Replace ST Training Args with SetFit Training Args

d63236e

Remove unused properties

d40e702

Require 'accelerate' when training SetFit models

5a90879

Remove log in docs as it is no longer used

4d1f505

tomaarsen added 5 commits September 18, 2024 09:23

Fix docs issue

fd4ce1a

Require installing sentence-transformers[train]

5a952ec

Keep not having to override metric_for_best_model by default

0c6d0f0

It'll just keep using the loss of whatever trainer you're using.

Ensure logs directory is made in Callbacks example

6a0f3fe

Fix outdated docstring

a9bb3f5

tomaarsen merged commit fb91f67 into huggingface:main Sep 18, 2024

tomaarsen deleted the feat/use_st_trainer branch September 18, 2024 10:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Train via the Sentence Transformers Trainer from ST v3 #554

Train via the Sentence Transformers Trainer from ST v3 #554

Uh oh!

tomaarsen commented Sep 17, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Sep 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Train via the Sentence Transformers Trainer from ST v3 #554

Train via the Sentence Transformers Trainer from ST v3 #554

Uh oh!

Conversation

tomaarsen commented Sep 17, 2024

Pull Request overview

Details

Uh oh!

HuggingFaceDocBuilderDev commented Sep 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants