Migrate transformers cli to Typer by Wauplin · Pull Request #41487 · huggingface/transformers

Wauplin · 2025-10-09T16:28:01Z

This PR migrates the transformers CLI to Typer.

Typer is a package build on top of click by the creator of FastAPI. It is now a dependency of huggingface_hub, meaning it's also a dependency of transformers. Typer simplifies arg definition and ensures consistency using type annotations, which should help with maintenance. The benefit for users is the built-in autocompletion feature that let someone do transformers chat [TAB][TAB] to check what are the options. The --help section is also improved.

By migrating to Typer, a longer term goal is to delegate to huggingface_hub some aspects of the installation and auto-update of the CLI. This will come in a second time and doesn't have to be correlated with the v5 release.

CLI `--help`

transformers --help
Usage: transformers [OPTIONS] COMMAND [ARGS]...

  Transformers CLI

Options:
  --install-completion  Install completion for the current shell.
  --show-completion     Show completion for the current shell, to copy it or
                        customize the installation.
  --help                Show this message and exit.

Commands:
  add-fast-image-processor  Add a fast image processor to a model.
  add-new-model-like        Add a new model to the library, based on an...
  chat                      Chat with a model from the command line.
  download                  Download a model and its tokenizer from the Hub.
  env                       Print information about the environment.
  run                       Run a pipeline on a given input file.
  serve                     Run a FastAPI server to serve models...
  version                   Print CLI version.

Side notes

Noted down some stuff while working on it. Can be addressed in later PRs.

Any command, even a simple transformers env is currently very long due both in previous and new CLI. This is due to torch import, no matter if it's used or not. I do think this is not good UX, especially if we want to have something like transformers chat as entrypoint for any openai-compatible server.
This is not really related to CLI only, but to lazy imports in general. I broke it down to is_torch_available actually importing the package, not just checking it's existence.
The current transformers serve + transformers chat twin commands are really nice. One to start a server, the other one launch a chat interface. However, I feel that the current UX for chat is too bloated since it covers both the case where a server is already started AND started a new server from a model id (or path). I do think that transformers chat should only be to consume an existing API. It would make the whole implementation much cleaner and the interface less bloated for the end user (currently having 4-5 arguments only to provide model name, path, address, port, and host, instead of a single "url" argument).
Since this would be a breaking change, I think addressing it for v5 is the perfect timing.

=> EDIT: this is now done in this PR. transformers chat does not serve a model now (making it much simpler)

transformers chat https://router.huggingface.co/v1 HuggingFaceTB/SmolLM3-3B

The transformers serve feature is currently only available as a CLI. I do believe it would be best to move it to its own module so that someone could call it programmatically (e.g. from transformers import serve in a notebook).

TODO

(minor) do not import typer_factory from private internal (requires an update in huggingface_hub first)
delete ./commands folder and remove transformers-legacy CLI (that I currently use for testing)
adapt remaining CLI tests
transformers chat UI-only (not serving)
do not use classes for chat and serve? + expose them as modules (e.g. from transformers import chat, serve)

HuggingFaceDocBuilderDev · 2025-10-09T16:38:17Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

gante · 2025-10-11T10:46:05Z

@Wauplin in general LGTM 👍

Regarding your notes:

imports were reworked very recently here -- maybe it helped?
imo makes sense to have chat being consume-only :)
also makes sense to have serve as its own module (it would also simplify testing)

Wauplin · 2025-10-13T09:28:32Z

imports were reworked very recently #41268 -- maybe it helped?

Good to know! Changes in this PR looks nice. I tried again to run a simple transformers version and we still import torch by default (at least) in src/transformers/utils/generic.py (L45 and L369). I tried to hack it to import torch only when needed but then I didn't know what to do with _torch_pytree. So torch is still not lazy-loaded even though we are getting closer. I won't work on it in this P (happy to help later).

imo makes sense to have chat being consume-only :)

Yay! Will work on that since I also got informal approval from @LysandreJik

also makes sense to have serve as its own module (it would also simplify testing)

Will check that again but might be for a future PR.

Wauplin · 2025-10-13T16:07:11Z

@gante @LysandreJik should now be ready for review. The revamped transformers chat is now UI-only (doesn't have all the serving part). I tried to keep most of the logic intact, though I do think some parts where a bit broken/untested. Interface can be tested like this:

transformers chat https://router.huggingface.co/v1 HuggingFaceTB/SmolLM3-3B

Wauplin · 2025-10-13T16:09:23Z

Note: I'm not 100% sure that the slow tests are running. @ydshieh would it be possible to trigger them please? (or anyone else with the permissions?)

LysandreJik · 2025-10-14T15:16:31Z

Reviewing it in batches:

The changes from the previous arg parsing to typer are very welcome
The chat changes are also coherent from an overview
Need to play with chat
Need to check the serve changes
Need to play with serve

LysandreJik · 2025-10-14T15:23:11Z

While I agree we can remove the support of chat to launch models, I think we should throw good errors when doing so; for example if I run the following command which should work well:

transformers chat google/vaultgemma-1b

I get the following error:

Usage: transformers chat [OPTIONS] BASE_URL MODEL_ID [GENERATE_FLAGS]...
Try 'transformers chat --help' for help.

Error: Missing argument 'MODEL_ID'.

which isn't super clear: I'd tell the user that this path isn't supported anymore and to please launch a transformers serve session alongside it.

LysandreJik · 2025-10-14T15:27:33Z

For transformers serve x chat to work well we likely need to merge the following in this PR: #41446

It throws an error currently that is fixed by the above ^ I'll merge it into main shortly

src/transformers/cli/chat.py

tests + fixup

Wauplin · 2025-10-15T14:25:41Z

While I agree we can remove the support of chat to launch models, I think we should throw good errors when doing so; for example if I run the following command which should work well:
transformers chat google/vaultgemma-1b
I get the following error:
Usage: transformers chat [OPTIONS] BASE_URL MODEL_ID [GENERATE_FLAGS]...
Try 'transformers chat --help' for help.

Error: Missing argument 'MODEL_ID'.
which isn't super clear: I'd tell the user that this path isn't supported anymore and to please launch a transformers serve session alongside it.

@LysandreJik I addressed this comment in 05d9515. It's not so straightforward to do it but "it works". I didn't want to just make it optional and then raise an error if not passed as it would have modified the --help section in a misleading way.

Error mesage is now:

> transformers chat google/vaultgemma-1b
Error: Missing argument 'MODEL_ID'.

Launching a server directly from the `transformers chat` command is no longer supported. Please use `transformers serve` to launch a server. Use --help for more information.

LysandreJik · 2025-10-15T14:33:07Z

Awesome, thanks @Wauplin !

* Add typer-slim as explicit dependency * Migrate CLI to Typer * code quality * bump release candidate * adapt test_cli.py * Remove ./commands + adapt tests * fix quality * consistency * doctested * do not serve model in chat * style * will it fix them? * fix test * capitalize classes * Rebase * Rebase * tests + fixup tests + fixup * csutom error message * fix ? * should be good * fix caplog globally * inner caplog * last attempt * Retry * Let's try with capsys disabled --------- Co-authored-by: Lysandre <hi@lysand.re>

Wauplin added 2 commits October 9, 2025 10:22

Add typer-slim as explicit dependency

fd1400b

Migrate CLI to Typer

a5a90c2

Wauplin requested review from a team and LysandreJik October 9, 2025 16:28

Wauplin added the for_v5? label Oct 9, 2025

Wauplin marked this pull request as draft October 9, 2025 16:28

Merge branch 'main' into switch-transformers-cli-to-typer

ba657f4

Wauplin mentioned this pull request Oct 10, 2025

Expose typer_factory publicly huggingface/huggingface_hub#3435

Merged

code quality

459164a

LysandreJik self-assigned this Oct 13, 2025

Wauplin added 2 commits October 13, 2025 11:12

Merge branch 'main' into switch-transformers-cli-to-typer

3184d6e

bump release candidate

d83cc4e

Wauplin added 8 commits October 13, 2025 12:01

adapt test_cli.py

488c304

Remove ./commands + adapt tests

19df999

fix quality

9805195

Merge branch 'main' into switch-transformers-cli-to-typer

ffc5194

consistency

b060bd3

doctested

850b2bc

do not serve model in chat

2978161

style

ef8471c

Wauplin marked this pull request as ready for review October 13, 2025 16:07

t Merge branch 'main' into switch-transformers-cli-to-typer

d8576da

LysandreJik mentioned this pull request Oct 14, 2025

Enable non-streaming mode in transformers serve #41446

Merged

Wauplin added 2 commits October 14, 2025 12:13

will it fix them?

c94c0f8

Merge branch 'main' into switch-transformers-cli-to-typer

14fe9c8

LysandreJik reviewed Oct 14, 2025

View reviewed changes

src/transformers/cli/chat.py Outdated Show resolved Hide resolved

LysandreJik and others added 4 commits October 15, 2025 10:00

Merge branch 'main' into switch-transformers-cli-to-typer

0269c41

capitalize classes

22aad41

Rebase

cfbc76f

Rebase

db44c05

LysandreJik force-pushed the switch-transformers-cli-to-typer branch from d299b27 to 0756797 Compare October 15, 2025 09:39

tests + fixup

cee44ab

tests + fixup

LysandreJik force-pushed the switch-transformers-cli-to-typer branch from 0756797 to cee44ab Compare October 15, 2025 09:42

Wauplin added 2 commits October 15, 2025 14:44

Merge branch 'main' into switch-transformers-cli-to-typer

a177d51

csutom error message

05d9515

Wauplin and others added 7 commits October 15, 2025 16:39

fix ?

89e2429

should be good

365adf2

fix caplog globally

2d537b1

inner caplog

212dd25

last attempt

8ee41a9

Retry

751c9e9

Let's try with capsys disabled

6440951

LysandreJik approved these changes Oct 16, 2025

View reviewed changes

LysandreJik merged commit af2a66c into main Oct 16, 2025
23 checks passed

LysandreJik deleted the switch-transformers-cli-to-typer branch October 16, 2025 11:29

Wauplin mentioned this pull request Oct 16, 2025

Welcome v5 #40822

Closed

Rocketknight1 mentioned this pull request Oct 20, 2025

transformers CLI documentation issue #41731

Closed

4 tasks

ArjunPimpale mentioned this pull request Oct 21, 2025

transformers cli default flag fix #41761

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Migrate transformers cli to Typer#41487

Migrate transformers cli to Typer#41487
LysandreJik merged 32 commits intomainfrom
switch-transformers-cli-to-typer

Wauplin commented Oct 9, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Oct 9, 2025

Uh oh!

gante commented Oct 11, 2025 •

edited

Loading

Uh oh!

Wauplin commented Oct 13, 2025

Uh oh!

Wauplin commented Oct 13, 2025

Uh oh!

Wauplin commented Oct 13, 2025

Uh oh!

LysandreJik commented Oct 14, 2025 •

edited

Loading

Uh oh!

LysandreJik commented Oct 14, 2025

Uh oh!

LysandreJik commented Oct 14, 2025

Uh oh!

Uh oh!

Wauplin commented Oct 15, 2025

Uh oh!

LysandreJik commented Oct 15, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Wauplin commented Oct 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CLI --help

Side notes

TODO

Uh oh!

HuggingFaceDocBuilderDev commented Oct 9, 2025

Uh oh!

gante commented Oct 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Wauplin commented Oct 13, 2025

Uh oh!

Wauplin commented Oct 13, 2025

Uh oh!

Wauplin commented Oct 13, 2025

Uh oh!

LysandreJik commented Oct 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

LysandreJik commented Oct 14, 2025

Uh oh!

LysandreJik commented Oct 14, 2025

Uh oh!

Uh oh!

Wauplin commented Oct 15, 2025

Uh oh!

LysandreJik commented Oct 15, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Wauplin commented Oct 9, 2025 •

edited

Loading

CLI `--help`

gante commented Oct 11, 2025 •

edited

Loading

LysandreJik commented Oct 14, 2025 •

edited

Loading