Mention that `InferenceClient` also works with local models + other t… #1322

julien-c · 2025-05-13T15:43:03Z

…weaks

Other tweaks:

Mention that the default provider for InferenceClient is now "auto" (i.e. the user's favorite provider that's available for the model)

…weaks

src/smolagents/models.py

Co-authored-by: célina <hanouticelina@gmail.com>

julien-c · 2025-05-13T16:39:49Z

will wait for a review from a maintainer @albertvillanova @aymeric-roucher given this PR bumps a dependency version

albertvillanova

Thank you!

Nice improvement in the documentation about model and base_url parameters.

However, if I understand correctly, the bump in the min version of "huggingface-hub" is intended to ensure that the default value of the provider parameter is set to "auto" (instead of the previous default , "hf-inference"). But this is a breaking change.

Would it make sense to initiate a deprecation cycle instead? That way, we could warn users about the upcoming change and give them time to adapt, minimizing disruption for those with currently working code in production.

Wauplin · 2025-05-14T06:42:17Z

AFAIK the breaking change has already been introduced (true that there was no prior notice to it). Bumping the minimum version ensures that all users have the same behavior which is currently not the case depending on the huggingface_hub version they install.

Regarding the "why" no prior notice, we decided to default to "auto" instead of "hf-inference" as we thought it was a minimal change for most users. Nothing breaks in the wild, just the data is processed by a different provider. Also it was quite needed with the recent rework on the HF Inference API which reduced the number of available models served by HF (no cold start models anymore).

julien-c · 2025-05-14T09:48:21Z

if anything it's a less-breaking-change i.e. it makes it work out of the box in more cases

src/smolagents/models.py

Co-authored-by: Merve Noyan <merveenoyan@gmail.com>

albertvillanova

Thanks for your feedback. I understand your point.

My concern wasn't so much about all users having identical behavior, but rather about avoiding a poor experience for even a single user.

The use case I had in mind (something similar happened to me) is: a user, already using smolagents in production, updates it to a new minor version to benefit from recent fixes or enhancements. After the update, their code suddenly breaks with an error due to a new, automatically assigned provider:

Error: Provider 'featherless-ai' not supported. Available values: 'auto' or any provider from ['black-forest-labs', 'cerebras', 'cohere', 'fal-ai', 'fireworks-ai', 'hf-inference', 'hyperbolic', 'nebius', 'novita', 'openai', 'replicate', 'sambanova', 'together'].Passing 'auto' (default value) will automatically select the first provider available for the model, sorted by the user's order in https://hf.co/settings/inference-providers.

I just wanted to ensure that users who trust the project enough to pin only minor versions don't suddenly hit unexpected errors that require them to dig into internal changes.

That said, if this potential disruption is acceptable from your point of view, I'm OK with merging as is.

Maybe it could also help to update the PR title to mention the change in default InferenceClientModel provider: that might help others quickly spot the root cause if they run into related issues.

I will also add a comment about this to the Release notes to make it more visible to users.

Mention that InferenceClient also works with local models + other t…

7489fcc

…weaks

julien-c requested review from Wauplin, hanouticelina, albertvillanova, merveenoyan and aymeric-roucher May 13, 2025 15:43

hanouticelina reviewed May 13, 2025

View reviewed changes

src/smolagents/models.py Outdated Show resolved Hide resolved

hanouticelina approved these changes May 13, 2025

View reviewed changes

Wauplin approved these changes May 13, 2025

View reviewed changes

Update src/smolagents/models.py

ea9fa70

Co-authored-by: célina <hanouticelina@gmail.com>

albertvillanova reviewed May 14, 2025

View reviewed changes

merveenoyan approved these changes May 14, 2025

View reviewed changes

src/smolagents/models.py Outdated Show resolved Hide resolved

Update src/smolagents/models.py

47dd664

Co-authored-by: Merve Noyan <merveenoyan@gmail.com>

julien-c requested a review from albertvillanova May 14, 2025 17:04

albertvillanova approved these changes May 15, 2025

View reviewed changes

julien-c merged commit f11a04e into main May 15, 2025
2 of 4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Mention that `InferenceClient` also works with local models + other t… #1322

Mention that `InferenceClient` also works with local models + other t… #1322

Uh oh!

julien-c commented May 13, 2025

Uh oh!

Uh oh!

julien-c commented May 13, 2025

Uh oh!

albertvillanova left a comment

Uh oh!

Wauplin commented May 14, 2025

Uh oh!

julien-c commented May 14, 2025

Uh oh!

Uh oh!

albertvillanova left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Mention that InferenceClient also works with local models + other t… #1322

Mention that InferenceClient also works with local models + other t… #1322

Uh oh!

Conversation

julien-c commented May 13, 2025

Uh oh!

Uh oh!

julien-c commented May 13, 2025

Uh oh!

albertvillanova left a comment

Choose a reason for hiding this comment

Uh oh!

Wauplin commented May 14, 2025

Uh oh!

julien-c commented May 14, 2025

Uh oh!

Uh oh!

albertvillanova left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Mention that `InferenceClient` also works with local models + other t… #1322

Mention that `InferenceClient` also works with local models + other t… #1322

albertvillanova left a comment •

edited

Loading