add model_config support in TransformersModel #1168

Jonnathanz · 2025-04-10T07:00:16Z

This pull request adds support for the model_config parameter in the TransformersModel class. With this change, it's now possible to pass a dictionary containing specific configuration options for model loading (via AutoModelForCausalLM.from_pretrained or AutoModelForImageTextToText.from_pretrained), separating these settings from the kwargs used in the generate() method.

Highlights:

Quantization support: Enables the use of configurations such as quantization_config (e.g., for 4-bit quantization using BitsAndBytes), as well as other parameters like torch_dtype and device_map.

Flexible model initialization: Users can now customize model loading with a wide range of parameters without interfering with generation-specific arguments.

Clear separation of concerns: Model configuration is handled through the model_config dictionary, while generation parameters remain in **kwargs during the generate() call.

This update improves customization options during model initialization, making the framework more versatile and suitable for models requiring specific loading configurations.

Open to feedback — happy to refine the implementation as needed.

Example:
```python
>>> from transformers import BitsAndBytesConfig
>>> from smolagents import CodeAgent, TransformersModel

>>> model_id = "Qwen/Qwen2.5-Coder-32B-Instruct"

>>> bnb_config = BitsAndBytesConfig(
...     load_in_4bit=True,
...     bnb_4bit_compute_dtype="float16",
...     bnb_4bit_use_double_quant=True,
...     bnb_4bit_quant_type="nf4"
... )

>>> model = TransformersModel(
...     model_id,
...     device_map="auto",
...     torch_dtype="auto",
...     trust_remote_code=True,
...     model_config={'quantization_config': bnb_config},
...     max_new_tokens=2000
... )

>>> agent = CodeAgent(tools=[], model=model)

>>> result = agent.run("Explain quantum mechanics in simple terms.")
>>> print(result)
"Quantum mechanics is a branch of physics that studies the behavior of particles at the smallest scales, such as atoms and subatomic particles. Unlike classical physics, which..."

gabaric · 2025-07-24T09:18:25Z

Replace :
model_config={'quantization_config': bnb_config},
By :
kwargs=bnb_config,

add model_config support in TransformersModel

22e3518

Jonnathanz mentioned this pull request Apr 10, 2025

[Feature Request] Allow passing configuration parameters to model loading (from_pretrained) at TransformersModel #1167

Open

albertvillanova mentioned this pull request Jul 24, 2025

ENH: Support passing model_kwargs to TransformersModel #1608

Merged

albertvillanova closed this in #1608 Jul 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add model_config support in TransformersModel #1168

add model_config support in TransformersModel #1168

Jonnathanz commented Apr 10, 2025 •

edited

Loading

Uh oh!

gabaric commented Jul 24, 2025

Uh oh!

Uh oh!

add model_config support in TransformersModel #1168

add model_config support in TransformersModel #1168

Conversation

Jonnathanz commented Apr 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gabaric commented Jul 24, 2025

Uh oh!

Uh oh!

Jonnathanz commented Apr 10, 2025 •

edited

Loading