Change code tags to xml #1442

aymeric-roucher · 2025-06-15T15:30:34Z

Change code tags to XML to align with the widespread usage of XML tags within chat template for most LLMs.

For instance gemma-3 chat template uses XML formatting: check it out here. Many other models also use XML: Qwen for instance.

Testing XML tags on models like GPT-4o, Claude-4-Sonnet, Llama-3.1 8B & 70B, or Mistral 7B, didn't show any performance degradation.

HuggingFaceDocBuilderDev · 2025-06-17T00:44:19Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

docs/source/en/conceptual_guides/intro_agents.mdx

docs/source/en/tutorials/building_good_agents.mdx

albertvillanova · 2025-06-17T04:57:30Z

src/smolagents/prompts/code_agent.yaml

@@ -205,7 +198,7 @@ planning:
    Then for the given task, develop a step-by-step high-level plan taking into account the above inputs and list of facts.
    This plan should involve individual tasks based on the available tools, that if executed correctly will yield the correct answer.
    Do not skip steps, do not add any superfluous steps. Only write the high-level plan, DO NOT DETAIL INDIVIDUAL TOOL CALLS.
-    After writing the final step of the plan, write the '\n<end_plan>' tag and stop there.
+    After writing the final step of the plan, write the '<end_plan>' tag and stop there.


Why not aligning this tag as well, and use </plan> instead of <end_plan>?

Here since there's no opening tag , maybe alone would be strange?
The idea of <end_plan> is to cut generation and prevent the model from doing anything else after generating its plan.

albertvillanova · 2025-06-17T05:06:47Z

src/smolagents/prompts/code_agent.yaml

@@ -180,6 +168,11 @@ system_prompt: |-
  9. The state persists between code executions: so if in one step you've created variables or imported modules, these will all persist.
  10. Don't give up! You're in charge of solving the task, not providing directions to solve it.

+  {%- if custom_instructions %}
+  Here are custom instructions:


Here are custom instructions:

Does this sentence add value to the prompt? Does the LLM treat differently "custom instructions" versus "regular"(?) instructions?

Good point, I've removed this sentence since it adds little value.

aymeric-roucher · 2025-06-17T20:32:35Z

tests/test_gradio_ui.py

-            assert "```python" in tool_message.content
-        else:
-            assert expected in tool_message.content
+        assert expected in tool_message.content


Removing the if/else here since I didn't see why it was needed!

albertvillanova

Thanks.

docs/source/en/tutorials/building_good_agents.mdx

Co-authored-by: Albert Villanova del Moral <8515462+albertvillanova@users.noreply.github.com>

aymeric-roucher added 4 commits June 15, 2025 17:30

Change code tags to xml

f043da9

Fix

f7c11e5

Add custom instructions for structured code agent

66dd4f7

Improve prompts and fix parsing issue

31517d2

Fix tests

411de70

albertvillanova reviewed Jun 17, 2025

View reviewed changes

docs/source/en/conceptual_guides/intro_agents.mdx Outdated Show resolved Hide resolved

albertvillanova reviewed Jun 17, 2025

View reviewed changes

docs/source/en/tutorials/building_good_agents.mdx Outdated Show resolved Hide resolved

albertvillanova reviewed Jun 17, 2025

View reviewed changes

aymeric-roucher added 4 commits June 17, 2025 17:34

Remove 'Here are custom instructions'

cf90446

Reword prompts

72b91cb

Fix tests

10f4fdb

Fix test

935885d

aymeric-roucher mentioned this pull request Jun 17, 2025

Fix and refactor final answer checks #1448

Merged

aymeric-roucher commented Jun 17, 2025

View reviewed changes

albertvillanova approved these changes Jun 18, 2025

View reviewed changes

docs/source/en/tutorials/building_good_agents.mdx Outdated Show resolved Hide resolved

Update docs/source/en/tutorials/building_good_agents.mdx

1b973a0

Co-authored-by: Albert Villanova del Moral <8515462+albertvillanova@users.noreply.github.com>

aymeric-roucher merged commit e0cc2cc into main Jun 18, 2025
5 checks passed

albertvillanova mentioned this pull request Jun 27, 2025

WIP: Fence code with ```python instead of <code> #1491

Closed

aymeric-roucher mentioned this pull request Jun 27, 2025

Allow markdown or custom formatting for code blocks #1493

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Change code tags to xml #1442

Change code tags to xml #1442

Uh oh!

aymeric-roucher commented Jun 15, 2025 •

edited by albertvillanova

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Jun 17, 2025

Uh oh!

Uh oh!

Uh oh!

albertvillanova Jun 17, 2025

Uh oh!

aymeric-roucher Jun 17, 2025

Uh oh!

albertvillanova Jun 17, 2025

Uh oh!

aymeric-roucher Jun 17, 2025

Uh oh!

aymeric-roucher Jun 17, 2025

Uh oh!

albertvillanova left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Change code tags to xml #1442

Change code tags to xml #1442

Uh oh!

Conversation

aymeric-roucher commented Jun 15, 2025 • edited by albertvillanova Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Jun 17, 2025

Uh oh!

Uh oh!

Uh oh!

albertvillanova Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

aymeric-roucher Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

albertvillanova Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

aymeric-roucher Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

aymeric-roucher Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

albertvillanova left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

aymeric-roucher commented Jun 15, 2025 •

edited by albertvillanova

Loading