Enable image output for Tool.from_space #1510

aymeric-roucher · 2025-07-02T14:03:27Z

gradio_client.Client.predict() returns path to generated images/audio files instead of the real image/audio object: this PR fixes it. It also adds testing to make sure that agents do return image objects when they call final_answer on an image.

aymeric-roucher · 2025-07-02T14:08:21Z

Requesting review from @A-Mahla since we have been discussing how to integrate different tools!

(failing wikipedia tests are unrelated)

A-Mahla · 2025-07-07T10:22:54Z

src/smolagents/tools.py

-                    return output[
+                    if isinstance(output[1], str):
+                        raise ValueError("The space returned this message: " + output[1])
+                    output = output[
                        0
                    ]  # Sometime the space also returns the generation seed, in which case the result is at index 0
+                IMAGE_EXTENTIONS = [".png", ".jpg", ".jpeg", ".gif", ".webp"]
+                AUDIO_EXTENTIONS = [".mp3", ".wav", ".ogg", ".m4a", ".flac"]
+                if isinstance(output, str) and any([output.endswith(ext) for ext in IMAGE_EXTENTIONS]):
+                    output = AgentImage(output)
+                elif isinstance(output, str) and any([output.endswith(ext) for ext in AUDIO_EXTENTIONS]):
+                    output = AgentAudio(output)
                return output


It’s not clear what type output is exactly. AgentImage | AgentAudio | str ?

aymeric-roucher added 2 commits July 2, 2025 16:02

Enable image output for Tool.from_space

0f62daf

Format

08d94d4

aymeric-roucher requested a review from A-Mahla July 2, 2025 14:07

aymeric-roucher added 3 commits July 2, 2025 16:31

Remove print statements and format

1ceb44b

Fix handle_agent_output_types

cf3f76d

Format

11e69d1

A-Mahla approved these changes Jul 7, 2025

View reviewed changes

aymeric-roucher merged commit d832f6d into main Jul 8, 2025
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable image output for Tool.from_space #1510

Enable image output for Tool.from_space #1510

Uh oh!

aymeric-roucher commented Jul 2, 2025 •

edited

Loading

Uh oh!

aymeric-roucher commented Jul 2, 2025 •

edited

Loading

Uh oh!

A-Mahla Jul 7, 2025

Uh oh!

Uh oh!

Uh oh!

Enable image output for Tool.from_space #1510

Enable image output for Tool.from_space #1510

Uh oh!

Conversation

aymeric-roucher commented Jul 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aymeric-roucher commented Jul 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

A-Mahla Jul 7, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

aymeric-roucher commented Jul 2, 2025 •

edited

Loading

aymeric-roucher commented Jul 2, 2025 •

edited

Loading