Working streaming Gradio chatbot outputs #1246

aymeric-roucher · 2025-04-24T21:30:49Z

This finalizes the streaming refactoring: makes step functions generators, and adds some intermediate yield to yield intermediate completions.

Also I've set all examples to stream outputs by default when the underlying Model class allows it. Progressively we should switch wherever we can to stream_ouputs because it's much more user-friendly than waiting until the end of message generation.

src/smolagents/agents.py

aymeric-roucher · 2025-04-25T08:09:04Z

src/smolagents/gradio_ui.py

+        yield gr.ChatMessage(role="assistant", content="**Planning step**", metadata={"status": "done"})
+        yield gr.ChatMessage(role="assistant", content=step_log.plan, metadata={"status": "done"})
+        yield gr.ChatMessage(
+            role="assistant", content=get_step_footnote_content(step_log, "Planning step"), metadata={"status": "done"}


Using metadata field to note when a step is done, will help us in the gradio app when receiving a completion delta to differentiate between "previous message is complete: start a new one" or "append this completion delta's text to the previous pending message".

Maybe there's a simpler way thought, WDYT?

Not clear to me how to use this param. However, it seems it requires a "title" key.

Filling the "title" key makes a title show up, which wouldn't look very good in our case.
This title field does not seem mandatory in practice, since the previous implementaiton of gr.ChatMessage in smolagents comes from @yvrjsharma and does not have title fields in every message.

So it appears to be a misalignment in the doc of this param.

aymeric-roucher · 2025-04-25T08:09:57Z

src/smolagents/gradio_ui.py

-            <a target="_blank" href="https://github.com/huggingface/smolagents"><b>huggingface/smolagents</b></a>
-            </div>""")
+                gr.HTML(
+                    "<br><br><h4><center>Powered by <a target='_blank' href='https://github.com/huggingface/smolagents'><b>smolagents</b></a></center></h4>"


Making the branding a bit less present, because for now it's too big. Maybe could be removed altogether.

src/smolagents/gradio_ui.py

albertvillanova · 2025-04-25T15:30:04Z

src/smolagents/gradio_ui.py

+        yield gr.ChatMessage(role="assistant", content="**Planning step**", metadata={"status": "done"})
+        yield gr.ChatMessage(role="assistant", content=step_log.plan, metadata={"status": "done"})
+        yield gr.ChatMessage(
+            role="assistant", content=get_step_footnote_content(step_log, "Planning step"), metadata={"status": "done"}


Not clear to me how to use this param. However, it seems it requires a "title" key.

albertvillanova · 2025-04-25T15:36:58Z

src/smolagents/agents.py

-        pass
+    def step(self, memory_step: ActionStep) -> Generator[Any]:
+        """To be implemented in children classes. Should yield either None if the step is not final, or the final answer."""
+        yield None


I think the conversion of step to generator is a breaking change.

albertvillanova

I solved the conflicts with he main branch and aligned the Generator type hint.

Also made a direct change in the PR instead of discussing it: feel free to revert.

aymeric-roucher · 2025-04-28T09:34:45Z

@yvrjsharma could you take a look at this comment? The way've I've modified the method pull_messages_from_step to detect pending streaming outputs it to put them gr.ChatMessages objects with "status" : "pending" in their metadata: however the metadata typed dict in gradio seems to need a title key and accept no other, is it legit to use status?

…ssue

aymeric-roucher · 2025-04-29T12:32:51Z

src/smolagents/agents.py

@@ -1007,10 +1035,10 @@ def initialize_system_prompt(self) -> str:
        )
        return system_prompt

-    def step(self, memory_step: ActionStep) -> None | Any:
+    def _step(self, memory_step: ActionStep) -> Generator[Any]:


@albertvillanova here's how I modified the step method.

albertvillanova

Thank you!

Just a comment: what about aligning the naming of the streaming versions of run and step? Currently:

_run_stream
_step

aymeric-roucher · 2025-04-30T14:15:40Z

@albertvillanova I went for _step_stream and _run_stream.

aymeric-roucher added 7 commits April 24, 2025 23:30

Working streaming Gradio chatbot outputs

a29d0ff

Simplify structure

9e0db6e

Gradio UI: make model output display depend on stream_outputs

276aff9

Use output streaming in Gradio UI example

32ec0bb

Stream model outputs in examples

b9a98e6

Merge branch 'main' into enable-streaming-outputs-gradio-ui

afba30f

Hopefully pas tests

95f4c4f

aymeric-roucher commented Apr 25, 2025

View reviewed changes

src/smolagents/agents.py Show resolved Hide resolved

aymeric-roucher commented Apr 25, 2025

View reviewed changes

aymeric-roucher marked this pull request as ready for review April 25, 2025 08:11

aymeric-roucher requested a review from albertvillanova April 25, 2025 08:12

albertvillanova added 4 commits April 25, 2025 17:03

Merge branch 'main' into enable-streaming-outputs-gradio-ui

fa0692b

Fix deprecated typing.Generator

97bbd70

Remove MultiStepAgent.stream_outputs

a8b2a9d

Fix accessing optional stream_outputs attribute

14f80c7

albertvillanova reviewed Apr 25, 2025

View reviewed changes

aymeric-roucher added 8 commits April 29, 2025 12:28

Make change of stepo function not breaking

0c0ddf2

Stream planning outputs

5b12e0a

Merge branch 'main' into enable-streaming-outputs-gradio-ui

c60e479

Add docstring on skip_model_outputs

a5f59d5

Remove image output callback from gradio UI to avoid context length i…

1229d5f

…ssue

Make MultiStepAgent instantiable by itself

cca59fa

Pass tests in agents.py

6c21e3c

Adjust test output length

aa23b38

aymeric-roucher commented Apr 29, 2025

View reviewed changes

albertvillanova approved these changes Apr 29, 2025

View reviewed changes

Harmonize naming for generator run and step functions

21637ee

Format

02212d9

aymeric-roucher merged commit d02f0cc into main Apr 30, 2025
4 checks passed

This was referenced May 9, 2025

Fix thought yield in GradioUI for streaming and non-streaming #1311

Merged

Fix duplicate plan display in GradioUI when streaming #1317

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Working streaming Gradio chatbot outputs #1246

Working streaming Gradio chatbot outputs #1246

Uh oh!

aymeric-roucher commented Apr 24, 2025 •

edited

Loading

Uh oh!

Uh oh!

aymeric-roucher Apr 25, 2025

Uh oh!

albertvillanova Apr 25, 2025

Uh oh!

aymeric-roucher Apr 29, 2025

Uh oh!

albertvillanova Apr 29, 2025

Uh oh!

aymeric-roucher Apr 25, 2025

Uh oh!

Uh oh!

albertvillanova Apr 25, 2025

Uh oh!

albertvillanova Apr 25, 2025

Uh oh!

albertvillanova left a comment •

edited

Loading

Uh oh!

aymeric-roucher commented Apr 28, 2025

Uh oh!

aymeric-roucher Apr 29, 2025

Uh oh!

albertvillanova left a comment

Uh oh!

aymeric-roucher commented Apr 30, 2025

Uh oh!

Uh oh!

Uh oh!

Working streaming Gradio chatbot outputs #1246

Working streaming Gradio chatbot outputs #1246

Uh oh!

Conversation

aymeric-roucher commented Apr 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

albertvillanova left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

aymeric-roucher commented Apr 28, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

albertvillanova left a comment

Choose a reason for hiding this comment

Uh oh!

aymeric-roucher commented Apr 30, 2025

Uh oh!

Uh oh!

Uh oh!

aymeric-roucher commented Apr 24, 2025 •

edited

Loading

albertvillanova left a comment •

edited

Loading