Skip to content

Conversation

whoisj
Copy link
Collaborator

@whoisj whoisj commented May 29, 2025

This change corrects the README.md file in the examples/multimodal folder:

  • Correct "vllm worker" to "decode worker"
  • Correct assertion that data is moved via NATS when embeddings are moved via RDMA.

Additionally, this change updates the textual graphs with Mermaid graphs for improved presentation on github.com.

DIS-100

Summary by CodeRabbit

  • Documentation
    • Updated the README for multimodal deployment examples with clarified worker names and improved descriptions of inter-worker communication.
    • Enhanced flow diagrams to reflect updated terminology and communication paths.
    • Made minor formatting improvements, including consistent capitalization and improved response examples.

@whoisj whoisj requested a review from indrajit96 as a code owner May 29, 2025 19:35
@whoisj whoisj added the documentation Improvements or additions to documentation label May 29, 2025
@whoisj whoisj requested a review from krishung5 as a code owner May 29, 2025 19:35
Copy link

copy-pr-bot bot commented May 29, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

Copy link

👋 Hi whoisj! Thank you for contributing to ai-dynamo/dynamo.

Just a reminder: The NVIDIA Test Github Validation CI runs an essential subset of the testing framework to quickly catch errors.Your PR reviewers may elect to test the changes comprehensively before approving your changes.

🚀

@github-actions github-actions bot added the external-contribution Pull request is from an external contributor label May 29, 2025
@whoisj whoisj changed the title examples/multimodal: update README docs: Update Multimodal Example README May 29, 2025
@github-actions github-actions bot added the docs label May 29, 2025
Copy link
Contributor

coderabbitai bot commented May 29, 2025

Walkthrough

The README for multimodal deployment examples was updated to clarify worker naming, specifying "decode_worker" instead of "vllm_worker". Communication details were enhanced, outlining the use of NATS and RDMA for embedding transfers. Flow diagrams and formatting were revised for consistency and accuracy, with no changes to code or APIs.

Changes

File(s) Change Summary
examples/multimodal/README.md Updated worker names, clarified communication details, revised flow diagrams with Mermaid syntax, and improved formatting (e.g., HTTP capitalization, JSON response blocks).

Poem

In README fields where diagrams dwell,
"vllm" to "decode"—the story to tell.
NATS and RDMA now clearly in view,
With diagrams crisp and formatting new.
A rabbit hops by, documentation in tow,
Cheering for clarity as knowledge will grow!
🐇✨


🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

  • Review comments: Directly reply to a review comment made by CodeRabbit. Example:
    • I pushed a fix in commit <commit_id>, please review it.
    • Explain this complex logic.
    • Open a follow-up GitHub issue for this discussion.
  • Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
    • @coderabbitai explain this code block.
    • @coderabbitai modularize this function.
  • PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
    • @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
    • @coderabbitai read src/utils.ts and explain its main purpose.
    • @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
    • @coderabbitai help me debug CodeRabbit configuration file.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

  • @coderabbitai pause to pause the reviews on a PR.
  • @coderabbitai resume to resume the paused reviews.
  • @coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
  • @coderabbitai full review to do a full review from scratch and review all the files again.
  • @coderabbitai summary to regenerate the summary of the PR.
  • @coderabbitai generate sequence diagram to generate a sequence diagram of the changes in this PR.
  • @coderabbitai resolve resolve all the CodeRabbit review comments.
  • @coderabbitai configuration to show the current CodeRabbit configuration for the repository.
  • @coderabbitai help to get help.

Other keywords and placeholders

  • Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
  • Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
  • Add @coderabbitai anywhere in the PR title to generate the title automatically.

Documentation and Community

  • Visit our Documentation for detailed information on how to use CodeRabbit.
  • Join our Discord Community to get help, request features, and share feedback.
  • Follow us on X/Twitter for updates and announcements.

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 7d0c938 and 17dd623.

📒 Files selected for processing (1)
  • examples/multimodal/README.md (4 hunks)
🧰 Additional context used
🪛 LanguageTool
examples/multimodal/README.md

[grammar] ~37-~37: The verb ‘encode’ does not usually follow articles like ‘the’. Check that ‘encode’ is spelled correctly; using ‘encode’ as a noun may be non-standard.
Context: ..../llm/README.md) example. By separating the encode from the prefill and decode stages, we ...

(A_INFINITIVE)


[grammar] ~38-~38: The usual collocation for “independently” is “of”, not “from”. Did you mean “independently of”?
Context: ... deployment and scale the encode worker independently from the prefill and decode workers if neede...

(INDEPENDENTLY_FROM_OF)

⏰ Context from checks skipped due to timeout of 90000ms (1)
  • GitHub Check: Build and Test - vllm
🔇 Additional comments (10)
examples/multimodal/README.md (10)

27-27: Corrected worker terminology for aggregated serving
The update replaces the ambiguous “vllm worker” label with explicit references to decode_worker, improving clarity.


29-29: Front-end bullet is clear and consistent
The frontend component is properly described as the HTTP endpoint. No issues found.


33-36: Updated deployment description for aggregated serving
The narrative clearly outlines the role of encode_worker and decode_worker, along with NATS + RDMA for data/embeddings transfer.


41-49: Mermaid diagram replaces ASCII flow
Switching to a Mermaid flowchart improves readability on GitHub. The syntax and arrow directions correctly reflect the described pipeline.


64-84: Formatted JSON request sample
The multi-line JSON payload is well-indented and uses proper quoting inside a single-quoted curl -d block.


96-98: Corrected worker list for disaggregated serving
The bullet list accurately enumerates encode_worker, decode_worker, and prefill_worker, and the frontend is still clearly defined.


102-106: Updated deployment narrative for disaggregated serving
The roles of the three workers and the data flow via NATS + RDMA are clearly articulated.


110-119: Mermaid diagram for disaggregated flow
The new Mermaid block correctly visualizes the extended pipeline (including prefill_worker).


131-154: Formatted JSON request sample (disaggregated)
Consistent with the aggregated example, the JSON payload is correctly formatted for curl.


158-158: Language tag for JSON response block
Changing the response example to a json code fence enables syntax highlighting, which is a nice documentation enhancement.

@whoisj whoisj removed the external-contribution Pull request is from an external contributor label May 29, 2025
@whoisj whoisj force-pushed the whoisj/mmdocupdate branch from 17dd623 to 5eb7d7e Compare May 29, 2025 19:49
Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

🧹 Nitpick comments (1)
examples/multimodal/README.md (1)

37-38: Consider refining sentence for clarity
Instead of:
“By separating the encode from the prefill and decode stages, we can have a more flexible deployment and scale the encode worker independently from the prefill and decode workers if needed.”

Consider:

  • “By separating encoding from the prefill and decode stages, …”
  • Use “independently of” instead of “independently from.”
🧰 Tools
🪛 LanguageTool

[grammar] ~37-~37: The verb ‘encode’ does not usually follow articles like ‘the’. Check that ‘encode’ is spelled correctly; using ‘encode’ as a noun may be non-standard.
Context: ..../llm/README.md) example. By separating the encode from the prefill and decode stages, we ...

(A_INFINITIVE)


[grammar] ~38-~38: The usual collocation for “independently” is “of”, not “from”. Did you mean “independently of”?
Context: ... deployment and scale the encode worker independently from the prefill and decode workers if neede...

(INDEPENDENTLY_FROM_OF)

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 17dd623 and 5eb7d7e.

📒 Files selected for processing (1)
  • examples/multimodal/README.md (4 hunks)
🧰 Additional context used
🪛 LanguageTool
examples/multimodal/README.md

[grammar] ~37-~37: The verb ‘encode’ does not usually follow articles like ‘the’. Check that ‘encode’ is spelled correctly; using ‘encode’ as a noun may be non-standard.
Context: ..../llm/README.md) example. By separating the encode from the prefill and decode stages, we ...

(A_INFINITIVE)


[grammar] ~38-~38: The usual collocation for “independently” is “of”, not “from”. Did you mean “independently of”?
Context: ... deployment and scale the encode worker independently from the prefill and decode workers if neede...

(INDEPENDENTLY_FROM_OF)

⏰ Context from checks skipped due to timeout of 90000ms (1)
  • GitHub Check: Build and Test - vllm
🔇 Additional comments (7)
examples/multimodal/README.md (7)

27-28: Terminology updated correctly
The references to vllm_worker have been successfully replaced with decode_worker in both the workers and processor bullets.


33-36: Aggregated deployment description is clear and accurate
The steps correctly describe how encode_worker and decode_worker interact via NATS and RDMA, and the terminology aligns with the rest of the document.


41-49: Mermaid diagram for aggregated serving looks correct
The flowchart accurately represents the interactions and is valid Mermaid syntax.


64-84: Client request example formatting is consistent
The curl command and embedded JSON payload are well-formatted and clear.


96-99: Disaggregated components list is consistent
The bullets correctly enumerate encode_worker, decode_worker, and prefill_worker, with updated terminology throughout.


111-120: Mermaid diagram for disaggregated serving is accurate
The flowchart correctly shows encode_workerprefill_workerdecode_worker along with the other interactions.


158-160: Response example uses proper JSON block
Labeling the response snippet with json enhances readability and clarity.

This change corrects the README.md file in the examples/multimodal folder:
- Correct "vllm worker" to "decode worker"
- Correct assertion that data is moved via NATS when embeddings are moved via RDMA.

Additionally, this change updates the textual graphs with Mermaid graphs for improved presentation on github.com.
@whoisj whoisj force-pushed the whoisj/mmdocupdate branch from 5eb7d7e to 0c4a230 Compare May 29, 2025 20:34
Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 2

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 5eb7d7e and 0c4a230.

📒 Files selected for processing (1)
  • examples/multimodal/README.md (3 hunks)
🧰 Additional context used
🪛 LanguageTool
examples/multimodal/README.md

[grammar] ~37-~37: The verb ‘encode’ does not usually follow articles like ‘the’. Check that ‘encode’ is spelled correctly; using ‘encode’ as a noun may be non-standard.
Context: ..../llm/README.md) example. By separating the encode from the prefill and decode stages, we ...

(A_INFINITIVE)


[grammar] ~38-~38: The usual collocation for “independently” is “of”, not “from”. Did you mean “independently of”?
Context: ... deployment and scale the encode worker independently from the prefill and decode workers if neede...

(INDEPENDENTLY_FROM_OF)

⏰ Context from checks skipped due to timeout of 90000ms (1)
  • GitHub Check: Build and Test - vllm
🔇 Additional comments (6)
examples/multimodal/README.md (6)

27-29: Terminology updated to decode_worker
The component bullets now consistently replace “vllm_worker” with “decode_worker” in the aggregated section.


33-36: Aggregated deployment description is clear
The narrative correctly outlines the roles of encode_worker and decode_worker, and specifies NATS/RDMA usage.


64-77: Aggregated client JSON formatting is correct
The curl request JSON is properly formatted within the bash snippet, using single quotes for shell and double quotes for JSON.

Also applies to: 79-84


96-97: Disaggregated terminology updated to decode_worker
The components list consistently uses decode_worker in place of the old generic term.


102-102: Disaggregated deployment description is accurate
The text now correctly describes the encode→prefill→decode workflow and NATS/RDMA communication channels.

Also applies to: 104-106


134-146: Disaggregated client JSON formatting is correct
The curl example JSON is well-formatted for shell use and consistent with the aggregated example.

Also applies to: 149-154

@whoisj whoisj enabled auto-merge (squash) May 29, 2025 21:38
@whoisj whoisj disabled auto-merge May 29, 2025 21:38
@whoisj whoisj merged commit fb4bf25 into ai-dynamo:main May 29, 2025
7 checks passed
@whoisj whoisj deleted the whoisj/mmdocupdate branch May 29, 2025 21:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
docs documentation Improvements or additions to documentation size/L
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants