[Doc] Added ray-serve llm doc #52832

Blaze-DSP · 2025-05-07T05:29:25Z

Why are these changes needed?

Add example of serving a Large Language Model using Ray Serve LLM on Kubernetes

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

kevin85421 · 2025-05-07T21:35:09Z

Can you fix the CI error? In addition, each commit needs to commit with git commit .... -s to avoid "DCO" failing.

pcmoritz

Thanks a lot for contributing this, it looks good to me. Before merging this, we first need to merge ray-project/kuberay#3517 :)

@kevin85421 Can you drive that PR forward?

doc/source/cluster/kubernetes/examples/rayserve-llm-example.md

Signed-off-by: DPatel_7 <dpatel@gocommotion.com>

Blaze-DSP · 2025-05-12T15:00:00Z

made updates. @kevin85421

Signed-off-by: DPatel_7 <dpatel@gocommotion.com>

doc/source/cluster/kubernetes/examples/rayserve-llm-example.md

Signed-off-by: DPatel_7 <dpatel@gocommotion.com>

kevin85421

Would you mind verifying whether the YAML (ray-project/kuberay#3517 (review)) still works or not? This doc has removed the step of creating a namespace, but the YAML still uses the namespace.

Signed-off-by: DPatel_7 <dpatel@gocommotion.com>

eicherseiji

Taking a closer look this afternoon; might push some edits

doc/source/cluster/kubernetes/examples/rayserve-llm-example.md

Signed-off-by: Seiji Eicher <seiji@anyscale.com>

doc/source/cluster/kubernetes/examples/rayserve-llm-example.md

Signed-off-by: Seiji Eicher <seiji@anyscale.com>

eicherseiji · 2025-06-09T23:51:27Z

@kevin85421 Ready to merge when green

doc/source/cluster/kubernetes/examples/rayserve-llm-example.md

Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com> Co-authored-by: Kai-Hsun Chen <kaihsun@apache.org> Signed-off-by: Seiji Eicher <58963096+eicherseiji@users.noreply.github.com>

Signed-off-by: Seiji Eicher <seiji@anyscale.com>

kevin85421

Looks great!

kevin85421 · 2025-06-12T18:13:49Z

doc/source/cluster/kubernetes/examples/rayserve-llm-example.md

+For additional security, instead of passing the HF Access Token directly as an environment variable, create a Kubernetes Secret containing your Hugging Face access token. Download the Ray Serve LLM service config .yaml file using the following command:
+
+```sh
+curl -o ray-service.llm-serve.yaml https://raw.githubusercontent.com/ray-project/kuberay/master/ray-operator/config/samples/ray-service.llm-serve.yaml


We can use the release branch https://github.com/ray-project/kuberay/tree/release-1.4.

Seems like ray-project/kuberay#3517 needs to be cherry-picked

https://github.com/ray-project/kuberay/tree/release-1.4/ray-operator/config/samples/ray-service.llm-serve.yaml

doc/source/cluster/kubernetes/examples/rayserve-llm-example.md

angelinalg

Just have some style nits. I'd appreciate you fixing prior to merge to decrease tech debt. Thanks!

Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com> Signed-off-by: Seiji Eicher <58963096+eicherseiji@users.noreply.github.com>

kevin85421 · 2025-06-12T23:48:42Z

cc @jjyao @edoakes would you mind merging this PR? Thanks!

Add example of serving a Large Language Model using Ray Serve LLM on Kubernetes Signed-off-by: DPatel_7 <dpatel@gocommotion.com> Signed-off-by: Seiji Eicher <seiji@anyscale.com> Signed-off-by: Seiji Eicher <58963096+eicherseiji@users.noreply.github.com> Co-authored-by: DPatel_7 <dpatel@gocommotion.com> Co-authored-by: Seiji Eicher <seiji@anyscale.com> Co-authored-by: Seiji Eicher <58963096+eicherseiji@users.noreply.github.com> Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com> Co-authored-by: Kai-Hsun Chen <kaihsun@apache.org> Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>

Blaze-DSP requested review from pcmoritz, kevin85421 and a team as code owners May 7, 2025 05:29

Blaze-DSP mentioned this pull request May 7, 2025

Added Ray-Serve Config For LLMs ray-project/kuberay#3517

Merged

hainesmichaelc added the community-contribution Contributed by the community label May 7, 2025

Blaze-DSP force-pushed the ray-serve/llm branch 2 times, most recently from 1ff1edc to b49a3e0 Compare May 8, 2025 17:19

pcmoritz approved these changes May 8, 2025

View reviewed changes

kevin85421 reviewed May 10, 2025

View reviewed changes

kevin85421 reviewed May 11, 2025

View reviewed changes

doc/source/cluster/kubernetes/examples/rayserve-llm-example.md Outdated Show resolved Hide resolved

doc/source/cluster/kubernetes/examples/rayserve-llm-example.md Show resolved Hide resolved

DPatel_7 added 6 commits May 12, 2025 20:26

added ray-serve llm doc

c3802ca

Signed-off-by: DPatel_7 <dpatel@gocommotion.com>

ci error fix

7cdb371

Signed-off-by: DPatel_7 <dpatel@gocommotion.com>

fix

b9a81d1

Signed-off-by: DPatel_7 <dpatel@gocommotion.com>

updates

b001abc

Signed-off-by: DPatel_7 <dpatel@gocommotion.com>

ray dashboard documentation

3ff7079

Signed-off-by: DPatel_7 <dpatel@gocommotion.com>

updates

f5c50e5

Signed-off-by: DPatel_7 <dpatel@gocommotion.com>

Blaze-DSP force-pushed the ray-serve/llm branch from c9e154a to f5c50e5 Compare May 12, 2025 14:56

fix

9e1ea64

Signed-off-by: DPatel_7 <dpatel@gocommotion.com>

masoudcharkhabi added serve Ray Serve Related Issue docs An issue or change related to documentation labels May 12, 2025

kevin85421 reviewed May 13, 2025

View reviewed changes

reference updates

412f85d

Signed-off-by: DPatel_7 <dpatel@gocommotion.com>

kevin85421 reviewed May 13, 2025

View reviewed changes

hainesmichaelc added community-backlog and removed community-backlog labels May 22, 2025

akshay-anyscale added the llm label May 23, 2025

updates

c1f4e6a

Signed-off-by: DPatel_7 <dpatel@gocommotion.com>

DPatel_7 added 2 commits May 28, 2025 12:38

updates

841512f

Signed-off-by: DPatel_7 <dpatel@gocommotion.com>

updates

6fba5b6

Signed-off-by: DPatel_7 <dpatel@gocommotion.com>

eicherseiji self-assigned this Jun 5, 2025

eicherseiji requested changes Jun 9, 2025

View reviewed changes

doc/source/cluster/kubernetes/examples/rayserve-llm-example.md Outdated Show resolved Hide resolved

P0 edits

1b2d8b6

Signed-off-by: Seiji Eicher <seiji@anyscale.com>

eicherseiji approved these changes Jun 9, 2025

View reviewed changes

dstrodtman reviewed Jun 9, 2025

View reviewed changes

doc/source/cluster/kubernetes/examples/rayserve-llm-example.md Outdated Show resolved Hide resolved

Remove verbose output

1e59108

Signed-off-by: Seiji Eicher <seiji@anyscale.com>

eicherseiji added the go add ONLY when ready to merge, run all tests label Jun 9, 2025

kevin85421 reviewed Jun 10, 2025

View reviewed changes

angelinalg reviewed Jun 10, 2025

View reviewed changes

eicherseiji and others added 5 commits June 10, 2025 12:54

Apply suggestions from code review

e8043e0

Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com> Co-authored-by: Kai-Hsun Chen <kaihsun@apache.org> Signed-off-by: Seiji Eicher <58963096+eicherseiji@users.noreply.github.com>

Vale

dc187df

Signed-off-by: Seiji Eicher <seiji@anyscale.com>

Respond to comments

ffb2c8a

Signed-off-by: Seiji Eicher <seiji@anyscale.com>

Explain serve config and add API documernation link

5caac80

Signed-off-by: Seiji Eicher <seiji@anyscale.com>

Fix vale

7cf914f

Signed-off-by: Seiji Eicher <seiji@anyscale.com>

kevin85421 approved these changes Jun 12, 2025

View reviewed changes

angelinalg reviewed Jun 12, 2025

View reviewed changes

doc/source/cluster/kubernetes/examples/rayserve-llm-example.md Outdated Show resolved Hide resolved

doc/source/cluster/kubernetes/examples/rayserve-llm-example.md Outdated Show resolved Hide resolved

angelinalg approved these changes Jun 12, 2025

View reviewed changes

eicherseiji and others added 2 commits June 12, 2025 13:00

Apply suggestions from code review

6c86e98

Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com> Signed-off-by: Seiji Eicher <58963096+eicherseiji@users.noreply.github.com>

Merge branch 'master' into ray-serve/llm

a57761a

jjyao changed the title ~~added ray-serve llm doc~~ [Doc] added ray-serve llm doc Jun 12, 2025

jjyao changed the title ~~[Doc] added ray-serve llm doc~~ [Doc] Added ray-serve llm doc Jun 12, 2025

jjyao merged commit d3c025c into ray-project:master Jun 12, 2025
5 checks passed

[Doc] Added ray-serve llm doc #52832

[Doc] Added ray-serve llm doc #52832

Uh oh!

Conversation

Blaze-DSP commented May 7, 2025 • edited by jjyao Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why are these changes needed?

Related issue number

Checks

Uh oh!

kevin85421 commented May 7, 2025

Uh oh!

pcmoritz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Blaze-DSP commented May 12, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kevin85421 left a comment

Choose a reason for hiding this comment

Uh oh!

eicherseiji left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

eicherseiji commented Jun 9, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kevin85421 left a comment

Choose a reason for hiding this comment

Uh oh!

kevin85421 Jun 12, 2025

Choose a reason for hiding this comment

Uh oh!

eicherseiji Jun 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Blaze-DSP commented May 7, 2025 •

edited by jjyao

Loading