Popular repositories Loading
-
ms-k8s-vgpu-scheduler
ms-k8s-vgpu-scheduler PublicForked from 4paradigm/k8s-vgpu-scheduler
OpenAIOS vGPU device plugin for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory, in order to allow applications to access larger memory space than its physical ca…
Go
-
go-streams
go-streams PublicForked from reugn/go-streams
A lightweight stream processing library for Go
Go
-
lorax
lorax PublicForked from predibase/lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Python
-
langgraph
langgraph PublicForked from langchain-ai/langgraph
Build resilient language agents as graphs.
Python
-
qdrant-operator
qdrant-operator PublicForked from ganochenkodg/qdrant-operator
Kubernetes operator for Qdrant
JavaScript
-
vllm
vllm PublicForked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Python
Repositories
- sglang Public Forked from sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
nstream-ai/sglang’s past year of commit activity - vllm Public Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
nstream-ai/vllm’s past year of commit activity - lorax Public Forked from predibase/lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
nstream-ai/lorax’s past year of commit activity - ms-k8s-vgpu-scheduler Public Forked from 4paradigm/k8s-vgpu-scheduler
OpenAIOS vGPU device plugin for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory, in order to allow applications to access larger memory space than its physical capacity. It is designed for ease of use of extended device memory for AI workloads.
nstream-ai/ms-k8s-vgpu-scheduler’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…