#

rocm

Here are 41 public repositories matching this topic...

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Updated Aug 10, 2025
Python

apache / tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

javascript machine-learning performance deep-learning metal compiler gpu vulkan opencl tensor spirv rocm tvm

Updated Aug 10, 2025
Python

cupy / cupy

NumPy & SciPy for GPU

python gpu numpy cuda cublas scipy tensor cudnn rocm cupy cusolver nccl curand cusparse nvrtc cutensor nvtx cusparselt

Updated Aug 8, 2025
Python

LMCache / LMCache

Supercharge Your LLM with the Fastest KV Cache Layer

fast amd cuda inference pytorch speed rocm kv-cache llm vllm

Updated Aug 10, 2025
Python

gpustack / gpustack

Simple, scalable AI model deployment on GPU clusters

Updated Aug 7, 2025
Python

lshqqytiger / stable-diffusion-webui-amdgpu

Stable Diffusion web UI

web ai deep-learning amd torch image-generation hip amdgpu rocm radeon text2image image2image img2img ai-art directml txt2img stable-diffusion

Updated Jun 24, 2025
Python

deepmd-kit

deepmodeling / deepmd-kit

A deep learning package for many-body potential energy representation and molecular dynamics

Updated Aug 8, 2025
Python

patientx / ComfyUI-Zluda

The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface. Now ZLUDA enhanced for better AMD GPU performance.

windows amd cuda rocm stable-diffusion comfyui zluda

Updated Aug 10, 2025
Python

devnen / Chatterbox-TTS-Server

Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), predefined voices, voice cloning, and large audiobook-scale text processing. Runs accelerated on NVIDIA (CUDA), AMD (ROCm), and CPU.

python text-to-speech ai cuda web-ui api-server pytorch tts speech-synthesis rocm chatterbox speech-synthesis-api tts-api voice-cloning fastapi huggingface openai-api audio-generation chatterbox-tts

Updated Jul 14, 2025
Python

ROCm / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

pytorch rocm

Updated Aug 8, 2025
Python

EmbeddedLLM / vllm

vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs

inference pytorch transformer gpt amdgpu rocm model-serving llm llm-inference

Updated Aug 9, 2025
Python

Herdora / kandc

Set up inference profiling in 60 seconds

python cloud ai amd gpu-acceleration profiling kernels rocm nvidia-gpu gpu-programming nvidia-cuda-gpu rocm-kernel llm nsys therock profiling-tools

Updated Aug 9, 2025
Python

Beinsezii / comfyui-amd-go-fast

Simple monkeypatch to boost AMD Navi 3 GPUs

amd rocm comfyui

Updated Apr 21, 2025
Python

nikos230 / Run-Pytorch-with-AMD-Radeon-GPU

Complete Guide how to run Pytorch with AMD rx460,470,480 (gfx803) GPUs

guide pytorch rocm gfx803

Updated Jan 2, 2025
Python

okuvshynov / cubestat

Horizon chart for CPU/GPU/Neural Engine utilization monitoring. Supports Apple M1-M4, Nvidia GPUs, AMD GPUs

monitoring gpu horizon command-line-tool rocm nvidia-gpu neural-engine apple-silicon

Updated Aug 10, 2025
Python

harakas / amd_igpu_yolo_v8

Example on how to use pytorch/yolov8 object detection on computers with AMD integrated GPUs

amd pytorch yolo object-detection rocm igpu yolov8 migraphx

Updated Feb 5, 2024
Python

srinivamd / rocminstaller

ROCm Install Utilities: rocminstall.py script to install a specific ROCm release version/revision.

ubuntu uninstall centos dkms rocm rocminstall rocm-releases rocm-kernel rocminstaller

Updated Jun 20, 2025
Python

ROCm / numba-hip

HIP backend patch for Numba, the NumPy aware dynamic Python compiler using LLVM.

python ai compiler hpc gpu cuda ml jit hip numba rocm radeon-instinct-mi-series

Updated Apr 15, 2025
Python

ROCm / iris

AMD RAD's experimental RMA library for Triton.

distributed-computing ml gpgpu triton rdma hip shmem gemm rma rocm multigpu fused-kernel commuication

Updated Aug 10, 2025
Python

jatinx / PyHIP

Python Interface to HIP and hiprtc Library

python gpu cuda bindings hip rocm hiprtc

Updated May 12, 2025
Python

Improve this page

Add a description, image, and links to the rocm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the rocm topic, visit your repo's landing page and select "manage topics."