本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
-
Updated
Aug 3, 2025 - HTML
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
List of software that allows searching the web with the assistance of AI: https://hf.co/spaces/felladrin/awesome-ai-web-search
Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs
The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
🚀 LLM inference optimization simulator, modeling compute-bound prefill and memory-bound decode phases.
The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
A static-page vanilla-js interface for various LLM APIs (OpenAI, Claude, Gemini, Together).
A small VLM that sees everything
AI access made free for everyone!
Welcome to our AI Battle! Ask a question and let our two AI models battle it out
Web Client For Ollama - Llama LLM
Template for building microservice-based apps with a frontend, backend, LLM serving engine (e.g., vllm), and nginx.
GUI for GGML Alpaca models
U'r 1 Click Podcast :)
Personal website, dedicated to sharing some useful computer learning knowledge!
LLM-based international M&A negotiation simulator.
Chat with webpages using IA and RAG
My expanding collection of scripts and tools designed to aid in working with large language models, understanding their performance characteristics and context limitations.
Add a description, image, and links to the llm-inference topic page so that developers can more easily learn about it.
To associate your repository with the llm-inference topic, visit your repo's landing page and select "manage topics."