qwen3

Here are 39 public repositories matching this topic...

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

Updated Aug 15, 2025
Python

1Panel-dev / MaxKB

Star

🔥 MaxKB is an open-source platform for building enterprise-grade agents. MaxKB 是强大易用的开源企业级智能体平台。

agent chatbot knowledgebase rag llm langchain pgvector ollama maxkb llama3 agentic-ai mcp-server deepseek-r1 qwen3

Updated Aug 15, 2025
Python

sgl-project / sglang

Star

SGLang is a fast serving framework for large language models and vision language models.

Updated Aug 15, 2025
Python

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4v, Phi4, ...) (AAAI 2025).

Updated Aug 15, 2025
Python

zilliztech / deep-searcher

Star

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

agent openai grok claude rag milvus vector-database llm zilliz deepseek agentic-rag grok3 reasoning-models deepseek-r1 deep-research qwen3 llama4

Updated Jul 10, 2025
Python

OpenPipe / ART

Star

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

agent reinforcement-learning rl lora llms qwen kimi-ai agentic-ai grpo qwen3

Updated Aug 15, 2025
Python

xlite-dev / Awesome-LLM-Inference

Star

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

mla vllm llm-inference awesome-llm flash-attention tensorrt-llm paged-attention deepseek flash-attention-3 deepseek-v3 minimax-01 deepseek-r1 flash-mla qwen3

Updated Aug 6, 2025
Python

NetEase-Media / grps_trtllm

Star

Higher performance OpenAI LLM service than vLLM serve: A pure C++ high-performance OpenAI LLM service implemented with GPRS+TensorRT-LLM+Tokenizers.cpp, supporting chat and function call, AI agents, distributed multi-GPU inference, multimodal capabilities, and a Gradio chat interface.

Updated May 14, 2025
Python

Zeyi-Lin / Qwen3-Medical-SFT

Star

Qwen3 Fine-tuning: Medical R1 Style Chat

r1 fine-tuning sft qwen3

Updated May 31, 2025
Python

aws-samples / easy-model-deployer

Star

Deploy open-source LLMs on AWS in minutes — with OpenAI-compatible APIs and a powerful CLI/SDK toolkit.

Updated Aug 15, 2025
Python

AaronFeng753 / Better-Qwen3

Star

Auto Thinking Mode switch for Qwen3 in Open webui

qwen open-webui qwen3

Updated May 8, 2025
Python

bold84 / cot_proxy

Star

Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and <think> tag filtering. Perfect for using advanced models with apps that lack parameter customization.

llm qwen3

Updated May 19, 2025
Python

adamjen / Prompt_Maker

Star

Makes a improved prompts from a basic prompt

agents crewai agentic-workflow qwen3

Updated Jun 19, 2025
Python

NVIDIA-NeMo / Automodel

Star

Fine-tune any Hugging Face LLM or VLM on day-0 using PyTorch-native features for GPU-accelerated distributed training with superior performance and memory efficiency.

python machine-learning ai pytorch openai llama mistral vlm finetuning huggingface llm llm-training finetuning-llms qwen llama3 gemma3 qwen3 gemma3n

Updated Aug 15, 2025
Python

gty111 / gLLM

Star

gLLM: Global Balanced Pipeline Parallelism System for Distributed LLM Serving with Token Throttling

pipeline-parallelism tensor-parallelism llm-serving llm-inference pagedattention continuous-batching qwen3 token-throttling chunked-prefill

Updated Aug 15, 2025
Python

QwenLM / PolyMath

Star

Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"

multilingual mathematical-reasoning large-language-models qwen3

Updated May 22, 2025
Python

taishan1994 / LLM-Quantization

Star

记录量化LLM中的总结。

quantization llm gptq quarot qwen3

Updated Aug 13, 2025
Python

BaohaoLiao / frac-cot

Star

An efficient sampling method for long-CoT LLM with fractured CoT.

efficiency reasoning sampling-methods chain-of-thought llm-inference deepseek deepseek-r1 qwen3

Updated May 25, 2025
Python

Shuyib / tool_calling_api

Star

This project demonstrates function-calling with Python and Ollama, utilizing the Africa's Talking API to send airtime and messages to phone numbers using natural language prompts. Ollama + LLM w/ functions + Natural language = User Interface for non-coders.

Updated Jul 24, 2025
Python

DAILtech / Qwen3-deploy-for-developer

Star

Local deployment guidance of Qwen3 for developer, and CLI script implementation.

linux cli deployment cuda developer llm qwen3

Updated May 1, 2025
Python

Improve this page

Add a description, image, and links to the qwen3 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the qwen3 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

qwen3

Here are 39 public repositories matching this topic...

unslothai / unsloth

1Panel-dev / MaxKB

sgl-project / sglang

modelscope / ms-swift

zilliztech / deep-searcher

OpenPipe / ART

xlite-dev / Awesome-LLM-Inference

NetEase-Media / grps_trtllm

Zeyi-Lin / Qwen3-Medical-SFT

aws-samples / easy-model-deployer

AaronFeng753 / Better-Qwen3

bold84 / cot_proxy

adamjen / Prompt_Maker

NVIDIA-NeMo / Automodel

gty111 / gLLM

QwenLM / PolyMath

taishan1994 / LLM-Quantization

BaohaoLiao / frac-cot

Shuyib / tool_calling_api

DAILtech / Qwen3-deploy-for-developer

Improve this page

Add this topic to your repo