deepseek-r1

Here are 247 public repositories matching this topic...

xtekky / gpt4free

The official gpt4free repository | various collection of powerful language models | o4, o3 and deepseek r1, gpt-4.1, gemini 2.5

Updated Aug 10, 2025
Python

unslothai / unsloth

Sponsor

Star

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

Updated Aug 9, 2025
Python

Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin993886460 (Beware of fake account)

ai agents autonomous-agents voice-assistant llm llm-agents agentic-ai deepseek-r1

Updated Jul 13, 2025
Python

1Panel-dev / MaxKB

Star

🔥 MaxKB is an open-source platform for building enterprise-grade agents. MaxKB 是强大易用的开源企业级智能体平台。

agent chatbot knowledgebase rag llm langchain pgvector ollama maxkb llama3 agentic-ai mcp-server deepseek-r1 qwen3

Updated Aug 8, 2025
Python

sgl-project / sglang

Star

SGLang is a fast serving framework for large language models and vision language models.

Updated Aug 10, 2025
Python

modelscope / ms-swift

Star

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4v, Phi4, ...) (AAAI 2025).

Updated Aug 10, 2025
Python

zilliztech / deep-searcher

Star

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

agent openai grok claude rag milvus vector-database llm zilliz deepseek agentic-rag grok3 reasoning-models deepseek-r1 deep-research qwen3 llama4

Updated Jul 10, 2025
Python

om-ai-lab / VLM-R1

Star

Solve Visual Understanding with Reinforced VLMs

reinforcement-learning vlm multimodal llm qwen deepseek-r1 grpo r1-zero vlm-r1 multimodal-r1

Updated Jun 26, 2025
Python

xlite-dev / Awesome-LLM-Inference

Star

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

mla vllm llm-inference awesome-llm flash-attention tensorrt-llm paged-attention deepseek flash-attention-3 deepseek-v3 minimax-01 deepseek-r1 flash-mla qwen3

Updated Aug 6, 2025
Python

SkyworkAI / Skywork-R1V

Star

Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI (Kunlun Inc.), specializing in vision-language reasoning.

reinforcement-learning reasoning vlm llm multimodal-understanding deepseek-r1 grpo vlm-r1 multimodal-r1 r1v skywork-r1v

Updated Aug 2, 2025
Python

sunnynexus / WebThinker

Star

🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability

research gaia reasoning hle reportgen o3 qwq webwalker o1 deepsearch deepseek-r1 gpqa deepresearch

Updated Jul 30, 2025
Python

ScienceOne-AI / DeepSeek-671B-SFT-Guide

Star

An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to inference, as well as some practical experiences and conclusions. (DeepSeek-V3/R1 满血版 671B 全参数微调的开源解决方案，包含从训练到推理的完整代码和脚本，以及实践中积累一些经验和结论。)

python moe sft llm deepseek-r1

Updated Mar 13, 2025
Python

langfengQ / verl-agent

Star

Official code for paper "Group-in-Group Policy Optimization for LLM Agent Training". This codebase supports training LLM/VLM agents via RL.

reinforcement-learning agent-framework large-language-models llm-training llm-agents deepseek-r1 grpo gigpo

Updated Aug 10, 2025
Python

turningpoint-ai / VisualThinker-R1-Zero

Star

Explore the Multimodal “Aha Moment” on 2B Model

reinforcement-learning reasoning r1 post-training multimodal deepseek deepseek-r1 grpo deepseek-r1-zero r1-zero multimodal-journey multimodal-r1

Updated Mar 18, 2025
Python

ModelTC / LightCompress

Star

[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".