qwen3

Star

Here are 130 public repositories matching this topic...

vllm-project / vllm

Sponsor

Star

A high-throughput and memory-efficient inference and serving engine for LLMs

Updated Oct 10, 2025
Python

Mintplex-Labs / anything-llm

Sponsor

Star

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

mcp web-scraping no-code ai-agents kimi multimodal rag moonshot vector-database llm localai local-llm ollama lmstudio deepseek llama3 custom-ai-agents mcp-servers qwen3

Updated Oct 9, 2025
JavaScript

unslothai / unsloth

Sponsor

Star

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Updated Oct 8, 2025
Python

sgl-project / sglang

Star

SGLang is a fast serving framework for large language models and vision language models.

Updated Oct 10, 2025
Python

1Panel-dev / MaxKB

Star

🔥 MaxKB is an open-source platform for building enterprise-grade agents. MaxKB 是强大易用的开源企业级智能体平台。

agent chatbot knowledgebase rag llm langchain pgvector ollama maxkb llama3 agentic-ai mcp-server deepseek-r1 qwen3

Updated Oct 10, 2025
Python

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Phi4, ...) (AAAI 2025).

Updated Oct 10, 2025
Python

OpenPipe / ART

Star

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

agent reinforcement-learning rl lora llms qwen agentic-ai grpo qwen3

Updated Oct 9, 2025
Python

zilliztech / deep-searcher

Star

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

agent openai grok claude rag milvus vector-database llm zilliz deepseek agentic-rag grok3 reasoning-models deepseek-r1 deep-research qwen3 llama4

Updated Jul 10, 2025
Python

NexaAI / nexa-sdk

Star

Run the latest LLMs and VLMs across GPU, NPU, and CPU with PC (Python/C++) & mobile (Android & iOS) support, running quickly with OpenAI gpt-oss, Granite4, Qwen3VL, Gemma 3n and more.

go sdk llama vlm on-device-ai llm stable-diffusion llama3 phi3 gemma3 qwen3 gpt-oss granite4 qwen3vl

Updated Oct 10, 2025
Go

xlite-dev / Awesome-LLM-Inference

Star

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

mla vllm llm-inference awesome-llm flash-attention tensorrt-llm paged-attention deepseek flash-attention-3 deepseek-v3 minimax-01 deepseek-r1 flash-mla qwen3

Updated Aug 19, 2025
Python

johnbean393 / Sidekick

Star

A native macOS app that allows users to chat with a local LLM that can respond with information from files, folders and websites on your Mac without installing any other software. Powered by llama.cpp.

macos swift ai chatbot llama agents ai-agents swiftui rag aichat llm qwen deepseek agentic-ai deepseek-r1 deep-research gemma3 qwen3 llama4

Updated Jul 29, 2025
Swift

papersgpt / papersgpt-for-zotero

Star

A powerful Zotero AI and MCP plugin with ChatGPT, Gemini, Claude, Grok, DeepSeek, OpenRouter, Kimi, GLM, SiliconFlow, GPT-oss, Gemma 3, Qwen 3

Updated Oct 1, 2025
JavaScript

kubewall / kubewall

Star

kubewall - Single-Binary Kubernetes Dashboard with Multi-Cluster Management & AI Integration. (OpenAI / Claude 4 / Gemini / DeepSeek / OpenRouter / Ollama / Qwen / LMStudio)

Updated Oct 6, 2025
TypeScript

coderonion / awesome-llm-and-aigc

Star

🚀🚀🚀A collection of some awesome public projects about Large Language Model(LLM), Vision Language Model(VLM), Vision Language Action(VLA), AI Generated Content(AIGC), the related Datasets and Applications.