frotms

frotms

Amoy

Achievements

Stars

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 11,066 1,600 Updated Jul 20, 2025

HW-whistleblower / True-Story-of-Pangu

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,219 1,380 Updated Jul 9, 2025

bytedance / trae-agent

Trae Agent is an LLM-based agent for general purpose software engineering tasks.

Python 8,091 763 Updated Jul 17, 2025

MoonshotAI / Kimi-Dev

open-source coding LLM for software engineering tasks

Python 847 103 Updated Jun 27, 2025

BeehiveInnovations / zen-mcp-server

The power of Claude Code + [Gemini / OpenAI / Grok / OpenRouter / Ollama / Custom Model / All Of The Above] working as one.

Python 4,890 451 Updated Jun 30, 2025

pyenv / pyenv

Simple Python version management

Roff 42,657 3,193 Updated Jul 19, 2025

ollama / ollama

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Go 147,029 12,459 Updated Jul 19, 2025

mudler / LocalAI

🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transf…

Go 33,973 2,642 Updated Jul 20, 2025

bentoml / OpenLLM

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

Python 11,585 752 Updated Jul 14, 2025

langgenius / dify

Production-ready platform for agentic workflow development.

TypeScript 107,704 16,351 Updated Jul 20, 2025

skyzh / tiny-llm

A course of learning LLM inference serving on Apple Silicon for systems engineers.

Python 2,780 156 Updated Jun 14, 2025

xlite-dev / LeetCUDA

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 5,607 595 Updated Jul 16, 2025

langflow-ai / langflow

Langflow is a powerful tool for building and deploying AI-powered agents and workflows.

Python 88,394 7,120 Updated Jul 20, 2025

ned14 / quickcpplib

Eliminate all the tedious hassle when making state-of-the-art C++ 14 - 23 libraries!

C 181 24 Updated May 21, 2025

Fosowl / agenticSeek

Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin9…

Python 20,186 2,007 Updated Jul 13, 2025

zilliztech / deep-searcher

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

Python 6,551 644 Updated Jul 10, 2025

NVIDIA / nvbandwidth

A tool for bandwidth measurements on NVIDIA GPUs.

C++ 485 47 Updated Apr 15, 2025

xlite-dev / Awesome-LLM-Inference

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 4,254 294 Updated Jul 14, 2025

QwenLM / Qwen2.5-Omni

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,341 256 Updated Jun 12, 2025

frotms / PaddleOCR2Pytorch

PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)

Python 1,014 195 Updated Jun 3, 2025

apple2333cream / t5-ort-cpp

T5 onnxruntime cpp

C++ 8 Updated Jul 28, 2024

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 3,389 387 Updated Jul 20, 2025

harry0703 / MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 38,175 5,506 Updated Jun 11, 2025

pythonstock / stock

stock，股票系统。使用python进行开发。

Python 7,245 2,311 Updated Mar 4, 2025

bytedance / deer-flow

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 15,419 1,915 Updated Jul 20, 2025

evalstate / fast-agent

Define, Prompt and Test MCP enabled Agents and Workflows

Python 2,767 283 Updated Jul 20, 2025

punkpeye / awesome-mcp-servers

A collection of MCP servers.

62,467 4,902 Updated Jul 13, 2025

microsoft / mscclpp

MSCCL++: A GPU-driven communication stack for scalable AI applications

C++ 387 61 Updated Jul 18, 2025

tadata-org / fastapi_mcp

Expose your FastAPI endpoints as Model Context Protocol (MCP) tools, with Auth!

Python 6,646 551 Updated Jul 14, 2025

fastapi / fastapi

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Python 87,473 7,618 Updated Jul 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

frotms

Achievements

Achievements

Block or report frotms

Stars

NVIDIA / TensorRT-LLM

HW-whistleblower / True-Story-of-Pangu

bytedance / trae-agent

MoonshotAI / Kimi-Dev

BeehiveInnovations / zen-mcp-server

pyenv / pyenv

ollama / ollama

mudler / LocalAI

bentoml / OpenLLM

langgenius / dify

skyzh / tiny-llm

xlite-dev / LeetCUDA

langflow-ai / langflow

ned14 / quickcpplib

Fosowl / agenticSeek

zilliztech / deep-searcher

NVIDIA / nvbandwidth

xlite-dev / Awesome-LLM-Inference

QwenLM / Qwen2.5-Omni

frotms / PaddleOCR2Pytorch

apple2333cream / t5-ort-cpp

flashinfer-ai / flashinfer

harry0703 / MoneyPrinterTurbo

pythonstock / stock

bytedance / deer-flow

evalstate / fast-agent

punkpeye / awesome-mcp-servers

microsoft / mscclpp

tadata-org / fastapi_mcp

fastapi / fastapi