+
Skip to content
View frotms's full-sized avatar

Block or report frotms

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 11,066 1,600 Updated Jul 20, 2025

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,219 1,380 Updated Jul 9, 2025

Trae Agent is an LLM-based agent for general purpose software engineering tasks.

Python 8,091 763 Updated Jul 17, 2025

open-source coding LLM for software engineering tasks

Python 847 103 Updated Jun 27, 2025

The power of Claude Code + [Gemini / OpenAI / Grok / OpenRouter / Ollama / Custom Model / All Of The Above] working as one.

Python 4,890 451 Updated Jun 30, 2025

Simple Python version management

Roff 42,657 3,193 Updated Jul 19, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Go 147,029 12,459 Updated Jul 19, 2025

🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transf…

Go 33,973 2,642 Updated Jul 20, 2025

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

Python 11,585 752 Updated Jul 14, 2025

Production-ready platform for agentic workflow development.

TypeScript 107,704 16,351 Updated Jul 20, 2025

A course of learning LLM inference serving on Apple Silicon for systems engineers.

Python 2,780 156 Updated Jun 14, 2025

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 5,607 595 Updated Jul 16, 2025

Langflow is a powerful tool for building and deploying AI-powered agents and workflows.

Python 88,394 7,120 Updated Jul 20, 2025

Eliminate all the tedious hassle when making state-of-the-art C++ 14 - 23 libraries!

C 181 24 Updated May 21, 2025

Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin9…

Python 20,186 2,007 Updated Jul 13, 2025

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

Python 6,551 644 Updated Jul 10, 2025

A tool for bandwidth measurements on NVIDIA GPUs.

C++ 485 47 Updated Apr 15, 2025

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 4,254 294 Updated Jul 14, 2025

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,341 256 Updated Jun 12, 2025

PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)

Python 1,014 195 Updated Jun 3, 2025

T5 onnxruntime cpp

C++ 8 Updated Jul 28, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 3,389 387 Updated Jul 20, 2025

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 38,175 5,506 Updated Jun 11, 2025

stock,股票系统。使用python进行开发。

Python 7,245 2,311 Updated Mar 4, 2025

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 15,419 1,915 Updated Jul 20, 2025

Define, Prompt and Test MCP enabled Agents and Workflows

Python 2,767 283 Updated Jul 20, 2025

A collection of MCP servers.

62,467 4,902 Updated Jul 13, 2025

MSCCL++: A GPU-driven communication stack for scalable AI applications

C++ 387 61 Updated Jul 18, 2025

Expose your FastAPI endpoints as Model Context Protocol (MCP) tools, with Auth!

Python 6,646 551 Updated Jul 14, 2025

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Python 87,473 7,618 Updated Jul 14, 2025
Next
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载