Stars
A python package to build AI-powered real-time audio applications
Real-time & local speech-to-text server.
Full python interactive 3D Gaussian Splatting viewer for real-time editing and analyzing.
[SIGGRAPH 2025] LAM: Large Avatar Model for One-shot Animatable Gaussian Head
EchoMimicV3: 1.3B Parameters are All You Need for Unified Multi-Modal and Multi-Task Human Animation
[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
Production-grade 3D gaussian splatting with CPU/GPU support for Windows, Mac and Linux 🚀
FlashInfer: Kernel Library for LLM Serving
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Digital Human Resource: 2D/3D/4D Human Modeling, Avatar Generation & Animation, Clothed People Digitalization, Virtual Try-On, and Others.
Accelerate inference in Flux and Sana for ComfyUI.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
AIGCPanel 是一个简单易用的一站式AI数字人系统,支持视频合成、声音合成、声音克隆,简化本地模型管理、一键导入和使用AI模型。
Real time interactive streaming digital human
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching
Combining Teacache with xDiT to Accelerate Visual Generation Models
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).
A cross-platform video structuring (video analysis) framework. If you find it helpful, please give it a star: ) 跨平台的视频结构化(视频分析)框架,觉得有帮助的请给个星星 : )
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Model Compression Toolbox for Large Language Models and Diffusion Models
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术