Highlights
- Pro
Stars
自建Claude Code镜像,一站式开源中转服务,让 Claude、OpenAI、Gemini、Droid 订阅统一接入,支持拼车共享,更高效分摊成本,原生工具无缝使用。
Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation.
Real-time & local speech-to-text server.
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
🌊 The leading agent orchestration platform for Claude. Deploy intelligent multi-agent swarms, coordinate autonomous workflows, and build conversational AI systems. Features enterprise-grade archite…
基于 Playwright 和AI过滤的闲鱼多任务实时/定时监控与智能分析工具,配备了功能完善的后台管理界面。帮助用户节省闲鱼商品过滤,能及时找到心仪商品。
Context engineering is the new vibe coding - it's the way to actually make AI coding assistants work. Claude Code is the best for this so that's what this repo is centered around, but you can apply…
🌐 The open-source Agentic browser; privacy-first alternative to Perplexity Comet, Arc, Dia
Fast-TTS 是一个基于异步框架的文本到语音转换(TTS)生成器项目。该项目利用了异步编程技术来高效处理请求和响应,实现了快速、秒级的流式生成长文本语音播放服务。Fast-TTS 可以快速地将长文本转换为语音流,并实时播放,适用于多种应用场景,如语音合成、智能助手、内容朗读等。
Open Source framework for voice and multimodal conversational AI
The simplest, fastest repository for training/finetuning medium-sized GPTs.
A CLI tool for analyzing Claude Code/Codex CLI usage from local JSONL files.
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
PyTorch code and models for VJEPA2 self-supervised learning from video.
The ultimate training toolkit for finetuning diffusion models
✨✨VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model
Official PyTorch implementation for "Large Language Diffusion Models"
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are commi…
FlowGram is a node-based flow building engine that helps developers quickly create workflows in either fixed layout or free connection layout modes
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
Have a natural, spoken conversation with AI!
一个基于Python和Flask的Web应用,整合了多维度股票分析能力和人工智能辅助决策功能。系统通过AKShare获取股票数据,结合技术分析、基本面分析和资金面分析,为投资者提供全方位的投资决策支持。
Making a mini version of the BDX droid. https://discord.gg/UtJZsgfQGe
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.