Lists (1)
Sort Name ascending (A-Z)
Stars
Incredibly fast Whisper-large-v3
React Flow | Svelte Flow - Powerful open source libraries for building node-based UIs with React (https://reactflow.dev) or Svelte (https://svelteflow.dev). Ready out-of-the-box and infinitely cust…
Generate audiobooks from e-books, voice cloning & 1107+ languages!
A free, open source, and extensible speech-to-text application that works completely offline.
High performance self-hosted photo and video management solution.
A enterprise-grade Chinese-English code switch punctuator from funasr.
CapsWriter 的离线版,一个好用的 PC 端的语音输入工具
Awesome curated collection of images and prompts generated by gemini-2.5-flash-image (aka Nano Banana) state-of-the-art image generation and editing model. Explore AI generated visuals created with…
Chrome DevTools for coding agents
A personalized language-learning tool that combines Duolingo-style lessons with your own curated vocabulary lists. Seamlessly add words from books, articles, or videos, and revisit them through in…
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
💫 Toolkit to help you get started with Spec-Driven Development
Recursive-Open-Meta-Agent v0.1 (Beta). A meta-agent framework to build high-performance multi-agent systems.
⚡ Python-free Rust inference server — OpenAI-API compatible. GGUF + SafeTensors, hot model swap, auto-discovery, single binary. FREE now, FREE forever.
FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus Agent Tools, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae…
A minimalistic cross-platform eBook reader built with Tauri, Epub.js, and Typescript
A multi-voice TTS system trained with an emphasis on quality
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
The glamourous AI coding agent for your favourite terminal 💘
Kronos: A Foundation Model for the Language of Financial Markets
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
Text-audio foundation model from Boson AI