An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 3,473 288 Updated Aug 14, 2025

fastapi / full-stack-fastapi-template

Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more.

TypeScript 38,259 7,353 Updated Oct 8, 2025

hcengineering / platform

Huly — All-in-One Project Management Platform (alternative to Linear, Jira, Slack, Notion, Motion)

TypeScript 23,290 1,598 Updated Oct 11, 2025

hatchet-dev / hatchet

🪓 Run Background Tasks at Scale

Go 6,097 266 Updated Oct 11, 2025

resemble-ai / chatterbox

SoTA open-source TTS

Python 13,791 1,794 Updated Sep 25, 2025

mostlygeek / llama-swap

Model swapping for llama.cpp (or any local OpenAI API compatible server)

Go 1,664 110 Updated Oct 11, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 24,969 1,741 Updated Sep 28, 2025

SCZwangxiao / video-FlexReduc

Official implementation of paper AdaReTaKe: Adaptive Redundancy Reduction to Perceive Longer for Video-language Understanding

Python 85 6 Updated Apr 23, 2025

pydantic / pydantic-ai

GenAI Agent Framework, the Pydantic way

Python 12,868 1,294 Updated Oct 10, 2025

merlresearch / tf-locoformer

Transformer with Local Modeling by Convolution for Speech Separation and Enhancement

Python 97 7 Updated Aug 8, 2025

nari-labs / dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 18,563 1,597 Updated Jul 6, 2025

Lex-au / Orpheus-FastAPI

High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.

Python 568 114 Updated Jul 5, 2025

Portkey-AI / gateway

A blazing fast AI Gateway with integrated guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.

TypeScript 9,648 749 Updated Oct 11, 2025

QuixiAI / dolphin-mcp

Python 519 56 Updated May 21, 2025

canopyai / Orpheus-TTS

Towards Human-Sounding Speech

Python 5,615 469 Updated May 6, 2025

python-arq / arq

Fast job queuing and RPC in python with asyncio and redis.

Python 2,688 195 Updated Jan 6, 2025

pydantic / logfire

Uncomplicated Observability for Python and beyond! 🪵🔥

Python 3,642 169 Updated Oct 9, 2025

AgentDeskAI / browser-tools-mcp

Monitor browser logs directly from Cursor and other MCP compatible IDEs.

JavaScript 6,692 499 Updated Mar 26, 2025

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 27,401 2,701 Updated Apr 30, 2025

google / speaker-id

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

Python 431 39 Updated Aug 12, 2025

getzep / graphiti

Build Real-Time Knowledge Graphs for AI Agents

Python 18,944 1,745 Updated Oct 9, 2025

gabrielchua / open-notebooklm

Forked from knowsuchagency/pdf-to-podcast

Convert any PDF into a podcast episode!

Python 2,477 279 Updated Dec 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Daniel Zimmermann d-zimmermann

Block or report d-zimmermann

Lists (2)

AI

microcontroller

Stars

d-zimmermann / joinly

DendyLusus / miyoo355-drivers

Abdess / retroarch_system

joinly-ai / joinly

modelscope / ClearerVoice-Studio