+
Skip to content
View freds0's full-sized avatar

Block or report freds0

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The best ChatGPT that $100 can buy.

Python 22,877 2,242 Updated Oct 16, 2025

VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

Python 1,811 188 Updated Oct 9, 2025

AI Edge Quantizer: flexible post training quantization for LiteRT models.

Python 71 13 Updated Oct 15, 2025

Audio-to-Audio Schrodinger Bridges is a diffusion-based audio restoration model for bandwidth extension and inpainting.

Python 107 6 Updated Aug 13, 2025

Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)

Python 334 36 Updated Apr 10, 2025
Jupyter Notebook 184 24 Updated Oct 13, 2025

Frontier Open-Source Text-to-Speech

9,622 1,194 Updated Sep 5, 2025

Open Source Text-To-Speech Portuguese Dataset

177 16 Updated Feb 2, 2024

Spotify Scraper to extract all the information from spotify, download mp3 with cover of the song

Makefile 191 20 Updated Oct 16, 2025

Running any GGUF SLMs/LLMs locally, on-device in Android

Kotlin 536 76 Updated Sep 19, 2025

Awesome speech/audio LLMs, representation learning, and codec models

1,152 70 Updated Aug 13, 2025

DiFlow-TTS: Compact and Low-Latency Zero-Shot Text-to-Speech with Factorized Discrete Flow Matching

Python 56 8 Updated Sep 25, 2025

SoTA open-source TTS

Python 99 14 Updated Jun 7, 2025

SoTA open-source TTS

Python 13,928 1,820 Updated Sep 25, 2025

finetune llm part for spark-tts model

Python 111 18 Updated Mar 25, 2025
Python 465 42 Updated May 19, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 126 20 Updated Jul 25, 2025

An AI-powered interactive avatar engine using Live2D, LLM, ASR, TTS, and RVC. Ideal for VTubing, streaming, and virtual assistant applications.

C# 916 98 Updated Apr 23, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 18,595 1,602 Updated Jul 6, 2025

Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 314 45 Updated Jul 21, 2025

A family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.

Python 397 28 Updated Sep 15, 2025

Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995

Python 79 7 Updated Dec 3, 2024

SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis

Python 140 16 Updated Jan 1, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 1,913 237 Updated Oct 16, 2025

GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code

Python 1,763 250 Updated Oct 18, 2024

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 13,426 1,965 Updated Oct 9, 2025

Official implementation for the SIGGRAPH Asia 2024 paper SPARK: Self-supervised Personalized Real-time Monocular Face Capture

Python 382 19 Updated Jun 30, 2025

A list of publicly available room impulse response datasets and scripts to download them.

Shell 513 46 Updated Oct 11, 2025

[Official Implementation] Acoustic Autoregressive Modeling 🔥

Python 71 6 Updated Aug 24, 2024
Next
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载