freds0

Frederico S. Oliveira freds0

Researcher in the area of NLP, Ph.D. student at UFG, focusing on speech synthesis and recognition using deep learning and also professor at UFMT.

91 followers · 96 following

UFMT
Cuiabá, Mato Grosso - Brazil
https://www.fredso.com.br
@fred_s0

Achievements

Highlights

Developer Program Member

Lists (12)

Sort

Speech-to-Text

Text-to-Speech

19 repositories

video-super-resolution

2 repositories

Voice-Conversion

9 repositories

Stars

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 22,877 2,242 Updated Oct 16, 2025

OpenBMB / VoxCPM

VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

Python 1,811 188 Updated Oct 9, 2025

google-ai-edge / ai-edge-quantizer

AI Edge Quantizer: flexible post training quantization for LiteRT models.

Python 71 13 Updated Oct 15, 2025

NVIDIA / diffusion-audio-restoration

Audio-to-Audio Schrodinger Bridges is a diffusion-based audio restoration model for bandwidth extension and inpainting.

Python 107 6 Updated Aug 13, 2025

freddyaboulton / orpheus-cpp

Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)

Python 334 36 Updated Apr 10, 2025

Vyvo-Labs / VyvoTTS

Jupyter Notebook 184 24 Updated Oct 13, 2025

microsoft / VibeVoice

Frontier Open-Source Text-to-Speech

9,622 1,194 Updated Sep 5, 2025

Edresson / TTS-Portuguese-Corpus

Open Source Text-To-Speech Portuguese Dataset

177 16 Updated Feb 2, 2024

AliAkhtari78 / SpotifyScraper

Spotify Scraper to extract all the information from spotify, download mp3 with cover of the song

Makefile 191 20 Updated Oct 16, 2025

shubham0204 / SmolChat-Android

Running any GGUF SLMs/LLMs locally, on-device in Android

Kotlin 536 76 Updated Sep 19, 2025

ga642381 / speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

1,152 70 Updated Aug 13, 2025

DiFlow-TTS / DiFlow-TTS

DiFlow-TTS: Compact and Low-Latency Zero-Shot Text-to-Speech with Factorized Discrete Flow Matching

Python 56 8 Updated Sep 25, 2025

stlohrey / chatterbox-finetuning

Forked from resemble-ai/chatterbox

SoTA open-source TTS

Python 99 14 Updated Jun 7, 2025

resemble-ai / chatterbox

SoTA open-source TTS

Python 13,928 1,820 Updated Sep 25, 2025

tuanh123789 / Spark-TTS-finetune

finetune llm part for spark-tts model

Python 111 18 Updated Mar 25, 2025

MYZY-AI / Muyan-TTS

Python 465 42 Updated May 19, 2025

stlohrey / dia-finetuning

Forked from nari-labs/dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 126 20 Updated Jul 25, 2025

fagenorn / handcrafted-persona-engine

An AI-powered interactive avatar engine using Live2D, LLM, ASR, TTS, and RVC. Ideal for VTubing, streaming, and virtual assistant applications.

C# 916 98 Updated Apr 23, 2025

nari-labs / dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 18,595 1,602 Updated Jul 6, 2025

zhenye234 / X-Codec-2.0

Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 314 45 Updated Jul 21, 2025

Stability-AI / stable-codec

A family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.

Python 397 28 Updated Sep 15, 2025

cantabile-kwok / vec2wav2.0

Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995

Python 79 7 Updated Dec 3, 2024

WangHelin1997 / SSR-Speech

SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis

Python 140 16 Updated Jan 1, 2025

idiap / coqui-ai-TTS

Forked from coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 1,913 237 Updated Oct 16, 2025

Frederico S. Oliveira freds0

Highlights

Lists (12)

Avatar

datasets

LipSync

Pitch-Extractor

Singing-Voice-Conversion

Speech-Enhancement

Speech-Metrics

speech-to-speech-translation

Speech-to-Text

Text-to-Speech

video-super-resolution

Voice-Conversion

Stars