Lists (1)
Sort Name ascending (A-Z)
Stars
cshaa / filtrex
Forked from joewalnes/filtrexA simple, safe, JavaScript Filter Expression compiler for end-users
Parse and evaluate MS Excel formula in javascript.
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
A production ready, scalable starter template for Vite + React
Ubuntu for Rockchip RK35XX Devices
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation
Official PyTorch Implementation of CleanUNet (ICASSP 2022)
Easy to use stem (e.g. instrumental/vocals) separation from CLI or as a python package, using a variety of amazing pre-trained models (primarily from UVR)
ML-powered speech recognition directly in your browser
Productive, portable, and performant GPU programming in Python.
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)
Noise supression using deep filtering
An Open Source text-to-speech system built by inverting Whisper.
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
Scenic: A Jax Library for Computer Vision Research and Beyond
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
so-vits-svc fork with realtime support, improved interface and more features.
Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.
Predicts the level of noise and reverberation on your audiofiles
An implementation for Frame-level Speech Signal-to-Noise Ratio Estimation using deep learning