Stars
哔哩下载姬downkyi,哔哩哔哩网站视频下载工具,支持批量下载,支持8K、HDR、杜比视界,提供工具箱(音视频提取、去水印等)。
中文法律LLaMA (LLaMA for Chinese legel domain)
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Code and documentation to train Stanford's Alpaca models, and generate the data.
LLaMA: Open and Efficient Foundation Language Models
Code for the paper "Jukebox: A Generative Model for Music"
🦜🔗 Build context-aware reasoning applications
LangChain 的中文入门教程
so-vits-svc fork with realtime support, improved interface and more features.
GUI for a Vocal Remover that uses Deep Neural Networks.
SoftVC VITS Singing Voice Conversion
Use Genetic Algorithm and Simulate Anneal for feature selection. 用遗传算法/模拟退火算法进行特征选择.
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
🛠「Watt Toolkit」是一个开源跨平台的多功能 Steam 工具箱。
中文语音识别; Mandarin Automatic Speech Recognition;
Production First and Production Ready End-to-End Speech Recognition Toolkit
A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis
A tool that can extract and pack krkr2 and krkrz's xp3 files
[Unofficial] qBittorrent Enhanced, based on qBittorrent
A C library for reading and writing sound files containing sampled audio data.
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding a…
kaldi-asr/kaldi is the official location of the Kaldi project.