Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 2,554 133 Updated Oct 9, 2025

JusperLee / Speech-Separation-Paper-Tutorial

A must-read paper for speech separation based on neural networks

TypeScript 842 139 Updated Aug 11, 2025

openai / baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Python 16,470 4,941 Updated Aug 1, 2024

NVIDIA-NeMo / RL

Scalable toolkit for efficient model reinforcement

Python 921 152 Updated Oct 10, 2025

InternLM / SIM-CoT

An official implementation of "SIM-CoT: Supervised Implicit Chain-of-Thought"

Python 87 2 Updated Sep 28, 2025

datawhalechina / daily-interview

Datawhale成员整理的面经，内容包括机器学习，CV，NLP，推荐，开发等，欢迎大家star

3,218 478 Updated Aug 27, 2025

AaronZ345 / StyleSinger

PyTorch Implementation of StyleSinger(AAAI 2024): Style Transfer for Out-of-Domain Singing Voice Synthesis

Python 409 26 Updated Aug 15, 2025

xiquan-li / MeanAudio

MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows

Python 94 7 Updated Sep 2, 2025

xiaomi-research / dasheng-lm

Efficient audio understanding with general audio captions

Jupyter Notebook 365 37 Updated Oct 9, 2025

inworld-ai / tts

Inworld TTS

Python 505 43 Updated Sep 19, 2025

stepfun-ai / StepMesh

C++ 303 26 Updated Oct 1, 2025

AI-S2-Lab / GPT-Talker

[ACMMM'2024] Generative Expressive Conversational Speech Synthesis

39 2 Updated Oct 28, 2024

facebookresearch / blt

Code for BLT research paper

Python 1,989 178 Updated May 22, 2025

stepfun-ai / Step3

428 9 Updated Aug 10, 2025

Diffusion-CoT / ReflectionFlow

[ICCV 2025] Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning

Python 194 12 Updated Jun 26, 2025

Wan-Video / Wan2.2

Wan: Open and Advanced Large-Scale Video Generative Models

Python 9,891 1,000 Updated Sep 19, 2025

DonArtkins / MetaGPT

Forked from FoundationAgents/MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 2 Updated Jun 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

forwiat

Achievements

Achievements

Block or report forwiat

Starred repositories

XiaomiMiMo / MiMo-Audio

FireRedTeam / FireRedTTS2

kwsong0113 / diffusion-forcing-transformer

Done-0 / fuck-u-code

OpenBMB / VoxCPM

tile-ai / tilelang

youngsheen / GPST

Alibaba-NLP / DeepResearch

SpenserCai / ComfyUI-FunAudioLLM

facebookresearch / DiT

QwenLM / Qwen3-Omni