Stars
Awesome Unified Multimodal Models
[🚀ICML 2025] "Taming Rectified Flow for Inversion and Editing" Using FLUX and HunyuanVideo for image and video editing!
Ming - facilitating advanced multimodal understanding and generation capabilities built upon the Ling LLM.
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
A curated list of publications on image and video segmentation leveraging Multimodal Large Language Models (MLLMs), highlighting state-of-the-art methods, innovative applications, and key advanceme…
[Support 0.49.x](Reset Cursor AI MachineID & Bypass Higher Token Limit) Cursor Ai ,自动重置机器ID , 免费升级使用Pro功能: You've reached your trial request limit. / Too many free trial accounts used on this machi…