Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,698 291 Updated Jun 12, 2025

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 13,968 1,064 Updated Oct 11, 2025

TIGER-AI-Lab / ScholarCopilot

ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations [COLM 2025]

Python 233 26 Updated Jul 8, 2025

TuGraph-family / Awesome-Text2GQL

Fine-Tuning Dataset Auto-Generation for Graph Query Languages.

Python 78 15 Updated Aug 25, 2025

TuGraph-family / chat2graph

Chat2Graph: Graph Native Agentic System.

Python 355 42 Updated Oct 10, 2025

x1xhlol / system-prompts-and-models-of-ai-tools

FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus Agent Tools, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae…

90,777 24,665 Updated Oct 2, 2025

limuloo / DreamRenderer

[ICCV 2025] DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models (official implement)

Jupyter Notebook 139 11 Updated May 21, 2025

PicoTrex / GPT-ImgEval

GPT-ImgEval: Evaluating GPT-4o’s state-of-the-art image generation capabilities

Python 300 6 Updated May 3, 2025

ali-vilab / AnyDoor

Official implementations for paper: Anydoor: zero-shot object-level image customization

Python 4,185 372 Updated Apr 8, 2024

zhixuhao / unet

unet for image segmentation

Jupyter Notebook 4,820 2,011 Updated Apr 10, 2024

ali-vilab / MangaNinjia

[CVPR 2025 Highlight] Official implementation of "MangaNinja: Line Art Colorization with Precise Reference Following"

Python 665 52 Updated Mar 2, 2025

VARGPT-family / VARGPT

VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model

Python 345 15 Updated Apr 17, 2025

tencent-ailab / IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 6,255 400 Updated Jun 28, 2024

salesforce / BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 5,517 717 Updated Aug 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhuochen Liu LIU423

Highlights

Block or report LIU423

Stars

cvpr-org / author-kit

nunchaku-tech / nunchaku

ModelTC / Qwen-Image-Lightning

pickle-com / glass

mlfoundations / open_clip

openai / CLIP

JeffSackmann / tennis_atp

kyleneideck / BackgroundMusic

qtalen / agentic-ai-playground

LLaVA-VL / LLaVA-NeXT

RUC-NLPIR / FlashRAG

zeyofu / Commonsense-T2I

stepfun-ai / Step1X-Edit

yandex-research / switti

PixArt-alpha / PixArt-sigma

ai-forever / Kandinsky-3

QwenLM / Qwen2.5-Omni