+
Skip to content
View LIU423's full-sized avatar

Highlights

  • Pro

Block or report LIU423

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
TeX 620 102 Updated Sep 17, 2025

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Python 3,173 177 Updated Oct 7, 2025

Qwen-Image-Lightning: Speed up Qwen-Image model with distillation

Python 790 32 Updated Oct 10, 2025

Digital Mind Extension

JavaScript 6,505 1,005 Updated Jul 20, 2025

An open source implementation of CLIP.

Python 12,731 1,171 Updated Sep 21, 2025

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 31,007 3,786 Updated Jul 23, 2024

ATP Tennis Rankings, Results, and Stats

1,343 669 Updated Dec 30, 2024

Background Music, a macOS audio utility: automatically pause your music, set individual apps' volumes and record system audio.

C++ 18,016 722 Updated Jun 30, 2025
Python 20 4 Updated Sep 12, 2025
Python 4,296 409 Updated Sep 14, 2025

⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)

Python 3,039 260 Updated Sep 25, 2025

Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]

Python 22 1 Updated Aug 13, 2024

A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.

Python 1,668 77 Updated Sep 8, 2025

(CVPR 2025) Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis

Jupyter Notebook 195 16 Updated Jul 13, 2025

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Python 1,847 90 Updated Oct 31, 2024
Python 386 38 Updated Jan 19, 2025

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,698 291 Updated Jun 12, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 13,968 1,064 Updated Oct 11, 2025

ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations [COLM 2025]

Python 233 26 Updated Jul 8, 2025

Fine-Tuning Dataset Auto-Generation for Graph Query Languages.

Python 78 15 Updated Aug 25, 2025

Chat2Graph: Graph Native Agentic System.

Python 355 42 Updated Oct 10, 2025

FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus Agent Tools, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae…

90,777 24,665 Updated Oct 2, 2025

[ICCV 2025] DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models (official implement)

Jupyter Notebook 139 11 Updated May 21, 2025

GPT-ImgEval: Evaluating GPT-4o’s state-of-the-art image generation capabilities

Python 300 6 Updated May 3, 2025

Official implementations for paper: Anydoor: zero-shot object-level image customization

Python 4,185 372 Updated Apr 8, 2024

unet for image segmentation

Jupyter Notebook 4,820 2,011 Updated Apr 10, 2024

[CVPR 2025 Highlight] Official implementation of "MangaNinja: Line Art Colorization with Precise Reference Following"

Python 665 52 Updated Mar 2, 2025

VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model

Python 345 15 Updated Apr 17, 2025

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 6,255 400 Updated Jun 28, 2024

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 5,517 717 Updated Aug 5, 2024
Next
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载