[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,434 537 Updated May 18, 2025

ChaofanTao / Autoregressive-Models-in-Vision-Survey

[TMLR 2025🔥] A survey for the autoregressive models in vision.

723 18 Updated Oct 15, 2025

AMAP-ML / NarrLV

NarrLV: Towards a Comprehensive Narrative-Centric Evaluation for Long Video Generation Models

Python 110 Updated Jul 28, 2025

AMAP-ML / FluxText

Implementation of "FLUX-Text: A Simple and Advanced Diffusion Transformer Baseline for Scene Text Editing"

Python 402 27 Updated Oct 10, 2025

hustvl / ControlAR

[ICLR 2025] ControlAR: Controllable Image Generation with Autoregressive Models

Python 299 8 Updated Apr 24, 2025

Davinci-XLab / STAR-T2I

Official implementation of "STAR: Scale-wise Text-to-image generation via Auto-Regressive representations"

Python 38 1 Updated Mar 11, 2025

JitengMu / EditAR

EditAR: Unified Conditional Generation with Autoregressive Models (CVPR 2025)

Python 32 2 Updated Jun 13, 2025

VARGPT-family / VARGPT-v1.1

VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning

Python 267 15 Updated Apr 15, 2025

FoundationVision / BitVAE

official training and inference code of bitwise tokenizer

Python 46 2 Updated May 18, 2025

AMAP-ML / UniVG-R1

UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning

Python 147 6 Updated Jun 2, 2025

Ascend-Research / Turtle

[ NeurIPS 2024 ] The official PyTorch implementation for Learning Truncated Causal History Model for Video Restoration.

Python 107 8 Updated May 27, 2025

EdisonLeeeee / Awesome-Masked-Autoencoders

A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).

853 54 Updated Jul 10, 2024

lxa9867 / ControlVAR

This is the official implementation for ControlVAR.

Python 122 7 Updated Dec 10, 2024

lxa9867 / ImageFolder

High-performance Image Tokenizers for VAR and AR

Python 291 5 Updated Apr 25, 2025

JiuhaiChen / BLIP3o

Official implementation of BLIP3o-Series

Python 1,504 65 Updated Oct 3, 2025

lxa9867 / Awesome-Autoregressive-Visual-Generation

This is a repo to track the latest autoregressive visual generation papers.

405 5 Updated Jun 25, 2025

LTH14 / fractalgen

PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437

Python 1,176 63 Updated Feb 25, 2025

facefusion / facefusion

Industry leading face manipulation platform

Python 25,468 4,035 Updated Oct 15, 2025

FoundationVision / Infinity

[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 1,464 76 Updated Jun 24, 2025

deepinsight / insightface

State-of-the-art 2D and 3D Face Analysis Project

Python 26,758 5,784 Updated Sep 27, 2025

ziqipang / RandAR

[CVPR 2025 (Oral)] Open implementation of "RandAR"

Python 198 6 Updated Jul 14, 2025

XduSyL / EventGPT

🔥[CVPR2025] EventGPT: Event Stream Understanding with Multimodal Large Language Models

Python 75 7 Updated Jul 26, 2025

lixiaowen-xw / DiffuEraser

DiffuEraser is a diffusion model for video inpainting, which performs great content completeness and temporal consistency while maintaining acceptable efficiency.

Python 540 47 Updated Apr 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ZeroRF

Achievements

Achievements

Block or report ZeroRF

Stars

VectorSpaceLab / OmniGen

modelscope / DiffSynth-Engine

yandex-research / switti

bytedance / DreamO

facebookresearch / dinov3

jdyjjj / All-in-One-Gait

baiyanlali / CS309-OOAD-Nikomon-Unity

FoundationVision / VAR