+
Skip to content
/ MVAR Public

Offical implementation of "Auto-Regressively Generating Multi-View Consistent Images". (ICCV 2025)

Notifications You must be signed in to change notification settings

MILab-PKU/MVAR

Repository files navigation

Auto-Regressively Generating Multi-View Consistent Images

arXiv 

JiaKui Hu*, Yuxiao Yang*, Jialun Liu, Jinbo Wu, Chen Zhao, Yanye Lu
PKU, BaiduVis, THU

blog (Chinese)

🚀️🚀️ News:

  • 2025-06-26: MV-AR is accepted by ICCV 2025 !!!

Introduction

overview

Diffusion-based multi-view image generation methods use a specific reference view for predicting subsequent views, which becomes problematic when overlap between the reference view and the predicted view is minimal, affecting image quality and multi-view consistency. Our MV-AR addresses this by using the preceding view with significant overlap for conditioning.

Updates

  • Rendered GSO test set.
  • Sampling codes for text-to-multi-view and image-to-multi-view.

TODO-lists:

  • Sampling codes for text-image-to-multi-view and text-shape-to-multi-view.
  • Training codes.

Results

Text to Multiview images

t2mv

Image to Multiview images

i2mv

Text + Geometric to Multiview images

ts2mv

Quick Start

Requirements

CUDA 12.4, Pytorch >= 2.4.0

pip install -r requirements.txt

Reproduce

  1. Please download flan-t5-xl in ./pretrained_models;
  2. Please download Cap3D_automated_Objaverse_full.csv in dataset/captions;
  3. Please download models from here, put them in ./pretrained_models;
  4. Run:
# For t2mv on objaverse
sh sample_tcam2i.sh
# For t2mv on GSO
sh sample_icam2i_gso.sh
# For i2mv on GSO
sh sample_icam2i_gso.sh

The generated images will be saved to samples_objaverse_nv_ray/.

Train

Coming soon.

Acknowledgement

This repository is heavily based on LlamaGen. We would like to thank the authors of these work for publicly releasing their code.

For help or issues using this git, please feel free to submit a GitHub issue.

For other communications related to this git, please contact jkhu29@stu.pku.edu.cn.

Citation

@inproceedings{hu2025mvar,
  title={Auto-Regressively Generating Multi-View Consistent Images},
  author={Hu, JiaKui and Yang, Yuxiao and Liu, Jialun and Wu, Jinbo and Zhao, Chen and Lu, Yanye},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
  year={2025}
}

About

Offical implementation of "Auto-Regressively Generating Multi-View Consistent Images". (ICCV 2025)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载