+
Skip to content
View thuwzy's full-sized avatar
🎃
Focusing
🎃
Focusing

Block or report thuwzy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Qwen-Image-Lightning: Speed up Qwen-Image model with distillation

Python 848 33 Updated Oct 14, 2025

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

1,034 36 Updated Oct 4, 2025

Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference

Python 1,131 35 Updated Oct 11, 2025

Official code of RDT 2

Python 522 19 Updated Oct 11, 2025

Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets

Python 418 26 Updated Oct 17, 2025

ViPE: Video Pose Engine for Geometric 3D Perception

Python 1,422 112 Updated Oct 13, 2025

A curated collection of fun and creative examples generated with Nano Banana🍌, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the community's development…

14,754 1,542 Updated Sep 24, 2025

Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.

Python 1,274 112 Updated Oct 17, 2025

4DNeX: Feed-Forward 4D Generative Modeling Made Easy

Python 780 7 Updated Oct 2, 2025

Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition

Python 605 68 Updated Oct 16, 2025

Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model

Python 1,678 176 Updated Oct 4, 2025

Generate large-scale explorable 3D scenes with high-quality panorama videos from a single image or text prompt.

Python 534 39 Updated Sep 21, 2025

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 5,715 308 Updated Sep 30, 2025

Code for "Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers" (NeurIPS 2024)

Python 195 10 Updated Mar 30, 2025

Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model

Python 2,286 186 Updated Oct 17, 2025

PhysX: Physical-Grounded 3D Asset Generation (NeurIPS 2025, Spotlight)

Jupyter Notebook 285 13 Updated Sep 19, 2025

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 11,303 1,147 Updated Oct 11, 2025

Towards a Generative 3D World Engine for Embodied Intelligence

Python 317 18 Updated Oct 2, 2025

Mesh Silksong: Auto-Regressive Mesh Generation as Weaving Silk

Python 88 3 Updated Jul 11, 2025

Code implementation for: From Virtual Games to Real-World Play

38 1 Updated Jun 23, 2025

Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources

1,996 120 Updated Sep 30, 2025

LATTICE: Scalable High-Fidelity 3D Generation via ?

122 Updated Oct 10, 2025

Efficient Part-level 3D Object Generation via Dual Volume Packing

Python 756 62 Updated Jun 26, 2025

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 2,698 193 Updated Sep 12, 2025

[NeurIPS 2025 Spotlight] A Native Multimodal LLM for 3D Generation and Understanding

Python 488 26 Updated Sep 25, 2025

Open-source unified multimodal model

Python 5,183 446 Updated Aug 22, 2025

Ongoing research training transformer models at scale

Python 13,878 3,162 Updated Oct 19, 2025

Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets

Python 789 49 Updated Sep 8, 2025

Official repository for BrickGPT, the first approach for generating physically stable toy brick models from text prompts.

Python 1,339 81 Updated Sep 19, 2025
Next
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载