-
Zhejiang University
- Hangzhou, China
- mingweili@zju.edu.cn
Highlights
- Pro
Starred repositories
[SIGGRAPH 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control
[RAL 2022 & ICRA 2023] TransCG: A Large-Scale Real-World Dataset for Transparent Object Depth Completion and A Grasping Baseline
Official implementation of "VeGaS: Video Gaussian Splatting"
[NeurIPS 2024] Official code for "Splatter a Video: Video Gaussian Representation for Versatile Processing"
Official implementation of the paper "GenCompositor: Generative Video Compositing with Diffusion Transformer"
The official repository of SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization
[NeurIPS 2025] Jasmine: Harnessing Diffusion Prior for Self-Supervised Depth Estimation
[CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"
[NeurIPS 2025] Pixel-Perfect Depth
Seeing Glass: Joint Point-Cloud and Depth Completion for Transparent Objects. CoRL 2021
(ICLR25 Oral) Do as We Do, Not as You Think: the Conformity of Large Language Models
[NeurIPS 2025 Spotlight] Official implementation of the SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alignment
StreamingVLM: Real-Time Understanding for Infinite Video Streams
PhysX: Physical-Grounded 3D Asset Generation (NeurIPS 2025, Spotlight)
Enjoy the magic of Diffusion models!
Wan: Open and Advanced Large-Scale Video Generative Models
Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
Fast and memory-efficient exact attention
High-Quality Text-to-Video Generation with Alpha Channel
[CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)
A library for efficient similarity search and clustering of dense vectors.
This project aims to enhance the working environment on Windows
Unified message conversion system supporting ROS2, Pydantic, Dataclass, JSON, YAML, Dict, and MCP schema inter-conversion
Code for FastVGGT: Training-Free Acceleration of Visual Geometry Transformer
MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer