- Sun Yat-sen University
-
07:23
(UTC +08:00) - https://scholar.google.com/citations?hl=zh-CN&user=T6LJd-8AAAAJ
Highlights
Starred repositories
[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
Pytorch Implementation (unofficial) of the paper "Mean Flows for One-step Generative Modeling" by Geng et al.
[CVPR 2025] Blurry-Edges: Photon-Limited Depth Estimation from Defocused Boundaries
Official PyTorch implementation of "GaussianLSS - Toward Real-world BEV Perception: Depth Uncertainty Estimation via Gaussian Splatting" (CVPR 2025).
CVPR 2025 DarkIR: Robust Low-Light Image Restoration - State of the art low light deblurring. NTIRE 2025 Best Method. [Official PyTorch Implementation]
Shortcut flow matching Pytorch implementation
[CVPR 2024] SinSR: Diffusion-Based Image Super-Resolution in a Single Step
Code for AAAI 2024 paper: "DiffusionEdge: Diffusion Probabilistic Model for Crisp Edge Detection"
⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation (AAAI 2025 Oral)
Official repo of Griffon series including v1(ECCV 2024), v2(ICCV 2025), G, and R, and also the RL tool Vision-R1.
Frequency Autoregressive Image Generation with Continuous Tokens
Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding
[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
[NeurIPS 2025]⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.
[ICLR'25] City-scale 3D Visual Grounding with Multi-modality LLMs
[CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding
[CVPR 2024 & NeurIPS 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
[Lumina Embodied AI] 具身智能技术指南 Embodied-AI-Guide
[CVPR 2025 Highlight] Truncated Diffusion Model for Real-Time End-to-End Autonomous Driving
(ECCV 2024) Code for V-IRL: Grounding Virtual Intelligence in Real Life
Official Implementation of STG-Mamba: Spatial-Temporal Graph Learning via Selective State Space Model.
[ICCV2023] DETRDistill: A Universal Knowledge Distillation Framework for DETR-families
A pytorch implementation of the vector quantized variational autoencoder (https://arxiv.org/abs/1711.00937)
本项目分享了中山大学计算机学院本科和研究生阶段的课程资料、笔记、期末考试卷和其他实用的相关资源。希望对同学们的学习有所帮助❤️,如果喜欢记得给个star🌟