-
The University of Hong Kong
- Hong Kong
-
23:13
(UTC +08:00) - https://happinesslz.github.io
Starred repositories
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
Devkit and documentation for the NVIDIA Physical AI Autonomous Vehicles Dataset
Florence-2 is a novel vision foundation model with a unified, prompt-based representation for a variety of computer vision and vision-language tasks.
PyTorch implementation of JiT https://arxiv.org/abs/2511.13720
Official release of "Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning"
CoRL2025 UniFP: Learning a Unified Policy for Position and Force Control in Legged Loco-Manipulation
Web-based 3D visualization + Python
OmniVGGT: Omni-Modality Driven Visual Geometry Grounded Transformer
The offical Implementation of "Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model"
Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.
Detect Anything via Next Point Prediction (Based on Qwen2.5-VL-3B)
StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
Native Multimodal Models are World Learners
[NeurIPS'25] Official repository of Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations
Official implementation of "From Forecasting to Planning: Policy World Model for Collaborative State-Action Prediction"
ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving
OmniNWM: Omniscient Navigation World Models for Autonomous Driving
Code implementation of the paper "World-in-World: World Models in a Closed-Loop World"
Automatically crawl arXiv papers daily and summarize them using AI. Illustrating them using GitHub Pages.
Dexbotic: Open-Source Vision-Language-Action Toolbox