Stars
[ECCV 2024] Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction
Autoware - the world's leading open-source software project for autonomous driving
[CoRL '25] Pseudo-Simulation for Autonomous Driving; [NeurIPS '24] NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and Benchmarking
Eclipse SUMO is an open source, highly portable, microscopic and continuous traffic simulation package designed to handle large networks. It allows for intermodal simulation including pedestrians a…
OpenPCDet Toolbox for LiDAR-based 3D Object Detection.
Fast-BEV: A Fast and Strong Bird’s-Eye View Perception Baseline
The Kalibr visual-inertial calibration toolbox
Open-source Windows and Office activator featuring HWID, Ohook, TSforge, KMS38, and Online KMS activation methods, along with advanced troubleshooting.
Effortless data labeling with AI support from Segment Anything and other awesome models.
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything
⚡ InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)
iBVP Dataset: RGB-Thermal rPPG Dataset with High Resolution Signal Quality Labels [Electronics 2024]
A tool to discover hidden variation in video.
Codes for some of my co-authored journal/conference papers
Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video” (ECCV 2024)
[CVPR 2025] Multiple Object Tracking as ID Prediction
[ICCV 2023] MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking
[BMVC2023] Widely Applicable Strong Baseline for Sports Ball Detection and Tracking
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
Official code for PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point Tracking (ICCV 2023)
A data generation pipeline for creating semi-realistic synthetic multi-object videos with rich annotations such as instance segmentation masks, depth maps, and optical flow.
CoTracker is a model for tracking any point (pixel) on a video.
2024 up-to-date list of DATASETS, CODEBASES and PAPERS on Multi-Task Learning (MTL), from Machine Learning perspective.
Object Detection Metrics. 14 object detection metrics: mean Average Precision (mAP), Average Recall (AR), Spatio-Temporal Tube Average Precision (STT-AP). This project supports different bounding b…
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)