+
Skip to content
View jbwang1997's full-sized avatar

Block or report jbwang1997

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
61 Updated Oct 15, 2025

The best ChatGPT that $100 can buy.

Python 24,163 2,376 Updated Oct 16, 2025

[NeurIPS 2025] Official implementation for "Flow Matching-Based Autonomous Driving Planning with Advanced Interactive Behavior Modeling"

44 7 Updated Oct 14, 2025

[ICCV 2025] SuperDec: 3D Scene Decomposition with 
Superquadric Primitives.

Python 118 5 Updated Oct 14, 2025

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,105 23 Updated Oct 15, 2025

Official Implementation of DA^2: Depth Anything in Any Direction

Python 168 14 Updated Oct 12, 2025

Infinite-Forcing: Towards Infinite-Long Video Generation

Python 74 2 Updated Oct 4, 2025

[NeurIPS 2025 (Spotlight)] The implementation for the paper "4DGT Learning a 4D Gaussian Transformer Using Real-World Monocular Videos"

Python 283 4 Updated Sep 19, 2025

[NeurIPS'25 Spotlight] GeoSVR: Taming Sparse Voxels for Geometrically Accurate Surface Reconstruction

Python 115 7 Updated Sep 28, 2025

[NeurIPS 2025] RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning

Python 72 1 Updated Oct 6, 2025

A minimal implementation of DeepMind's Genie world model

Python 979 66 Updated Sep 28, 2025

[CVPR 2024 Highlight] Visual Point Cloud Forecasting

Python 336 23 Updated Jul 2, 2025

Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)

Python 638 22 Updated Sep 24, 2025

Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion (ICCV 2025)

Python 65 5 Updated Sep 18, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 14,947 1,150 Updated Oct 15, 2025

[NeurIPS 2025]"Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning"

Python 73 4 Updated Sep 26, 2025

Offical implementation of "Visual Instruction Pretraining for Domain-Specific Foundation Models"

Python 73 1 Updated Oct 13, 2025

MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.

Python 334 17 Updated Aug 26, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 16,128 1,210 Updated Oct 11, 2025

[CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision

Python 1,831 111 Updated Oct 16, 2025

OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling

Python 376 6 Updated Oct 15, 2025

Official implementation of Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction

Python 742 45 Updated Apr 3, 2025

Code for FastVGGT: Training-Free Acceleration of Visual Geometry Transformer

Python 542 23 Updated Oct 14, 2025

[ICLR'23 Spotlight & ECCV'24 & IJCV'24] MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction

Python 1,392 219 Updated Mar 3, 2025

[ICLR2025] A PyTorch implementation for STORM: Spatiotemporal Reconstruction Model for Large-Scale Outdoor Scenes

Python 255 8 Updated May 22, 2025

Scalable and Generalizable Autonomous Driving Scene Synthesis

Python 44 5 Updated Oct 2, 2025

The offical repo for paper "VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers" (ICCV 2025)

Python 85 Updated Aug 12, 2025

MiroThinker is open-source agentic models trained for deep research and complex tool use scenarios.

Python 456 40 Updated Oct 15, 2025
Next
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载