jbwang1997

jbwang1997 jbwang1997

Keep calm and carry on coding!!!

165 followers · 75 following

NWPU -> NKU
Tianjin, China
18:01 (UTC +08:00)
https://jbwang1997.github.io/

Achievements

x3 x2 x3 x2

Achievements

x3 x2 x3 x2

Stars

NVlabs / vla0

61 Updated Oct 15, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 24,163 2,376 Updated Oct 16, 2025

DiffusionAD / Flow-Planner

[NeurIPS 2025] Official implementation for "Flow Matching-Based Autonomous Driving Planning with Advanced Interactive Behavior Modeling"

44 7 Updated Oct 14, 2025

elisabettafedele / superdec

[ICCV 2025] SuperDec: 3D Scene Decomposition with  Superquadric Primitives.

Python 118 5 Updated Oct 14, 2025

bytetriper / RAE

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,105 23 Updated Oct 15, 2025

EnVision-Research / DA-2

Official Implementation of DA^2: Depth Anything in Any Direction

Python 168 14 Updated Oct 12, 2025

SOTAMak1r / Infinite-Forcing

Forked from guandeh17/Self-Forcing

Infinite-Forcing: Towards Infinite-Long Video Generation

Python 74 2 Updated Oct 4, 2025

facebookresearch / 4DGT

[NeurIPS 2025 (Spotlight)] The implementation for the paper "4DGT Learning a 4D Gaussian Transformer Using Real-World Monocular Videos"

Python 283 4 Updated Sep 19, 2025

Fictionarry / GeoSVR

[NeurIPS'25 Spotlight] GeoSVR: Taming Sparse Voxels for Geometrically Accurate Surface Reconstruction

Python 115 7 Updated Sep 28, 2025

hustvl / RAD

[NeurIPS 2025] RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning

Python 72 1 Updated Oct 6, 2025

AlmondGod / tinyworlds

A minimal implementation of DeepMind's Genie world model

Python 979 66 Updated Sep 28, 2025

OpenDriveLab / ViDAR

[CVPR 2024 Highlight] Visual Point Cloud Forecasting

Python 336 23 Updated Jul 2, 2025

SpatialVision / Prior-Depth-Anything

Python 402 32 Updated Sep 2, 2025

NVlabs / Long-RL

Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)

Python 638 22 Updated Sep 24, 2025

tum-vision / scenedino

Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion (ICCV 2025)

Python 65 5 Updated Sep 18, 2025

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 14,947 1,150 Updated Oct 15, 2025

YXB-NKU / SE-GUI

[NeurIPS 2025]"Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning"

Python 73 4 Updated Sep 26, 2025

zcablii / ViTP

Offical implementation of "Visual Instruction Pretraining for Domain-Specific Foundation Models"

Python 73 1 Updated Oct 13, 2025

EvolvingLMMs-Lab / multimodal-search-r1

MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.

Python 334 17 Updated Aug 26, 2025

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 16,128 1,210 Updated Oct 11, 2025

microsoft / MoGe

[CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision

Python 1,831 111 Updated Oct 16, 2025

yangzhou24 / OmniWorld

OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling

Python 376 6 Updated Oct 15, 2025

EnVision-Research / Lotus

Official implementation of Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction

Python 742 45 Updated Apr 3, 2025

mystorm16 / FastVGGT

Code for FastVGGT: Training-Free Acceleration of Visual Geometry Transformer

Python 542 23 Updated Oct 14, 2025

hustvl / MapTR

[ICLR'23 Spotlight & ECCV'24 & IJCV'24] MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction

Python 1,392 219 Updated Mar 3, 2025

NVlabs / GaussianSTORM

[ICLR2025] A PyTorch implementation for STORM: Spatiotemporal Reconstruction Model for Large-Scale Outdoor Scenes

Python 255 8 Updated May 22, 2025

meituan-longcat / LongCat-Flash-Chat

1,160 54 Updated Oct 17, 2025

Czm369 / bev-vae

Scalable and Generalizable Autonomous Driving Scene Synthesis

Python 44 5 Updated Oct 2, 2025

xiaoxiao0406 / VQ-VLA

The offical repo for paper "VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers" (ICCV 2025)

Python 85 Updated Aug 12, 2025

MiroMindAI / MiroThinker

MiroThinker is open-source agentic models trained for deep research and complex tool use scenarios.

Python 456 40 Updated Oct 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

jbwang1997 jbwang1997

Achievements

Achievements

Block or report jbwang1997

Stars

NVlabs / vla0

karpathy / nanochat

DiffusionAD / Flow-Planner

elisabettafedele / superdec

bytetriper / RAE

EnVision-Research / DA-2

SOTAMak1r / Infinite-Forcing

facebookresearch / 4DGT

Fictionarry / GeoSVR

hustvl / RAD

AlmondGod / tinyworlds

OpenDriveLab / ViDAR

SpatialVision / Prior-Depth-Anything

NVlabs / Long-RL

tum-vision / scenedino

QwenLM / Qwen3-VL

YXB-NKU / SE-GUI

zcablii / ViTP

EvolvingLMMs-Lab / multimodal-search-r1

Alibaba-NLP / DeepResearch

microsoft / MoGe

yangzhou24 / OmniWorld

EnVision-Research / Lotus

mystorm16 / FastVGGT

hustvl / MapTR

NVlabs / GaussianSTORM

meituan-longcat / LongCat-Flash-Chat

Czm369 / bev-vae

xiaoxiao0406 / VQ-VLA

MiroMindAI / MiroThinker