whwu95

♥️

I may be slow to respond.

Wenhao Wu whwu95

♥️

I may be slow to respond.

Scientist @ Amazon | Ex-Ph.D. @ USYD

146 followers · 29 following

Amazon AGI
Bellevue, WA, US
07:52 (UTC -07:00)
whwu95.github.io
@dr_wenhao
in/wenhao-w-usyd

Achievements

Highlights

Stars

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,371 2,030 Updated Jul 17, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 3,739 281 Updated Oct 11, 2025

InternLM / Intern-S1

A Scientific Multimodal Foundation Model

582 26 Updated Sep 30, 2025

hshjerry / VideoEspresso

[CVPR 2025 Oral] VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection

Python 119 4 Updated Jul 28, 2025

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,305 278 Updated Oct 4, 2025

HJYao00 / Awesome-Reasoning-MLLM

Awesome Reasoning in MLLMs: Papers and Projects about learning to reason with MLLMs, including Chain-of-Thought (CoT), OpenAl o1, and DeepSeek-R1

57 3 Updated Mar 18, 2025

alexeyinkin / eb-1a

TeX 90 40 Updated Jan 29, 2025

HJYao00 / Mulberry

[NIPS'25 Spotlight] Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS

Python 1,216 110 Updated Sep 19, 2025

swordlidev / Efficient-Multimodal-LLMs-Survey

Efficient Multimodal Large Language Models: A Survey

373 21 Updated Apr 29, 2025

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 13,981 1,066 Updated Oct 11, 2025

QwenLM / Qwen2.5-Math

A series of math-specific large language models of our Qwen2 series.

Python 1,015 142 Updated Jan 11, 2025

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,152 611 Updated Oct 11, 2025

QwenLM / Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,894 144 Updated Apr 21, 2025

autogluon / autogluon-rag

Retrieval-Augmented Generation in 3 Lines of Code!

Python 48 6 Updated Feb 3, 2025

AudioLLMs / AudioBench

AudioBench: A Universal Benchmark for Audio Large Language Models

Python 264 11 Updated Jun 17, 2025

takomc / amp

【NeurIPS 2024】The official code of paper "Automated Multi-level Preference for MLLMs"

Python 19 1 Updated Sep 26, 2024

HJYao00 / DenseConnector

【NeurIPS 2024】Dense Connector for MLLMs

Python 177 8 Updated Oct 14, 2024

whwu95 / FreeVA

FreeVA: Offline MLLM as Training-Free Video Assistant

Python 63 1 Updated Jun 9, 2024

RayeRen / acad-homepage.github.io

AcadHomepage: A Modern and Responsive Academic Personal Homepage

SCSS 2,305 4,694 Updated Oct 7, 2025

johnnyhwu / Awesome-LLM-Tabular

Awesome-LLM-Tabular: a curated list of Large Language Model applied to Tabular Data

416 31 Updated Dec 22, 2024

whwu95 / GPT4Vis

GPT4Vis: What Can GPT-4 Do for Zero-shot Visual Recognition?

Python 185 18 Updated May 22, 2024

whwu95 / ATM

【ICCV'2023】What Can Simple Arithmetic Operations Do for Temporal Modeling?

Python 74 6 Updated Jan 26, 2024

MisterBooo / LeetCodeAnimation

Demonstrate all the questions on LeetCode in the form of animation.（用动画的形式呈现解LeetCode题目的思路）

Java 76,476 14,018 Updated Aug 14, 2023

progfay / shields-with-icon

Enjoy https://shields.io

Go 452 250 Updated Oct 1, 2025

nerfies / nerfies.github.io

JavaScript 3,601 1,528 Updated Jun 21, 2024

mdyao / CSNorm

[ICCV 2023] Official Implementation of "Generalized Lightness Adaptation with Channel Selective Normalization"

Python 84 6 Updated Jan 22, 2024

mdyao / Awesome-3D-AIGC

A curated list of papers and open-source resources focused on 3D AIGC.

331 17 Updated Sep 1, 2024

Ileriayo / markdown-badges

Badges for your personal developer branding, profile, and projects.

SCSS 15,643 1,759 Updated Jun 11, 2025

PKU-YuanGroup / Video-LLaVA

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 3,369 242 Updated Dec 3, 2024

HJYao00 / Side4Video

Python 40 3 Updated Apr 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wenhao Wu whwu95

Achievements

Achievements

Highlights

Block or report whwu95

Stars

Wan-Video / Wan2.1

hiyouga / EasyR1

InternLM / Intern-S1

hshjerry / VideoEspresso

PeterGriffinJin / Search-R1

HJYao00 / Awesome-Reasoning-MLLM

alexeyinkin / eb-1a

HJYao00 / Mulberry

swordlidev / Efficient-Multimodal-LLMs-Survey

QwenLM / Qwen3-VL

QwenLM / Qwen2.5-Math

InternLM / lmdeploy

QwenLM / Qwen2-Audio

autogluon / autogluon-rag

AudioLLMs / AudioBench

takomc / amp

HJYao00 / DenseConnector

whwu95 / FreeVA

RayeRen / acad-homepage.github.io

johnnyhwu / Awesome-LLM-Tabular

whwu95 / GPT4Vis

whwu95 / ATM

MisterBooo / LeetCodeAnimation

progfay / shields-with-icon

nerfies / nerfies.github.io

mdyao / CSNorm

mdyao / Awesome-3D-AIGC

Ileriayo / markdown-badges

PKU-YuanGroup / Video-LLaVA

HJYao00 / Side4Video