+
Skip to content
View whwu95's full-sized avatar
♥️
I may be slow to respond.
♥️
I may be slow to respond.

Highlights

  • Pro

Block or report whwu95

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,371 2,030 Updated Jul 17, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 3,739 281 Updated Oct 11, 2025

A Scientific Multimodal Foundation Model

582 26 Updated Sep 30, 2025

[CVPR 2025 Oral] VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection

Python 119 4 Updated Jul 28, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,305 278 Updated Oct 4, 2025

Awesome Reasoning in MLLMs: Papers and Projects about learning to reason with MLLMs, including Chain-of-Thought (CoT), OpenAl o1, and DeepSeek-R1

57 3 Updated Mar 18, 2025
TeX 90 40 Updated Jan 29, 2025

[NIPS'25 Spotlight] Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS

Python 1,216 110 Updated Sep 19, 2025

Efficient Multimodal Large Language Models: A Survey

373 21 Updated Apr 29, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 13,981 1,066 Updated Oct 11, 2025

A series of math-specific large language models of our Qwen2 series.

Python 1,015 142 Updated Jan 11, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,152 611 Updated Oct 11, 2025

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,894 144 Updated Apr 21, 2025

Retrieval-Augmented Generation in 3 Lines of Code!

Python 48 6 Updated Feb 3, 2025

AudioBench: A Universal Benchmark for Audio Large Language Models

Python 264 11 Updated Jun 17, 2025

【NeurIPS 2024】The official code of paper "Automated Multi-level Preference for MLLMs"

Python 19 1 Updated Sep 26, 2024

【NeurIPS 2024】Dense Connector for MLLMs

Python 177 8 Updated Oct 14, 2024

FreeVA: Offline MLLM as Training-Free Video Assistant

Python 63 1 Updated Jun 9, 2024

AcadHomepage: A Modern and Responsive Academic Personal Homepage

SCSS 2,305 4,694 Updated Oct 7, 2025

Awesome-LLM-Tabular: a curated list of Large Language Model applied to Tabular Data

416 31 Updated Dec 22, 2024

GPT4Vis: What Can GPT-4 Do for Zero-shot Visual Recognition?

Python 185 18 Updated May 22, 2024

【ICCV'2023】What Can Simple Arithmetic Operations Do for Temporal Modeling?

Python 74 6 Updated Jan 26, 2024

Demonstrate all the questions on LeetCode in the form of animation.(用动画的形式呈现解LeetCode题目的思路)

Java 76,476 14,018 Updated Aug 14, 2023

Enjoy https://shields.io

Go 452 250 Updated Oct 1, 2025
JavaScript 3,601 1,528 Updated Jun 21, 2024

[ICCV 2023] Official Implementation of "Generalized Lightness Adaptation with Channel Selective Normalization"

Python 84 6 Updated Jan 22, 2024

A curated list of papers and open-source resources focused on 3D AIGC.

331 17 Updated Sep 1, 2024

Badges for your personal developer branding, profile, and projects.

SCSS 15,643 1,759 Updated Jun 11, 2025

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 3,369 242 Updated Dec 3, 2024
Python 40 3 Updated Apr 7, 2024
Next
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载