eric-xw

💭

I may be slow to respond.

Xin (Eric) Wang eric-xw

💭

I may be slow to respond.

Researcher in natural language processing, computer vision, and machine learning.

221 followers · 23 following

University of California, Santa Barbara

Achievements

Highlights

Organizations

Stars

eric-ai-lab / EvoPresent

Python 10 Updated Oct 8, 2025

QwenLM / Qwen3-Omni

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 2,553 133 Updated Oct 9, 2025

zai-org / GLM-4.5

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Python 2,927 292 Updated Sep 30, 2025

microsoft / GUI-Actor

GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents

Python 345 40 Updated Aug 6, 2025

MLRM-Halu / MLRM-Halu

[NeurIPS 2025] More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models

Python 60 3 Updated May 31, 2025

eric-ai-lab / SafeKey

[EMNLP 2025] Official code for the paper "SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning"

Python 10 Updated Jun 30, 2025

eric-ai-lab / GRIT

Official code for NeurIPS 2025 paper "GRIT: Teaching MLLMs to Think with Images"

Python 145 6 Updated Aug 4, 2025

eric-ai-lab / Soft-Thinking

Official implementation of the NeurIPS 2025 paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"

Python 246 20 Updated Sep 5, 2025

simular-ai / Agent-S

Agent S: an open agentic framework that uses computers like a human

Python 7,065 777 Updated Oct 5, 2025

mem0ai / mem0

Universal memory layer for AI Agents; Announcing OpenMemory MCP - local and secure memory management.

Python 41,046 4,364 Updated Oct 9, 2025

eric-ai-lab / EditRoom

[ICLR 2025] EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing

Python 15 3 Updated Apr 1, 2025

karpathy / LLM101n

LLM101n: Let's build a Storyteller

34,513 1,876 Updated Aug 1, 2024

eric-ai-lab / MMIR

[ACL 2025 Findings] "Multimodal Inconsistency Reasoning (MMIR): A New Benchmark for Multimodal Reasoning Models"

Python 11 Updated Feb 25, 2025

eric-ai-lab / Mojito

Official repo for the paper "Mojito: Motion Trajectory and Intensity Control for Video Generation""

Python 5 1 Updated Jun 11, 2025

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 13,827 1,049 Updated Oct 9, 2025

facebookresearch / large_concept_model

Large Concept Models: Language modeling in a sentence representation space

Python 2,289 202 Updated Jan 29, 2025

NVIDIA / Cosmos

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

8,058 521 Updated Jun 9, 2025

microsoft / OmniParser

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 23,645 2,019 Updated Sep 12, 2025

openai / swarm

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 20,493 2,202 Updated Mar 11, 2025

eric-ai-lab / MSSBench

[ICLR 2025] Official codebase for the ICLR 2025 paper "Multimodal Situational Safety"

Python 26 1 Updated Jun 23, 2025

GengzeZhou / NavGPT-2

[ECCV 2024] Official implementation of NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models

Python 208 14 Updated Sep 20, 2024

eric-ai-lab / ViCor

This is the implementation of ACL 2024 Findings paper ViCor: Bridging Visual Understanding and Commonsense Reasoning with Large Language Models

4 Updated Jun 11, 2024

InternLM / InternLM-XComposer

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,895 177 Updated May 26, 2025

eric-ai-lab / Screen-Point-and-Read

Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"

Python 28 2 Updated Jul 31, 2024

eric-ai-lab / via-video

25 Updated Jun 20, 2024

eric-ai-lab / MMWorld

Official repo of the ICLR 2025 paper "MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos"

Python 29 1 Updated Jul 15, 2025

eric-ai-lab / ProbMed

[ACL 2025 Findings] "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"

Python 22 1 Updated Feb 21, 2025

letta-ai / letta

Letta is the platform for building stateful agents: open AI with advanced memory that can learn and self-improve over time.

Python 18,713 1,937 Updated Oct 10, 2025

eric-ai-lab / Discffusion

Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"

Python 30 4 Updated Apr 27, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 29,024 3,470 Updated Jan 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xin (Eric) Wang eric-xw

Achievements

Achievements

Highlights

Organizations

Block or report eric-xw

Stars

eric-ai-lab / EvoPresent

QwenLM / Qwen3-Omni

zai-org / GLM-4.5

microsoft / GUI-Actor

MLRM-Halu / MLRM-Halu

eric-ai-lab / SafeKey

eric-ai-lab / GRIT

eric-ai-lab / Soft-Thinking

simular-ai / Agent-S

mem0ai / mem0

eric-ai-lab / EditRoom

karpathy / LLM101n

eric-ai-lab / MMIR

eric-ai-lab / Mojito

QwenLM / Qwen3-VL

facebookresearch / large_concept_model

NVIDIA / Cosmos

microsoft / OmniParser

openai / swarm

eric-ai-lab / MSSBench

GengzeZhou / NavGPT-2

eric-ai-lab / ViCor

InternLM / InternLM-XComposer

eric-ai-lab / Screen-Point-and-Read

eric-ai-lab / via-video

eric-ai-lab / MMWorld

eric-ai-lab / ProbMed

letta-ai / letta

eric-ai-lab / Discffusion

meta-llama / llama3