XuecWu

🎯

Focusing

Conna XuecWu

🎯

Focusing

Multimodal Deep Learning & Cross-Media Perception Computing.

35 followers · 110 following

Xi'an Jiaotong University
Xi'an, China
05:14 (UTC +08:00)
@XuecWu

Achievements

Lists (1)

Sort

🚀 My stack

Stars

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 3,179 110 Updated Oct 20, 2025

shinshin86 / oh-my-logo

Display giant ASCII-art logos with colorful gradients in your terminal — like Claude Code or Gemini CLI.

TypeScript 1,178 50 Updated Oct 5, 2025

EzioBy / Ditto

[Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset

Python 109 6 Updated Oct 20, 2025

thuml / MiniVeo3-Reasoner

Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give it a star 🌟 if you find it useful.

Python 154 2 Updated Oct 12, 2025

hzlsaber / So-Fake

The offical repository of "So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection"

16 1 Updated Oct 4, 2025

OneIG-Bench / OneIG-Benchmark

[NeurIPS 2025 DB] OneIG-Bench is a meticulously designed comprehensive benchmark framework for fine-grained evaluation of T2I models across multiple dimensions, including subject-element alignment,…

Python 76 3 Updated Oct 2, 2025

EvolvingLMMs-Lab / NEO

NEO Series: Native Vision-Language Models from First Principles

Python 160 7 Updated Oct 18, 2025

nnnth / UniLIP

Official implementation of "UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing"

Python 20 Updated Oct 10, 2025

yeyupiaoling / zsxq

本项目是博主在知识星球编写的文章项目，包含了多个项目的部署教程

1 Updated Oct 17, 2025

IST-DASLab / HALO

HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. 🚀 The official implementation of https://arxiv.org/abs/2501.02625

Python 26 Updated Feb 17, 2025

Vchitect / Uni-MMMU

Python 9 Updated Oct 16, 2025

BDML-lab / llm-inductive-reasoning-survey

This is the repository for the paper ‘A Survey of Inductive Reasoning for Large Language Models’

26 Updated Oct 20, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 28,389 2,937 Updated Oct 20, 2025

openvla / openvla

Forked from TRI-ML/prismatic-vlms

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 4,149 500 Updated Mar 23, 2025

shangshang-wang / Tora

Forked from meta-pytorch/torchtune

Tora: Torchtune-LoRA for RL

Python 66 7 Updated Oct 20, 2025

SakanaAI / AI-Scientist-v2

The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search

Python 1,668 318 Updated Aug 24, 2025

nvidia-cosmos / cosmos-predict2.5

Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.

Python 207 11 Updated Oct 16, 2025

Eyeline-Labs / VChain

VChain: Chain-of-Visual-Thought for Reasoning in Video Generation

90 1 Updated Oct 7, 2025

Espere-1119-Song / VideoNSA

VideoNSA: Native Sparse Attention Scales Video Understanding

Python 50 1 Updated Oct 8, 2025

showlab / Paper2Video

Automatic Video Generation from Scientific Papers

Python 1,164 148 Updated Oct 20, 2025

WECENG / ticket-purchase

大麦自动抢票，支持人员、城市、日期场次、价格选择

Python 5,217 642 Updated Sep 16, 2025

index-tts / index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 13,734 1,510 Updated Oct 10, 2025

inclusionAI / Ming-UniVision

Code release for Ming-UniVision: Joint Image Understanding and Geneation with a Continuous Unified Tokenizer

Python 106 4 Updated Oct 14, 2025

rdi-berkeley / awesome-RLVR-boundary

A curated list of resources on Reinforcement Learning with Verifiable Rewards (RLVR) and the reasoning capability boundary of Large Language Models (LLMs).

64 4 Updated Oct 7, 2025

PRIME-RL / TTRL

[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

Python 864 64 Updated Sep 26, 2025

zhengchen1999 / DOVE

[NeurIPS'25] DOVE: Efficient One-Step Diffusion Model for Real-World Video Super-Resolution

Python 89 2 Updated Oct 16, 2025

shim0114 / T2V-Diffusion-Search

[NeurIPS 2025] Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search

Python 8 Updated Oct 4, 2025

XueZeyue / DanceGRPO

An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation

Python 1,043 50 Updated Oct 16, 2025

posquit0 / Awesome-CV

📄 Awesome CV is LaTeX template for your outstanding job application

TeX 25,503 5,096 Updated Sep 5, 2025

ai-forever / Kandinsky-5

Kandinsky 5.0: A family of diffusion models for Video & Image generation

Python 164 8 Updated Oct 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Conna XuecWu

Achievements

Achievements

Block or report XuecWu

Lists (1)

🚀 My stack

Stars

deepseek-ai / DeepSeek-OCR

shinshin86 / oh-my-logo

EzioBy / Ditto

thuml / MiniVeo3-Reasoner

hzlsaber / So-Fake

OneIG-Bench / OneIG-Benchmark

EvolvingLMMs-Lab / NEO

nnnth / UniLIP

yeyupiaoling / zsxq

IST-DASLab / HALO

Vchitect / Uni-MMMU

BDML-lab / llm-inductive-reasoning-survey

karpathy / nanochat

openvla / openvla

shangshang-wang / Tora

SakanaAI / AI-Scientist-v2

nvidia-cosmos / cosmos-predict2.5

Eyeline-Labs / VChain

Espere-1119-Song / VideoNSA

showlab / Paper2Video

WECENG / ticket-purchase

index-tts / index-tts

inclusionAI / Ming-UniVision

rdi-berkeley / awesome-RLVR-boundary

PRIME-RL / TTRL

zhengchen1999 / DOVE

shim0114 / T2V-Diffusion-Search

XueZeyue / DanceGRPO

posquit0 / Awesome-CV

ai-forever / Kandinsky-5