default search action
Yao Hu 0002
Person information
- affiliation: Xiaohongshu Inc., Beijing, China
Other persons with the same name
- Yao Hu — disambiguation page
- Yao Hu 0001
— City University of Hong Kong, Department of Data Science and Artificial Intelligence, Hong Kong
- Yao Hu 0003
— University of Illinois at Urbana-Champaign, Ven Te Chow Hydrosystems Laboratory, Department of Civil and Environmental Engineering, Urbana, IL, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j16]Yutao Hu
, Xiaolong Jiang, Xuhui Liu, Xiaoyan Luo
, Yao Hu, Xianbin Cao
, Baochang Zhang
, Jun Zhang
:
Hierarchical Self-Distilled Feature Learning for Fine-Grained Visual Categorization. IEEE Trans. Neural Networks Learn. Syst. 36(3): 4005-4018 (2025) - [c53]Chen Zhang, Qiuchi Li, Dawei Song, Zheyu Ye, Yan Gao, Yao Hu:
Towards the Law of Capacity Gap in Distilling Language Models. ACL (1) 2025: 22504-22528 - [c52]Meizhi Zhong, Xikai Liu, Chen Zhang, Yikun Lei, Yan Gao, Yao Hu, Kehai Chen, Min Zhang:
ZigZagKV: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty. COLING 2025: 8897-8907 - [c51]Meizhi Zhong, Chen Zhang, Yikun Lei, Xikai Liu, Yan Gao, Yao Hu, Kehai Chen, Min Zhang:
Understanding the RoPE Extensions of Long-Context LLMs: An Attention Perspective. COLING 2025: 8955-8962 - [c50]Yikun Liu, Yajie Zhang
, Jiayin Cai, Xiaolong Jiang, Yao Hu, Jiangchao Yao, Yanfeng Wang, Weidi Xie:
LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant. CVPR 2025: 4015-4025 - [c49]Zehao Xiao, Shilin Yan, Jack Hong, Jiayin Cai, Xiaolong Jiang, Yao Hu, Jiayi Shen, Cheems Wang, Cees G. M. Snoek:
DynaPrompt: Dynamic Test-Time Prompt Tuning. ICLR 2025 - [c48]Shilin Yan, Ouxiang Li, Jiayin Cai, Yanbin Hao, Xiaolong Jiang, Yao Hu, Weidi Xie:
A Sanity Check for AI-generated Image Detection. ICLR 2025 - [c47]Ouxiang Li
, Jiayin Cai
, Yanbin Hao
, Xiaolong Jiang
, Yao Hu
, Fuli Feng
:
Improving Synthetic Image Detection Towards Generalization: An Image Transformation Perspective. KDD (1) 2025: 2405-2414 - [c46]Chao Zhang
, Haoxin Zhang
, Shiwei Wu
, Di Wu
, Tong Xu
, Xiangyu Zhao
, Yan Gao
, Yao Hu
, Enhong Chen
:
NoteLLM-2: Multimodal Large Representation Models for Recommendation. KDD (1) 2025: 2815-2826 - [c45]Chen Zhang, Meizhi Zhong, Qimeng Wang, Xuantao Lu, Zheyu Ye, Chengqiang Lu, Yan Gao, Yao Hu, Kehai Chen, Min Zhang, Dawei Song:
MoDification: Mixture of Depths Made Easy. NAACL (Long Papers) 2025: 5137-5149 - [c44]Xu Zhao
, Ruibo Ma
, Jiaqi Chen
, Weiqi Zhao
, Ping Yang
, Yao Hu
:
Multi-Granularity Distribution Modeling for Video Watch Time Prediction via Exponential-Gaussian Mixture Network. RecSys 2025: 309-318 - [c43]Jia Chen
, Qian Dong
, Haitao Li
, Xiaohui He
, Yan Gao
, Shaosheng Cao
, Yi Wu
, Ping Yang
, Chen Xu
, Yao Hu
, Qingyao Ai
, Yiqun Liu
:
Qilin: A Multimodal Information Retrieval Dataset with APP-level User Sessions. SIGIR 2025: 3670-3680 - [c42]Zihan Niu
, Zheyong Xie
, Shaosheng Cao
, Chonggang Lu
, Zheyu Ye
, Tong Xu
, Zuozhu Liu
, Yan Gao
, Jia Chen
, Zhe Xu
, Yi Wu
, Yao Hu
:
PaRT: Enhancing Proactive Social Chatbots with Personalized Real-Time Retrieval. SIGIR 2025: 4269-4274 - [c41]Yang Shi
, Yiping Sun
, Jiaolong Du
, Xiaocheng Zhong
, Zhiyong Wang
, Yao Hu
:
Scalable Overload-Aware Graph-Based Index Construction for 10-Billion-Scale Vector Similarity Search. WWW (Companion Volume) 2025: 1303-1307 - [i69]Runqi Wang, Sijie Xu, Tianyao He, Yang Chen, Wei Zhu, Dejia Song, Nemo Chen, Xu Tang, Yao Hu:
DynamicFace: High-Quality and Consistent Video Face Swapping using Composable 3D Facial Priors. CoRR abs/2501.08553 (2025) - [i68]Kaituo Xu, Feng-Long Xie, Xu Tang, Yao Hu:
FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration. CoRR abs/2501.14350 (2025) - [i67]Zehao Xiao, Shilin Yan, Jack Hong, Jiayin Cai, Xiaolong Jiang, Yao Hu, Jiayi Shen, Qi Wang, Cees G. M. Snoek:
DynaPrompt: Dynamic Test-Time Prompt Tuning. CoRR abs/2501.16404 (2025) - [i66]Jack Hong, Shilin Yan, Jiayin Cai, Xiaolong Jiang, Yao Hu, Weidi Xie:
WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs. CoRR abs/2502.04326 (2025) - [i65]Jia Chen, Qian Dong, Haitao Li, Xiaohui He, Yan Gao, Shaosheng Cao, Yi Wu, Ping Yang, Chen Xu, Yao Hu, Qingyao Ai, Yiqun Liu:
Qilin: A Multimodal Information Retrieval Dataset with APP-level User Sessions. CoRR abs/2503.00501 (2025) - [i64]Wenxuan Huang, Bohan Jia, Zijie Zhai, Shaosheng Cao, Zheyu Ye, Fei Zhao, Zhe Xu, Yao Hu, Shaohui Lin:
Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models. CoRR abs/2503.06749 (2025) - [i63]Zhichao Sun, Huazhang Hu, Yidong Ma, Gang Liu, Nemo Chen, Xu Tang, Yao Hu, Yongchao Xu:
CQ-DINO: Mitigating Gradient Dilution via Category Queries for Vast Vocabulary Object Detection. CoRR abs/2503.18430 (2025) - [i62]Haohan Guo, Kun Xie, Yi-Chen Wu, Feng-Long Xie, Xu Tang, Yao Hu:
FireRedTTS-1S: An Upgraded Streamable Foundation Text-to-Speech System. CoRR abs/2503.20499 (2025) - [i61]Hongcheng Guo, Fei Zhao, Shaosheng Cao, Xinze Lyu, Ziyan Liu, Yue Wang, Boyang Wang, Zhoujun Li, Chonggang Lu, Zhe Xu, Yao Hu:
Redefining Machine Translation on Social Network Services with Large Language Models. CoRR abs/2504.07901 (2025) - [i60]Zihan Niu, Zheyong Xie, Shaosheng Cao, Chonggang Lu, Zheyu Ye, Tong Xu, Zuozhu Liu, Yan Gao, Jia Chen, Zhe Xu, Yi Wu, Yao Hu:
PaRT: Enhancing Proactive Social Chatbots with Personalized Real-Time Retrieval. CoRR abs/2504.20624 (2025) - [i59]Zhaopeng Feng, Yupu Liang, Shaosheng Cao, Jiayuan Su, Jiahan Ren, Zhe Xu, Yao Hu, Wenxuan Huang, Jian Wu, Zuozhu Liu:
MT3: Scaling MLLM-based Text Image Machine Translation via Multi-Task Reinforcement Learning. CoRR abs/2505.19714 (2025) - [i58]Jack Hong, Shilin Yan, Zehao Xiao, Jiayin Cai, Xiaolong Jiang, Yao Hu, Henghui Ding:
Progressive Scaling Visual Object Tracking. CoRR abs/2505.19990 (2025) - [i57]Dongjie Yang, Chengqiang Lu, Qimeng Wang, Xinbei Ma, Yan Gao, Yao Hu, Hai Zhao:
Plan Your Travel and Travel with Your Plan: Wide-Horizon Planning and Evaluation via LLM. CoRR abs/2506.12421 (2025) - [i56]Zhouhong Gu, Xiaoxuan Zhu, Yin Cai, Hao Shen, Xingzhou Chen, Qingyi Wang, Jialin Li, Xiaoran Shi, Haoran Guo, Wenxuan Huang, Hongwei Feng, Yanghua Xiao, Zheyu Ye, Yao Hu, Shaosheng Cao:
AgentGroupChat-V2: Divide-and-Conquer Is What LLM-Based Multi-Agent System Need. CoRR abs/2506.15451 (2025) - [i55]Tianyao He, Runqi Wang, Yang Chen, Dejia Song, Nemo Chen, Xu Tang, Yao Hu:
Flux-Sculptor: Text-Driven Rich-Attribute Portrait Editing through Decomposed Spatial Flow Control. CoRR abs/2507.03979 (2025) - [i54]Fei Zhao, Chonggang Lu, Yue Wang, Zheyong Xie, Ziyan Liu, Haofu Qian, JianZhao Huang, Fangcheng Shi, Zijie Meng, Hongcheng Guo, Mingqian He, Xinze Lyu, Yiming Lu, Ziyang Xiang, Zheyu Ye, Chengqiang Lu, Zhe Xu, Yi Wu, Yao Hu, Yan Gao, Jun Fan, Xiaolong Jiang, Weiting Liu, Boyang Wang, Shaosheng Cao:
RedOne: Revealing Domain-specific LLM Post-Training in Social Networking Services. CoRR abs/2507.10605 (2025) - [i53]Qian Dong, Jia Chen, Qingyao Ai, Hongning Wang, Haitao Li, Yi Wu, Yao Hu, Yiqun Liu, Shaoping Ma:
SelfRACG: Enabling LLMs to Self-Express and Retrieve for Code Generation. CoRR abs/2507.19033 (2025) - [i52]Haochen Wang, Qirui Chen, Cilin Yan, Jiayin Cai, Xiaolong Jiang, Yao Hu, Weidi Xie, Stratis Gavves:
Object-centric Video Question Answering with Visual Grounding and Referring. CoRR abs/2507.19599 (2025) - [i51]Xiaowei Yuan, Lei Jin, Haoxin Zhang, Yan Gao, Yi Wu, Yao Hu, Ziyang Huang, Jun Zhao, Kang Liu:
Decomposed Reasoning with Reinforcement Learning for Relevance Assessment in UGC Platforms. CoRR abs/2508.02506 (2025) - [i50]Xu Zhao, Ruibo Ma, Jiaqi Chen, Weiqi Zhao, Ping Yang, Yao Hu:
Multi-Granularity Distribution Modeling for Video Watch Time Prediction via Exponential-Gaussian Mixture Network. CoRR abs/2508.12665 (2025) - [i49]Kun Xie, Feiyu Shen, Junjie Li, Fenglong Xie, Xu Tang, Yao Hu:
FireRedTTS-2: Towards Long Conversational Speech Generation for Podcast and Chatbot. CoRR abs/2509.02020 (2025) - [i48]Yuqing Huang, Rongyang Zhang, Qimeng Wang, Chengqiang Lu, Yan Gao, Yi Wu, Yao Hu, Xuyang Zhi, Guiquan Liu, Xin Li, Hao Wang, Enhong Chen:
SelfAug: Mitigating Catastrophic Forgetting in Retrieval-Augmented Generation via Distribution Self-Alignment. CoRR abs/2509.03934 (2025) - [i47]Junjie Chen, Yao Hu, Junjie Li, Kangyue Li, Kun Liu, Wenpeng Li, Xu Li, Ziyuan Li, Feiyu Shen, Xu Tang, Manzhen Wei, Yichen Wu, Fenglong Xie, Kaituo Xu, Kun Xie:
FireRedChat: A Pluggable, Full-Duplex Voice Interaction System with Cascaded and Semi-Cascaded Implementations. CoRR abs/2509.06502 (2025) - [i46]Wenxuan Huang, Shuang Chen, Zheyong Xie, Shaosheng Cao, Shixiang Tang, Yufan Shen, Qingyu Yin, Wenbo Hu, Xiaoman Wang, Yuntian Tang, Junbo Qiao, Yue Guo, Yao Hu, Zhenfei Yin, Philip Torr, Yu Cheng, Wanli Ouyang, Shaohui Lin:
Interleaving Reasoning for Better Text-to-Image Generation. CoRR abs/2509.06945 (2025) - [i45]Qiang Xiang, Shuang Sun, Binglei Li, Dejia Song, Huaxia Li, Nemo Chen, Xu Tang, Yao Hu, Junping Zhang:
InstanceAssemble: Layout-Aware Image Generation via Instance Assembling Attention. CoRR abs/2509.16691 (2025) - [i44]Fei Zhao, Chengqiang Lu, Yufan Shen, Qimeng Wang, Yicheng Qian, Haoxin Zhang, Yan Gao, Yi Wu, Yao Hu, Zhen Wu, Shangyu Xing, Xinyu Dai:
RealBench: A Chinese Multi-image Understanding Benchmark Close to Real-world Scenarios. CoRR abs/2509.17421 (2025) - 2024
- [j15]Haochen Wang, Cilin Yan, Keyan Chen, Xiaolong Jiang, Xu Tang, Yao Hu, Guoliang Kang, Weidi Xie
, Efstratios Gavves:
OV-VIS: Open-Vocabulary Video Instance Segmentation. Int. J. Comput. Vis. 132(11): 5048-5065 (2024) - [j14]Keyan Chen, Xiaolong Jiang, Haochen Wang, Cilin Yan, Yan Gao, Xu Tang, Yao Hu, Weidi Xie
:
OV-DAR: Open-Vocabulary Object Detection and Attributes Recognition. Int. J. Comput. Vis. 132(11): 5387-5409 (2024) - [j13]Cilin Yan
, Haochen Wang, Jie Liu, Xiaolong Jiang, Yao Hu, Xu Tang, Guoliang Kang
, Efstratios Gavves:
PiClick: Picking the desired mask from multiple candidates in click-based interactive segmentation. Neurocomputing 599: 128083 (2024) - [j12]Yunkai Chen
, Qimeng Wang
, Shiwei Wu
, Yan Gao
, Tong Xu
, Yao Hu
:
TOMGPT: Reliable Text-Only Training Approach for Cost-Effective Multi-modal Large Language Model. ACM Trans. Knowl. Discov. Data 18(7): 171 (2024) - [c40]Bohan Zeng, Shanglin Li, Xuhui Liu, Sicheng Gao, Xiaolong Jiang, Xu Tang, Yao Hu, Jianzhuang Liu, Baochang Zhang:
Controllable Mind Visual Diffusion Model. AAAI 2024: 6935-6943 - [c39]Runqi Wang, Huixin Sun, Linlin Yang, Shaohui Lin, Chuanjian Liu, Yan Gao, Yao Hu, Baochang Zhang:
AQ-DETR: Low-Bit Quantized Detection Transformer with Auxiliary Queries. AAAI 2024: 15598-15606 - [c38]Dongjie Yang, Xiaodong Han, Yan Gao, Yao Hu, Shilin Zhang, Hai Zhao:
PyramidInfer: Pyramid KV Cache Compression for High-throughput LLM Inference. ACL (Findings) 2024: 3258-3270 - [c37]Shanglin Li, Bohan Zeng, Yutang Feng, Sicheng Gao, Xiuhui Liu, Jiaming Liu, Lin Li, Xu Tang, Yao Hu, Jianzhuang Liu, Baochang Zhang:
ZONE: Zero-Shot Instruction-Guided Local Editing. CVPR 2024: 6254-6263 - [c36]Yuxuan Zhang, Yiren Song, Jiaming Liu, Rui Wang, Jinpeng Yu, Hao Tang, Huaxia Li, Xu Tang, Yao Hu, Han Pan, Zhongliang Jing:
SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation. CVPR 2024: 8069-8078 - [c35]Cilin Yan, Haochen Wang, Shilin Yan, Xiaolong Jiang, Yao Hu, Guoliang Kang, Weidi Xie, Efstratios Gavves:
VISA: Reasoning Video Object Segmentation via Large Language Models. ECCV (15) 2024: 98-115 - [c34]Yejing Wang
, Dong Xu, Xiangyu Zhao
, Zhiren Mao, Peng Xiang, Ling Yan, Yao Hu, Zijian Zhang, Xuetao Wei, Qidong Liu
:
Bi-Level User Modeling for Deep Recommenders. ICDM 2024: 510-519 - [c33]Suyuan Huang, Haoxin Zhang, Yanyu Xu, Yan Gao, Yao Hu, Zengchang Qin:
Caseg: Clip-Based Action Segmentation With Learnable Text Prompt. ICIP 2024: 2201-2207 - [c32]Zihan Niu, Zheyong Xie, Tong Xu, Xiangfeng Wang, Yao Hu, Ying Yu, Enhong Chen:
Knowledge-Enhanced Multi-perspective Incongruity Perception Network for Multimodal Sarcasm Detection. ICME 2024: 1-6 - [c31]Lijun Zhang, Haomin Bai, Wei-Wei Tu, Ping Yang, Yao Hu:
Efficient Stochastic Approximation of Minimax Excess Risk Optimization. ICML 2024 - [c30]Wenhao Yang, Wei Jiang, Yibo Wang, Ping Yang, Yao Hu, Lijun Zhang:
Small-loss Adaptive Regret for Online Convex Optimization. ICML 2024 - [c29]Shiwei Wu, Joya Chen, Kevin Qinghong Lin, Qimeng Wang, Yan Gao, Qianli Xu, Tong Xu, Yao Hu, Enhong Chen, Mike Zheng Shou:
VideoLLM-MoD: Efficient Video-Language Streaming with Mixture-of-Depths Vision Computation. NeurIPS 2024 - [c28]Dongjie Yang, Suyuan Huang, Chengqiang Lu, Xiaodong Han, Haoxin Zhang, Yan Gao, Yao Hu, Hai Zhao:
Vript: A Video Is Worth Thousands of Words. NeurIPS 2024 - [c27]Chao Zhang
, Shiwei Wu
, Haoxin Zhang
, Tong Xu
, Yan Gao
, Yao Hu
, Enhong Chen
:
NoteLLM: A Retrievable Large Language Model for Note Recommendation. WWW (Companion Volume) 2024: 170-179 - [i43]Chao Zhang, Shiwei Wu, Haoxin Zhang, Tong Xu, Yan Gao, Yao Hu, Di Wu, Enhong Chen:
NoteLLM: A Retrievable Large Language Model for Note Recommendation. CoRR abs/2403.01744 (2024) - [i42]Yuxuan Zhang, Lifu Wei, Qing Zhang, Yiren Song, Jiaming Liu, Huaxia Li, Xu Tang, Yao Hu, Haibo Zhao:
Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model. CoRR abs/2403.07764 (2024) - [i41]Rui Wang, Hailong Guo, Jiaming Liu, Huaxia Li, Haibo Zhao, Xu Tang, Yao Hu, Hao Tang, Peipei Li:
StableGarment: Garment-Centric Generation via Stable Diffusion. CoRR abs/2403.10783 (2024) - [i40]Zhouhong Gu, Xiaoxuan Zhu, Haoran Guo, Lin Zhang, Yin Cai, Hao Shen, Jiangjie Chen, Zheyu Ye, Yifei Dai, Yan Gao, Yao Hu, Hongwei Feng, Yanghua Xiao:
Agent Group Chat: An Interactive Group Chat Simulacra For Better Eliciting Collective Emergent Behavior. CoRR abs/2403.13433 (2024) - [i39]Suyuan Huang, Haoxin Zhang, Yan Gao, Yao Hu, Zengchang Qin:
From Image to Video, what do we need in multimodal LLMs? CoRR abs/2404.11865 (2024) - [i38]Dongjie Yang, Xiaodong Han, Yan Gao, Yao Hu, Shilin Zhang, Hai Zhao:
PyramidInfer: Pyramid KV Cache Compression for High-throughput LLM Inference. CoRR abs/2405.12532 (2024) - [i37]Chao Zhang, Haoxin Zhang, Shiwei Wu, Di Wu, Tong Xu, Yan Gao, Yao Hu, Enhong Chen:
NoteLLM-2: Multimodal Large Representation Models for Recommendation. CoRR abs/2405.16789 (2024) - [i36]Dongjie Yang, Suyuan Huang, Chengqiang Lu, Xiaodong Han, Haoxin Zhang, Yan Gao, Yao Hu, Hai Zhao:
Vript: A Video Is Worth Thousands of Words. CoRR abs/2406.06040 (2024) - [i35]Shiwei Wu, Chao Zhang, Joya Chen, Tong Xu, Likang Wu, Yao Hu, Enhong Chen:
From a Social Cognitive Perspective: Context-aware Visual Social Relationship Recognition. CoRR abs/2406.08358 (2024) - [i34]Cilin Yan, Haochen Wang, Xiaolong Jiang, Yao Hu, Xu Tang, Guoliang Kang, Efstratios Gavves:
Mining Open Semantics from CLIP: A Relation Transition Perspective for Few-Shot Learning. CoRR abs/2406.11252 (2024) - [i33]Meizhi Zhong, Chen Zhang, Yikun Lei, Xikai Liu, Yan Gao, Yao Hu, Kehai Chen, Min Zhang:
Understanding the RoPE Extensions of Long-Context LLMs: An Attention Perspective. CoRR abs/2406.13282 (2024) - [i32]Shilin Yan, Ouxiang Li, Jiayin Cai, Yanbin Hao, Xiaolong Jiang, Yao Hu, Weidi Xie:
A Sanity Check for AI-generated Image Detection. CoRR abs/2406.19435 (2024) - [i31]Cilin Yan, Haochen Wang, Shilin Yan, Xiaolong Jiang, Yao Hu, Guoliang Kang, Weidi Xie, Efstratios Gavves:
VISA: Reasoning Video Object Segmentation via Large Language Models. CoRR abs/2407.11325 (2024) - [i30]Ouxiang Li, Jiayin Cai, Yanbin Hao, Xiaolong Jiang, Yao Hu, Fuli Feng:
Improving Synthetic Image Detection Towards Generalization: An Image Transformation Perspective. CoRR abs/2408.06741 (2024) - [i29]Shiwei Wu, Joya Chen, Kevin Qinghong Lin, Qimeng Wang, Yan Gao, Qianli Xu, Tong Xu, Yao Hu, Enhong Chen, Mike Zheng Shou:
VideoLLM-MoD: Efficient Video-Language Streaming with Mixture-of-Depths Vision Computation. CoRR abs/2408.16730 (2024) - [i28]Cunzheng Wang, Ziyuan Guo, Yuxuan Duan, Huaxia Li, Nemo Chen, Xu Tang, Yao Hu:
Target-Driven Distillation: Consistency Distillation with Target Timestep Selection and Decoupled Guidance. CoRR abs/2409.01347 (2024) - [i27]Huixin Sun, Runqi Wang, Yanjing Li, Xianbin Cao, Xiaolong Jiang, Yao Hu, Baochang Zhang:
P4Q: Learning to Prompt for Quantization in Visual-language Models. CoRR abs/2409.17634 (2024) - [i26]Shiwei Wu, Chen Zhang, Yan Gao, Qimeng Wang, Tong Xu, Yao Hu, Enhong Chen:
Benchmarking Large Language Models for Conversational Question Answering in Multi-instructional Documents. CoRR abs/2410.00526 (2024) - [i25]Chen Zhang, Meizhi Zhong, Qimeng Wang, Xuantao Lu, Zheyu Ye, Chengqiang Lu, Yan Gao, Yao Hu, Kehai Chen, Min Zhang, Dawei Song:
MoDification: Mixture of Depths Made Easy. CoRR abs/2410.14268 (2024) - [i24]Yejing Wang, Dong Xu, Xiangyu Zhao, Zhiren Mao, Peng Xiang, Ling Yan, Yao Hu, Zijian Zhang, Xuetao Wei, Qidong Liu:
GPRec: Bi-level User Modeling for Deep Recommenders. CoRR abs/2410.20730 (2024) - [i23]Suyuan Huang, Chao Zhang, Yuanyuan Wu, Haoxin Zhang, Yuan Wang, Maolin Wang, Shaosheng Cao, Tong Xu, Xiangyu Zhao, Zengchang Qin, Yan Gao, Yunhan Bai, Jun Fan, Yao Hu, Enhong Chen:
ScalingNote: Scaling up Retrievers with Large Language Models for Real-World Dense Retrieval. CoRR abs/2411.15766 (2024) - [i22]Yikun Liu, Pingan Chen, Jiayin Cai, Xiaolong Jiang, Yao Hu, Jiangchao Yao, Yanfeng Wang, Weidi Xie:
LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant. CoRR abs/2412.01720 (2024) - [i21]Meizhi Zhong, Xikai Liu, Chen Zhang, Yikun Lei, Yan Gao, Yao Hu, Kehai Chen, Min Zhang:
ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty. CoRR abs/2412.09036 (2024) - [i20]Sijie Xu, Runqi Wang, Wei Zhu, Dejia Song, Nemo Chen, Xu Tang, Yao Hu:
Single Trajectory Distillation for Accelerating Image and Video Style Transfer. CoRR abs/2412.18945 (2024) - 2023
- [j11]Chao Xiang, Zhongming Jin, Zhengxu Yu, Xian-Sheng Hua, Yao Hu, Wei Qian, Kaili Zhu, Deng Cai, Xiaofei He:
Optimizing traffic efficiency via a reinforcement learning approach based on time allocation. Int. J. Mach. Learn. Cybern. 14(10): 3381-3391 (2023) - [c26]Keyan Chen, Xiaolong Jiang, Yao Hu, Xu Tang, Yan Gao, Jianqi Chen, Weidi Xie:
OvarNet: Towards Open-Vocabulary Object Attribute Recognition. CVPR 2023: 23518-23527 - [c25]Jiasheng Zhang, Xikai Liu, Xinyi Lai, Yan Gao, Shusen Wang, Yao Hu, Yiqing Lin:
2INER: Instructive and In-Context Learning on Few-Shot Named Entity Recognition. EMNLP (Findings) 2023: 3940-3951 - [c24]Haochen Wang, Xiaolong Jiang, Xu Tang, Yao Hu, Cilin Yan, Weidi Xie, Shuai Wang
, Efstratios Gavves:
Towards Open-Vocabulary Video Instance Segmentation. ICCV 2023: 4034-4043 - [i19]Keyan Chen, Xiaolong Jiang, Yao Hu, Xu Tang, Yan Gao, Jianqi Chen, Weidi Xie:
OvarNet: Towards Open-vocabulary Object Attribute Recognition. CoRR abs/2301.09506 (2023) - [i18]Haochen Wang, Shuai Wang
, Cilin Yan, Xiaolong Jiang, Xu Tang, Yao Hu, Weidi Xie, Efstratios Gavves:
Towards Open-Vocabulary Video Instance Segmentation. CoRR abs/2304.01715 (2023) - [i17]Jie Guo, Qimeng Wang, Yan Gao, Xiaolong Jiang, Xu Tang, Yao Hu, Baochang Zhang:
MVP-SEG: Multi-View Prompt Learning for Open-Vocabulary Semantic Segmentation. CoRR abs/2304.06957 (2023) - [i16]Cilin Yan, Haochen Wang, Jie Liu, Xiaolong Jiang, Yao Hu, Xu Tang, Guoliang Kang, Efstratios Gavves:
PiClick: Picking the desired mask in click-based interactive segmentation. CoRR abs/2304.11609 (2023) - [i15]Bohan Zeng, Shanglin Li, Xuhui Liu, Sicheng Gao, Xiaolong Jiang, Xu Tang, Yao Hu, Jianzhuang Liu, Baochang Zhang:
Controllable Mind Visual Diffusion Model. CoRR abs/2305.10135 (2023) - [i14]Yuxuan Zhang, Jiaming Liu, Yiren Song, Rui Wang, Hao Tang, Jinpeng Yu, Huaxia Li, Xu Tang, Yao Hu, Han Pan, Zhongliang Jing:
SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation. CoRR abs/2312.16272 (2023) - [i13]Shanglin Li, Bohan Zeng, Yutang Feng, Sicheng Gao, Xuhui Liu, Jiaming Liu, Li Lin, Xu Tang, Yao Hu, Jianzhuang Liu, Baochang Zhang:
ZONE: Zero-Shot Instruction-Guided Local Editing. CoRR abs/2312.16794 (2023) - 2022
- [j10]Jiyang Qi, Yan Gao, Yao Hu, Xinggang Wang
, Xiaoyu Liu, Xiang Bai, Serge J. Belongie
, Alan L. Yuille, Philip H. S. Torr, Song Bai
:
Occluded Video Instance Segmentation: A Benchmark. Int. J. Comput. Vis. 130(8): 2022-2039 (2022) - [j9]Xiaolong Liu
, Qimeng Wang
, Yao Hu, Xu Tang, Shiwei Zhang, Song Bai
, Xiang Bai
:
End-to-End Temporal Action Detection With Transformer. IEEE Trans. Image Process. 31: 5427-5441 (2022) - [j8]Tun Zhu, Daoxin Zhang, Yao Hu, Tianran Wang
, Xiaolong Jiang, Jianke Zhu
, Jiawei Li:
Horizontal-to-Vertical Video Conversion. IEEE Trans. Multim. 24: 3036-3048 (2022) - [c23]Shiyin Lu, Yuan Miao, Ping Yang, Yao Hu, Lijun Zhang:
Non-stationary Dueling Bandits for Online Learning to Rank. APWeb/WAIM (2) 2022: 166-174 - [i12]Yan Gao, Qimeng Wang, Xu Tang, Haochen Wang, Fei Ding, Jing Li, Yao Hu:
Decoupled IoU Regression for Object Detection. CoRR abs/2202.00866 (2022) - 2021
- [j7]Tong Xu, Peilun Zhou, Linkang Hu, Xiangnan He, Yao Hu, Enhong Chen
:
Socializing the Videos: A Multimodal Approach for Social Relation Recognition. ACM Trans. Multim. Comput. Commun. Appl. 17(1): 23:1-23:23 (2021) - [c22]Shiyin Lu, Yao Hu, Lijun Zhang:
Stochastic Bandits with Graph Feedback in Non-Stationary Environments. AAAI 2021: 8758-8766 - [c21]Haochen Wang, Xiaolong Jiang, Haibing Ren, Yao Hu, Song Bai:
SwiftNet: Real-Time Video Object Segmentation. CVPR 2021: 1296-1305 - [c20]Xiaolong Liu
, Yao Hu, Song Bai, Fei Ding, Xiang Bai, Philip H. S. Torr:
Multi-Shot Temporal Event Localization: A Benchmark. CVPR 2021: 12596-12606 - [c19]Hao Fang, Daoxin Zhang, Yi Zhang, Minghao Chen, Jiawei Li, Yao Hu, Deng Cai, Xiaofei He:
Salient Object Ranking with Position-Preserved Attention. ICCV 2021: 16311-16321 - [c18]Cheng Chen, Jiayin Cai, Yao Hu, Xu Tang, Xinggang Wang
, Chun Yuan, Xiang Bai, Song Bai:
Deep Interactive Video Inpainting: An Invisibility Cloak for Harry Potter. ACM Multimedia 2021: 862-870 - [c17]Shiwei Wu, Joya Chen
, Tong Xu, Liyi Chen
, Lingfei Wu, Yao Hu, Enhong Chen
:
Linking the Characters: Video-oriented Social Graph Generation via Hierarchical-cumulative GCN. ACM Multimedia 2021: 4716-4724 - [c16]Yan Gao, Qimeng Wang, Xu Tang, Haochen Wang, Fei Ding, Jing Li, Yao Hu:
Decoupled IoU Regression for Object Detection. ACM Multimedia 2021: 5628-5636 - [c15]Jiyang Qi, Yan Gao, Yao Hu, Xinggang Wang, Xiaoyu Liu, Xiang Bai, Serge J. Belongie, Alan L. Yuille, Philip H. S. Torr, Song Bai:
Occluded Video Instance Segmentation: Dataset and ICCV 2021 Challenge. NeurIPS Datasets and Benchmarks 2021 - [c14]Jiyang Qi, Xinggang Wang
, Yao Hu, Xu Tang, Wenyu Liu:
Pyramid Self-attention for Semantic Segmentation. PRCV (1) 2021: 480-492 - [i11]Tun Zhu, Daoxin Zhang, Tianran Wang, Xiaolong Jiang, Jiawei Li, Yao Hu, Jianke Zhu:
Horizontal-to-Vertical Video Conversion. CoRR abs/2101.04051 (2021) - [i10]Jiyang Qi, Yan Gao, Yao Hu, Xinggang Wang, Xiaoyu Liu, Xiang Bai, Serge J. Belongie
, Alan L. Yuille, Philip H. S. Torr, Song Bai:
Occluded Video Instance Segmentation. CoRR abs/2102.01558 (2021) - [i9]Haochen Wang, Xiaolong Jiang, Haibing Ren, Yao Hu, Song Bai:
SwiftNet: Real-time Video Object Segmentation. CoRR abs/2102.04604 (2021) - [i8]Hao Fang, Daoxin Zhang, Yi Zhang, Minghao Chen, Jiawei Li, Yao Hu, Deng Cai, Xiaofei He:
Salient Object Ranking with Position-Preserved Attention. CoRR abs/2106.05047 (2021) - [i7]Xiaolong Liu
, Qimeng Wang, Yao Hu, Xu Tang, Song Bai, Xiang Bai:
End-to-end Temporal Action Detection with Transformer. CoRR abs/2106.10271 (2021) - [i6]Jiyang Qi, Yan Gao, Yao Hu, Xinggang Wang, Xiaoyu Liu, Xiang Bai, Serge J. Belongie, Alan L. Yuille, Philip H. S. Torr, Song Bai:
Occluded Video Instance Segmentation: Dataset and ICCV 2021 Challenge. CoRR abs/2111.07950 (2021) - 2020
- [c13]Guanghui Wang, Shiyin Lu, Yao Hu, Lijun Zhang:
Adapting to Smoothness: A More Universal Algorithm for Online Convex Optimization. AAAI 2020: 6162-6169 - [i5]Jia Guo, Minghao Chen, Yao Hu, Chen Zhu, Xiaofei He, Deng Cai:
Spherical Knowledge Distillation. CoRR abs/2010.07485 (2020) - [i4]Jia Guo, Chen Zhu, Yilun Zhao, Heda Wang, Yao Hu, Xiaofei He, Deng Cai:
LAMP: Label Augmented Multimodal Pretraining. CoRR abs/2012.04446 (2020) - [i3]Xiaolong Liu
, Yao Hu, Song Bai, Fei Ding, Xiang Bai, Philip H. S. Torr:
Multi-shot Temporal Event Localization: a Benchmark. CoRR abs/2012.09434 (2020)
2010 – 2019
- 2019
- [c12]Shiyin Lu, Guanghui Wang, Yao Hu, Lijun Zhang:
Optimal Algorithms for Lipschitz Bandits with Heavy-tailed Rewards. ICML 2019: 4154-4163 - [c11]Shiyin Lu, Guanghui Wang, Yao Hu, Lijun Zhang:
Multi-Objective Generalized Linear Bandits. IJCAI 2019: 3080-3086 - [i2]Shiyin Lu, Guanghui Wang, Yao Hu, Lijun Zhang:
Multi-Objective Generalized Linear Bandits. CoRR abs/1905.12879 (2019) - [i1]Shuai Zhao, Boxi Wu, Wenqing Chu, Yao Hu, Deng Cai:
Correlation Maximized Structural Similarity Loss for Semantic Segmentation. CoRR abs/1910.08711 (2019) - 2016
- [j6]Bin Hong, Long Wei, Yao Hu, Deng Cai, Xiaofei He:
Online robust principal component analysis via truncated nuclear norm regularization. Neurocomputing 175: 216-222 (2016) - [j5]Wenqing Chu, Yao Hu, Chen Zhao, Haifeng Liu, Deng Cai:
Atom Decomposition Based Subgradient Descent for matrix classification. Neurocomputing 205: 222-228 (2016) - [j4]Yao Hu, Chen Zhao, Deng Cai, Xiaofei He, Xuelong Li
:
Atom Decomposition with Adaptive Basis Selection Strategy for Matrix Completion. ACM Trans. Multim. Comput. Commun. Appl. 12(3): 43:1-43:25 (2016) - 2015
- [j3]Yao Hu, Zhongming Jin
, Yi Shi, Debing Zhang, Deng Cai, Xiaofei He:
Large scale multi-class classification with truncated nuclear norm regularization. Neurocomputing 148: 310-317 (2015) - [c10]Debing Zhang, Long Wei, Bin Hong, Yao Hu, Deng Cai, Xiaofei He:
Event Recovery by Faster Truncated Nuclear Norm Minimization. IScIDE (2) 2015: 171-180 - 2014
- [j2]Zhongming Jin, Debing Zhang, Yao Hu, Shiding Lin, Deng Cai, Xiaofei He:
Fast and Accurate Hashing Via Iterative Nearest Neighbors Expansion. IEEE Trans. Cybern. 44(11): 2167-2177 (2014) - [c9]Weizhong Zhang, Lijun Zhang, Yao Hu, Rong Jin, Deng Cai, Xiaofei He:
Sparse Learning for Stochastic Composite Optimization. AAAI 2014: 893-900 - [c8]Yao Hu, Zhongming Jin, Hongyi Ren
, Deng Cai, Xiaofei He:
Iterative Multi-View Hashing for Cross Media Indexing. ACM Multimedia 2014: 527-536 - [c7]Zheng Yang, Yao Hu, Haifeng Liu, Huajun Chen, Zhaohui Wu:
Matrix Completion for Cross-view Pairwise Constraint Propagation. ACM Multimedia 2014: 897-900 - 2013
- [j1]Yao Hu, Debing Zhang, Jieping Ye, Xuelong Li
, Xiaofei He:
Fast and Accurate Matrix Completion via Truncated Nuclear Norm Regularization. IEEE Trans. Pattern Anal. Mach. Intell. 35(9): 2117-2130 (2013) - [c6]Zhongming Jin, Yao Hu, Yue Lin, Debing Zhang, Shiding Lin, Deng Cai, Xuelong Li
:
Complementary Projection Hashing. ICCV 2013: 257-264 - [c5]Debing Zhang, Genmao Yang, Yao Hu, Zhongming Jin, Deng Cai, Xiaofei He:
A Unified Approximate Nearest Neighbor Search Scheme by Combining Data Structure and Hashing. IJCAI 2013: 681-688 - [c4]Yao Hu, Debing Zhang, Zhongming Jin, Deng Cai, Xiaofei He:
Active Learning Based on Local Representation. IJCAI 2013: 1415-1421 - [c3]Chuhang Zou, Yao Hu, Deng Cai, Xiaofei He:
Salient Object Detection via Fast Iterative Truncated Nuclear Norm Recovery. IScIDE 2013: 238-245 - 2012
- [c2]Debing Zhang, Yao Hu, Jieping Ye, Xuelong Li
, Xiaofei He:
Matrix completion by Truncated Nuclear Norm Regularization. CVPR 2012: 2192-2199 - [c1]Yao Hu, Debing Zhang, Jun Liu, Jieping Ye, Xiaofei He:
Accelerated singular value thresholding for matrix completion. KDD 2012: 298-306
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-11-10 00:26 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint