default search action
Haoqin Sun
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j5]Hui Wang
, Yifan Yang
, Shujie Liu, Jinyu Li
, Lingwei Meng, Tie-Yan Liu, Jiaming Zhou
, Haoqin Sun, Yan Lu, Yong Qin:
StreamMel: Real-Time Zero-Shot Text-to-Speech Via Interleaved Continuous Autoregressive Modeling. IEEE Signal Process. Lett. 32: 3530-3534 (2025) - [c12]Jiaming Zhou, Shiyao Wang, Shiwan Zhao, Jiabei He, Haoqin Sun, Hui Wang, Cheng Liu, Aobo Kong, Yujie Guo, Xi Yang, Yequan Wang, Yonghua Lin, Yong Qin:
ChildMandarin: A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5. ACL (1) 2025: 12524-12537 - [c11]Jiabei He, Shiwan Zhao, Jiaming Zhou, Haoqin Sun, Hui Wang, Yong Qin:
Emotion-Preserving Prosody Anonymization Network for Voice Privacy Protection. ICASSP 2025: 1-5 - [c10]Cheng Liu, Hui Wang, Jinghua Zhao, Shiwan Zhao, Hui Bu, Xin Xu, Jiaming Zhou, Haoqin Sun, Yong Qin:
MusicEval: A Generative Music Dataset with Expert Ratings for Automatic Text-to-Music Evaluation. ICASSP 2025: 1-5 - [c9]Haoqin Sun, Shiwan Zhao, Shaokai Li, Xiangyu Kong, Xuechen Wang, Jiaming Zhou, Aobo Kong, Yong Chen, Wenjia Zeng, Yong Qin:
Enhancing Emotion Recognition in Incomplete Data: A Novel Cross-Modal Alignment, Reconstruction, and Refinement Framework. ICASSP 2025: 1-5 - [c8]Xuechen Wang, Shiwan Zhao, Haoqin Sun, Hui Wang, Jiaming Zhou, Yong Qin:
Enhancing Multimodal Emotion Recognition through Multi-Granularity Cross-Modal Alignment. ICASSP 2025: 1-5 - [c7]Jiaming Zhou, Shiwan Zhao, Jiabei He, Hui Wang, Wenjia Zeng, Yong Chen, Haoqin Sun, Aobo Kong, Yong Qin:
M2R-Whisper: Multi-stage and Multi-scale Retrieval Augmentation for Enhancing Whisper. ICASSP 2025: 1-5 - [c6]Jiaming Zhou, Shiwan Zhao, Hui Wang, Tian-Hao Zhang, Haoqin Sun, Xuechen Wang, Yong Qin:
Improving Zero-Shot Chinese-English Code-Switching ASR with kNN-CTC and Gated Monolingual Datastores. ICASSP 2025: 1-5 - [i24]Cheng Liu, Hui Wang, Jinghua Zhao, Shiwan Zhao, Hui Bu, Xin Xu, Jiaming Zhou, Haoqin Sun, Yong Qin:
MusicEval: A Generative Music Corpus with Expert Ratings for Automatic Text-to-Music Evaluation. CoRR abs/2501.10811 (2025) - [i23]Hui Wang, Shujie Liu, Lingwei Meng, Jinyu Li, Yifan Yang, Shiwan Zhao, Haiyang Sun, Yanqing Liu, Haoqin Sun, Jiaming Zhou, Yan Lu, Yong Qin:
FELLE: Autoregressive Speech Synthesis with Token-Wise Coarse-to-Fine Flow Matching. CoRR abs/2502.11128 (2025) - [i22]Jiaming Zhou, Yujie Guo, Shiwan Zhao, Haoqin Sun, Hui Wang, Jiabei He, Aobo Kong, Shiyao Wang, Xi Yang, Yequan Wang, Yonghua Lin, Yong Qin:
CS-Dialogue: A 104-Hour Dataset of Spontaneous Mandarin-English Code-Switching Dialogues for Speech Recognition. CoRR abs/2502.18913 (2025) - [i21]Jingguang Tian, Haoqin Sun, Xinhui Hu, Xinkang Xu:
Discrete Audio Representations for Automated Audio Captioning. CoRR abs/2505.14989 (2025) - [i20]Haoqin Sun, Jingguang Tian, Jiaming Zhou, Hui Wang, Jiabei He, Shiwan Zhao, Xiangyu Kong, Desheng Hu, Xinkang Xu, Xinhui Hu, Yong Qin:
RA-CLAP: Relation-Augmented Emotional Speaking Style Contrastive Language-Audio Pretraining For Speech Retrieval. CoRR abs/2505.19437 (2025) - [i19]Haoqin Sun, Xuechen Wang, Jinghua Zhao, Shiwan Zhao, Jiaming Zhou, Hui Wang, Jiabei He, Aobo Kong, Xi Yang, Yequan Wang, Yonghua Lin, Yong Qin:
EmotionTalk: An Interactive Chinese Multimodal Emotion Dataset With Rich Annotations. CoRR abs/2505.23018 (2025) - [i18]Hui Wang, Yifan Yang, Shujie Liu, Jinyu Li, Lingwei Meng, Yanqing Liu, Jiaming Zhou, Haoqin Sun, Yan Lu, Yong Qin:
StreamMel: Real-Time Zero-shot Text-to-Speech via Interleaved Continuous Autoregressive Modeling. CoRR abs/2506.12570 (2025) - [i17]Jiaming Zhou, Hongjie Chen, Shiwan Zhao, Jian Kang, Jie Li, Enzhi Wang, Yujie Guo, Haoqin Sun, Hui Wang, Aobo Kong, Yong Qin, Xuelong Li:
DIFFA: Large Language Diffusion Models Can Listen and Understand. CoRR abs/2507.18452 (2025) - [i16]Xiangyu Kong, Hengde Zhu, Haoqin Sun, Zhihao Guo, Jiayan Gu, Xinyi Ni, Wei Zhang, Shizhe Liu, Siyang Song:
Learning Personalised Human Internal Cognition from External Expressive Behaviours for Real Personality Recognition. CoRR abs/2508.00205 (2025) - [i15]Fengping Tian, Chenyang Lyu, Xuanfan Ni, Haoqin Sun, Qingjuan Li, Zhiqiang Qian, Haijun Li, Longyue Wang, Zhao Xu, Weihua Luo, Kaifu Zhang:
Marco-Voice Technical Report. CoRR abs/2508.02038 (2025) - [i14]Hui Wang, Cheng Liu, Junyang Chen, Haoze Liu, Yuhang Jia, Shiwan Zhao, Jiaming Zhou, Haoqin Sun, Hui Bu, Yong Qin:
TTA-Bench: A Comprehensive Benchmark for Evaluating Text-to-Audio Models. CoRR abs/2509.02398 (2025) - [i13]Jinghua Zhao, Hang Su, Lichun Fan, Zhenbo Luo, Hui Wang, Haoqin Sun, Yong Qin:
Omni-CLST: Error-aware Curriculum Learning with guided Selective chain-of-Thought for audio question answering. CoRR abs/2509.12275 (2025) - [i12]Haoqin Sun, Chenyang Lyu, Xiangyu Kong, Shiwan Zhao, Jiaming Zhou, Hui Wang, Aobo Kong, Jinghua Zhao, Longyue Wang, Weihua Luo, Kaifu Zhang, Yong Qin:
MECap-R1: Emotion-aware Policy with Reinforcement Learning for Multimodal Emotion Captioning. CoRR abs/2509.18729 (2025) - 2024
- [c5]Haoqin Sun, Shiwan Zhao, Xuechen Wang, Wenjia Zeng, Yong Chen, Yong Qin:
Fine-Grained Disentangled Representation Learning For Multimodal Emotion Recognition. ICASSP 2024: 11051-11055 - [c4]Haoqin Sun, Shiwan Zhao, Xiangyu Kong, Xuechen Wang, Hui Wang, Jiaming Zhou, Yong Qin:
Iterative Prototype Refinement for Ambiguous Speech Emotion Recognition. INTERSPEECH 2024 - [c3]Hui Wang, Shiwan Zhao, Jiaming Zhou, Xiguang Zheng, Haoqin Sun, Xuechen Wang, Yong Qin:
Uncertainty-Aware Mean Opinion Score Prediction. INTERSPEECH 2024 - [i11]Jiaming Zhou, Shiwan Zhao, Hui Wang, Tian-Hao Zhang, Haoqin Sun, Xuechen Wang, Yong Qin:
Improving Zero-Shot Chinese-English Code-Switching ASR with kNN-CTC and Gated Monolingual Datastores. CoRR abs/2406.03814 (2024) - [i10]Aobo Kong, Shiwan Zhao, Hao Chen, Qicheng Li, Yong Qin, Ruiqi Sun, Xin Zhou, Jiaming Zhou, Haoqin Sun:
Self-Prompt Tuning: Enable Autonomous Role-Playing in LLMs. CoRR abs/2407.08995 (2024) - [i9]Haoqin Sun, Shiwan Zhao, Shaokai Li, Xiangyu Kong, Xuechen Wang, Aobo Kong, Jiaming Zhou, Yong Chen, Wenjia Zeng, Yong Qin:
Enhancing Emotion Recognition in Incomplete Data: A Novel Cross-Modal Alignment, Reconstruction, and Refinement Framework. CoRR abs/2407.09029 (2024) - [i8]Haoqin Sun, Shiwan Zhao, Xiangyu Kong, Xuechen Wang, Hui Wang, Jiaming Zhou, Yong Qin:
Iterative Prototype Refinement for Ambiguous Speech Emotion Recognition. CoRR abs/2408.00325 (2024) - [i7]Hui Wang, Shiwan Zhao, Jiaming Zhou, Xiguang Zheng, Haoqin Sun, Xuechen Wang, Yong Qin:
Uncertainty-Aware Mean Opinion Score Prediction. CoRR abs/2408.12829 (2024) - [i6]Jiaming Zhou, Shiwan Zhao, Jiabei He, Hui Wang, Wenjia Zeng, Yong Chen, Haoqin Sun, Aobo Kong, Yong Qin:
M2R-Whisper: Multi-stage and Multi-scale Retrieval Augmentation for Enhancing Whisper. CoRR abs/2409.11889 (2024) - [i5]Jiaming Zhou, Shiyao Wang, Shiwan Zhao, Jiabei He, Haoqin Sun, Hui Wang, Cheng Liu, Aobo Kong, Yujie Guo, Yong Qin:
ChildMandarin: A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5. CoRR abs/2409.18584 (2024) - [i4]Shaokai Li, Yixuan Ji, Peng Song, Haoqin Sun, Wenming Zheng:
Feature distribution Adaptation Network for Speech Emotion Recognition. CoRR abs/2410.22023 (2024) - [i3]Xuechen Wang, Shiwan Zhao, Haoqin Sun, Hui Wang, Jiaming Zhou, Yong Qin:
Enhancing Multimodal Emotion Recognition through Multi-Granularity Cross-Modal Alignment. CoRR abs/2412.20821 (2024) - 2023
- [j4]Yang Liu, Yuqi Xia, Haoqin Sun, Xiaolei Meng, Jianxiong Bai, Wenbo Guan, Zhen Zhao, Yongwei Li:
A Multitask Learning Approach Based on Cascaded Attention Network and Self-Adaption Loss for Speech Emotion Recognition. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 106(6): 876-885 (2023) - [j3]Yang Liu
, Haoqin Sun
, Wenbo Guan, Yuqi Xia, Zhen Zhao
:
Speech Emotion Recognition Using Cascaded Attention Network with Joint Loss for Discrimination of Confusions. Mach. Intell. Res. 20(4): 595-604 (2023) - [j2]Yang Liu
, Haoqin Sun
, Wenbo Guan
, Yuqi Xia
, Yongwei Li
, Masashi Unoki
, Zhen Zhao
:
A Discriminative Feature Representation Method Based on Cascaded Attention Network With Adversarial Strategy for Speech Emotion Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1063-1074 (2023) - [c2]Yang Liu, Haoqin Sun, Geng Chen, Qingyue Wang, Zhen Zhao, Xugang Lu, Longbiao Wang:
Multi-Level Knowledge Distillation for Speech Emotion Recognition in Noisy Conditions. INTERSPEECH 2023: 1893-1897 - [i2]Yang Liu, Haoqin Sun, Geng Chen, Qingyue Wang, Zhen Zhao, Xugang Lu, Longbiao Wang:
Multi-Level Knowledge Distillation for Speech Emotion Recognition in Noisy Conditions. CoRR abs/2312.13556 (2023) - [i1]Haoqin Sun, Shiwan Zhao, Xuechen Wang, Wenjia Zeng, Yong Chen, Yong Qin:
Fine-grained Disentangled Representation Learning for Multimodal Emotion Recognition. CoRR abs/2312.13567 (2023) - 2022
- [j1]Yang Liu
, Haoqin Sun, Wenbo Guan, Yuqi Xia, Zhen Zhao:
Multi-modal speech emotion recognition using self-attention mechanism and multi-scale fusion framework. Speech Commun. 139: 1-9 (2022) - [c1]Yang Liu
, Haoqin Sun, Wenbo Guan, Yuqi Xia, Zhen Zhao:
Discriminative Feature Representation Based on Cascaded Attention Network with Adversarial Joint Loss for Speech Emotion Recognition. INTERSPEECH 2022: 4750-4754
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-10-19 21:52 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint