default search action
Haoqin Sun
- > Home > Persons > Haoqin Sun
Publications
- 2025
- [j5]Hui Wang
, Yifan Yang
, Shujie Liu, Jinyu Li
, Lingwei Meng, Tie-Yan Liu, Jiaming Zhou
, Haoqin Sun, Yan Lu, Yong Qin:
StreamMel: Real-Time Zero-Shot Text-to-Speech Via Interleaved Continuous Autoregressive Modeling. IEEE Signal Process. Lett. 32: 3530-3534 (2025) - [c12]Jiaming Zhou, Shiyao Wang, Shiwan Zhao, Jiabei He, Haoqin Sun, Hui Wang, Cheng Liu, Aobo Kong, Yujie Guo, Xi Yang, Yequan Wang, Yonghua Lin, Yong Qin:
ChildMandarin: A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5. ACL (1) 2025: 12524-12537 - [c11]Jiabei He, Shiwan Zhao, Jiaming Zhou, Haoqin Sun, Hui Wang, Yong Qin:
Emotion-Preserving Prosody Anonymization Network for Voice Privacy Protection. ICASSP 2025: 1-5 - [c10]Cheng Liu, Hui Wang, Jinghua Zhao, Shiwan Zhao, Hui Bu, Xin Xu, Jiaming Zhou, Haoqin Sun, Yong Qin:
MusicEval: A Generative Music Dataset with Expert Ratings for Automatic Text-to-Music Evaluation. ICASSP 2025: 1-5 - [c8]Xuechen Wang, Shiwan Zhao, Haoqin Sun, Hui Wang, Jiaming Zhou, Yong Qin:
Enhancing Multimodal Emotion Recognition through Multi-Granularity Cross-Modal Alignment. ICASSP 2025: 1-5 - [c7]Jiaming Zhou, Shiwan Zhao, Jiabei He, Hui Wang, Wenjia Zeng, Yong Chen, Haoqin Sun, Aobo Kong, Yong Qin:
M2R-Whisper: Multi-stage and Multi-scale Retrieval Augmentation for Enhancing Whisper. ICASSP 2025: 1-5 - [c6]Jiaming Zhou, Shiwan Zhao, Hui Wang, Tian-Hao Zhang, Haoqin Sun, Xuechen Wang, Yong Qin:
Improving Zero-Shot Chinese-English Code-Switching ASR with kNN-CTC and Gated Monolingual Datastores. ICASSP 2025: 1-5 - [i24]Cheng Liu, Hui Wang, Jinghua Zhao, Shiwan Zhao, Hui Bu, Xin Xu, Jiaming Zhou, Haoqin Sun, Yong Qin:
MusicEval: A Generative Music Corpus with Expert Ratings for Automatic Text-to-Music Evaluation. CoRR abs/2501.10811 (2025) - [i23]Hui Wang, Shujie Liu, Lingwei Meng, Jinyu Li, Yifan Yang, Shiwan Zhao, Haiyang Sun, Yanqing Liu, Haoqin Sun, Jiaming Zhou, Yan Lu, Yong Qin:
FELLE: Autoregressive Speech Synthesis with Token-Wise Coarse-to-Fine Flow Matching. CoRR abs/2502.11128 (2025) - [i22]Jiaming Zhou, Yujie Guo, Shiwan Zhao, Haoqin Sun, Hui Wang, Jiabei He, Aobo Kong, Shiyao Wang, Xi Yang, Yequan Wang, Yonghua Lin, Yong Qin:
CS-Dialogue: A 104-Hour Dataset of Spontaneous Mandarin-English Code-Switching Dialogues for Speech Recognition. CoRR abs/2502.18913 (2025) - [i20]Haoqin Sun, Jingguang Tian, Jiaming Zhou, Hui Wang, Jiabei He, Shiwan Zhao, Xiangyu Kong, Desheng Hu, Xinkang Xu, Xinhui Hu, Yong Qin:
RA-CLAP: Relation-Augmented Emotional Speaking Style Contrastive Language-Audio Pretraining For Speech Retrieval. CoRR abs/2505.19437 (2025) - [i19]Haoqin Sun, Xuechen Wang, Jinghua Zhao, Shiwan Zhao, Jiaming Zhou, Hui Wang, Jiabei He, Aobo Kong, Xi Yang, Yequan Wang, Yonghua Lin, Yong Qin:
EmotionTalk: An Interactive Chinese Multimodal Emotion Dataset With Rich Annotations. CoRR abs/2505.23018 (2025) - [i18]Hui Wang, Yifan Yang, Shujie Liu, Jinyu Li, Lingwei Meng, Yanqing Liu, Jiaming Zhou, Haoqin Sun, Yan Lu, Yong Qin:
StreamMel: Real-Time Zero-shot Text-to-Speech via Interleaved Continuous Autoregressive Modeling. CoRR abs/2506.12570 (2025) - [i17]Jiaming Zhou, Hongjie Chen, Shiwan Zhao, Jian Kang, Jie Li, Enzhi Wang, Yujie Guo, Haoqin Sun, Hui Wang, Aobo Kong, Yong Qin, Xuelong Li:
DIFFA: Large Language Diffusion Models Can Listen and Understand. CoRR abs/2507.18452 (2025) - [i14]Hui Wang, Cheng Liu, Junyang Chen, Haoze Liu, Yuhang Jia, Shiwan Zhao, Jiaming Zhou, Haoqin Sun, Hui Bu, Yong Qin:
TTA-Bench: A Comprehensive Benchmark for Evaluating Text-to-Audio Models. CoRR abs/2509.02398 (2025) - [i13]Jinghua Zhao, Hang Su, Lichun Fan, Zhenbo Luo, Hui Wang, Haoqin Sun, Yong Qin:
Omni-CLST: Error-aware Curriculum Learning with guided Selective chain-of-Thought for audio question answering. CoRR abs/2509.12275 (2025) - [i12]Haoqin Sun, Chenyang Lyu, Xiangyu Kong, Shiwan Zhao, Jiaming Zhou, Hui Wang, Aobo Kong, Jinghua Zhao, Longyue Wang, Weihua Luo, Kaifu Zhang, Yong Qin:
MECap-R1: Emotion-aware Policy with Reinforcement Learning for Multimodal Emotion Captioning. CoRR abs/2509.18729 (2025) - 2024
- [c4]Haoqin Sun, Shiwan Zhao, Xiangyu Kong, Xuechen Wang, Hui Wang, Jiaming Zhou, Yong Qin:
Iterative Prototype Refinement for Ambiguous Speech Emotion Recognition. INTERSPEECH 2024 - [c3]Hui Wang, Shiwan Zhao, Jiaming Zhou, Xiguang Zheng, Haoqin Sun, Xuechen Wang, Yong Qin:
Uncertainty-Aware Mean Opinion Score Prediction. INTERSPEECH 2024 - [i11]Jiaming Zhou, Shiwan Zhao, Hui Wang, Tian-Hao Zhang, Haoqin Sun, Xuechen Wang, Yong Qin:
Improving Zero-Shot Chinese-English Code-Switching ASR with kNN-CTC and Gated Monolingual Datastores. CoRR abs/2406.03814 (2024) - [i8]Haoqin Sun, Shiwan Zhao, Xiangyu Kong, Xuechen Wang, Hui Wang, Jiaming Zhou, Yong Qin:
Iterative Prototype Refinement for Ambiguous Speech Emotion Recognition. CoRR abs/2408.00325 (2024) - [i7]Hui Wang, Shiwan Zhao, Jiaming Zhou, Xiguang Zheng, Haoqin Sun, Xuechen Wang, Yong Qin:
Uncertainty-Aware Mean Opinion Score Prediction. CoRR abs/2408.12829 (2024) - [i6]Jiaming Zhou, Shiwan Zhao, Jiabei He, Hui Wang, Wenjia Zeng, Yong Chen, Haoqin Sun, Aobo Kong, Yong Qin:
M2R-Whisper: Multi-stage and Multi-scale Retrieval Augmentation for Enhancing Whisper. CoRR abs/2409.11889 (2024) - [i5]Jiaming Zhou, Shiyao Wang, Shiwan Zhao, Jiabei He, Haoqin Sun, Hui Wang, Cheng Liu, Aobo Kong, Yujie Guo, Yong Qin:
ChildMandarin: A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5. CoRR abs/2409.18584 (2024) - [i3]Xuechen Wang, Shiwan Zhao, Haoqin Sun, Hui Wang, Jiaming Zhou, Yong Qin:
Enhancing Multimodal Emotion Recognition through Multi-Granularity Cross-Modal Alignment. CoRR abs/2412.20821 (2024)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-10-19 21:52 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint