default search action

combined dblp search
author search
venue search
publication search

ask others

Fan Yu 0002

> Home > Persons

Person information

affiliation: Alibaba Group, Speech Lab of DAMO Academy, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

Conference and Workshop Papers

see FAQ

What is the meaning of the colors in the publication lists?

2025
[c23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/MaY0GWDY0ZZ025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/MaY0GWDY0ZZ025
Ziyang Ma, Guanrou Yang, Yifan Yang, Zhifu Gao, Jiaming Wang, Zhihao Du, Fan Yu, Qian Chen, Siqi Zheng, Shiliang Zhang, Xie Chen:
Speech Recognition Meets Large Language Model: Benchmarking, Models, and Exploration. AAAI 2025: 24840-24848
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YangY0DGZ025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YangY0DGZ025
Guanrou Yang, Fan Yu, Ziyang Ma, Zhihao Du, Zhifu Gao, Shiliang Zhang, Xie Chen:
Enhancing Low-Resource ASR through Versatile TTS: Bridging the Data Gap. ICASSP 2025: 1-5
2024
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuWMZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuWMZ24
Fan Yu, Haoxu Wang, Ziyang Ma, Shiliang Zhang:
Hourglass-AVSR: Down-Up Sampling-Based Computational Efficiency Model for Audio-Visual Speech Recognition. ICASSP 2024: 7940-7944
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuWSZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuWSZ24
Fan Yu, Haoxu Wang, Xian Shi, Shiliang Zhang:
LCB-Net: Long-Context Biasing for Audio-Visual Speech Recognition. ICASSP 2024: 10621-10625
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangYSWZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangYSWZL24
Haoxu Wang, Fan Yu, Xian Shi, Yuezhang Wang, Shiliang Zhang, Ming Li:
SlideSpeech: A Large Scale Slide-Enriched Audio-Visual Corpus. ICASSP 2024: 11076-11080
[c18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangMYGZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangMYGZ024
Guanrou Yang, Ziyang Ma, Fan Yu, Zhifu Gao, Shiliang Zhang, Xie Chen:
MaLa-ASR: Multimedia-Assisted LLM-Based ASR. INTERSPEECH 2024
2023
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ShiZDYCZD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ShiZDYCZD23
Mohan Shi, Jie Zhang, Zhihao Du, Fan Yu, Qian Chen, Shiliang Zhang, Li-Rong Dai:
A Comparative Study on Multichannel Speaker-Attributed Automatic Speech Recognition in Multi-party Meetings. APSIPA ASC 2023: 1943-1948
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ChenYLXWZZX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ChenYLXWZZX23
Peikun Chen, Fan Yu, Yuhao Liang, Hongfei Xue, Xucheng Wan, Naijun Zheng, Huan Zhou, Lei Xie:
BA-MoE: Boundary-Aware Mixture-of-Experts Adapter for Code-Switching Speech Recognition. ASRU 2023: 1-7
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LiYLGSDZX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LiYLGSDZX23
Yangze Li, Fan Yu, Yuhao Liang, Pengcheng Guo, Mohan Shi, Zhihao Du, Shiliang Zhang, Lei Xie:
Sa-Paraformer: Non-Autoregressive End-To-End Speaker-Attributed ASR. ASRU 2023: 1-7
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LiangSYLZDCXQWCLYB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LiangSYLZDCXQWCLYB23
Yuhao Liang, Mohan Shi, Fan Yu, Yangze Li, Shiliang Zhang, Zhihao Du, Qian Chen, Lei Xie, Yanmin Qian, Jian Wu, Zhuo Chen, Kong Aik Lee, Zhijie Yan, Hui Bu:
The Second Multi-Channel Multi-Party Meeting Transcription Challenge (M2MeT 2.0): A Benchmark for Speaker-Attributed ASR. ASRU 2023: 1-8
[c13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShiD0YLZ0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShiD0YLZ0023
Mohan Shi, Zhihao Du, Qian Chen, Fan Yu, Yangze Li, Shiliang Zhang, Jie Zhang, Li-Rong Dai:
CASA-ASR: Context-Aware Speaker-Attributed ASR. INTERSPEECH 2023: 411-415
[c12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiangYLGZ0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiangYLGZ0023
Yuhao Liang, Fan Yu, Yangze Li, Pengcheng Guo, Shiliang Zhang, Qian Chen, Lei Xie:
BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR. INTERSPEECH 2023: 3487-3491
2022
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuZFXZDHGYMXB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuZFXZDHGYMXB22
Fan Yu, Shiliang Zhang, Yihui Fu, Lei Xie, Siqi Zheng, Zhihao Du, Weilong Huang, Pengcheng Guo, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
M2Met: The Icassp 2022 Multi-Channel Multi-Party Meeting Transcription Challenge. ICASSP 2022: 6167-6171
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuZGFDZHXTWQLYM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuZGFDZHXTWQLYM22
Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
Summary on the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge. ICASSP 2022: 9156-9160
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuDZL022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuDZL022
Fan Yu, Zhihao Du, Shiliang Zhang, Yuxiao Lin, Lei Xie:
A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings. INTERSPEECH 2022: 560-564
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/LinDZYZW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/LinDZYZW22
Yuxiao Lin, Zhihao Du, Shiliang Zhang, Fan Yu, Zhou Zhao, Fei Wu:
Separate-to-Recognize: Joint Multi-target Speech Separation and Speech Recognition for Speaker-attributed ASR. ISCSLP 2022: 150-154
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ZhangYHXWCBZCX22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ZhangYHXWCBZCX22
Ao Zhang, Fan Yu, Kaixun Huang, Lei Xie, Longbiao Wang, Eng Siong Chng, Hui Bu, Binbin Zhang, Wei Chen, Xin Xu:
The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results. ISCSLP 2022: 507-511
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/LiangCYZXGX22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/LiangCYZXGX22
Yuhao Liang, Peikun Chen, Fan Yu, Xinfa Zhu, Tianyi Xu, Yingying Gao, Lei Xie:
The NPU-ASLP System for The ISCSLP 2022 Magichub Code-Swiching ASR Challenge. ISCSLP 2022: 532-536
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/YuZGLDLX22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/YuZGLDLX22
Fan Yu, Shiliang Zhang, Pengcheng Guo, Yuhao Liang, Zhihao Du, Yuxiao Lin, Lei Xie:
MFCCA:Multi-Frame Cross-Channel Attention for Multi-Speaker ASR in Multi-Party Meeting Scenario. SLT 2022: 144-151
2021
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/YuLGLYXGHZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/YuLGLYXGHZ21
Fan Yu, Haoneng Luo, Pengcheng Guo, Yuhao Liang, Zhuoyuan Yao, Lei Xie, Yingying Gao, Leijing Hou, Shilei Zhang:
Boundary and Context Aware Training for CIF-Based Non-Autoregressive End-to-End ASR. ASRU 2021: 328-334
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShiYLLFWQX21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShiYLLFWQX21
Xian Shi, Fan Yu, Yizhou Lu, Yuhao Liang, Qiangze Feng, Daliang Wang, Yanmin Qian, Lei Xie:
The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods. ICASSP 2021: 6918-6922
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YaoWWZYYPCXL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YaoWWZYYPCXL21
Zhuoyuan Yao, Di Wu, Xiong Wang, Binbin Zhang, Fan Yu, Chao Yang, Zhendong Peng, Xiaoyu Chen, Lei Xie, Xin Lei:
WeNet: Production Oriented Streaming and Non-Streaming End-to-End Speech Recognition Toolkit. Interspeech 2021: 4054-4058
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/YuYWAXOLLM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/YuYWAXOLLM21
Fan Yu, Zhuoyuan Yao, Xiong Wang, Keyu An, Lei Xie, Zhijian Ou, Bo Liu, Xiulin Li, Guanqiong Miao:
The SLT 2021 Children Speech Recognition Challenge: Open Datasets, Rules and Baselines. SLT 2021: 1117-1123

Informal and Other Publications

see FAQ

What is the meaning of the colors in the publication lists?

2025
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-06282
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-06282
Qian Chen, Yafeng Chen, Yanni Chen, Mengzhe Chen, Yingda Chen, Chong Deng, Zhihao Du, Ruize Gao, Changfeng Gao, Zhifu Gao, Yabin Li, Xiang Lv, Jiaqing Liu, Haoneng Luo, Bin Ma, Chongjia Ni, Xian Shi, Jialong Tang, Hui Wang, Hao Wang, Wen Wang, Yuxuan Wang, Yunlan Xu, Fan Yu, Zhijie Yan, Yexin Yang, Baosong Yang, Xian Yang, Guanrou Yang, Tianyu Zhao, Qinglin Zhang, Shiliang Zhang, Nan Zhao, Pei Zhang, Chong Zhang, Jinren Zhou:
MinMo: A Multimodal Large Language Model for Seamless Voice Interaction. CoRR abs/2501.06282 (2025)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-12867
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-12867
Guanrou Yang, Chen Yang, Qian Chen, Ziyang Ma, Wenxi Chen, Wen Wang, Tianrui Wang, Yifan Yang, Zhikang Niu, Wenrui Liu, Fan Yu, Zhihao Du, Zhifu Gao, Shiliang Zhang, Xie Chen:
EmoVoice: LLM-based Emotional Text-To-Speech Model with Freestyle Text Prompting. CoRR abs/2504.12867 (2025)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-17589
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-17589
Zhihao Du, Changfeng Gao, Yuxuan Wang, Fan Yu, Tianyu Zhao, Hao Wang, Xiang Lv, Hui Wang, Chongjia Ni, Xian Shi, Keyu An, Guanrou Yang, Yabin Li, Yanni Chen, Zhifu Gao, Qian Chen, Yue Gu, Mengzhe Chen, Yafeng Chen, Shiliang Zhang, Wen Wang, Jieping Ye:
CosyVoice 3: Towards In-the-wild Speech Generation via Scaling-up and Post-training. CoRR abs/2505.17589 (2025)
2024
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-06390
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-06390
Fan Yu, Haoxu Wang, Xian Shi, Shiliang Zhang:
LCB-net: Long-Context Biasing for Audio-Visual Speech Recognition. CoRR abs/2401.06390 (2024)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-08846
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-08846
Ziyang Ma, Guanrou Yang, Yifan Yang, Zhifu Gao, Jiaming Wang, Zhihao Du, Fan Yu, Qian Chen, Siqi Zheng, Shiliang Zhang, Xie Chen:
An Embarrassingly Simple Approach for LLM with Strong ASR Capacity. CoRR abs/2402.08846 (2024)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-05839
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-05839
Guanrou Yang, Ziyang Ma, Fan Yu, Zhifu Gao, Shiliang Zhang, Xie Chen:
MaLa-ASR: Multimedia-Assisted LLM-Based ASR. CoRR abs/2406.05839 (2024)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-16726
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-16726
Guanrou Yang, Fan Yu, Ziyang Ma, Zhihao Du, Zhifu Gao, Shiliang Zhang, Xie Chen:
Enhancing Low-Resource ASR through Versatile TTS: Bridging the Data Gap. CoRR abs/2410.16726 (2024)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-10117
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-10117
Zhihao Du, Yuxuan Wang, Qian Chen, Xian Shi, Xiang Lv, Tianyu Zhao, Zhifu Gao, Yexin Yang, Changfeng Gao, Hui Wang, Fan Yu, Huadai Liu, Zhengyan Sheng, Yue Gu, Chong Deng, Wen Wang, Shiliang Zhang, Zhijie Yan, Jingren Zhou:
CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models. CoRR abs/2412.10117 (2024)
2023
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-12459
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-12459
Mohan Shi, Zhihao Du, Qian Chen, Fan Yu, Yangze Li, Shiliang Zhang, Jie Zhang, Li-Rong Dai:
CASA-ASR: Context-Aware Speaker-Attributed ASR. CoRR abs/2305.12459 (2023)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-13716
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-13716
Yuhao Liang, Fan Yu, Yangze Li, Pengcheng Guo, Shiliang Zhang, Qian Chen, Lei Xie:
BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR. CoRR abs/2305.13716 (2023)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-05396
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-05396
Haoxu Wang, Fan Yu, Xian Shi, Yuezhang Wang, Shiliang Zhang, Ming Li:
SlideSpeech: A Large-Scale Slide-Enriched Audio-Visual Corpus. CoRR abs/2309.05396 (2023)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-13573
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-13573
Yuhao Liang, Mohan Shi, Fan Yu, Yangze Li, Shiliang Zhang, Zhihao Du, Qian Chen, Lei Xie, Yanmin Qian, Jian Wu, Zhuo Chen, Kong Aik Lee, Zhijie Yan, Hui Bu:
The second multi-channel multi-party meeting transcription challenge (M2MeT) 2.0): A benchmark for speaker-attributed ASR. CoRR abs/2309.13573 (2023)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-02629
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-02629
Peikun Chen, Fan Yu, Yuhao Liang, Hongfei Xue, Xucheng Wan, Naijun Zheng, Huan Zhou, Lei Xie:
BA-MoE: Boundary-Aware Mixture-of-Experts Adapter for Code-Switching Speech Recognition. CoRR abs/2310.02629 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-04863
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-04863
Yangze Li, Fan Yu, Yuhao Liang, Pengcheng Guo, Mohan Shi, Zhihao Du, Shiliang Zhang, Lei Xie:
SA-Paraformer: Non-autoregressive End-to-End Speaker-Attributed ASR. CoRR abs/2310.04863 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-08850
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-08850
Fan Yu, Haoxu Wang, Ziyang Ma, Shiliang Zhang:
Hourglass-AVSR: Down-Up Sampling-based Computational Efficiency Model for Audio-Visual Speech Recognition. CoRR abs/2312.08850 (2023)
2022
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-03647
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-03647
Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge. CoRR abs/2202.03647 (2022)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-16834
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-16834
Fan Yu, Zhihao Du, Shiliang Zhang, Yuxiao Lin, Lei Xie:
A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings. CoRR abs/2203.16834 (2022)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-05265
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-05265
Fan Yu, Shiliang Zhang, Pengcheng Guo, Yuhao Liang, Zhihao Du, Yuxiao Lin, Lei Xie:
MFCCA: Multi-Frame Cross-Channel attention for multi-speaker ASR in Multi-party meeting scenario. CoRR abs/2210.05265 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-14448
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-14448
Yuhao Liang, Peikun Chen, Fan Yu, Xinfa Zhu, Tianyi Xu, Lei Xie:
The NPU-ASLP System for The ISCSLP 2022 Magichub Code-Swiching ASR Challenge. CoRR abs/2210.14448 (2022)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-01585
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-01585
Ao Zhang, Fan Yu, Kaixun Huang, Lei Xie, Longbiao Wang, Eng Siong Chng, Hui Bu, Binbin Zhang, Wei Chen, Xin Xu:
The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results. CoRR abs/2211.01585 (2022)
2021
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-01547
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-01547
Binbin Zhang, Di Wu, Chao Yang, Xiaoyu Chen, Zhendong Peng, Xiangming Wang, Zhuoyuan Yao, Xiong Wang, Fan Yu, Lei Xie, Xin Lei:
WeNet: Production First and Production Ready End-to-End Speech Recognition Toolkit. CoRR abs/2102.01547 (2021)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-10233
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-10233
Xian Shi, Fan Yu, Yizhou Lu, Yuhao Liang, Qiangze Feng, Daliang Wang, Yanmin Qian, Lei Xie:
The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods. CoRR abs/2102.10233 (2021)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-04702
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-04702
Fan Yu, Haoneng Luo, Pengcheng Guo, Yuhao Liang, Zhuoyuan Yao, Lei Xie, Yingying Gao, Leijing Hou, Shilei Zhang:
Boundary and Context Aware Training for CIF-based Non-Autoregressive End-to-end ASR. CoRR abs/2104.04702 (2021)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-07393
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-07393
Fan Yu, Shiliang Zhang, Yihui Fu, Lei Xie, Siqi Zheng, Zhihao Du, Weilong Huang, Pengcheng Guo, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
M2MeT: The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Challenge. CoRR abs/2110.07393 (2021)
2020
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-06724
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-06724
Fan Yu, Zhuoyuan Yao, Xiong Wang, Keyu An, Lei Xie, Zhijian Ou, Bo Liu, Xiulin Li, Guanqiong Miao:
The SLT 2021 children speech recognition challenge: Open datasets, rules and baselines. CoRR abs/2011.06724 (2020)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-05481
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-05481
Binbin Zhang, Di Wu, Zhuoyuan Yao, Xiong Wang, Fan Yu, Chao Yang, Liyong Guo, Yaguang Hu, Lei Xie, Xin Lei:
Unified Streaming and Non-streaming Two-pass End-to-end Model for Speech Recognition. CoRR abs/2012.05481 (2020)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.