Advancing acoustic-to-word CTC model J Li, G Ye, A Das, R Zhao, Y Gong 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 121 | 2018 |
Deep Convolutional Neural Networks with Layer-Wise Context Expansion and Attention. D Yu, W Xiong, J Droppo, A Stolcke, G Ye, J Li, G Zweig Interspeech, 17-21, 2016 | 119 | 2016 |
Towards code-switching ASR for end-to-end CTC models K Li, J Li, G Ye, R Zhao, Y Gong ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 106 | 2019 |
Developing far-field speaker system via teacher-student learning J Li, R Zhao, Z Chen, C Liu, X Xiao, G Ye, Y Gong 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 69 | 2018 |
Low latency end-to-end streaming speech recognition with a scout network C Wang, Y Wu, S Liu, J Li, L Lu, G Ye, M Zhou arXiv preprint arXiv:2003.10369, 2020 | 61 | 2020 |
Acoustic-to-word model without OOV J Li, G Ye, R Zhao, J Droppo, Y Gong 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017 | 59 | 2017 |
End-to-end speaker-attributed ASR with Transformer N Kanda, G Ye, Y Gaur, X Wang, Z Meng, Z Chen, T Yoshioka arXiv preprint arXiv:2104.02128, 2021 | 53 | 2021 |
Semantic mask for transformer based end-to-end speech recognition C Wang, Y Wu, Y Du, J Li, S Liu, L Lu, S Ren, G Ye, S Zhao, M Zhou arXiv preprint arXiv:1912.03010, 2019 | 50 | 2019 |
Large-scale pre-training of end-to-end multi-talker ASR for meeting transcription with single distant microphone N Kanda, G Ye, Y Wu, Y Gaur, X Wang, Z Meng, Z Chen, T Yoshioka arXiv preprint arXiv:2103.16776, 2021 | 44 | 2021 |
Advancing acoustic-to-word CTC model with attention and mixed-units A Das, J Li, G Ye, R Zhao, Y Gong IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (12 …, 2019 | 34 | 2019 |
Adaptation of rnn transducer with text-to-speech technology for keyword spotting E Sharma, G Ye, W Wei, R Zhao, Y Tian, J Wu, L He, E Lin, Y Gong ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 26 | 2020 |
Rapid Speaker Adaptation for Conformer Transducer: Attention and Bias Are All You Need. Y Huang, G Ye, J Li, Y Gong Interspeech, 1309-1313, 2021 | 22 | 2021 |
On-device custom wake word detection E Stoimenov, R Zhao, KP Kalgaonkar, IA Enchev, K Shahid, AP Stark, ... US Patent 11,132,992, 2021 | 20 | 2021 |
Minimum word error rate training with language model fusion for end-to-end speech recognition Z Meng, Y Wu, N Kanda, L Lu, X Chen, G Ye, E Sun, J Li, Y Gong arXiv preprint arXiv:2106.02302, 2021 | 20 | 2021 |
Generating and using text-to-speech data for speech recognition models YE Guoli, Y Huang, W Wei, L He, E Sharma, J Wu, T Yao, EC Lin, Y Gong, ... US Patent 11,587,569, 2023 | 17 | 2023 |
Adapting large language model with speech for fully formatted end-to-end speech recognition S Ling, Y Hu, S Qian, G Ye, Y Qian, Y Gong, E Lin, M Zeng ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 12 | 2024 |
Fast GMM computation for speaker verification using scalar quantization and discrete densities. G Ye, B Mak, MW Mak INTERSPEECH, 2327-2330, 2009 | 10 | 2009 |
Have best of both worlds: Two-pass hybrid and E2E cascading framework for speech recognition G Ye, V Mazalov, J Li, Y Gong ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 9 | 2022 |
Wake word selection assistance architectures and methods E Stoimenov, K Shahid, YE Guoli, HA Khalil, Y Gong US Patent 11,222,622, 2022 | 6 | 2022 |
Exploring sequential characteristics in speaker bottleneck feature for text-dependent speaker verification L Chen, Y Zhao, SX Zhang, J Li, G Ye, F Soong 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 6 | 2018 |