Deep contextualized acoustic representations for semi-supervised speech recognition S Ling, Y Liu, J Salazar, K Kirchhoff ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 171 | 2020 |
Decoar 2.0: Deep contextualized acoustic representations with vector quantization S Ling, Y Liu arXiv preprint arXiv:2012.06659, 2020 | 113 | 2020 |
Word embeddings with limited memory S Ling, Y Song, D Roth Proceedings of the 54th Annual Meeting of the Association for Computational …, 2016 | 35 | 2016 |
Bertphone: Phonetically-aware encoder representations for utterance-level speaker and language recognition S Ling, J Salazar, Y Liu, K Kirchhoff arXiv preprint arXiv:1907.00457, 2019 | 31 | 2019 |
Cogcompnlp: Your swiss army knife for nlp D Khashabi, M Sammons, B Zhou, T Redman, C Christodoulopoulos, ... Proceedings of the eleventh international conference on Language Resources …, 2018 | 22 | 2018 |
Adapting large language model with speech for fully formatted end-to-end speech recognition S Ling, Y Hu, S Qian, G Ye, Y Qian, Y Gong, E Lin, M Zeng ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 12 | 2024 |
Improving pseudo-label training for end-to-end speech recognition using gradient mask S Ling, C Shen, M Cai, Z Ma ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 7 | 2022 |
Contextual phonetic pre-training for end-to-end utterance-level language and speaker recognition S Ling, J Salazar, K Kirchhoff | 2 | 2019 |
Hybrid Attention-Based Encoder-Decoder Model for Efficient Language Model Adaptation S Ling, G Ye, R Zhao, Y Gong 2024 IEEE Spoken Language Technology Workshop (SLT), 49-55, 2024 | 1 | 2024 |
Efficient Long-Form Speech Recognition for General Speech In-Context Learning H Yen, S Ling, G Ye arXiv preprint arXiv:2409.19757, 2024 | | 2024 |