Streaming end-to-end speech recognition for mobile devices Y He, TN Sainath, R Prabhavalkar, I McGraw, R Alvarez, D Zhao, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 772 | 2019 |
A streaming on-device end-to-end model surpassing server-side conventional model quality and latency TN Sainath, Y He, B Li, A Narayanan, R Pang, A Bruguier, S Chang, W Li, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 236 | 2020 |
Deep context: end-to-end contextual speech recognition G Pundak, TN Sainath, R Prabhavalkar, A Kannan, D Zhao 2018 IEEE spoken language technology workshop (SLT), 418-425, 2018 | 210 | 2018 |
Shallow-Fusion End-to-End Contextual Biasing. D Zhao, TN Sainath, D Rybach, P Rondon, D Bhatia, B Li, R Pang Interspeech, 1418-1422, 2019 | 175 | 2019 |
Dynamic sparsity neural networks for automatic speech recognition Z Wu, D Zhao, Q Liang, J Yu, A Gulati, R Pang ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 50 | 2021 |
Using context information with end-to-end models for speech recognition D Zhao, B Li, R Pang, TN Sainath, D Rybach, D Bhatia, Z Wu US Patent 11,545,142, 2023 | 19 | 2023 |
A unified cascaded encoder asr model for dynamic model sizes S Ding, W Wang, D Zhao, TN Sainath, Y He, R David, R Botros, X Wang, ... arXiv preprint arXiv:2204.06164, 2022 | 18 | 2022 |
Federated pruning: Improving neural network efficiency with federated learning R Lin, Y Xiao, TJ Yang, D Zhao, L Xiong, G Motta, F Beaufays arXiv preprint arXiv:2209.06359, 2022 | 11 | 2022 |
Personalized keyphrase detection using speaker and environment information R Rikhye, Q Wang, Q Liang, Y He, D Zhao, A Narayanan, I McGraw arXiv preprint arXiv:2104.13970, 2021 | 10 | 2021 |
Sharing low rank conformer weights for tiny always-on ambient speech recognition models SM Hernandez, D Zhao, S Ding, A Bruguier, R Prabhavalkar, TN Sainath, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 9 | 2023 |
Multi-output RNN-T joint networks for multi-task learning of ASR and auxiliary tasks W Wang, D Zhao, S Ding, H Zhang, SY Chang, D Rybach, TN Sainath, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 6 | 2023 |
Contextual biasing with the Knuth-Morris-Pratt matching algorithm W Wang, Z Wu, D Caseiro, T Munkhdalai, KC Sim, P Rondon, G Pundak, ... arXiv preprint arXiv:2310.00178, 2023 | 3 | 2023 |
Multi-Output Decoders for Multi-Task Learning of ASR and Auxiliary Tasks W Wang, D Zhao, S Ding, H Zhang, S Chang, DJ Rybach, TN Sainath, ... US Patent App. 18/494,984, 2024 | | 2024 |
Unified Cascaded Encoder ASR model for Dynamic Model Sizes S Ding, Y He, X Wang, W Wang, T Strohman, TN Sainath, ... US Patent App. 18/182,925, 2023 | | 2023 |