Device-robust acoustic scene classification based on two-stage categorization and data augmentation H Hu, CHH Yang, X Xia, X Bai, X Tang, Y Wang, S Niu, L Chai, J Li, H Zhu, ... arXiv preprint arXiv:2007.08389, 2020 | 78 | 2020 |
A gender mixture detection approach to unsupervised single-channel speech separation based on deep neural networks Y Wang, J Du, LR Dai, CH Lee IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (7), 1535 …, 2017 | 52 | 2017 |
A two-stage approach to device-robust acoustic scene classification H Hu, CHH Yang, X Xia, X Bai, X Tang, Y Wang, S Niu, L Chai, J Li, H Zhu, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 47 | 2021 |
A Maximum Likelihood Approach to Deep Neural Network Based Nonlinear Spectral Mapping for Single-Channel Speech Separation. Y Wang, J Du, LR Dai, CH Lee Interspeech, 1178-1182, 2017 | 25 | 2017 |
A lottery ticket hypothesis framework for low-complexity device-robust neural acoustic scene classification H Yen, CHH Yang, H Hu, SM Siniscalchi, Q Wang, Y Wang, X Xia, Y Zhao, ... arXiv preprint arXiv:2107.01461, 2021 | 24 | 2021 |
Unsupervised single-channel speech separation via deep neural network for different gender mixtures Y Wang, J Du, LR Dai, CH Lee 2016 Asia-Pacific Signal and Information Processing Association Annual …, 2016 | 21 | 2016 |
Tensor-to-vector regression for multi-channel speech enhancement based on tensor-train network J Qi, H Hu, Y Wang, CHH Yang, SM Siniscalchi, CH Lee ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 20 | 2020 |
Interspeech 2021 conferencingspeech challenge: Towards far-field multi-channel speech enhancement for video conferencing W Rao, Y Fu, Y Hu, X Xu, Y Jv, J Han, Z Jiang, L Xie, Y Wang, ... arXiv preprint arXiv:2104.00960, 2021 | 19 | 2021 |
Speech enhancement with fullband-subband cross-attention network J Chen, W Rao, Z Wang, Z Wu, Y Wang, T Yu, S Shang, H Meng arXiv preprint arXiv:2211.05432, 2022 | 15 | 2022 |
Exploring deep hybrid tensor-to-vector network architectures for regression based speech enhancement J Qi, H Hu, Y Wang, CHH Yang, SM Siniscalchi, CH Lee arXiv preprint arXiv:2007.13024, 2020 | 13 | 2020 |
Tea-pse 3.0: Tencent-ethereal-audio-lab personalized speech enhancement system for icassp 2023 dns-challenge Y Ju, J Chen, S Zhang, S He, W Rao, W Zhu, Y Wang, T Yu, S Shang ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 12 | 2023 |
Improving channel decorrelation for multi-channel target speech extraction J Han, W Rao, Y Wang, Y Long arXiv preprint arXiv:2106.03113, 2021 | 10 | 2021 |
Inter-subnet: Speech enhancement with subband interaction J Chen, W Rao, Z Wang, J Lin, Z Wu, Y Wang, S Shang, H Meng ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 9 | 2023 |
Relational teacher student learning with neural label embedding for device adaptation in acoustic scene classification H Hu, SM Siniscalchi, Y Wang, CH Lee arXiv preprint arXiv:2008.00110, 2020 | 9 | 2020 |
Mc-spex: Towards effective speaker extraction with multi-scale interfusion and conditional speaker modulation J Chen, W Rao, Z Wang, J Lin, Y Ju, S He, Y Wang, Z Wu arXiv preprint arXiv:2306.16250, 2023 | 8 | 2023 |
Gesper: A unified framework for general speech restoration J Chen, Y Shi, W Liu, W Rao, S He, A Li, Y Wang, Z Wu, S Shang, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 8 | 2023 |
An attention based speaker-independent audio-visual deep learning model for speech enhancement Z Sun, Y Wang, L Cao MultiMedia Modeling: 26th International Conference, MMM 2020, Daejeon, South …, 2020 | 8 | 2020 |
A progressive deep learning approach to child speech separation X Wang, J Du, L Sun, Q Wang, CH Lee 2018 11th International Symposium on Chinese Spoken Language Processing …, 2018 | 7 | 2018 |
A fusion approach to spoken language identification based on combining multiple phone recognizers and speech attribute detectors Y Wang, J Du, L Dai, CH Lee The 9th International Symposium on Chinese Spoken Language Processing, 158-162, 2014 | 7 | 2014 |
A LSTM-based joint progressive learning framework for simultaneous speech dereverberation and denoising X Tang, J Du, L Chai, Y Wang, Q Wang, CH Lee 2019 Asia-Pacific Signal and Information Processing Association Annual …, 2019 | 6 | 2019 |