Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding S Han, H Mao, WJ Dally International Conference on Learning Representations (ICLR'16 best paper award), 2015 | 11139 | 2015 |
SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and< 0.5MB model size FN Iandola, S Han, MW Moskewicz, K Ashraf, WJ Dally, K Keutzer arXiv preprint arXiv:1602.07360, 2016 | 10443 | 2016 |
Learning both Weights and Connections for Efficient Neural Network S Han, J Pool, J Tran, W Dally Advances in Neural Information Processing Systems (NIPS), 1135-1143, 2015 | 8249 | 2015 |
EIE: Efficient Inference Engine on Compressed Deep Neural Network S Han, X Liu, H Mao, J Pu, A Pedram, MA Horowitz, WJ Dally International Symposium on Computer Architecture (ISCA 2016), 2016 | 3290 | 2016 |
Deep leakage from gradients L Zhu, Z Liu, S Han Advances in neural information processing systems 32, 2019 | 2496 | 2019 |
TSM: Temporal shift module for efficient video understanding J Lin, C Gan, S Han Proceedings of the IEEE International Conference on Computer Vision, 7083-7093, 2019 | 2264 | 2019 |
ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware H Cai, L Zhu, S Han International Conference on Learning Representations (ICLR) 2019, 2018 | 2243 | 2018 |
AMC: Automl for model compression and acceleration on mobile devices Y He, J Lin, Z Liu, H Wang, LJ Li, S Han Proceedings of the European Conference on Computer Vision (ECCV), 784-800, 2018 | 1762 | 2018 |
Deep gradient compression: Reducing the communication bandwidth for distributed training Y Lin, S Han, H Mao, Y Wang, WJ Dally International Conference on Learning Representations (ICLR) 2018, 2017 | 1647 | 2017 |
Once-for-all: Train one network and specialize it for efficient deployment H Cai, C Gan, T Wang, Z Zhang, S Han International Conference on Learning Representations (ICLR) 2020, 2019 | 1481 | 2019 |
Trained Ternary Quantization C Zhu, S Han, H Mao, WJ Dally International Conference on Learning Representations (ICLR) 2017, 2016 | 1348 | 2016 |
HAQ: Hardware-aware automated quantization with mixed precision K Wang, Z Liu, Y Lin, J Lin, S Han Proceedings of the IEEE conference on computer vision and pattern …, 2019 | 1158 | 2019 |
Model compression and hardware acceleration for neural networks: A comprehensive survey L Deng, G Li, S Han, L Shi, Y Xie Proceedings of the IEEE 108 (4), 485-532, 2020 | 922 | 2020 |
ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA. S Han, J Kang, H Mao, Y Hu, X Li, Y Li, D Xie, H Luo, S Yao, Y Wang, ... International Symposium on Field-Programmable Gate Arrays (FPGA'17), 75-84, 2017 | 850 | 2017 |
Bevfusion: Multi-task multi-sensor fusion with unified bird's-eye view representation Z Liu, H Tang, A Amini, X Yang, H Mao, DL Rus, S Han 2023 IEEE international conference on robotics and automation (ICRA), 2774-2781, 2023 | 818 | 2023 |
Point-voxel cnn for efficient 3d deep learning Z Liu, H Tang, Y Lin, S Han Advances in neural information processing systems 32, 2019 | 777 | 2019 |
Searching efficient 3d architectures with sparse point-voxel convolution H Tang, Z Liu, S Zhao, Y Lin, J Lin, H Wang, S Han European conference on computer vision, 685-702, 2020 | 715 | 2020 |
Differentiable augmentation for data-efficient gan training S Zhao, Z Liu, J Lin, JY Zhu, S Han NeurIPS'20, 2020 | 686 | 2020 |
Smoothquant: Accurate and efficient post-training quantization for large language models G Xiao, J Lin, M Seznec, H Wu, J Demouth, S Han International Conference on Machine Learning, 38087-38099, 2023 | 685 | 2023 |
Angel-eye: A complete design flow for mapping CNN onto embedded FPGA K Guo, L Sui, J Qiu, J Yu, J Wang, S Yao, S Han, Y Wang, H Yang IEEE transactions on computer-aided design of integrated circuits and …, 2017 | 621 | 2017 |