Follow
Changho Hwang
Changho Hwang
Microsoft Research
Verified email at microsoft.com - Homepage
Title
Cited by
Cited by
Year
APUNet: Revitalizing GPU as Packet Processing Accelerator
Y Go, MA Jamshed, YG Moon, C Hwang, KS Park
NSDI (the 14th USENIX Symposium on Networked Systems Design and Implementation), 2017
1252017
Tutel: Adaptive mixture-of-experts at scale
C Hwang, W Cui, Y Xiong, Z Yang, Z Liu, H Hu, Z Wang, R Salas, J Jose, ...
MLSys (the 6th Conference on Machine Learning and Systems), 2023
115*2023
Elastic Resource Sharing for Distributed Deep Learning
C Hwang, T Kim, S Kim, J Shin, KS Park
NSDI (the 18th USENIX Symposium on Networked Systems Design and Implementation), 2021
852021
Confident Multiple Choice Learning
K Lee, C Hwang, KS Park, J Shin
ICML (the 34th International Conference on Machine Learning), 2017
652017
Pre-gated MoE: An algorithm-system co-design for fast and scalable mixture-of-expert inference
R Hwang, J Wei, S Cao, C Hwang, X Tang, T Cao, M Yang
ISCA (the 51st ACM/IEEE Annual International Symposium on Computer …, 2024
40*2024
Accelerating GNN training with locality-aware partial execution
T Kim, C Hwang, KS Park, Z Lin, P Cheng, Y Miao, L Ma, Y Xiong
APSys (the 12th ACM SIGOPS Asia-Pacific Workshop on Systems), 2021
112021
ARK: GPU-driven Code Execution for Distributed Deep Learning
C Hwang, KS Park, R Shu, X Qu, P Cheng, Y Xiong
NSDI (the 20th USENIX Symposium on Networked Systems Design and Implementation), 2023
92023
A case for two-stage inference with knowledge caching
G Park, C Hwang, KS Park
EMDL@MobiSys (the 3rd International Workshop on Deep Learning for Mobile …, 2019
32019
Alchemist: Towards the Design of Efficient Online Continual Learning System
Y Huang, Y Liu, HS Gunawi, B Li, C Hwang
arXiv preprint arXiv:2503.01066, 2025
2025
Mixture-of-experts layer with dynamic gating
Y Xiong, C Hwang, W Cui, Y Ziyue, Z Liu, H Hu, Z Wang, RO Salas, J Jose, ...
US Patent App. 18/054,451, 2024
2024
Mixture-of-experts layer with switchable parallel modes
Y Xiong, C Hwang, W Cui, Y Ziyue, Z Liu, H Hu, Z Wang, RO Salas, J Jose, ...
US Patent App. 18/054,446, 2024
2024
Collective communication phases at mixture-of-experts layer
Y Xiong, C Hwang, W Cui, Y Ziyue, Z Liu, H Hu, Z Wang, RO Salas, J Jose, ...
US Patent App. 18/054,452, 2024
2024
Sparse encoding and decoding at mixture-of-experts layer
Y Xiong, C Hwang, W Cui, Y Ziyue, Z Liu, H Hu, Z Wang, RO Salas, J Jose, ...
US Patent App. 18/318,436, 2024
2024
Immediate Communication for Distributed AI Tasks
J Xin, S Bae, KS Park, M Canini, C Hwang
HotInfra@SOSP (the 2nd Workshop on Hot Topics in System Infrastructure), 2024
2024
Towards GPU-driven Code Execution for Distributed Deep Learning
C Hwang, KS Park, R Shu, X Qu, P Cheng, Y Xiong
MLArchSys@ISCA (the 3rd Machine Learning for Computer Architecture and Systems), 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–15