Follow
Hao WANG
Title
Cited by
Cited by
Year
Rat-resilient allreduce tree for distributed machine learning
X Wan, H Zhang, H Wang, S Hu, J Zhang, K Chen
Proceedings of the 4th Asia-Pacific Workshop on Networking, 52-57, 2020
382020
Tacc: A full-stack cloud computing infrastructure for machine learning tasks
K Xu, X Wan, H Wang, Z Ren, X Liao, D Sun, C Zeng, K Chen
arXiv preprint arXiv:2110.01556, 2021
322021
Domain-specific communication optimization for distributed DNN training
H Wang, J Chen, X Wan, H Tian, J Xia, G Zeng, W Wang, K Chen, W Bai, ...
arXiv preprint arXiv:2008.08445, 2020
222020
Efficient data-plane memory scheduling for in-network aggregation
H Wang, Y Qin, CL Lao, Y Le, W Wu, K Chen
arXiv preprint arXiv:2201.06398, 2022
202022
Fine-grained probability counting for cardinality estimation of data streams
L Wang, T Yang, H Wang, J Jiang, Z Cai, B Cui, X Li
World Wide Web 22, 2065-2081, 2019
122019
Towards {Domain-Specific} Network Transport for Distributed {DNN} Training
H Wang, H Tian, J Chen, X Wan, J Xia, G Zeng, W Bai, J Jiang, Y Wang, ...
21st USENIX Symposium on Networked Systems Design and Implementation (NSDI …, 2024
112024
FID-sketch: An accurate sketch to store frequencies in data streams
T Yang, H Zhang, H Wang, M Shahzad, X Liu, Q Xin, X Li
World Wide Web 22, 2675-2696, 2019
112019
Autobyte: Automatic configuration for optimal communication scheduling in dnn training
Y Ma, H Wang, Y Zhang, K Chen
IEEE INFOCOM 2022-IEEE Conference on Computer Communications, 760-769, 2022
92022
Fine-grained probability counting: Refined loglog algorithm
L Wang, Z Cai, H Wang, J Jiang, T Yang, B Cui, X Li
2018 IEEE International Conference on Big Data and Smart Computing (BigComp …, 2018
92018
Qcluster: Clustering packets for flow scheduling
T Yang, J Li, Y Zhao, K Yang, H Wang, J Jiang, Y Zhang, N Zhang
Proceedings of the ACM Web Conference 2022, 1752-1763, 2022
82022
ID bloom filter: Achieving faster multi-set membership query in network applications
P Liu, H Wang, S Gao, T Yang, L Zou, L Uden, X Li
2018 IEEE International Conference on Communications (ICC), 1-6, 2018
52018
Preemptive Switch Memory Usage to Accelerate Training Jobs with Shared In-Network Aggregation
H Wang, Y Qin, CL Lao, Y Le, W Wu, K Chen
2023 IEEE 31th International Conference on Network Protocols (ICNP), 2023
42023
Herald: An embedding scheduler for distributed embedding model training
C Zeng, X Cheng, H Tian, H Wang, K Chen
Proceedings of the 6th Asia-Pacific Workshop on Networking, 50-56, 2022
32022
Automatic configuration for optimal communication scheduling in DNN training
Y Ma, H Wang, Y Zhang, K Chen
arXiv preprint arXiv:2112.13509, 2021
32021
Accelerating Neural Recommendation Training with Embedding Scheduling
C Zeng, X Liao, X Cheng, H Tian, X Wan, H Wang, K Chen
21st USENIX Symposium on Networked Systems Design and Implementation (NSDI …, 2024
22024
AutoPipe: Automatic Configuration of Pipeline Parallelism in Shared GPU Cluster
J Hu, Y Liu, H Wang, J Wang
Proceedings of the 53rd International Conference on Parallel Processing, 443-452, 2024
12024
The system can't perform the operation now. Try again later.
Articles 1–16