Deepspeed-chat: Easy, fast and affordable rlhf training of chatgpt-like models at all scales Z Yao, RY Aminabadi, O Ruwase, S Rajbhandari, X Wu, AA Awan, ... arXiv preprint arXiv:2308.01320, 2023 | 28 | 2023 |
Flash-llm: Enabling cost-effective and highly-efficient large generative model inference with unstructured sparsity H Xia, Z Zheng, Y Li, D Zhuang, Z Zhou, X Qiu, Y Li, W Lin, SL Song arXiv preprint arXiv:2309.10285, 2023 | 13 | 2023 |
RenAIssance: A Survey into AI Text-to-Image Generation in the Era of Large Model F Bie, Y Yang, Z Zhou, A Ghanem, M Zhang, Z Yao, X Wu, C Holmes, ... arXiv preprint arXiv:2309.00810, 2023 | 6 | 2023 |
Binary neural network for automated visual surface defect detection W Liu, J Zhang, Z Su, Z Zhou, L Liu Sensors 21 (20), 6868, 2021 | 6 | 2021 |
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design H Xia, Z Zheng, X Wu, S Chen, Z Yao, S Youn, A Bakhtiari, M Wyatt, ... arXiv preprint arXiv:2401.14112, 2024 | 1 | 2024 |
DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies SL Song, B Kruft, M Zhang, C Li, S Chen, C Zhang, M Tanaka, X Wu, ... arXiv preprint arXiv:2310.04610, 2023 | 1 | 2023 |
JSidentify: A hybrid framework for detecting plagiarism among JavaScript code in online mini games Q Xia, Z Zhou, Z Li, B Xu, W Zou, Z Chen, H Ma, G Liang, H Lu, S Guo, ... Proceedings of the ACM/IEEE 42nd International Conference on Software …, 2020 | 1 | 2020 |