Cited By
View all- Guo ZTang YZhai JYuan TJin JWang LZhao YLi R(2024)A Survey on Performance Modeling and Prediction for Distributed DNN TrainingIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2024.347639035:12(2463-2478)Online publication date: Dec-2024
- Tang YYuan TCao FWang LGuo ZZhao YLi R(2024)Simulating LLM Training in CXL-Based Heterogeneous Computing ClusterIEEE INFOCOM 2024 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS)10.1109/INFOCOMWKSHPS61880.2024.10620705(1-6)Online publication date: 20-May-2024