Cited By
View all- Unnikrishnan NParhi K(2023)InterGrad: Energy-Efficient Training of Convolutional Neural Networks via Interleaved Gradient SchedulingIEEE Transactions on Circuits and Systems I: Regular Papers10.1109/TCSI.2023.324646870:5(1949-1962)Online publication date: May-2023
- Qian RCao BGao MShi QWang YXu YHuo QQiu K(2023)EagerReuse: An Efficient Memory Reuse Approach for Complex Computational Graph2023 IEEE 29th International Conference on Parallel and Distributed Systems (ICPADS)10.1109/ICPADS60453.2023.00041(223-229)Online publication date: 17-Dec-2023
- Zhou QWang HYu XLi CBai YYan FXu Y(2023)MPress: Democratizing Billion-Scale Model Training on Multi-GPU Servers via Memory-Saving Inter-Operator Parallelism2023 IEEE International Symposium on High-Performance Computer Architecture (HPCA)10.1109/HPCA56546.2023.10071077(556-569)Online publication date: Feb-2023
- Show More Cited By