Cited By
View all- Xu MCai DYin WWang SJin XLiu X(2025)Resource-efficient Algorithms and Systems of Foundation Models: A SurveyACM Computing Surveys10.1145/370641857:5(1-39)Online publication date: 9-Jan-2025
- Liu SLuo HLi XLi YGuo BYu ZWang YMa KDing YYao Y(2025)AdaKnife: Flexible DNN Offloading for Inference Acceleration on Heterogeneous Mobile DevicesIEEE Transactions on Mobile Computing10.1109/TMC.2024.346693124:2(736-748)Online publication date: Feb-2025
- Xie JYan YSaxena AQiu QChen JSun HChen RBhattacharyya S(2025)ShaderNN: A lightweight and efficient inference engine for real-time applications on mobile GPUsNeurocomputing10.1016/j.neucom.2024.128628611(128628)Online publication date: Jan-2025
- Show More Cited By