Computer Science > Networking and Internet Architecture

arXiv:2304.09961 (cs)

[Submitted on 19 Apr 2023 (v1), last revised 2 May 2023 (this version, v2)]

Title:Adaptive Scheduling for Edge-Assisted DNN Serving

Authors:Jian He, Chenxi Yang, Zhaoyuan He, Ghufran Baig, Lili Qiu

View PDF

Abstract:Deep neural networks (DNNs) have been widely used in various video analytic tasks. These tasks demand real-time responses. Due to the limited processing power on mobile devices, a common way to support such real-time analytics is to offload the processing to an edge server. This paper examines how to speed up the edge server DNN processing for multiple clients. In particular, we observe batching multiple DNN requests significantly speeds up the processing time. Based on this observation, we first design a novel scheduling algorithm to exploit the batching benefits of all requests that run the same DNN. This is compelling since there are only a handful of DNNs and many requests tend to use the same DNN. Our algorithms are general and can support different objectives, such as minimizing the completion time or maximizing the on-time ratio. We then extend our algorithm to handle requests that use different DNNs with or without shared layers. Finally, we develop a collaborative approach to further improve performance by adaptively processing some of the requests or portions of the requests locally at the clients. This is especially useful when the network and/or server is congested. Our implementation shows the effectiveness of our approach under different request distributions (e.g., Poisson, Pareto, and Constant inter-arrivals).

Subjects:	Networking and Internet Architecture (cs.NI); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
Cite as:	arXiv:2304.09961 [cs.NI]
	(or arXiv:2304.09961v2 [cs.NI] for this version)
	https://doi.org/10.48550/arXiv.2304.09961

Submission history

From: Zhaoyuan He [view email]
[v1] Wed, 19 Apr 2023 20:46:50 UTC (576 KB)
[v2] Tue, 2 May 2023 19:05:35 UTC (576 KB)

Computer Science > Networking and Internet Architecture

Title:Adaptive Scheduling for Edge-Assisted DNN Serving

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Networking and Internet Architecture

Title:Adaptive Scheduling for Edge-Assisted DNN Serving

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators