Open AccessArticle

Minimizing Delay and Power Consumption at the Edge

Erol Gelenbe

^1,2,3

Institute of Theoretical & Applied Informatics, Polish Academy of Sciences (IITiS-PAN), 44-100 Gliwice, Poland

Université Côte d’Azur, CNRS I3S, 06107 Nice, France

Department of Engineering, King’s College, London SE1 8WA, UK

Sensors 2025, 25(2), 502; https://doi.org/10.3390/s25020502

Submission received: 16 October 2024 / Revised: 3 January 2025 / Accepted: 9 January 2025 / Published: 16 January 2025

(This article belongs to the Special Issue Feature Papers in the 'Sensor Networks' Section 2024)

Download

Browse Figures

Figure 1
Architecture of an edge system that allocates incoming tasks to a set of locally connected servers for edge computing [<a href="#B40-sensors-25-00502" class="html-bibr">40</a>]. It is composed of a Dispatching Platform (DP) that dynamically exploits the n distinct servers’ available capacity to allocate tasks to minimize average task delay or to minimize total power consumption. Each server has its own incoming local flow of tasks, and each server requests and receives tasks from the DP. "> Figure 2
The curve on the left shows the power consumption that was measured on an NUC versus its overall arrival rate of workload. There is a substantial power consumption of close to 63% of its maximum value when the NUC is idle. We observe that the power consumption attains its maximum value of 30 Watts as the workload increases. The curve on the right shows the corresponding energy consumption per arriving request in Joules as a function of the load. "> Figure 3
We illustrate the measured characteristics of the power consumption <math display="inline"><semantics> <mrow> <msub> <mo>Π</mo> <mi>i</mi> </msub> <mrow> <mo>(</mo> <msub> <mi>X</mi> <mi>i</mi> </msub> <mo>)</mo> </mrow> </mrow> </semantics></math> along the y-axis in Watts, versus the load <math display="inline"><semantics> <msub> <mi>X</mi> <mi>i</mi> </msub> </semantics></math> along the x-axis in tasks/sec for several different servers, showing the approximately linear increase in power consumption at some rate <math display="inline"><semantics> <mrow> <msub> <mi>α</mi> <mi>i</mi> </msub> <mo>></mo> <mn>0</mn> </mrow> </semantics></math>, which depends on the characteristics of the different processors, between the zero load level (no task arrivals and the server is idle), which corresponds to <math display="inline"><semantics> <msub> <mi>π</mi> <mrow> <mi>i</mi> <mn>0</mn> </mrow> </msub> </semantics></math>, up to close to the maximum value of the power consumption that we denote by <math display="inline"><semantics> <msub> <mi>π</mi> <mrow> <mi>i</mi> <mi>M</mi> </mrow> </msub> </semantics></math>. Note that the value <math display="inline"><semantics> <msub> <mi>X</mi> <mrow> <mn>1</mn> <mi>i</mi> </mrow> </msub> </semantics></math> cannot exceed the maximum processing rate of jobs <math display="inline"><semantics> <msub> <mi>μ</mi> <mi>i</mi> </msub> </semantics></math> of <math display="inline"><semantics> <msub> <mi>S</mi> <mi>i</mi> </msub> </semantics></math>. The linear characteristic is displayed as a straight red line on top of the measured data that are also shown in the figure. The rightmost curve refers to the NUC whose characteristics are discussed in <a href="#sensors-25-00502-f002" class="html-fig">Figure 2</a>. ">

Versions Notes

Abstract

Edge computing systems must offer low latency at low cost and low power consumption for sensors and other applications, including the IoT, smart vehicles, smart homes, and 6G. Thus, substantial research has been conducted to identify optimum task allocation schemes in this context using non-linear optimization, machine learning, and market-based algorithms. Prior work has mainly focused on two methodologies: (i) formulating non-linear optimizations that lead to NP-hard problems, which are processed via heuristics, and (ii) using AI-based formulations, such as reinforcement learning, that are then tested with simulations. These prior approaches have two shortcomings: (a) there is no guarantee that optimum solutions are achieved, and (b) they do not provide an explicit formula for the fraction of tasks that are allocated to the different servers to achieve a specified optimum. This paper offers a radically different and mathematically based principled method that explicitly computes the optimum fraction of jobs that should be allocated to the different servers to (1) minimize the average latency (delay) of the jobs that are allocated to the edge servers and (2) minimize the average energy consumption of these jobs at the set of edge servers. These results are obtained with a mathematical model of a multiple-server edge system that is managed by a task distribution platform, whose equations are derived and solved using methods from stochastic processes. This approach has low computational cost and provides simple linear complexity formulas to compute the fraction of tasks that should be assigned to the different servers to achieve minimum latency and minimum energy consumption.

Keywords:

edge computing; sensor networks; edge computing; latency minimization; reducing energy consumption; G-networks; analytical solution

1. Introduction

The advent of the Internet of Things (IoT) and related technologies, such as smart homes, smart vehicles, 5th generation (5G) networks, and beyond 5G, increase the need for high throughput, low task delays, and low energy consumption through the development of systems that provide computing and communication services at the edge [1,2]. While radio access networks (RANs) and mobile base stations can massively increase the bandwidth and throughput that is offered to end users through these technologies, applications are also being moved from cloud computing platforms to the edge of the Internet [3,4,5] to achieve high throughput with low latency and lower energy consumption [6,7]. Motivated by these developments, much research has been conducted to allocate tasks in edge systems in a manner that attempts to minimize latency and energy consumption using non-linear optimization techniques [8,9] leading to NP-hard problems, which are processed with various heuristics and approximations, or with AI-based approaches [10,11], such as reinforcement learning. These previous approaches have some shortcomings: (a) there is no guarantee that optimum solutions are achieved, and (b) they do not provide a clear indication of the fraction of tasks that should be allocated to the different servers to achieve a specified optimum. Also, the parameters that are used by these methods must be measured and updated to construct the required algorithms; the methods are computationally costly, with additional overhead and energy consumption required for lightweight edge systems. In addition, these approaches do not provide insight into the key parameters, such as the task allocation rates or proportion of tasks that should be sent to different servers, to guarantee that the system will operate at or near its optimum point.

Thus, this paper proposes a radically different, mathematically based, and principled approach that explicitly computes the optimum fraction of jobs that should be allocated to the different servers to either (1) minimize the average latency (delay) of the jobs that are allocated to the edge servers or (2) minimize the average energy consumption of the jobs that use the edge servers. To achieve these objectives, this paper develops a mathematical model of a multiple-server system that is managed by a task dispatching platform (DP). The model equations are derived and solved using methods from stochastic processes. We then use this theoretical framework to explicitly derive the optimum workload distribution that minimizes latency. The paper then uses a similar approach to derive an explicit expression for the share of workload that should be allocated to each edge server that minimizes the system’s additional energy consumption per task. The analytical approach we develop has a low computational cost and provides detailed insight into the fraction of tasks that are allocated to the different servers to achieve minimum latency and minimum energy consumption.

1.1. The Main Results Presented in This Paper

After the review of related work on the design of task-dispatching algorithms that optimize edge performance discussed in Section 1.2, the architecture of an edge system that includes a decision platform (DP) that dispatches incoming external tasks to a set of n servers is presented in Section 2. Then, the notation and symbols used in the paper are summarized in Section 2.1. All the proofs related to the theoretical developments in the paper are presented in detail in separate appendix sections that are clearly linked to the sections where the results are presented.

A novel mathematical model of an edge system composed of the DP that sends tasks to n servers is presented in Section 3. The Key Product Form Result for this model is stated and proved in Theorem 1, and Lemma 1 shows that its solution accounts for the processing of all the tasks that enter the system. Then, in Section 4, we show how the decision variables

C_{i}, 1 \leq i \leq n

, which combine the requests from the n servers with the task assignment decisions that are made by the DP to each server, affect the average latency of externally arriving tasks at the DP.

Then, Section 5 derives the task allocation policy that minimizes the average response time for all tasks being processed at the n servers in the system. Section 6 discusses the power consumption of edge servers based on power measurements that were made on NUCs and other processors, and we derive a policy that depends on the known parameters of each server to share the tasks between servers to guarantee that the average energy consumption for incoming tasks at the edge is minimized.

Finally, Section 7 provides conclusions and directions for further work.

1.2. Related Work

There has been considerable work on the design of algorithms for distributed system management and task distribution to reduce response times for tasks and maximize data transfer throughputs [12,13]. Real-time techniques have been developed to this effect [14], and various heuristics have often been tested in simulated environments to balance load and reduce response times [15,16]. Energy consumption has been of increasing concern over the last decade because of the steady increase observed over this period in the power consumption of ICT [5,17,18].

Recent research in this area has been primarily motivated by the need for low-cost distributed systems that offer computation and data-intensive applications close to the network edge to achieve low latency [19] for mobile technologies, the IoT, and smart vehicles [20]. Another motivation is the need for distributed computing facilities that locally serve small-scale applications, such as smart homes [21], and in some recent work [22], a system was considered where tasks that arrive at an edge server are either directly executed there or off-loaded to a different server.

As early as the 1990s, the research community proposed AI-based dynamic network management techniques [23,24,25,26] that were later facilitated by the introduction of Software Defined Networks [27] to achieve improvements in network performance and security [28,29]. Attempts have been made to use reinforcement learning or, more broadly, machine learning [30,31,32] as a tool to reduce latency and achieve power savings for tasks that are sensitive to the “quality of service” [33]. Other work has integrated security needs by managing tasks and flows of data so that insecure servers and networks may be dynamically avoided [34,35]. Market-based bidding techniques and games to design low computational cost algorithms that have been shown to offer fast solutions at low cost during simulations [36,37]. Some practical experiments have tested AI in distributed edge systems using Software Defined Networks to reduce latency and improve power consumption [38]. Since edge systems often fulfill multiple functions and support a variety of users, the resulting optimization problems are often NP-hard, and heuristic approximations are often investigated [39].

2. System Description

We consider an edge distributed computing system composed of a Dispatching Platform (DP) that resides on a separate server with n machines or servers,

S_{1}, \dots, S_{n}

, that together form a cluster that is accessible through the Internet. Each

S_{i}

receives local tasks to execute, as well as tasks that are allocated to it by the DP. External tasks to be executed by the edge system are received by the DP and assigned to the servers based on requests from the servers.

The base station or external user shown in Figure 1 sends tasks to the DP, where they are stored in an input queue as they wait for task requests from the n edge servers.

When any $S_{i}$ completes the current task that it is executing, it makes a task request from the DP with probability $1 \leq p_{i} \leq 1$ . If the DP task input queue is empty, then the request is simply rejected by the DP. If the DP task input queue contains at least one task, then the DP assigns the task to $S_{i}$ with probability $0 \leq a_{i} \leq 1$ .
Thus, when $S_{i}$ terminates an ongoing task, a task from the incoming pool is dispatched by the DP to $S_{i}$ with probability $C_{i} = p_{i} a_{i}$ , provided that the input queue at the DP is not empty. If the DP queue is empty, obviously, no task can be sent. This is equivalent to assuming that when a server $S_{i}$ informs the DP that it has terminated a task, then the DP allocates a task to $S_{i}$ with probability $0 \leq C_{i} \leq 1$ if the DP has a task waiting at its input. If there are no tasks waiting at the DP, then the request from $S_{i}$ is rejected.
Note that task endings at the different servers occur asynchronously with each other, and the decision of the DP is simply to send or not to send a new task to $S_{i}$ .
Thus, each server has a queue of tasks, some of which have been sent by the DP and others are local tasks that it receives and executes.

Figure 1. Architecture of an edge system that allocates incoming tasks to a set of locally connected servers for edge computing [40]. It is composed of a Dispatching Platform (DP) that dynamically exploits the n distinct servers’ available capacity to allocate tasks to minimize average task delay or to minimize total power consumption. Each server has its own incoming local flow of tasks, and each server requests and receives tasks from the DP.

External tasks arrive at the DP at a rate

Λ > 0

(tasks per second), while each

S_{i}

receives “locally generated tasks”, e.g., from its local owner or user or as part of its operating system at the following rate:

λ_{i} \geq 0, λ = \sum_{i = 1}^{n} λ_{i} .

(1)

The average execution time of each task at

S_{i}

is denoted by

μ_{i}^{- 1}

The DP’s objective is to minimize the total average waiting time at the DP and the average response time at all the n servers. However, it also aims to reduce the overall energy consumption of the system. On the other hand, each

S_{i}

must execute all the tasks it has received locally, as well as those that it has requested from the DP and that the DP has allocated to it. The

S_{i}

may need to generate income from the external tasks it receives from the DP. On the other hand, it also needs to provide low latency (i.e., low response time) for all the tasks it receives. The DP, as well as all

S_{i}

, also aims to keep the overall average energy consumption as low as possible because of the cost of the energy and to achieve greater sustainability.

2.1. Summary of Notation and Symbols and Abbreviations

In this sub-section, we present and define all the symbols that are used throughout this paper.

$t \geq 0$ is the real-valued time variable.
DP is the task dispatching platform that transfers tasks from the end users to the servers.
$S_{i}$ denotes a server that receives tasks assigned by the DP, as well as “locally generated tasks”, e.g., from its local owner or user or as part of its operating system.
$Λ > 0$ is the rate of arrival of external tasks to the DP.
$λ_{i}$ is the rate of arrival of locally generated tasks to $S_{i}$ .
$μ_{i} > 0$ is the average service rate for tasks at the server $S_{i}$ . Thus, the average service time per task at $S_{i}$ is $\frac{1}{μ_{i}}$ .
We define $ρ_{i} = \frac{λ_{i}}{μ_{i}}$ , $λ = \sum_{i = 1}^{n} λ_{i}$ , and $μ = \sum_{i = 1}^{n} μ_{i}$ .
$p_{i}$ , $0 \leq p_{i} \leq 1$ , is the probability that, when $S_{i}$ completes the current task that it is executing, it requests to receive a task from the DP.
$a_{i}$ , $0 \leq a_{i} \leq 1$ , is the probability that the DP accepts $S_{i}$ ’s request when the DP’s input queue is non-empty.
$C_{i} = p_{i} {\dot{a}}_{i}$ is the probability that when $S_{i}$ asks for a new task from the DP, it receives it provided that a new task is available at the DP.
$y (t) \geq 0$ is the non-negative integer-valued length of the queue of externally arriving tasks waiting at the Dispatching Platform (DP) at time t.
$y_{i} (t) \geq 0$ is the integer-valued total number (queue length) of all the tasks that are in the queue at $S_{i}$ at time t.
k is a particular value of $y (t)$ .
$k_{i}$ is a particular value of $y_{i} (t)$ , and we define the vectors as follows:

$\begin{matrix} Y (t) = (y (t), y_{1} (t), \dots, y_{n} (t)), \\ K = (k, k_{1}, \dots, k_{n}) . \end{matrix}$
The following vectors are related to $K = (k, k_{1}, \dots, k_{n})$ , where $k \geq 0, k_{i} \geq 0$ :

$\begin{matrix} K^{- 0} = (k - 1, k_{1}, \dots, k_{n}) i f k > 0, \\ K^{+ 0} = (k + 1, k_{1}, \dots, k_{n}), \\ K^{- i} = (k, k_{1}, \dots, k_{i} - 1, \dots, k_{n}) i f k_{i} > 0, \\ K^{+ i} = (k, k_{1}, \dots, k_{i} + 1, \dots, k_{n}) . \end{matrix}$

(2)
$Φ_{i}$ is the fraction of external user tasks that the DP allocates to $S_{i}$ .
$Φ_{i}^{+}$ is the fraction of external user tasks that the DP allocates to $S_{i}$ to minimize the average task response time of the edge system.
$Φ_{i}^{*}$ is the fraction of external user tasks that the DP allocates to $S_{i}$ to minimize the average energy consumption per external task assigned to the edge system.
$X_{i} = λ_{i} + Φ_{i} Λ$ is the total arrival rate of tasks to server $S_{i}$ , i.e., the load of $S_{i}$ .
$X_{i 1}$ is the upper bound for the linear approximation of the power consumption of $S_{i}$ , and $X_{i 1} < μ_{i}$
$q_{i} = \frac{λ_{i} + Φ_{i} Λ}{μ_{i}}$ is the utilization rate of server $S_{i}$ . If $q_{i} < 0$ , it can be interpreted as the probability that $S_{i}$ is busy processing tasks.
$R_{D P}$ is the average response time at the DP for externally arriving tasks.
$R_{S}$ is the average response time of all tasks at the n servers.
$π_{i 0}$ is the power consumption of server $S_{i}$ when the server is idle, i.e., when $X_{i} = 0$ .
$π_{i M}$ is the maximum power consumption of server $S_{i}$ . It is attained when $X_{i}$ is just under the value $μ_{i}$ .
$α_{i} > 0$ is the approximate linear increase in power consumption of $S_{i}$ as a function of the load $X_{i}$ .
$π_{i} (X_{i}) = π_{i 0} + α_{i} X_{i}$ is the approximate power consumption of $S_{i}$ when its load is $X_{i}$ , for $X_{i} < μ_{i}$ .
$π_{i}^{'}$ is the derivative of $π_{i} (X_{i})$ with respect to $Φ_{i}$ .
$π_{i}^{″}$ is the second derivative of $π_{i} (X_{i})$ with respect to $Φ_{i}$ .
E is the average energy consumption of the externally arriving tasks that are assigned by the DP to the different servers, and $E = \sum_{i = 1}^{n} Φ_{i} π_{i} (X_{i}) μ_{i}^{- 1}$ .

3. Analytical Solution for the Dispatching Platform (DP) and Its $n$ Servers

In this section, we construct a G-Network with triggered customer movement [41,42], where the service times at all

S_{i}

are mutually independent and exponentially distributed random variables, with parameter

μ_{i}

for

S_{i}

, and the interarrival times of external tasks to the DP is a Poisson process of rate

Λ

. The arrivals of local tasks at each

S_{i}

constitute a mutually independent Poisson process with rate

λ_{i}

and are independent of all the service times at the servers. Thus, in a small time interval of length

Δ t

, an external task arrival occurs to the DP with probability

Λ Δ t + o (Δ t)

, a local task arrives to any server

S_{i}

with probability

λ_{i} Δ t + o (Δ t)

, and provided that there is a local task at

S_{i}

(i.e.

k_{i} > 0

), a local task ends its service at

S_{i}

with probability

μ_{i} Δ t + o (Δ t)

. Here,

o (Δ t)

represents a function that tends to zero with

Δ t

, i.e.,

{lim}_{Δ t \to 0} \frac{o (Δ t)}{Δ t} = 0

Also, when a service completes at

S_{i}

, the server requests to receive a new task from the DP with probability

p_{i}

, which is allocated instantaneously with probability

a_{i}

k > 0

or refused with probability

(1 - a_{i})

or accepted with probability

p_{i}

and not allocated when

k = 0

. Thus, the following state transitions occur:

$K \to K^{+ 0}$ with probability $Λ Δ t + o (Δ t)$ .
$K \to K^{+ i}$ with probability $λ_{i} Δ t + o (Δ t)$ .
$K^{+ 0} \to K$ with probability $μ_{i} C_{i} Δ t + o (Δ t)$ when $k_{i} > 0$ (a task at $S_{i}$ departs but is immediately replaced by a task from the DP).
$K^{+ i} \to K$ , with probability $μ_{i} C_{i} Δ t + o (Δ t)$ when $k = 0$ (a task at $S_{i}$ departs; the request for a new task is made but the DP queue is empty (i.e., $k = 0$ and, therefore, the DP has no tasks to send to $S_{i}$ ).
$K^{+ i} \to K$ with probability $μ_{i} (1 - C_{i}) Δ t + o (Δ t)$ obtained from

$[μ_{i} (1 - p_{i}) + μ_{i} p_{i} (1 - a_{i})] Δ t + o (Δ t) = μ_{i} (1 - C_{i}) Δ t + o (Δ t)$

(3)

independently of the value of k or $k_{i}$ ; note that these values refer to the quantities in the vector $K = (k, k_{1}, \dots, k_{n})$ .
$K \to K$ , with probability $1 - (Λ + λ_{i} + μ_{i} 1 [k_{i} > 0]) Δ t + o (δ t)$ .

Then, the probability

p (K, t) = P r o b [Y (t) = K]

satisfies the following system (4) of Chapman–Kolmogorov differential-difference equations:

\begin{matrix} \frac{d p (K, t)}{d t} & = & - p (K, t) [Λ + \sum_{i = 1}^{n} (μ_{i} 1 [k_{i} > 0] + λ_{i})] + Λ p (K^{- 0}, t) 1 [k > 0] \\ + \sum_{i = 1}^{n} [λ_{i} p (K^{- i}, t) 1 [k_{i} > 0] + μ_{i} C_{i} p (K^{+ 0}, t) 1 [k_{i} > 0] \\ + μ_{i} C_{i} p (K^{+ i}, t) 1 [k = 0] + μ_{i} (1 - C_{i}) p (K^{+ i}, t)] . \end{matrix}

(4)

We now state the following result, which we use throughout this paper. The proof of Theorem 1 is detailed in Appendix A.

Theorem 1 (Key Product Form Result).

Assume that the arrival processes whose rates are

Λ, λ_{1}, \dots, λ_{n}

are all independent Poisson processes and that the service rates

μ_{i}, 1 \leq i \leq n

, are parameters of independent exponentially distributed random variables, which are also independent of the inter-arrival times. Then, if the system of simultaneous non-linear equations

\begin{matrix} q = \frac{Λ}{\sum_{i = 1}^{n} q_{i} μ_{i} C_{i}}, q_{i} = \frac{λ_{i} + q q_{i} μ_{i} C_{i}}{μ_{i}} = \frac{ρ_{i}}{1 - q C_{i}}, 1 \leq i \leq n, \end{matrix}

(5)

has a solution that satisfies

0 < q < 1, 0 < q_{i} < 1

, then this solution is unique, and

\begin{matrix} lim_{t \to \infty} Prob [x (t) = k, x_{1} (t) = k_{1}, \dots, x_{n} (t) = k_{n}] \\ = q^{k} (1 - q) \prod_{i = 1}^{n} q_{i}^{k_{i}} (1 - q_{i}), \end{matrix}

(6)

where

q = lim_{t \to \infty} Prob [x (t) > 0], q_{i} = lim_{t \to \infty} Prob [x_{i} (t) > 0] .

(7)

Note: The denominator of the expression for q in (5) represents the fact that each server

S_{i}

will notify the DP with probability

p_{i}

when

S_{i}

’s ongoing job ends, that it is ready to receive a task from the

D P

, and that the DP will respond by sending a task to

S_{i}

with probability

a_{i}

so that

C_{i} = p_{i} \cdot a_{i}

. The rate at which such requests arrive to the DP from

S_{i}

is, therefore,

q_{i} μ_{i} p_{i}

, and the rate at which the DP sends tasks to

S_{i}

q_{i} μ_{i} C_{i}

. Note that both of the equations in (5) are non-linear, contrary to those of an ordinary “Jackson” (open) or “Gordon–Newell” (closed) product-form queueing network [43,44].

Corollary 1.

From (6), it is easy to show that when

q < 1

, the average total number of jobs at steady state

N_{D P}

in the input queue to the DP is

N_{D P} = \frac{q}{1 - q},

(8)

and the average total number of jobs at steady state

N_{i}

that are in the input queue of

S_{i}

N_{i} = \frac{q_{i}}{1 - q_{i}} .

(9)

The expression for

q_{i}

in (5) has the intuitive property that we now prove; namely, when the stationary solution exists, the total incoming flow of jobs to the DP and the servers

S_{i}

is identical to the outgoing flow of jobs whose service ends at the n servers, which we use in the proof of Theorem 1 given in Appendix A.

Lemma 1.

Let us denote

λ = \sum_{i = 1}^{n} λ_{i} .

(10)

Then, if

0 < q_{i} < 1, 0 < q < 1

, it follows that

\sum_{i = 1}^{n} q_{i} μ_{i} = Λ + λ .

(11)

Remark 1.

The expression (11) is an intuitive “flow conservation” identity at steady state for a stable system, which states that all the work that arrives at the DP or that arrives locally to the n servers is eventually processed by one of the n servers.

Proof of Lemma 1.

As a consequence of the expressions for q and

q_{i}

in (5), we can write

\begin{matrix} \sum_{i = 1}^{n} q_{i} μ_{i} & = & \sum_{i = 1}^{n} λ_{i} [1 + \sum_{l = 1}^{\infty} {(q C_{i})}^{l}], \end{matrix}

and using the expression for q in (5), we obtain

\begin{matrix} \sum_{i = 1}^{n} q_{i} μ_{i} & = & λ + \frac{Λ}{\sum_{j = 1}^{n} q_{j} μ_{j} C_{j}} . \sum_{i = 1}^{n} \frac{λ_{i}}{1 - q C_{i}} = λ + \frac{Λ}{\sum_{j = 1}^{n} q_{j} μ_{j} C_{j}} . \sum_{j = 1}^{n} q_{j} μ_{j} C_{j} = λ + Λ, \end{matrix}

(12)

which completes the proof. □

Corollary of Lemma 1. Since we assume that

0 < q_{i} < 1, 1 \leq i \leq n

, the following holds:

\begin{matrix} D e n o t i n g ρ_{i} = \frac{λ_{i}}{μ_{i}}, w e h a v e : ρ_{i} < 1 - q C_{i}, a n d h e n c e C_{i} < \frac{1 - ρ_{i}}{q} . \end{matrix}

(13)

4. Minimizing the Average Response Time or Average Delay at the DP

The well-known “Little’s Formula” [45] can be used to compute the average response time of tasks entering through the DP and of tasks entering the edge system composed of n servers. Here,

Λ

is the total arrival rate of externally arriving tasks to the DP and

q q_{i} μ_{i} C_{i}

is the arrival rate of tasks from the DP to server

S_{i}

Since

Λ

is the total arrival rate of such tasks, if

R_{D P}

denotes the average response time of tasks at the DP before they are assigned to a server, by Little’s Formula and Equation (8) in Corollary 1, we have

\begin{matrix} R_{D P} & = & \frac{N_{D P}}{Λ} = \frac{1}{Λ} \frac{q}{1 - q}, \end{matrix}

(14)

and we would like to know how we should choose

C_{i}, i = 1, \dots, n

, to minimize

R_{D P}

. To this effect, the following result is needed:

Theorem 2.

Let

0 < q_{i} < 1

, and denote

D_{i} = \frac{d q}{d C_{i}}, d_{i j} = \frac{d D_{i}}{d C_{j}}

. It follows that

D_{i} < 0

d_{i j} < 0

, and

d_{i i} > 0

for

i, j = 1, \dots n, j \neq i

The proof of Theorem 2 is given in Appendix B.

Using (14), we can derive

\frac{d R_{D P}}{d C_{i}} = \frac{1}{Λ} \frac{D_{i}}{1 - q}, \frac{d^{2} R_{D P}}{d C_{i}^{2}} = \frac{1}{Λ} \frac{d_{i i} (1 - q) + D_{i}^{2}}{{(1 - q)}^{2}} .

(15)

Then, also using Theorem 2, we have

\frac{d R_{D P}}{d C_{i}} < 0

and

\frac{d^{2} R_{D P}}{d C_{i}^{2}} > 0

for

i = 1, \dots, n

Theorem 3.

Using (14), (15), and Theorem 2, it follows that for fixed Λ, the average response time

R_{D P}

for a task that arrives from the MBS or an external user to the DP, until it is assigned to one of the server input queues, is minimized with respect to

0 \leq C_{i} \leq 1

by taking the largest possible value of

C_{i}

, which is

C_{i} = 1

. When all the

C_{i}, 1 \leq i \leq n

, are set to

C_{i} = 1

, then

R_{D P}

attains its minimum value wth respect to the vector

C = (C_{1}, \dots, C_{n})

5. Minimizing the Average Response Time $R_{S}$ at the Edge Servers

Different edge servers have different task processing rates

μ_{i}

and different local task arrival rates

λ_{i}

. Therefore, it is worth understanding how the DP should share the tasks that it receives among the edge servers to achieve a minimum average response time

R_{S}

for all the tasks, both those that arrive locally to each server and those that are assigned by the DP. Let

Φ_{i}

denote the proportion of incoming external tasks that the DP assigns to server

S_{i}

Φ_{i} = \frac{q_{i} μ_{i} C_{i}}{\sum_{j = 1}^{n} q_{j} μ_{j} C_{j}}, \sum_{j = 1}^{n} Φ_{j} = 1,

(16)

so that the total arrival rate of tasks arriving to reach

S_{i}

λ_{i} + Λ Φ_{i}

. As a result, when

q < 1

q_{i} < 1, i = 1, \dots, n

, in steady state, the average number of tasks

N_{S}

at the n servers can be obtained from (6) in Theorem 1 as

N_{S} = \sum_{i = 1}^{n} N_{i} = \sum_{i = 1}^{n} \frac{q_{i}}{1 - q_{i}}, w h e r e q_{i} = \frac{λ_{i} + Λ Φ_{i}}{μ_{i}},

(17)

and by Little’s Theorem, we have

R_{S} = \frac{1}{Λ + λ} \sum_{i = 1}^{n} \frac{q_{i}}{1 - q_{i}} = \frac{1}{Λ + λ} \sum_{i = 1}^{n} \frac{λ_{i} + Λ Φ_{i}}{μ_{i} - λ_{i} - Λ Φ_{i}}, w h e r e λ = \sum_{i = 1}^{n} λ_{i} .

(18)

We can now state the following result, whose proof is given in Appendix C.

Theorem 4.

Let

0 \leq q < 1, 0 \leq q_{j} < 1

for

1 \leq j \leq n

. Then, the average response time at steady state for all tasks that are processed by the n servers, denoted by

R_{S}

, attains its global minimum with respect to the vector $Φ = (Φ_{1}, \dots, Φ_{n})$ when $Φ_{j}$ is equal to $Φ_{j}^{*}$ :

\begin{matrix} Φ_{j}^{*} & = & \frac{μ_{j} - λ_{j}}{Λ} - \frac{μ - Λ - λ}{Λ} \frac{\sqrt{\frac{μ_{j}}{μ_{1}}}}{[\sum_{i = 1}^{n} \sqrt{\frac{μ_{i}}{μ_{1}}}]}, 1 \leq j \leq n, w h e r e μ = \sum_{j = 1}^{n} μ_{j}, \\ = & \frac{\sqrt{\frac{μ_{j}}{μ_{1}}}}{[\sum_{i = 1}^{n} \sqrt{\frac{μ_{i}}{μ_{1}}}]} + \frac{1}{Λ} [μ_{j} - λ_{j} - (μ - λ) \frac{\sqrt{\frac{μ_{j}}{μ_{1}}}}{[\sum_{i = 1}^{n} \sqrt{\frac{μ_{i}}{μ_{1}}}]}], 1 \leq j \leq n . \end{matrix}

(19)

Communication Overhead and Computational Cost. From (19), we see that the terms

μ a n d \frac{\sqrt{\frac{μ_{j}}{μ_{1}}}}{[\sum_{i = 1}^{n} \sqrt{\frac{μ_{i}}{μ_{1}}}]},

(20)

can be computed in advance once and for all for a given set of n servers since they only depend on the server speed parameters

μ_{i}, i = 1, \dots, n

, and do not need to be re-computed for each decision. Λ is known by the DP, which locally monitors the external arrival rate of tasks, and no communication is needed to update Λ. The parameters

λ_{j}

must be updated in (19) and should be sent by each

S_{j}

to the DP (where the task assignment decision is taken) each time

λ_{j}

changes. This boils down to a periodic communication overhead of, at most, a total of n packets that are sent from the servers to the DP. From a computational standpoint, obtaining (19) only requires four additions and subtractions and two multiplications for each of the n values

Φ_{j}^{*}

Corollary 2.

The minimum value of

R_{S}

, denoted

R_{S}^{*}

\begin{matrix} R_{S}^{*} = \frac{1}{Λ + λ} \sum_{j = 1}^{n} \frac{λ_{j} + Λ Φ_{j}^{*}}{μ_{j} - λ_{j} - Λ Φ_{j}^{*}} = \frac{1}{Λ + λ} \sum_{j = 1}^{n} \frac{μ_{j}}{μ_{j} - \sqrt{\frac{μ_{j}}{μ_{1}}} λ_{j} - Λ Φ_{j}^{*}} . \end{matrix}

(21)

Corollary 3.

In many cases of interest, an edge system is composed of the DP and n identical servers $S_{i}$ , which, in general, have different local loads $λ_{i}$ so that we have

μ_{i} = μ, 1 \leq i \leq n

. In this case,

R_{S}

is minimized when

Φ_{i}^{*} = Φ_{1}^{*} + \frac{λ_{1} - λ_{i}}{Λ}, 2 \leq i \leq n, Φ_{1}^{*} = \frac{1}{n} [1 + \frac{\sum_{i = 2}^{n} (λ_{i} - λ_{1})}{Λ}] .

(22)

6. Minimizing Energy Consumption

An important system performance metric of interest is the energy consumption of the system. As an example, the measured power and energy consumption characteristics of an Intel NUC processor [46] that is widely used in edge systems are shown in Figure 2 based on accurate measurements that were reported in [47].

Let us note from (11) and (12) that

Λ

is the total arrival rate of external tasks to the DP; these are, in turn, assigned by the DP to the n edge servers. Also, we define

X_{i} = λ_{i} + Λ Φ_{i}

, where (as previously in this paper)

λ_{i}

is the local arrival rate of tasks to

S_{i}

and

Φ_{i}

is the fraction of externally arriving tasks that are allocated by the DP to

S_{i}

The curve on the left in Figure 2 shows the rise in the power consumption as a function of its load, expressed as the arrival rate of workload to the NUC, starting from a value of roughly 19 Watts when the NUC is idle and attaining a maximum value of approximately 30 Watts when the NUC is fully loaded. The curve on the right in Figure 2 shows the energy consumption in Joules per arriving request as a function of the total arrival rate of tasks

X_{i}

to server

S_{i}

Indeed, the curve on the left-hand-side of Figure 2 and the different measurement curves shown in Figure 3 also suggest the following representation for the power consumption

π_{i} (X_{i})

of server

S_{i}

(23), where

X_{i} = λ_{i} + Λ Φ_{i}

, rising from the power consumption

π_{i 0}

when

S_{i}

is idle, up to its maximum power consumption denoted by

π_{i M}

. Thus, these measurement results indicate that the power versus workload characteristics of a server may be represented by a piece-wise linear approximation consisting of a straight line from

X_{i} = 0

X_{i} = X_{i 1}

with a positive slope and a second flat (nearly zero slope) straight line from

X_{i 1}

to higher values of

X_{i}

. Also,

X_{i 1}

is smaller than the maximum processing or service rate

μ_{i}

of server i. We, therefore, use this observation to express the approximation for

0 \leq X_{i} \leq X_{i 1}

with

π_{i} (X_{i 1}) = π_{i M}

\begin{matrix} π_{i} (X_{i}) & = & π_{i 0}, i f X_{i} = 0, \\ = & π_{i 0} + α_{i} X_{i}, i f 0 \leq X_{i} \leq X_{i 1} < μ_{i}, \end{matrix}

(23)

where

α_{i} > 0

is a positive constant that depends on the specific server being considered. We can then define the first and second derivatives of

π_{i} (X_{i})

with respect to

Φ_{i}

π_{i}^{'} = \frac{d π_{i} (X_{i})}{d Φ_{i}}, π_{i}^{″} = \frac{d^{2} π_{i} (X_{i})}{d Φ_{i}^{2}} .

(24)

When

i \neq 1

, we have, for

X_{i} < μ_{i}

π_{i}^{'} = α_{i} Λ, π_{i}^{″} = 0, f o r α_{i} > 0, w h e n 0 \leq X_{i} < X_{i 1} .

(25)

Also, since

Φ_{1} = 1 - \sum_{i = 2}^{n} Φ_{i}

, we have

\frac{d Φ_{1}}{d Φ_{i}} = - 1

for

i \neq 1

. Thus, the first and second derivatives of

π_{1} (X_{1})

with respect to

Φ_{i}

for

i \neq 1

are

\frac{d π_{1} (X_{1})}{d Φ_{i}} = - α_{1} Λ, f o r α_{1} > 0, \frac{d^{2} π_{1} (X_{1})}{d Φ_{i}^{2}} = 0, f o r 0 \leq X_{1} < X_{11} .

(26)

Figure 3. We illustrate the measured characteristics of the power consumption

Π_{i} (X_{i})

along the y-axis in Watts, versus the load

X_{i}

along the x-axis in tasks/sec for several different servers, showing the approximately linear increase in power consumption at some rate

α_{i} > 0

, which depends on the characteristics of the different processors, between the zero load level (no task arrivals and the server is idle), which corresponds to

π_{i 0}

, up to close to the maximum value of the power consumption that we denote by

π_{i M}

. Note that the value

X_{1 i}

cannot exceed the maximum processing rate of jobs

μ_{i}

S_{i}

. The linear characteristic is displayed as a straight red line on top of the measured data that are also shown in the figure. The rightmost curve refers to the NUC whose characteristics are discussed in Figure 2.

Figure 3. We illustrate the measured characteristics of the power consumption

Π_{i} (X_{i})

along the y-axis in Watts, versus the load

X_{i}

along the x-axis in tasks/sec for several different servers, showing the approximately linear increase in power consumption at some rate

α_{i} > 0

, which depends on the characteristics of the different processors, between the zero load level (no task arrivals and the server is idle), which corresponds to

π_{i 0}

, up to close to the maximum value of the power consumption that we denote by

π_{i M}

. Note that the value

X_{1 i}

cannot exceed the maximum processing rate of jobs

μ_{i}

S_{i}

Allocating Incoming Tasks to Minimize the Average Additional Energy Consumed by the Servers

If the DP sends an externally arriving task to server

S_{i}

, we know that the task waits for some time, and then it will be processed during

μ_{i}^{- 1}

time units on average. If the power consumption of

S_{i}

π_{i}

and

Φ_{i}

is the probability that the DP has chosen to send the task to

S_{i}

, then the energy that is consumed by the task is simply

π_{i} \times μ_{i}^{- 1}

Therefore, the expected average energy consumption E for executing a task sent from the DP to the edge system composed of n servers is

E = \sum_{i = 1}^{n} [Φ_{i} \times \frac{π_{i} (X_{i})}{μ_{i}}] .

(27)

This leads us directly to the following result, whose proof is given in Appendix D.

Theorem 5.

Assuming the power consumption characteristic given in (23), the proportion of incoming traffic that should be allocated to server

S_{i}

tominimize E for

j = 2, \dots, n

Φ_{j}^{+} = Φ_{1}^{+} \frac{α_{1} μ_{j}}{α_{j} μ_{1}} + \frac{1}{2 Λ α_{j}} [π_{10} \frac{μ_{j}}{μ_{1}} - π_{j 0}],

(28)

where

Φ_{1}^{+} = \frac{1 + \frac{1}{2 Λ} \sum_{i = 2}^{n} [\frac{π_{i 0}}{α_{i}} - \frac{π_{10}}{μ_{1}} \frac{μ_{i}}{α_{i}}]}{1 + \frac{α_{1}}{μ_{1}} \sum_{i = 2}^{n} \frac{μ_{i}}{α_{i}}} .

(29)

As would be expected, when all the servers are identical with

π_{i 0} = π_{i 1}, α_{i} = α_{1}, μ_{i} = μ_{1}

for

i = 2, \dots, n

, we have

Φ_{1}^{+} = \frac{1}{n}

, and

Φ_{j}^{+} = Φ_{1}^{+}, 2 \leq j \leq n

Communication Overhead and Computational Overhead. Since the parameters

α_{j}, μ_{j}, π_{i 0}

are fixed and can be known in advance for the servers

S_{j}, j = 1, \dots, n

, the terms

\sum_{i = 2}^{n} [\frac{π_{i 0}}{α_{i}} - \frac{π_{10}}{μ_{1}} \frac{μ_{i}}{α_{i}}]

1 + \frac{α_{1}}{μ_{1}} \sum_{i = 2}^{n} \frac{μ_{i}}{α_{i}}

\frac{α_{1} μ_{j}}{α_{j} μ_{1}}

, and

\frac{1}{2 α_{j}} [π_{10} \frac{μ_{j}}{μ_{1}} - π_{j 0}]

can be computed just one time in advance for

j = 2, \dots, n

. The only parameter in (28) and (29) that must be measured is Λ; it is measured directly by the DP, which uses it to compute the values of

Φ_{j}

that minimize E. Therefore, there is no communication overhead involved in choosing the fraction of externally arriving tasks assigned to each server to minimize the additional average energy consumption E. Considering the computational overhead, we note that the computation of

Φ_{i}^{+}

involves an additional addition and two divisions. The computation of each of the remaining

Φ_{j}^{+}

involves one additional multiplication, one division, and one addition. Thus, we see that the number of arithmetic operations needed to compute all of the n values of

Φ_{j}^{+}

3 n

for each new value of Λ.

7. Conclusions

Edge computing systems, composed of clusters of processors, are particularly important for supporting the low latency, high throughput, and low power consumption needs of mobile base stations and other communication systems. Their aim is to provide crucial low latency and sustainable low energy consuming services for the Internet of Things and support the transition of communications to 5G and 6th generation (6G) mobile networks. Thus, considerable work has been devoted to the design of different types of algorithms for configuring them, dynamically or statically, to optimize the allocation of tasks to edge system servers.

Much prior work has used machine learning, including reinforcement learning, non-linear optimization methods, and market-based mechanisms, and some of these methods have been tested in experimental environments. Though this work has been extremely useful in generating experience about the manner in which edge systems can be implemented, it comes at the cost of extensive simulations and time-consuming real-system experimentations. Furthermore, the machine learning-based approaches, such as that in our earlier work [10,47], do not provide insight into the fraction of tasks that should be allocated to different servers to achieve optimality.

Thus, in the present work, we address the edge computing design process through an analytical model that results in explicit formulas for optimal task allocation, minimal task latency, and minimal energy consumption of the system as a whole. We show that this approach leads to simple formulas that provide the optimum share of externally arriving tasks that should be assigned to each edge server. We also observe that these formulas are computationally very simple and that they lead to very low communication overhead. In future work, we plan to prioritize the execution of locally generated tasks and remote tasks and include the effect of different types of tasks being executed in the system.

We also plan to implement the proposed algorithms in an experimental test bed and compare various machine learning-based algorithms and other simple heuristics (such as greedy algorithms) to see how close they can get to achieving the optimum performance obtained via the analytical approach.

Funding

This research was funded by the European Union’s Horizon Europe research and innovation programme, DOSS Project under Grant Agreement No. 101120270, and by the UKRI Project No. 10034722.

Data Availability Statement

The data presented in this study are available on request from the author.

Acknowledgments

The author would like to thank the editors and anonymous reviewers for their valuable comments and suggestions.

Conflicts of Interest

The author declares no conflicts of interest.

Appendix A

Proof of Theorem 1 (Key Product Form Result).

For the Equation (4) at steady state, we set

\frac{d p (K, t)}{d t} = 0

and drop the dependency on t to write

\begin{matrix} p (K) [Λ + \sum_{i = 1}^{n} (μ_{i} 1 [k_{i} > 0] + λ_{i})] \\ = Λ p (K^{- 0}) 1 [k > 0] + \sum_{i = 1}^{n} [λ_{i} p (K^{- i}) 1 [k_{i} > 0] \\ + μ_{i} C_{i} p (K^{+ 0}) 1 [k_{i} > 0] + μ_{i} C_{i} p (K^{+ i}) 1 [k = 0] \\ + μ_{i} (1 - C_{i}) p (K^{+ i})] . \end{matrix}

(A1)

Then, we divide both sides of (A1) by

p (K)

and substitute the expression from (6), to obtain

\begin{matrix} [Λ + \sum_{i = 1}^{n} (μ_{i} 1 [k_{i} > 0] + λ_{i})] = \frac{Λ}{q} 1 [k > 0] \\ + \sum_{i = 1}^{n} [\frac{λ_{i}}{q_{i}} 1 [k_{i} > 0] + μ_{i} C_{i} q 1 [k_{i} > 0] + μ_{i} C_{i} q_{i} 1 [k = 0] \\ + μ_{i} (1 - C_{i}) q_{i}] . \end{matrix}

Now, substituting

μ_{i} q_{i} = \frac{λ_{i}}{1 - q C_{i}}

from the expression for

q_{i}

in (5) and the expression

q = Λ \sum_{i = 1}^{n} q_{i} μ_{i} C_{i}

, we have

\begin{matrix} [Λ + \sum_{i = 1}^{n} (μ_{i} 1 [k_{i} > 0] + λ_{i})] = \sum_{i = 1}^{n} q_{i} μ_{i} C_{i} 1 [k > 0] \\ + \sum_{i = 1}^{n} [μ_{i} (1 - q C_{i}) 1 [k_{i} > 0] + μ_{i} C_{i} q 1 [k_{i} > 0] + μ_{i} C_{i} q_{i} 1 [k = 0] \\ + μ_{i} (1 - C_{i}) q_{i}], \end{matrix}

or canceling identical terms with opposite signs and summing identical terms for

k > 0

and

k = 0]

, we obtain

\begin{matrix} [Λ + \sum_{i = 1}^{n} (μ_{i} 1 [k_{i} > 0] + λ_{i})] = \sum_{i = 1}^{n} q_{i} μ_{i} C_{i} 1 \\ + \sum_{i = 1}^{n} [μ_{i} 1 [k_{i} > 0] + μ_{i} (1 - C_{i}) q_{i}] . \end{matrix}

Now, canceling identical terms on both sides of the equation and also canceling identical terms with opposite signs on the right-hand side, we are left with

\begin{matrix} Λ + \sum_{i = 1}^{n} λ_{i} = \sum_{i = 1}^{n} μ_{i} q_{i} . \end{matrix}

However, by Lemma 1, the right-hand side and left-hand side of the above equation are identical; hence, the solution (5) and (6) has now been proved. The uniqueness of the solutions of the non-linear Equation (5) follows from the known uniqueness of the stationary solution of the Chapman–Kolmogorov differential-difference Equation (4) [48,49]. This completes the proof of the Key Product Form (Theorem 1). □

Appendix B

Proof of Theorem 2.

We use (5) to derive

\begin{matrix} D_{i} & = & - \frac{Λ [\sum_{j = 1}^{n} d_{j i} μ_{j} C_{j} + q_{i} μ_{i}]}{{[\sum_{j = 1}^{n} q_{j} μ_{j} C_{j}]}^{2}}, \\ = & - \frac{q^{2}}{Λ} [\sum_{j = 1}^{n} d_{j i} μ_{j} C_{j} + q_{i} μ_{i}], \end{matrix}

(A2)

\begin{matrix} d_{j i} & = & ρ_{j} \frac{D_{i} C_{j} + q 1 [i = j]}{{[1 - q C_{j}]}^{2}}, \\ = & \frac{q_{j}^{2}}{ρ_{j}} [D_{i} C_{j} + q 1 [i = j]] . \end{matrix}

(A3)

As a consequence, we can write

\begin{matrix} D_{i} & = & - \frac{q^{2}}{Λ} [\sum_{j = 1}^{n} \frac{q_{j}^{2}}{ρ_{j}} D_{i} μ_{j} C_{j}^{2} + \frac{q_{i}^{2}}{ρ_{i}} q μ_{i} C_{i} + q_{i} μ_{i}], \\ = & - q^{2} \frac{q_{i} μ_{i} [1 + \frac{q_{i}}{ρ_{i}} q C_{i}]}{Λ + q^{2} \sum_{j = 1}^{n} \frac{q_{j}^{2}}{ρ_{j}} μ_{j} C_{j}^{2}} \\ = & - q \frac{q_{i} μ_{i} [1 + \frac{q_{i}}{ρ_{i}} q C_{i}]}{\sum_{j = 1}^{n} q_{j} μ_{j} C_{j} [1 + q \frac{q_{j}}{ρ_{j}} C_{j}]}, \\ = & - q \frac{\frac{λ_{i}}{{(1 - q C_{i})}^{2}}}{\sum_{j = 1}^{n} \frac{λ_{j} C_{j}}{{(1 - q C_{j})}^{2}}} . \end{matrix}

(A4)

Thus, (A4) tells us that if $q > 0$ and all the $q_{i} > 0$ , then all $D_{i} < 0$ .

Now, substituting (A4) back into (A3), we have

\begin{matrix} d_{j i} & = & q [\frac{q_{j}^{2}}{ρ_{j}} 1 [i = j] - \frac{q_{j}^{2}}{ρ_{j}} \frac{\frac{λ_{i} C_{j}}{{(1 - q C_{i})}^{2}}}{\sum_{l = 1}^{n} \frac{λ_{l} C_{l}}{{(1 - q C_{l})}^{2}}}], \\ = & q [\frac{q_{j}^{2}}{ρ_{j}} 1 [i = j] - \frac{q_{i}^{2} μ_{i}}{ρ_{i}} \frac{\frac{q_{j}^{2} C_{j}}{ρ_{j}}}{\sum_{l = 1}^{n} \frac{q_{l}^{2} μ_{l} C_{l}}{ρ_{l}}}], \\ = & q \frac{q_{i}^{2}}{ρ_{i}} [1 [i = j] - μ_{i} \frac{\frac{q_{j}^{2} C_{j}}{ρ_{j}}}{\sum_{l = 1}^{n} \frac{q_{l}^{2} μ_{l} C_{l}}{ρ_{l}}}] . \end{matrix}

(A5)

Since the first term (which is non-negative) in (A5) vanishes when $i \neq j$ , we can see that $d_{j i} < 0$ for $i \neq j$ .

The last part of the proof must establish that

d_{i i} > 0

. Using (A5), we write

\begin{matrix} d_{i i} & = & q \frac{q_{i}^{2}}{ρ_{i}} [1 - \frac{\frac{q_{i}^{2} μ_{i} C_{i}}{ρ_{j}}}{\sum_{l = 1}^{n} \frac{q_{l}^{2} μ_{l} C_{l}}{ρ_{l}}}], \end{matrix}

so that

d_{i i} > 0

is obvious as long as

n > 1

0 < q_{l} < 1

and all

C_{l} > 0

. Hence, under these conditions, we have $d_{i i} > 0$ . This completes the proof of Theorem 2. □

Appendix C

Proof of Theorem 3.

We start from (16) and (18) to write

\begin{matrix} R_{S} & = & \frac{1}{Λ + λ} \sum_{i = 1}^{n} \frac{λ_{j} + Λ Φ_{i}}{μ_{i} - λ_{i} - Λ Φ_{i}}, w i t h Φ_{1} = 1 - \sum_{i = 2}^{n} Φ_{i}, \end{matrix}

(A6)

so that for

2 \leq i \leq n

we have

\frac{d Φ_{1}}{d Φ_{i}} = - 1

and

\begin{matrix} \frac{d R_{S}}{d Φ_{i}} & = & \frac{Λ (μ_{i} - λ_{i} - Λ Φ_{i}) + Λ (λ_{i} + Λ Φ_{i})}{{(μ_{i} - λ_{i} - Λ Φ_{i})}^{2}} \\ \frac{1}{Λ + λ} [\frac{Λ (μ_{1} - λ_{1} - Λ Φ_{1}) + Λ (λ_{1} + Λ Φ_{1})}{{(μ_{1} - λ_{1} - Λ Φ_{1})}^{2}}], \\ = & \frac{Λ}{Λ + λ} [\frac{μ_{i}}{{(μ_{i} - λ_{i} - Λ Φ_{i})}^{2}} - \frac{μ_{1}}{{(μ_{1} - λ_{1} - Λ Φ_{1})}^{2}}], \end{matrix}

(A7)

\begin{matrix} \frac{d^{2} R_{S}}{d Φ_{i}^{2}} & = & \frac{Λ^{2}}{Λ + λ} [\frac{μ_{i}}{{(μ_{i} - λ_{i} - Λ Φ_{i})}^{3}} + \frac{μ_{1}}{{(μ_{1} - λ_{1} - Λ Φ_{1})}^{3}}] . \end{matrix}

(A8)

Since

q_{i} < 1

for all

1 \leq i \leq n

, it follows from (A8) that

\frac{d^{2} R_{S}}{d Φ_{i}^{2}} > 0

. Therefore, the minimum of

R_{S}

with respect to

Φ_{i}, i = 1, \dots, n

, is obtained from (A7) when

\begin{matrix} \frac{d R_{S}}{d Φ_{i}} = 0, o r (μ_{1} - λ_{1} - Λ Φ_{1}^{*}) \sqrt{\frac{μ_{i}}{μ_{1}}} = μ_{i} - λ_{i} - Λ Φ_{i}^{*} . \end{matrix}

(A9)

Using

Φ_{1}^{*} = 1 - \sum_{i = 2}^{n} Φ_{i}^{*}

and summing both sides of (A9) over

2 \leq i \leq n

, we have

\begin{matrix} (μ_{1} - λ_{1} - Λ Φ_{1}^{*}) \sum_{i = 2}^{n} \sqrt{\frac{μ_{i}}{μ_{1}}} = μ - μ_{1} - λ + λ_{1} - Λ + Λ Φ_{1}^{*}, o r \\ Λ Φ_{1}^{*} [1 + \sum_{i = 2}^{n} \sqrt{\frac{μ_{i}}{μ_{1}}}] = (μ_{1} - λ_{1}) [1 + \sum_{i = 2}^{n} \sqrt{\frac{μ_{i}}{μ_{1}}}] + (μ - Λ - λ), o r \\ Λ Φ_{1}^{*} & = & μ_{1} - λ_{1} - \frac{μ - Λ - λ}{1 + \sum_{i = 2}^{n} \sqrt{\frac{μ_{i}}{μ_{1}}}}, a n d Φ_{i}^{*} = \frac{μ_{i} - λ_{i}}{Λ} - \frac{μ - Λ - λ}{Λ} \frac{\sqrt{\frac{μ_{i}}{μ_{1}}}}{\sum_{j = 1}^{n} \sqrt{\frac{μ_{j}}{μ_{1}}}}, \end{matrix}

(A10)

and the proof is complete. □

Appendix D

Proof of Theorem 4.

Let us use the notation

E_{i}^{'}, E_{i}^{″}, a n d π_{i}^{'} t o d e n o t e \frac{d E}{d Φ_{i}}, \frac{d^{2} E}{d Φ_{i}^{2}}, a n d \frac{d π_{i}}{d Φ_{i}}

, respectively,

1 \leq j \leq n

. Using the fact that

\sum_{j = 1}^{n} Φ_{j} = 1

, we obtain the following expressions for

i \neq 1

\begin{matrix} E_{i}^{'} = \frac{π_{i}}{μ_{i}} + Φ_{i} \times \frac{π_{i}^{'}}{μ_{i}} - \frac{π_{1}}{μ_{1}} - Φ_{1} \times \frac{π_{1}^{'}}{μ_{1}}, \end{matrix}

(A11)

\begin{matrix} E_{i}^{″} = \frac{π_{i}^{'}}{μ_{i}} + Φ_{i} \times \frac{π_{i}^{″}}{μ_{i}} + \frac{π_{i}^{'}}{μ_{i}} + \frac{π_{1}^{'}}{μ_{1}} + \frac{π_{i}^{'}}{μ_{i}} + Φ_{1} \times \frac{π_{1}^{″}}{μ_{1}} . \end{matrix}

(A12)

We see easily that

E_{i}^{″} > 0

when

0 \leq X_{i} < X_{i 1}

for

i \neq 1

. Thus, for

i \neq 1

, the value

Φ_{i}^{+}

Φ_{i}

that minimizes E is attained by setting

E_{i}^{'} = 0

in (A11), leading to

\begin{matrix} Φ_{i}^{+} \frac{π_{i}^{'}}{μ_{i}} & = & Φ_{1}^{+} \frac{π_{1}^{'}}{μ_{1}} + \frac{π_{1}}{μ_{1}} - \frac{π_{i}}{μ_{i}}, o r \\ Φ_{i}^{+} & = & Φ_{1} \frac{α_{1} μ_{i}}{α_{i} μ_{1}} + μ_{i} \frac{π_{10} + α_{1} λ_{1} + α_{1} Φ_{1}^{+} Λ}{α_{i} Λ μ_{1}} - \frac{π_{i 0} + α_{i} λ_{i} + α_{i} Φ_{i}^{+} Λ}{α_{i} Λ}, \\ 2 Φ_{i}^{+} & = & 2 Φ_{1}^{+} \frac{μ_{i} α_{1}}{μ_{1} α_{i}} + \frac{λ_{1} μ_{i} α_{1}}{Λ μ_{1} α_{i}} - \frac{λ_{i}}{Λ} + \frac{\frac{μ_{i}}{μ_{1}} π_{10} - π_{i 0}}{α_{i} Λ}, y i e l d i n g \\ Φ_{i}^{+} & = & Φ_{1}^{+} \frac{μ_{i} α_{1}}{μ_{1} α_{i}} + \frac{\frac{μ_{i}}{μ_{1}} π_{10} - π_{i 0}}{α_{i} Λ} . \end{matrix}

(A13)

Summing both sides of (A13) from 2 to n, we obtain

\begin{matrix} 1 - Φ_{1}^{+} & = & Φ_{1}^{+} \frac{α_{1}}{μ_{1}} \sum_{2}^{n} \frac{μ_{i}}{α_{i}} + \frac{π_{1}}{Λ μ_{1}} \sum_{2}^{n} \frac{μ_{i}}{α_{i}} - \sum_{2}^{n} \frac{π_{i}}{Λ α_{i}} \\ = & Φ_{1}^{+} \frac{α_{1}}{μ_{1}} \sum_{2}^{n} \frac{μ_{i}}{α_{i}} + \frac{π_{1}}{Λ μ_{1}} \sum_{2}^{n} \frac{μ_{i}}{α_{i}} - \sum_{2}^{n} \frac{π_{i 0}}{Λ α_{i}} - \sum_{2}^{n} Φ_{i}^{+}, i m p l y i n g t h a t : \\ 2 (1 - Φ_{1}^{+}) & = & Φ_{1}^{+} \frac{α_{1}}{μ_{1}} \sum_{2}^{n} \frac{μ_{i}}{α_{i}} + (\frac{π_{10}}{Λ μ_{1}} + Φ_{1}^{+} \frac{α_{1}}{μ_{1}}) \sum_{2}^{n} \frac{μ_{i}}{α_{i}} - \sum_{2}^{n} \frac{π_{i 0}}{Λ α_{i}}, o r \\ 2 (1 - Φ_{1}^{+}) & = & 2 Φ_{1}^{+} \frac{α_{1}}{μ_{1}} \sum_{2}^{n} \frac{μ_{i}}{α_{i}} + \frac{π_{10}}{Λ μ_{1}} \sum_{2}^{n} \frac{μ_{i}}{α_{i}} - \sum_{2}^{n} \frac{π_{i 0}}{Λ α_{i}}, t h a t y i e l d s : \end{matrix}

\begin{matrix} Φ_{1}^{+} & = & \frac{1 + \frac{1}{2 Λ} \sum_{2}^{n} [\frac{π_{i 0}}{α_{i}} - \frac{π_{10}}{μ_{1}} \frac{μ_{i}}{α_{i}}]}{1 + \frac{α_{1}}{μ_{1}} \sum_{2}^{n} \frac{μ_{i}}{α_{i}}} . \end{matrix}

(A14)

Finally, (A13) and (A14) provide us with the expression

\begin{matrix} Φ_{i}^{+} & = & Φ_{1}^{+} \frac{μ_{i} α_{1}}{μ_{1} α_{i}} + \frac{π_{10} μ_{i}}{Λ α_{i} μ_{1}} + Φ_{1}^{+} \frac{α_{1} μ_{i}}{α_{i} μ_{1}} - \frac{π_{i 0}}{Λ α_{i}} - Φ_{i}^{+}, o r \\ = & Φ_{1}^{+} \frac{α_{1} μ_{i}}{α_{i} μ_{1}} + \frac{1}{2 Λ α_{i}} [π_{10} \frac{μ_{i}}{μ_{1}} - π_{i 0}] . \end{matrix}

(A15)

□

References

Juniper-Networks. Expel Complexity with a Self-Driving Network: Soon, Your Network Will Adaptively Meet Your Business Goals All by Itself. 2020. Available online: https://www.juniper.net/us/en/dm/the-selfdriving-network/ (accessed on 15 October 2024).
Apostolos, J. Improving Networks with Artificial Intelligence. 2019. Available online: https://blogs.cisco.com/networking/improving-networks-with-ai (accessed on 15 October 2024).
Kompany, R. Huawei’s ‘Autonomous Driving’ Mobile Networks Strategy Aims to Increase Automation and Reduce Costs; Knowledge Centre: Analysis Mason Ltd.: London, UK, 2018. [Google Scholar]
Weiss, P. Making the ICT Sector Energy Efficient: The Information and Communication Technology Sector Is a Major Energy Consumer, But It Also Offers the Potential for Savings…If Used Properly. Let’s Work Smarter. 2022. Available online: https://www.theparliamentmagazine.eu/news/article/energy-efficient-and-energy-smart (accessed on 15 October 2024).
Gelenbe, E. Electricity Consumption by ICT: Facts, Trends, and Measurements. Ubiquity 2023, 2023, 1–15. [Google Scholar] [CrossRef]
Ishtiaq, M.; Saeed, N.; Khan, M.A. Edge Computing in IoT: A 6G Perspective. arXiv 2022, arXiv:2111.08943. [Google Scholar]
Al-Ansi, A.; Al-Ansi, A.M.; Muthanna, A.; Elgendy, I.A.; Koucheryavy, A. Survey on Intelligence Edge Computing in 6G: Characteristics, Challenges, Potential Use Cases, and Market Drivers. Future Internet 2021, 13, 118. [Google Scholar] [CrossRef]
Nguyen, T.A.; Thang, N.K.; Trystram, D. One gradient Frank-Wolfe for decentralized online convex and submodular optimization. In Proceedings of the ACML 2022—14th Asian Conference in Machine Learning, Hyderabad, India, 12–14 December 2022; pp. 1–33. [Google Scholar]
Sadatdiynov, K.; Cui, L.; Zhang, L.; Huang, J.Z.; Salloum, S.; Mahmud, M.S. A review of optimization methods for computation offloading in edge computing networks. Digit. Commun. Netw. 2023, 9, 450–461. [Google Scholar] [CrossRef]
Fröhlich, P.; Gelenbe, E.; Nowak, M. Reinforcement Learning and Energy-Aware Routing. In Proceedings of the 4th FlexNets Workshop on Flexible Networks Artificial Intelligence Supported Network Flexibility and Agility, New York, NY, USA, 26–27 August 2021; FlexNets ’21. pp. 26–31. [Google Scholar] [CrossRef]
Safri, H.; Kandi, M.M.; Miloudi, Y.; Bortolaso, C.; Trystram, D.; Desprez, F. Towards Developing a Global Federated Learning Platform for IoT. In Proceedings of the 2022 IEEE 42nd International Conference on Distributed Computing Systems (ICDCS), Bologna, Italy, 10–13 July 2022; pp. 1312–1315. [Google Scholar] [CrossRef]
Kim, C.; Kameda, H. An algorithm for optimal static load balancing in distributed computer systems. IEEE Trans. Comput. 1992, 41, 381–384. [Google Scholar]
Topcuoglu, H.; Hariri, S.; Wu, M.Y. Performance-effective and low-complexity task scheduling for the Bera erogeneous computing. IEEE Trans. Parallel Distrib. Syst. 2002, 13, 260–274. [Google Scholar] [CrossRef]
Zhu, X.; Qin, X.; Qiu, M. Qos-aware fault-tolerant scheduling for real-time tasks on heterogeneous clusters. IEEE Trans. Comput. 2011, 60, 800–812. [Google Scholar]
Tian, W.; Zhao, Y.; Zhong, Y.; Xu, M.; Jing, C. A dynamic and integrated load-balancing scheduling algorithm for cloud datacenters. In Proceedings of the 2011 IEEE International Conference on Cloud Computing and Intelligence Systems, Beijing, China, 15–17 September 2011; pp. 311–315. [Google Scholar]
Zhang, Z.; Zhang, X. A load balancing mechanism based on ant colony and complex network theory in open cloud computing federation. In Proceedings of the 2nd International Conference on Mechatronics and Automation, Wuhan, China, 30–31 May 2010; Volume 2, pp. 240–243. [Google Scholar]
Gelenbe, E.; Morfopoulou, C. A framework for energy-aware routing in packet networks. Comput. J. 2011, 54, 850–859. [Google Scholar] [CrossRef]
Gelenbe, E.; Mahmoodi, T. Energy-aware routing in the cognitive packet network. In Proceedings of the Energy 2011: The First International Conference on Smart Grids, Green Communications and IT Energy-Aware Technologies, IARIA, Venice, Italy, 22–27 May 2011; pp. 7–11. Available online: https://www.researchgate.net/publication/289245202_Energy-Aware_Routing_in_the_Cognitive_Packet_Network (accessed on 15 October 2024).
Sakr, A.; Schuster, R. Edge Resource Allocation Based on End-to-End Latency. In Proceedings of the HotEdge ’20, USENIX Association, Online, 25–26 June 2020; Available online: https://www.usenix.org/system/files/hotedge20_poster_sakr_0.pdf (accessed on 15 October 2024).
Sarah, A.; Nencioni, G.; Khan, M.M.I. Resource Allocation in Multi-access Edge Computing for 5G-and-beyond networks. Comput. Netw. 2023, 227, 109720. [Google Scholar] [CrossRef]
Liu, H.; Li, S.; Sun, W. Resource Allocation for Edge Computing without Using Cloud Center in Smart Home Environment: A Pricing Approach. Sensors 2020, 20, 6545. [Google Scholar] [CrossRef]
Zheng, K.; Jiang, G.; Liu, X.; Chi, K.; Yao, X.; Liu, J. DRL-Based Offloading for Computation Delay Minimization in Wireless-Powered Multi-Access Edge Computing. IEEE Trans. Commun. 2023, 71, 1755–1770. [Google Scholar] [CrossRef]
Boyan, J.A.; Littman, M.L. Packet Routing in Dynamically Changing Networks: A Reinforcement Learning Approach. In Advances in Neural Information Processing Systems 6, Proceedings of the 7th NIPS Conference, Denver, CO, USA, 29 November–2 December 1993; Cowan, J.D., Tesauro, G., Alspector, J., Eds.; Morgan Kaufmann: Burlington, MA, USA, 1993; pp. 671–678. [Google Scholar]
Tennenhouse, D.L.; Wetherall, D.J. Towards an active network architecture. Comput. Commun. Rev. 1996, 26, 5–18. [Google Scholar] [CrossRef]
Tsarouchis, C.; Denazis, S.; Kitahara, C.; Vivero, J.; Salamanca, E.; Magana, E.; Galis, A.; Manas, J.L.; Carlinet, L.; Mathieu, B.; et al. A policy-based management architecture for active and programmable networks. IEEE Netw. 2003, 17, 22–28. [Google Scholar] [CrossRef]
Gelenbe, E.; Xu, Z.; Seref, E. Cognitive Packet Networks. In Proceedings of the 11th IEEE International Conference on Tools with Artificial Intelligence, ICTAI ’99, Chicago, IL, USA, 8–10 November 1999; IEEE Computer Society: New York, NY, USA, 1999; pp. 47–54. [Google Scholar] [CrossRef]
Masoudi, R.; Ghaffari, A. Software defined networks: A survey. J. Netw. Comput. Appl. 2016, 67, 1–25. [Google Scholar] [CrossRef]
Tuncer, D.; Charalambides, M.; Clayman, S.; Pavlou, G. On the Placement of Management and Control Functionality in Software Defined Networks. In Proceedings of the 2015 11th International Conference on Network and Service Management (CNSM), Barcelona, Spain, 9–13 November 2015; pp. 360–365. [Google Scholar] [CrossRef]
Montazerolghaem, A. Software-defined load-balanced data center: Design, implementation and performance analysis. Clust. Comput. 2021, 24, 591–610. [Google Scholar] [CrossRef]
Liu, X.; Qin, Z.; Gao, Y. Resource Allocation for Edge Computing in IoT Networks via Reinforcement Learning. In Proceedings of the ICC 2019—2019 IEEE International Conference on Communications (ICC), Shanghai, China, 20–24 May 2019; pp. 1–6. [Google Scholar] [CrossRef]
Wang, J.; Zhao, L.; Liu, J.; Kato, N. Smart Resource Allocation for Mobile Edge Computing: A Deep Reinforcement Learning Approach. IEEE Trans. Emerg. Top. Comput. 2021, 9, 1529–1541. [Google Scholar] [CrossRef]
Huang, J.; Wan, J.; Lv, B.; Ye, Q.; Chen, Y. Joint Computation Offloading and Resource Allocation for Edge-Cloud Collaboration in Internet of Vehicles via Deep Reinforcement Learning. IEEE Syst. J. 2023, 17, 2500–2511. [Google Scholar] [CrossRef]
You, C.; Huang, K.; Chae, H.; Kim, B.H. Energy-Efficient Resource Allocation for Mobile-Edge Computation Offloading. IEEE Trans. Wirel. Commun. 2017, 16, 1397–1411. [Google Scholar] [CrossRef]
Domanska, J.; Gelenbe, E.; Czachorski, T.; Drosou, A.; Tzovaras, D. Research and Innovation Action for the Security of the Internet of Things: The SerIoT Project. In Recent Cybersecurity Research in Europe, Proceedings of the 2018 ISCIS Security Workshop, London, UK, 26–27 February 2018; Lecture Notes CCIS No. 821; Springer: Berlin/Heidelberg, Germany, 2018; Volume 821. [Google Scholar]
Gelenbe, E.; Domanska, J.; Fröhlich, P.; Nowak, M.P.; Nowak, S. Self-Aware Networks That Optimize Security, QoS, and Energy. Proc. IEEE 2020, 108, 1150–1167. [Google Scholar] [CrossRef]
Rublein, C.; Mehmeti, F.; Towers, M.; Stein, S.; Porta, T.L. Online resource allocation in edge computing using distributed bidding approaches. In Proceedings of the 2021 IEEE 18th International Conference on Mobile Ad Hoc and Smart Systems (MASS), Denver, CO, USA, 4–7 October 2021. [Google Scholar]
Nguyen, D.; Le, L.; Bhargava, V. Price-Based Resource Allocation for Edge Computing: A Market Equilibrium Approach. IEEE Trans. Cloud Comput. 2021, 9, 302–317. [Google Scholar] [CrossRef]
Zhao, Z.; Schiller, E.; Kalogeiton, E.; Braun, T.; Stiller, B.; Garip, M.T.; Joy, J.; Gerla, M.; Akhtar, N.; Matta, I. Autonomic Communications in Software-Driven Networks. IEEE J. Sel. Areas Commun. 2017, 35, 2431–2445. [Google Scholar] [CrossRef]
Ben-Ameur, A.; Araldo, A.; Chahed, T. Multiple Resource Allocation in Multi-Tenant Edge Computing via Sub-modular Optimization. arXiv 2023, arXiv:2302.09888. [Google Scholar]
Hamilton, E. What Is Edge Computing? Definition, Examples and Use Cases Explained in 2025. Available online: https://www.cloudwards.net/what-is-edge-computing/ (accessed on 15 October 2024).
Gelenbe, E. G-networks with signals and batch removal. Probab. Eng. Inform. Sci. 1993, 7, 335–342. [Google Scholar] [CrossRef]
Gelenbe, E. G-networks with instantaneous customer movement. J. Appl. Probab. 1993, 30, 742–748. [Google Scholar] [CrossRef]
Gelenbe, E.; Mitrani, I. Analysis and Synthesis of Computer Systems, 2nd ed.; World Scientific: Singapore, 2010. [Google Scholar]
Ross, S.M. Introduction to Probability Models, 11th ed.; Academic Press: Cambridge, MA, USA, 2014; Chapter 4.2. [Google Scholar]
Sigman, K. Stationary Marked Point Processes: An Intuitive Approach; Chapman and Hall: New York, NY, USA; London, UK; CRC Press: Boca Raton, FL, USA, 1995. [Google Scholar]
Intel. NUC— Small Form Factor Mini PC. 2021. Available online: https://en.wikipedia.org/wiki/Next-Unit-of-Computing (accessed on 23 March 2021).
Fröhlich, P.; Gelenbe, E.; Fiołka, J.; Checinski, J.; Nowak, M.; Filus, Z. Smart SDN Management of Fog Services to Optimize QoS and Energy. Sensors 2021, 21, 3105. [Google Scholar] [CrossRef]
Feller, W. An Introduction to Probability Theory and Its Applications, 3rd ed.; John Wiley & Sons: Hoboken, NJ, USA, 1968; Volume I. [Google Scholar]
Feller, W. An Introduction to Probability Theory and Its Applications, 2nd ed.; John Wiley & Sons: Hoboken, NJ, USA, 1971; Volume II. [Google Scholar]

Figure 2. The curve on the left shows the power consumption that was measured on an NUC versus its overall arrival rate of workload. There is a substantial power consumption of close to 63% of its maximum value when the NUC is idle. We observe that the power consumption attains its maximum value of 30 Watts as the workload increases. The curve on the right shows the corresponding energy consumption per arriving request in Joules as a function of the load.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Gelenbe, E. Minimizing Delay and Power Consumption at the Edge. Sensors 2025, 25, 502. https://doi.org/10.3390/s25020502

AMA Style

Gelenbe E. Minimizing Delay and Power Consumption at the Edge. Sensors. 2025; 25(2):502. https://doi.org/10.3390/s25020502

Chicago/Turabian Style

Gelenbe, Erol. 2025. "Minimizing Delay and Power Consumption at the Edge" Sensors 25, no. 2: 502. https://doi.org/10.3390/s25020502

APA Style

Gelenbe, E. (2025). Minimizing Delay and Power Consumption at the Edge. Sensors, 25(2), 502. https://doi.org/10.3390/s25020502

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Minimizing Delay and Power Consumption at the Edge

Abstract

1. Introduction

1.1. The Main Results Presented in This Paper

1.2. Related Work

2. System Description

2.1. Summary of Notation and Symbols and Abbreviations

3. Analytical Solution for the Dispatching Platform (DP) and Its $n$ Servers

4. Minimizing the Average Response Time or Average Delay at the DP

5. Minimizing the Average Response Time $R_{S}$ at the Edge Servers

6. Minimizing Energy Consumption

Allocating Incoming Tasks to Minimize the Average Additional Energy Consumed by the Servers

7. Conclusions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

Appendix B

Appendix C

Appendix D

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Minimizing Delay and Power Consumption at the Edge

Abstract

1. Introduction

1.1. The Main Results Presented in This Paper

1.2. Related Work

2. System Description

2.1. Summary of Notation and Symbols and Abbreviations

3. Analytical Solution for the Dispatching Platform (DP) and Its n Servers

4. Minimizing the Average Response Time or Average Delay at the DP

5. Minimizing the Average Response Time R S at the Edge Servers

6. Minimizing Energy Consumption

Allocating Incoming Tasks to Minimize the Average Additional Energy Consumed by the Servers

7. Conclusions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

Appendix B

Appendix C

Appendix D

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3. Analytical Solution for the Dispatching Platform (DP) and Its $n$ Servers

5. Minimizing the Average Response Time $R_{S}$ at the Edge Servers