[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN113891481A - Throughput-oriented cellular network D2D communication dynamic resource allocation method - Google Patents

Throughput-oriented cellular network D2D communication dynamic resource allocation method Download PDF

Info

Publication number
CN113891481A
CN113891481A CN202111140067.1A CN202111140067A CN113891481A CN 113891481 A CN113891481 A CN 113891481A CN 202111140067 A CN202111140067 A CN 202111140067A CN 113891481 A CN113891481 A CN 113891481A
Authority
CN
China
Prior art keywords
user
resource block
throughput
spectrum resource
cellular
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202111140067.1A
Other languages
Chinese (zh)
Other versions
CN113891481B (en
Inventor
郑军
姜书瑞
张源
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Southeast University
Original Assignee
Southeast University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Southeast University filed Critical Southeast University
Priority to CN202111140067.1A priority Critical patent/CN113891481B/en
Publication of CN113891481A publication Critical patent/CN113891481A/en
Application granted granted Critical
Publication of CN113891481B publication Critical patent/CN113891481B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W72/00Local resource management
    • H04W72/50Allocation or scheduling criteria for wireless resources
    • H04W72/54Allocation or scheduling criteria for wireless resources based on quality criteria
    • H04W72/541Allocation or scheduling criteria for wireless resources based on quality criteria using the level of interference
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B17/00Monitoring; Testing
    • H04B17/30Monitoring; Testing of propagation channels
    • H04B17/382Monitoring; Testing of propagation channels for resource allocation, admission control or handover
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W16/00Network planning, e.g. coverage or traffic planning tools; Network deployment, e.g. resource partitioning or cells structures
    • H04W16/02Resource partitioning among network components, e.g. reuse partitioning
    • H04W16/10Dynamic resource partitioning
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W72/00Local resource management
    • H04W72/04Wireless resource allocation
    • H04W72/044Wireless resource allocation based on the type of the allocated resource
    • H04W72/0453Resources in frequency domain, e.g. a carrier in FDMA
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W72/00Local resource management
    • H04W72/50Allocation or scheduling criteria for wireless resources
    • H04W72/54Allocation or scheduling criteria for wireless resources based on quality criteria
    • H04W72/542Allocation or scheduling criteria for wireless resources based on quality criteria using measured or perceived quality

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • Electromagnetism (AREA)
  • Mobile Radio Communication Systems (AREA)

Abstract

The invention discloses a throughput-oriented dynamic resource allocation method for cellular network D2D communication, which comprises the following steps: step 1, judging the type of a newly arrived user; step 2, respectively calculating the link signal-to-interference-and-noise ratio of the user on each frequency spectrum resource block and the obtained throughput according to the user type, and allocating the frequency spectrum resource block which can currently provide the maximum user throughput for the newly arrived user; and 3, after the distribution of the spectrum resource blocks is completed, distributing the transmission power for the newly arrived users. The invention can effectively improve the total throughput of the cellular network on the premise of ensuring the communication quality of cellular users.

Description

Throughput-oriented cellular network D2D communication dynamic resource allocation method
Technical Field
The invention relates to the technical field of wireless communication, in particular to a throughput-oriented dynamic resource allocation method for cellular network D2D communication.
Background
With the rapid spread of mobile devices, more and more local applications, such as content sharing, interactive games, etc., require data to be transmitted between nearby users, and the demand is increasing. Meanwhile, with the rapid rise of the application of the internet of things, a large number of mobile terminals need to access a network. All this poses a significant challenge to communication resource management in cellular networks. D2D communication is a new type of communication where users communicate directly without going through a base station. The D2D communication is introduced into the cellular network, so that the frequency spectrum utilization rate of the network can be effectively improved, and the problem of network resource shortage is solved. Meanwhile, since the transmission distance of D2D communication is usually much shorter than that of cellular communication, it also has the advantages of improving energy efficiency, reducing transmission delay, reducing network load, etc. To improve the spectrum utilization of cellular networks, the D2D user pairs typically employ a sharing mode. In the shared mode, the pair of D2D users multiplexes spectrum resource blocks of the cellular users, in which case mutual interference between the two cannot be avoided. Therefore, how to effectively alleviate the interference between the D2D link and the cellular link under the premise of ensuring the communication quality of the cellular user becomes an important issue in the D2D communication. To address this problem, current solutions mainly include power control, spectrum allocation, and the like. In an actual network scenario, the cellular user and the D2D user both arrive and leave the network dynamically, and the interference between links is in a changing process, so it is necessary to research and design the interference control algorithm in the dynamic scenario.
Disclosure of Invention
The technical problem to be solved by the present invention is to provide a throughput-oriented dynamic resource allocation method for D2D communication in a cellular network, which is used to solve the problem of interference among a base station, a cellular user, and a D2D user caused by introducing D2D communication in a single-cell uplink scenario, and improve the network throughput at the same time.
In order to solve the technical problem, the invention provides a throughput-oriented dynamic resource allocation method for cellular network D2D communication, which comprises the following steps:
step 1, judging the type of a newly arrived user, wherein the user type comprises a cellular user and a D2D user pair.
Step 2, distributing frequency spectrum resource blocks for newly arrived users; the method specifically comprises the following steps:
step 2.1, respectively calculating link signal-to-interference-and-noise ratios of users on each frequency spectrum resource block and the obtained throughput according to the types of the users;
step 2.2, sequencing the throughputs obtained by the users on each frequency spectrum resource block from large to small; allocating a spectrum resource block which can currently provide the maximum user throughput for a newly arrived cellular user or a D2D user pair; for a cellular user, if the available throughput is less than the minimum throughput requirement of the cellular user, denying the cellular user access to the network;
step 3, distributing the sending power for the newly arrived user;
if the newly arrived user is a cellular user, allocating fixed transmission power for the newly arrived user;
if the newly arrived user is a D2D user pair, calling a power control algorithm based on Q learning to distribute the transmission power for the D2D user pair, and dynamically adjusting the transmission power of other D2D user pairs sharing the same spectrum resource block so as to maximize the total network throughput; and if all the available D2D users cannot meet the minimum throughput requirement of the cellular users occupying the same spectrum resource block, rejecting the D2D users to access the network.
Further, in step 2.1, according to the user type, the link signal-to-interference-and-noise ratio of the user on each spectrum resource block and the throughput that can be obtained are respectively calculated; the method specifically comprises the following steps:
1) if the newly arrived user type is a cellular user, the calculation process of the link signal-to-interference-and-noise ratio and the obtained throughput of the cellular user on each frequency spectrum resource block is as follows:
the calculation formula of the link signal-to-interference-and-noise ratio of the cellular user on each frequency spectrum resource block is as follows:
Figure BDA0003283490080000021
wherein, CiDenotes the ith cellular subscriber (i ═ 1,2, …), DjRepresents the j-th D2D user pair (j is 1,2, …), r is 1,2, …, K represents the number of spectrum resource blocks in the network;
Figure BDA0003283490080000027
to representA set of all D2D user pairs sharing an r-th spectrum resource block;
Figure BDA0003283490080000022
indicating a cellular user C occupying the r-th spectral resource blockiThe transmission power of the transmitter,
Figure BDA0003283490080000023
indicating D2D user pair D occupying the r-th spectral resource blockjThe transmission power of (a);
Figure BDA0003283490080000024
indicating a cellular user C occupying the r-th spectral resource blockiThe channel gain with the base station is,
Figure BDA0003283490080000025
indicating D2D user pair D occupying the r-th spectral resource blockjChannel gain, σ, between the transmitting end and the base station2Representing the noise power.
According to the shannon theorem, the calculation formula of the throughput obtained by the cellular user on each spectrum resource block is as follows:
Figure BDA0003283490080000026
where W represents the bandwidth of one spectrum resource block.
2) If the newly arrived user type is D2D user, the link signal-to-interference-and-noise ratio and the obtainable throughput of the D2D user on each spectrum resource block are calculated as follows:
the signal-to-interference-and-noise ratio calculation formula of the D2D link is as follows:
Figure BDA0003283490080000031
wherein,
Figure BDA0003283490080000032
indicating occupation of the r-th spectrum resourceBlock D2D user pairs DjThe channel gain between the transmitting end and the receiving end,
Figure BDA0003283490080000033
indicating a cellular user C occupying the r-th spectral resource blockiAnd D2D user pair DjThe channel gain between the receiving ends is,
Figure BDA0003283490080000034
representing different D2D user pairs D sharing an r-th spectrum resource blockj'Transmitting terminal and DjChannel gain between receiving ends.
According to shannon's theorem, the throughput calculation formula of the D2D user pair is:
Figure BDA0003283490080000035
further, in step 3, if the newly arrived user is the D2D user pair, the Q-learning-based power control algorithm is invoked to allocate the transmission power to the D2D user pair, specifically, the transmission power is allocated to the D2D user pair according to the Q-value table output by the Q-learning-based power control algorithm.
Further, in step 3, if the newly arrived user is a D2D user pair, a power control algorithm based on Q learning is invoked to allocate transmission power to the D2D user pair, and the transmission power of other D2D user pairs sharing the same spectrum resource block is dynamically adjusted to maximize the total network throughput; the method comprises the following specific steps:
step 3.1, for N sharing newly arrived D2D user pair allocated spectrum resource blockrD2D user pair Dj,j∈{1,2,…,NrInitializing the values of Q value tables output by all the power control algorithms based on Q learning to be 0;
step 3.2, selecting the jth D2D user pair sharing the spectrum resource block;
3.3, based on the current Q value table, selecting an action a according to an epsilon-greedy strategy; wherein action a is defined as selecting one transmission work for each pair of D2D users sharing the spectrum resource blockThe rate p ∈ { p }1,p2,…,pLIn which p is1,p2,…,pLIs the alternative transmit power. Specifically, a random number of 0-1 is generated, and if the random number is smaller than epsilon, the action is randomly selected, and if the random number is larger than epsilon, the action with the maximum Q value is selected.
Step 3.4, executing action a, and calculating a reward function R;
the reward function R is defined as follows:
Figure BDA0003283490080000041
wherein, tau0Representing a minimum throughput requirement of a cellular user occupying the spectrum resource block;
the above equation shows that when the throughput of the cellular user is higher than the minimum throughput requirement, the reward function is the total throughput of all users sharing the spectrum resource block, that is, the optimization goal of the algorithm is to maximize the total network throughput, otherwise, the reward function is-1, which represents a penalty value.
Step 3.5, updating the Q value table according to the following formula:
Figure BDA0003283490080000042
wherein Q' (s, a) represents an updated value of the Q value table, Q (s, a) represents a current value of the Q value table, a represents a learning rate, 0. ltoreq. a.ltoreq.1, γ represents an attenuation factor, 0. ltoreq. γ.ltoreq.1,
Figure BDA0003283490080000043
represents the maximum value in the current Q value table;
step 3.6, repeating the steps 3.3-3.5 until the Q value table is converged;
step 3.7, repeating steps 3.2-3.6 until all D2D user pairs sharing the spectrum resource block are traversed;
and 3.8, assigning j to be 1, and repeating the steps 3.2-3.7 until the Q value tables of all the D2D user pairs sharing the spectrum resource block converge to the same optimal solution.
The invention has the beneficial effects that: on the premise of ensuring the communication quality of cellular users, the total throughput of the cellular network can be effectively improved.
Drawings
Fig. 1 is a schematic diagram of a cellular network D2D communication uplink system model in the invention.
Fig. 2 is a schematic flow chart of a step of allocating a spectrum resource block to a newly arrived user in the present invention.
Fig. 3 is a flow chart illustrating a procedure of allocating transmission power to a newly arrived user according to the present invention.
FIG. 4 is a schematic diagram of a power control algorithm based on Q learning according to the present invention.
FIG. 5 is a schematic flow chart of the method of the present invention.
Detailed Description
The embodiment of the invention discloses a throughput-oriented cellular network D2D communication dynamic resource allocation method, which is applied to a single-cell scene. Within the cell there is a base station BS, with both cellular and D2D user pairs dynamically arriving and leaving the network. There are K spectrum resource blocks in the system, noted
Figure BDA0003283490080000044
The D2D user pairs multiplex the frequency spectrum resource blocks of the cellular user uplink, the cellular user and the D2D user pairs are distributed uniformly in the cell area at random, and the base station can obtain the channel state information of all links. There are two link modes in a cell: a cellular link mode between a base station and a cellular user; D2D user is directed to direct link mode between the sender and receiver.
Because the D2D user multiplexes the spectrum resource of the uplink, there are three kinds of interference in the system, as shown in fig. 1: (1) the signals transmitted by the cellular users to the base station are received by the D2D user pair receiving end, and generate interference to the D2D user pair; (2) signals transmitted by a D2D user to a transmitting end to a D2D user to a receiving end are received by a base station, and interference is generated on the base station; (3) signals transmitted by the D2D user pair transmitting terminal to the D2D user pair receiving terminal are received by other D2D users in the same cell to the receiving terminal, and generate interference to other D2D user pairs.
The cellular network D2D communication dynamic resource allocation method for throughput mainly comprises 3 steps: (1) judging the type of a newly arrived user; (2) allocating spectrum resource blocks for newly arrived users; (3) the transmission power is allocated to the newly arrived user.
Specifically, as shown in fig. 5, the throughput-oriented dynamic resource allocation method for cellular network D2D communication according to the present invention includes the following steps:
step 1, judging the type of a newly arrived user, wherein the user type comprises a cellular user and a D2D user pair.
Step 2, distributing frequency spectrum resource blocks for newly arrived users; as shown in fig. 2, the method specifically includes the following steps:
step 2.1, respectively calculating link signal-to-interference-and-noise ratios of users on each frequency spectrum resource block and the obtained throughput according to the types of the users;
1) the user type is a cellular user, and the calculation process of the link signal-to-interference-and-noise ratio and the obtained throughput of the cellular user on each frequency spectrum resource block is as follows:
the channel gains between the base station and the cellular users, between the base station and the D2D users ' receivers, between the D2D users ' transmitters and the cellular users, and between the D2D users ' pairs in the cell are respectively expressed as:
Figure BDA0003283490080000051
Figure BDA0003283490080000052
Figure BDA0003283490080000053
Figure BDA0003283490080000054
Figure BDA0003283490080000055
wherein,
Figure BDA0003283490080000056
respectively representing cellular subscribers CiWith base station, D2D user pair DjPath loss between the transmitting end and the base station, beta denotes a gain exponent, mu denotes a path loss exponent,
Figure BDA0003283490080000061
indicating cellular user CiAnd D2D user pair DjThe distance between the receiving ends is such that,
Figure BDA0003283490080000062
representing D2D user pair DjThe distance between the transmitting end and the receiving end,
Figure BDA0003283490080000063
representing different D2D user pairs Dj'Transmitting terminal and DjThe distance between the receiving ends.
The calculation formula of the link signal-to-interference-and-noise ratio of the cellular user on each frequency spectrum resource block is as follows:
Figure BDA0003283490080000064
wherein, CiDenotes the ith cellular subscriber (i ═ 1,2, …), DjRepresents the j-th D2D user pair (j is 1,2, …), r is 1,2, …, K represents the number of spectrum resource blocks in the network;
Figure BDA0003283490080000065
represents a set of all D2D user pairs sharing an r-th spectrum resource block;
Figure BDA0003283490080000066
indicating a cellular user C occupying the r-th spectral resource blockiThe transmission power of the transmitter,
Figure BDA0003283490080000067
indicating D2D user pair D occupying the r-th spectral resource blockjThe transmission power of (a);
Figure BDA0003283490080000068
indicating a cellular user C occupying the r-th spectral resource blockiThe channel gain with the base station is,
Figure BDA0003283490080000069
indicating D2D user pair D occupying the r-th spectral resource blockjChannel gain, σ, between the transmitting end and the base station2Representing the noise power.
According to the shannon theorem, the calculation formula of the throughput obtained by the cellular user on each spectrum resource block is as follows:
Figure BDA00032834900800000610
where W represents the bandwidth of one spectrum resource block.
2) The user type is D2D user, and the steps of calculating the link signal-to-interference-and-noise ratio and the obtainable throughput of the D2D user on each spectrum resource block are as follows:
the signal-to-interference-and-noise ratio calculation formula of the D2D link is as follows:
Figure BDA00032834900800000611
wherein,
Figure BDA00032834900800000612
indicating D2D user pair D occupying the r-th spectral resource blockjThe channel gain between the transmitting end and the receiving end,
Figure BDA00032834900800000613
indicating a cellular user C occupying the r-th spectral resource blockiAnd D2D user pairsDjThe channel gain between the receiving ends is,
Figure BDA0003283490080000071
representing different D2D user pairs D sharing an r-th spectrum resource blockj'Transmitting terminal and DjChannel gain between receiving ends.
According to shannon's theorem, the throughput calculation formula of the D2D user pair is:
Figure BDA0003283490080000072
step 2.2, sequencing the throughputs obtained by the users on each frequency spectrum resource block from large to small; allocating a spectrum resource block which can currently provide the maximum user throughput for a newly arrived cellular user or a D2D user pair;
for a cellular user, if the available throughput is less than the minimum throughput requirement of the cellular user, denying the cellular user access to the network;
step 3, distributing the transmission power for the newly arrived user, as shown in fig. 3;
if the newly arrived user is a cellular user, allocating fixed transmission power for the newly arrived user;
if the newly arrived user is a D2D user pair, calling a power control algorithm based on Q learning, distributing the transmission power for the D2D user pair according to a Q value table output by the power control algorithm, and dynamically adjusting the transmission power of other D2D user pairs sharing the same spectrum resource block so as to maximize the total throughput of the network; and if all the available D2D users cannot meet the minimum throughput requirement of the cellular users occupying the same spectrum resource block within a certain constraint range, rejecting the D2D users to access the network.
Specifically, as shown in fig. 4, if the arriving user is a pair of D2D users, the specific steps of allocating transmission power to the arriving user are as follows:
step 3.1, for N sharing newly arrived D2D user pair allocated spectrum resource blockrD2D user pair Dj,j∈{1,2,…,Nr}, beginningInitializing the values of Q value tables output by all the power control algorithms based on Q learning to be 0, and assigning j to be 1;
step 3.2, selecting the jth D2D user pair sharing the spectrum resource block;
3.3, based on the current Q value table, selecting an action a according to an epsilon-greedy strategy; wherein action a is defined as selecting a transmit power p e p for each pair of D2D users sharing the spectrum resource block1,p2,…,pLIn which p is1,p2,…,pLIs the alternative transmit power. Specifically, a random number of 0-1 is generated, and if the random number is smaller than epsilon, the action is randomly selected, and if the random number is larger than epsilon, the action with the maximum Q value is selected.
Step 3.4, executing action a, and calculating a reward function R;
the reward function R is defined as follows:
Figure BDA0003283490080000081
wherein, tau0Representing a minimum throughput requirement of a cellular user occupying the spectrum resource block;
the above equation shows that when the throughput of the cellular user is higher than the minimum throughput requirement, the reward function is the total throughput of all users sharing the spectrum resource block, that is, the optimization goal of the algorithm is to maximize the total network throughput, otherwise, the reward function is-1, which represents a penalty value.
Step 3.5, updating the Q value table according to the following formula:
Figure BDA0003283490080000082
wherein Q' (s, a) represents an updated value of the Q value table, Q (s, a) represents a current value of the Q value table, α represents a learning rate, 0. ltoreq. a.ltoreq.1, γ represents an attenuation factor, 0. ltoreq. γ.ltoreq.1,
Figure BDA0003283490080000083
representing the maximum value in the current Q-value table;
Step 3.6, repeating the steps 3.3-3.5 until the Q value table is converged;
step 3.7, repeating steps 3.2-3.6 until all D2D user pairs sharing the spectrum resource block are traversed;
and 3.8, re-assigning j to 1, and repeating the steps 3.2-3.7 until the Q value tables of all the D2D user pairs sharing the spectrum resource block converge to the same optimal solution.
While the invention has been shown and described with respect to the embodiments, it will be understood by those skilled in the art that various changes and modifications may be made without departing from the scope of the invention as defined in the following claims.

Claims (4)

1. A dynamic resource allocation method for throughput-oriented cellular network D2D communication, comprising the steps of:
step 1, judging the type of a newly arrived user, wherein the user type comprises a cellular user and a D2D user pair;
step 2, distributing frequency spectrum resource blocks for newly arrived users; the method specifically comprises the following steps:
step 2.1, respectively calculating link signal-to-interference-and-noise ratios of users on each frequency spectrum resource block and the obtained throughput according to the types of the users;
step 2.2, sequencing the throughputs obtained by the users on each frequency spectrum resource block from large to small; allocating a spectrum resource block which can currently provide the maximum user throughput for a newly arrived cellular user or a D2D user pair;
for a cellular user, if the available throughput is less than the minimum throughput requirement of the cellular user, denying the cellular user access to the network;
step 3, distributing the sending power for the newly arrived user;
if the newly arrived user is a cellular user, allocating fixed transmission power for the newly arrived user;
if the newly arrived user is a D2D user pair, calling a power control algorithm based on Q learning to distribute the transmission power for the D2D user pair, and dynamically adjusting the transmission power of other D2D user pairs sharing the same spectrum resource block so as to maximize the total network throughput; and when all available D2D users cannot meet the minimum throughput requirement of the cellular users occupying the same spectrum resource block for the selected power, rejecting the D2D users to access the network.
2. A throughput-oriented dynamic resource allocation method for cellular network D2D communication according to claim 1, wherein in step 2.1, the link sir and the achievable throughput of the user on each spectrum resource block are calculated respectively according to the user type; the method specifically comprises the following steps:
1) if the newly arrived user type is a cellular user, the calculation process of the link signal-to-interference-and-noise ratio and the obtained throughput of the cellular user on each frequency spectrum resource block is as follows:
the calculation formula of the link signal-to-interference-and-noise ratio of the cellular user on each frequency spectrum resource block is as follows:
Figure FDA0003283490070000011
wherein, CiDenotes the ith cellular subscriber (i ═ 1,2, …), DjRepresents the j-th D2D user pair (j is 1,2, …), r is 1,2, …, K represents the number of spectrum resource blocks in the network;
Figure FDA0003283490070000015
represents a set of all D2D user pairs sharing an r-th spectrum resource block;
Figure FDA0003283490070000012
indicating a cellular user C occupying the r-th spectral resource blockiThe transmission power of the transmitter,
Figure FDA0003283490070000013
indicating D2D user pair D occupying the r-th spectral resource blockjThe transmission power of (a);
Figure FDA0003283490070000014
indicating a cellular user C occupying the r-th spectral resource blockiThe channel gain with the base station is,
Figure FDA0003283490070000021
indicating D2D user pair D occupying the r-th spectral resource blockjChannel gain, σ, between the transmitting end and the base station2Representing the noise power;
according to the shannon theorem, the calculation formula of the throughput obtained by the cellular user on each spectrum resource block is as follows:
Figure FDA0003283490070000022
wherein, W represents the bandwidth of one spectrum resource block;
2) if the newly arrived user type is D2D user, the link signal-to-interference-and-noise ratio and the obtainable throughput of the D2D user on each spectrum resource block are calculated as follows:
the signal-to-interference-and-noise ratio calculation formula of the D2D link is as follows:
Figure FDA0003283490070000023
wherein,
Figure FDA0003283490070000024
indicating D2D user pair D occupying the r-th spectral resource blockjThe channel gain between the transmitting end and the receiving end,
Figure FDA0003283490070000025
indicating a cellular user C occupying the r-th spectral resource blockiAnd D2D user pair DjThe channel gain between the receiving ends is,
Figure FDA0003283490070000026
representing different D2D user pairs sharing an r-th spectrum resource blockDj'Transmitting terminal and DjChannel gain between the receiving ends;
according to shannon's theorem, the throughput calculation formula of the D2D user pair is:
Figure FDA0003283490070000027
3. the method as claimed in claim 1, wherein in step 3, if the newly arrived user is the D2D user pair, the Q-learning based power control algorithm is invoked to allocate the transmission power for the D2D user pair, and specifically, the transmission power is allocated for the D2D user pair according to the Q-value table outputted by the Q-learning based power control algorithm.
4. The dynamic D2D communication resource allocation method for a throughput-oriented cellular network, according to claim 1, wherein in step 3, if the newly arrived user is the D2D user pair, the power control algorithm based on Q learning is invoked to allocate the transmission power for the D2D user pair, and the transmission power of other D2D user pairs sharing the same spectrum resource block is dynamically adjusted to maximize the total network throughput; the method comprises the following specific steps:
step 3.1, for N sharing newly arrived D2D user pair allocated spectrum resource blockrD2D user pair Dj,j∈{1,2,…,NrInitializing the values of Q value tables output by all the power control algorithms based on Q learning to be 0; assigning j to 1;
step 3.2, selecting the jth D2D user pair sharing the spectrum resource block;
3.3, based on the current Q value table, selecting an action a according to an epsilon-greedy strategy; wherein action a is defined as selecting a transmit power p e p for each pair of D2D users sharing the spectrum resource block1,p2,…,pLIn which p is1,p2,…,pLIs an alternative transmit power; specifically, a random number of 0-1 is generated,if the Q value is larger than the epsilon, selecting the action randomly;
step 3.4, executing action a, and calculating a reward function R;
the reward function R is defined as follows:
Figure FDA0003283490070000031
wherein, tau0Representing a minimum throughput requirement of a cellular user occupying the spectrum resource block;
the above equation represents that when the throughput of a cellular user is higher than its minimum throughput requirement, the reward function is the total throughput of all users sharing the spectrum resource block, i.e. the optimization goal of the algorithm is to maximize the network total throughput; otherwise, the reward function is-1 and represents a penalty value;
step 3.5, updating the Q value table according to the following formula:
Figure FDA0003283490070000032
wherein Q' (s, a) represents an updated value of the Q value table, Q (s, a) represents a current value of the Q value table, α represents a learning rate, 0. ltoreq. α.ltoreq.1, γ represents an attenuation factor, 0. ltoreq. γ.ltoreq.1,
Figure FDA0003283490070000033
represents the maximum value in the current Q value table;
step 3.6, repeating the steps 3.3-3.5 until the Q value table is converged;
step 3.7, repeating steps 3.2-3.6 until all D2D user pairs sharing the spectrum resource block are traversed;
and 3.8, re-assigning j to 1, and repeating the steps 3.2-3.7 until the Q value tables of all the D2D user pairs sharing the spectrum resource block converge to the same optimal solution.
CN202111140067.1A 2021-09-28 2021-09-28 Throughput-oriented cellular network D2D communication dynamic resource allocation method Active CN113891481B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111140067.1A CN113891481B (en) 2021-09-28 2021-09-28 Throughput-oriented cellular network D2D communication dynamic resource allocation method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111140067.1A CN113891481B (en) 2021-09-28 2021-09-28 Throughput-oriented cellular network D2D communication dynamic resource allocation method

Publications (2)

Publication Number Publication Date
CN113891481A true CN113891481A (en) 2022-01-04
CN113891481B CN113891481B (en) 2024-08-27

Family

ID=79007292

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111140067.1A Active CN113891481B (en) 2021-09-28 2021-09-28 Throughput-oriented cellular network D2D communication dynamic resource allocation method

Country Status (1)

Country Link
CN (1) CN113891481B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114513854A (en) * 2022-01-29 2022-05-17 北京邮电大学 Uplink resource sharing method for cellular network and D2D communication and related equipment
CN116828618A (en) * 2023-08-16 2023-09-29 四川轻化工大学 D2D communication resource allocation method based on user energy efficiency control

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104486744A (en) * 2014-12-29 2015-04-01 重庆邮电大学 D2D (device-to-device) mode selection method in small heterogeneous cellular network
WO2017024967A1 (en) * 2015-08-13 2017-02-16 中兴通讯股份有限公司 D2d communication resource multiplexing selection method, base station and terminal
CN107484244A (en) * 2017-08-01 2017-12-15 东南大学 The cellular network D2D communication spectrum resource allocation algorithms that a kind of QoS is perceived
CN113301638A (en) * 2021-05-20 2021-08-24 东南大学 D2D communication spectrum allocation and power control algorithm based on Q learning

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104486744A (en) * 2014-12-29 2015-04-01 重庆邮电大学 D2D (device-to-device) mode selection method in small heterogeneous cellular network
WO2017024967A1 (en) * 2015-08-13 2017-02-16 中兴通讯股份有限公司 D2d communication resource multiplexing selection method, base station and terminal
CN107484244A (en) * 2017-08-01 2017-12-15 东南大学 The cellular network D2D communication spectrum resource allocation algorithms that a kind of QoS is perceived
CN113301638A (en) * 2021-05-20 2021-08-24 东南大学 D2D communication spectrum allocation and power control algorithm based on Q learning

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
王倩;聂秀山;耿蕾蕾;尹义龙: "D2D通信中基于Q学习的联合资源分配与功率控制算法", 南京大学学报(自然科学), no. 006, 31 December 2018 (2018-12-31) *
陈文泰: "《基于机器学习的蜂窝网络D2D通信频谱分配与功率控制算法研究》", 《中国优秀博硕士学位论文》, 15 January 2021 (2021-01-15) *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114513854A (en) * 2022-01-29 2022-05-17 北京邮电大学 Uplink resource sharing method for cellular network and D2D communication and related equipment
CN114513854B (en) * 2022-01-29 2023-05-23 北京邮电大学 Uplink resource sharing method for cellular network and D2D communication and related equipment
CN116828618A (en) * 2023-08-16 2023-09-29 四川轻化工大学 D2D communication resource allocation method based on user energy efficiency control
CN116828618B (en) * 2023-08-16 2024-02-09 四川轻化工大学 D2D communication resource allocation method based on user energy efficiency control

Also Published As

Publication number Publication date
CN113891481B (en) 2024-08-27

Similar Documents

Publication Publication Date Title
CN111447619B (en) Joint task unloading and resource allocation method in mobile edge computing network
US7209711B2 (en) Dynamic channel quality measurement procedures for optimizing data transmission in a wireless digital communication system
KR100680020B1 (en) Packet scheduling
CN103249007B (en) D2D multi-casting communication in IMT-A system is set up and its wireless resource allocation methods
CN101415207B (en) Method, apparatus and base station for balancing load of honeycomb collaboration relay network
CN111884696B (en) Relay cooperation mobile edge calculation method based on multiple carriers
CN108600999B (en) FD-D2D joint optimization method based on channel allocation and power control
CN1399833A (en) Acceleration dependent channel swithing in mobile telecommunications
CN107484244A (en) The cellular network D2D communication spectrum resource allocation algorithms that a kind of QoS is perceived
CN106454920A (en) Resource allocation optimization algorithm based on time delay guarantee in LTE (Long Term Evolution) and D2D (Device-to-Device) hybrid network
CN113891481A (en) Throughput-oriented cellular network D2D communication dynamic resource allocation method
US20230143476A1 (en) Methods and systems for reducing fronthaul bandwidth in a wireless communication system
CN113301638A (en) D2D communication spectrum allocation and power control algorithm based on Q learning
Pichna et al. A medium-access control protocol for a cellular packet CDMA carrying multirate traffic
CN110139281A (en) A method of the full duplex D2D sub-clustering resource allocation based on K-means
CN113301637A (en) D2D communication power control algorithm based on Q learning and neural network
CN110049473A (en) The associated wireless channel distribution and Poewr control method of relaying enhancing D2D communication
Yuan et al. A cascaded channel-power allocation for D2D underlaid cellular networks using matching theory
CN105072686B (en) A kind of wireless resource allocation methods based on OFDMA junction network
CN107835514B (en) Modeling method for wireless network resource matching in multi-host service scene
CN112770343B (en) D2D-NOMA resource allocation method and system based on HAGA
Kim et al. Spectrum allocation with power control in LBS based D2d cellular mobile networks
CN113301639A (en) Cellular network D2D communication power control algorithm based on Q learning
CN115884343A (en) High-mobility ad hoc network dynamic power distribution method based on directional multi-beam antenna
Kou et al. Dynamic uplink/downlink resource allocation for TDD OFDMA access network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant