[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN113891481B - Throughput-oriented cellular network D2D communication dynamic resource allocation method - Google Patents

Throughput-oriented cellular network D2D communication dynamic resource allocation method Download PDF

Info

Publication number
CN113891481B
CN113891481B CN202111140067.1A CN202111140067A CN113891481B CN 113891481 B CN113891481 B CN 113891481B CN 202111140067 A CN202111140067 A CN 202111140067A CN 113891481 B CN113891481 B CN 113891481B
Authority
CN
China
Prior art keywords
user
spectrum resource
resource block
throughput
cellular
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202111140067.1A
Other languages
Chinese (zh)
Other versions
CN113891481A (en
Inventor
郑军
姜书瑞
张源
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Southeast University
Original Assignee
Southeast University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Southeast University filed Critical Southeast University
Priority to CN202111140067.1A priority Critical patent/CN113891481B/en
Publication of CN113891481A publication Critical patent/CN113891481A/en
Application granted granted Critical
Publication of CN113891481B publication Critical patent/CN113891481B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W72/00Local resource management
    • H04W72/50Allocation or scheduling criteria for wireless resources
    • H04W72/54Allocation or scheduling criteria for wireless resources based on quality criteria
    • H04W72/541Allocation or scheduling criteria for wireless resources based on quality criteria using the level of interference
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B17/00Monitoring; Testing
    • H04B17/30Monitoring; Testing of propagation channels
    • H04B17/382Monitoring; Testing of propagation channels for resource allocation, admission control or handover
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W16/00Network planning, e.g. coverage or traffic planning tools; Network deployment, e.g. resource partitioning or cells structures
    • H04W16/02Resource partitioning among network components, e.g. reuse partitioning
    • H04W16/10Dynamic resource partitioning
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W72/00Local resource management
    • H04W72/04Wireless resource allocation
    • H04W72/044Wireless resource allocation based on the type of the allocated resource
    • H04W72/0453Resources in frequency domain, e.g. a carrier in FDMA
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W72/00Local resource management
    • H04W72/50Allocation or scheduling criteria for wireless resources
    • H04W72/54Allocation or scheduling criteria for wireless resources based on quality criteria
    • H04W72/542Allocation or scheduling criteria for wireless resources based on quality criteria using measured or perceived quality

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • Electromagnetism (AREA)
  • Mobile Radio Communication Systems (AREA)

Abstract

The invention discloses a cellular network D2D communication dynamic resource allocation method facing throughput, which comprises the following steps: step 1, judging the type of a newly arrived user; step 2, according to the user type, calculating the link signal-to-interference-and-noise ratio and the obtained throughput of the user on each frequency spectrum resource block, and distributing the frequency spectrum resource block which can provide the maximum user throughput currently for the newly arrived user; and step 3, after the allocation of the frequency spectrum resource blocks is completed, allocating the transmission power for the newly arrived users. The invention can effectively improve the total throughput of the cellular network on the premise of ensuring the communication quality of the cellular user.

Description

Throughput-oriented cellular network D2D communication dynamic resource allocation method
Technical Field
The invention relates to the technical field of wireless communication, in particular to a cellular network D2D communication dynamic resource allocation method facing throughput.
Background
With the rapid popularity of mobile devices, there is an increasing demand for more and more local applications, such as content sharing, interactive games, etc., to transfer data between nearby users. Meanwhile, with the rapid rise of the application of the internet of things, a large number of mobile terminals need to access the network. This all presents a significant challenge for communication resource management in cellular networks. D2D communication is a new communication method in which users communicate directly without passing through a base station. By introducing D2D communication into the cellular network, the spectrum utilization rate of the network can be effectively improved, and the problem of network resource shortage can be relieved. Meanwhile, since the transmission distance of D2D communication is generally far smaller than that of cellular communication, it also has advantages of improving energy efficiency, reducing transmission delay, and reducing network load. To increase the spectrum utilization of cellular networks, D2D user pairs typically employ a sharing mode. In the sharing mode, the D2D user pair multiplexes the spectrum resource blocks of the cellular user, in which case mutual interference between the two cannot be avoided. Therefore, how to effectively mitigate interference between the D2D link and the cellular link under the premise of guaranteeing the communication quality of the cellular user becomes an important problem in D2D communication. Current solutions to this problem mainly include power control, spectrum allocation, etc. In an actual network scenario, both the cellular user and the D2D user pair dynamically arrive and leave the network, and the interference between links is also in the process of changing constantly, so it is necessary to study and design the interference control algorithm in the dynamic scenario.
Disclosure of Invention
The invention aims to solve the technical problem of providing a throughput-oriented cellular network D2D communication dynamic resource allocation method which is used for solving the problem of interference among a base station, a cellular user and a D2D user caused by introducing D2D communication in a single-cell uplink scene and improving network throughput.
In order to solve the technical problems, the invention provides a method for allocating dynamic resources of a cellular network D2D communication facing throughput, which comprises the following steps:
and step 1, judging the type of the newly arrived user, wherein the user type comprises a cellular user and a D2D user pair.
Step 2, allocating frequency spectrum resource blocks for the newly arrived users; the method specifically comprises the following steps:
step 2.1, respectively calculating the link signal-to-interference-and-noise ratio and the obtained throughput of the user on each frequency spectrum resource block according to the user type;
Step 2.2, ordering the throughput which can be obtained by the user on each frequency spectrum resource block from large to small; allocating a spectrum resource block capable of providing the maximum user throughput currently for a newly arrived cellular user or D2D user pair; for a cellular user, if the obtainable throughput is smaller than the minimum throughput requirement of the cellular user, rejecting the cellular user to access the network;
Step 3, distributing the transmitting power for the newly arrived user;
if the newly arrived user is a cellular user, distributing fixed transmission power for the newly arrived user;
If the newly arrived user is a D2D user pair, a power control algorithm based on Q learning is called to allocate transmission power for the D2D user pair, and the transmission power of other D2D user pairs sharing the same frequency spectrum resource block is dynamically adjusted so as to maximize the total throughput of the network; all the power available for D2D users to select cannot meet the minimum throughput requirement of the cellular users occupying the same spectrum resource block, and the D2D users are denied access to the network.
Further, in step 2.1, according to the user type, calculating the link signal-to-interference-and-noise ratio and the throughput that can be obtained of the user on each spectrum resource block respectively; the method comprises the following steps:
1) If the new arriving user type is a cellular user, the calculation process of the link signal-to-interference-and-noise ratio and the obtained throughput of the cellular user on each frequency spectrum resource block is as follows:
The calculation formula of the link signal-to-interference-and-noise ratio of the cellular user on each frequency spectrum resource block is as follows:
Where C i denotes the i-th cellular user (i=1, 2, …), D j denotes the j-th D2D user pair (j=1, 2, …), r=1, 2, …, K denotes the number of spectrum resource blocks in the network; representing a set of all D2D user pairs sharing an r-th spectrum resource block; Representing the transmit power of the cellular user C i occupying the r-th spectrum resource block, Representing the transmission power of the D j of the D2D user occupying the r-th frequency spectrum resource block; Representing the channel gain between the cellular user C i and the base station occupying the r-th spectrum resource block, The channel gain between the D j transmitting end and the base station of the D2D user pair occupying the r-th spectrum resource block is represented, and σ 2 represents the noise power.
According to shannon's theorem, the throughput calculation formula that can be obtained by the cellular user on each spectrum resource block is:
where W represents the bandwidth of one spectrum resource block.
2) If the newly arrived user type is a D2D user, the steps of the link signal-to-interference-and-noise ratio of the D2D user on each spectrum resource block and the obtained throughput are calculated as follows:
the signal-to-interference-and-noise ratio calculation formula of the D2D link is as follows:
wherein, Represents the channel gain between the transmitting end and the receiving end of the D j of the D2D user pair occupying the r-th spectrum resource block,Representing the channel gain between the cellular user C i and the D2D user pair D j receiver occupying the r-th spectrum resource block,Representing the channel gains between the D j' transmitting end and the D j receiving end of different D2D user pairs sharing the r-th spectrum resource block.
According to shannon's theorem, the throughput calculation formula of the D2D user pair is:
further, in step 3, if the newly arrived user is a D2D user pair, a power control algorithm based on Q learning is invoked to allocate transmission power to the D2D user pair, specifically, the transmission power is allocated to the D2D user pair according to a Q value table output by the power control algorithm based on Q learning.
Further, in step 3, if the newly arrived user is a D2D user pair, a power control algorithm based on Q learning is invoked to allocate transmission power for the D2D user pair, and the transmission power of other D2D user pairs sharing the same spectrum resource block is dynamically adjusted to maximize the total throughput of the network; the method comprises the following specific steps:
Step 3.1, initializing the value of the Q value table output by all the power control algorithms based on Q learning to be 0 for N r D2D user pairs D j,j∈{1,2,…,Nr which share the newly arrived D2D user pair allocated spectrum resource block;
Step 3.2, selecting a j-th D2D user pair sharing the spectrum resource block;
Step 3.3, selecting an action a according to an epsilon-greedy strategy based on the current Q value table; wherein action a is defined as selecting one transmit power p e { p 1,p2,…,pL } for each D2D user pair sharing the spectrum resource block, where p 1,p2,…,pL is the alternative transmit power. Specifically, a random number of 0-1 is generated, and if the random number is smaller than epsilon, the operation is randomly selected, and if the random number is larger than epsilon, the operation with the largest Q value is selected.
Step 3.4, executing the action a and calculating a reward function R;
the bonus function R is defined as follows:
where τ 0 represents the minimum throughput requirement of the cellular user occupying the spectrum resource block;
The above equation indicates that when the throughput of the cellular user is higher than its minimum throughput requirement, the reward function is the total throughput of all users sharing the spectrum resource block, i.e. the optimization objective of the algorithm is to maximize the network total throughput, otherwise, the reward function is-1, indicating a penalty value.
Step 3.5, updating the Q value table according to the following formula:
Wherein Q' (s, a) represents an updated value of the Q value table, Q (s, a) represents a current value of the Q value table, a represents a learning rate, 0.ltoreq.a.ltoreq.1, γ represents an attenuation factor, 0.ltoreq.γ.ltoreq.1, Representing the maximum value in the current Q value table;
Step 3.6, repeating the steps 3.3-3.5 until the Q value table converges;
Step 3.7, repeating steps 3.2-3.6 until all D2D user pairs sharing the spectrum resource block are traversed;
And 3.8, assigning j to be 1, and repeating the steps 3.2-3.7 until the Q value tables of all D2D user pairs sharing the spectrum resource block are converged to the same optimal solution.
The beneficial effects of the invention are as follows: on the premise of ensuring the communication quality of the cellular user, the total throughput of the cellular network can be effectively improved.
Drawings
Fig. 1 is a schematic diagram of a cellular network D2D communication uplink system model according to the present invention.
Fig. 2 is a flowchart illustrating a step of allocating a spectrum resource block for a newly arrived user according to the present invention.
Fig. 3 is a flowchart illustrating a process of allocating transmission power to a newly arrived user according to the present invention.
Fig. 4 is a flowchart illustrating steps of a power control algorithm based on Q learning in the present invention.
FIG. 5 is a schematic flow chart of the method of the present invention.
Detailed Description
The embodiment of the invention discloses a throughput-oriented cellular network D2D communication dynamic resource allocation method which is applied to a single-cell scene. Within the cell there is a base station BS, and both cellular and D2D user pairs dynamically arrive and leave the network. There are K spectrum resource blocks in the system, which are marked asThe D2D user pair multiplexes the frequency spectrum resource blocks of the uplink of the cellular user, the cellular user pair and the D2D user pair are randomly and uniformly distributed in the cell range, and the base station can obtain the channel state information of all links. There are two link modes in a cell: a cellular link mode between the base station and the cellular user; D2D user pair direct link mode between transmitting and receiving ends.
Because the D2D user multiplexes the spectrum resources of the uplink, there are three types of interference in the system at this time, as shown in fig. 1: (1) The signals transmitted to the base station by the cellular users are received by the D2D user receiver, and interference is generated to the D2D user; (2) The signal transmitted to the receiving end of the D2D user pair by the transmitting end of the D2D user pair is received by the base station, and interference is generated to the base station; (3) The signals transmitted by the D2D user pair transmitting end to the D2D user pair receiving end are received by other D2D users in the same cell, and interference is generated to other D2D user pairs.
The method for allocating the dynamic resources of the cellular network D2D communication facing the throughput mainly comprises the following 3 steps: (1) judging the newly arrived user type; (2) allocating a spectrum resource block for the newly arrived user; (3) allocating transmit power for the newly arrived user.
Specifically, as shown in fig. 5, the method for allocating dynamic resources for D2D communication in a throughput-oriented cellular network according to the present invention includes the following steps:
and step 1, judging the type of the newly arrived user, wherein the user type comprises a cellular user and a D2D user pair.
Step 2, allocating frequency spectrum resource blocks for the newly arrived users; as shown in fig. 2, the method specifically comprises the following steps:
step 2.1, respectively calculating the link signal-to-interference-and-noise ratio and the obtained throughput of the user on each frequency spectrum resource block according to the user type;
1) The user type is a cellular user, and the calculation process of the link signal-to-interference-and-noise ratio and the obtained throughput of the cellular user on each frequency spectrum resource block is as follows:
The channel gains between the base station and the cellular user in the cell, between the base station and the D2D user pair receiving end, between the D2D user pair transmitting end and the cellular user, and between the D2D user pair are respectively expressed as:
wherein, Respectively, the path loss between the cellular user C i and the base station, the D2D user pair D j sender and the base station, β represents the gain index, μ represents the path loss index,Representing the distance between cellular user C i and the D2D user pair D j receiving end,Representing the distance between the transmitting end and the receiving end of the D j pair of D2D users,Representing the distance between the transmitting end of the D j' and the receiving end of the D j of different D2D user pairs.
The calculation formula of the link signal-to-interference-and-noise ratio of the cellular user on each frequency spectrum resource block is as follows:
Where C i denotes the i-th cellular user (i=1, 2, …), D j denotes the j-th D2D user pair (j=1, 2, …), r=1, 2, …, K denotes the number of spectrum resource blocks in the network; representing a set of all D2D user pairs sharing an r-th spectrum resource block; Representing the transmit power of the cellular user C i occupying the r-th spectrum resource block, Representing the transmission power of the D j of the D2D user occupying the r-th frequency spectrum resource block; Representing the channel gain between the cellular user C i and the base station occupying the r-th spectrum resource block, The channel gain between the D j transmitting end and the base station of the D2D user pair occupying the r-th spectrum resource block is represented, and σ 2 represents the noise power.
According to shannon's theorem, the throughput calculation formula that can be obtained by the cellular user on each spectrum resource block is:
where W represents the bandwidth of one spectrum resource block.
2) The user type is D2D user, and the steps of the link signal-to-interference-and-noise ratio of the D2D user on each frequency spectrum resource block and the obtained throughput are calculated as follows:
the signal-to-interference-and-noise ratio calculation formula of the D2D link is as follows:
wherein, Represents the channel gain between the transmitting end and the receiving end of the D j of the D2D user pair occupying the r-th spectrum resource block,Representing the channel gain between the cellular user C i and the D2D user pair D j receiver occupying the r-th spectrum resource block,Representing the channel gains between the D j' transmitting end and the D j receiving end of different D2D user pairs sharing the r-th spectrum resource block.
According to shannon's theorem, the throughput calculation formula of the D2D user pair is:
Step 2.2, ordering the throughput which can be obtained by the user on each frequency spectrum resource block from large to small; allocating a spectrum resource block capable of providing the maximum user throughput currently for a newly arrived cellular user or D2D user pair;
for a cellular user, if the obtainable throughput is smaller than the minimum throughput requirement of the cellular user, rejecting the cellular user to access the network;
step 3, distributing transmission power for the newly arrived user, as shown in fig. 3;
if the newly arrived user is a cellular user, distributing fixed transmission power for the newly arrived user;
If the newly arrived user is a D2D user pair, a power control algorithm based on Q learning is called, transmission power is distributed to the D2D user pair according to a Q value table output by the power control algorithm, and the transmission power of other D2D user pairs sharing the same frequency spectrum resource block is dynamically adjusted so as to maximize the total throughput of the network; if the power available for the D2D user to select cannot meet the minimum throughput requirement of the cellular user occupying the same spectrum resource block within a certain constraint range, rejecting the D2D user to access the network.
Specifically, as shown in fig. 4, if the arriving user is a D2D user pair, the specific steps for allocating the transmission power to the arriving user are as follows:
Step 3.1, initializing the value of a Q value table output by all power control algorithms based on Q learning to be 0 and assigning j to be 1 for N r D2D user pairs D j,j∈{1,2,…,Nr which share the newly arrived D2D user pair allocated spectrum resource blocks;
Step 3.2, selecting a j-th D2D user pair sharing the spectrum resource block;
Step 3.3, selecting an action a according to an epsilon-greedy strategy based on the current Q value table; wherein action a is defined as selecting one transmit power p e { p 1,p2,…,pL } for each D2D user pair sharing the spectrum resource block, where p 1,p2,…,pL is the alternative transmit power. Specifically, a random number of 0-1 is generated, and if the random number is smaller than epsilon, the operation is randomly selected, and if the random number is larger than epsilon, the operation with the largest Q value is selected.
Step 3.4, executing the action a and calculating a reward function R;
the bonus function R is defined as follows:
where τ 0 represents the minimum throughput requirement of the cellular user occupying the spectrum resource block;
The above equation indicates that when the throughput of the cellular user is higher than its minimum throughput requirement, the reward function is the total throughput of all users sharing the spectrum resource block, i.e. the optimization objective of the algorithm is to maximize the network total throughput, otherwise, the reward function is-1, indicating a penalty value.
Step 3.5, updating the Q value table according to the following formula:
Wherein Q' (s, a) represents an updated value of the Q value table, Q (s, a) represents a current value of the Q value table, alpha represents a learning rate, 0.ltoreq.a.ltoreq.1, gamma represents an attenuation factor, 0.ltoreq.gamma.ltoreq.1, Representing the maximum value in the current Q value table;
Step 3.6, repeating the steps 3.3-3.5 until the Q value table converges;
Step 3.7, repeating steps 3.2-3.6 until all D2D user pairs sharing the spectrum resource block are traversed;
and 3.8, re-assigning j to be 1, and repeating the steps 3.2-3.7 until the Q value tables of all D2D user pairs sharing the spectrum resource block are converged to the same optimal solution.
While the invention has been shown and described with respect to the embodiments, it will be understood by those skilled in the art that various changes and modifications may be made therein without departing from the scope of the invention as defined in the following claims.

Claims (1)

1. A method for dynamic resource allocation for D2D communication in a throughput-oriented cellular network, comprising the steps of:
step 1, judging the type of a newly arrived user, wherein the user type comprises a cellular user and a D2D user pair;
step 2, allocating frequency spectrum resource blocks for the newly arrived users; the method specifically comprises the following steps:
step 2.1, respectively calculating the link signal-to-interference-and-noise ratio and the obtained throughput of the user on each frequency spectrum resource block according to the user type;
Step 2.2, ordering the throughput which can be obtained by the user on each frequency spectrum resource block from large to small; allocating a spectrum resource block capable of providing the maximum user throughput currently for a newly arrived cellular user or D2D user pair;
for a cellular user, if the obtainable throughput is smaller than the minimum throughput requirement of the cellular user, rejecting the cellular user to access the network;
Step 3, distributing the transmitting power for the newly arrived user;
if the newly arrived user is a cellular user, distributing fixed transmission power for the newly arrived user;
If the newly arrived user is a D2D user pair, a power control algorithm based on Q learning is called to allocate transmission power for the D2D user pair, and the transmission power of other D2D user pairs sharing the same frequency spectrum resource block is dynamically adjusted so as to maximize the total throughput of the network; when all the power available for D2D users to select can not meet the minimum throughput requirement of the cellular users occupying the same frequency spectrum resource block, rejecting the D2D users to access the network;
in step 2.1, according to the user type, calculating the link signal-to-interference-and-noise ratio and the throughput which can be obtained of the user on each frequency spectrum resource block respectively; the method comprises the following steps:
1) If the new arriving user type is a cellular user, the calculation process of the link signal-to-interference-and-noise ratio and the obtained throughput of the cellular user on each frequency spectrum resource block is as follows:
The calculation formula of the link signal-to-interference-and-noise ratio of the cellular user on each frequency spectrum resource block is as follows:
Where C i denotes the time period between the i-th cellular user (i=1, 2, D j represents a j-th D2D user pair (j=1, 2, the terms of r=1, 2, K represents the number of the spectrum resource block in the network; representing a set of all D2D user pairs sharing an r-th spectrum resource block; Representing the transmit power of the cellular user C i occupying the r-th spectrum resource block, Representing the transmission power of the D j of the D2D user occupying the r-th frequency spectrum resource block; Representing the channel gain between the cellular user C i and the base station occupying the r-th spectrum resource block, Representing the channel gain between the D j transmitting end and the base station of the D2D user pair occupying the r-th frequency spectrum resource block, and sigma 2 represents the noise power;
according to shannon's theorem, the throughput calculation formula that can be obtained by the cellular user on each spectrum resource block is:
wherein W represents the bandwidth of one spectrum resource block;
2) If the newly arrived user type is a D2D user, the steps of the link signal-to-interference-and-noise ratio of the D2D user on each spectrum resource block and the obtained throughput are calculated as follows:
the signal-to-interference-and-noise ratio calculation formula of the D2D link is as follows:
wherein, Represents the channel gain between the transmitting end and the receiving end of the D j of the D2D user pair occupying the r-th spectrum resource block,Representing the channel gain between the cellular user C i and the D2D user pair D j receiver occupying the r-th spectrum resource block,Representing channel gains between a D j' transmitting end and a D j receiving end of different D2D user pairs sharing an r-th spectrum resource block;
According to shannon's theorem, the throughput calculation formula of the D2D user pair is:
In step 3, if the newly arrived user is a D2D user pair, a power control algorithm based on Q learning is called to allocate transmission power to the D2D user pair, specifically, the transmission power is allocated to the D2D user pair according to a Q value table output by the power control algorithm based on Q learning;
In step 3, if the newly arrived user is a D2D user pair, a power control algorithm based on Q learning is called to allocate transmission power for the D2D user pair, and the transmission power of other D2D user pairs sharing the same spectrum resource block is dynamically adjusted to maximize the total throughput of the network; the method comprises the following specific steps:
Step 3.1, initializing the value of the Q value table output by all the power control algorithms based on Q learning to be 0 for N r D2D user pairs D j,j∈{1,2,···,Nr which share the newly arrived D2D user pair allocated spectrum resource block; assigning j to 1;
Step 3.2, selecting a j-th D2D user pair sharing the spectrum resource block;
step 3.3, selecting an action a according to an epsilon-greedy strategy based on the current Q value table; wherein action a is defined as selecting a transmit power p e { p 1,p2,···,pL } for each D2D user pair sharing the spectrum resource block, wherein p 1,p2, P L is the alternative transmit power; specifically, a random number of 0-1 is generated, if the random number is smaller than epsilon, the action is randomly selected, and if the random number is larger than epsilon, the action with the largest Q value is selected;
Step 3.4, executing the action a and calculating a reward function R;
the bonus function R is defined as follows:
where τ 0 represents the minimum throughput requirement of the cellular user occupying the spectrum resource block;
The above indicates that when the throughput of the cellular user is higher than its minimum throughput requirement, the reward function is the total throughput of all users sharing the spectrum resource block, i.e. the optimization objective of the algorithm is to maximize the network total throughput; otherwise, the reward function is-1, which represents a penalty value;
Step 3.5, updating the Q value table according to the following formula:
Wherein Q' (s, a) represents an updated value of the Q value table, Q (s, a) represents a current value of the Q value table, alpha represents a learning rate, 0.ltoreq.alpha.ltoreq.1, gamma represents an attenuation factor, 0.ltoreq.gamma.ltoreq.1, Representing the maximum value in the current Q value table;
Step 3.6, repeating the steps 3.3-3.5 until the Q value table converges;
Step 3.7, repeating steps 3.2-3.6 until all D2D user pairs sharing the spectrum resource block are traversed;
and 3.8, re-assigning j to be 1, and repeating the steps 3.2-3.7 until the Q value tables of all D2D user pairs sharing the spectrum resource block are converged to the same optimal solution.
CN202111140067.1A 2021-09-28 2021-09-28 Throughput-oriented cellular network D2D communication dynamic resource allocation method Active CN113891481B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111140067.1A CN113891481B (en) 2021-09-28 2021-09-28 Throughput-oriented cellular network D2D communication dynamic resource allocation method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111140067.1A CN113891481B (en) 2021-09-28 2021-09-28 Throughput-oriented cellular network D2D communication dynamic resource allocation method

Publications (2)

Publication Number Publication Date
CN113891481A CN113891481A (en) 2022-01-04
CN113891481B true CN113891481B (en) 2024-08-27

Family

ID=79007292

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111140067.1A Active CN113891481B (en) 2021-09-28 2021-09-28 Throughput-oriented cellular network D2D communication dynamic resource allocation method

Country Status (1)

Country Link
CN (1) CN113891481B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114513854B (en) * 2022-01-29 2023-05-23 北京邮电大学 Uplink resource sharing method for cellular network and D2D communication and related equipment
CN116828618B (en) * 2023-08-16 2024-02-09 四川轻化工大学 D2D communication resource allocation method based on user energy efficiency control

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104486744A (en) * 2014-12-29 2015-04-01 重庆邮电大学 D2D (device-to-device) mode selection method in small heterogeneous cellular network
CN107484244A (en) * 2017-08-01 2017-12-15 东南大学 The cellular network D2D communication spectrum resource allocation algorithms that a kind of QoS is perceived

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106455083A (en) * 2015-08-13 2017-02-22 中兴通讯股份有限公司 D2D communication resource multiplex selection method, base station and terminal
CN113301638A (en) * 2021-05-20 2021-08-24 东南大学 D2D communication spectrum allocation and power control algorithm based on Q learning

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104486744A (en) * 2014-12-29 2015-04-01 重庆邮电大学 D2D (device-to-device) mode selection method in small heterogeneous cellular network
CN107484244A (en) * 2017-08-01 2017-12-15 东南大学 The cellular network D2D communication spectrum resource allocation algorithms that a kind of QoS is perceived

Also Published As

Publication number Publication date
CN113891481A (en) 2022-01-04

Similar Documents

Publication Publication Date Title
AlQerm et al. Enhanced machine learning scheme for energy efficient resource allocation in 5G heterogeneous cloud radio access networks
CN114867030B (en) Dual-time scale intelligent wireless access network slicing method
CN113411105B (en) AP selection method of non-cell large-scale antenna system
CN113891481B (en) Throughput-oriented cellular network D2D communication dynamic resource allocation method
CN103997740A (en) Cognitive cooperative network joint resource allocation method based on utility optimization
CN107343268B (en) Non-orthogonal multicast and unicast transmission beamforming method and system
Qiu et al. Subchannel assignment and power allocation for time-varying fog radio access network with NOMA
Gupta et al. Energy harvesting aided device-to-device communication in the over-sailing heterogeneous two-tier downlink
CN108449737B (en) D2D-based downlink energy-efficient power distribution method in distributed antenna system
CN110139281A (en) A method of the full duplex D2D sub-clustering resource allocation based on K-means
CN113301637A (en) D2D communication power control algorithm based on Q learning and neural network
CN113301638A (en) D2D communication spectrum allocation and power control algorithm based on Q learning
CN111682915B (en) Self-allocation method for frequency spectrum resources
Guo et al. Downlink interference management for D2D communication underlying cellular networks
Li et al. A general DRL-based optimization framework of user association and power control for HetNet
CN111314938A (en) Optimization method for time-frequency domain resource allocation of cellular network of single cell
CN108540964B (en) Spectrum resource allocation method
CN107148078B (en) User access control method and device for hybrid full-duplex and half-duplex network
CN106301501B (en) A kind of instant data transfer optimization method of combined coding modulation
Jha et al. Joint resource and power allocation for 5G enabled D2D networking with NOMA
Sun et al. A tradeoff between throughput and energy efficiency for D2D underlaying communication systems
CN113301639A (en) Cellular network D2D communication power control algorithm based on Q learning
CN110730025B (en) Incremental relay method suitable for energy-carrying non-orthogonal multiple access communication system
Guan et al. Joint rate and BER scheduling resource allocation for wireless communication systems
Yang et al. [Retracted] Power Control for Full‐Duplex Device‐to‐Device Underlaid Cellular Networks: A Stackelberg Game Approach

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant