research-article

Federated reinforcement learning approach for detecting uncertain deceptive target using autonomous dual UAV system

Authors:

Haythem Bany Salameh,

Mohannad Alhafnawi,

Ala’eddin Masadeh,

Yaser JararwehAuthors Info & Claims

Volume 60, Issue 2

https://doi.org/10.1016/j.ipm.2022.103149

Published: 01 March 2023 Publication History

Abstract

This paper develops a cooperative federated reinforcement learning (RL) strategy that enables two unmanned aerial vehicles (UAVs) to cooperate in learning and predicting the movements of an intelligent deceptive target in a given search area. The proposed strategy allows the UAVs to autonomously cooperate, through information exchange of the gained experience to maximize the target detection performance and accelerate the learning speed while maintaining privacy. Specifically, we consider a monitoring model that includes a search area, a charging station, two cooperative UAVs, an intelligent deceptive uncertain moving target, and a fake (false) target. Each UAV is equipped with a limited-capacity rechargeable battery and a communication unit for exchanging the gained experience. The problem of maximizing the detection probability of the uncertain deceptive target using cooperative UAVs is mathematically modeled as a search-benefit maximization problem, which is then reformulated as a Markov decision process (MDP) due to the uncertainty nature of the problem. Because there is no prior information on the targets’ movement, a cooperative RL, is utilized to tackle the problem. The proposed cooperative RL-based algorithm is a distributed collaborative mechanism that enables the two UAVs, i.e., agents, to individually interact with the operating environment and maximize their cumulative rewards by converging to a shared policy while achieving privacy. Simulation results indicate that a cooperative RL-based dual UAV system can noticeably improve the target detection probability, reduce the detection performance, and accelerate the learning speed.

Highlights

•

UAV cooperative learning for target detection in indoor environments is investigated.

•

The detection problem of deceptive target using 2-UAVs is mathematically formulated.

•

The 2-UAV system optimization is reformulated as MDP and solved using cooperative RL.

References

[1]

Abreha H.G., Hayajneh M., Serhani M.A., Federated learning in edge computing: A systematic survey, Sensors 22 (2) (2022),. URL https://www.mdpi.com/1424-8220/22/2/450.

Abstract

Highlights

References

Cited By

Index Terms

Recommendations

Autonomous UAV-based surveillance system for multi-target detection using reinforcement learning

Federated Reinforcement Learning-Based UAV Swarm System for Aerial Remote Sensing

Reinforcement Learning for UAV Attitude Control

Comments

Information

Published In

Publisher

Publication History

Author Tags

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations