[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Federated reinforcement learning approach for detecting uncertain deceptive target using autonomous dual UAV system

Published: 01 March 2023 Publication History

Abstract

This paper develops a cooperative federated reinforcement learning (RL) strategy that enables two unmanned aerial vehicles (UAVs) to cooperate in learning and predicting the movements of an intelligent deceptive target in a given search area. The proposed strategy allows the UAVs to autonomously cooperate, through information exchange of the gained experience to maximize the target detection performance and accelerate the learning speed while maintaining privacy. Specifically, we consider a monitoring model that includes a search area, a charging station, two cooperative UAVs, an intelligent deceptive uncertain moving target, and a fake (false) target. Each UAV is equipped with a limited-capacity rechargeable battery and a communication unit for exchanging the gained experience. The problem of maximizing the detection probability of the uncertain deceptive target using cooperative UAVs is mathematically modeled as a search-benefit maximization problem, which is then reformulated as a Markov decision process (MDP) due to the uncertainty nature of the problem. Because there is no prior information on the targets’ movement, a cooperative RL, is utilized to tackle the problem. The proposed cooperative RL-based algorithm is a distributed collaborative mechanism that enables the two UAVs, i.e., agents, to individually interact with the operating environment and maximize their cumulative rewards by converging to a shared policy while achieving privacy. Simulation results indicate that a cooperative RL-based dual UAV system can noticeably improve the target detection probability, reduce the detection performance, and accelerate the learning speed.

Highlights

UAV cooperative learning for target detection in indoor environments is investigated.
The detection problem of deceptive target using 2-UAVs is mathematically formulated.
The 2-UAV system optimization is reformulated as MDP and solved using cooperative RL.

References

[1]
Abreha H.G., Hayajneh M., Serhani M.A., Federated learning in edge computing: A systematic survey, Sensors 22 (2) (2022),. URL https://www.mdpi.com/1424-8220/22/2/450.
[2]
Akram M.W., Bashir A.K., Shamshad S., Saleem M.A., AlZubi A.A., Chaudhry S.A., et al., A secure and lightweight drones-access protocol for smart city surveillance, IEEE Transactions on Intelligent Transportation Systems (2021) 1–10.
[3]
Al-Hefnawi M., Search mission for an uncertain target using intelligent unmanned aerial vehicles, (Master thesis) Arabic Digital Library-Yarmouk University, 2021.
[4]
Banabilah S., Aloqaily M., Alsayed E., Malik N., Jararweh Y., Federated learning review: Fundamentals, enabling technologies, and future applications, Information Processing & Management 59 (6) (2022).
[5]
Bhagat S., Sujit P., UAV target tracking in urban environments using deep reinforcement learning, in: 2020 international conference on unmanned aircraft systems, IEEE, 2020, pp. 694–701.
[6]
Blasco P., Gunduz D., Dohler M., A learning theoretic approach to energy harvesting communication system optimization, IEEE Transactions on Wireless Communication 12 (4) (2013) 1872–1882.
[7]
Boonyathanmig N., Gongmanee S., Kayunyeam P., Wutticho P., Prongnuch S., Design and implementation of mini-UAV for indoor surveillance, in: 2021 9th international electrical engineering congress, 2021, pp. 305–308,.
[8]
Capitan J., Merino L., Ollero A., Cooperative decision-making under uncertainties for multi-target surveillance with multiples UAVs, Journal of Intelligent and Robotic Systems 84 (1) (2016) 371–386.
[9]
Chen Y.-J., Chang D.-K., Zhang C., Autonomous tracking using a swarm of UAVs: A constrained multi-agent reinforcement learning approach, IEEE Transactions on Vehicular Technology 69 (11) (2020) 13702–13717.
[10]
Guerra, A., Guidi, F., Dardari, D., & Djuric, P. M. (2020). Reinforcement learning for UAV autonomous navigation, mapping and target detection. In 2020 IEEE/ION position, location and navigation symposium (pp. 1004–1013).
[11]
Guerra A., Guidi F., Dardari D., Djurić P.M., Multi-agent Q-learning in UAV networks for target detection and indoor mapping, in: 2021 international balkan conference on communications and networking (BalkanCom), IEEE, 2021, pp. 80–84.
[12]
Han B., Li Q., Cheng C., Research on UAV indoor path planning algorithm based on global subdivision grids, in: 2021 IEEE international geoscience and remote sensing symposium IGARSS, 2021, pp. 8503–8506,.
[13]
Hassija V., Chamola V., Agrawal A., Goyal A., Luong N.C., Niyato D., et al., Fast, reliable, and secure drone communication: A comprehensive survey, IEEE Communications Surveys & Tutorials 23 (4) (2021) 2802–2832,.
[14]
Hayat, S., Yanmaz, E., Brown, T. X., & Bettstetter, C. (2017a). Multi-objective UAV path planning for search and rescue. In 2017 IEEE international conference on robotics and automation (pp. 5569–5574).
[15]
Hayat S., Yanmaz E., Brown T.X., Bettstetter C., Multi-objective UAV path planning for search and rescue, in: 2017 IEEE international conference on robotics and automation, 2017, pp. 5569–5574,.
[16]
Hayat S., Yanmaz E., Muzaffar R., Survey on unmanned aerial vehicle networks for civil applications: A communications viewpoint, IEEE Communications Surveys & Tutorials 18 (4) (2016) 2624–2661.
[17]
Heidrich-Meisner V., Lauer M., Igel C., Riedmiller M.A., Reinforcement learning in a nutshell, in: ESANN, 2007, pp. 277–288.
[18]
Hinas A., Ragel R., Roberts J., Gonzalez F., A framework for multiple ground target finding and inspection using a multirotor UAS, Sensors 20 (1) (2020) 272.
[19]
Hu C., Zhang Z., Yang N., Shin H.-S., Tsourdos A., Fuzzy multiobjective cooperative surveillance of multiple UAVs based on distributed predictive control for unknown ground moving target in urban environment, Aerospace Science and Technology 84 (2019) 329–338.
[20]
Khan L., Saad W., Han Z., Hossain E., Hong C., Federated learning for internet of things: Recent advances, taxonomy, and open challenges, IEEE Communications Surveys & Tutorials 23 (3) (2021) 1759–1799,.
[21]
Lai Y.-C., Huang Z.-Y., Detection of a moving UAV based on deep learning-based distance estimation, Remote Sensing 12 (18) (2020) 3035.
[22]
Li J., Ye D.H., Chung T., Kolsch M., Wachs J., Bouman C., Multi-target detection and tracking from a single camera in unmanned aerial vehicles (UAVs), in: 2016 IEEE/RSJ international conference on intelligent robots and systems, IEEE, 2016, pp. 4992–4997.
[23]
Lin B., Wu L., Niu Y., Zhou H., Ma Z., A multi-target detection framework for multirotor UAV, in: 2020 Chinese automation congress, IEEE, 2020, pp. 1063–1068.
[24]
Masadeh A., Wang Z., Kamal A.E., Convergence-based exploration algorithm for reinforcement learning, Electrical and Computer Engineering Technical Reports and White Papers 1 (2018).
[25]
Moon J., Papaioannou S., Laoudias C., Kolios P., Kim S., Deep reinforcement learning multi-UAV trajectory control for target tracking, IEEE Internet of Things Journal 8 (20) (2021) 15441–15455.
[26]
Mozaffari M., Saad W., Bennis M., Nam Y.-H., Debbah M., A tutorial on UAVs for wireless networks: Applications, challenges, and open problems, IEEE Communications Surveys & Tutorials 21 (3) (2019) 2334–2360.
[27]
Niu G., Zhang J., Guo S., Pun M.-O., Chen C.S., UAV-enabled 3D indoor positioning and navigation based on VLC, in: ICC 2021 - IEEE international conference on communications, 2021, pp. 1–6,.
[28]
Ortiz A., Al-Shatri H., Li X., Weber T., Klein A., Reinforcement learning for energy harvesting decode-and-forward two-hop communications, IEEE Transactions on Green Communications and Networking 1, no.3 (2017) 309–319.
[29]
Pham H.X., La H.M., Feil-Seifer D., Van Nguyen L., Reinforcement learning for autonomous UAV navigation using function approximation, in: 2018 IEEE international symposium on safety, security, and rescue robotics, IEEE, 2018, pp. 1–6.
[30]
Qi J., Zhou Q., Lei L., Zheng K., Federated reinforcement learning: Techniques, applications, and open challenges, 2021, ArXiv arXiv:2108.11887.
[31]
Shakhatreh H., Sawalmeh A.H., Al-Fuqaha A., Dou Z., Almaita E., Khalil I., et al., Unmanned aerial vehicles (UAVs): A survey on civil applications and key research challenges, Ieee Access 7 (2019) 48572–48634.
[32]
Stasinchuk Y., Vrba M., Petrlík M., Báča T., Spurnỳ V., Hert D., et al., A multi-UAV system for detection and elimination of multiple targets, in: 2021 IEEE international conference on robotics and automation, IEEE, 2021, pp. 555–561.
[33]
Sutton R.S., Barto A.G., Reinforcement learning: An introduction, MIT Press, 2018.
[34]
Wang S., Njau C.E., Jiang Z., Design and implementation of multi-UAV cooperation search experimental platform, in: 2021 5th international conference on robotics and automation sciences, IEEE, 2021, pp. 94–98.
[35]
Wang T., Qin R., Chen Y., Snoussi H., Choi C., A reinforcement learning approach for UAV target searching and tracking, Multimedia Tools and Applications 78 (4) (2019) 4347–4364.
[36]
Wei X.L., Huang X.L., Lu T., Song G.G., An improved method based on deep reinforcement learning for target searching, in: 2019 4th international conference on robotics and automation engineering, IEEE, 2019, pp. 130–134.
[37]
Wu D., Deng Y., Li M., FL-MGVN: Federated learning for anomaly detection using mixed gaussian variational self-encoding network, Information Processing & Management 59 (2) (2022),. URL https://www.sciencedirect.com/science/article/pii/S0306457321003113.
[38]
Yang B., Cao X., Yuen C., Qian L., Offloading optimization in edge computing for deep-learning-enabled target tracking by internet of UAVs, IEEE Internet of Things Journal 8 (12) (2020) 9878–9893.
[39]
Yilmaz K., Kaya O., Uslu E., Indoor UAV localization and 3D mapping using visual odometry, in: 2020 innovations in intelligent systems and applications conference, 2020, pp. 1–5,.
[40]
Yuan H., Xiao C., Zhan W., Wang Y., Shi C., Ye H., et al., Target detection, positioning and tracking using new UAV gas sensor systems: Simulation and analysis, Journal of Intelligent and Robotic Systems 94 (3) (2019) 871–882.
[41]
Yue W., Guan X., Xi Y., Reinforcement learning based approach for multi-UAV cooperative searching in unknown environments, in: 2019 Chinese Automation Congress, IEEE, 2019, pp. 2018–2023.

Cited By

View all
  • (2024)Autonomous UAV-based surveillance system for multi-target detection using reinforcement learningCluster Computing10.1007/s10586-024-04452-027:7(9381-9394)Online publication date: 1-Oct-2024
  • (2023)A Survey on Edge Intelligence and Lightweight Machine Learning Support for Future Applications and ServicesJournal of Data and Information Quality10.1145/358175915:2(1-30)Online publication date: 25-Jan-2023

Index Terms

  1. Federated reinforcement learning approach for detecting uncertain deceptive target using autonomous dual UAV system
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Please enable JavaScript to view thecomments powered by Disqus.

        Information & Contributors

        Information

        Published In

        cover image Information Processing and Management: an International Journal
        Information Processing and Management: an International Journal  Volume 60, Issue 2
        Mar 2023
        1443 pages

        Publisher

        Pergamon Press, Inc.

        United States

        Publication History

        Published: 01 March 2023

        Author Tags

        1. Cooperative learning
        2. Federated learning
        3. Artificial intelligence
        4. Emerging UAV
        5. Indoor environment

        Qualifiers

        • Research-article

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)0
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 25 Dec 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2024)Autonomous UAV-based surveillance system for multi-target detection using reinforcement learningCluster Computing10.1007/s10586-024-04452-027:7(9381-9394)Online publication date: 1-Oct-2024
        • (2023)A Survey on Edge Intelligence and Lightweight Machine Learning Support for Future Applications and ServicesJournal of Data and Information Quality10.1145/358175915:2(1-30)Online publication date: 25-Jan-2023

        View Options

        View options

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media