MDPI - Publisher of Open Access Journals

33 pages, 7735 KiB

Open AccessArticle

Control and Optimization of Hydrogen Hybrid Electric Vehicles Using GPS-Based Speed Estimation

by Nouha Mansouri, Aymen Mnassri, Sihem Nasri, Majid Ali, Abderezak Lashab, Juan C. Vasquez and Josep M. Guerrero

Electronics 2025, 14(1), 110; https://doi.org/10.3390/electronics14010110 - 30 Dec 2024

Viewed by 525

Abstract

This paper investigates the feasibility of hydrogen-powered hybrid electric vehicles as a solution to transportation-related pollution. It focuses on optimizing energy use to improve efficiency and reduce emissions. The study details the creation and real-time performance assessment of a hydrogen hybrid electric vehicle [...] Read more.

This paper investigates the feasibility of hydrogen-powered hybrid electric vehicles as a solution to transportation-related pollution. It focuses on optimizing energy use to improve efficiency and reduce emissions. The study details the creation and real-time performance assessment of a hydrogen hybrid electric vehicle (HHEV)system using an STM32F407VG board. This system includes a fuel cell (FC) as the main energy source, a battery (Bat) to provide energy during hydrogen supply disruptions and a supercapacitor (SC) to handle power fluctuations. A multi-agent-based artificial intelligence tool is used to model the system components, and an energy management algorithm (EMA) is applied to optimize energy use and support decision-making. Real Global Positioning System (GPS) data are analyzed to estimate energy consumption based on trip and speed parameters. The EMA, developed and implemented in real-time using Matlab/Simulink(2016), identifies the most energy-efficient routes. The results show that the proposed vehicle architecture and management strategy effectively select optimal routes with minimal energy use. Full article

► Show Figures

Figure 1

24 pages, 1649 KiB

Open AccessArticle

Heterogeneous Multi-Agent Risk-Aware Graph Encoder with Continuous Parameterized Decoder for Autonomous Driving Trajectory Prediction

by Shaoyu Sun, Chunyang Wang, Bo Xiao, Xuelian Liu, Chunhao Shi, Rongliang Sun and Ruijie Han

Electronics 2025, 14(1), 105; https://doi.org/10.3390/electronics14010105 - 30 Dec 2024

Viewed by 254

Abstract

Trajectory prediction is a critical component of autonomous driving, intelligent transportation systems, and human–robot interactions, particularly in complex environments like intersections, where diverse road constraints and multi-agent interactions significantly increase the risk of collisions. To address these challenges, a Heterogeneous Risk-Aware Graph Encoder [...] Read more.

Trajectory prediction is a critical component of autonomous driving, intelligent transportation systems, and human–robot interactions, particularly in complex environments like intersections, where diverse road constraints and multi-agent interactions significantly increase the risk of collisions. To address these challenges, a Heterogeneous Risk-Aware Graph Encoder with Continuous Parameterized Decoder for Trajectory Prediction (HRGC) is proposed. The architecture integrates a heterogeneous risk-aware local graph attention encoder, a low-rank temporal transformer, a fusion lane and global interaction encoder layer, and a continuous parameterized decoder. First, a heterogeneous risk-aware edge-enhanced local attention encoder is proposed, which enhances edge features using risk metrics, constructs graph structures through graph optimization and spectral clustering, maps these enhanced edge features to corresponding graph structure indices, and enriches node features with local agent-to-agent attention. Risk-aware edge attention is aggregated to update node features, capturing spatial and collision-aware representations, embedding crucial risk information into agents’ features. Next, the low-rank temporal transformer is employed to reduce computational complexity while preserving accuracy. By modeling agent-to-lane relationships, it captures critical map context, enhancing the understanding of agent behavior. Global interaction further refines node-to-node interactions via attention mechanisms, integrating risk and spatial information for improved trajectory encoding. Finally, a trajectory decoder utilizes the aforementioned encoder to generate control points for continuous parameterized curves. These control points are multiplied by dynamically adjusted basis functions, which are determined by an adaptive knot vector that adjusts based on velocity and curvature. This mechanism ensures precise local control and the superior handling of sharp turns and speed variations, resulting in more accurate real-time predictions in complex scenarios. The HRGC network achieves superior performance on the Argoverse 1 benchmark, outperforming state-of-the-art methods in complex urban intersections. Full article

(This article belongs to the Section Artificial Intelligence)

► Show Figures

Figure 1

27 pages, 9470 KiB

Open AccessArticle

Multi-Objective Dynamic Path Planning with Multi-Agent Deep Reinforcement Learning

by Mengxue Tao, Qiang Li and Junxi Yu

J. Mar. Sci. Eng. 2025, 13(1), 20; https://doi.org/10.3390/jmse13010020 - 27 Dec 2024

Viewed by 287

Abstract

Multi-agent reinforcement learning (MARL) is characterized by its simple structure and strong adaptability, which has led to its widespread application in the field of path planning. To address the challenge of optimal path planning for mobile agent clusters in uncertain environments, a multi-objective [...] Read more.

Multi-agent reinforcement learning (MARL) is characterized by its simple structure and strong adaptability, which has led to its widespread application in the field of path planning. To address the challenge of optimal path planning for mobile agent clusters in uncertain environments, a multi-objective dynamic path planning model (MODPP) based on multi-agent deep reinforcement learning (MADRL) has been proposed. This model is suitable for complex, unstable task environments characterized by dimensionality explosion and offers scalability. The approach consists of two components: an action evaluation module and an action decision module, utilizing a centralized training with decentralized execution (CTDE) training architecture. During the training process, agents within the cluster learn cooperative strategies while being able to communicate with one another. Consequently, they can navigate through task environments without communication, achieving collision-free paths that optimize multiple sub-objectives globally, minimizing time, distance, and overall costs associated with turning. Furthermore, in real-task execution, agents acting as mobile entities can perform real-time obstacle avoidance. Finally, based on the OpenAI Gym platform, environments such as simple multi-objective environment and complex multi-objective environment were designed to analyze the rationality and effectiveness of the multi-objective dynamic path planning through minimum cost and collision risk assessments. Additionally, the impact of reward function configuration on agent strategies was discussed. Full article

(This article belongs to the Special Issue Advanced Condition Monitoring and Intelligent Operation & Maintenance Technologies in Ships and Offshore Facilities)

► Show Figures

Figure 1

26 pages, 10534 KiB

Open AccessArticle

Assessment of the Impact of Multi-Agent Model-Based Traffic Optimization Interventions on Urban Travel Behavior

by Lihu Pan, Nan Yang, Linliang Zhang, Rui Zhang, Binhong Xie and Huimin Yan

Electronics 2025, 14(1), 13; https://doi.org/10.3390/electronics14010013 - 24 Dec 2024

Viewed by 243

Abstract

With the continuous increase in car ownership, alleviating traffic congestion and reducing carbon emissions have become key challenges in urban traffic management. This study constructs a multi-agent model to evaluate the impact of various traffic optimization interventions on citizens’ travel behavior and traffic [...] Read more.

With the continuous increase in car ownership, alleviating traffic congestion and reducing carbon emissions have become key challenges in urban traffic management. This study constructs a multi-agent model to evaluate the impact of various traffic optimization interventions on citizens’ travel behavior and traffic carbon emission levels. Different from previous mathematical models, this model integrates computer technology and geographic information systems, abstracting travelers as agents with self-control capabilities who can make independent decisions based on their own circumstances, thus reflecting individual differences in travel behavior. Using the real geographical and social environment of the high-density travel area in Xiaodian District, Taiyuan City as a case study, this research explores the overall improvement in the urban transportation system through the implementation of multiple traffic optimization interventions, such as a parking reservation system, the promotion of the park-and-ride mode, and the optimization of public transportation services. Studies have demonstrated that, compared to reducing bus fares, travelers exhibit a greater sensitivity to waiting times. Reducing bus departure intervals can increase the proportion of park-and-ride trips to 25.79%, surpassing the 19.19% increase observed with fare adjustments. A moderate increase in the proportion of reserved parking spaces can elevate the public transport load to 49.85%. The synergistic effect of a combined strategy can further boost the public transport share to 50.62%, while increasing the park-and-ride trip proportion to 33.6%, thereby highlighting the comprehensive benefits of implementing multiple strategies in tandem. When the parking reservation system is effectively implemented, carbon dioxide emissions can be reduced from over 800 kg to below 200 kg, and the proportion of vehicle cruising can decrease from over 20% to under 15%. These results underscore the critical role of the parking reservation strategy in optimizing traffic flow and advancing environmental sustainability. Full article

► Show Figures

Figure 1

16 pages, 2276 KiB

Open AccessArticle

Adaptive Control of VSG Inertia Damping Based on MADDPG

by Demu Zhang, Jing Zhang, Yu He, Tao Shen and Xingyan Liu

Energies 2024, 17(24), 6421; https://doi.org/10.3390/en17246421 - 20 Dec 2024

Viewed by 277

Abstract

As renewable energy sources become more integrated into the power grid, traditional virtual synchronous generator (VSG) control strategies have become inadequate for the current low-damping, low-inertia power systems. Therefore, this paper proposes a VSG inertia and damping adaptive control method based on multi-agent [...] Read more.

As renewable energy sources become more integrated into the power grid, traditional virtual synchronous generator (VSG) control strategies have become inadequate for the current low-damping, low-inertia power systems. Therefore, this paper proposes a VSG inertia and damping adaptive control method based on multi-agent deep deterministic policy gradient (MADDPG). The paper first introduces the working principles of virtual synchronous generators and establishes a corresponding VSG model. Based on this model, the influence of variations in virtual inertia (J) and damping (D) coefficients on fluctuations in active power output is examined, defining the action space for J and D. The proposed method is mainly divided into two phases: “centralized training and decentralized execution”. In the centralized training phase, each agent’s critic network shares global observation and action information to guide the actor network in policy optimization. In the decentralized execution phase, agents observe frequency deviations and the rate at which angular frequency changes, using reinforcement learning algorithms to adjust the virtual inertia J and damping coefficient D in real time. Finally, the effectiveness of the proposed MADDPG control strategy is validated through comparison with adaptive control and DDPG control methods. Full article

(This article belongs to the Special Issue Planning, Operation, and Control of New Power Systems)

► Show Figures

Figure 1

25 pages, 6743 KiB

Open AccessArticle

Online Autonomous Motion Control of Communication-Relay UAV with Channel Prediction in Dynamic Urban Environments

by Cancan Tao and Bowen Liu

Drones 2024, 8(12), 771; https://doi.org/10.3390/drones8120771 - 19 Dec 2024

Viewed by 474

Abstract

In order to improve the network performance of multi-unmanned ground vehicle (UGV) systems in urban environments, this article proposes a novel online autonomous motion-control method for the relay UAV. The problem is solved by jointly considering unknown RF channel parameters, unknown multi-agent mobility, [...] Read more.

In order to improve the network performance of multi-unmanned ground vehicle (UGV) systems in urban environments, this article proposes a novel online autonomous motion-control method for the relay UAV. The problem is solved by jointly considering unknown RF channel parameters, unknown multi-agent mobility, the impact of the environments on channel characteristics, and the unavailable angle-of-arrival (AoA) information of the received signal, making the solution of the problem more practical and comprehensive. The method mainly consists of two parts: wireless channel parameter estimation and optimal relay position search. Considering that in practical applications, the radio frequency (RF) channel parameters in complex urban environments are difficult to obtain in advance and are constantly changing, an estimation algorithm based on Gaussian process learning is proposed for online evaluation of the wireless channel parameters near the current position of the UAV; for the optimal relay position search problem, in order to improve the real-time performance of the method, a line search algorithm and a general gradient-based algorithm are proposed, which are used for point-to-point communication and multi-node communication scenarios, respectively, reducing the two-dimensional search to a one-dimensional search, and the stability proof and convergence conditions of the algorithm are given. Comparative experiments and simulation results under different scenarios show that the proposed motion-control method can drive the UAV to reach or track the optimal relay position and improve the network performance, while demonstrating that it is beneficial to consider the impact of the environments on the channel characteristics. Full article

(This article belongs to the Special Issue Advances in Civil Applications of Unmanned Aircraft Systems: 2nd Edition)

► Show Figures

Figure 1

17 pages, 2088 KiB

Open AccessArticle

Personalized Clustering for Emotion Recognition Improvement

by Laura Gutiérrez-Martín, Celia López-Ongil, Jose M. Lanza-Gutiérrez and Jose A. Miranda Calero

Sensors 2024, 24(24), 8110; https://doi.org/10.3390/s24248110 - 19 Dec 2024

Viewed by 340

Abstract

Emotion recognition through artificial intelligence and smart sensing of physical and physiological signals (affective computing) is achieving very interesting results in terms of accuracy, inference times, and user-independent models. In this sense, there are applications related to the safety and well-being of people [...] Read more.

Emotion recognition through artificial intelligence and smart sensing of physical and physiological signals (affective computing) is achieving very interesting results in terms of accuracy, inference times, and user-independent models. In this sense, there are applications related to the safety and well-being of people (sexual assaults, gender-based violence, children and elderly abuse, mental health, etc.) that require even more improvements. Emotion detection should be done with fast, discrete, and non-luxurious systems working in real time and real life (wearable devices, wireless communications, battery-powered). Furthermore, emotional reactions to violence are not equal in all people. Then, large general models cannot be applied to a multi-user system for people protection, and health and social workers and law enforcement agents would welcome customized and lightweight AI models. These semi-personalized models will be applicable to clusters of subjects sharing similarities in their emotional reactions to external stimuli. This customization requires several steps: creating clusters of subjects with similar behaviors, creating AI models for every cluster, continually updating these models with new data, and enrolling new subjects in clusters when required. An initial approach for clustering labeled data compiled (physiological data, together with emotional labels) is presented in this work, as well as the method to ensure the enrollment of new users with unlabeled data once the AI models are generated. The idea is that this complete methodology can be exportable to any other expert systems where unlabeled data are added during in-field operation and different profiles exist in terms of data. Experimental results demonstrate an improvement of 5% in accuracy and 4% in F1 score with respect to our baseline general model, along with a 32% to 58% reduction in variability, respectively. Full article

(This article belongs to the Special Issue Wearable Sensors and Artificial Intelligence for Measuring Human Vital Signs: 2nd Edition)

► Show Figures

Figure 1

26 pages, 6416 KiB

Open AccessArticle

Advanced Monocular Outdoor Pose Estimation in Autonomous Systems: Leveraging Optical Flow, Depth Estimation, and Semantic Segmentation with Dynamic Object Removal

by Alireza Ghasemieh and Rasha Kashef

Sensors 2024, 24(24), 8040; https://doi.org/10.3390/s24248040 - 17 Dec 2024

Viewed by 394

Abstract

Autonomous technologies have revolutionized transportation, military operations, and space exploration, necessitating precise localization in environments where traditional GPS-based systems are unreliable or unavailable. While widespread for outdoor localization, GPS systems face limitations in obstructed environments such as dense urban areas, forests, and indoor [...] Read more.

Autonomous technologies have revolutionized transportation, military operations, and space exploration, necessitating precise localization in environments where traditional GPS-based systems are unreliable or unavailable. While widespread for outdoor localization, GPS systems face limitations in obstructed environments such as dense urban areas, forests, and indoor spaces. Moreover, GPS reliance introduces vulnerabilities to signal disruptions, which can lead to significant operational failures. Hence, developing alternative localization techniques that do not depend on external signals is essential, showing a critical need for robust, GPS-independent localization solutions adaptable to different applications, ranging from Earth-based autonomous vehicles to robotic missions on Mars. This paper addresses these challenges using Visual odometry (VO) to estimate a camera’s pose by analyzing captured image sequences in GPS-denied areas tailored for autonomous vehicles (AVs), where safety and real-time decision-making are paramount. Extensive research has been dedicated to pose estimation using LiDAR or stereo cameras, which, despite their accuracy, are constrained by weight, cost, and complexity. In contrast, monocular vision is practical and cost-effective, making it a popular choice for drones, cars, and autonomous vehicles. However, robust and reliable monocular pose estimation models remain underexplored. This research aims to fill this gap by developing a novel adaptive framework for outdoor pose estimation and safe navigation using enhanced visual odometry systems with monocular cameras, especially for applications where deploying additional sensors is not feasible due to cost or physical constraints. This framework is designed to be adaptable across different vehicles and platforms, ensuring accurate and reliable pose estimation. We integrate advanced control theory to provide safety guarantees for motion control, ensuring that the AV can react safely to the imminent hazards and unknown trajectories of nearby traffic agents. The focus is on creating an AI-driven model(s) that meets the performance standards of multi-sensor systems while leveraging the inherent advantages of monocular vision. This research uses state-of-the-art machine learning techniques to advance visual odometry’s technical capabilities and ensure its adaptability across different platforms, cameras, and environments. By merging cutting-edge visual odometry techniques with robust control theory, our approach enhances both the safety and performance of AVs in complex traffic situations, directly addressing the challenge of safe and adaptive navigation. Experimental results on the KITTI odometry dataset demonstrate a significant improvement in pose estimation accuracy, offering a cost-effective and robust solution for real-world applications. Full article

(This article belongs to the Special Issue Sensors for Object Detection, Pose Estimation, and 3D Reconstruction)

► Show Figures

Figure 1

26 pages, 3702 KiB

Open AccessArticle

Real-Time Scheduling with Independent Evaluators: Explainable Multi-Agent Approach

by Artem Isakov, Danil Peregorodiev, Ivan Tomilov, Chuyang Ye, Natalia Gusarova, Aleksandra Vatian and Alexander Boukhanovsky

Technologies 2024, 12(12), 259; https://doi.org/10.3390/technologies12120259 - 14 Dec 2024

Viewed by 652

Abstract

This study introduces a multi-agent reinforcement learning approach to address the challenges of real-time scheduling in dynamic environments, with a specific focus on healthcare operations. The proposed system integrates the Human-in-the-Loop (HITL) paradigm, providing continuous feedback from human evaluators, and it employs a [...] Read more.

This study introduces a multi-agent reinforcement learning approach to address the challenges of real-time scheduling in dynamic environments, with a specific focus on healthcare operations. The proposed system integrates the Human-in-the-Loop (HITL) paradigm, providing continuous feedback from human evaluators, and it employs a sophisticated reward function to attenuate the effects of human-driven events. Novel mapping between reinforcement learning (RL) concepts and the Belief–Desire–Intention (BDI) framework is developed to enhance the explainability of the agent’s decision-making. A system is designed to adapt to changes in patient conditions and preferences while minimizing disruptions to existing schedules. Experimental results show a notable decrease in patient waiting times compared to conventional methods while adhering to operator-induced constraints. This approach offers a robust, explainable, and adaptable solution for the challenging tasks of scheduling in the environments that require human-centered decision-making. Full article

(This article belongs to the Section Information and Communication Technologies)

► Show Figures

Figure 1

17 pages, 6290 KiB

Open AccessArticle

Real-Time Detection of IoT Anomalies and Intrusion Data in Smart Cities Using Multi-Agent System

by Maria Viorela Muntean

Sensors 2024, 24(24), 7886; https://doi.org/10.3390/s24247886 - 10 Dec 2024

Viewed by 461

Abstract

Analyzing IoT data is an important challenge in the smart cities domain due to the complexity of network traffic generated by a large number of interconnected devices: smart cameras, light bulbs, motion sensors, voice assistants, and so on. To overcome this issue, a [...] Read more.

Analyzing IoT data is an important challenge in the smart cities domain due to the complexity of network traffic generated by a large number of interconnected devices: smart cameras, light bulbs, motion sensors, voice assistants, and so on. To overcome this issue, a multi-agent system is proposed to deal with all machine learning steps, from preprocessing and labeling data to discovering the most suitable model for the analyzed dataset. This paper shows that dividing the work into different tasks, managed by specialized agents, and evaluating the discovered models by an Expert System Agent leads to better results in the learning process. Full article

(This article belongs to the Special Issue Advanced IoT Systems in Smart Cities: 2nd Edition)

► Show Figures

Figure 1

23 pages, 6025 KiB

Open AccessArticle

Integrating Vision and Olfaction via Multi-Modal LLM for Robotic Odor Source Localization

by Sunzid Hassan, Lingxiao Wang and Khan Raqib Mahmud

Sensors 2024, 24(24), 7875; https://doi.org/10.3390/s24247875 - 10 Dec 2024

Viewed by 539

Abstract

Odor source localization (OSL) technology allows autonomous agents like mobile robots to localize a target odor source in an unknown environment. This is achieved by an OSL navigation algorithm that processes an agent’s sensor readings to calculate action commands to guide the robot [...] Read more.

Odor source localization (OSL) technology allows autonomous agents like mobile robots to localize a target odor source in an unknown environment. This is achieved by an OSL navigation algorithm that processes an agent’s sensor readings to calculate action commands to guide the robot to locate the odor source. Compared to traditional ‘olfaction-only’ OSL algorithms, our proposed OSL algorithm integrates vision and olfaction sensor modalities to localize odor sources even if olfaction sensing is disrupted by non-unidirectional airflow or vision sensing is impaired by environmental complexities. The algorithm leverages the zero-shot multi-modal reasoning capabilities of large language models (LLMs), negating the requirement of manual knowledge encoding or custom-trained supervised learning models. A key feature of the proposed algorithm is the ‘High-level Reasoning’ module, which encodes the olfaction and vision sensor data into a multi-modal prompt and instructs the LLM to employ a hierarchical reasoning process to select an appropriate high-level navigation behavior. Subsequently, the ‘Low-level Action’ module translates the selected high-level navigation behavior into low-level action commands that can be executed by the mobile robot. To validate our algorithm, we implemented it on a mobile robot in a real-world environment with non-unidirectional airflow environments and obstacles to mimic a complex, practical search environment. We compared the performance of our proposed algorithm to single-sensory-modality-based ‘olfaction-only’ and ‘vision-only’ navigation algorithms, and a supervised learning-based ‘vision and olfaction fusion’ (Fusion) navigation algorithm. The experimental results show that the proposed LLM-based algorithm outperformed the other algorithms in terms of success rates and average search times in both unidirectional and non-unidirectional airflow environments. Full article

(This article belongs to the Special Issue Design and Integration of Sensors for Control, Planning and Deployment in Robotic Systems)

► Show Figures

Figure 1

28 pages, 5225 KiB

Open AccessArticle

MAARS: Multiagent Actor–Critic Approach for Resource Allocation and Network Slicing in Multiaccess Edge Computing

by Ducsun Lim and Inwhee Joe

Sensors 2024, 24(23), 7760; https://doi.org/10.3390/s24237760 - 4 Dec 2024

Viewed by 569

Abstract

This paper presents a novel algorithm to address resource allocation and network-slicing challenges in multiaccess edge computing (MEC) networks. Network slicing divides a physical network into virtual slices, each tailored to efficiently allocate resources and meet diverse service requirements. To maximize the completion [...] Read more.

This paper presents a novel algorithm to address resource allocation and network-slicing challenges in multiaccess edge computing (MEC) networks. Network slicing divides a physical network into virtual slices, each tailored to efficiently allocate resources and meet diverse service requirements. To maximize the completion rate of user-computing tasks within these slices, the problem is decomposed into two subproblems: efficient core-to-edge slicing (ECS) and autonomous resource slicing (ARS). ECS facilitates collaborative resource distribution through cooperation among edge servers, while ARS dynamically manages resources based on real-time network conditions. The proposed solution, a multiagent actor–critic resource scheduling (MAARS) algorithm, employs a reinforcement learning framework. Specifically, MAARS utilizes a multiagent deep deterministic policy gradient (MADDPG) for efficient resource distribution in ECS and a soft actor–critic (SAC) technique for robust real-time resource management in ARS. Simulation results demonstrate that MAARS outperforms benchmark algorithms, including heuristic-based, DQN-based, and A2C-based methods, in terms of task completion rates, resource utilization, and convergence speed. Thus, this study offers a scalable and efficient framework for resource optimization and network slicing in MEC networks, providing practical benefits for real-world deployments and setting a new performance benchmark in dynamic environments. Full article

(This article belongs to the Special Issue Sensing and Mobile Edge Computing)

► Show Figures

Figure 1

17 pages, 2270 KiB

Open AccessArticle

Fast Parameter Estimation of Linear Frequency Modulation Signals in Marine Environments Based on Gradient Optimization Strategy

by Jiawei Wen, Zhe Ouyang, Donghu Nie and Cong Ren

J. Mar. Sci. Eng. 2024, 12(12), 2195; https://doi.org/10.3390/jmse12122195 - 1 Dec 2024

Viewed by 485

Abstract

Multi-buoy sonar systems achieve target localization by receiving broadband Linear Frequency Modulation signals emitted from the transmitter. Accurate estimations of the parameters of Linear Frequency Modulation signals significantly enhance the localization accuracy. Linear Frequency Modulation signals can be focused into the fractional domain [...] Read more.

Multi-buoy sonar systems achieve target localization by receiving broadband Linear Frequency Modulation signals emitted from the transmitter. Accurate estimations of the parameters of Linear Frequency Modulation signals significantly enhance the localization accuracy. Linear Frequency Modulation signals can be focused into the fractional domain through Fractional Fourier Transform, but this increases the computational complexity. In marine environments, the low signal-to-noise ratio and multipath effects degrade the parameter estimation accuracy further. To address these issues, this paper proposes a fast estimation algorithm based on the Fractional Fourier Transform and a Gradient Subtraction-Average-Based Optimizer. This algorithm leverages the impulsive characteristics of Linear Frequency Modulation signals after Fractional Fourier Transform transformation, using the Fractional Fourier Transform as the fitness function. The Gradient Subtraction-Average-Based Optimizer algorithm includes three enhancement strategies: chaotic mapping initialization, a Golden Sine Algorithm, and an adaptive t-distribution variational operator. The simulation results demonstrate that the Gradient Subtraction-Average-Based Optimizer algorithm improves the issues of low diversity in the search agents, imbalanced global and local search capabilities, and susceptibility to local optima. A comparative analysis and statistical testing confirm that under a low signal-to-noise ratio and multipath effect conditions, the Gradient Subtraction-Average-Based Optimizer algorithm not only ensures real-time parameter estimation but also improves the estimation accuracy. The results of the parameter estimation provide reliable data support for subsequent target localization. Full article

(This article belongs to the Section Ocean Engineering)

► Show Figures

Figure 1

17 pages, 2075 KiB

Open AccessArticle

Extending Conflict-Based Search for Optimal and Efficient Quadrotor Swarm Motion Planning

by Zihao Wang, Zhiwei Zhang, Wenying Dou, Guangpeng Hu, Lifu Zhang and Meng Zhang

Drones 2024, 8(12), 719; https://doi.org/10.3390/drones8120719 - 29 Nov 2024

Viewed by 382

Abstract

Multi-agent pathfinding has been extensively studied by the robotics and artificial intelligence communities. The classical algorithm, conflict-based search (CBS), is widely used in various real-world applications due to its ability to solve large-scale conflict-free paths. However, classical CBS assumes discrete time–space planning and [...] Read more.

Multi-agent pathfinding has been extensively studied by the robotics and artificial intelligence communities. The classical algorithm, conflict-based search (CBS), is widely used in various real-world applications due to its ability to solve large-scale conflict-free paths. However, classical CBS assumes discrete time–space planning and overlooks physical constraints in actual scenarios, making it unsuitable for direct application in unmanned aerial vehicle (UAV) swarm. Inspired by the decentralized planning and centralized conflict resolution ideas of CBS, we propose, for the first time, an optimal and efficient UAV swarm motion planner that integrates state lattice with CBS without any underlying assumption, named SL-CBS. SL-CBS is a two-layer search algorithm: (1) The low-level search utilizes an improved state lattice. We design emergency stop motion primitives to ensure complete UAV dynamics and handle spatio-temporal constraints from high-level conflicts. (2) The high-level algorithm defines comprehensive conflict types and proposes a motion primitive conflict detection method with linear time complexity based on Sturm’s theory. Additionally, our modified independence detection (ID) technique is applied to enable parallel conflict processing. We validate the planning capabilities of SL-CBS in classical scenarios and compare these with the latest state-of-the-art (SOTA) algorithms, showing great improvements in success rate, computation time, and flight time. Finally, we conduct large-scale tests to analyze the performance boundaries of SL-CBS+ID. Full article

(This article belongs to the Section Drone Design and Development)

► Show Figures

Figure 1

Figure 1
Planning process of the SL-CBS algorithm. The black border represents the map boundary, the light green rectangles denote static obstacles, the blue hollow circles indicate the initial positions of the UAVs, and the cyan hollow circles represent the UAVs’ target regions. The magenta curves depict the planned UAV flight trajectories, with the solid red circles on the curves indicating the UAV states. (a) initial state; (b) the first iteration; and (c) the second iteration. Full article ">Figure 2
Two types of conflicts. In (a), a state conflict occurs at the extended state at time <math display="inline"><semantics> <mrow> <mi>t</mi> <mo>+</mo> <mi>τ</mi> </mrow> </semantics></math>, while in (b), a motion primitive conflict occurs at some location within <math display="inline"><semantics> <mrow> <mo>(</mo> <mi>t</mi> <mo>,</mo> <mi>t</mi> <mo>+</mo> <mi>τ</mi> <mo>)</mo> </mrow> </semantics></math>. Here, <math display="inline"><semantics> <mi>τ</mi> </semantics></math> is time interval. Full article ">Figure 3
Emergency stop motion primitive model. Full article ">Figure 4
UAV swarm conflict graph. The magenta double-headed arrows represent the existence of conflicts between the trajectories of the UAVs. Full article ">Figure 5
Flight trajectories of UAV swarm in classic scenario test instances. The black border represents the map boundary, while the light green areas indicate static obstacles. The magenta curves depict the flight trajectories of the UAVs. In (a), the green and orange curves also represent UAV flight trajectories. In (b–d), the solid red circles indicate the flight states of the UAV at intermediate times. Full article ">Figure 6
Flight trajectories planned by K-CBS, db-CBS, and SL-CBS for swarm size <math display="inline"><semantics> <mrow> <mi>K</mi> <mo>=</mo> <mn>8</mn> </mrow> </semantics></math>. The black border represents the map boundaries, while the 20 gray circles indicate obstacle regions. The solid circles in various colors depict the current states of the UAV swarm, and the curves in different colors represent the trajectories already flown by the UAV swarm. (a) K-CBS; (b) db-CBS; and (c) SL-CBS. Full article ">Figure 7
Flight trajectories planned by SL-CBS for a swarm size of <math display="inline"><semantics> <mrow> <mi>K</mi> <mo>=</mo> <mn>20</mn> </mrow> </semantics></math>. The black border represents the map boundary, while the 25 gray circles indicate obstacle regions. The solid circles in various colors represent the current states of the UAV swarm, with black arrows on the circles indicating the yaw direction of each UAV. The curves in different colors illustrate the trajectories already traveled by the UAV swarm. (a) <math display="inline"><semantics> <mrow> <mi>K</mi> <mo>=</mo> <mn>20</mn> </mrow> </semantics></math> (without yaw); and (b) <math display="inline"><semantics> <mrow> <mi>K</mi> <mo>=</mo> <mn>20</mn> </mrow> </semantics></math> (with yaw). Full article ">Figure 8
Evaluation metrics results of SL-CBS under different swarm sizes: (a) success rate; (b) computation time; (c) total flight time; and (d) makespan. Full article ">

20 pages, 1304 KiB

Open AccessArticle

Robust Reinforcement Learning Strategies with Evolving Curriculum for Efficient Bus Operations in Smart Cities

by Yuhan Tang, Ao Qu, Xuan Jiang, Baichuan Mo, Shangqing Cao, Joseph Rodriguez, Haris N Koutsopoulos, Cathy Wu and Jinhua Zhao

Smart Cities 2024, 7(6), 3658-3677; https://doi.org/10.3390/smartcities7060141 - 29 Nov 2024

Viewed by 711

Abstract

Public transit systems are critical to the quality of urban life, and enhancing their efficiency is essential for building cost-effective and sustainable smart cities. Historically, researchers sought reinforcement learning (RL) applications to mitigate bus bunching issues with holding strategies. Nonetheless, these attempts often [...] Read more.

Public transit systems are critical to the quality of urban life, and enhancing their efficiency is essential for building cost-effective and sustainable smart cities. Historically, researchers sought reinforcement learning (RL) applications to mitigate bus bunching issues with holding strategies. Nonetheless, these attempts often led to oversimplifications and misalignment with the goal of reducing the total time passengers spent in the system, resulting in less robust or non-optimal solutions. In this study, we introduce a novel setting where each bus, supervised by an RL agent, can appropriately form aggregated policies from three strategies (holding, skipping station, and turning around to serve the opposite direction). It’s difficult to learn them all together, due to learning complexity, we employ domain knowledge and develop a gradually expanding action space curriculum, enabling agents to learn these strategies incrementally. We incorporate Long Short-Term Memory (LSTM) in our model considering the temporal interrelation among these actions. To address the inherent uncertainties of real-world traffic systems, we impose Domain Randomization (DR) on variables such as passenger demand and bus schedules. We conduct extensive numerical experiments with the integration of synthetic and real-world data to evaluate our model. Our methodology proves effective, enhancing bus schedule reliability and reducing total passenger waiting time by over 15%, thereby improving bus operation efficiency and smoothering operations of buses that align with sustainable goals. This work highlights the potential of robust RL combined with curriculum learning for optimizing public transport in smart cities, offering a scalable solution for real-world multi-agent systems. Full article

(This article belongs to the Special Issue Cost-Effective Transportation Planning for Smart Cities)

► Show Figures

Figure 1

Search Results (277)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (277)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI