Coordinated Longitudinal and Lateral Motions Control of Automated Vehicles Based on Multi-Agent Deep Reinforcement Learning for On-Ramp Merging 2024-01-2560
The on-ramp merging driving scenario is challenging for achieving the highest-level autonomous driving. Current research using reinforcement learning methods to address the on-ramp merging problem of automated vehicles (AVs) is mainly designed for a single AV, treating other vehicles as part of the environment. This paper proposes a control framework for cooperative on-ramp merging of multiple AVs based on multi-agent deep reinforcement learning (MADRL). This framework facilitates AVs on the ramp and adjacent mainline to learn a coordinate control policy for their longitudinal and lateral motions based on the environment observations. Unlike the hierarchical architecture, this paper integrates decision and control into a unified optimal control problem to solve an on-ramp merging strategy through MADRL. Firstly, a partially observable Markov game (POMG) is formulated to characterize the on-ramp merging control problem, where the observation space of each AV (agent) is defined as its states and the relative state between it and other AVs, and the joint action spaces are the longitudinal acceleration and front wheel steering angle of AVs. Then, with safety and traffic efficiency as the objective, the reward function of each AV is designed. Furthermore, the joint action for multi-agent is obtained by solving the POMG problem utilizing the multi-agent deep deterministic policy gradient (MADDPG) method. Finally, a rule-based action guidance strategy is presented to supervise further the joint action for enhancing the safety of AVs. Numerical experiments are performed under different conditions to verify the effectiveness of the proposed merging control framework for a multi-agent system. The proposed scheme is also compared with the method for a single agent, taking the deep deterministic policy gradient (DDPG) method as a benchmark. The results demonstrate superior performance of the proposed method than the DDPG method in terms of safety and traffic efficiency.
Citation: Li, W., Zhao, Z., Liang, K., and Zhao, K., "Coordinated Longitudinal and Lateral Motions Control of Automated Vehicles Based on Multi-Agent Deep Reinforcement Learning for On-Ramp Merging," SAE Technical Paper 2024-01-2560, 2024, https://doi.org/10.4271/2024-01-2560. Download Citation
Author(s):
Wenchang Li, Zhiguo Zhao, Kaichong Liang, Kun Zhao
Affiliated:
Tongji University
Pages: 11
Event:
WCX SAE World Congress Experience
ISSN:
0148-7191
e-ISSN:
2688-3627
Related Topics:
Machine learning
Autonomous vehicles
Automated vehicles
Optimization
Architecture
Vehicle acceleration
Research and development
SAE MOBILUS
Subscribers can view annotate, and download all of SAE's content.
Learn More »