Browse Publications Technical Papers 2024-01-2560
2024-04-09

Coordinated Longitudinal and Lateral Motions Control of Automated Vehicles Based on Multi-Agent Deep Reinforcement Learning for On-Ramp Merging 2024-01-2560

The on-ramp merging driving scenario is challenging for achieving the highest-level autonomous driving. Current research using reinforcement learning methods to address the on-ramp merging problem of automated vehicles (AVs) is mainly designed for a single AV, treating other vehicles as part of the environment. This paper proposes a control framework for cooperative on-ramp merging of multiple AVs based on multi-agent deep reinforcement learning (MADRL). This framework facilitates AVs on the ramp and adjacent mainline to learn a coordinate control policy for their longitudinal and lateral motions based on the environment observations. Unlike the hierarchical architecture, this paper integrates decision and control into a unified optimal control problem to solve an on-ramp merging strategy through MADRL. Firstly, a partially observable Markov game (POMG) is formulated to characterize the on-ramp merging control problem, where the observation space of each AV (agent) is defined as its states and the relative state between it and other AVs, and the joint action spaces are the longitudinal acceleration and front wheel steering angle of AVs. Then, with safety and traffic efficiency as the objective, the reward function of each AV is designed. Furthermore, the joint action for multi-agent is obtained by solving the POMG problem utilizing the multi-agent deep deterministic policy gradient (MADDPG) method. Finally, a rule-based action guidance strategy is presented to supervise further the joint action for enhancing the safety of AVs. Numerical experiments are performed under different conditions to verify the effectiveness of the proposed merging control framework for a multi-agent system. The proposed scheme is also compared with the method for a single agent, taking the deep deterministic policy gradient (DDPG) method as a benchmark. The results demonstrate superior performance of the proposed method than the DDPG method in terms of safety and traffic efficiency.

SAE MOBILUS

Subscribers can view annotate, and download all of SAE's content. Learn More »

Access SAE MOBILUS »

Members save up to 16% off list price.
Login to see discount.
Special Offer: Download multiple Technical Papers each year? TechSelect is a cost-effective subscription option to select and download 12-100 full-text Technical Papers per year. Find more information here.
X