Graph-Based Prediction and Planning Policy Network (GP3Net) for Scalable Self-Driving in Dynamic Environments Using Deep Reinforcement Learning

Chowdhury, J and Shivaraman, V and Sundaram, S and Sujit, PB (2024) Graph-Based Prediction and Planning Policy Network (GP3Net) for Scalable Self-Driving in Dynamic Environments Using Deep Reinforcement Learning. In: 38th AAAI Conference on Artificial Intelligence, AAAI 2024, 20 February 2024through 27 February 2024, Vancouver, pp. 11606-11614.

Preview

PDF
pro_aaa_con_art_int_38_10_2024.pdf - Published Version
Download (4MB) | Preview

Official URL: https://doi.org/10.1609/aaai.v38i10.29043

Abstract

Recent advancements in motion planning for Autonomous Vehicles (AVs) show great promise in using expert driver behaviors in non-stationary driving environments. However, learning only through expert drivers needs more generalizability to recover from domain shifts and near-failure scenarios due to the dynamic behavior of traffic participants and weather conditions. A deep Graph-based Prediction and Planning Policy Network (GP3Net) framework is proposed for non-stationary environments that encodes the interactions between traffic participants with contextual information and provides a decision for safe maneuver for AV. A spatiotemporal graph models the interactions between traffic participants for predicting the future trajectories of those participants. The predicted trajectories are utilized to generate a future occupancy map around the AV with uncertainties embedded to anticipate the evolving non-stationary driving environments. Then the contextual information and future occupancy maps are input to the policy network of the GP3Net framework and trained using Proximal Policy Optimization (PPO) algorithm. The proposed GP3Net performance is evaluated on standard CARLA benchmarking scenarios with domain shifts of traffic patterns (urban, highway, and mixed). The results show that the GP3Net outperforms previous state-of-the-art imitation learning-based planning models for different towns. Further, in unseen new weather conditions, GP3Net completes the desired route with fewer traffic infractions. Finally, the results emphasize the advantage of including the prediction module to enhance safety measures in non-stationary environments. Copyright Â© 2024, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.

Item Type:	Conference Paper
Publication:	Proceedings of the AAAI Conference on Artificial Intelligence
Publisher:	Association for the Advancement of Artificial Intelligence
Additional Information:	The copyright for this article belongs to authors.
Keywords:	Automobile drivers; Autonomous vehicles; Behavioral research; Deep learning; Forecasting; Graphic methods; Meteorology; Motion planning; Reinforcement learning, Autonomous Vehicles; Condition; Contextual information; Driving environment; Graph-based; Non-stationary environment; Nonstationary; Occupancy maps; Planning policies; Policy networks, Benchmarking
Department/Centre:	Division of Interdisciplinary Sciences > Robert Bosch Centre for Cyber Physical Systems Division of Mechanical Sciences > Aerospace Engineering(Formerly Aeronautical Engineering)
Date Deposited:	11 Jul 2024 08:10
Last Modified:	11 Jul 2024 08:10
URI:	http://eprints.iisc.ac.in/id/eprint/84812

Actions (login required)

View Item


	Powered by EPrints		A service from The J.R.D. Tata Memorial Library Indian Institute of Science, Bengaluru-560012, India