Deep reinforcement learning based driving strategy for avoidance of chain collisions and its safety efficiency analysis in autonomous vehicles

Vehicle control in autonomous traffic flow is often handled using the best decision-making reinforcement learning methods. However, unexpected critical situations make the collisions more severe and, consequently, the chain collisions. In this work, we first review the leading causes of chain collis...

Full description

Bibliographic Details
Main Authors: Abu Jafar, Md Muzahid, Syafiq Fauzi, Kamarulzaman, Rahman, Md. Arafatur, Alenezi, Ali H.
Format: Article
Language:English
Published: IEEE 2022
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/33891/
http://umpir.ump.edu.my/id/eprint/33891/1/Deep%20reinforcement%20learning%20based%20driving%20strategy%20for%20avoidance_full.pdf
_version_ 1848824372207288320
author Abu Jafar, Md Muzahid
Syafiq Fauzi, Kamarulzaman
Rahman, Md. Arafatur
Alenezi, Ali H.
author_facet Abu Jafar, Md Muzahid
Syafiq Fauzi, Kamarulzaman
Rahman, Md. Arafatur
Alenezi, Ali H.
author_sort Abu Jafar, Md Muzahid
building UMP Institutional Repository
collection Online Access
description Vehicle control in autonomous traffic flow is often handled using the best decision-making reinforcement learning methods. However, unexpected critical situations make the collisions more severe and, consequently, the chain collisions. In this work, we first review the leading causes of chain collisions and their subsequent chain events, which might provide an indication of how to prevent and mitigate the crash severity of chain collisions. Then, we consider the problem of chain collision avoidance as a Markov Decision Process problem in order to propose a reinforcement learning-based decision-making strategy and analyse the safety efficiency of existing methods in driving security. To address this, A reward function is being developed to deal with the challenge of multiple vehicle collision avoidance. A perception network structure based on formation and on actor-critic methodologies is employed to enhance the decision-making process. Finally, in the safety efficiency analysis phase, we investigated the safety efficiency performance of the agent vehicle in both single-agent and multi-agent autonomous driving environments. Three state-of-the-art contemporary actor-critic algorithms are used to create an extensive simulation in Unity3D. Moreover, to demonstrate the accuracy of the safety efficiency analysis, multiple training runs of the neural networks in respect of training performance, speed of training, success rate, and stability of rewards with a trade-off between exploitation and exploration during training are presented. Two aspects (single-agent and multi-agent) have assessed the efficiency of algorithms. Every aspect has been analyzed regarding the traffic flows: (1) the controlling efficiency of unexpected traffic situations by the sudden slowdown, (2) abrupt lane change, and (3) smoothly reaching the destination. All the findings of the analysis are intended to shed insight on the benefits of a greater, more reliable autonomous traffic set-up for academics and policymakers, and also to pave the way for the actual carry-out of a driver-less traffic world.
first_indexed 2025-11-15T03:11:59Z
format Article
id ump-33891
institution Universiti Malaysia Pahang
institution_category Local University
language English
last_indexed 2025-11-15T03:11:59Z
publishDate 2022
publisher IEEE
recordtype eprints
repository_type Digital Repository
spelling ump-338912022-04-26T08:03:17Z http://umpir.ump.edu.my/id/eprint/33891/ Deep reinforcement learning based driving strategy for avoidance of chain collisions and its safety efficiency analysis in autonomous vehicles Abu Jafar, Md Muzahid Syafiq Fauzi, Kamarulzaman Rahman, Md. Arafatur Alenezi, Ali H. QA76 Computer software T Technology (General) Vehicle control in autonomous traffic flow is often handled using the best decision-making reinforcement learning methods. However, unexpected critical situations make the collisions more severe and, consequently, the chain collisions. In this work, we first review the leading causes of chain collisions and their subsequent chain events, which might provide an indication of how to prevent and mitigate the crash severity of chain collisions. Then, we consider the problem of chain collision avoidance as a Markov Decision Process problem in order to propose a reinforcement learning-based decision-making strategy and analyse the safety efficiency of existing methods in driving security. To address this, A reward function is being developed to deal with the challenge of multiple vehicle collision avoidance. A perception network structure based on formation and on actor-critic methodologies is employed to enhance the decision-making process. Finally, in the safety efficiency analysis phase, we investigated the safety efficiency performance of the agent vehicle in both single-agent and multi-agent autonomous driving environments. Three state-of-the-art contemporary actor-critic algorithms are used to create an extensive simulation in Unity3D. Moreover, to demonstrate the accuracy of the safety efficiency analysis, multiple training runs of the neural networks in respect of training performance, speed of training, success rate, and stability of rewards with a trade-off between exploitation and exploration during training are presented. Two aspects (single-agent and multi-agent) have assessed the efficiency of algorithms. Every aspect has been analyzed regarding the traffic flows: (1) the controlling efficiency of unexpected traffic situations by the sudden slowdown, (2) abrupt lane change, and (3) smoothly reaching the destination. All the findings of the analysis are intended to shed insight on the benefits of a greater, more reliable autonomous traffic set-up for academics and policymakers, and also to pave the way for the actual carry-out of a driver-less traffic world. IEEE 2022 Article PeerReviewed pdf en cc_by_4 http://umpir.ump.edu.my/id/eprint/33891/1/Deep%20reinforcement%20learning%20based%20driving%20strategy%20for%20avoidance_full.pdf Abu Jafar, Md Muzahid and Syafiq Fauzi, Kamarulzaman and Rahman, Md. Arafatur and Alenezi, Ali H. (2022) Deep reinforcement learning based driving strategy for avoidance of chain collisions and its safety efficiency analysis in autonomous vehicles. IEEE Access, 4. pp. 1-18. ISSN 2169-3536. (Published) https://doi.org/10.1109/ACCESS.2022.3167812 https://doi.org/10.1109/ACCESS.2022.3167812
spellingShingle QA76 Computer software
T Technology (General)
Abu Jafar, Md Muzahid
Syafiq Fauzi, Kamarulzaman
Rahman, Md. Arafatur
Alenezi, Ali H.
Deep reinforcement learning based driving strategy for avoidance of chain collisions and its safety efficiency analysis in autonomous vehicles
title Deep reinforcement learning based driving strategy for avoidance of chain collisions and its safety efficiency analysis in autonomous vehicles
title_full Deep reinforcement learning based driving strategy for avoidance of chain collisions and its safety efficiency analysis in autonomous vehicles
title_fullStr Deep reinforcement learning based driving strategy for avoidance of chain collisions and its safety efficiency analysis in autonomous vehicles
title_full_unstemmed Deep reinforcement learning based driving strategy for avoidance of chain collisions and its safety efficiency analysis in autonomous vehicles
title_short Deep reinforcement learning based driving strategy for avoidance of chain collisions and its safety efficiency analysis in autonomous vehicles
title_sort deep reinforcement learning based driving strategy for avoidance of chain collisions and its safety efficiency analysis in autonomous vehicles
topic QA76 Computer software
T Technology (General)
url http://umpir.ump.edu.my/id/eprint/33891/
http://umpir.ump.edu.my/id/eprint/33891/
http://umpir.ump.edu.my/id/eprint/33891/
http://umpir.ump.edu.my/id/eprint/33891/1/Deep%20reinforcement%20learning%20based%20driving%20strategy%20for%20avoidance_full.pdf