Tactical Decision-Making for Autonomous Driving Using Dueling Double Deep Q Network With Double Attention
Decision-making is still a significant challenge to realize fully autonomous driving. Using deep reinforcement learning (DRL) to solve autonomous driving decision-making problems is a recent trend. A common method is to encode surrounding vehicles in a grid to describe the state space to help DRL ne...
Guardado en:
Autores principales: | , , , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
IEEE
2021
|
Materias: | |
Acceso en línea: | https://doaj.org/article/f187f625b58e4ec29438dc584887e503 |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:f187f625b58e4ec29438dc584887e503 |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:f187f625b58e4ec29438dc584887e5032021-11-18T00:00:55ZTactical Decision-Making for Autonomous Driving Using Dueling Double Deep Q Network With Double Attention2169-353610.1109/ACCESS.2021.3127105https://doaj.org/article/f187f625b58e4ec29438dc584887e5032021-01-01T00:00:00Zhttps://ieeexplore.ieee.org/document/9609990/https://doaj.org/toc/2169-3536Decision-making is still a significant challenge to realize fully autonomous driving. Using deep reinforcement learning (DRL) to solve autonomous driving decision-making problems is a recent trend. A common method is to encode surrounding vehicles in a grid to describe the state space to help DRL network extract the scene features. However, in the process of human driving, surrounding vehicles at different positions have different contributions to decision-making. Meanwhile, the network is difficult to fully extract useful features in a sparse state, which can also lead to catastrophic actions. Therefore, this work proposes a spatial attention module to calculate different weights to represent different contributions to decision-making results. And a channel attention module is proposed to fully extract useful features in sparse state features. These two attention modules are integrated into dueling double deep Q network, named D3QN-DA, as a high-level decision-maker of a hierarchical hierarchical control structure based decision-making system. To improve agent performance, an emergency safe checker is introduced in this system. And the agent is trained and test with a designed reward function from safety and efficiency in simulation. The experimental results show that the proposed method increases the safety rate by 54%, and the average explore distance by 30%, which is safer and more intelligent for the decision-making process of automatic driving.Shuwei ZhangYutian WuHarutoshi OgaiHiroshi InujimaShigeyuki TatenoIEEEarticleAutonomous drivingdecision-makingdueling double deep Q networkspatial attentionchannel attentionElectrical engineering. Electronics. Nuclear engineeringTK1-9971ENIEEE Access, Vol 9, Pp 151983-151992 (2021) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
Autonomous driving decision-making dueling double deep Q network spatial attention channel attention Electrical engineering. Electronics. Nuclear engineering TK1-9971 |
spellingShingle |
Autonomous driving decision-making dueling double deep Q network spatial attention channel attention Electrical engineering. Electronics. Nuclear engineering TK1-9971 Shuwei Zhang Yutian Wu Harutoshi Ogai Hiroshi Inujima Shigeyuki Tateno Tactical Decision-Making for Autonomous Driving Using Dueling Double Deep Q Network With Double Attention |
description |
Decision-making is still a significant challenge to realize fully autonomous driving. Using deep reinforcement learning (DRL) to solve autonomous driving decision-making problems is a recent trend. A common method is to encode surrounding vehicles in a grid to describe the state space to help DRL network extract the scene features. However, in the process of human driving, surrounding vehicles at different positions have different contributions to decision-making. Meanwhile, the network is difficult to fully extract useful features in a sparse state, which can also lead to catastrophic actions. Therefore, this work proposes a spatial attention module to calculate different weights to represent different contributions to decision-making results. And a channel attention module is proposed to fully extract useful features in sparse state features. These two attention modules are integrated into dueling double deep Q network, named D3QN-DA, as a high-level decision-maker of a hierarchical hierarchical control structure based decision-making system. To improve agent performance, an emergency safe checker is introduced in this system. And the agent is trained and test with a designed reward function from safety and efficiency in simulation. The experimental results show that the proposed method increases the safety rate by 54%, and the average explore distance by 30%, which is safer and more intelligent for the decision-making process of automatic driving. |
format |
article |
author |
Shuwei Zhang Yutian Wu Harutoshi Ogai Hiroshi Inujima Shigeyuki Tateno |
author_facet |
Shuwei Zhang Yutian Wu Harutoshi Ogai Hiroshi Inujima Shigeyuki Tateno |
author_sort |
Shuwei Zhang |
title |
Tactical Decision-Making for Autonomous Driving Using Dueling Double Deep Q Network With Double Attention |
title_short |
Tactical Decision-Making for Autonomous Driving Using Dueling Double Deep Q Network With Double Attention |
title_full |
Tactical Decision-Making for Autonomous Driving Using Dueling Double Deep Q Network With Double Attention |
title_fullStr |
Tactical Decision-Making for Autonomous Driving Using Dueling Double Deep Q Network With Double Attention |
title_full_unstemmed |
Tactical Decision-Making for Autonomous Driving Using Dueling Double Deep Q Network With Double Attention |
title_sort |
tactical decision-making for autonomous driving using dueling double deep q network with double attention |
publisher |
IEEE |
publishDate |
2021 |
url |
https://doaj.org/article/f187f625b58e4ec29438dc584887e503 |
work_keys_str_mv |
AT shuweizhang tacticaldecisionmakingforautonomousdrivingusingduelingdoubledeepqnetworkwithdoubleattention AT yutianwu tacticaldecisionmakingforautonomousdrivingusingduelingdoubledeepqnetworkwithdoubleattention AT harutoshiogai tacticaldecisionmakingforautonomousdrivingusingduelingdoubledeepqnetworkwithdoubleattention AT hiroshiinujima tacticaldecisionmakingforautonomousdrivingusingduelingdoubledeepqnetworkwithdoubleattention AT shigeyukitateno tacticaldecisionmakingforautonomousdrivingusingduelingdoubledeepqnetworkwithdoubleattention |
_version_ |
1718425260251938816 |