A deep learning method for video‐based action recognition
Abstract In this paper, a deep learning method for video‐based action recognition is proposed. On the one hand, boundary compensation on the basis of a deep neural network is performed to achieve action proposal. Boundary compensation considering non‐maximum suppression according to sliding window p...
Guardado en:
Autores principales: | , , , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
Wiley
2021
|
Materias: | |
Acceso en línea: | https://doaj.org/article/da8a99b21cce4c59b135963dcc5b04de |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:da8a99b21cce4c59b135963dcc5b04de |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:da8a99b21cce4c59b135963dcc5b04de2021-11-29T03:38:16ZA deep learning method for video‐based action recognition1751-96671751-965910.1049/ipr2.12303https://doaj.org/article/da8a99b21cce4c59b135963dcc5b04de2021-12-01T00:00:00Zhttps://doi.org/10.1049/ipr2.12303https://doaj.org/toc/1751-9659https://doaj.org/toc/1751-9667Abstract In this paper, a deep learning method for video‐based action recognition is proposed. On the one hand, boundary compensation on the basis of a deep neural network is performed to achieve action proposal. Boundary compensation considering non‐maximum suppression according to sliding window priority is applied to remove redundant windows. To accurately detect boundaries, a boundary compensation network is established with multiple networks to process different numbers of segments. On the other hand, action recognition based on the resultant action proposals is performed. To further utilise boundary compensation, three methods are introduced for key frame selection. Optical flow and RGB features are combined via a channel fusion to realise feature representation. A two‐stream network with a spatiotemporal structure is adopted for action recognition. The proposed method is evaluated on three public datasets. The experimental results demonstrate that the proposed method achieves a superior performance to that of state‐of‐the‐art methods.Guanwen ZhangYukun RaoChanghao WangWei ZhouXiangyang JiWileyarticlePhotographyTR1-1050Computer softwareQA76.75-76.765ENIET Image Processing, Vol 15, Iss 14, Pp 3498-3511 (2021) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
Photography TR1-1050 Computer software QA76.75-76.765 |
spellingShingle |
Photography TR1-1050 Computer software QA76.75-76.765 Guanwen Zhang Yukun Rao Changhao Wang Wei Zhou Xiangyang Ji A deep learning method for video‐based action recognition |
description |
Abstract In this paper, a deep learning method for video‐based action recognition is proposed. On the one hand, boundary compensation on the basis of a deep neural network is performed to achieve action proposal. Boundary compensation considering non‐maximum suppression according to sliding window priority is applied to remove redundant windows. To accurately detect boundaries, a boundary compensation network is established with multiple networks to process different numbers of segments. On the other hand, action recognition based on the resultant action proposals is performed. To further utilise boundary compensation, three methods are introduced for key frame selection. Optical flow and RGB features are combined via a channel fusion to realise feature representation. A two‐stream network with a spatiotemporal structure is adopted for action recognition. The proposed method is evaluated on three public datasets. The experimental results demonstrate that the proposed method achieves a superior performance to that of state‐of‐the‐art methods. |
format |
article |
author |
Guanwen Zhang Yukun Rao Changhao Wang Wei Zhou Xiangyang Ji |
author_facet |
Guanwen Zhang Yukun Rao Changhao Wang Wei Zhou Xiangyang Ji |
author_sort |
Guanwen Zhang |
title |
A deep learning method for video‐based action recognition |
title_short |
A deep learning method for video‐based action recognition |
title_full |
A deep learning method for video‐based action recognition |
title_fullStr |
A deep learning method for video‐based action recognition |
title_full_unstemmed |
A deep learning method for video‐based action recognition |
title_sort |
deep learning method for video‐based action recognition |
publisher |
Wiley |
publishDate |
2021 |
url |
https://doaj.org/article/da8a99b21cce4c59b135963dcc5b04de |
work_keys_str_mv |
AT guanwenzhang adeeplearningmethodforvideobasedactionrecognition AT yukunrao adeeplearningmethodforvideobasedactionrecognition AT changhaowang adeeplearningmethodforvideobasedactionrecognition AT weizhou adeeplearningmethodforvideobasedactionrecognition AT xiangyangji adeeplearningmethodforvideobasedactionrecognition AT guanwenzhang deeplearningmethodforvideobasedactionrecognition AT yukunrao deeplearningmethodforvideobasedactionrecognition AT changhaowang deeplearningmethodforvideobasedactionrecognition AT weizhou deeplearningmethodforvideobasedactionrecognition AT xiangyangji deeplearningmethodforvideobasedactionrecognition |
_version_ |
1718407676850864128 |