Human behaviour recognition with mid‐level representations for crowd understanding and analysis

Abstract Crowd understanding and analysis have received increasing attention for couples of decades, and development of human behaviour recognition strongly supports the application of crowd understanding and analysis. Human behaviour recognition usually seeks to automatically analyse ongoing moveme...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Bangyong Sun, Nianzeng Yuan, Shuying Li, Siyuan Wu, Nan Wang
Formato: article
Lenguaje:EN
Publicado: Wiley 2021
Materias:
Acceso en línea:https://doaj.org/article/90e51ef616c14f9a8510351812599a83
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:90e51ef616c14f9a8510351812599a83
record_format dspace
spelling oai:doaj.org-article:90e51ef616c14f9a8510351812599a832021-11-29T03:38:15ZHuman behaviour recognition with mid‐level representations for crowd understanding and analysis1751-96671751-965910.1049/ipr2.12147https://doaj.org/article/90e51ef616c14f9a8510351812599a832021-12-01T00:00:00Zhttps://doi.org/10.1049/ipr2.12147https://doaj.org/toc/1751-9659https://doaj.org/toc/1751-9667Abstract Crowd understanding and analysis have received increasing attention for couples of decades, and development of human behaviour recognition strongly supports the application of crowd understanding and analysis. Human behaviour recognition usually seeks to automatically analyse ongoing movements and actions in different camera views by using various machine learning methodologies in unknown video clips or image sequences. Compared to other data modalities such as documents and images, processing video data demands much higher computational and storage resources. The idea of using middle level semantic concepts to represent human actions from videos is explored and it is argued that these semantic attributes enable the construction of more descriptive methods for human action recognition. The mid‐level attributes, initialized by a cluster processing, are built upon low level features and fully utilize the discrepancies in different action classes, which can capture the importance of each attribute for each action class. In this way, the representation is constructed to be semantically rich and capable of highly discriminative performance even paired with simple linear classifiers. The method is verified on three challenging datasets (KTH, UCF50 and HMDB51), and the experimental results demonstrate that our method achieves better results than the baseline methods on human action recognition.Bangyong SunNianzeng YuanShuying LiSiyuan WuNan WangWileyarticlePhotographyTR1-1050Computer softwareQA76.75-76.765ENIET Image Processing, Vol 15, Iss 14, Pp 3414-3424 (2021)
institution DOAJ
collection DOAJ
language EN
topic Photography
TR1-1050
Computer software
QA76.75-76.765
spellingShingle Photography
TR1-1050
Computer software
QA76.75-76.765
Bangyong Sun
Nianzeng Yuan
Shuying Li
Siyuan Wu
Nan Wang
Human behaviour recognition with mid‐level representations for crowd understanding and analysis
description Abstract Crowd understanding and analysis have received increasing attention for couples of decades, and development of human behaviour recognition strongly supports the application of crowd understanding and analysis. Human behaviour recognition usually seeks to automatically analyse ongoing movements and actions in different camera views by using various machine learning methodologies in unknown video clips or image sequences. Compared to other data modalities such as documents and images, processing video data demands much higher computational and storage resources. The idea of using middle level semantic concepts to represent human actions from videos is explored and it is argued that these semantic attributes enable the construction of more descriptive methods for human action recognition. The mid‐level attributes, initialized by a cluster processing, are built upon low level features and fully utilize the discrepancies in different action classes, which can capture the importance of each attribute for each action class. In this way, the representation is constructed to be semantically rich and capable of highly discriminative performance even paired with simple linear classifiers. The method is verified on three challenging datasets (KTH, UCF50 and HMDB51), and the experimental results demonstrate that our method achieves better results than the baseline methods on human action recognition.
format article
author Bangyong Sun
Nianzeng Yuan
Shuying Li
Siyuan Wu
Nan Wang
author_facet Bangyong Sun
Nianzeng Yuan
Shuying Li
Siyuan Wu
Nan Wang
author_sort Bangyong Sun
title Human behaviour recognition with mid‐level representations for crowd understanding and analysis
title_short Human behaviour recognition with mid‐level representations for crowd understanding and analysis
title_full Human behaviour recognition with mid‐level representations for crowd understanding and analysis
title_fullStr Human behaviour recognition with mid‐level representations for crowd understanding and analysis
title_full_unstemmed Human behaviour recognition with mid‐level representations for crowd understanding and analysis
title_sort human behaviour recognition with mid‐level representations for crowd understanding and analysis
publisher Wiley
publishDate 2021
url https://doaj.org/article/90e51ef616c14f9a8510351812599a83
work_keys_str_mv AT bangyongsun humanbehaviourrecognitionwithmidlevelrepresentationsforcrowdunderstandingandanalysis
AT nianzengyuan humanbehaviourrecognitionwithmidlevelrepresentationsforcrowdunderstandingandanalysis
AT shuyingli humanbehaviourrecognitionwithmidlevelrepresentationsforcrowdunderstandingandanalysis
AT siyuanwu humanbehaviourrecognitionwithmidlevelrepresentationsforcrowdunderstandingandanalysis
AT nanwang humanbehaviourrecognitionwithmidlevelrepresentationsforcrowdunderstandingandanalysis
_version_ 1718407659394170880