Crowd activity recognition in live video streaming via 3D‐ResNet and region graph convolution network
Abstract Since the era of we‐media, live video industry has shown an explosive growth trend. For large‐scale live video streaming, especially those containing crowd events that may cause great social impact, how to identify and supervise the crowd activity in live video streaming effectively is of g...
Guardado en:
Autores principales: | , , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
Wiley
2021
|
Materias: | |
Acceso en línea: | https://doaj.org/article/2c1e125c055446439dcfecf525920db4 |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:2c1e125c055446439dcfecf525920db4 |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:2c1e125c055446439dcfecf525920db42021-11-29T03:38:16ZCrowd activity recognition in live video streaming via 3D‐ResNet and region graph convolution network1751-96671751-965910.1049/ipr2.12239https://doaj.org/article/2c1e125c055446439dcfecf525920db42021-12-01T00:00:00Zhttps://doi.org/10.1049/ipr2.12239https://doaj.org/toc/1751-9659https://doaj.org/toc/1751-9667Abstract Since the era of we‐media, live video industry has shown an explosive growth trend. For large‐scale live video streaming, especially those containing crowd events that may cause great social impact, how to identify and supervise the crowd activity in live video streaming effectively is of great value to push the healthy development of live video industry. The existing crowd activity recognition mainly uses visual information, rarely fully exploiting and utilizing the correlation or external knowledge between crowd content. Therefore, a crowd activity recognition method in live video streaming is proposed by 3D‐ResNet and regional graph convolution network (ReGCN). (1) After extracting deep spatiotemporal features from live video streaming with 3D‐ResNet, the region proposals are generated by region proposal network. (2) A weakly supervised ReGCN is constructed by making region proposals as graph nodes and their correlations as edges. (3) Crowd activity in live video streaming is recognised by combining the output of ReGCN, the deep spatiotemporal features and the crowd motion intensity as external knowledge. Four experiments are conducted on the public collective activity extended dataset and a real‐world dataset BJUT‐CAD. The competitive results demonstrate that our method can effectively recognise crowd activity in live video streaming.Junpeng KangJing ZhangWensheng LiLi ZhuoWileyarticlePhotographyTR1-1050Computer softwareQA76.75-76.765ENIET Image Processing, Vol 15, Iss 14, Pp 3476-3486 (2021) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
Photography TR1-1050 Computer software QA76.75-76.765 |
spellingShingle |
Photography TR1-1050 Computer software QA76.75-76.765 Junpeng Kang Jing Zhang Wensheng Li Li Zhuo Crowd activity recognition in live video streaming via 3D‐ResNet and region graph convolution network |
description |
Abstract Since the era of we‐media, live video industry has shown an explosive growth trend. For large‐scale live video streaming, especially those containing crowd events that may cause great social impact, how to identify and supervise the crowd activity in live video streaming effectively is of great value to push the healthy development of live video industry. The existing crowd activity recognition mainly uses visual information, rarely fully exploiting and utilizing the correlation or external knowledge between crowd content. Therefore, a crowd activity recognition method in live video streaming is proposed by 3D‐ResNet and regional graph convolution network (ReGCN). (1) After extracting deep spatiotemporal features from live video streaming with 3D‐ResNet, the region proposals are generated by region proposal network. (2) A weakly supervised ReGCN is constructed by making region proposals as graph nodes and their correlations as edges. (3) Crowd activity in live video streaming is recognised by combining the output of ReGCN, the deep spatiotemporal features and the crowd motion intensity as external knowledge. Four experiments are conducted on the public collective activity extended dataset and a real‐world dataset BJUT‐CAD. The competitive results demonstrate that our method can effectively recognise crowd activity in live video streaming. |
format |
article |
author |
Junpeng Kang Jing Zhang Wensheng Li Li Zhuo |
author_facet |
Junpeng Kang Jing Zhang Wensheng Li Li Zhuo |
author_sort |
Junpeng Kang |
title |
Crowd activity recognition in live video streaming via 3D‐ResNet and region graph convolution network |
title_short |
Crowd activity recognition in live video streaming via 3D‐ResNet and region graph convolution network |
title_full |
Crowd activity recognition in live video streaming via 3D‐ResNet and region graph convolution network |
title_fullStr |
Crowd activity recognition in live video streaming via 3D‐ResNet and region graph convolution network |
title_full_unstemmed |
Crowd activity recognition in live video streaming via 3D‐ResNet and region graph convolution network |
title_sort |
crowd activity recognition in live video streaming via 3d‐resnet and region graph convolution network |
publisher |
Wiley |
publishDate |
2021 |
url |
https://doaj.org/article/2c1e125c055446439dcfecf525920db4 |
work_keys_str_mv |
AT junpengkang crowdactivityrecognitioninlivevideostreamingvia3dresnetandregiongraphconvolutionnetwork AT jingzhang crowdactivityrecognitioninlivevideostreamingvia3dresnetandregiongraphconvolutionnetwork AT wenshengli crowdactivityrecognitioninlivevideostreamingvia3dresnetandregiongraphconvolutionnetwork AT lizhuo crowdactivityrecognitioninlivevideostreamingvia3dresnetandregiongraphconvolutionnetwork |
_version_ |
1718407634964447232 |