A System for the Detection of Polyphonic Sound on a University Campus Based on CapsNet-RNN
In recent decades, surveillance and home security systems based on video analysis have been proposed for the automatic detection of abnormal situations. Nevertheless, in several real applications, it may be easier to detect a given event from audio information, and the use of audio surveillance syst...
Guardado en:
Autores principales: | , , , , , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
IEEE
2021
|
Materias: | |
Acceso en línea: | https://doaj.org/article/00625346e21a438491f18f5330021399 |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:00625346e21a438491f18f5330021399 |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:00625346e21a438491f18f53300213992021-11-18T00:04:45ZA System for the Detection of Polyphonic Sound on a University Campus Based on CapsNet-RNN2169-353610.1109/ACCESS.2021.3123970https://doaj.org/article/00625346e21a438491f18f53300213992021-01-01T00:00:00Zhttps://ieeexplore.ieee.org/document/9592754/https://doaj.org/toc/2169-3536In recent decades, surveillance and home security systems based on video analysis have been proposed for the automatic detection of abnormal situations. Nevertheless, in several real applications, it may be easier to detect a given event from audio information, and the use of audio surveillance systems can greatly improve the robustness and reliability of event detection. In this paper, a novel system for the detection of polyphonic urban noise is proposed for on-campus audio surveillance. The system aggregates different acoustic features to improve the classification accuracy of urban noise. A combination model composed of a capsule neural network (CapsNet) and recurrent neural network (RNN) is employed as the classifier. CapsNet overcomes some limitations of convolutional neural networks (CNNs), such as the loss of position information after max-pooling, and the RNN mainly models the temporal dependency of context information. The combination of these networks further improves the accuracy and robustness of polyphonic sound events detection. Moreover, a monitoring platform is designed to visualize noise maps and acoustic event information. The deployment architecture of the system is used in real environments, and experiments were also conducted on two public datasets. The results demonstrate that the proposed method is superior to existing state-of-art methods for the polyphonic sound event detection task.Liyan LuoLiujun ZhangMei WangZhenghong LiuXin LiuRuibin HeYe JinIEEEarticleDeep learningpolyphonic sound event detectionfeature aggregationmonitoring platformElectrical engineering. Electronics. Nuclear engineeringTK1-9971ENIEEE Access, Vol 9, Pp 147900-147913 (2021) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
Deep learning polyphonic sound event detection feature aggregation monitoring platform Electrical engineering. Electronics. Nuclear engineering TK1-9971 |
spellingShingle |
Deep learning polyphonic sound event detection feature aggregation monitoring platform Electrical engineering. Electronics. Nuclear engineering TK1-9971 Liyan Luo Liujun Zhang Mei Wang Zhenghong Liu Xin Liu Ruibin He Ye Jin A System for the Detection of Polyphonic Sound on a University Campus Based on CapsNet-RNN |
description |
In recent decades, surveillance and home security systems based on video analysis have been proposed for the automatic detection of abnormal situations. Nevertheless, in several real applications, it may be easier to detect a given event from audio information, and the use of audio surveillance systems can greatly improve the robustness and reliability of event detection. In this paper, a novel system for the detection of polyphonic urban noise is proposed for on-campus audio surveillance. The system aggregates different acoustic features to improve the classification accuracy of urban noise. A combination model composed of a capsule neural network (CapsNet) and recurrent neural network (RNN) is employed as the classifier. CapsNet overcomes some limitations of convolutional neural networks (CNNs), such as the loss of position information after max-pooling, and the RNN mainly models the temporal dependency of context information. The combination of these networks further improves the accuracy and robustness of polyphonic sound events detection. Moreover, a monitoring platform is designed to visualize noise maps and acoustic event information. The deployment architecture of the system is used in real environments, and experiments were also conducted on two public datasets. The results demonstrate that the proposed method is superior to existing state-of-art methods for the polyphonic sound event detection task. |
format |
article |
author |
Liyan Luo Liujun Zhang Mei Wang Zhenghong Liu Xin Liu Ruibin He Ye Jin |
author_facet |
Liyan Luo Liujun Zhang Mei Wang Zhenghong Liu Xin Liu Ruibin He Ye Jin |
author_sort |
Liyan Luo |
title |
A System for the Detection of Polyphonic Sound on a University Campus Based on CapsNet-RNN |
title_short |
A System for the Detection of Polyphonic Sound on a University Campus Based on CapsNet-RNN |
title_full |
A System for the Detection of Polyphonic Sound on a University Campus Based on CapsNet-RNN |
title_fullStr |
A System for the Detection of Polyphonic Sound on a University Campus Based on CapsNet-RNN |
title_full_unstemmed |
A System for the Detection of Polyphonic Sound on a University Campus Based on CapsNet-RNN |
title_sort |
system for the detection of polyphonic sound on a university campus based on capsnet-rnn |
publisher |
IEEE |
publishDate |
2021 |
url |
https://doaj.org/article/00625346e21a438491f18f5330021399 |
work_keys_str_mv |
AT liyanluo asystemforthedetectionofpolyphonicsoundonauniversitycampusbasedoncapsnetrnn AT liujunzhang asystemforthedetectionofpolyphonicsoundonauniversitycampusbasedoncapsnetrnn AT meiwang asystemforthedetectionofpolyphonicsoundonauniversitycampusbasedoncapsnetrnn AT zhenghongliu asystemforthedetectionofpolyphonicsoundonauniversitycampusbasedoncapsnetrnn AT xinliu asystemforthedetectionofpolyphonicsoundonauniversitycampusbasedoncapsnetrnn AT ruibinhe asystemforthedetectionofpolyphonicsoundonauniversitycampusbasedoncapsnetrnn AT yejin asystemforthedetectionofpolyphonicsoundonauniversitycampusbasedoncapsnetrnn AT liyanluo systemforthedetectionofpolyphonicsoundonauniversitycampusbasedoncapsnetrnn AT liujunzhang systemforthedetectionofpolyphonicsoundonauniversitycampusbasedoncapsnetrnn AT meiwang systemforthedetectionofpolyphonicsoundonauniversitycampusbasedoncapsnetrnn AT zhenghongliu systemforthedetectionofpolyphonicsoundonauniversitycampusbasedoncapsnetrnn AT xinliu systemforthedetectionofpolyphonicsoundonauniversitycampusbasedoncapsnetrnn AT ruibinhe systemforthedetectionofpolyphonicsoundonauniversitycampusbasedoncapsnetrnn AT yejin systemforthedetectionofpolyphonicsoundonauniversitycampusbasedoncapsnetrnn |
_version_ |
1718425202559287296 |