ENHANCEMENT OF SPEECH SIGNAL SEGMENTATION USING TEAGER ENERGY OPERATOR

Background. Speech signal segmentation is detection of the boundaries of the beginning and the end of sections of voiced and unvoiced speech, and pauses. Accurate detection of the boundaries both improves the quality of speech signal segmentation, and reduces the number of computational operations....

Descripción completa

Guardado en:

Detalles Bibliográficos
Autor principal:	A. K. Alimuradov
Formato:	article
Lenguaje:	EN RU
Publicado:	Penza State University Publishing House 2021
Materias:	speech processing speech segmentation voiced and unvoiced speech pauses teager energy operator Engineering (General). Civil engineering (General) TA1-2040
Acceso en línea:	https://doaj.org/article/f69780ad369246798f2233c4d0423ead
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

id	oai:doaj.org-article:f69780ad369246798f2233c4d0423ead
record_format	dspace
spelling	oai:doaj.org-article:f69780ad369246798f2233c4d0423ead2021-11-24T10:50:14ZENHANCEMENT OF SPEECH SIGNAL SEGMENTATION USING TEAGER ENERGY OPERATOR10.21685/2307-5538-2021-3-102307-5538https://doaj.org/article/f69780ad369246798f2233c4d0423ead2021-11-01T00:00:00Zhttps://doaj.org/toc/2307-5538Background. Speech signal segmentation is detection of the boundaries of the beginning and the end of sections of voiced and unvoiced speech, and pauses. Accurate detection of the boundaries both improves the quality of speech signal segmentation, and reduces the number of computational operations. The aim of the work is to improve the efficiency of segmentation based on the energy analysis of speech signals using the Teager energy operator. Materials and methods. The second-order differential Teager energy operator, which makes it possible to estimate the energy characteristics of a signal, was used in this work. The Teager operator is simple, efficient, and highly susceptible to changes in signal amplitude and frequency.The software implementation of the method was performed in ©MATLAB (MathWorks) mathematical modeling environment. Results. An improved method for speech signal segmentation, providing an increase in the efficiency of detecting voiced and unvoiced areas, and pauses, has been developed. The nature of the method is the energy analysis of speech signal fragments using the Teager energy operator; analysis of zerocrossing rate and short-term energy of the energy characteristic function. Research to assess the efficiency and noise robustness of the improved method over the known segmentation methods, was carried out. Conclusions. In accordance with the obtained research results, it was revealed that due to the good susceptibility of the Teager energy operator to sharp changes in signal amplitude and frequency, the improved method provides an increase in the segmentation efficiency by 2.97 % and 2.49 % for the 1st and 2nd kind errors, respectively.A. K. AlimuradovPenza State University Publishing Housearticlespeech processingspeech segmentationvoiced and unvoiced speechpausesteager energy operatorEngineering (General). Civil engineering (General)TA1-2040ENRUИзмерение, мониторинг, управление, контроль, Iss 3 (2021)
institution	DOAJ
collection	DOAJ
language	EN RU
topic	speech processing speech segmentation voiced and unvoiced speech pauses teager energy operator Engineering (General). Civil engineering (General) TA1-2040
spellingShingle	speech processing speech segmentation voiced and unvoiced speech pauses teager energy operator Engineering (General). Civil engineering (General) TA1-2040 A. K. Alimuradov ENHANCEMENT OF SPEECH SIGNAL SEGMENTATION USING TEAGER ENERGY OPERATOR
description	Background. Speech signal segmentation is detection of the boundaries of the beginning and the end of sections of voiced and unvoiced speech, and pauses. Accurate detection of the boundaries both improves the quality of speech signal segmentation, and reduces the number of computational operations. The aim of the work is to improve the efficiency of segmentation based on the energy analysis of speech signals using the Teager energy operator. Materials and methods. The second-order differential Teager energy operator, which makes it possible to estimate the energy characteristics of a signal, was used in this work. The Teager operator is simple, efficient, and highly susceptible to changes in signal amplitude and frequency.The software implementation of the method was performed in ©MATLAB (MathWorks) mathematical modeling environment. Results. An improved method for speech signal segmentation, providing an increase in the efficiency of detecting voiced and unvoiced areas, and pauses, has been developed. The nature of the method is the energy analysis of speech signal fragments using the Teager energy operator; analysis of zerocrossing rate and short-term energy of the energy characteristic function. Research to assess the efficiency and noise robustness of the improved method over the known segmentation methods, was carried out. Conclusions. In accordance with the obtained research results, it was revealed that due to the good susceptibility of the Teager energy operator to sharp changes in signal amplitude and frequency, the improved method provides an increase in the segmentation efficiency by 2.97 % and 2.49 % for the 1st and 2nd kind errors, respectively.
format	article
author	A. K. Alimuradov
author_facet	A. K. Alimuradov
author_sort	A. K. Alimuradov
title	ENHANCEMENT OF SPEECH SIGNAL SEGMENTATION USING TEAGER ENERGY OPERATOR
title_short	ENHANCEMENT OF SPEECH SIGNAL SEGMENTATION USING TEAGER ENERGY OPERATOR
title_full	ENHANCEMENT OF SPEECH SIGNAL SEGMENTATION USING TEAGER ENERGY OPERATOR
title_fullStr	ENHANCEMENT OF SPEECH SIGNAL SEGMENTATION USING TEAGER ENERGY OPERATOR
title_full_unstemmed	ENHANCEMENT OF SPEECH SIGNAL SEGMENTATION USING TEAGER ENERGY OPERATOR
title_sort	enhancement of speech signal segmentation using teager energy operator
publisher	Penza State University Publishing House
publishDate	2021
url	https://doaj.org/article/f69780ad369246798f2233c4d0423ead
work_keys_str_mv	AT akalimuradov enhancementofspeechsignalsegmentationusingteagerenergyoperator
_version_	1718415085710344192

ENHANCEMENT OF SPEECH SIGNAL SEGMENTATION USING TEAGER ENERGY OPERATOR

Ejemplares similares