Modeling consonant-vowel coarticulation for articulatory speech synthesis.

A central challenge for articulatory speech synthesis is the simulation of realistic articulatory movements, which is critical for the generation of highly natural and intelligible speech. This includes modeling coarticulation, i.e., the context-dependent variation of the articulatory and acoustic r...

Descripción completa

Guardado en:

Detalles Bibliográficos
Autor principal:	Peter Birkholz
Formato:	article
Lenguaje:	EN
Publicado:	Public Library of Science (PLoS) 2013
Materias:	Medicine R Science Q
Acceso en línea:	https://doaj.org/article/2595abfef4f346e0bc8e5d3f267901fd
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

id	oai:doaj.org-article:2595abfef4f346e0bc8e5d3f267901fd
record_format	dspace
spelling	oai:doaj.org-article:2595abfef4f346e0bc8e5d3f267901fd2021-11-18T07:49:24ZModeling consonant-vowel coarticulation for articulatory speech synthesis.1932-620310.1371/journal.pone.0060603https://doaj.org/article/2595abfef4f346e0bc8e5d3f267901fd2013-01-01T00:00:00Zhttps://www.ncbi.nlm.nih.gov/pmc/articles/pmid/23613734/?tool=EBIhttps://doaj.org/toc/1932-6203A central challenge for articulatory speech synthesis is the simulation of realistic articulatory movements, which is critical for the generation of highly natural and intelligible speech. This includes modeling coarticulation, i.e., the context-dependent variation of the articulatory and acoustic realization of phonemes, especially of consonants. Here we propose a method to simulate the context-sensitive articulation of consonants in consonant-vowel syllables. To achieve this, the vocal tract target shape of a consonant in the context of a given vowel is derived as the weighted average of three measured and acoustically-optimized reference vocal tract shapes for that consonant in the context of the corner vowels /a/, /i/, and /u/. The weights are determined by mapping the target shape of the given context vowel into the vowel subspace spanned by the corner vowels. The model was applied for the synthesis of consonant-vowel syllables with the consonants /b/, /d/, /g/, /l/, /r/, /m/, /n/ in all combinations with the eight long German vowels. In a perception test, the mean recognition rate for the consonants in the isolated syllables was 82.4%. This demonstrates the potential of the approach for highly intelligible articulatory speech synthesis.Peter BirkholzPublic Library of Science (PLoS)articleMedicineRScienceQENPLoS ONE, Vol 8, Iss 4, p e60603 (2013)
institution	DOAJ
collection	DOAJ
language	EN
topic	Medicine R Science Q
spellingShingle	Medicine R Science Q Peter Birkholz Modeling consonant-vowel coarticulation for articulatory speech synthesis.
description	A central challenge for articulatory speech synthesis is the simulation of realistic articulatory movements, which is critical for the generation of highly natural and intelligible speech. This includes modeling coarticulation, i.e., the context-dependent variation of the articulatory and acoustic realization of phonemes, especially of consonants. Here we propose a method to simulate the context-sensitive articulation of consonants in consonant-vowel syllables. To achieve this, the vocal tract target shape of a consonant in the context of a given vowel is derived as the weighted average of three measured and acoustically-optimized reference vocal tract shapes for that consonant in the context of the corner vowels /a/, /i/, and /u/. The weights are determined by mapping the target shape of the given context vowel into the vowel subspace spanned by the corner vowels. The model was applied for the synthesis of consonant-vowel syllables with the consonants /b/, /d/, /g/, /l/, /r/, /m/, /n/ in all combinations with the eight long German vowels. In a perception test, the mean recognition rate for the consonants in the isolated syllables was 82.4%. This demonstrates the potential of the approach for highly intelligible articulatory speech synthesis.
format	article
author	Peter Birkholz
author_facet	Peter Birkholz
author_sort	Peter Birkholz
title	Modeling consonant-vowel coarticulation for articulatory speech synthesis.
title_short	Modeling consonant-vowel coarticulation for articulatory speech synthesis.
title_full	Modeling consonant-vowel coarticulation for articulatory speech synthesis.
title_fullStr	Modeling consonant-vowel coarticulation for articulatory speech synthesis.
title_full_unstemmed	Modeling consonant-vowel coarticulation for articulatory speech synthesis.
title_sort	modeling consonant-vowel coarticulation for articulatory speech synthesis.
publisher	Public Library of Science (PLoS)
publishDate	2013
url	https://doaj.org/article/2595abfef4f346e0bc8e5d3f267901fd
work_keys_str_mv	AT peterbirkholz modelingconsonantvowelcoarticulationforarticulatoryspeechsynthesis
_version_	1718422900785020928

Modeling consonant-vowel coarticulation for articulatory speech synthesis.

Ejemplares similares