dc.contributorPetrova, Victoria M.
dc.creatorCoto Jiménez, Marvin
dc.creatorGoddard Close, John
dc.date.accessioned2022-03-31T14:24:29Z
dc.date.accessioned2022-10-20T01:18:27Z
dc.date.available2022-03-31T14:24:29Z
dc.date.available2022-10-20T01:18:27Z
dc.date.created2022-03-31T14:24:29Z
dc.date.issued2018
dc.identifier978-1-53613-842-9
dc.identifierhttps://hdl.handle.net/10669/86347
dc.identifier.urihttps://repositorioslatinoamericanos.uchile.cl/handle/2250/4539290
dc.description.abstractIn this chapter, we introduce hybrid postfilters into speech synthesis, with the objective of enhancing the quality of the synthesized speech. Our approach combines a Wiener filter with deep neural networks. Several attempts to enhance synthetic speech have contemplated single-stage deep-learning-based postfilters, which learn to perform a mapping of the synthetic speech parameters to the natural ones. In the synthetic speech produced by statistical methods, we have measured low-level noise components, so the common single-stage postfilters must achieve the reduction of that component, as well as the complex relationship between the parameters of the synthetic and the natural speech. That is why we consider a two-stage approach: In the first stage, the Wiener filter deals with the noise components of the synthetic speech. In the second stage, a set of multi-stream postfilters, which encompass a collection of autoencoders and auto-associative networks, deal with the relationship between the output of the Wiener filter and the natural speech. Results show that the hybrid approach succeeds in enhancing the synthetic speech in most cases compared to a single-stage approach.
dc.languageeng
dc.sourceAdvances in Engineering Research (pp.223-249).Estados Unidos: Nova Science Publishers
dc.subjectDeep learning
dc.subjectHMM
dc.subjectLSTM
dc.subjectSpeech synthesis
dc.titleHMM-Based Speech Synthesis Enhancement with Hybrid Postfilters
dc.typecapítulo de libro


Este ítem pertenece a la siguiente institución