dc.creatorYoma, Néstor Becerra
dc.creatorSánchez, Jorge Silva
dc.date.accessioned2019-01-29T17:51:50Z
dc.date.available2019-01-29T17:51:50Z
dc.date.created2019-01-29T17:51:50Z
dc.date.issued2002
dc.identifierIEEE Transactions on Speech and Audio Processing, Volumen 10, Issue 7, 2018, Pages 443-450
dc.identifier10636676
dc.identifier10.1109/TSA.2002.803441
dc.identifierhttp://repositorio.uchile.cl/handle/2250/163580
dc.description.abstractThis paper presents a framework for maximum a posteriori (MAP) speaker adaptation of state duration distributions in hidden Markov models (HMM). Four key issues of MAP estimation, namely analysis and modeling of state duration distributions, the choice of prior distribution, the specification of the parameters of the prior density and the evaluation of the MAP estimates, are tackled. Moreover, a comparison with an adaptation procedure based on maximum likelihood (ML) estimation is presented, and the problem of truncation of the state duration distribution is addressed from the statistical point of view. The results shown in this paper suggest that the speaker adaptation of temporal restrictions substantially improves the accuracy of speaker-independent (SI) HMM with clean and noisy speech. The method requires a low computational load and a small number of adapting utterances, and can be useful to follow the dynamics of the speaking rate in speech recognition.
dc.languageen
dc.rightshttp://creativecommons.org/licenses/by-nc-nd/3.0/cl/
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 Chile
dc.sourceIEEE Transactions on Speech and Audio Processing
dc.subjectSpeaker adaptation
dc.subjectSpeech recognition
dc.subjectState duration modeling
dc.titleMAP speaker adaptation of state duration distributions for speech recognition
dc.typeArtículos de revistas


Este ítem pertenece a la siguiente institución