Disambiguating Conflicting Classification Results in AVSR

Sad, Gonzalo Daniel; Terissi, Lucas Daniel; Gómez, Juan Carlos

dc.contributor	Dey, Nilanjan
dc.creator	Sad, Gonzalo Daniel
dc.creator	Terissi, Lucas Daniel
dc.creator	Gómez, Juan Carlos
dc.date.accessioned	2021-05-19T18:35:45Z
dc.date.accessioned	2022-10-15T15:51:44Z
dc.date.available	2021-05-19T18:35:45Z
dc.date.available	2022-10-15T15:51:44Z
dc.date.created	2021-05-19T18:35:45Z
dc.date.issued	2019
dc.identifier	Sad, Gonzalo Daniel; Terissi, Lucas Daniel; Gómez, Juan Carlos; Disambiguating Conflicting Classification Results in AVSR; Elsevier; 2019; 55-80
dc.identifier	978-0-12-818130-0
dc.identifier	http://hdl.handle.net/11336/132286
dc.identifier	CONICET Digital
dc.identifier	CONICET
dc.identifier.uri	https://repositorioslatinoamericanos.uchile.cl/handle/2250/4405377
dc.description.abstract	A novel scheme for disambiguating conflicting classification results in Audio-Visual Speech Recognition (AVSR) applications is proposed in this paper. The classification scheme can be implemented with both generative and discriminative models and can be used with different input modalities, viz. only audio, only visual, and audio visual information. The proposed scheme consists of the cascade connection of a standard classifier, trained with instances of each particular class, followed by a complementary model which is trained with instances of all the remaining classes. The performance of the proposed recognition system is evaluated on three publicly available audio-visual datasets, and using a generative model, namely a Hidden Markov Model, and three discriminative techniques, viz. Random Forests, Support Vector Machines, and Adaptive Boosting. The experimental results are promising in the sense that for the three datasets, the different models, and the different input modalities, improvements in the recognition rates are achieved in comparison to other methods reported in the literature over the same datasets.
dc.language	eng
dc.publisher	Elsevier
dc.relation	info:eu-repo/semantics/altIdentifier/doi/https://doi.org/10.1016/B978-0-12-818130-0.00004-0
dc.relation	info:eu-repo/semantics/altIdentifier/url/https://www.sciencedirect.com/science/article/pii/B9780128181300000040
dc.rights	https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
dc.rights	info:eu-repo/semantics/restrictedAccess
dc.source	Intelligent Speech Signal Processing
dc.subject	SPEECH CLASSIFICATION
dc.subject	AUDIO-VISUAL SPEECH
dc.subject	COMPLEMENTARY MODELS
dc.subject	CLASSIFIER COMBINATION
dc.title	Disambiguating Conflicting Classification Results in AVSR
dc.type	info:eu-repo/semantics/publishedVersion
dc.type	info:eu-repo/semantics/bookPart
dc.type	info:ar-repo/semantics/parte de libro

Este ítem pertenece a la siguiente institución

Consejo Nacional de Investigaciones Científicas y Tecnológicas (Argentina)