dc.contributorDey, Nilanjan
dc.creatorSad, Gonzalo Daniel
dc.creatorTerissi, Lucas Daniel
dc.creatorGómez, Juan Carlos
dc.date.accessioned2021-05-19T18:35:45Z
dc.date.accessioned2022-10-15T15:51:44Z
dc.date.available2021-05-19T18:35:45Z
dc.date.available2022-10-15T15:51:44Z
dc.date.created2021-05-19T18:35:45Z
dc.date.issued2019
dc.identifierSad, Gonzalo Daniel; Terissi, Lucas Daniel; Gómez, Juan Carlos; Disambiguating Conflicting Classification Results in AVSR; Elsevier; 2019; 55-80
dc.identifier978-0-12-818130-0
dc.identifierhttp://hdl.handle.net/11336/132286
dc.identifierCONICET Digital
dc.identifierCONICET
dc.identifier.urihttps://repositorioslatinoamericanos.uchile.cl/handle/2250/4405377
dc.description.abstractA novel scheme for disambiguating conflicting classification results in Audio-Visual Speech Recognition (AVSR) applications is proposed in this paper. The classification scheme can be implemented with both generative and discriminative models and can be used with different input modalities, viz. only audio, only visual, and audio visual information. The proposed scheme consists of the cascade connection of a standard classifier, trained with instances of each particular class, followed by a complementary model which is trained with instances of all the remaining classes. The performance of the proposed recognition system is evaluated on three publicly available audio-visual datasets, and using a generative model, namely a Hidden Markov Model, and three discriminative techniques, viz. Random Forests, Support Vector Machines, and Adaptive Boosting. The experimental results are promising in the sense that for the three datasets, the different models, and the different input modalities, improvements in the recognition rates are achieved in comparison to other methods reported in the literature over the same datasets.
dc.languageeng
dc.publisherElsevier
dc.relationinfo:eu-repo/semantics/altIdentifier/doi/https://doi.org/10.1016/B978-0-12-818130-0.00004-0
dc.relationinfo:eu-repo/semantics/altIdentifier/url/https://www.sciencedirect.com/science/article/pii/B9780128181300000040
dc.rightshttps://creativecommons.org/licenses/by-nc-sa/2.5/ar/
dc.rightsinfo:eu-repo/semantics/restrictedAccess
dc.sourceIntelligent Speech Signal Processing
dc.subjectSPEECH CLASSIFICATION
dc.subjectAUDIO-VISUAL SPEECH
dc.subjectCOMPLEMENTARY MODELS
dc.subjectCLASSIFIER COMBINATION
dc.titleDisambiguating Conflicting Classification Results in AVSR
dc.typeinfo:eu-repo/semantics/publishedVersion
dc.typeinfo:eu-repo/semantics/bookPart
dc.typeinfo:ar-repo/semantics/parte de libro


Este ítem pertenece a la siguiente institución