Buscar
Mostrando ítems 1-10 de 180
Noisy Speech Recognition Based on Combined Audio-Visual Classifiers
(Springer, 2015-01)
An isolated word speech recognition system based on audio-visual features is proposed in this paper. To enhance the recognition over different noisy conditions, this system combines three classifiers based on audio, visual ...
Audio-Visual Automatic Speech Recognition Using PZM, MFCC and Statistical Analysis
Audio-Visual Automatic Speech Recognition (AV-ASR) has become the most promising research area when the audio signal gets corrupted by noise. The main objective of this paper is to select the important and discriminative ...
Robust front-end for audio, visual and audio–visual speech classification
(Springer, 2018-06)
This paper proposes a robust front-end for speech classification which can be employed with acoustic, visual or audio–visual information, indistinctly. Wavelet multiresolution analysis is employed to represent temporal ...
Disambiguating Conflicting Classification Results in AVSR
(Elsevier, 2019)
A novel scheme for disambiguating conflicting classification results in Audio-Visual Speech Recognition (AVSR) applications is proposed in this paper. The classification scheme can be implemented with both generative and ...
A comprehensive system for facial animation of generic 3D head models driven by speech
(Springer, 2013-02)
A comprehensive system for facial animation of generic 3D head models driven by speech is presented in this article. In the training stage, audio-visual information is extracted from audio-visual training data, and then ...
Audio-Visual Automatic Speech Recognition Towards Education for Disabilities
Education is a fundamental right that enriches everyone’s life. However, physically challenged people often debar from the general and advanced education system. Audio-Visual Automatic Speech Recognition (AV-ASR) based ...
Faces and Voices Processing in Human and Primate Brains: Rhythmic and Multimodal Mechanisms Underlying the Evolution and Development of Speech
(2022)
While influential works since the 1970s have widely assumed that imitation is an
innate skill in both human and non-human primate neonates, recent empirical studies
and meta-analyses have challenged this view, indicating ...
A method for lexical tone classification in audio-visual speech
(Universidade Federal de Minas GeraisBrasilFALE - FACULDADE DE LETRASUFMG, 2020)
Chapter 13 : Origin and evolution of human speech : emergence from a trimodal auditory, visual and vocal network
(2019)
In recent years, there have been important additions to the classical model of speech processing as originally depicted by the Broca–Wernicke model consisting of an anterior, productive region and a posterior, perceptive ...