Search
Now showing items 1-10 of 70
Multi-objective optimisation of wavelet features for phoneme recognition
(Institution of Engineering and Technology, 2016-03)
State-of-the-art speech representations provide acceptable recognition results under optimal conditions, though their performance in adverse conditions still needs to be improved. In this direction, many advances involving ...
Objective quality evaluation in blind source separation for speech recognition in a real room
(Elsevier Science, 2007-12)
The determination of quality of the signals obtained by blind source separation is a very important subject fordevelopment and evaluation of such algorithms. When this approach is used as a pre-processing stage for automatic ...
Robust features in deep-learning-based speech recognition
(Springer Nature Switzerland AG, 2017)
Recent progress in deep learning has revolutionized speech recognition research, with Deep Neural Networks (DNNs) becoming the new state of the art for acoustic modeling. DNNs offer significantly lower speech recognition ...
Disambiguating Conflicting Classification Results in AVSR
(Elsevier, 2019)
A novel scheme for disambiguating conflicting classification results in Audio-Visual Speech Recognition (AVSR) applications is proposed in this paper. The classification scheme can be implemented with both generative and ...
Robust front-end for audio, visual and audio–visual speech classification
(Springer, 2018-06)
This paper proposes a robust front-end for speech classification which can be employed with acoustic, visual or audio–visual information, indistinctly. Wavelet multiresolution analysis is employed to represent temporal ...
Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees
(Elsevier, 2010-04)
Hidden Markov models have been found very useful for a wide range of applications in machine learning and pattern recognition. The wavelet transform has emerged as a new tool for signal and image analysis. Learning models ...
Classification of ASR Word Hypotheses using prosodic information and resampling of training data
(Planta Piloto de Ingeniería Química, 2013-07)
In this work, we propose a novel re-sampling method based on word lattice information and we use prosodic cues with support vector machines for classification. The idea is to consider word recognition as a two-class ...
Hate speech in social networks and recognition of the other: the M. caseDiscurso de ódio em redes sociais e reconhecimento do outro: o caso M.
(Escola de Direito de São Paulo da Fundação Getulio Vargas, 2019)
Generation and Dramatization of Detective Stories
(Brazilian Computing Society (SBC), 2014)
Evolutionary cepstral coefficients
(Elsevier Science, 2011-06)
Evolutionary algorithms provide flexibility and robustness required to find satisfactory solutions in complex search spaces. This is why they are successfully applied for solving real engineering problems. In this work we ...