Search
Now showing items 31-40 of 1192
On the determination of epsilon during discriminative GMM training
(2010-12-01)
Discriminative training of Gaussian Mixture Models (GMMs) for speech or speaker recognition purposes is usually based on the gradient descent method, in which the iteration step-size, ε, uses to be defined experimentally. ...
Robust features in deep-learning-based speech recognition
(Springer Nature Switzerland AG, 2017)
Recent progress in deep learning has revolutionized speech recognition research, with Deep Neural Networks (DNNs) becoming the new state of the art for acoustic modeling. DNNs offer significantly lower speech recognition ...
Desenvolvimento de um sistema de reconhecimento de fala usando modelos ocultos de Markov
(Universidade Tecnológica Federal do ParanáCornelio ProcopioBrasilEngenharia ElétricaUTFPR, 2014-11-26)
In this study, we present the development of a speech recognition system in Matlab software that can recognize words spoken by different speakers. The method proposed is based on three stages: signal pre-processing, Markov ...
Automatic speech-to-text transcription in an ecuadorian radio broadcast context
(SPRINGER VERLAG, 2017-09-19)
A key element to enable the analysis and accessing to radio broadcast content is the development of automatic speech-to-text systems. The building of these systems has been possible given the current available of different ...
An Investigation of Type-1 Adaptive Neural Fuzzy Inference System for Speech Reconigtion
(Universidade Federal do Rio Grande do NorteBrasilUFRNBacharelado em Ciência da Computação, 2018-06-19)
Using voice for user recognition is something that humans do since the beginning and
it a very natural ability. Being able to recognise the user by its voice is very important,
but, in some cases, being able to recognise ...
Tools and technologies for computer-aided speech and language therapy
This paper addresses the problem of Computer-Aided Speech and Language Therapy (CASLT). The goal of the work described in the paper is to develop and evaluate a semi-automated system for providing interactive speech therapy ...
A tutorial on signal energy and its applications
(2016-02-29)
This tutorial, dedicated both to young professionals and students working with digital signal processing and pattern recognition, introduces three feature extraction approaches based on signal energy, characterising ...
Non-flat audiograms in sensorineural hearing loss and speech perception
(Faculdade de Medicina / USP, 2013-06-01)
OBJECTIVE: The audibility thresholds for the sound frequency of 137 upward- and downward-sloping audiograms showing sensorineural hearing loss were selected and analyzed in conjunction with speech recognition thresholds ...
Modeling, estimating, and compensating low-bit rate coding distortion in speech recognition
(2006)
A solution to the problem of speech recognition with signals distorted by low-bit rate coders is presented in this paper. A model for the coding-decoding distortion, a HMM compensation method to include this model, and an ...