Buscar
Mostrando ítems 1-10 de 328
A framework for speaker retrieval and identification through unsupervised learning
(2019-11-01)
Speaker recognition is a task of remarkable relevance, with applications in diversified domains. Recently, mainly due to the facilities in audio-visual content acquisition, the capacity of analyzing growing datasets ...
MAP speaker adaptation of state duration distributions for speech recognition
(2002)
This paper presents a framework for maximum a posteriori (MAP) speaker adaptation of state duration distributions in hidden Markov models (HMM). Four key issues of MAP estimation, namely analysis and modeling of state ...
Effective speaker retrieval and recognition through vector quantization and unsupervised distance learning
(2016-06-06)
The huge amount of multimedia content accumulated daily has demanded the development of effective retrieval approaches. In this context, speaker recognition methods capable of automatically identifying a person through ...
Investigating fuzzy methods for multilingual speaker identification
(Universidade Federal do Rio Grande do NorteBrasilUFRNPROGRAMA DE PÓS-GRADUAÇÃO EM SISTEMAS E COMPUTAÇÃO, 2020-08-27)
Morphological and semantic priming in word recognitionImprimación morfológica y semántica en el reconocimiento de palabras
(Universidad Finis Terrae, Facultad de Educación, Psicología y Familia, 2024)
On the determination of epsilon during discriminative GMM training
(2010-12-01)
Discriminative training of Gaussian Mixture Models (GMMs) for speech or speaker recognition purposes is usually based on the gradient descent method, in which the iteration step-size, ε, uses to be defined experimentally. ...
Desenvolvimento de um protótipo para reconhecimento de voz
(Universidade Federal de Santa MariaBrasilUFSMCentro de Tecnologia, 2009-07-17)
The effect of recognizing a person by his voice, through a machine, is known as automatic
speaker recognition. This technique sets up a complex problem when considering
algorithms that demand a fast processing and the ...
Forensic speaker verification using ordinary least squares
(2019-10-02)
In Brazil, the recognition of speakers for forensic purposes still relies on a subjectivity-based decision-making process through a results analysis of untrustworthy techniques. Owing to the lack of a voice database, speaker ...
The use of Locally Normalized Cepstral Coefficients (LNCC) to improve speaker recognition accuracy in highly reverberant rooms
(2016)
We describe the ability of LNCC features (Locally Normalized Cepstral Coefficients) to improve speaker recognition accuracy in highly reverberant environments. We used a realistic test environment, in which we changed the ...
Vocal caricatures reveal signatures of speaker identity
(Nature Publishing Group, 2013-12)
What are the features that impersonators select to elicit a speaker’s identity? We built a voice database of public figures (targets) and imitations produced by professional impersonators. They produced one imitation based ...