info:eu-repo/semantics/article
Voice Signal Typing Using a Pattern Recognition Approach
Fecha
2020-05Registro en:
Miramont, Juan Manuel; Restrepo Rinckoar, Juan Felipe; Codino, J.; Jackson-Menaldi, C.; Schlotthauer, Gaston; Voice Signal Typing Using a Pattern Recognition Approach; Mosby-Elsevier; Journal Of Voice : Official Journal Of The Voice Foundation.; 5-2020
0892-1997
CONICET Digital
CONICET
Autor
Miramont, Juan Manuel
Restrepo Rinckoar, Juan Felipe
Codino, J.
Jackson-Menaldi, C.
Schlotthauer, Gaston
Resumen
Voice signal classification in three types according to their degree of periodicity, a task known as signal typing, is a relevant preprocessing step before computing any perturbation measures. However, it is a time consuming and subjective activity. This has given rise to interest in automatic systems that use objective measures to distinguish among the different signal types. The purpose of this paper is twofold. First, to propose a pattern recognition approach for automatic voice signal typing based on a multi-class linear Support Vector Machine, and using rather well-known parameters like Jitter, Shimmer, Harmonic-to-Noise Ratio, and Cepstral Prominence Peak in combination with nonlinear dynamics measures. Two novel features are also proposed as objective parameters. Second, to validate this approach using a large amount of signals coming from two well-known corpora using cross-dataset experiments to assess the generalizability of the system. A total amount of 1262 signals labeled by professional voice pathologists were used with this purpose. Statistically significant differences between all types were found for all features. Accuracies over 82.71% were estimated in all intra-datasets and inter-datasets using cross-validation. Finally, the use of posterior probabilities is proposed as a measure of the reliability of the assigned type. This could help clinicians to make a more informed decision about the type assigned to a voice. These outcomes suggest that the proposed approach can successfully discriminate among the three voice types, paving the way to a fully automatic tool for voice signal typing in the future.