dc.creatorDehghan Firoozabadi, Ali
dc.creatorIrarrázaval, Pablo
dc.creatorAdasme, Pablo
dc.creatorZabala-Blanco, David
dc.creatorPalacios Játiva, Pablo
dc.creatorDurney, Hugo
dc.creatorSanhueza Olave, Miguel
dc.date2023-03-08T13:39:47Z
dc.date2023-03-08T13:39:47Z
dc.date2022
dc.date.accessioned2024-05-02T20:30:39Z
dc.date.available2024-05-02T20:30:39Z
dc.identifierhttp://repositorio.ucm.cl/handle/ucm/4500
dc.identifier.urihttps://repositorioslatinoamericanos.uchile.cl/handle/2250/9274745
dc.descriptionIn this paper, a new speaker counting algorithm is proposed by novel zig-zag nested array (ZZNA) combining with adaptive generalized cross-correlation (GCC) function (with phase transform (PHAT) and maximum likelihood (ML)) and wavelet packet transform (WPT) with an agglomerative classification method by Elbow decisioning criteria. The proper ZZNA is introduced for covering the acoustical environments and removing the spatial aliasing. Then, the WPT with different frequency resolution is considered for preparing the frequency subbands. The adaptive GCC function based on PHAT and ML weighting filters is done on the microphone pairs for each subbands. Finally, the unsupervised agglomerative classification method with Elbow criteria is considered for classifying the information and speakers’ counting. The proposed ZZNA-WAGC method is compared with Hilbert envelope, multi-channel correlational recurrent neural network by using of ambisonics features (AF-CRNN) and estimating the number of speakers by density-based classification and clustering decision (ENS-DCCD) algorithms to show the superiority of the method in undesirable scenarios.
dc.languageen
dc.rightsAtribución-NoComercial-SinDerivadas 3.0 Chile
dc.rightshttp://creativecommons.org/licenses/by-nc-nd/3.0/cl/
dc.source8th International Conference on Signal Processing and Communication (ICSC), Noida, India, 358-363
dc.subjectSignal processing algorithms
dc.subjectAdaptive filters
dc.subjectAdaptive arrays
dc.subjectInformation filters
dc.subjectWavelet packets
dc.subjectMicrophone arrays
dc.subjectClassification algorithms
dc.titleEstimating the number of speakers by novel zig-zag nested microphone array based on wavelet packet and adaptive GCC method
dc.typeArticle


Este ítem pertenece a la siguiente institución