Mostra el registre d'ítem simple
Jitter and Shimmer measurements for speaker diarization
dc.contributor.author | Zewoudie, Abraham Woubie |
dc.contributor.author | Luque, Jordi |
dc.contributor.author | Hernando Pericás, Francisco Javier |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions |
dc.date.accessioned | 2015-04-17T17:23:19Z |
dc.date.available | 2015-04-17T17:23:19Z |
dc.date.created | 2014 |
dc.date.issued | 2014 |
dc.identifier.citation | Zewoudie, A.; Jordi Luque; Hernando, J. Jitter and Shimmer measurements for speaker diarization. A: Jornadas en Tecnología del Habla and III Iberian SLTech Workshop. "VII Jornadas en Tecnología del Habla and III Iberian SLTech Workshop: proceedings: November 19-21, 2014: Escuela de Ingeniería en Telecomunicación y Electrónica Universidad de Las Palmas de Gran Canaria: Las Palmas de Gran Canaria, Spain". Las Palmas de Gran Canaria: 2014, p. 21-30. |
dc.identifier.isbn | 978-84-617-2862-6 |
dc.identifier.uri | http://hdl.handle.net/2117/27438 |
dc.description.abstract | Jitter and shimmer voice quality features have been successfully used to characterize speaker voice traits and detect voice pathologies. Jitter and shimmer measure variations in the fundamental frequency and amplitude of speaker's voice, respectively. Due to their nature, they can be used to assess differences between speakers. In this paper, we investigate the usefulness of these voice quality features in the task of speaker diarization. The combination of voice quality features with the conventional spectral features, Mel-Frequency Cepstral Coefficients (MFCC), is addressed in the framework of Augmented Multiparty Interaction (AMI) corpus, a multi-party and spontaneous speech set of recordings. Both sets of features are independently modeled using mixture of Gaussians and fused together at the score likelihood level. The experiments carried out on the AMI corpus show that incorporating jitter and shimmer measurements to the baseline spectral features decreases the diarization error rate in most of the recordings. |
dc.format.extent | 10 p. |
dc.language.iso | eng |
dc.rights | Attribution-NonCommercial-NoDerivs 3.0 Spain |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/es/ |
dc.subject | Àrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic |
dc.subject.lcsh | Speech processing systems |
dc.subject.other | Speaker diarization |
dc.subject.other | Spectral features |
dc.subject.other | Jitter |
dc.subject.other | Shimmer |
dc.subject.other | Fusion |
dc.title | Jitter and Shimmer measurements for speaker diarization |
dc.type | Conference report |
dc.subject.lemac | Processament de la parla |
dc.contributor.group | Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla |
dc.description.peerreviewed | Peer Reviewed |
dc.rights.access | Open Access |
local.identifier.drac | 15429550 |
dc.description.version | Postprint (published version) |
local.citation.author | Zewoudie, A.; Jordi Luque; Hernando, J. |
local.citation.contributor | Jornadas en Tecnología del Habla and III Iberian SLTech Workshop |
local.citation.pubplace | Las Palmas de Gran Canaria |
local.citation.publicationName | VII Jornadas en Tecnología del Habla and III Iberian SLTech Workshop: proceedings: November 19-21, 2014: Escuela de Ingeniería en Telecomunicación y Electrónica Universidad de Las Palmas de Gran Canaria: Las Palmas de Gran Canaria, Spain |
local.citation.startingPage | 21 |
local.citation.endingPage | 30 |