Mostra el registre d'ítem simple

dc.contributor.authorZewoudie, Abraham Woubie
dc.contributor.authorLuque, Jordi
dc.contributor.authorHernando Pericás, Francisco Javier
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.date.accessioned2015-04-17T17:23:19Z
dc.date.available2015-04-17T17:23:19Z
dc.date.created2014
dc.date.issued2014
dc.identifier.citationZewoudie, A.; Jordi Luque; Hernando, J. Jitter and Shimmer measurements for speaker diarization. A: Jornadas en Tecnología del Habla and III Iberian SLTech Workshop. "VII Jornadas en Tecnología del Habla and III Iberian SLTech Workshop: proceedings: November 19-21, 2014: Escuela de Ingeniería en Telecomunicación y Electrónica Universidad de Las Palmas de Gran Canaria: Las Palmas de Gran Canaria, Spain". Las Palmas de Gran Canaria: 2014, p. 21-30.
dc.identifier.isbn978-84-617-2862-6
dc.identifier.urihttp://hdl.handle.net/2117/27438
dc.description.abstractJitter and shimmer voice quality features have been successfully used to characterize speaker voice traits and detect voice pathologies. Jitter and shimmer measure variations in the fundamental frequency and amplitude of speaker's voice, respectively. Due to their nature, they can be used to assess differences between speakers. In this paper, we investigate the usefulness of these voice quality features in the task of speaker diarization. The combination of voice quality features with the conventional spectral features, Mel-Frequency Cepstral Coefficients (MFCC), is addressed in the framework of Augmented Multiparty Interaction (AMI) corpus, a multi-party and spontaneous speech set of recordings. Both sets of features are independently modeled using mixture of Gaussians and fused together at the score likelihood level. The experiments carried out on the AMI corpus show that incorporating jitter and shimmer measurements to the baseline spectral features decreases the diarization error rate in most of the recordings.
dc.format.extent10 p.
dc.language.isoeng
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 Spain
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.subjectÀrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic
dc.subject.lcshSpeech processing systems
dc.subject.otherSpeaker diarization
dc.subject.otherSpectral features
dc.subject.otherJitter
dc.subject.otherShimmer
dc.subject.otherFusion
dc.titleJitter and Shimmer measurements for speaker diarization
dc.typeConference report
dc.subject.lemacProcessament de la parla
dc.contributor.groupUniversitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
dc.description.peerreviewedPeer Reviewed
dc.rights.accessOpen Access
local.identifier.drac15429550
dc.description.versionPostprint (published version)
local.citation.authorZewoudie, A.; Jordi Luque; Hernando, J.
local.citation.contributorJornadas en Tecnología del Habla and III Iberian SLTech Workshop
local.citation.pubplaceLas Palmas de Gran Canaria
local.citation.publicationNameVII Jornadas en Tecnología del Habla and III Iberian SLTech Workshop: proceedings: November 19-21, 2014: Escuela de Ingeniería en Telecomunicación y Electrónica Universidad de Las Palmas de Gran Canaria: Las Palmas de Gran Canaria, Spain
local.citation.startingPage21
local.citation.endingPage30


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple