Mostra el registre d'ítem simple

dc.contributor.authorZelenak, Martin
dc.contributor.authorHernando Pericás, Francisco Javier
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.date.accessioned2013-03-01T11:47:44Z
dc.date.available2013-03-01T11:47:44Z
dc.date.created2012
dc.date.issued2012
dc.identifier.citationZelenák, M.; Hernando, J. Detection and handling of overlapping speech for speaker diarization. A: Iberspeech. "IBERSPEECH 2012". Madrid: 2012, p. 460-469.
dc.identifier.urihttp://hdl.handle.net/2117/18033
dc.description.abstractThis thesis concerns the detection of overlapping speech segments and its further application for the improvement of speaker diarization performance. We propose the use of three spatial cross-correlation-based parameters for overlap detection on distant microphone channel data. Spatial features from dierent microphone pairs are fused by means of principal component analysis or by an approach involving a multilayer perceptron. In addition, we investigate the possibility of employing long-term prosodic information. The most suitable subset of candidate prosodic features is determined by a two-step mRMR feature selection algorithm. For segments including detected overlapping speech the speaker diarization system picks a second speaker label, and such segments are also discarded from the model training. The proposed overlap labeling technique is integrated in the Viterbi-decoding part of the diarization algorithm.
dc.format.extent10 p.
dc.language.isoeng
dc.subjectÀrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic
dc.subject.lcshAutomatic speech recognition
dc.titleDetection and handling of overlapping speech for speaker diarization
dc.typeConference report
dc.subject.lemacReconeixement automàtic de la parla
dc.contributor.groupUniversitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttp://iberspeech2012.ii.uam.es/IberSPEECH2012_OnlineProceedings.pdf
dc.rights.accessOpen Access
local.identifier.drac11052685
dc.description.versionPostprint (published version)
local.citation.authorZelenák, M.; Hernando, J.
local.citation.contributorIberspeech
local.citation.pubplaceMadrid
local.citation.publicationNameIBERSPEECH 2012
local.citation.startingPage460
local.citation.endingPage469


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple