Show simple item record

dc.contributor.authorZelenak, Martin
dc.contributor.authorHernando Pericás, Francisco Javier
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.date.accessioned2013-03-01T11:47:44Z
dc.date.available2013-03-01T11:47:44Z
dc.date.created2012
dc.date.issued2012
dc.identifier.citationZelenák, M.; Hernando, J. Detection and handling of overlapping speech for speaker diarization. A: Iberspeech. "IBERSPEECH 2012". Madrid: 2012, p. 460-469.
dc.identifier.urihttp://hdl.handle.net/2117/18033
dc.description.abstractThis thesis concerns the detection of overlapping speech segments and its further application for the improvement of speaker diarization performance. We propose the use of three spatial cross-correlation-based parameters for overlap detection on distant microphone channel data. Spatial features from dierent microphone pairs are fused by means of principal component analysis or by an approach involving a multilayer perceptron. In addition, we investigate the possibility of employing long-term prosodic information. The most suitable subset of candidate prosodic features is determined by a two-step mRMR feature selection algorithm. For segments including detected overlapping speech the speaker diarization system picks a second speaker label, and such segments are also discarded from the model training. The proposed overlap labeling technique is integrated in the Viterbi-decoding part of the diarization algorithm.
dc.format.extent10 p.
dc.language.isoeng
dc.subjectÀrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic
dc.subject.lcshAutomatic speech recognition
dc.titleDetection and handling of overlapping speech for speaker diarization
dc.typeConference report
dc.subject.lemacReconeixement automàtic de la parla
dc.contributor.groupUniversitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttp://iberspeech2012.ii.uam.es/IberSPEECH2012_OnlineProceedings.pdf
dc.rights.accessOpen Access
drac.iddocument11052685
dc.description.versionPostprint (published version)
upcommons.citation.authorZelenák, M.; Hernando, J.
upcommons.citation.contributorIberspeech
upcommons.citation.pubplaceMadrid
upcommons.citation.publishedtrue
upcommons.citation.publicationNameIBERSPEECH 2012
upcommons.citation.startingPage460
upcommons.citation.endingPage469


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

All rights reserved. This work is protected by the corresponding intellectual and industrial property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public communication or transformation of this work are prohibited without permission of the copyright holder