Mostra el registre d'ítem simple
Detection and handling of overlapping speech for speaker diarization
dc.contributor.author | Zelenak, Martin |
dc.contributor.author | Hernando Pericás, Francisco Javier |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions |
dc.date.accessioned | 2013-03-01T11:47:44Z |
dc.date.available | 2013-03-01T11:47:44Z |
dc.date.created | 2012 |
dc.date.issued | 2012 |
dc.identifier.citation | Zelenák, M.; Hernando, J. Detection and handling of overlapping speech for speaker diarization. A: Iberspeech. "IBERSPEECH 2012". Madrid: 2012, p. 460-469. |
dc.identifier.uri | http://hdl.handle.net/2117/18033 |
dc.description.abstract | This thesis concerns the detection of overlapping speech segments and its further application for the improvement of speaker diarization performance. We propose the use of three spatial cross-correlation-based parameters for overlap detection on distant microphone channel data. Spatial features from dierent microphone pairs are fused by means of principal component analysis or by an approach involving a multilayer perceptron. In addition, we investigate the possibility of employing long-term prosodic information. The most suitable subset of candidate prosodic features is determined by a two-step mRMR feature selection algorithm. For segments including detected overlapping speech the speaker diarization system picks a second speaker label, and such segments are also discarded from the model training. The proposed overlap labeling technique is integrated in the Viterbi-decoding part of the diarization algorithm. |
dc.format.extent | 10 p. |
dc.language.iso | eng |
dc.subject | Àrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic |
dc.subject.lcsh | Automatic speech recognition |
dc.title | Detection and handling of overlapping speech for speaker diarization |
dc.type | Conference report |
dc.subject.lemac | Reconeixement automàtic de la parla |
dc.contributor.group | Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla |
dc.description.peerreviewed | Peer Reviewed |
dc.relation.publisherversion | http://iberspeech2012.ii.uam.es/IberSPEECH2012_OnlineProceedings.pdf |
dc.rights.access | Open Access |
local.identifier.drac | 11052685 |
dc.description.version | Postprint (published version) |
local.citation.author | Zelenák, M.; Hernando, J. |
local.citation.contributor | Iberspeech |
local.citation.pubplace | Madrid |
local.citation.publicationName | IBERSPEECH 2012 |
local.citation.startingPage | 460 |
local.citation.endingPage | 469 |