The detection of overlapping speech with prosodic features for speaker diarization
Tipo de documentoComunicación de congreso
Fecha de publicación2011
Condiciones de accesoAcceso restringido por política de la editorial
Overlapping speech is responsible for a certain amount of errors produced by tandard speaker diarization systems in meeting environment. We are investigating a set of prosody-based long-term features as a potential complement to our overlap detection system relying on short-term spectral parameters. The most relevant features are selected in a two-step process. They are firstly evaluated and sorted according to mRMR criterion and then the optimal number is determined by iterative wrapper approach. We show that the addition of prosodic features decreased overlap detection error. Detected overlap segments are used in speaker diarization to recover missed speech by assigning multiple speaker labels and to increase the purity of speaker clusters.
CitaciónZelenak, M.; Hernando, J. The detection of overlapping speech with prosodic features for speaker diarization. A: European Conference on Speech Communication and Technology. "Proceedings of Interspeech 2011: spoken language processing for all : 15th August 2011 : Florence, Italy". Florencia: 2011, p. 1041-1044.