The detection of overlapping speech with prosodic features for speaker diarization
Tipus de documentComunicació de congrés
Condicions d'accésAccés restringit per política de l'editorial
Overlapping speech is responsible for a certain amount of errors produced by tandard speaker diarization systems in meeting environment. We are investigating a set of prosody-based long-term features as a potential complement to our overlap detection system relying on short-term spectral parameters. The most relevant features are selected in a two-step process. They are firstly evaluated and sorted according to mRMR criterion and then the optimal number is determined by iterative wrapper approach. We show that the addition of prosodic features decreased overlap detection error. Detected overlap segments are used in speaker diarization to recover missed speech by assigning multiple speaker labels and to increase the purity of speaker clusters.
CitacióZelenak, M.; Hernando, J. The detection of overlapping speech with prosodic features for speaker diarization. A: European Conference on Speech Communication and Technology. "Proceedings of Interspeech 2011: spoken language processing for all : 15th August 2011 : Florence, Italy". Florencia: 2011, p. 1041-1044.