Rights accessRestricted access - publisher's policy
Overlapping speech is responsible for a certain amount of errors produced by tandard speaker diarization systems in meeting environment. We are investigating a set of prosody-based long-term features as a potential complement to our overlap detection
system relying on short-term spectral parameters. The most relevant features are selected in a two-step process. They
are firstly evaluated and sorted according to mRMR criterion and then the optimal number is determined by iterative wrapper
approach. We show that the addition of prosodic features decreased overlap detection error. Detected overlap segments are used in speaker diarization to recover missed speech by assigning multiple speaker labels and to increase the purity of speaker clusters.
CitationZelenak, M.; Hernando, J. The detection of overlapping speech with prosodic features for speaker diarization. A: European Conference on Speech Communication and Technology. "Proceedings of Interspeech 2011: spoken language processing for all : 15th August 2011 : Florence, Italy". Florencia: 2011, p. 1041-1044.
All rights reserved. This work is protected by the corresponding intellectual and industrial property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public communication or transformation of this work are prohibited without permission of the copyright holder. If you wish to make any use of the work not provided for in the law, please contact: firstname.lastname@example.org