Audio segmentation of broadcast news in the Albayzin-2010 evaluation: overview, results, and discussion

View/Open
Document typeArticle
Defense date2011-06-17
PublisherHINDAWI
Rights accessOpen Access
Abstract
Recently, audio segmentation has attracted research interest because of its usefulness in several applications like audio indexing and retrieval, subtitling, monitoring of acoustic scenes, etc. Moreover, a previous audio segmentation stage may be useful to improve the robustness of speech technologies like automatic speech
recognition and speaker diarization. In this article, we present the evaluation of broadcast news audio segmentation systems carried out in the context of the Albayzín-2010 evaluation campaign. That evaluation consisted of segmenting audio from the 3/24 Catalan TV channel into five acoustic classes: music, speech, speech over music, speech over noise, and the other. The evaluation results displayed the difficulty of this segmentation task. In this article, after presenting the database and metric, as well as the feature extraction methods and
segmentation techniques used by the submitted systems, the experimental results are analyzed and compared, with the aim of gaining an insight into the proposed solutions, and looking for directions which are promising.
CitationButko, T.; Nadeu, C. Audio segmentation of broadcast news in the Albayzin-2010 evaluation: overview, results, and discussion. "EURASIP Journal on Audio, Speech, and Music Processing", 17 Juny 2011, vol. 2011, núm. 1, p. 1-10.
ISSN1687-4714
Publisher versionhttp://asmp.eurasipjournals.com/content/2011/1/1
Files | Description | Size | Format | View |
---|---|---|---|---|
1687-4722-2011-1.pdf | 387,6Kb | View/Open |
All rights reserved. This work is protected by the corresponding intellectual and industrial
property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public
communication or transformation of this work are prohibited without permission of the copyright holder