PublisherACM Press. Association for Computing Machinery
Rights accessRestricted access - publisher's policy
In this paper, we present a multilingual database specifically designed to develop technologies for rich audio-visual scene
description in meeting-room environments. Part of that database includes the already existing CHIL audio-visual recordings, whose annotations have been extended. A relevant objective in the new recorded sessions was to include situations in which the semantic content can not be extracted from a single modality. The presented database, that includes five hours of rather spontaneously generated scientific presentations, was manually annotated using standard or previously reported annotation schemes, and will be publicly available for the research purposes.
CitationButko, T.; Nadeu, C.; Moreno, M. A multilingual corpus for rich audio-visual scene description in a meeting-room environment. A: ICMI Workshop on Multimodal Corpora For Machine Learning. "ICMI Workshop on Multimodal Corpora for Machine Learning : Taking Stock and Roadmapping the Future". Alacant: ACM Press. Association for Computing Machinery, 2011, p. 1-6.
All rights reserved. This work is protected by the corresponding intellectual and industrial property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public communication or transformation of this work are prohibited without permission of the copyright holder. If you wish to make any use of the work not provided for in the law, please contact: firstname.lastname@example.org