A multilingual corpus for rich audio-visual scene description in a meeting-room environment
Document typeConference report
PublisherACM Press. Association for Computing Machinery
Rights accessRestricted access - publisher's policy
In this paper, we present a multilingual database specifically designed to develop technologies for rich audio-visual scene description in meeting-room environments. Part of that database includes the already existing CHIL audio-visual recordings, whose annotations have been extended. A relevant objective in the new recorded sessions was to include situations in which the semantic content can not be extracted from a single modality. The presented database, that includes five hours of rather spontaneously generated scientific presentations, was manually annotated using standard or previously reported annotation schemes, and will be publicly available for the research purposes.
CitationButko, T.; Nadeu, C.; Moreno, M. A multilingual corpus for rich audio-visual scene description in a meeting-room environment. A: ICMI Workshop on Multimodal Corpora For Machine Learning. "ICMI Workshop on Multimodal Corpora for Machine Learning : Taking Stock and Roadmapping the Future". Alacant: ACM Press. Association for Computing Machinery, 2011, p. 1-6.