Audiovisual event detection towards scene understanding
Ver/Abrir
Article (547,7Kb) (Acceso restringido)
Solicitud de copia al autor
Què és aquest botó?
Aquest botó permet demanar una còpia d'un document restringit a l'autor. Es mostra quan:
- Disposem del correu electrònic de l'autor
- El document té una mida inferior a 20 Mb
- Es tracta d'un document d'accés restringit per decisió de l'autor o d'un document d'accés restringit per política de l'editorial
Cita com:
hdl:2117/23653
Tipo de documentoTexto en actas de congreso
Fecha de publicación2009
EditorInstitute of Electrical and Electronics Engineers (IEEE)
Condiciones de accesoAcceso restringido por política de la editorial
Todos los derechos reservados. Esta obra
está protegida por los derechos de propiedad intelectual e industrial. Sin perjuicio de las exenciones legales
existentes, queda prohibida su reproducción, distribución, comunicación pública o transformación sin la
autorización del titular de los derechos
Resumen
Acoustic events produced in meeting environments may contain useful information for perceptually aware interfaces and multimodal behavior analysis. In this paper, a system to detect and recognize these events from a multimodal perspective is presented combining information from multiple cameras and microphones. First, spectral and temporal features are extracted from a single audio channel and spatial localization is achieved by exploiting cross-correlation among microphone arrays. Second, several video cues obtained from multiperson tracking, motion analysis, face recognition, and object detection provide the visual counterpart of the acoustic events to be detected. A multimodal data fusion at score level is carried out using two approaches: weighted mean average and fuzzy integral. Finally, a multimodal database containing a rich variety of acoustic events has been recorded including manual annotations of the data. A set of metrics allow assessing the performance of the presented algorithms. This dataset is made publicly available for research purposes.
CitaciónCanton, C. [et al.]. Audiovisual event detection towards scene understanding. A: IEEE Conference on Computer Vision and Pattern Recognition. "2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition workshops: CVPR workshops 2009: Miami Beach, Florida, USA: 20-25 June 2009". Institute of Electrical and Electronics Engineers (IEEE), 2009, p. 840-847.
ISBN978-1-4244-3994-2
Versión del editorhttp://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=05204264
Ficheros | Descripción | Tamaño | Formato | Ver |
---|---|---|---|---|
90.pdf![]() | Article | 547,7Kb | Acceso restringido |