Audiovisual head orientation estimation with particle filtering in multisensor scenarios

Canton Ferrer, Cristian; Segura Perales, Carlos; Casas Pla, Josep Ramon; Pardàs Feliu, Montse; Hernando Pericás, Francisco Javier

doi:10.1155/2008/276846

Visualitza/Obre

AudiovisualHead.pdf (2,113Mb)

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

Canton Ferrer, Cristian

Segura Perales, Carlos

Casas Pla, Josep Ramon

Pardàs Feliu, Montse

Hernando Pericás, Francisco Javier

Tipus de documentArticle

Data publicació2008-01

Condicions d'accésAccés obert

Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets

Abstract

This article presents a multimodal approach to head pose estimation of individuals in environments equipped with multiple cameras and microphones, such as SmartRooms or automatic video conferencing. Determining the individuals head orientation is the basis for many forms of more sophisticated interactions between humans and technical devices and can also be used for automatic sensor selection (camera, microphone) in communications or video surveillance systems. The use of particle filters as a unified framework for the estimation of the head orientation for both monomodal and multimodal cases is proposed. In video, we estimate head orientation from color information by exploiting spatial redundancy among cameras. Audio information is processed to estimate the direction of the voice produced by a speaker making use of the directivity characteristics of the head radiation pattern. Furthermore, two different particle filter multimodal information fusion schemes for combining the audio and video streams are analyzed in terms of accuracy and robustness. In the first one, fusion is performed at a decision level by combining each monomodal head pose estimation, while the second one uses a joint estimation system combining information at data level. Experimental results conducted over the CLEAR 2006 evaluation database are reported and the comparison of the proposed multimodal head pose estimation algorithms with the reference monomodal approaches proves the effectiveness of the proposed approach.

CitacióCanton, C. [et al.]. Audiovisual head orientation estimation with particle filtering in multisensor scenarios. "Eurasip journal on advances in signal processing", Gener 2008, vol. 2008, p. 1-12.

URIhttp://hdl.handle.net/2117/9466

DOI10.1155/2008/276846

ISSN1687-6172

Versió de l'editorhttp://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/276846

Col·leccions

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
AudiovisualHead.pdf		2,113Mb	PDF	Visualitza/Obre

UPCommons. Portal del coneixement obert de la UPC

Audiovisual head orientation estimation with particle filtering in multisensor scenarios

Visualitza/Obre

Explora