Mostra el registre d'ítem simple

dc.contributor.authorCanton Ferrer, Cristian
dc.contributor.authorButko, Taras
dc.contributor.authorSegura, C.
dc.contributor.authorGiró Nieto, Xavier
dc.contributor.authorNadeu Camprubí, Climent
dc.contributor.authorHernando Pericás, Francisco Javier
dc.contributor.authorCasas Pla, Josep Ramon
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.date.accessioned2014-07-31T08:41:18Z
dc.date.created2009
dc.date.issued2009
dc.identifier.citationCanton, C. [et al.]. Audiovisual event detection towards scene understanding. A: IEEE Conference on Computer Vision and Pattern Recognition. "2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition workshops: CVPR workshops 2009: Miami Beach, Florida, USA: 20-25 June 2009". Institute of Electrical and Electronics Engineers (IEEE), 2009, p. 840-847.
dc.identifier.isbn978-1-4244-3994-2
dc.identifier.urihttp://hdl.handle.net/2117/23653
dc.description.abstractAcoustic events produced in meeting environments may contain useful information for perceptually aware interfaces and multimodal behavior analysis. In this paper, a system to detect and recognize these events from a multimodal perspective is presented combining information from multiple cameras and microphones. First, spectral and temporal features are extracted from a single audio channel and spatial localization is achieved by exploiting cross-correlation among microphone arrays. Second, several video cues obtained from multiperson tracking, motion analysis, face recognition, and object detection provide the visual counterpart of the acoustic events to be detected. A multimodal data fusion at score level is carried out using two approaches: weighted mean average and fuzzy integral. Finally, a multimodal database containing a rich variety of acoustic events has been recorded including manual annotations of the data. A set of metrics allow assessing the performance of the presented algorithms. This dataset is made publicly available for research purposes.
dc.format.extent8 p.
dc.language.isoeng
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)
dc.subjectÀrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic
dc.subjectÀrees temàtiques de la UPC::Informàtica
dc.subject.lcshHuman face recognition (Computer science)
dc.subject.otherAudio signal processing
dc.subject.otherFace recognition
dc.subject.otherMotion estimation
dc.subject.otherObject detection
dc.subject.otherSensor fusion
dc.subject.otherTransforms
dc.subject.otherVideo signal processing
dc.titleAudiovisual event detection towards scene understanding
dc.typeConference report
dc.subject.lemacReconeixement facial (Informàtica)
dc.contributor.groupUniversitat Politècnica de Catalunya. GPI - Grup de Processament d'Imatge i Vídeo
dc.contributor.groupUniversitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
dc.identifier.doi10.1109/CVPRW.2009.5204264
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttp://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=05204264
dc.rights.accessRestricted access - publisher's policy
local.identifier.drac2416071
dc.description.versionPostprint (published version)
dc.date.lift10000-01-01
local.citation.authorCanton, C.; Butko, T.; Segura, C.; Giro, X.; Nadeu, C.; Hernando, J.; Casas, J.
local.citation.contributorIEEE Conference on Computer Vision and Pattern Recognition
local.citation.publicationName2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition workshops: CVPR workshops 2009: Miami Beach, Florida, USA: 20-25 June 2009
local.citation.startingPage840
local.citation.endingPage847


Fitxers d'aquest items

Imatge en miniatura

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple