Mostra el registre d'ítem simple

dc.contributor.authorButko, Taras
dc.contributor.authorCanton Ferrer, Cristian
dc.contributor.authorSegura Perales, Carlos
dc.contributor.authorGiró Nieto, Xavier
dc.contributor.authorNadeu Camprubí, Climent
dc.contributor.authorHernando Pericás, Francisco Javier
dc.contributor.authorCasas Pla, Josep Ramon
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.date.accessioned2011-10-23T09:26:48Z
dc.date.available2011-10-23T09:26:48Z
dc.date.created2011-03-15
dc.date.issued2011-03-15
dc.identifier.citationButko, T. [et al.]. Acoustic event detection based on feature-level fusion of audio and video modalities. "Eurasip journal on advances in signal processing", 15 Març 2011, vol. 2011, p. 1-11.
dc.identifier.issn1687-6172
dc.identifier.urihttp://hdl.handle.net/2117/13630
dc.descriptionResearch article
dc.description.abstractAcoustic event detection (AED) aims at determining the identity of sounds and their temporal position in audio signals. When applied to spontaneously generated acoustic events, AED based only on audio information shows a large amount of errors, which are mostly due to temporal overlaps. Actually, temporal overlaps accounted for more than 70% of errors in the realworld interactive seminar recordings used in CLEAR 2007 evaluations. In this paper, we improve the recognition rate of acoustic events using information from both audio and video modalities. First, the acoustic data are processed to obtain both a set of spectrotemporal features and the 3D localization coordinates of the sound source. Second, a number of features are extracted from video recordings by means of object detection, motion analysis, and multicamera person tracking to represent the visual counterpart of several acoustic events. A feature-level fusion strategy is used, and a parallel structure of binary HMM-based detectors is employed in our work. The experimental results show that information from both the microphone array and video cameras is useful to improve the detection rate of isolated as well as spontaneously generated acoustic events.
dc.format.extent11 p.
dc.language.isoeng
dc.publisherHINDAWI
dc.subjectÀrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic
dc.subjectÀrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la imatge i del senyal vídeo
dc.subject.lcshAcoustic event detection
dc.titleAcoustic event detection based on feature-level fusion of audio and video modalities
dc.typeArticle
dc.subject.lemacSenyal acústic -- Detecció
dc.contributor.groupUniversitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
dc.contributor.groupUniversitat Politècnica de Catalunya. GPI - Grup de Processament d'Imatge i Vídeo
dc.identifier.doi10.1155/2011/485738
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttp://www.hindawi.com/journals/asp/2011/485738/
dc.rights.accessOpen Access
local.identifier.drac5391480
dc.description.versionPostprint (published version)
local.citation.authorButko, T.; Canton-Ferrer, C.; Segura, C.; Giro, X.; Nadeu, C.; Hernando, J.; Casas, J.
local.citation.publicationNameEurasip journal on advances in signal processing
local.citation.volume2011
local.citation.startingPage1
local.citation.endingPage11


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple