Improving detection of acoustic events using audiovisual data and feature level fusion

Butko, Taras; Canton Ferrer, Cristian; Segura, C.; Giró Nieto, Xavier; Nadeu Camprubí, Climent; Hernando Pericás, Francisco Javier; Casas Pla, Josep Ramon

Visualitza/Obre

Article (228,1Kb)

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

Butko, Taras

Canton Ferrer, Cristian

Segura, C.

Giró Nieto, Xavier

Nadeu Camprubí, Climent

Hernando Pericás, Francisco Javier

Casas Pla, Josep Ramon

Tipus de documentText en actes de congrés

Data publicació2009

Condicions d'accésAccés obert

Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets

Abstract

The detection of the acoustic events (AEs) that are naturally produced in a meeting room may help to describe the human and social activity that takes place in it. When applied to spontaneous recordings, the detection of AEs from only audio information shows a large amount of errors, which are mostly due to temporal overlapping of sounds. In this paper, a system to detect and recognize AEs using both audio and video information is presented. A feature-level fusion strategy is used, and the structure of the HMM-GMM based system considers each class separately and uses a one-against-all strategy for training. Experime ntal AED results with a new and rather spontaneous dataset are presented which show the advantage of the proposed approach.

CitacióButko, T., Canton, C., Segura, C., Giro, X., Nadeu, C., Hernando, J., Casas, J. Improving detection of acoustic events using audiovisual data and feature level fusion. A: Annual Conference of the International Speech Communication Association. "ISCA-INST Speech Communication Association". 2009, p. 1147-1150.

URIhttp://hdl.handle.net/2117/85340

ISBN978-1-61567-692-7

Versió de l'editorhttp://www.isca-speech.org/archive/interspeech_2009/i09_1147.html

Col·leccions

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
91.pdf	Article	228,1Kb	PDF	Visualitza/Obre

UPCommons. Portal del coneixement obert de la UPC

Improving detection of acoustic events using audiovisual data and feature level fusion

Visualitza/Obre

Explora