|
E-prints UPC >
Altres >
Enviament des de DRAC >
Empreu aquest identificador per citar o enllaçar aquest ítem:
http://hdl.handle.net/2117/12429
|
| Citació: | Butko, T.; Nadeu, C. Detection of overlapped acoustic events using fusion of audio and video modalities. A: Jornadas en Tecnología del Habla and Iberian SLTech Workshop. "VI Jornadas en Tecnología del Habla and II Iberian SLTech Workshop". 2010, p. 165-168. |
| Títol: | Detection of overlapped acoustic events using fusion of audio and video modalities |
| Autor: | Butko, Taras ; Nadeu Camprubí, Climent  |
| Data: | 2010 |
| Tipus de document: | Conference report |
| Resum: | Acoustic event detection (AED) may help to describe acoustic scenes, and also contribute to improve the robustness of
speech technologies. Even if the number of considered events is not large, that detection becomes a difficult task in
scenarios where the AEs are produced rather spontaneously and often overlap in time with speech. In this work, fusion of audio and video information at either feature or decision level is performed, and the results are compared for different levels of signal overlaps. The best improvement with respect to an audio-only baseline system was obtained using the featurelevel fusion technique. Furthermore, a significant recognition rate improvement is observed where the AEs are overlapped with loud speech, mainly due to the fact that the video
modality remains unaffected by the interfering sound. |
| URI: | http://hdl.handle.net/2117/12429 |
| Versió de l'editor: | http://fala2010.uvigo.es/images/proceedings/index.html |
| Apareix a les col·leccions: | Altres. Enviament des de DRAC Departament de Teoria del Senyal i Comunicacions. Ponències/Comunicacions de congressos VEU - Grup de Tractament de la Parla. Ponències/Comunicacions de congressos
|
| Comparteix: |
|
Aquest ítem (excepte textos i imatges no creats per l'autor) està subjecte a una llicència de Creative Commons Llicència Creative Commons
|