Now showing items 1-7 of 7

  • Acoustic event detection based on feature-level fusion of audio and video modalities 

    Butko, Taras; Canton Ferrer, Cristian; Segura Perales, Carlos; Giró Nieto, Xavier; Nadeu Camprubí, Climent; Hernando Pericás, Francisco Javier; Casas Pla, Josep Ramon (HINDAWI, 2011-03-15)
    Article
    Open Access
    Acoustic event detection (AED) aims at determining the identity of sounds and their temporal position in audio signals. When applied to spontaneously generated acoustic events, AED based only on audio information shows a ...
  • Audiovisual head orientation estimation with particle filtering in multisensor scenarios 

    Canton Ferrer, Cristian; Segura Perales, Carlos; Casas Pla, Josep Ramon; Pardàs Feliu, Montse; Hernando Pericás, Francisco Javier (2008-01)
    Article
    Open Access
    This article presents a multimodal approach to head pose estimation of individuals in environments equipped with multiple cameras and microphones, such as SmartRooms or automatic video conferencing. Determining the individuals ...
  • Multimodal identification and localization of users in a smart environment 

    Salah, Albert Ali; Morros Rubió, Josep Ramon; Luque, Jordi; Segura Perales, Carlos; Hernando Pericás, Francisco Javier; Ambekar, Onkar; Schouten, Ben; Pauwels, Eric (2008-09)
    Article
    Open Access
    Detecting the location and identity of users is a first step in creating contextaware applications for technologically-endowed environments. We propose a system that makes use of motion detection, person tracking, face ...
  • Overlap detection for speaker diarization by fusing spectral and spatial features 

    Zelenak, Martin; Segura Perales, Carlos; Hernando Pericás, Francisco Javier (2010)
    Conference lecture
    Restricted access - publisher's policy
    A substantial portion of errors of the conventional speaker diarization systems on meeting data can be accounted to overlapped speech. This paper proposes the use of several spatial features to improve speech overlap ...
  • Simultaneous speech detection with spatial features for speaker diarization 

    Zelenak, Martin; Segura Perales, Carlos; Luque, Jordi; Hernando Pericás, Francisco Javier (2012-02)
    Article
    Restricted access - publisher's policy
    Simultaneous speech poses a challenging problem for conventional speaker diarization systems. In meeting data, a substantial amount of missed speech error is due to speaker overlaps, since usually only one speaker label ...
  • Speaker orientation estimation based on hybridation of GCC-PHAT and HLBR 

    Segura Perales, Carlos; Abad, Alberto; Hernando Pericás, Francisco Javier; Nadeu Camprubí, Climent (2008)
    Conference report
    Open Access
    This paper presents a novel approach to speaker orientation estimation in a SmartRoom environment equipped with multiple microphones. The ratio between the high and low band energies (HLBR) received at each microphone ...
  • Two-source acoustic event detection and localization: online implementation in a smart-room 

    Butko, Taras; Gonzalez Pla, Fran; Segura Perales, Carlos; Nadeu Camprubí, Climent; Hernando Pericás, Francisco Javier (2011)
    Conference lecture
    Open Access
    Real-time processing is a requirement for many practical signal processing applications. In this work we implemented online 2-source acoustic event detection and localization algorithms in a Smart-room, a closed space ...