Now showing items 21-40 of 113

  • Comportamiento de la transformacion bilineal de frecuencias en reconocimiento de habla ruidosa 

    Hernando Pericás, Francisco Javier; Nadeu Camprubí, Climent; Riu, D. (. AERFAI, 1992)
    Conference report
    Open Access
  • Corpus selection 

    Adda, Gilles; Barras, Claude; Kernal Ekenel, Hazim; Morros Rubió, Josep Ramon; Hernando Pericás, Francisco Javier (2013-03-31)
    External research report
    Open Access
    Entregable del proyecto Collaborative Annotation of multi-MOdal, MultI-Lingual and multi-mEdia documents. This document describes the different corpora that will be used during the Camomile project
  • Deep belief networks for i-vector based speaker recognition 

    Ghahabi Esfahani, Omid; Hernando Pericás, Francisco Javier (Institute of Electrical and Electronics Engineers (IEEE), 2014)
    Conference report
    Restricted access - publisher's policy
    The use of Deep Belief Networks (DBNs) is proposed in this paper to model discriminatively target and impostor i-vectors in a speaker verification task. The authors propose to adapt the network parameters of each speaker ...
  • Deep learning backend for single and multisession i-vector speaker recognition 

    Ghahabi, Omid; Hernando Pericás, Francisco Javier (2017-04-01)
    Article
    Open Access
    The lack of labeled background data makes a big performance gap between cosine and Probabilistic Linear Discriminant Analysis (PLDA) scoring baseline techniques for i-vectors in speaker recognition. Although there are some ...
  • Deep neural networks for i-vector language identification of short utterances in cars 

    Ghahabi Esfahani, Omid; Bonafonte Cávez, Antonio; Hernando Pericás, Francisco Javier; Moreno Bilbao, M. Asunción (International Speech Communication Association (ISCA), 2016)
    Conference report
    Restricted access - publisher's policy
    This paper is focused on the application of the Language Identification (LID) technology for intelligent vehicles. We cope with short sentences or words spoken in moving cars in four languages: English, Spanish, German, ...
  • Detection and handling of overlapping speech for speaker diarization 

    Zelenak, Martin; Hernando Pericás, Francisco Javier (2012)
    Conference report
    Open Access
    This thesis concerns the detection of overlapping speech segments and its further application for the improvement of speaker diarization performance. We propose the use of three spatial cross-correlation-based parameters ...
  • Discriminación robusta de locutores 

    Hernando Pericás, Francisco Javier (1996)
    Conference report
    Open Access
  • Discriminative weighting of dynamic feautres in continuous-density hidden Markov models for word recognition 

    Hernando Pericás, Francisco Javier (1995)
    Conference report
    Open Access
    Speech dynamic features, which provide smoothed estimates of the derivatives of the spectral parameter trajectories in the current frame, are routinely used in current speech recognition systems in combination with short-term ...
  • Dynamic time warping applied to detection of confusable word pairs in automatic speech recognition 

    Anguita Ortega, Jan; Hernando Pericás, Francisco Javier (Escola Tècnica Superior d'Enginyers de Telecomunicació de Barcelona, 2005)
    Article
    Open Access
    In this paper we present a rnethod to predict if two words are likely to be confused by an Autornatic SpeechRecognition (ASR) systern. This method is based on the c1assical Dynamic Time Warping (DTW) technique. This ...
  • Esquema unificado de parametrización de la señal de voz en reconocimiento del habla 

    Hernando Pericás, Francisco Javier; Nadeu Camprubí, Climent; Vallverdú Bayés, Sisco (Universidad de Valladolid, 1995)
    Conference report
    Open Access
    A correct choice of voice signal modeling methods is essential to obtain good results in automatic speech recognition. In this paper, we have proposed a unified view of the speech parametrization stage, in which conventional ...
  • Estudio comparativo y nuevas propuestas de tecnicas de parametrizacion de la señal de voz para el reconocimiento del habla 

    Clot, J; Hernando Pericás, Francisco Javier; Nadeu Camprubí, Climent (1994)
    Conference report
    Open Access
    A correct choice of voice signal modeling method is essential to obtain good results in automatic speech recogniton. In this paper, a comparative study betwen two speech signal models, Linear Prediction Coeficients and ...
  • Feature classification by means of Deep Belief Networks for speaker recognition 

    Safari, Pooyan; Ghahabi, Omid; Hernando Pericás, Francisco Javier (Institute of Electrical and Electronics Engineers (IEEE), 2015)
    Conference report
    Restricted access - publisher's policy
    In this paper, we propose to discriminatively model target and impostor spectral features using Deep Belief Networks (DBNs) for speaker recognition. In the feature level, the number of impostor samples is considerably ...
  • Frequency and time filtering of filter-bank energies for HMM speech recognition 

    Nadeu Camprubí, Climent; Mariño Acebal, José Bernardo; Hernando Pericás, Francisco Javier; Nogueiras Rodríguez, Albino (H. TIMOTHY BRUMMELL, WILLIAM IDSARDI CITATION DELAWARE, NEW CASTLE, DELAWARE, 1996)
    Conference lecture
    Restricted access - publisher's policy
    In speech recognition, a discriminative frequency weighting can be achieved by decorrelating the frequency sequence of log mel-scaled filter-bank energies with a computationally inexpensive filter. We show how the spectral ...
  • From features to speaker vectors by means of restricted Boltzmann machine adaptation 

    Safari, Pooyan; Ghahabi, Omid; Hernando Pericás, Francisco Javier (2016)
    Conference lecture
    Open Access
    Restricted Boltzmann Machines (RBMs) have shown success in different stages of speaker recognition systems. In this paper, we propose a novel framework to produce a vector-based representation for each speaker, which will ...
  • GCC-PHAT based head orientation estimation 

    Segura, Carlos; Hernando Pericás, Francisco Javier (2012)
    Conference report
    Open Access
    This work presents a novel two-step algorithm to estimate the orientation of speakers in a smart-room environment equipped with microphone arrays. First the position of the speaker is estimated by the SRP-PHAT algorithm, ...
  • Global impostor selection for DBNs in multi-session i-vector speaker recognition 

    Ghahabi Esfahani, Omid; Hernando Pericás, Francisco Javier (2014-11-19)
    Article
    Restricted access - publisher's policy
    An effective global impostor selection method is proposed in this paper for discriminative Deep Belief Networks (DBN) in the context of a multi-session i-vector based speaker recognition. The proposed method is an ...
  • Improving detection of acoustic events using audiovisual data and feature level fusion 

    Butko, Taras; Canton Ferrer, Cristian; Segura, C.; Giró Nieto, Xavier; Nadeu Camprubí, Climent; Hernando Pericás, Francisco Javier; Casas Pla, Josep Ramon (2009)
    Conference report
    Open Access
    The detection of the acoustic events (AEs) that are naturally produced in a meeting room may help to describe the human and social activity that takes place in it. When applied to spontaneous recordings, the detection ...
  • Improving i-Vector and PLDA based speaker clustering with long-term features 

    Woubie, Abraham; Luque, Jordi; Hernando Pericás, Francisco Javier (International Speech Communication Association (ISCA), 2016)
    Conference report
    Restricted access - publisher's policy
    i-vector modeling techniques have been successfully used for speaker clustering task recently. In this work, we propose the extraction of i-vectors from short-and long-term speech features, and the fusion of their PLDA ...
  • Improving the robustness of the usual fbe-based asr front-end 

    Nadeu Camprubí, Climent; Macho, D; Hernando Pericás, Francisco Javier (Mergablum, 2000)
    Conference report
    Open Access
    All speech recognition systems require some form of signal representation that parametrically models the temporal evolution of the spectral envelope. Current parameterizations involve, either explicitly or implicitly, ...
  • Informe proyecto SARAI 

    Hernando Pericás, Francisco Javier (2013-03-05)
    External research report
    Open Access