Ara es mostren els items 91-110 de 133

    • Self attention networks in speaker recognition 

      Safari, Pooyan; India Massana, Miquel Àngel; Hernando Pericás, Francisco Javier (Multidisciplinary Digital Publishing Institute, 2023-05-24)
      Article
      Accés obert
      Recently, there has been a significant surge of interest in Self-Attention Networks (SANs) based on the Transformer architecture. This can be attributed to their notable ability for parallelization and their impressive ...
    • Self multi-head attention for speaker recognition 

      India Massana, Miquel Àngel; Safari, Pooyan; Hernando Pericás, Francisco Javier (International Speech Communication Association (ISCA), 2019)
      Comunicació de congrés
      Accés obert
      Most state-of-the-art Deep Learning (DL) approaches forspeaker recognition work on a short utterance level. Given thespeech signal, these algorithms extract a sequence of speakerembeddings from short segments and those are ...
    • Self-attention encoding and pooling for speaker recognition 

      Safari, Pooyan; India Massana, Miquel Àngel; Hernando Pericás, Francisco Javier (International Speech Communication Association (ISCA), 2020)
      Text en actes de congrés
      Accés obert
      The computing power of mobile devices limits the end-user applications in terms of storage size, processing, memory and energy consumption. These limitations motivate researchers for the design of more efficient deep models. ...
    • Self-supervised deep learning approaches to speaker recognition: A Ph.D. Thesis overview 

      Khan, Umair; Hernando Pericás, Francisco Javier (International Speech Communication Association (ISCA), 2021)
      Comunicació de congrés
      Accés obert
      Recent advances in Deep Learning (DL) for speaker recognition have improved the performance but are constrained to the need of labels for the background data, which is difficult in prac- tice. In i-vector based speaker ...
    • Short- and long-term speech features for hybrid HMM-i-vector based speaker diarization system 

      Zewoudie, Abraham Woubie; Luque, Jordi; Hernando Pericás, Francisco Javier (2016)
      Comunicació de congrés
      Accés obert
      i-vectors have been successfully applied over the last years in speaker recognition tasks. This work aims at assessing the suitability of i-vector modeling within the frame of speaker diarization task. In such context, a ...
    • Simultaneous speech detection with spatial features for speaker diarization 

      Zelenak, Martin; Segura Perales, Carlos; Luque, Jordi; Hernando Pericás, Francisco Javier (2012-02)
      Article
      Accés restringit per política de l'editorial
      Simultaneous speech poses a challenging problem for conventional speaker diarization systems. In meeting data, a substantial amount of missed speech error is due to speaker overlaps, since usually only one speaker label ...
    • Some fast higher order ar estimation techniques applied to parametric wiener filtering 

      Salavedra Molí, Josep; Masgrau Gómez, Enrique José; Moreno Bilbao, M. Asunción; Estarellas, J; Hernando Pericás, Francisco Javier (2004)
      Text en actes de congrés
      Accés obert
      Some Speech Enhancement algorithms based on the iterative Wiener filtering Method due to L1m-Oppenheim [2] are presented. In the original Lim-Oppenheim algorithm, speech AR estimation is carried out using classic second-order ...
    • Speaker characterization by means of attention pooling 

      Costa, Federico; India Massana, Miquel Àngel; Hernando Pericás, Francisco Javier (International Speech Communication Association (ISCA), 2022)
      Comunicació de congrés
      Accés obert
      State-of-the-art Deep Learning systems for speaker verification are commonly based on speaker embedding extractors. These architectures are usually composed of a feature extractor front-end together with a pooling layer ...
    • Speaker diarization of broadcast news in Albayzin 2010 Evaluation Campaign 

      Zelenak, Martin; Schulz, Henrik; Hernando Pericás, Francisco Javier (2012-07-31)
      Article
      Accés obert
      In this article, we present the evaluation results for the task of speaker diarization of broadcast news, which was part of the Albayzin 2010 evaluation campaign of language and speech technologies. The evaluation data ...
    • Speaker identification in noisy conditions using linear prediction of one-sided autocorrelation sequence 

      Hernando Pericás, Francisco Javier; Nadeu Camprubí, Climent; Villagrasa, C; Monte Moreno, Enrique (2004)
      Text en actes de congrés
      Accés obert
      The OSALPC (One-Sided Autocorrelation Linear Predictive Coding) representation of the speech signal has shown to be attractive for speech recognition because of its simplicity and its high recognition performance with ...
    • Speaker orientation estimation based on hybridation of GCC-PHAT and HLBR 

      Segura Perales, Carlos; Abad, Alberto; Hernando Pericás, Francisco Javier; Nadeu Camprubí, Climent (2008)
      Text en actes de congrés
      Accés obert
      This paper presents a novel approach to speaker orientation estimation in a SmartRoom environment equipped with multiple microphones. The ratio between the high and low band energies (HLBR) received at each microphone ...
    • Speaker recognition by means of restricted Boltzmann machine adaptation 

      Safari, Pooyan; Ghahabi Esfahani, Omid; Hernando Pericás, Francisco Javier (Universidad Autónoma de Madrid, 2016)
      Comunicació de congrés
      Accés obert
      Restricted Boltzmann Machines (RBMs) have shown success in speaker recognition. In this paper, RBMs are investigated in a framework comprising a universal model training and model adaptation. Taking advantage of RBM ...
    • Speaker recognition using frequency filtered spectral energies 

      Hernando Pericás, Francisco Javier (FONDAZIONE UGO BORDONI, 1999)
      Text en actes de congrés
      Accés obert
      The spectral parameters that result from filtering the frequency sequence of log mel-scaled filter-bank energies with a simple first or second order FIR filter have proved to be an efficient speech representation in ...
    • Speaker verification on the polycost database using frequency filtered spectral energies 

      Hernando Pericás, Francisco Javier; Nadeu Camprubí, Climent (1998)
      Text en actes de congrés
      Accés obert
      The spectral parameters that result from filtering the frequency sequence of log mel-scaled filter-bank energies with a first or second order FIR filter have proved to be competitive for speech recognition. Recently, the ...
    • Speech recognition and enhancement using some robust HOS-based AR estimation techniques 

      Salavedra Molí, Josep; Hernando Pericás, Francisco Javier; Masgrau Gómez, Enrique José; Moreno Bilbao, M. Asunción (Publicacions UPC, 1995)
      Text en actes de congrés
      Accés obert
    • Speech recognition in a noisy car environment based on LP of the one-sided autocorrelation sequence and robust similarity measuring techniques 

      Hernando Pericás, Francisco Javier; Nadeu Camprubí, Climent; Mariño Acebal, José Bernardo (1997-02)
      Article
      Accés restringit per política de l'editorial
      The performance of the existing speech recognition systems degrades rapidly in the presence of background noise. A novel representation of the speech signal, which is based on Linear Prediction of the One-Sided Autocorrelation ...
    • Speech recognition in noisy car environment based on OSALPC representation and robust similarity measuring techniques 

      Hernando Pericás, Francisco Javier; Nadeu Camprubí, Climent (1994)
      Text en actes de congrés
      Accés obert
      The performance of the existing speech recognition systems degrades rapidly in the presence of background noise. The OSALPC (one-sided autocorrelation linear predictive coding) representation of the speech signal has shown ...
    • Tecnicas de modelado AR robusto de la señal de voz para reconocimiento del habla en ambientes ruidosos 

      Hernando Pericás, Francisco Javier; Riu, D.; Nadeu Camprubí, Climent (. URSI, 1992)
      Text en actes de congrés
      Accés obert
    • Text independent speaker identification on noisy environments by means of self organizing maps 

      Monte Moreno, Enrique; Hernando Pericás, Francisco Javier (Institute of Electrical and Electronics Engineers (IEEE), 1996)
      Text en actes de congrés
      Accés obert
      We propose an architecture for speaker recognition. This architecture is independent of the text, robust with the presence of noise, and is based on self organizing maps (SOM) (T. Kohonen, 1984). We compare the performance ...
    • Text independent speaker identification on noisy envisorments by means of self organizing maps 

      Enric, Monte; Monte Moreno, Enrique; Adolf, A; Hernando Pericás, Francisco Javier (H. TIMOTHY BRUMMELL, WILLIAM IDSARDI CITATION DELAWARE, NEW CASTLE, DELAWARE, 1996)
      Text en actes de congrés
      Accés obert
      In this paper we propose a new architecture for speaker recognition. This architecture is independent of the text, robust with the presence of noise, and is based on the Self Organizing Maps (SOM) [1]. We compare the ...