Exploració per autor "Hernando Pericás, Francisco Javier"
Ara es mostren els items 24-43 de 133
-
Deep belief networks for i-vector based speaker recognition
Ghahabi Esfahani, Omid; Hernando Pericás, Francisco Javier (Institute of Electrical and Electronics Engineers (IEEE), 2014)
Text en actes de congrés
Accés restringit per política de l'editorialThe use of Deep Belief Networks (DBNs) is proposed in this paper to model discriminatively target and impostor i-vectors in a speaker verification task. The authors propose to adapt the network parameters of each speaker ... -
Deep learning backend for single and multisession i-vector speaker recognition
Ghahabi Esfahani, Omid; Hernando Pericás, Francisco Javier (2017-04-01)
Article
Accés obertThe lack of labeled background data makes a big performance gap between cosine and Probabilistic Linear Discriminant Analysis (PLDA) scoring baseline techniques for i-vectors in speaker recognition. Although there are some ... -
Deep neural networks for i-vector language identification of short utterances in cars
Ghahabi Esfahani, Omid; Bonafonte Cávez, Antonio; Hernando Pericás, Francisco Javier; Moreno Bilbao, M. Asunción (International Speech Communication Association (ISCA), 2016)
Text en actes de congrés
Accés restringit per política de l'editorialThis paper is focused on the application of the Language Identification (LID) technology for intelligent vehicles. We cope with short sentences or words spoken in moving cars in four languages: English, Spanish, German, ... -
Detection and handling of overlapping speech for speaker diarization
Zelenak, Martin; Hernando Pericás, Francisco Javier (2012)
Text en actes de congrés
Accés obertThis thesis concerns the detection of overlapping speech segments and its further application for the improvement of speaker diarization performance. We propose the use of three spatial cross-correlation-based parameters ... -
Discriminación robusta de locutores
Hernando Pericás, Francisco Javier (1996)
Text en actes de congrés
Accés obert -
Discriminative weighting of dynamic feautres in continuous-density hidden Markov models for word recognition
Hernando Pericás, Francisco Javier (1995)
Text en actes de congrés
Accés obertSpeech dynamic features, which provide smoothed estimates of the derivatives of the spectral parameter trajectories in the current frame, are routinely used in current speech recognition systems in combination with short-term ... -
DNN speaker embeddings using autoencoder pre-training
Khan, Umair; Hernando Pericás, Francisco Javier (Institute of Electrical and Electronics Engineers (IEEE), 2019)
Comunicació de congrés
Accés restringit per política de l'editorialOver the last years, i-vectors have been the state-of-the-art approach in speaker recognition. Recent improvements in deep learning have increased the discriminative quality of i-vectors. However, deep learning architectures ... -
Double multi-head attention for speaker verification
India Massana, Miquel Àngel; Safari, Pooyan; Hernando Pericás, Francisco Javier (Institute of Electrical and Electronics Engineers (IEEE), 2021)
Text en actes de congrés
Accés obertMost state-of-the-art Deep Learning systems for text-independent speaker verification are based on speaker embedding extractors. These architectures are commonly composed of a feature extractor front-end together with a ... -
Dynamic time warping applied to detection of confusable word pairs in automatic speech recognition
Anguita Ortega, Jan; Hernando Pericás, Francisco Javier (Escola Tècnica Superior d'Enginyers de Telecomunicació de Barcelona, 2005)
Article
Accés obertIn this paper we present a rnethod to predict if two words are likely to be confused by an Autornatic SpeechRecognition (ASR) systern. This method is based on the c1assical Dynamic Time Warping (DTW) technique. This ... -
End-to-end transparent user identification using touchscreen biometrics
Krzeminski, Michal; Hernando Pericás, Francisco Javier (Universidad de Málaga, 2020)
Text en actes de congrés
Accés obertWe study the touchscreen data as behavioral biometrics. The goal was to create an end-to-end system that can transparently identify users using raw data from mobile devices. The touchscreen biometrics was researched only ... -
Esquema unificado de parametrización de la señal de voz en reconocimiento del habla
Hernando Pericás, Francisco Javier; Nadeu Camprubí, Climent; Vallverdú Bayés, Sisco (Universidad de Valladolid, 1995)
Text en actes de congrés
Accés obertA correct choice of voice signal modeling methods is essential to obtain good results in automatic speech recognition. In this paper, we have proposed a unified view of the speech parametrization stage, in which conventional ... -
Estudio comparativo y nuevas propuestas de tecnicas de parametrizacion de la señal de voz para el reconocimiento del habla
Clot, J; Hernando Pericás, Francisco Javier; Nadeu Camprubí, Climent (1994)
Text en actes de congrés
Accés obertA correct choice of voice signal modeling method is essential to obtain good results in automatic speech recogniton. In this paper, a comparative study betwen two speech signal models, Linear Prediction Coeficients and ... -
Examen Final
Oliveras Vergés, Albert; Hernando Pericás, Francisco Javier (Universitat Politècnica de Catalunya, 2013-01-17)
Examen
Accés restringit a la comunitat UPC -
Examen Final
Oliveras Vergés, Albert; Mariño Acebal, José Bernardo; Hernando Pericás, Francisco Javier; Villares Piera, Nemesio Javier (Universitat Politècnica de Catalunya, 2014-06-10)
Examen
Accés restringit a la comunitat UPC -
Feature classification by means of Deep Belief Networks for speaker recognition
Safari, Pooyan; Ghahabi Esfahani, Omid; Hernando Pericás, Francisco Javier (Institute of Electrical and Electronics Engineers (IEEE), 2015)
Text en actes de congrés
Accés restringit per política de l'editorialIn this paper, we propose to discriminatively model target and impostor spectral features using Deep Belief Networks (DBNs) for speaker recognition. In the feature level, the number of impostor samples is considerably ... -
Frequency and time filtering of filter-bank energies for HMM speech recognition
Nadeu Camprubí, Climent; Mariño Acebal, José Bernardo; Hernando Pericás, Francisco Javier; Nogueiras Rodríguez, Albino (H. TIMOTHY BRUMMELL, WILLIAM IDSARDI CITATION DELAWARE, NEW CASTLE, DELAWARE, 1996)
Comunicació de congrés
Accés restringit per política de l'editorialIn speech recognition, a discriminative frequency weighting can be achieved by decorrelating the frequency sequence of log mel-scaled filter-bank energies with a computationally inexpensive filter. We show how the spectral ... -
From features to speaker vectors by means of restricted Boltzmann machine adaptation
Safari, Pooyan; Ghahabi Esfahani, Omid; Hernando Pericás, Francisco Javier (2016)
Comunicació de congrés
Accés obertRestricted Boltzmann Machines (RBMs) have shown success in different stages of speaker recognition systems. In this paper, we propose a novel framework to produce a vector-based representation for each speaker, which will ... -
GCC-PHAT based head orientation estimation
Segura, Carlos; Hernando Pericás, Francisco Javier (2012)
Text en actes de congrés
Accés obertThis work presents a novel two-step algorithm to estimate the orientation of speakers in a smart-room environment equipped with microphone arrays. First the position of the speaker is estimated by the SRP-PHAT algorithm, ... -
Global impostor selection for DBNs in multi-session i-vector speaker recognition
Ghahabi Esfahani, Omid; Hernando Pericás, Francisco Javier (2014-11-19)
Article
Accés restringit per política de l'editorialAn effective global impostor selection method is proposed in this paper for discriminative Deep Belief Networks (DBN) in the context of a multi-session i-vector based speaker recognition. The proposed method is an ... -
i-Vector modeling with deep belief networks for multi-session speaker recognition
Ghahabi Esfahani, Omid; Hernando Pericás, Francisco Javier (2014)
Text en actes de congrés
Accés restringit per política de l'editorialIn this paper we propose an impostor selection method for a Deep Belief Network (DBN) based system which models i-vectors in a multi-session speaker verification task. In the proposed method, instead of choosing a ...