Exploració per autor "Hernando Pericás, Francisco Javier"

Tecnicas de modelado AR robusto de la señal de voz para reconocimiento del habla en ambientes ruidosos

Hernando Pericás, Francisco Javier; Riu, D.; Nadeu Camprubí, Climent (. URSI, 1992)
Text en actes de congrés
Accés obert

Text independent speaker identification on noisy environments by means of self organizing maps

Monte Moreno, Enrique; Hernando Pericás, Francisco Javier (Institute of Electrical and Electronics Engineers (IEEE), 1996)
Text en actes de congrés
Accés obert

We propose an architecture for speaker recognition. This architecture is independent of the text, robust with the presence of noise, and is based on self organizing maps (SOM) (T. Kohonen, 1984). We compare the performance ...

Text independent speaker identification on noisy envisorments by means of self organizing maps

Enric, Monte; Monte Moreno, Enrique; Adolf, A; Hernando Pericás, Francisco Javier (H. TIMOTHY BRUMMELL, WILLIAM IDSARDI CITATION DELAWARE, NEW CASTLE, DELAWARE, 1996)
Text en actes de congrés
Accés obert

In this paper we propose a new architecture for speaker recognition. This architecture is independent of the text, robust with the presence of noise, and is based on the Self Organizing Maps (SOM) [1]. We compare the ...

Técnicas de modelado AR robusto de la señal de voz para el reconocimiento del habla en ambientes ruidosos

Hernando Pericás, Francisco Javier; Nadeu Camprubí, Climent (1992)
Text en actes de congrés
Accés obert

Speech recognition in noisy environments remains an unsolved problem, even in the case of isolated word recognition with small vocabularies. Recently, several techniques have been proposed to alleviate this problem. In ...

Técnicas de procesado y representación de la señal de voz para el reconocimiento del habla en ambientes ruidosos

Hernando Pericás, Francisco Javier (Universitat Politècnica de Catalunya, 1993-05-07)
Tesi
Accés obert

El comportamiento de los sistemas actuales de reconocimiento del habla se degrada rápidamente en presencia de ruido de fondo cuando las etapas de entrenamiento y de test no pueden llevarse a cabo en las mismas condiciones ...

Técnicas robustas de reconocimiento del habla en ambientes adversos

Hernando Pericás, Francisco Javier; Nadeu Camprubí, Climent; Mariño Acebal, José Bernardo (1997-06)
Article
Accés obert

El comportamiento de los sistemas actuales de reconocimiento del habla se degrada rápidamente en presencia de ruido de fondo. Recientemente, se ha propuesto una técnica de representación de la señal de voz basada en la ...

Técnicas robustas para la discriminación de locutores

Hernando Pericás, Francisco Javier; Nadeu Camprubí, Climent (1996)
Text en actes de congrés
Accés obert

Recently, a new filtering technique based on the decorrelation of filter bank energies has shown to be attractive for speech recognition because of its simplicity and its lower computational cost than standard representations ...

The AXIOM project: IoT on heterogeneous embedded platforms

Filgueras Izquierdo, Antonio; Vidal, Miquel; Mateu, Marc; Jiménez González, Daniel; Álvarez Martínez, Carlos; Martorell Bofill, Xavier; Ayguadé Parra, Eduard; Theodoropoulos, Dimitris; Pnevmatikatos, Dionisis; Gai, Paolo; Garzarella, Stefano; Oro de Herta, David; Hernando Pericás, Francisco Javier; Bettin, Nicola; Pomella, Alberto; Giorgi, Roberto (Institute of Electrical and Electronics Engineers (IEEE), 2019-11-11)
Article
Accés obert

The AXIOM project aims at providing an environment for Cyber-Physical Systems. Smart Video Surveillance targets public environments, involving real-time face detection in crowds. Smart Home Living targets home environments ...

The AXIOM software layers

Álvarez, Carlos; Ayguadé Parra, Eduard; Bosch Pons, Jaume; Bueno Hedo, Javier; Cherkashin, Artem; Filgueras Izquierdo, Antonio; Jiménez González, Daniel; Martorell Bofill, Xavier; Navarro, Nacho; Vidal, Miquel; Theodoropoulos, Dimitris; Pnevmatikatos, Dionisis; Catani, Davide; Oro Garcia, David; Fernandez Prades, Carles; Segura, Carlos; Rodriguez Saeta, Javier; Hernando Pericás, Francisco Javier; Scordino, Claudio; Gai, Paolo; Passera, Pierluigi; Pomella, Alberto; Bettin, Nicola; Rizzo, Antonio; Giorgi, Roberto (2016-11-01)
Article
Accés obert

AXIOM project aims at developing a heterogeneous computing board (SMP-FPGA).The Software Layers developed at the AXIOM project are explained.OmpSs provides an easy way to execute heterogeneous codes in multiple cores. ...

The CAMOMILE collaborative annotation platform for multi-modal, multi-lingual and multi-media documents

Poignant, Johann; Budnik, Mateusz; Bredin, Herve; Barras, Claude; Adda, Gilles; Hernando Pericás, Francisco Javier; Mariani, Joseph; Morros Rubió, Josep Ramon (European Language Resources Association, 2016)
Comunicació de congrés
Accés obert

In this paper, we describe the organization and the implementation of the CAMOMILE collaborative annotation framework for multimodal, multimedia, multilingual (3M) data. Given the versatile nature of the analysis which ...

The detection of overlapping speech with prosodic features for speaker diarization

Zelenak, Martin; Hernando Pericás, Francisco Javier (2011)
Comunicació de congrés
Accés restringit per política de l'editorial

Overlapping speech is responsible for a certain amount of errors produced by tandard speaker diarization systems in meeting environment. We are investigating a set of prosody-based long-term features as a potential ...

The L2F - UPC Speaker Recognition System for NIST SRE 2010

Abad, Alberto; Luque, Jordi; Trancoso, Isabel; Hernando Pericás, Francisco Javier (2011)
Comunicació de congrés
Accés obert

This document describes the joint submission of the INESC-ID’s Spoken Language Systems Laboratory (L 2 F) and the TALP Research Center from the Technical University of Catalonia (UPC) to the 2010 NIST Speaker Recognition ...

The UPC speaker verification system submitted to VoxCeleb Speaker Recognition Challenge 2020 (VoxSRC-20)

Khan, Umair; Hernando Pericás, Francisco Javier (2020-10-27)
Report de recerca
Accés obert

This report describes the submission from Technical University of Catalonia (UPC) to the VoxCeleb Speaker Recognition Challenge (VoxSRC-20) at Interspeech 2020. The final submission is a combination of three systems. ...

The use of long-term features for GMM- and i-vector-based speaker diarization systems

Zewoudie, Abraham Woubie; Luque, Jordi; Hernando Pericás, Francisco Javier (2018-09-26)
Article
Accés obert

Several factors contribute to the performance of speaker diarization systems. For instance, the appropriate selection of speech features is one of the key aspects that affect speaker diarization systems. The other factors ...

Third-order cumulant-based wiener filtering algorithm applied to robust speech recognition

Salavedra Molí, Josep; Hernando Pericás, Francisco Javier (1996)
Text en actes de congrés
Accés obert

In previous works [5], [6], we studied some speech enhancement algorithms based on the iterative Wiener filtering method due to Lim-Oppenheim [2], where the AR spectral estimation of the speech is carried out using a ...

Time and frequency filtering for speech recognition with real noises

Macho, D; Nadeu Camprubí, Climent; Hernando Pericás, Francisco Javier (., 1999)
Text en actes de congrés
Accés obert

very speech recognition system requires a signal representation that parametrically models the temporal evolution of the speech spectral envelope. Current parameterizations involve, either explicitly or implicitly, a set ...

Towards large scale multimedia indexing: a case study on person discovery in broadcast news

Le, Nam; Bredin, Herve; Sergent, Gabriel; India Massana, Miquel Àngel; López-Otero, Paula; Barras, Claude; Guinaudeau, Camille; Gravier, Guillaume; Barbosa da Fonseca, Gabriel; Lyon Freire, Izabela; Patrocinio Jr., Zenilton; Jamil F. Guimarães, Silvio; Martí Juan, Gerard; Morros Rubió, Josep Ramon; Hernando Pericás, Francisco Javier; Docio-Fernández, Laura; García-Mateo, Carmen; Meignier, Sylvain; Odobez, Jean-Marc (Association for Computing Machinery (ACM), 2017)
Text en actes de congrés
Accés restringit per política de l'editorial

The rapid growth of multimedia databases and the human interest in their peers make indices representing the location and identity of people in audio-visual documents essential for searching archives. Person discovery ...

Two-source acoustic event detection and localization: online implementation in a smart-room

Butko, Taras; Gonzalez Pla, Fran; Segura Perales, Carlos; Nadeu Camprubí, Climent; Hernando Pericás, Francisco Javier (2011)
Comunicació de congrés
Accés obert

Real-time processing is a requirement for many practical signal processing applications. In this work we implemented online 2-source acoustic event detection and localization algorithms in a Smart-room, a closed space ...

Unsupervised training of siamese networks for speaker verification

Khan, Umair; Hernando Pericás, Francisco Javier (International Speech Communication Association (ISCA), 2020)
Text en actes de congrés
Accés obert

Speaker labeled background data is an essential requirement for most state-of-the-art approaches in speaker recognition, e.g., xvectors and i-vector/PLDA. However, in reality it is difficult to access large amount of labeled ...

UPC multimodal speaker diarization system for the 2018 Albayzin challenge

India Massana, Miquel Àngel; Sagastiberri, Itziar; Palau Puigdevall, Ponç; Sayrol Clols, Elisa; Morros Rubió, Josep Ramon; Hernando Pericás, Francisco Javier (International Speech Communication Association (ISCA), 2018)
Text en actes de congrés
Accés obert

This paper presents the UPC system proposed for the Multimodal Speaker Diarization task of the 2018 Albayzin Challenge. This approach works by processing individually the speech and the image signal. In the speech domain, ...