Exploració per autor "Hernando Pericás, Francisco Javier"
Ara es mostren els items 108-127 de 133
-
Tecnicas de modelado AR robusto de la señal de voz para reconocimiento del habla en ambientes ruidosos
Hernando Pericás, Francisco Javier; Riu, D.; Nadeu Camprubí, Climent (. URSI, 1992)
Text en actes de congrés
Accés obert -
Text independent speaker identification on noisy environments by means of self organizing maps
Monte Moreno, Enrique; Hernando Pericás, Francisco Javier (Institute of Electrical and Electronics Engineers (IEEE), 1996)
Text en actes de congrés
Accés obertWe propose an architecture for speaker recognition. This architecture is independent of the text, robust with the presence of noise, and is based on self organizing maps (SOM) (T. Kohonen, 1984). We compare the performance ... -
Text independent speaker identification on noisy envisorments by means of self organizing maps
Enric, Monte; Monte Moreno, Enrique; Adolf, A; Hernando Pericás, Francisco Javier (H. TIMOTHY BRUMMELL, WILLIAM IDSARDI CITATION DELAWARE, NEW CASTLE, DELAWARE, 1996)
Text en actes de congrés
Accés obertIn this paper we propose a new architecture for speaker recognition. This architecture is independent of the text, robust with the presence of noise, and is based on the Self Organizing Maps (SOM) [1]. We compare the ... -
Técnicas de modelado AR robusto de la señal de voz para el reconocimiento del habla en ambientes ruidosos
Hernando Pericás, Francisco Javier; Nadeu Camprubí, Climent (1992)
Text en actes de congrés
Accés obertSpeech recognition in noisy environments remains an unsolved problem, even in the case of isolated word recognition with small vocabularies. Recently, several techniques have been proposed to alleviate this problem. In ... -
Técnicas de procesado y representación de la señal de voz para el reconocimiento del habla en ambientes ruidosos
Hernando Pericás, Francisco Javier (Universitat Politècnica de Catalunya, 1993-05-07)
Tesi
Accés obertEl comportamiento de los sistemas actuales de reconocimiento del habla se degrada rápidamente en presencia de ruido de fondo cuando las etapas de entrenamiento y de test no pueden llevarse a cabo en las mismas condiciones ... -
Técnicas robustas de reconocimiento del habla en ambientes adversos
Hernando Pericás, Francisco Javier; Nadeu Camprubí, Climent; Mariño Acebal, José Bernardo (1997-06)
Article
Accés obertEl comportamiento de los sistemas actuales de reconocimiento del habla se degrada rápidamente en presencia de ruido de fondo. Recientemente, se ha propuesto una técnica de representación de la señal de voz basada en la ... -
Técnicas robustas para la discriminación de locutores
Hernando Pericás, Francisco Javier; Nadeu Camprubí, Climent (1996)
Text en actes de congrés
Accés obertRecently, a new filtering technique based on the decorrelation of filter bank energies has shown to be attractive for speech recognition because of its simplicity and its lower computational cost than standard representations ... -
The AXIOM project: IoT on heterogeneous embedded platforms
Filgueras Izquierdo, Antonio; Vidal, Miquel; Mateu, Marc; Jiménez González, Daniel; Álvarez Martínez, Carlos; Martorell Bofill, Xavier; Ayguadé Parra, Eduard; Theodoropoulos, Dimitris; Pnevmatikatos, Dionisis; Gai, Paolo; Garzarella, Stefano; Oro de Herta, David; Hernando Pericás, Francisco Javier; Bettin, Nicola; Pomella, Alberto; Giorgi, Roberto (Institute of Electrical and Electronics Engineers (IEEE), 2019-11-11)
Article
Accés obertThe AXIOM project aims at providing an environment for Cyber-Physical Systems. Smart Video Surveillance targets public environments, involving real-time face detection in crowds. Smart Home Living targets home environments ... -
The AXIOM software layers
Álvarez, Carlos; Ayguadé Parra, Eduard; Bosch Pons, Jaume; Bueno Hedo, Javier; Cherkashin, Artem; Filgueras Izquierdo, Antonio; Jiménez González, Daniel; Martorell Bofill, Xavier; Navarro, Nacho; Vidal, Miquel; Theodoropoulos, Dimitris; Pnevmatikatos, Dionisis; Catani, Davide; Oro Garcia, David; Fernandez Prades, Carles; Segura, Carlos; Rodriguez Saeta, Javier; Hernando Pericás, Francisco Javier; Scordino, Claudio; Gai, Paolo; Passera, Pierluigi; Pomella, Alberto; Bettin, Nicola; Rizzo, Antonio; Giorgi, Roberto (2016-11-01)
Article
Accés obertAXIOM project aims at developing a heterogeneous computing board (SMP-FPGA).The Software Layers developed at the AXIOM project are explained.OmpSs provides an easy way to execute heterogeneous codes in multiple cores. ... -
The CAMOMILE collaborative annotation platform for multi-modal, multi-lingual and multi-media documents
Poignant, Johann; Budnik, Mateusz; Bredin, Herve; Barras, Claude; Adda, Gilles; Hernando Pericás, Francisco Javier; Mariani, Joseph; Morros Rubió, Josep Ramon (European Language Resources Association, 2016)
Comunicació de congrés
Accés obertIn this paper, we describe the organization and the implementation of the CAMOMILE collaborative annotation framework for multimodal, multimedia, multilingual (3M) data. Given the versatile nature of the analysis which ... -
The detection of overlapping speech with prosodic features for speaker diarization
Zelenak, Martin; Hernando Pericás, Francisco Javier (2011)
Comunicació de congrés
Accés restringit per política de l'editorialOverlapping speech is responsible for a certain amount of errors produced by tandard speaker diarization systems in meeting environment. We are investigating a set of prosody-based long-term features as a potential ... -
The L2F - UPC Speaker Recognition System for NIST SRE 2010
Abad, Alberto; Luque, Jordi; Trancoso, Isabel; Hernando Pericás, Francisco Javier (2011)
Comunicació de congrés
Accés obertThis document describes the joint submission of the INESC-ID’s Spoken Language Systems Laboratory (L 2 F) and the TALP Research Center from the Technical University of Catalonia (UPC) to the 2010 NIST Speaker Recognition ... -
The UPC speaker verification system submitted to VoxCeleb Speaker Recognition Challenge 2020 (VoxSRC-20)
Khan, Umair; Hernando Pericás, Francisco Javier (2020-10-27)
Report de recerca
Accés obertThis report describes the submission from Technical University of Catalonia (UPC) to the VoxCeleb Speaker Recognition Challenge (VoxSRC-20) at Interspeech 2020. The final submission is a combination of three systems. ... -
The use of long-term features for GMM- and i-vector-based speaker diarization systems
Zewoudie, Abraham Woubie; Luque, Jordi; Hernando Pericás, Francisco Javier (2018-09-26)
Article
Accés obertSeveral factors contribute to the performance of speaker diarization systems. For instance, the appropriate selection of speech features is one of the key aspects that affect speaker diarization systems. The other factors ... -
Third-order cumulant-based wiener filtering algorithm applied to robust speech recognition
Salavedra Molí, Josep; Hernando Pericás, Francisco Javier (1996)
Text en actes de congrés
Accés obertIn previous works [5], [6], we studied some speech enhancement algorithms based on the iterative Wiener filtering method due to Lim-Oppenheim [2], where the AR spectral estimation of the speech is carried out using a ... -
Time and frequency filtering for speech recognition with real noises
Macho, D; Nadeu Camprubí, Climent; Hernando Pericás, Francisco Javier (., 1999)
Text en actes de congrés
Accés obertvery speech recognition system requires a signal representation that parametrically models the temporal evolution of the speech spectral envelope. Current parameterizations involve, either explicitly or implicitly, a set ... -
Towards large scale multimedia indexing: a case study on person discovery in broadcast news
Le, Nam; Bredin, Herve; Sergent, Gabriel; India Massana, Miquel Àngel; López-Otero, Paula; Barras, Claude; Guinaudeau, Camille; Gravier, Guillaume; Barbosa da Fonseca, Gabriel; Lyon Freire, Izabela; Patrocinio Jr., Zenilton; Jamil F. Guimarães, Silvio; Martí Juan, Gerard; Morros Rubió, Josep Ramon; Hernando Pericás, Francisco Javier; Docio-Fernández, Laura; García-Mateo, Carmen; Meignier, Sylvain; Odobez, Jean-Marc (Association for Computing Machinery (ACM), 2017)
Text en actes de congrés
Accés restringit per política de l'editorialThe rapid growth of multimedia databases and the human interest in their peers make indices representing the location and identity of people in audio-visual documents essential for searching archives. Person discovery ... -
Two-source acoustic event detection and localization: online implementation in a smart-room
Butko, Taras; Gonzalez Pla, Fran; Segura Perales, Carlos; Nadeu Camprubí, Climent; Hernando Pericás, Francisco Javier (2011)
Comunicació de congrés
Accés obertReal-time processing is a requirement for many practical signal processing applications. In this work we implemented online 2-source acoustic event detection and localization algorithms in a Smart-room, a closed space ... -
Unsupervised training of siamese networks for speaker verification
Khan, Umair; Hernando Pericás, Francisco Javier (International Speech Communication Association (ISCA), 2020)
Text en actes de congrés
Accés obertSpeaker labeled background data is an essential requirement for most state-of-the-art approaches in speaker recognition, e.g., xvectors and i-vector/PLDA. However, in reality it is difficult to access large amount of labeled ... -
UPC multimodal speaker diarization system for the 2018 Albayzin challenge
India Massana, Miquel Àngel; Sagastiberri, Itziar; Palau Puigdevall, Ponç; Sayrol Clols, Elisa; Morros Rubió, Josep Ramon; Hernando Pericás, Francisco Javier (International Speech Communication Association (ISCA), 2018)
Text en actes de congrés
Accés obertThis paper presents the UPC system proposed for the Multimodal Speaker Diarization task of the 2018 Albayzin Challenge. This approach works by processing individually the speech and the image signal. In the speech domain, ...