Ara es mostren els items 108-127 de 133

    • Tecnicas de modelado AR robusto de la señal de voz para reconocimiento del habla en ambientes ruidosos 

      Hernando Pericás, Francisco Javier; Riu, D.; Nadeu Camprubí, Climent (. URSI, 1992)
      Text en actes de congrés
      Accés obert
    • Text independent speaker identification on noisy environments by means of self organizing maps 

      Monte Moreno, Enrique; Hernando Pericás, Francisco Javier (Institute of Electrical and Electronics Engineers (IEEE), 1996)
      Text en actes de congrés
      Accés obert
      We propose an architecture for speaker recognition. This architecture is independent of the text, robust with the presence of noise, and is based on self organizing maps (SOM) (T. Kohonen, 1984). We compare the performance ...
    • Text independent speaker identification on noisy envisorments by means of self organizing maps 

      Enric, Monte; Monte Moreno, Enrique; Adolf, A; Hernando Pericás, Francisco Javier (H. TIMOTHY BRUMMELL, WILLIAM IDSARDI CITATION DELAWARE, NEW CASTLE, DELAWARE, 1996)
      Text en actes de congrés
      Accés obert
      In this paper we propose a new architecture for speaker recognition. This architecture is independent of the text, robust with the presence of noise, and is based on the Self Organizing Maps (SOM) [1]. We compare the ...
    • Técnicas de modelado AR robusto de la señal de voz para el reconocimiento del habla en ambientes ruidosos 

      Hernando Pericás, Francisco Javier; Nadeu Camprubí, Climent (1992)
      Text en actes de congrés
      Accés obert
      Speech recognition in noisy environments remains an unsolved problem, even in the case of isolated word recognition with small vocabularies. Recently, several techniques have been proposed to alleviate this problem. In ...
    • Técnicas de procesado y representación de la señal de voz para el reconocimiento del habla en ambientes ruidosos 

      Hernando Pericás, Francisco Javier (Universitat Politècnica de Catalunya, 1993-05-07)
      Tesi
      Accés obert
      El comportamiento de los sistemas actuales de reconocimiento del habla se degrada rápidamente en presencia de ruido de fondo cuando las etapas de entrenamiento y de test no pueden llevarse a cabo en las mismas condiciones ...
    • Técnicas robustas de reconocimiento del habla en ambientes adversos 

      Hernando Pericás, Francisco Javier; Nadeu Camprubí, Climent; Mariño Acebal, José Bernardo (1997-06)
      Article
      Accés obert
      El comportamiento de los sistemas actuales de reconocimiento del habla se degrada rápidamente en presencia de ruido de fondo. Recientemente, se ha propuesto una técnica de representación de la señal de voz basada en la ...
    • Técnicas robustas para la discriminación de locutores 

      Hernando Pericás, Francisco Javier; Nadeu Camprubí, Climent (1996)
      Text en actes de congrés
      Accés obert
      Recently, a new filtering technique based on the decorrelation of filter bank energies has shown to be attractive for speech recognition because of its simplicity and its lower computational cost than standard representations ...
    • The AXIOM project: IoT on heterogeneous embedded platforms 

      Filgueras Izquierdo, Antonio; Vidal, Miquel; Mateu, Marc; Jiménez González, Daniel; Álvarez Martínez, Carlos; Martorell Bofill, Xavier; Ayguadé Parra, Eduard; Theodoropoulos, Dimitris; Pnevmatikatos, Dionisis; Gai, Paolo; Garzarella, Stefano; Oro de Herta, David; Hernando Pericás, Francisco Javier; Bettin, Nicola; Pomella, Alberto; Giorgi, Roberto (Institute of Electrical and Electronics Engineers (IEEE), 2019-11-11)
      Article
      Accés obert
      The AXIOM project aims at providing an environment for Cyber-Physical Systems. Smart Video Surveillance targets public environments, involving real-time face detection in crowds. Smart Home Living targets home environments ...
    • The AXIOM software layers 

      Álvarez, Carlos; Ayguadé Parra, Eduard; Bosch Pons, Jaume; Bueno Hedo, Javier; Cherkashin, Artem; Filgueras Izquierdo, Antonio; Jiménez González, Daniel; Martorell Bofill, Xavier; Navarro, Nacho; Vidal, Miquel; Theodoropoulos, Dimitris; Pnevmatikatos, Dionisis; Catani, Davide; Oro Garcia, David; Fernandez Prades, Carles; Segura, Carlos; Rodriguez Saeta, Javier; Hernando Pericás, Francisco Javier; Scordino, Claudio; Gai, Paolo; Passera, Pierluigi; Pomella, Alberto; Bettin, Nicola; Rizzo, Antonio; Giorgi, Roberto (2016-11-01)
      Article
      Accés obert
      AXIOM project aims at developing a heterogeneous computing board (SMP-FPGA).The Software Layers developed at the AXIOM project are explained.OmpSs provides an easy way to execute heterogeneous codes in multiple cores. ...
    • The CAMOMILE collaborative annotation platform for multi-modal, multi-lingual and multi-media documents 

      Poignant, Johann; Budnik, Mateusz; Bredin, Herve; Barras, Claude; Adda, Gilles; Hernando Pericás, Francisco Javier; Mariani, Joseph; Morros Rubió, Josep Ramon (European Language Resources Association, 2016)
      Comunicació de congrés
      Accés obert
      In this paper, we describe the organization and the implementation of the CAMOMILE collaborative annotation framework for multimodal, multimedia, multilingual (3M) data. Given the versatile nature of the analysis which ...
    • The detection of overlapping speech with prosodic features for speaker diarization 

      Zelenak, Martin; Hernando Pericás, Francisco Javier (2011)
      Comunicació de congrés
      Accés restringit per política de l'editorial
      Overlapping speech is responsible for a certain amount of errors produced by tandard speaker diarization systems in meeting environment. We are investigating a set of prosody-based long-term features as a potential ...
    • The L2F - UPC Speaker Recognition System for NIST SRE 2010 

      Abad, Alberto; Luque, Jordi; Trancoso, Isabel; Hernando Pericás, Francisco Javier (2011)
      Comunicació de congrés
      Accés obert
      This document describes the joint submission of the INESC-ID’s Spoken Language Systems Laboratory (L 2 F) and the TALP Research Center from the Technical University of Catalonia (UPC) to the 2010 NIST Speaker Recognition ...
    • The UPC speaker verification system submitted to VoxCeleb Speaker Recognition Challenge 2020 (VoxSRC-20) 

      Khan, Umair; Hernando Pericás, Francisco Javier (2020-10-27)
      Report de recerca
      Accés obert
      This report describes the submission from Technical University of Catalonia (UPC) to the VoxCeleb Speaker Recognition Challenge (VoxSRC-20) at Interspeech 2020. The final submission is a combination of three systems. ...
    • The use of long-term features for GMM- and i-vector-based speaker diarization systems 

      Zewoudie, Abraham Woubie; Luque, Jordi; Hernando Pericás, Francisco Javier (2018-09-26)
      Article
      Accés obert
      Several factors contribute to the performance of speaker diarization systems. For instance, the appropriate selection of speech features is one of the key aspects that affect speaker diarization systems. The other factors ...
    • Third-order cumulant-based wiener filtering algorithm applied to robust speech recognition 

      Salavedra Molí, Josep; Hernando Pericás, Francisco Javier (1996)
      Text en actes de congrés
      Accés obert
      In previous works [5], [6], we studied some speech enhancement algorithms based on the iterative Wiener filtering method due to Lim-Oppenheim [2], where the AR spectral estimation of the speech is carried out using a ...
    • Time and frequency filtering for speech recognition with real noises 

      Macho, D; Nadeu Camprubí, Climent; Hernando Pericás, Francisco Javier (., 1999)
      Text en actes de congrés
      Accés obert
      very speech recognition system requires a signal representation that parametrically models the temporal evolution of the speech spectral envelope. Current parameterizations involve, either explicitly or implicitly, a set ...
    • Towards large scale multimedia indexing: a case study on person discovery in broadcast news 

      Le, Nam; Bredin, Herve; Sergent, Gabriel; India Massana, Miquel Àngel; López-Otero, Paula; Barras, Claude; Guinaudeau, Camille; Gravier, Guillaume; Barbosa da Fonseca, Gabriel; Lyon Freire, Izabela; Patrocinio Jr., Zenilton; Jamil F. Guimarães, Silvio; Martí Juan, Gerard; Morros Rubió, Josep Ramon; Hernando Pericás, Francisco Javier; Docio-Fernández, Laura; García-Mateo, Carmen; Meignier, Sylvain; Odobez, Jean-Marc (Association for Computing Machinery (ACM), 2017)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      The rapid growth of multimedia databases and the human interest in their peers make indices representing the location and identity of people in audio-visual documents essential for searching archives. Person discovery ...
    • Two-source acoustic event detection and localization: online implementation in a smart-room 

      Butko, Taras; Gonzalez Pla, Fran; Segura Perales, Carlos; Nadeu Camprubí, Climent; Hernando Pericás, Francisco Javier (2011)
      Comunicació de congrés
      Accés obert
      Real-time processing is a requirement for many practical signal processing applications. In this work we implemented online 2-source acoustic event detection and localization algorithms in a Smart-room, a closed space ...
    • Unsupervised training of siamese networks for speaker verification 

      Khan, Umair; Hernando Pericás, Francisco Javier (International Speech Communication Association (ISCA), 2020)
      Text en actes de congrés
      Accés obert
      Speaker labeled background data is an essential requirement for most state-of-the-art approaches in speaker recognition, e.g., xvectors and i-vector/PLDA. However, in reality it is difficult to access large amount of labeled ...
    • UPC multimodal speaker diarization system for the 2018 Albayzin challenge 

      India Massana, Miquel Àngel; Sagastiberri, Itziar; Palau Puigdevall, Ponç; Sayrol Clols, Elisa; Morros Rubió, Josep Ramon; Hernando Pericás, Francisco Javier (International Speech Communication Association (ISCA), 2018)
      Text en actes de congrés
      Accés obert
      This paper presents the UPC system proposed for the Multimodal Speaker Diarization task of the 2018 Albayzin Challenge. This approach works by processing individually the speech and the image signal. In the speech domain, ...