Now showing items 21-40 of 177

    • Auto-encoding nearest neighbor i-vectors for speaker verification 

      Khan, Umair; India Massana, Miquel Àngel; Hernando Pericás, Francisco Javier (International Speech Communication Association (ISCA), 2019)
      Conference lecture
      Open Access
      In the last years, i-vectors followed by cosine or PLDA scoringtechniques were the state-of-the-art approach in speaker veri-fication. PLDA requires labeled background data, and thereexists a significant performance gap ...
    • Bandwidth extension of narrowband speech 

      Expósito Pérez, Miquel; Salavedra Molí, Josep (Universidad Politécnica de Valencia, 2014)
      Conference report
      Open Access
      Recently, 4G mobile phone systems have been designed to process wideband speech signals whose sampling frequency is 16 kHz. However, most part of mobile and classical phone network, and current 3G mobile phones, still ...
    • Bit-slice implementation of a linear predictive vocoder 

      Vázquez Grau, Gregorio; Gasull Llampallas, Antoni (1985)
      Conference report
      Open Access
      A digital 16-bit high-speed general-purpose signal-processor is shown. The main objective has been the implementation of a linear predictive vocoder for obtaining real-time speech compression. For real-time digital speech ...
    • Building synthetic voices in the META-NET framework 

      Garcia Casademont, Emília; Bonafonte Cávez, Antonio; Moreno Bilbao, M. Asunción (2012)
      Conference report
      Restricted access - publisher's policy
      METANET 4 U is a European project aiming at supporting language technology for European languages and multilingualism. It is a project in the META-NET Network of Excellence, a cluster of projects aiming at fostering the ...
    • CDHMM speaker recognition by means of frequency filtering of filter-bank energies 

      Hernando Pericás, Francisco Javier; Nadeu Camprubí, Climent (1997)
      Conference report
      Open Access
      Recently, the set of spectral parameters of every speech frame that result from filtering the frequency sequence of mel-scaled filter-bank energies with a simple first-order high-pass FIR filter have proved to be an efficient ...
    • Codificación APVQ de voz en banda ancha para velocidades entre 16 y 32 KBPS 

      Salavedra Molí, Josep; Masgrau Gómez, Enrique José (1996)
      Conference report
      Open Access
      This paper describes a coding scheme for broadband speech (sampling frequency 16KHz). We present a wideband speech encoder called APVQ (Adaptive Predictive Vector Quantization). It combines Subband Coding, Vector Quantization ...
    • Codificación APVQ de voz en banda ancha usando asignación dinámica de bits 

      Salavedra Molí, Josep (Universidad de Valladolid, 1995)
      Conference report
      Open Access
      This paper describes a coding scheme for broadband speech. It can be seen as a vectorial extension of a conventional ADPCM encoder. In this scheme, signal vector is formed with one sample of the normalized prediction error ...
    • Codificación APVQ-extendida de voz de banda ancha 

      Masgrau Gómez, Enrique José; Salavedra Molí, Josep (1994)
      Conference report
      Open Access
      This paper describes a coding scheme for broadband speech. It can be seen as a vectorial extension of an conventional ADPCM encoder. In this scheme, the vector signal is formed with one sample of the normalizaed prediction ...
    • Comparison of time & frequency filtering and cepstral-time matrix approaches in ASR 

      Macho, D; Nadeu Camprubí, Climent; Jancovic, P; Rozinaj, G; Hernando Pericás, Francisco Javier (1999)
      Conference report
      Open Access
      In current speech recognition systems, speech is represented by a 2-D sequence of parameters that model the temporal evolution of the spectral envelope of speech. Linear transformation or filtering along both time and ...
    • Comportamiento de la transformación bilineal de frecuencias en reconocimiento de habla ruidosa 

      Hernando Pericás, Francisco Javier; Nadeu Camprubí, Climent (1992)
      Conference report
      Open Access
    • Configuración e instalación de una PBX de VoIP basada en Asterisk 

      Castro Alonso, Sergio (Universitat Politècnica de Catalunya, 2013-05-06)
      Bachelor thesis
      Open Access
      El proyecto trata de la configuración de una centralita Asterisk y de su integración con diferentes aplicaciones para dar servicios de valor añadido. No hay ninguna duda de que VoIP es la telefonía del futuro por las ...
    • Corpus lingüístic pel desenvolupament d'una veu sintètica en català per a Festival 

      Gallego Gonzàlez, Silvia (Universitat Politècnica de Catalunya, 2010-06)
      Master thesis (pre-Bologna period)
      Open Access
    • Creating a VoIP platform for virtual containers based on JITSI 

      Morales Duarte, Carles (Universitat Politècnica de Catalunya, 2017-06)
      Bachelor thesis
      Restricted access - author's decision
    • Deep neural networks in acoustic model 

      Camacho Tejedor, Oriol (Universitat Politècnica de Catalunya, 2016-05-25)
      Bachelor thesis
      Open Access
      Covenantee:   Akademia Górniczo-Hutnicza im. S. Staszica w Krakowie
      Do implementation of a training of a deep neural network acoustic model for speech recognition
    • Defining analogy for non-native inclusions in Spanish utterances 

      Polyakova, Tatyana; Bonafonte Cávez, Antonio (2010)
      Conference report
      Open Access
      Mass media globalization introduces the challenge of multilingualism into most popular speech applications such as text-tospeech synthesis and automatic speech recognition. In Spain as well as in the other countries, the ...
    • Desarrollo de un módulo de dictados en una plataforma web educativa 

      Castillo Malaver, Italo (Universitat Politècnica de Catalunya, 2018-01)
      Bachelor thesis
      Restricted access - confidentiality agreement
      Covenantee:   Semidynamics Technology Services
    • Desarrollos futuros en aplicaciones comandadas por voz 

      Pérez González, Xavier (Escola Tècnica Superior d'Enginyers de Telecomunicació de Barcelona, 1996)
      Article
      Open Access
    • Design and evaluation of an ultra low-power human-quality speech recognition system 

      Pinto Rivero, Daniel; Arnau Montañés, José María; González Colás, Antonio María (2020-11)
      Article
      Open Access
      Automatic Speech Recognition (ASR) has experienced a dramatic evolution since pioneer development of Bell Lab’s single-digit recognizer more than 50 years ago. Current ASR systems have taken advantage of the tremendous ...
    • Development of the Feature Extractor for Speech Recognition 

      Añorga Irigoien, Eneko (Universitat Politècnica de Catalunya, 2009-10)
      Master thesis (pre-Bologna period)
      Open Access
      With this diploma work we have attempted to give continuity to the previous work done by other researchers called, Voice Operating Intelligent Wheelchair – VOIC [1]. A development of a wheelchair controlled by voice is ...
    • Direct expressive voice training based on semantic selection 

      Jauk, Igor; Bonafonte Cávez, Antonio (International Speech Communication Association (ISCA), 2016)
      Conference report
      Restricted access - publisher's policy
      This work aims at creating expressive voices from audiobooks using semantic selection. First, for each utterance of the audiobook an acoustic feature vector is extracted, including iVectors built on MFCC and on F0 ...