Enviaments recents

  • Extraction of syllabically rich and balanced sentences for Tigrigna language 

    Abera, Hafte; Nadeu Camprubí, Climent; Mariam, Sebsibe H. (Institute of Electrical and Electronics Engineers (IEEE), 2016)
    Comunicació de congrés
    Accés restringit per política de l'editorial
    The Tigrigna language lacks text and speech corpora for developing speech technologies. In this work, after considering the phonetic nature of Tigrigna, we have gathered and pre-processed an initial and relatively large ...
  • Work-efficient parallel non-maximum suppression for embedded GPU architectures 

    Oro Garcia, David; Fernandez Tena, Carles; Martorell Bofill, Xavier; Hernando Pericás, Francisco Javier (Institute of Electrical and Electronics Engineers (IEEE), 2016)
    Text en actes de congrés
    Accés restringit per política de l'editorial
    With the emergence of GPU computing, deep neural networks have become a widely used technique for advancing research in the field of image and speech processing. In the context of object and event detection, slidingwindow ...
  • Subband splitting, adaptive scalar prediction and vector quantization for speech coding 

    Masgrau Gómez, Enrique José; Rodríguez Fonollosa, José Adrián; Mariño Acebal, José Bernardo (1988)
    Text en actes de congrés
    Accés obert
    This paper describes a new coding structure based on the combination of Vector Quantizati.on, Linear Prediction l)nd Subband Splitting that achieves high guality speech at rates below 10 Kbit/sec. In this scheme, a vector ...
  • Wideband-speech APVQ coding from 16 to 32 KBPS 

    Salavedra Molí, Josep (1997)
    Text en actes de congrés
    Accés obert
    This paper describes a coding scheme for broadband speech (sampling frequency 16KHz). We present a wideband speech encoder called APVQ (Adaptive Predictive Vector Quantization). It combines Subband Coding, Vector Quantization ...
  • Robust hos-based techniques applied to speech recognition and enhancement 

    Salavedra Molí, Josep; Hernando Pericás, Francisco Javier; Masgrau Gómez, Enrique José; Moreno Bilbao, M. Asunción (1995)
    Text en actes de congrés
    Accés obert
    We study some speech enhancement algorithms based on the iterative Wiener filtering method due to Lim-Oppenheim [2], where the AR spectral estimation of the speech is carried out using a second-order analysis. But in our ...
  • A speech enhancement system using higher order ar estimation in real environments 

    Salavedra Molí, Josep; Masgrau Gómez, Enrique José; Moreno Bilbao, M. Asunción (1993)
    Text en actes de congrés
    Accés obert
    We study some speech enhancement algorithms based on the iterative Wiener filtering method due to Lim-Oppenheim [2], where the AR spectral estimation of the speech is carried out using a second-order analysis. But in our ...
  • Speaker verification on the polycost database using frequency filtered spectral energies 

    Hernando Pericás, Francisco Javier; Nadeu Camprubí, Climent (1998)
    Text en actes de congrés
    Accés obert
    The spectral parameters that result from filtering the frequency sequence of log mel-scaled filter-bank energies with a first or second order FIR filter have proved to be competitive for speech recognition. Recently, the ...
  • Reconocimiento del habla en ambientes ruidosos mediante modelos ocultos de Markov discretos 

    Hernando Pericás, Francisco Javier; Nadeu Camprubí, Climent (1992)
    Text en actes de congrés
    Accés obert
    Speech recognition in noisy environments remains an unsolved problem, even in the case of isolated word recognition with small vocabularies. Recently, several techniques have been proposed to alleviate this problem. ...
  • Comportamiento de la transformación bilineal de frecuencias en reconocimiento de habla ruidosa 

    Hernando Pericás, Francisco Javier; Nadeu Camprubí, Climent (1992)
    Text en actes de congrés
    Accés obert
  • Discriminative weighting of dynamic feautres in continuous-density hidden Markov models for word recognition 

    Hernando Pericás, Francisco Javier (1995)
    Text en actes de congrés
    Accés obert
    Speech dynamic features, which provide smoothed estimates of the derivatives of the spectral parameter trajectories in the current frame, are routinely used in current speech recognition systems in combination with short-term ...

Mostra'n més