Browsing by Subject "Processament de la parla"
Now showing items 21-40 of 177
-
Auto-encoding nearest neighbor i-vectors for speaker verification
(International Speech Communication Association (ISCA), 2019)
Conference lecture
Open AccessIn the last years, i-vectors followed by cosine or PLDA scoringtechniques were the state-of-the-art approach in speaker veri-fication. PLDA requires labeled background data, and thereexists a significant performance gap ... -
Bandwidth extension of narrowband speech
(Universidad Politécnica de Valencia, 2014)
Conference report
Open AccessRecently, 4G mobile phone systems have been designed to process wideband speech signals whose sampling frequency is 16 kHz. However, most part of mobile and classical phone network, and current 3G mobile phones, still ... -
Bit-slice implementation of a linear predictive vocoder
(1985)
Conference report
Open AccessA digital 16-bit high-speed general-purpose signal-processor is shown. The main objective has been the implementation of a linear predictive vocoder for obtaining real-time speech compression. For real-time digital speech ... -
Building synthetic voices in the META-NET framework
(2012)
Conference report
Restricted access - publisher's policyMETANET 4 U is a European project aiming at supporting language technology for European languages and multilingualism. It is a project in the META-NET Network of Excellence, a cluster of projects aiming at fostering the ... -
CDHMM speaker recognition by means of frequency filtering of filter-bank energies
(1997)
Conference report
Open AccessRecently, the set of spectral parameters of every speech frame that result from filtering the frequency sequence of mel-scaled filter-bank energies with a simple first-order high-pass FIR filter have proved to be an efficient ... -
Codificación APVQ de voz en banda ancha para velocidades entre 16 y 32 KBPS
(1996)
Conference report
Open AccessThis paper describes a coding scheme for broadband speech (sampling frequency 16KHz). We present a wideband speech encoder called APVQ (Adaptive Predictive Vector Quantization). It combines Subband Coding, Vector Quantization ... -
Codificación APVQ de voz en banda ancha usando asignación dinámica de bits
(Universidad de Valladolid, 1995)
Conference report
Open AccessThis paper describes a coding scheme for broadband speech. It can be seen as a vectorial extension of a conventional ADPCM encoder. In this scheme, signal vector is formed with one sample of the normalized prediction error ... -
Codificación APVQ-extendida de voz de banda ancha
(1994)
Conference report
Open AccessThis paper describes a coding scheme for broadband speech. It can be seen as a vectorial extension of an conventional ADPCM encoder. In this scheme, the vector signal is formed with one sample of the normalizaed prediction ... -
Comparison of time & frequency filtering and cepstral-time matrix approaches in ASR
(1999)
Conference report
Open AccessIn current speech recognition systems, speech is represented by a 2-D sequence of parameters that model the temporal evolution of the spectral envelope of speech. Linear transformation or filtering along both time and ... -
Comportamiento de la transformación bilineal de frecuencias en reconocimiento de habla ruidosa
(1992)
Conference report
Open Access -
Configuración e instalación de una PBX de VoIP basada en Asterisk
(Universitat Politècnica de Catalunya, 2013-05-06)
Bachelor thesis
Open AccessEl proyecto trata de la configuración de una centralita Asterisk y de su integración con diferentes aplicaciones para dar servicios de valor añadido. No hay ninguna duda de que VoIP es la telefonía del futuro por las ... -
Corpus lingüístic pel desenvolupament d'una veu sintètica en català per a Festival
(Universitat Politècnica de Catalunya, 2010-06)
Master thesis (pre-Bologna period)
Open Access -
Creating a VoIP platform for virtual containers based on JITSI
(Universitat Politècnica de Catalunya, 2017-06)
Bachelor thesis
Restricted access - author's decision -
Deep neural networks in acoustic model
(Universitat Politècnica de Catalunya, 2016-05-25)
Bachelor thesis
Open Access
Covenantee: Akademia Górniczo-Hutnicza im. S. Staszica w KrakowieDo implementation of a training of a deep neural network acoustic model for speech recognition -
Defining analogy for non-native inclusions in Spanish utterances
(2010)
Conference report
Open AccessMass media globalization introduces the challenge of multilingualism into most popular speech applications such as text-tospeech synthesis and automatic speech recognition. In Spain as well as in the other countries, the ... -
Desarrollo de un módulo de dictados en una plataforma web educativa
(Universitat Politècnica de Catalunya, 2018-01)
Bachelor thesis
Restricted access - confidentiality agreement
Covenantee: Semidynamics Technology Services -
Desarrollos futuros en aplicaciones comandadas por voz
(Escola Tècnica Superior d'Enginyers de Telecomunicació de Barcelona, 1996)
Article
Open Access -
Design and evaluation of an ultra low-power human-quality speech recognition system
(2020-11)
Article
Open AccessAutomatic Speech Recognition (ASR) has experienced a dramatic evolution since pioneer development of Bell Lab’s single-digit recognizer more than 50 years ago. Current ASR systems have taken advantage of the tremendous ... -
Development of the Feature Extractor for Speech Recognition
(Universitat Politècnica de Catalunya, 2009-10)
Master thesis (pre-Bologna period)
Open AccessWith this diploma work we have attempted to give continuity to the previous work done by other researchers called, Voice Operating Intelligent Wheelchair – VOIC [1]. A development of a wheelchair controlled by voice is ... -
Direct expressive voice training based on semantic selection
(International Speech Communication Association (ISCA), 2016)
Conference report
Restricted access - publisher's policyThis work aims at creating expressive voices from audiobooks using semantic selection. First, for each utterance of the audiobook an acoustic feature vector is extracted, including iVectors built on MFCC and on F0 ...