Exploració per tema "Processament de la parla"
Ara es mostren els items 21-40 de 214
-
Aplicación Android de movilidad de invidentes
(Universitat Politècnica de Catalunya, 2011-05-09)
Treball Final de Grau
Accés obertEn este proyecto se ha desarrollado parte de una aplicación de movilidad de invidentes para Android. La introducción del destino se realiza por voz y a partir de ahí utilizando diversas herramientas se guía al usuario. A ... -
APVQ encoder applied to wideband speech coding
(Institute of Electrical and Electronics Engineers (IEEE), 1996)
Text en actes de congrés
Accés obertThe paper describes a coding scheme for broadband speech (sampling frequency 16 KHz). The authors present a wideband speech encoder called APVQ (adaptive predictive vector quantization). It combines subband coding, vector ... -
AR modeling of the speech autocorrelation to improve noisy speech recognition
(1992)
Text en actes de congrés
Accés obertSpeech recognition in noisy environments remains an unsolved problem even in the case of isolated word recognition with small vocabularies. Recently, several techniques have been proposed to alleviate this problem. Concretely, ... -
Asignación secuencial de canales para tráfico de voz y datos en entornos móviles celulares
(Universitat Politècnica de Catalunya, 2007-11-22)
Projecte/Treball Final de Carrera
Accés obertEn el siguiente documento se realizará un estudio sobre diferentes modelos de asignación de time slots (canales) en redes radio, con el fin de conseguir el mayor número consecutivo de time slots libres teniendo en cuenta ... -
Audio classification experiments in a neonatal intensive care unit
(Universitat Politècnica de Catalunya, 2014-06-25)
Projecte/Treball Final de Carrera
Accés obert[ANGLÈS] Newborns delivered at a gestational age of 24-32 weeks commonly have health problems. The use of a Neonatal Intensive Care Unit (NICU) is, in most of the cases, crucial for their survival. Nowadays, it is known ... -
Augment de dades de veu per a sistemes de processament de la parla
(Universitat Politècnica de Catalunya, 2023-01-31)
Treball Final de Grau
Accés obertWe live in an era where intelligent systems are becoming more and more part of our lives. These systems require a large amount of data to learn different tasks and, in many cases, not enough content is available to train ... -
Auto-encoding nearest neighbor i-vectors for speaker verification
(International Speech Communication Association (ISCA), 2019)
Comunicació de congrés
Accés obertIn the last years, i-vectors followed by cosine or PLDA scoringtechniques were the state-of-the-art approach in speaker veri-fication. PLDA requires labeled background data, and thereexists a significant performance gap ... -
Bandwidth extension of narrowband speech
(Universidad Politécnica de Valencia, 2014)
Text en actes de congrés
Accés obertRecently, 4G mobile phone systems have been designed to process wideband speech signals whose sampling frequency is 16 kHz. However, most part of mobile and classical phone network, and current 3G mobile phones, still ... -
Bit-slice implementation of a linear predictive vocoder
(1985)
Text en actes de congrés
Accés obertA digital 16-bit high-speed general-purpose signal-processor is shown. The main objective has been the implementation of a linear predictive vocoder for obtaining real-time speech compression. For real-time digital speech ... -
Building synthetic voices in the META-NET framework
(2012)
Text en actes de congrés
Accés restringit per política de l'editorialMETANET 4 U is a European project aiming at supporting language technology for European languages and multilingualism. It is a project in the META-NET Network of Excellence, a cluster of projects aiming at fostering the ... -
CDHMM speaker recognition by means of frequency filtering of filter-bank energies
(1997)
Text en actes de congrés
Accés obertRecently, the set of spectral parameters of every speech frame that result from filtering the frequency sequence of mel-scaled filter-bank energies with a simple first-order high-pass FIR filter have proved to be an efficient ... -
Channel selection and reverberation-robust automatic speech recognition
(Universitat Politècnica de Catalunya, 2013-11-11)
Tesi
Accés obertIf speech is acquired by a close-talking microphone in a controlled and noise-free environment, current state-of-the-art recognition systems often show an acceptable error rate. The use of close-talking microphones, however, ... -
Codificación APVQ de voz en banda ancha para velocidades entre 16 y 32 KBPS
(1996)
Text en actes de congrés
Accés obertThis paper describes a coding scheme for broadband speech (sampling frequency 16KHz). We present a wideband speech encoder called APVQ (Adaptive Predictive Vector Quantization). It combines Subband Coding, Vector Quantization ... -
Codificación APVQ de voz en banda ancha usando asignación dinámica de bits
(Universidad de Valladolid, 1995)
Text en actes de congrés
Accés obertThis paper describes a coding scheme for broadband speech. It can be seen as a vectorial extension of a conventional ADPCM encoder. In this scheme, signal vector is formed with one sample of the normalized prediction error ... -
Codificación APVQ-extendida de voz de banda ancha
(1994)
Text en actes de congrés
Accés obertThis paper describes a coding scheme for broadband speech. It can be seen as a vectorial extension of an conventional ADPCM encoder. In this scheme, the vector signal is formed with one sample of the normalizaed prediction ... -
Comparative analysis of methods for the adaptation of Speech Emotion Recognition (SER) systems
(Universitat Politècnica de Catalunya, 2023-07-06)
Treball Final de Grau
Accés obert
Realitzat a/amb: University of New South WalesThe aim of this work is to analyse how the adaptation to certain speakers of a Speech Emotion Recognition (SER) system improves its performance by contrasting several variations of the adaptation procedure. The initial ... -
Comparison of time & frequency filtering and cepstral-time matrix approaches in ASR
(1999)
Text en actes de congrés
Accés obertIn current speech recognition systems, speech is represented by a 2-D sequence of parameters that model the temporal evolution of the spectral envelope of speech. Linear transformation or filtering along both time and ... -
Comportamiento de la transformación bilineal de frecuencias en reconocimiento de habla ruidosa
(1992)
Text en actes de congrés
Accés obert -
Configuración e instalación de una PBX de VoIP basada en Asterisk
(Universitat Politècnica de Catalunya, 2013-05-06)
Treball Final de Grau
Accés obertEl proyecto trata de la configuración de una centralita Asterisk y de su integración con diferentes aplicaciones para dar servicios de valor añadido. No hay ninguna duda de que VoIP es la telefonía del futuro por las ... -
Conversió de veu a text per a reunions virtuals: un estudi de transcripció automatitzada
(Universitat Politècnica de Catalunya, 2023-07-07)
Treball Final de Grau
Accés obertIn the last few years, the use of Deep Learning has increased in virtual assistance and speech recognition applications, improving its performance with supervised learning techniques. However, it is an area that continues ...