Browsing by Contributor "Hernando Pericás, Francisco Javier"
Now showing items 1-20 of 38
-
A multi-microphone approach to speech processing in a smart-room environment
(Universitat Politècnica de Catalunya, 2007-06-29)
Doctoral thesis
Open AccessEls avenços recents en tecnologia informàtica i processament de la parla i del llenguatge, entre altres, han fet possible que noves maneres de comunicació entre les persones i les màquines comencin a semblar factibles. ... -
A system for robust iris recognition
(Universitat Politècnica de Catalunya, 2012-02-13)
Master thesis (pre-Bologna period)
Restricted access - author's decision[ANGLÈS] Biometric technologies are becoming important due to the increasing demand for flexible and robust systems for identification and verification of individuals. In recent years, some biometrics has been proposed ... -
Age prediction by voice using deep learning
(Universitat Politècnica de Catalunya, 2023-01-30)
Master thesis
Open AccessOne of the main topics in artificial intelligence is the speech characterization. Moreover, it is a field of study with the minimal scope when the Catalan language is involved in. In this project, we try to perform an age ... -
Análisis y diseño de Dataset para el reconocimiento de COVID-19 en señales de tos con Redes Neuronales Profundas
(Universitat Politècnica de Catalunya, 2022-05-23)
Bachelor thesis
Open AccessThis project will try to improve the detection results of the COVID-19 disease as much as possible from acoustic signals of coughing through a convolutional neural network. To accomplish these objectives, we started with ... -
Augment de dades de veu per a sistemes de processament de la parla
(Universitat Politècnica de Catalunya, 2023-01-31)
Bachelor thesis
Open AccessWe live in an era where intelligent systems are becoming more and more part of our lives. These systems require a large amount of data to learn different tasks and, in many cases, not enough content is available to train ... -
Caracterització de locutors usant deep learning
(Universitat Politècnica de Catalunya, 2018-06-26)
Bachelor thesis
Open AccessDesenvolupament d'un sistema computacional capaç de comparar dos fitxers d'àudio, on parla una persona, i determinar si comparteixen locutor, amb un error òptim del 29,77%; i d'una interfície gràfica que l'utilitza. -
Deep learning bottle-neck features for speaker recognition
(Universitat Politècnica de Catalunya, 2018-10)
Bachelor thesis
Open AccessSpeaker recognition is a very useful and powerful technology that has many interesting security applications, which makes it an investigation field to pour on many efforts. Recognizing whether a person is who he/she claims ... -
Deep Learning for Demographic Classification by Speech
(Universitat Politècnica de Catalunya, 2022-10-26)
Master thesis
Open AccessSpeech characterization is a challenging task and one of the most relevant challenges in AI. Moreover, it is a field of study with minimal scope in the Catalan language. In this work, we try to perform a demographic ... -
Deep learning for i-vector speaker and language recognition
(Universitat Politècnica de Catalunya, 2018-05-29)
Doctoral thesis
Open AccessOver the last few years, i-vectors have been the state-of-the-art technique in speaker and language recognition. Recent advances in Deep Learning (DL) technology have improved the quality of i-vectors but the DL techniques ... -
Deep learning for speaker characterization
(Universitat Politècnica de Catalunya, 2022-07-05)
Bachelor thesis
Open AccessLa caracterización de un locutor es una de las tareas más relevantes en muchas aplicaciones de inteligencia artificial. Del mismo modo que estas tecnologías mejoran y se aumenta la cantidad de datos disponibles, es también ... -
Deep Neural Networks for Channel Compensated i-Vectors in Speaker Recognition
(Universitat Politècnica de Catalunya, 2014-06)
Bachelor thesis
Open AccessThis thesis explores the application of channel-compensation techniques in speaker verification and the posterior combination with deep learning technologies. The idea is to reduce the degradation of the performance due ... -
Design and evaluation of a conversational agent to support focused knowledge work
(Universitat Politècnica de Catalunya, 2022-09-15)
Master thesis
Open AccessThe rise of social networks and the increase of notifications we receive on our phones has had a negative impact on the way people work. Distractions have become a more serious problem on such times when productivity is ... -
Detection and handling of overlapping speech for speaker diarization
(Universitat Politècnica de Catalunya, 2012-01-31)
Doctoral thesis
Open AccessFor the last several years, speaker diarization has been attracting substantial research attention as one of the spoken language technologies applied for the improvement, or enrichment, of recording transcriptions. ... -
Discriminative features for GMM and i-vector based speaker diarization
(Universitat Politècnica de Catalunya, 2017-09-20)
Doctoral thesis
Open AccessSpeaker diarization has received several research attentions over the last decade. Among the different domains of speaker diarization, diarization in meeting domain is the most challenging one. It usually contains spontaneous ... -
Diseño e implementación de un sistema de deep learning para la detección de covid por la tos con aumento de datos
(Universitat Politècnica de Catalunya, 2022-05-23)
Bachelor thesis
Open AccessIn recent years, COVID-19 has had a major impact on today's society and has not gone unnoticed in the world of deep learning. Cases can now be studied using image analysis or in the world of audio analysis. This report ... -
Emociones en señales de voz: reconocimiento con redes neuronales profundas.
(Universitat Politècnica de Catalunya, 2021-10-27)
Bachelor thesis
Open AccessIn recent years the research effort of different tasks is through neural network techniques. This work continues along this line and explores its possibilities in the task of speech emotion recognition (SER). In this work, ... -
Feature extraction for speaker diarization
(Universitat Politècnica de Catalunya, 2016-04-15)
Master thesis (pre-Bologna period)
Restricted access - confidentiality agreementFeature extraction for speaker diarization using different databases -
Fusing prosodic and acoustic information for speaker recognition
(Universitat Politècnica de Catalunya, 2008-10-29)
Doctoral thesis
Open AccessEl reconeixement automàtic del locutor és la utilització d’una màquina per identificar un individu a partir de d’un missatge parlat. Recentment, aquesta tecnologia ha experimentat un increment en l’ús de diverses aplicacions ... -
Identificació de veu mitjançant xarxes neuronals profundes implementades sobre FPGA
(Universitat Politècnica de Catalunya, 2016-04-29)
Bachelor thesis
Open AccessEl projecte presenta un sistema d'identificació del gènere de l'interlocutor a partir de la veu basat en una xarxa neuronal profunda implementada en un dispositiu FPGA. L'objecte d'estudi és el rendiment la latència i la ... -
Integración de biometría de voz en un sistema de pago por teléfono: verificación del locutor dependiente del texto
(Universitat Politècnica de Catalunya, 2016-09)
Bachelor thesis
Open AccessThe aim of this project is obtain a text-dependent speaker verification system using speech biometrics. We recover software created by the Department of Signal and Communications Theory, which had been modified. It has ...