Now showing items 1-20 of 38

    • A multi-microphone approach to speech processing in a smart-room environment 

      Abad Gareta, Alberto (Universitat Politècnica de Catalunya, 2007-06-29)
      Doctoral thesis
      Open Access
      Els avenços recents en tecnologia informàtica i processament de la parla i del llenguatge, entre altres, han fet possible que noves maneres de comunicació entre les persones i les màquines comencin a semblar factibles. ...
    • A system for robust iris recognition 

      Pérez Parera, Dídac (Universitat Politècnica de Catalunya, 2012-02-13)
      Master thesis (pre-Bologna period)
      Restricted access - author's decision
      [ANGLÈS] Biometric technologies are becoming important due to the increasing demand for flexible and robust systems for identification and verification of individuals. In recent years, some biometrics has been proposed ...
    • Age prediction by voice using deep learning 

      Linde Martínez, David (Universitat Politècnica de Catalunya, 2023-01-30)
      Master thesis
      Open Access
      One of the main topics in artificial intelligence is the speech characterization. Moreover, it is a field of study with the minimal scope when the Catalan language is involved in. In this project, we try to perform an age ...
    • Análisis y diseño de Dataset para el reconocimiento de COVID-19 en señales de tos con Redes Neuronales Profundas 

      Sánchez Bergés, Marc (Universitat Politècnica de Catalunya, 2022-05-23)
      Bachelor thesis
      Open Access
      This project will try to improve the detection results of the COVID-19 disease as much as possible from acoustic signals of coughing through a convolutional neural network. To accomplish these objectives, we started with ...
    • Augment de dades de veu per a sistemes de processament de la parla 

      Falceto Piñol, Anna (Universitat Politècnica de Catalunya, 2023-01-31)
      Bachelor thesis
      Open Access
      We live in an era where intelligent systems are becoming more and more part of our lives. These systems require a large amount of data to learn different tasks and, in many cases, not enough content is available to train ...
    • Caracterització de locutors usant deep learning 

      Alcón Doganoc, Miguel (Universitat Politècnica de Catalunya, 2018-06-26)
      Bachelor thesis
      Open Access
      Desenvolupament d'un sistema computacional capaç de comparar dos fitxers d'àudio, on parla una persona, i determinar si comparteixen locutor, amb un error òptim del 29,77%; i d'una interfície gràfica que l'utilitza.
    • Deep learning bottle-neck features for speaker recognition 

      Cumalat Puig, Eudald (Universitat Politècnica de Catalunya, 2018-10)
      Bachelor thesis
      Open Access
      Speaker recognition is a very useful and powerful technology that has many interesting security applications, which makes it an investigation field to pour on many efforts. Recognizing whether a person is who he/she claims ...
    • Deep Learning for Demographic Classification by Speech 

      Navarrete Jiménez, Daniel (Universitat Politècnica de Catalunya, 2022-10-26)
      Master thesis
      Open Access
      Speech characterization is a challenging task and one of the most relevant challenges in AI. Moreover, it is a field of study with minimal scope in the Catalan language. In this work, we try to perform a demographic ...
    • Deep learning for i-vector speaker and language recognition 

      Ghahabi Esfahani, Omid (Universitat Politècnica de Catalunya, 2018-05-29)
      Doctoral thesis
      Open Access
      Over the last few years, i-vectors have been the state-of-the-art technique in speaker and language recognition. Recent advances in Deep Learning (DL) technology have improved the quality of i-vectors but the DL techniques ...
    • Deep learning for speaker characterization 

      Garriga Artieda, Daniel (Universitat Politècnica de Catalunya, 2022-07-05)
      Bachelor thesis
      Open Access
      La caracterización de un locutor es una de las tareas más relevantes en muchas aplicaciones de inteligencia artificial. Del mismo modo que estas tecnologías mejoran y se aumenta la cantidad de datos disponibles, es también ...
    • Deep Neural Networks for Channel Compensated i-Vectors in Speaker Recognition 

      Jiménez Sanfiz, Albert (Universitat Politècnica de Catalunya, 2014-06)
      Bachelor thesis
      Open Access
      This thesis explores the application of channel-compensation techniques in speaker verification and the posterior combination with deep learning technologies. The idea is to reduce the degradation of the performance due ...
    • Design and evaluation of a conversational agent to support focused knowledge work 

      Llaneras Gayà, Miquel (Universitat Politècnica de Catalunya, 2022-09-15)
      Master thesis
      Open Access
      The rise of social networks and the increase of notifications we receive on our phones has had a negative impact on the way people work. Distractions have become a more serious problem on such times when productivity is ...
    • Detection and handling of overlapping speech for speaker diarization 

      Zelenák, Martin (Universitat Politècnica de Catalunya, 2012-01-31)
      Doctoral thesis
      Open Access
      For the last several years, speaker diarization has been attracting substantial research attention as one of the spoken language technologies applied for the improvement, or enrichment, of recording transcriptions. ...
    • Discriminative features for GMM and i-vector based speaker diarization 

      Zewoudie, Abraham Woubie (Universitat Politècnica de Catalunya, 2017-09-20)
      Doctoral thesis
      Open Access
      Speaker diarization has received several research attentions over the last decade. Among the different domains of speaker diarization, diarization in meeting domain is the most challenging one. It usually contains spontaneous ...
    • Diseño e implementación de un sistema de deep learning para la detección de covid por la tos con aumento de datos 

      Marchan Del Pino, David (Universitat Politècnica de Catalunya, 2022-05-23)
      Bachelor thesis
      Open Access
      In recent years, COVID-19 has had a major impact on today's society and has not gone unnoticed in the world of deep learning. Cases can now be studied using image analysis or in the world of audio analysis. This report ...
    • Emociones en señales de voz: reconocimiento con redes neuronales profundas. 

      Hernández Leal, Victor Emilio (Universitat Politècnica de Catalunya, 2021-10-27)
      Bachelor thesis
      Open Access
      In recent years the research effort of different tasks is through neural network techniques. This work continues along this line and explores its possibilities in the task of speech emotion recognition (SER). In this work, ...
    • Feature extraction for speaker diarization 

      Negre Rabassa, Enric (Universitat Politècnica de Catalunya, 2016-04-15)
      Master thesis (pre-Bologna period)
      Restricted access - confidentiality agreement
      Feature extraction for speaker diarization using different databases
    • Fusing prosodic and acoustic information for speaker recognition 

      Farrús Cabeceran, Mireia (Universitat Politècnica de Catalunya, 2008-10-29)
      Doctoral thesis
      Open Access
      El reconeixement automàtic del locutor és la utilització d’una màquina per identificar un individu a partir de d’un missatge parlat. Recentment, aquesta tecnologia ha experimentat un increment en l’ús de diverses aplicacions ...
    • Identificació de veu mitjançant xarxes neuronals profundes implementades sobre FPGA 

      Palet Gual, Marc (Universitat Politècnica de Catalunya, 2016-04-29)
      Bachelor thesis
      Open Access
      El projecte presenta un sistema d'identificació del gènere de l'interlocutor a partir de la veu basat en una xarxa neuronal profunda implementada en un dispositiu FPGA. L'objecte d'estudi és el rendiment la latència i la ...
    • Integración de biometría de voz en un sistema de pago por teléfono: verificación del locutor dependiente del texto 

      Masana de Bouffard, Judit (Universitat Politècnica de Catalunya, 2016-09)
      Bachelor thesis
      Open Access
      The aim of this project is obtain a text-dependent speaker verification system using speech biometrics. We recover software created by the Department of Signal and Communications Theory, which had been modified. It has ...