Now showing items 1-20 of 44

    • A multi-microphone approach to speech processing in a smart-room environment 

      Abad Gareta, Alberto (Universitat Politècnica de Catalunya, 2007-06-29)
      Doctoral thesis
      Open Access
      Els avenços recents en tecnologia informàtica i processament de la parla i del llenguatge, entre altres, han fet possible que noves maneres de comunicació entre les persones i les màquines comencin a semblar factibles. ...
    • A system for robust iris recognition 

      Pérez Parera, Dídac (Universitat Politècnica de Catalunya, 2012-02-13)
      Master thesis (pre-Bologna period)
      Restricted access - author's decision
      [ANGLÈS] Biometric technologies are becoming important due to the increasing demand for flexible and robust systems for identification and verification of individuals. In recent years, some biometrics has been proposed ...
    • Acoustic classification of marine mammals by deep learning 

      Cavero Carreras, Pol (Universitat Politècnica de Catalunya, 2023-09-14)
      Master thesis
      Open Access
      The classification of marine mammals has been an important topic for monitoring their population numbers. Many species of whales are considered endangered, making this task even more imperative. In this thesis two species ...
    • Acoustic detection of marine mammals by deep learning 

      Casals I Salvador, Marc (Universitat Politècnica de Catalunya, 2023-10-18)
      Master thesis
      Open Access
      In oceanography, specifically in the study of whales, the classification of the different species is an essential task for passive acoustic observation. It is fundamental for the tracking of some species that have become ...
    • Age prediction by voice using deep learning 

      Linde Martínez, David (Universitat Politècnica de Catalunya, 2023-01-30)
      Master thesis
      Open Access
      One of the main topics in artificial intelligence is the speech characterization. Moreover, it is a field of study with the minimal scope when the Catalan language is involved in. In this project, we try to perform an age ...
    • Análisis y diseño de Dataset para el reconocimiento de COVID-19 en señales de tos con Redes Neuronales Profundas 

      Sánchez Bergés, Marc (Universitat Politècnica de Catalunya, 2022-05-23)
      Bachelor thesis
      Open Access
      This project will try to improve the detection results of the COVID-19 disease as much as possible from acoustic signals of coughing through a convolutional neural network. To accomplish these objectives, we started with ...
    • Augment de dades de veu per a sistemes de processament de la parla 

      Falceto Piñol, Anna (Universitat Politècnica de Catalunya, 2023-01-31)
      Bachelor thesis
      Open Access
      We live in an era where intelligent systems are becoming more and more part of our lives. These systems require a large amount of data to learn different tasks and, in many cases, not enough content is available to train ...
    • Caracterització de locutors usant deep learning 

      Alcón Doganoc, Miguel (Universitat Politècnica de Catalunya, 2018-06-26)
      Bachelor thesis
      Open Access
      Desenvolupament d'un sistema computacional capaç de comparar dos fitxers d'àudio, on parla una persona, i determinar si comparteixen locutor, amb un error òptim del 29,77%; i d'una interfície gràfica que l'utilitza.
    • Catalan Accent Classification by Voice using Deep Learning 

      Felip I Díaz, Bernat (Universitat Politècnica de Catalunya, 2023-05-25)
      Master thesis
      Open Access
      Speech characterization is a vital field in artificial intelligence, yet accent classification is often overlooked, particularly for the Catalan language. This project is centered on the classification of Catalan accents ...
    • Data Augmentation for Speech Processing 

      Sánchez Shiromizu, Lucas Takanori (Universitat Politècnica de Catalunya, 2023-07-07)
      Bachelor thesis
      Open Access
      Desarrollo de un programa de augmentación de datos para sistemas de processado del habla
    • Deep learning bottle-neck features for speaker recognition 

      Cumalat Puig, Eudald (Universitat Politècnica de Catalunya, 2018-10)
      Bachelor thesis
      Open Access
      Speaker recognition is a very useful and powerful technology that has many interesting security applications, which makes it an investigation field to pour on many efforts. Recognizing whether a person is who he/she claims ...
    • Deep Learning for Demographic Classification by Speech 

      Navarrete Jiménez, Daniel (Universitat Politècnica de Catalunya, 2022-10-26)
      Master thesis
      Open Access
      Speech characterization is a challenging task and one of the most relevant challenges in AI. Moreover, it is a field of study with minimal scope in the Catalan language. In this work, we try to perform a demographic ...
    • Deep learning for i-vector speaker and language recognition 

      Ghahabi Esfahani, Omid (Universitat Politècnica de Catalunya, 2018-05-29)
      Doctoral thesis
      Open Access
      Over the last few years, i-vectors have been the state-of-the-art technique in speaker and language recognition. Recent advances in Deep Learning (DL) technology have improved the quality of i-vectors but the DL techniques ...
    • Deep learning for speaker characterization 

      Garriga Artieda, Daniel (Universitat Politècnica de Catalunya, 2022-07-05)
      Bachelor thesis
      Open Access
      La caracterización de un locutor es una de las tareas más relevantes en muchas aplicaciones de inteligencia artificial. Del mismo modo que estas tecnologías mejoran y se aumenta la cantidad de datos disponibles, es también ...
    • Deep Neural Networks for Channel Compensated i-Vectors in Speaker Recognition 

      Jiménez Sanfiz, Albert (Universitat Politècnica de Catalunya, 2014-06)
      Bachelor thesis
      Open Access
      This thesis explores the application of channel-compensation techniques in speaker verification and the posterior combination with deep learning technologies. The idea is to reduce the degradation of the performance due ...
    • Design and evaluation of a conversational agent to support focused knowledge work 

      Llaneras Gayà, Miquel (Universitat Politècnica de Catalunya, 2022-09-15)
      Master thesis
      Open Access
      The rise of social networks and the increase of notifications we receive on our phones has had a negative impact on the way people work. Distractions have become a more serious problem on such times when productivity is ...
    • Detection and handling of overlapping speech for speaker diarization 

      Zelenák, Martin (Universitat Politècnica de Catalunya, 2012-01-31)
      Doctoral thesis
      Open Access
      For the last several years, speaker diarization has been attracting substantial research attention as one of the spoken language technologies applied for the improvement, or enrichment, of recording transcriptions. ...
    • Discriminative features for GMM and i-vector based speaker diarization 

      Zewoudie, Abraham Woubie (Universitat Politècnica de Catalunya, 2017-09-20)
      Doctoral thesis
      Open Access
      Speaker diarization has received several research attentions over the last decade. Among the different domains of speaker diarization, diarization in meeting domain is the most challenging one. It usually contains spontaneous ...
    • Diseño e implementación de un sistema de deep learning para la detección de covid por la tos con aumento de datos 

      Marchan Del Pino, David (Universitat Politècnica de Catalunya, 2022-05-23)
      Bachelor thesis
      Open Access
      In recent years, COVID-19 has had a major impact on today's society and has not gone unnoticed in the world of deep learning. Cases can now be studied using image analysis or in the world of audio analysis. This report ...
    • Emociones en señales de voz: reconocimiento con redes neuronales profundas. 

      Hernández Leal, Victor Emilio (Universitat Politècnica de Catalunya, 2021-10-27)
      Bachelor thesis
      Open Access
      In recent years the research effort of different tasks is through neural network techniques. This work continues along this line and explores its possibilities in the task of speech emotion recognition (SER). In this work, ...