Now showing items 21-40 of 42

    • Identificació de veu mitjançant xarxes neuronals profundes implementades sobre FPGA 

      Palet Gual, Marc (Universitat Politècnica de Catalunya, 2016-04-29)
      Bachelor thesis
      Open Access
      El projecte presenta un sistema d'identificació del gènere de l'interlocutor a partir de la veu basat en una xarxa neuronal profunda implementada en un dispositiu FPGA. L'objecte d'estudi és el rendiment la latència i la ...
    • Integración de biometría de voz en un sistema de pago por teléfono: verificación del locutor dependiente del texto 

      Masana de Bouffard, Judit (Universitat Politècnica de Catalunya, 2016-09)
      Bachelor thesis
      Open Access
      The aim of this project is obtain a text-dependent speaker verification system using speech biometrics. We recover software created by the Department of Signal and Communications Theory, which had been modified. It has ...
    • Integración de tecnologías de audio en entornos inteligentes 

      Lendoiro Rodríguez, Diego (Universitat Politècnica de Catalunya, 2013-12-16)
      Master thesis (pre-Bologna period)
      Open Access
      Anglès: Audio technologies are a key part in the development of smart environments. As for the human beings, audio tecnologies, are a great information resource from which big amounts of data can be obtained. This data ...
    • Integration of speech biometrics in a phone payment system: text-independent speaker verification 

      Barón Garcia, Anna (Universitat Politècnica de Catalunya, 2016-09)
      Bachelor thesis
      Open Access
      Nowadays, the integration of biometrics in security systems is a prominent research and application field. Also, it is clear that speech is the most common form of communication, which makes a swell candidate. While using ...
    • Llibre blanc sobre la Intel·ligència Artificial aplicada a l’Educació i també a la Llengua : la IA per donar resposta als reptes del sector l’Educació i als reptes de la Llengua a Catalunya 

      Gibert, Karina; Catala Roig, Neus; Hernando Pericás, Francisco Javier; Padró, Lluís; Rodríguez Fonollosa, José Adrián (Centre of Innovation for Data Tech and Artificial Intelligence, 2023-08)
      Book
      Open Access
      L'objectiu d'aquest llibre blanc és contribuir a l'impuls i el foment de l'adopció i la incorporació de la IA en el sector de l'educació com també analitzar la importància de les seves aplicacions a la llengua catalana, ...
    • Multimodal emotion recognition via face and voice 

      Griera i Jiménez, Oriol (Universitat Politècnica de Catalunya, 2022-07-14)
      Master thesis
      Open Access
      Recent advances in technology have allowed humans to interact with computers in ways previously unimaginable. Despite significant progress, a necessary element for natural interaction is still lacking: emotions. Emotions ...
    • Normalización estadística para fusión biométrica multimodal 

      Ejarque Monserrate, Pascual (Universitat Politècnica de Catalunya, 2011-03-17)
      Doctoral thesis
      Open Access
      Los sistemas de reconocimiento biométrico utilizan ciertas características humanas como la voz, los rasgos faciales, la huella dactilar, el iris o la geometría de la mano para identificar a un individuo o verificar su ...
    • Noves característiques de veu per la diarització de parlants 

      Casanovas Duch, Artemi (Universitat Politècnica de Catalunya, 2015-10)
      Bachelor thesis
      Open Access
      Voice features fusion has been successfully used to both identify and verify speakers and to detect voice pathologies. Nowadays, it is also being tested in speaker diarization task. The following thesis shows a study on ...
    • Online mail classifier using Deep Learning 

      Gil Garcia, Enric (Universitat Politècnica de Catalunya, 2022-02-03)
      Master thesis
      Restricted access - confidentiality agreement
      This thesis has been developed at NTTData for an external client. The client requires an autonomous mail classifier since they currently classify mails manually. The thesis consists in the development of a Deep Learning ...
    • Parallel scalability of face detection in heterogeneous multithreaded architectures 

      Oro García, David (Universitat Politècnica de Catalunya, 2020-11-17)
      Doctoral thesis
      Open Access
      Recently, facial recognition systems have become extremely popular and deployments of this technology are now ubiquitous. Applications ranging from access control to automated surveillance of video feeds rely on facial ...
    • Predicting emotion in speech: a Deep Learning approach using Attention mechanisms 

      Aromí Leaverton, Daniel (Universitat Politècnica de Catalunya, 2021-06)
      Bachelor thesis
      Open Access
      Speech Emotion Recognition (SER) has recently become a popular field of research because of its implications in human-computer interaction. In this study, the emotional state of the speaker is successfully predicted by ...
    • Robust speaker diarization for meetings 

      Anguera Miró, Xavier (Universitat Politècnica de Catalunya, 2006-12-21)
      Doctoral thesis
      Open Access
      Aquesta tesi doctoral mostra la recerca feta en l'àrea de la diarització de locutor per a sales de reunions. En la present s'estudien els algorismes i la implementació d'un sistema en diferit de segmentació i aglomerat de ...
    • Self-supervised deep learning approaches to speaker recognition 

      Khan, Umair (Universitat Politècnica de Catalunya, 2021-01-11)
      Doctoral thesis
      Open Access
      In speaker recognition, i-vectors have been the state-of-the-art unsupervised technique over the last few years, whereas x-vectors is becoming the state-of-the-art supervised technique, these days. Recent advances in Deep ...
    • Sistema multimodal para el reconocimiento de personas en grabaciones de TV 

      Cortillas Liesa, Carla (Universitat Politècnica de Catalunya, 2016-09)
      Bachelor thesis
      Open Access
      The Project described in this document falls within the topic of person recognition in TV recordings by mean of multimodal systems. It has been developed as collaboration with image and audio processing groups in the signal ...
    • Speaker diarization and tracking in multiple-sensor environments 

      Luque Serrano, Jordi (Universitat Politècnica de Catalunya, 2012-12-21)
      Doctoral thesis
      Open Access
      This thesis verses about the research conducted in the topic of speaker recognition in real conditions like as meeting rooms, telephone quality speech and radio and TV broadcast news. The main objective is concerned to the ...
    • Speaker tracking system using speaker boundary detection 

      Khan, Umair (Universitat Politècnica de Catalunya, 2016-11)
      Master thesis
      Open Access
      This thesis is about a research conducted in the area of Speaker Recognition. The application is concerned to the automatic detection and tracking of target speakers in meetings, conferences, telephone conversations and ...
    • Speech emotion recognition: Un sistema de reconocimiento de emociones por voz basado en Ivectors 

      Pérez Pascual, Francesc (Universitat Politècnica de Catalunya, 2017-05)
      Bachelor thesis
      Open Access
      The Speech Emotion Recognition project verses about the design and implementation of an emotion recognition system by analyzing the characteristics of the voice signal, based on the use of the Ivectors technique, which is ...
    • Study and implementation of manifold regularization techniques in neural networks 

      Tubau Pires, Miquel (Universitat Politècnica de Catalunya, 2017-01)
      Bachelor thesis
      Open Access
      Covenantee:   Università degli studi di Roma "La Sapienza"
      During the last years, semi-supervised learning has become one of the most important topics for research in machine learning. Dealing with the situation where few labeled training points are available, but a large number ...
    • Towards blind extraction of guitar effects 

      González Comalada, Laura (Universitat Politècnica de Catalunya, 2022-09-05)
      Bachelor thesis
      Open Access
      Covenantee:   Leibniz Universität Hannover
      In this work, a study has been carried out on the behaviour of MFCCs when distortion is applied to a signal. Subsequently, the conclusions drawn from the study have been used to customise an LSTM network to allow the ...
    • Unsupervised and attention approaches for deep learning speaker recognition 

      Safari, Pooyan (Universitat Politècnica de Catalunya, 2023-05-19)
      Doctoral thesis
      Open Access
      (English) The thesis presents various contributions to the field of speaker recognition. In the first part of the thesis, the focus is on unsupervised methods for speaker recognition. This includes a method that uses Deep ...