Exploració per altres contribucions "Hernando Pericás, Francisco Javier"

Feature extraction for speaker diarization

Negre Rabassa, Enric (Universitat Politècnica de Catalunya, 2016-04-15)
Projecte/Treball Final de Carrera
Accés restringit per acord de confidencialitat

Feature extraction for speaker diarization using different databases

Fusing prosodic and acoustic information for speaker recognition

Farrús Cabeceran, Mireia (Universitat Politècnica de Catalunya, 2008-10-29)
Tesi
Accés obert

El reconeixement automàtic del locutor és la utilització d’una màquina per identificar un individu a partir de d’un missatge parlat. Recentment, aquesta tecnologia ha experimentat un increment en l’ús de diverses aplicacions ...

Identificació de veu mitjançant xarxes neuronals profundes implementades sobre FPGA

Palet Gual, Marc (Universitat Politècnica de Catalunya, 2016-04-29)
Treball Final de Grau
Accés obert

El projecte presenta un sistema d'identificació del gènere de l'interlocutor a partir de la veu basat en una xarxa neuronal profunda implementada en un dispositiu FPGA. L'objecte d'estudi és el rendiment la latència i la ...

Integración de biometría de voz en un sistema de pago por teléfono: verificación del locutor dependiente del texto

Masana de Bouffard, Judit (Universitat Politècnica de Catalunya, 2016-09)
Treball Final de Grau
Accés obert

The aim of this project is obtain a text-dependent speaker verification system using speech biometrics. We recover software created by the Department of Signal and Communications Theory, which had been modified. It has ...

Integración de tecnologías de audio en entornos inteligentes

Lendoiro Rodríguez, Diego (Universitat Politècnica de Catalunya, 2013-12-16)
Projecte/Treball Final de Carrera
Accés obert

Anglès: Audio technologies are a key part in the development of smart environments. As for the human beings, audio tecnologies, are a great information resource from which big amounts of data can be obtained. This data ...

Integration of speech biometrics in a phone payment system: text-independent speaker verification

Barón Garcia, Anna (Universitat Politècnica de Catalunya, 2016-09)
Treball Final de Grau
Accés obert

Nowadays, the integration of biometrics in security systems is a prominent research and application field. Also, it is clear that speech is the most common form of communication, which makes a swell candidate. While using ...

Llibre blanc sobre la Intel·ligència Artificial aplicada a l’Educació i també a la Llengua : la IA per donar resposta als reptes del sector l’Educació i als reptes de la Llengua a Catalunya

Gibert, Karina; Catala Roig, Neus; Hernando Pericás, Francisco Javier; Padró, Lluís; Rodríguez Fonollosa, José Adrián (Centre of Innovation for Data Tech and Artificial Intelligence, 2023-08)
Llibre
Accés obert

L'objectiu d'aquest llibre blanc és contribuir a l'impuls i el foment de l'adopció i la incorporació de la IA en el sector de l'educació com també analitzar la importància de les seves aplicacions a la llengua catalana, ...

Multimodal emotion recognition via face and voice

Griera i Jiménez, Oriol (Universitat Politècnica de Catalunya, 2022-07-14)
Projecte Final de Màster Oficial
Accés obert

Recent advances in technology have allowed humans to interact with computers in ways previously unimaginable. Despite significant progress, a necessary element for natural interaction is still lacking: emotions. Emotions ...

Normalización estadística para fusión biométrica multimodal

Ejarque Monserrate, Pascual (Universitat Politècnica de Catalunya, 2011-03-17)
Tesi
Accés obert

Los sistemas de reconocimiento biométrico utilizan ciertas características humanas como la voz, los rasgos faciales, la huella dactilar, el iris o la geometría de la mano para identificar a un individuo o verificar su ...

Noves característiques de veu per la diarització de parlants

Casanovas Duch, Artemi (Universitat Politècnica de Catalunya, 2015-10)
Treball Final de Grau
Accés obert

Voice features fusion has been successfully used to both identify and verify speakers and to detect voice pathologies. Nowadays, it is also being tested in speaker diarization task. The following thesis shows a study on ...

Online mail classifier using Deep Learning

Gil Garcia, Enric (Universitat Politècnica de Catalunya, 2022-02-03)
Projecte Final de Màster Oficial
Accés restringit per acord de confidencialitat

This thesis has been developed at NTTData for an external client. The client requires an autonomous mail classifier since they currently classify mails manually. The thesis consists in the development of a Deep Learning ...

Parallel scalability of face detection in heterogeneous multithreaded architectures

Oro García, David (Universitat Politècnica de Catalunya, 2020-11-17)
Tesi
Accés obert

Recently, facial recognition systems have become extremely popular and deployments of this technology are now ubiquitous. Applications ranging from access control to automated surveillance of video feeds rely on facial ...

Predicting emotion in speech: a Deep Learning approach using Attention mechanisms

Aromí Leaverton, Daniel (Universitat Politècnica de Catalunya, 2021-06)
Treball Final de Grau
Accés obert

Speech Emotion Recognition (SER) has recently become a popular field of research because of its implications in human-computer interaction. In this study, the emotional state of the speaker is successfully predicted by ...

Robust speaker diarization for meetings

Anguera Miró, Xavier (Universitat Politècnica de Catalunya, 2006-12-21)
Tesi
Accés obert

Aquesta tesi doctoral mostra la recerca feta en l'àrea de la diarització de locutor per a sales de reunions. En la present s'estudien els algorismes i la implementació d'un sistema en diferit de segmentació i aglomerat de ...

Self-supervised deep learning approaches to speaker recognition

Khan, Umair (Universitat Politècnica de Catalunya, 2021-01-11)
Tesi
Accés obert

In speaker recognition, i-vectors have been the state-of-the-art unsupervised technique over the last few years, whereas x-vectors is becoming the state-of-the-art supervised technique, these days. Recent advances in Deep ...

Sistema multimodal para el reconocimiento de personas en grabaciones de TV

Cortillas Liesa, Carla (Universitat Politècnica de Catalunya, 2016-09)
Treball Final de Grau
Accés obert

The Project described in this document falls within the topic of person recognition in TV recordings by mean of multimodal systems. It has been developed as collaboration with image and audio processing groups in the signal ...

Speaker diarization and tracking in multiple-sensor environments

Luque Serrano, Jordi (Universitat Politècnica de Catalunya, 2012-12-21)
Tesi
Accés obert

This thesis verses about the research conducted in the topic of speaker recognition in real conditions like as meeting rooms, telephone quality speech and radio and TV broadcast news. The main objective is concerned to the ...

Speaker tracking system using speaker boundary detection

Khan, Umair (Universitat Politècnica de Catalunya, 2016-11)
Projecte Final de Màster Oficial
Accés obert

This thesis is about a research conducted in the area of Speaker Recognition. The application is concerned to the automatic detection and tracking of target speakers in meetings, conferences, telephone conversations and ...

Speech emotion recognition: Un sistema de reconocimiento de emociones por voz basado en Ivectors

Pérez Pascual, Francesc (Universitat Politècnica de Catalunya, 2017-05)
Treball Final de Grau
Accés obert

The Speech Emotion Recognition project verses about the design and implementation of an emotion recognition system by analyzing the characteristics of the voice signal, based on the use of the Ivectors technique, which is ...

Study and implementation of manifold regularization techniques in neural networks

Tubau Pires, Miquel (Universitat Politècnica de Catalunya, 2017-01)
Treball Final de Grau
Accés obert
Realitzat a/amb: Università degli studi di Roma "La Sapienza"

During the last years, semi-supervised learning has become one of the most important topics for research in machine learning. Dealing with the situation where few labeled training points are available, but a large number ...