Multimodal emotion recognition via face and voice

Griera i Jiménez, Oriol

Visualitza/Obre

GrieraJimenezOriol_TFM.pdf (6,986Mb)

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

Griera i Jiménez, Oriol

Tutor / directorHernando Pericás, Francisco Javier

; de Marsico, Maria

Tipus de documentProjecte Final de Màster Oficial

Data2022-07-14

Condicions d'accésAccés obert

Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets

Abstract

Recent advances in technology have allowed humans to interact with computers in ways previously unimaginable. Despite significant progress, a necessary element for natural interaction is still lacking: emotions. Emotions play an important role in human communication and interaction, allowing people to express themselves beyond the language domain. The purpose of this project is to develop a multimodal system to classify emotions using facial expressions and the voice taken from videos. For face emotion recognition, face images and optical flow frames are used to exploit spatial and temporal information of the videos. Regarding the voice, the model uses speech features extracted from the chunked audio signals to predict the emotion. The combination of the two biometrics with a score-level fusion achieves excellent performance on the RAVDESS and the BAUM-1 datasets. However, the results remark the importance of further investigating the preprocessing techniques applied in this work to "normalize" the datasets to a unified format to improve the cross-dataset performance.

MatèriesComputer vision, Deep learning, Visió per ordinador, Aprenentatge profund

TitulacióMÀSTER UNIVERSITARI EN ENGINYERIA DE TELECOMUNICACIÓ (Pla 2013)

URIhttp://hdl.handle.net/2117/374046

Col·leccions

Màsters oficials - Master's degree in Telecommunications Engineering (MET) [393]

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
GrieraJimenezOriol_TFM.pdf		6,986Mb	PDF	Visualitza/Obre

UPCommons. Portal del coneixement obert de la UPC

Multimodal emotion recognition via face and voice

Visualitza/Obre

Explora