• Open-ended visual question answering 

    Masuda Mora, Issey (Universitat Politècnica de Catalunya, 2016-07-15)
    Treball Final de Grau
    Accés obert
    This thesis studies methods to solve Visual Question-Answering (VQA) tasks with a Deep Learning framework. As a preliminary step, we explore Long Short-Term Memory (LSTM) networks used in Natural Language Processing (NLP) ...
  • Voice conversion using Deep Learning 

    Aparicio Isarn, Albert (Universitat Politècnica de Catalunya, 2017-05-15)
    Treball Final de Grau
    Accés obert
    In this project we present a first attempt at a Voice Conversion system based on Deep Learning in which the alignment between the training data is intrinsic to the model. Our system is structured in three main blocks. The ...
  • Voice generation using deep learning 

    Gómez Sánchez, Gonzalo (Universitat Politècnica de Catalunya, 2016-09-28)
    Treball Final de Grau
    Accés obert
    Voice generation, also known as Speech Synthesis, is the artificial production of human speech. In the last decade, the Speech Synthesis research has been focused on a technique called Statistical Parametric Speech Synthesis. ...