• Open-ended visual question answering 

      Masuda Mora, Issey (Universitat Politècnica de Catalunya, 2016-07-15)
      Treball Final de Grau
      Accés obert
      This thesis studies methods to solve Visual Question-Answering (VQA) tasks with a Deep Learning framework. As a preliminary step, we explore Long Short-Term Memory (LSTM) networks used in Natural Language Processing (NLP) ...
    • Voice conversion using Deep Learning 

      Aparicio Isarn, Albert (Universitat Politècnica de Catalunya, 2017-05-15)
      Treball Final de Grau
      Accés obert
      In this project we present a first attempt at a Voice Conversion system based on Deep Learning in which the alignment between the training data is intrinsic to the model. Our system is structured in three main blocks. The ...
    • Voice generation using deep learning 

      Gómez Sánchez, Gonzalo (Universitat Politècnica de Catalunya, 2016-09-28)
      Treball Final de Grau
      Accés obert
      Voice generation, also known as Speech Synthesis, is the artificial production of human speech. In the last decade, the Speech Synthesis research has been focused on a technique called Statistical Parametric Speech Synthesis. ...