Now showing items 1-3 of 3

  • Open-ended visual question answering 

    Masuda Mora, Issey (Universitat Politècnica de Catalunya, 2016-07-15)
    Bachelor thesis
    Open Access
    This thesis studies methods to solve Visual Question-Answering (VQA) tasks with a Deep Learning framework. As a preliminary step, we explore Long Short-Term Memory (LSTM) networks used in Natural Language Processing (NLP) ...
  • Voice conversion using Deep Learning 

    Aparicio Isarn, Albert (Universitat Politècnica de Catalunya, 2017-05-15)
    Bachelor thesis
    Open Access
    In this project we present a first attempt at a Voice Conversion system based on Deep Learning in which the alignment between the training data is intrinsic to the model. Our system is structured in three main blocks. The ...
  • Voice generation using deep learning 

    Gómez Sánchez, Gonzalo (Universitat Politècnica de Catalunya, 2016-09-28)
    Bachelor thesis
    Open Access
    Voice generation, also known as Speech Synthesis, is the artificial production of human speech. In the last decade, the Speech Synthesis research has been focused on a technique called Statistical Parametric Speech Synthesis. ...