L'àmbit de recerca del grup 'VEU' és el tractament de la parla. Investiguem tecnologies que permeten l'extracció d'informació que la veu conté: reconeixement del que es diu, l'idioma o el dialecte, característiques del parlant -qui és, la seva edat, el sexe, l'estat emocional-, la direcció del so. També treballem en la caracterització general de l'àudio, per determinar quan hi ha veu i quan hi ha altres esdeveniments acústics com música o sorolls diversos. Les tecnologies de la parla possibiliten generar veu -síntesis de veu- o modificar les seves

Recent Submissions

  • DNN speaker embeddings using autoencoder pre-training 

    Khan, Umair; Hernando Pericás, Francisco Javier (Institute of Electrical and Electronics Engineers (IEEE), 2019)
    Conference lecture
    Restricted access - publisher's policy
    Over the last years, i-vectors have been the state-of-the-art approach in speaker recognition. Recent improvements in deep learning have increased the discriminative quality of i-vectors. However, deep learning architectures ...
  • Conditional distribution variability measures for causality detection 

    Rodríguez Fonollosa, José Adrián (Springer, 2019)
    Part of book or chapter of book
    Restricted access - publisher's policy
    In this paper we derive variability measures for the conditional probability distributions of a pair of random variables, and we study its application in the inference of causal-effect relationships. We also study the ...
  • Electron density retrieval from truncated Radio Occultation GNSS data 

    Lyu, Haixia; Hernández Pajares, Manuel; Monte Moreno, Enrique; Cardellach Galí, Estel (2019-06-01)
    Article
    Open Access
    This paper summarizes the definition and validation of two complementary new strategies, to invert incomplete Global Navigation Satellite System Radio-Occultation (RO) ionospheric measurements, such as the ones to be ...
  • Chatbol, a chatbot for the Spanish “La Liga” 

    Segura, Carlos; Palau, Alex; Luque, Jordi; Ruiz Costa-Jussà, Marta; Banchs, Rafael E. (Springer, 2019-09-25)
    Article
    Restricted access - publisher's policy
    This work describes the development of a social chatbot for the football domain. The chatbot, named chatbol, aims at answering a wide variety of questions related to the Spanish football league “La Liga”. Chatbol is deployed ...
  • Dades, màquina i ètica 

    Ruiz Costa-Jussà, Marta (2019-02-01)
    Article
    Open Access
  • The TALP-UPC machine translation systems for WMT19 news translation task: pivoting techniques for low resource MT 

    Casas Manzanares, Noé; Rodríguez Fonollosa, José Adrián; Escolano Peinado, Carlos; Basta, Christine Raouf Saad; Ruiz Costa-Jussà, Marta (Association for Computational Linguistics, 2019)
    Conference report
    Restricted access - publisher's policy
    In this article, we describe the TALP-UPC research group participation in the WMT19 news translation shared task for Kazakh-English. Given the low amount of parallel training data, we resort to using Russian as pivot ...
  • Terminology-aware segmentation and domain feature for the WMT19 biomedical translation task 

    Carrino, Casimiro Pio; Rafieian, Bardia; Ruiz Costa-Jussà, Marta; Rodríguez Fonollosa, José Adrián (Association for Computational Linguistics, 2019)
    Conference report
    Restricted access - publisher's policy
    In this work, we give a description of the TALP-UPC systems submitted for the WMT19 Biomedical Translation Task. Our proposed strategy is NMT model-independent and relies only on one ingredient, a biomedical terminology ...
  • BERT masked language modeling for co-reference resolution 

    Alfaro, Felipe; Ruiz Costa-Jussà, Marta; Rodríguez Fonollosa, José Adrián (2019)
    Conference report
    Open Access
    This paper explains the TALP-UPC participation for the Gendered Pronoun Resolution shared-task of the 1st ACL Workshop on Gender Bias for Natural Language Processing. We have implemented two models for mask language modeling ...
  • ADDTID: An alternative tool for studying earthquake/tsunami signatures in the ionosphere. Case of the 2011 Tohoku earthquake 

    Yang, Heng; Monte Moreno, Enrique; Hernández Pajares, Manuel (Multidisciplinary Digital Publishing Institute (MDPI), 2019-09-13)
    Article
    Open Access
    Traveling Ionospheric Disturbances (ADDTID) algorithm. This algorithm automatically detects and characterizes Traveling Ionospheric Disturbances (TIDs) from Global Navigation Satellite System (GNSS) measurements. Applying ...
  • Neural networks principal component analysis for estimating the generative multifactor model of returns under a statistical approach to the arbitrage pricing theory: Evidence from the mexican stock exchange 

    Ladrón de Guevara Cortés, Rogelio; Torra Porras, Salvador; Monte Moreno, Enrique (2019-01-01)
    Article
    Open Access
    A nonlinear principal component analysis (NLPCA) represents an extension of the standard principal component analysis (PCA) that overcomes the limitation of the PCA’s assumption about the linearity of the model. The NLPCA ...
  • Wav2Pix: speech-conditioned face generation using generative adversarial networks 

    Cardoso Duarte, Amanda; Roldan, Francisco; Tubau, Miquel; Escur, Janna; Pascual de la Puente, Santiago; Salvador Aguilera, Amaia; Mohedano, Eva; McGuinness, Kevin; Torres Viñals, Jordi; Giró Nieto, Xavier (Institute of Electrical and Electronics Engineers (IEEE), 2019)
    Conference lecture
    Restricted access - publisher's policy
    Speech is a rich biometric signal that contains information about the identity, gender and emotional state of the speaker. In this work, we explore its potential to generate face images of a speaker by conditioning a ...
  • Chinese-Catalan: A neural machine translation approach based on pivoting and attention mechanisms 

    Ruiz Costa-Jussà, Marta; Casas Manzanares, Noé; Escolano Peinado, Carlos; Rodríguez Fonollosa, José Adrián (2019-01-01)
    Article
    Open Access
    This article innovatively addresses machine translation from Chinese to Catalan using neural pivot strategies trained without any direct parallel data. The Catalan language is very similar to Spanish from a linguistic point ...

View more