L'àmbit de recerca del grup 'VEU' és el tractament de la parla. Investiguem tecnologies que permeten l'extracció d'informació que la veu conté: reconeixement del que es diu, l'idioma o el dialecte, característiques del parlant -qui és, la seva edat, el sexe, l'estat emocional-, la direcció del so. També treballem en la caracterització general de l'àudio, per determinar quan hi ha veu i quan hi ha altres esdeveniments acústics com música o sorolls diversos. Les tecnologies de la parla possibiliten generar veu -síntesis de veu- o modificar les seves

http://futur.upc.edu/VEU

Enviaments recents

  • Neural machine translation with the transformer and multi-source romance languages for the biomedical WMT 2018 task 

    Tubay, Brian; Ruiz Costa-Jussà, Marta (2018)
    Comunicació de congrés
    Accés restringit per política de l'editorial
  • A neural approach to language variety translation 

    Ruiz Costa-Jussà, Marta; Zampieri, Marcos; Pal, Santanu (Association for Computational Linguistics, 2018)
    Comunicació de congrés
    Accés restringit per política de l'editorial
    In this paper we present the first neural-based machine translation system trained to translate between standard national varieties of the same language. We take the pair Brazilian - European Portuguese as an example and ...
  • From feature to paradigm: Deep learning in machine translation (Extended Abstract) 

    Ruiz Costa-Jussà, Marta (2018)
    Comunicació de congrés
    Accés restringit per política de l'editorial
    n the last years, deep learning algorithms have highly revolutionized several areas including speech, image and natural language processing. The specific field of Machine Translation (MT) has not remained invariant. ...
  • Synthesis using speaker adaptation from speech recognition DB 

    Oller Moreno, Sergio; Moreno Bilbao, M. Asunción; Bonafonte Cávez, Antonio (Universidad de Vigo, 2010)
    Comunicació de congrés
    Accés obert
    This paper deals with the creation of multiple voices from a Hidden Markov Model based speech synthesis system (HTS). More than 150 Catalan synthetic voices were built using Hidden Markov Models (HMM) and speaker adaptation ...
  • Visualizing punctuation restoration in speech transcripts with prosograph 

    Oktem, A.; Farrús, M.; Bonafonte Cávez, Antonio (International Speech Communication Association (ISCA), 2018)
    Text en actes de congrés
    Accés obert
    We have developed a neural architecture that tests the effect of lexical, morphosyntactic and prosodic features in restoring punctuation in speech transcriptions. Having outperformed a baseline model in terms of precision ...
  • Expressive speech synthesis using sentiment embeddings 

    Jauk, Igor; Lorenzo Trueba, J.; Yamagishi, J.; Bonafonte Cávez, Antonio (International Speech Communication Association (ISCA), 2018)
    Text en actes de congrés
    Accés obert
    In this paper we present a DNN based speech synthesis system trained on an audiobook including sentiment features predicted by the Stanford sentiment parser. The baseline system uses DNN to predict acoustic parameters based ...
  • Spanish statistical parametric speech synthesis using a neural vocoder 

    Bonafonte Cávez, Antonio; Pascual de la Puente, Santiago; Dorca, G. (International Speech Communication Association (ISCA), 2018)
    Text en actes de congrés
    Accés obert
    During the 2000s decade, unit-selection based text-to-speech was the dominant commercial technology. Meanwhile, the TTS research community has made a big effort to push statistical-parametric speech synthesis to get similar ...
  • The use of long-term features for GMM- and i-vector-based speaker diarization systems 

    Zewoudie, Abraham Woubie; Luque, Jordi; Hernando Pericás, Francisco Javier (2018-09-26)
    Article
    Accés obert
    Several factors contribute to the performance of speaker diarization systems. For instance, the appropriate selection of speech features is one of the key aspects that affect speaker diarization systems. The other factors ...
  • Detection and description of the different ionospheric disturbances that appeared during the solar eclipse of 21 August 2017 

    Yang, Heng; Monte Moreno, Enrique; Hernández Pajares, Manuel (Multidisciplinary Digital Publishing Institute (MDPI), 2018-10-30)
    Article
    Accés obert
    This work will provide a detailed characterization of the travelling ionospheric disturbances (TIDs) created by the solar eclipse of 21 August 2017, the shadow of which crossed the United States from the Pacific to the ...
  • A conversation analysis framework using speech recognition and naïve bayes classification for construction process monitoring 

    Zhang, T.; Lee, Y. C.; Zhu, Y.; Hernando Pericás, Francisco Javier (American Society of Civil Engineers (ASCE), 2018)
    Text en actes de congrés
    Accés restringit per política de l'editorial
    At a dynamic construction site, conversations convey vital information including construction activities, operation status, and task performance. Even though because of information security, recording the entire conversations ...
  • Language and noise transfer in speech enhancement generative adversarial network 

    Pascual de la Puente, Santiago; Park, Maruchan; Serra, Joan; Bonafonte Cávez, Antonio; Ahn, Kang-hun (Institute of Electrical and Electronics Engineers (IEEE), 2018)
    Text en actes de congrés
    Accés restringit per política de l'editorial
    Speech enhancement deep learning systems usually require large amounts of training data to operate in broad conditions or real applications. This makes the adaptability of those systems into new, low resource environments ...
  • Tracking economic growth by evolving expectations via genetic programming: a two-step approach 

    Claveria González, Oscar; Monte Moreno, Enrique; Torra Porras, Salvador (2018-10-09)
    Report de recerca
    Accés obert
    The main objective of this study is to present a two-step approach to generate estimates of economic growth based on agents’ expectations from tendency surveys. First, we design a genetic programming experiment to derive ...

Mostra'n més