L'àmbit de recerca del grup 'VEU' és el tractament de la parla. Investiguem tecnologies que permeten l'extracció d'informació que la veu conté: reconeixement del que es diu, l'idioma o el dialecte, característiques del parlant -qui és, la seva edat, el sexe, l'estat emocional-, la direcció del so. També treballem en la caracterització general de l'àudio, per determinar quan hi ha veu i quan hi ha altres esdeveniments acústics com música o sorolls diversos. Les tecnologies de la parla possibiliten generar veu -síntesis de veu- o modificar les seves

http://futur.upc.edu/VEU

Enviaments recents

  • Recent activities of IAG working group “Ionosphere Prediction” 

    Erdogan, Eren; Hoque, Mainul; García Rigo, Alberto; Cueto, M.; Schmidt, Michael; Jakowski, Norbert; Berdermann, Jens; Monte Moreno, Enrique; Hernández Pajares, Manuel (2018)
    Comunicació de congrés
    Accés obert
    Ionospheric disturbances pose, for instance, an increasing risk on economy, national security, satellite and airline operations, communications networks and the navigation systems. Constructing ...
  • Experimental research on encoder-decoder architectures with attention for chatbots 

    Ruiz Costa-Jussà, Marta; Nuez, Álvaro; Segura, Carlos (2018-03-18)
    Article
    Accés restringit per política de l'editorial
    Chatbots aim at automatically offering a conversation be- tween a human and a computer. While there is a long track of re- search in rule-based and retrieval-based approaches, the generation-based approaches are promisingly ...
  • Bridging deep and kernel methods 

    Belanche Muñoz, Luis Antonio; Ruiz Costa-Jussà, Marta (2017)
    Text en actes de congrés
    Accés restringit per política de l'editorial
    There has been some exciting major progress in recent years in data analysis methods, including a variety of deep learning architectures, as well as further advances in kernel-based learning methods, which have demonstrated ...
  • Byte-based neural machine translation 

    Ruiz Costa-Jussà, Marta; Escolano Peinado, Carlos; Rodríguez Fonollosa, José Adrián (Association for Computational Linguistics, 2017)
    Text en actes de congrés
    Accés obert
    This paper presents experiments compar- ing character-based and byte-based neural machine translation systems. The main motivation of the byte-based neural ma- chine translation system is to build multi- lingual neural ...
  • A novel approach to real-time range estimation of underwater acoustic sources using supervised machine learning 

    Houégnigan, Ludwig; Safari, Pooyan; Nadeu Camprubí, Climent; Van der Schaar, Mike Connor Roger Malcolm; André, Michel (Institute of Electrical and Electronics Engineers (IEEE), 2017)
    Comunicació de congrés
    Accés obert
    The proposed paper introduces a novel method for range estimation of acoustic sources, both cetaceans and industrial sources, in deep sea environments using supervised learning with neural networks in the contex of a single ...
  • Automatic detection of alarm sounds in a noisy hospital environment using model and non-model based approaches 

    Raboshchuk, Ganna; Gómez Quintana, Sergi; Peiró Lilja, Alexandre; Nadeu Camprubí, Climent (2017-11-12)
    Report de recerca
    Accés obert
    In the noisy acoustic environment of a Neonatal Intensive Care Unit (NICU) there is a variety of alarms, which are frequently triggered by the biomedical equipment. In this paper different approaches for automatic detection ...
  • Joint model-based recognition and localization of overlapped acoustic events using a set of distributed small microphone arrays 

    Chakraborty, Rupayan; Nadeu Camprubí, Climent (2017-12-19)
    Report de recerca
    Accés obert
    In the analysis of acoustic scenes, often the occurring sounds have to be detected in time, recognized, and localized in space. Usually, each of these tasks is done separately. In this paper, a model-based approach to ...
  • Rudiments of spatial audio synthesis 

    Girbau Xalabarder, Andreu; Nadeu Camprubí, Climent (2017-02-15)
    Report de recerca
    Accés obert
    For many application areas the binaural synthesis has become a field of interest. In this paper, we present the basics of bin- aural synthesis for 2 channels -left and right- providing exam- ples and figures, using ...
  • A neural network approach for automatic detection of acoustic alarms 

    Peiró Lilja, Alexandre; Raboshchuk, Ganna; Nadeu Camprubí, Climent (Scitepress, 2017)
    Comunicació de congrés
    Accés restringit per política de l'editorial
    Acoustic alarms generated by biomedical equipment are relevant sounds in the noisy Neonatal Intensive Care Unit (NICU) environment both because of their high frequency of occurrence and their possible negative effects on ...
  • Non parametric coding of speech by means of a MLP with hints 

    Hernández, G; Monte Moreno, Enrique; Mariño Acebal, José Bernardo (Springer, 1997)
    Text en actes de congrés
    Accés restringit per política de l'editorial
    This paper presents a non parametric compression system which makes use of the fact that a MLP has an internal representation of the data in the hidden layer. The system that we present makes a compression by using 4 or 8 ...
  • A data-driven approach to construct survey-based indicators by means of evolutionary algorithms 

    Claveria, Oscar; Monte Moreno, Enrique; Torra Porras, Salvador (2018-01-09)
    Article
    Accés obert
    In this paper we propose a data-driven approach for the construction of survey-based indicators using large data sets. We make use of agents’ expectations about a wide range of economic variables contained in the World ...

Mostra'n més