VEU - Grup de Tractament de la Parla
L'àmbit de recerca del grup 'VEU' és el tractament de la parla. Investiguem tecnologies que permeten l'extracció d'informació que la veu conté: reconeixement del que es diu, l'idioma o el dialecte, característiques del parlant -qui és, la seva edat, el sexe, l'estat emocional-, la direcció del so. També treballem en la caracterització general de l'àudio, per determinar quan hi ha veu i quan hi ha altres esdeveniments acústics com música o sorolls diversos. Les tecnologies de la parla possibiliten generar veu -síntesis de veu- o modificar les seves
Col·leccions
-
Articles de revista [124]
-
Llibres [5]
-
Presentacions [2]
-
Reports de recerca [25]
Enviaments recents
-
Recent activities of IAG working group “Ionosphere Prediction”
(2018)
Comunicació de congrés
Accés obertIonospheric disturbances pose, for instance, an increasing risk on economy, national security, satellite and airline operations, communications networks and the navigation systems. Constructing ... -
Experimental research on encoder-decoder architectures with attention for chatbots
(2018-03-18)
Article
Accés restringit per política de l'editorialChatbots aim at automatically offering a conversation be- tween a human and a computer. While there is a long track of re- search in rule-based and retrieval-based approaches, the generation-based approaches are promisingly ... -
Bridging deep and kernel methods
(2017)
Text en actes de congrés
Accés restringit per política de l'editorialThere has been some exciting major progress in recent years in data analysis methods, including a variety of deep learning architectures, as well as further advances in kernel-based learning methods, which have demonstrated ... -
Byte-based neural machine translation
(Association for Computational Linguistics, 2017)
Text en actes de congrés
Accés obertThis paper presents experiments compar- ing character-based and byte-based neural machine translation systems. The main motivation of the byte-based neural ma- chine translation system is to build multi- lingual neural ... -
A novel approach to real-time range estimation of underwater acoustic sources using supervised machine learning
(Institute of Electrical and Electronics Engineers (IEEE), 2017)
Comunicació de congrés
Accés obertThe proposed paper introduces a novel method for range estimation of acoustic sources, both cetaceans and industrial sources, in deep sea environments using supervised learning with neural networks in the contex of a single ... -
Automatic detection of alarm sounds in a noisy hospital environment using model and non-model based approaches
(2017-11-12)
Report de recerca
Accés obertIn the noisy acoustic environment of a Neonatal Intensive Care Unit (NICU) there is a variety of alarms, which are frequently triggered by the biomedical equipment. In this paper different approaches for automatic detection ... -
Joint model-based recognition and localization of overlapped acoustic events using a set of distributed small microphone arrays
(2017-12-19)
Report de recerca
Accés obertIn the analysis of acoustic scenes, often the occurring sounds have to be detected in time, recognized, and localized in space. Usually, each of these tasks is done separately. In this paper, a model-based approach to ... -
Guidelines for producing a database of continuous acoustic environment recordings and child physiological variables in a neonatal intensive care unit
(2017-01-15)
Report de recerca
Accés obert -
Rudiments of spatial audio synthesis
(2017-02-15)
Report de recerca
Accés obertFor many application areas the binaural synthesis has become a field of interest. In this paper, we present the basics of bin- aural synthesis for 2 channels -left and right- providing exam- ples and figures, using ... -
A neural network approach for automatic detection of acoustic alarms
(Scitepress, 2017)
Comunicació de congrés
Accés restringit per política de l'editorialAcoustic alarms generated by biomedical equipment are relevant sounds in the noisy Neonatal Intensive Care Unit (NICU) environment both because of their high frequency of occurrence and their possible negative effects on ... -
Non parametric coding of speech by means of a MLP with hints
(Springer, 1997)
Text en actes de congrés
Accés restringit per política de l'editorialThis paper presents a non parametric compression system which makes use of the fact that a MLP has an internal representation of the data in the hidden layer. The system that we present makes a compression by using 4 or 8 ... -
A data-driven approach to construct survey-based indicators by means of evolutionary algorithms
(2018-01-09)
Article
Accés obertIn this paper we propose a data-driven approach for the construction of survey-based indicators using large data sets. We make use of agents’ expectations about a wide range of economic variables contained in the World ...