Now showing items 1-20 of 152

  • A bilingual Spanish-Catalan database of units for concatenative synthesis 

    Esquerra Llucià, Ignasi; Bonafonte Cávez, Antonio; Vallverdú Bayés, Sisco; Febrer Godayol, Albert (1998)
    Conference report
    Open Access
    Different databases of phonetic units are required in multilingual Text-to-Speech systems based on concatenative synthesis. We are currently developing a TTS system able to convert text either in Catalan and Spanish, with ...
  • A Catalan broadcast conversational speech database 

    Schulz, Henrik; Rodríguez Fonollosa, José Adrián (2009-09)
    Conference lecture
    Restricted access - publisher's policy
    Data driven methods in speech and linguistic research, and system develoment require appropriate speech databases. A new Catalan speech database has been developed with a particular emphasis on broadcast conversational ...
  • A conversation analysis framework using speech recognition and naïve bayes classification for construction process monitoring 

    Zhang, T.; Lee, Y. C.; Zhu, Y.; Hernando Pericás, Francisco Javier (American Society of Civil Engineers (ASCE), 2018)
    Conference report
    Restricted access - publisher's policy
    At a dynamic construction site, conversations convey vital information including construction activities, operation status, and task performance. Even though because of information security, recording the entire conversations ...
  • Action plan for dissemination 

    Cristea, Dan; Trandaba¿, Diana; Branco, Antonio; Mendes, Amalia; Pellegrini, Thomas; Thompson, Paul; Irimia, Elena; Tufis, Dan; Gilmenau, Georgiana; Rosner, Mike; Moreno Bilbao, M. Asunción; Bel, Nùria (2012-07-29)
    External research report
    Open Access
    The central objective of the Metanet4u project is to contribute to the establishment of a pan-European digital platform that makes available language resources and services, encompassing both datasets and software tools, ...
  • Action plan for dissemination updated 

    Cristea, Dan; Trandaba¿, Diana; Branco, Antonio; Mendes, Amalia; Pellegrini, Thomas; Thompson, Paul; Tufis, Dan; Gilmenau, Georgiana; Rosner, Mike; Moreno Bilbao, M. Asunción; Bel, Nùria (2012-07-04)
    External research report
    Open Access
    Deliverable D5.3 del projecte METANET4U (Project CIP #270893)
  • Adaptación del sistema texto a voz "Festival" al catalán 

    Jaén Gómez, Alejandro (Universitat Politècnica de Catalunya, 2007-01-08)
    Master thesis (pre-Bologna period)
    Open Access
  • Alzheimer disease diagnosis based on automatic spontaneous speech analysis 

    Lopez de Ipiña Peña, Karmele; Alonso Hernandez, Jesus Bernardino; Sole Casals, Jordi; Barroso Moreno, Nora; Faúndez Zanuy, Marcos; Ecay Torres, Mirian; Travieso Gonzalez, Carlos Manuel; Ezeiza Ramos, Aitzol; Estanga Alustiza, Ainara (SciTePress, 2012)
    Conference report
    Restricted access - publisher's policy
    Alzheimer's disease (AD) is the most prevalent form of progressive degenerative dementia and it has a high socio-economic impact in Western countries, therefore is one of the most active research areas today. Its diagnosis ...
  • Análisis de los servicios y sistemas de comunicaciones de voz y datos para la implantación de un sistema de telefonía IP, en un entorno editorial 

    Giné Figueras, Francesc (Universitat Politècnica de Catalunya, 2011-04-27)
    Master thesis (pre-Bologna period)
    Restricted access - author's decision
    Català: L'objectiu principal del present document és el d'analitzar el sistema de comunicacions de veu actual i, en especial, la qualitat del servei ofertat, estudiar els requeriments dels serveis de comunicacions de veu ...
  • Analisis estadistico de orden superior de la voz 

    Rodríguez Fonollosa, José Adrián; Moreno Bilbao, M. Asunción (1991)
    Conference report
    Open Access
    Most of the speech analysis methods developed up to date have been based on the autocorrelation function or power spectrum, i. e., the second order statistics of the signa!. In this paper it is shown that higher order ...
  • An information-theoretic string matching approach for spoken utterance verification and keyword spotting 

    Quer Romeo, Guillem (Universitat Politècnica de Catalunya, 2016)
    Master thesis (pre-Bologna period)
    Restricted access - author's decision
    The goal of this project is to develop an information-theoretic acoustic-phonetic approach to detect the presence of words or phrases in an utterance. Specifically, the project focuses on two types of detection tasks in ...
  • An ultra low-power hardware accelerator for acoustic scoring in speech recognition 

    Tabani, Hamid; Arnau Montañés, José María; Tubella Murgadas, Jordi; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 2017)
    Conference report
    Restricted access - publisher's policy
    Accurate, real-time Automatic Speech Recognition (ASR) comes at a high energy cost, so accuracy has often to be sacrificed in order to fit the strict power constraints of mobile systems. However, accuracy is extremely ...
  • A Speech-based Dialogue System for Household Robots 

    Pons Rueda, Susana (Universitat Politècnica de Catalunya / Technische Universiteit Delft, 2011)
    Master thesis (pre-Bologna period)
    Restricted access - author's decision
    This thesis studies mechanisms to improve human-robot-interaction through a spoken dialogue for household robots. Therefore, a full dialogue system, in which the semantics of the words play an important role, is ...
  • Automatic speech recognition with deep neural networks for impaired speech 

    España-i-Bonet, Cristina; Rodríguez Fonollosa, José Adrián (Springer, 2016)
    Conference report
    Open Access
    Automatic Speech Recognition has reached almost human performance in some controlled scenarios. However, recognition of impaired speech is a difficult task for two main reasons: data is (i) scarce and (ii) heterogeneous. ...
  • Automatic speech recognition with Kaldi toolkit 

    Rosillo Gil, Victor (Universitat Politècnica de Catalunya, 2016-02-08)
    Bachelor thesis
    Open Access
    Covenantee:  Akademia Górniczo-Hutnicza im. S. Staszica w Krakowie
    The topic of this thesis is to built an accurate automatic speech recognition system to be able to recognize speech using Kaldi, an open-source toolkit for speech recognition written in C++ and with free data. First of ...
  • Awareness, mobilisation and dissemination actions 

    Trandaba¿, Diana; Cristea, Dan; Branco, Antonio; Mendes, Amalia; Pellegrini, Thomas; Ananiadou, Sophia; Thompson, Paul; Irimia, Elena; Tufis, Dan; Gilmenau, Georgiana; Rosner, Mike; Moreno Bilbao, M. Asunción; Bel, Nùria (2012-01-31)
    External research report
    Open Access
    The central objective of the Metanet4u project is to contribute to the establishment of a pan-European digital platform that makes available language resources and services, encompassing both datasets and software tools, ...
  • BaNa: a noise resilient fundamental frequency detection algorithm for speech and music 

    Yang, Na; Ba, He; Cai, Weiyang; Seyfettin Demirkol, Ilker; Heinzelman, Wendi (2014-08-27)
    Article
    Open Access
    Fundamental frequency (F0) is one of the essential features in many acoustic related applications. Although numerous F0 detection algorithms have been developed, the detection accuracy in noisy environments still needs ...
  • Blind channel equalization using weighted subspace methods 

    Ruiz Feliu, Rafael; Cabrera-Bean, Margarita (Institute of Electrical and Electronics Engineers (IEEE), 1999)
    Conference report
    Open Access
    This paper addresses the problems of blind channel estimation and symbol detection with second order statistics methods from the received data. It can be shown that this problem is similar to direction of arrival (DOA) ...
  • Block-based Speech-to-Speech Translation 

    Roca, Sandra (Universitat Politècnica de Catalunya, 2018-10)
    Bachelor thesis
    Open Access
    Esta tesis explora diferentes maneras de implementar un sistema de bloques de Traducción de Voz con el propósito de generar grandes cantidades de datos para generar un gran corpus paralelo de voz. La primera tarea consiste ...
  • BUCEADOR, a multi-language search engine for digital libraries 

    Adell Mercado, Jordi; Bonafonte Cávez, Antonio; Cardenal, Antonio; Ruiz Costa-Jussà, Marta; Rodríguez Fonollosa, José Adrián; Moreno Bilbao, M. Asunción; Navas, Eva; Rodríguez Banga, Eduardo (2012)
    Conference lecture
    Open Access
    This paper presents a web-based multimedia search engine built within the Buceador (www.buceador.org) research project. A proof-of-concept tool has been implemented which is able to retrieve information from a digital ...
  • BUCEADOR hybrid TTS for blizzard challenge 2011 

    Sainz, Iñaki; Erro Eslava, Daniel; Navas, Eva; Adell Mercado, Jordi; Bonafonte Cávez, Antonio (2011)
    Conference report
    Open Access
    This paper describes the Text-to-Speech (TTS) systems presented by the Buceador Consortium in the Blizzard Challenge 2011 evaluation campaign. The main system is a concatenative hybrid one that tries to combine the strong ...