Ara es mostren els items 1-20 de 43

  • A bilingual Spanish-Catalan database of units for concatenative synthesis 

    Esquerra Llucià, Ignasi; Bonafonte Cávez, Antonio; Vallverdú Bayés, Sisco; Febrer Godayol, Albert (1998)
    Text en actes de congrés
    Accés obert
    Different databases of phonetic units are required in multilingual Text-to-Speech systems based on concatenative synthesis. We are currently developing a TTS system able to convert text either in Catalan and Spanish, with ...
  • A billingual texto-to-speech system in spanish and catalan 

    Bonafonte Cávez, Antonio; Esquerra Llucià, Ignasi; Febrer Godayol, Albert; Vallverdú Bayés, Sisco (Editors: G. Kokkinakis, N. Fakotakis, E. Dermatas; Editorial: WCL, University of Patras, Grece, 1997)
    Text en actes de congrés
    Accés obert
    This paper summarises the text-to-speech system that has been developed during the last years in the Speech Group of the Universitat Politccnica de Catalunya (UPC). The paper emphasises the parts of the system which are ...
  • Acoustic feature prediction from semantic features for expressive speech using deep neural networks 

    Jauk, Igor; Bonafonte Cávez, Antonio; Pascual, Santiago (Institute of Electrical and Electronics Engineers (IEEE), 2016)
    Text en actes de congrés
    Accés restringit per política de l'editorial
    The goal of the study is to predict acoustic features of expressive speech from semantic vector space representations. Though a lot of successful work was invested in expressiveness analysis and prediction, the results ...
  • Albayzin speech database: design of the phonetic corpus 

    Moreno Bilbao, M. Asunción; Poig, D; Bonafonte Cávez, Antonio; Lleida, E; Llisterri, J; Mariño Acebal, José Bernardo; Nadeu Camprubí, Climent (. EUROSPEECH, 1993)
    Text en actes de congrés
    Accés obert
    This paper describes the phonetic content of Albayzin, a spoken database for Spanish designed for speech recognition purposes. A statistical study of a large sample of spontaneous speech is presented, and the phonetic and ...
  • An efficient algorithm to find the best state sequence in HSMM 

    Bonafonte Cávez, Antonio; Ros Majó, Xavier; Mariño, José B. (1993)
    Text en actes de congrés
    Accés obert
    Hidden Markov Modeling (HMM) techniques have been applied successfully to speech analysis. However, it has been claimed [1-7] that a major weakness of HMM is that the state duration probability density functions (SDPDF) ...
  • Aneto: a tool for prosody analysis of speech 

    Febrer, M; Febrer, A; Bonafonte Cávez, Antonio; Esquerra Llucià, Ignasi (Institut de l'Audiovisual, Universitat Pompeu Fabra, 1998)
    Text en actes de congrés
    Accés obert
    The developed tool provides utilities for prosody analysis and labeling of voice signals. It works under Windows 95 and Windows NT environments and uses the Microsoft Win32 application programming interface (API) for audio ...
  • BUCEADOR, a multi-language search engine for digital libraries 

    Adell Mercado, Jordi; Bonafonte Cávez, Antonio; Cardenal, Antonio; Ruiz Costa-Jussà, Marta; Rodríguez Fonollosa, José Adrián; Moreno Bilbao, M. Asunción; Navas, Eva; Rodríguez Banga, Eduardo (2012)
    Comunicació de congrés
    Accés obert
    This paper presents a web-based multimedia search engine built within the Buceador (www.buceador.org) research project. A proof-of-concept tool has been implemented which is able to retrieve information from a digital ...
  • BUCEADOR hybrid TTS for blizzard challenge 2011 

    Sainz, Iñaki; Erro Eslava, Daniel; Navas, Eva; Adell Mercado, Jordi; Bonafonte Cávez, Antonio (2011)
    Text en actes de congrés
    Accés obert
    This paper describes the Text-to-Speech (TTS) systems presented by the Buceador Consortium in the Blizzard Challenge 2011 evaluation campaign. The main system is a concatenative hybrid one that tries to combine the strong ...
  • Building synthetic voices in the META-NET framework 

    Garcia Casademont, Emília; Bonafonte Cávez, Antonio; Moreno Bilbao, M. Asunción (2012)
    Text en actes de congrés
    Accés restringit per política de l'editorial
    METANET 4 U is a European project aiming at supporting language technology for European languages and multilingualism. It is a project in the META-NET Network of Excellence, a cluster of projects aiming at fostering the ...
  • Building synthetic voices in the METANET framework 

    Garcia Casademont, Emília; Bonafonte Cávez, Antonio; Moreno Bilbao, M. Asunción (2012)
    Comunicació de congrés
    Accés obert
    METANET4U is a European project aiming at supporting language technology for European languages and multilingualism. It is a project in the META-NET Network of Excellence, a cluster of projects aiming at fostering the ...
  • Creating expressive synthetic voices by unsupervised clustering of audiobooks 

    Jauk, Igor; Bonafonte Cávez, Antonio; López Otero, Paula; Docio Fernández, Laura (International Speech Communication Association (ISCA), 2015)
    Comunicació de congrés
    Accés restringit per política de l'editorial
    In this work we design an approach for automatic feature selection and voice creation for expressive synthesis. Our approach is guided by two main goals: (1) increasing the flexibility of expressive voice creation and (2) ...
  • Deep neural networks for i-vector language identification of short utterances in cars 

    Ghahabi Esfahani, Omid; Bonafonte Cávez, Antonio; Hernando Pericás, Francisco Javier; Moreno Bilbao, M. Asunción (International Speech Communication Association (ISCA), 2016)
    Text en actes de congrés
    Accés restringit per política de l'editorial
    This paper is focused on the application of the Language Identification (LID) technology for intelligent vehicles. We cope with short sentences or words spoken in moving cars in four languages: English, Spanish, German, ...
  • Defining analogy for non-native inclusions in Spanish utterances 

    Polyakova, Tatyana; Bonafonte Cávez, Antonio (2010)
    Text en actes de congrés
    Accés obert
    Mass media globalization introduces the challenge of multilingualism into most popular speech applications such as text-tospeech synthesis and automatic speech recognition. In Spain as well as in the other countries, the ...
  • Direct expressive voice training based on semantic selection 

    Jauk, Igor; Bonafonte Cávez, Antonio (International Speech Communication Association (ISCA), 2016)
    Text en actes de congrés
    Accés restringit per política de l'editorial
    This work aims at creating expressive voices from audiobooks using semantic selection. First, for each utterance of the audiobook an acoustic feature vector is extracted, including iVectors built on MFCC and on F0 ...
  • Duration modeling with expanded HMM applied to speech recognition 

    Bonafonte Cávez, Antonio; Vidal Manzano, José; Nogueiras Rodríguez, Albino (H. TIMOTHY BRUMMELL, WILLIAM IDSARDI CITATION DELAWARE, NEW CASTLE, DELAWARE, 1996)
    Comunicació de congrés
    Accés obert
    The occupancy of the HMM states is modeled by means of a Markov chain. A linear estimator is introduced to compute the probabilities of the Markov chain. The distribution function (DF) represents accurately the observed ...
  • Explicit segmentation of speech using gaussian models 

    Bonafonte Cávez, Antonio; Nogueiras Rodríguez, Albino; Rodriguez-Garrido, A (H. TIMOTHY BRUMMELL, WILLIAM IDSARDI CITATION DELAWARE, NEW CASTLE, DELAWARE, 1996)
    Text en actes de congrés
    Accés restringit per política de l'editorial
    The authors investigate an automatic method to segment labeled speech. The method needs an initial estimation of the segmentation which is provided by an alignment based on HMM. Afterwards, the boundaries are refined moving ...
  • Glissando: a corpus for multidisciplinary prosodic studies in Spanish and Catalan 

    Garrido, Juan Maria; Escudero, David; Aguilar, Lourdes; Cardeñoso Payo, V.; Rodero, Emma; de-la-Mota, Carme; González, César; Vivaracho, C. E.; Rustullet, Sílvia; Larrea, Olatz; Laplaza, Yesika; Vizcaíno, Francisco; Estebas, Eva; Cabrera, Mercedes; Bonafonte Cávez, Antonio (2013-12)
    Article
    Accés restringit per política de l'editorial
    Literature review on prosody reveals the lack of corpora for prosodic studies in Catalan and Spanish. In this paper, we present a corpus intended to fill this gap. The corpus comprises two distinct data-sets, a news subcorpus ...
  • Introducing nativization to Spanish TTS systems 

    Polyakova, Tatyana; Bonafonte Cávez, Antonio (2011-06)
    Article
    Accés restringit per política de l'editorial
    In the modern world, speech technologies must be flexible and adaptable to any framework. Mass media globalization introduces multilingualism as a challenge for the most popular speech applications such as text-to-speech ...
  • Language and noise transfer in speech enhancement generative adversarial network 

    Pascual de la Puente, Santiago; Park, Maruchan; Serra, Joan; Bonafonte Cávez, Antonio; Ahn, Kang-hun (Institute of Electrical and Electronics Engineers (IEEE), 2018)
    Text en actes de congrés
    Accés restringit per política de l'editorial
    Speech enhancement deep learning systems usually require large amounts of training data to operate in broad conditions or real applications. This makes the adaptability of those systems into new, low resource environments ...
  • Language modeling using X-grams 

    Bonafonte Cávez, Antonio; Mariño Acebal, José Bernardo (H. TIMOTHY BRUMMELL, WILLIAM IDSARDI CITATION DELAWARE, NEW CASTLE, DELAWARE, 1996)
    Text en actes de congrés
    Accés obert
    In this paper, an extension of n-grams, called x-grams, is proposed. In this extension, the memory of the model (n) is not fixed a priori. Instead, large memories are accepted first, and merging criteria are then applied ...