Show simple item record

dc.contributor.authorEsquerra Llucià, Ignasi
dc.contributor.authorBonafonte Cávez, Antonio
dc.contributor.authorVallverdú Bayés, Sisco
dc.contributor.authorFebrer Godayol, Albert
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.identifier.citationEsquerra, I., Bonafonte, A., Vallverdu, F., Febrer, A. A bilingual Spanish-Catalan database of units for concatenative synthesis. A: International Conference on Language Resource & Evaluation. "LREC 1998: proceedings of the 1st International Conference on Language Resource & Evaluation: Granada, Spain: 28-30 May 1998". 1998.
dc.description.abstractDifferent databases of phonetic units are required in multilingual Text-to-Speech systems based on concatenative synthesis. We are currently developing a TTS system able to convert text either in Catalan and Spanish, with some of the modules being used indistinctly by the two languages while others are specific to each language. In order to reduce the total amount of units, a bilingual database has been obtained from two monolingual databases recorded by the same speaker, which contains all possible units for both languages. Common units have been selected according to their phonetic representation. The bilingual database has 1099 units, including diphones and some long units, while the two monolingual databases would result in 1545 units. An analysis of Catalan unit frequencies has been done to select what units should be included in the database. The experiments carried out showed that that synthetic speech has a strong Catalan accent, probably due to the speaker's accent. Some common units, even if they are represented with the same symbol, should be considered separately in a bilingual database in order to cope with acoustically different allophones.
dc.format.extent1 p.
dc.subjectÀrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic
dc.subject.lcshAutomatic speech recognition
dc.subject.otherConcatenative synthesis
dc.subject.otherSpeech synthesis
dc.subject.otherSynthetic data
dc.subject.otherProgramming languages
dc.titleA bilingual Spanish-Catalan database of units for concatenative synthesis
dc.typeConference report
dc.subject.lemacReconeixement automàtic de la parla
dc.contributor.groupUniversitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
dc.description.peerreviewedPeer Reviewed
dc.rights.accessOpen Access
dc.description.versionPostprint (published version)
local.citation.authorEsquerra, I.; Bonafonte, A.; Vallverdu, F.; Febrer, A.
local.citation.contributorInternational Conference on Language Resource & Evaluation
local.citation.publicationNameLREC 1998: proceedings of the 1st International Conference on Language Resource & Evaluation: Granada, Spain: 28-30 May 1998

Files in this item


This item appears in the following Collection(s)

Show simple item record

All rights reserved. This work is protected by the corresponding intellectual and industrial property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public communication or transformation of this work are prohibited without permission of the copyright holder