Mostra el registre d'ítem simple

dc.contributor.authorSainz, Iñaki
dc.contributor.authorNavas, Eva
dc.contributor.authorHernáez, Inma
dc.contributor.authorBonafonte Cávez, Antonio
dc.contributor.authorCampillo, Francisco
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.date.accessioned2010-10-11T07:58:56Z
dc.date.available2010-10-11T07:58:56Z
dc.date.created2010
dc.date.issued2010
dc.identifier.citationSainz, I. [et al.]. TTS evaluation campaign with a common spanish database. A: International Conference on Language Resources and Evaluation. "Seventh Int. Conf. on Language Resources and Evaluation (LREC)". Valleta: 2010, p. 2155-2160.
dc.identifier.isbn2-9517408-6-7
dc.identifier.urihttp://hdl.handle.net/2117/9608
dc.description.abstractThis paper describes the first TTS evaluation campaign designed for Spanish. Seven research institutions took part in the evaluation campaign and developed a voice from a common speech database provided by the organisation. Each participating team had a period of seven weeks to generate a voice. Next, a set of sentences were released and each team had to synthesise them within a week period. Finally, some of the synthesised test audio files were subjectively evaluated via an online test according to the following criteria: similarity to the original voice, naturalness and intelligibility. Box-plots, Wilcoxon tests and WER have been generated in order to analyse the results. Two main conclusions can be drawn: On the one hand, there is considerable margin for improvement to reach the quality level of the natural voice. On the other hand, two systems get significantly better results than the rest: one is based on statistical parametric synthesis and the other one is a concatenative system that makes use of a sinusoidal model to modify both prosody and smooth spectral joints. Therefore, it seems that some kind of spectral control is needed when building voices with a medium size database for unrestricted domains.
dc.format.extent6 p.
dc.language.isoeng
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 Spain
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.subjectÀrees temàtiques de la UPC::Enginyeria de la telecomunicació
dc.subject.lcshText-to-speech software
dc.subject.lcshSignal theory (Telecommunication)
dc.titleTTS evaluation campaign with a common spanish database
dc.typeConference report
dc.subject.lemacSenyal, Teoria del (Telecomunicació)
dc.contributor.groupUniversitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
dc.relation.publisherversionhttp://www.lrec-conf.org/proceedings/lrec2010/pdf/456_Paper.pdf
dc.rights.accessOpen Access
local.identifier.drac3265447
dc.description.versionPostprint (published version)
local.citation.authorSainz, I.; Navas, E.; Hernáez, I.; Bonafonte, A.; Campillo, F.
local.citation.contributorInternational Conference on Language Resources and Evaluation
local.citation.pubplaceValleta
local.citation.publicationNameSeventh Int. Conf. on Language Resources and Evaluation (LREC)
local.citation.startingPage2155
local.citation.endingPage2160


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple