Mostra el registre d'ítem simple

dc.contributor.authorFormiga Fanals, Lluís
dc.contributor.authorÁlías, Francesc
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.date.accessioned2013-03-15T15:56:13Z
dc.date.available2013-03-15T15:56:13Z
dc.date.created2012
dc.date.issued2012
dc.identifier.citationFormiga, L.; Álias, F. Perceptual optimization of unit-selection text-to-speech synthesis systems by means of active interactive genetic algorithms. A: Jornadas en Tecnología del Habla and Iberian SLTech Workshop. "Proceedings IberSPEECH 2012: VII Jornadas en Tecnología del Habla and III Iberian SLTech Workshop: November 21 - 23, 2012, Madrid, Spain". Madrid: 2012, p. 500-509.
dc.identifier.isbn84-616-1535-2
dc.identifier.urihttp://hdl.handle.net/2117/18355
dc.description.abstractThe tuning process of Unit Selection TTS (US-TTS) system is usually performed by an expert that typically conducts the task of weighting the cost function by hand. However, hand tuning is costly in terms of the required training time and inaccurate and ambiguous in terms of methodology. With the purpose of easing the task of properly tuning the weights of the cost function, this thesis make its contribution from a perceptual-based approach using of active interactive Genetic Algorithms (aiGAs). The thesis pursues four major guidelines: i) accuracy when tuning the weights, ii) robustness of the obtained weights, iii)real world applicability of the methodology to any cost function design, and iv)finding consensus of the different users when tuning the weights. The experimentation is carried out through a small and medium sized corpus (1.9h) applied to different configurations (type of features) of the US-TTS cost function. The thesis concludes that aiGAs are highly competitive in comparison to other weight tuning techniques from the state-of-the-art
dc.format.extent10 p.
dc.language.isoeng
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 Spain
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.subjectÀrees temàtiques de la UPC::Informàtica::Sistemes d'informació::Interacció home-màquina
dc.subjectÀrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic
dc.subject.lcshHuman-computer interaction
dc.subject.lcshAutomatic speech recognition
dc.titlePerceptual optimization of unit-selection text-to-speech synthesis systems by means of active interactive genetic algorithms
dc.typeConference report
dc.subject.lemacInteracció persona-ordinador
dc.subject.lemacReconeixement automàtic de la parla
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttp://iberspeech2012.ii.uam.es/IberSPEECH2012_OnlineProceedings.pdf
dc.rights.accessOpen Access
local.identifier.drac11072348
dc.description.versionPostprint (published version)
local.citation.authorFormiga, L.; Álias, F
local.citation.contributorJornadas en Tecnología del Habla and Iberian SLTech Workshop
local.citation.pubplaceMadrid
local.citation.publicationNameProceedings IberSPEECH 2012: VII Jornadas en Tecnología del Habla and III Iberian SLTech Workshop: November 21 - 23, 2012, Madrid, Spain
local.citation.startingPage500
local.citation.endingPage509


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple