Mass media globalization introduces the challenge of multilingualism into most popular speech applications such as text-tospeech synthesis and automatic speech recognition. In Spain as well as in the other countries, the usage of English words is
rapidly growing, however due to the linguistic diversity of the languages spoken across the country, Spanish is not less inﬂuenced by inclusions from the four ofﬁcial languages. This work is focused on the pronunciation of Catalan inclusions in Spanish utterances. Our goal was to approach the nativization phenomenon by data-driven methods, making it easily transferable
to other languages without loss in performance. For this particular task, training and test nativization corpora were manually crafted and the task itself was approached using pronunciation by analogy. The results were encouraging and showed that even small corpus of 1000 words allows to capture the analogy in the nativization process. The resulting pronunciations allowed signiﬁcant improvements in the ntelligibility of Catalan inclusions
in Spanish utterances.
CitationPolyákova, T.; Bonafonte, A. Defining analogy for non-native inclusions in Spanish utterances. A: Jornadas en Tecnología del Habla and Iberian SLTech Workshop. "Actas de las VI jornadas en tecnologías del habla". Vigo: 2010, p. 21-24.
All rights reserved. This work is protected by the corresponding intellectual and industrial property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public communication or transformation of this work are prohibited without permission of the copyright holder. If you wish to make any use of the work not provided for in the law, please contact: email@example.com