Enhancing scarce-resource language translation through pivot combinations
Tipus de documentText en actes de congrés
Condicions d'accésAccés obert
Chinese and Spanish are the most spoken languages in the world. However, there is not much research done in machine translation for this language pair. We experiment with the parallel Chinese-Spanish corpus (United Nations) to explore alternatives of SMT strategies which consist on using a pivot language. Particularly, two well-known alternatives are shown for pivoting: the cascade system and the pseudo-corpus. As Pivot language we use English, Arabic and French. Results show that English is the best pivot language between Chinese and Spanish. As a new strategy, we propose to perform a combination of the pivot strategies which is capable to highly outperform the direct translation strategy.
CitacióRuiz, M., Henríquez, C., Banchs, R. Enhancing scarce-resource language translation through pivot combinations. A: International Joint Conference on Natural Language Processing. "Proceedings of the 5th International Joint Conference on Natural Language Processing". 2011, p. 1361-1365.