Description of the Chinese-to-Spanish rule-based machine translation system developed with a hybrid combination of human annotation and statistical techniques
Visualitza/Obre
Cita com:
hdl:2117/104736
Tipus de documentArticle
Data publicació2015-11-01
Condicions d'accésAccés obert
Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i
industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva
reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets
Abstract
Two of the most popular Machine Translation (MT) paradigms are rule based (RBMT) and corpus based, which include the statistical systems (SMT). When scarce parallel corpus is available, RBMT becomes particularly attractive. This is the case of the Chinese--Spanish language pair.
This article presents the first RBMT system for Chinese to Spanish. We describe a hybrid method for constructing this system taking advantage of available resources such as parallel corpora that are used to extract dictionaries and lexical and structural transfer rules.
The final system is freely available online and open source. Although performance lags behind standard SMT systems for an in-domain test set, the results show that the RBMT’s coverage is competitive and it outperforms the SMT system in an out-of-domain test set. This RBMT system is available to the general public, it can be further enhanced, and it opens up the possibility of creating future hybrid MT systems.
CitacióRuiz, M., Centelles, J. Description of the Chinese-to-Spanish rule-based machine translation system developed with a hybrid combination of human annotation and statistical techniques. "ACM transactions on asian language information processing", 1 Novembre 2015, vol. 15, núm. 1, p. 1-13.
ISSN1530-0226
Versió de l'editorhttp://dl.acm.org/citation.cfm?id=2738045
Fitxers | Descripció | Mida | Format | Visualitza |
---|---|---|---|---|
chisparulebased-acm.pdf | 213,5Kb | Visualitza/Obre |