Automatic and human evaluation study of a rule-based and a statistical Catalan-Spanish machine translation systems

Ruiz Costa-Jussà, Marta; Farrús Cabeceran, Mireia; Mariño Acebal, José Bernardo; Rodríguez Fonollosa, José Adrián

Visualitza/Obre

LREC2010_Marta_Automatic.pdf (457,1Kb)

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

Ruiz Costa-Jussà, Marta

Farrús Cabeceran, Mireia

Mariño Acebal, José Bernardo

Rodríguez Fonollosa, José Adrián

Tipus de documentText en actes de congrés

Data publicació2011

Condicions d'accésAccés obert

Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets

Abstract

Machine translation systems can be classified into rule-based and corpus-based approaches, in terms of their core technology. Since both paradigms have largely been used during the last years, one of the aims in the research community is to know how these systems differ in terms of translation quality. To this end, this paper reports a study and comparison of a rule-based and a corpus-based (particularly, statistical) Catalan-Spanish machine translation systems, both of them freely available in the web. The translation quality analysis is performed under two different domains: journalistic and medical. The systems are evaluated by using standard automatic measures, as well as by native human evaluators. Automatic results show that the statistical system performs better than the rule-based system. Human judgements show that in the Spanishto- Catalan direction the statistical system also performs better than the rule-based system, while in the Catalan-to-Spanish direction is the other way round. Although the statistical system obtains the best automatic scores, its errors tend to be more penalized by human judgements than the errors of the rule-based system. This can be explained because statistical errors are usually unexpected and they do not follow any pattern.

CitacióCosta-Jussà, M. R. [et al.]. Automatic and human evaluation study of a rule-based and a statistical Catalan-Spanish machine translation systems. A: International Conference on Language Resources and Evaluation. "Seventh Conference on International Language Resources and Evaluation". Valletta: 2011, p. 1707-1711.

URIhttp://hdl.handle.net/2117/11279

ISBN2-9517408-6-7

Versió de l'editorhttp://www.lrec-conf.org/proceedings/lrec2010/pdf/47_Paper.pdf

Col·leccions

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
LREC2010_Marta_Automatic.pdf		457,1Kb	PDF	Visualitza/Obre

UPCommons. Portal del coneixement obert de la UPC

Automatic and human evaluation study of a rule-based and a statistical Catalan-Spanish machine translation systems

Visualitza/Obre

Explora