Automatic and human evaluation study of a rule-based and a statistical Catalan-Spanish machine translation systems

Ruiz Costa-Jussà, Marta; Farrús Cabeceran, Mireia; Mariño Acebal, José Bernardo; Rodríguez Fonollosa, José Adrián

dc.contributor.author	Ruiz Costa-Jussà, Marta
dc.contributor.author	Farrús Cabeceran, Mireia
dc.contributor.author	Mariño Acebal, José Bernardo
dc.contributor.author	Rodríguez Fonollosa, José Adrián
dc.contributor.other	Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.date.accessioned	2011-02-04T13:52:09Z
dc.date.available	2011-02-04T13:52:09Z
dc.date.created	2011
dc.date.issued	2011
dc.identifier.citation	Costa-Jussà, M. R. [et al.]. Automatic and human evaluation study of a rule-based and a statistical Catalan-Spanish machine translation systems. A: International Conference on Language Resources and Evaluation. "Seventh Conference on International Language Resources and Evaluation". Valletta: 2011, p. 1707-1711.
dc.identifier.isbn	2-9517408-6-7
dc.identifier.uri	http://hdl.handle.net/2117/11279
dc.description.abstract	Machine translation systems can be classified into rule-based and corpus-based approaches, in terms of their core technology. Since both paradigms have largely been used during the last years, one of the aims in the research community is to know how these systems differ in terms of translation quality. To this end, this paper reports a study and comparison of a rule-based and a corpus-based (particularly, statistical) Catalan-Spanish machine translation systems, both of them freely available in the web. The translation quality analysis is performed under two different domains: journalistic and medical. The systems are evaluated by using standard automatic measures, as well as by native human evaluators. Automatic results show that the statistical system performs better than the rule-based system. Human judgements show that in the Spanishto- Catalan direction the statistical system also performs better than the rule-based system, while in the Catalan-to-Spanish direction is the other way round. Although the statistical system obtains the best automatic scores, its errors tend to be more penalized by human judgements than the errors of the rule-based system. This can be explained because statistical errors are usually unexpected and they do not follow any pattern.
dc.format.extent	5 p.
dc.language.iso	cat
dc.subject	Àrees temàtiques de la UPC::Enginyeria de la telecomunicació
dc.subject.lcsh	Machine translation
dc.subject.lcsh	Rule-based machine translation
dc.subject.lcsh	Statistical machine translation
dc.subject.lcsh	Signal theory (Telecommunication)
dc.title	Automatic and human evaluation study of a rule-based and a statistical Catalan-Spanish machine translation systems
dc.type	Conference report
dc.contributor.group	Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
dc.relation.publisherversion	http://www.lrec-conf.org/proceedings/lrec2010/pdf/47_Paper.pdf
dc.rights.access	Open Access
local.identifier.drac	4928155
dc.description.version	Postprint (published version)
local.citation.author	Costa-Jussà, M. R.; Farrus, M.; Mariño, J.; Fonollosa, José A. R.
local.citation.contributor	International Conference on Language Resources and Evaluation
local.citation.pubplace	Valletta
local.citation.publicationName	Seventh Conference on International Language Resources and Evaluation
local.citation.startingPage	1707
local.citation.endingPage	1711

Fitxers d'aquest items

Nom:: LREC2010_Marta_Automatic.pdf
Mida:: 457,1Kb
Format:: PDF

Visualitza/Obre

Aquest ítem apareix a les col·leccions següents

Ponències/Comunicacions de congressos [437]
Ponències/Comunicacions de congressos [3.328]

Mostra el registre d'ítem simple

UPCommons. Portal del coneixement obert de la UPC

Automatic and human evaluation study of a rule-based and a statistical Catalan-Spanish machine translation systems

Fitxers d'aquest items

Aquest ítem apareix a les col·leccions següents

Explora