Show simple item record

dc.contributor.authorRuiz Costa-Jussà, Marta
dc.contributor.authorFarrús Cabeceran, Mireia
dc.contributor.authorMariño Acebal, José Bernardo
dc.contributor.authorRodríguez Fonollosa, José Adrián
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.date.accessioned2011-02-04T13:52:09Z
dc.date.available2011-02-04T13:52:09Z
dc.date.created2011
dc.date.issued2011
dc.identifier.citationCosta-Jussà, M. R. [et al.]. Automatic and human evaluation study of a rule-based and a statistical Catalan-Spanish machine translation systems. A: International Conference on Language Resources and Evaluation. "Seventh Conference on International Language Resources and Evaluation". Valletta: 2011, p. 1707-1711.
dc.identifier.isbn2-9517408-6-7
dc.identifier.urihttp://hdl.handle.net/2117/11279
dc.description.abstractMachine translation systems can be classified into rule-based and corpus-based approaches, in terms of their core technology. Since both paradigms have largely been used during the last years, one of the aims in the research community is to know how these systems differ in terms of translation quality. To this end, this paper reports a study and comparison of a rule-based and a corpus-based (particularly, statistical) Catalan-Spanish machine translation systems, both of them freely available in the web. The translation quality analysis is performed under two different domains: journalistic and medical. The systems are evaluated by using standard automatic measures, as well as by native human evaluators. Automatic results show that the statistical system performs better than the rule-based system. Human judgements show that in the Spanishto- Catalan direction the statistical system also performs better than the rule-based system, while in the Catalan-to-Spanish direction is the other way round. Although the statistical system obtains the best automatic scores, its errors tend to be more penalized by human judgements than the errors of the rule-based system. This can be explained because statistical errors are usually unexpected and they do not follow any pattern.
dc.format.extent5 p.
dc.language.isocat
dc.subjectÀrees temàtiques de la UPC::Enginyeria de la telecomunicació
dc.subject.lcshMachine translation
dc.subject.lcshRule-based machine translation
dc.subject.lcshStatistical machine translation
dc.subject.lcshSignal theory (Telecommunication)
dc.titleAutomatic and human evaluation study of a rule-based and a statistical Catalan-Spanish machine translation systems
dc.typeConference report
dc.contributor.groupUniversitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
dc.relation.publisherversionhttp://www.lrec-conf.org/proceedings/lrec2010/pdf/47_Paper.pdf
dc.rights.accessOpen Access
drac.iddocument4928155
dc.description.versionPostprint (published version)
upcommons.citation.authorCosta-Jussà, M. R.; Farrus, M.; Mariño, J.; Fonollosa, José A. R.
upcommons.citation.contributorInternational Conference on Language Resources and Evaluation
upcommons.citation.pubplaceValletta
upcommons.citation.publishedtrue
upcommons.citation.publicationNameSeventh Conference on International Language Resources and Evaluation
upcommons.citation.startingPage1707
upcommons.citation.endingPage1711


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

All rights reserved. This work is protected by the corresponding intellectual and industrial property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public communication or transformation of this work are prohibited without permission of the copyright holder