Mostra el registre d'ítem simple
Linguistic measures for automatic machine translation evaluation
dc.contributor.author | Giménez Linares, Jesús Ángel |
dc.contributor.author | Màrquez Villodre, Lluís |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament de Llenguatges i Sistemes Informàtics |
dc.date.accessioned | 2011-08-23T09:13:54Z |
dc.date.available | 2011-08-23T09:13:54Z |
dc.date.created | 2010-12 |
dc.date.issued | 2010-12 |
dc.identifier.citation | Gimenez, J.; Màrquez, L. Linguistic measures for automatic machine translation evaluation. "Machine translation", Desembre 2010, vol. 24, núm. 3-4, p. 209-240. |
dc.identifier.issn | 0922-6567 |
dc.identifier.uri | http://hdl.handle.net/2117/13092 |
dc.description.abstract | Assessing the quality of candidate translations involves diverse linguistic facets. However, most automatic evaluation methods in use today rely on limited quality assumptions, such as lexical similarity. This introduces a bias in the development cycle which in some cases has been reported to carry very negative consequences. In order to tackle this methodological problem, we explore a novel path towards heterogeneous automatic Machine Translation evaluation. We have compiled a rich set of specialized similarity measures operating at different linguistic dimensions and analyzed their individual and collective behaviour over a wide range of evaluation scenarios. Results show that measures based on syntactic and semantic information are able to provide more reliable system rankings than lexical measures, especially when the systems under evaluation are based on different paradigms. At the sentence level, while some linguistic measures perform better than most lexical measures, some others perform substantially worse, mainly due to parsing problems. Their scores are, however, suitable for combination, yielding a substantially improved evaluation quality. |
dc.format.extent | 32 p. |
dc.language.iso | eng |
dc.subject | Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Llenguatge natural |
dc.subject.lcsh | Automatic evaluation methods |
dc.subject.lcsh | Linguistic analysis |
dc.subject.lcsh | Machine translation |
dc.title | Linguistic measures for automatic machine translation evaluation |
dc.type | Article |
dc.subject.lemac | Traducció automàtica |
dc.contributor.group | Universitat Politècnica de Catalunya. GPLN - Grup de Processament del Llenguatge Natural |
dc.identifier.doi | 10.1007/s10590-011-9088-7 |
dc.description.peerreviewed | Peer Reviewed |
dc.rights.access | Restricted access - publisher's policy |
local.identifier.drac | 5797095 |
dc.description.version | Postprint (published version) |
dc.relation.projectid | info:eu-repo/grantAgreement/EC/FP7/247762/EU/Feedback Analysis for User adaptive Statistical Translation/FAUST |
local.citation.author | Gimenez, J.; Màrquez, L. |
local.citation.publicationName | Machine translation |
local.citation.volume | 24 |
local.citation.number | 3-4 |
local.citation.startingPage | 209 |
local.citation.endingPage | 240 |
Fitxers d'aquest items
Aquest ítem apareix a les col·leccions següents
-
Articles de revista [1.049]
-
Articles de revista [97]