Robust Estimation of Feature Weights in Statistical Machine Translation

España Bonet, Cristina; Màrquez Villodre, Lluís

Visualitza/Obre

EAMT-2010-Espana-Bonet.pdf (617,1Kb)

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

España Bonet, Cristina

Màrquez Villodre, Lluís

Tipus de documentComunicació de congrés

Data publicació2010

Condicions d'accésAccés obert

Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets

Abstract

Weights of the various components in a standard Statistical Machine Translation model are usually estimated via Minimum Error Rate Training. With this, one finds their optimum value on a development set with the expectation that these optimal weights generalise well to other test sets. However, this is not always the case when domains differ. This work uses a perceptron algorithm to learn more robust weights to be used on out-of-domain corpora without the need for specialised data. For an Arabic-to-English translation system, the generalisation of weights represents an improvement of more than 2 points of BLEU with respect to the MERT baseline using the same information.

CitacióEspaña-Bonet, C.; Màrquez, L. Robust estimation of feature weights in statistical machine translation. A: Annual Conference of the European Association for Machine Translation. "14th Annual Conference of the European Association for Machine Translation". Saint-Raphaël: 2010, p. 190-197.

URIhttp://hdl.handle.net/2117/10449

Col·leccions

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
EAMT-2010-Espana-Bonet.pdf		617,1Kb	PDF	Visualitza/Obre

UPCommons. Portal del coneixement obert de la UPC

Robust Estimation of Feature Weights in Statistical Machine Translation

Visualitza/Obre

Explora