Document-level machine translation with word vector models

Martínez Garcia, Eva; España Bonet, Cristina; Márquez Villodre, Luís

Visualitza/Obre

EAMT15Martinezetal.pdf (576,8Kb)

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

Martínez Garcia, Eva

España Bonet, Cristina

Márquez Villodre, Luís

Tipus de documentText en actes de congrés

Data publicació2015

Condicions d'accésAccés obert

Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets

Abstract

In this paper we apply distributional semantic information to document-level machine translation. We train monolingual and bilingual word vector models on large corpora and we evaluate them first in a cross-lingual lexical substitution task and then on the final translation task. For translation, we incorporate the semantic information in a statistical document-level decoder (Docent), by enforcing translation choices that are semantically similar to the context. As expected, the bilingual word vector models are more appropriate for the purpose of translation. The final document-level translator incorporating the semantic model outperforms the basic Docent (without semantics) and also performs slightly over a standard sentence level SMT system in terms of ULC (the average of a set of standard automatic evaluation metrics for MT). Finally, we also present some manual analysis of the translations of some concrete documents

CitacióMartinez, E.; España-Bonet, C.; Márquez , L. Document-level machine translation with word vector models. A: Annual Conference of the European Association for Machine Translation. "Proceedings of the 18th Annual Conference of the European Association for Machine Translation". Antalya: 2015, p. 59-66.

URIhttp://hdl.handle.net/2117/28516

Versió de l'editorhttp://www.eamt2015.org/files/downloads/EAMT2015_Proceedings.pdf

Col·leccions

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
EAMT15Martinezetal.pdf		576,8Kb	PDF	Visualitza/Obre

UPCommons. Portal del coneixement obert de la UPC

Document-level machine translation with word vector models

Visualitza/Obre

Explora