Coverage for character based neural machine translation

Kazimi, Bashir; Ruiz Costa-Jussà, Marta

Visualitza/Obre

camera_ready_v2.pdf (276,2Kb)

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

Kazimi, Bashir

Ruiz Costa-Jussà, Marta

Tipus de documentArticle

Data publicació2017-09-22

Condicions d'accésAccés obert

Attribution-NonCommercial-NoDerivs 3.0 Spain

Llevat que s'hi indiqui el contrari, els continguts d'aquesta obra estan subjectes a la llicència de Creative Commons : Reconeixement-NoComercial-SenseObraDerivada 3.0 Espanya

Abstract

In recent years, Neural Machine Translation (NMT) has achieved state-of-the-art performance in translating from a language; source language, to another; target language. However, many of the proposed methods use word embedding techniques to represent a sentence in the source or target language. Character embedding techniques for this task has been suggested to represent the words in a sentence better. Moreover, recent NMT models use attention mechanism where the most relevant words in a source sentence are used to generate a target word. The problem with this approach is that while some words are translated multiple times, some other words are not translated. To address this problem, coverage model has been integrated into NMT to keep track of already-translated words and focus on the untranslated ones. In this research, we present a new architecture in which we use character embedding for representing the source and target languages, and also use coverage model to make certain that all words are translated. Experiments were performed to compare our model with coverage and character model and the results show that our model performs better than the other two models.

CitacióKazimi, B., Ruiz, M. Coverage for character based neural machine translation. "Procesamiento del lenguaje natural (SEPLN)", 22 Setembre 2017, vol. 59, p. 99-106.

URIhttp://hdl.handle.net/2117/107805

ISSN1989-7553

Versió de l'editorhttp://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/5498

Col·leccions

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
camera_ready_v2.pdf		276,2Kb	PDF	Visualitza/Obre

UPCommons. Portal del coneixement obert de la UPC

Coverage for character based neural machine translation

Visualitza/Obre

Explora