Coverage model for character-based neural machine translation

Kazimi, Mohammad Bashir

Visualitza/Obre

122533.pdf (1,452Mb)

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

Kazimi, Mohammad Bashir

Tutor / directorPadró, Lluís

; Ruiz Costa-Jussà, Marta

Tipus de documentProjecte Final de Màster Oficial

Data2017-05

Condicions d'accésAccés obert

Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets

Abstract

In recent years, Neural Machine Translation (NMT) has achieved state-of-the art performance in translating from a language; source language, to another; target language. However, many of the proposed methods use word embedding techniques to represent a sentence in the source or target language. Character embedding techniques for this task has been suggested to represent the words in a sentence better. Moreover, recent NMT models use attention mechanism where the most relevant words in a source sentence are used to generate a target word. The problem with this approach is that while some words are translated multiple times, some other words are not translated. To address this problem, coverage model has been integrated into NMT to keep track of already-translated words and focus on the untranslated ones. In this research, we present a new architecture in which we use character embedding for representing the source and target words, and also use coverage model to make certain that all words are translated. We compared our model with the previous models and our model shows comparable improvements. Our model achieves an improvement of 2.87 BLEU (BiLingual Evaluation Understudy) score over the baseline; attention model, for German-English translation, and 0.34 BLEU score improvement for Catalan-Spanish translation.

Descripció

En col·laboració amb la Universitat de Barcelona (UB) i la Universitat Rovira i Virgili (URV)

MatèriesNatural language processing (Computer science), Machine translating, Machine learning, Tractament del llenguatge natural (Informàtica), Traducció Automàtica, Aprenentatge automàtic

TitulacióMÀSTER UNIVERSITARI EN INTEL·LIGÈNCIA ARTIFICIAL (Pla 2012)

URIhttp://hdl.handle.net/2117/105513

Col·leccions

Màsters oficials - Master in Artificial Intelligence - MAI [278]

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
122533.pdf		1,452Mb	PDF	Visualitza/Obre

UPCommons. Portal del coneixement obert de la UPC

Coverage model for character-based neural machine translation

Visualitza/Obre

Explora