Mostra el registre d'ítem simple
Coverage model for character-based neural machine translation
dc.contributor | Padró, Lluís |
dc.contributor | Ruiz Costa-Jussà, Marta |
dc.contributor.author | Kazimi, Mohammad Bashir |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament de Ciències de la Computació |
dc.date.accessioned | 2017-06-16T11:13:46Z |
dc.date.available | 2017-06-16T11:13:46Z |
dc.date.issued | 2017-05 |
dc.identifier.uri | http://hdl.handle.net/2117/105513 |
dc.description | En col·laboració amb la Universitat de Barcelona (UB) i la Universitat Rovira i Virgili (URV) |
dc.description.abstract | In recent years, Neural Machine Translation (NMT) has achieved state-of-the art performance in translating from a language; source language, to another; target language. However, many of the proposed methods use word embedding techniques to represent a sentence in the source or target language. Character embedding techniques for this task has been suggested to represent the words in a sentence better. Moreover, recent NMT models use attention mechanism where the most relevant words in a source sentence are used to generate a target word. The problem with this approach is that while some words are translated multiple times, some other words are not translated. To address this problem, coverage model has been integrated into NMT to keep track of already-translated words and focus on the untranslated ones. In this research, we present a new architecture in which we use character embedding for representing the source and target words, and also use coverage model to make certain that all words are translated. We compared our model with the previous models and our model shows comparable improvements. Our model achieves an improvement of 2.87 BLEU (BiLingual Evaluation Understudy) score over the baseline; attention model, for German-English translation, and 0.34 BLEU score improvement for Catalan-Spanish translation. |
dc.language.iso | eng |
dc.publisher | Universitat Politècnica de Catalunya |
dc.subject | Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial |
dc.subject.lcsh | Natural language processing (Computer science) |
dc.subject.lcsh | Machine translating |
dc.subject.lcsh | Machine learning |
dc.subject.other | Deep Learning |
dc.subject.other | Natural Language Processing |
dc.subject.other | Neural Machine Translation |
dc.subject.other | Aprenentatge profund |
dc.subject.other | Traducció Automàtica |
dc.title | Coverage model for character-based neural machine translation |
dc.type | Master thesis |
dc.subject.lemac | Tractament del llenguatge natural (Informàtica) |
dc.subject.lemac | Traducció Automàtica |
dc.subject.lemac | Aprenentatge automàtic |
dc.identifier.slug | 122533 |
dc.rights.access | Open Access |
dc.date.updated | 2017-05-12T04:00:14Z |
dc.audience.educationlevel | Màster |
dc.audience.mediator | Facultat d'Informàtica de Barcelona |
dc.audience.degree | MÀSTER UNIVERSITARI EN INTEL·LIGÈNCIA ARTIFICIAL (Pla 2012) |