A hybrid machine translation architecture guided by syntax

Labaka, Gorka; España Bonet, Cristina; Màrquez Villodre, Lluís; Sarasola, Kepa

doi:10.1007/s10590-014-9153-0

Visualitza/Obre

Article principal (339,9Kb) (Accés restringit) Sol·licita una còpia a l'autor

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

Labaka, Gorka

España Bonet, Cristina

Màrquez Villodre, Lluís

Sarasola, Kepa

Tipus de documentArticle

Data publicació2014-09-16

Condicions d'accésAccés restringit per política de l'editorial

Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets

Abstract

This article presents a hybrid architecture which combines rule-based machine translation (RBMT) with phrase-based statistical machine translation (SMT). The hybrid translation system is guided by the rule-based engine. Before the transfer step, a varied set of partial candidate translations is calculated with the SMT system and used to enrich the tree-based representation with more translation alternatives. The final translation is constructed by choosing the most probable combination among the available fragments using monotone statistical decoding following the order provided by the rule-based system. We apply the hybrid model to a pair of distantly related languages, Spanish and Basque, and perform extensive experimentation on two different corpora. According to our empirical evaluation, the hybrid approach outperforms the best individual system across a varied set of automatic translation evaluation metrics. Following some output analysis to better understand the behaviour of the hybrid system, we explore the possibility of adding alternative parse trees and extra features to the hybrid decoder. Finally, we present a twofold manual evaluation of the translation systems studied in this paper, consisting of (i) a pairwise output comparison and (ii) a individual task-oriented evaluation using HTER. Interestingly, the manual evaluation shows some contradictory results with respect to the automatic evaluation; humans tend to prefer the translations from the RBMT system over the statistical and hybrid translations.

CitacióLabaka, G. [et al.]. A hybrid machine translation architecture guided by syntax. "Machine translation", 16 Setembre 2014, vol. 28, p. 1-35.

URIhttp://hdl.handle.net/2117/24404

DOI10.1007/s10590-014-9153-0

ISSN0922-6567

Versió de l'editorhttp://link.springer.com/article/10.1007/s10590-014-9153-0

Col·leccions

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
MT_labakaetal14.pdf	Article principal	339,9Kb	PDF	Accés restringit

UPCommons. Portal del coneixement obert de la UPC

A hybrid machine translation architecture guided by syntax

Visualitza/Obre

Explora