Mostra el registre d'ítem simple

dc.contributor.authorKhalilov, Maxim
dc.contributor.authorRodríguez Fonollosa, José Adrián
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.date.accessioned2011-07-13T16:57:01Z
dc.date.available2011-07-13T16:57:01Z
dc.date.created2011-10
dc.date.issued2011-10
dc.identifier.citationKhalilov, M.; Fonollosa, José A. R. Syntax-based reordering for statistical machine translation. "Computer speech and language", Octubre 2011, vol. 25, núm. 4, p. 761-788.
dc.identifier.issn0885-2308
dc.identifier.urihttp://hdl.handle.net/2117/12964
dc.description.abstractIn this paper, we develop an approach called syntax-based reordering (SBR) to handling the fundamental problem of word ordering for statistical machine translation (SMT). We propose to alleviate the word order challenge including morpho-syntactical and statistical information in the context of a pre-translation reordering framework aimed at capturing short- and long-distance word distortion dependencies. We examine the proposed approach from the theoretical and experimental points of view discussing and analyzing its advantages and limitations in comparison with some of the state-of-the-art reordering methods. In the final part of the paper, we describe the results of applying the syntax-based model to translation tasks with a great need for reordering (Chinese-to-English and Arabic-to-English). The experiments are carried out on standard phrase-based and alternative N-gram-based SMT systems. We first investigate sparse training data scenarios, in which the translation and reordering models are trained on a sparse bilingual data, then scaling the method to a large training set and demonstrating that the improvement in terms of translation quality is maintained.
dc.format.extent28 p.
dc.language.isoeng
dc.publisherElsevier
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 Spain
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.subjectÀrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Llenguatge natural
dc.subject.lcshNatural language processing
dc.subject.lcshComputational linguistics
dc.titleSyntax-based reordering for statistical machine translation
dc.typeArticle
dc.subject.lemacLingüística computacional
dc.subject.lemacTractament del llenguatge natural (Informàtica)
dc.contributor.groupUniversitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
dc.identifier.doi10.1016/j.csl.2011.01.001
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttp://www.sciencedirect.com/science/article/B6WCW-525YP4S-1/2/5ac1785c72af82c125d6d716f37f4fbf
dc.rights.accessRestricted access - publisher's policy
local.identifier.drac5799381
dc.description.versionPostprint (published version)
local.citation.authorKhalilov, M.; Fonollosa, José A. R.
local.citation.publicationNameComputer speech and language
local.citation.volume25
local.citation.number4
local.citation.startingPage761
local.citation.endingPage788


Fitxers d'aquest items

Imatge en miniatura

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple