Mostra el registre d'ítem simple
The TALP-UPC phrase-based translation systems for WMT12: morphology simplification and domain adaptation
dc.contributor.author | Formiga Fanals, Lluís |
dc.contributor.author | Henríquez Quintana, Carlos Alberto |
dc.contributor.author | Hernández Huerta, Adolfo |
dc.contributor.author | Mariño Acebal, José Bernardo |
dc.contributor.author | Monte Moreno, Enrique |
dc.contributor.author | Rodríguez Fonollosa, José Adrián |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions |
dc.date.accessioned | 2013-03-14T14:52:57Z |
dc.date.available | 2013-03-14T14:52:57Z |
dc.date.created | 2012 |
dc.date.issued | 2012 |
dc.identifier.citation | Formiga, L. [et al.]. The TALP-UPC phrase-based translation systems for WMT12: morphology simplification and domain adaptation. A: Workshop on Statistical Machine Translation. "Proceedings of the Seventh Workshop on Statistical Machine Translation : Montréal, Canada, June 7-8, 2012". Montreal, Quebec: 2012, p. 275-282. |
dc.identifier.uri | http://hdl.handle.net/2117/18301 |
dc.description.abstract | This paper describes the UPC participation in the WMT 12 evaluation campaign. All sys- tems presented are based on standard phrase- based Moses systems. Variations adopted sev- eral improvement techniques such as mor- phology simplification and generation and do- main adaptation. The morphology simpli- fication overcomes the data sparsity prob- lem when translating into morphologically- rich languages such as Spanish by translat- ing first to a morphology-simplified language and secondly leave the morphology gener- ation to an independent classification task. The domain adaptation approach improves the SMT system by adding new translation units learned from MT-output and reference align- ment. Results depict an improvement on TER, METEOR, NIST and BLEU scores compared to our baseline system, obtaining on the of- ficial test set more benefits from the domain adaptation approach than from the morpho- logical generalization method. |
dc.format.extent | 8 p. |
dc.language.iso | eng |
dc.rights | Attribution-NonCommercial-NoDerivs 3.0 Spain |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/es/ |
dc.subject | Àrees temàtiques de la UPC::Ensenyament i aprenentatge::Aprenentatge de llengües |
dc.subject | Àrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic |
dc.subject.lcsh | Natural language processing (Computer science) |
dc.subject.lcsh | Signal theory (Telecommunication) |
dc.title | The TALP-UPC phrase-based translation systems for WMT12: morphology simplification and domain adaptation |
dc.type | Conference lecture |
dc.subject.lemac | Tractament del llenguatge natural (Informàtica) |
dc.subject.lemac | Senyal, Teoria del (Telecomunicació) |
dc.contributor.group | Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla |
dc.description.peerreviewed | Peer Reviewed |
dc.relation.publisherversion | http://aclweb.org/anthology-new/W/W12/W12-3133.pdf |
dc.rights.access | Open Access |
local.identifier.drac | 11071249 |
dc.description.version | Postprint (published version) |
local.citation.author | Formiga, L.; Henriquez, C.; Hernandez, A.; Mariño, J.; Monte, E.; Fonollosa, José A. R. |
local.citation.contributor | Workshop on Statistical Machine Translation |
local.citation.pubplace | Montreal, Quebec |
local.citation.publicationName | Proceedings of the Seventh Workshop on Statistical Machine Translation : Montréal, Canada, June 7-8, 2012 |
local.citation.startingPage | 275 |
local.citation.endingPage | 282 |