Show simple item record

dc.contributor.authorHenriquez, Carlos A
dc.contributor.authorRuiz Costa-Jussà, Marta
dc.contributor.authorDaudaravicius, Vidas
dc.contributor.authorBanchs Martínez, Rafael Enrique
dc.contributor.authorMariño, José B.
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.identifier.citationHenriquez, C., Ruiz, M., Daudaravicius, V., Banchs, R., Mariño, J. Using collocation segmentation to augment the phrase table. A: Workshop on Statistical Machine Translation and MetricsMATR. "ACL 2010 Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR (WMT '10) : Uppsala, Sweden, July 15 - 16, 2010". Association for Computational Linguistics, 2010, p. 98-102.
dc.description.abstractThis paper describes the 2010 phrase-based statistical machine translation system developed at the TALP Research Center of the UPC1 in cooperation with BMIC2 and VMU3. In phrase-based SMT, the phrase table is the main tool in translation. It is created extracting phrases from an aligned parallel corpus and then computing translation model scores with them. Performing a collocation segmentation over the source and target corpus before the alignment causes that di erent and larger phrases are extracted from the same original documents. We performed this segmentation and used the union of this phrase set with the phrase set extracted from the nonsegmented corpus to compute the phrase table. We present the con gurations considered and also report results obtained with internal and o cial test sets.
dc.format.extent5 p.
dc.publisherAssociation for Computational Linguistics
dc.subjectÀrees temàtiques de la UPC::Informàtica
dc.subject.lcshMachine translating
dc.subject.otherMachine translation
dc.titleUsing collocation segmentation to augment the phrase table
dc.typeConference report
dc.subject.lemacTraducció automàtica
dc.contributor.groupUniversitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
dc.rights.accessOpen Access
dc.description.versionPostprint (published version)
local.citation.authorHenriquez, C.; Ruiz, M.; Daudaravicius, V.; Banchs, R.; Mariño, J.
local.citation.contributorWorkshop on Statistical Machine Translation and MetricsMATR
local.citation.publicationNameACL 2010 Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR (WMT '10) : Uppsala, Sweden, July 15 - 16, 2010

Files in this item


This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivs 3.0 Spain
Except where otherwise noted, content on this work is licensed under a Creative Commons license : Attribution-NonCommercial-NoDerivs 3.0 Spain