Browsing by Author "Màrquez Villodre, Lluís"
Now showing items 1-20 of 46
-
A graph-based strategy to streamline translation quality assessments
Pighin, Daniele; Formiga Fanals, Lluís; Màrquez Villodre, Lluís (2012)
Conference report
Open AccessWe present a detailed analysis of a graph- based annotation strategy that we employed to annotate a corpus of 11,292 real-world En- glish to Spanish automatic translations with relative (ranking) and absolute ... -
A graphical interface for MT evaluation and error analysis
González Bermúdez, Meritxell; Giménez, J.; Màrquez Villodre, Lluís (Association for Computational Linguistics, 2012)
Conference lecture
Open AccessError analysis in machine translation is a necessary step in order to investigate the strengths and weaknesses of the MT systems under development and allow fair comparisons among them. This work presents an application ... -
A hybrid machine translation architecture guided by syntax
Labaka, Gorka; España Bonet, Cristina; Màrquez Villodre, Lluís; Sarasola, Kepa (2014-09-16)
Article
Restricted access - publisher's policyThis article presents a hybrid architecture which combines rule-based machine translation (RBMT) with phrase-based statistical machine translation (SMT). The hybrid translation system is guided by the rule-based engine. ... -
A hybrid system for patent translation
Enache, Ramona; España Bonet, Cristina; Ranta, Aarne; Màrquez Villodre, Lluís (2012)
Conference lecture
Open AccessThis work presents a HMT system for patent translation. The system exploits the high coverage of SMT and the high precision of an RBMT system based on GF to deal with specific issues of the language. The translator is ... -
A joint model for parsing syntactic and semantic dependencies
Lluis Martorell, Xavier; Màrquez Villodre, Lluís (Coling 2008 Organizing Committee, 2008)
Conference report
Open AccessThis paper describes a system that jointly parses syntactic and semantic dependencies, presented at the CoNLL-2008 shared task (Surdeanu et al., 2008). It combines online Peceptron learning (Collins, 2002) with a parsing ... -
A Machine learning approach to POS tagging
Màrquez Villodre, Lluís; Padró, Lluís; Rodríguez Hontoria, Horacio (1997-12)
Research report
Open AccessWe have applied inductive learning of statistical decision trees and relaxation labelling to the Natural Language Processing (NLP) task of morphosyntactic disambiguation (Part Of Speech Tagging). The learning process ... -
A Proposal for wide-coverage Spanish named entity recognition
Arévalo, M.; Carreras Pérez, Xavier; Màrquez Villodre, Lluís; Martí Antonin, Maria Antònia; Padró, Lluís; Simon, Maria José (2002-04)
Research report
Open AccessThis paper presents a proposal for wide--coverage Named Entity Recognition for Spanish. First, a linguistic description of the typology of Named Entities is proposed. Following this definition an architecture of sequential ... -
A second-order joint Eisner model for syntactic and semantic dependency parsing
Lluis Martorell, Xavier; Bott, Stefan Markus; Màrquez Villodre, Lluís (2009)
Conference lecture
Restricted access - publisher's policyWe present a system developed for the CoNLL-2009 Shared Task (Hajic et al., 2009). We extend the Carreras (2007) parser to jointly annotate syntactic and semantic dependencies. This state-of-the-art parser factorizes the ... -
Boosting applied to word sense disambiguation
Escudero Bakx, Gerard; Màrquez Villodre, Lluís; Rigau Claramunt, German (2000-01)
Research report
Open AccessIn this paper we apply Schapire and Singer's AdaBoost.MH boosting algorithm to the Word Sense Disambiguation (WSD) problem. Initial experiments on a set of 15 selected polysemous words show that the boosting approach ... -
Boosting trees for anti-spam email filtering (Extended version)
Carreras Pérez, Xavier; Màrquez Villodre, Lluís (2001-10)
Research report
Open AccessIn this work, a set of comparative experiments for the problem of automatically filtering unwanted electronic mail messages are performed on two public corpora: PU1 and LingSpam. Several variants of the AdaBoost algorithm ... -
Context-aware machine translation for software localization
Muntés Mulero, Víctor; Paladini Adell, Patricia; España Bonet, Cristina; Màrquez Villodre, Lluís (2012)
Conference lecture
Open AccessSoftware localization requires translating short text strings appearing in user interfaces (UI) into several languages. These strings are usually unrelated to the other strings in the UI. Due to the lack of semantic ... -
Deep evaluation of hybrid architectures: simple metrics correlated with human judgments
Labaka, Gorka; Díaz de Ilarraza Sánchez, Arantza; Sarasola Gabiola, Kepa; España Bonet, Cristina; Màrquez Villodre, Lluís (2011)
Conference lecture
Open AccessThe process of developing hybrid MT systems is guided by the evaluation method used to compare different combinations of basic subsystems. This work presents a deep evaluation experiment of a hybrid architecture ... -
Deep evaluation of hybrid architectures: Use of different metrics in MERT weight optimization
España Bonet, Cristina; Labaka, Gorka; Díaz de Ilarraza Sánchez, Arantza; Màrquez Villodre, Lluís; Sarasola, Kepa (2013)
Conference report
Open AccessThe process of developing hybrid MT systems is usually guided by an evaluation method used to compare different combinations of basic subsystems. This work presents a deep evaluation experiment of a hybrid architecture, ... -
Discriminative learning within Arabic statistical machine translation
España Bonet, Cristina; Giménez, Jesús; Màrquez Villodre, Lluís (2009-01)
Research report
Open AccessWritten Arabic is a especially ambiguous due to the lack of diacritisation of texts, and this makes the translation harder for automatic systems that do not take into account the context of phrases. Here, we use a standard ... -
Document-level machine translation as a re-translation process
Martínez Garcia, Eva; España Bonet, Cristina; Màrquez Villodre, Lluís (2014-09-22)
Article
Open AccessMost of the current Machine Translation systems are designed to translate a document sentence by sentence ignoring discourse information and producing incoherencies in the final translations. In this paper we present some ... -
Exploiting diversity of margin-based classifiers
Romero Merino, Enrique; Carreras Pérez, Xavier; Màrquez Villodre, Lluís (2003-12)
Research report
Open AccessAn experimental comparison among Support Vector Machines, AdaBoost and a recently proposed model for maximizing the margin with Feed-forward Neural Networks has been made on a real-world classification problem, namely ... -
Hybrid machine translation guided by a rule-based system
España Bonet, Cristina; Màrquez Villodre, Lluís; Labaka, Gorka; Díaz de Ilarraza Sánchez, Arantza; Sarasola Gabiola, Kepa (2011)
Conference lecture
Open AccessThis paper presents a machine translation architecture which hybridizes Matxin, a rulebased system, with regular phrase-based Statistical Machine Translation. In short, the hybrid translation process is guided by the ... -
Identifying useful human correction feedback from an on-line machine translation service
Barrón-Cedeño, Alberto; Màrquez Villodre, Lluís; Henríquez Quintana, Carlos Alberto; Formiga Fanals, Lluís; Romero Merino, Enrique; May, Jonathan (2013)
Conference report
Open AccessPost-editing feedback provided by users of on-line translation services offers an excellent opportunity for automatic improvement of statistical machine translation (SMT) systems. However, feedback provided by casual users ... -
Identifying useful human feedback from an on-line translation service
Barrón-Cedeño, Alberto; Màrquez Villodre, Lluís; Henríquez Quintana, Carlos Alberto; Formiga Fanals, Lluís; Romero Merino, Enrique; May, Jonathan (2013)
Conference lecture
Open AccessPost-editing feedback provided by users of on-line translation services offers an excellent opportunity for automatic improvement of statistical machine translation (SMT) systems. However, feedback provided by casual ... -
IPA and STOUT: leveraging linguistic and source-based features for machine translation evaluation
González Bermúdez, Meritxell; Barrón-Cedeño, Alberto; Màrquez Villodre, Lluís (Association for Computational Linguistics, 2014)
Conference lecture
Restricted access - publisher's policyThis paper describes the UPC submissions to the WMT14 Metrics Shared Task : UPC-IPA and UPC-STOUT. These metrics use a collection of evaluation measures integrated in ASIYA, a toolkit for machine translation evaluation. ...