Exploració per autor "España Bonet, Cristina"
Ara es mostren els items 6-25 de 27
-
Deep evaluation of hybrid architectures: simple metrics correlated with human judgments
Labaka, Gorka; Díaz de Ilarraza Sánchez, Arantza; Sarasola Gabiola, Kepa; España Bonet, Cristina; Màrquez Villodre, Lluís (2011)
Comunicació de congrés
Accés obertThe process of developing hybrid MT systems is guided by the evaluation method used to compare different combinations of basic subsystems. This work presents a deep evaluation experiment of a hybrid architecture ... -
Deep evaluation of hybrid architectures: Use of different metrics in MERT weight optimization
España Bonet, Cristina; Labaka, Gorka; Díaz de Ilarraza Sánchez, Arantza; Màrquez Villodre, Lluís; Sarasola, Kepa (2013)
Text en actes de congrés
Accés obertThe process of developing hybrid MT systems is usually guided by an evaluation method used to compare different combinations of basic subsystems. This work presents a deep evaluation experiment of a hybrid architecture, ... -
Discriminative learning within Arabic statistical machine translation
España Bonet, Cristina; Giménez, Jesús; Màrquez Villodre, Lluís (2009-01)
Report de recerca
Accés obertWritten Arabic is a especially ambiguous due to the lack of diacritisation of texts, and this makes the translation harder for automatic systems that do not take into account the context of phrases. Here, we use a standard ... -
Document-level machine translation as a re-translation process
Martínez Garcia, Eva; España Bonet, Cristina; Màrquez Villodre, Lluís (2014-09-22)
Article
Accés obertMost of the current Machine Translation systems are designed to translate a document sentence by sentence ignoring discourse information and producing incoherencies in the final translations. In this paper we present some ... -
Document-level machine translation with word vector models
Martínez Garcia, Eva; España Bonet, Cristina; Márquez Villodre, Luís (2015)
Text en actes de congrés
Accés obertIn this paper we apply distributional semantic information to document-level machine translation. We train monolingual and bilingual word vector models on large corpora and we evaluate them first in a cross-lingual lexical ... -
El català i les tecnologies de la llengua
Boleda Torrent, Gemma; Cuadros Oller, Montserrat; España Bonet, Cristina; Melero, Maite; Padró, Lluís; Quixal, Martí; Rodríguez, Carlos (2009)
Article
Accés restringit per política de l'editorialEl processament computacional de la llengua abraça qualsevol activitat relacionada amb la creació, gestió i utilització de tecnologia i recursos lingüístics. En el pla científic, aquesta activitat és central en disciplines ... -
Experiments on document level machine translation
Martínez Garcia, Eva; España Bonet, Cristina; Márquez Villodre, Luís (2014-03-03)
Report de recerca
Accés obertMost of the current SMT systems work at sentence level. They translate a text assuming that sentences are independent, but, when one looks at a well formed document, it is clear that there exist many inter sentence relations. ... -
Full machine translation for factoid question answering
España Bonet, Cristina; Comas Umbert, Pere Ramon (Association for Computational Linguistics, 2012)
Comunicació de congrés
Accés obertIn this paper we present an SMT-based approach to Question Answering (QA). QA is the task of extracting exact answers in response to natural language questions. In our approach, the answer is a translation of the question ... -
GeBioToolkit: automatic extraction of gender-balanced multilingual corpus of Wikipedia biographies
Ruiz Costa-Jussà, Marta; Li Lin, Pau; España Bonet, Cristina (European Language Resources Association (ELRA), 2020)
Comunicació de congrés
Accés obertWe introduce GeBioToolkit, a tool for extracting multilingual parallel corpora at sentence level, with document and gender information from Wikipedia biographies. Despite the gender inequalities present in Wikipedia, the ... -
Hybrid machine translation guided by a rule-based system
España Bonet, Cristina; Màrquez Villodre, Lluís; Labaka, Gorka; Díaz de Ilarraza Sánchez, Arantza; Sarasola Gabiola, Kepa (2011)
Comunicació de congrés
Accés obertThis paper presents a machine translation architecture which hybridizes Matxin, a rulebased system, with regular phrase-based Statistical Machine Translation. In short, the hybrid translation process is guided by the ... -
Language technology challenges of a "small" language (Catalan)
Melero, Maite; Boleda Torrent, Gemma; Cuadros Oller, Montserrat; España Bonet, Cristina; Padró, Lluís; Quixal, Martí; Rodríguez, Carlos; Saurí, Roser (2010-05)
Text en actes de congrés
Accés restringit per política de l'editorialIn this paper, we present a brief snapshot of the state of affairs in computational processing of Catalan and the initiatives that are starting to take place in an effort to bring the field a step forward, by making a ... -
MT techniques in a retrieval system of semantically enriched patents
González Bermúdez, Meritxell; Mateva, Maria; Enache, Ramona; España Bonet, Cristina; Màrquez Villodre, Lluís; Popov, Borislav; Ranta, Aarne (2013)
Comunicació de congrés
Accés obertThis paper focuses on how automatic translation techniques integrated in a patent retrieval system increase its capabilities and make possible extended features and functionalities. We describe 1) a novel methodology ... -
Overview of TweetMT : a shared task on machine translation of tweets at SEPLN 2015
Alegria, Iñaki; Aranberri, Nora; España Bonet, Cristina; Gamallo, Pablo; Gonçalo Oliveira, Hugo; Martínez Garcia, Eva; San Vicente Roncal, Iñaki; Toral, Antonio; Zubiaga, Arkaitz (2015)
Text en actes de congrés
Accés obertThis article presents an overview of the shared task that took place as part of the TweetMT workshop held at SEPLN 2015. The task consisted in translating collections of tweets from and to several ... -
Patent translation within the MOLTO project
España Bonet, Cristina; Enache, Ramona; Slaski, Adam; Ranta, Aarne; Màrquez Villodre, Lluís; González Bermúdez, Meritxell (2011)
Comunicació de congrés
Accés obertMOLTO is an FP7 European project whose goal is to translate texts between multiple languages in real time with high quality. Patents translation is a case of study where research is focused on simultaneously obtaining a ... -
Primera Jornada del Processament Computacional del Català
Boleda Torrent, Gemma; Cuadros Oller, Montserrat; España Bonet, Cristina; Melero, Maite; Padró, Lluís; Quixal, Martí; Rodríguez, Carlos (2009-09)
Article
Accés restringit per política de l'editorialPresentamos las conclusiones de la primera Jornada del Processament Computacional del Català, celebrado en Barcelona en marzo del 2009. We present the conclusions of the first Jornada del Processament Computacional del ... -
Robust Estimation of Feature Weights in Statistical Machine Translation
España Bonet, Cristina; Màrquez Villodre, Lluís (2010)
Comunicació de congrés
Accés obertWeights of the various components in a standard Statistical Machine Translation model are usually estimated via Minimum Error Rate Training. With this, one finds their optimum value on a development set with the ... -
Sobre la I Jornada del Processament Computacional del Català
Boleda Torrent, Gemma; Cuadros Oller, Montserrat; España Bonet, Cristina; Melero, Maite; Padró, Lluís; Quixal, Martí; Rodríguez, Carlos (2009)
Article
Accés restringit per política de l'editorialEl processament computacional de la llengua abraça qualsevol activitat relacionada amb la creació, gestió i utilització de tecnologia i recursos lingüístics. En el pla científic, aquesta activitat és central en disciplines ... -
The patents retrieval prototype in the MOLTO project
Chechev, Milen; González Bermúdez, Meritxell; Màrquez Villodre, Lluís; España Bonet, Cristina (ACM Press. Association for Computing Machinery, 2012)
Text en actes de congrés
Accés restringit per política de l'editorialThis paper describes the patents retrieval prototype developed within the MOLTO project. The prototype aims to provide a multilingual natural language interface for querying the content of patent documents. The developed ... -
The UPC TweetMT participation : translating formal tweets using context information
Martínez Garcia, Eva; España Bonet, Cristina; Márquez Villodre, Luís (2015)
Text en actes de congrés
Accés obertIn this paper, we describe the UPC systems that participated in the TweetMT shared task. We developed two main systems that were applied to the Spanish-Catalan language pair: a state-of-the-art phrase-based ... -
Wikicardi : hacia la extracción de oraciones paralelas de Wikipedia
Boldoba Trapote, Josu; Barrón-Cedeño, Alberto; España Bonet, Cristina (2014-03-01)
Report de recerca
Accés obertUno de los objetivos del proyecto Tacardi (TIN2012-38523-C02-00) consiste en extraer oraciones paralelas de corpus comparables para enriquecer y adaptar traductores automáticos. En esta investigación usamos un subconjunto ...