Now showing items 1-20 of 27

    • A comparison of approaches for measuring cross-lingual similarity of wikipedia articles 

      Barrón-Cedeño, Alberto; Lestari Paramita, Monica; Clough, Paul; Rosso, Paolo (Springer, 2014)
      Conference report
      Open Access
      Wikipedia has been used as a source of comparable texts for a range of tasks, such as Statistical Machine Translation and Cross-Language Information Retrieval. Articles written in different languages on the same topic are ...
    • A factory of comparable corpora from Wikipedia 

      Barrón-Cedeño, Alberto; España Bonet, Cristina; Boldoba Trapote, Josu; Márquez Villodre, Luís (Association for Computational Linguistics, 2015)
      Conference report
      Open Access
      Multiple approaches to grab comparable data from the Web have been developed up to date. Nevertheless, coming out with a high-quality comparable corpus of a specific topic is not straightforward. We present a model ...
    • Boosting terminology extraction through crosslingual resources 

      Cajal Mariñosa, Sergio; Rodríguez Hontoria, Horacio (2014-09)
      Article
      Open Access
      Terminology Extraction is an important Natural Language Processing task with multiple applications in many areas. The task has been approached from different points of view using different techniques. Language and domain ...
    • Combining Wikipedia and WordNet for improving domain terms compilation 

      Vivaldi, Jorge; Rodríguez Hontoria, Horacio; Rigau Claramunt, German (2013)
      External research report
      Open Access
      Domain terms are a useful mean for tuning both resources and NLP processors to domain specific tasks. This paper proposes an improved method for obtaining terms from potentially any domain using the Wikipedia graph structure ...
    • Context-aware knowledge-based recommender system for events 

      Horowitz, Daniel (Universitat Politècnica de Catalunya, 2015-04)
      Master thesis
      Restricted access - confidentiality agreement
    • Creación de artículos en Wikipedia como herramienta de introducción al concepto de web 2.0 para estudiantes de Comunicación Audiovisual 

      Bustillo Iglesias, Andrés; Martin Alonso, David (2009-06-19T09:26:10Z)
      Conference report
      Open Access
      La comprensión de los conceptos básicos que gobiernan la llamada Web 2.0 es fundamental para los estudiantes de la titulación de Comunicación Audiovisual por lo que comporta de cara a su futuro profesional. La ...
    • Elaboración de material para la Wiquipedia: Instrumentación de medida y observación de la retina 

      Amat Marfà, Pedro (Universitat Politècnica de Catalunya, 2021-02-05)
      Bachelor thesis
      Open Access
      Actualmente se utilizan diferentes instrumentos para la observación y medida de la retina y cada uno de ellos presenta diferentes características y funciones. Con el fin de compartir información sobre estos instrumentos ...
    • Enciclopedias participativas 

      Barceló Garcia, Miquel (2004-03)
      Article
      Open Access
    • Enrich Data: millorant les cerques a la web 

      Tamayo Domènech, David (Universitat Politècnica de Catalunya, 2016-06-30)
      Bachelor thesis
      Open Access
      Enrich Data té com objectiu principal dissenyar i desenvolupar un sistema que contribueixi en millorar les webs i alhora l'experiència dels usuaris. Per fer-ho ens proposem un sistema que expandeixi les consultes dels ...
    • Entorn per a la inducció de patrons d’extracció, orientat a la Wikipedia 

      Martí Farriol, Jaume (Universitat Politècnica de Catalunya, 2009-11)
      Master thesis (pre-Bologna period)
      Open Access
      Aquesta memòria descriu un sistema que aprofita l’estructura de la Wikipedia, que combina dades estructurades amb dades no estructurades, per induïr de forma completament automàtica, patrons per a l’extracció d’entitats ...
    • Explotación de wikipedia para el enriquecimiento de un traductor automático 

      Boldoba Trapote, Josu (Universitat Politècnica de Catalunya, 2014-06-22)
      Master thesis (pre-Bologna period)
      Open Access
      Este trabajo aprovecha la naturaleza multilingüe de Wikipedia para construir sistemas de traducción especializados en diferentes áreas de conocimiento. En él se describen los procedimientos seguidos para extraer corpus ...
    • Extracción de una terminología multilingüe de Wikipedia 

      Cajal Mariñosa, Sergio (Universitat Politècnica de Catalunya, 2014-05-03)
      Master thesis (pre-Bologna period)
      Open Access
      Disseny i avaluació d'un algorisme que extrau una terminologia multilingüe fent servir com a font d'informació Wikipedia, i ordena els termes per termhood fent servir una versió modificada de l'algorisme de PageRank de Google.
    • Fine-tuning neural machine translation on gender-balanced datasets 

      Ruiz Costa-Jussà, Marta; de Jorge Sánchez, Adrián (Association for Computational Linguistics, 2020)
      Conference lecture
      Open Access
      Misrepresentation of certain communities in datasets is causing big disruptions in artificial intelligence applications. In this paper, we propose using an automatically extracted gender-balanced dataset parallel corpus ...
    • GeBioToolkit: automatic extraction of gender-balanced multilingual corpus of Wikipedia biographies 

      Ruiz Costa-Jussà, Marta; Li Lin, Pau; España Bonet, Cristina (European Language Resources Association (ELRA), 2020)
      Conference lecture
      Open Access
      We introduce GeBioToolkit, a tool for extracting multilingual parallel corpora at sentence level, with document and gender information from Wikipedia biographies. Despite the gender inequalities present in Wikipedia, the ...
    • Massive query expansion by exploiting graph knowledge bases for image retrieval 

      Guisado Gámez, Joan; Domínguez Sal, David; Larriba Pey, Josep (Association for Computing Machinery (ACM), 2014)
      Conference report
      Restricted access - publisher's policy
      Annotation-based techniques for image retrieval suffer from sparse and short image textual descriptions. Moreover, users are often not able to describe their needs with the most appropriate keywords. This situation is a ...
    • Nuevos monopolios 

      Barceló Garcia, Miquel (2013-11)
      Article
      Open Access
    • Semantic tagging and normalization of French medical entities 

      Cotik, Viviana; Rodríguez Hontoria, Horacio; Vivaldi, Jorge (CEUR-WS.org, 2016)
      Conference report
      Open Access
      In this paper we present two tools for facing task 2 in CLEF eHealth 2016. The first one is a semantic tagger aiming to detect relevant entities in French medical documents, tagging them with their appropriate ...
    • Semantic tagging of French medical entities using distant learning 

      Cotik, Viviana; Rodríguez Hontoria, Horacio; Vivaldi, Jorge (CEUR-WS.org, 2015)
      Conference lecture
      Open Access
      In this paper we present a semantic tagger aiming to detect relevant entities in French medical documents and tagging them with their appropriate semantic class. These experiments has been carried out in the framework ...
    • Semiautomatic completion of Wikipedia contents with domain-specific MT and CLIR 

      Cosma, Adriana Elena (Universitat Politècnica de Catalunya, 2015-04-28)
      Master thesis
      Open Access
      The aim of this master thesis is to develop a system which is able to help these potential users with the enrichment of Wikipedia articles in one language with the information resent in another language. The main contribution ...
    • TALP at GikiCLEF 2009 

      Ferrés Domènech, Daniel; Rodríguez Hontoria, Horacio (2009)
      Conference report
      Open Access
      This paper describes our experiments in Geographical Information Retrieval with the Wikipedia collection in the context of our participation in the GikiCLEF 2009 Multilingual task in English and Spanish. Our system, called ...