• A comparison of approaches for measuring cross-lingual similarity of wikipedia articles 

      Barrón-Cedeño, Alberto; Lestari Paramita, Monica; Clough, Paul; Rosso, Paolo (Springer, 2014)
      Text en actes de congrés
      Accés obert
      Wikipedia has been used as a source of comparable texts for a range of tasks, such as Statistical Machine Translation and Cross-Language Information Retrieval. Articles written in different languages on the same topic are ...
    • A Deep source-context feature for lexical selection in statistical machine translation 

      Gupta, Parth; Ruiz Costa-Jussà, Marta; Rosso, Paolo; Banchs Martínez, Rafael Enrique (Elsevier, 2016-05-01)
      Article
      Accés obert
      This paper presents a methodology to address lexical disambiguation in a standard phrase-based statistical machine translation system. Similarity among source contexts is used to select appropriate translation units. The ...
    • GeoTextMESS: result fusion with fuzzy Borda ranking in geographical information retrieval 

      Buscaldi, Davide; Perea Ortega, Jose Manuel; Rosso, Paolo; Ureña López, L. Alfonso; Ferrés Domènech, Daniel; Rodríguez Hontoria, Horacio (2009)
      Article
      Accés obert
      In this paper we discuss the integration of different GIR systems by means of a fuzzy Borda method for result fusion. Two of the systems, the one by the Universidad Politécnica de Valencia and the one of the Universidad ...
    • Methods for cross-language plagiarism detection 

      Barrón-Cedeño, Alberto; Gupta, P.; Rosso, Paolo (2013-09)
      Article
      Accés restringit per política de l'editorial
      Three reasons make plagiarism across languages to be on the rise: (i) speakers of under-resourced languages often consult documentation in a foreign language, (ii) people immersed in a foreign country can still consult ...
    • PAN@FIRE: overview of the cross-language Indian text re-use detection competition 

      Barrón-Cedeño, Alberto; Rosso, Paolo; Lalitha Devi, Sobha; Clough, Paul; Stevenson, Mark (2010)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      The development of models for automatic detection of text re-use and plagiarism across languages has received increasing attention in recent years. However, the lack of an evaluation framework composed of annotated datasets ...
    • Plagiarism meets paraphrasing: insights for the next generation in automatic plagiarism detection 

      Barrón-Cedeño, Alberto; Vila, Marta; Martí, Maria Antonia; Rosso, Paolo (2013)
      Article
      Accés obert
      Although paraphrasing is the linguistic mechanism underlying many plagiarism cases, little attention has been paid to its analysis in the framework of automatic plagiarism detection. Therefore, state-of-the-art plagiarism ...