• ADN-classifier: automatically assigning denotation types to nominalizations 

      Peris, Aina; Taulé, Mariona; Boleda Torrent, Gemma; Rodríguez Hontoria, Horacio (2010)
      Text en actes de congrés
      Accés obert
      This paper presents the ADN-Classifier, an Automatic classification system of Spanish Deverbal Nominalizations aimed at identifying its semantic denotation (i.e. event, result, underspecified, or lexicalized). The classifier ...
    • Annotation and representation of a diachronic corpus of Spanish 

      Sánchez-Marco, Cristina; Boleda Torrent, Gemma; Fontana, Josep Maria; Domingo, Judith (2010)
      Text en actes de congrés
      Accés obert
      In this article we describe two different strategies for the automatic tagging of a Spanish diachronic corpus involving the adaptation of existing NLP tools developed for modern Spanish. In the initial approach we follow ...
    • El català i les tecnologies de la llengua 

      Boleda Torrent, Gemma; Cuadros Oller, Montserrat; España Bonet, Cristina; Melero, Maite; Padró, Lluís; Quixal, Martí; Rodríguez, Carlos (2009)
      Article
      Accés restringit per política de l'editorial
      El processament computacional de la llengua abraça qualsevol activitat relacionada amb la creació, gestió i utilització de tecnologia i recursos lingüístics. En el pla científic, aquesta activitat és central en disciplines ...
    • Ethnic adjectives are proper adjectives 

      Arsenijevic, Boban; Boleda Torrent, Gemma; Gehrke, Berit; McNally, Louise (2010)
      Text en actes de congrés
      Accés obert
      In this paper, we challenge the view that ethnic adjectives (EAs) are nouns in disguise. Instead, we propose a unified semantics of the thematic and classificatory uses of EAs that treats them as proper adjectives, but ...
    • Language technology challenges of a "small" language (Catalan) 

      Melero, Maite; Boleda Torrent, Gemma; Cuadros Oller, Montserrat; España Bonet, Cristina; Padró, Lluís; Quixal, Martí; Rodríguez, Carlos; Saurí, Roser (2010-05)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      In this paper, we present a brief snapshot of the state of affairs in computational processing of Catalan and the initiatives that are starting to take place in an effort to bring the field a step forward, by making a ...
    • Primera Jornada del Processament Computacional del Català 

      Boleda Torrent, Gemma; Cuadros Oller, Montserrat; España Bonet, Cristina; Melero, Maite; Padró, Lluís; Quixal, Martí; Rodríguez, Carlos (2009-09)
      Article
      Accés restringit per política de l'editorial
      Presentamos las conclusiones de la primera Jornada del Processament Computacional del Català, celebrado en Barcelona en marzo del 2009. We present the conclusions of the first Jornada del Processament Computacional del ...
    • Sobre la I Jornada del Processament Computacional del Català 

      Boleda Torrent, Gemma; Cuadros Oller, Montserrat; España Bonet, Cristina; Melero, Maite; Padró, Lluís; Quixal, Martí; Rodríguez, Carlos (2009)
      Article
      Accés restringit per política de l'editorial
      El processament computacional de la llengua abraça qualsevol activitat relacionada amb la creació, gestió i utilització de tecnologia i recursos lingüístics. En el pla científic, aquesta activitat és central en disciplines ...
    • Word-sense disambiguated multilingual Wikipedia corpus 

      Reese, Samuel; Boleda Torrent, Gemma; Cuadros Oller, Montserrat; Padró, Lluís; Rigau Claramunt, German (2010-05)
      Text en actes de congrés
      Accés obert
      This article presents a new freely available trilingual corpus (Catalan, Spanish, English) that contains large portions of the Wikipedia and has been automatically enriched with linguistic information. To our knowledge, ...
    • Zipf's law for word frequencies: Word forms versus lemmas in long texts 

      Corral, Alvaro; Boleda Torrent, Gemma; Ferrer Cancho, Ramon (2015-07-09)
      Article
      Accés obert
      Zipf's law is a fundamental paradigm in the statistics of written and spoken natural language as well as in other communication systems. We raise the question of the elementary units for which Zipf's law should hold in the ...