• Language technology challenges of a "small" language (Catalan) 

      Melero, Maite; Boleda Torrent, Gemma; Cuadros Oller, Montserrat; España Bonet, Cristina; Padró, Lluís; Quixal, Martí; Rodríguez, Carlos; Saurí, Roser (2010-05)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      In this paper, we present a brief snapshot of the state of affairs in computational processing of Catalan and the initiatives that are starting to take place in an effort to bring the field a step forward, by making a ...
    • Primera Jornada del Processament Computacional del Català 

      Boleda Torrent, Gemma; Cuadros Oller, Montserrat; España Bonet, Cristina; Melero, Maite; Padró, Lluís; Quixal, Martí; Rodríguez, Carlos (2009-09)
      Article
      Accés restringit per política de l'editorial
      Presentamos las conclusiones de la primera Jornada del Processament Computacional del Català, celebrado en Barcelona en marzo del 2009. We present the conclusions of the first Jornada del Processament Computacional del ...
    • Sobre la I Jornada del Processament Computacional del Català 

      Boleda Torrent, Gemma; Cuadros Oller, Montserrat; España Bonet, Cristina; Melero, Maite; Padró, Lluís; Quixal, Martí; Rodríguez, Carlos (2009)
      Article
      Accés restringit per política de l'editorial
      El processament computacional de la llengua abraça qualsevol activitat relacionada amb la creació, gestió i utilització de tecnologia i recursos lingüístics. En el pla científic, aquesta activitat és central en disciplines ...
    • Word-sense disambiguated multilingual Wikipedia corpus 

      Reese, Samuel; Boleda Torrent, Gemma; Cuadros Oller, Montserrat; Padró, Lluís; Rigau Claramunt, German (2010-05)
      Text en actes de congrés
      Accés obert
      This article presents a new freely available trilingual corpus (Catalan, Spanish, English) that contains large portions of the Wikipedia and has been automatically enriched with linguistic information. To our knowledge, ...
    • Zipf's law for word frequencies: Word forms versus lemmas in long texts 

      Corral, Alvaro; Boleda Torrent, Gemma; Ferrer Cancho, Ramon (2015-07-09)
      Article
      Accés obert
      Zipf's law is a fundamental paradigm in the statistics of written and spoken natural language as well as in other communication systems. We raise the question of the elementary units for which Zipf's law should hold in the ...