El grup de processament del llenguatge natural inicia la seva activitat al 1988. Des dels seus orígens ha sigut un grup interdisciplinari (informàtica i llingüistica). Ha portat a terme una prolífica activitat en diverses àrees d'investigació dins de processament del llenguatge natural i la intel•ligència artificial. En el processament bàsic de la llengua destaquen els camps de desambiguació morfosintàctica, anàlisis sintàctica i semàntica, desambiguació semàntica, etc., amb especial èmfasi en l'aplicació de mètodes estadístics i d'aprenentatge automàtic per resoldre aquestes tasques. Així mateix, apart de la tecnologia bàsica, també s'aborden aplicacions de més alt nivell, com ara traducció automàtica, extracció d'informació, donar resposta a preguntes, resum automàtic, processament de diccionaris, de corpus textuals, i de recursos lingüístics en general.

El grupo de procesamiento del lenguaje natural inicia su actividad en el 1988. Des de sus orígenes ha sido un grupo interdisciplinario (informática y lingüística). Ha llevado a cabo una prolífica actividad en diversas áreas de investigación dentro del procesamiento del lenguaje natural y la inteligencia artificial. En el procesamiento básico de la lengua destacan los campos de desambiguación morfosintáctica, análisis sintáctica y semántica, desambiguación semántica, etc., con especial énfasis en la aplicación de métodos estadísticos y de aprendizaje automático para resolver estas faenas. Así mismo, a parte de la tecnología básica, también se abordan aplicaciones de más alto nivel, como traducción automática, extracción de información, dar respuesta a preguntas, resumen automático, procesamiento de diccionarios, de corpus textuales, y de recursos lingüísticos en general.

The Natural Language Processing Group (GPLN) conducts research into the automatic processing of natural language, particularly those aspects related to the construction and exploitation of multilingual lexical resources and the extraction of information from documents written in natural language and summarisation. Within these research lines, the GPLN focuses its attention on developing language processing tools, such as morphological and syntactical analysers, grammatical labelers, corpus processing systems, and specific machine learning algorithms for natural language processing and on compiling linguistic resources, such as grammars, dictionaries and lexical and conceptual databases. The GPLN has also been working on natural language software specifications for the development of tools for the quality control of software. The GPLN collaborates with the Speech Processing Group in the research of oral translation and the development of dialogue systems.

The Natural Language Processing Group (GPLN) conducts research into the automatic processing of natural language, particularly those aspects related to the construction and exploitation of multilingual lexical resources and the extraction of information from documents written in natural language and summarisation. Within these research lines, the GPLN focuses its attention on developing language processing tools, such as morphological and syntactical analysers, grammatical labelers, corpus processing systems, and specific machine learning algorithms for natural language processing and on compiling linguistic resources, such as grammars, dictionaries and lexical and conceptual databases. The GPLN has also been working on natural language software specifications for the development of tools for the quality control of software. The GPLN collaborates with the Speech Processing Group in the research of oral translation and the development of dialogue systems.

Recent Submissions

  • An environment for management and extraction of taxonomies from on-line dictionaries 

    Ageno Pulido, Alicia; Castellón Masalles, Irene; Martí Antonin, Maria Antònia; Rigau Claramunt, German; Rodríguez Hontoria, Horacio; Taulé, Mariona; Verdejo Maillo, Maria Felisa (1991-06)
    Research report
    Open Access
  • Information retrieval from scientific abstract and citation databases: A query-by-documents approach based on Monte-Carlo sampling 

    Lechtenberg, Fabian; Farreres de la Morena, Xavier; Galvan Cara, Aldwin Lois; Somoza Tornos, Ana; Espuña Camarasa, Antonio; Graells Sobré, Moisès (2022-08-01)
    Article
    Open Access
    The rapidly increasing amount of information and entries in abstract and citation databases steadily complicates the information retrieval task. In this study, a novel query-by-document approach using Monte-Carlo sampling ...
  • Women and computer science in Catalonia 

    Castell Ariño, Núria; Hernández Gómez, M. Angeles (1994-09)
    Research report
    Open Access
    This paper presents the evolution of the role of women on Computer Science in Catalonia. Being these graduate studies quite new, they have started with an important participation of women but year after year Computer Science ...
  • Comprobació de restriccions d'integritat en bases de dades deductives bitemporals 

    Martín Escofet, Carme; Sistac Planas, Jaume (1994-03)
    Research report
    Open Access
  • TGE: tlinks generation environment 

    Ageno Pulido, Alicia; Ribas Framis, Francesc; Rigau Claramunt, German; Rodríguez Hontoria, Horacio; Verdejo Maillo, Maria Felisa (1993-07)
    Research report
    Open Access
    This document describes the enhancements made to the lexical knowledge base system in order to support linking entries to their translation equivalents to other languages.
  • Composición de esquemas paralelos 

    Abad Soriano, María Teresa (1994-01)
    Research report
    Open Access
    The cooperation and competition schemes that we use in the concurrent problem resolution are not powerful enough to solve all possible situations.
  • Evaluation and symmetrisation of alignments obtained with the Giza++ software 

    Lambert, Patrik; Castell Ariño, Núria (2004-03)
    Research report
    Open Access
    This paper presents a simple way of producing symmetric, phrase-based alignments, combining two single-word based alignments. Our algorithm exploits the asymmetries in the superposition of the two word alignments to detect ...
  • Converting the Enciclopedia Catalana bilingual MRD to an MTD 

    Benítez García, Laura; Escudero Bakx, Gerard; Farreres de la Morena, Xavier; Rigau Claramunt, German (1998-02)
    Research report
    Open Access
  • PRE+: a production rule environment 

    Turmo Borras, Jorge (1999-03)
    Research report
    Open Access
    PRE+ is an improved version of the PRE rule-oriented general purpose interpreter. Many rule-guided applications can be built by using a rule-oriented interpreter (i.e. information extraction systems, systems for lexical ...
  • TACAT: tagged corpus analyser tool 

    Atserias Batalla, Jordi; Rodríguez Hontoria, Horacio (1998-01)
    Research report
    Open Access
    This document describes the structure, funtional capabilities as well as the user manual of TACAT (Tagged Corpus Analyser Tool), a parser and syntactic editor for tagged text. TACAT has been developed as part of ITEM ...
  • Applying automatic methods for the construction of multilingual WordNets 

    Benítez García, Laura; Escudero Bakx, Gerard; Farreres de la Morena, Xavier; Rigau Claramunt, German (1998-02)
    Research report
    Open Access
  • Lexicographers' interface prototype 

    Atserias Batalla, Jordi; Farreres de la Morena, Xavier; Rodríguez Hontoria, Horacio; Turmo Borras, Jorge; Verdejo Maillo, Maria Felisa (1996-07)
    Research report
    Open Access
    In this paper we present an interface prototype for lexicographers based on the LDB/LKB integration. The aim of this interface is to allow the lexicographers to use the LDB/LKB capabilities in a easy way. The paper describe ...

View more