El grup de processament del llenguatge natural inicia la seva activitat al 1988. Des dels seus orígens ha sigut un grup interdisciplinari (informàtica i llingüistica). Ha portat a terme una prolífica activitat en diverses àrees d'investigació dins de processament del llenguatge natural i la intel•ligència artificial. En el processament bàsic de la llengua destaquen els camps de desambiguació morfosintàctica, anàlisis sintàctica i semàntica, desambiguació semàntica, etc., amb especial èmfasi en l'aplicació de mètodes estadístics i d'aprenentatge automàtic per resoldre aquestes tasques. Així mateix, apart de la tecnologia bàsica, també s'aborden aplicacions de més alt nivell, com ara traducció automàtica, extracció d'informació, donar resposta a preguntes, resum automàtic, processament de diccionaris, de corpus textuals, i de recursos lingüístics en general.

http://futur.upc.edu/GPLN

The Natural Language Processing Group (GPLN) conducts research into the automatic processing of natural language, particularly those aspects related to the construction and exploitation of multilingual lexical resources and the extraction of information from documents written in natural language and summarisation. Within these research lines, the GPLN focuses its attention on developing language processing tools, such as morphological and syntactical analysers, grammatical labelers, corpus processing systems, and specific machine learning algorithms for natural language processing and on compiling linguistic resources, such as grammars, dictionaries and lexical and conceptual databases. The GPLN has also been working on natural language software specifications for the development of tools for the quality control of software. The GPLN collaborates with the Speech Processing Group in the research of oral translation and the development of dialogue systems.

http://futur.upc.edu/GPLN

Recent Submissions

  • Support vector machines for query-focused summarization trained and evaluated on pyramid data 

    Fuentes Fort, Maria; Alfonseca, Enrique; Rodríguez Hontoria, Horacio (2007-01)
    External research report
    Open Access
    This paper presents the use of Support Vector Machines (SVM) to detect relevant information to be included in a queryfocused summary. Several classifiers are trained using pyramids of summary content units information. ...
  • FEMsum: A flexible eclectic multitask summarizer architecture evaluated in multidocument tasks 

    Fuentes Fort, Maria; Rodríguez Hontoria, Horacio; Turmo Borras, Jorge (2007-01)
    External research report
    Open Access
    This article describes two types of summarization approaches integrated in a flexible architecture for multitask summarization. The first type is based on the use of lexical features, while the second one is grounded on ...
  • A graph partitioning approach to coreference resolution 

    Sapena Masip, Emilio; Padró, Lluís; Turmo Borras, Jorge (2009-01)
    External research report
    Open Access
    This report presents a graph partitioning approach given a set of constraints to resolve coreferences. Coreference resolution is the task of determining which referring expressions in a discourse refer to the same entity. ...
  • Discriminative learning within Arabic statistical machine translation 

    España Bonet, Cristina; Giménez, Jesús; Màrquez Villodre, Lluís (2009-01)
    External research report
    Open Access
    Written Arabic is a especially ambiguous due to the lack of diacritisation of texts, and this makes the translation harder for automatic systems that do not take into account the context of phrases. Here, we use a standard ...
  • Coreference resolution survey 

    Sapena Masip, Emilio; Padró, Lluís; Turmo Borras, Jorge (2008-12)
    External research report
    Open Access
    This survey is an extended summarization of state of the art of coreference resolution. The key concepts related to coreference and anaphora are presented, the most relevant approaches to coreference resolution are discussed, ...
  • Recomendador t-incluye para un uso inclusivo del lenguaje 

    Fuentes Fort, Maria; Padró, Lluís; Turmo Borras, Jorge (2008-11)
    External research report
    Open Access
    Sistema que procesa un texto escrito en castellano detectando usos del lenguaje no inclusivos. Para cada sintagma nominal sospechoso el sistema propone una serie de alternativas. El sistema permite también la adquisición ...
  • A question-driven information system adaptable via information extraction techniques 

    Sapena Masip, Emilio; González Pérez, Manuel; Padró, Lluís; Turmo Borras, Jorge (2008-06)
    External research report
    Open Access
    This paper presents a system is composed by two main parts: The Information Extraction Process, which crawls the web and extracts all relevant knowledge that is then stored in the database, and the Question Processor which ...
  • Aprendizaje y asistencia virtual en red : la prueba Piloto : Cátedra Telefófica UPC : Análisis de le evolución y tendencias futuras de la sociedad de la información 

    Fuentes Fort, Maria; González Bermúdez, Meritxell; Guardiola Garcia, Marta; Jofre Roca, Lluís; Romeu Robert, Jordi; Vallverdú Bayés, Francesc (2011-07-29)
    External research report
    Open Access
    Hemos llevado a cabo una prueba piloto, demostradora del uso de las tecnologías del lenguaje y el habla aplicadas al aprendizaje de inglés. En general, se espera que la actividad acerque al alumno el máximo posible a la ...
  • On the process of building a process systems engineering ontology using a semi-automatic construction approach 

    Dombayci, Canan; Farreres de la Morena, Xavier; Rodríguez Hontoria, Horacio; Muñoz Mata, Edrisi; Capon Garcia, Elisabeth; Espuña Camarasa, Antonio; Graells Sobré, Moisès (Elsevier, 2015)
    Conference lecture
    Open Access
    This work presents a novel systematic approach for the construction of domain ontolog ies . The s ugge sted approach uses a semi - automatic construction methodology . F or this study , parent - child concept pairs are ...
  • Knowledge-based and data-driven approaches for georeferencing of informal documents 

    Ferrés Domènech, Daniel; Rodríguez Hontoria, Horacio (Springer, 2015)
    Conference lecture
    Restricted access - publisher's policy
    This paper describes Knowledge-Based and Data-Driven approaches we have followed for generic Textual Georeferencing of Informal Documents. Textual georeferencing consists in assigning a set of geographical coordinates to ...

View more