El GPI fa recerca en Processament d'Imatge i Vídeo per representació, codificació, indexació i anàlisi del contingut visual. L'expertesa del grup en Morfologia i segmentació ha estat la base de contribucions als estàndards ISO MPEG-4 i MPEG-7. La recerca en anàlisi d'imatge l'ha permès participar en projectes europeus des de 1992 als programes RACE (Morpheco, coord.), ACTS (MAVT, MoMuSys, Vidas), IST (Diceman, Hypermedia, INTERFACE, ADViSOR, MASCOT, FAETHON), xarxes d'excel·lència (SCHEMA, SIMILAR, MUSCLE), i projectes integrats FP6 (CHIL) i FP7 (FASCINATE).

http://futur.upc.edu/GPI

El Grupo de Procesado de Imagen y Vídeo investiga técnicas de tratamiento de imagen y vídeo en los campos de compresión, análisis, indexación, representación e interfaces multimodales. El Grupo se ha especializado en herramientas básicas de filtrado no lineal y morfología matemática, segmentación, seguimiento de objetos, detección y reconocimiento de caras, análisis de emociones y modelitzación de la actividad humana, que han sido la base de aplicaciones de codificación basada en el contenido, representación de vídeo mediante índices y tablas de contenido, y contribuciones a estándares internacionales como MPEG4 y MPEG7. El Grupo también desarrolla aplicaciones biomédicas, de teledetección y marcado.

http://futur.upc.edu/GPI

The research of the Image and Video Processing Group focuses on the areas of compression, analysis, indexing, representation and multimodal interfaces. The group specialises in basic tools for nonlinear filtering, mathematical morphology, segmentation, object tracking, face detection and recognition, emotion analysis and modelling of human activity, which have been the basis of applications related to content-based video coding, video indexing and the creation of tables of content and contributions to international standardisation processes such as MPEG-4 and MPEG-7. The group also develops biomedical, remote-sensing and watermarking applications.

http://futur.upc.edu/GPI

The research of the Image and Video Processing Group focuses on the areas of compression, analysis, indexing, representation and multimodal interfaces. The group specialises in basic tools for nonlinear filtering, mathematical morphology, segmentation, object tracking, face detection and recognition, emotion analysis and modelling of human activity, which have been the basis of applications related to content-based video coding, video indexing and the creation of tables of content and contributions to international standardisation processes such as MPEG-4 and MPEG-7. The group also develops biomedical, remote-sensing and watermarking applications.

http://futur.upc.edu/GPI

Enviaments recents

  • Magnetic resonance imaging and machine learning make a valuable combined tool for the screening of preclinical AD 

    Petrone, Paula; Vilaplana Besler, Verónica; Casamitjana Díaz, Adrià; Domingo Gispert, Juan; Molinuevo, Jose Luis; Sánchez Escobedo, Dalila (2017-07)
    Article
    Accés restringit per política de l'editorial
  • Hierarchical stack filtering: a bitplane-based algorithm for massively parallel processors 

    Frías Velazquez, Andres; Morros Rubió, Josep Ramon; García Molina, Mario; Philips, Wilfried (2017-03-15)
    Article
    Accés obert
    With the development of novel parallel architectures for image processing, the implementation of well-known image operators needs to be reformulated to take advantage of the so-called massive parallelism. In this work, we ...
  • Cost-effective active learning for melanoma segmentation 

    Górriz, Marc; Giró Nieto, Xavier; Carlier, Axel; Faure, Emmanuel (2017)
    Comunicació de congrés
    Accés obert
    We propose a novel Active Learning framework capable to train effectively a convolutional neural network for semantic segmentation of medical imaging, with a limited amount of training labeled data. Our contribution is a ...
  • 3D hierarchical optimization for multi-view depth map coding 

    Maceira, Marc; Varas, David; Morros Rubió, Josep Ramon; Ruiz Hidalgo, Javier; Marqués Acosta, Fernando (2017-11-28)
    Article
    Accés restringit per política de l'editorial
    Depth data has a widespread use since the popularity of high resolution 3D sensors. In multi-view sequences, depth information is used to supplement the color data of each view. This article proposes a joint encoding of ...
  • Hierarchical object detection with deep reinforcement learning 

    Bellver, Míriam; Giró Nieto, Xavier; Marqués Acosta, Fernando; Torres Viñals, Jordi (IOS Press, 2017-11-23)
    Capítol de llibre
    Accés restringit per política de l'editorial
    Deep learning and image processing are two areas of great interest to academics and industry professionals alike. The areas of application of these two disciplines range widely, encompassing fields such as medicine, robotics, ...
  • Object retrieval with deep convolutional features 

    Mohedano, Eva; Salvador Aguilera, Amaia; McGuinness, Kevin; Giró Nieto, Xavier; O'Connor, Noel; Marqués Acosta, Fernando (IOS Press, 2017-11-23)
    Capítol de llibre
    Accés restringit per política de l'editorial
    Deep learning and image processing are two areas of great interest to academics and industry professionals alike. The areas of application of these two disciplines range widely, encompassing fields such as medicine, robotics, ...
  • Towards large scale multimedia indexing: a case study on person discovery in broadcast news 

    Le, Nam; Bredin, Herve; Sergent, Gabriel; India Massana, Miquel Àngel; López-Otero, Paula; Barras, Claude; Guinaudeau, Camille; Gravier, Guillaume; Barbosa da Fonseca, Gabriel; Lyon Freire, Izabela; Patrocinio Jr., Zenilton; Jamil F. Guimarães, Silvio; Martí Juan, Gerard; Morros Rubió, Josep Ramon; Hernando Pericás, Francisco Javier; Docio-Fernández, Laura; García-Mateo, Carmen; Meignier, Sylvain; Odobez, Jean-Marc (Association for Computing Machinery (ACM), 2017)
    Text en actes de congrés
    Accés restringit per política de l'editorial
    The rapid growth of multimedia databases and the human interest in their peers make indices representing the location and identity of people in audio-visual documents essential for searching archives. Person discovery ...
  • Two level continuous speech recognition using demisyllable-based HMM word spotting 

    Lleida Solano, Eduardo; Mariño Acebal, José Bernardo; Nadeu Camprubí, Climent; Oliveras Vergés, Albert (1991)
    Text en actes de congrés
    Accés obert
    This paper describes a two level Spanish Continuous Speech Recognition System based on Demisyllable HMM modelling, word-spotting and finite-state lexical and syntactic knowledge. The first level, the word level, is based ...
  • Image sequence analysis and merging 

    Salembier Clairon, Philippe Jean; Garrido Ostermann, Luis; Garcia, D (LINKÖPING UNIVERSITY, 1997)
    Text en actes de congrés
    Accés obert
  • Registration of images to unorganized 3D point clouds using contour cues 

    Pujol Miró, Alba; Ruiz Hidalgo, Javier; Casas Pla, Josep Ramon (Institute of Electrical and Electronics Engineers (IEEE), 2017)
    Text en actes de congrés
    Accés obert
    Low resolution commercial 3D sensors contribute to computer vision tasks even better when the analysis is carried out in a combination with higher resolution image data. This requires registration of 2D images to ...
  • ViTS: Video tagging system from massive web multimedia collections 

    Fernàndez, Dèlia; Varas, David; Espadaler, Joan; Masuda, Issey; Ferreira, Jordi; Woodward, Alejandro; Rodríguez, David; Giró Nieto, Xavier; Riveiro, Juan Carlos; Bou Balust, Elisenda (IEEE Press, 2017)
    Text en actes de congrés
    Accés obert
    The popularization of multimedia content on the Web has arised the need to automatically understand, index and retrieve it. In this paper we present ViTS, an automatic Video Tagging System which learns from videos, their ...
  • More cat than cute?: interpretable prediction of adjective-noun pairs 

    Fernàndez, Dèlia; Woodward, Alejandro; Campos Camunez, Victor; Giró Nieto, Xavier; Jou, Brendan; Chang, Shih-Fu (2017)
    Text en actes de congrés
    Accés restringit per política de l'editorial
    The increasing availability of affect-rich multimedia resources has bolstered interest in understanding sentiment and emotions in and from visual content. Adjective-noun pairs (ANP) are a popular midlevel semantic ...

Mostra'n més