El GPI fa recerca en Processament d'Imatge i Vídeo per representació, codificació, indexació i anàlisi del contingut visual. L'expertesa del grup en Morfologia i segmentació ha estat la base de contribucions als estàndards ISO MPEG-4 i MPEG-7. La recerca en anàlisi d'imatge l'ha permès participar en projectes europeus des de 1992 als programes RACE (Morpheco, coord.), ACTS (MAVT, MoMuSys, Vidas), IST (Diceman, Hypermedia, INTERFACE, ADViSOR, MASCOT, FAETHON), xarxes d'excel·lència (SCHEMA, SIMILAR, MUSCLE), i projectes integrats FP6 (CHIL) i FP7 (FASCINATE).

http://futur.upc.edu/GPI

El Grupo de Procesado de Imagen y Vídeo investiga técnicas de tratamiento de imagen y vídeo en los campos de compresión, análisis, indexación, representación e interfaces multimodales. El Grupo se ha especializado en herramientas básicas de filtrado no lineal y morfología matemática, segmentación, seguimiento de objetos, detección y reconocimiento de caras, análisis de emociones y modelitzación de la actividad humana, que han sido la base de aplicaciones de codificación basada en el contenido, representación de vídeo mediante índices y tablas de contenido, y contribuciones a estándares internacionales como MPEG4 y MPEG7. El Grupo también desarrolla aplicaciones biomédicas, de teledetección y marcado.

http://futur.upc.edu/GPI

The research of the Image and Video Processing Group focuses on the areas of compression, analysis, indexing, representation and multimodal interfaces. The group specialises in basic tools for nonlinear filtering, mathematical morphology, segmentation, object tracking, face detection and recognition, emotion analysis and modelling of human activity, which have been the basis of applications related to content-based video coding, video indexing and the creation of tables of content and contributions to international standardisation processes such as MPEG-4 and MPEG-7. The group also develops biomedical, remote-sensing and watermarking applications.

http://futur.upc.edu/GPI

The research of the Image and Video Processing Group focuses on the areas of compression, analysis, indexing, representation and multimodal interfaces. The group specialises in basic tools for nonlinear filtering, mathematical morphology, segmentation, object tracking, face detection and recognition, emotion analysis and modelling of human activity, which have been the basis of applications related to content-based video coding, video indexing and the creation of tables of content and contributions to international standardisation processes such as MPEG-4 and MPEG-7. The group also develops biomedical, remote-sensing and watermarking applications.

http://futur.upc.edu/GPI

Enviaments recents

  • Optimum graph cuts for pruning binary partition trees of polarimetric SAR images 

    Salembier Clairon, Philippe Jean; Foucher, Samuel (Institute of Electrical and Electronics Engineers (IEEE), 2016-09-01)
    Article
    Accés obert
    This paper investigates several optimum graph-cut techniques for pruning binary partition trees (BPTs) and their usefulness for the low-level processing of polarimetric synthetic aperture radar (PolSAR) images. BPTs group ...
  • Hierarchical object detection with deep reinforcement learning 

    Bellver, Míriam; Giró Nieto, Xavier; Marqués Acosta, Fernando; Torres, Jordi (2016)
    Comunicació de congrés
    Accés obert
    We present a method for performing hierarchical object detection in images guided by a deep reinforcement learning agent. The key idea is to focus on those parts of the image that contain richer information and zoom on ...
  • Is a “happy dog” more “happy” than “dog”? - Adjective and Noun Contributions for Adjective-Noun Pair prediction 

    Fernàndez, Dèlia; Campos Camúñez, Victor; Jou, Brendan; Giró Nieto, Xavier; Chang, Shih-Fu (2016)
    Comunicació de congrés
    Accés obert
  • Hierarchical visual description schemes for still images and video sequences 

    Salembier Clairon, Philippe Jean; O'Connor, N; Correia Fernandez-Pereira, Paulo; Pereira, F (Institute of Electrical and Electronics Engineers (IEEE), 1999)
    Text en actes de congrés
    Accés obert
    This paper proposes two description schemes (DSs) to describe the visual information of an audio-visual (AV) document. The first one, is devoted to still images. It describes the image visual appearance and its structure ...
  • Optimum watermark detection in color images 

    Sayrol Clols, Elisa; Vidal Manzano, José; Cabanillas, Silvia; Santamaria, Sonia (Institute of Electrical and Electronics Engineers (IEEE), 1999)
    Text en actes de congrés
    Accés obert
    This work concentrates on the problem of watermarking embedding and optimum detection in color images through the use of spread spectrum techniques both in space (Direct Sequence Spread Spectrum or DSSS) and frequency ...
  • Human face segmentation and tracking using connected components and partition projection 

    Marqués Acosta, Fernando; Vilaplana Besler, Verónica; Buxes, A (Institute of Electrical and Electronics Engineers (IEEE), 1999)
    Text en actes de congrés
    Accés obert
    A new technique for segmenting and tracking human faces in video sequences is presented. The algorithm uses a connected operator to extract the connected component that more likely belongs to a face. Such a connected ...
  • Accurate and automatic NOAA-AVHRR image navigation using a global contour matching approach 

    Eugenio, F; Marqués Acosta, Fernando; Gómez, L; Suarez, E; Rovaris, E (2000)
    Text en actes de congrés
    Accés obert
    The problem of precise and automatic AVHRR image navigation is tractable in theory, but has proved to be somewhat difficult in practice. The authors' work has been motivated by the need for a fully automatic and operational ...
  • Description schemes for video program, user and devices 

    Salembier Clairon, Philippe Jean; Richard, Qian; O'Connors, N Et Al; Correia, P. (2000-09)
    Article
    Accés restringit per política de l'editorial
    This paper presents a set of description schemes (DS) dealing with video programs, users and devices. Following MPEG-7 terminology, a description of an AV document includes descriptors (termed Ds), which specify the syntax ...
  • Perceptual masks in the wavelet domain for color image watermarking 

    Sayrol Clols, Elisa; Martínez Serra, Elena (Institute of Electrical and Electronics Engineers (IEEE), 2015)
    Text en actes de congrés
    Accés restringit per política de l'editorial
    In this paper, three different perceptual masks in the wavelet domain are investigated for image watermarking systems. Along with an already existing method, we present two alternative perceptual masks based on visual ...
  • Shallow and deep convolutional networks for saliency prediction 

    Pan, Junting; Sayrol Clols, Elisa; Giró Nieto, Xavier; McGuinness, Kevin; O'Connor, Noel (Institute of Electrical and Electronics Engineers (IEEE), 2016)
    Comunicació de congrés
    Accés restringit per política de l'editorial
    The prediction of salient areas in images has been traditionally addressed with hand-crafted features based on neuroscience principles. This paper, however, addresses the problem with a completely data-driven approach by ...

Mostra'n més