<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:dc="http://purl.org/dc/elements/1.1/" version="2.0">
  <channel>
    <title>DSpace Collection:</title>
    <link>http://hdl.handle.net/2117/3335</link>
    <description />
    <pubDate>Sat, 25 May 2013 04:49:32 GMT</pubDate>
    <dc:date>2013-05-25T04:49:32Z</dc:date>
    <itunes:owner>
      <itunes:email>webmaster.bupc@upc.edu</itunes:email>
      <itunes:name>Universitat Politècnica de Catalunya. Servei de Biblioteques i Documentació</itunes:name>
    </itunes:owner>
    <itunes:explicit>no</itunes:explicit>
    <itunes:keywords />
    <item>
      <title>Depth estimation of frames in image sequences using motion occlusions</title>
      <link>http://hdl.handle.net/2117/19261</link>
      <description>Title: Depth estimation of frames in image sequences using motion occlusions
Authors: Palou Visa, Guillem; Salembier Clairon, Philippe Jean
Abstract: This paper proposes a system to depth order regions of a&#xD;
frame belonging to a monocular image sequence. For a given frame, re-&#xD;
gions are ordered according to their relative depth using the previous&#xD;
and following frames. The algorithm estimates occluded and disoccluded&#xD;
pixels belonging to the central frame. Afterwards, a Binary Partition&#xD;
Tree (BPT) is constructed to obtain a hierarchical, region based repre-&#xD;
sentation of the image. The  nal depth partition is obtained by means&#xD;
of energy minimization on the BPT. To achieve a global depth ordering&#xD;
from local occlusion cues, a depth order graph is constructed and used to&#xD;
eliminate contradictory local cues. Results of the system are evaluated&#xD;
and compared with state of the art  gure/ground labeling systems on&#xD;
several datasets, showing promising results.</description>
      <pubDate>Wed, 15 May 2013 15:11:26 GMT</pubDate>
      <guid isPermaLink="false">http://hdl.handle.net/2117/19261</guid>
      <dc:date>2013-05-15T15:11:26Z</dc:date>
      <itunes:author>Palou Visa, Guillem; Salembier Clairon, Philippe Jean</itunes:author>
      <itunes:explicit>no</itunes:explicit>
      <itunes:keywords>Binary partition Tree, Data sets, Depth Estimation, Energy minimization, Image sequence, Monocular image sequence, Order graph, Region-based, State of the art</itunes:keywords>
      <itunes:summary>This paper proposes a system to depth order regions of a&#xD;
frame belonging to a monocular image sequence. For a given frame, re-&#xD;
gions are ordered according to their relative depth using the previous&#xD;
and following frames. The algorithm estimates occluded and disoccluded&#xD;
pixels belonging to the central frame. Afterwards, a Binary Partition&#xD;
Tree (BPT) is constructed to obtain a hierarchical, region based repre-&#xD;
sentation of the image. The  nal depth partition is obtained by means&#xD;
of energy minimization on the BPT. To achieve a global depth ordering&#xD;
from local occlusion cues, a depth order graph is constructed and used to&#xD;
eliminate contradictory local cues. Results of the system are evaluated&#xD;
and compared with state of the art  gure/ground labeling systems on&#xD;
several datasets, showing promising results.</itunes:summary>
    </item>
    <item>
      <title>Can our TV robustly understand human gestures? Real-Time Gesture Localization in Range Data</title>
      <link>http://hdl.handle.net/2117/18542</link>
      <description>Title: Can our TV robustly understand human gestures? Real-Time Gesture Localization in Range Data
Authors: López Méndez, Adolfo; Casas Pla, Josep Ramon
Abstract: The 'old' remote falls short of requirements when confronted with digital convergence for living room displays. Enriched options to watch, manage and interact with content on large displays demand improved means of interaction. Concurrently, gesture recognition is increasingly present in human-computer interaction for gaming applications. In this paper we propose a gesture localization framework for interactive display of audio-visual content. The proposed framework works with range data captured from a single consumer depth camera. We focus on still gestures because they are generally user friendly (users do not have to make complex and tiring movements) and allow formulating the problem in terms of object localization. Our method is based on random forests, which have shown an excellent performance on classification and regression tasks. In this work, however, we aim at a specific class of localization problems involving highly unbalanced data: positive examples appear during a small fraction of space and time. We study the impact of this natural unbalance on the random forest learning and we propose a framework to robustly detect gestures on range images in real applications. Our experiments with offline data show the effectiveness of our approach. We also present a real-time application where users can control the TV display with a reduced set of still gestures.</description>
      <pubDate>Tue, 02 Apr 2013 12:43:38 GMT</pubDate>
      <guid isPermaLink="false">http://hdl.handle.net/2117/18542</guid>
      <dc:date>2013-04-02T12:43:38Z</dc:date>
      <itunes:author>López Méndez, Adolfo; Casas Pla, Josep Ramon</itunes:author>
      <itunes:explicit>no</itunes:explicit>
      <itunes:keywords />
      <itunes:summary>The 'old' remote falls short of requirements when confronted with digital convergence for living room displays. Enriched options to watch, manage and interact with content on large displays demand improved means of interaction. Concurrently, gesture recognition is increasingly present in human-computer interaction for gaming applications. In this paper we propose a gesture localization framework for interactive display of audio-visual content. The proposed framework works with range data captured from a single consumer depth camera. We focus on still gestures because they are generally user friendly (users do not have to make complex and tiring movements) and allow formulating the problem in terms of object localization. Our method is based on random forests, which have shown an excellent performance on classification and regression tasks. In this work, however, we aim at a specific class of localization problems involving highly unbalanced data: positive examples appear during a small fraction of space and time. We study the impact of this natural unbalance on the random forest learning and we propose a framework to robustly detect gestures on range images in real applications. Our experiments with offline data show the effectiveness of our approach. We also present a real-time application where users can control the TV display with a reduced set of still gestures.</itunes:summary>
    </item>
    <item>
      <title>Metric learning from poses for temporal clustering of human motion</title>
      <link>http://hdl.handle.net/2117/18496</link>
      <description>Title: Metric learning from poses for temporal clustering of human motion
Authors: López Méndez, Adolfo; Gall, Juergen; Casas Pla, Josep Ramon; van Gool, Luc
Abstract: Temporal clustering of human motion into semantically meaningful behaviors is a challenging task. While unsupervised methods do well to some extent, the obtained clusters often lack a semantic interpretation. In this paper, we propose to learn what makes a sequence of human poses different from others such that it should be annotated as an action. To this end, we formulate the problem as weakly supervised temporal clustering for an unknown number of clusters. Weak supervision is attained by learning a metric from the implicit semantic distances derived from already annotated databases. Such a metric contains some low-level semantic information that can be used to effectively segment a human motion sequence into distinct actions or behaviors. The main advantage of our approach is that metrics can be successfully used across datasets, making our method a compelling alternative to unsupervised methods. Experiments on publicly available mocap datasets show the effectiveness of our approach.</description>
      <pubDate>Fri, 22 Mar 2013 13:04:35 GMT</pubDate>
      <guid isPermaLink="false">http://hdl.handle.net/2117/18496</guid>
      <dc:date>2013-03-22T13:04:35Z</dc:date>
      <itunes:author>López Méndez, Adolfo; Gall, Juergen; Casas Pla, Josep Ramon; van Gool, Luc</itunes:author>
      <itunes:explicit>no</itunes:explicit>
      <itunes:keywords />
      <itunes:summary>Temporal clustering of human motion into semantically meaningful behaviors is a challenging task. While unsupervised methods do well to some extent, the obtained clusters often lack a semantic interpretation. In this paper, we propose to learn what makes a sequence of human poses different from others such that it should be annotated as an action. To this end, we formulate the problem as weakly supervised temporal clustering for an unknown number of clusters. Weak supervision is attained by learning a metric from the implicit semantic distances derived from already annotated databases. Such a metric contains some low-level semantic information that can be used to effectively segment a human motion sequence into distinct actions or behaviors. The main advantage of our approach is that metrics can be successfully used across datasets, making our method a compelling alternative to unsupervised methods. Experiments on publicly available mocap datasets show the effectiveness of our approach.</itunes:summary>
    </item>
    <item>
      <title>Depth ordering on image sequences using motion occlusions</title>
      <link>http://hdl.handle.net/2117/18356</link>
      <description>Title: Depth ordering on image sequences using motion occlusions
Authors: Palou Visa, Guillem; Salembier Clairon, Philippe Jean
Abstract: This paper proposes a system to obtain the depth order of frames in image sequences using motion occlusion cues. The system first computes the forward and backward flows with the previous and next frames and estimates the occluded points. To obtain a region representation of the image, a Binary Partition Tree (BPT) is created for each frame. To estimate occlusion relations in the image, projective flow models are fitted to all regions in the image. The depth order solution is obtained by minimizing over the tree structure a cost function based on occlusion relations and the number of regions. Results show that optical flow algorithms can be used directly to estimate occlusion points. Promising results are obtained combining motion occlusions and region information by means of a BPT. Evaluation is performed comparing current state-of-the-art algorithms on figure/ground assignments, showing that the performance of the proposed system is comparable to current algorithms.</description>
      <pubDate>Fri, 15 Mar 2013 16:02:14 GMT</pubDate>
      <guid isPermaLink="false">http://hdl.handle.net/2117/18356</guid>
      <dc:date>2013-03-15T16:02:14Z</dc:date>
      <itunes:author>Palou Visa, Guillem; Salembier Clairon, Philippe Jean</itunes:author>
      <itunes:explicit>no</itunes:explicit>
      <itunes:keywords />
      <itunes:summary>This paper proposes a system to obtain the depth order of frames in image sequences using motion occlusion cues. The system first computes the forward and backward flows with the previous and next frames and estimates the occluded points. To obtain a region representation of the image, a Binary Partition Tree (BPT) is created for each frame. To estimate occlusion relations in the image, projective flow models are fitted to all regions in the image. The depth order solution is obtained by minimizing over the tree structure a cost function based on occlusion relations and the number of regions. Results show that optical flow algorithms can be used directly to estimate occlusion points. Promising results are obtained combining motion occlusions and region information by means of a BPT. Evaluation is performed comparing current state-of-the-art algorithms on figure/ground assignments, showing that the performance of the proposed system is comparable to current algorithms.</itunes:summary>
    </item>
    <item>
      <title>Depth map coding based on a optimal hierarchical region representation</title>
      <link>http://hdl.handle.net/2117/18320</link>
      <description>Title: Depth map coding based on a optimal hierarchical region representation
Authors: Maceira Duch, Marc; Ruiz Hidalgo, Javier; Morros Rubió, Josep Ramon
Abstract: Multiview color information used jointly with depth maps is a&#xD;
widespread technique for 3D video. Using this depth information,&#xD;
3D functionalities such as free view point video can be provided&#xD;
by means of depth-image-based rendering techniques. In this pa-&#xD;
per, a new technique to encode depth maps is proposed. Based on&#xD;
the usually smooth structure and the sharp edges of depth map, our&#xD;
proposal segments the depth map into homogeneous regions of ar-&#xD;
bitrary shape and encodes the contents of these regions using dif-&#xD;
ferent texture coding strategies. An optimal lagrangian approach&#xD;
is applied to the hierarchical region representation provided by our&#xD;
segmentation technique. This approach automatically selects the&#xD;
best encoding strategy for each region and the optimal partition to&#xD;
encode the depth map. To avoid the high coding costs of coding&#xD;
the resulting partition, a prediction is made using the associated&#xD;
decoded color image</description>
      <pubDate>Thu, 14 Mar 2013 17:59:25 GMT</pubDate>
      <guid isPermaLink="false">http://hdl.handle.net/2117/18320</guid>
      <dc:date>2013-03-14T17:59:25Z</dc:date>
      <itunes:author>Maceira Duch, Marc; Ruiz Hidalgo, Javier; Morros Rubió, Josep Ramon</itunes:author>
      <itunes:explicit>no</itunes:explicit>
      <itunes:keywords />
      <itunes:summary>Multiview color information used jointly with depth maps is a&#xD;
widespread technique for 3D video. Using this depth information,&#xD;
3D functionalities such as free view point video can be provided&#xD;
by means of depth-image-based rendering techniques. In this pa-&#xD;
per, a new technique to encode depth maps is proposed. Based on&#xD;
the usually smooth structure and the sharp edges of depth map, our&#xD;
proposal segments the depth map into homogeneous regions of ar-&#xD;
bitrary shape and encodes the contents of these regions using dif-&#xD;
ferent texture coding strategies. An optimal lagrangian approach&#xD;
is applied to the hierarchical region representation provided by our&#xD;
segmentation technique. This approach automatically selects the&#xD;
best encoding strategy for each region and the optimal partition to&#xD;
encode the depth map. To avoid the high coding costs of coding&#xD;
the resulting partition, a prediction is made using the associated&#xD;
decoded color image</itunes:summary>
    </item>
    <item>
      <title>From local occlusion cues to global monocular depth estimation</title>
      <link>http://hdl.handle.net/2117/18317</link>
      <description>Title: From local occlusion cues to global monocular depth estimation
Authors: Palou Visa, Guillem; Salembier Clairon, Philippe Jean
Abstract: In this paper, we propose a system to obtain a depth ordered seg-&#xD;
mentation of a single image based on low level cues. The algorithm&#xD;
first constructs a hierarchical, region-based image representation of&#xD;
the image using a Binary Partition Tree (BPT). During the building&#xD;
process, T-junction depth cues are detected, along with high convex&#xD;
boundaries. When the BPT is built, a suitable segmentation is found&#xD;
and a global depth ordering is found using a probabilistic framework.&#xD;
Results are compared with state of the art depth ordering and&#xD;
figure/ground labeling systems. The advantage of the proposed ap-&#xD;
proach compared to systems based on a training procedure is the&#xD;
lack of assumptions about the scene content. Moreover, it is shown&#xD;
that the system outperforms previously low-level cue based systems,&#xD;
while offering similar results to a priori trained figure/ground label-&#xD;
ing algorithms</description>
      <pubDate>Thu, 14 Mar 2013 17:28:13 GMT</pubDate>
      <guid isPermaLink="false">http://hdl.handle.net/2117/18317</guid>
      <dc:date>2013-03-14T17:28:13Z</dc:date>
      <itunes:author>Palou Visa, Guillem; Salembier Clairon, Philippe Jean</itunes:author>
      <itunes:explicit>no</itunes:explicit>
      <itunes:keywords />
      <itunes:summary>In this paper, we propose a system to obtain a depth ordered seg-&#xD;
mentation of a single image based on low level cues. The algorithm&#xD;
first constructs a hierarchical, region-based image representation of&#xD;
the image using a Binary Partition Tree (BPT). During the building&#xD;
process, T-junction depth cues are detected, along with high convex&#xD;
boundaries. When the BPT is built, a suitable segmentation is found&#xD;
and a global depth ordering is found using a probabilistic framework.&#xD;
Results are compared with state of the art depth ordering and&#xD;
figure/ground labeling systems. The advantage of the proposed ap-&#xD;
proach compared to systems based on a training procedure is the&#xD;
lack of assumptions about the scene content. Moreover, it is shown&#xD;
that the system outperforms previously low-level cue based systems,&#xD;
while offering similar results to a priori trained figure/ground label-&#xD;
ing algorithms</itunes:summary>
    </item>
    <item>
      <title>Microarray classification with hierarchical data representation and novel feature selection criteria</title>
      <link>http://hdl.handle.net/2117/18291</link>
      <description>Title: Microarray classification with hierarchical data representation and novel feature selection criteria
Authors: Bosio, Mattia; Bellot Pujalte, Pau; Salembier Clairon, Philippe Jean; Oliveras Vergés, Albert
Abstract: Microarray data classification is a challenging prob-&#xD;
lem due to the high number of variables compared to the&#xD;
small number of available samples. An effective methodology&#xD;
to output a precise and reliable classifier is proposed in this&#xD;
work as an improvement of the algorithm in [1]. It considers the&#xD;
sample scarcity problem and the lack of data structure typical of&#xD;
microarrays. Both problem are assessed by a two-step approach&#xD;
applying hierarchical clustering to create new features called&#xD;
metagenes and introducing a novel feature ranking criterion,&#xD;
inside the wrapper feature selection task. The classification ability&#xD;
has been evaluated on 4 publicly available datasets from&#xD;
Micro&#xD;
Array Quality Control study phase II&#xD;
(MAQC) classified by 7&#xD;
different endpoints. The global results have showed how the&#xD;
proposed approach obtains better prediction accuracy than a&#xD;
wide variety of state of the art alternatives</description>
      <pubDate>Wed, 13 Mar 2013 19:48:02 GMT</pubDate>
      <guid isPermaLink="false">http://hdl.handle.net/2117/18291</guid>
      <dc:date>2013-03-13T19:48:02Z</dc:date>
      <itunes:author>Bosio, Mattia; Bellot Pujalte, Pau; Salembier Clairon, Philippe Jean; Oliveras Vergés, Albert</itunes:author>
      <itunes:explicit>no</itunes:explicit>
      <itunes:keywords />
      <itunes:summary>Microarray data classification is a challenging prob-&#xD;
lem due to the high number of variables compared to the&#xD;
small number of available samples. An effective methodology&#xD;
to output a precise and reliable classifier is proposed in this&#xD;
work as an improvement of the algorithm in [1]. It considers the&#xD;
sample scarcity problem and the lack of data structure typical of&#xD;
microarrays. Both problem are assessed by a two-step approach&#xD;
applying hierarchical clustering to create new features called&#xD;
metagenes and introducing a novel feature ranking criterion,&#xD;
inside the wrapper feature selection task. The classification ability&#xD;
has been evaluated on 4 publicly available datasets from&#xD;
Micro&#xD;
Array Quality Control study phase II&#xD;
(MAQC) classified by 7&#xD;
different endpoints. The global results have showed how the&#xD;
proposed approach obtains better prediction accuracy than a&#xD;
wide variety of state of the art alternatives</itunes:summary>
    </item>
    <item>
      <title>INTAIRACT: Joint hand gesture and fingertip classification for touchless interaction</title>
      <link>http://hdl.handle.net/2117/18278</link>
      <description>Title: INTAIRACT: Joint hand gesture and fingertip classification for touchless interaction
Authors: Suau Cuadros, Xavier; Alcoverro Vidal, Marcel; López Méndez, Adolfo; Ruiz Hidalgo, Javier; Casas Pla, Josep Ramon
Abstract: In this demo we present intAIRact, an online hand-based&#xD;
touchless interaction system. Interactions are based on easy-to-learn hand&#xD;
gestures, that combined with translations and rotations render a user&#xD;
friendly and highly configurable system. The main advantage with respect&#xD;
to existing approaches is that we are able to robustly locate and&#xD;
identify fingertips. Hence, we are able to employ a simple but powerful alphabet&#xD;
of gestures not only by determining the number of visible fingers&#xD;
in a gesture, but also which fingers are being observed. To achieve such a&#xD;
system we propose a novel method that jointly infers hand gestures and&#xD;
fingertip locations using a single depth image from a consumer depth&#xD;
camera. Our approach is based on a novel descriptor for depth data, the&#xD;
Oriented Radial Distribution (ORD) [1]. On the one hand, we exploit the&#xD;
ORD for robust classification of hand gestures by means of efficient k-NN&#xD;
retrieval. On the other hand, maxima of the ORD are used to perform&#xD;
structured inference of fingertip locations. The proposed method outperforms&#xD;
other state-of-the-art approaches both in gesture recognition and&#xD;
fingertip localization. An implementation of the ORD extraction on a&#xD;
GPU yields a real-time demo running at approximately 17fps on a single&#xD;
laptop</description>
      <pubDate>Wed, 13 Mar 2013 16:13:16 GMT</pubDate>
      <guid isPermaLink="false">http://hdl.handle.net/2117/18278</guid>
      <dc:date>2013-03-13T16:13:16Z</dc:date>
      <itunes:author>Suau Cuadros, Xavier; Alcoverro Vidal, Marcel; López Méndez, Adolfo; Ruiz Hidalgo, Javier; Casas Pla, Josep Ramon</itunes:author>
      <itunes:explicit>no</itunes:explicit>
      <itunes:keywords />
      <itunes:summary>In this demo we present intAIRact, an online hand-based&#xD;
touchless interaction system. Interactions are based on easy-to-learn hand&#xD;
gestures, that combined with translations and rotations render a user&#xD;
friendly and highly configurable system. The main advantage with respect&#xD;
to existing approaches is that we are able to robustly locate and&#xD;
identify fingertips. Hence, we are able to employ a simple but powerful alphabet&#xD;
of gestures not only by determining the number of visible fingers&#xD;
in a gesture, but also which fingers are being observed. To achieve such a&#xD;
system we propose a novel method that jointly infers hand gestures and&#xD;
fingertip locations using a single depth image from a consumer depth&#xD;
camera. Our approach is based on a novel descriptor for depth data, the&#xD;
Oriented Radial Distribution (ORD) [1]. On the one hand, we exploit the&#xD;
ORD for robust classification of hand gestures by means of efficient k-NN&#xD;
retrieval. On the other hand, maxima of the ORD are used to perform&#xD;
structured inference of fingertip locations. The proposed method outperforms&#xD;
other state-of-the-art approaches both in gesture recognition and&#xD;
fingertip localization. An implementation of the ORD extraction on a&#xD;
GPU yields a real-time demo running at approximately 17fps on a single&#xD;
laptop</itunes:summary>
    </item>
    <item>
      <title>Rich internet application for semi-automatic annotation of semantic shots on keyframes</title>
      <link>http://hdl.handle.net/2117/18063</link>
      <description>Title: Rich internet application for semi-automatic annotation of semantic shots on keyframes
Authors: Carcel, Elisabet; Martos Asensio, Manel; Giró Nieto, Xavier; Marqués Acosta, Fernando
Abstract: This paper describes a system developed for the semi-automatic annotation of keyframes in a broadcasting company. The tool aims at assisting archivists who traditionally label every keyframe manually by suggesting them an automatic annotation that they can intuitively edit and validate. The system is valid for any domain as it uses generic MPEG-7 visual descriptors and binary SVM classifiers. The classification engine has been tested on the multiclass problem of semantic shot detection, a type of metadata used in the company to index new content ingested in the system. The detection performance has been tested in two different domains: soccer and parliament. The core engine is accessed by a Rich Internet Application via a web service. The graphical user interface allows the edition of the suggested labels with an intuitive drag and drop mechanism between rows of thumbnails, each row representing a different semantic shot class. The system has been described as complete and easy to use by the professional archivists at the company</description>
      <pubDate>Tue, 05 Mar 2013 11:49:33 GMT</pubDate>
      <guid isPermaLink="false">http://hdl.handle.net/2117/18063</guid>
      <dc:date>2013-03-05T11:49:33Z</dc:date>
      <itunes:author>Carcel, Elisabet; Martos Asensio, Manel; Giró Nieto, Xavier; Marqués Acosta, Fernando</itunes:author>
      <itunes:explicit>no</itunes:explicit>
      <itunes:keywords>annotation, classification, MPEG-7 visual descriptors, RIA, semantic shot</itunes:keywords>
      <itunes:summary>This paper describes a system developed for the semi-automatic annotation of keyframes in a broadcasting company. The tool aims at assisting archivists who traditionally label every keyframe manually by suggesting them an automatic annotation that they can intuitively edit and validate. The system is valid for any domain as it uses generic MPEG-7 visual descriptors and binary SVM classifiers. The classification engine has been tested on the multiclass problem of semantic shot detection, a type of metadata used in the company to index new content ingested in the system. The detection performance has been tested in two different domains: soccer and parliament. The core engine is accessed by a Rich Internet Application via a web service. The graphical user interface allows the edition of the suggested labels with an intuitive drag and drop mechanism between rows of thumbnails, each row representing a different semantic shot class. The system has been described as complete and easy to use by the professional archivists at the company</itunes:summary>
    </item>
    <item>
      <title>Temporal PolSAR image series exploitation with binary partition trees</title>
      <link>http://hdl.handle.net/2117/17826</link>
      <description>Title: Temporal PolSAR image series exploitation with binary partition trees
Authors: Alonso González, Alberto; López Martínez, Carlos; Salembier Clairon, Philippe Jean
Abstract: In this paper, the processing of temporal PolSAR image&#xD;
series is addressed through a region-based and multi-scale&#xD;
data representation, the Binary Partition Tree (BPT). This&#xD;
structure contains useful information related to the data&#xD;
structure at different detail levels that may be employed for&#xD;
different applications. The construction of this structure ans&#xD;
its exploitation is addressed in this work in the context of the&#xD;
speckle filtering and data segmentation applications. A new&#xD;
region model and processing strategy are defined to tackle&#xD;
with the temporal dimension of the data. Finally, to illustrate&#xD;
the capabilities of the proposed technique, results are shown&#xD;
with a real RADARSAT-2 dataset.</description>
      <pubDate>Mon, 18 Feb 2013 12:44:39 GMT</pubDate>
      <guid isPermaLink="false">http://hdl.handle.net/2117/17826</guid>
      <dc:date>2013-02-18T12:44:39Z</dc:date>
      <itunes:author>Alonso González, Alberto; López Martínez, Carlos; Salembier Clairon, Philippe Jean</itunes:author>
      <itunes:explicit>no</itunes:explicit>
      <itunes:keywords />
      <itunes:summary>In this paper, the processing of temporal PolSAR image&#xD;
series is addressed through a region-based and multi-scale&#xD;
data representation, the Binary Partition Tree (BPT). This&#xD;
structure contains useful information related to the data&#xD;
structure at different detail levels that may be employed for&#xD;
different applications. The construction of this structure ans&#xD;
its exploitation is addressed in this work in the context of the&#xD;
speckle filtering and data segmentation applications. A new&#xD;
region model and processing strategy are defined to tackle&#xD;
with the temporal dimension of the data. Finally, to illustrate&#xD;
the capabilities of the proposed technique, results are shown&#xD;
with a real RADARSAT-2 dataset.</itunes:summary>
    </item>
    <item>
      <title>Variable local weight filtering for PolSAR data speckle noise reduction</title>
      <link>http://hdl.handle.net/2117/17818</link>
      <description>Title: Variable local weight filtering for PolSAR data speckle noise reduction
Authors: Alonso González, Alberto; López Martínez, Carlos; Salembier Clairon, Philippe Jean
Abstract: This paper presents a Polarimetric SAR data speckle filtering&#xD;
technique, based on a combined filtering in the spatial and polarimetric&#xD;
domains. It is based on a bilateral filtering employing&#xD;
distance measures over these domains. These measures&#xD;
concentrate all the information related to the domain structure&#xD;
that is needed for an adaptation to the scene morphology.&#xD;
A weighted average is performed over a given window favoring&#xD;
closer and similar pixels. As a consequence, an adaptive&#xD;
filtering is achieved, attaining higher filtering over homogeneous&#xD;
areas whereas point scatters remain almost unchanged.&#xD;
Results will be shown over a real RADARSAT-2 data.</description>
      <pubDate>Mon, 18 Feb 2013 10:53:28 GMT</pubDate>
      <guid isPermaLink="false">http://hdl.handle.net/2117/17818</guid>
      <dc:date>2013-02-18T10:53:28Z</dc:date>
      <itunes:author>Alonso González, Alberto; López Martínez, Carlos; Salembier Clairon, Philippe Jean</itunes:author>
      <itunes:explicit>no</itunes:explicit>
      <itunes:keywords />
      <itunes:summary>This paper presents a Polarimetric SAR data speckle filtering&#xD;
technique, based on a combined filtering in the spatial and polarimetric&#xD;
domains. It is based on a bilateral filtering employing&#xD;
distance measures over these domains. These measures&#xD;
concentrate all the information related to the domain structure&#xD;
that is needed for an adaptation to the scene morphology.&#xD;
A weighted average is performed over a given window favoring&#xD;
closer and similar pixels. As a consequence, an adaptive&#xD;
filtering is achieved, attaining higher filtering over homogeneous&#xD;
areas whereas point scatters remain almost unchanged.&#xD;
Results will be shown over a real RADARSAT-2 data.</itunes:summary>
    </item>
    <item>
      <title>Multi-view body tracking with a detector-driven hierarchical particle filter</title>
      <link>http://hdl.handle.net/2117/17456</link>
      <description>Title: Multi-view body tracking with a detector-driven hierarchical particle filter
Authors: Navarro, S.; López Méndez, Adolfo; Alcoverro Vidal, Marcel; Casas Pla, Josep Ramon
Abstract: In this paper we present a novel approach to markerless human&#xD;
motion capture that robustly integrates body part detections in&#xD;
multiple views. The proposed method fuses cues from multiple views&#xD;
to enhance the propagation and observation model of particle filtering&#xD;
methods aiming at human motion capture. We particularize our method&#xD;
to improve arm tracking in the publicly available IXMAS dataset. Our&#xD;
experiments show that the proposed method outperforms other state-ofthe-&#xD;
art approaches.</description>
      <pubDate>Mon, 21 Jan 2013 17:47:42 GMT</pubDate>
      <guid isPermaLink="false">http://hdl.handle.net/2117/17456</guid>
      <dc:date>2013-01-21T17:47:42Z</dc:date>
      <itunes:author>Navarro, S.; López Méndez, Adolfo; Alcoverro Vidal, Marcel; Casas Pla, Josep Ramon</itunes:author>
      <itunes:explicit>no</itunes:explicit>
      <itunes:keywords>3D reconstruction, Body part detection, Human motion capture, Inverse kinematics, Multi-view</itunes:keywords>
      <itunes:summary>In this paper we present a novel approach to markerless human&#xD;
motion capture that robustly integrates body part detections in&#xD;
multiple views. The proposed method fuses cues from multiple views&#xD;
to enhance the propagation and observation model of particle filtering&#xD;
methods aiming at human motion capture. We particularize our method&#xD;
to improve arm tracking in the publicly available IXMAS dataset. Our&#xD;
experiments show that the proposed method outperforms other state-ofthe-&#xD;
art approaches.</itunes:summary>
    </item>
    <item>
      <title>Promeds: an adaptive robust fundamental matrix estimation approach</title>
      <link>http://hdl.handle.net/2117/17275</link>
      <description>Title: Promeds: an adaptive robust fundamental matrix estimation approach
Authors: Irurueta Carro, Alberto; Morros Rubió, Josep Ramon
Abstract: Accurate fundamental matrix estimation from computed correspondences is hard to achieve depending on the constraints on computational time and available data (i.e. correspondences and quality scores). Several algorithms exist for this task, like the 8-points, the 7-points algorithm [1] or robust methods such as RANSAC [2], MSAC [3] or LMedS [4]. Robust methods are capable of discriminating correspondence outliers, thus, obtaining better results. Additionally, some variations of the previous methods have been proposed. For instance PROSAC [5] is an improvement of RANSAC which takes into account additional information of the quality of the matches to largely reduce the computational cost of the fundamental matrix estimation process. This work proposes a new robust method for fundamental matrix estimation that combines the benefits of PROSAC and LMedS algorithms, namely improved quality, reduced computational time and less parameters to adjust</description>
      <pubDate>Fri, 11 Jan 2013 11:49:11 GMT</pubDate>
      <guid isPermaLink="false">http://hdl.handle.net/2117/17275</guid>
      <dc:date>2013-01-11T11:49:11Z</dc:date>
      <itunes:author>Irurueta Carro, Alberto; Morros Rubió, Josep Ramon</itunes:author>
      <itunes:explicit>no</itunes:explicit>
      <itunes:keywords />
      <itunes:summary>Accurate fundamental matrix estimation from computed correspondences is hard to achieve depending on the constraints on computational time and available data (i.e. correspondences and quality scores). Several algorithms exist for this task, like the 8-points, the 7-points algorithm [1] or robust methods such as RANSAC [2], MSAC [3] or LMedS [4]. Robust methods are capable of discriminating correspondence outliers, thus, obtaining better results. Additionally, some variations of the previous methods have been proposed. For instance PROSAC [5] is an improvement of RANSAC which takes into account additional information of the quality of the matches to largely reduce the computational cost of the fundamental matrix estimation process. This work proposes a new robust method for fundamental matrix estimation that combines the benefits of PROSAC and LMedS algorithms, namely improved quality, reduced computational time and less parameters to adjust</itunes:summary>
    </item>
    <item>
      <title>Oriented radial distribution on depth data: application to the detection of end-effectors</title>
      <link>http://hdl.handle.net/2117/16580</link>
      <description>Title: Oriented radial distribution on depth data: application to the detection of end-effectors
Authors: Suau Cuadros, Xavier; Ruiz Hidalgo, Javier; Casas Pla, Josep Ramon
Description: End-effectors are considered to be the main topological extremities&#xD;
of a given 3D body. Even if the nature of such body is not restricted,&#xD;
this paper focuses on the human body case. Detection of human&#xD;
extremities is a key issue in the human motion capture domain, being&#xD;
needed to initialize and update the tracker. Therefore, the effectiveness&#xD;
of human motion capture systems usually depends on the&#xD;
reliability of the obtained end-effectors. The increasing accuracy,&#xD;
low cost and easy installation of depth cameras has opened the door&#xD;
to new strategies to overcome the body pose estimation problem.&#xD;
With the objective of detecting the head, hands and feet of a human&#xD;
body, we propose a new local feature computed from depth data,&#xD;
which gives an idea of its curvature and prominence. Such feature is&#xD;
weighted depending on recent detections, providing also a temporal&#xD;
dimension. Based on this feature, some end-effector candidate blobs&#xD;
are obtained and classified into head, hands and feet according to&#xD;
three probabilistic descriptors.</description>
      <pubDate>Wed, 26 Sep 2012 10:23:04 GMT</pubDate>
      <guid isPermaLink="false">http://hdl.handle.net/2117/16580</guid>
      <dc:date>2012-09-26T10:23:04Z</dc:date>
      <itunes:author>Suau Cuadros, Xavier; Ruiz Hidalgo, Javier; Casas Pla, Josep Ramon</itunes:author>
      <itunes:explicit>no</itunes:explicit>
      <itunes:keywords />
    </item>
    <item>
      <title>Upper-bound assessment of the spatial accuracy of hierarchical region-based image representations</title>
      <link>http://hdl.handle.net/2117/16487</link>
      <description>Title: Upper-bound assessment of the spatial accuracy of hierarchical region-based image representations
Authors: Pont Tuset, Jordi; Marqués Acosta, Fernando
Abstract: Hierarchical region-based image representations are versatile tools for segmentation, filtering, object detection, etc. The evaluation of their spatial accuracy has been usually performed assessing the final result of an algorithm based on this representation. Given its wide applicability, however, a direct supervised assessment, independent of any application, would be desirable and fair. A brute-force assessment of all the partitions represented in the hierarchical structure would be a correct approach, but as we prove formally, it is computationally unfeasible. This paper presents an efficient algorithm to find the upper-bound performance of the representation and we show that the previous approximations in the literature can fail at finding this bound.</description>
      <pubDate>Fri, 14 Sep 2012 09:22:51 GMT</pubDate>
      <guid isPermaLink="false">http://hdl.handle.net/2117/16487</guid>
      <dc:date>2012-09-14T09:22:51Z</dc:date>
      <itunes:author>Pont Tuset, Jordi; Marqués Acosta, Fernando</itunes:author>
      <itunes:explicit>no</itunes:explicit>
      <itunes:keywords />
      <itunes:summary>Hierarchical region-based image representations are versatile tools for segmentation, filtering, object detection, etc. The evaluation of their spatial accuracy has been usually performed assessing the final result of an algorithm based on this representation. Given its wide applicability, however, a direct supervised assessment, independent of any application, would be desirable and fair. A brute-force assessment of all the partitions represented in the hierarchical structure would be a correct approach, but as we prove formally, it is computationally unfeasible. This paper presents an efficient algorithm to find the upper-bound performance of the representation and we show that the previous approximations in the literature can fail at finding this bound.</itunes:summary>
    </item>
  </channel>
</rss>

