3d semantic representation of actions from effcient stereo-image-sequence segmentation on GPUs

Alexey, Abramov; Aksoy, Eren Erdal; Dörr, Johannes; Wörgötter, Florentin; Pauwels, Karl; Dellen, Babette

dc.contributor.author	Alexey, Abramov
dc.contributor.author	Aksoy, Eren Erdal
dc.contributor.author	Dörr, Johannes
dc.contributor.author	Wörgötter, Florentin
dc.contributor.author	Pauwels, Karl
dc.contributor.author	Dellen, Babette
dc.contributor.other	Institut de Robòtica i Informàtica Industrial
dc.date.accessioned	2010-08-23T09:11:59Z
dc.date.available	2010-08-23T09:11:59Z
dc.date.created	2010
dc.date.issued	2010
dc.identifier.citation	Alexey, A. [et al.]. 3d semantic representation of actions from effcient stereo-image-sequence segmentation on GPUs. A: International Symposium 3D Data Processing, Visualization and Transmission. "International Symposium 3D Data Processing, Visualization and Transmission (3DPVT) Edition 5th". Paris: 2010, p. 1-8.
dc.identifier.uri	http://hdl.handle.net/2117/8683
dc.description.abstract	A novel real-time framework for model-free stereo-video segmentation and stereo-segment tracking is presented, combining real-time optical flow and stereo with image segmentation running separately on two GPUs. The stereosegment tracking algorithm achieves a frame rate of 23 Hz for regular videos with a frame size of 256 x 320 pixels and nearly real time for stereo videos. The computed stereo segments are used to construct 3D segment graphs, from which main graphs, representing a relevant change in the scene, are extracted, which allow us to represent a movie of e.g. 396 original frames by only 12 graphs, each containing only a small number of nodes, providing a condensed description of the scene while preserving data-intrinsic semantics. Using this method, human activities, e.g., handling of objects, can be encoded in an efficient way. The method has potential applications for manipulation action recognition and learning, and provides a vision-front end for applications in cognitive robotics.
dc.format.extent	8 p.
dc.language.iso	eng
dc.subject	Àrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Reconeixement de formes
dc.subject.lcsh	Pattern recognition systems
dc.title	3d semantic representation of actions from effcient stereo-image-sequence segmentation on GPUs
dc.type	Conference report
dc.subject.lemac	Reconeixement de formes (Informàtica)
dc.subject.inspec	Classificació INSPEC::Pattern recognition
dc.relation.publisherversion	http://campwww.informatik.tu-muenchen.de/3DPVT2010/doku.php?id=acceptedpapers
dc.rights.access	Open Access
local.identifier.drac	2634162
dc.description.version	Postprint (published version)
local.citation.author	Alexey, A.; Aksoy , E.; Dörr, J.; Wörgötter , F.; Pauwels, K.; Dellen, B.
local.citation.contributor	International Symposium 3D Data Processing, Visualization and Transmission
local.citation.pubplace	Paris
local.citation.publicationName	International Symposium 3D Data Processing, Visualization and Transmission (3DPVT) Edition 5th
local.citation.startingPage	1
local.citation.endingPage	8

Fitxers d'aquest items

Nom:: dellen.pdf
Mida:: 2,238Mb
Format:: PDF

Visualitza/Obre

Aquest ítem apareix a les col·leccions següents

Ponències/Comunicacions de congressos [576]

Mostra el registre d'ítem simple

UPCommons. Portal del coneixement obert de la UPC

3d semantic representation of actions from effcient stereo-image-sequence segmentation on GPUs

Fitxers d'aquest items

Aquest ítem apareix a les col·leccions següents

Explora