Mostra el registre d'ítem simple
3d semantic representation of actions from effcient stereo-image-sequence segmentation on GPUs
dc.contributor.author | Alexey, Abramov |
dc.contributor.author | Aksoy, Eren Erdal |
dc.contributor.author | Dörr, Johannes |
dc.contributor.author | Wörgötter, Florentin |
dc.contributor.author | Pauwels, Karl |
dc.contributor.author | Dellen, Babette |
dc.contributor.other | Institut de Robòtica i Informàtica Industrial |
dc.date.accessioned | 2010-08-23T09:11:59Z |
dc.date.available | 2010-08-23T09:11:59Z |
dc.date.created | 2010 |
dc.date.issued | 2010 |
dc.identifier.citation | Alexey, A. [et al.]. 3d semantic representation of actions from effcient stereo-image-sequence segmentation on GPUs. A: International Symposium 3D Data Processing, Visualization and Transmission. "International Symposium 3D Data Processing, Visualization and Transmission (3DPVT) Edition 5th". Paris: 2010, p. 1-8. |
dc.identifier.uri | http://hdl.handle.net/2117/8683 |
dc.description.abstract | A novel real-time framework for model-free stereo-video segmentation and stereo-segment tracking is presented, combining real-time optical flow and stereo with image segmentation running separately on two GPUs. The stereosegment tracking algorithm achieves a frame rate of 23 Hz for regular videos with a frame size of 256 x 320 pixels and nearly real time for stereo videos. The computed stereo segments are used to construct 3D segment graphs, from which main graphs, representing a relevant change in the scene, are extracted, which allow us to represent a movie of e.g. 396 original frames by only 12 graphs, each containing only a small number of nodes, providing a condensed description of the scene while preserving data-intrinsic semantics. Using this method, human activities, e.g., handling of objects, can be encoded in an efficient way. The method has potential applications for manipulation action recognition and learning, and provides a vision-front end for applications in cognitive robotics. |
dc.format.extent | 8 p. |
dc.language.iso | eng |
dc.subject | Àrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Reconeixement de formes |
dc.subject.lcsh | Pattern recognition systems |
dc.title | 3d semantic representation of actions from effcient stereo-image-sequence segmentation on GPUs |
dc.type | Conference report |
dc.subject.lemac | Reconeixement de formes (Informàtica) |
dc.subject.inspec | Classificació INSPEC::Pattern recognition |
dc.relation.publisherversion | http://campwww.informatik.tu-muenchen.de/3DPVT2010/doku.php?id=acceptedpapers |
dc.rights.access | Open Access |
local.identifier.drac | 2634162 |
dc.description.version | Postprint (published version) |
local.citation.author | Alexey, A.; Aksoy , E.; Dörr, J.; Wörgötter , F.; Pauwels, K.; Dellen, B. |
local.citation.contributor | International Symposium 3D Data Processing, Visualization and Transmission |
local.citation.pubplace | Paris |
local.citation.publicationName | International Symposium 3D Data Processing, Visualization and Transmission (3DPVT) Edition 5th |
local.citation.startingPage | 1 |
local.citation.endingPage | 8 |