Ir al contenido (pulsa Retorno)

Universitat Politècnica de Catalunya

    • Català
    • Castellano
    • English
    • LoginRegisterLog in (no UPC users)
  • mailContact Us
  • world English 
    • Català
    • Castellano
    • English
  • userLogin   
      LoginRegisterLog in (no UPC users)

UPCommons. Global access to UPC knowledge

Banner header
61.676 UPC E-Prints
You are here:
View Item 
  •   DSpace Home
  • E-prints
  • Grups de recerca
  • ROBiri - Grup de Robòtica de l'IRI
  • Ponències/Comunicacions de congressos
  • View Item
  •   DSpace Home
  • E-prints
  • Grups de recerca
  • ROBiri - Grup de Robòtica de l'IRI
  • Ponències/Comunicacions de congressos
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Modeling long-term interactions to enhance action recognition

Thumbnail
View/Open
2375-Modeling-long-term-interactions-to-enhance-action-recognition.pdf (6,692Mb)
 
10.1109/ICPR48806.2021.9412148
 
  View Usage Statistics
  LA Referencia / Recolecta stats
Cita com:
hdl:2117/351241

Show full item record
Cartas Ayala, Alejandro
Radeva, Petia
Dimiccoli, MariellaMés informació
Document typeConference report
Defense date2021
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
Rights accessOpen Access
Attribution-NonCommercial-NoDerivs 3.0 Spain
Except where otherwise noted, content on this work is licensed under a Creative Commons license : Attribution-NonCommercial-NoDerivs 3.0 Spain
Abstract
In this paper, we propose a new approach to understand actions in egocentric videos that exploits the semantics of object interactions at both frame and temporal levels. At the frame level, we use a region-based approach that takes as input a primary region roughly corresponding to the user hands and a set of secondary regions potentially corresponding to the interacting objects and calculates the action score through a CNN formulation. This information is then fed to a Hierarchical Long Short-Term Memory Network (HLSTM) that captures temporal dependencies between actions within and across shots. Ablation studies thoroughly validate the proposed approach, showing in particular that both levels of the HLSTM architecture contribute to performance improvement. Furthermore, quantitative comparisons show that the proposed approach outperforms the state-of-the-art in terms of action recognition on standard benchmarks, without relying on motion information.
Description
© 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
CitationCartas, A.; Radeva, P.; Dimiccoli, M. Modeling long-term interactions to enhance action recognition. A: International Conference on Pattern Recognition. "Proceedings of ICPR 2020: 25th International Conference on Pattern Recognition: Milan, 10–15 January 2021". Institute of Electrical and Electronics Engineers (IEEE), 2021, p. 10351-10358. ISBN 978-1-7281-8808-9. DOI 10.1109/ICPR48806.2021.9412148. 
URIhttp://hdl.handle.net/2117/351241
DOI10.1109/ICPR48806.2021.9412148
ISBN978-1-7281-8808-9
Publisher versionhttps://ieeexplore.ieee.org/document/9412148/
Collections
  • ROBiri - Grup de Robòtica de l'IRI - Ponències/Comunicacions de congressos [219]
  View Usage Statistics

Show full item record

FilesDescriptionSizeFormatView
2375-Modeling-l ... nce-action-recognition.pdf6,692MbPDFView/Open

Browse

This CollectionBy Issue DateAuthorsOther contributionsTitlesSubjectsThis repositoryCommunities & CollectionsBy Issue DateAuthorsOther contributionsTitlesSubjects

© UPC Obrir en finestra nova . Servei de Biblioteques, Publicacions i Arxius

info.biblioteques@upc.edu

  • About This Repository
  • Contact Us
  • Send Feedback
  • Privacy Settings
  • Inici de la pàgina