Recent Submissions

  • Uncalibrated, unified and unsupervised specular-aware photometric stereo 

    Estévez Graña, Pablo; Agudo Martínez, Antonio (Springer, 2023)
    Conference report
    Open Access
    In this paper we present a variational approach to simultaneously recover the 3D reconstruction, reflectance, lighting and specularities of an object, all of them, from a set of RGB images. The approach works in an ...
  • Context attention: human motion prediction using context information and deep learning attention models 

    Laplaza Galindo, Javier; Moreno-Noguer, Francesc; Sanfeliu Cortés, Alberto (Springer, 2022)
    Conference lecture
    Open Access
    This work proposes a human motion prediction model for handover operations. The model uses a multi-headed attention architecture to process the human skeleton data together with contextual data from the operation. This ...
  • Towards transferring tactile-based continuous force control policies from simulation to robot 

    Lach, Luca Michael; Haschke, Robert; Tateo, Davide; Peters, Jan; Ritter, Helge; Borràs Sol, Júlia; Torras, Carme (2023)
    Conference report
    Open Access
    The advent of tactile sensors in robotics has sparked many ideas on how robots can leverage direct contact measurements of their environment interactions to improve manipulation tasks. An important line of research in this ...
  • Back to MLP: a simple baseline for human motion prediction 

    Guo, Wen; Du, Yuming; Shen, Xi; Lepetit, Vincent; Alameda-Pineda, Xavier; Moreno-Noguer, Francesc (Institute of Electrical and Electronics Engineers (IEEE), 2023)
    Conference report
    Open Access
    This paper tackles the problem of human motion prediction, consisting in forecasting future body poses from historically observed sequences. State-of-the-art approaches provide good results, however, they rely on deep ...
  • Visual semantic relatedness dataset for image captioning 

    Sabir, Ahmed; Moreno-Noguer, Francesc; Padró, Lluís (2023)
    Conference report
    Open Access
    Modern image captioning system relies heavily on extracting knowledge from images to capture the concept of a static story. In this paper, we propose a textual visual context dataset for captioning, in which the publicly ...
  • NeRFLight: Fast and Light Neural Radiance Fields using a shared feature grid 

    Rivas-Manzaneque, Fernando; Sierra Acosta, Jorge; Peñate Sánchez, Adrián; Moreno-Noguer, Francesc; Ribeiro Seijas, Angela (Institute of Electrical and Electronics Engineers (IEEE), 2023)
    Conference report
    Open Access
    While original Neural Radiance Fields (NeRF) have shown impressive results in modeling the appearance of a scene with compact MLP architectures, they are not able to achieve real-time rendering. This has been recently ...
  • User interactions and negative examples to improve the learning of semantic rules in a cognitive exercise scenario 

    Suárez Hernández, Alejandro; Andriella, Antonio; Torras, Carme; Alenyà Ribas, Guillem (2023)
    Conference report
    Open Access
    Enabling a robot to perform new tasks is a complex endeavor, usually beyond the reach of non-technical users. For this reason, research efforts that aim at empowering end-users to teach robots new abilities using intuitive ...
  • Photovoltaic power forecasting using sky images and sun motion 

    Berresheim, Arne; Agudo Martínez, Antonio (Institute of Electrical and Electronics Engineers (IEEE), 2023)
    Conference report
    Open Access
    Solar energy adoption is moving at a rapid pace. The variability in solar energy production causes grid stability issues and hinders mass adoption. To solve these issues, more accurate photovoltaic power forecasting systems ...
  • NYAM: the role of configurable engagement strategies in robotic-assisted feeding 

    Barrué Subirana, Cristian; Suárez Hernández, Alejandro; Inzitari, Marco; Ribera Solé, Aida; Alenyà Ribas, Guillem (2024)
    Conference report
    Restricted access - publisher's policy
    In some contexts, like geriatric hospitals, the number of patients requiring assistance with feeding is very high and robots may be an effective tool for caregivers to provide better assistance. This article introduces ...
  • Robust multimodal and multi-object tracking for autonomous driving applications 

    Pérez Quintana, Marc; Agudo Martínez, Antonio (Institute of Electrical and Electronics Engineers (IEEE), 2023)
    Conference report
    Open Access
    In this work, we present a method for Multi-Object Tracking (MOT) that uses unsynchronized multimodal detections from a configurable set of sensors such as cameras, radars and lidars. All the information is processed from ...
  • Detail-aware uncalibrated photometric stereo 

    Agudo Martínez, Antonio (2023)
    Conference report
    Open Access
    Photometric stereo is the problem of jointly inferring the 3D reconstruction, reflectance, lighting and specularities of an object from a set of visual signals. Recently, some variational, uncalibrated, unsupervised and ...
  • Grasp-oriented fine-grained cloth segmentation without real supervision 

    Ruijie, Ren; Rajesh, Mohit Gurnani; Sánchez Riera, Jordi; López Rodríguez, Andrian; Zhang, Fan; Tian, Yurun; Alenyà Ribas, Guillem; Agudo Martínez, Antonio; Demiris, Yiannis; Mikolajczyk, Krystian; Moreno-Noguer, Francesc (Association for Computing Machinery (ACM), 2023)
    Conference report
    Open Access
    Automatically detecting graspable regions from a single depth image is a key ingredient in cloth manipulation. The large variability of cloth deformations has motivated most of the current approaches to focus on identifying ...

View more