• 3D pose estimation using convolutional neural networks 

      Rubio Romano, Antonio (Universitat Politècnica de Catalunya, 2015-10)
      Projecte Final de Màster Oficial
      Accés obert
      The present Master Thesis describes a new Pose Estimation method based on Convolutional Neural Networks (CNN). This method divides the three-dimensional space in several regions and, given an input image, returns the ...
    • An automotive case study on the limits of approximation for object detection 

      Caro Roca, Martí; Tabani, Hamid; Abella Ferrer, Jaume; Moll Echeto, Francisco de Borja; Morancho Llena, Enrique; Canal Corretger, Ramon; Altet Sanahujes, Josep; Calomarde Palomino, Antonio; Cazorla Almeida, Francisco Javier; Rubio Romano, Antonio; Fontova Muste, Pau; Fornt Mas, Jordi (2023-05)
      Article
      Accés restringit per política de l'editorial
      The accuracy of camera-based object detection (CBOD) built upon deep learning is often evaluated against the real objects in frames only. However, such simplistic evaluation ignores the fact that many unimportant objects ...
    • BASS: boundary-aware superpixel segmentation 

      Rubio Romano, Antonio; Yu, Longlong; Simó Serra, Edgar; Moreno-Noguer, Francesc (IEEE Press, 2016)
      Text en actes de congrés
      Accés obert
      We propose a new superpixel algorithm based on exploiting the boundary information of an image, as objects in images can generally be described by their boundaries. Our proposed approach initially estimates the boundaries ...
    • Efficient monocular pose estimation for complex 3D models 

      Rubio Romano, Antonio; Villamizar Vergel, Michael Alejandro; Ferraz Colomina, Luis; Peñate Sánchez, Adrián; Ramisa Ayats, Arnau; Simó Serra, Edgar; Sanfeliu Cortés, Alberto; Moreno-Noguer, Francesc (Institute of Electrical and Electronics Engineers (IEEE), 2015)
      Text en actes de congrés
      Accés obert
      We propose a robust and efficient method to estimate the pose of a camera with respect to complex 3D textured models of the environment that can potentially contain more than 100, 000 points. To tackle this problem we ...
    • Estimación monocular y eficiente de la pose usando modelos 3D complejos 

      Rubio Romano, Antonio; Villamizar Vergel, Michael Alejandro; Ferraz Colomina, Luis; Peñate Sánchez, Adrián; Sanfeliu Cortés, Alberto; Moreno-Noguer, Francesc (2014)
      Text en actes de congrés
      Accés obert
      El siguiente documento presenta un método robusto y eficiente para estimar la pose de una cámara. El método propuesto asume el conocimiento previo de un modelo 3D del entorno, y compara una nueva imagen de entrada únicamente ...
    • Fashion discovery : a computer vision approach 

      Rubio Romano, Antonio (2021-07-23)
      Tesi
      Accés obert
      Performing semantic interpretation of fashion images is undeniably one of the most challenging domains for computer vision. Subtle variations in color and shape might confer different meanings or interpretations to an ...
    • Multi-modal embedding for main product detection in fashion 

      Rubio Romano, Antonio; LongLong, Yu; Simó Serra, Edgar; Moreno-Noguer, Francesc (2017)
      Text en actes de congrés
      Accés obert
      We present an approach to detect the main product in fashion images by exploiting the textual metadata associated with each image. Our approach is based on a Convolutional Neural Network and learns a joint embedding of ...
    • Multi-modal fashion product retrieval 

      Rubio Romano, Antonio; Yu, Longlong; Simó Serra, Edgar; Moreno-Noguer, Francesc (2017)
      Text en actes de congrés
      Accés obert
      Finding a product in the fashion world can be a daunting task. Everyday, e-commerce sites are updating with thousands of images and their associated metadata (textual information), deepening the problem. In this paper, we ...
    • Multi-modal joint embedding for fashion product retrieval 

      Rubio Romano, Antonio; Yu, Longlong; Simó Serra, Edgar; Moreno-Noguer, Francesc (Institute of Electrical and Electronics Engineers (IEEE), 2017)
      Text en actes de congrés
      Accés obert
      Finding a product in the fashion world can be a daunting task. Everyday, e-commerce sites are updating with thousands of images and their associated metadata (textual information), deepening the problem, akin to finding a ...