Contextless Object Recognition with Shape-enriched SIFT and Bags of Features

Tella Amo, Marcel

Visualitza/Obre

Marcel_Tella_Thesis.pdf (4,819Mb)

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

Tella Amo, Marcel

Tutor / directorZeppelzauer, Matthias

Tipus de documentProjecte/Treball Final de Carrera

Data2014-08-28

Condicions d'accésAccés obert

Attribution-NonCommercial-NoDerivs 3.0 Spain

Llevat que s'hi indiqui el contrari, els continguts d'aquesta obra estan subjectes a la llicència de Creative Commons : Reconeixement-NoComercial-SenseObraDerivada 3.0 Espanya

Abstract

Currently, there are highly competitive results in the field of object recognition based on the aggregation of point-based features [4, 26, 5, 6]. The aggregation process, typically with an average or max-pooling of the features generates a single vector that represents the image or region that contains the object [7]. The aggregated point-based features typically describe the texture around the points with descriptors such as SIFT. These descriptors present limitations for wired and textureless objects. A possible solution is the addition of shape-based information. [9, 6, 2, 12]. Shape descriptors have been previously used to encode shape information and thus, recognise those types of objects. But generally an alignment step is required in order to match every point from one shape to other ones. The computational cost of the similarity assessment is high. We purpose to enrich location and texture-based features with shape-based ones. Two main architectures are explored: On the one side, to enrich the SIFT descriptors with shape information before they are aggregated. On the other side, to create the standard Bag of Words [7] histogram and concatenate a shape histogram, classifying them as a single vector. We evaluate the proposed techniques and the novel features on the Caltech-101 dataset. Results show that shape features increase the final performance. Our extension of the Bag of Words with a shape-based histogram(BoW+S) results in better performance. However, for a high number of shape features, BoW+S and enriched SIFT architectures tend to converge.

MatèriesRobot vision, Computer vision, Visió artificial (Robòtica), Visió per ordinador

TitulacióENGINYERIA DE TELECOMUNICACIÓ (Pla 1992)

URIhttp://hdl.handle.net/2099.1/22390

Col·leccions

Escola Tècnica Superior d'Enginyeria de Telecomunicació de Barcelona - Enginyeria de Telecomunicació (Pla 1992) [1.590]

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
Marcel_Tella_Thesis.pdf		4,819Mb	PDF	Visualitza/Obre

UPCommons. Portal del coneixement obert de la UPC

Contextless Object Recognition with Shape-enriched SIFT and Bags of Features

Visualitza/Obre

Explora