Contextless Object Recognition with Shape-enriched SIFT and Bags of Features

Tella Amo, Marcel

dc.contributor	Zeppelzauer, Matthias
dc.contributor.author	Tella Amo, Marcel
dc.contributor.other	Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.date.accessioned	2014-09-17T13:23:47Z
dc.date.available	2014-09-17T13:23:47Z
dc.date.issued	2014-08-28
dc.identifier.uri	http://hdl.handle.net/2099.1/22390
dc.description.abstract	Currently, there are highly competitive results in the field of object recognition based on the aggregation of point-based features [4, 26, 5, 6]. The aggregation process, typically with an average or max-pooling of the features generates a single vector that represents the image or region that contains the object [7]. The aggregated point-based features typically describe the texture around the points with descriptors such as SIFT. These descriptors present limitations for wired and textureless objects. A possible solution is the addition of shape-based information. [9, 6, 2, 12]. Shape descriptors have been previously used to encode shape information and thus, recognise those types of objects. But generally an alignment step is required in order to match every point from one shape to other ones. The computational cost of the similarity assessment is high. We purpose to enrich location and texture-based features with shape-based ones. Two main architectures are explored: On the one side, to enrich the SIFT descriptors with shape information before they are aggregated. On the other side, to create the standard Bag of Words [7] histogram and concatenate a shape histogram, classifying them as a single vector. We evaluate the proposed techniques and the novel features on the Caltech-101 dataset. Results show that shape features increase the final performance. Our extension of the Bag of Words with a shape-based histogram(BoW+S) results in better performance. However, for a high number of shape features, BoW+S and enriched SIFT architectures tend to converge.
dc.language.iso	eng
dc.publisher	Universitat Politècnica de Catalunya
dc.rights	S'autoritza la difusió de l'obra mitjançant la llicència Creative Commons o similar 'Reconeixement-NoComercial- SenseObraDerivada'
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.subject	Àrees temàtiques de la UPC::Enginyeria de la telecomunicació
dc.subject.lcsh	Robot vision
dc.subject.lcsh	Computer vision
dc.subject.other	SIFT
dc.subject.other	interest points
dc.subject.other	object candidates
dc.subject.other	segmentation
dc.subject.other	Bag of Words
dc.subject.other	shape coding
dc.subject.other	object detection
dc.subject.other	textureless objects
dc.subject.other	wired objects.
dc.subject.other	visión por computador
dc.title	Contextless Object Recognition with Shape-enriched SIFT and Bags of Features
dc.title.alternative	Reconocimineto de objetos sin contexto con SIFT enriquecidos con forma y BoF
dc.title.alternative	Reconeixement d'objectes sense context amb eSIFT i BoW+S
dc.type	Master thesis (pre-Bologna period)
dc.subject.lemac	Visió artificial (Robòtica)
dc.subject.lemac	Visió per ordinador
dc.identifier.slug	ETSETB-230.103014
dc.rights.access	Open Access
dc.date.updated	2014-09-16T05:51:00Z
dc.audience.educationlevel	Estudis de primer/segon cicle
dc.audience.mediator	Escola Tècnica Superior d'Enginyeria de Telecomunicació de Barcelona
dc.audience.degree	ENGINYERIA DE TELECOMUNICACIÓ (Pla 1992)

Fitxers d'aquest items

Nom:: Marcel_Tella_Thesis.pdf
Mida:: 4,819Mb
Format:: PDF

Visualitza/Obre

Aquest ítem apareix a les col·leccions següents

Enginyeria de Telecomunicació (Pla 1992) [1.590]

Mostra el registre d'ítem simple

UPCommons. Portal del coneixement obert de la UPC

Contextless Object Recognition with Shape-enriched SIFT and Bags of Features

Fitxers d'aquest items

Aquest ítem apareix a les col·leccions següents

Explora