Mostra el registre d'ítem simple

dc.contributorZeppelzauer, Matthias
dc.contributor.authorTella Amo, Marcel
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.date.accessioned2014-09-17T13:23:47Z
dc.date.available2014-09-17T13:23:47Z
dc.date.issued2014-08-28
dc.identifier.urihttp://hdl.handle.net/2099.1/22390
dc.description.abstractCurrently, there are highly competitive results in the field of object recognition based on the aggregation of point-based features [4, 26, 5, 6]. The aggregation process, typically with an average or max-pooling of the features generates a single vector that represents the image or region that contains the object [7]. The aggregated point-based features typically describe the texture around the points with descriptors such as SIFT. These descriptors present limitations for wired and textureless objects. A possible solution is the addition of shape-based information. [9, 6, 2, 12]. Shape descriptors have been previously used to encode shape information and thus, recognise those types of objects. But generally an alignment step is required in order to match every point from one shape to other ones. The computational cost of the similarity assessment is high. We purpose to enrich location and texture-based features with shape-based ones. Two main architectures are explored: On the one side, to enrich the SIFT descriptors with shape information before they are aggregated. On the other side, to create the standard Bag of Words [7] histogram and concatenate a shape histogram, classifying them as a single vector. We evaluate the proposed techniques and the novel features on the Caltech-101 dataset. Results show that shape features increase the final performance. Our extension of the Bag of Words with a shape-based histogram(BoW+S) results in better performance. However, for a high number of shape features, BoW+S and enriched SIFT architectures tend to converge.
dc.language.isoeng
dc.publisherUniversitat Politècnica de Catalunya
dc.rightsS'autoritza la difusió de l'obra mitjançant la llicència Creative Commons o similar 'Reconeixement-NoComercial- SenseObraDerivada'
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.subjectÀrees temàtiques de la UPC::Enginyeria de la telecomunicació
dc.subject.lcshRobot vision
dc.subject.lcshComputer vision
dc.subject.otherSIFT
dc.subject.otherinterest points
dc.subject.otherobject candidates
dc.subject.othersegmentation
dc.subject.otherBag of Words
dc.subject.othershape coding
dc.subject.otherobject detection
dc.subject.othertextureless objects
dc.subject.otherwired objects.
dc.subject.othervisión por computador
dc.titleContextless Object Recognition with Shape-enriched SIFT and Bags of Features
dc.title.alternativeReconocimineto de objetos sin contexto con SIFT enriquecidos con forma y BoF
dc.title.alternativeReconeixement d'objectes sense context amb eSIFT i BoW+S
dc.typeMaster thesis (pre-Bologna period)
dc.subject.lemacVisió artificial (Robòtica)
dc.subject.lemacVisió per ordinador
dc.identifier.slugETSETB-230.103014
dc.rights.accessOpen Access
dc.date.updated2014-09-16T05:51:00Z
dc.audience.educationlevelEstudis de primer/segon cicle
dc.audience.mediatorEscola Tècnica Superior d'Enginyeria de Telecomunicació de Barcelona
dc.audience.degreeENGINYERIA DE TELECOMUNICACIÓ (Pla 1992)


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple