Clustering for unsupervised image segmentation
| dc.audience.degree | GRAU EN ENGINYERIA INFORMÀTICA (Pla 2010) |
| dc.audience.educationlevel | Grau |
| dc.audience.mediator | Facultat d'Informàtica de Barcelona |
| dc.contributor | Blum, Hermann |
| dc.contributor | Ott, Lionel |
| dc.contributor.author | Bercowsky Rama, Andrés Eduardo |
| dc.contributor.covenantee | Eidgenössische Technische Hochschule Zürich |
| dc.contributor.other | Universitat Politècnica de Catalunya |
| dc.date.accessioned | 2024-01-17T13:17:04Z |
| dc.date.available | 2024-01-17T13:17:04Z |
| dc.date.issued | 2023-09-01 |
| dc.date.updated | 2023-10-02T04:00:28Z |
| dc.description.abstract | Unsupervised and self-supervised semantic segmentation has made significant advances in recent years in identifying and labeling regions of an image based on their appearance and context without the need for manual annotations or supervision. However, despite these advancements, the current state of the art still relies on prior knowledge, such as the number of classes present in the image. With research focusing mainly on improving the learned representations, arranging these representations into groups is left to robust but limited methods such as K-means. In this work, we propose FOCL, a computationally efficient clustering algorithm which does not require the number of clusters to be specified a priori. The idea behind FOCL is to move each point to a denser region by iteratively relocating them to the mean of their neighbors. We combine our method with a deep feature extractor to discover the underlying patterns and correlations in the representation. Our approach leverages the proven power of self-supervised learning, where the neural network learns to extract informative representations from the input images without the need for explicit human annotations. FOCL is capable of automatically discovering the number of clusters in the data, while scaling much better than Mean-shift or DBSCAN. |
| dc.identifier.slug | 178051 |
| dc.identifier.uri | https://hdl.handle.net/2117/399691 |
| dc.language.iso | eng |
| dc.publisher | Universitat Politècnica de Catalunya |
| dc.rights.access | Open Access |
| dc.subject | Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial |
| dc.subject.lcsh | Computer vision |
| dc.subject.lcsh | Machine learning |
| dc.subject.lcsh | Deep learning (Machine learning) |
| dc.subject.lcsh | Image segmentation |
| dc.subject.lemac | Visió per ordinador |
| dc.subject.lemac | Aprenentatge automàtic |
| dc.subject.lemac | Aprenentatge profund |
| dc.subject.lemac | Imatges--Segmentació |
| dc.subject.other | Clustering |
| dc.subject.other | Visió per Computador |
| dc.subject.other | Aprenentatge Automàtic |
| dc.subject.other | Aprenentatge profund |
| dc.subject.other | Aprenentatge sense supervisió |
| dc.subject.other | Segmentació d'Imatges |
| dc.subject.other | Segmentació Semàntica |
| dc.subject.other | Computer Vision |
| dc.subject.other | Machine Learning |
| dc.subject.other | Deep Learning |
| dc.subject.other | Unsupervised Learning |
| dc.subject.other | Image Segmentation |
| dc.subject.other | Semantic Segmentation |
| dc.title | Clustering for unsupervised image segmentation |
| dc.type | Bachelor thesis |
| dspace.entity.type | Publication |
Fitxers
Paquet original
1 - 1 de 1



