Mostra el registre d'ítem simple

dc.contributorIvanova Radeva, Petia
dc.contributorKamal Sarker, Mostafa
dc.contributor.authorDueñas Gaviria, David
dc.contributor.otherUniversitat Politècnica de Catalunya. Universitat de Barcelona
dc.date.accessioned2022-11-21T19:44:34Z
dc.date.available2022-11-21T19:44:34Z
dc.date.issued2022-10-18
dc.identifier.urihttp://hdl.handle.net/2117/376883
dc.description.abstractThe field of computer vision has for years been dominated by Convolutional Neural Networks (CNNs) in the medical field. However, there are various other Deep Learning (DL) techniques that have become very popular in this space. Vision Transformers (ViTs) are an example of a deep learning technique that has been gaining in popularity in recent years. In this work, we study the performance of ViTs and CNNs on skin lesions classification tasks, specifically melanoma diagnosis. We compare the performance of ViTs to that of CNNs and show that regardless of the performance of both architectures, an ensemble of the two can improve generalization. We also present an adaptation to the Gram-OOD* method (detecting Out-of-distribution (OOD) using Gram matrices) for skin lesion images. A rescaling method was also used to address the imbalanced dataset problem, which is generally inherent in medical images. The phenomenon of super-convergence was critical to our success in building models with computing and training time constraints. Finally, we train and evaluate an ensemble of ViTs and CNNs, demonstrating that generalization is enhanced by placing first in the 2019 and third in the 2022 ISIC Challenge Live. Leaderboard (available at \href{https://challenge.isic-archive.com/leaderboards/live/}{https://challenge.isic-archive.com/leaderboards/live/}).
dc.language.isoeng
dc.publisherUniversitat Politècnica de Catalunya
dc.subjectÀrees temàtiques de la UPC::Informàtica::Intel·ligència artificial
dc.subject.lcshDeep Learning
dc.subject.lcshNeural networks (Computer science)
dc.subject.otherSkin lesion classification
dc.subject.otherConvolutional Neural Networks
dc.subject.otherVision Transformers
dc.subject.otherISIC Challenge. Out-of-distribution
dc.subject.otherEnsemble
dc.titleApplication of deep learning general-purpose neural architectures based on vision transformers for ISIC melanoma classification
dc.typeMaster thesis
dc.subject.lemacAprenentatge profund
dc.subject.lemacXarxes neuronals (Informàtica)
dc.identifier.slug170557
dc.rights.accessOpen Access
dc.date.updated2022-11-18T05:00:33Z
dc.audience.educationlevelMàster
dc.audience.mediatorFacultat d'Informàtica de Barcelona
dc.audience.degreeMÀSTER UNIVERSITARI EN INTEL·LIGÈNCIA ARTIFICIAL (Pla 2017)


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple