Mostra el registre d'ítem simple

dc.contributor.authorMahdavi, Kaveh
dc.contributor.authorLabarta Mancho, Jesús José
dc.contributor.authorGiménez, Judit
dc.contributor.otherUniversitat Politècnica de Catalunya. Doctorat en Arquitectura de Computadors
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.contributor.otherBarcelona Supercomputing Center
dc.date.accessioned2020-05-04T12:15:40Z
dc.date.available2020-05-04T12:15:40Z
dc.date.issued2019
dc.identifier.citationMahdavi, K.; Labarta, J.; Giménez, J. Unsupervised feature selection for noisy data. A: International Conference on Advanced Data Mining and Applications. "Advanced Data Mining and Applications, 15th International Conference, ADMA 2019: Dalian, China, November 21–23, 2019: proceedings". Berlín: Springer, 2019, p. 79-94.
dc.identifier.isbn978-3-030-35231-8
dc.identifier.urihttp://hdl.handle.net/2117/186164
dc.description.abstractFeature selection techniques are enormously applied in a variety of data analysis tasks in order to reduce the dimensionality. According to the type of learning, feature selection algorithms are categorized to: supervised or unsupervised. In unsupervised learning scenarios, selecting features is a much harder problem, due to the lack of class labels that would facilitate the search for relevant features. The selecting feature difficulty is amplified when the data is corrupted by different noises. Almost all traditional unsupervised feature selection methods are not robust against the noise in samples. These approaches do not have any explicit mechanism for detaching and isolating the noise thus they can not produce an optimal feature subset. In this article, we propose an unsupervised approach for feature selection on noisy data, called Robust Independent Feature Selection (RIFS). Specifically, we choose feature subset that contains most of the underlying information, using the same criteria as the Independent component analysis (ICA). Simultaneously, the noise is separated as an independent component. The isolation of representative noise samples is achieved using factor oblique rotation whereas noise identification is performed using factor pattern loadings. Extensive experimental results over divers real-life data sets have showed the efficiency and advantage of the proposed algorithm.
dc.description.sponsorshipWe thankfully acknowledge the support of the Comision Interministerial de Ciencia y Tecnologa (CICYT) under contract No. TIN2015-65316-P which has partially funded this work.
dc.format.extent16 p.
dc.language.isoeng
dc.publisherSpringer
dc.subjectÀrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Aprenentatge automàtic
dc.subject.lcshMachine learning
dc.subject.lcshData mining
dc.subject.otherFeature selection
dc.subject.otherIndependent component analysis
dc.subject.otherOblique rotation
dc.subject.otherNoise separation
dc.titleUnsupervised feature selection for noisy data
dc.typeConference report
dc.subject.lemacAprenentatge automàtic
dc.subject.lemacMineria de dades
dc.contributor.groupUniversitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions
dc.identifier.doi10.1007/978-3-030-35231-8_6
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttps://link.springer.com/chapter/10.1007/978-3-030-35231-8_6
dc.rights.accessOpen Access
local.identifier.drac28080223
dc.description.versionPostprint (author's final draft)
dc.relation.projectidinfo:eu-repo/grantAgreement/MINECO//TIN2015-65316-P/ES/COMPUTACION DE ALTAS PRESTACIONES VII/
local.citation.authorMahdavi, K.; Labarta, J.; Giménez, J.
local.citation.contributorInternational Conference on Advanced Data Mining and Applications
local.citation.pubplaceBerlín
local.citation.publicationNameAdvanced Data Mining and Applications, 15th International Conference, ADMA 2019: Dalian, China, November 21–23, 2019: proceedings
local.citation.startingPage79
local.citation.endingPage94


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple