Mostra el registre d'ítem simple

dc.contributor.authorGibert, Karina
dc.contributor.authorValls Mateu, Aïda
dc.contributor.authorBatet Sanromà, Montserrat
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Estadística i Investigació Operativa
dc.date.accessioned2015-06-30T11:31:34Z
dc.date.available2015-09-30T00:31:02Z
dc.date.created2014-09-01
dc.date.issued2014-09-01
dc.identifier.citationGibert, Karina; Valls, A.; Batet, M. Introducing semantic variables in mixed distance measures: Impact on hierarchical clustering. "Knowledge and information systems", 01 Setembre 2014, vol. 40, núm. 3, p. 559-593.
dc.identifier.issn0219-1377
dc.identifier.urihttp://hdl.handle.net/2117/28467
dc.description.abstractToday, it is well known that taking into account the semantic information available for categorical variables sensibly improves the meaningfulness of the final results of any analysis. The paper presents a generalization of mixed Gibert's metrics, which originally handled numerical and categorical variables, to include also semantic variables. Semantic variables are defined as categorical variables related to a reference ontology (ontologies are formal structures to model semantic relationships between the concepts of a certain domain). The superconcept-based distance (SCD) is introduced to compare semantic variables taking into account the information provided by the reference ontology. A benchmark shows the good performance of SCD with respect to other proposals, taken from the literature, to compare semantic features. Mixed Gibert's metrics is generalized incorporating SCD. Finally, two real applications based on touristic data show the impact of the generalized Gibert's metrics in clustering procedures and, in consequence, the impact of taking into account the reference ontology in clustering. The main conclusion is that the reference ontology, when available, can sensibly improve the meaningfulness of the final clusters.
dc.format.extent35 p.
dc.language.isoeng
dc.subjectÀrees temàtiques de la UPC::Matemàtiques i estadística::Investigació operativa::Programació matemàtica
dc.subject.lcshOperations research
dc.subject.otherClustering
dc.subject.otherMetrics
dc.subject.otherNumerical and Categorical variables
dc.subject.otherSemantic data
dc.subject.otherOntology
dc.subject.otherBACKGROUND KNOWLEDGE
dc.subject.otherGENE ONTOLOGY
dc.subject.otherSIMILARITY
dc.subject.otherWEB
dc.subject.otherRECOMMENDATIONS
dc.subject.otherPROFILES
dc.subject.otherTOURISM
dc.subject.otherSYSTEMS
dc.subject.otherMETRICS
dc.subject.otherDOMAIN
dc.titleIntroducing semantic variables in mixed distance measures: Impact on hierarchical clustering
dc.typeArticle
dc.subject.lemacOptimització i investigació operativa
dc.contributor.groupUniversitat Politècnica de Catalunya. KEMLG - Grup d'Enginyeria del Coneixement i Aprenentatge Automàtic
dc.identifier.doi10.1007/s10115-013-0663-5
dc.description.peerreviewedPeer Reviewed
dc.subject.ams90B Operations research and management science
dc.relation.publisherversionhttp://link.springer.com/article/10.1007%2Fs10115-013-0663-5
dc.rights.accessRestricted access - publisher's policy
local.identifier.drac12985413
dc.description.versionPostprint (published version)
local.citation.authorGibert, Karina; Valls, A.; Batet, M.
local.citation.publicationNameKnowledge and information systems
local.citation.volume40
local.citation.number3
local.citation.startingPage559
local.citation.endingPage593


Fitxers d'aquest items

Imatge en miniatura

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple