A weighted Cramer's V index for the assessment of stability in the fuzzy clustering of class C G protein-coupled receptors
Document typePart of book or chapter of book
Rights accessOpen Access
After decades of intensive use, K-Means is still a common choice for crisp data clustering in real-world applications, particularly in biomedicine and bioinformatics. It is well-known that different initializations of the algorithm can lead to different solutions, precluding replicability. It has also been reported that even solutions with very similar errors may widely differ. A criterion for the choice of clustering solutions according to a combination of error and stability measures has recently been suggested. It is based on the use of Cramér’s V index, calculated from contingency tables, which is valid only for crisp clustering. Here, this criterion is extended to fuzzy and probabilistic clustering by first defining weighted contingency tables and a corresponding weighted Cramér’s V index. The proposed method is illustrated using Fuzzy C-Means in a proteomics problem.
CitationVellido, A., Halka, C., Nebot, M. A weighted Cramer's V index for the assessment of stability in the fuzzy clustering of class C G protein-coupled receptors. A: "Bioinformatics and Biomedical Engineering: Third International Conference, IWBBIO 2015: Granada, Spain, April 15-17, 2015: proceedings, part I". Berlín: Springer, 2015, p. 536-547.