Performance of Rand's C statistics in clustering analysis: an application to clustering the regions of Turkey

dc.contributor.authorSaracli, Sinan
dc.date.accessioned2026-01-12T21:04:26Z
dc.date.issued2013
dc.departmentAfyon Kocatepe Üniversitesi
dc.description.abstractPurpose: When a clustering problem is encountered, the researcher must be aware that choosing an incorrect clustering method and distance measure may significantly affect the results of the analysis. The purpose of this study is to determine the best clustering method and distance measure in cluster analysis and to cluster the regions of Turkey on the basis of this result. Methods: In hierarchical clustering, there are several clustering methods and distance measures. For comparison of the clustering methods and distance measures, Rand's C statistic is one of the best methods. Rand's comparative statistic C takes on values from 0.0 to 1.0 inclusive that may be used to compare two resultant clusterings produced by applying clustering methods to a data set with unknown structure or to assess the performance of a clustering method on a data set with known structure. Results: In this study, the seven regions of Turkey are clustered by all the clustering methods and distance measures. Related with the social and economic indicators, the final cluster number is taken as three. Then, according to Rand's C statistics, all possible pairs of distance measures for all clustering methods in hierarchical clustering are compared, and the results are given in the related tables. Conclusions: According to the results of all possible comparisons, Ward's method is found to be the best among others, and final clustering of the regions is applied according to Ward's clustering measure.
dc.identifier.doi10.1186/1029-242X-2013-142
dc.identifier.issn1029-242X
dc.identifier.orcid0000-0003-4662-8031
dc.identifier.scopus2-s2.0-84894631080
dc.identifier.scopusqualityQ2
dc.identifier.urihttps://doi.org/10.1186/1029-242X-2013-142
dc.identifier.urihttps://hdl.handle.net/11630/26958
dc.identifier.wosWOS:000323371400001
dc.identifier.wosqualityQ2
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.language.isoen
dc.publisherSpringer International Publishing Ag
dc.relation.ispartofJournal of Inequalities and Applications
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/openAccess
dc.snmzKA_WoS_20260101
dc.subjectRand's C statistics
dc.subjecthierarchical clustering methods
dc.subjectdistance measures
dc.titlePerformance of Rand's C statistics in clustering analysis: an application to clustering the regions of Turkey
dc.typeArticle

Dosyalar