Please submit manuscripts in either of the following two submission systems

    ScholarOne Manuscripts

  • ScholarOne
  • 勤云稿件系统

  • 登录

Search by Issue

  • 2024 Vol.31
  • 2023 Vol.30
  • 2022 Vol.29
  • 2021 Vol.28
  • 2020 Vol.27
  • 2019 Vol.26
  • 2018 Vol.25
  • 2017 Vol.24
  • 2016 vol.23
  • 2015 vol.22
  • 2014 vol.21
  • 2013 vol.20
  • 2012 vol.19
  • 2011 vol.18
  • 2010 vol.17
  • 2009 vol.16
  • No.1
  • No.2

Supervised by Ministry of Industry and Information Technology of The People's Republic of China Sponsored by Harbin Institute of Technology Editor-in-chief Yu Zhou ISSNISSN 1005-9113 CNCN 23-1378/T

期刊网站二维码
微信公众号二维码
Related citation:Dan Simovici,Joshua Yee.Inertial Entropy and External Validation of Clusterings[J].Journal of Harbin Institute Of Technology(New Series),2024,31(5):41-54.DOI:10.11916/j.issn.1005-9113.2023115.
【Print】   【HTML】   【PDF download】   View/Add Comment  Download reader   Close
←Previous|Next→ Back Issue    Advanced Search
This paper has been: browsed 523times   downloaded 435times 本文二维码信息
码上扫一扫!
Shared by: Wechat More
Inertial Entropy and External Validation of Clusterings
Author NameAffiliation
Dan Simovici Computer Science Deptartment, University of Massachusetts Boston, Boston 02125, USA 
Joshua Yee Computer Science Deptartment, University of Massachusetts Boston, Boston 02125, USA 
Abstract:
Axiomatization of Shannon entropy is a subject that has received lots of attention in the information theory literature. While Shannon entropy is defined on probability distribution, we define a new type of entropy on the set of partitions of finite subsets of metric spaces, which has a rich algebraic structure as a partially ordered set. We propose an axiomatization of an entropy-like measure of partitions of sets of objects located in metric spaces, and we derive an analytic expression of this new type of entropy referred to as inertial entropy. This approach starts with the notion of inertia of a partition and includes a study of the behavior of the sum of square errors of a partition. In this context, we characterize the chain of partitions produced by the Ward hierarchical clustering method.Starting from inertial entropies of partitions, we introduce conditional entropies which, in turn, generate metrics on partitions of finite sets. These metrics are used as external validation tools for clusterings of labeled data sets.The metric generated by inertial entropy can be used to validate data clustering for labeled data sets. This type of validation aims to determine to what extend labeling of the data coincides with the clustering obtained algorithmically, and we obtain a high degree of consistency of the data labeling with the results of several hierarchical clusterings.
Key words:  partition  inertia  hierarchical clustering  generalized entropy
DOI:10.11916/j.issn.1005-9113.2023115
Clc Number:TP311
Fund:

LINKS