TMsDP: two-stage density peak clustering based on multi-strategy optimization
Purpose The density peak clustering algorithm (DP) is proposed to identify cluster centers by two parameters, i.e. ρ value (local density) and δ value (the distance between a point and another point with a higher ρ value). According to the center-identifying principle of the DP, the potential cluster centers should have a higher ρ value and a higher δ value than other points. However, this principle may limit the DP from identifying some categories with multi-centers or the centers in lower-density regions. In addition, the improper assignment strategy of the DP could cause a wrong assignment result for the non-center points. This paper aims to address the aforementioned issues and improve the clustering performance of the DP. Design/methodology/approach First, to identify as many potential cluster centers as possible, the authors construct a point-domain by introducing the pinhole imaging strategy to extend the searching range of the potential cluster centers. Second, they design different novel calculation methods for calculating the domain distance, point-domain density and domain similarity. Third, they adopt domain similarity to achieve the domain merging process and optimize the final clustering results. Findings The experimental results on analyzing 12 synthetic data sets and 12 real-world data sets show that two-stage density peak clustering based on multi-strategy optimization (TMsDP) outperforms the DP and other state-of-the-art algorithms. Originality/value The authors propose a novel DP-based clustering method, i.e. TMsDP, and transform the relationship between points into that between domains to ultimately further optimize the clustering performance of the DP.
Year of publication: |
2022
|
---|---|
Authors: | MaJie, Jie ; Hao, Zhiyuan ; Hu, Mo |
Published in: |
Data Technologies and Applications. - Emerald Publishing Limited, ISSN 2514-9318, ZDB-ID 2935212-5. - Vol. 58.2022, 3, p. 380-406
|
Publisher: |
Emerald Publishing Limited |
Subject: | Data clustering | Density peak clustering algorithm | Merging strategy | Pinhole imaging strategy | Point-domain | Point-domain similarity |
Saved in:
Saved in favorites
Similar items by subject
-
Sener, Ipek N., (2010)
-
Study on Secure Dynamic Covering Algorithm for E-Logistics Information in a Cloud Computing Platform
He, Yan, (2017)
-
Selection of information sources for identifying technology trends: A comparative analysis
Mikova, Nadezhda, (2014)
- More ...
Similar items by person