A New Topology-Preserving Distance Metric with Applications to Multi-dimensional Data Clustering - Archive ouverte HAL Access content directly
Conference Papers Year : 2019

A New Topology-Preserving Distance Metric with Applications to Multi-dimensional Data Clustering

(1)
1
Konstantinos K. Delibasis
  • Function : Author
  • PersonId : 991052

Abstract

In many cases of high dimensional data analysis, data points may lie on manifolds of very complex shapes/geometries. Thus, the usual Euclidean distance may lead to suboptimal results when utilized in clustering or visualization operations. In this work, we introduce a new distance definition in multi-dimensional spaces that preserves the topology of the data point manifold. The parameters of the proposed distance are discussed and their physical meaning is explored through 2 and 3-dimensional synthetic datasets. A robust method for the parameterization of the algorithm is suggested. Finally, a modification of the well-known k-means clustering algorithm is introduced, to exploit the benefits of the proposed distance metric for data clustering. Comparative results including other established clustering algorithms are presented in terms of cluster purity and V-measure, for a number of well-known datasets.
Fichier principal
Vignette du fichier
483292_1_En_12_Chapter.pdf (3.49 Mo) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-02331302 , version 1 (24-10-2019)

Licence

Attribution - CC BY 4.0

Identifiers

Cite

Konstantinos K. Delibasis. A New Topology-Preserving Distance Metric with Applications to Multi-dimensional Data Clustering. 15th IFIP International Conference on Artificial Intelligence Applications and Innovations (AIAI), May 2019, Hersonissos, Greece. pp.155-166, ⟨10.1007/978-3-030-19823-7_12⟩. ⟨hal-02331302⟩
19 View
3 Download

Altmetric

Share

Gmail Facebook Twitter LinkedIn More