Skip to Main content Skip to Navigation
Conference papers

A New Topology-Preserving Distance Metric with Applications to Multi-dimensional Data Clustering

Abstract : In many cases of high dimensional data analysis, data points may lie on manifolds of very complex shapes/geometries. Thus, the usual Euclidean distance may lead to suboptimal results when utilized in clustering or visualization operations. In this work, we introduce a new distance definition in multi-dimensional spaces that preserves the topology of the data point manifold. The parameters of the proposed distance are discussed and their physical meaning is explored through 2 and 3-dimensional synthetic datasets. A robust method for the parameterization of the algorithm is suggested. Finally, a modification of the well-known k-means clustering algorithm is introduced, to exploit the benefits of the proposed distance metric for data clustering. Comparative results including other established clustering algorithms are presented in terms of cluster purity and V-measure, for a number of well-known datasets.
Document type :
Conference papers
Complete list of metadata

Cited literature [11 references]  Display  Hide  Download
Contributor : Hal Ifip <>
Submitted on : Thursday, October 24, 2019 - 12:50:14 PM
Last modification on : Thursday, October 24, 2019 - 12:54:44 PM
Long-term archiving on: : Saturday, January 25, 2020 - 3:24:01 PM


 Restricted access
To satisfy the distribution rights of the publisher, the document is embargoed until : 2022-01-01

Please log in to resquest access to the document


Distributed under a Creative Commons Attribution 4.0 International License



Konstantinos Delibasis. A New Topology-Preserving Distance Metric with Applications to Multi-dimensional Data Clustering. 15th IFIP International Conference on Artificial Intelligence Applications and Innovations (AIAI), May 2019, Hersonissos, Greece. pp.155-166, ⟨10.1007/978-3-030-19823-7_12⟩. ⟨hal-02331302⟩



Record views