Skip to Main content Skip to Navigation
Journal articles

A Fuzzy Clustering Algorithm for the Mode-Seeking Framework

Thomas Bonis 1 Steve Oudot 1
1 DATASHAPE - Understanding the Shape of Data
CRISAM - Inria Sophia Antipolis - Méditerranée , Inria Saclay - Ile de France
Abstract : In this paper, we propose a new fuzzy clustering algorithm based on the modeseeking framework. Given a dataset in Rd, we define regions of high density that we call cluster cores. We then consider a random walk on a neighborhood graph built on top of our data points which is designed to be attracted by high density regions. The strength of this attraction is controlled by a temperature parameter beta > 0. The membership of a point to a given cluster is then the probability for the random walk to hit the corresponding cluster core before any other. While many properties of random walks (such as hitting times, commute distances, etc. . . ) have been shown to enventually encode purely local information when the number of data points grows, we show that the regularization introduced by the use of cluster cores solves this issue. Empirically, we show how the choice of beta influences the behavior of our algorithm: for small values of beta the result is close to hard modeseeking whereas when beta is close to 1 the result is similar to the output of a (fuzzy) spectral clustering. Finally, we demonstrate the scalability of our approach by providing the fuzzy clustering of a protein configuration dataset containing a million data points in 30 dimensions.
Document type :
Journal articles
Complete list of metadata

Cited literature [17 references]  Display  Hide  Download

https://hal.inria.fr/hal-01111854
Contributor : Steve Oudot <>
Submitted on : Tuesday, June 21, 2016 - 11:03:34 AM
Last modification on : Monday, December 14, 2020 - 5:02:54 PM
Long-term archiving on: : Thursday, September 22, 2016 - 12:31:21 PM

File

Density-Based_Soft_Clustering....
Files produced by the author(s)

Identifiers

Collections

Citation

Thomas Bonis, Steve Oudot. A Fuzzy Clustering Algorithm for the Mode-Seeking Framework. Pattern Recognition Letters, Elsevier, 2018, ⟨10.1016/j.patrec.2017.11.019⟩. ⟨hal-01111854v2⟩

Share

Metrics

Record views

314

Files downloads

529