Automated Determination of the Input Parameter of DBSCAN Based on Outlier Detection

Abstract : During the last two decades, DBSCAN (Density-Based Spatial Clustering of Applications with Noise) has been one of the most common clustering algorithms, that is also highly cited in the scientific literature. However, despite its strengths, DBSCAN has a shortcoming in parameter detection, which is done in interaction with the user, presenting some graphical representation of the data. This paper introduces a simple and effective method for automatically determining the input parameter of DBSCAN. The idea is based on a statistical technique for outlier detection, namely the empirical rule. This work also suggests a more accurate method for detecting the clusters that lie close to each other. Experimental results in comparison with the old method, together with the time complexity of the algorithm, which is the same as for the old algorithm, indicate that the proposed method is able to automatically determine the input parameter of DBSCAN quite reliably and efficiently.
Document type :
Conference papers
Complete list of metadatas

https://hal.inria.fr/hal-01557638
Contributor : Hal Ifip <>
Submitted on : Thursday, July 6, 2017 - 1:55:34 PM
Last modification on : Tuesday, March 20, 2018 - 2:48:32 PM
Long-term archiving on : Wednesday, January 24, 2018 - 3:02:19 AM

File

430537_1_En_24_Chapter.pdf
Files produced by the author(s)

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

Citation

Zohreh Akbari, Rainer Unland. Automated Determination of the Input Parameter of DBSCAN Based on Outlier Detection. 12th IFIP International Conference on Artificial Intelligence Applications and Innovations (AIAI), Sep 2016, Thessaloniki, Greece. pp.280-291, ⟨10.1007/978-3-319-44944-9_24⟩. ⟨hal-01557638⟩

Share

Metrics

Record views

155

Files downloads

107