Improved Cuckoo Search Algorithm for Document Clustering

Abstract : Efficient document clustering plays an important role in organizing and browsing the information in the World Wide Web. K-means is the most popular clustering algorithms, due to its simplicity and efficiency. However, it may be trapped in local minimum which leads to poor results. Recently, cuckoo search based clustering has proved to reach interesting results. By against, the number of iterations can increase dramatically due to its slowness convergence. In this paper, we propose an improved cuckoo search clustering algorithm in order to overcome the weakness of the conventional cuckoo search clustering. In this algorithm, the global search procedure is enhanced by a local search method. The experiments tests on four text document datasets and one standard dataset extracted from well known collections show the effectiveness and the robustness of the proposed algorithm to improve significantly the clustering quality in term of fitness function, f-measure and purity.
Document type :
Conference papers
Complete list of metadatas

Cited literature [21 references]  Display  Hide  Download

https://hal.inria.fr/hal-01789952
Contributor : Hal Ifip <>
Submitted on : Friday, May 11, 2018 - 3:10:49 PM
Last modification on : Friday, May 11, 2018 - 3:13:33 PM
Long-term archiving on : Monday, September 24, 2018 - 3:50:33 PM

File

339159_1_En_18_Chapter.pdf
Files produced by the author(s)

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

Citation

Saida Boushaki, Nadjet Kamel, Omar Bendjeghaba. Improved Cuckoo Search Algorithm for Document Clustering. 5th International Conference on Computer Science and Its Applications (CIIA), May 2015, Saida, Algeria. pp.217-228, ⟨10.1007/978-3-319-19578-0_18⟩. ⟨hal-01789952⟩

Share

Metrics

Record views

641

Files downloads

203