Multithreading approach to process real-time updates in KNN algorithms - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2016

Multithreading approach to process real-time updates in KNN algorithms

Résumé

K-Nearest Neighbors algorithm is the core of a considerable amount of online services and applications, like recommendation engines, content-classifiers, information retrieval systems, etc. The users of these services change their preferences and evolve with time, aggravating the computational challenges of KNN more with the ever evolving data to process. In this work, we present UpKNN: an efficient thread-based approach to take the updates of users preferences into account while it computes the KNN efficiently, keeping a check on the wall-time. By using an efficient thread-based approach, UpKNN processes millions of updates online, on a single commodity PC. Our experiments confirm the scalability of UpKNN, both in terms of the number of updates processed and the threads used. UpKNN achieves speedups ranging from 13.64X to 49.5X in the processing of millions of updates, with respect to the performance of a non-partitioned baseline. These results are a direct consequence of reducing the number of disk operations, roughly speaking, only 1% disk operations are performed as compared to the baseline.
Fichier non déposé

Dates et versions

hal-01415495 , version 1 (13-12-2016)

Identifiants

  • HAL Id : hal-01415495 , version 1

Citer

Anne-Marie Kermarrec, Nupur Mittal, Javier Olivares. Multithreading approach to process real-time updates in KNN algorithms. 2016. ⟨hal-01415495⟩
506 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More