Classification of Content and Users in BitTorrent by Semi-supervised Learning Methods

Konstantin Avrachenkov 1 Paulo Gonçalves 2 Arnaud Legout 3 Marina Sokol 1
1 MAESTRO - Models for the performance analysis and the control of networks
CRISAM - Inria Sophia Antipolis - Méditerranée
2 RESO - Protocols and softwares for very high-performance network
Inria Grenoble - Rhône-Alpes, LIP - Laboratoire de l'Informatique du Parallélisme
3 PLANETE - Protocols and applications for the Internet
Inria Grenoble - Rhône-Alpes, CRISAM - Inria Sophia Antipolis - Méditerranée
Abstract : P2P downloads still represent a large portion of today's Internet traffic. More than 100 million users operate BitTorrent and generate more than 30% of the total Internet traffic. Recently, a significant research effort has been done to develop tools for automatic classification of Internet traffic by application. The purpose of the present work is to provide a framework for subclassification of P2P traffic generated by the BitTorrent protocol. The general intuition is that the users with similar interests download similar contents. This intuition can be rigorously formalized with the help of graph based semi-supervised learning approach. We have chosen to work with a PageRank based semi-supervised learning method, which scales well with very large volumes of data. We provide recommendations for the choice of parameters in the PageRank based semi-supervised learning method. In particular, we show that it is advantageous to choose labelled points with large PageRank score.
Type de document :
Communication dans un congrès
8th International Wireless Communications and Mobile Computing Conference (3rd International Workshop on Traffic Analysis and Classification), Date-Added = 2012-04-16 17:03:44 +0200, Date-Modified = 2012-08-30 11:13:51 +0200, Aug 2012, Cyprus, Cyprus. 2012, 〈10.1109/IWCMC.2012.6314276〉
Liste complète des métadonnées

https://hal.inria.fr/hal-00747641
Contributeur : Paulo Gonçalves <>
Soumis le : mercredi 31 octobre 2012 - 18:08:33
Dernière modification le : vendredi 20 avril 2018 - 15:44:26

Lien texte intégral

Identifiants

Collections

Citation

Konstantin Avrachenkov, Paulo Gonçalves, Arnaud Legout, Marina Sokol. Classification of Content and Users in BitTorrent by Semi-supervised Learning Methods. 8th International Wireless Communications and Mobile Computing Conference (3rd International Workshop on Traffic Analysis and Classification), Date-Added = 2012-04-16 17:03:44 +0200, Date-Modified = 2012-08-30 11:13:51 +0200, Aug 2012, Cyprus, Cyprus. 2012, 〈10.1109/IWCMC.2012.6314276〉. 〈hal-00747641〉

Partager

Métriques

Consultations de la notice

249