Topical tags vs . non - topical tags : towards a bipartite classification ? - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Article Dans Une Revue Journal of Information Science Année : 2014

Topical tags vs . non - topical tags : towards a bipartite classification ?

Résumé

In this paper we investigate whether it is possible to create a computational approach that allows us to distinguish topical tags (i. e. , talking about the topic of a resource) and non-topical tags (i. e. , describing aspects of a resource that are not related to its topic) in folksonomies , in a way that correlates with humans. Towards this goal , we collected 21M tags (1. 2M unique terms) from Delicious and we developed an unsupervised statistical algorithm that classifies such tags by applying a word space model adapted to the folksonomy space. Our algorithm analyses the co-occurrence network of tags to a target tag and exploits graph-based metrics for their classification. We validated its outcomes against a reference classification made by humans on a limited number of terms in three separate tests. The analysis of the outcomes of our algorithm shows , in some cases , a consistent disagreement among humans and between humans and our algorithm about what constitutes a topical tag , and suggests the rise of a new category of overly generic tags (i. e. , umbrella tags) .
Fichier principal
Vignette du fichier
55a4f9eb08ae81aec91327f8.pdf (997.94 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-01228923 , version 1 (15-11-2015)

Identifiants

Citer

Valerio Basile, Silvio Peroni, Fabio Tamburini, Fabio Vitali. Topical tags vs . non - topical tags : towards a bipartite classification ?. Journal of Information Science, 2014, ⟨10.1177/0165551515585283⟩. ⟨hal-01228923⟩
70 Consultations
177 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More