Usage based indexing of web resources with natural language processing - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2007

Usage based indexing of web resources with natural language processing

Armelle Brun
Anne Boyer

Résumé

Due to the huge amount of available information via Internet, the identification of reliable and interesting items becomes more and more difficult and time consuming. This paper is a position paper describing our intended work in the framework of multimedia information retrieval by browsing techniques within web navigation. It relies on a usage-based indexing of resources: we ignore the nature, the content and the structure of resources. We describe a new approach taking advantage of the similarity between statistical modeling of language and document retrieval systems. A syntax of usage is computed that designs a Statistical Grammar of Usage (SGU). A SGU enables resources classification to perform a personalized navigation assistant tool. It relies both on collaborative filtering to compute virtual communities of users and a new distance dependent trigger model. The resulting SGU is a community dependent SGU.
Fichier principal
Vignette du fichier
WebistBrunBoyer.pdf (67.65 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

inria-00172234 , version 1 (14-09-2007)

Identifiants

  • HAL Id : inria-00172234 , version 1

Citer

Armelle Brun, Anne Boyer. Usage based indexing of web resources with natural language processing. 3rd International Conference on Web Information Systems and Technologies - Webist 07, INSTICC - Institute for Systems and Technologies of Information, Control and Communication ; Open University of Catalonia, Mar 2007, Barcelone, Spain. ⟨inria-00172234⟩
110 Consultations
190 Téléchargements

Partager

Gmail Facebook X LinkedIn More