Usage based indexing of web resources with natural language processing

Armelle Brun 1 Anne Boyer 2
1 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
2 MAIA - Autonomous intelligent machine
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : Due to the huge amount of available information via Internet, the identification of reliable and interesting items becomes more and more difficult and time consuming. This paper is a position paper describing our intended work in the framework of multimedia information retrieval by browsing techniques within web navigation. It relies on a usage-based indexing of resources: we ignore the nature, the content and the structure of resources. We describe a new approach taking advantage of the similarity between statistical modeling of language and document retrieval systems. A syntax of usage is computed that designs a Statistical Grammar of Usage (SGU). A SGU enables resources classification to perform a personalized navigation assistant tool. It relies both on collaborative filtering to compute virtual communities of users and a new distance dependent trigger model. The resulting SGU is a community dependent SGU.
Type de document :
Communication dans un congrès
3rd International Conference on Web Information Systems and Technologies - Webist 07, Mar 2007, Barcelone, Spain. 2007
Liste complète des métadonnées

Littérature citée [12 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00172234
Contributeur : Armelle Brun <>
Soumis le : vendredi 14 septembre 2007 - 15:14:38
Dernière modification le : jeudi 11 janvier 2018 - 06:19:56
Document(s) archivé(s) le : jeudi 8 avril 2010 - 21:54:45

Fichier

WebistBrunBoyer.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00172234, version 1

Collections

Citation

Armelle Brun, Anne Boyer. Usage based indexing of web resources with natural language processing. 3rd International Conference on Web Information Systems and Technologies - Webist 07, Mar 2007, Barcelone, Spain. 2007. 〈inria-00172234〉

Partager

Métriques

Consultations de la notice

263

Téléchargements de fichiers

152