B. Adelberg and M. Denny, Nodose version 2.0, SIGMOD '99: Proc. 1999 ACM SIGMOD international conference on Management of data, pp.559-561, 1999.
DOI : 10.1145/304181.304576

D. A. Iskandar, J. Pehcevski, J. A. Thom, and S. M. Tahaghoghi, Social Media Retrieval Using Image Features and Structured Text, Lecture Notes in Computer Science, vol.4518, pp.358-372, 2006.
DOI : 10.1007/978-3-540-73888-6_35

E. Blanchard, M. Harzallah, and P. K. Henri-briand, A typology of ontology-based semantic measures, EMOI-INTEROP'05, Proc. Open Interop Workshop on Enterprise Modelling and Ontologies for Interoperability, 2005.

E. Blanchard, P. Kuntz, M. Harzallah, and H. Briand, A Tree-Based Similarity for Evaluating Concept Proximities in an Ontology, Proc. 10th conference of the International Fedederation of Classification Societies, pp.3-11, 2006.
DOI : 10.1007/3-540-34416-0_1

URL : https://hal.archives-ouvertes.fr/hal-00421387

S. Brin and L. Page, The anatomy of a large-scale hypertextual Web search engine, Proc. 7th International Conference on World Wide Web (WWW7), pp.107-117, 1998.
DOI : 10.1016/S0169-7552(98)00110-X

J. Callan and T. Mitamura, Knowledge-based extraction of named entities, Proceedings of the eleventh international conference on Information and knowledge management , CIKM '02, pp.532-537, 2002.
DOI : 10.1145/584792.584880

W. W. Cohen and S. Sarawagi, Exploiting dictionaries in named entity extraction, Proceedings of the 2004 ACM SIGKDD international conference on Knowledge discovery and data mining , KDD '04, pp.89-98, 2004.
DOI : 10.1145/1014052.1014065

S. Cucerzan, Large-scale named entity disambiguation based on Wikipedia data, Proc. 2007 Joint Conference on EMNLP and CNLL, pp.708-716, 2007.

S. Cucerzan and D. Yarowsky, Language independent named entity recognition combining morphological and contextual evidence, Proc. 1999 Joint SIGDAT Conference on EMNLP and VLC, pp.90-99, 1999.

H. Cunningham, D. Maynard, K. Bontcheva, and V. Tablan, GATE, Proceedings of the 40th Annual Meeting on Association for Computational Linguistics , ACL '02, pp.168-175, 2001.
DOI : 10.3115/1073083.1073112

A. P. De-vries and N. Craswell, Entity ranking ? guidelines, INEX 2006 Workshop Pre-Proceedings, pp.413-414, 2006.

A. P. De-vries, J. A. Thom, A. Vercoustre, N. Craswell, and M. Lalmas, INEX 2007 Entity ranking track guidelines, INEX 2007 Workshop Pre-Proceedings, 2007.

L. Denoyer and P. Gallinari, The Wikipedia XML corpus, ACM SIGIR Forum, vol.40, issue.1, pp.64-69, 2006.
DOI : 10.1145/1147197.1147210

URL : https://hal.archives-ouvertes.fr/hal-01172244

T. Despeyroux, E. Fraschini, and A. Vercoustre, Extraction d'entits dans des collections volutives, 7imes Journes francophones Extraction et Gestion des Connaissances Revue des Nouvelles Technologies de l'Information (RNTI-E-3), pp.533-538, 2007.

J. Hassell, B. Aleman-meza, and I. B. Arpinar, Ontology-Driven Automatic Entity Disambiguation in Unstructured Text, Proc. 5th International Semantic Web Conference (ISWC), pp.44-57, 2006.
DOI : 10.1007/11926078_4

J. M. Kleinberg, Authoritative sources in a hyperlinked environment, Journal of the ACM, vol.46, issue.5, pp.604-632, 1999.
DOI : 10.1145/324133.324140

N. Kushmerick, Wrapper induction: Efficiency and expressiveness, Artificial Intelligence, vol.118, issue.1-2, pp.15-68, 2000.
DOI : 10.1016/S0004-3702(99)00100-9

URL : http://doi.org/10.1016/s0004-3702(99)00100-9

K. Lerman, S. N. Minton, and C. A. Knoblock, Wrapper maintenance: A machine learning approach, Journal of Artificial Intelligence Research, vol.18, pp.149-181, 2003.

B. Liu, R. Grossman, and Y. Zhai, Mining data records in Web pages, Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining , KDD '03, pp.601-606, 2003.
DOI : 10.1145/956750.956826

S. Malik, A. Trotman, and M. Lalmas, Overview of INEX 2006, Lecture Notes in Computer Science, vol.4518, pp.1-11, 2006.
DOI : 10.1007/978-3-540-73888-6_1

P. Mcnamee and J. Mayfield, Entity extraction without languagespecific resources [22] NIST Speech Group. The ACE 2006 evaluation plan: Evaluation of the detection and recognition of ACE entities, values, temporal expressions, relations, and events, COLING-02: proceeding of the 6th conference on Natural language learning, pp.1-4, 2002.

J. Pehcevski, J. A. Thom, and A. Vercoustre, Hybrid XML Retrieval: Combining Information Retrieval and a Native XML Database, Information Retrieval, vol.40, issue.4, pp.571-600, 2005.
DOI : 10.1007/s10791-005-0748-1

URL : https://hal.archives-ouvertes.fr/inria-00000183

B. Popov, A. Kiryakov, D. Manov, A. Kirilov, D. Ognyanoff et al., Towards semantic web information extraction, 2nd International Semantic Web Conference: Workshop on Human Language Technology for the Semantic Web and Web Services, 2003.

A. Sahuguet and F. Azavant, Building light-weight wrappers for legacy web data-sources using W4F [26] S. Sekine. Named entity: History and future, Proc. 25th International Conference on Very Large Data Bases, pp.738-741, 1999.

S. Tenier, A. Napoli, X. Polanco, and Y. Toussaint, Annotation semantique de pages web, 6mes journes francophones " Extraction et Gestion de Connaissances, 2006.
URL : https://hal.archives-ouvertes.fr/inria-00079378

A. Vercoustre and F. Paradis, A Descriptive Language for Information Object Reuse through Virtual Documents, 4th International Conference on Object-Oriented Information Systems (OOIS'97), pp.299-311, 1997.
DOI : 10.1007/978-1-4471-1525-0_25

E. M. Voorhees and D. K. Harman, TREC, Communications of the ACM, vol.50, issue.11, 2005.
DOI : 10.1145/1297797.1297822

J. Yu, J. A. Thom, and A. Tam, Ontology evaluation using wikipedia categories for browsing, Proceedings of the sixteenth ACM conference on Conference on information and knowledge management , CIKM '07, 2007.
DOI : 10.1145/1321440.1321474

I. Unité-de-recherche and . Lorraine, Technopôle de Nancy-Brabois -Campus scientifique 615, rue du Jardin Botanique -BP 101 -54602 Villers-lès-Nancy Cedex (France) Unité de recherche INRIA Rennes : IRISA, Campus universitaire de Beaulieu -35042 Rennes Cedex (France) Unité de recherche INRIA Rhône-Alpes : 655, avenue de l'Europe -38334 Montbonnot Saint-Ismier (France) Unité de recherche, 2004.