Information Fusion for Entity Matching in Unstructured Data

Abstract : Every day the global media system produces an abundance of news stories, all containing many references to people. An important task is to automatically generate reliable lists of people by analysing news content. We describe a system that leverages large amounts of data for this purpose. Lack of structure in this data gives rise to a large number of ways to refer to any particular person. Entity matching attempts to connect references that refer to the same person, usually employing some measure of similarity between references. We use information from multiple sources in order to produce a set of similarity measures with differing strengths and weaknesses. We show how their combination can improve precision without decreasing recall.
Type de document :
Communication dans un congrès
Harris Papadopoulos; Andreas S. Andreou; Max Bramer. 6th IFIP WG 12.5 International Conference on Artificial Intelligence Applications and Innovations (AIAI), Oct 2010, Larnaca, Cyprus. Springer, IFIP Advances in Information and Communication Technology, AICT-339, pp.162-169, 2010, Artificial Intelligence Applications and Innovations. 〈10.1007/978-3-642-16239-8_23〉
Liste complète des métadonnées

Littérature citée [13 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01060664
Contributeur : Hal Ifip <>
Soumis le : vendredi 17 novembre 2017 - 15:59:28
Dernière modification le : lundi 18 décembre 2017 - 01:11:00
Document(s) archivé(s) le : dimanche 18 février 2018 - 16:20:36

Fichier

AliC10.pdf
Fichiers produits par l'(les) auteur(s)

Licence


Distributed under a Creative Commons Paternité 4.0 International License

Identifiants

Citation

Omar Ali, Nello Cristianini. Information Fusion for Entity Matching in Unstructured Data. Harris Papadopoulos; Andreas S. Andreou; Max Bramer. 6th IFIP WG 12.5 International Conference on Artificial Intelligence Applications and Innovations (AIAI), Oct 2010, Larnaca, Cyprus. Springer, IFIP Advances in Information and Communication Technology, AICT-339, pp.162-169, 2010, Artificial Intelligence Applications and Innovations. 〈10.1007/978-3-642-16239-8_23〉. 〈hal-01060664〉

Partager

Métriques

Consultations de la notice

68

Téléchargements de fichiers

13