Skip to Main content Skip to Navigation
Conference papers

MetaExtractor: A System for Metadata Extraction from Structured Data Sources

Abstract : The extraction of metadata used during the planning phase in mediation systems assumes the existence of a metadata repository that in most cases must be created with high human involvement. This dependency rises complexity of maintenance of the system and therefore the reliability of the metadata itself. This article presents MetaExtractor, a system which extracts structure, quality, capability and content metadata of structured data sources available on a mediation system. MetaExtractor is designed as a Multi-Agent System(MAS) where each agent specializes in the extraction of a particular type of metadata. The MAS cooperation capability allows the creation and maintenance of the metadata repository. MetaExtractor is useful to reduce the number of data sources selected during query planning in large scale mediation systems due to its ability to prioritize data sources that better contribute to answer a query. The work reported in this paper presents the general architecture of MetaExtractor and emphasizes on the extraction logic of content metadata and the strategy used to prioritize data sources accordingly to a given query.
Complete list of metadata

Cited literature [31 references]  Display  Hide  Download
Contributor : Hal Ifip Connect in order to contact the contributor
Submitted on : Wednesday, April 12, 2017 - 11:19:21 AM
Last modification on : Monday, November 22, 2021 - 8:42:01 AM
Long-term archiving on: : Thursday, July 13, 2017 - 12:36:30 PM


Files produced by the author(s)


Distributed under a Creative Commons Attribution 4.0 International License


  • HAL Id : hal-01506797, version 1


Alexandra Pomares-Quimbaya, Miguel Eduardo Torres-Moreno, Fabián Roldán. MetaExtractor: A System for Metadata Extraction from Structured Data Sources. 1st Cross-Domain Conference and Workshop on Availability, Reliability, and Security in Information Systems (CD-ARES), Sep 2013, Regensburg, Germany. pp.84-99. ⟨hal-01506797⟩



Record views


Files downloads