Skip to Main content Skip to Navigation
Conference papers

Mining Governmental Collaboration Through Semantic Profiling of Open Data Catalogues and Publishers

Abstract : Due to the increasing adoption of open data among governments worldwide especially in the European Union area, a deeper analysis of the newly published data is becoming a mandate. Apart from analyzing the published dataset itself we aimed on analyzing published dataset catalogues. A dataset catalogue or a dataset metadata contains features that describe what the data is about in a textual representation. So, we first acquire data from open data portals, choose descriptive dataset catalogue features, and then construct an aggregated textual representation of the datasets. Afterwards we enrich those textual representations using Natural Language Processing (NLP) methods to create a new comparable data feature “Named Entities”. By mining the new data feature we are able to produce datasets and publishers relatedness network. Those networks are used to point similarities between the published data across multiple open data portals. Pointing all possible collaborations for integrating and standardizing data features and types would increase the value of da1ta and ease its analysis process.
Document type :
Conference papers
Complete list of metadata

Cited literature [23 references]  Display  Hide  Download
Contributor : Hal Ifip Connect in order to contact the contributor
Submitted on : Wednesday, January 3, 2018 - 5:19:46 PM
Last modification on : Thursday, January 4, 2018 - 9:20:07 AM
Long-term archiving on: : Thursday, May 3, 2018 - 4:28:59 AM


Files produced by the author(s)


Distributed under a Creative Commons Attribution 4.0 International License



Mohamed Adel Rezk, Adegboyega Ojo, Islam A. Hassan. Mining Governmental Collaboration Through Semantic Profiling of Open Data Catalogues and Publishers. 18th Working Conference on Virtual Enterprises (PROVE), Sep 2017, Vicenza, Italy. pp.253-264, ⟨10.1007/978-3-319-65151-4_24⟩. ⟨hal-01674854⟩



Record views


Files downloads