Data Governance as Success Factor for Data Science - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2020

Data Governance as Success Factor for Data Science

Paul Brous
  • Fonction : Auteur
  • PersonId : 995645
Marijn Janssen
  • Fonction : Auteur
  • PersonId : 985668
Rutger Krans
  • Fonction : Auteur
  • PersonId : 1098390

Résumé

More and more, asset management organizations are introducing data science initiatives to support predictive maintenance and anomaly detection. Asset management organizations are by nature data intensive to manage their assets like bridges, dykes, railways and roads. For this, they often implement data lakes using a variety of architectures and technologies to store big data and facilitate data science initiatives. However, the decision-outcomes of data science models are often highly reliant on the quality of the data. The data in the data lake therefore has to be of sufficient quality to develop trust by decision-makers. Not surprisingly, organizations are increasingly adopting data governance as a means to ensure that the quality of data entering the data lake is and remains of sufficient quality, and to ensure the organization remains legally compliant. The objective of the case study is to understand the role of data governance as success factor for data science. For this, a case study regarding the governance of data in a data lake in the asset management domain is analyzed to test three propositions contributing to the success of using data science. The results show that unambiguous ownership of the data, monitoring the quality of the data entering the data lake, and a controlled overview of standard and specific compliance requirements are important factors for maintaining data quality and compliance and building trust in data science products.
Fichier principal
Vignette du fichier
492453_1_En_36_Chapter.pdf (532.15 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03222837 , version 1 (10-05-2021)

Licence

Paternité

Identifiants

Citer

Paul Brous, Marijn Janssen, Rutger Krans. Data Governance as Success Factor for Data Science. 19th Conference on e-Business, e-Services and e-Society (I3E), Apr 2020, Skukuza, South Africa. pp.431-442, ⟨10.1007/978-3-030-44999-5_36⟩. ⟨hal-03222837⟩
57 Consultations
79 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More