Skip to Main content Skip to Navigation
Conference papers

Quality-Aware Integration and Warehousing of Genomic Data

Laure Berti-Équille 1 Fouzia Moussouni 2
1 TEXMEX - Multimedia content-based indexing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : In human health and life sciences, researchers extensively collaborate with each other, sharing biomedical and genomic data and their experimental results. This necessitates dynamically integrating different databases or warehousing them into a single repository. Based on our past experience of building a data warehouse called GEDAW (Gene Expression Data Warehouse) that stores data on genes expressed in the liver during iron overload and liver pathologies, and also relevant information from public databanks (mostly in XML format), DNA chips home experiments and medical records, we present the lessons learned, the data quality issues in this context and the current solutions we propose for integrating and warehousing biomedical data. This paper provides a functional and modular architecture for data quality enhancement and awareness in the complex processes of integration and warehousing of biomedical data.
Complete list of metadatas

Cited literature [50 references]  Display  Hide  Download
Contributor : Laure Berti-Equille <>
Submitted on : Wednesday, August 8, 2018 - 5:40:34 PM
Last modification on : Monday, October 19, 2020 - 10:55:08 AM
Long-term archiving on: : Friday, November 9, 2018 - 3:53:04 PM


Files produced by the author(s)


  • HAL Id : hal-01855920, version 1


Laure Berti-Équille, Fouzia Moussouni. Quality-Aware Integration and Warehousing of Genomic Data. ICIQ’05 - 10th International Conference on Information Quality, Nov 2005, Massachusetts Institute of Technology, Cambridge, MA, United States. pp.1-15. ⟨hal-01855920⟩



Record views


Files downloads