Quality-Aware Integration and Warehousing of Genomic Data

Laure Berti-Équille 1 Fouzia Moussouni 2
1 TEXMEX - Multimedia content-based indexing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : In human health and life sciences, researchers extensively collaborate with each other, sharing biomedical and genomic data and their experimental results. This necessitates dynamically integrating different databases or warehousing them into a single repository. Based on our past experience of building a data warehouse called GEDAW (Gene Expression Data Warehouse) that stores data on genes expressed in the liver during iron overload and liver pathologies, and also relevant information from public databanks (mostly in XML format), DNA chips home experiments and medical records, we present the lessons learned, the data quality issues in this context and the current solutions we propose for integrating and warehousing biomedical data. This paper provides a functional and modular architecture for data quality enhancement and awareness in the complex processes of integration and warehousing of biomedical data.
Complete list of metadatas

Cited literature [50 references]  Display  Hide  Download

https://hal.inria.fr/hal-01855920
Contributor : Laure Berti-Equille <>
Submitted on : Wednesday, August 8, 2018 - 5:40:34 PM
Last modification on : Friday, November 16, 2018 - 1:31:10 AM
Long-term archiving on : Friday, November 9, 2018 - 3:53:04 PM

File

ICIQ05.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01855920, version 1

Citation

Laure Berti-Équille, Fouzia Moussouni. Quality-Aware Integration and Warehousing of Genomic Data. ICIQ’05 - 10th International Conference on Information Quality, Nov 2005, Massachusetts Institute of Technology, Cambridge, MA, United States. pp.1-15. ⟨hal-01855920⟩

Share

Metrics

Record views

542

Files downloads

24