Skip to Main content Skip to Navigation
New interface
Conference papers

A scalable biclustering method for heterogeneous medical data

Abstract : We define the problem of biclustering on heterogeneous data, that is, data of various types (binary, numeric, etc.). This problem has not yet been investigated in the biclustering literature.We propose a new method, HBC (Heterogeneous BiClustering), designed to extract biclus- ters from heterogeneous, large-scale, sparse data matrices. The goal of this method is to handle medical data gathered by hospitals (on patients, stays, acts, diagnoses, prescriptions, etc.) and to provide valuable insight on such data. HBC takes advantage of the data sparsity and uses a con- structive greedy heuristic to build a large number of possibly overlapping biclusters. The proposed method is successfully compared with a stan- dard biclustering algorithm on small-size numeric data. Experiments on real-life data sets further assert its scalability and efficiency.
Complete list of metadata
Contributor : Maxence Vandromme Connect in order to contact the contributor
Submitted on : Wednesday, December 21, 2016 - 11:56:34 AM
Last modification on : Tuesday, November 22, 2022 - 2:26:16 PM


  • HAL Id : hal-01420947, version 1



Maxence Vandromme, Julie Jacques, Julien Taillard, Laetitia Jourdan, Clarisse Dhaenens. A scalable biclustering method for heterogeneous medical data. International Workshop on Machine Learning, Optimization and Big Data, Aug 2016, Volterra, Italy. pp.12. ⟨hal-01420947⟩



Record views