Skip to Main content Skip to Navigation
Conference papers

A scalable biclustering method for heterogeneous medical data

Abstract : We define the problem of biclustering on heterogeneous data, that is, data of various types (binary, numeric, etc.). This problem has not yet been investigated in the biclustering literature.We propose a new method, HBC (Heterogeneous BiClustering), designed to extract biclus- ters from heterogeneous, large-scale, sparse data matrices. The goal of this method is to handle medical data gathered by hospitals (on patients, stays, acts, diagnoses, prescriptions, etc.) and to provide valuable insight on such data. HBC takes advantage of the data sparsity and uses a con- structive greedy heuristic to build a large number of possibly overlapping biclusters. The proposed method is successfully compared with a stan- dard biclustering algorithm on small-size numeric data. Experiments on real-life data sets further assert its scalability and efficiency.
Complete list of metadatas

https://hal.inria.fr/hal-01420947
Contributor : Maxence Vandromme <>
Submitted on : Wednesday, December 21, 2016 - 11:56:34 AM
Last modification on : Friday, March 22, 2019 - 1:34:00 AM

Identifiers

  • HAL Id : hal-01420947, version 1

Collections

Citation

Maxence Vandromme, Julie Jacques, Julien Taillard, Laetitia Jourdan, Clarisse Dhaenens. A scalable biclustering method for heterogeneous medical data. International Workshop on Machine Learning, Optimization and Big Data, Aug 2016, Volterra, Italy. pp.12. ⟨hal-01420947⟩

Share

Metrics

Record views

270