HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Conference papers

Deep Learning for Proteomics Data for Feature Selection and Classification

Abstract : Todays high-throughput molecular profiling technologies allow to routinely create large datasets providing detailed information about a given biological sample, e.g. about the concentrations of thousands contained proteins. A standard task in the context of precision medicine is to identify a set of biomarkers (e.g. proteins) from these datasets that can be used for disease diagnosis, prognosis or to monitor treatment response. However, finding good biomarker sets is still a challenging task due to the high dimensionality and complexity of the data and the often quite high noise level.In this work, we present an approach to this problem based on Deep Neural Networks (DNN) and a transfer learning strategy using simulation data. To allow interpretation of the results, we compare different approaches to analyze the learned DNN. Based on these interpretation approaches, we describe how to extract biomarker sets.Comparison of our method to a state-of-the-art L1-SVM approach shows that the new approach is able to find better biomarker sets for classification when small sets are desired. Compared to a state-of-the-art $$\ell _1$$-support vector machine ($$\ell _1$$-SVM) approach, our method achieves better results for the classification task when a small number of features are needed.
Document type :
Conference papers
Complete list of metadata

Cited literature [39 references]  Display  Hide  Download

Contributor : Hal Ifip Connect in order to contact the contributor
Submitted on : Thursday, March 26, 2020 - 1:52:32 PM
Last modification on : Tuesday, March 31, 2020 - 3:50:21 PM
Long-term archiving on: : Saturday, June 27, 2020 - 2:41:55 PM


Files produced by the author(s)


Distributed under a Creative Commons Attribution 4.0 International License



Sahar Iravani, Tim Conrad. Deep Learning for Proteomics Data for Feature Selection and Classification. 3rd International Cross-Domain Conference for Machine Learning and Knowledge Extraction (CD-MAKE), Aug 2019, Canterbury, United Kingdom. pp.301-316, ⟨10.1007/978-3-030-29726-8_19⟩. ⟨hal-02520063⟩



Record views


Files downloads