Skip to Main content Skip to Navigation
Conference papers

Deep Learning for Proteomics Data for Feature Selection and Classification

Abstract : Todays high-throughput molecular profiling technologies allow to routinely create large datasets providing detailed information about a given biological sample, e.g. about the concentrations of thousands contained proteins. A standard task in the context of precision medicine is to identify a set of biomarkers (e.g. proteins) from these datasets that can be used for disease diagnosis, prognosis or to monitor treatment response. However, finding good biomarker sets is still a challenging task due to the high dimensionality and complexity of the data and the often quite high noise level.In this work, we present an approach to this problem based on Deep Neural Networks (DNN) and a transfer learning strategy using simulation data. To allow interpretation of the results, we compare different approaches to analyze the learned DNN. Based on these interpretation approaches, we describe how to extract biomarker sets.Comparison of our method to a state-of-the-art L1-SVM approach shows that the new approach is able to find better biomarker sets for classification when small sets are desired. Compared to a state-of-the-art $$\ell _1$$-support vector machine ($$\ell _1$$-SVM) approach, our method achieves better results for the classification task when a small number of features are needed.
Document type :
Conference papers
Complete list of metadata

Cited literature [39 references]  Display  Hide  Download

https://hal.inria.fr/hal-02520063
Contributor : Hal Ifip <>
Submitted on : Thursday, March 26, 2020 - 1:52:32 PM
Last modification on : Tuesday, March 31, 2020 - 3:50:21 PM
Long-term archiving on: : Saturday, June 27, 2020 - 2:41:55 PM

File

 Restricted access
To satisfy the distribution rights of the publisher, the document is embargoed until : 2022-01-01

Please log in to resquest access to the document

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

Citation

Sahar Iravani, Tim Conrad. Deep Learning for Proteomics Data for Feature Selection and Classification. 3rd International Cross-Domain Conference for Machine Learning and Knowledge Extraction (CD-MAKE), Aug 2019, Canterbury, United Kingdom. pp.301-316, ⟨10.1007/978-3-030-29726-8_19⟩. ⟨hal-02520063⟩

Share

Metrics

Record views

50