Skip to Main content Skip to Navigation
Conference papers

Non-coding RNA Sequences Identification and Classification Using a Multi-class and Multi-label Ensemble Technique

Abstract : High throughput sequencing RNA-sequencing technologies and modern in silico techniques have expanded our knowledge on short non-coding RNAs. These sequences were initially split into various categories based on their cellular functionality and their sequential, thermodynamic and structural properties believing that their sequence can be used as an identifier to distinguish them. However, recent evidence has indicated that the same sequences can act and function as more than one type of non-coding RNAs with a striking example of mature microRNA sequences which can also be transfer RNA fragments. Most of the existing computational methods for the prediction of non-coding RNA sequences have emphasized on the prediction of only one type of noncoding RNAs and even the ones designed for multiclassification do not support multiple labeling and are thus not able to assign a sequence to more than one non-coding RNA type. In the present paper, we introduce a new multilabel- multiclass method based on the combination of multiobjective evolutionary algorithms and multi-label implementations of Random Forests to optimize the feature selection process and assign short RNA sequences to one or more non-coding RNA types. The overall methodology clearly outperformed other machine learning techniques which were used for the same purpose and it is applicable to data coming from RNA-sequencing experiments.
Document type :
Conference papers
Complete list of metadata

Cited literature [24 references]  Display  Hide  Download

https://hal.inria.fr/hal-01821313
Contributor : Hal Ifip <>
Submitted on : Friday, June 22, 2018 - 2:13:33 PM
Last modification on : Friday, June 22, 2018 - 2:24:11 PM
Long-term archiving on: : Monday, September 24, 2018 - 10:33:23 PM

File

468652_1_En_17_Chapter.pdf
Files produced by the author(s)

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

Citation

Michalis Stavridis, Aigli Korfiati, Georgios Sakellaropoulos, Seferina Mavroudi, Konstantinos Theofilatos. Non-coding RNA Sequences Identification and Classification Using a Multi-class and Multi-label Ensemble Technique. 14th IFIP International Conference on Artificial Intelligence Applications and Innovations (AIAI), May 2018, Rhodes, Greece. pp.179-188, ⟨10.1007/978-3-319-92016-0_17⟩. ⟨hal-01821313⟩

Share

Metrics

Record views

362

Files downloads

6