Skip to Main content Skip to Navigation
Conference papers

Arguments Against Using the 1998 DARPA Dataset for Cloud IDS Design and Evaluation and Some Alternative

Abstract : Due to the lack of adequate public datasets, the proponents of many existing cloud intrusion detection systems (IDS) have relied on the DARPA dataset to design and evaluate their models. In the current paper, we show empirically that the DARPA dataset by failing to meet important statistical characteristics of real world cloud traffic data center is inadequate for evaluating cloud IDS. We present, as alternative, a new public dataset collected through a cooperation between our lab and a non-profit cloud service provider, which contains benign data and a wide variety of attack data. We present a new hypervisor-based cloud IDS using instance-oriented feature model and supervised machine learning techniques. We investigate 3 different classifiers: Logistic Regression (LR), Random Forest (RF), and Support Vector Machine (SVM) algorithms. Experimental evaluation on a diversified dataset yields a detection rate of 92.08% and a false positive rate of 1.49% for random forest, the best performing of the three classifiers.
Complete list of metadata

https://hal.inria.fr/hal-03266464
Contributor : Hal Ifip Connect in order to contact the contributor
Submitted on : Monday, June 21, 2021 - 5:31:55 PM
Last modification on : Friday, July 30, 2021 - 3:58:44 PM
Long-term archiving on: : Wednesday, September 22, 2021 - 7:02:47 PM

File

 Restricted access
To satisfy the distribution rights of the publisher, the document is embargoed until : 2023-01-01

Please log in to resquest access to the document

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

Citation

Paulo Faria Quinan, Issa Traore, Isaac Woungang, Abdulaziz Aldribi, Onyekachi Nwamuo. Arguments Against Using the 1998 DARPA Dataset for Cloud IDS Design and Evaluation and Some Alternative. 2nd International Conference on Machine Learning for Networking (MLN), Dec 2019, Paris, France. pp.315-332, ⟨10.1007/978-3-030-45778-5_21⟩. ⟨hal-03266464⟩

Share

Metrics

Les métriques sont temporairement indisponibles