Skip to Main content Skip to Navigation
Conference papers

Data Type Classification: Hierarchical Class-to-Type Modeling

Abstract : Data and file type classification research conducted over the past ten to fifteen years has been dominated by competing experiments that only vary the number of classes, types of classes, machine learning technique and input vector. There has been surprisingly little innovation on fundamental approaches to data and file type classification. This chapter focuses on the empirical testing of a hypothesized, two-level hierarchical classification model and the empirical derivation and testing of several alternative classification models. Comparative evaluations are conducted on ten classification models to identify a final winning, two-level classification model consisting of five classes and 52 lower-level data and file types. Experimental results demonstrate that the approach leads to very good class-level classification performance, improved classification performance for data and file types without high entropy (e.g., compressed and encrypted data) and reasonably-equivalent classification performance for high-entropy data and file types.
Document type :
Conference papers
Complete list of metadata

Cited literature [26 references]  Display  Hide  Download

https://hal.inria.fr/hal-01758687
Contributor : Hal Ifip <>
Submitted on : Wednesday, April 4, 2018 - 4:48:18 PM
Last modification on : Wednesday, April 4, 2018 - 4:55:46 PM

File

431606_1_En_17_Chapter.pdf
Files produced by the author(s)

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

Citation

Nicole Beebe, Lishu Liu, Minghe Sun. Data Type Classification: Hierarchical Class-to-Type Modeling. 12th IFIP International Conference on Digital Forensics (DF), Jan 2016, New Delhi, India. pp.325-343, ⟨10.1007/978-3-319-46279-0_17⟩. ⟨hal-01758687⟩

Share

Metrics

Record views

142

Files downloads

439