Skip to Main content Skip to Navigation
Conference papers

Data Type Classification: Hierarchical Class-to-Type Modeling

Abstract : Data and file type classification research conducted over the past ten to fifteen years has been dominated by competing experiments that only vary the number of classes, types of classes, machine learning technique and input vector. There has been surprisingly little innovation on fundamental approaches to data and file type classification. This chapter focuses on the empirical testing of a hypothesized, two-level hierarchical classification model and the empirical derivation and testing of several alternative classification models. Comparative evaluations are conducted on ten classification models to identify a final winning, two-level classification model consisting of five classes and 52 lower-level data and file types. Experimental results demonstrate that the approach leads to very good class-level classification performance, improved classification performance for data and file types without high entropy (e.g., compressed and encrypted data) and reasonably-equivalent classification performance for high-entropy data and file types.
Document type :
Conference papers
Complete list of metadata

Cited literature [26 references]  Display  Hide  Download
Contributor : Hal Ifip Connect in order to contact the contributor
Submitted on : Wednesday, April 4, 2018 - 4:48:18 PM
Last modification on : Wednesday, April 4, 2018 - 4:55:46 PM


Files produced by the author(s)


Distributed under a Creative Commons Attribution 4.0 International License



Nicole Beebe, Lishu Liu, Minghe Sun. Data Type Classification: Hierarchical Class-to-Type Modeling. 12th IFIP International Conference on Digital Forensics (DF), Jan 2016, New Delhi, India. pp.325-343, ⟨10.1007/978-3-319-46279-0_17⟩. ⟨hal-01758687⟩



Record views


Files downloads