Skip to Main content Skip to Navigation
Preprints, Working Papers, ...

Traffic Refinery: Cost-Aware Traffic Representation for Machine Learning in Networks

Abstract : Ever more frequently network management tasks apply machine learning on network traffic. Both the accuracy of a machine learning model and its effectiveness in practice ultimately depend on the representation of raw network traffic as features. Often, the representation of the traffic is as important as the choice of the model itself; furthermore, the features that the model relies on will ultimately determine where (and even whether) the model can be deployed in practice. This paper develops a new framework and system that enables a joint evaluation of both the conventional notions of machine learning performance (e.g., model accuracy) and the systems-level costs of different representations of network traffic. We highlight these two dimensions for a practical network management task, video streaming quality inference, and show that the appropriate operating point for these two dimensions depends on the deployment scenario. We demonstrate the benefit of exploring a range of representations of network traffic and present Traffic Refinery, a proof-of-concept reference implementation that both monitors network traffic at 10 Gbps and transforms the traffic in real time to produce a variety of feature representations for machine learning models. Traffic Refinery both highlights this design space and makes it possible for network operators to easily explore different representations for learning, balancing systems costs related to feature extraction and model training against the resulting model performance.
Document type :
Preprints, Working Papers, ...
Complete list of metadata
Contributor : Renata Teixeira Connect in order to contact the contributor
Submitted on : Wednesday, February 17, 2021 - 1:54:32 AM
Last modification on : Tuesday, January 11, 2022 - 11:16:04 AM

Links full text



Francesco Bronzino, Paul Schmitt, Sara Ayoubi, Hyojoon Kim, Renata Teixeira, et al.. Traffic Refinery: Cost-Aware Traffic Representation for Machine Learning in Networks. 2021. ⟨hal-03143736⟩



Les métriques sont temporairement indisponibles