Latency-Aware Strategies for Placing Data Stream Analytics onto Edge Computing

Alexandre da Silva Veith; Marcos Dias de Assuncao; Laurent Lefèvre

Poster Année : 2018

Latency-Aware Strategies for Placing Data Stream Analytics onto Edge Computing

(1, 2) , (3) , (1)

1
2
3

Alexandre da Silva Veith

Fonction : Auteur
PersonId : 11554
IdHAL : alexandre-da-silva-veith
ORCID : 0000-0001-5130-1792

Algorithms and Software Architectures for Distributed and HPC Platforms

Laboratoire de l'Informatique du Parallélisme

Marcos Dias de Assuncao

Fonction : Auteur
PersonId : 5091
IdHAL : marcos-dias-de-assuncao
ORCID : 0000-0002-4218-0260

Department of Computer Science and Software Engineering

Laurent Lefèvre

Fonction : Auteur

Algorithms and Software Architectures for Distributed and HPC Platforms

Résumé

Much of the "big data" generated today is received in near real-time and requires quick analysis. In Internet of Things (IoT) [1, 9], for instance, continuous data streams produced by multiple sources must be handled under very short delays. As a result, several stream processing engines have been proposed. Under several engines, a stream processing application is a directed graph or dataflow whose vertices are operators that execute a function over the incoming data and edges that define how data flows between them. A dataflow has one or multiple sources (i.e., sensors, gateways or actuators), operators that perform transformations on the data (e.g., filtering, mapping, and aggregation) and sinks (i.e., queries that consume or store the data). In a traditional cloud deployment, the whole application is placed in the cloud computing to benefit from virtually unlimited resources. However, processing all the data in the cloud can introduce latency due to data transfer, which makes near real-time processing difficult to achieve. In contrast, edge computing has become an attractive solution for performing certain stream processing operators, as many edge devices have non-trivial compute capacity. The deployment of data stream processing applications onto heterogeneous infrastructure has been proved to be NP-hard [2]. Moving operators from cloud to edge devices is challenging due to limitations of edge devices [5]. Existing work often proposes placements strategies considering user intervention [8]. Many models do not support memory and communication constraints [6, 4] while others consider all data sinks to be located in the cloud, with no feedback loop to actua-tors located at the edge of the network [3, 7]. There is a lack of solutions covering scenarios involving smart cities, precision agriculture, and smart homes comprising various heterogeneous sensors and actuators, as well as, time-constraint applications. We model the data stream processing placement problem considering heterogeneous computational and network resources, and computing and communication as M/M/1 queues (i.e., Poisson arrival distribution, exponential service time and single server). Events are handled in a First-Come, First-Served fashion both by the computation and communication services, guaranteeing the time order of events; an important requirement in many data stream processing applications. The model allows us to calculate the waiting and service times for each message in computation and communication queues allowing for estimating the response time. We then propose two strategies to minimize the application response time by splitting the dataflow graph dynamically and distributing the operators across cloud and edge infrastructure. We focus on real-time analytics applications with multiple sources and sinks distributed across resources. In particular, we first decompose the application graph by considering behaviors such as forks and joins (i.e., split points), and by identifying the operator dependencies recursively. The Response Time Rate (RTR) strategy takes the decomposed graph and organizes the deployment sequence and consecutively calculates the response time for each operator by considering the previous mappings, resource capabilities, and operator requirements. RTR with Region Patterns (RTR+RP) strategy extends RTR by exploiting the split points to first find candidate operators for edge or cloud and then estimates the response time for the edge operators. Comprehensive simulations considering multiple application configurations demonstrate that our approach can improve the response time up to 50%. For future work, we will investigate further techniques to deal with CPU-intensive operators and their energy consumption.

Domaines

Calcul parallèle, distribué et partagé [cs.DC] Performance et fiabilité [cs.PF]

Fichier principal

poster.pdf (1.22 Mo)

hotedge_2018.pdf (51.36 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Alexandre da Silva Veith : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01875999

Soumis le : mardi 18 septembre 2018-13:32:25

Dernière modification le : jeudi 11 mai 2023-11:56:10

Archivage à long terme le : mercredi 19 décembre 2018-13:14:46

Dates et versions

hal-01875999 , version 1 (18-09-2018)

Identifiants

HAL Id : hal-01875999 , version 1

Citer

Alexandre da Silva Veith, Marcos Dias de Assuncao, Laurent Lefèvre. Latency-Aware Strategies for Placing Data Stream Analytics onto Edge Computing. HotEdge'18 - USENIX Workshop on Hot Topics in Edge Computing, Jul 2018, Boston, United States. pp.1, 2018. ⟨hal-01875999⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-LYON CNRS INRIA UNIV-LYON1 INRIA-MECSCI INRIA2 UDL

160 Consultations

71 Téléchargements

Latency-Aware Strategies for Placing Data Stream Analytics onto Edge Computing

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager