Scaling Smart Appliances for Spatial Data Synthesis

Abstract : With the rapidly growing number of dynamic data streams produced by sensing and experimental devices as well as social networks, scientists are given an unprecedented opportunity to explore a variety of environmental and social phenomena ranging from understanding of weather and climate to population dynamics. One of the main challenges is that dynamic data streams and their computation requirements are volatile: sensors or social networks may generate data at highly variable rates, processing time in an application may significantly change from one stage to the next one, or different phenomena may simply generate different levels of interest. Cloud computing is a promising platform allowing us to cope with such volatility because it enables us to allocate computational resources on demand, for short periods of time, and at an acceptable cost. At the same time using clouds for this purpose is challenging because an application may yield a very different performance depending on the hosting infrastructure, requiring us to pay special attention to how and where we schedule resources. In this poster, we describe our experiences using an application relying on input from social networks, notably geo-located tweets, to discover correlation between users’ work and home locations, with focus in the Illinois area. Our overall intent is to assess the impact of running the same application in offerings from different providers; to this end, we execute data filtering and per-user classification applications in two flavors of Chameleon cloud instances, namely bare-metal and KVM. Also, we analyze specific configuration parameters, such as data block size, replication factor and parallel processing, towards statistically modeling the application performance in a given infrastructure. We then identify and discuss the key parameters that influence the execution time. Finally, we look into the gains brought by accounting for data proximity when scheduling a resource in a multi-site environment.
Type de document :
Poster
SC15 - ACM/IEEE International Conference in Supercomputing, Nov 2015, Austin, United States. 2015, 〈http://sc15.supercomputing.org/〉
Liste complète des métadonnées

https://hal.inria.fr/hal-01241718
Contributeur : Luis Pineda-Morales <>
Soumis le : jeudi 10 décembre 2015 - 21:26:03
Dernière modification le : mardi 21 novembre 2017 - 15:22:41
Document(s) archivé(s) le : vendredi 11 mars 2016 - 23:05:07

Fichier

Pineda-Morales_SC.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01241718, version 1

Citation

Luis Pineda-Morales, Balaji Subramaniam, Kate Keahey, Gabriel Antoniu, Alexandru Costan, et al.. Scaling Smart Appliances for Spatial Data Synthesis. SC15 - ACM/IEEE International Conference in Supercomputing, Nov 2015, Austin, United States. 2015, 〈http://sc15.supercomputing.org/〉. 〈hal-01241718〉

Partager

Métriques

Consultations de la notice

436

Téléchargements de fichiers

89