A Flexible Framework for Asynchronous In Situ and In Transit Analytics for Scientific Simulations

Matthieu Dreher; Bruno Raffin

Conference Papers Year : 2014

A Flexible Framework for Asynchronous In Situ and In Transit Analytics for Scientific Simulations

(1) , (1)

Matthieu Dreher

Function : Correspondent author
PersonId : 938975

Connectez-vous pour contacter l'auteur

PrograMming and scheduling design fOr Applications in Interactive Simulation

Bruno Raffin

Function : Author
PersonId : 4842
IdHAL : bruno-raffin
ORCID : 0000-0002-7980-4946
IdRef : 091616999

PrograMming and scheduling design fOr Applications in Interactive Simulation

Abstract

High performance computing systems are today composed of tens of thousands of processors and deep memory hierarchies. The next generation of machines will further increase the unbalance between I/O capabilities and processing power. To reduce the pressure on I/Os, the in situ analytics paradigm proposes to process the data as closely as possible to where and when the data are produced. Processing can be embedded in the simulation code, executed asynchronously on helper cores on the same nodes, or performed in transit on staging nodes dedicated to analytics. Today, software environ- nements as well as usage scenarios still need to be investigated before in situ analytics become a standard practice. In this paper we introduce a framework for designing, deploying and executing in situ scenarios. Based on a com- ponent model, the scientist designs analytics workflows by first developing processing components that are next assembled in a dataflow graph through a Python script. At runtime the graph is instantiated according to the execution context, the framework taking care of deploying the application on the target architecture and coordinating the analytics workflows with the simulation execution. Component coordination, zero- copy intra-node communications or inter-nodes data transfers rely on per-node distributed daemons. We evaluate various scenarios performing in situ and in transit analytics on large molecular dynamics systems sim- ulated with Gromacs using up to 1664 cores. We show in particular that analytics processing can be performed on the fraction of resources the simulation does not use well, resulting in a limited impact on the simulation performance (less than 6%). Our more advanced scenario combines in situ and in transit processing to compute a molecular surface based on the Quicksurf algorithm.

Domains

Distributed, Parallel, and Cluster Computing [cs.DC]

Fichier principal

ccgrid_final_version_march_2014.pdf (421.77 Ko)

screenMesh.png (167.74 Ko)

Origin : Files produced by the author(s)

Format : Figure, Image

Matthieu Dreher : Connect in order to contact the contributor

https://inria.hal.science/hal-00941413

Submitted on : Friday, May 23, 2014-10:15:04 AM

Last modification on : Thursday, April 4, 2024-8:54:55 PM

Long-term archiving on: Saturday, August 23, 2014-10:41:00 AM

Dates and versions

hal-00941413 , version 1 (23-05-2014)

Identifiers

HAL Id : hal-00941413 , version 1

Cite

Matthieu Dreher, Bruno Raffin. A Flexible Framework for Asynchronous In Situ and In Transit Analytics for Scientific Simulations. CCGrid 2014 - 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, May 2014, Chicago, United States. ⟨hal-00941413⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 UGA CNRS INRIA IRISA LIG LIG_SRCPR LIG_SRCPR_MOAIS INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM LIG_SIDCH

838 View

963 Download

A Flexible Framework for Asynchronous In Situ and In Transit Analytics for Scientific Simulations

Abstract

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Share