, Argonne Leadership Computing Facility. Mira log traces
Shelf algorithms for two-dimensional packing problems, SIAM Journal on Computing, vol.12, issue.3, pp.508-525, 1983. ,
Orthogonal packings in two dimensions, SIAM Journal on Computing, vol.9, issue.4, pp.846-855, 1980. ,
Detecting silent data corruption through data dynamic monitoring for scientific applications, PPoPP, 2014. ,
Sequencing tasks with exponential service times to minimize the expected flow time or makespan, J. ACM, vol.28, issue.1, pp.100-113, 1981. ,
Scheduling partially ordered tasks with probabilistic execution times, SIGOPS Oper. Syst. Rev, vol.9, issue.5, pp.169-177, 1975. ,
Scheduling on identical machines: How good is LPT in an on-line setting, Operations Research Letters, vol.21, issue.4, pp.165-169, 1997. ,
LADR: Low-cost application-level detector for reducing silent output corruptions, HPDC, pp.156-167, 2018. ,
Online-ABFT: An online algorithm based fault tolerance scheme for soft error detection in iterative methods, SIGPLAN Not, vol.48, issue.8, pp.167-176, 2013. ,
Performance bounds for level-oriented two-dimensional packing algorithms, SIAM J. Comput, vol.9, issue.4, pp.808-826, 1980. ,
Shelf algorithms for on-line strip packing, Information Processing Letters, vol.63, issue.4, pp.171-175, 1997. ,
On-line packing and covering problems, Online Algorithms: The State of the Art, pp.147-177, 1998. ,
Theory and practice in parallel job scheduling, JSSPP, pp.1-34, 1997. ,
Optimal on-line scheduling of parallel jobs with dependencies, Journal of Combinatorial Optimization, vol.1, issue.4, pp.393-411, 1998. ,
Dynamic scheduling on parallel machines, Theoretical Computer Science, vol.130, issue.1, pp.49-72, 1994. ,
Bounds for multiprocessor scheduling with resource constraints, SIAM J. Comput, vol.4, issue.2, pp.187-200, 1975. ,
Computers and Intractability, a Guide to the Theory of NP-Completeness, 1979. ,
Online tuning of EASY-backfilling using queue reordering policies, IEEE Transactions on Parallel and Distributed Systems, vol.29, issue.10, pp.2304-2316, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01963216
Stochastic load balancing and related problems, Proceedings of the 40th Annual Symposium on Foundations of Computer Science (FOCS), 1999. ,
Lightweight and accurate silent data corruption detection in ordinary differential equation solvers, Euro-Par, 2016. ,
Strip packing vs. bin packing, Algorithmic Aspects in Information and Management, pp.358-367, 2007. ,
Fault-Tolerance Techniques for High-Performance Computing, Computer Communications and Networks, 2015. ,
Algorithm-based fault tolerance for matrix operations, IEEE Trans. Comput, vol.33, issue.6, pp.518-528, 1984. ,
Online algorithm for parallel job scheduling and strip packing, Approximation and Online Algorithms, pp.67-74, 2008. ,
Core Algorithms of the Maui Scheduler, JSSPP, pp.87-102, 2001. ,
A (3/2+ ) approximation algorithm for scheduling moldable and non-moldable parallel tasks, SPAA, pp.224-235, 2012. ,
Scheduling parallel jobs to minimize the makespan, J. of Scheduling, vol.9, issue.5, pp.433-452, 2006. ,
Allocating bandwidth for bursty connections, Proceedings of the 29th Annual ACM Symposium on Theory of Computing (STOC), pp.664-673, 1997. ,
Analysis of the list scheduling algorithm for precedence constrained parallel tasks, Journal of Combinatorial Optimization, vol.3, issue.1, pp.73-88, 1999. ,
The ANL/IBM SP Scheduling System, JSSPP, pp.295-303, 1995. ,
Two-dimensional packing problems: A survey, European Journal of Operational Research, vol.141, issue.2, pp.241-252, 2002. ,
Addressing failures in exascale computing, Int. J. High Perform. Comput. Appl, vol.28, issue.2, pp.129-173, 2014. ,
Utilization, Predictability, Workloads, and User Runtime Estimates in Scheduling the IBM SP2 with Backfilling, IEEE Trans. Parallel Distrib. Syst, vol.12, issue.6, pp.529-543, 2001. ,
On an on-line scheduling problem for parallel jobs, Inf. Process. Lett, vol.81, issue.6, pp.297-304, 2002. ,
, Stochastic scheduling. Encyclopedia of Optimization, pp.3818-3824, 2009.
The effect of cosmic rays on the soft error rate of a DRAM at ground level, IEEE Trans. Electron Devices, vol.41, issue.4, pp.553-557, 1994. ,
Scheduling: Theory, Algorithms, and Systems, 2008. ,
Scheduling parallel machines on-line, SIAM J. Comput, vol.24, issue.6, pp.1313-1331, 1995. ,
The EASY -LoadLeveler API Project, JSSPP, pp.41-47, 1996. ,
Characterization of backfilling strategies for parallel job scheduling, International Conference on Parallel Processing Workshop, 2002. ,
TORQUE resource manager, Proceedings of the ACM/IEEE Conference on Supercomputing, 2006. ,
Approximate algorithms scheduling parallelizable tasks, SPAA, 1992. ,
Scheduling jobs with stochastic processing requirements on parallel machines to minimize makespan or flowtime, J Appl Probab, vol.19, issue.1, pp.167-182, 1982. ,
Scheduling tasks with exponential service times on non-identical processors to minimize various cost functions, J Appl Probab, vol.17, issue.1, pp.187-202, 1980. ,
Evaluating the EASY-backfill job scheduling of static workloads on clusters, CLUSTER, 2007. ,
Fault tolerant matrix-matrix multiplication: Correcting soft errors online, ScalA'11, pp.25-28, 2011. ,
A note on online strip packing, Journal of Combinatorial Optimization, vol.17, issue.4, pp.417-423, 2009. ,
SLURM: Simple Linux Utility for Resource Management, JSSPP, pp.44-60, 2003. ,
Cosmic ray soft error rates of 16-Mb DRAM memory chips, IEEE Journal of Solid-State Circuits, vol.33, issue.2, pp.246-252, 1998. ,