Skip to Main content Skip to Navigation
Journal articles

Runtime, Speculative On-Stack Parallelization of For-Loops in Binary Programs

Abstract : Nowadays almost every device has parallel architecture, hence parallelization is almost always desirable. However, parallelizing legacy running programs is very challenging. That is due to the fact that usually source code is not available, and runtime parallelization, without program restarting is challenging. Also, detecting parallelizable code is difficult, due to possible dependencies and different execution paths that undecidable statically. Therefore, speculation is a typical approach whereby wrongly parallelized code is detected and rolled back at runtime. This paper considers utilizing processes to implement speculative parallelization using on-stack replacement, allowing for generally simple and portable design where forking a new process enters the speculative state, and killing a faulty process simply performs the roll back operation. While the cost of such operations are high, the approach is promising for cases where the parallel section is long and dependency issues are rare. Also, our proposed system performs speculative parallelization on binary code at runtime, without the need for source code, restarting the program or special hardware support. Initial experiments show about 2\times to 3\times speedup for speculative execution over serial one, when three fourth of loop iterations are parallelizable. Also, maximum measured speculation overhead over pure parallel execution is 5.8%.
Document type :
Journal articles
Complete list of metadata

Cited literature [9 references]  Display  Hide  Download
Contributor : Erven Rohou Connect in order to contact the contributor
Submitted on : Tuesday, October 9, 2018 - 5:29:24 PM
Last modification on : Wednesday, October 27, 2021 - 12:30:56 PM


Files produced by the author(s)




Marwa Yusuf, Ahmed El-Mahdy, Erven Rohou. Runtime, Speculative On-Stack Parallelization of For-Loops in Binary Programs. IEEE Letters of the Computer Society, IEEE, 2018, pp.1-4. ⟨10.1109/LOCS.2018.2872454⟩. ⟨hal-01890719⟩



Record views


Files downloads