Application-Level Optimizations on NUMA Multicore Architectures: the Apache Case Study - Archive ouverte HAL Access content directly
Reports (Research Report) Year : 2011

Application-Level Optimizations on NUMA Multicore Architectures: the Apache Case Study

(1) , (2) , (3) , (4) , (2)
1
2
3
4

Abstract

Multicore machines with Non-Uniform Memory Accesses (NUMA) are becoming commonplace. It is thus becoming crucial to understand how the resources they provide can be efficiently exploited. Most current research works are tackling the problem at the Operating System (OS) level. They focus on improving existing OS primitives, or on proposing novel OS designs with the aim of reducing OS bottlenecks and improving the scalability of applications running on such machines. In this paper, we adopt a complementary perspective: we examine how to optimize the scalability of a parallel application running on top of an unmodified, currently available operating system. The chosen application is the popular Apache-PHP stack. We highlight three performance issues at different levels of the system due to: (i) excessive remote memory accesses, (ii) inefficient load dispatching among cores, and (iii) contention on kernel data structures. We propose and implement solutions at the application-level for each issue. Our optimized Apache-PHP software stack achieves a 33% higher throughput than the base configuration on a 16-core setup. We conclude the paper with lessons learned on optimizing server applications for multicore computers
Not file

Dates and versions

hal-00950933 , version 1 (23-02-2014)

Identifiers

  • HAL Id : hal-00950933 , version 1

Cite

Fabien Gaud, Renaud Lachaize, Baptiste Lepers, Gilles Muller, Vivien Quéma. Application-Level Optimizations on NUMA Multicore Architectures: the Apache Case Study. [Research Report] RR-LIG-011, LIG. 2011. ⟨hal-00950933⟩
183 View
0 Download

Share

Gmail Facebook Twitter LinkedIn More