Weeratunga. 1991. The NAS Parallel Benchmarks, The International Journal of Supercomputer Applications ,
The PARSEC Benchmark Suite: Characterization and Architectural Implications, Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques (PACT '08, pp.72-81, 2008. ,
Pattern recognition and machine learning, pp.4-11, 2006. ,
Pattern recognition and machine learning, pp.179-207, 2006. ,
Contentionaware scheduling on multicore systems, ACM Transactions on Computer Systems (TOCS), vol.28, 2010. ,
Random forests, Machine learning, vol.45, pp.5-32, 2001. ,
, , 2014.
hwloc: a Generic Framework for Managing Hardware A nities in HPC Applications, PDP 2010 -The 18th Euromicro International Conference on Parallel, Distributed and Network-Based Computing, 2010. ,
ForestGOMP: an e cient OpenMP environment for NUMA architectures, International Journal of Parallel Programming, 2010. ,
A machine learning-based approach for thread mapping on transactional memory applications, 18th International Conference on High Performance Computing, pp.1-10, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00788791
LIBSVM: a library for support vector machines, ACM transactions on intelligent systems and technology (TIST), vol.2, p.27, 2011. ,
, The Coral Benchmarks Codes, 2018.
Dynamic thread mapping of shared memory applications by exploiting cache coherence protocols, J. Parallel and Distrib. Comput, vol.74, pp.2215-2228, 2014. ,
EagerMap: a task mapping algorithm to improve communication and load balancing in clusters of multicore systems, ACM Transactions on Parallel Computing (TOPC), vol.5, p.17, 2019. ,
Tra c Management: A Holistic Approach to Memory Placement on NUMA Systems, Proceedings of the Eighteenth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS '13, pp.381-394, 2013. ,
Addressing characterization methods for memory contention aware co-scheduling, The Journal of Supercomputing, pp.1-33, 2015. ,
Locality vs. Balance: Exploring data mapping policies on NUMA systems, 23rd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, pp.9-16, 2015. ,
Communication-aware Process and Thread Mapping Using Online Communication Detection, Parallel Comput, vol.43, pp.43-63, 2015. ,
Characterizing communication and page usage of parallel applications for thread and data mapping. Performance Evaluation, pp.18-36, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01146859
A nity-Based Thread and Data Mapping in Shared Memory Systems, ACM Computing Survey, vol.49, p.38, 2016. ,
The singular value decomposition: Its computation and some applications, IEEE Trans. Automat. Control, vol.25, pp.164-176, 1980. ,
autopinautomated optimization of thread-to-core pinning on multicore systems, Transactions on high-performance embedded architectures and compilers III, pp.219-235, 2011. ,
Pin: building customized program analysis tools with dynamic instrumentation, Acm sigplan notices, vol.40, pp.190-200, 2005. ,
Memory Management in NUMA Multicore Systems: Trapped Between Cache Contention and Interconnect Overhead. SIGPLAN Not, vol.46, pp.11-20, 2011. ,
Hardware Pro le-guided Automatic Page Placement for ccNUMA Systems, Proceedings of the Eleventh ACM SIG-PLAN Symposium on Principles and Practice of Parallel Programming (PPoPP '06, pp.90-99, 2006. ,
Feedback-directed Page Placement for ccNUMA via Hardware-generated Memory Traces, J. Parallel and Distrib. Comput, vol.70, pp.1204-1219, 2010. ,
, Methodology Implementation for Reproducibility of the Paper Experiments, Methodology Implementation, 2019.
PAPI: A portable interface to hardware performance counters, Proceedings of the department of defense HPCMP users group conference, vol.710, 1999. ,
Thread assignment of multithreaded network applications in multicore/multithreaded processors, IEEE Transactions on Parallel and Distributed Systems, vol.24, pp.2513-2525, 2013. ,
Mapping Parallelism to Multi-cores: A Machine Learning Based Approach, Proceedings of the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP '09, pp.75-84, 2009. ,
Addressing Shared Resource Contention in Multicore Processors via Scheduling, Proceedings of the Fifteenth Edition of ASPLOS on Architectural Support for Programming Languages and Operating Systems (ASPLOS XV, pp.129-142, 2010. ,
Survey of scheduling techniques for addressing shared resources in multicore processors, ACM Computing Surveys (CSUR), vol.45, p.4, 2012. ,