A GPU-based Branch-and-Bound algorithm using Integer-Vector-Matrix data structure

Jan Gmys; Mohand Mezmaz; Nouredine Melab; Daniel Tuyttens

doi:10.1016/j.parco.2016.01.008

Article Dans Une Revue Parallel Computing Année : 2016

A GPU-based Branch-and-Bound algorithm using Integer-Vector-Matrix data structure

(1, 2, 3) , (2) , (1) , (2)

1
2
3

Jan Gmys

Fonction : Auteur
PersonId : 178112
IdHAL : jan-gmys
ORCID : 0000-0001-9635-4396
IdRef : 224900633

Parallel Cooperative Multi-criteria Optimization

Institut de Mathématiques [Mons]

Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189

Mohand Mezmaz

Fonction : Auteur

Institut de Mathématiques [Mons]

Nouredine Melab

Fonction : Auteur

Parallel Cooperative Multi-criteria Optimization

Daniel Tuyttens

Fonction : Auteur

Institut de Mathématiques [Mons]

Résumé

Branch-and-Bound (B&B) algorithms are tree-based exploratory methods for solving combinatorial optimization problems exactly to optimality. These problems are often large in size and known to be NP-hard to solve. The construction and exploration of the B&B-tree are performed using four operators: branching, bounding, selection and pruning. Such algorithms are irregular which makes their parallel design and implementation on GPU challenging. Existing GPU-accelerated B&B algorithms perform only a part of the algorithm on the GPU and rely on the transfer of pools of subproblems across the PCI Express bus to the device. To the best of our knowledge, the algorithm presented in this paper is the first GPU-based B&B algorithm that performs all four operators on the device and subsequently avoids the data transfer bottleneck between CPU and GPU. The implementation on GPU is based on the Integer-Vector-Matrix (IVM) data structure which is used instead of a conventional linked-list to store and manage the pool of subproblems. This paper revisits the IVM-based B&B algorithm on the GPU, addressing the irregularity of the algorithm in terms of workload, memory access patterns and control flow. In particular, the focus is put on reducing thread divergence by making a judicious choice for the mapping of threads onto the data. Compared to a GPU-accelerated B&B based on a linked-list, the algorithm presented in this paper solves a set of standard flowshop instances on average 3.3 times faster.

Mots clés

GPU computing Irregular applications Branch-and-Bound Combinatorial optimization

Domaines

Calcul parallèle, distribué et partagé [cs.DC] Algorithme et structure de données [cs.DS] Recherche opérationnelle [math.OC]

Fichier principal

Gmys_et_al_revised_Manuscript.pdf (1.87 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Jan Gmys : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01389471

Soumis le : vendredi 28 octobre 2016-14:42:59

Dernière modification le : mercredi 24 janvier 2024-09:54:23

Dates et versions

hal-01389471 , version 1 (28-10-2016)

Identifiants

HAL Id : hal-01389471 , version 1
DOI : 10.1016/j.parco.2016.01.008

Citer

Jan Gmys, Mohand Mezmaz, Nouredine Melab, Daniel Tuyttens. A GPU-based Branch-and-Bound algorithm using Integer-Vector-Matrix data structure. Parallel Computing, 2016, Parallel Computing, 59, pp.119-139. ⟨10.1016/j.parco.2016.01.008⟩. ⟨hal-01389471⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA CRISTAL INRIA2 UNIV-LILLE

172 Consultations

620 Téléchargements

A GPU-based Branch-and-Bound algorithm using Integer-Vector-Matrix data structure

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager