High Performance in the Cloud with FPGA Groups

Abstract : Field-programmable gate arrays (FPGAs) can offer invaluable computational performance for many compute-intensive algorithms. However, to justify their purchase and administration costs it is necessary to maximize resource utilization over their expected lifetime. Making FPGAs available in a cloud environment would make them attractive to new types of users and applications and help democratize this increasingly popular technology. However, there currently exists no satisfactory technique for offering FPGAs as cloud resources and sharing them between multiple tenants. We propose FPGA groups, which are seen by their clients as a single virtual FPGA, and which aggregate the computational power of multiple physical FPGAs. FPGA groups are elastic, and they may be shared among multiple tenants. We present an autoscaling algorithm to maximize FPGA groups' resource utilization and reduce user-perceived computation latencies. FPGA groups incur a low overhead in the order of 0.09ms per submitted task. When faced with a challenging workload, the autoscaling algorithm increases resource utilization from 52% to 61% compared to a static resource allocation, while reducing task execution latencies by 61%.
Complete list of metadatas

Cited literature [32 references]  Display  Hide  Download

https://hal.inria.fr/hal-01356998
Contributor : Guillaume Pierre <>
Submitted on : Monday, September 19, 2016 - 4:33:10 PM
Last modification on : Thursday, October 3, 2019 - 10:28:04 AM

File

paper.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01356998, version 1

Citation

Anca Iordache, Guillaume Pierre, Peter Sanders, Jose Gabriel de F. Coutinho, Mark Stillwell. High Performance in the Cloud with FPGA Groups. 9th IEEE/ACM International Conference on Utility and Cloud Computing (UCC 2016), Dec 2016, Shanghai, China. ⟨hal-01356998⟩

Share

Metrics

Record views

641

Files downloads

572