Skip to Main content Skip to Navigation
Journal articles

Efficient Out-of-core and Out-of-place Rectangular Matrix Transposition and Rotation

Paul Godard 1, 2, 3 Vincent Loechner 1, 2 Cédric Bastoul 1, 2
1 CAMUS - Compilation pour les Architectures MUlti-coeurS
Inria Nancy - Grand Est, ICube - Laboratoire des sciences de l'ingénieur, de l'informatique et de l'imagerie
Abstract : Modern computers keep following the traditional model of addressing memory linearly for their main memory and out-of-core storage. While this model allows efficient row access to row-major 2D matrices, it introduces complexity to perform efficient column access. A common strategy to improve these accesses is to transpose or rotate the matrix beforehand, thus the accessing complexity is centralized in one transformation operation. Further column accesses are performed as row accesses to the transposed matrix therefore they are optimized to the memory model. In this paper, we propose an efficient solution to perform in-memory or out-of-core rectangular matrix transposition and rotation by using an out-of-place strategy, reading a matrix from an input file and writing the transformed matrix to another (output) file. An originality of our processing algorithm is to rely on an optimized use of the page cache mechanism. It is parallel, optimized by several levels of tiling and independent of any disk block size. We evaluate our approach on five common storage configurations: HDD, hybrid HDD-SSD, SSD, software RAID 0 of several SSDs and NVMe. We show that it brings significant performance improvement over a hand-tuned optimized reference implementation developed by the Caldera company and we confront it against the baseline speed of a straight file copy.
Complete list of metadatas

Cited literature [11 references]  Display  Hide  Download
Contributor : Vincent Loechner <>
Submitted on : Thursday, October 8, 2020 - 11:46:45 AM
Last modification on : Monday, October 26, 2020 - 3:37:47 PM


Files produced by the author(s)



Paul Godard, Vincent Loechner, Cédric Bastoul. Efficient Out-of-core and Out-of-place Rectangular Matrix Transposition and Rotation. IEEE Transactions on Computers, Institute of Electrical and Electronics Engineers, 2020, ⟨10.1109/TC.2020.3030592⟩. ⟨hal-02960539⟩



Record views


Files downloads