Skip to Main content Skip to Navigation
Journal articles

Union of MDCT Bases for Audio Coding

Abstract : This paper investigates the use of sparse overcomplete decompositions for audio coding. Audio signals are decomposed over a redundant union of modified discrete cosine transform (MDCT) bases having eight different scales. This approach produces a sparser decomposition than the traditional MDCT-based orthogonal transform and allows better coding efficiency at low bitrates. Contrary to state-of-the-art low bitrate coders, which are based on pure parametric or hybrid representations, our approach is able to provide transparency. Moreover, we use a bitplane encoding approach, which provides a fine-grain scalable coder that can seamlessly operate from very low bitrates up to transparency. Objective evaluation, as well as listening tests, show that the performance of our coder is significantly better than a state-of-the-art transform coder at very low bitrates and has similar performance at high bitrates. We provide a link to test soundfiles and source code to allow better evaluation and reproducibility of the results.
Document type :
Journal articles
Complete list of metadata

https://hal.inria.fr/hal-02652697
Contributor : Gaël Richard <>
Submitted on : Friday, May 29, 2020 - 8:00:41 PM
Last modification on : Wednesday, June 2, 2021 - 4:26:29 PM

Links full text

Identifiers

Citation

Emmanuel Ravelli, Gael Richard, Laurent Daudet. Union of MDCT Bases for Audio Coding. IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2008, 16, ⟨10.1109/TASL.2008.2004290⟩. ⟨hal-02652697⟩

Share

Metrics

Record views

61