Audio Inpainting

Amir Adler 1 Valentin Emiya 2 Maria Jafari 3 Michael Elad 1 Rémi Gribonval 2 Mark D. Plumbley 3
2 METISS - Speech and sound data modeling and processing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : We propose the Audio Inpainting framework that recovers audio intervals distorted due to impairments such as impulsive noise, clipping, and packet loss. In this framework, the distorted samples are treated as missing, and the signal is decomposed into overlapping time-domain frames. The restoration problem is then formulated as an inverse problem per audio frame. Sparse representation modeling is employed per frame, and each inverse problem is solved using the Orthogonal Matching Pursuit algorithm together with a discrete cosine or a Gabor dictionary. The performance of this algorithm is shown to be comparable or better than state-of-the-art methods when blocks of samples of variable durations are missing. We also demonstrate that the size of the block of missing samples, rather than the overall number of missing samples, is a crucial parameter for high quality signal restoration. We further introduce a constrained Matching Pursuit approach for the special case of audio declipping that exploits the sign pattern of clipped audio samples and their maximal absolute value, as well as allowing the user to specify the maximum amplitude of the signal. This approach is shown to outperforms state-of-the-art and commercially available methods for audio declipping.
Type de document :
Article dans une revue
IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2012, 20 (3), pp.922 - 932. 〈10.1109/TASL.2011.2168211〉
Liste complète des métadonnées

https://hal.inria.fr/inria-00577079
Contributeur : Valentin Emiya <>
Soumis le : mercredi 16 mars 2011 - 11:42:09
Dernière modification le : mercredi 16 mai 2018 - 11:23:03
Document(s) archivé(s) le : vendredi 17 juin 2011 - 02:36:37

Fichier

RR-7571.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Citation

Amir Adler, Valentin Emiya, Maria Jafari, Michael Elad, Rémi Gribonval, et al.. Audio Inpainting. IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2012, 20 (3), pp.922 - 932. 〈10.1109/TASL.2011.2168211〉. 〈inria-00577079〉

Partager

Métriques

Consultations de la notice

1965

Téléchargements de fichiers

10353