An analysis of psychoacoustically-inspired matching pursuit decompositions of speech signals

Abstract : Matching pursuit (MP), particularly using the Gammatones dictionary , has become a popular tool in sparse representations of speech/audio signals. The classical MP algorithm does not however take into account psychoacoustical aspects of the auditory system. Recently two algorithms, called PAMP and PMP have been introduced in order to select only perceptually relevant atoms during MP decomposition. In this paper we compare this two algorithms on few speech sentences. The results suggest that PMP, which also has the strong advantage of including an implicit stop criterion, always outperforms PAMP as well as classical MP. We then raise the question of whether the Gam-matones dictionary is the best choice when using PMP. We thus compare it to the popular Gabor and damped-Sinusoids dictionaries. The results suggest that Gammatones always outperform damped-Sinusoids, and that Gabor yield better reconstruction quality but with higher atoms rate.
Type de document :
Communication dans un congrès
International Conference on Natural Language, Signal and Speech Processing, Dec 2017, Casablanca, Morocco
Liste complète des métadonnées

Littérature citée [17 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01627106
Contributeur : Khalid Daoudi <>
Soumis le : mardi 31 octobre 2017 - 17:55:00
Dernière modification le : jeudi 11 janvier 2018 - 06:25:44
Document(s) archivé(s) le : jeudi 1 février 2018 - 14:10:57

Fichier

Daoudi-Vinuesa.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01627106, version 1

Collections

Citation

Khalid Daoudi, Nicolas Vinuesa. An analysis of psychoacoustically-inspired matching pursuit decompositions of speech signals. International Conference on Natural Language, Signal and Speech Processing, Dec 2017, Casablanca, Morocco. 〈hal-01627106〉

Partager

Métriques

Consultations de la notice

70

Téléchargements de fichiers

22