An analysis of psychoacoustically-inspired matching pursuit decompositions of speech signals - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2017

An analysis of psychoacoustically-inspired matching pursuit decompositions of speech signals

Résumé

Matching pursuit (MP), particularly using the Gammatones dictionary , has become a popular tool in sparse representations of speech/audio signals. The classical MP algorithm does not however take into account psychoacoustical aspects of the auditory system. Recently two algorithms, called PAMP and PMP have been introduced in order to select only perceptually relevant atoms during MP decomposition. In this paper we compare this two algorithms on few speech sentences. The results suggest that PMP, which also has the strong advantage of including an implicit stop criterion, always outperforms PAMP as well as classical MP. We then raise the question of whether the Gam-matones dictionary is the best choice when using PMP. We thus compare it to the popular Gabor and damped-Sinusoids dictionaries. The results suggest that Gammatones always outperform damped-Sinusoids, and that Gabor yield better reconstruction quality but with higher atoms rate.
Fichier principal
Vignette du fichier
Daoudi-Vinuesa.pdf (224.4 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01627106 , version 1 (31-10-2017)

Identifiants

  • HAL Id : hal-01627106 , version 1

Citer

Khalid Daoudi, Nicolas Vinuesa. An analysis of psychoacoustically-inspired matching pursuit decompositions of speech signals. International Conference on Natural Language, Signal and Speech Processing, Dec 2017, Casablanca, Morocco. ⟨hal-01627106⟩
133 Consultations
148 Téléchargements

Partager

Gmail Facebook X LinkedIn More