Solving global permutation ambiguity of time domain BSS using speaker specific features of speech signals

Abstract : Multidimensional localization of multiple sources using BSS based TDOA estimators, requires the solution of global permutation ambiguity before fusing several TDOA estimations. Since the separation quality of BSS isn't always perfect, it is not easy to decide which TDOA belongs to which source. Here we study the possibility of using several speaker specific features of speech signal in order to recognize perceptually dominant sources in each one of moderately separated outputs of BSS algorithm. We compare the feasibility of different features in terms of validity rate of decisions and computational complexity.
Type de document :
Communication dans un congrès
2009 IEEE Symposium on Industrial Electronics and Applications (ISIEA 2009), Oct 2009, Kuala Lumpur, Malaysia. 2009
Liste complète des métadonnées

Littérature citée [12 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-00938356
Contributeur : H. Yahia <>
Soumis le : mercredi 29 janvier 2014 - 12:43:51
Dernière modification le : mercredi 29 novembre 2017 - 09:22:57
Document(s) archivé(s) le : lundi 5 mai 2014 - 11:06:35

Fichier

vk2.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00938356, version 1

Citation

Vahid Khanagha, Ali Khanagha. Solving global permutation ambiguity of time domain BSS using speaker specific features of speech signals. 2009 IEEE Symposium on Industrial Electronics and Applications (ISIEA 2009), Oct 2009, Kuala Lumpur, Malaysia. 2009. 〈hal-00938356〉

Partager

Métriques

Consultations de la notice

110

Téléchargements de fichiers

145