Solving global permutation ambiguity of time domain BSS using speaker specific features of speech signals

Abstract : Multidimensional localization of competing speakers using BSS based TDOA estimations, requires the solution of global permutation ambiguity before fusing several TDOA estimates. Since the separation quality of BSS is not perfect, it is not easy to decide which TDOA belongs to which source (specially when the number of speakers grows). We study the robustness of several speaker specific features of speech against dereverberation filtering, by evaluating their capability to recognize perceptually dominant sources in each one of moderately enhanced outputs of the BSS algorithm. We compare the performance of several features in terms of Average Decision Statistic and computational complexity.
Type de document :
Communication dans un congrès
2009 IEEE Symposium on Industrial Electronics and Applications., Oct 2009, Kuala Lumpur, Malaysia. 2009
Liste complète des métadonnées

https://hal.inria.fr/inria-00438658
Contributeur : Vahid Khanagha <>
Soumis le : vendredi 4 décembre 2009 - 11:54:20
Dernière modification le : mercredi 29 novembre 2017 - 15:10:31

Identifiants

  • HAL Id : inria-00438658, version 1

Collections

Citation

Vahid Khanagha, Ali Khanagha. Solving global permutation ambiguity of time domain BSS using speaker specific features of speech signals. 2009 IEEE Symposium on Industrial Electronics and Applications., Oct 2009, Kuala Lumpur, Malaysia. 2009. 〈inria-00438658〉

Partager

Métriques

Consultations de la notice

142