Progressive Perceptual Audio Rendering of Complex Scenes

Abstract : Despite recent advances, including sound source clustering and perceptual auditory masking, high quality rendering of complex virtual scenes with thousands of sound sources remains a challenge. Two major bottlenecks appear as the scene complexity increases: the cost of clustering itself, and the cost of pre-mixing source signals within each cluster. In this paper, we first propose an improved hierarchical clustering algorithm that remains efficient for large numbers of sources and clusters while providing progressive refinement capabilities. We then present a lossy pre-mixing method based on a progressive representation of the input audio signals and the perceptual importance of each sound source. Our quality evaluation user tests indicate that the recently introduced audio saliency map is inappropriate for this task. Consequently we propose a "pinnacle", loudness-based metric, which gives the best results for a variety of target computing budgets. We also performed a perceptual pilot study which indicates that in audio-visual environments, it is better to allocate more clusters to visible sound sources. We propose a new clustering metric using this result. As a result of these three solutions, our system can provide high quality rendering of thousands of 3D-sound sources on a "gamer-style" PC.
Type de document :
Communication dans un congrès
Symposium on Interactive 3D graphics and games (I3D 2007), Apr 2007, Seattle, United States. ACM, pp.189-196, 2007, I3D '07 Proceedings of the 2007 symposium on Interactive 3D graphics and games. 〈10.1145/1230100.1230133〉
Liste complète des métadonnées

Littérature citée [26 références]  Voir  Masquer  Télécharger


https://hal.inria.fr/inria-00606801
Contributeur : Team Reves <>
Soumis le : mardi 19 juillet 2011 - 10:58:37
Dernière modification le : mercredi 21 mars 2018 - 18:57:08
Document(s) archivé(s) le : lundi 7 novembre 2011 - 11:25:19

Fichiers

Identifiants

Collections

Citation

Thomas Moeck, Nicolas Bonneel, Nicolas Tsingos, George Drettakis, Isabelle Viaud-Delmon, et al.. Progressive Perceptual Audio Rendering of Complex Scenes. Symposium on Interactive 3D graphics and games (I3D 2007), Apr 2007, Seattle, United States. ACM, pp.189-196, 2007, I3D '07 Proceedings of the 2007 symposium on Interactive 3D graphics and games. 〈10.1145/1230100.1230133〉. 〈inria-00606801〉

Partager

Métriques

Consultations de la notice

211

Téléchargements de fichiers

418