Fast Modal Sounds with Scalable Frequency-Domain Synthesis - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Article Dans Une Revue ACM Transactions on Graphics Année : 2008

Fast Modal Sounds with Scalable Frequency-Domain Synthesis

Résumé

Audio rendering of impact sounds, such as those caused by falling objects or explosion debris, adds realism to interactive 3D audiovisual applications, and can be convincingly achieved using modal sound synthesis. Unfortunately, mode-based computations can become prohibitively expensive when many objects, each with many modes, are impacted simultaneously. We introduce a fast sound synthesis approach, based on short-time Fourier Tranforms, that exploits the inherent sparsity of modal sounds in the frequency domain. For our test scenes, this "fast mode summation" can give speedups of 5-8 times compared to a time-domain solution, with slight degradation in quality. We discuss different reconstruction windows, affecting the quality of impact sound "attacks". Our Fourier-domain processing method allows us to introduce a scalable, real-time, audio processing pipeline for both recorded and modal sounds, with auditory masking and sound source clustering. To avoid abrupt computation peaks, such as during the simultaneous impacts of an explosion, we use crossmodal perception results on audiovisual synchrony to effect temporal scheduling. We also conducted a pilot perceptual user evaluation of our method. Our implementation results show that we can treat complex audiovisual scenes in real time with high quality.
Fichier principal
Vignette du fichier
FastModalSounds.pdf (1.89 Mo) Télécharger le fichier
Vignette du fichier
cubesnew.jpg (239.63 Ko) Télécharger le fichier
FastModalSounds.avi (133.54 Mo) Télécharger le fichier
FastModalSounds_Additional.pdf (79.78 Ko) Télécharger le fichier
Vignette du fichier
a.Oriental.jpg (182.62 Ko) Télécharger le fichier
Vignette du fichier
magnet.jpg (110.48 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Format : Figure, Image
Format : Vidéo
Format : Autre
Format : Figure, Image
Format : Figure, Image

Dates et versions

inria-00607249 , version 1 (08-07-2011)

Identifiants

Citer

Nicolas Bonneel, George Drettakis, Nicolas Tsingos, Isabelle Viaud-Delmon, Doug James. Fast Modal Sounds with Scalable Frequency-Domain Synthesis. ACM Transactions on Graphics, 2008, SIGGRAPH Conference Proceedings, 27 (3), ⟨10.1145/1399504.1360623⟩. ⟨inria-00607249⟩
176 Consultations
315 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More