DREGON: Dataset and Methods for UAV-Embedded Sound Source Localization

Martin Strauss 1 Pol Mordel 2 Victor Miguet 3 Antoine Deleforge 4, 5
2 RAINBOW - Sensor-based and interactive robotics
Inria Rennes – Bretagne Atlantique , IRISA_D5 - SIGNAUX ET IMAGES NUMÉRIQUES, ROBOTIQUE
4 PANAMA - Parcimonie et Nouveaux Algorithmes pour le Signal et la Modélisation Audio
Inria Rennes – Bretagne Atlantique , IRISA_D5 - SIGNAUX ET IMAGES NUMÉRIQUES, ROBOTIQUE
5 MULTISPEECH - Speech Modeling for Facilitating Oral-Based Communication
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : This paper introduces DREGON, a novel publicly-available dataset that aims at pushing research in sound source localization using a microphone array embedded in an unmanned aerial vehicle (UAV). The dataset contains both clean and noisy in-flight audio recordings continuously annotated with the 3D position of the target sound source using an accurate motion capture system. In addition, various signals of interests are available such as the rotational speed of individual rotors and inertial measurements at all time. Besides introducing the dataset, this paper sheds light on the specific properties, challenges and opportunities brought by the emerging task of UAV-embedded sound source localization. Several baseline methods are evaluated and compared on the dataset, with real-time applicability in mind. Very promising results are obtained for the localization of a broad-band source in loud noise conditions, while speech localization remains a challenge under extreme noise levels.
Document type :
Conference papers
Complete list of metadatas

Cited literature [24 references]  Display  Hide  Download

https://hal.inria.fr/hal-01854878
Contributor : Eric Marchand <>
Submitted on : Tuesday, August 7, 2018 - 11:31:41 AM
Last modification on : Thursday, February 7, 2019 - 5:08:01 PM
Long-term archiving on : Thursday, November 8, 2018 - 1:20:48 PM

File

2018_iros_strauss.pdf
Files produced by the author(s)

Identifiers

Citation

Martin Strauss, Pol Mordel, Victor Miguet, Antoine Deleforge. DREGON: Dataset and Methods for UAV-Embedded Sound Source Localization. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2018), Oct 2018, Madrid, Spain. pp.5735-5742, ⟨10.1109/IROS.2018.8593581⟩. ⟨hal-01854878⟩

Share

Metrics

Record views

941

Files downloads

1210