CRNN-based joint azimuth and elevation localization with the Ambisonics intensity vector

Lauréline Perotin 1, 2 Romain Serizel 2 Emmanuel Vincent 2 Alexandre Guérin 1
2 MULTISPEECH - Speech Modeling for Facilitating Oral-Based Communication
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : We present a source localization system for first-order Ambisonics (FOA) contents based on a stacked convolutional and recurrent neural network (CRNN). We propose to use as input to the CRNN the FOA acoustic intensity vector, which is easy to compute and closely linked to the sound direction of arrival (DoA). The system estimates the DoA of a point source in both azimuth and elevation. We conduct an experimental evaluation in configurations including reverberation, noise, and various speaker w.r.t. microphone orientations. The results show that the proposed architecture and input allow the network to return accurate location estimates in realistic conditions compared to another recent CRNN-based system.
Document type :
Conference papers
Complete list of metadatas

Cited literature [27 references]  Display  Hide  Download

https://hal.inria.fr/hal-01840453
Contributor : Lauréline Perotin <>
Submitted on : Monday, July 16, 2018 - 2:20:05 PM
Last modification on : Wednesday, April 3, 2019 - 1:23:01 AM
Long-term archiving on : Wednesday, October 17, 2018 - 2:56:17 PM

File

articleIWAENC.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01840453, version 1

Collections

Citation

Lauréline Perotin, Romain Serizel, Emmanuel Vincent, Alexandre Guérin. CRNN-based joint azimuth and elevation localization with the Ambisonics intensity vector. IWAENC 2018 - 16th International Workshop on Acoustic Signal Enhancement, Sep 2018, Tokyo, Japan. ⟨hal-01840453⟩

Share

Metrics

Record views

355

Files downloads

402