Audio-Motor Integration for Robot Audition

Antoine Deleforge 1 Alexander Schmidt 2 Walter Kellermann 2
1 MULTISPEECH - Speech Modeling for Facilitating Oral-Based Communication
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : In the context of robotics, audio signal processing in the wild amounts to dealing with sounds recorded by a system that moves and whose actuators produce noise. This creates additional challenges in sound source localization, signal enhancement and recognition. But the speci-ficity of such platforms also brings interesting opportunities: can information about the robot actuators' states be meaningfully integrated in the audio processing pipeline to improve performance and efficiency? While robot audition grew to become an established field, methods that explicitly use motor-state information as a complementary modality to audio are scarcer. This chapter proposes a unified view of this endeavour, referred to as audio-motor integration. A literature review and two learning-based methods for audio-motor integration in robot audition are presented, with application to single-microphone sound source localization and ego-noise reduction on real data.
Complete list of metadatas

Cited literature [61 references]  Display  Hide  Download

https://hal.inria.fr/hal-01929388
Contributor : Antoine Deleforge <>
Submitted on : Wednesday, November 21, 2018 - 10:46:29 AM
Last modification on : Tuesday, December 18, 2018 - 4:38:02 PM
Long-term archiving on : Friday, February 22, 2019 - 1:29:53 PM

File

Book.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01929388, version 1

Collections

Citation

Antoine Deleforge, Alexander Schmidt, Walter Kellermann. Audio-Motor Integration for Robot Audition. Multimodal Behavior Analysis in the Wild, Academic Press, pp.1-27, 2018. ⟨hal-01929388⟩

Share

Metrics

Record views

101

Files downloads

278