Audio-Motor Integration for Robot Audition

Antoine Deleforge 1 Alexander Schmidt 2 Walter Kellermann 2
1 MULTISPEECH - Speech Modeling for Facilitating Oral-Based Communication
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : In the context of robotics, audio signal processing in the wild amounts to dealing with sounds recorded by a system that moves and whose actuators produce noise. This creates additional challenges in sound source localization, signal enhancement and recognition. But the speci-ficity of such platforms also brings interesting opportunities: can information about the robot actuators' states be meaningfully integrated in the audio processing pipeline to improve performance and efficiency? While robot audition grew to become an established field, methods that explicitly use motor-state information as a complementary modality to audio are scarcer. This chapter proposes a unified view of this endeavour, referred to as audio-motor integration. A literature review and two learning-based methods for audio-motor integration in robot audition are presented, with application to single-microphone sound source localization and ego-noise reduction on real data.
Type de document :
Chapitre d'ouvrage
Multimodal Behavior Analysis in the Wild, Academic Press, pp.1-27, 2018
Liste complète des métadonnées

Littérature citée [8 références]  Voir  Masquer  Télécharger
Contributeur : Antoine Deleforge <>
Soumis le : mercredi 21 novembre 2018 - 10:46:29
Dernière modification le : mardi 18 décembre 2018 - 16:38:02


Fichiers produits par l'(les) auteur(s)


  • HAL Id : hal-01929388, version 1



Antoine Deleforge, Alexander Schmidt, Walter Kellermann. Audio-Motor Integration for Robot Audition. Multimodal Behavior Analysis in the Wild, Academic Press, pp.1-27, 2018. 〈hal-01929388〉



Consultations de la notice


Téléchargements de fichiers