Perspectives

Abstract : Source separation and speech enhancement research has made dramatic progress in the last 30 years. It is now a mainstream topic in speech and audio processing, with hundreds of papers published every year. Separation and enhancement performance have greatly improved and successful commercial applications are increasingly being deployed. This chapter provides an overview of research and development perspectives in the field. We do not attempt to cover all perspectives currently under discussion in the community. Instead, we focus on five directions in which we believe major progress is still possible: getting the most out of deep learning, exploiting phase relationships across time-frequency bins, improving the estimation accuracy of multichannel parameters, addressing scenarios involving multiple microphone arrays or other sensors, and accelerating industry transfer. These five directions are covered in Sections 19.1, 19.2, 19.3, 19.4, and 19.5, respectively.
Document type :
Book sections
Liste complète des métadonnées

https://hal.inria.fr/hal-01881424
Contributor : Emmanuel Vincent <>
Submitted on : Tuesday, September 25, 2018 - 9:25:45 PM
Last modification on : Wednesday, April 3, 2019 - 1:23:14 AM
Document(s) archivé(s) le : Wednesday, December 26, 2018 - 5:17:48 PM

File

vincent_book18_chap19.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01881424, version 1

Collections

Citation

Emmanuel Vincent, Tuomas Virtanen, Sharon Gannot. Perspectives. Emmanuel Vincent; Tuomas Virtanen; Sharon Gannot. Audio source separation and speech enhancement, Wiley, 2018, 978-1-119-27989-1. ⟨https://www.wiley.com/en-us/Audio+Source+Separation+and+Speech+Enhancement-p-9781119279891⟩. ⟨hal-01881424⟩

Share

Metrics

Record views

70

Files downloads

35