Perspectives

Emmanuel Vincent; Tuomas Virtanen; Sharon Gannot

Chapitre D'ouvrage Année : 2018

Perspectives

(1) , (2) , (3)

1
2
3

Emmanuel Vincent

Fonction : Auteur
PersonId : 1256
IdHAL : emmanuelv
ORCID : 0000-0002-0183-7289
IdRef : 089360176

Speech Modeling for Facilitating Oral-Based Communication

Tuomas Virtanen

Fonction : Auteur

Tampere University of Technology [Tampere]

Sharon Gannot

Fonction : Auteur

Bar-Ilan University [Israël]

Résumé

Source separation and speech enhancement research has made dramatic progress in the last 30 years. It is now a mainstream topic in speech and audio processing, with hundreds of papers published every year. Separation and enhancement performance have greatly improved and successful commercial applications are increasingly being deployed. This chapter provides an overview of research and development perspectives in the field. We do not attempt to cover all perspectives currently under discussion in the community. Instead, we focus on five directions in which we believe major progress is still possible: getting the most out of deep learning, exploiting phase relationships across time-frequency bins, improving the estimation accuracy of multichannel parameters, addressing scenarios involving multiple microphone arrays or other sensors, and accelerating industry transfer. These five directions are covered in Sections 19.1, 19.2, 19.3, 19.4, and 19.5, respectively.

Domaines

Traitement du signal et de l'image [eess.SP]

Fichier principal

vincent_book18_chap19.pdf (897.76 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Emmanuel Vincent : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01881424

Soumis le : mardi 25 septembre 2018-21:25:45

Dernière modification le : jeudi 1 février 2024-10:05:50

Archivage à long terme le : mercredi 26 décembre 2018-17:17:48

Dates et versions

hal-01881424 , version 1 (25-09-2018)

Identifiants

HAL Id : hal-01881424 , version 1

Citer

Emmanuel Vincent, Tuomas Virtanen, Sharon Gannot. Perspectives. Emmanuel Vincent; Tuomas Virtanen; Sharon Gannot. Audio source separation and speech enhancement, Wiley, 2018, 978-1-119-27989-1. ⟨hal-01881424⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA IRISA UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

76 Consultations

171 Téléchargements

Perspectives

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager