Text-informed speech inpainting via voice conversion

Abstract : The problem of speech inpainting consists in recovering some parts in a speech signal that are missing for some reasons. To our best knowledge none of the existing methods allows satisfactory inpainting of missing parts of large size such as one second and longer. In this work we address this challenging scenario. Since in the case of such long missing parts entire words can be lost, we assume that the full text uttered in the speech signal is known. This leads to a new concept of text-informed speech inpainting. To solve this problem we propose a method that is based on synthesizing the missing speech by a speech synthesizer, on modifying its vocal characteristics via a voice conversion method, and on filling in the missing part with the resulting converted speech sample. We carried subjective listening tests to compare the proposed approach with two baseline methods.
Type de document :
Communication dans un congrès
24th European Signal Processing Conference (EUSIPCO 2016), Aug 2016, Budapest, Hungary. 2016
Liste complète des métadonnées

Littérature citée [23 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01271257
Contributeur : Alexey Ozerov <>
Soumis le : mardi 22 novembre 2016 - 16:49:33
Dernière modification le : mercredi 31 janvier 2018 - 15:08:01
Document(s) archivé(s) le : lundi 20 mars 2017 - 23:54:34

Fichier

eusipco16a.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01271257, version 2

Citation

Pierre Prablanc, Alexey Ozerov, Ngoc Q. K. Duong, Patrick Pérez. Text-informed speech inpainting via voice conversion. 24th European Signal Processing Conference (EUSIPCO 2016), Aug 2016, Budapest, Hungary. 2016. 〈hal-01271257v2〉

Partager

Métriques

Consultations de la notice

89

Téléchargements de fichiers

80