Text-informed speech inpainting via voice conversion

Abstract : The problem of speech inpainting consists in recovering some parts in a speech signal that are missing for some reasons. To our best knowledge none of the existing methods allows satisfactory inpainting of missing parts of large size such as one second and longer. In this work we address this challenging scenario. Since in the case of such long missing parts entire words can be lost, we assume that the full text uttered in the speech signal is known. This leads to a new concept of text-informed speech inpainting. To solve this problem we propose a method that is based on synthesizing the missing speech by a speech synthesizer, on modifying its vocal characteristics via a voice conversion method, and on filling in the missing part with the resulting converted speech sample. We carried subjective listening tests to compare the proposed approach with two baseline methods.
Document type :
Conference papers
Complete list of metadatas

https://hal.inria.fr/hal-01271257
Contributor : Alexey Ozerov <>
Submitted on : Monday, February 8, 2016 - 9:50:40 PM
Last modification on : Wednesday, January 31, 2018 - 3:08:01 PM
Long-term archiving on : Saturday, November 12, 2016 - 2:23:28 PM

File

eusipco16a.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01271257, version 1

Citation

Pierre Prablanc, Alexey Ozerov, Ngoc Q. K. Duong, Patrick Pérez. Text-informed speech inpainting via voice conversion. 24th European Signal Processing Conference (EUSIPCO 2016), Aug 2016, Budapest, Hungary. ⟨hal-01271257v1⟩

Share

Metrics

Record views

71

Files downloads

95