Suppression of Phasiness for Time-Scale Modifications of Speech Signals Based on a Shape Invariance Property

Joseph Di Martino 1 Yves Laprie 1
1 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : Time-scale modifications of speech signals, based on frequency-domain techniques, are hampered by two important artifacts which are ''phasiness'' and ''transient smearing''. They correspond to the destruction of the shape of the original signal, i.e. the de-synchronization between the phases of frequency components. This paper describes an algorithm that preserves the shape invariance of speech signals in the context of a phase vocoder. Phases are corrected at the onset of each voiced region. Modified signals, even for large expansion factors, are of high quality and free from transient smearing or phasiness. A demonstration is proposed in the web page: http://www.loria.fr/~jdm/PhaseVocoder/index.html where some audio files can be down-loaded.
Document type :
Conference papers
Complete list of metadatas

https://hal.inria.fr/inria-00108073
Contributor : Joseph Di Martino <>
Submitted on : Thursday, October 19, 2006 - 3:40:46 PM
Last modification on : Thursday, January 11, 2018 - 6:19:56 AM

Identifiers

  • HAL Id : inria-00108073, version 1

Collections

Citation

Joseph Di Martino, Yves Laprie. Suppression of Phasiness for Time-Scale Modifications of Speech Signals Based on a Shape Invariance Property. International Conference on Acoustics, Speech, and Signal Processing - ICASSP 2001, IEEE, 2001, Salt Lake City, United States. pp.853-856. ⟨inria-00108073⟩

Share

Metrics

Record views

175