Suppression of Phasiness for Time-Scale Modifications of Speech Signals Based on a Shape Invariance Property

Joseph Di Martino 1 Yves Laprie 1
1 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : Time-scale modifications of speech signals, based on frequency-domain techniques, are hampered by two important artifacts which are ''phasiness'' and ''transient smearing''. They correspond to the destruction of the shape of the original signal, i.e. the de-synchronization between the phases of frequency components. This paper describes an algorithm that preserves the shape invariance of speech signals in the context of a phase vocoder. Phases are corrected at the onset of each voiced region. Modified signals, even for large expansion factors, are of high quality and free from transient smearing or phasiness. A demonstration is proposed in the web page: http://www.loria.fr/~jdm/PhaseVocoder/index.html where some audio files can be down-loaded.
Type de document :
Communication dans un congrès
International Conference on Acoustics, Speech, and Signal Processing - ICASSP 2001, 2001, Salt Lake City, United States. 2, pp.853-856, 2001
Liste complète des métadonnées

https://hal.inria.fr/inria-00108073
Contributeur : Joseph Di Martino <>
Soumis le : jeudi 19 octobre 2006 - 15:40:46
Dernière modification le : jeudi 11 janvier 2018 - 06:19:56

Identifiants

  • HAL Id : inria-00108073, version 1

Collections

Citation

Joseph Di Martino, Yves Laprie. Suppression of Phasiness for Time-Scale Modifications of Speech Signals Based on a Shape Invariance Property. International Conference on Acoustics, Speech, and Signal Processing - ICASSP 2001, 2001, Salt Lake City, United States. 2, pp.853-856, 2001. 〈inria-00108073〉

Partager

Métriques

Consultations de la notice

149