Higher precision pitch marking for TD-PSOLA

Vincent Colotte 1 Yves Laprie 1
1 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : The paper describes techniques to improve the precision of prosodic modifications with TD-PSOLA. TD-PSOLA relies on the pitch synchronous decomposition of the signal into overlapping frames synchronised with pitch period. The main objective is thus to preserve the consistency of marks between neighbouring frames with respect to the temporal structure of pitch periods. First, we improve pitch marking by eliminating mismatch errors which appear during rapid formant transitions. This is achieved by pruning pitch mark candidates whose distance with other candidates is clearly not consistent with the current pitch period. From the synthesis point of view we exploit a fast re-sampling method which allows signal frames to be shifted finely where they should appear given both the initial pitch mark and the location of pitch mark for synthesis. Together with the pitch marking improvement, this fast re-sampling method enables very high quality transformations characterised by the absence of noise between harmonics.
Type de document :
Communication dans un congrès
XI European Signal Processing Conference- EUSIPCO 2002, 2002, Toulouse, France, 2002
Liste complète des métadonnées

https://hal.inria.fr/inria-00107610
Contributeur : Publications Loria <>
Soumis le : jeudi 19 octobre 2006 - 09:03:00
Dernière modification le : jeudi 11 janvier 2018 - 06:19:55
Document(s) archivé(s) le : mercredi 29 mars 2017 - 13:02:35

Identifiants

  • HAL Id : inria-00107610, version 1

Collections

Citation

Vincent Colotte, Yves Laprie. Higher precision pitch marking for TD-PSOLA. XI European Signal Processing Conference- EUSIPCO 2002, 2002, Toulouse, France, 2002. 〈inria-00107610〉

Partager

Métriques

Consultations de la notice

404

Téléchargements de fichiers

511