Weight Optimization for Bimodal Unit-Selection Talking Head Synthesis

Asterios Toutios 1 Utpala Musti 1 Slim Ouni 1 Vincent Colotte 1
1 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : This paper addresses talking head synthesis based on the concatenation of units comprising of both acoustic and visual information. Selection of appropriate diphone units to synthesize a given text string is based on the minimization of a weighted linear combination of four costs that reflect linguistic, acoustic, and visual considerations. We present initial work toward a method to determine automatically the weights applied to each cost, using a series of metrics that assess quantitatively the performance of synthesis.
Type de document :
Communication dans un congrès
ISCA. 12thAnnual Conference of the International Speech Communication Association - Interspeech 2011, Aug 2011, Florence, Italy. 2011
Liste complète des métadonnées

https://hal.inria.fr/inria-00602407
Contributeur : Slim Ouni <>
Soumis le : mercredi 22 juin 2011 - 14:03:27
Dernière modification le : jeudi 11 janvier 2018 - 06:19:55

Identifiants

  • HAL Id : inria-00602407, version 1

Collections

Citation

Asterios Toutios, Utpala Musti, Slim Ouni, Vincent Colotte. Weight Optimization for Bimodal Unit-Selection Talking Head Synthesis. ISCA. 12thAnnual Conference of the International Speech Communication Association - Interspeech 2011, Aug 2011, Florence, Italy. 2011. 〈inria-00602407〉

Partager

Métriques

Consultations de la notice

214