Skip to Main content Skip to Navigation
Conference papers

Weight Optimization for Bimodal Unit-Selection Talking Head Synthesis

Asterios Toutios 1 Utpala Musti 1 Slim Ouni 1 Vincent Colotte 1
1 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : This paper addresses talking head synthesis based on the concatenation of units comprising of both acoustic and visual information. Selection of appropriate diphone units to synthesize a given text string is based on the minimization of a weighted linear combination of four costs that reflect linguistic, acoustic, and visual considerations. We present initial work toward a method to determine automatically the weights applied to each cost, using a series of metrics that assess quantitatively the performance of synthesis.
Complete list of metadata
Contributor : Slim Ouni Connect in order to contact the contributor
Submitted on : Wednesday, June 22, 2011 - 2:03:27 PM
Last modification on : Thursday, January 20, 2022 - 5:28:26 PM


  • HAL Id : inria-00602407, version 1



Asterios Toutios, Utpala Musti, Slim Ouni, Vincent Colotte. Weight Optimization for Bimodal Unit-Selection Talking Head Synthesis. 12thAnnual Conference of the International Speech Communication Association - Interspeech 2011, Aug 2011, Florence, Italy. ⟨inria-00602407⟩



Les métriques sont temporairement indisponibles