Skip to Main content Skip to Navigation
Conference papers

Weight Optimization for Bimodal Unit-Selection Talking Head Synthesis

Asterios Toutios 1 Utpala Musti 1 Slim Ouni 1 Vincent Colotte 1
1 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : This paper addresses talking head synthesis based on the concatenation of units comprising of both acoustic and visual information. Selection of appropriate diphone units to synthesize a given text string is based on the minimization of a weighted linear combination of four costs that reflect linguistic, acoustic, and visual considerations. We present initial work toward a method to determine automatically the weights applied to each cost, using a series of metrics that assess quantitatively the performance of synthesis.
Complete list of metadatas
Contributor : Slim Ouni <>
Submitted on : Wednesday, June 22, 2011 - 2:03:27 PM
Last modification on : Saturday, November 28, 2020 - 10:24:03 AM


  • HAL Id : inria-00602407, version 1



Asterios Toutios, Utpala Musti, Slim Ouni, Vincent Colotte. Weight Optimization for Bimodal Unit-Selection Talking Head Synthesis. 12thAnnual Conference of the International Speech Communication Association - Interspeech 2011, Aug 2011, Florence, Italy. ⟨inria-00602407⟩



Record views