Unit Selection Cost Function Exploration Using an A* based Text-to-Speech System

David Guennec 1 Damien Lolive 1
1 EXPRESSION - Expressiveness in Human Centered Data/Media
UBS - Université de Bretagne Sud, IRISA-D6 - MEDIA ET INTERACTIONS
Abstract : Speech synthesis systems usually use the Viterbi algorithm as a basis for unit selection, while it is not the only possible choice. In this paper, we study a speech synthesis system relying on the A* algorithm, which is a general pathfinding strategy developing a graph rather than a lattice. Using state of the art techniques, we propose and analyze different selection strategies and evaluate them using a subjective evaluation on the N-best paths returned. The best strategy achieves a MOS score of 3.29 (±0.18). More interesting, the proposed system enables an in-depth analysis of unit selection.
Type de document :
Communication dans un congrès
International Conference on Text, Speech and Dialogue (TSD), Sep 2014, Brno, Czech Republic. 2014, 〈http://www.tsdconference.org/tsd2014/〉
Liste complète des métadonnées

https://hal.inria.fr/hal-01133321
Contributeur : Expression Irisa <>
Soumis le : jeudi 19 mars 2015 - 09:23:44
Dernière modification le : mardi 16 janvier 2018 - 15:54:23

Identifiants

  • HAL Id : hal-01133321, version 1

Citation

David Guennec, Damien Lolive. Unit Selection Cost Function Exploration Using an A* based Text-to-Speech System. International Conference on Text, Speech and Dialogue (TSD), Sep 2014, Brno, Czech Republic. 2014, 〈http://www.tsdconference.org/tsd2014/〉. 〈hal-01133321〉

Partager

Métriques

Consultations de la notice

633