Experiments on the Construction of a Phonetically Balanced Corpus from the Web - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2004

Experiments on the Construction of a Phonetically Balanced Corpus from the Web

Résumé

The construction of a speech recognition system requires a recorded set of phrases to compute the pertinent acoustic models. This set of phrases must be phonetically rich and balanced in order to obtain a robust recognizer. By tradition, this set is defined manually implicating a great human effort. In this paper we propose an automated method for assembling a phonetically balanced corpus (set of phrases) from the Web. The proposed method was used to construct a phonetically balanced corpus for the Mexican Spanish language.
Fichier principal
Vignette du fichier
Villasenor04.pdf (19.58 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

inria-00326519 , version 1 (03-10-2008)

Identifiants

  • HAL Id : inria-00326519 , version 1

Citer

Luis Villaseñor-Pineda, Manuel Montes-Y-Gómez, Dominique Vaufreydaz, Jean-François Serignat. Experiments on the Construction of a Phonetically Balanced Corpus from the Web. Conference on Intelligent Text Processing and Computational Linguistics CICLing-2004, Feb 2004, Seoul, South Korea. 4 p. ⟨inria-00326519⟩
127 Consultations
263 Téléchargements

Partager

Gmail Facebook X LinkedIn More