Skip to Main content Skip to Navigation
Conference papers

Semi-automatic phonetic labelling of large corpora

Abstract : The aim of the present paper is to present a methodology to semi-automatically label large corpora. This methodology is based on three main points: using several concurrent automatic stochastic labellers, decomposing the labelling of the whole corpus into an iterative refining process and building a labelling comparison procedure which takes into account phonologic and acoustic-phonetic rules to evaluate the similarity of the various labelling of one sentence. After having detailed these three points, we describe our HMM-based labelling tool and we describe the application of that methodology to the Swiss French POLYPHON database.
Keywords : Automatic labelling
Complete list of metadatas

Cited literature [2 references]  Display  Hide  Download

https://hal.inria.fr/hal-01727539
Contributor : Odile Mella <>
Submitted on : Friday, March 9, 2018 - 11:42:00 AM
Last modification on : Tuesday, April 24, 2018 - 1:33:56 PM
Document(s) archivé(s) le : Sunday, June 10, 2018 - 1:57:39 PM

File

euro97.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01727539, version 1

Collections

Citation

Odile Mella, Dominique Fohr. Semi-automatic phonetic labelling of large corpora . EUROPSPEECH'97 - Fifth European conference on speech communication and technology, Sep 1997, Rhodes, Greece. ⟨hal-01727539⟩

Share

Metrics

Record views

165

Files downloads

40