Impact of frame rate on automatic speech-text alignment for corpus-based phonetic studies

Katarina Bartkova; Denis Jouvet

Communication Dans Un Congrès Année : 2015

Impact of frame rate on automatic speech-text alignment for corpus-based phonetic studies

(1) , (2)

1
2

Katarina Bartkova

Fonction : Auteur correspondant
PersonId : 968792

Connectez-vous pour contacter l'auteur

Analyse et Traitement Informatique de la Langue Française

Denis Jouvet

Fonction : Auteur
PersonId : 15904
IdHAL : denis-jouvet
IdRef : 029418666

Speech Modeling for Facilitating Oral-Based Communication

Résumé

Phonetic segmentation is the basis for many phonetic and linguistic studies. As manual segmentation is a lengthy and tedious task, automatic procedures have been developed over the years. They rely on acoustic Hidden Markov Models. Many studies have been conducted, and refinements developed for corpus based speech synthesis, where the technology is mainly used in a speaker-dependent context and applied on good quality speech signals. In a different research direction, automatic speech-text alignment is also used for phonetic and linguistic studies on large speech corpora. In this case, speaker independent acoustic models are mandatory, and the speech quality may not be so good. The speech models rely on 10 ms shift between acoustic frames, and their topology leads to strong minimum duration constraints. This paper focuses on the acoustic analysis frame rate, and gives a first insight on the impact of the frame rate on corpus-based phonetic studies.

Mots clés

Automatic speech-text alignment frame rate pronunciation variants

Domaines

Traitement du signal et de l'image [eess.SP]

Fichier principal

FrameRateAndSpeechTextALignment-V1.2.pdf (424.13 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Denis Jouvet : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01183637

Soumis le : lundi 10 août 2015-15:01:42

Dernière modification le : lundi 11 septembre 2023-18:22:03

Archivage à long terme le : mercredi 11 novembre 2015-10:24:53

Dates et versions

hal-01183637 , version 1 (10-08-2015)

Identifiants

HAL Id : hal-01183637 , version 1

Citer

Katarina Bartkova, Denis Jouvet. Impact of frame rate on automatic speech-text alignment for corpus-based phonetic studies. ICPhS'2015 - 18th International Congress of Phonetic Sciences, Aug 2015, Glasgow, United Kingdom. ⟨hal-01183637⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA ATILF UNIV-LORRAINE INRIA2 CAMPUS-AAR AAI LORIA LORIA-NLPKD

263 Consultations

346 Téléchargements

Impact of frame rate on automatic speech-text alignment for corpus-based phonetic studies

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager