Automatic Detection of the Prosodic Structures of Speech Utterances

Katarina Bartkova 1 Denis Jouvet 2
2 PAROLE - Analysis, perception and recognition of speech
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : This paper presents an automatic approach for the detection of the prosodic structures of speech utterances. The algorithm relies on a hierarchical representation of the prosodic organization of the speech utterances. The approach is applied on a corpus of radio French broadcast news and also on radio and TV shows which are more spontaneous speech data. The algorithm detects prosodic boundaries whether they are followed or not by pause. The detection of the prosodic boundaries and of the prosodic structures is based on an approach that integrates little linguistic knowledge and mainly uses the amplitude of the F0 slopes and the inversion of the slopes as described in [1], as well as phone durations. The automatic prosodic segmentation results are then compared to a manual prosodic segmentation made by an expert phonetician. Finally, the results obtained by this automatic approach provide an insight into the most frequently used prosodic structures in the broadcasting speech style as well as in a more spontaneous speech style.
Type de document :
Communication dans un congrès
Miloš Železný and Ivan Habernal and Andrey Ronzhin. SPECOM - 15th International Conference on Speech and Computer - 2013, Sep 2013, Pilsen, Czech Republic. Springer Verlag, 8113, pp.1-8, 2013, Lecture Notes in Artificial Intelligence. 〈http://link.springer.com/chapter/10.1007%2F978-3-319-01931-4_1〉
Liste complète des métadonnées

https://hal.inria.fr/hal-00834318
Contributeur : Denis Jouvet <>
Soumis le : vendredi 14 juin 2013 - 16:37:02
Dernière modification le : jeudi 11 janvier 2018 - 06:25:24

Identifiants

  • HAL Id : hal-00834318, version 1

Collections

Citation

Katarina Bartkova, Denis Jouvet. Automatic Detection of the Prosodic Structures of Speech Utterances. Miloš Železný and Ivan Habernal and Andrey Ronzhin. SPECOM - 15th International Conference on Speech and Computer - 2013, Sep 2013, Pilsen, Czech Republic. Springer Verlag, 8113, pp.1-8, 2013, Lecture Notes in Artificial Intelligence. 〈http://link.springer.com/chapter/10.1007%2F978-3-319-01931-4_1〉. 〈hal-00834318〉

Partager

Métriques

Consultations de la notice

203