Speaker-Dependent Emotion Recognition For Audio Document Indexing

Xuan Hung Le; Georges Quénot; Eric Castelli

Communication Dans Un Congrès Année : 2004

Speaker-Dependent Emotion Recognition For Audio Document Indexing

(1) , (1) , (2)

1
2

Xuan Hung Le

Fonction : Auteur

Modélisation et Recherche d’Information Multimédia [Grenoble]

Georges Quénot

Fonction : Auteur
PersonId : 3114
IdHAL : georges-quenot
ORCID : 0000-0003-2117-247X
IdRef : 034104518

Modélisation et Recherche d’Information Multimédia [Grenoble]

Eric Castelli

Fonction : Auteur
PersonId : 750232
IdHAL : eric-castelli
ORCID : 0000-0003-2978-2619
IdRef : 068256256

International Research Institute MICA

Résumé

The researches of the emotions are currently great interest in speech processing as well as in human-machine interaction domain. In the recent years, more and more of researches relating to emotion synthesis or emotion recognition are developed for the different purposes. Each approach uses its methods and its various parameters measured on the speech signal. In this paper, we proposed using a short-time parameter: MFCC coefficients (Mel-Frequency Cepstrum Coefficients) and a simple but efficient classifying method: Vector Quantification (VQ) for speaker-dependent emotion recognition. Many other features: energy, pitch, zero crossing, phonetic rate, LPCï¿½ and their derivatives are also tested and combined with MFCC coefficients in order to find the best combination. The other models: GMM and HMM (Discrete and Continuous Hidden Markov Model) are studied as well in the hope that the usage of continuous distribution and the temporal behaviour of this set of features will improve the quality of emotion recognition. The maximum accuracy recognizing five different emotions exceeds 88% by using only MFCC coefficients with VQ model. This is a simple but efficient approach, the result is even much better than those obtained with the same database in human evaluation by listening and judging without returning permission nor comparison between sentences [8]; And this result is positively comparable with the other approaches.

Mots clés

Emotion Recognition

Domaines

Recherche d'information [cs.IR]

Marie-Christine Fauvet : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00953924

Soumis le : vendredi 28 février 2014-16:06:42

Dernière modification le : jeudi 4 avril 2024-18:26:54

Dates et versions

hal-00953924 , version 1 (28-02-2014)

Identifiants

HAL Id : hal-00953924 , version 1

Citer

Xuan Hung Le, Georges Quénot, Eric Castelli. Speaker-Dependent Emotion Recognition For Audio Document Indexing. International Conference on Electronics, Information, 2004, Unknown. ⟨hal-00953924⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS INRIA LIG LIG_TDCGE LIG_TDCGE_MRIM LIG_SIDCH

112 Consultations

1 Téléchargements

Speaker-Dependent Emotion Recognition For Audio Document Indexing

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager