Skip to Main content Skip to Navigation
Journal articles

The production and recognition of emotions in speech: features and algorithms

Pierre-Yves Oudeyer 1, 2, 3
1 Flowers - Flowing Epigenetic Robots and Systems
Inria Bordeaux - Sud-Ouest, U2IS - Unité d'Informatique et d'Ingénierie des Systèmes
Abstract : This paper presents algorithms that allow a robot to express its emotions by modulating the intonation of its voice. They are very simple and efficiently provide life-like speech thanks to the use of concatenative speech synthesis. We describe a technique which allows to continuously control both the age of a synthetic voice and the quantity of emotions that are expressed. Also, we present the first large-scale data mining experiment about the automatic recognition of basic emotions in informal everyday short utterances. We focus on the speaker-dependent problem. We compare a large set of machine learning algorithms, ranging from neural networks, Support Vector Machines or decision trees, together with 200 features, using a large database of several thousands examples. We show that the difference of performance among learning schemes can be substantial, and that some features which were previously unexplored are of crucial importance. An optimal feature set is derived through the use of a genetic algorithm. Finally, we explain how this study can be applied to real world situations in which very few examples are available. Furthermore, we describe a game to play with a personal robot which facilitates teaching of examples of emotional utterances in a natural and rather unconstrained manner. Keywords: Emotions; Speech; Robots; Emotion production; Emotion recognition
Complete list of metadata
Contributor : Pierre-Yves Oudeyer <>
Submitted on : Friday, April 26, 2013 - 11:41:32 AM
Last modification on : Wednesday, July 3, 2019 - 10:48:05 AM

Links full text




Pierre-Yves Oudeyer. The production and recognition of emotions in speech: features and algorithms. International Journal of Human-Computer Studies, Elsevier, 2003, 59, pp.157-183. ⟨10.1016/S1071-5819(02)00141-6⟩. ⟨hal-00818196⟩



Record views