L. Sproull, M. Subramani, S. Kiesler, J. H. Walker, and K. Waters, When the interface is a face, Human-Computer Interaction, vol.11, issue.2, pp.97-124, 1996.

I. S. Pandzic, J. Ostermann, and D. Millen, User evaluation: Synthetic talking faces for interactive services, The visual computer, vol.15, issue.7-8, pp.330-340, 1999.

D. M. Dehn and S. Van-mulken, The impact of animated interface agents: a review of empirical research, International journal of human-computer studies, vol.52, issue.1, pp.1-22, 2000.

J. Ostermann and D. Millen, Talking heads and synthetic speech: An architecture for supporting electronic commerce, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No. 00TH8532), vol.1, pp.71-74, 2000.

F. Eyben, S. Buchholz, N. Braunschweiler, J. Latorre, V. Wan et al., Unsupervised clustering of emotion and voice styles for expressive TTS, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.4009-4012, 2012.

M. Charfuelan and I. Steiner, Expressive speech synthesis in MARY TTS using audiobook data and emotionML, INTER-SPEECH, pp.1564-1568, 2013.

X. Li, Z. Wu, H. M. Meng, J. Jia, X. Lou et al., Expressive speech driven talking avatar synthesis with DBLSTM using limited amount of emotional bimodal data, INTERSPEECH, pp.1477-1481, 2016.

S. An, Z. Ling, and L. Dai, Emotional statistical parametric speech synthesis using LSTM-RNNs, 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC, pp.1613-1616, 2017.

Y. Zhang, Y. Liu, F. Weninger, and B. Schuller, Multi-task deep neural network with shared hidden layers: Breaking down the wall between emotion representations, 2017 IEEE international conference on acoustics, speech and signal processing

, IEEE, pp.4990-4994, 2017.

P. Ekman, An argument for basic emotions, Cognition & emotion, vol.6, issue.3-4, pp.169-200, 1992.

J. A. Russell, A circumplex model of affect, Journal of personality and social psychology, vol.39, issue.6, p.1161, 1980.
URL : https://hal.archives-ouvertes.fr/hal-01086372

R. Plutchik, Emotions: A general psychoevolutionary theory, vol.1984, pp.197-219, 1984.

R. J. Larsen and E. Diener, Promises and problems with the circumplex model of emotion, 1992.

J. Posner, J. A. Russell, and B. S. Peterson, The circumplex model of affect: An integrative approach to affective neuroscience, cognitive development, and psychopathology, Development and psychopathology, vol.17, issue.3, pp.715-734, 2005.

J. A. Russell and B. Fehr, Fuzzy concepts in a fuzzy hierarchy: Varieties of anger, Journal of personality and social psychology, vol.67, issue.2, p.186, 1994.

G. O. Hofer, K. Richmond, and R. A. Clark, Informed blending of databases for emotional speech synthesis, 2005.

Y. Xue, Y. Hamada, and M. Akagi, Voice conversion for emotional speech: Rule-based synthesis with degree of emotion controllable in dimensional space, Speech Communication, vol.102, pp.54-67, 2018.

G. E. Henter, J. Lorenzo-trueba, X. Wang, and J. Yamagishi, Principles for learning controllable TTS from annotated and latent variation, pp.3956-3960, 2017.

J. Chorowski, R. J. Weiss, S. Bengio, and A. V. Oord, Unsupervised speech representation learning using wavenet autoencoders, 2019.

K. Akuzawa, Y. Iwasawa, and Y. Matsuo, Expressive speech synthesis via modeling expressions with variational autoencoder, 2018.

S. Latif, R. Rana, J. Qadir, and J. Epps, Variational autoencoders for learning latent representations of speech emotion: A preliminary study, 2017.

E. and T. Virtanen, Musical instrument synthesis and morphing in multidimensional latent space using variational, convolutional recurrent autoencoders, Audio Engineering Society Convention 145, 2018.

S. Moore, The Stanislavski system: The professional training of an actor. Penguin, 1984.

K. S. Stanislavski and J. Vilar, La formation de l'acteur, 1963.

T. Bänziger, M. Mortillaro, and K. R. Scherer, Introducing the geneva multimodal expression corpus for experimental research on emotion perception, Emotion, vol.12, issue.5, p.1161, 2012.

G. Gravier, J. Bonastre, E. Geoffrois, S. Galliano, K. Mctait et al., The ester evaluation campaign for the rich transcription of french broadcast news, LREC, 2004.

Y. Bengio, A. Courville, and P. Vincent, Representation learning: A review and new perspectives, IEEE transactions on pattern analysis and machine intelligence, vol.35, pp.1798-1828, 2013.

D. P. Kingma and M. Welling, Auto-encoding variational bayes, 2013.

I. Higgins, L. Matthey, A. Pal, C. Burgess, X. Glorot et al., beta-vae: Learning basic visual concepts with a constrained variational framework, International Conference on Learning Representations, 2017.

F. Roche, T. Hueber, S. Limier, and L. Girin, Autoencoders for music sound synthesis: a comparison of linear, shallow, deep and variational models, 2018.

Z. Wu, O. Watts, and S. King, Merlin: An open source neural network speech synthesis system, pp.202-207, 2016.

L. V. Maaten and G. Hinton, Visualizing data using t-SNE, Journal of machine learning research, vol.9, pp.2579-2605, 2008.

M. Morise, F. Yokomori, and K. Ozawa, World: a vocoder-based high-quality speech synthesis system for real-time applications, IEICE TRANSACTIONS on Information and Systems, vol.99, issue.7, pp.1877-1884, 2016.

J. N. Bassili, Emotion recognition: the role of facial movement and the relative importance of upper and lower areas of the face, Journal of personality and social psychology, vol.37, issue.11, p.2049, 1979.

E. Costantini, F. Pianesi, and M. Prete, Recognising emotions in human and synthetic faces: the role of the upper and lower parts of the face, Proceedings of the 10th international conference on Intelligent user interfaces, pp.20-27, 2005.

R. Plutchik, The nature of emotions: Human emotions have deep evolutionary roots, a fact that may explain their complexity and provide tools for clinical practice, American scientist, vol.89, issue.4, pp.344-350, 2001.

D. Balouek, A. C. Amarie, G. Charrier, F. Desprez, E. Jeannot et al., Adding virtualization capabilities to the Grid'5000 testbed, International Conference on Cloud Computing and Services Science, pp.3-20, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00946971