M. A. Arbib, From monkey-like action recognition to human language: An evolutionary framework for neurolinguistics, Behavioral and Brain Sciences, vol.28, issue.02, pp.105-124, 2005.
DOI : 10.1017/S0140525X05000038

URL : http://www.ecs.soton.ac.uk/~harnad/Temp/arbib-bbs.pdf

K. Arnold and K. Zuberbühler, Meaningful call combinations in a non-human primate, Current Biology, vol.18, issue.5, pp.202-203, 2008.
DOI : 10.1016/j.cub.2008.01.040

URL : https://doi.org/10.1016/j.cub.2008.01.040

K. Arnold and K. Zuberbühler, Call combinations in monkeys: compositional or idiomatic expressions? Brain Lang, pp.303-309, 2012.
DOI : 10.1016/j.bandl.2011.10.001

URL : http://doc.rero.ch/record/278480/files/Arnold_K.-Call_combination_20170131090359-OV.pdf

J. M. Baker, L. Deng, J. Glass, S. Khudanpur, C. Lee et al., Developments and directions in speech recognition and understanding, Part 1 [DSP Education], IEEE Signal Processing Magazine, vol.26, issue.3, pp.75-80, 2009.
DOI : 10.1109/MSP.2009.932166

J. M. Baker, L. Deng, S. Khudanpur, C. Lee, J. R. Glass et al., Updated MINDS report on speech recognition and understanding, Part 2 [DSP Education], IEEE Signal Processing Magazine, vol.26, issue.4, pp.78-85, 2009.
DOI : 10.1109/MSP.2009.932707

B. Balentine, It's Better to Be a Good Machine Than a Bad Person: Speech Recognition and Other Exotic User Interfaces at the Twilight of the Jetsonian Age, 2007.

L. Barsalou, A. Santos, W. Simmons, W. , and C. , Language and simulation in conceptual processing, Symbols, Embodiment, and Meaning, pp.245-283, 2008.
DOI : 10.1093/acprof:oso/9780199217274.003.0013

J. R. Bellegarda and C. Monz, State of the art in statistical methods for language and speech processing, Computer Speech & Language, vol.35, 2015.
DOI : 10.1016/j.csl.2015.07.001

T. Belpaeme, S. J. Cowley, J. I. Benichov, S. E. Benezra, D. Vallentin et al., Foreword, Interact. Stud. Curr. Biol, vol.8, issue.26, pp.309-318, 2007.
DOI : 10.1075/bct.21.01bel

N. O. Bernsen, H. Dybkjaer, and L. Dybkjaer, Designing Interactive Speech Systems: From First Ideas to User Testing, 1998.
DOI : 10.1007/978-1-4471-0897-9

R. C. Berwick, A. D. Friederici, N. Chomsky, and J. J. Bolhuis, Evolution, brain, and the nature of language, Trends in Cognitive Sciences, vol.17, issue.2, pp.89-98, 2013.
DOI : 10.1016/j.tics.2012.12.002

R. C. Berwick, K. Okanoya, G. J. Beckers, and J. J. Bolhuis, Songs to syntax: the linguistics of birdsong, Trends in Cognitive Sciences, vol.15, issue.3, pp.113-121, 2011.
DOI : 10.1016/j.tics.2011.01.002

M. H. Bickhard, Language as an interaction system, New Ideas in Psychology, vol.25, issue.2, pp.171-187, 2007.
DOI : 10.1016/j.newideapsych.2007.02.006

URL : http://www.lehigh.edu/~mhb0/LanguageInteractSys.pdf

D. T. Blumstein, ALARM CALLING IN THREE SPECIES OF MARMOTS, Behaviour, vol.136, issue.6, pp.731-757, 1999.
DOI : 10.1163/156853999501540

URL : https://www.eeb.ucla.edu/Faculty/Blumstein/pdf reprints/Blumstein1999_Behaviour.pdf

D. T. Blumstein and K. Armitage, Alarm calling in yellow-bellied marmots: I. The meaning of situationally variable alarm calls, Animal Behaviour, vol.53, issue.1, pp.143-171, 1996.
DOI : 10.1006/anbe.1996.0285

J. N. Bohannon and A. L. Marquis, Children's control of adult speech. Child Dev, pp.1002-1008, 1977.

E. Bolund, H. Schielzeth, and W. Forstmeier, Singing activity stimulates partner reproductive investment rather than increasing paternity success in zebra finches, Behavioral Ecology and Sociobiology, vol.53, issue.6, pp.975-984, 2012.
DOI : 10.1016/0022-5193(75)90111-3

H. P. Branigan, M. J. Pickering, J. Pearson, J. F. Mclean, and A. Brown, The role of beliefs in lexical alignment: Evidence from dialogs with humans and computers, Cognition, vol.121, issue.1, pp.41-57, 2011.
DOI : 10.1016/j.cognition.2011.05.011

C. Breazeal, Emotion and sociable humanoid robots, International Journal of Human-Computer Studies, vol.59, issue.1-2, pp.119-15510, 2003.
DOI : 10.1016/S1071-5819(03)00018-1

URL : http://robotic.media.mit.edu/Papers/Breazeal-ijhcs03.pdf

H. Brumm and P. J. Slater, Animals can vary signal amplitude with receiver distance: evidence from zebra finch song, Animal Behaviour, vol.72, issue.3, pp.699-705, 2006.
DOI : 10.1016/j.anbehav.2006.01.020

J. Brzoska, Vocal response of male European water frogs (Rana Esculenta complex) to mating and territorial calls, Behavioural Processes, vol.7, issue.1, pp.37-4710, 1982.
DOI : 10.1016/0376-6357(82)90051-1

T. Bugnyar, S. A. Reber, and C. Buckner, Ravens attribute visual access to unseen competitors Differentiation, dynamical integration and functional emotional development, Nat. Commun. Emot. Rev, vol.3, pp.138-14610, 1177.

A. Candiotti, K. Zuberbühler, and A. Lemasson, Context-related call combinations in female Diana monkeys, Animal Cognition, vol.113, issue.3, pp.327-339, 2012.
DOI : 10.1037/0735-7036.113.1.33

URL : https://hal.archives-ouvertes.fr/hal-01021857

A. Candiotti, K. Zuberbühler, and A. Lemasson, Voice discrimination in four primates, Behavioural Processes, vol.99, pp.67-72, 2013.
DOI : 10.1016/j.beproc.2013.06.010

URL : https://hal.archives-ouvertes.fr/hal-01019945

A. Cangelosi, The grounding and sharing of symbols, Pragmat. Cogn, vol.14, pp.275-285, 2006.
DOI : 10.1075/bct.16.07can

A. Cangelosi, R. , and T. , An Embodied Model for Sensorimotor Grounding and Grounding Transfer: Experiments With Epigenetic Robots, Cognitive Science, vol.28, issue.4, pp.673-689, 2006.
DOI : 10.1080/09540090500281554

URL : http://onlinelibrary.wiley.com/doi/10.1207/s15516709cog0000_72/pdf

B. D. Charlton, W. A. Ellis, J. Brumm, K. Nilsson, and W. T. Fitch, Female koalas prefer bellows in which lower formants indicate larger males, Animal Behaviour, vol.84, issue.6, 2012.
DOI : 10.1016/j.anbehav.2012.09.034

Y. Chen, L. E. Matheson, and J. T. Sakata, Mechanisms underlying the social enhancement of vocal learning in songbirds, Proceedings of the National Academy of Sciences, vol.38, issue.24, pp.6641-6646, 2016.
DOI : 10.1523/JNEUROSCI.4445-14.2015

URL : http://www.pnas.org/content/113/24/6641.full.pdf

D. L. Cheney and R. M. Seyfarth, Vervet Monkey Alarm Calls: Manipulation Through Shared Information?, Behaviour, vol.94, issue.1, pp.150-166, 1985.
DOI : 10.1163/156853985X00316

F. Chersi, S. Thill, T. Ziemke, and A. M. Borghi, Sentence processing: linking language to motor chains, Frontiers in Neurorobotics, vol.4, 2010.
DOI : 10.3389/fnbot.2010.00004

URL : https://www.frontiersin.org/articles/10.3389/fnbot.2010.00004/pdf

E. Clara, L. Tommasi, R. , and L. J. , Social mobbing calls in common marmosets (Callithrix jacchus): effects of experience and associated cortisol levels, Animal Cognition, vol.105, issue.7, pp.349-358, 2008.
DOI : 10.1046/j.1439-0310.1999.00396.x

E. Clarke, U. H. Reichard, and K. Zuberbühler, The Syntax and Meaning of Wild Gibbon Songs, PLoS ONE, vol.47, issue.1, 2006.
DOI : 10.1371/journal.pone.0000073.t002

URL : https://doi.org/10.1371/journal.pone.0000073

Z. Clay and K. Zuberbühler, Bonobos Extract Meaning from Call Sequences, PLoS ONE, vol.17, issue.4, 2011.
DOI : 10.1371/journal.pone.0018786.s009

URL : https://doi.org/10.1371/journal.pone.0018786

S. Coradeschi, A. Loutfi, and B. Wrede, A Short Review of Symbol Grounding in Robotic and Intelligent Systems, KI - K??nstliche Intelligenz, vol.16, issue.4, pp.129-136, 2013.
DOI : 10.1162/artl_a_00007

S. J. Cowley, Distributed Language, 2011.

C. Crockford, R. M. Wittig, and K. Zuberbühler, An intentional vocalization draws others??? attention: A playback experiment with wild chimpanzees, Animal Cognition, vol.40, issue.1444, 2014.
DOI : 10.1017/CBO9780511921643.013

C. R. Crowell, M. Scheutz, P. Schermerhorn, and M. Villano, Gendered voice and robot entities: Perceptions and reactions of male and female subjects, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp.3735-3741, 2009.
DOI : 10.1109/IROS.2009.5354204

J. Crumpton and C. L. Bethel, A Survey of Using Vocal Prosody to Convey Emotion in Robot Speech, International Journal of Social Robotics, vol.32, issue.3, pp.271-285, 2016.
DOI : 10.1109/MSP.2014.2359987

F. Cummins, Voice, (inter-)subjectivity, and real time recurrent interaction, Frontiers in Psychology, vol.5, 2014.
DOI : 10.3389/fpsyg.2014.00760

URL : http://journal.frontiersin.org/article/10.3389/fpsyg.2014.00760/pdf

J. Cynx, R. Lewis, B. Tavel, and H. Tse, Amplitude regulation of vocalizations in noise by a songbird,Taeniopygia guttata, Animal Behaviour, vol.56, issue.1, pp.107-113, 1998.
DOI : 10.1006/anbe.1998.0746

C. Darwin, The Expression of the Emotions in Man and Animals, 1872.

R. Dawkins, The Blind Watchmaker, 1991.

J. De-greeff and T. Belpaeme, Why Robots Should Be Social: Enhancing Machine Learning through Social Human-Robot Interaction, PLOS ONE, vol.9, issue.2, 2015.
DOI : 10.1371/journal.pone.0138061.s002

C. De-looze, S. Scherer, B. Vaughan, and N. Campbell, Investigating automatic measurements of prosodic accommodation and its dynamics in social interaction, Speech Communication, vol.58, 2014.
DOI : 10.1016/j.specom.2013.10.002

A. J. Doupe and P. K. Kuhl, BIRDSONG AND HUMAN SPEECH: Common Themes and Mechanisms, Annual Review of Neuroscience, vol.22, issue.1, 1999.
DOI : 10.1146/annurev.neuro.22.1.567

G. Dove, On the need for Embodied and Dis-Embodied Cognition, Frontiers in Psychology, vol.1, 2011.
DOI : 10.3389/fpsyg.2010.00242

URL : http://journal.frontiersin.org/article/10.3389/fpsyg.2010.00242/pdf

J. Dowling and M. S. Webster, ) with moderate cuckoldry rates, Behavioral Ecology, vol.63, issue.1, pp.228-236, 2016.
DOI : 10.1093/beheco/arn015

P. Ekman, Basic emotions, " in Handbook of Cognition and Emotion, pp.301-320, 1999.

J. E. Elie, M. M. Mariette, H. A. Soula, S. C. Griffith, N. Mathevon et al., Vocal communication at the nest between mates in wild zebra finches: a private vocal duet?, Animal Behaviour, vol.80, issue.4, pp.597-605, 2010.
DOI : 10.1016/j.anbehav.2010.06.003

URL : https://hal.archives-ouvertes.fr/hal-00794060

S. Engesser, J. M. Crane, J. L. Savage, A. F. Russell, T. et al., Experimental Evidence for Phonemic Contrasts in a Nonhuman Vocal System, PLOS Biology, vol.1, issue.6, 2015.
DOI : 10.1371/journal.pbio.1002171.s005

URL : https://doi.org/10.1371/journal.pbio.1002171

H. C. Eskelinen, K. A. Winship, B. L. Jones, A. E. Ames, and S. A. Kuczaj, Acoustic behavior associated with cooperative task success in bottlenose dolphins (Tursiops truncatus), Animal Cognition, vol.123, issue.4, pp.789-797, 2016.
DOI : 10.1037/a0015838

A. Esposito and A. Esposito, On Speech and Gestures Synchrony, Lecture Notes in Computer Science, vol.84, issue.2, pp.252-272, 2011.
DOI : 10.1037/0033-2909.84.5.963

F. Eyssel, D. Kuchenbrandt, S. Bobinger, L. De-ruiter, and F. Hegel, 'If you sound like me, you must be more human', Proceedings of the seventh annual ACM/IEEE international conference on Human-Robot Interaction, HRI '12, p.125, 2012.
DOI : 10.1145/2157689.2157717

G. Fang, F. Jiang, P. Yang, J. Cui, S. E. Brauth et al., Male vocal competition is dynamic and strongly affected by social contexts in music frogs, Animal Cognition, vol.30, issue.3, 2014.
DOI : 10.1007/978-1-4612-4738-8_4

URL : http://210.75.237.14/bitstream/351003/26382/1/2014e0181h.pdf

J. A. Feldman, From Molecules to Metaphor: A Neural Theory of Language, 2008.

J. Fellous and M. Arbib, Who Needs Emotions? The Brain Meets the Robot, 2005.

A. Fernald, Four-month-old infants prefer to listen to motherese, Infant Behavior and Development, vol.8, issue.2, pp.181-19510, 1985.
DOI : 10.1016/S0163-6383(85)80005-9

C. Fichtel and C. P. Van-schaik, Semantic Differences in Sifaka (Propithecus verreauxi) Alarm Calls: A Reflection of Genetic or Cultural Variants?, Ethology, vol.51, issue.9, 2006.
DOI : 10.1023/A:1005594625841

M. S. Ficken and J. W. Popp, A Comparative Analysis of Passerine Mobbing Calls, The Auk, vol.113, issue.2, pp.370-380, 1996.
DOI : 10.2307/4088904

W. Fitch, The evolution of speech: a comparative review, Trends in Cognitive Sciences, vol.4, issue.7, pp.258-26710, 2000.
DOI : 10.1016/S1364-6613(00)01494-7

URL : http://www.wjh.harvard.edu/~tec/Fitch2000TICS.pdf

W. T. Fitch, The Evolution of Language, 2010.
DOI : 10.1017/CBO9780511817779

W. T. Fitch, Rhythmic cognition in humans and animals: distinguishing meter and pulse perception, Frontiers in Systems Neuroscience, vol.7, 2013.
DOI : 10.3389/fnsys.2013.00068

URL : https://doi.org/10.3389/fnsys.2013.00068

W. T. Fitch, R. , and D. , The descended larynx is not uniquely human, Proc. Biol. Sci, vol.268, 2001.

K. Friston and C. Frith, A Duet for one, Consciousness and Cognition, vol.36, pp.390-405, 2015.
DOI : 10.1016/j.concog.2014.12.003

URL : https://doi.org/10.1016/j.concog.2014.12.003

K. Friston and S. Kiebel, Predictive coding under the free-energy principle, Philosophical Transactions of the Royal Society B: Biological Sciences, vol.335, issue.6188, pp.1211-12210300, 2008.
DOI : 10.1038/335311a0

URL : http://rstb.royalsocietypublishing.org/content/royptb/364/1521/1211.full.pdf

R. Fusaroli, J. Raczaszek-leonardi, and K. Tylén, Dialog as interpersonal synergy, New Ideas in Psychology, vol.32, 2014.
DOI : 10.1016/j.newideapsych.2013.03.005

M. Gales, Y. , and S. J. , The application of hidden Markov models in speech recognition. Found. Trends Signal Process, pp.195-304, 2007.

J. Ganger and M. R. Brent, Reexamining the Vocabulary Spurt., Developmental Psychology, vol.40, issue.4, pp.621-632, 2004.
DOI : 10.1037/0012-1649.40.4.621

URL : http://www.pitt.edu/~jganger/GangerBrent2004.pdf

S. Garrod, C. Gambi, and M. J. Pickering, Prediction at all levels: forward model predictions can enhance comprehension, Language, Cognition and Neuroscience, vol.29, issue.1, pp.46-48, 2013.
DOI : 10.1016/S1364-6613(97)01070-X

D. Gil and M. Gahr, The honesty of bird song: multiple constraints for multiple traits, Trends in Ecology & Evolution, vol.17, issue.3, pp.133-141, 2002.
DOI : 10.1016/S0169-5347(02)02410-2

K. Gillespie-lynch, P. M. Greenfield, Y. Feng, S. Savage-rumbaugh, L. et al., A Cross-Species Study of Gesture and Its Role in Symbolic Development: Implications for the Gestural Theory of Language Evolution, Frontiers in Psychology, vol.4, 2013.
DOI : 10.3389/fpsyg.2013.00160

R. Gisiner and R. J. Schusterman, California sea lion pups play an active role in reunions with their mothers, Animal Behaviour, vol.41, issue.2, pp.364-366, 1991.
DOI : 10.1016/S0003-3472(05)80488-9

B. A. Goldfield and J. S. Reznick, Early lexical acquisition: rate, content, and the vocabulary spurt, Journal of Child Language, vol.38, issue.01, pp.171-183, 1990.
DOI : 10.2307/412864

A. Gopnik, A. N. Meltzoff, and P. K. Kuhl, The Scientist in the Crib, 2001.

T. U. Grafe, J. H. Bitz, and M. Wink, Song repertoire and duetting behaviour of the tropical boubou, Laniarius aethiopicus, Animal Behaviour, vol.68, issue.1, pp.181-191, 2004.
DOI : 10.1016/j.anbehav.2003.11.004

E. Greene and T. Meagher, Red squirrels,Tamiasciurus hudsonicus, produce predator-class specific alarm calls, Animal Behaviour, vol.55, issue.3, pp.511-518, 1997.
DOI : 10.1006/anbe.1997.0620

M. Gridi-papp, A. S. Rand, R. , and M. J. , Animal communication: complex call production in the túngara frog Mobbing calls signal predator category in a kin group-living bird species, Nature Proc. Biol. Sci, vol.441, issue.276, pp.2887-28920551, 2006.

P. E. Griffiths and A. Scarantino, Emotions in the Wild, Cambridge Handbook of Situated Cognition, pp.437-453, 2005.
DOI : 10.1017/CBO9780511816826.023

S. R. Hage, T. Jiang, S. W. Berquist, J. Feng, and W. Metzner, Ambient noise induces independent shifts in call frequency and amplitude within the Lombard effect in echolocating bats, Proceedings of the National Academy of Sciences, vol.112, issue.1, pp.4063-4068, 2013.
DOI : 10.1037/0033-2909.112.1.155

URL : http://www.pnas.org/content/110/10/4063.full.pdf

W. Halfwerk, A. Lea, M. Guerra, R. Page, R. et al., Vocal responses to noise reveal the presence of the Lombard effect in a frog A review of hypotheses for the functions of avian duetting, Behav. Ecol. Behav. Ecol. Sociobiol, vol.27, issue.55, pp.669-676, 2004.

M. L. Hall, S. A. Kingma, and A. Peters, Male Songbird Indicates Body Size with Low-Pitched Advertising Songs, PLoS ONE, vol.22, issue.2, 2013.
DOI : 10.1371/journal.pone.0056717.s005

URL : https://doi.org/10.1371/journal.pone.0056717

E. B. Hanggi and R. J. Schusterman, Kin recognition in captive California sea lions (Zalophus californianus)., Journal of Comparative Psychology, vol.104, issue.4, 1990.
DOI : 10.1037/0735-7036.104.4.368

M. Haring, N. Bee, A. , and E. , Creation and Evaluation of emotion expression with body movement, sound and eye color for humanoid robots, 2011 RO-MAN, pp.204-209, 2011.
DOI : 10.1109/ROMAN.2011.6005263

S. Harnad, The symbol grounding problem, Physica D: Nonlinear Phenomena, vol.42, issue.1-3, pp.335-346, 1990.
DOI : 10.1016/0167-2789(90)90087-6

URL : http://arxiv.org/pdf/cs/9906002

F. H. Harrington and L. D. Mech, Wolf pack spacing: Howling as a territory-independent spacing mechanism in a territorial population, Behavioral Ecology and Sociobiology, vol.6, issue.2, pp.161-168, 1983.
DOI : 10.1007/BF00343208

M. D. Hauser, N. Chomsky, and W. T. Fitch, The Faculty of Language: What Is It, Who Has It, and How Did It Evolve?, Science, vol.298, issue.5598, pp.1569-1579, 2002.
DOI : 10.1126/science.298.5598.1569

G. Hinton, L. Deng, D. Yu, G. E. Dahl, A. Mohamed et al., Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups, IEEE Signal Processing Magazine, vol.29, issue.6, pp.82-97, 2012.
DOI : 10.1109/MSP.2012.2205597

J. Holler, L. Schubotz, S. Kelly, P. Hagoort, M. Schuetze et al., Social eye gaze modulates processing of speech and co-speech gesture, Cognition, vol.133, issue.3, pp.692-697, 2014.
DOI : 10.1016/j.cognition.2014.08.008

URL : http://pubman.mpdl.mpg.de/pubman/item/escidoc:2051347/component/escidoc:2058507/Holler et al_2014_social gaze.pdf

S. Hooper, D. Reiss, M. Carter, and B. Mccowan, Importance of contextual saliency on vocal imitation by bottlenose dolphins, Int. J. Comp. Psychol, vol.19, pp.116-128, 2006.

S. L. Hopp and C. S. Evans, Acoustic Communication in Animals, 1998.
DOI : 10.1007/978-3-642-76220-8

A. Horowitz and J. Hecht, Examining dog???human play: the characteristics, affect, and vocalizations of a unique interspecific interaction, Animal Cognition, vol.261, issue.4, pp.779-788, 2016.
DOI : 10.1038/scientificamerican0789-78

C. F. Hotchkin, S. E. Parks, and D. J. Weiss, Vocal modifications in primates: effects of noise and behavioral context on vocalization structure, Proc. Meet. Acoust, 2013.
DOI : 10.1121/1.4799257

URL : http://asa.scitation.org/doi/pdf/10.1121/1.4799257

I. S. Howard and P. Messum, Learning to Pronounce First Words in Three Languages: An Investigation of Caregiver and Infant Behavior Using a Computational Model of an Infant, PLoS ONE, vol.60, issue.4, 2014.
DOI : 10.1371/journal.pone.0110334.s003

C. R. Hurd, Interspecific attraction to the mobbing calls of black-capped chickadees ( Parus atricapillus ), Behavioral Ecology and Sociobiology, vol.38, issue.4, pp.287-292, 1996.
DOI : 10.1007/s002650050244

S. J. Insley, Mother???Offspring vocal recognition in northern fur seals is mutual but asymmetrical, Animal Behaviour, vol.61, issue.1, pp.129-137, 2000.
DOI : 10.1006/anbe.2000.1569

H. Ishihara, Y. Yoshikawa, K. Miura, and M. Asada, How Caregiver's Anticipation Shapes Infant's Vowel Through Mutual Imitation, IEEE Transactions on Autonomous Mental Development, vol.1, issue.4, pp.217-225, 2009.
DOI : 10.1109/TAMD.2009.2038988

E. D. Jarvis, Learned Birdsong and the Neurobiology of Human Language, Annals of the New York Academy of Sciences, vol.11, issue.1, pp.749-777, 2004.
DOI : 10.1212/WNL.55.8.1151

E. D. Jarvis, The Evolution of Vocal Learning Systems in Birds and Humans, Evolution of Nervous Systems, pp.213-228, 2006.
DOI : 10.1016/B0-12-370878-8/00136-1

E. D. Jarvis, Selection for and against vocal learning in birds and mammals, Ornithological Science, vol.5, issue.1, 2006.
DOI : 10.2326/osj.5.5

T. Jones, S. Lawson, and D. Mills, Interaction with a zoomorphic robot that exhibits canid mechanisms of behaviour, 2008 IEEE International Conference on Robotics and Automation, pp.2128-2133, 2008.
DOI : 10.1109/ROBOT.2008.4543521

P. W. Joslin, Movements and homesites of timber wolves in Algonquin Park, Am. Zool, vol.7, 1967.

J. Kaminski, J. Call, and J. Fischer, Word Learning in a Domestic Dog: Evidence for "Fast Mapping", Science, vol.304, issue.5677, pp.1682-1683, 2004.
DOI : 10.1126/science.1097859

A. Kershenbaum, D. T. Blumstein, M. A. Roch, Ç. Akçay, G. Backus et al., Acoustic sequences in non-human animals: a tutorial review and prospectus, Biological Reviews, vol.8, issue.1, pp.13-52, 2016.
DOI : 10.1201/9781420010893

URL : http://europepmc.org/articles/pmc4444413?pdf=render

A. Kershenbaum, A. Ilany, L. Blaustein, and E. Geffen, Syntactic structure and geographical dialects in the songs of male rock hyraxes, Proceedings of the Royal Society B: Biological Sciences, vol.60, issue.1, pp.2974-2981, 2012.
DOI : 10.1006/anbe.2000.1410

URL : http://rspb.royalsocietypublishing.org/content/royprsb/279/1740/2974.full.pdf

A. Kershenbaum, L. S. Sayigh, and V. M. Janik, The Encoding of Individual Identity in Dolphin Signature Whistles: How Much Information Is Needed?, PLoS ONE, vol.13, issue.10, 2013.
DOI : 10.1371/journal.pone.0077671.g006

S. L. King, H. E. Harley, and V. M. Janik, The role of signature whistle matching in bottlenose dolphins, Tursiops truncatus, Animal Behaviour, vol.96, pp.79-86, 2014.
DOI : 10.1016/j.anbehav.2014.07.019

S. L. King and V. M. Janik, Bottlenose dolphins can use learned vocal labels to address each other, Proceedings of the National Academy of Sciences, vol.156, issue.1, pp.13216-13221, 2013.
DOI : 10.1006/anbe.1998.0923

URL : http://www.pnas.org/content/110/32/13216.full.pdf

J. A. Kirsch, O. Gntrkn, R. , and J. , Insight without cortex: Lessons from the avian brain, Consciousness and Cognition, vol.17, issue.2, pp.475-483, 2008.
DOI : 10.1016/j.concog.2008.03.018

C. Knight, M. Studdert-kennedy, and J. R. Hurford, The Evolutionary Emergence of Language, 2000.
DOI : 10.1017/CBO9780511606441

K. I. Kobayasi and K. Okanoya, Context-dependent song amplitude control in Bengalese finches, NeuroReport, vol.14, 2003.
DOI : 10.1097/00001756-200303030-00045

S. Kopp, Social resonance and embodied coordination in face-to-face conversation with artificial interlocutors, Speech Communication, vol.52, issue.6, pp.587-597, 2010.
DOI : 10.1016/j.specom.2010.02.007

URL : http://www.techfak.uni-bielefeld.de/ags/soa/publications/doc/Kopp-SocialResonance.pdf

P. K. Kuhl, Discrimination of speech by nonhuman animals: Basic auditory sensitivities conducive to the perception of speech???sound categories, The Journal of the Acoustical Society of America, vol.70, issue.2, 1981.
DOI : 10.1121/1.386782

P. K. Kuhl, A new view of language acquisition, Proceedings of the National Academy of Sciences, vol.388, issue.4, pp.11850-11857, 2000.
DOI : 10.3758/BF03206698

P. K. Kuhl, Early language acquisition: cracking the speech code, Nature Reviews Neuroscience, vol.298, issue.11, pp.831-843, 1533.
DOI : 10.1126/science.298.5598.1569

G. Lakoff, J. , and M. , Metaphors We Live By, 1980.
DOI : 10.7208/chicago/9780226470993.001.0001

M. L. Leonard and A. G. Horn, Ambient noise and the design of begging signals, Proceedings of the Royal Society B: Biological Sciences, vol.57, issue.3, pp.651-6563021, 2004.
DOI : 10.1006/anbe.1998.1013

URL : http://europepmc.org/articles/pmc1564071?pdf=render

S. C. Levinson, Pragmatics. Cambridge, 1983.

S. C. Levinson, On the human " interaction engine, Roots of Human Sociality: Culture, Cognition and Interaction, pp.39-69, 2006.

S. C. Levinson, Turn-taking in Human Communication ??? Origins and Implications for Language Processing, Trends in Cognitive Sciences, vol.20, issue.1, 2015.
DOI : 10.1016/j.tics.2015.10.010

URL : http://hdl.handle.net/11858/00-001M-0000-0029-404D-6

K. Liebal, B. M. Waller, A. M. Burrows, and K. E. Slocombe, Primate Communication: A Multimodal Approach, 2013.
DOI : 10.1017/CBO9781139018111

P. Lieberman, The Biology and Evolution of Language, 1984.

A. Lim and H. G. Okuno, The MEI Robot: Towards Using Motherese to Develop Multimodal Emotional Intelligence, IEEE Transactions on Autonomous Mental Development, vol.6, issue.2, pp.126-138, 2014.
DOI : 10.1109/TAMD.2014.2317513

URL : http://winnie.kuis.kyoto-u.ac.jp/members/angelica/papers/angelicalim-tamd-2014.pdf

H. Lind, T. Dabelsteen, G. , and P. K. , Female great tits can identify mates by song, Animal Behaviour, vol.52, issue.4, pp.667-671, 1996.
DOI : 10.1006/anbe.1996.0211

B. Lindblom, Explaining Phonetic Variation: A Sketch of the H&H Theory, Speech Production and Speech Modelling, pp.403-439, 1990.
DOI : 10.1007/978-94-009-2037-8_16

D. Lipkind, G. F. Marcus, D. K. Bemis, K. Sasahara, N. Jacoby et al., Stepwise acquisition of vocal combinatorial capacity in songbirds and human infants, Nature, vol.432, issue.7452, pp.104-10810, 1038.
DOI : 10.1038/nature02992

URL : http://europepmc.org/articles/pmc3676428?pdf=render

E. Lombard, Le sign de l' élévation de la voix, Ann. Maladies Oreille Larynx Nez Pharynx, vol.37, pp.101-119, 1911.

R. Lopez-cozar-delgado and M. Araki, Spoken, Multilingual and Multimodal Dialogue Systems: Development and Assessment, 2005.

C. Lyon, C. L. Nehaniv, and A. Cangelosi, Emergence of Communication and Language, 2007.
DOI : 10.1007/978-1-84628-779-4

Z. S. Ma, Towards computational models of animal communications, an introduction for computer scientists, Cognitive Systems Research, vol.33, pp.70-99, 2015.
DOI : 10.1016/j.cogsys.2014.08.002

P. Macneilage, The frame/content theory of evolution of speech production, Behavioral and Brain Sciences, vol.21, issue.04, pp.499-546, 1998.
DOI : 10.1017/S0140525X98001265

P. F. Macneilage, The Origin of Speech, 2008.

K. Manabe, E. I. Sadr, and R. J. Dooling, ): Differential reinforcement of vocal intensity and the Lombard effect, The Journal of the Acoustical Society of America, vol.103, issue.2, pp.1190-1198, 1998.
DOI : 10.1121/1.421227

M. B. Manser, The acoustic structure of suricates' alarm calls varies with predator type and the level of response urgency, Proceedings of the Royal Society B: Biological Sciences, vol.268, issue.1483, pp.2315-2324, 2001.
DOI : 10.1098/rspb.2001.1773

A. H. Maslow, A theory of human motivation., Psychological Review, vol.50, issue.4, pp.370-396, 1943.
DOI : 10.1037/h0054346

H. R. Maturana and F. J. Varela, The Tree of Knowledge: The Biological Roots of Human Understanding, 1987.

N. Mavridis, A review of verbal and non-verbal human???robot interactive communication, Robotics and Autonomous Systems, vol.63, pp.22-35, 2014.
DOI : 10.1016/j.robot.2014.09.031

URL : https://doi.org/10.1016/j.robot.2014.09.031

D. Mccarthy, Language development in children., Manual of Child Psychology, pp.492-630, 1954.
DOI : 10.1037/10756-010

K. Mccomb, G. Shannon, K. N. Sayialel, M. , and C. , Elephants can determine ethnicity, gender, and age from acoustic cues in human voices, Proceedings of the National Academy of Sciences, vol.106, issue.3 Pt 1, pp.5433-5438, 2014.
DOI : 10.1121/1.427148

B. Mccowan, S. F. Hanser, D. , and L. R. , Quantitative tools for comparing animal communication systems: information theory applied to bottlenose dolphin whistle repertoires, Animal Behaviour, vol.57, issue.2, pp.409-419, 1999.
DOI : 10.1006/anbe.1998.1000

P. K. Mcgregor, Playback and Studies of Animal Communication, 1992.
DOI : 10.1007/978-1-4757-6203-7

M. F. Mctear, Spoken Dialogue Technology: Towards the Conversational User Interface, 2004.
DOI : 10.1007/978-0-85729-414-2

A. Mehrabian and D. J. Mennill, Pleasure-arousal-dominance: A general framework for describing and measuring individual differences in Temperament, Current Psychology, vol.13, issue.1, pp.261-292, 1996.
DOI : 10.1080/03610738408258553

D. J. Mennill, P. T. Boag, and L. M. Ratcliffe, The reproductive choices of eavesdropping female black-capped chickadees, Poecile atricapillus, Naturwissenschaften, vol.90, issue.12, pp.577-582, 2003.
DOI : 10.1007/s00114-003-0479-3

P. Messum, H. , and I. S. , Creating the cognitive form of phonological units: the speech sound correspondence problem in infancy could be solved Frontiers in Robotics and AI | www.frontiersin, p.61, 2015.

W. J. Mitchell, K. A. Szerszen, . Sr, A. S. Lu, P. W. Schermerhorn et al., A mismatch in the human realism of face and voice produces an uncanny valley Unconscious anchoring in maternal imitation that helps find the correspondence of a caregiver's vowel categories, Adv. Robot, vol.2, issue.21, pp.1583-160010, 2011.

S. Miyagawa, S. Ojima, R. C. Berwick, and K. Okanoya, The integration hypothesis of human language evolution and the nature of contemporary languages, Frontiers in Psychology, vol.38, issue.7, 2014.
DOI : 10.1017/CBO9780511486371.021

R. K. Moore, PRESENCE: A Human-Inspired Architecture for Speech-Based Human-Machine Interaction, IEEE Transactions on Computers, vol.56, issue.9, pp.1176-1188, 1080.
DOI : 10.1109/TC.2007.1080

URL : http://eprints.whiterose.ac.uk/42799/2/Moore_42799.pdf

R. K. Moore, Spoken language processing: Piecing together the puzzle, Speech Communication, vol.49, issue.5, pp.418-435, 2007.
DOI : 10.1016/j.specom.2007.01.011

URL : https://hal.archives-ouvertes.fr/hal-00499174

R. K. Moore, Cognitive Approaches to Spoken Language Technology, Speech Technology: Theory and Applications, pp.89-103, 2010.
DOI : 10.1007/978-0-387-73819-2_6

R. K. Moore, A Bayesian explanation of the ???Uncanny Valley??? effect and related psychological phenomena, Scientific Reports, vol.26, issue.1, 2012.
DOI : 10.1016/j.cub.2009.11.034

URL : http://www.nature.com/articles/srep00864.pdf

R. K. Moore, Spoken Language Processing: Where Do We Go from Here?, Your Virtual Butler, LNAI, pp.111-125, 2013.
DOI : 10.1007/978-3-642-37346-6_10

R. K. Moore, From talking and listening robots to intelligent communicative machines, Robots That Talk and Listen, pp.317-335, 2015.

R. K. Moore, A Real-Time Parametric General-Purpose Mammalian Vocal Synthesiser, Interspeech 2016, 2016.
DOI : 10.21437/Interspeech.2016-841

R. K. Moore, Is Spoken Language All-or-Nothing? Implications for Future Speech-Based Human-Machine Interaction, Dialogues with Social Robots Enablements, Analyses, and Evaluation Lecture Notes in Electrical Engineering (LNEE)). Available at, 2016.
DOI : 10.1109/ICARA.2011.6144906

R. Moore and A. Morris, Experiences collecting genuine spoken enquiries using WOZ techniques, Proceedings of the workshop on Speech and Natural Language , HLT '91, pp.61-63, 1992.
DOI : 10.3115/1075527.1075540

URL : http://dl.acm.org/ft_gateway.cfm?id=1075540&type=pdf

R. K. Moore and L. Bosch, Modelling vocabulary growth from birth to young adulthood, INTERSPEECH (Brighton), pp.1727-1730, 2009.

M. Mori, Bukimi no tani (the uncanny valley, pp.33-35, 1970.

A. F. Morse, V. L. Benitez, T. Belpaeme, A. Cangelosi, and L. B. Smith, Posture Affects How Robots and Infants Map Words to Objects, PLOS ONE, vol.106, issue.1, 2015.
DOI : 10.1371/journal.pone.0116012.s019

URL : https://doi.org/10.1371/journal.pone.0116012

A. F. Morse, C. Herrera, R. Clowes, A. Montebelli, and T. Ziemke, The role of robotic modelling in cognitive science, New Ideas in Psychology, vol.29, issue.3, pp.312-324, 2011.
DOI : 10.1016/j.newideapsych.2011.02.001

C. Moulin-frier, S. M. Nguyen, and P. Oudeyer, Self-organization of early vocal development in infants and machines: the role of intrinsic motivation, Frontiers in Psychology, vol.4, 2013.
DOI : 10.3389/fpsyg.2013.01006

URL : https://hal.archives-ouvertes.fr/hal-00927940

C. Nass and S. Brave, Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship, 2005.

T. Nazzi and J. Bertoncini, Before and after the vocabulary spurt: two modes of word acquisition?, Developmental Science, vol.57, issue.149, pp.136-142, 2003.
DOI : 10.1037//0012-1649.30.4.553

N. Nguyen and V. Delvaux, Role of imitation in the emergence of phonological systems, Journal of Phonetics, vol.53, pp.46-54, 2015.
DOI : 10.1016/j.wocn.2015.08.004

URL : https://hal.archives-ouvertes.fr/hal-01394207

A. Niculescu, B. Van-dijk, A. Nijholt, S. , and S. L. , The influence of voice pitch on the evaluation of a social robot receptionist, 2011 International Conference on User Science and Engineering (i-USEr ), pp.18-23, 2011.
DOI : 10.1109/iUSEr.2011.6150529

S. Nolfi and M. Mirolli, Evolution of Communication and Language in Embodied Agents, 2010.
DOI : 10.1007/978-3-642-01250-1

S. Nonaka, R. Takahashi, K. Enomoto, A. Katada, and T. Unno, Lombard reflex during PAG-induced vocalization in decerebrate cats, Neuroscience Research, vol.29, issue.4, pp.283-289, 1997.
DOI : 10.1016/S0168-0102(97)00097-7

D. K. Oller, Evolution of Communication Systems: A Comparative Approach, 2004.

M. S. Osmanski and R. J. Dooling, The effect of altered auditory feedback on control of vocal production in budgerigars, 2009.

K. Ouattara, A. Lemasson, and K. Zuberbühler, Campbell's monkeys concatenate vocalizations into context-specific call sequences, Proc. Natl. Acad, 2009.
DOI : 10.1037/a0014280

URL : http://www.pnas.org/content/106/51/22026.full.pdf

E. Oztop, M. Kawato, and M. Arbib, Mirror neurons and imitation: A computationally guided review, Neural Networks, vol.19, issue.3, pp.254-271, 2006.
DOI : 10.1016/j.neunet.2006.02.002

A. Pentland, Honest signals, Proceedings of the 19th ACM international conference on Multimedia, MM '11, 2008.
DOI : 10.1145/2072298.2072374

I. M. Pepperberg, Vocal learning in Grey parrots: A brief review of perception, production, and cross-species comparisons, Brain and Language, vol.115, issue.1, pp.81-91, 2010.
DOI : 10.1016/j.bandl.2009.11.002

E. C. Perez, J. E. Elie, C. O. Soulage, H. A. Soula, N. Mathevon et al., The acoustic expression of stress in a songbird: Does corticosterone drive isolation-induced modifications of zebra finch calls?, Hormones and Behavior, vol.61, issue.4, pp.573-581, 2012.
DOI : 10.1016/j.yhbeh.2012.02.004

URL : https://hal.archives-ouvertes.fr/hal-00759517

R. S. Peterson and G. A. Bartholomew, Airborne vocal communication in the california sea lion, Zalophus californianus, Animal Behaviour, vol.17, issue.69, pp.17-24, 1969.
DOI : 10.1016/0003-3472(69)90108-0

J. A. Pfaff, L. Zanette, S. A. Macdougall-shackleton, and E. A. Macdougall-shackleton, Song repertoire size varies with HVC volume and is indicative of male quality in song sparrows (Melospiza melodia), Proceedings of the Royal Society B: Biological Sciences, vol.159, issue.4, pp.2035-20400170, 2007.
DOI : 10.1098/rspb.1996.0091

M. Phillips and M. Philips, APPLICATIONS OF SPOKEN LANGUAGE TECHNOLOGY AND SYSTEMS, 2006 IEEE Spoken Language Technology Workshop, 2006.
DOI : 10.1109/SLT.2006.326784

R. W. Picard, Affective Computing, 1997.
DOI : 10.1037/e526112012-054

M. J. Pickering and S. Garrod, Do people use language production to make predictions during comprehension?, Trends in Cognitive Sciences, vol.11, issue.3, 2007.
DOI : 10.1016/j.tics.2006.12.002

R. Pieraccini, The Voice in the Machine, 2012.

S. Pinker and R. Jackendoff, The faculty of language: what's special about it?, Cognition, vol.95, issue.2, 2005.
DOI : 10.1016/j.cognition.2004.08.004

URL : http://mapageweb.umontreal.ca/tuitekj/cours/chomsky/pinker-jackendoff.pdf

K. Pisanski, V. Cartei, C. Mcgettigan, J. Raine, R. et al., Voice modulation: a window into the origins of human vocal control? Trends Cogn, 2016.
DOI : 10.1016/j.tics.2016.01.002

R. Plutchik, A general psychoevolutionary theory of emotion, " in Emotion: Theory, Research and Experience, Theories of Emotion, pp.3-33, 1980.

P. Pongrácz, C. Molnár, and Á. Miklósi, Acoustic parameters of dog barks carry emotional information for humans, Applied Animal Behaviour Science, vol.100, issue.3-4, 2006.
DOI : 10.1016/j.applanim.2005.12.004

J. H. Poole, P. L. Tyack, A. S. Stoeger-horwath, and S. Watwood, Elephants are capable of vocal learning, Nature, vol.125, issue.7032, pp.455-456, 1972.
DOI : 10.5479/si.00810282.125

D. Premack, G. L. Woodruff, K. Mccomb, R. , and D. , Does the chimpanzee have a theory of mind? Cross-modal individual recognition in domestic horses (Equus caballus), Behav. Brain Sci. Proc. Natl. Acad. Sci. U.S.A, vol.1, issue.106, pp.515-526, 1978.

H. J. Rainey, K. Zuberbühler, and P. J. Slater, Hornbills can distinguish between primate alarm calls, Proceedings of the Royal Society B: Biological Sciences, vol.271, issue.1540, pp.755-759, 2003.
DOI : 10.1098/rspb.2003.2619

URL : http://europepmc.org/articles/pmc1691652?pdf=render

R. Ranganath, D. Jurafsky, and D. A. Mcfarland, Detecting friendly, flirtatious, awkward, and assertive speech in speed-dates. Comput. Speech Lang, 2013.
DOI : 10.1016/j.csl.2012.01.005

A. Ravignani, D. Bowling, and W. T. Fitch, Chorusing, synchrony, and the evolutionary functions of rhythm, Frontiers in Psychology, vol.5, 2014.
DOI : 10.3389/fpsyg.2014.01118

URL : https://www.frontiersin.org/articles/10.3389/fpsyg.2014.01118/pdf

A. Ravignani, W. T. Fitch, F. D. Hanke, T. Heinrich, B. Hurgitsch et al., What Pinnipeds Have to Say about Human Speech, Music, and the Evolution of Rhythm, Frontiers in Neuroscience, vol.102, issue.730, p.274, 2016.
DOI : 10.1152/jn.00066.2009

URL : https://www.frontiersin.org/articles/10.3389/fnins.2016.00274/pdf

D. Reiss and B. Mccowan, Spontaneous vocal mimicry and production by bottlenose dolphins (Tursiops truncatus): Evidence for vocal learning., Journal of Comparative Psychology, vol.107, issue.3, 1993.
DOI : 10.1037/0735-7036.107.3.301

A. R. Ridley, M. F. Child, and M. B. Bell, Interspecific audience effects on the alarm-calling behaviour of a kleptoparasitic bird, Biology Letters, vol.67, issue.3, pp.589-5910325, 2007.
DOI : 10.1016/0022-5193(77)90061-3

URL : http://rsbl.royalsocietypublishing.org/content/roybiolett/3/6/589.full.pdf

G. Rizzolatti, C. , and L. , THE MIRROR-NEURON SYSTEM, Annual Review of Neuroscience, vol.27, issue.1, pp.169-192, 2004.
DOI : 10.1146/annurev.neuro.27.070203.144230

B. C. Roy, M. C. Frank, P. Decamp, M. Miller, R. et al., Predicting the birth of a spoken word, Proceedings of the National Academy of Sciences, vol.21, issue.1, 2015.
DOI : 10.1007/978-3-642-04898-2_594

URL : http://www.pnas.org/content/112/41/12663.full.pdf

J. A. Russell, A circumplex model of affect., Journal of Personality and Social Psychology, vol.39, issue.6, pp.1161-1178, 1980.
DOI : 10.1037/h0077714

URL : https://hal.archives-ouvertes.fr/hal-01086372

J. R. Saffran, Statistical Language Learning, Current Directions in Psychological Science, vol.12, issue.4, pp.110-114, 1243.
DOI : 10.1207/S15327078IN0402_07

J. R. Saffran, R. N. Aslin, and E. Newport, Statistical Learning by 8-Month-Old Infants, Science, vol.274, issue.5294, 1926.
DOI : 10.1126/science.274.5294.1926

K. Sasahara, M. L. Cody, D. Cohen, T. , and C. E. , Structural Design Principles of Complex Bird Songs: A Network-Based Approach, PLoS ONE, vol.7, issue.9, 2012.
DOI : 10.1371/journal.pone.0044436.s006

URL : https://doi.org/10.1371/journal.pone.0044436

A. M. Schel, A. Candiotti, and K. Zuberbühler, Predator-deterring alarm call sequences in Guereza colobus monkeys are meaningful to conspecifics, Animal Behaviour, vol.80, issue.5, 2010.
DOI : 10.1016/j.anbehav.2010.07.012

URL : http://doc.rero.ch/record/232397/files/Schel_A._M._-_Predator-deterring_alarm_call_sequences_20141007.pdf

A. M. Schel, S. W. Townsend, Z. Machanda, K. Zuberbühler, and K. E. Slocombe, Chimpanzee Alarm Call Production Meets Key Criteria for Intentionality, PLoS ONE, vol.84, issue.10, 2013.
DOI : 10.1371/journal.pone.0076674.s006

URL : https://doi.org/10.1371/journal.pone.0076674

K. R. Scherer, Vocal communication of emotion: A review of research paradigms, Speech Communication, vol.40, issue.1-2, pp.227-25610, 2003.
DOI : 10.1016/S0167-6393(02)00084-5

R. J. Schusterman, Temporal patterning in sea lion barking (Zalophus californianus), Behavioral Biology, vol.20, issue.3, pp.404-40810, 1977.
DOI : 10.1016/S0091-6773(77)90964-6

M. Schwenk and K. O. Arras, R2-D2 Reloaded: A flexible sound synthesis system for sonic human-robot interaction design, The 23rd IEEE International Symposium on Robot and Human Interactive Communication, pp.161-167, 2014.
DOI : 10.1109/ROMAN.2014.6926247

T. Scott-phillips, Speaking Our Minds: Why Human Communication Is Different, and How Language Evolved to Make It Special, 2015.
DOI : 10.1007/978-1-137-31273-0

W. A. Searcy, Y. , and K. , Song and female choice, Ecology and Evolution of Acoustic Communication in Birds, pp.454-473, 1996.

R. M. Seyfarth and D. L. Cheney, Meaning and Emotion in Animal Vocalizations, Annals of the New York Academy of Sciences, vol.61, issue.1, pp.32-55, 2003.
DOI : 10.1006/anbe.2000.1518

R. M. Seyfarth, D. L. Cheney, and P. Marler, Monkey responses to three different alarm calls: evidence of predator classification and semantic communication, Science, vol.210, issue.4471, pp.801-803, 1980.
DOI : 10.1126/science.7433999

R. V. Shannon, Is Birdsong More Like Speech or Music?, Trends in Cognitive Sciences, vol.20, issue.4, pp.245-247, 2016.
DOI : 10.1016/j.tics.2016.02.004

L. B. Smith, Y. , and C. , Infants rapidly learn word-referent mappings via cross-situational statistics, Cognition, vol.106, issue.3, 2008.
DOI : 10.1016/j.cognition.2007.06.010

URL : http://www.indiana.edu/~dll/papers/smith_cognition08.pdf

J. Soltis, K. A. Leighty, C. M. Wesolek, and A. Savage, The expression of affect in African elephant (Loxodonta africana) rumble vocalizations., Journal of Comparative Psychology, vol.123, issue.2, pp.222-22510, 1037.
DOI : 10.1037/a0015223

R. Stark, Stages of speech development in the first year of life, pp.113-142, 1980.

L. Steels, Language games for autonomous robots, IEEE Intell. Syst, vol.16, pp.16-22, 2001.
DOI : 10.1109/mis.2001.956077

L. Steels, Evolving grounded communication for robots, Trends in Cognitive Sciences, vol.7, issue.7, pp.308-312, 2003.
DOI : 10.1016/S1364-6613(03)00129-3

URL : http://www.csl.sony.fr/downloads/papers/2003/steels-03c.pdf

C. Stephan and K. Zuberbühler, Predation increases acoustic complexity in primate alarm calls, Biology Letters, vol.53, issue.6, pp.641-644, 2008.
DOI : 10.1006/anbe.1996.0334

URL : http://rsbl.royalsocietypublishing.org/content/roybiolett/4/6/641.full.pdf

F. Stramandinoli, D. Marocco, and A. Cangelosi, The grounding of higher order concepts in action and language: A cognitive robotics model, Neural Networks, vol.32, pp.165-173, 2012.
DOI : 10.1016/j.neunet.2012.02.012

URL : http://www.tech.plym.ac.uk/socce/robotdoc/publications/Stramandinoli-et-al_TheGroundingOfHigherOrderConcepts-NN.pdf

D. Y. Takahashi, D. Z. Narayanan, and A. A. Ghazanfar, Coupled Oscillator Dynamics of Vocal Turn-Taking in Monkeys, Current Biology, vol.23, issue.21, 2013.
DOI : 10.1016/j.cub.2013.09.005

URL : https://doi.org/10.1016/j.cub.2013.09.005

W. J. Talkington, K. M. Rapuano, L. A. Hitt, C. A. Frum, L. et al., Humans Mimicking Animals: A Cortical Hierarchy for Human Vocal Communication Sounds, Journal of Neuroscience, vol.32, issue.23, pp.8084-8093, 2012.
DOI : 10.1523/JNEUROSCI.1118-12.2012

URL : http://www.jneurosci.org/content/jneuro/32/23/8084.full.pdf

O. Tchernichovski, P. P. Mitra, T. Lints, and F. Nottebohm, Dynamics of the Vocal Imitation Process: How a Zebra Finch Learns Its Song, Science, vol.291, issue.5513, pp.2564-2569, 2001.
DOI : 10.1126/science.1058522

C. N. Templeton and E. Greene, Nuthatches eavesdrop on variations in heterospecific chickadee mobbing alarm calls, Proceedings of the National Academy of Sciences, vol.37, issue.2, pp.5479-5482, 2007.
DOI : 10.1016/0003-3472(89)90039-0

URL : http://www.pnas.org/content/104/13/5479.full.pdf

C. N. Templeton, E. Greene, D. , and K. , Allometry of Alarm Calls: Black-Capped Chickadees Encode Information About Predator Size, Science, vol.308, issue.5730, 1934.
DOI : 10.1126/science.1108841

C. N. Templeton, N. I. Mann, A. A. Ríos-chelén, E. Quiros-guerrero, M. Garcia et al., An experimental study of duet integration in the happy wren, Pheugopedius felix, Animal Behaviour, vol.86, issue.4, pp.821-827, 2013.
DOI : 10.1016/j.anbehav.2013.07.022

L. Ten-bosch, L. Boves, H. Van-hamme, M. , and R. K. , A computational model of language acquisition: the emergence of words, Fundam. Inform, vol.90, pp.229-249, 2009.

C. Ten-cate, On the phonetic and syntactic processing abilities of birds: From songs to speech and artificial grammars, Current Opinion in Neurobiology, vol.28, pp.157-164, 2014.
DOI : 10.1016/j.conb.2014.07.019

C. Ten-cate and K. Okanoya, Revisiting the syntactic abilities of nonhuman animals: natural vocalizations and artificial grammar learning, Philos. Trans. R. Soc. Lond. B Biol. Sci, vol.367, 1984.

S. Thill, D. Caligiore, A. M. Borghi, T. Ziemke, and G. Baldassarre, Theories and computational models of affordance and mirror systems: An integrative review, Neuroscience & Biobehavioral Reviews, vol.37, issue.3, pp.491-521, 2013.
DOI : 10.1016/j.neubiorev.2013.01.012

URL : https://doi.org/10.1016/j.neubiorev.2013.01.012

S. Thill and R. Lowe, On the Functional Contributions of Emotion Mechanisms to (Artificial) Cognition and Intelligence, Proceedings of the Fifth Conference on Artificial General Intelligence, LNAI 7716, pp.322-331, 2012.
DOI : 10.1007/978-3-642-35506-6_33

S. Thill, S. Padó, and T. Ziemke, On the Importance of a Rich Embodiment in the Grounding of Concepts: Perspectives From Embodied Cognitive Science and Computational Linguistics, Topics in Cognitive Science, vol.1, issue.1, pp.545-558, 2014.
DOI : 10.1007/s12559-009-9012-0

S. Thill and K. E. Twomey, What's on the Inside Counts: A Grounded Account of Concept Acquisition and Development, Frontiers in Psychology, vol.1, issue.506, 2016.
DOI : 10.3758/s13423-015-0864-x

URL : http://journal.frontiersin.org/article/10.3389/fpsyg.2016.00402/pdf

M. Tomasello, Origins of Human Communication, 2008.

M. Tomasello, M. Carpenter, J. Call, T. Behne, M. et al., Understanding and sharing intentions: The origins of cultural cognition, Behavioral and Brain Sciences, vol.28, issue.05, pp.675-735, 2005.
DOI : 10.1017/S0140525X05000129

URL : https://www.cambridge.org/core/services/aop-cambridge-core/content/view/F9C40BF73A68B30B8EB713F2F947F7E2/S0140525X05000129a.pdf/div-class-title-understanding-and-sharing-intentions-the-origins-of-cultural-cognition-div.pdf

S. W. Townsend, S. E. Koski, R. W. Byrne, K. E. Slocombe, B. Bickel et al., Exorcising Grice's ghost: an empirical approach to studying intentional communication in animals, Biological Reviews, vol.18, issue.3, 2016.
DOI : 10.1016/j.cub.2007.12.041

F. Trillmich, Mutual Mother-Pup Recognition in Gal??pagos Fur Seals and Sea Lions: Cues Used and Functional Significance, Behaviour, vol.78, issue.1, pp.21-42, 1981.
DOI : 10.1163/156853981X00248

URL : https://pub.uni-bielefeld.de/download/1781845/2313548

E. Vallet, I. Beme, and M. Kreutzer, Two-note syllables in canary songs elicit high levels of sexual display, Animal Behaviour, vol.55, issue.2, pp.291-297, 1997.
DOI : 10.1006/anbe.1997.0631

S. C. Vernes, What bats have to say about speech and language, Psychonomic Bulletin & Review, vol.197, issue.1, pp.13423-13439, 2016.
DOI : 10.1126/science.1230835

URL : https://link.springer.com/content/pdf/10.3758%2Fs13423-016-1060-3.pdf

A. Vinciarelli, M. Pantic, and H. Bourlard, Social signal processing: Survey of an emerging domain, Image and Vision Computing, vol.27, issue.12, 2009.
DOI : 10.1016/j.imavis.2008.11.007

URL : http://www.doc.ic.ac.uk/~maja/IVCJ-SSPsurvey-FINAL.pdf

A. Vollmer, B. Wrede, K. J. Rohlfing, and A. Cangelosi, Do beliefs about a robot's capabilities influence alignment to its actions?, 2013 IEEE Third Joint International Conference on Development and Learning and Epigenetic Robotics (ICDL), pp.2013-2014, 2013.
DOI : 10.1109/DevLrn.2013.6652521

I. A. Volodin, E. V. Volodina, E. N. Lapshina, K. O. Efremova, and N. V. Soldatova, Vocal group signatures in the goitred gazelle Gazella subgutturosa, Animal Cognition, vol.119, issue.10, 2014.
DOI : 10.1121/1.2130934

W. Von-humboldt, Uber die verschiedenheit des menschlichen sprachbaues und ihren einfuss auf die geistige entwickelung des menschengeschlechts, Royal Academy of Science, 1836.

P. Wagner, Z. Malisz, and S. Kopp, Gesture and speech in interaction: An overview, Speech Communication, vol.57, 2014.
DOI : 10.1016/j.specom.2013.09.008

S. Waiblinger, X. Boivin, V. Pedersen, M. V. Tosi, A. M. Janczak et al., Assessing the human???animal relationship in farmed species: A critical review, Applied Animal Behaviour Science, vol.101, issue.3-4, pp.185-242, 2006.
DOI : 10.1016/j.applanim.2006.02.001

URL : https://doi.org/10.1016/j.applanim.2006.02.001

M. Walters, D. Syrdal, K. Koay, K. Dautenhahn, and R. Boekhorst, Human approach distances to a mechanical-looking robot with different robot voice styles, RO-MAN 2008, The 17th IEEE International Symposium on Robot and Human Interactive Communication, pp.707-712, 2008.
DOI : 10.1109/ROMAN.2008.4600750

S. K. Watson, S. W. Townsend, A. M. Schel, C. Wilke, E. K. Wallace et al., Vocal Learning in the Functionally Referential Food Grunts of Chimpanzees, Current Biology, vol.25, issue.4, pp.495-499, 2015.
DOI : 10.1016/j.cub.2014.12.032

URL : https://doi.org/10.1016/j.cub.2014.12.032

D. M. Weary and J. R. Krebs, Great tits classify songs by individual voice characteristics, Animal Behaviour, vol.43, issue.2, pp.283-287, 1992.
DOI : 10.1016/S0003-3472(05)80223-4

B. Webb, Using robots to model animals: a cricket test, Robotics and Autonomous Systems, vol.16, issue.2-4, pp.117-13410, 1995.
DOI : 10.1016/0921-8890(95)00044-5

B. Webb, Using robots to understand animal behavior, Adv. Study Behav, vol.38, issue.08, pp.1-58, 2008.

M. Weiss, H. Hultsch, I. Adam, C. Scharff, and S. Kipper, The use of network analysis to study complex animal communication systems: a study on nightingale song, Proceedings of the Royal Society B: Biological Sciences, vol.80, issue.3, 2014.
DOI : 10.1103/PhysRevE.80.051902

S. Wermter, M. Page, M. Knowles, V. Gallese, F. Pulvermüller et al., Multimodal communication in animals, humans and robots: An introduction to perspectives in brain-inspired informatics, Neural Networks, vol.22, issue.2, pp.111-115, 2009.
DOI : 10.1016/j.neunet.2009.01.004

A. D. Wilson and S. Golonka, Embodied Cognition is Not What you Think it is, Frontiers in Psychology, vol.4, 2013.
DOI : 10.3389/fpsyg.2013.00058

URL : http://journal.frontiersin.org/article/10.3389/fpsyg.2013.00058/pdf

M. Wilson and G. Knoblich, The Case for Motor Involvement in Perceiving Conspecifics., Psychological Bulletin, vol.131, issue.3, pp.460-473, 2005.
DOI : 10.1037/0033-2909.131.3.460

URL : http://somby.info/page3/assets/2005_WilsonKnoblich.pdf

J. L. Yorzinski and S. L. Vehrencamp, The Effect of Predator Type and Danger Level on the Mob Calls of the American Crow, The Condor, vol.111, issue.1, pp.159-168, 2009.
DOI : 10.1525/cond.2009.080057

Y. Yoshikawa, M. Asada, K. Hosoda, and J. Koga, A constructivist approach to infants' vowel acquisition through mother???infant interaction, Connection Science, vol.15, issue.4, pp.245-258, 2003.
DOI : 10.1121/1.1906694

K. Zuberbühler, Referential labelling in Diana monkeys, Animal Behaviour, vol.59, issue.5, pp.917-927, 2000.
DOI : 10.1006/anbe.1999.1317

K. K. Zuberbühler, Predator-specific alarm calls in Campbell's monkeys, Cercopithecus campbelli A syntactic rule in forest monkey communication, Behav. Ecol. Sociobiol. Anim. Behav, vol.50, issue.63, pp.414-422, 1914.

K. Zuberbühler, D. Jenny, and R. Bshary, The Predator Deterrence Function of Primate Alarm Calls, Ethology, vol.53, issue.6, pp.477-490, 1999.
DOI : 10.1046/j.1439-0310.1999.00396.x