R. Zhang, D. Yuan, and Y. Wang, A health monitoring system for wireless sensor networks, 2nd IEEE Conference on Industrial Electronics and Applications, 2007.

J. Maerien, P. Agten, C. Huygens, and W. Joosen, Famos: a flexible active monitoring service for wireless sensor networks, IFIP International Conference on Distributed Applications and Interoperable Systems, 2012.
URL : https://hal.archives-ouvertes.fr/hal-01527640

A. Meier, M. Motani, H. Siquan, and S. Künzli, Dimo: Distributed node monitoring in wireless sensor networks, 11th international symposium on Modeling, 2008.

F. Wuhib, R. Stadler, and M. Dam, Gossiping for threshold detection, IFIP/IEEE International Symposium on Integrated Network Management, 2009.

R. Mijumbi, A. Asthana, M. Koivunen, F. Haiyong, and Z. Norman, Darn: Dynamic baselines for real-time network monitoring, 4th IEEE Conference on Network Softwarization, 2018.

M. Dilman and D. Raz, Efficient reactive monitoring, IEEE journal on selected areas in communications, vol.20, issue.4, pp.668-676, 2002.

Y. Rioual, J. Laurent, E. Senn, and J. Diguet, Reinforcement learning strategies for energy management in low power iot, International Conference on Computational Science and Computational Intelligence (CSCI), 2017.
URL : https://hal.archives-ouvertes.fr/hal-01654931

G. Oddi, A. Pietrabissa, and F. Liberati, Energy balancing in multi-hop wireless sensor networks: an approach based on reinforcement learning, NASA/ESA Conference on Adaptive Hardware and Systems (AHS)

, IEEE, 2014.

G. Liu, X. Wang, X. Li, J. Hao, and Z. Feng, Esrq: An efficient secure routing method in wireless sensor networks based on q-learning, 17th IEEE International Conference On Trust, Security And Privacy In Computing And Communications, 2018.

R. S. Sutton and A. G. Barto, Reinforcement learning: An introduction, 2017.

A. Lahmadi, A. Boeglin, and O. Festor, Efficient distributed monitoring in 6lowpan networks, Proceedings of the 9th International Conference on Network and Service Management (CNSM), 2013.
URL : https://hal.archives-ouvertes.fr/hal-00879550

R. S. Sutton, Learning to predict by the methods of temporal differences, Machine learning, vol.3, issue.1, pp.9-44, 1988.

G. A. Rummery and M. Niranjan, On-line Q-learning using connectionist systems, vol.37, 1994.

C. J. Watkins and P. Dayan, Q-learning, Machine learning, vol.8, issue.3-4, 1992.

H. Van-seijen, H. Van-hasselt, S. Whiteson, and M. Wiering, A theoretical and empirical analysis of expected sarsa, IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, 2009.

, Gym: A toolkit for developing and comparing reinforcement learning algorithms, 2019.

F. Österlind, A. Dunkels, J. Eriksson, N. Finne, and T. Voigt, Crosslevel sensor network simulation with cooja, IEEE Workshop on Practical Issues in Building Sensor Network Applications, 2006.

, Zolertia z1 motes -contiki wiki, 2019.