I. Winner and . Project, Interference avoidance concept, Delivrable D4, 2007.

P. Y. Glorennec, Reinforcement Learning: an overview, European Sym. on Intelligent Techniques, 2000.

A. L. Stolyar and H. Viswanathan, Self-Organizing Dynamic Fractional Frequency Reuse for Best-Effort Traffic through Distributed Inter-Cell Coordination, IEEE INFOCOM 2009, The 28th Conference on Computer Communications, 2009.
DOI : 10.1109/INFCOM.2009.5062043

J. Gross, Comparison of heuristic and optimal subcarrier assignment algorithms, Proc. ICWN'03, 2003.

R. Combes, Z. Altman, and E. Altman, On the use of packet scheduling in self-optimization processes: application to coverage-capacity optimization, 2010.
URL : https://hal.archives-ouvertes.fr/inria-00498378

J. L. Van-den and . Berg, Self-organizion in future mobile communication networks, ICT -Mobile Summit, 2008.

L. Jouffe, Fuzzy inference system learning by reinforcement methods, IEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews), vol.28, issue.3, pp.338-355, 1998.
DOI : 10.1109/5326.704563

L. Matignon, G. J. Laurent, and N. L. Fort-piat, Hysteretic q-learning :an algorithm for decentralized reinforcement learning in cooperative multi-agent teams, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2007.
DOI : 10.1109/IROS.2007.4399095

URL : https://hal.archives-ouvertes.fr/hal-00187279

J. Mo and J. Walrand, Fair end-to-end window-based congestion control, IEEE/ACM Transactions on Networking, vol.8, issue.5, pp.556-567, 2000.
DOI : 10.1109/90.879343

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.143.3766

A. Samhat, Z. Altman, M. Francisco, and B. Fouresti, Semi-dynamic simulator for large-scale heterogeneous wireless networks, International Journal of Mobile Network Design and Innovation, vol.1, issue.3/4, pp.3-4, 2006.
DOI : 10.1504/IJMNDI.2006.012097