I. F. Akyildiz, A. Lee, P. Wang, M. Luo, and W. Chou, A roadmap for traffic engineering in sdn-openflow networks, Computer Networks, vol.71, pp.1-30, 2014.

. Fig, 11: Empirical CDF of ? QF

S. Layeghy, F. Pakzad, and M. Portmann, SCOR: software-defined constrained optimal routing platform for SDN, 2016.
DOI : 10.14264/uql.2018.820

D. D. Clark, C. Partridge, J. C. Ramming, and J. T. Wroclawski, A knowledge plane for the internet, Proceedings of the 2003 conference on Applications, technologies, architectures, and protocols for computer communications, pp.3-10, 2003.
DOI : 10.1145/863956.863957

M. Wang, Y. Cui, X. Wang, S. Xiao, and J. Jiang, Machine learning for networking: Workflow, advances and opportunities, IEEE Network, vol.32, issue.2, pp.92-99, 2018.
DOI : 10.1109/mnet.2017.1700200
URL : http://arxiv.org/pdf/1709.08339

A. Valadarsky, M. Schapira, D. Shahaf, and A. Tamar, A machine learning approach to routing, 2017.

J. A. Boyan and M. L. Littman, Packet routing in dynamically changing networks: A reinforcement learning approach, Advances in neural information processing systems, pp.671-678, 1994.

S. Lin, I. F. Akyildiz, P. Wang, and M. Luo, Qos-aware adaptive routing in multilayer hierarchical software defined networks: A reinforcement learning approach, 2016 IEEE International Conference on Services Computing (SCC), pp.25-33, 2016.

Y. Li, Deep reinforcement learning: An overview, 2017.

K. Arulkumaran, M. P. Deisenroth, M. Brundage, and A. A. Bharath, A brief survey of deep reinforcement learning, 2017.

A. Krizhevsky, I. Sutskever, and G. E. Hinton, Imagenet classification with deep convolutional neural networks, Advances in neural information processing systems, pp.1097-1105, 2012.
DOI : 10.1145/3065386
URL : http://dl.acm.org/ft_gateway.cfm?id=3065386&type=pdf

V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness et al., Human-level control through deep reinforcement learning, Nature, vol.518, issue.7540, p.529, 2015.
DOI : 10.1038/nature14236

T. P. Lillicrap, J. J. Hunt, A. Pritzel, N. Heess, T. Erez et al., Continuous control with deep reinforcement learning, 2015.

G. Stampa, M. Arias, D. Sanchez-charles, V. Muntés-mulero, and A. Cabellos, A deep-reinforcement learning approach for software-defined networking routing optimization, 2017.

D. Silver, G. Lever, N. Heess, T. Degris, D. Wierstra et al., Deterministic policy gradient algorithms, Proceedings of the 31st International Conference on International Conference on Machine Learning, vol.32, 2014.
URL : https://hal.archives-ouvertes.fr/hal-00938992

R. S. Sutton and A. G. Barto, Reinforcement learning: An introduction, 1998.

, The internet topology zoo

A. Varga, Discrete event simulation system, Proc. of the European Simulation Multiconference, 2011.

M. Roughan, Simplifying the synthesis of internet traffic matrices, SIGCOMM Comput. Commun. Rev, vol.35, issue.5, pp.93-96, 2005.

D. P. Kingma and J. Ba, Adam: A method for stochastic optimization, 2014.