L. Gupta, R. Jain, and G. Vaszkun, Survey of important issues in UAV communication networks, IEEE Communications Surveys & Tutorials, vol.18, issue.2, pp.1123-1152, 2015.

Y. Zeng, Q. Wu, and R. Zhang, Accessing from the sky: A tutorial on UAV communications for 5G and beyond, 2019.

E. Vinogradov, H. Sallouha, S. De, M. M. Bast, S. Azari et al., Tutorial on UAV: A blue sky view on wireless communication, 2019.

T. S. Rappaport, Y. Xing, G. R. Maccartney, A. F. Molisch, E. Mellios et al., Overview of millimeter wave communications for fifthgeneration (5G) wireless networks-with a focus on propagation models, IEEE Transactions on Antennas and Propagation, vol.65, issue.12, pp.6213-6230, 2017.

A. Fotouhi, H. Qiang, M. Ding, M. Hassan, L. G. Giordano et al., Survey on uav cellular communications: Practical aspects, standardization advancements, regulation, and security challenges, IEEE Communications Surveys & Tutorials, vol.21, issue.4, pp.3417-3442, 2019.

R. Ghanavi, E. Kalantari, M. Sabbaghian, H. Yanikomeroglu, and A. Yongacoglu, Efficient 3d aerial base station placement considering users mobility by reinforcement learning, 2018 IEEE Wireless Communications and Networking Conference (WCNC), pp.1-6, 2018.

J. Chen and D. Gesbert, Optimal positioning of flying relays for wireless networks: A los map approach, 2017 IEEE international conference on communications (ICC), pp.1-6, 2017.

V. Saxena, J. Jaldén, and H. Klessig, Optimal UAV base station trajectories using flow-level models for reinforcement learning, IEEE Transactions on Cognitive Communications and Networking, vol.5, issue.4, pp.1101-1112, 2019.

M. Alzenad, A. El-keyi, and H. Yanikomeroglu, 3-D placement of an unmanned aerial vehicle base station for maximum coverage of users with different qos requirements, IEEE Wireless Communications Letters, vol.7, issue.1, pp.38-41, 2017.

N. C. Luong, D. T. Hoang, S. Gong, D. Niyato, P. Wang et al.,

D. I. Liang and . Kim, Applications of deep reinforcement learning in communications and networking: A survey, IEEE Communications Surveys & Tutorials, vol.21, issue.4, pp.3133-3174, 2019.

K. Arulkumaran, M. P. Deisenroth, M. Brundage, and A. A. Bharath, A brief survey of deep reinforcement learning, IEEE SIGNAL, 2017.

C. H. Liu, Z. Chen, J. Tang, J. Xu, and C. Piao, Energy-efficient UAV control for effective and fair communication coverage: A deep reinforcement learning approach, IEEE Journal on Selected Areas in Communications, vol.36, issue.9, pp.2059-2070, 2018.

J. Mo and J. Walrand, Fair end-to-end window-based congestion control, IEEE/ACM Transactions on networking, vol.8, issue.5, pp.556-567, 2000.

J. Lyu, Y. Zeng, R. Zhang, and T. J. Lim, Placement optimization of uav-mounted mobile base stations, IEEE Communications Letters, vol.21, issue.3, pp.604-607, 2016.

M. Mozaffari, W. Saad, M. Bennis, and M. Debbah, Efficient deployment of multiple unmanned aerial vehicles for optimal wireless coverage, IEEE Communications Letters, vol.20, issue.8, pp.1647-1650, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01777917

P. S. Bithas, E. T. Michailidis, N. Nomikos, D. Vouyioukas, and A. G. Kanatas, A survey on machine-learning techniques for UAV-based communications, Sensors, vol.19, issue.23, p.5170, 2019.

T. Yuan, W. B. Da-rocha-neto, C. Rothenberg, K. Obraczka, C. Barakat et al., Harnessing machine learning for next-generation intelligent transportation systems: A survey, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02284820

M. S. Shokry, D. Ebrahimi, C. Assi, S. Sharafeddine, and A. Ghrayeb, Leveraging UAVs for coverage in cell-free vehicular networks: A deep reinforcement learning approach, IEEE Transactions on Mobile Computing, 2020.

M. Chen, M. Mozaffari, W. Saad, C. Yin, M. Debbah et al., Caching in the sky: Proactive deployment of cache-enabled unmanned aerial vehicles for optimized quality-of-experience, IEEE Journal on Selected Areas in Communications, vol.35, issue.5, pp.1046-1061, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01781981

A. Goldsmith, Wireless communications, 2005.

A. Al-hourani, S. Kandeepan, and S. Lardner, Optimal lap altitude for maximum coverage, IEEE Wireless Communications Letters, vol.3, issue.6, pp.569-572, 2014.

Y. Zeng, J. Xu, and R. Zhang, Energy minimization for wireless communication with rotary-wing UAV, IEEE Transactions on Wireless Communications, vol.18, issue.4, pp.2329-2345, 2019.

T. P. Lillicrap, J. J. Hunt, A. Pritzel, N. Heess, T. Erez et al., Continuous control with deep reinforcement learning, 2015.

D. Dias and L. H. Costa, CRAWDAD dataset coppe-ufrj/riobuses (v, 2018.