Reinforcement learning: an introduction, 2018. ,
The complexity of markov decision processes, Mathematics of Operations Research, vol.12, issue.3, pp.441-450, 1987. ,
TensorFlow: A System for Large-Scale Machine Learning, Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI '16). USENIX Association, 2016. ,
, Kapazitätsplanung mit SAP ®, 2014.
Supply Chain Management with APO: Structures, Modelling Approaches and Implementation Pecularities. 3 rd edn, 2009. ,
Hands-On Machine Learning with Scikit-Learn and TensorFlow. O'Reilly Media, 2017. ,
, Das intelligente Unternehmen: Maschinelles Lernen mit SAP zielgerichtet einsetzen, pp.51-62, 2019.
Tensorforce: a TensorFlow library for applied reinforcement learning, 2017. ,
, LIFT: Reinforcement Learning in Computer Systems by Learning from Demonstrations, 2018.
, TensorForce Documentation Release 0.3.3. media.readthedocs, 2018.
Human-level control through deep reinforcement learning, Nature, vol.518, pp.529-533, 2015. ,
, Continuous Deep Q-Learning with Modelbased Acceleration, 2016.
, Deep Reinforcement Learning with Double Qlearning, 2015.
Simple statistical gradient-following algorithms for connectionist reinforcement learning, Machine learning, vol.8, issue.3-4, pp.229-256, 1992. ,
Trust region policy optimization, Proceedings of the 32nd International Conference on Machine Learning (ICML-15), pp.1889-1897, 2017. ,
, Proximal Policy Optimization Algorithms, 2017.
, Asynchronous Methods for Deep Reinforcement Learning, 2016.
, Continuous control with deep reinforcement learning, 2016.
Visualizing Dataflow Graphs of Learning Models in TensorFlow, IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, vol.24, issue.1, pp.1-12, 2018. ,
, The Tensor Board repository on GitHub