Concurrent probabilistic temporal planning with policy-gradients, ICAPS, pp.10-17, 2007. ,
Top-down induction of first-order logical decision trees, Artificial Intelligence, vol.101, issue.1-2, pp.285-297, 1998. ,
DOI : 10.1016/S0004-3702(98)00034-4
A survey of monte carlo tree search methods. Computational Intelligence and AI in Games, IEEE Transactions on, vol.4, issue.1, pp.1-43, 2012. ,
Relational reinforcement learning, Machine learning, vol.43, issue.12, pp.7-52, 2001. ,
DOI : 10.1007/BFb0027307
machine., The Annals of Statistics, vol.29, issue.5, pp.1189-1232, 2001. ,
DOI : 10.1214/aos/1013203451
Trial-based heuristic tree search for finite horizon MDPs, ICAPS, 2013. ,
Bandit Based Monte-Carlo Planning, Machine Learning: ECML 2006, pp.282-293, 2006. ,
DOI : 10.1007/11871842_29
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.102.1296
Exploration in relational domains for model-based reinforcement learning, The Journal of Machine Learning Research, vol.13, issue.1, pp.3725-3768, 2012. ,
Planning with durative actions in stochastic domains, J. Artif. Intell. Res.(JAIR), vol.31, pp.33-82, 2008. ,
Inverse reinforcement learning in relational domains, International Joint Conferences on Artificial Intelligence, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01154650
Gradient-based boosting for statistical relational learning: The relational dependency network case, Machine Learning, pp.25-56, 2012. ,
DOI : 10.1007/s10994-011-5244-9
Learning to take concurrent actions, Advances in neural information processing systems, pp.1619-1626, 2002. ,
Coarticulation, Proceedings of the 22nd international conference on Machine learning , ICML '05, pp.720-727, 2005. ,
DOI : 10.1145/1102351.1102442
Temporal planning with mutual exclusion reasoning, IJCAI, pp.326-337, 1999. ,
Policy generation for continuoustime stochastic domains with concurrency, ICAPS, p.325, 2004. ,
Learning planning rules in noisy stochastic worlds, AAAI, pp.911-918, 2005. ,