A. A. and M. R. Szepesvári-c, Fitted Q-iteration in continuous action-space MDPs, NIPS, 2007.

B. C. Mollon-r and T. D. Deleage-g, Grid Deployment of Legacy Bioinformatics Applications with Transparent Data Access, 7th IEEE/ACM International Conference on Grid computing, pp.120-127, 2006.

C. D. Mcgough-a, The gridcc project, International Conference on Communication System Software and Middleware, pp.1-4, 2006.

E. D. and G. P. Wehenkel-l, Tree-based batch mode reinforcement learning, Journal of Machine Learning Research, vol.6, pp.503-556, 2005.

L. E. Al, Programming the Grid with gLite, 2006.

M. J. Mo´scicki, L. H. Bubak-m, and M. A. Sloot-p, Quality of service on the grid with user level scheduling, Cracow Grid Workshop, pp.119-129, 2007.

S. R. Barto-a, Reinforcement Learning, 1998.
DOI : 10.1016/B978-012526430-3/50003-9