P. Blanchet, Modular growing network architectures for td learning, From animals to animats 4, pp.343-352, 1996.

L. Bougrain and F. Alexandre, Unsupervised connectionist algorithms for clustering an environmental data set: A comparison, Neurocomputing, vol.28, issue.1-3, 1999.
DOI : 10.1016/S0925-2312(98)00123-4

URL : https://hal.archives-ouvertes.fr/inria-00099002

C. Boutilier, R. Dearden, and M. Goldszmidt, Stochastic dynamic programming with factored representations, Artificial Intelligence, vol.121, issue.1-2, pp.49-107, 2000.
DOI : 10.1016/S0004-3702(00)00033-3

URL : http://doi.org/10.1016/s0004-3702(00)00033-3

B. Digney, Emergent hierarchical control structures: Learning reactive /hierarchical relationships in reinforcement envi- ronments

B. Fritzke, A self-organizing network that can follow non-stationary distributions, ICANN'97: International Conference on Artificial Neural Networks, pp.613-618, 1997.
DOI : 10.1007/BFb0020222

M. Hauskrecht, N. Meuleau, L. Pack-kaelbling, T. Dean, and C. Boutilier, Hierarchical solution of Markov decision processes using macro-actions, pp.220-229

R. Jacobs, M. Jordan, and A. Barto, Task Decomposition Through Competition in a Modular Connectionist Architecture: The What and Where Vision Tasks, Cognitive Science, vol.I, issue.2
DOI : 10.1207/s15516709cog1502_2

L. Pack and K. , Hierarchical learning in stochastic domains: Preliminary results, International Conference on Machine Learning, pp.167-173, 1993.

J. Lange, H. Voigt, and D. Wolf, Growing artificial neural networks based on correlation measures, taske decomposition and local attention neurons, 1994.
DOI : 10.1109/icnn.1994.374482

P. Laroche, Processus Décisionnels de Markov appliqués à la planification sous incertitudes, 2000.

M. L. Littman, Algorithms for Sequential Decision Making, 1996.

J. Macqueen, Some methods of classification and analysis of multivariate observations, 1967.

R. Munos and A. Moore, Variable resolution discretization in optimal control, 1999.

R. Munos and A. Moore, Rates of convergence for variable resolution schemes in optimal control, International Conference on Machine Learning, 2000.

M. Puterman, Markov decision processes, 1994.
DOI : 10.1002/9780470316887

E. Rich and K. Knight, Artificial Intelligence, 1991.

A. Roe, S. Pallas, J. Hahm, and M. Sur, A map of visual space induced in primary auditory cortex, Science, vol.250, issue.4982, 1990.
DOI : 10.1126/science.2237432

R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction, IEEE Transactions on Neural Networks, vol.9, issue.5, 1998.
DOI : 10.1109/TNN.1998.712192

G. Theocharous, K. Rohanimanesh, and S. Mahadevan, Learning and planning with hierarchical stochastic models for robot navigation, ICML 2000 Workshop on Machine Learning of Spatial Knowledge, 2000.