A Learning Algorithm for Boltzmann Machines*, Cognitive Science, vol.85, issue.1, pp.147-169, 1985. ,
DOI : 10.1207/s15516709cog0901_7
No unbiased estimator of the variance of k-fold cross-validation, Journal of Machine Learning Research, vol.5, pp.1089-1105, 2004. ,
Greedy layer-wise training of deep networks, Proc. of NIPS'07, pp.153-160, 2007. ,
Scaling learning algorithms towards ai', in Large-Scale Kernel Machines, 2007. ,
Curriculum learning, Proceedings of the 26th Annual International Conference on Machine Learning, ICML '09, 2009. ,
DOI : 10.1145/1553374.1553380
Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms, Neural Computation, vol.6, issue.7, 1998. ,
DOI : 10.1007/BF00058655
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.129.2536
An introduction to the bootstrap, of Monographs on Statistic and Applied Probability, 1993. ,
DOI : 10.1007/978-1-4899-4541-9
The difficulty of training deep architectures and the effect of unsupervised pre-training, Proc. of AISTATS'09, 2009. ,
The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2001. ,
Training Products of Experts by Minimizing Contrastive Divergence, Neural Computation, vol.22, issue.8, pp.1771-1800, 2002. ,
DOI : 10.1162/089976600300015385
A fast learning algorithm for deep belief nets', Neural Conputation, pp.1527-1554, 2006. ,
Reducing the Dimensionality of Data with Neural Networks, Science, vol.313, issue.5786, pp.313-504, 2006. ,
DOI : 10.1126/science.1127647
Exploring strategies for training deep neural networks', Journal of Machine Learning Research, vol.10, pp.1-40, 2009. ,
An empirical evaluation of deep architectures on problems with many factors of variation, Proceedings of the 24th international conference on Machine learning, ICML '07, pp.473-480, 2007. ,
DOI : 10.1145/1273496.1273556
Representational power of restricted boltzmann machines and deep belief networks, Neural Computation, vol.20, issue.6, pp.1631-1649, 2008. ,
Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations, Proceedings of the 26th Annual International Conference on Machine Learning, ICML '09, p.77, 2009. ,
DOI : 10.1145/1553374.1553453
Parallel neural computing based on network duplicating' , in Parallel Algorithms for Digital Image Processing, Computer Vision and Neural Networks, pp.305-340, 1993. ,
Estimating the number of clusters in a data set via the gap statistic, Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol.63, issue.2, pp.411-423, 2000. ,
DOI : 10.1111/1467-9868.00293
A New Learning Algorithm for Mean Field Boltzmann Machines, Proc. of the International Conference on Artificial Neural Networks (ICANN), 2002. ,
DOI : 10.1007/3-540-46084-5_57