sign in
english version rss feed

inria-00000549, version 1

A Statistical Learning Theory Approach of Bloat

Olivier Teytaud 1, Sylvain Gelly 1, Nicolas Bredeche 1, Marc Schoenauer () 1

Genetic and Evolutionary Computation Conference (2005)

Abstract: Code bloat, the excessive increase of code size, is an important is- sue in Genetic Programming (GP). This paper proposes a theoreti- cal analysis of code bloat in the framework of symbolic regression in GP, from the viewpoint of Statistical Learning Theory, a well grounded mathematical toolbox for Machine Learning. Two kinds of bloat must be distinguished in that context, depending whether the target function lies in the search space or not. Then, important mathematical results are proved using classical results from Sta- tistical Learning. Namely, the Vapnik-Cervonenkis dimension of programs is computed, and further results from Statistical Learn- ing allow to prove that a parsimonious fitness ensures Universal Consistency (the solution minimizing the empirical error does con- verge to the best possible error when the number of samples goes to infinity). However, it is proved that the standard method consisting in choosing a maximal program size depending on the number of samples might still result in programs of infinitely increasing size whith their accuracy; a more complicated modification of the fit- ness is proposed that theoretically avoids unnecessary bloat while nevertheless preserving the Universal Consistency.

  • 1:  TAO (INRIA Futurs)
  • INRIA – CNRS : UMR8623 – Université Paris XI - Paris Sud
  • Domain : Computer Science/Learning
 
  • inria-00000549, version 1
  • oai:hal.inria.fr:inria-00000549
  • From: 
  • Submitted on: Wednesday, 2 November 2005 11:05:13
  • Updated on: Wednesday, 2 November 2005 11:08:44
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...