A comparison of predictive measures of problem difficulty for classification with Genetic Programming

Leonardo Trujillo 1 Yuliana Martinez 1 Edgar Galvan-Lopez 2 Pierrick Legrand 3, 4
2 School of Computer Science and Electronic Engineering
CSEE - School of Computer Science and Electronic Engineering [Essex]
4 ALEA - Advanced Learning Evolutionary Algorithms
Inria Bordeaux - Sud-Ouest, UB - Université de Bordeaux, CNRS - Centre National de la Recherche Scientifique : UMR5251
Abstract : In the field of Genetic Programming (GP) a question exists that is difficult to solve; how can problem difficulty be determined? In this paper the overall goal is to develop predictive tools that estimate how difficult a problem is for GP to solve. Here we analyse two groups of methods. We call the first group Evolvability Indicators (EI), measures that capture how amendable the fitness landscape is to a GP search. The second are Predictors of Expected Performance (PEP), models that take as input a set of descriptive attributes of a problem and predict the expected performance of a GP system. These predictive variables are domain specific thus problems are described in the context of the problem domain. This paper compares an EI, the Negative Slope Coefficient, and a PEP model for a GP classifier. Results suggest that the EI does not correlate with the performance of GP classifiers. Conversely, the PEP models show a high correlation with GP performance. It appears that while an EI estimates the difficulty of a search, it does not necessarily capture the difficulty of the underlying problem. However, while PEP models treat GP as a computational black-box, they can produce accurate performance predictions.
Complete list of metadatas

Cited literature [25 references]  Display  Hide  Download

https://hal.inria.fr/hal-00757363
Contributor : Pierrick Legrand <>
Submitted on : Monday, November 26, 2012 - 4:54:47 PM
Last modification on : Thursday, May 2, 2019 - 2:10:05 PM
Long-term archiving on: Wednesday, February 27, 2013 - 3:46:47 AM

File

ERA_2012_NSC.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00757363, version 1

Collections

CNRS | INRIA | IMB

Citation

Leonardo Trujillo, Yuliana Martinez, Edgar Galvan-Lopez, Pierrick Legrand. A comparison of predictive measures of problem difficulty for classification with Genetic Programming. ERA 2012, Nov 2012, Tijuana, Mexico. ⟨hal-00757363⟩

Share

Metrics

Record views

653

Files downloads

386