RANSAC-GP: Dealing with Outliers in Symbolic Regression with Genetic Programming

Uriel Lopez; Leonardo Trujillo; Yuliana Martinez; Pierrick Legrand; Enrique Naredo; Sara Silva

Chapitre D'ouvrage Année : 2017

RANSAC-GP: Dealing with Outliers in Symbolic Regression with Genetic Programming

(1) , (1) , (1) , (2, 3, 4) , (5) , (6, 7)

1
2
3
4
5
6
7

Uriel Lopez

Fonction : Auteur

Instituto Tecnológico de Tijuana = Tijuana Institute of Technology [Tijuana]

Leonardo Trujillo

Fonction : Auteur

Instituto Tecnológico de Tijuana = Tijuana Institute of Technology [Tijuana]

Yuliana Martinez

Fonction : Auteur

Instituto Tecnológico de Tijuana = Tijuana Institute of Technology [Tijuana]

Pierrick Legrand

Fonction : Auteur
PersonId : 174782
IdHAL : legrand

Université de Bordeaux

Quality control and dynamic reliability

Institut de Mathématiques de Bordeaux

Enrique Naredo

Fonction : Auteur

Laboratorio Nacional de Geointeligencia

Sara Silva

Fonction : Auteur

Universidade de Lisboa = University of Lisbon

University of Coimbra [Portugal]

Résumé

Genetic programming (GP) has been shown to be a powerful tool for automatic modeling and program induction. It is often used to solve difficult symbolic regression tasks, with many examples in real-world domains. However, the robustness of GP-based approaches has not been substantially studied. In particular, the present work deals with the issue of outliers, data in the training set that represent severe errors in the measuring process. In general, a datum is considered an outlier when it sharply deviates from the true behavior of the system of interest. GP practitioners know that such data points usually bias the search and produce inaccurate models. Therefore, this work presents a hybrid methodology based on the RAndom SAmpling Consensus (RANSAC) algorithm and GP, which we call RANSAC-GP. RANSAC is an approach to deal with outliers in parameter estimation problems, widely used in computer vision and related fields. On the other hand, this work presents the first application of RANSAC to symbolic regression with GP, with impressive results. The proposed algorithm is able to deal with extreme amounts of contamination in the training set, evolving highly accurate models even when the amount of outliers reaches 90%.

Domaines

Intelligence artificielle [cs.AI]

Pierrick Legrand : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01911448

Soumis le : vendredi 2 novembre 2018-17:52:10

Dernière modification le : vendredi 26 avril 2024-14:01:07

Dates et versions

hal-01911448 , version 1 (02-11-2018)

Identifiants

HAL Id : hal-01911448 , version 1

Citer

Uriel Lopez, Leonardo Trujillo, Yuliana Martinez, Pierrick Legrand, Enrique Naredo, et al.. RANSAC-GP: Dealing with Outliers in Symbolic Regression with Genetic Programming. Genetic Programming. EuroGP 2017. Lecture Notes in Computer Science, vol 10196. Springer, Cham, Springer, 2017. ⟨hal-01911448⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA IMB INSMI INRIA2

68 Consultations

0 Téléchargements

RANSAC-GP: Dealing with Outliers in Symbolic Regression with Genetic Programming

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager