Codeplay: Autotelic Learning through Collaborative Self-Play in Programming Environments

Laetitia Teodorescu; Cédric Colas; Matthew Bowers; Thomas Carta; Pierre-Yves Oudeyer

Communication Dans Un Congrès Année : 2023

Codeplay: Autotelic Learning through Collaborative Self-Play in Programming Environments

(1) , (1, 2) , (2) , (1) , (1)

1
2

Laetitia Teodorescu

Fonction : Auteur correspondant
PersonId : 750783
IdHAL : laetitia-teodorescu

Connectez-vous pour contacter l'auteur

Flowing Epigenetic Robots and Systems

Cédric Colas

Fonction : Auteur
PersonId : 742663
IdHAL : cedric-colas
ORCID : 0000-0003-0212-427X

Flowing Epigenetic Robots and Systems

Massachusetts Institute of Technology

Matthew Bowers

Fonction : Auteur

Massachusetts Institute of Technology

Thomas Carta

Fonction : Auteur
PersonId : 750836
IdHAL : thomas-carta
ORCID : 0000-0003-3145-549X

Flowing Epigenetic Robots and Systems

Pierre-Yves Oudeyer

Fonction : Auteur
PersonId : 6675
IdHAL : pyoudeyer
ORCID : 0000-0002-9404-7613
IdRef : 081674481

Flowing Epigenetic Robots and Systems

Résumé

Autotelic learning is the training setup where agents learn by setting their own goals and trying to achieve them. However, creatively generating freeform goals is challenging for autotelic agents. We present Codeplay, an algorithm casting autotelic learning as a game between a Setter agent and a Solver agent, where the Setter generates programming puzzles of appropriate difficulty and novelty for the solver and the Solver learns to achieve them. Early experiments with the Setter demonstrates one can effectively control the tradeoff between difficulty of a puzzle and its novelty by tuning the reward of the Setter, a code language model finetuned with deep reinforcement learning.

Domaines

Informatique [cs]

Fichier principal

_IMOL_Neurips2023__Codeplay-1.pdf (2.49 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Laetitia Teodorescu : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04374993

Soumis le : vendredi 5 janvier 2024-16:24:03

Dernière modification le : jeudi 11 janvier 2024-03:51:22

Dates et versions

hal-04374993 , version 1 (05-01-2024)

Identifiants

HAL Id : hal-04374993 , version 1

Citer

Laetitia Teodorescu, Cédric Colas, Matthew Bowers, Thomas Carta, Pierre-Yves Oudeyer. Codeplay: Autotelic Learning through Collaborative Self-Play in Programming Environments. IMOL 2023 - Intrinsically Motivated Open-ended Learning workshop at NeurIPS 2023, Dec 2023, New Orleans, United States. ⟨hal-04374993⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INRIA INRIA2

33 Consultations

30 Téléchargements

Codeplay: Autotelic Learning through Collaborative Self-Play in Programming Environments

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager