CopyCAT: Taking Control of Neural Policies with Constant Attacks

Léonard Hussenot; Matthieu Geist; Olivier Pietquin

Communication Dans Un Congrès Année : 2020

CopyCAT: Taking Control of Neural Policies with Constant Attacks

(1, 2) , (1) , (1)

1
2

Léonard Hussenot

Fonction : Auteur
PersonId : 1092830

Google Research [Paris]

Scool

Matthieu Geist

Fonction : Auteur
PersonId : 6945
IdHAL : matthieu-geist

Google Research [Paris]

Olivier Pietquin

Fonction : Auteur
PersonId : 4024
IdHAL : olivier-pietquin
ORCID : 0000-0002-5386-465X
IdRef : 142821861

Google Research [Paris]

Résumé

We propose a new perspective on adversarial attacks against deep reinforcement learning agents. Our main contribution is CopyCAT, a targeted attack able to consistently lure an agent into following an outsider's policy. It is pre-computed, therefore fast inferred, and could thus be usable in a real-time scenario. We show its effectiveness on Atari 2600 games in the novel read-only setting. In this setting, the adversary cannot directly modify the agent's state -- its representation of the environment -- but can only attack the agent's observation -- its perception of the environment. Directly modifying the agent's state would require a write-access to the agent's inner workings and we argue that this assumption is too strong in realistic settings.

Domaines

Informatique [cs]

Fichier principal

1905.12282.pdf (3.52 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Léonard Hussenot : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-03162124

Soumis le : lundi 8 mars 2021-16:03:45

Dernière modification le : mercredi 24 janvier 2024-09:54:24

Archivage à long terme le : mercredi 9 juin 2021-18:53:53

Dates et versions

hal-03162124 , version 1 (08-03-2021)

Identifiants

HAL Id : hal-03162124 , version 1
ARXIV : 1905.12282

Citer

Léonard Hussenot, Matthieu Geist, Olivier Pietquin. CopyCAT: Taking Control of Neural Policies with Constant Attacks. AAMAS 2020 - 19th International Conference on Autonomous Agents and Multi-Agent Systems, May 2020, Virtual, New Zealand. ⟨hal-03162124⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA CRISTAL INRIA2 UNIV-LILLE CRISTAL-SCOOL

52 Consultations

137 Téléchargements

CopyCAT: Taking Control of Neural Policies with Constant Attacks

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager