GraphiT: Encoding Graph Structure in Transformers

Grégoire Mialon; Dexiong Chen; Margot Selosse; Julien Mairal

Pré-Publication, Document De Travail Année : 2021

GraphiT: Encoding Graph Structure in Transformers

(1, 2) , (2) , (2) , (2)

1
2

Grégoire Mialon

Fonction : Auteur
PersonId : 1036976

Statistical Machine Learning and Parsimony

Apprentissage de modèles à partir de données massives

Dexiong Chen

Fonction : Auteur
PersonId : 1047920

Apprentissage de modèles à partir de données massives

Margot Selosse

Fonction : Auteur
PersonId : 1101748

Apprentissage de modèles à partir de données massives

Julien Mairal

Fonction : Auteur
PersonId : 1034832
ORCID : 0000-0001-6991-2110
IdRef : 152125256

Apprentissage de modèles à partir de données massives

Résumé

We show that viewing graphs as sets of node features and incorporating structural and positional information into a transformer architecture is able to outperform representations learned with classical graph neural networks (GNNs). Our model, GraphiT, encodes such information by (i) leveraging relative positional encoding strategies in self-attention scores based on positive definite kernels on graphs, and (ii) enumerating and encoding local sub-structures such as paths of short length. We thoroughly evaluate these two ideas on many classification and regression tasks, demonstrating the effectiveness of each of them independently, as well as their combination. In addition to performing well on standard benchmarks, our model also admits natural visualization mechanisms for interpreting graph motifs explaining the predictions, making it a potentially strong candidate for scientific applications where interpretation is important.

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

GraphiT.pdf (734.89 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Grégoire Mialon : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03256708

Soumis le : jeudi 10 juin 2021-14:04:36

Dernière modification le : vendredi 26 avril 2024-13:43:43

Archivage à long terme le : samedi 11 septembre 2021-18:52:51

Dates et versions

hal-03256708 , version 1 (10-06-2021)

Identifiants

HAL Id : hal-03256708 , version 1

Citer

Grégoire Mialon, Dexiong Chen, Margot Selosse, Julien Mairal. GraphiT: Encoding Graph Structure in Transformers. 2021. ⟨hal-03256708⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS UNIV-RENNES1 UGA CNRS INRIA IRISA INSMI LJK LJK_GI INRIA2 GENCI LJK-GI-THOTH PSL UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES MIAI ANR UR1-MATH-NUM

1824 Consultations

974 Téléchargements

GraphiT: Encoding Graph Structure in Transformers

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager