Expérimentations autour des architectures d'apprentissage par transfert pour l'extraction de relations biomédicales

Walid Hafiane; Joël Legrand; Yannick Toussaint; Adrien Coulet

Communication Dans Un Congrès Année : 2021

Experiments on transfer learning architectures for biomedical relation extraction

Expérimentations autour des architectures d'apprentissage par transfert pour l'extraction de relations biomédicales

(1) , (2) , (1) , (1, 3)

1
2
3

Walid Hafiane

Fonction : Auteur

Knowledge representation, reasonning

Joël Legrand

Fonction : Auteur
PersonId : 169652
IdHAL : joel-legrand
ORCID : 0000-0002-6508-7421

Natural Language Processing : representations, inference and semantics

Yannick Toussaint

Fonction : Auteur

Knowledge representation, reasonning

Adrien Coulet

Fonction : Auteur
PersonId : 1742
IdHAL : coulet
ORCID : 0000-0002-1466-062X
IdRef : 134214099

Knowledge representation, reasonning

Health data- and model- driven Knowledge Acquisition

Résumé

Relation extraction (RE) consists in identifying and structuring automatically relations of interest from texts. Recently, BERT improved the top performances for several NLP tasks, including RE. However, the best way to use BERT, within a machine learning architecture, and within a transfer learning strategy is still an open question since it is highly dependent on each specific task and domain. Here, we explore various BERT-based architectures and transfer learning strategies (i.e., frozen or fine-tuned) for the task of biomedical RE on two corpora. Among tested architectures and strategies, our *BERT-segMCNN with fine-tuning reaches performances higher than the state-of-the-art on the two corpora (1.73 % and 32.77 % absolute improvement on ChemProt and PGxCorpus corpora respectively). More generally, our experiments illustrate the expected interest of fine-tuning with BERT, but also the unexplored advantage of using structural information (with sentence segmentation), in addition to the context classically leveraged by BERT.

L'extraction de relations (ER) consiste à identifier et à structurer automatiquement des relations à partir de textes. Récemment, BERT a permis d'améliorer les performances de plusieurs tâches de TAL, dont l'ER. Cependant, la meilleure façon d'utiliser BERT, dans une architecture d'apprentissage automatique avec une stratégie par transfert reste une question ouverte, car elle dépend à la fois de la tâche et du domaine d'application. Dans ce travail, nous explorons diverses architectures d'ER qui s'appuient sur BERT et deux stratégies de transfert (gel des poids ou réglage fin) sur deux corpus biomédicaux. Parmi les architectures et stratégies de transfert testées, *BERT-segMCNN avec réglage fin atteint des performances supérieures à l'état de l'art sur les deux corpus (amélioration absolue de 1,73% et 32,77% sur ChemProt et PGxCorpus respectivement). Nos expériences illustrent l'intérêt attendu du réglage fin avec BERT, et de façon plus originale l'intérêt d'ajouter aux représentations de BERT une information structurelle en considérant la segmentation des phrases.

Domaines

Intelligence artificielle [cs.AI] Bio-informatique [q-bio.QM] Apprentissage [cs.LG] Traitement du texte et du document

Fichier principal

hafiane_et_al_fr.pdf (806.65 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Adrien Coulet : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-03073601

Soumis le : mercredi 16 décembre 2020-11:23:58

Dernière modification le : vendredi 19 avril 2024-16:19:00

Archivage à long terme le : mercredi 17 mars 2021-18:56:25

Dates et versions

hal-03073601 , version 1 (16-12-2020)

Identifiants

HAL Id : hal-03073601 , version 1

Citer

Walid Hafiane, Joël Legrand, Yannick Toussaint, Adrien Coulet. Expérimentations autour des architectures d'apprentissage par transfert pour l'extraction de relations biomédicales. EGC 2021 - 21ème édition de la conférence "Extraction et Gestion des Connaissances", Jan 2021, Montpellier / Virtuel, France. ⟨hal-03073601⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSERM EPHE CNRS INRIA GRID5000 CORDELIERS UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD PSL SORBONNE-UNIVERSITE SU-SCIENCES UP-SANTE SILECS ANR HUB-IA

268 Consultations

324 Téléchargements

Experiments on transfer learning architectures for biomedical relation extraction

Expérimentations autour des architectures d'apprentissage par transfert pour l'extraction de relations biomédicales

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager