Large Language Models as Instructors: A Study on Multilingual Clinical Entity Extraction

Simon Meoni; Theo Ryffel; Eric Villemonte de La Clergerie

doi:10.18653/v1/2023.bionlp-1.15

Communication Dans Un Congrès Année : 2023

Large Language Models as Instructors: A Study on Multilingual Clinical Entity Extraction

(1, 2) , (1) , (2)

1
2

Simon Meoni

Fonction : Auteur

Arkhn

Automatic Language Modelling and ANAlysis & Computational Humanities

Theo Ryffel

Fonction : Auteur

Arkhn

Eric Villemonte de La Clergerie

Fonction : Auteur
PersonId : 1179
IdHAL : eric-villemonte-de-la-clergerie

Automatic Language Modelling and ANAlysis & Computational Humanities

Résumé

In clinical and other specialized domains, data are scarce due to their confidential nature. This lack of data is a major problem when finetuning language models. Nevertheless, very large language models (LLMs) are promising for the medical domain but cannot be used directly in healthcare facilities due to data confidentiality issues. We explore an approach of annotating training data with LLMs to train smaller models more adapted to our problem. We show that this method yields promising results for information extraction tasks.

Domaines

Informatique [cs]

Fichier principal

2023.bionlp-1.15.pdf (840.38 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte
Licence : CC BY - Paternité

Eric Villemonte De La Clergerie : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04394012

Soumis le : lundi 15 janvier 2024-10:46:08

Dernière modification le : mercredi 24 janvier 2024-09:04:28

Dates et versions

hal-04394012 , version 1 (15-01-2024)

Identifiants

HAL Id : hal-04394012 , version 1
DOI : 10.18653/v1/2023.bionlp-1.15

Citer

Simon Meoni, Theo Ryffel, Eric Villemonte de La Clergerie. Large Language Models as Instructors: A Study on Multilingual Clinical Entity Extraction. The 22nd Workshop on Biomedical Natural Language Processing and BioNLP Shared Tasks, Jul 2023, Toronto, Canada. pp.178-190, ⟨10.18653/v1/2023.bionlp-1.15⟩. ⟨hal-04394012⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INRIA INRIA2

17 Consultations

19 Téléchargements

Large Language Models as Instructors: A Study on Multilingual Clinical Entity Extraction

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager