Deep Learning Transformer Models for Building a Comprehensive and Real-time Trauma Observatory: Development and Validation Study

Gabrielle Chenais; Cédric Gil-Jardiné; Hélène Touchais; Marta Avalos Fernandez; Benjamin Contrand; Eric Tellier; Xavier Combes; Loick Bourdois; Philippe Revel; Emmanuel Lagarde

doi:10.2196/40843

Journal Articles JMIR AI Year : 2023

Deep Learning Transformer Models for Building a Comprehensive and Real-time Trauma Observatory: Development and Validation Study

(1) , (1, 2) , (1) , (3) , (1) , (1) , (1, 2) , (1) , (4) , (1)

1
2
3
4

Gabrielle Chenais

Function : Author
PersonId : 1196261
ORCID : 0000-0003-2006-6149

Bordeaux population health

Cédric Gil-Jardiné

Function : Author
PersonId : 1196262
ORCID : 0000-0001-5329-6405

Bordeaux population health

CHU Bordeaux

Hélène Touchais

Function : Author
PersonId : 1224814
ORCID : 0000-0003-1324-8542

Bordeaux population health

Marta Avalos Fernandez

Function : Author
PersonId : 742122
IdHAL : mavalosf
ORCID : 0000-0002-5471-2615
IdRef : 153689293

Statistics In System biology and Translational Medicine

Benjamin Contrand

Function : Author
PersonId : 1224815
ORCID : 0000-0002-2012-2676

Bordeaux population health

Eric Tellier

Function : Author
PersonId : 1224816
ORCID : 0000-0002-4627-5435

Bordeaux population health

Xavier Combes

Function : Author
PersonId : 1196263
ORCID : 0000-0001-6660-2168

Bordeaux population health

CHU Bordeaux

Loick Bourdois

Function : Author
PersonId : 1196264
ORCID : 0000-0003-0244-5591

Bordeaux population health

Philippe Revel

Function : Author
PersonId : 1224817
ORCID : 0000-0002-9221-9928

Roberval

Emmanuel Lagarde

Function : Author
PersonId : 1151175
ORCID : 0000-0001-8031-7400
IdRef : 110886410

Bordeaux population health

Abstract

Background Public health surveillance relies on the collection of data, often in near-real time. Recent advances in natural language processing make it possible to envisage an automated system for extracting information from electronic health records. Objective To study the feasibility of setting up a national trauma observatory in France, we compared the performance of several automatic language processing methods in a multiclass classification task of unstructured clinical notes. Methods A total of 69,110 free-text clinical notes related to visits to the emergency departments of the University Hospital of Bordeaux, France, between 2012 and 2019 were manually annotated. Among these clinical notes, 32.5% (22,481/69,110) were traumas. We trained 4 transformer models (deep learning models that encompass attention mechanism) and compared them with the term frequency–inverse document frequency associated with the support vector machine method. Results The transformer models consistently performed better than the term frequency–inverse document frequency and a support vector machine. Among the transformers, the GPTanam model pretrained with a French corpus with an additional autosupervised learning step on 306,368 unlabeled clinical notes showed the best performance with a micro F1-score of 0.969. Conclusions The transformers proved efficient at the multiclass classification of narrative and medical data. Further steps for improvement should focus on the expansion of abbreviations and multioutput multiclass classification.

Keywords

deep learning public health trauma emergencies natural language processing transformers

Domains

Artificial Intelligence [cs.AI] Santé publique et épidémiologie

Fichier principal

Chenais et al 2023.pdf (302.07 Ko)

Origin : Publisher files allowed on an open archive

Marta Avalos : Connect in order to contact the contributor

https://inria.hal.science/hal-03978442

Submitted on : Monday, October 16, 2023-12:17:52 PM

Last modification on : Wednesday, March 6, 2024-4:14:22 PM

Long-term archiving on: Wednesday, January 17, 2024-7:50:32 PM

Dates and versions

hal-03978442 , version 1 (16-10-2023)

Identifiers

HAL Id : hal-03978442 , version 1
DOI : 10.2196/40843

Cite

Gabrielle Chenais, Cédric Gil-Jardiné, Hélène Touchais, Marta Avalos Fernandez, Benjamin Contrand, et al.. Deep Learning Transformer Models for Building a Comprehensive and Real-time Trauma Observatory: Development and Validation Study. JMIR AI, 2023, 2, pp.e40843. ⟨10.2196/40843⟩. ⟨hal-03978442⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INRIA UNIV-COMPIEGNE INRIA2 ROBERVAL U1219

61 View

17 Download

Deep Learning Transformer Models for Building a Comprehensive and Real-time Trauma Observatory: Development and Validation Study

Abstract

Keywords

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Altmetric

Share