Improving patient clustering by incorporating structured label relationships in similarity measures - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2023

Improving patient clustering by incorporating structured label relationships in similarity measures

Résumé

Context Patient stratification is the cornerstone of numerous health studies, serving to enhance medicine efficacy estimation and facilitate patient matching. To stratify patients, similarity measured between patients can be computed from medical health records databases, such as medico-administrative databases. Importantly, the variables included in medico-administrative databases can be associated with labels, which can be organized in ontologies or other classification systems. However, to the best of our knowledge, the relevance of considering such label classification in the computation of patient similarity measures has been poorly studied. Objective We propose and evaluate several weighted versions of the Cosine similarity that consider structured label relationships to compute patient similarities from a medico-administrative database. Material and Methods As a use case, we analyze medicine reimbursements contained in the Échantillon Généraliste des Bénéficiaires , a French medico-administrative database. We compute the standard Cosine similarity between patients based on their medicine reimbursement. In addition, we computed a weighted Cosine similarity measure that includes variable frequencies and two weighted Cosine similarity measures that consider label relationships. We construct patient networks from each similarity measure and identify clusters of patients. We evaluate the performance of the different similarity measures with enrichment tests using information on chronic diseases. Results The similarity measures that include label relationships perform better to identify similar patients. Indeed, using these weighted measures, we identify distinct patient clusters with a higher number of chronic disease enrichments as compared to the other measures. Importantly, the enrichment tests provide clinically interpretable insights into these patient clusters. Conclusion Considering label relationships when computing patient similarities improves stratification of patients regarding their health status.
Fichier principal
Vignette du fichier
BMC_improving_patient_clustering.pdf (1.02 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-04208019 , version 1 (15-09-2023)

Identifiants

Citer

Judith Lambert, Anne-Louise Leutenegger, Anaïs Baudot, Anne-Sophie Jannot. Improving patient clustering by incorporating structured label relationships in similarity measures. 2023. ⟨hal-04208019⟩
36 Consultations
23 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More