PhenoLines: Phenotype Comparison Visualizations for Disease Subtyping via Topic Models - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2017

PhenoLines: Phenotype Comparison Visualizations for Disease Subtyping via Topic Models

Résumé

—PhenoLines is a visual analysis tool for the interpretation of disease subtypes, derived from the application of topic models to clinical data. Topic models enable one to mine cross-sectional patient comorbidity data (e.g., electronic health records) and construct disease subtypes—each with its own temporally evolving prevalence and co-occurrence of phenotypes—without requiring aligned longitudinal phenotype data for all patients. However, the dimensionality of topic models makes interpretation challenging, and de facto analyses provide little intuition regarding phenotype relevance or phenotype interrelationships. PhenoLines enables one to compare phenotype prevalence within and across disease subtype topics, thus supporting subtype characterization, a task that involves identifying a proposed subtype’s dominant phenotypes, ages of effect, and clinical validity. We contribute a data transformation workflow that employs the Human Phenotype Ontology to hierarchically organize phenotypes and aggregate the evolving probabilities produced by topic models. We introduce a novel measure of phenotype relevance that can be used to simplify the resulting topology. The design of PhenoLines was motivated by formative interviews with machine learning and clinical experts. We describe the co-operative design process, distill high-level tasks, and report on initial evaluations with machine learning experts and a medical domain expert. These results suggest that PhenoLines demonstrates promising approaches to support the characterization and optimization of topic models.
Fichier non déposé

Dates et versions

hal-01565556 , version 1 (19-07-2017)

Identifiants

  • HAL Id : hal-01565556 , version 1

Citer

Michael Glueck, Mahdi Pakdaman Naeini, Finale Doshi-Velez, Fanny Chevalier, Azam Khan, et al.. PhenoLines: Phenotype Comparison Visualizations for Disease Subtyping via Topic Models. IEEE Conference on Visual Analytics Science and Technology (IEEE VAST 2017), 2017, Phoenix, United States. ⟨hal-01565556⟩
260 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More