21830 articles – 15616 Notices  [english version]

hal-00705952, version 1

Clustering Trajectories of a Three-Way Longitudinal Dataset

Mireille Gettler Summa () 1, Bernard Goldfarb () 1, Maurizio Vichi () 2

Statistical Learning and data Science Taylor & Francis Group, Chapman & Hall (Ed.) (2012) 227

Résumé : Longitudinal data are widely used information for repeated observations of the same units over a period of time in order to investigate developmental trends across life span of units. Each object depicts, in the space of the features and of time, a trajectory describing its changes over time. Here trajectories are modeled according to three features: trend, velocity and acceleration. Clustering trajectories of a longitudinal data set is an important issue to assess similarities in the histories of the observed units that we fully discuss in this chapter. Starting from the Tucker model, widely used in psychometrics, we consider the optimal partition of trajectories that minimizes a distance accounting for trend, for velocity and for acceleration of trajectories. A Sequential Quadratic Programming algorithm is proposed to solve the clustering problem and its performance is evaluated by simulation

  • 1 :  CEntre de REcherches en MAthématiques de la DEcision (CEREMADE)
  • CNRS : UMR7534 – Université Paris IX - Paris Dauphine
  • 2 :  Dipartimento di Statistica
  • Università La Sapienza Roma
  • Domaine : Mathématiques/Statistiques
    Statistiques/Théorie
  • Mots-clés : Machine Learning – Statistical Methods – Data Mining – trajectories – T3Clus model – SQP algorithm
 
  • hal-00705952, version 1
  • oai:hal.archives-ouvertes.fr:hal-00705952
  • Contributeur : 
  • Soumis le : Lundi 11 Juin 2012, 11:03:37
  • Dernière modification le : Lundi 11 Juin 2012, 11:32:30