21791 articles – 15600 references  [version française]

hal-00705952, version 1

Clustering Trajectories of a Three-Way Longitudinal Dataset

Mireille Gettler Summa () 1, Bernard Goldfarb () 1, Maurizio Vichi () 2

Statistical Learning and data Science Taylor & Francis Group, Chapman & Hall (Ed.) (2012) 227

Abstract: Longitudinal data are widely used information for repeated observations of the same units over a period of time in order to investigate developmental trends across life span of units. Each object depicts, in the space of the features and of time, a trajectory describing its changes over time. Here trajectories are modeled according to three features: trend, velocity and acceleration. Clustering trajectories of a longitudinal data set is an important issue to assess similarities in the histories of the observed units that we fully discuss in this chapter. Starting from the Tucker model, widely used in psychometrics, we consider the optimal partition of trajectories that minimizes a distance accounting for trend, for velocity and for acceleration of trajectories. A Sequential Quadratic Programming algorithm is proposed to solve the clustering problem and its performance is evaluated by simulation

  • 1:  CEntre de REcherches en MAthématiques de la DEcision (CEREMADE)
  • CNRS : UMR7534 – Université Paris IX - Paris Dauphine
  • 2:  Dipartimento di Statistica
  • Università La Sapienza Roma
  • Domain : Mathematics/Statistics
    Statistics/Statistics Theory
  • Keywords : Machine Learning – Statistical Methods – Data Mining – trajectories – T3Clus model – SQP algorithm
 
  • hal-00705952, version 1
  • oai:hal.archives-ouvertes.fr:hal-00705952
  • From: 
  • Submitted on: Monday, 11 June 2012 11:03:37
  • Updated on: Monday, 11 June 2012 11:32:30