Do Common Educational Datasets Contain Static Information? A Statistical Study - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2021

Do Common Educational Datasets Contain Static Information? A Statistical Study

Résumé

In Intelligent Tutoring Systems (ITS), methods to choose the next exercise for a student are inspired from generic recommender systems, used, for instance, in online shopping or multimedia recommendation. As such, collaborative filtering, especially matrix factorization, is often included as apart of recommendation algorithms in ITS. One notable difference in ITS is the rapid evolution of users, who improve their performance, as opposed to multimedia recommendation where preferences are more static. This raises the following question: how reliably can we use matrix factorization, a tool tried and tested in a static environment, in a context where timelines seem to be of importance. In this article we tried to quantify empirically how much information can be extracted statically from datasets in education versus datasets in multimedia, as the quality of such information is critical to be able to accurately make predictions and recommendations. We found that educational datasets contain less static information compared to multi-media datasets, to the extent that vectors of higher dimensions only marginally increase the precision of the matrix factorization compared to a 1-dimensional characterization.These results show that educational datasets must be used with time information, and warn against the dangers of directly trying to use existing algorithms developed for static datasets.
Fichier principal
Vignette du fichier
EDM21_theo_AI4HI_SUB.pdf (790.27 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03526276 , version 1 (14-01-2022)

Identifiants

  • HAL Id : hal-03526276 , version 1

Citer

Théo Barollet, Florent Bouchez-Tichadou, Fabrice Rastello. Do Common Educational Datasets Contain Static Information? A Statistical Study. EDM 2021 - Conference on Educational Data Mining, Jun 2021, Paris / Virtual, France. pp.1-7. ⟨hal-03526276⟩
70 Consultations
37 Téléchargements

Partager

Gmail Facebook X LinkedIn More