The Ultimate Debian Database: Consolidating Bazaar Metadata for Quality Assurance and Data Mining

Lucas Nussbaum 1 Stefano Zacchiroli 2
1 ALGORILLE - Algorithms for the Grid
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : FLOSS distributions like RedHat and Ubuntu require a lot more complex infrastructures than most other FLOSS projects. In the case of community-driven distributions like Debian, the development of such an infrastructure is often not very organized, leading to new data sources being added in an impromptu manner while hackers set up new services that gain acceptance in the community. Mixing and matching data is then harder than should be, albeit being badly needed for Quality Assurance and data mining. Massive refactoring and integration is not a viable solution either, due to the constraints imposed by the bazaar development model. This paper presents the Ultimate Debian Database (UDD), which is the countermeasure adopted by the Debian project to the above ``data hell''. UDD gathers data from various data sources into a single, central SQL database, turning Quality Assurance needs that could not be easily implemented before into simple SQL queries. The paper also discusses the customs that have contributed to the data hell, the lessons learnt while designing UDD, and its applications and potentialities for data mining on FLOSS distributions.
Type de document :
Communication dans un congrès
7th IEEE Working Conference on Mining Software Repositories (MSR'2010), May 2010, Cape Town, South Africa. 2010
Liste complète des métadonnées

Littérature citée [21 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00502886
Contributeur : Lucas Nussbaum <>
Soumis le : vendredi 16 juillet 2010 - 08:47:34
Dernière modification le : jeudi 15 novembre 2018 - 20:26:59
Document(s) archivé(s) le : mardi 23 octobre 2012 - 10:26:38

Fichier

msr2010-udd.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00502886, version 1

Collections

Citation

Lucas Nussbaum, Stefano Zacchiroli. The Ultimate Debian Database: Consolidating Bazaar Metadata for Quality Assurance and Data Mining. 7th IEEE Working Conference on Mining Software Repositories (MSR'2010), May 2010, Cape Town, South Africa. 2010. 〈inria-00502886〉

Partager

Métriques

Consultations de la notice

603

Téléchargements de fichiers

277