Harvesting models from web 2.0 databases

Oscar Diaz 1 Gorka Puente 1 Javier Cánovas 2, 3 Jesus Garcia Molina 4
2 ATLANMOD - Modeling Technologies for Software Production, Operation, and Evolution
LINA - Laboratoire d'Informatique de Nantes Atlantique, Département informatique - EMN, Inria Rennes – Bretagne Atlantique
4 Modelum
Modelum - Departamento de Informática y Sistemas [Murcia]
Abstract : Data rather than functionality are the sources of competitive advantage for Web2.0 applications such as wikis, blogs and social networking websites. This valuable information might need to be capitalized by third-party applications or be subject to migration or data analysis. Model-Driven Engineering (MDE) can be used for these purposes. However, MDE first requires obtaining models from the wiki/blog/website database (a.k.a. model harvesting). This can be achieved through SQL scripts embedded in a program. However, this approach leads to laborious code that exposes the iterations and table joins that serve to build the model. By contrast, a Domain-Specific Language (DSL) can hide these "how" concerns, leaving the designer to focus on the "what", i.e. the mapping of database schemas to model classes. This paper introduces Schemol, a DSL tailored for extracting models out of databases which considers Web2.0 specifics. Web2.0 applications are often built on top of general frameworks (a.k.a. engines) that set the database schema (e.g.,MediaWiki, Blojsom). Hence, table names offer little help in automating the extraction process. In addition, Web2.0 data tend to be annotated. User-provided data (e.g., wiki articles, blog entries) might contain semantic markups which provide helpful hints for model extraction. Unfortunately, these data end up being stored as opaque strings. Therefore, there exists a considerable conceptual gap between the source database and the target metamodel. Schemol offers extractive functions and view-like mechanisms to confront these issues. Examples using Blojsom as the blog engine are available for download.
Type de document :
Article dans une revue
Software and Systems Modeling, Springer Verlag, 2013, 12 (1), pp.15-34. 〈10.1007/s10270-011-0194-z〉
Liste complète des métadonnées

Littérature citée [29 références]  Voir  Masquer  Télécharger

Contributeur : Javier Canovas <>
Soumis le : jeudi 16 mai 2013 - 16:04:22
Dernière modification le : vendredi 22 juin 2018 - 09:34:14
Document(s) archivé(s) le : samedi 17 août 2013 - 05:05:23


Fichiers produits par l'(les) auteur(s)



Oscar Diaz, Gorka Puente, Javier Cánovas, Jesus Garcia Molina. Harvesting models from web 2.0 databases. Software and Systems Modeling, Springer Verlag, 2013, 12 (1), pp.15-34. 〈10.1007/s10270-011-0194-z〉. 〈hal-00823323〉



Consultations de la notice


Téléchargements de fichiers