Harvesting models from web 2.0 databases - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Article Dans Une Revue Software and Systems Modeling Année : 2013

Harvesting models from web 2.0 databases

Oscar Diaz
  • Fonction : Auteur
  • PersonId : 941426
Gorka Puente
  • Fonction : Auteur
  • PersonId : 941427
Jesus Garcia Molina
  • Fonction : Auteur
  • PersonId : 914122

Résumé

Data rather than functionality are the sources of competitive advantage for Web2.0 applications such as wikis, blogs and social networking websites. This valuable information might need to be capitalized by third-party applications or be subject to migration or data analysis. Model-Driven Engineering (MDE) can be used for these purposes. However, MDE first requires obtaining models from the wiki/blog/website database (a.k.a. model harvesting). This can be achieved through SQL scripts embedded in a program. However, this approach leads to laborious code that exposes the iterations and table joins that serve to build the model. By contrast, a Domain-Specific Language (DSL) can hide these "how" concerns, leaving the designer to focus on the "what", i.e. the mapping of database schemas to model classes. This paper introduces Schemol, a DSL tailored for extracting models out of databases which considers Web2.0 specifics. Web2.0 applications are often built on top of general frameworks (a.k.a. engines) that set the database schema (e.g.,MediaWiki, Blojsom). Hence, table names offer little help in automating the extraction process. In addition, Web2.0 data tend to be annotated. User-provided data (e.g., wiki articles, blog entries) might contain semantic markups which provide helpful hints for model extraction. Unfortunately, these data end up being stored as opaque strings. Therefore, there exists a considerable conceptual gap between the source database and the target metamodel. Schemol offers extractive functions and view-like mechanisms to confront these issues. Examples using Blojsom as the blog engine are available for download.

Domaines

Autre [cs.OH]
Fichier principal
Vignette du fichier
09SchemolFinal.pdf (2.55 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00823323 , version 1 (16-05-2013)

Identifiants

Citer

Oscar Diaz, Gorka Puente, Javier Cánovas, Jesus Garcia Molina. Harvesting models from web 2.0 databases. Software and Systems Modeling, 2013, 12 (1), pp.15-34. ⟨10.1007/s10270-011-0194-z⟩. ⟨hal-00823323⟩
494 Consultations
459 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More