Harvesting models from web 2.0 databases

Oscar Diaz 1 Gorka Puente 1 Javier Cánovas 2, 3 Jesus Garcia Molina 4
2 ATLANMOD - Modeling Technologies for Software Production, Operation, and Evolution
LINA - Laboratoire d'Informatique de Nantes Atlantique, Département informatique - EMN, Inria Rennes – Bretagne Atlantique
4 Modelum
Modelum - Departamento de Informática y Sistemas [Murcia]
Abstract : Data rather than functionality are the sources of competitive advantage for Web2.0 applications such as wikis, blogs and social networking websites. This valuable information might need to be capitalized by third-party applications or be subject to migration or data analysis. Model-Driven Engineering (MDE) can be used for these purposes. However, MDE first requires obtaining models from the wiki/blog/website database (a.k.a. model harvesting). This can be achieved through SQL scripts embedded in a program. However, this approach leads to laborious code that exposes the iterations and table joins that serve to build the model. By contrast, a Domain-Specific Language (DSL) can hide these "how" concerns, leaving the designer to focus on the "what", i.e. the mapping of database schemas to model classes. This paper introduces Schemol, a DSL tailored for extracting models out of databases which considers Web2.0 specifics. Web2.0 applications are often built on top of general frameworks (a.k.a. engines) that set the database schema (e.g.,MediaWiki, Blojsom). Hence, table names offer little help in automating the extraction process. In addition, Web2.0 data tend to be annotated. User-provided data (e.g., wiki articles, blog entries) might contain semantic markups which provide helpful hints for model extraction. Unfortunately, these data end up being stored as opaque strings. Therefore, there exists a considerable conceptual gap between the source database and the target metamodel. Schemol offers extractive functions and view-like mechanisms to confront these issues. Examples using Blojsom as the blog engine are available for download.
Document type :
Journal articles
Liste complète des métadonnées

Cited literature [29 references]  Display  Hide  Download

https://hal.inria.fr/hal-00823323
Contributor : Javier Canovas <>
Submitted on : Thursday, May 16, 2013 - 4:04:22 PM
Last modification on : Thursday, February 7, 2019 - 2:27:35 PM
Document(s) archivé(s) le : Saturday, August 17, 2013 - 5:05:23 AM

File

09SchemolFinal.pdf
Files produced by the author(s)

Identifiers

Citation

Oscar Diaz, Gorka Puente, Javier Cánovas, Jesus Garcia Molina. Harvesting models from web 2.0 databases. Software & Systems Modeling, Springer Verlag, 2013, 12 (1), pp.15-34. ⟨10.1007/s10270-011-0194-z⟩. ⟨hal-00823323⟩

Share

Metrics

Record views

680

Files downloads

523