Skip to Main content Skip to Navigation
Journal articles

Harvesting models from web 2.0 databases

Oscar Diaz 1 Gorka Puente 1 Javier Cánovas 2, 3 Jesus Garcia Molina 4 
2 ATLANMOD - Modeling Technologies for Software Production, Operation, and Evolution
LINA - Laboratoire d'Informatique de Nantes Atlantique, Département informatique - EMN, Inria Rennes – Bretagne Atlantique
4 Modelum
Modelum - Departamento de Informática y Sistemas [Murcia]
Abstract : Data rather than functionality are the sources of competitive advantage for Web2.0 applications such as wikis, blogs and social networking websites. This valuable information might need to be capitalized by third-party applications or be subject to migration or data analysis. Model-Driven Engineering (MDE) can be used for these purposes. However, MDE first requires obtaining models from the wiki/blog/website database (a.k.a. model harvesting). This can be achieved through SQL scripts embedded in a program. However, this approach leads to laborious code that exposes the iterations and table joins that serve to build the model. By contrast, a Domain-Specific Language (DSL) can hide these "how" concerns, leaving the designer to focus on the "what", i.e. the mapping of database schemas to model classes. This paper introduces Schemol, a DSL tailored for extracting models out of databases which considers Web2.0 specifics. Web2.0 applications are often built on top of general frameworks (a.k.a. engines) that set the database schema (e.g.,MediaWiki, Blojsom). Hence, table names offer little help in automating the extraction process. In addition, Web2.0 data tend to be annotated. User-provided data (e.g., wiki articles, blog entries) might contain semantic markups which provide helpful hints for model extraction. Unfortunately, these data end up being stored as opaque strings. Therefore, there exists a considerable conceptual gap between the source database and the target metamodel. Schemol offers extractive functions and view-like mechanisms to confront these issues. Examples using Blojsom as the blog engine are available for download.
Document type :
Journal articles
Complete list of metadata

Cited literature [29 references]  Display  Hide  Download
Contributor : Javier Canovas Connect in order to contact the contributor
Submitted on : Thursday, May 16, 2013 - 4:04:22 PM
Last modification on : Wednesday, April 27, 2022 - 3:59:27 AM
Long-term archiving on: : Saturday, August 17, 2013 - 5:05:23 AM


Files produced by the author(s)



Oscar Diaz, Gorka Puente, Javier Cánovas, Jesus Garcia Molina. Harvesting models from web 2.0 databases. Software and Systems Modeling, Springer Verlag, 2013, 12 (1), pp.15-34. ⟨10.1007/s10270-011-0194-z⟩. ⟨hal-00823323⟩



Record views


Files downloads