Data Integration through DL-LiteA Ontologies

Diego Calvanese, Giuseppe De Giacomo, Domenico Lembo, Maurizio Lenzerini, Antonella Poggi, Riccardo Rosati, and Marco Ruzzi

Revised Selected Papers of the 3rd Int. Workshop on Semantics in Data and Knowledge Bases (SDKB 2008). Volume 4925 of Lecture Notes in Computer Science. 2008.

The goal of data integration is to provide a uniform access to a set of heterogeneous data sources, freeing the user from the knowledge about where the data are, how they are stored, and how they can be accessed. One of the outcomes of the research work carried out on data integration in the last years is a clear conceptual architecture, comprising a global schema, the source schema, and the mapping between the source and the global schema. In this paper, we present a comprehensive approach to, and a complete system for, ontology-based data integration. In this system, the global schema is expressed in terms of a TBox of the tractable Description Logics DL-LiteA , the sources are relations, and the mapping language allows for expressing GAV sound mappings between the sources and the global schema. The mapping language has specific mechanisms for addressing the so-called impedance mismatch problem, arising from the fact that, while the data sources store values, the instances of concepts in the ontology are objects. By virtue of the careful design of the various languages used in our system, answering unions of conjunctive queries can be done through a very efficient technique (LOGSPACE with respect to data complexity) which reduces this task to standard SQL query evaluation. We also show that even very slight extensions of the expressive abilities of our system lead beyond this complexity bound.


@inproceedings{SDKB-2008,
   title = "Data Integration through DL-LiteA   Ontologies",
   year = "2008",
   author = "Diego Calvanese and De Giacomo, Giuseppe and Domenico
Lembo and Maurizio Lenzerini and Antonella Poggi and Riccardo Rosati and
Marco Ruzzi",
   editor = "Klaus-Dieter Schewe and Bernhard Thalheim",
   booktitle = "Revised Selected Papers of the 3rd Int. Workshop on Semantics
in Data and Knowledge Bases (SDKB 2008)",
   pages = "26--47",
   volume = "4925",
   publisher = "Springer",
   series = "Lecture Notes in Computer Science",
   doi = "10.1007/978-3-540-88594-8_2",
}
pdf url