http://www.dis.uniroma1.it/~calvanese/didattica/99-00-data-integration/
Corso di Dottorato in Ingegneria Informatica --
Università di Roma "La Sapienza"
Home page del Corso di
Data Integration
A.A. 1999-2000
Programma
- Introduction to data integration
What is data integration. Contexts where data integration is important.
Relevant problems in data integration.
- Achitectures for data integration
Read-only vs read/write integration. Materialized vs virtual integration.
Global-as-view vs local-as-view. Wrappers and mediators. Limitations
in accessing the sources.
- Source and global schema modeling
Languages for describing global schemas. Languages for describing
sources. Conceptual and logical description of schemas.
Inter-relationships between sources.
- Data extraction, cleaning, and reconciliation
Accessing the sources for extraction. The problem of data cleaning.
Reconciling data coming from different sources.
- Query answering in data integration systems
The problem of view-based query processing. Answering queries using
views. Query rewriting using views. View-based query processing in the
local-as-view approach.
- Semistructured data and data integration
The problem of semistructured data modeling. Languages and models for
expressing semistructured data. The language XML. Using semistructured
data in data integration.
- Query answering in semistructured data integration systems
The problem of view-based query processing with semistructured data.
Answering queries using views. Rewriting queries using views.
- Other topics presented by students
Calendario
Le lezioni si svolgeranno presso il Dipartimento di Informatica e Sistemistica,
Via Salaria, 113 II piano sempre alle ore 14 in aula C2,
nelle seguenti date:
- Dicembre 6, 2000
- Dicembre 7
- Dicembre 11
- Dicembre 12
- Dicembre 13
- Dicembre 14
- Dicembre 15
- Dicembre 18
- Dicembre 19
- Dicembre 20 - Seminario dello studente:
- Walter Crescenzi su
Wrapping di sorgenti
- Gennaio 26, 2001 - Seminari degli studenti:
- Monica Scannapieco su Data cleaning
- Massimo Mecella su EDOC and data
integration
- Antonino Virgillito su Tsimmis
- Febbraio 9 - Seminari degli studenti:
- Carlo Marchetti su Data materialization
- Andrea Santoro su Distributed data
- Fabio Zito su Algoritmi di risposta alle query
usando viste
- Andrea Cali su Limitazioni di accesso in
GAV
- Domenico Lembo su Limitazioni di accesso in
LAV
- Marco Pirrone su Memorizzazione di documenti
XML
- Massimo Romano su Schemi XML
Materiale didattico
Riferimenti bibliografici
-
Ullman, J. D. (1997).
Information integration using logical views.
In Proc. of the 6th Int. Conf. on Database Theory (ICDT'97),
volume 1186 of Lecture Notes in Computer Science, pages 19-40.
Springer-Verlag.
-
Hull, R. (1997).
Managing semantic heterogeneity in databases: A theoretical
perspective.
In Proc. of the 16th ACM SIGACT SIGMOD SIGART Symp. on
Principles of Database Systems (PODS'97).
-
Levy, A. Y., Mendelzon, A. O., Sagiv, Y., and Srivastava, D. (1995).
Answering queries using views.
In Proc. of the 14th ACM SIGACT SIGMOD SIGART Symp. on
Principles of Database Systems (PODS'95), pages 95-104.
-
Levy, A. Y. (1999).
Answering queries using views: A survey.
Technical report, University of Washinghton.
-
Abiteboul, S. and Duschka, O. (1998).
Complexity of answering queries using materialized views.
In Proc. of the 17th ACM SIGACT SIGMOD SIGART Symp. on
Principles of Database Systems (PODS'98), pages 254-265.
-
Grahne, G. and Mendelzon, A. O. (1999).
Tableau techniques for querying information sources through global
schemas.
In Proc. of the 7th Int. Conf. on Database Theory (ICDT'99),
volume 1540 of Lecture Notes in Computer Science, pages 332-347.
Springer-Verlag.
-
Calvanese, D., De Giacomo, G., Lenzerini, M., and Vardi, M. Y. (2000b).
What is query rewriting?
In Proc. of the 7th Int. Workshop on Knowledge Representation
meets Databases (KRDB 2000), pages 17-27. CEUR Electronic Workshop
Proceedings,
http://sunsite.informatik.rwth-aachen.de/Publications/CEUR-WS/Vol-29/.
-
Duschka, O. M., Genesereth, M. R., and Levy, A. Y. (2000).
Recursive query plans for data integration.
J. of Logic Programming, 43(1):49-73.
-
Buneman, P. (1997).
Semistructured data.
In Proc. of the 16th ACM SIGACT SIGMOD SIGART Symp. on
Principles of Database Systems (PODS'97), pages 117-121.
-
Abiteboul, S. (1997).
Querying semi-structured data.
In Proc. of the 6th Int. Conf. on Database Theory (ICDT'97),
pages 1-18.
-
Florescu, D., Levy, A., and Mendelzon, A. (1998).
Database techniques for the World-Wide Web: A survey.
SIGMOD Record, 27(3):59-74.
-
Calvanese, D., De Giacomo, G., Lenzerini, M., and Vardi, M. Y. (1999).
Rewriting of regular expressions and regular path queries.
In Proc. of the 18th ACM SIGACT SIGMOD SIGART Symp. on
Principles of Database Systems (PODS'99), pages 194-204.
-
Calvanese, D., De Giacomo, G., Lenzerini, M., and Vardi, M. Y. (2000).
Query processing using views for regular path queries with inverse.
In Proc. of the 19th ACM SIGACT SIGMOD SIGART Symp. on
Principles of Database Systems (PODS 2000), pages 58-66.
-
http://www.w3.org/