INODE: Building an End-to-End Data Exploration System in Practice [ExtendedVision]

Sihem Amer-Yahia, Georgia Koutrika, Frederic Bastian, Theofilos Belmpas, Martin Braschler, Ursin Brunner, Diego Calvanese, Maximilian Fabricius, Orest Gkini, Catherine Kosten, Davide Lanti, Antonis Litke, Hendrik Lücke-Tieke, Francesco Alessandro Massucci, Tarcisio Mendes de Farias, Alessandro Mosca, Francesco Multari, Nikolaos Papadakis, Dimitris Papadopoulos, Yogendra Patil, Aurélien Personnaz, Guillem Rull, Ana Claudia Sima, Ellery Smith, Dimitrios Skoutas, Srividya Subramanian, Guohui Xiao, and Kurt Stockinger

Technical Report, arXiv.org e-Print archive. CoRR Technical Report arXiv:2104.04194 2021. Available at http://arxiv.org/abs/2104.04194.

A full-fledged data exploration system must combine different access modalities with a powerful concept of guiding the user in the exploration process, by being reactive and anticipative both for data discovery and for data linking. Such systems are a real opportunity for our community to cater to users with different domain and data science expertise. We introduce INODE - an end-to-end data exploration system - that leverages, on the one hand, Machine Learning and, on the other hand, semantics for the purpose of Data Management (DM). Our vision is to develop a classic unified, comprehensive platform that provides extensive access to open datasets, and we demonstrate it in three significant use cases in the fields of Cancer Biomarker Research, Research and Innovation Policy Making, and Astrophysics. INODE offers sustainable services in (a) data modeling and linking, (b) integrated query processing using natural language, (c) guidance, and (d) data exploration through visualization, thus facilitating the user in discovering new insights. We demonstrate that our system is uniquely accessible to a wide range of users from larger scientific communities to the public. Finally, we briefly illustrate how this work paves the way for new research opportunities in DM.


@techreport{Corr-2021-INODE,
   title = "INODE: Building an End-to-End Data Exploration System in
Practice [ExtendedVision]",
   year = "2021",
   author = "Sihem Amer-Yahia and Georgia Koutrika and Frederic Bastian
and Theofilos Belmpas and Martin Braschler and Ursin Brunner and Diego
Calvanese and Maximilian Fabricius and Orest Gkini and Catherine Kosten
and Davide Lanti and Antonis Litke and Hendrik Lücke-Tieke and
Francesco Alessandro Massucci and Mendes de Farias, Tarcisio and Alessandro
Mosca and Francesco Multari and Nikolaos Papadakis and Dimitris
Papadopoulos and Yogendra Patil and Aurélien Personnaz and
Guillem Rull and Sima, Ana Claudia and Ellery Smith and Dimitrios Skoutas
and Srividya Subramanian and Guohui Xiao and Kurt Stockinger",
   institution = "arXiv.org e-Print archive",
   number = "arXiv:2104.04194",
   note = "Available at http://arxiv.org/abs/2104.04194",
}
pdf url