Process-aware Data Quality Assessment
Questions
Approach
Leitmotif: Solve an ordinary problem in DQ
Simple example (starting point)
Consult books (and hard papers) used only when the research get stucked
<fc #FF0000>Think rather then read!</fc>
<fc #FF0000>Use imagination</fc>
Properties of a BP language:
Model sophisticated properties include:
(Color) Petri Nets
<fc #FF0000>
Color Petri Nets == YAWL
</fc>
Drawbacks:
Data:
Supported via
Local and
Global variables
=/= fully fledged
SQL database
Still with proper restrictions one get very elegant model with clear semantics for representing BP.
References
Business Processes: A Database Propsective (2012)
A book by Daniel Deutch, Tova Milo
Summary:
Business Processes are important to consider because they are telling us the context in which the data is generated and manipulated (processes, users and goals). The ultimate goal is to create a declarative model and query language that will posses all advantages of the relational counter-part. We need a flow-and-data framework that will allow us developing analysis and optimization techniques as we have in the relational model. Therefore, this research is of a fundamental importance.
Highlights:
BP and data main challenges: A model and a query language for that model
Existing BP languages (e.g., BPEL) very weakly support explicit description of data operations. Instead they are hidden bellow some high-level code (e.g., in Java methods).
On the other hand, existing data flow languages developed within database community (Hull, Deutsch, Abitaboul, etc.), are implicitly defining a data flow (in a way it is hidden) by explicitly specifying data transformation.
Neither existing solution fulfills the starting goal of an elegant model and declarative query language.
Observations
Query languages for BP and data can perform analysis in:
Data Quality concepts, methodologies and techniques (2006)
a book by Carlo Batini and Monica Scannapieco
Foundations of Data Quality Management (2012)