< Back to previous page

Publication

Quality-Informed Process Mining: A Case for Standardised Data Quality Annotations

Journal Contribution - Journal Article

Real-life event logs, reflecting the actual executions of complex business processes, are faced with numerous data quality issues. Extensive data sanity checks and pre-processing are usually needed before historical data can be used as input to obtain reliable data-driven insights. However, most of the existing algorithms in process mining, a field focusing on data-driven process analysis, do not take any data quality issues or the potential effects of data pre-processing into account explicitly. This can result in erroneous process mining results, leading to inaccurate or misleading conclusions about the process under investigation. To address this gap, we propose data quality annotations for event logs, which can be used by process mining algorithms to generate quality-informed insights. Using a design science approach, requirements are formulated, which are leveraged to propose data quality annotations. Moreover, we present the 'Quality-Informed visual Miner' plug-in to demonstrate the potential utility and impact of data quality annotations. Our experimental results, utilising both synthetic and real-life event logs, show how the use of data quality annotations by process mining techniques can assist in increasing the reliability of performance analysis results.
Journal: ACM Transactions on Knowledge Discovery from Data
ISSN: 1556-4681
Issue: 5
Volume: 16
Pages: 1 - 47
Publication year:2022
Keywords:Additional Key Words and Phrases: Process Mining, Data Quality, Annotations, Metadata, Quality-Informed Performance Analysis, Quality-Informed Conformance Checking
Accessibility:Closed