Data quality thresholds

Introduction to Data Quality

Chrissy Bloom

Head of Enterprise Data Strategy & Governance

Data quality alert thresholds

Data quality alert threshold: a threshold set by either the data producer or consumer which will trigger an action when a data quality rule finds more issues than the threshold allows for

  • represented as either a count or percentage of records

table with alert thresholds, actions, and interpretations of examples of thresholds

Introduction to Data Quality

Importance of thresholds and alerts

table with examples of data quality results, possible alert thresholds, and actions

Introduction to Data Quality

Determining thresholds

  • Thresholds for alerts are based on criticality, priority, and impact of the data quality issue
  • More critical fields require a stricter, higher threshold to be met
  • Less critical fields require less stringent thresholds

positive line graph showing the relationship between criticality, priority, and impact and alert threshold

Introduction to Data Quality

Levels of alert thresholds

  • Level 1 - Warning - alerts when a threshold is breached and does not require rapid remediation
  • Level 2 - Critical issue alert - alerts when a threshold is breached and requires rapid remediation
  • Level 3 - Critical issue prevent - alerts when a threshold is breached and will stop downstream processes from loading data further in the data pipeline

depiction of three levels of alert thresholds: yellow is warn for least critical, orange is alert, and red is prevent for most critical

Introduction to Data Quality

Alert example

table with data quality rules, alert thresholds, alert levels and interpretations

Introduction to Data Quality

Let's practice!

Introduction to Data Quality

Preparing Video For Download...