Ensuring Data Quality Standards with DevOps

DevOps Concepts

Cem Sakarya

DevOps Risk Advisor

DevOps ensures good code

  Good code drives good data which leads to good ml.

  • Most software handle data
  • Good software drives high-quality data
  • DevOps helps maintain high-quality in software
DevOps Concepts

Data quality

 

  • How trusted is the information?
  • Most recent and accurate information
  • Can be hard and costly to achieve
  • Not all data requires the same quality
  • There are many elements to Data Quality
DevOps Concepts

Elements of data quality

  Data Quality elements: Accuracy, Completeness, Consistency, Relevance, Timeliness

 

  • Elements breaks down and define Data Quality
  • They should be traced and standardized
DevOps Concepts

   

Data Quality elements: Accuracy, Completeness, Consistency, Relevance, Timeliness; Accuracy Highlighted

   

  • Is the data correct in all details?
  • Should be closely monitored
DevOps Concepts

   

Data Quality elements: Accuracy, Completeness, Consistency, Relevance, Timeliness; Completeness Highlighted

   

  • How comprehensive is the data?
  • Check if data is lost when it is stored in a database
  • Check if data is lost when it is moved to different databases
DevOps Concepts

   

Data Quality elements: Accuracy, Completeness, Consistency, Relevance, Timeliness; Consistency Highlighted

   

  • Consistency is about the reliability of the information
  • It is also referred to as Data Integrity
  • Should be closely monitored and tested
DevOps Concepts

   

Data Quality elements: Accuracy, Completeness, Consistency, Relevance, Timeliness; Relevance Highlighted

   

  • Keeping and handling only the relevant information
  • Irrelevant information wastes time and resources
DevOps Concepts

   

Data Quality elements: Accuracy, Completeness, Consistency, Relevance, Timeliness; Timeliness Highlighted

   

  • How up to date is the information?
  • Old data could be replaced by more recent information
  • Raw data within microservices is not used in data analysis
  • Microservices talk to each other via API calls to get the most recent information
  • The data within microservices should be replicated at an acceptable frequency
DevOps Concepts

Let's practice!

DevOps Concepts

Preparing Video For Download...