A robust data Infrastructure

Data Fluency

Konstantinos Kattidis

Data Analytics Lead

Ecosystem enabling data products

Diagram showing data products

  • Data infrastructure enables the creation and sharing of data products
  • They are the resulting products from data processing, modeling, or analysis that are made available to users
Data Fluency

The meaning of data infrastructure

  • Data infrastructure refers to the hardware, software, databases, standards, and policies
  • It ensures data is available, reliable, and secure
  • Imagine data as water, and data infrastructure as the pipes and reservoirs that store and transport this water

Pipes showing the flow of data

Data Fluency

Implementing a data strategy

Data leader defining the strategy

  • Data-fluent organizations have a clear data strategy in place
  • This makes sure data is collected and made available to users across the organization in the best way possible
  • It ensures:
    • A single source of truth
    • Data is discoverable, compliant, actionable, and understood
Data Fluency

Enabling a single source of truth

A central data warehouse

  • A data-fluent organization ensures a single source of truth by creating one central place where all important data is stored, organized, and managed
  • For example, a centralized data warehouse
Data Fluency

Data standards and governance

The rules of the road

  • To ensure data is reliable, consistent, and accessible data-fluent organizations focus on setting the right standards and governance
  • Data standards provide the specific guidelines and formats that data should adhere to
  • Data governance is responsible for overseeing and enforcing the rules and policies related to data
Data Fluency

Enabling discoverability

Discovering data

  • Data discovery tools help users to effortlessly identify the data products they need
  • Users can read about the meaning of the datasets, the meaning of each column, and the source of data, learn who owns the data product
  • It helps understanding the data before using it
Data Fluency

Operationalization of data products

  • An important element of a strong data infrastructure is enabling a clear path to operationalization
  • It is about turning an idea or a model into something practical that the organization can use every day
  • Making sure the data experts can deploy their data products and make them available to users

Data expert sharing data products with users

Data Fluency

Let's practice!

Data Fluency

Preparing Video For Download...