Is your deployed AI system successful?

Yapay Zekayı Anlamak

Maarten Van den Broeck

Senior Content Developer at DataCamp

When to measure success?

MLOps methodology lifecycle

Yapay Zekayı Anlamak

When to measure success?

Measuring success across the MLOps lifecycle

Yapay Zekayı Anlamak

Measuring performance offline - accuracy

Training and validation data

Yapay Zekayı Anlamak

Measuring performance offline - accuracy

Training a model

Yapay Zekayı Anlamak

Measuring performance offline - accuracy

Validating a model with test examples

Yapay Zekayı Anlamak

Measuring performance offline - accuracy

Comparing predictions to actual labels

Yapay Zekayı Anlamak

Measuring performance offline - accuracy

Calculating accuracy

Yapay Zekayı Anlamak

Beyond accuracy - error and other metrics

Error metric in regression models

Metrics for search and recommendation engines: ranking quality -relevance of ranking items to the user-, diversity in search results or recommendations, etc.

Ranking of recommended products

Yapay Zekayı Anlamak

Measuring success in production

  • Model degradation: the measured metric value gets worse over the time

Model degradation

  • Business metrics: Key Performance Indicators (KPIs)
    • Indicator of performance and progress of organization objectives
Yapay Zekayı Anlamak

Risks: what could possibly go wrong?

Possible risks include:

  • Data bias
  • Lack of transparency
  • Ethical concerns
  • Dubious system reliability
  • Vulnerability to cyber threats

Proof-of-Concept (PoC):

  • Pilot demonstrator to validate feasibility and potential value + early risk identification
Yapay Zekayı Anlamak

Let's practice!

Yapay Zekayı Anlamak

Preparing Video For Download...