Introduction to Anomaly Detection in R
Alastair Rushworth
Data Scientist
Choose a high value
high_score <- quantile(sat$score, probs = 0.99)
high_score
99%
0.6228078
Binarize score
sat$binary_score <- as.numeric(score >= high_score)
Comparing true label and binarized score
table(sat$label, sat$binary_score)
0 1
0 5729 3
1 15 56
Anomalies correctly identified $\div$ Total anomalies
table(sat$label, sat$binary_score)
0 1
0 5729 3
1 15 56
recall <- 56 / (15 + 56)
recall
0.7887324
Anomalies correctly identified $\mathbf{\div}$ Total scored as anomalous
table(sat$label, sat$binary_score)
0 1
0 5729 3
1 15 56
precision <- 56 / (56 + 3)
precision
0.9491525
Introduction to Anomaly Detection in R