Distance between two observations

Cluster Analysis in R

Dmitriy (Dima) Gorenshteyn

Lead Data Scientist, Memorial Sloan Kettering Cancer Center

Distance vs. Similarity

Cluster Analysis in R

Distance vs. Similarity

       

$$Distance = 1 - Similarity$$
Cluster Analysis in R

Distance between two players

soccer_1

Cluster Analysis in R

Distance between two players

soccer_2

Cluster Analysis in R

Distance between two players

soccer_3

Cluster Analysis in R

Distance between two players

soccer_4

Cluster Analysis in R

Distance between two players

soccer_6

Cluster Analysis in R

Distance between two players

soccer_7

Cluster Analysis in R

Distance between two players

soccer_8

Cluster Analysis in R

Distance between two players

soccer_9

Cluster Analysis in R

Distance between two players

soccer_12

Cluster Analysis in R

dist() function

print(two_players)
     X  Y
BLUE 0  0
RED  9 12
dist(two_players, method = 'euclidean')
      BLUE
RED   15
Cluster Analysis in R

More than 2 observations

print(three_players)
       X  Y
BLUE   0  0
RED    9 12
GREEN -2 19
dist(three_players)
          BLUE      RED
RED   15.00000         
GREEN 19.10497 13.03840
Cluster Analysis in R

Let's practice!

Cluster Analysis in R

Preparing Video For Download...