Predictive Analytics using Networked Data in R
María Óskarsdóttir, Ph.D.
Postdoctoral researcher
Are the relationships between nodes important?
Are the labels randomly spread through the network or is there some structure?
Is the network homophilic?
# Heterophilicity
e_mixed / m_mixed
N <- 40 E <- 39 n_green <- 10 n_white <- 30 e_green <- 6 e_mixed <- 13
p <- 2 * E / N / (N-1) m_green <- n_green * (n_green-1)/2 * p m_mixed <- n_green * n_white * p
# Dyadicity e_green / m_green
2.666667
0.8666667
$\Rightarrow$ Homophilic
Predictive Analytics using Networked Data in R