HR Analytics: Predicting Employee Churn in R
Abhishek Trehan
People Analytics Practitioner
# Plot the distribution of compensation
ggplot(emp_tenure, aes(x = compensation)) +
geom_histogram()
# Plot the distribution of compensation across levels
ggplot(emp_tenure,
aes(x = level, y = compensation)) +
geom_boxplot()
$$ \text{Compa Ratio} = \frac{\text{ Actual Compensation }}{\text{Median Compensation}} $$
Compa-ratio of 1.2 or 120% means that the employee is paid 20% above the median pay
Compa-ratio of 1 or 100% means that the employee is paid exactly the median pay
Compa-ratio of 0.8 or 80% means that the employee is paid 20% below the median pay
# Derive Compa-ratio
emp_compa_ratio <- emp_tenure %>%
group_by(level) %>%
mutate(median_compensation = median(compensation),
compa_ratio = (compensation / median_compensation))
# Look at the median compensation for each level
emp_compa_ratio %>%
distinct(level, median_compensation)
# A tibble: 2 x 2
# Groups: level[2]
level median_compensation
<fct> <dbl>
1 Analyst 51840
2 Specialist 83496
HR Analytics: Predicting Employee Churn in R