Chi-squared test statistic

Inference for Categorical Data in R

Andrew Bray

Assistant Professor of Statistics at Reed College

Comparing bar plots

ch3v2-natspac-barplot-filled.png

ch3v2-natarms-barplot-filled.png

Inference for Categorical Data in R

Hypothesis test

null <- data %>%
  specify(var1 ~ var2) %>%
  hypothesize(null = "independence") %>%
  generate(reps = 100, type = "permute") %>%
  calculate(stat = ?)
Inference for Categorical Data in R

Choosing a statistic

ch3v2-two-bar-plots.png

Inference for Categorical Data in R

Choosing a statistic

ch3v2-three-bar-plots.png

Inference for Categorical Data in R

Choosing a statistic

observed_counts
             party
natspac       Dem Ind Rep
  TOO LITTLE    8  13   9
  ABOUT RIGHT  22  37  17
  TOO MUCH     13  22   8
expected_counts
             party
natspac        Dem  Ind  Rep
  TOO LITTLE   8.7 14.5  6.8
  ABOUT RIGHT 21.9 36.7 17.3
  TOO MUCH    12.4 20.8  9.8

ch3v2-natspac-barplot-stacked.png

Inference for Categorical Data in R

Choosing a statistic

observed_counts
             party
natspac       Dem Ind Rep
  TOO LITTLE    8  13   9
  ABOUT RIGHT  22  37  17
  TOO MUCH     13  22   8
expected_counts
             party
natspac        Dem  Ind  Rep
  TOO LITTLE   8.7 14.5  6.8
  ABOUT RIGHT 21.9 36.7 17.3
  TOO MUCH    12.4 20.8  9.8
(observed_counts - expected_counts) ^ 2
             party
natspac         Dem   Ind   Rep
  TOO LITTLE  0.433 2.240 4.641
  ABOUT RIGHT 0.005 0.076 0.117
  TOO MUCH    0.349 1.492 3.284
sum((observed_counts - expected_counts) ^ 2)
12.63565
Inference for Categorical Data in R

Chi-squared distance

ch3v2-annotated-three-bar-plots.png

Inference for Categorical Data in R

Let's practice!

Inference for Categorical Data in R

Preparing Video For Download...