Dasar-dasar Inferensi di R
Jo Hardin
Instructor






Membuat distribusi statistik dari populasi nol memberi informasi apakah data teramati tidak konsisten dengan hipotesis nol
Data asli
| Lokasi | Cola | Jeruk |
|---|---|---|
| Timur | 28 | 6 |
| Barat | 19 | 7 |
$\hat{p}_\text{east} = 28/(28 + 6) = 0.82$
$\hat{p}_\text{west} = 19/(19 + 7) = 0.73$
Pengacakan pertama, sama seperti asli
| Lokasi | Cola | Jeruk |
|---|---|---|
| Timur | 28 | 6 |
| Barat | 19 | 7 |

Pengacakan kedua
| Lokasi | Cola | Jeruk |
|---|---|---|
| Timur | 27 | 7 |
| Barat | 20 | 6 |

Pengacakan ketiga
| Lokasi | Cola | Jeruk |
|---|---|---|
| Timur | 28 | 8 |
| Barat | 21 | 5 |

Pengacakan keempat
| Lokasi | Cola | Jeruk |
|---|---|---|
| Timur | 25 | 9 |
| Barat | 22 | 4 |

Pengacakan kelima
| Lokasi | Cola | Jeruk |
|---|---|---|
| Timur | 29 | 5 |
| Barat | 18 | 8 |

Pengacakan kelima
| Lokasi | Cola | Jeruk |
|---|---|---|
| Timur | 29 | 5 |
| Barat | 18 | 8 |







soda %>%
group_by(location) %>%
summarize(prop_cola =
mean(drink == "cola")) %>%
summarize(diff(prop_cola))
# A tibble: 1 x 1
`diff(prop_cola)`
<dbl>
1 -0.09276018
library(infer)
soda %>% specify(drink ~ location,
success = "cola") %>%
hypothesize(null = "independence") %>%
generate(reps = 1, type = "permute") %>%
calculate(stat = "diff in props",
order = c("west","east"))
# A tibble: 1 x 2
replicate stat
<int> <dbl>
1 1 -0.02488688
soda %>%
specify(drink ~ location, success = "cola") %>%
hypothesize(null = "independence") %>%
generate(reps = 5, type = "permute") %>%
calculate(stat = "diff in props", order = c("west", "east"))
# A tibble: 5 x 2
replicate stat
<int> <dbl>
1 1 0.04298643
2 2 -0.09276018
3 3 0.11085973
4 4 0.17873303
5 5 -0.16063348

Dasar-dasar Inferensi di R