Relatieve fout van puntschattingen

Steekproeven in R

Richie Cotton

Data Evangelist at DataCamp

Steekproef is aantal rijen

coffee_ratings %>% 
  slice_sample(n = 300) %>% 
  nrow()
300
coffee_ratings %>% 
  slice_sample(prop = 0.25) %>% 
  nrow()
334
Steekproeven in R

Verschillende steekproefgroottes

coffee_ratings %>% 
  summarize(mean_points = mean(total_cup_points)) %>% 
  pull(mean_points)

82.15
coffee_ratings %>% 
  slice_sample(n = 10) %>% 
  summarize(mean_points = mean(total_cup_points)) %>% 
  pull(mean_points)
82.82
coffee_ratings %>% 
  slice_sample(n = 100) %>% 
  summarize(mean_points = mean(total_cup_points)) %>% 
  pull(mean_points)
82.02
coffee_ratings %>% 
  slice_sample(n = 1000) %>% 
  summarize(mean_points = mean(total_cup_points)) %>% 
  pull(mean_points)
82.16
Steekproeven in R

Relatieve fouten

Populatieparameter

population_mean <- coffee_ratings %>% 
  summarize(mean_points = mean(total_cup_points)) %>% 
  pull(mean_points)

Puntschatting

sample_mean <- coffee_ratings %>% 
  slice_sample(n = sample_size) %>% 
  summarize(mean_points = mean(total_cup_points)) %>% 
  pull(mean_points)

Relatieve fout als percentage

100 * abs(population_mean - sample_mean) / population_mean
Steekproeven in R

Relatieve fout vs. steekproefgrootte

ggplot(errors, aes(sample_size, relative_error)) +
  geom_line() +
  geom_smooth(method = "loess")

Spreidingsdiagram van relatieve fout tegenover steekproefgrootte.

Steekproeven in R

Laten we oefenen!

Steekproeven in R

Preparing Video For Download...