Galat relatif pada taksiran titik

Sampling di Python

James Chapman

Curriculum Manager, DataCamp

Ukuran sampel adalah jumlah baris

len(coffee_ratings.sample(n=300))
300
len(coffee_ratings.sample(frac=0.25))
334
Sampling di Python

Beragam ukuran sampel

coffee_ratings['total_cup_points'].mean()
82.15120328849028
coffee_ratings.sample(n=10)['total_cup_points'].mean()
83.027
coffee_ratings.sample(n=100)['total_cup_points'].mean()
82.4897
coffee_ratings.sample(n=1000)['total_cup_points'].mean()
82.1186
Sampling di Python

Galat relatif

Parameter populasi:

population_mean = coffee_ratings['total_cup_points'].mean()

Taksiran titik:

sample_mean = coffee_ratings.sample(n=sample_size)['total_cup_points'].mean()

Galat relatif (persen):

rel_error_pct = 100 * abs(population_mean-sample_mean) / population_mean
Sampling di Python

Galat relatif vs. ukuran sampel

import matplotlib.pyplot as plt
errors.plot(x="sample_size", 
            y="relative_error", 
            kind="line")
plt.show()

Sifat:

  • Sangat bising, terutama untuk sampel kecil
  • Amplitudo awalnya curam, lalu mendatar
  • Galat relatif menurun ke nol (saat ukuran sampel = populasi)

Plot garis galat relatif vs. ukuran sampel.

Sampling di Python

Ayo berlatih!

Sampling di Python

Preparing Video For Download...