Introduction to the Normal distribution

Statistical Thinking in Python (Part 1)

Justin Bois

Teaching Professor at the California Institute of Technology

Normal distribution

  • Describes a continuous variable whose PDF has a single symmetric peak.
Statistical Thinking in Python (Part 1)

Normal distribution

ch4-2.004.png

Statistical Thinking in Python (Part 1)

Normal distribution

ch4-2.005.png

Statistical Thinking in Python (Part 1)

Normal distribution

ch4-2.006.png

Statistical Thinking in Python (Part 1)

Normal distribution

ch4-2.007.png

Statistical Thinking in Python (Part 1)

Normal distribution

ch4-2.008.png

Statistical Thinking in Python (Part 1)

ch4-2.009.png

Statistical Thinking in Python (Part 1)

ch4-2.010.png

Statistical Thinking in Python (Part 1)

Comparing data to a Normal PDF

ch4-2.012.png

Statistical Thinking in Python (Part 1)

Checking Normality of Michelson data

import numpy as np
rng = np.random.default_rng()
mean = np.mean(michelson_speed_of_light)
std = np.std(michelson_speed_of_light)
samples = rng.normal(mean, std, size=10000)
x, y = ecdf(michelson_speed_of_light)
x_theor, y_theor = ecdf(samples)
Statistical Thinking in Python (Part 1)

Checking Normality of Michelson data

import matplotlib.pyplot as plt
import seaborn as sns
sns.set()
_ = plt.plot(x_theor, y_theor)
_ = plt.plot(x, y, marker='.', linestyle='none')
_ = plt.xlabel('speed of light (km/s)')
_ = plt.ylabel('CDF')
plt.show()
Statistical Thinking in Python (Part 1)

Checking Normality of Michelson data

ch4-2.029.png

Statistical Thinking in Python (Part 1)

Let's practice!

Statistical Thinking in Python (Part 1)

Preparing Video For Download...