Introduction to Statistics in Python
Maggie Matsui
Content Developer, DataCamp
x
doesn't tell us anything about y
0.75: as x
increases, y
increases
-0.75: as x
increases, y
decreases
import seaborn as sns
sns.scatterplot(x="sleep_total", y="sleep_rem", data=msleep)
plt.show()
import seaborn as sns sns.lmplot(x="sleep_total", y="sleep_rem", data=msleep, ci=None)
plt.show()
msleep['sleep_total'].corr(msleep['sleep_rem'])
0.751755
msleep['sleep_rem'].corr(msleep['sleep_total'])
0.751755
$$ r = \frac{1}{n - 1} \sum_{i=1}^{n} \frac{(x_i - \bar{x})(y_i - \bar{y})}{\sigma_x \cdot \sigma_y}$$
Introduction to Statistics in Python